Recognition Method of Chinese Phrase Structure Based on Maximum Entropy

doi:10.3969/j.issn.1000-3428.2011.16.070

Computer Engineering ›› 2011, Vol. 37 ›› Issue (16): 206-208.

• Networks and Communications • Previous Articles Next Articles

Recognition Method of Chinese Phrase Structure Based on Maximum Entropy

HUO Ya-ge, HUANG Guang-jun

(Electronic & Information Engineering College, Henan University of Science and Technology, Luoyang 471003, China)

Received:2011-02-21 Online:2011-08-20 Published:2011-08-20

基于最大熵的汉语短语结构识别方法

霍亚格，黄广君

(河南科技大学电子信息工程学院，河南洛阳 471003)

作者简介:霍亚格(1984－)，女，硕士研究生，主研方向：自然语言处理，中文信息检索；黄广君，副教授、博士
基金资助:
河南省科技攻关计划基金资助项目(102102210159)

Abstract

Abstract: To improve the computer’s processing capacity on Chinese information, and do better shallow parsing, this paper presents a recognition method of Chinese phrase structure based on Maximum Entropy(ME). The Mutual Information(MI) among the phrases is proposed to achieve boundary prediction of the sentences structure, and the ME model is used to set up atomic and composite templates, selects more effective features for constituting the final feature set. The identification of phrase structure is completed by using the ME method, and good precision and recall are proved in the ME model based on MI by the practical experiment.

Key words: shallow parsing, Mutual Information(MI), boundary prediction, Maximum Entropy(ME) model, feature selection

摘要： 为提高计算机对汉语信息的处理能力，更好地进行浅层句法分析，提出一种基于最大熵的汉语短语结构识别方法。利用词语之间的互信息知识对句子的短语结构边界进行预测，应用最大熵模型建立原子模板与复合模板，选择有效的特征构成特征集，实现对句子短语结构的识别。实例证明，基于互信息的最大熵模型能取得较好的精确率和召回率。

关键词: 浅层句法分析, 互信息, 边界预测, 最大熵模型, 特征选择

CLC Number:

TP391

HE E-Ge, HUANG An-Jun. Recognition Method of Chinese Phrase Structure Based on Maximum Entropy[J]. Computer Engineering, 2011, 37(16): 206-208.

霍亚格, 黄广君. 基于最大熵的汉语短语结构识别方法[J]. 计算机工程, 2011, 37(16): 206-208.

/ Recommend / Download Citations

URL:

https://www.ecice06.com/EN/Y2011/V37/I16/206

[1]	ZHANG Jian, ZHANG Bo. Biological Invasion-Based Feature Selection Algorithm [J]. Computer Engineering, 2024, 50(9): 46-53.
[2]	LI Junyi, LI Xiangyang, LONG Chaoxun, LI Haiyan, LI Hongsong, YU Pengfei. Wild Mushroom Classification Based on Multi-level Region Selection and Cross-layer Feature Fusion [J]. Computer Engineering, 2024, 50(9): 179-188.
[3]	LIU Zhongmin, YAN Liang. Inpainting Model of Dunhuang Mural Fusing Dynamic Feature and Attention [J]. Computer Engineering, 2024, 50(5): 342-353.
[4]	Jie ZHAO, Wenhao YE, Zhouyang LIANG, Jianxin CHEN, Zhenning DONG. Fuzzy Rough Set Feature Selection Based on Inconsistent Nearest Neighbors [J]. Computer Engineering, 2024, 50(1): 110-119.
[5]	Xuan YANG, Jianmin MA, Manjun ZHAO. Feature Selection of High-Dimensional Time-Series Data Based on Neighborhood Mutual Information [J]. Computer Engineering, 2023, 49(7): 135-142.
[6]	Jianyong DUAN, Yifei ZHU, Hao WANG, Li HE, Xin LI. Chinese Nested Named Entity Recognition Based on Location Embedding and Multilevel Prediction [J]. Computer Engineering, 2023, 49(12): 71-77.
[7]	LIU Li, ZHANG Desheng, XIAO Yanting. Fuzzy Weighted k-Nearest Centroid Neighbor Algorithm Based on Membership [J]. Computer Engineering, 2022, 48(7): 122-129.
[8]	AI Chenghao, GAO Jianhua, HUANG Zijie. Code Smell Detection Driven by Hybrid Feature Selection and Ensemble Learning [J]. Computer Engineering, 2022, 48(7): 168-176,198.
[9]	FAN Linge, WU Xinrong, TONG Wei, ZENG Weijun. Feature Selection Method for Incomplete Data Sets Based on Probability Matrix Decomposition [J]. Computer Engineering, 2022, 48(6): 57-64.
[10]	ZHANG Yao, MA Yingcang, ZHU Hengdong, LI Heng, CHEN Cheng. Multi-label Feature Selection Combining Manifold Learning and Logistic Regression [J]. Computer Engineering, 2022, 48(3): 90-99,106.
[11]	WANG Zhengkai, SHEN Dongsheng, WANG Chenxi. Fisher Score Fast Multi-Label Feature Selection Algorithm Based on Text Classification [J]. Computer Engineering, 2022, 48(2): 113-124.
[12]	HUANG Yixuan, DU Shiqiang, YU Yao, XIAO Qingjiang, SONG Jinmei. Multi-View Clustering Based on Feature Selection and Robust Graph Learning [J]. Computer Engineering, 2022, 48(12): 95-103.
[13]	WANG Junhong, ZHAO Binjia. Research on Feature Selection Algorithms Based on Unbalanced Data [J]. Computer Engineering, 2021, 47(11): 100-107.
[14]	WANG Xu, CHEN Yongle, WANG Qingsheng, CHEN Junjie. Cryptosystem Identification Scheme Combining Feature Selection and Ensemble Learning [J]. Computer Engineering, 2021, 47(1): 139-145,153.
[15]	YUAN Zheming, YANG Jingjing, CHEN Yuan. Feature Selection Method Based on Maximum Information Coefficient and Redundancy Sharing [J]. Computer Engineering, 2020, 46(8): 101-105.

Please choose a citation manager

Content to export

Recognition Method of Chinese Phrase Structure Based on Maximum Entropy

基于最大熵的汉语短语结构识别方法

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Recognition Method of Chinese Phrase Structure Based on Maximum Entropy

基于最大熵的汉语短语结构识别方法

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments