Uyghur Homograph Disambiguation Based on Classification and Optimal Mapping Pronunciation

doi:10.3969/j.issn.1000-3428.2012.18.006

Computer Engineering ›› 2012, Vol. 38 ›› Issue (18): 22-25.

Previous Articles Next Articles

Uyghur Homograph Disambiguation Based on Classification and Optimal Mapping Pronunciation

Guljamal Mamateli ^a, Askar Rozi ^b, Gulnar Ali^a, Askar Hamdulla ^a

(a. Institute of Information Science and Engineering; b. Institute of Mathematics and System Science, Xinjiang University, Urumqi 830046, China)

Received:2011-10-24 Revised:2011-12-14 Online:2012-09-20 Published:2012-09-18

基于分类及最佳匹配读音的维吾尔多音词消歧

姑丽加玛丽•麦麦提艾力 ^a，艾斯卡尔•肉孜 ^b，古丽娜尔•艾力 ^a，艾斯卡尔•艾木都拉 ^a

(新疆大学 a. 信息科学与工程学院；b. 数学与系统科学学院，乌鲁木齐 830046)

作者简介:姑丽加玛丽?麦麦提艾力(1984－)，女，博士研究生，主研方向：维吾尔语音合成技术；艾斯卡尔?肉孜、古丽娜尔?艾力，讲师；艾斯卡尔?艾木都拉，教授
基金资助:
国家自然科学基金资助项目(61065005, 61062008)；教育部新世纪优秀人才支持计划基金资助项目(NCET-10-0969)

Abstract

Abstract:

This paper deeply investigates the homograph in Uyghur language and classifies them according to the different features of homograph, disambiguates the first type of homograph according to the mapping relation between the part of speech and pronunciation, disambiguates the second type of homograph according to vowel weakening when suffix attaches to a stem, and optimal pronunciation mapping method is used to disambiguate the third type of homograph by extracting the contextual features of homograph. Log-likelihood ratio is used to select keywords and keyword selection experiment of different window size is also conducted. Experimental result shows that the homograph disambiguation performance of can be got to 20.9% error rate through the research idea of this paper.

Key words: Uyghur language, homograph disambiguation, classification, vowel weakening, optimal mapping pronunciation, keyword selection

摘要：

研究维吾尔语中的多音词现象，根据多音词的不同特点进行分类。利用词性和读音的映射关系消歧第1类多音词。根据词缀连接词干后是否发生元音弱化的特点消歧第2类多音词。提取上下文语境信息，使用最佳匹配读音的方法消歧第3类多音词。采用似然比方法进行关键词选择，并对不同窗口宽度的关键词选取方法进行对比实验。结果表明，该方法可以得到错误率为20.9%的多音词消歧效果。

关键词: 维吾尔语, 多音词消歧, 分类, 元音弱化, 最佳匹配读音, 关键词选取

CLC Number:

TP391

GU Li-Jia-Ma-Li-.Mai-Chi-Ai-Li-a, AI Shi-Ka-Er-.Zi-b, GU Li-Na-Er-.Li-a, AI Shi-Ka-Er-.Mu-Dou-La-a. Uyghur Homograph Disambiguation Based on Classification and Optimal Mapping Pronunciation[J]. Computer Engineering, 2012, 38(18): 22-25.

姑丽加玛丽.麦提艾力a, 艾斯卡尔.孜b, 古丽娜尔.力a, 艾斯卡尔.木都拉a. 基于分类及最佳匹配读音的维吾尔多音词消歧[J]. 计算机工程, 2012, 38(18): 22-25.

/ Recommend / Download Citations

URL:

https://www.ecice06.com/EN/Y2012/V38/I18/22

[1]	ZHANG Heping, FANG Zhijun, LU Junxin, GAO Yongbin. Few-Shot Relation Classification Based on Knowledge-Enhanced Adaptive Prototype Networks [J]. Computer Engineering, 2025, 51(4): 129-136.
[2]	YIN Zhaoliang, HUANG Yuxin, YU Zhengtao, WANG Guanwen, AI Chuanxian. A Method for Analyzing News Themes Involving Cases with Integrated Crime Classification [J]. Computer Engineering, 2025, 51(4): 208-216.
[3]	ZHANG Heping, ZHANG Hegui, XIE Xiaoyao, ZHANG Taihua, ZHANG Sicong, YU Guojun. Network Embedding Based on k-core Decomposition [J]. Computer Engineering, 2025, 51(2): 139-148.
[4]	YANG Wangda, WAN Yaping, ZOU Gang, MIN Xiaoshan, WANG Yi, LU Yucheng. Research on Deep Learning Classification Method for Testing Eye Status of Driving Quality Deficiency [J]. Computer Engineering, 2025, 51(2): 149-158.
[5]	YAO Lifeng, CAI Manchun, ZHU Yi, CHEN Yonghao, ZHANG Yiwen. Encrypted Traffic Classification Model Based on Byte Coding and Pre-Training Tasks [J]. Computer Engineering, 2025, 51(2): 188-201.
[6]	MA Hengzhi, QIAN Yurong, LENG Hongyong, WU Haipeng, TAO Wenbin, ZHANG Yiyang. Review of Research Progress on Knowledge Graph Embedding [J]. Computer Engineering, 2025, 51(2): 18-34.
[7]	WANG Xiang, WEI Yuxin, MAO Guojun. A Graph Pooling Method Fusing Multiple Structures and Features of Graph Data [J]. Computer Engineering, 2025, 51(1): 128-137.
[8]	ZHANG Xinbo, ZHANG Xueying, HUANG Lixia, CHEN Guijun. Classification Algorithm and Application Based on Semi-Supervised Deep Auto-Encoder Network [J]. Computer Engineering, 2025, 51(1): 71-80.
[9]	CAI Junmin, LIANG Zhengyou, SUN Yu, CHEN Ziao. Research on Lightweight Point Cloud Classification Based on Deformable 3D Graph Convolution [J]. Computer Engineering, 2024, 50(9): 255-265.
[10]	WANG Yanguo, LÜ Pengyuan, LAN Jinjiang, LIU Mingzhe, QIN Guanjun, ZHANG Shuohua, ZHOU Yu. Wind Turbine Fault Classification Method Based on Adversarial Training and Transformer [J]. Computer Engineering, 2024, 50(9): 377-384.
[11]	LI Weigang, LI Xuchang, TIAN Zhiqiang, LI Jinling. Research on Point Cloud Classification and Its Robustness Based on Self-Distillation Framework [J]. Computer Engineering, 2024, 50(9): 72-81.
[12]	LI Junyi, LI Xiangyang, LONG Chaoxun, LI Haiyan, LI Hongsong, YU Pengfei. Wild Mushroom Classification Based on Multi-level Region Selection and Cross-layer Feature Fusion [J]. Computer Engineering, 2024, 50(9): 179-188.
[13]	Han CHEN, Chunlei ZHAO, Haoda JIANG, Chundong WANG. Research on App User Intent Recognition Based on Fusion Model and Semantic Network [J]. Computer Engineering, 2024, 50(8): 50-63.
[14]	Lai QIAN, Weiwei ZHAO. Text Classification Method Based on Contrastive Learning and Attention Mechanism [J]. Computer Engineering, 2024, 50(7): 104-111.
[15]	YAN Yinkai, PENG Ningning, YI Lisha. Skewed Time Series Classification Algorithm Based on Persistent Homology [J]. Computer Engineering, 2024, 50(6): 110-123.

Please choose a citation manager

Content to export

Uyghur Homograph Disambiguation Based on Classification and Optimal Mapping Pronunciation

基于分类及最佳匹配读音的维吾尔多音词消歧

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Uyghur Homograph Disambiguation Based on Classification and Optimal Mapping Pronunciation

基于分类及最佳匹配读音的维吾尔多音词消歧

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments