Research on Coefficient of Neighboring Dialect Differences Based on Hidden Markov Model

doi:10.3969/j.issn.1000-3428.2016.04.032

Abstract

Abstract: In the research of quantifying neighboring area’s dialect difference,this paper makes people read the independent word text A in dialect to form the sound file M,uses the HTK tool to structure the acoustic feature parameter set S_M for the file M,calculates and forms the diversity factor.Using this method in continuous i neighboring regions forms the homologous parameter set Si_Mi,while using the sound file Mi to compare to the sound-character (word) mapping table of the sample area (i=0),obtaining the text Ai of the village i.The ratio of text content differences between Ai and A0 (sample area or village) is defined as diversity factor ξ.By analyzing ξ feature of continuous villages,the paper finds that the dialect has less difference when the ξ value is between 0.88 and 1 in neighboring 3 villages(geographic location),while in 9 villages distance,the ξ value(synthesize) less than 0.6,and the ξ value of phrases less than 0.2,this difference changes quickly,so this paper establishes the dialect distance and proposes the concept of dialect radius,confirming that the dialect radius is eight(eight villages).

Key words: dialect sound, coefficient of dialect differences, Hidden Markov Toolkit(HTK) software, Hidden Markov Model(HMM), dialect radius

摘要： 量化邻近地域的方言差异性研究,运用方言朗读独立字词文本A形成声音文件M,使用HTK工具将M文件构造为声学特征参数集S_M,计算方言差异系数。在邻近连续i个地域基础上得到相应的Si_Mi,同时使声音Mi结合对比样本区域(i=0)音-字(词)映射表,形成i村落并对应文本Ai。差异系数ξ定义为Ai与A0(样本区域或村落)之间的文本内容差异之比。分析连续古村落ξ值特征结果表明,方言在邻近3个村落(地理位置)的ξ值介于0.88~1时,差异较小,而当邻近9个村落的ξ值(综合)小于0.6及词组ξ值小于0.2时,差异快速变大,建立方言距离并提出方言半径概念,确认所测试方言的半径为8(8个村落)。

关键词: 方言语音, 方言差异系数, HTK软件, 隐马可夫模型, 方言半径

CLC Number:

TP18

WANG Xuefei,LIU Jun. Research on Coefficient of Neighboring Dialect Differences Based on Hidden Markov Model[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2016.04.032.

王雪飞,刘珺. 基于隐马可夫模型的邻近方言差异系数研究[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2016.04.032.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2016.04.032

http://www.ecice06.com/EN/Y2016/V42/I4/179

References

参考文献［1］Zissman M A.Comparison of Four Approaches to Automatic Language Identification of Telephone Speech［J］.IEEE Transactions on Speech and Audio Processing,1996,4(1):31-34. ［2］Fu Qiang,Murphy P.A Robust Joint Estimation Algorithm of Glottal Source and Vocal Tract Models［J］.IEEE Transactions on Speech and Audio Processing,2006,14(2):492-501. ［3］Jamieson P V.Acoustic Discrimination of Pathological Voice:Sustained Vowels Versus Continuous Speech［J］.Journal of Speech,Language,and Hearing Research,2001,44(2):327-339. ［4］Richardson F S,Campbell W M.Language Recognition with Discriminative Keyword Selection［C］//Pro-ceedings of ICASSP’08.Washington D.C.,USA:IEEE Press,2008:4145-4148. ［5］梁春燕,杨琳,汪俊杰,等.音子配列学语种识别系统中特征选择方法的研究［J］.声学学报,2013,(2):212-213. ［6］钱盛友,许慧燕.基于动态时间规整和神经网络的方言辨识研究［J］.计算机工程与应用,2008,44(10):211-213. ［7］刘杰,莫超.兰州话阻塞辅音的声学研究［D］.兰州:西北民族大学,2012. ［8］Torres-Carrasquillo P A,Reynolds D A,Deller J R.Language Identification Using Gaussianm Ixturemodel Tokenization［C］//Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing.Orlando,USA:IEEE Press,2002:215-223. ［9］Zissman M A.Comparison of Four Approaches to Automatic Language Identification of Telephone Speech［J］.IEEE Transactions on Speech and Audio Processing,1996,4(1):31-34. ［10］顾明亮,马勇.基于高斯混合模型的汉语方言辨识系统［J］.计算机工程与应用2007,43(3):204-205. ［11］Rabiner L R,Juang B H.Fundamentals of Speech Recognition［M］.Trenton,USA:Prentice Hall,1993. ［12］Makhoul J,Schwartz R.The Voice of the Computer is Heard in the Land and it Listens Too［J］.IEEE Spectrum,1997,34(12):39-47. ［13］Scharenborg O.Reaching over the Gap:A Review of Efforts to Link Human and Automatic Speech Reco-gnition Research［J］.Speech Communication,2007,49:336-347. ［14］Brian K,Wng M.Subspace Distribution Clusting Hidden Markov Model［D］.Corvallis,USA:Oregon Graduate Institute of Science and Technology,2010. ［15］Deller J R,Proakis J G,Hansen J H L.Discrete-time Processing of Speech Signals［M］.Macmillian,The Republic of South Africa:Macmillian Publishing Company,1993. ［16］孟庆惠.徽州方言［M］.合肥:安徽人民出版社,2005. ［17］赵日新.徽语的特点和分区［J］.方言,2005,(3):279-286. ［18］Young S,Kershaw D,Odell J.The HTK Book(for HTK Version 3.0)［Z］.Microsoft Corporation,2000. ［19］李春,王作英.基于语音学分类的汉语三音子识别单元的算法［J］.清华大学学报,2003,43(l):16-19. ［20］Qin Zengchang,Lawry J.Decision Tree Learning with Fuzzy Abels［J］.Information Science,2005,173(2):255-275. ［21］Young S.An Application Toolkit for HTK［EB/OL］.(2007-01-01).http://htk.eng.cam.ac.uk. 编辑索书志

[1]	SUN Zhongjun, ZHAI Jiangtao. A Network Application Identification Method for Encrypted Traffic [J]. Computer Engineering, 2020, 46(4): 151-156.
[2]	BAI Lingling, NING Zhenhu, XUE Fei, YANG Yongli. Application of Hidden Markov Model in Malicious Domain Name Detection [J]. Computer Engineering, 2019, 45(9): 161-168.
[3]	HUANG Juanjuan,XU Yuan,ZHU Qunxiong. 3D map matching algorithm for scenic spot based on improved hidden Markov model [J]. Computer Engineering, 2019, 45(6): 259-266.
[4]	WU Jianwei,LI Yanling,ZANG Hanlin. Cognitive Network Throughput Optimization Method Based on Improved Frame Structure [J]. Computer Engineering, 2018, 44(6): 45-49.
[5]	HU Zhilong,WEN Chang,XIE Kai,HE Jianbiao. Voiceprint Password Recognition Algorithm Fusing with HMM-UBM and RVM [J]. Computer Engineering, 2018, 44(11): 129-134.
[6]	LIU Bo,DU Jianqiang,NIE Bin,LIU Lei,ZHANG Xin,HAO Zhulin. Part-of-speech Tagging of Traditional Chinese Medicine Diagnosis Ancient Prose Based on Second-order HMM [J]. Computer Engineering, 2017, 43(7): 211-216.
[7]	CUI Jianguo,GAO Bo,JIANG Liying,YU Mingyue,ZHENG Wei. Application Research of LSSVM and HMM in Aeroengine Condition Prediction [J]. Computer Engineering, 2017, 43(10): 310-315.
[8]	GAO Zhenbin,BAI Xue,YANG Song,HE Jiaji. Hardware Trojan Detection Method Based on Hidden Markov Model [J]. Computer Engineering, 2016, 42(9): 126-131.
[9]	GE Yongkan,YU Fengqin. Improved Speech Synthesis Algorithm Based on Harmonic Plus Noise Excitation Model [J]. Computer Engineering, 2016, 42(12): 278-281,289.
[10]	WANG Xingfu,WANG Yuqi. Sequence Classification Method Based on Neighborhood Information in Unconstrained Space [J]. Computer Engineering, 2016, 42(1): 311-315.
[11]	CAI Wenxue,QIU Zhucheng,HUANG Xiaoyu,XIAO Chaowu,CHEN Kang. Indoor Track Positioning Model Based on WiFi Fingerprint [J]. Computer Engineering, 2015, 41(6): 76-82.
[12]	HUANG Zhen-xiang, PENG Bo, WU Juan, WANG Ru-peng. Gesture Recognition Based on DTW and Combined Discriminative Feature Detector [J]. Computer Engineering, 2014, 40(5): 216-218,223.
[13]	XIAO Jia-lin, ZHAO Yu-qing, WANG Ying. Voice Activity Detection Based on HMM and SVM [J]. Computer Engineering, 2014, 40(1): 203-208.
[14]	LE Juan, DIAO Xi. Algorithm of Beijing Opera Organization Names Entity Recognition Based on HMM [J]. Computer Engineering, 2013, 39(6): 266-271,286.
[15]	FENG Chao, HUANG Kai-Qi, XU Tian-Shun. Communication Situation Estimating Method Based on HMM [J]. Computer Engineering, 2013, 39(2): 6-11.

Please choose a citation manager

Content to export