作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (05): 1-3. doi: 10.3969/j.issn.1000-3428.2007.05.001

• 博士论文 •    下一篇

延时HMM在基因剪接供体位点识别中的应用

朱红梅,王家廞,赵燕南,杨泽红   

  1. (清华大学计算机系智能技术与系统实验室,北京 100084)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-03-05 发布日期:2007-03-05

Application of Time-delay HMM in Gene Splice Donor Sites Prediction

ZHU Hongmei, WANG Jiaxin, ZHAO Yannan, YANG Zehong   

  1. (State Key Laboratory of Intelligent Technology and Systems, Department of Computer, Tsinghua University, Beijing 100084)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-03-05 Published:2007-03-05

摘要: 为使得隐马尔可夫模型(HMM)能够处理非相邻可见符号之间的依赖关系,将延时机制引入标准的HMM中。该技术仅仅改变了高阶状态发射概率的计算。所有适用于HMM的算法基本保持不变。该文设计了一个一阶延时隐马尔可夫模型和一个一阶标准隐马尔可夫模型,将两者分别应用于水稻基因剪接供体位点的识别。识别结果显示,延时模型的判别能力在一定程度上优于标准模型。对那些特征很不符合的位点,延时模型给出了相对低得多的得分。

关键词: 隐马尔可夫模型, 延时, 剪接供体位点识别, 水稻基因组

Abstract: To enable hidden Markov models to account for dependencies between non-adjacent observation symbols, time-delay is introduced to standard high order HMM states. This technique only changes the calculation of emission probabilities in high order states. All the algorithms for HMM remains almost the same. Such a time-delay first order HMM as well as a standard first order HMM is established for splice donor sites in rice genome. The results show some improvements in discriminative power for time-delay first order HMM vs standard first order HMM. It is worth noting that the former gives much lower scores to sites with poor potential as donor signals from the remainder of sites.

Key words: Hidden Markov model (HMM), Time-delay, Splice donor sites prediction, Rice genome