作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (9): 181-183. doi: 10.3969/j.issn.1000-3428.2011.09.063

• 人工智能及识别技术 • 上一篇    下一篇

基于频谱分析的串联重复序列识别方法

聂俊岚,毛伟伟,王常武,王宝文,刘文远   

  1. (燕山大学信息科学与工程学院,河北 秦皇岛 066004)
  • 出版日期:2011-05-05 发布日期:2011-05-12
  • 作者简介:聂俊岚(1962-),女,教授,主研方向:生物信息学,虚拟仿真,图像处理;毛伟伟,硕士研究生;王常武,教授;王宝文,副教授;刘文远,教授
  • 基金资助:
    河北省教育厅自然科学研究计划基金资助项目(2009339)

Identification Method of Tandem Repeats Based on Spectral Analysis

NIE Jun-lan, MAO Wei-wei, WANG Chang-wu, WANG Bao-wen, LIU Wen-yuan   

  1. (College of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China)
  • Online:2011-05-05 Published:2011-05-12

摘要: 针对现有串联重复序列识别方法存在的计算量大、灵敏度低等问题,提出一种基于频谱分析的串联重复序列识别方法。该方法采用碱基的电子离子相互作用势作为基因序列数字化表示的方法,通过对数字序列作离散傅里叶变换得到序列中串联重复序列出现的频率,并对基因序列做加窗傅里叶变换,找出串联重复序列存在的位置。实验表明,该方法的计算量较已有方法减少了75%,并能较好地解决已有方法识别灵敏度低的缺点。

关键词: 串联重复序列, 离散傅里叶变换, 电子离子相互作用势, 频谱分析, 信噪比

Abstract: Aiming at the drawbacks of the existing tandem repeats finding methods, such as large number of calculations and feeble sensitivity, this paper presents a tandem repeats identification method which is based on spectral analysis. The technique employs the Electron-Ion Interaction Potential(EIIP) of each nucleotide as the numerical representation for DNA sequence, and obtains the occurrence frequency of the tandem repeats which is buried in the sequence after computing the Discrete Fourier Transform(DFT) of the sequence. The windowed Fourier transform is used, and the tandem repeats location is identified efficiently. Experiment demonstrates that the calculation amount is reduced by 75% compared with the existing methods, and greatly resolves the feeble sensitivity of the existing techniques.

Key words: tandem repeats, Discrete Fourier Transform(DFT), Electron-Ion Interaction Potential(EIIP), spectral analysis, Signal to Noise Ratio(SNR)

中图分类号: