作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

一种快速精确的个体单体型重建算法

吴璟莉,梁彬彬,李志欣,王 华   

  1. (广西师范大学计算机科学与信息工程学院,广西 桂林 541004)
  • 收稿日期:2012-08-13 出版日期:2013-09-15 发布日期:2013-09-13
  • 作者简介:吴璟莉(1978-),女,副教授、博士、CCF会员,主研方向:生物信息学;梁彬彬,硕士研究生;李志欣,副教授、博士;王 华,硕士研究生
  • 基金资助:
    国家自然科学基金资助项目(61165009);广西自然科学基金资助项目(2011GXNSFB018068, 2012GXNSFAA053219);“八桂学者”工程专项基金资助项目;广西高等学校科学技术研究基金资助项目(2013YB028)

A Fast and Accurate Reconstruction Algorithm for Individual Haplotype

WU Jing-li, LIANG Bin-bin, LI Zhi-xin, WANG Hua   

  1. (College of Computer Science and Information Technology, Guangxi Normal University, Guilin 541004, China)
  • Received:2012-08-13 Online:2013-09-15 Published:2013-09-13

摘要: 在最少错误更正模型的基础上,提出一种重建单体型的启发式算法H-MEC。按照单体型的单核苷酸多态性(SNP)位点顺序依次构建算法步骤,根据某SNP位点取值将覆盖该SNP位点的片段划分为2个集合,利用包含片段数较多集合中的片段进行重建。使用HapMap计划发布的CEPH样本中的60个个体,在1号染色体的单体型上进行实验。结果表明,H-MEC算法在各种参数设置下,能获得较Fast Hare算法和DGS算法更高的单体型重建率。此外,该算法在重建长单体型时也具有较高的执行效率。

关键词: 单核苷酸多态性, 单体型, 最少错误更正, 启发式, 重建

Abstract: A heuristic algorithm for haplotype reconstrucion, named H-MEC, is proposed based on the Minimum Error Correction(MEC) model. H-MEC reconstructs the columns of a pair of haplotypes one by one. It partitions the Single Nucleotide Polymorphisms(SNP) fragments that cover some SNP site into two sets according to the values of the corresponding SNP site, and reconstructs haplotypes by using the fragments of the set which contains more fragments. The experiments are conducted by using the haplotypes on the chromosomes 1 of 60 individuals in the CEPH sample, which are released by the international HapMap project. Experimental results indicate that under various parameter settings, H-MEC can obtain higher reconstruction rate than Fast Hare algorithm and DGS algorithm. Moreover, H-MEC still has high efficiency even for reconstructing long haplotypes.

Key words: Single Nucleotide Polymorphisms(SNP), haplotype, Minimum Error Correction(MEC), heuristic, reconstruction

中图分类号: