作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (2): 166-168. doi: 10.3969/j.issn.1000-3428.2011.02.057

• 人工智能及识别技术 • 上一篇    下一篇

多序列的近似LCS改进算法

胡 婕1,业 宁1,2,罗晓波1,崔 静1,董程玲1   

  1. (1. 南京林业大学信息科学技术学院,南京 210037;2. 山东大学计算机科学技术学院,济南 250101)
  • 出版日期:2011-01-20 发布日期:2011-01-25
  • 作者简介:胡 婕(1987-),女,硕士,主研方向:数据挖掘,生物信息学,算法分析与设计;业 宁,博士;罗晓波、崔 静、董程玲,硕士研究生
  • 基金资助:

    江苏省自然科学基金资助项目(BK2009393)

Improved Approximation Algorithms for Longest Common Subsequence of Multiple Sequences

HU Jie1, YE Ning1,2, LUO Xiao-bo1, CUI Jing1, DONG Cheng-ling1   

  1. (1. School of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, China; 2. School of Computer Science and Technology, Shandong University, Jinan 250101, China)
  • Online:2011-01-20 Published:2011-01-25

摘要:

提出2种针对3条源序列的近似LCS算法,近似因子均为1/|?|。其中,线性近似LCS算法的时空复杂度均为 , 为最长源序列的长度,适于解决大规模问题。递归近似LCS算法时空复杂度均为O(nlogn),适于要求高精度问题。同时,这2种算法都能用于解决多序列的LCS和CLCS问题。实验验证了这2种算法的有效性。

关键词: 生物信息学, 最长公共子序列, 近似因子, 约束

Abstract:

This paper proposes two approximation algorithms for LCS problem of three input strings. Even if the approximation factors of the two algorithms are still 1/|?|, they can both get much better results in comparison of Gotthilf’s approximating LCS algorithm in most cases. Linear approximating LCS algorithm, takes O(n) time and O(n) space, where n is the length of the longest input sequence. It can work well in the case of large scale input strings. Recursive approximating LCS algorithm, whose time and space complexities both are O(nlogn), can receive the best precision among these algorithms at the most time. It is applicable to meet the desire of good precision for corresponding LCS problem. The two algorithms can be used to solve the LCS problem of multiple sequences and CLCS problem of multiple sequences. Experimental results prove the validity of the two algorithms.

Key words: bioinformatics, Longest Common Subsequence(LCS), approximation factor, constraint

中图分类号: