作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (22): 32-33. doi: 10.3969/j.issn.1000-3428.2010.22.011

• 博士论文 • 上一篇    下一篇

寡核苷酸芯片的逐步探针选取算法

彭胜蓝1,周一鸣2   

  1. (1. 景德镇陶瓷学院信息工程学院,江西 景德镇 333403;2. 清华大学生物科学与技术系,北京 100084)
  • 出版日期:2010-11-20 发布日期:2010-11-20
  • 作者简介:彭胜蓝(1975-),男,博士,主研方向:生物信息学;周一鸣,博士
  • 基金资助:

    国家自然科学基金资助项目(60961003)

Stepwise Probe Selection Algorithm for Oligonucleotide Array

PENG Sheng-lan1, ZHOU Yi-ming2   

  1. (1. School of Information Engineering, Jingdezhen Ceramic Institute, Jingdezhen 333403, China;2. Department of Biological Sciences & Biotechnology, Tsinghua University, Beijing 100084, China)
  • Online:2010-11-20 Published:2010-11-20

摘要:

探针集的挑选是寡核苷酸芯片设计过程中最重要的部分。基于合成探针成本的考虑,探针的个数成为评价探针集优劣的一个最重要的指标。一个好的探针挑选算法应该挑选出尽可能少的探针。为此,对探针选取的贪心算法作了改进,提出一个类似于逐步向前回归算法的探针选取算法。该算法在每次向探针集加入边际效用最大的探针的同时,把边际效用没有或者很小的探针从探针集中剔除出去。对HLA等位基因数据的实验结果表明,逐步选取算法得到的探针集优于贪心算法挑选出的探针集。

关键词: 基因分型, 寡核苷酸芯片, 探针选取问题, 贪心算法, 逐步算法

Abstract:

Probe selection plays an essential role in the procedure of designing oligonucleotide array. Considering the cost of synthesizing probes, the size of probe set becomes the most important criterion to evaluate the probe set. A good probe selection algorithm should produce a small number of probes as possible. This paper makes an improvement for the greedy algorithm of Herwig’s which is based on Shannon entropy as a quality criterion. The new algorithm is similar to the forward stepwise regression search algorithm. While adding the probes with maximal marginal effect, those probes with little marginal effect are dropped from the optimization set. Experimental results making on the data sets of HLA alleles show that the probe set produced by the algorithm is superior to the probe set selected by greedy algorithm.

Key words: genotyping, oligonucleotide array, probe selection problem, greedy algorithm, stepwise algorithm

中图分类号: