作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 体系结构与软件技术 • 上一篇    下一篇

基于语义数据的药物网络模型构建与分析

王爽a,b,冯志勇a,b   

  1. (天津大学 a.计算机科学与技术学院; b.天津市认知计算与应用重点实验室,天津 300072)
  • 收稿日期:2015-06-09 出版日期:2016-06-15 发布日期:2016-06-15
  • 作者简介:王爽(1991-),女,硕士研究生,主研方向为语义网络;冯志勇,教授、博士。
  • 基金资助:
    国家自然科学基金资助项目(61173155,61373035);国家“863”计划基金资助项目(2007AA01Z130,2013AA013204)。

Construction and Analysis of Drug Network Model Based on Semantic Data

WANG Shuang a,b,FENG Zhiyong a,b   

  1. (a.School of Computer Science and Technology; b.Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin University,Tianjin 300072,China)
  • Received:2015-06-09 Online:2016-06-15 Published:2016-06-15

摘要: 针对传统生物数据分析方法无法高效处理规模不断增大的生物语义数据集的现状,将基于属性共现的节点相似度算法应用于ChEMBL数据集,构建基于药物天然产物-活性的二部图模型,应用Graphlab框架计算基于活性特征的药物天然产物相似度,并对相似度较高的药物天然产物进行活性推荐。实验结果表明,该方法能有效利用生物数据集的语义信息发现药物天然产物潜在的活性特征,从而指导药物研发早期的活性探测以及药物靶标的发现和选择过程。

关键词: 语义数据, ChEMBL数据集, 活性, Graphlab并行计算, 节点相似度算法

Abstract: For the reason that the traditional biological analysis method is unable to handle the semantic information effectively,this paper applies the node similarity algorithm based on attribute co-occurrence to ChEMBL database and constructs the bipartite graph based on nautral product and activity.Then,with the framework of Graphlab,it calculates the natural product similarity based on activity and recommends the natural products with high similarity.Experimental results show that the method can effectively use the semantic information of biological datasets to find out the potential activities of natural products,thus guiding the activity detection and drug target discovery and selection in the early stage of drug research.

Key words: semantic data, ChEMBL dataset, activity, Graphlab parallel computing, node similarity algorithm

中图分类号: