作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2019, Vol. 45 ›› Issue (2): 154-159. doi: 10.19678/j.issn.1000-3428.0049931

• 人工智能及识别技术 • 上一篇    下一篇

基于网络表示学习的miRNA功能相似性研究

阮璐,熊赟   

  1. 复旦大学 计算机科学技术学院,上海 201203
  • 收稿日期:2018-01-02 出版日期:2019-02-15 发布日期:2019-02-15
  • 作者简介:阮璐(1992—),女,硕士研究生,主研方向为数据挖掘、数据科学;熊赟,教授
  • 基金资助:

    国家高技术研究发展计划(2015AA020105-10);上海市科委基金(16JC1400800,17511105502)

Research on Functional Similarity of miRNA Based on Network Representation Learning

RUAN Lu,XIONG Yun   

  1. School of Computer Science,Fudan University,Shanghai 201203,China
  • Received:2018-01-02 Online:2019-02-15 Published:2019-02-15

摘要:

miRNA是一类重要的非编码小RNA分子,与癌症等疾病有密切的关系。目前研究者已经识别大量miRNA,但是多数miRNA的功能仍然未知。为此,提出一种网络表示学习的miRNA功能相似性计算方法。通过miRNA的相关数据集如目标基因和关联疾病可以有效地计算miRNA的功能相似性,从而预测疾病相关的候选miRNA。利用不同类型生物数据集构建miRNA相关多源网络,采用网络表示学习的方式为网络中的每一个miRNA节点学习一个特征向量,进而使用特征向量来衡量miRNA的相似性。实验结果表明,与DeepWalk方法相比,该方法在同一家族的miRNA中能够取得较高的得分,并且可以在已有的数据库中找到疾病候选miRNA验证记录。

关键词: 功能相似性, 网络表示学习, 网络嵌入, 多源网络, 特征向量

Abstract:

miRNA is an important class of non-coding small RNA molecules that are closely related to diseases such as cancer.Researchers have identified a large number of miRNA,but the function of most miRNA remains unknown.Therefore,a method for calculating the similarity of miRNA functions based on network representation learning is proposed.The functional similarity of miRNA can be efficiently calculated by relevant data sets of miRNA such as target genes and associated diseases,thereby predicting disease-related candidate miRNA.miRNA-related multi-source network is constructed by utilizing different types of biological data sets.The network representation learning method is used to learn a feature vector for each miRNA node in the network,and the similarity of the miRNA is measured by the learned feature vector.Experimental results show that compared with the DeepWalk method,the method can obtain higher scores in the same family of miRNA,and the disease candidate miRNA verification records can be found in the existing database.

Key words: functional similarity, network representation learning, network embedding, multi-source network, feature vector

中图分类号: