作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (8): 38-40. doi: 10.3969/j.issn.1000-3428.2008.08.013

• 博士论文 • 上一篇    下一篇

语义异构生物数据源中的数据集成与更新

杨 森1,夏 燕1,曹顺良2,邓绪斌1,朱扬勇1,2   

  1. (1. 复旦大学上海(国际)数据库研究中心,上海 200433;2. 上海生物信息技术研究中心,上海 200235)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-04-20 发布日期:2008-04-20

Data Integration and Update in Semantic Heterogeneous Biological Data Sources

YANG Sen1, XIA Yan1, CAO Shun-liang2, DENG Xu-bin1, ZHU Yang-yong1,2   

  1. (1. Shanghai (International) Database Research Center, Fudan University, Shanghai 200433; 2. Shanghai Center for Bioinformatics Technology, Shanghai 200235)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-04-20 Published:2008-04-20

摘要: 针对生物数据源的分布性、异构性和动态性等特性,探讨生物信息技术服务支撑系统整体解决方案,构建基于基因本体的信息集成模式以实现生物语义学上的数据集成。设计一种以半结构化形式规范生物元数据及基于MD5算法的增量更新技术,用以解决通用扩展性和效率问题,实现生物数据仓库中数据的共享并提高管理效率。

关键词: 基因本体, 半结构化, 增量更新, MD5算法

Abstract: For the characters of distribution, heterogeneity and dynamic of biological data, a resolution of the service system for bioinformatics technology is presented, and an approach of biological data integration based on Gene Ontology(GO) is proposed in order to realize biological semantic integration. Semi-structured incremental updating method to standardize biological metadata with MD5 algorithm to improve the updating efficiency is designed, which resolves the data sharing and the efficiency of data management in biological data warehouse.

Key words: Gene Ontology(GO), semi-structured, incremental update, MD5 algorithm

中图分类号: