Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2008, Vol. 34 ›› Issue (8): 38-40. doi: 10.3969/j.issn.1000-3428.2008.08.013

• Degree Paper • Previous Articles     Next Articles

Data Integration and Update in Semantic Heterogeneous Biological Data Sources

YANG Sen1, XIA Yan1, CAO Shun-liang2, DENG Xu-bin1, ZHU Yang-yong1,2   

  1. (1. Shanghai (International) Database Research Center, Fudan University, Shanghai 200433; 2. Shanghai Center for Bioinformatics Technology, Shanghai 200235)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-04-20 Published:2008-04-20

语义异构生物数据源中的数据集成与更新

杨 森1,夏 燕1,曹顺良2,邓绪斌1,朱扬勇1,2   

  1. (1. 复旦大学上海(国际)数据库研究中心,上海 200433;2. 上海生物信息技术研究中心,上海 200235)

Abstract: For the characters of distribution, heterogeneity and dynamic of biological data, a resolution of the service system for bioinformatics technology is presented, and an approach of biological data integration based on Gene Ontology(GO) is proposed in order to realize biological semantic integration. Semi-structured incremental updating method to standardize biological metadata with MD5 algorithm to improve the updating efficiency is designed, which resolves the data sharing and the efficiency of data management in biological data warehouse.

Key words: Gene Ontology(GO), semi-structured, incremental update, MD5 algorithm

摘要: 针对生物数据源的分布性、异构性和动态性等特性,探讨生物信息技术服务支撑系统整体解决方案,构建基于基因本体的信息集成模式以实现生物语义学上的数据集成。设计一种以半结构化形式规范生物元数据及基于MD5算法的增量更新技术,用以解决通用扩展性和效率问题,实现生物数据仓库中数据的共享并提高管理效率。

关键词: 基因本体, 半结构化, 增量更新, MD5算法

CLC Number: