摘要: 针对生物数据源的分布性、异构性和动态性等特性,探讨生物信息技术服务支撑系统整体解决方案,构建基于基因本体的信息集成模式以实现生物语义学上的数据集成。设计一种以半结构化形式规范生物元数据及基于MD5算法的增量更新技术,用以解决通用扩展性和效率问题,实现生物数据仓库中数据的共享并提高管理效率。
关键词:
基因本体,
半结构化,
增量更新,
MD5算法
Abstract: For the characters of distribution, heterogeneity and dynamic of biological data, a resolution of the service system for bioinformatics technology is presented, and an approach of biological data integration based on Gene Ontology(GO) is proposed in order to realize biological semantic integration. Semi-structured incremental updating method to standardize biological metadata with MD5 algorithm to improve the updating efficiency is designed, which resolves the data sharing and the efficiency of data management in biological data warehouse.
Key words:
Gene Ontology(GO),
semi-structured,
incremental update,
MD5 algorithm
中图分类号:
杨 森;夏 燕;曹顺良;邓绪斌;朱扬勇;. 语义异构生物数据源中的数据集成与更新[J]. 计算机工程, 2008, 34(8): 38-40.
YANG Sen; XIA Yan; CAO Shun-liang; DENG Xu-bin; ZHU Yang-yong;.
Data Integration and Update in Semantic Heterogeneous Biological Data Sources
[J]. Computer Engineering, 2008, 34(8): 38-40.