作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (17): 256-258,261. doi: 10.3969/j.issn.1000-3428.2011.17.086

• 开发研究与设计技术 • 上一篇    下一篇

面向ETL的数据起源追踪系统

戴超凡,王 涛   

  1. (国防科学技术大学信息系统与管理学院信息系统工程重点实验室,长沙 410073)
  • 收稿日期:2011-02-28 出版日期:2011-09-05 发布日期:2011-09-05
  • 作者简介:戴超凡(1973-),男,副教授、博士,主研方向:智能辅助决策;王 涛,硕士研究生

Data Provenance Tracing System for Extraction-Transform-Load

DAI Chao-fan, WANG Tao   

  1. (Science and Technology on Information Systems Engineering Laboratory, College of Information System and Management, National University of Defense Technology, Changsha 410073, China)
  • Received:2011-02-28 Online:2011-09-05 Published:2011-09-05

摘要: 提出一种面向提取-转换-加载(ETL)过程的数据起源追踪系统,讨论实现的关键技术,包括转换分类、元数据设计、转换序列构建、追踪流程设计以及不同转换的追踪方法。系统将追踪所需的元数据设计在包文件结构中,在逆向追踪时抽取元数据进行相关处理,构建各个层次的转换起源信息图,从而实现数据起源的追踪。

关键词: 数据起源, 起源管理系统, 提取-转换-加载, 同步/异步转换

Abstract: This paper proposes a lineage tracing frame for Extraction-Transform-Load(ETL) process, and discusses some key issues in this system, such as classifying the transformations, the designing of metadata, constructing transformation series, tracing process design and the tracing methods for every type of the transformation. The metadata for tracing is injected into the package file, these information are extracted when tracing query were proposed to support data tracing.

Key words: data provenance, provenance management system, Extraction-Transform-Load(ETL), synchronous/asynchronous transformation

中图分类号: