作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (10): 68-72. doi: 10.3969/j.issn.1000-3428.2009.10.022

• 软件技术与数据库 • 上一篇    下一篇

基于模式树的XETL过程研究

郭有限,张东站

  

  1. (厦门大学计算机科学系,厦门 361005)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-05-20 发布日期:2009-05-20

Research on XETL Process Based on Pattern Tree

GUO You-xian, ZHANG Dong-zhan   

  1. (Department of Computer Science, Xiamen University, Xiamen 361005)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-05-20 Published:2009-05-20

摘要: XML数据与传统的关系型数据存在的差异,使得传统数据仓库的ETL方法已经不适用于XML数据,而目前也没有专门的、有效的适用于XML数据的ETL方法。针对这一问题,提出基于模式树的XML转换处理过程——XETL。从数据模型和谓词模式研究XETL模型,基于XETL模型定义ETL过程中属性选择、空置处理、聚合以及属性重命名4类主要的转换处理操作。

关键词: 模式树, XML数据仓库, XETL过程

Abstract: Because of the existing differences between XML data and the traditional relational data, the traditional method of data warehouse ETL is no longer suitable for dealing with XML data. This paper proposes the XETL method which is based on pattern tree and can be applied to transfer and deal with XML data. This paper starts with the research on XETL pattern based on data model and predicate model, and defines the four main transference operations in the XETL process based on the XETL model, , which are attribute selection, null attribute operation, aggregation and attribute renamed.

Key words: pattern tree, XML data warehouse, XETL process

中图分类号: