Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2008, Vol. 34 ›› Issue (11): 74-76. doi: 10.3969/j.issn.1000-3428.2008.11.027

• Software Technology and Database • Previous Articles     Next Articles

Design and Implementation of Unicode Data Warehouse ETL

XU Wei, LI Mao-qing   

  1. (Dept. of Automation, Xiamen University, Xiamen 361005)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-06-05 Published:2008-06-05

Unicode数据仓库ETL的设计与实现

许 威,李茂青   

  1. (厦门大学自动化系,厦门 361005)

Abstract: During the process of loading Unicode data, the phenomenon so-called a loss of information will be encountered while the source characters are not defined in the target character sets. In light of this situation, this paper delivers a fault tolerant ETL solution on how to resolve the source/target character set conversion problem while populating Unicode data from Oracle 9i database to Teradata data warehouse. Practical result proves its efficiency and accuracy.

Key words: character set, data warehouse, Unicode

摘要: 在Unicode数据装载过程中,如源字符集中的某个字符在目标字符集中没有定义,将会出现错误,产生信息丢失的现象。针对这种情况,该文提出一种从源Oracle数据库到目标Teradata数据仓库字符集转换的ETL设计方法和实现。实践表明该方案有效可行,能提高ETL过程的容错率。

关键词: 字符集, 数据仓库, 统一字符编码标准

CLC Number: