Abstract:
During the process of loading Unicode data, the phenomenon so-called a loss of information will be encountered while the source characters are not defined in the target character sets. In light of this situation, this paper delivers a fault tolerant ETL solution on how to resolve the source/target character set conversion problem while populating Unicode data from Oracle 9i database to Teradata data warehouse. Practical result proves its efficiency and accuracy.
Key words:
character set,
data warehouse,
Unicode
摘要: 在Unicode数据装载过程中,如源字符集中的某个字符在目标字符集中没有定义,将会出现错误,产生信息丢失的现象。针对这种情况,该文提出一种从源Oracle数据库到目标Teradata数据仓库字符集转换的ETL设计方法和实现。实践表明该方案有效可行,能提高ETL过程的容错率。
关键词:
字符集,
数据仓库,
统一字符编码标准
CLC Number:
XU Wei; LI Mao-qing. Design and Implementation of Unicode Data Warehouse ETL[J]. Computer Engineering, 2008, 34(11): 74-76.
许 威;李茂青. Unicode数据仓库ETL的设计与实现[J]. 计算机工程, 2008, 34(11): 74-76.