Parallel Data Warehouses Architecture Based on PC Cluster

doi:10.3969/j.issn.1000-3428.2009.20.025

Computer Engineering ›› 2009, Vol. 35 ›› Issue (20): 73-75.

• Software Technology and Database • Previous Articles Next Articles

Parallel Data Warehouses Architecture Based on PC Cluster

YOU Jin-guo, XI Jian-qing, XIAO Yu-hong

(School of Computer Science and Engineering, South China University of Technology, Guangzhou 510641)

Received:1900-01-01 Revised:1900-01-01 Online:2009-10-20 Published:2009-10-20

基于PC集群的并行数据仓库架构

游进国，奚建清，肖裕洪

(华南理工大学计算机科学与工程学院，广州 510641)

Abstract

Abstract: As data warehouses grow in size, how to assuring the performance of answering Ad Hoc queries on massive data becomes a big challenge. To address the issue, this paper proposes a parallel data warehouse architecture, HDW, built upon PC cluster. It employs Google’s GFS, Bigtable to process the distributive storage management and MapReduce to parallelize OLAP computation tasks. In addition, it provides the XMLA interface for front-end applications. Experimental results conducted on an 18-node cluster show that HDW scales well and can process large data sets with at least 10 million tuples.

Key words: data warehouse, OLAP, cluster

摘要： 针对数据仓库规模不断增长而导致难以确保即席查询分析性能的问题，提出一种构建在PC集群上的并行数据仓库架构——HDW，采用Google的GFS和Bigtable技术进行分布式存储管理，采用MapReduce技术进行并行联机分析处理，为前台应用程序提供遵循XMLA规范的统一接口。在18个节点的集群上进行实验，结果表明，HDW系统扩展性好，能快速处理至少千万条元组的数据。

关键词: 数据仓库, 联机分析处理, 集群

CLC Number:

TP311

YOU Jin-guo; XI Jian-qing; XIAO Yu-hong. Parallel Data Warehouses Architecture Based on PC Cluster[J]. Computer Engineering, 2009, 35(20): 73-75.

游进国;奚建清;肖裕洪. 基于PC集群的并行数据仓库架构[J]. 计算机工程, 2009, 35(20): 73-75.

/ Recommend / Download Citations

URL:

https://www.ecice06.com/EN/Y2009/V35/I20/73

[1]	LI Qiwen, WANG Zhihe, DU Hui, LU Depeng. Adaptive Density Peak Clustering Algorithm Based on Gaussian Distribution [J]. Computer Engineering, 2025, 51(4): 137-148.
[2]	GUO Jipeng, XU Shilong, LONG Jiahao, WANG Youqing, SUN Yanfeng, YIN Baocai. Multi-view Subspace Clustering Based on Dual Cross-view Correlation Detection [J]. Computer Engineering, 2025, 51(4): 27-36.
[3]	HAN Peng, HUANG Yunzhi, REN Caiyue, CHENG Jingyi, XU Jun. Assessment of Neoadjuvant Chemotherapy Efficacy in Breast Cancer Using Dual-Branch Network with PET Imaging [J]. Computer Engineering, 2025, 51(3): 293-299.
[4]	NIE Lei, HU Zisheng, BAO Haizhou. Heterogeneous Vehicular Network Selection Method Based on RSU-assisted and Adaptive Clustering [J]. Computer Engineering, 2025, 51(3): 162-171.
[5]	LI Junjun, DONG Jiangang, LI Kun. Research on Kubernetes-based Cluster Energy-Saving Strategy [J]. Computer Engineering, 2024, 50(9): 82-91.
[6]	Hongjiao LI, Baojin WANG, Zhaohui WANG, Renhao HU. Dual-Client Selection Algorithm Based on Model Similarity and Local Loss [J]. Computer Engineering, 2024, 50(8): 153-164.
[7]	QIN Yuan, ZHANG Hang, ZHU Hongpeng, LI Jiong, HU Hang. Online BSS Algorithm for Anti-jamming of Mobile UAV Cluster in Satellite MIMO Communication System [J]. Computer Engineering, 2024, 50(6): 65-76.
[8]	HU Aoran, CHEN Xiaohong. One-step Multi-view Clustering Based on Diversity and Consistency [J]. Computer Engineering, 2024, 50(5): 51-61.
[9]	Yue MA, Mi WEN. Spatial Load Forecasting Method Based on Multiscale LDTW and TCN [J]. Computer Engineering, 2024, 50(3): 106-113.
[10]	Huawei SONG, Shengqi LI, Fangjie WAN, Yuping WEI. Federated Learning Optimization Method in Non-IID Scenarios [J]. Computer Engineering, 2024, 50(3): 166-172.
[11]	Lijuan WANG, Jinping XING, Ming YIN, Zhifeng HAO, Ruichu CAI, Wen WEN. Weight Adaptive Multi-view Spectral Clustering Algorithm Based on Consistent Graphs [J]. Computer Engineering, 2024, 50(2): 122-131.
[12]	WANG Teng, HUANG Junsong, WANG Leting, ZHANG Caikun, LI Xiaoyang. Multi-Antenna Phased Array Radar-Guided Search Resource Optimization Algorithm Based on MADDPG [J]. Computer Engineering, 2024, 50(11): 38-48.
[13]	PAN Wei, HUANG Ruizhang, REN Lina, XUE Jingjing. Deep Document Clustering Based on Adaptive Structural Learning [J]. Computer Engineering, 2024, 50(11): 89-97.
[14]	LIU Daxing, GU Naijie, HUANG Zhangjin, SU Junjie, QI Dongsheng. A Sampling Algorithm for Software Prefetching Using Memory Access Traces [J]. Computer Engineering, 2024, 50(10): 362-369.
[15]	ZHANG Yujie, GAO Han. Image Segmentation Algorithm for Stamping Defects Based on Improved FCM [J]. Computer Engineering, 2024, 50(10): 342-351.

Please choose a citation manager

Content to export

Parallel Data Warehouses Architecture Based on PC Cluster

基于PC集群的并行数据仓库架构

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Parallel Data Warehouses Architecture Based on PC Cluster

基于PC集群的并行数据仓库架构

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments