[1]李国杰,程学旗.大数据研究:未来科技及经济社会发展的重大战略领域——大数据的研究现状与科学思考[J].中国科学院院刊,2012,27(6):5-15.
[2]黄晓云.基于HDFS的云存储服务系统研究[D].大连:大连海事大学,2010.
[3]肖凌,刘继红,姚建初.分布式数据库系统的研究与应用[J].计算机工程,2001,27(1):33-35.
[4]朱珠.基于Hadoop的海量数据处理模型研究和应用[D].北京:北京邮电大学,2008.
[5]魏士伟,黄文明,康业娜,等.分布式数据库中基于半连接的查询优化算法研究[J].计算机应用,2007,27(1):34-36.
[6]刘大昕,张春林,聂亚杰,等.数据仓库与技术[J].计算机仿真,2003,20(5):40-43.
[7]CHAUDHURI S,DAYAL U.An overview of data warehousing and OLAP technology[J].ACM Sigmod Record,1997,26(1):65-74.
[8]FLORATOU A,MINHAS U F,ZCAN F.SQL-on-Hadoop:full circle back to shared-nothing database architectures[J].VLDB Endowment,2014,7(12):1295-1306.
[9]ABADI D,BABU S,ZCAN F.SQL-on-hadoop systems:tutorial[J].VLDB Endowment,2015,8(12):2050-2051.
[10]BORTHAKUR D.The Hadoop distributed file system:architecture and design[J].Hadoop Project Website,2007,11(11):1-10.
[11]WAA S,FLORIAN M.Beyond conventional data warehousing——massively parallel data processing with Greenplum database[C]//Proceedings of International Workshop on Business Intelligence for the Real-time Enterprise.Berlin,Germany:Springer,2008:235-243.
[12]CHANG L,WANG Z,MA T,et al.HAWQ:a massively parallel processing SQL engine in hadoop[C]//Proceedings of ACM SIGMOD International Conference on Management of Data.New York,USA:ACM Press,2014:1223-1234.
[13]LIU L C H,BAERENWALD L L,PLASEK J M,et al.Hybrid hash join process:USA,US 6263331 B1[P].2001-05-31.
[14]BONCZ P,NEUMANN T,ERLING O,et a.TPC-H analyzed:hidden messages and lessons learned from an influential benchmark[C]//Proceedings of Conference on Performance Evaluation and Benchmarking.Berlin,Germany:Springer,2013:124-132.
[15]GIACOMONI J,MOSELEY T,VACHHARAJANI M.Fast forward for efficient pipeline parallelism:a cache-optimized concurrent lock-free queue[C]//Proceedings of ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming.New York,USA:ACM Press,2008:43-52. |