作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (14): 262-263. doi: 10.3969/j.issn.1000-3428.2010.14.095

• 开发研究与设计技术 • 上一篇    下一篇

Hadoop平台的性能优化研究

栾亚建,黄翀民,龚高晟,赵铁柱   

  1. (华南理工大学广东省计算机网络重点实验室,广州 510640)
  • 出版日期:2010-07-20 发布日期:2010-07-20
  • 作者简介:栾亚建(1985-),男,硕士研究生,主研方向:分布式计算;黄翀民、龚高晟,硕士研究生;赵铁柱,博士研究生
  • 基金资助:

    Sun Microsystems公司研究基金资助项目

Research on Performance Optimization of Hadoop Platform

LUAN Ya-jian, HUANG Chong-min, GONG Gao-sheng, ZHAO Tie-zhu   

  1. (Key Lab of Computer Network of Guangdong Province, South China University of Technology, Guangzhou 510640)
  • Online:2010-07-20 Published:2010-07-20

摘要:

随着基于MapReduce模型的应用程序越来越多,Hadoop性能取决于应用程序。针对上述特性,从应用着手剖析Hadoop存在的局限和不足,提出解决方案,利用作业和任务的多重并发平衡磁盘和网络带宽,减小瓶颈出现的可能性,提高系统性能。

关键词: 分布式计算, 存储, 并发I/O, 性能优化

Abstract:

As applications programs based on MapReduce model growing up, Hadoop’s performance depends on applications. From applications, this paper analyzes the limitations and shortcomings of Hadoop architecture and storage technology, and proposes a feasible and cost-effective solution to address the problem. It builds a series tests to convince the idea. The solution is multi-level parallel, both on job and I/O levels, which makes full use of disk and network bandwidth, reduces I/O bottlenecks and improves performance.

Key words: distributed computing, storage, concurrent I/O, performance optimization

中图分类号: