作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (19): 38-39,4. doi: 10.3969/j.issn.1000-3428.2006.19.014

• 软件技术与数据库 • 上一篇    下一篇

并行I/O中大型多维数据集合分配策略研究

曾碧卿1,陈志刚2

  

  1. (1. 华南师范大学计算机系,佛山 528225;2. 中南大学信息科学与工程学院,长沙 410083)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2006-10-05 发布日期:2006-10-05

Research on Declustering Strategy of Large-scale Multidimensional Dataset of Parallel I/O

ZENG Biqing1, CHEN Zhigang2   

  1. (1. Department of Computer, South China Normal University, Foshan 528225;
    2. School of Information Science and Engineering, Central South University, Changsha 410083)
  • Received:1900-01-01 Revised:1900-01-01 Online:2006-10-05 Published:2006-10-05

摘要: 在大型多维数据集合处理中,对多维数据集合的拆分及其在磁盘上的存储分配是重要的研究课题。由于磁盘的机械运动已形成了数据I/O时的速度瓶颈,因此通过采用并行I/O技术,将多维数据进行有效的拆分,并在多个磁盘间进行分布存储是克服瓶颈的有效办法。基于此,论文中提出了一种多维数据的循环拆分方法,它是对二维数据集合循环拆分分配方法的扩展,性能比较与分析表明了新算法的有 效性。

关键词: 多维数据集, 并行I/O, 数据拆分, 循环分配策略

Abstract: It is an important research topic to process the large-scale multidimensional dataset by program. When the data is input or output, the speed of reading and writing is limited by the mechanism movement of disks itself. The parallel I/O technology is efficient to eliminate the I/O bottleneck. Parallel I/O can decluster the multidimensional dataset and store them. Based on this fact, a new cyclic declustering strategy for the large-scale multidimensional dataset is proposed. The experiment shows its validity.


Key words: Multidimensional dataset, Parallel I/O, Data declustering, Cyclic distributed strategy

中图分类号: