计算机工程 ›› 2010, Vol. 36 ›› Issue (16): 264-266.doi: 10.3969/j.issn.1000-3428.2010.16.094

• 开发研究与设计技术 • 上一篇    下一篇

异构环境下空闲计算资源的共享与聚集

狄振强1,徐培松2   

  1. (1. 曲阜师范大学物理工程学院,曲阜 273165;2. 中国人民解放军92961部队,三亚 572021)
  • 出版日期:2010-08-20 发布日期:2010-08-17
  • 作者简介:狄振强(1965-),男,副教授、硕士,主研方向:高性能计算,网络通信,安全技术;徐培松,助理工程师

Sharing and Aggregation of Free Computing Resource in Heterogeneous Environments

DI Zhen-qiang1, XU Pei-song2   

  1. (1. College of Physics and Engineering, Qufu Normal University, Qufu 273165,2. 92961 Troops of People’s Liberation Army, Sanya 572021)
  • Online:2010-08-20 Published:2010-08-17

摘要: 描述异构环境下的空闲计算资源共享与聚集平台DCSA。DCSA是一种通用的、可扩展与高效并行化的平台,在该环境下可以较容易地使用高级别的方式编写并行程序,把各种不同的软硬环境下的机器整合到单一的系统环境,共同完成高性能科学计算问题。DCSA的通用性得益于统一的XML文档来描述任务与计算结果及灵活的编程模型。利用一个轻量级的检查点机制与高度可配置的通信机制保证了DCSA的容错性与高效性。通过对实际并行例程的案例分析,验证了DCSA的正确性与高效性。

关键词: 高性能计算, 资源共享, 分布式计算, 多线程

Abstract: A computing resource sharing and aggregation platform in the heterogeneous environments called DCSA is presented and described. DCSA is a parallelization platform that enables the programmer to build distributed parallel applications using a high level and easy-to-use parallel programming paradigm. DCSA is designed for a general system by defining XML approach with which the application developer can document and evaluate the results of program. A light-weight check point mechanism and a highly customizable communication mechanism are presented that prove the fault tolerance and efficiency of DCSA. Results obtained from examples analysis show that DCSA is feasible and efficient computing resource sharing platform.

Key words: high performance computing, resource sharing, distributed computing, multi-thread

中图分类号: