Abstract:
Partitioned workload system (PWS) is one job management system using partition and dynamic resources-lease technology, which enhances the manageability of JMS and improves the utilization of cluster system. With optimized communication system, PWS can support cluster systems with thousands of CPUs or nodes. PWS has been applied in Shuguang 4000A super server and other computation fields requiring high performances. the design and implementation of the PWS is introduced.
Key words:
JMS,
cluster system,
partition,
lease,
scalability
摘要: PWS(partitioned workload system)是引入分区化技术以及资源租借技术的作业管理系统(JMS)。分区技术加强了机群系统中资源的可管理性,通过资源租借技术提高了系统资源的利用率。优化的通信方式让PWS能够支持上千节点的大规模机群系统。PWS已经在曙光4000A超级计算机以及相关高性能计算领域中得到了应用。该文介绍了PWS的设计与实现。
关键词:
JMS,
机群系统,
分区,
租借,
扩展性
CLC Number:
ZOU Ming; TU Bi-bo; ZHAN Jian-feng. Large-scale Job Management System Based on Partition and Lease Technologies[J]. Computer Engineering, 2007, 33(17): 99-101.
邹 铭;涂碧波;詹剑锋. 基于分区租借的大规模作业管理系统[J]. 计算机工程, 2007, 33(17): 99-101.