Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2023, Vol. 49 ›› Issue (3): 288-295. doi: 10.19678/j.issn.1000-3428.0063895

• Development Research and Engineering Application • Previous Articles     Next Articles

Optimization of Molecular Dynamics Algorithm for Solid Crystalline Silicon Based on GPU

LI Jing1, ZHU Aiqi2, HAN Lin3, HOU Chaofeng2   

  1. 1. School of Information Engineering, Zhengzhou University, Zhengzhou 450001, China;
    2. Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190, China;
    3. National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
  • Received:2022-02-12 Revised:2022-03-29 Published:2022-05-24

基于GPU的固态晶体硅分子动力学算法优化

李靖1, 祝爱琦2, 韩林3, 侯超峰2   

  1. 1. 郑州大学 信息工程学院, 郑州 450001;
    2. 中国科学院过程工程研究所, 北京 100190;
    3. 郑州大学 国家超级计算郑州中心, 郑州 450001
  • 作者简介:李靖(1995—),男,硕士研究生,主研方向为分子动力学模拟性能优化;祝爱琦,博士研究生;韩林,副教授、博士;侯超峰(通信作者),副研究员、博士。
  • 基金资助:
    国家自然科学基金(21776280,22073103);北京市自然科学基金(JQ21034);河南省重大科技专项项目(201400211300)。

Abstract: Molecular Dynamics(MD) simulations are typically used to investigate the thermodynamic properties of crystalline silicon.Molecular simulations generally require heavy computational loads owing to the complex multibody interaction potential between atoms, resulting in limited time and space scale of calculations.The Graphics Processing Unit (GPU), which adopts parallel multithreading technology and computationally intensive processing, shows significant application potential in MD simulations.Therefore, it is necessary to fully use the characteristics of GPU hardware architecture to improve the space-time scale of MD simulations of solid covalent crystalline silicon to investigate the thermal conduction mechanism of crystalline silicon.Based on the simulation algorithm of solid covalent crystalline silicon MD, a fixed neighbor algorithm design and optimization for the GPU computing platform is proposed.The data structure, branch structure optimization, and other methods are used to solve the time consuming problem of global memory access and branch structure of the fixed neighbor algorithm for MD simulations, reduce data memory access consumption and branch conflict, and change the thread parallel scheduling mode to achieve high performance parallel computing on the GPU computing platform.This effectively solves the computing load problem.The experimental results show that acceleration ratio of LAMMPS double precision solid crystalline silicon MD simulation and double precision fixed neighbor algorithm is 11.62, and the acceleration ratio of HOOMD-blue double precision solid crystal silicon molecular dynamics simulation, double precision fixed neighbor algorithm and single precision fixed neighbor algorithm is 9.39 and 12.18 respectively.

Key words: Molecular Dynamics(MD) simulation, Graphics Processing Unit(GPU), fixed neighbor, data structure, branch structure

摘要: 分子动力学模拟通常用于晶体硅热力学性质的研究,因原子间采用复杂的多体作用势,分子模拟通常面临较高的计算负载,导致计算的时间和空间尺度受限。图形处理器(GPU)采用并行多线程技术,用于计算密集型处理任务,在分子动力学模拟领域中显示巨大的应用潜力。因此,充分利用GPU硬件架构特性提升固态共价晶体硅分子动力学模拟的时空尺度对晶体硅导热机制的研究具有重要意义。基于固态共价晶体硅分子动力学模拟算法,提出面向GPU计算平台的固定邻居算法设计与优化。利用数据结构、分支结构优化等方法解决分子动力学模拟的固定邻居算法全局访存和分支结构的耗时问题,降低数据访存消耗和分支冲突,通过改变线程并行调度方式,在GPU计算平台上实现高性能并行计算,有效解决计算负载问题。实验结果表明,LAMMPS双精度固态晶体硅分子动力学模拟与双精度固定邻居算法的加速比为11.62,HOOMD-blue双精度固态晶体硅分子动力学模拟与双精度固定邻居算法和单精度固定邻居算法的加速比分别为9.39和12.18。

关键词: 分子动力学模拟, 图形处理器, 固定邻居, 数据结构, 分支结构

CLC Number: