[1] WANG Qiao,CAO Zongyan,GAO Liang,et al.PHoToNs-a parallel heterogeneous and threads oriented code for cosmological N-body simulation[J].Research in Astronomy and Astrophysics,2018,18(6):23-62. [2] BARNES J,HUT P.A hierarchical O(NlogN) force-calculation algorithm[J].Nature,1986,324(6096):446-449. [3] HOCKNEY R,EASTWOOD J.Computer simulation using particles[M].[S.l.]:Taylor & Francis,Inc.,1988. [4] GREENGARD L,ROKHLIN V.A fast algorithm for particle simulations[J].Journal of Computational Physics,1987,73(2):325-348. [5] WARREN M S,SALMON J K.Astrophysical N-body simulations using hierarchical tree data structures[C]//Proceedings of 1992 ACM/IEEE Conference on Super-computing.Washington D.C.,USA:IEEE Press,1992:570-576. [6] WARREN M S.2HOT:an improved parallel hashed OCT-tree N-body algorithm for cosmological simulation[J].Scientific Programming,2014,22(2):109-124. [7] ISHIYAMA T,NITADORI K,MAKINO J.4.45 Pflops astrophysical N-body simulation on K computer-the gravitational trillion-body problem[C]//Proceedings of International Conference on High Performance Computing,Networking,Storage and Analysis.Washington D.C.,USA:IEEE Press,2012:1-10. [8] MAKINO J,TAIJI M.Astrophysical N-body simulations on GRAPE-4 special-purpose computer[C]//Proceedings of 1995 ACM/IEEE Conference on Supercomputing.Washington D.C.,USA:IEEE Press,1995:63-75. [9] KAWAI A,FUKUSHIGE T,MAKINO J.$7.0/Mflops astrophysical N-body simulation with treecode on GRAPE-5[C]//Proceedings of 1999 ACM/IEEE Conference on Supercomputing.Washington D.C.,USA:IEEE Press,1999:67-85. [10] MAKINO J,FUKUSHIGE T,KOGA M.A 1.349 Tflops simulation of black holes in a galactic center on GRAPE-6[C]//Proceedings of 2000 ACM/IEEE Conference on Supercomputing.Washington D.C.,USA:IEEE Press,2000:43-53. [11] MAKINO J,KOKUBO E,FUKUSHIGE T.Performance evaluation and tuning of GRAPE-6-towards 40"real" TFlops[C]//Proceedings of 2003 ACM/IEEE Conference on Supercomputing.Washington D.C.,USA:IEEE Press,2003:22-116. [12] HAMADA T,NARUMI T,YOKOTA R,et al.42 TFlops hierarchical N-body simulations on GPUs with applications in both astrophysics and turbulence[C]//Proceedings of Conference on High Performance Computing Networking,Storage and Analysis.Washington D.C.,USA:IEEE Press,2009:1-12. [13] HAMADA T,NITADORI K.190 TFlops astrophysical N-body simulation on a cluster of GPUs[C]//Proceedings of 2010 ACM/IEEE International Conference for High Performance Computing,Networking,Storage and Analysis.Washington D.C.,USA:IEEE Press,2010:1-9. [14] BÉDORF J,GABUROV E,FUJII M S,et al.24.77 Pflops on a gravitational tree-code to simulate the Milky way galaxy with 18600 GPUs[C]//Proceedings of International Conference for High Performance Computing,Networking,Storage and Analysis.Washington D.C.,USA:IEEE Press,2014:54-65. [15] IWASAWA M,WANG L,NITADORI K,et al.Global simulation of planetary rings on Sunway TaihuLight[C]//Proceedings of International Conference on Computational Science.Berlin,Germany:Springer,2018:483-495. [16] FANG Jiarui,FU Haohuan,ZHAO Wenlai,et al.swDNN:a library for accelerating deep learning applications on Sunway TaihuLight[C]//Proceedings of 2017 IEEE International Parallel and Distributed Processing Symposium.Washington D.C.,USA:IEEE Press,2017:615-624. [17] XU LEI,XU YING.Investigation of the implementation of N-body problem on GPUs[J].Computer Applications and Software,2012,29(1):92-95.(in Chinese)徐磊,徐莹.多体问题在GPU上实现的讨论[J].计算机应用与软件,2012,29(1):92-95. [18] LI Ning.2DECOMP&FFT-a highly scalable 2D decomposi-tion library and FFT interface[C]//Proceedings of CUG'10.Washington D.C.,USA:IEEE Press,2010:24-27. [19] DEHNEN W.A hierarchical O(N) force calculation algorithm[J].Journal of Computational Physics,2002,179(1):27-42. [20] PRUNET S,PICHON C,AUBERT D,et al.Initial conditions for large cosmological simulations[J].The Astrophysical Journal Letters Supplement Series,2008,178(2):179-188. |