[1]WOODALL T,SHIPMAN G,BOSILCA G,et al.High performance RDMA protocols in HPC[C]//Proceedings of the 13th European PVM/MPI User’s Group Conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface.Berlin,Germany:Springer,2006:76-85.br/
[2]LIAO X K,PANG Z B,WANG K F,et al.High performance interconnect network for Tianhe system[J].Journal of Computer Science and Technology,2015,30(2):259-272.br/
[3]SUR S,JIN H W,CHAI L,et al.RDMA read based rendezvous protocol for MPI over infiniband:design alternatives and benefits[C]//Proceedings of the 11th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming.New York,USA:ACM Press,2006:32-39.br/
[4]LIU J,JIANG W,WYCKOFF P,et al.Design and implementation of MPICH2 over Infiniband with RDMA support[C]//Proceedings of the 18th International Parallel and Distributed Processing Symposium.Washington D.C.,USA:IEEE Press,2004:13-16.br/
[5]LIU J,WU J,PANDA D K.High performance RDMA-based MPI implementation over InfiniBand[J].Inter-national Journal of Parallel Programming,2004,32(3):167-198.br/
[6]MITCHELL C,GENG Y,LI J.Using one-sided RDMA reads to build a fast,CPU-efficient key-value store[C]//Proceedings of USENIX Annual Technical Conference.Washington D.C.,USA:IEEE Press,2013:103-114.br/
[7]余胜生,初莹莹,周敬利,等.基于RDMA协议的零拷贝技术研究[J].计算机工程与应用,2004,40(3):126-128.br/
[8]徐健,侯振龙,龚东磊,等.高速串行数据处理模块的设计与实现[J].计算机工程,2016,42(3):289-294.br/
[9]ANDERSON E,BROOKS J,GRASSL C,et al.Perfor-mance of the Cray T3E multiprocessor[C]//Proceedings of 1997 ACM/IEEE Conference on Supercomputing.New York,USA:ACM Press,1997:1-17.br/
[10]CHEN D,EISLEY N A,HEIDELBERGER P,et al.The IBM Blue Gene/Q interconnection network and message unit[C]//Proceedings of 2011 International Conference on High Performance Computing,Networking,Storage and Analysis.Washington D.C.,USA:IEEE Press,2011:1-10.br/
[11]InfiniBand Trade Association.InfiniBand architecture specification:release 1.3[S].2015.br/
[12]NIEPLOCHA J,TIPPARAJU V,KRISHNAN M,et al.High performance remote memory access communication:the ARMCI approach[J].The International Journal of High Performance Computing Applications,2006,20(2):233-253.br/
[13]STRUMPEN V,CASAVANT T L.Exploiting communica-tion latency hiding for parallel network computing:model and analysis[C]//Proceedings of 1994 International Con-ference on Parallel and Distributed Systems.Washington D.C.,USA:IEEE Press,1994:622-627.br/
[14]NIEPLOCHA J,TIPPARAJU V,KRISHNAN M,et al.Optimizing mechanisms for latency tolerance in remote memory access communication on clusters[C]//Proceedings of IEEE International Conference on Cluster Computing.Washington D.C.,USA:IEEE Press,2003:130-138.br/
[15]刘利,李文龙,陈彧,等.软件流水中隐藏存储延迟的方法[J].软件学报,2005,16(10):1833-1841.br/
[16]CULLER D,KARP R,PATTERSON D,et al.LogP:towards a realistic model of parallel computation[C]//Proceedings of the 4th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming.New York,USA:ACM Press,1993:1-12.br/
[17]ALEXANDROV A,IONESCU M F,SCHAUSER K E,et al.LogGP:incorporating long messages into the LogP model for parallel computation[J].Journal of Parallel and Distributed Computing,1997,44(1):71-79.br/
[18]AL-TAWIL K,MORITZ C A.Performance modeling and evaluation of MPI[J].Journal of Parallel and Distributed Computing,2001,61(2):202-223.br/ |