作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2013, Vol. 39 ›› Issue (2): 293-298. doi: 10.3969/j.issn.1000-3428.2013.02.061

• 开发研究与设计技术 • 上一篇    下一篇

基于龙芯3B的循环规约算法向量化研究

吴 淅 1,2,3,黄章进 1,2,3,顾乃杰 1,2,3   

  1. (1. 中国科学技术大学计算机科学与技术学院,合肥 230027;2. 安徽省计算与通讯软件重点实验室,合肥 230027; 3. 中国科学技术大学与中国科学院沈阳计算技术研究所网络与通信联合实验室,合肥 230027)
  • 收稿日期:2012-03-23 修回日期:2012-05-24 出版日期:2013-02-15 发布日期:2013-02-13
  • 作者简介:吴 淅(1989-),女,硕士研究生,主研方向:计算机体系结构,算法优化;黄章进,副教授、博士;顾乃杰,教授、博士生导师
  • 基金资助:
    国家自然科学基金资助项目(60833004);“核高基”重大专项(2009ZX01028-002-003-005);高等学校学科创新引智计划基金资助项目(B07033)

Vectorization Study for Cyclic Reduction Algorithm Based on Godson 3B

WU Xi 1,2,3, HUANG Zhang-jin 1,2,3, GU Nai-jie 1,2,3   

  1. (1. School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China; 2. Anhui Province Key Laboratory of Computing and Communication Software, Hefei 230027, China; 3. Network and Communication Joint Laboratory, University of Science and Technology of China & Shenyang Institute of Computing Technology, Chinese Academy of Sciences, Hefei 230027, China)
  • Received:2012-03-23 Revised:2012-05-24 Online:2013-02-15 Published:2013-02-13

摘要: 研究国产CPU龙芯3B的体系结构,分析三对角线性方程组求解中的循环规约算法,并结合算法特性,提出循环规约算法的优化方案。采用向量化级数,利用龙芯3B处理器提供的向量扩展指令对该算法在龙芯3B上进行实现和优化。实验结果表明,与非向量化算法相比,向量化算法的性能提升较大。

关键词: 龙芯3B, 三对角线性方程, 循环规约算法, 向量化, 泊松方程, 离散化

Abstract: By analyzing the unique characteristics of reduction cyclic algorithm for solving tri-diagonal linear system, this paper presents an optimized algorithm specially for Godson 3B processor. The reduction cyclic algorithm is implemented on Godson 3B platform based on its vector instructions and vectorization techniques are used to boost performance of the algorithm. Experimental results show that, comparied with non-vectorization algorithm, vectorization algorithm larger enhances the performance.

Key words: Godson 3B, tri-diagonal linear equation, cyclic reduction algorithm, vectorization, Poisson equation, discretization

中图分类号: