作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 开发研究与工程应用 • 上一篇    下一篇

一种高性能四倍精度浮点乘加器的设计与实现

何 军,黄永勤,朱 英   

  1. (上海高性能集成电路设计中心,上海 201204)
  • 收稿日期:2012-12-26 出版日期:2014-02-15 发布日期:2014-02-13
  • 作者简介:何 军(1980-),男,博士研究生,主研方向:微处理器设计;黄永勤、朱 英,高级工程师

Design and Implementation of a High Performance Quadruple Precision Floating-point Multiplier Accumulator

HE Jun, HUANG Yong-qin, ZHU Ying   

  1. (Shanghai High Performance Integrated Circuit Design Centre, Shanghai 201204, China)
  • Received:2012-12-26 Online:2014-02-15 Published:2014-02-13

摘要: 高精度、高性能浮点运算部件是高性能微处理器设计的重要部分。通过对传统双精度浮点乘加运算算法的研究,结合四倍精度浮点数据格式特点,设计并实现一种高性能的四倍精度浮点乘加器(QPFMA),该乘加器支持多种浮点运算,运算延迟为 7拍,全流水结构。采用双路加法器改进算法结构,优化头零预测和规格化移位逻辑,减小运算延迟和硬件开销。通过参数化设计验证方法,实现高效的正确性验证。逻辑综合结果表明,基于65 nm工艺,该QPFMA频率可达1.2 GHz,比现有的QPFMA设计运算延迟减少3拍,频率提高约11.63%。

关键词: 浮点运算, 乘加, 四倍精度, 高精度, 参数化

Abstract: High precision and high performance floating-point unit is an important research object of high performance microprocessor design. According to the characteristic of Quadruple Precision(QP) floating-point data format and research on double precision floating-point multiplier accumulator algorithms, a high performance Quadruple Precision Floating-point Multiplier Accumulator(QPFMA) is designed and realized, which supports multiple floating-point arithmetic with a 7 cycles pipeline. By adopting dual path adder and improving on algorithm architecture, optimizing leading-zero-anticipation and normalization shifter logic, the latency and hardware area is decreased. And by making use of parameterized design and verification methodology, the correction of the QPFMA is verified efficiently. Based on 65 nm technology, as the synthesis results show that the QPFMA can work at 1.2 GHz, with the latency decreased by 3 cycles and the frequency increased by about 11.63% compared with current QPFMA design.

Key words: floating-point arithmetic, multiply-add, Quadruple Precision(QP), high precision, parameterization

中图分类号: