计算机工程 ›› 2013, Vol. 39 ›› Issue (7): 7-10,15.doi: 10.3969/j.issn.1000-3428.2013.07.002

• 专栏 • 上一篇    下一篇

基于扩展寄存器与片上网络的运算阵列设计

张家杰,欧 鹏,俞 政,于学球,虞志益   

  1. (复旦大学专用集成电路与系统国家重点实验室,上海 201203)
  • 收稿日期:2012-08-29 出版日期:2013-07-15 发布日期:2013-07-12
  • 作者简介:张家杰(1987-),男,硕士研究生,主研方向:多核处理器设计;欧 鹏、俞 政、于学球,硕士研究生;虞志益,副研究员
  • 基金项目:

    国家自然科学基金资助项目(61103008);国家科技重大专项基金资助项目(2011ZX03003-003-03);上海市科委集成电路专项基金资助项目(10706200300);上海市青年科技启明星计划基金资助项目(11QA1400500)

Design of Computing Array Based on Extended Register and Network-on-Chip

ZHANG Jia-jie, OU Peng, YU Zheng, YU Xue-qiu, YU Zhi-yi   

  1. (State Key Laboratory of ASIC & System, Fudan University, Shanghai 201203, China)
  • Received:2012-08-29 Online:2013-07-15 Published:2013-07-12

摘要:

为提高多核处理器性能,在传统硬件加速部件的基础上,提出一种新型的运算阵列设计方案。将运算阵列与多核处理器的通信端口映射在扩展寄存器地址空间上,实现阵列与多核处理器的紧密耦合。通过片上网络连接各个运算单元,实现运算阵列的灵活配置和高度共享。在实验系统上实现1 024点快速傅里叶变换和H.264解码器,结果表明,与纯软件实现相比,该方案能使处理器性能和功耗都有所改善。

关键词: 多核处理器, 运算阵列, 扩展寄存器, 片上网络, 快速傅里叶变换, H.264解码器

Abstract:

In order to improve the performance of multi-core processor, based on traditional hardware accelerator, this paper presents a novel computing array design scheme. The communication ports between the computing array and the processor are mapped in the address space of extended register file, which makes the computing array and the processor tightly coupled. The computing units are connected by Network-on-Chip(NoC) which enables the computing array be flexibly configured and highly shared by the multi-core processor. A 1 024-point Fast Fourier Transform(FFT) and an H.264 decoder are implemented on the experimental platform, and results show that the scheme can improve the performance and power consumption significantly compared to pure software solution.

Key words: multi-core processor, computing array, extended register, Network-on-Chip(NoC), Fast Fourier Transform(FFT), H.264 decoder

中图分类号: