计算机工程

• 体系结构与软件技术 • 上一篇    下一篇

基于访存局部性的一致性请求广播范围预测

王云霏,李媛,王飙   

  1. (上海高性能集成电路设计中心,上海 200120)
  • 收稿日期:2016-08-24 出版日期:2017-10-15 发布日期:2017-10-15
  • 作者简介:王云霏(1992—),男,硕士研究生,主研方向为微处理器体系结构;李媛、王飙,高级工程师。
  • 基金项目:

    “核高基”重大专项“超级计算机处理器研发”(2013ZX0102-8001-001-001)。

Prediction of Coherence Request Broadcast Range Based on Access Locality

WANG Yunfei,LI Yuan,WANG Biao   

  1. (Shanghai High Performance IC Design Center,Shanghai 200120,China)
  • Received:2016-08-24 Online:2017-10-15 Published:2017-10-15

摘要: 目前广泛采用的广播协议带宽需求较高,目录协议访存延迟较大,不适用于国产服务器处理器片间直连接口带宽相对较低、延迟较高的应用场景。为此,基于片内目录、片间Token广播的双层异构混合一致性协议,应用访存局部性原理,对片间请求广播范围进行预测研究,提出一种HP-SRW协议。实验结果表明,与两级目录协议相比,该协议时间性能提高8.9%,带宽需求降低3.1%,与混合协议相比时间性能略有提升,带宽需求降低30.6%,与Token协议相比,HP-SRW协议以4.7%的时间性能为代价,带宽需求降低66.5%。

关键词: 片间直连, Cache一致性, Token协议, 目录协议, 混合协议

Abstract:

Snoopy protocols and directory protocols are widely used in modern server systems,but the former need large bandwidth,directory protocols have long latency,so they are not suitable for domestic server processor where bandwidth is relatively small and latency is relatively long.This paper based on a hybrid coherence protocol which uses Token protocol inter chip and directory protocol intra chip,and uses the access locality to predict the destination of the request.Experimental results show that the HP-SRW protocol has 8.9% better performance than the two-level directory protocol,and 3.1% less demand of bandwidth.It has 30.6% less demand of bandwidth than Hybrid Protocol(HP).Compared with the Token protocol,the HP-SRW significantly reduces the demand of inter-chip bandwidth by 66.5% at the cost of 4.7% performance.

Key words: direct connection among chips, Cache coherence, Token protocol, directory protocol, Hybrid Protocol(HP)

中图分类号: