Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (3): 299-307. doi: 10.19678/j.issn.1000-3428.0070060

• Computer Architecture and Advanced Computing • Previous Articles     Next Articles

Research and Implementation of Computational Storage System for High-Energy Physics

GAO Yu1,2, CHENG Yaodong1,2,3,*(), ZHANG Minxing1,2, CHENG Yaosong1, BI Yujiang1   

  1. 1. Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
    2. University of Chinese Academy of Sciences, Beijing 100049, China
    3. TIANFU Cosmic Ray Research Center, Chengdu 610213, Sichuan, China
  • Received:2024-07-01 Revised:2024-08-28 Online:2026-03-15 Published:2024-12-04
  • Contact: CHENG Yaodong

面向高能物理的可计算存储系统研究与实现

高宇1,2, 程耀东1,2,3,*(), 张敏行1,2, 程垚松1, 毕玉江1   

  1. 1. 中国科学院高能物理研究所, 北京 100049
    2. 中国科学院大学, 北京 100049
    3. 四川天府新区宇宙线研究中心, 四川 成都 610213
  • 通讯作者: 程耀东
  • 作者简介:

    高宇,男,博士研究生,主研方向为分布式存储、可计算存储

    程耀东(通信作者),研究员、博士

    张敏行,博士研究生

    程垚松,硕士

    毕玉江,副研究员、博士

  • 基金资助:
    国家自然科学基金(12075268); 中国科学院高能物理研究所创新基金(E15451U210)

Abstract:

In high-energy physics experiments, data processing typically involves a compute-storage separated computing model. During the computation process, data must be transferred between the computing and storage nodes. The continuous growth in experimental data and data analysis demands has led to data transfer bottlenecks, reducing the overall processing efficiency of these systems. This paper proposes a computational storage system for high-energy physics. First, the storage software EOS is extended. Computational storage plugins are introduced by building on the original architecture. After parsing user commands, the storage server executes local computations based on the file I/O, thereby reducing data movement, alleviating network pressure, and enhancing data processing efficiency. Second, a computational storage server based on a Central Processing Unit-Field Programmable Gate Array (CPU-FPGA) heterogeneous computing architecture is constructed. Considering the lower computational complexity of I/O-intensive tasks, tasks suitable for parallel computing are offloaded to the FPGA via the PCIe bus, thereby extending the computational capabilities of the storage server. Experimental evaluations show that the computational storage system eliminates queuing time and network latency, thereby shortening the overall execution time of computational tasks. Moreover, leveraging FPGA-based hardware acceleration effectively compensates for the weak computing performance of CPUs in storage servers, thereby enhancing the algorithmic versatility of computational storage devices. In tests based on decoding by LHAASO, the computational storage system achieves a speedup of approximately sixfold.

Key words: computational storage, heterogeneous computing, Field Programmable Gate Array (FPGA), big data, high-energy physics

摘要:

高能物理实验数据处理普遍采用存算分离的计算模式, 计算过程中需要在计算节点和存储节点间传输数据。实验数据和数据分析需求的不断增长造成了数据传输瓶颈, 降低了系统整体的处理效率。针对上述问题提出面向高能物理的可计算存储系统。首先, 对存储软件EOS进行扩展, 在原架构的基础上增加可计算存储插件, 存储服务器解析用户命令后, 在文件I/O的基础上执行本地计算, 减少数据移动, 缓解网络压力, 提升数据处理效率。然后, 构建基于中央处理器-现场可编程门阵列(CPU-FPGA)异构计算架构的可计算存储服务器。针对I/O密集型任务计算复杂度较低的特点, 将适合并行计算的任务通过PCIe总线卸载到FPGA中, 扩展了存储服务器的计算能力。对系统的实验评估结果表明, 可计算存储系统能有效消除排队时间和网络延迟, 进而缩短计算任务的整体执行时间。基于FPGA的硬件加速, 有效弥补了存储服务器中CPU计算性能较弱的缺陷, 提升了可计算存储设备的算法通用性。在基于LHAASO的解码作业的测试中, 可计算存储系统实现了约6倍的速度提升。

关键词: 可计算存储, 异构计算, 现场可编程门阵列, 大数据, 高能物理