基于通信和拓扑感知的SNN分区与映射算法

doi:10.19678/j.issn.1000-3428.0069271

摘要/Abstract

摘要：

脉冲神经网络(SNN)正日益成为研究和模拟大脑各区功能及其相互关联性的重要方法。为了模拟更大规模的脑区域, 并行分布式计算已成为模拟SNN的必然选择。然而, 随着计算规模的增长, 计算节点间的负载不均衡及通信问题成为影响SNN模拟性能的主要因素。针对分布式计算平台, 现有分区算法还无法找到全局最佳分区并有效地将工作负载映射到计算核心上。因此, 提出一种基于通信和拓扑感知的分区与映射算法, 该算法包括分区和拓扑感知映射2个核心步骤。通过引入能够感知SNN连接的分区方法, 提高计算效率并降低通信延迟; 在拓扑感知映射方法中, 利用通信拓扑图和底层网络信息将工作负载高效地分配到各计算节点上, 最小化跨不同计算核心的通信成本。实验结果表明, 在国家超算济南计算中心的并行计算平台上, 采用96进程规模并行模拟SNN基准测试集时, 相比现有先进的分区框架, 所提方法具有更好的负载均衡和通信性能, 同步时间和通信时间分别减少了40%和7.1%, 最终的模拟总时间缩短了30%。

关键词: 脉冲神经网络, 分布式计算, 负载均衡, 超图分区, 拓扑感知映射

Abstract:

Spiking Neural Network (SNN) has become increasingly important for studying and simulating the functions of various brain regions and their interconnections. Parallel-distributed computing has become an inevitable choice for SNN simulations of larger-scale brain regions. However, as the scale of computation increases, SNN simulation performance is affected primarily by load imbalances among computing nodes and communication issues. For distributed computing platforms, existing partitioning algorithms cannot find a globally optimal partition or effectively map workloads to computing cores. Therefore, this study proposes a communication and topology-aware partitioning and mapping algorithm that includes two core steps: partitioning and topology-aware mapping. Introducing a partitioning method that is aware of SNN connections improves the computational efficiency and reduces communication latency. In the topology-aware mapping method, the communication topology graph and underlying network information are utilized to efficiently allocate workloads to computing nodes and minimize the communication costs across different computing cores. Experimental results show that, when simulating SNN benchmark datasets with 96 processes on the parallel computing platform of the National Supercomputing Center in Jinan, the proposed method achieves better load balancing and communication performance than existing state-of-the-art partitioning frameworks. The synchronization and communication times are reduced by 40% and 7.1%, respectively, and the total simulation time is shortened by 30%.

Key words: Spiking Neural Network (SNN), distributed computing, load balancing, hypergraph partitioning, topology-aware mapping

黄尧, 柴志雷. 基于通信和拓扑感知的SNN分区与映射算法[J]. 计算机工程, 2025, 51(5): 219-228.

HUANG Yao, CHAI Zhilei. Communication and Topology-Aware Partitioning and Mapping Algorithm for SNN[J]. Computer Engineering, 2025, 51(5): 219-228.

https://www.ecice06.com/CN/Y2025/V51/I5/219

图/表 10

图1 脉冲通信与交付过程

Fig.1 Pulse communication and delivery process

图2 通信与拓扑感知算法结构

Fig.2 Communication and topology-aware algorithm architecture

图3 多级图划分流程

Fig.3 Procedure of multilevel graph partitioning

图4 从拓扑网络构建延迟矩阵的过程

Fig.4 The process of constructing delay matrix from topological network

图5 皮质微电路模型

Fig.5 Cortical microcircuit model

图6 猕猴视觉多尺度模型

Fig.6 Multi-scale model of macaque visio

图7 不同分区算法的性能对比

Fig.7 Performance comparison of different partitioning algorithms

图8 不同映射算法的通信时间对比

Fig.8 Comparison of communication time between different mapping algorithms

参考文献 32

1	QU P , YANG L , ZHENG W M , et al. A review of basic software for brain-inspired computing. CCF Transactions on High Performance Computing, 2022, 4 (1): 34- 42. doi: 10.1007/s42514-022-00092-1
2	张铁林, 徐波. 脉冲神经网络研究现状及展望. 计算机学报, 2021, 44 (9): 1767- 1785.
	ZHANG T L , XU B . Research advances and perspectives on spiking neural networks. Chinese Journal of Computers, 2021, 44 (9): 1767- 1785.
3	GEWALTIG M O , DIESMANN M . NEST (NEural Simulation Tool). Scholarpedia, 2007, 2 (4): 1430. doi: 10.4249/scholarpedia.1430
4	STIMBERG M , BRETTE R , GOODMAN D F . Brian 2, an intuitive and efficient neural simulator. eLife, 2019, 8, e47314. doi: 10.7554/eLife.47314
5	JI Y, ZHANG Y, LI S, et al. NEUTRAMS: neural network transformation and co-design under neuromorphic hardware constraints[EB/OL]. [2023-08-05]. https://ieeexplore.ieee.org/document/7783724.
6	JORDAN J , IPPEN T , HELIAS M , et al. Extremely scalable spiking neuronal network simulation code: from laptops to exascale computers. Frontiers in Neuroinformatics, 2018, 12, 2. doi: 10.3389/fninf.2018.00002
7	PRONOLD J , JORDAN J , WYLIE B J N , et al. Routing brain traffic through the von Neumann bottleneck: parallel sorting and refactoring. Frontiers in Neuroinformatics, 2021, 15, 785068.
8	BAUTEMBACH D, OIKONOMIDIS I, ARGYROS A. Multi-GPU SNN simulation with static load balancing[EB/OL]. [2023-08-05]. https://ieeexplore.ieee.org/document/9533921.
9	栗学磊, 朱效民, 魏彦杰, 等. 神威太湖之光加速计算在脑神经网络模拟中的应用. 计算机学报, 2020, 43 (6): 1025- 1037.
	LI X L , ZHU X M , WEI Y J , et al. Application of Sunway TaihuLight accelerating in brain neural network simulation. Chinese Journal of Computers, 2020, 43 (6): 1025- 1037.
10	ALBERS J , PRONOLD J , KURTH A C , et al. A modular workflow for performance benchmarking of neuronal network simulations. Frontiers in Neuroinformatics, 2022, 16, 837549. doi: 10.3389/fninf.2022.837549
11	FERNANDEZ-MUSOLES C , COCA D , RICHMOND P . Communication sparsity in distributed spiking neural network simulations to improve scalability. Gene, 2019, 13, 19.
12	BARCHI F, URGESE G, MACⅡ E, et al. Mapping spiking neural networks on multi-core neuromorphic platforms: problem formulation and performance analysis[EB/OL]. [2023-08-05]. https://link.springer.com/chapter/10.1007/978-3-030-23425-6_9.
13	TIDDIA G , GOLOSIO B , ALBERS J , et al. Fast simulation of a multi-area spiking network model of macaque cortex on an MPI-GPU cluster. Frontiers in Neuroinformatics, 2022, 16, 883333. doi: 10.3389/fninf.2022.883333
14	BARCHI F, URGESE G, MACⅡ E, et al. Work-in-progress: impact of graph partitioning on SNN placement for a multi-core neuromorphic architecture[EB/OL]. [2023-08-05]. https://ieeexplore.ieee.org/document/8516831.
15	KARYPIS G , KUMAR V . A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM Journal on Scientific Computing, 1998, 20 (1): 359- 392. doi: 10.1137/S1064827595287997
16	LI S M, GUO S S, ZHANG L M, et al. SNEAP: a fast and efficient toolchain for mapping large-scale spiking neural network onto NoC-based neuromorphic platform[EB/OL]. [2023-08-05]. https://dl.acm.org/doi/abs/10.1145/3386263.3406900.
17	PARK J , YU T , JOSHI S , et al. Hierarchical address event routing for reconfigurable large-scale neuromorphic systems. IEEE Transactions on Neural Networks and Learning Systems, 2017, 28 (10): 2408- 2422. doi: 10.1109/TNNLS.2016.2572164
18	GALLUPPI F, DAVIES S, RAST A, et al. A hierachical configuration system for a massively parallel neural hardware platform[C]//Proceedings of the 9th Conference on Computing Frontiers. New York, USA: ACM Press, 2012: 183-192.
19	LEE M K F , CUI Y N , SOMU T , et al. A system-level simulator for RRAM-based neuromorphic computing chips. ACM Transactions on Architecture and Code Optimization, 2019, 15 (4): 1- 24.
20	BALAJI A , CATTHOOR F , DAS A , et al. Mapping spiking neural networks to neuromorphic hardware. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2019, 28 (1): 76- 86.
21	FURBER S B , LESTER D R , PLANA L A , et al. Overview of the SpiNNaker system architecture. IEEE Transactions on Computers, 2013, 62 (12): 2454- 2467.
22	华夏, 朱铮皓, 徐聪, 等. 基于精准通信建模的脉冲神经网络工作负载自动映射器. 计算机应用, 2023, 43 (3): 827- 834.
	HUA X , ZHU Z H , XU C , et al. Workload automatic mapper for spiking neural network based on precise communication modeling. Journal of Computer Applications, 2023, 43 (3): 827- 834.
23	TRENSCH G , MORRISON A . A system-on-chip based hybrid neuromorphic compute node architecture for reproducible hyper-real-time simulations of spiking neural networks. Frontiers in Neuroinformatics, 2022, 16, 884033.
24	DEVECI M , KAYA K , UÇAR B , et al. Hypergraph partitioning for multiple communication cost metrics: model and methods. Journal of Parallel and Distributed Computing, 2015, 77, 69- 83.
25	QU P , LIN H , PANG M , et al. ENLARGE: an efficient SNN simulation framework on GPU clusters. IEEE Transactions on Parallel and Distributed Systems: A Publication of the IEEE Computer Society, 2023 (9): 34.
26	FARAJ M F. Streaming, local, and multi-level (hyper) graph decomposition[EB/OL]. [2023-08-05]. https://arxiv.org/abs/2308.15617.
27	LIU L T, KUO M T, HUANG S C, et al. A gradient method on the initial partition of Fiduccia-Mattheyses algorithm[EB/OL]. [2023-08-05]. https://www.cs.york.ac.uk/rts/docs/SIGDA-Compendium-1994-2004/papers/1995/iccad95/pdffiles/03d_3.pdf.
28	YAN B C , XIAO L M , QIN G J , et al. QTMS: a quadratic time complexity topology-aware process mapping method for large-scale parallel applications on shared HPC system. Parallel Computing, 2020, 94, 102637.
29	VON KIRCHBACH K , SCHULZ C , TRÄFF J L . Better process mapping and sparse quadratic assignment. ACM Journal of Experimental Algorithmics, 2020, 25, 1- 19.
30	TAILLARD E . Robust taboo search for the quadratic assignment problem. Parallel Computing, 1991, 17 (4/5): 443- 455.
31	POTJANS T C , DIESMANN M . The cell-type specific cortical microcircuit: relating structure and activity in a full-scale spiking network model. Cerebral Cortex, 2014, 24 (3): 785- 806.
32	SCHMIDT M , BAKKER R , HILGETAG C C , et al. Multi-scale account of the network structure of macaque visual cortex. Brain Structure and Function, 2018, 223 (3): 1409- 1435.

[1]	聂雷, 胡字升, 鲍海洲. 基于RSU辅助和自适应分簇的异构车载网络选择方法[J]. 计算机工程, 2025, 51(3): 162-171.
[2]	魏德宾, 乔维维, 张怡. 基于麻雀搜索算法的软件定义卫星网络控制器部署[J]. 计算机工程, 2025, 51(3): 172-179.
[3]	张明, 郭文康, 王海峰. 面向大规模动态图的异构图计算系统设计[J]. 计算机工程, 2025, 51(3): 197-207.
[4]	彭世明, 林士飏, 贾硕, 杨苗会. 基于负载预测的多目标优化任务卸载策略[J]. 计算机工程, 2024, 50(1): 206-215.
[5]	刘向举, 赵犇, 方贤进, 徐杨洋. SDN中基于过程优化的动态负载均衡策略[J]. 计算机工程, 2023, 49(8): 137-145.
[6]	叶钧超, 徐聪, 黄尧, 柴志雷. 基于FPGA的Izhikevich神经元定制计算方法[J]. 计算机工程, 2023, 49(12): 35-45.
[7]	王奎宇, 宋晓勤, 缪娟娟, 张昕婷, 雷磊. 基于SDN的高性能QoS保障低轨道卫星星间路由算法[J]. 计算机工程, 2022, 48(5): 185-190,199.
[8]	刘家航, 郁龚健, 李佩琦, 华夏, 柴志雷, 陈闻杰. 基于SNN神经元重分布的NEST仿真器性能优化[J]. 计算机工程, 2022, 48(3): 189-196.
[9]	贺鹏飞, 范鹏飞, 尹千慧, 王中训, 张桐敬, 梁大伟. 基于负载均衡算法的Hyperledger Fabric共识机制研究[J]. 计算机工程, 2022, 48(11): 170-176.
[10]	李亚朋, 庞建民, 徐金龙, 聂凯. 一种针对线性循环结构的非线性静态调度策略[J]. 计算机工程, 2022, 48(1): 155-162.
[11]	施凌鹏, 朱征, 周俊松, 李鑫, 李静. 面向微服务架构的云系统负载均衡机制[J]. 计算机工程, 2021, 47(9): 44-50,58.
[12]	左攀, 束永安. DCN中基于前馈神经网络的动态多路径负载均衡方法[J]. 计算机工程, 2021, 47(9): 113-119.
[13]	杨海竹, 孙长印, 吴维超, 徐文军. 毫米波微波网络基于匹配算法的小区关联方法[J]. 计算机工程, 2021, 47(8): 210-215,223.
[14]	曹志鹏, 刘勤让, 刘冬培, 张霞. 面向时间敏感网络的流量调度方法[J]. 计算机工程, 2021, 47(7): 168-175,182.
[15]	姚玉坤, 朱克兰, 杨迪, 赵子军. 一种高效低时延的RPL跨层优化机制[J]. 计算机工程, 2021, 47(4): 141-146,179.

选择文件类型/文献管理软件名称

选择包含的内容