面向大规模网络的服务功能链部署方法

doi:10.19678/j.issn.1000-3428.0065169

摘要/Abstract

摘要：

网络功能虚拟化(NFV)将网络功能从硬件中间盒中解耦出来，部署功能实例并编排为服务功能链(SFC)，从而实现网络服务。针对资源受限情况下大规模网络环境中的SFC动态部署问题，提出一种基于多智能体的群策部署方法，该方法结合了集中式深度强化学习（DRL）和传统分布式方法的优点。将SFC部署问题建模为部分可见马尔可夫决策过程，每个节点部署一个Actor-Critic智能体，仅通过观察本地节点信息即可得到全局训练策略，具有DRL的灵活性和自适应性。本地智能体控制交互过程，以解决集中式DRL方法在大规模网络中控制复杂、响应速度慢等问题。基于多线程的思想，收集、整合每个节点的经验进行集中式训练，避免完全分布式训练过程中部分节点因请求流量少而导致训练不充分、策略不适用等问题。实验结果表明，该方法无须考虑网络规模而且不依赖特定场景，可以很好地适应现实中复杂多变的网络环境，在相对复杂的流量环境中，与CDRL、GCASP方法相比，在多种流量模式下所提方法的部署成功率均提高了20%以上，同时能够降低部署成本。

关键词: 网络功能虚拟化, 服务功能链, 深度强化学习, 部分可见马尔可夫决策过程, 多智能体

Abstract:

Network Function Virtualization(NFV) decouples network functions from hardware intermediate boxes, deploys function instances and arranges them into Service Function Chains(SFC) to realize network services.A multi-agent based group strategy deployment method is proposed for the dynamic deployment of SFC in large-scale network environments with resource constraints.The proposed method combines the advantages of centralized Deep Reinforcement Learning(DRL) and traditional distributed methods.The SFC deployment problem is modeled as a Partially Observable Markov Decision Process(POMDP), with each node deploying an Actor-Critic(AC) agent.The global training strategy can be obtained only by observing local node information, which has DRL flexibility and adaptability. The local agent controls the interaction process to solve complex control and slow response speed problems in large-scale networks using centralized DRL methods.Based on the multithreading concept, this research aims to collect and integrate the experience of each node for centralized training, to avoid problems such as insufficient training and policy inapplicability caused by low request traffic in some nodes during the fully distributed training process. Experimental results demonstrate that while it adapts well to complex and everchanging environments in practice, it is not necessary for the proposed method to rely on specific scenarios or to consider network scale.In relatively complex traffic environments, compared with CDRL and GCASP methods, the proposed method's deployment success rate in multiple traffic modes increased by over 20%, while reducing deployment costs.

Key words: Network Function Virtualization(NFV), Service Function Chain(SFC), Deep Reinforcement Learning(DRL), Partially Observable Markov Decision Process(POMDP), multiple agent

张冠莹, 伊鹏, 李丹, 朱棣, 毛明. 面向大规模网络的服务功能链部署方法[J]. 计算机工程, 2023, 49(8): 122-129.

Guanying ZHANG, Peng YI, Dan LI, Di ZHU, Ming MAO. Service Function Chain Deployment Method for Large-Scale Network[J]. Computer Engineering, 2023, 49(8): 122-129.

https://www.ecice06.com/CN/Y2023/V49/I8/122

图/表 10

参考文献 27

1	JALODIA N, HENNA S, DAVY A. Deep reinforcement learning for topology-aware VNF resource prediction in NFV environments[C]//Proceedings of IEEE Conference on Network Function Virtualization and Software Defined Networks. Washington D. C., USA: IEEE Press, 2020: 1-5.
2	SUN S L, KADOCH M, GONG L, et al. Integrating network function virtualization with SDR and SDN for 4G/5G networks. IEEE Network, 2015, 29 (3): 54- 59. doi: 10.1109/MNET.2015.7113226
3	ZHANG C, JOSHI H P, RILEY G F, et al. Towards a virtual network function research agenda: a systematic literature review of VNF design considerations. Journal of Network and Computer Applications, 2019, 146, 102417. doi: 10.1016/j.jnca.2019.102417
4	ZHOU Y C, YU F R, CHEN J, et al. Resource allocation for information-centric virtualized heterogeneous networks with in-network caching and mobile edge computing. IEEE Transactions on Vehicular Technology, 2017, 66 (12): 11339- 11351. doi: 10.1109/TVT.2017.2737028
5	JACOBSON A G, VISWANATHAN R, PRAKASH C, et al. OpenNF: enabling innovation in network function control[EB/OL]. [2022-06-05]. https://pages.cs.wisc.edu/~akella/papers/opennf.pdf.
6	LAGHRISSI A, TALEB T. A survey on the placement of virtual resources and virtual network functions. IEEE Communications Surveys & Tutorials, 2019, 21 (2): 1409- 1434.
7	PEI J N, HONG P L, PAN M, et al. Optimal VNF placement via deep reinforcement learning in SDN/NFV-enabled networks. IEEE Journal on Selected Areas in Communications, 2020, 38 (2): 263- 278. doi: 10.1109/JSAC.2019.2959181
8	ASGARIAN M, MIRJALILY G, LUO Z Q. Trade-off between efficiency and complexity in multi-stage embedding of multicast VNF service chains. IEEE Communications Letters, 2022, 26 (2): 429- 433. doi: 10.1109/LCOMM.2021.3132134
9	BORSATTI D, CERRONI W, DAVOLI G, et al. Intent-based service function chaining on ETSI NFV platforms[C]//Proceedings of the 10th International Conference on Networks of the Future. Washington D. C., USA: IEEE Press, 2020: 144-146.
10	SCHNEIDER S, QARAWLUS H, KARL H. Distributed online service coordination using deep reinforcement learning[C]//Proceedings of the 41st IEEE International Conference on Distributed Computing Systems. Washington D. C., USA: IEEE Press, 2021: 539-549.
11	YAN Z X, GE J G, WU Y L, et al. Automatic virtual network embedding: a deep reinforcement learning approach with graph convolutional networks. IEEE Journal on Selected Areas in Communications, 2020, 38 (6): 1040- 1057. doi: 10.1109/JSAC.2020.2986662
12	MOHAMAD A, HASSANEIN H S. On demonstrating the gain of SFC placement with VNF sharing at the edge[C]//Proceedings of IEEE Global Communications Conference. Washington D. C., USA: IEEE Press, 2020: 1-6.
13	LEIVADEAS A, KESIDIS G, IBNKAHLA M, et al. VNF placement optimization at the edge and cloud [EB/OL]. [2022-06-05]. https://www.researchgate.net/publication/331679932_VNF_Placement_Optimization_at_the_Edge_and_Cloud.
14	BEN JEMAA F, PUJOLLE G, PARIENTE M. QoS-aware VNF placement optimization in edge-central carrier cloud architecture[C]//Proceedings of IEEE Global Communications Conference. Washington D. C., USA: IEEE Press, 2017: 1-7.
15	SANG Y, JI B, GUPTA G R, et al. Provably efficient algorithms for joint placement and allocation of virtual network functions[C]//Proceedings of IEEE Conference on Computer Communications. Washington D. C., USA: IEEE Press, 2017: 1-9.
16	SHI R Y, ZHANG J, CHU W J, et al. MDP and machine learning-based cost-optimization of dynamic resource allocation for network function virtualization[C]//Proceedings of IEEE International Conference on Services Computing. Washington D. C., USA: IEEE Press, 2015: 65-73.
17	袁泉, 汤红波, 黄开枝, 等. 基于Q-learning算法的vEPC虚拟网络功能部署方法. 通信学报, 2017, 38 (8): 172- 182. URL
	YUAN Q, TANG H B, HUANG K Z, et al. Deployment method for vEPC virtualized network function via Q-learning. Journal on Communications, 2017, 38 (8): 172- 182. URL
18	ZHANG Z Y, MA L, LEUNG K K, et al. Q-placement: reinforcement-learning-based service placement in software-defined networks[C]//Proceedings of the 38th IEEE International Conference on Distributed Computing Systems. Washington D. C., USA: IEEE Press, 2018: 1527-1532.
19	YANG Z Y, MEI H B, WANG W Y, et al. Joint resource allocation for emotional 5G IoT systems using deep reinforcement learning. International Journal of Machine Learning and Cybernetics, 2021, 12 (12): 3517- 3528. doi: 10.1007/s13042-021-01398-2
20	SCHNEIDER S, MANZOOR A, QARAWLUS H, et al. Self-driving network and service coordination using deep reinforcement learning[C]//Proceedings of the 16th International Conference on Network and Service Management. Washington D. C., USA: IEEE Press, 2020: 1-9.
21	TONG R, XU S, HU B, et al. VNF dynamic scaling and deployment algorithm based on traffic prediction[EB/OL]. [2022-06-05]. https://ieeexplore.ieee.org/document/9148479.
22	GU L, ZENG D Z, LI W, et al. Intelligent VNF orchestration and flow scheduling via model-assisted deep reinforcement learning. IEEE Journal on Selected Areas in Communications, 2020, 38 (2): 279- 291. doi: 10.1109/JSAC.2019.2959182
23	XU Z Y, TANG J, MENG J S, et al. Experience-driven networking: a deep reinforcement learning based approach[C]//Proceedings of IEEE Conference on Computer Communications. Washington D. C., USA: IEEE Press, 2018: 1871-1879.
24	SCHNEIDER S, DIETRICH KLENNER L, KARL H. Every node for itself: fully distributed service coordination[C]//Proceedings of the 16th International Conference on Network and Service Management. Washington D. C., USA: IEEE Press, 2020: 1-9.
25	KNIGHT S, NGUYEN H X, FALKNER N, et al. The Internet topology zoo[EB/OL]. [2022-06-05]. http://topology-zoo.org/publications/topology_zoo.pdf.
26	SCHNEIDER S, MANZOOR A, QARAWLUS H, et al. Service coordination simulator GitHub repository (November 18, 2020) [EB/OL]. [2022-06-05]. https://github.com/RealVNF/coord-sim.
27	KALMAN B L, KWASNY S C. Why tanh: choosing a sigmoidal function[C]//Proceedings of IJCNN International Joint Conference on Neural Networks. Washington D. C., USA: IEEE Press, 2002: 578-581.

符号	表示含义
$ V $	节点$ v $的集合
$ L $	链路$ v{v}^{\text{'}} $的集合
$ v{v}^{\text{'}} $	以节点$ v $、$ {v}^{\text{'}} $为端点的链路
R_v	节点$ v $的可用资源
R_vv′	链路$ v{v}^{\text{'}}\in L $的可用带宽资源
$ {d}_{v{v}^{\text{'}}} $	链路$ v{v}^{\text{'}} $的延迟
$ S $, $ s $	$ S $表示服务集合，$ s $表示某种服务
$ {C}_{s} $, $ {c}_{n} $	$ {C}_{s} $表示组件集合，$ {c}_{n} $表示组件，$ {C}_{s}=〈{c}_{1}, \cdots , {c}_{n}〉 $
$ f $	服务请求或者流
$ {s}_{f} $	请求中的服务
$ {v}_{f} $	请求到达节点，即入口节点
$ {t}_{f} $	服务请求到达时间
$ {a}_{f} $	请求需要占用的带宽
$ {d}_{f} $	请求的周期时长
$ {x}_{c, v}\left(t\right) $	在$ t $时刻组件$ c $是否实例化在节点$ v $上
$ {{y}_{c, v}}_{, f}\left(t\right) $	在$ t $时刻流$ f $是否应用实例化在节点$ v $上的组件$ c $
$ {R}_{v}^{{c}_{n}} $	c实例化在节点$ v $上、$ {x}_{c, v}\left(t\right) $取值为1时占用的节点资源
$ {d}_{c, v}^{f} $	在节点v上的处理时延
$ {z}_{v{v}^{\text{'}}, f}\left(t\right) $	在$ t $时刻流$ f $占用链路$ v{v}^{\text{'}} $
$ {w}_{v}\left(t\right) $	在$ t $时刻节点剩余可用带宽的利用率
$ {w}_{v{v}^{\text{'}}}\left(t\right) $	在$ t $时刻链路剩余可用带宽的利用率
$ {d}_{\mathrm{a}\mathrm{v}\mathrm{g}} $	端到端时延

符号	表示含义
$ V $	节点$ v $的集合
$ L $	链路$ v{v}^{\text{'}} $的集合
$ v{v}^{\text{'}} $	以节点$ v $、$ {v}^{\text{'}} $为端点的链路
R_v	节点$ v $的可用资源
R_vv′	链路$ v{v}^{\text{'}}\in L $的可用带宽资源
$ {d}_{v{v}^{\text{'}}} $	链路$ v{v}^{\text{'}} $的延迟
$ S $, $ s $	$ S $表示服务集合，$ s $表示某种服务
$ {C}_{s} $, $ {c}_{n} $	$ {C}_{s} $表示组件集合，$ {c}_{n} $表示组件，$ {C}_{s}=〈{c}_{1}, \cdots , {c}_{n}〉 $
$ f $	服务请求或者流
$ {s}_{f} $	请求中的服务
$ {v}_{f} $	请求到达节点，即入口节点
$ {t}_{f} $	服务请求到达时间
$ {a}_{f} $	请求需要占用的带宽
$ {d}_{f} $	请求的周期时长
$ {x}_{c, v}\left(t\right) $	在$ t $时刻组件$ c $是否实例化在节点$ v $上
$ {{y}_{c, v}}_{, f}\left(t\right) $	在$ t $时刻流$ f $是否应用实例化在节点$ v $上的组件$ c $
$ {R}_{v}^{{c}_{n}} $	c实例化在节点$ v $上、$ {x}_{c, v}\left(t\right) $取值为1时占用的节点资源
$ {d}_{c, v}^{f} $	在节点v上的处理时延
$ {z}_{v{v}^{\text{'}}, f}\left(t\right) $	在$ t $时刻流$ f $占用链路$ v{v}^{\text{'}} $
$ {w}_{v}\left(t\right) $	在$ t $时刻节点剩余可用带宽的利用率
$ {w}_{v{v}^{\text{'}}}\left(t\right) $	在$ t $时刻链路剩余可用带宽的利用率
$ {d}_{\mathrm{a}\mathrm{v}\mathrm{g}} $	端到端时延

[1]	石琼, 段辉, 师智斌. 基于深度强化学习的可信任务卸载方案[J]. 计算机工程, 2024, 50(8): 142-152.
[2]	孙文洁, 李宗民, 孙浩淼. 基于图神经网络的多智能体强化学习值函数分解方法[J]. 计算机工程, 2024, 50(5): 62-70.
[3]	傅明建, 郭福强. 基于深度强化学习的无信号灯路口决策研究[J]. 计算机工程, 2024, 50(5): 91-99.
[4]	张建强, 杨凯军, 欧阳凌丛. 具有规定性能的多智能体动态事件触发编队控制[J]. 计算机工程, 2024, 50(3): 78-88.
[5]	范晓宇, 贾新春, 李彬, 谢云飞. 多率采样机制下多智能体动态事件触发二分一致性研究[J]. 计算机工程, 2024, 50(3): 114-121.
[6]	杜海军, 余粟. 基于时空图注意力网络的服务机器人动态避障[J]. 计算机工程, 2024, 50(2): 105-112.
[7]	张俊娜, 韩超臣, 陈家伟, 赵晓焱, 袁培燕. 一种联合边缘服务器部署与服务放置的方法[J]. 计算机工程, 2024, 50(10): 266-280.
[8]	蔡梓越, 谭北海, 余荣, 黄旭民, 王思明. 面向6G物联网设备协同的区块链动态分片[J]. 计算机工程, 2024, 50(1): 50-59.
[9]	孔凌辉, 饶哲恒, 徐彦彦, 潘少明. 基于深度强化学习的无线网络智能路由算法[J]. 计算机工程, 2023, 49(9): 199-207, 216.
[10]	胡水. 基于深度强化学习的智能兵棋推演决策方法[J]. 计算机工程, 2023, 49(9): 303-312.
[11]	梅晶, 戴龙宝, 童钊, 邓昕, 王嘉珂. 资源约束下基于Lyapunov优化的自适应卸载算法[J]. 计算机工程, 2023, 49(7): 34-46.
[12]	蔡丽娇, 秦进, 陈双. 远离旧区域和避免回路的强化探索方法[J]. 计算机工程, 2023, 49(7): 118-124.
[13]	李强, 仪晋辉, 杜婷婷, 王胜春. 移动边缘计算中基于A3C的依赖任务卸载与资源分配[J]. 计算机工程, 2023, 49(6): 42-52.
[14]	李雪松, 张锲石, 宋呈群, 康宇航, 程俊. 自动驾驶场景下的轨迹预测技术综述[J]. 计算机工程, 2023, 49(5): 1-11.
[15]	金志军, 王浩, 方宝富. 稀疏场景下基于理性好奇心的多智能体强化学习[J]. 计算机工程, 2023, 49(5): 302-309.

选择文件类型/文献管理软件名称

选择包含的内容