Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (4): 349-357. doi: 10.19678/j.issn.1000-3428.0070139

• Next-Generation Networks and Edge Computing • Previous Articles     Next Articles

Research on Optimization of Kubernetes Elastic Scaling Based on Entropy Weight Utilization and Prediction Algorithm

SONG Zhedai, ZHU Jinrong*(), LIANG Chenyue, CHENG Xinyu   

  1. College of Physical Science and Technology, Yangzhou University, Yangzhou 225100, Jiangsu, China
  • Received:2024-07-17 Revised:2024-09-02 Online:2026-04-15 Published:2024-12-10
  • Contact: ZHU Jinrong

基于熵权利用率与预测算法的Kubernetes弹性伸缩优化研究

宋哲代, 朱金荣*(), 梁琛悦, 程心雨   

  1. 扬州大学物理科学与技术学院, 江苏 扬州 225100
  • 通讯作者: 朱金荣
  • 作者简介:

    宋哲代, 男, 硕士, 主研方向为云计算

    朱金荣(通信作者), 教授、硕士

    梁琛悦, 硕士

    程心雨, 硕士

  • 基金资助:
    国家自然科学基金(62375234); 江苏省研究生研究与实践创新计划(KYCX24_3714)

Abstract:

This study proposes an improved elastic scaling strategy based on a composite algorithm that combines entropy weight utilization and a prediction model, to address the issues of single-metric evaluation, latency, and low resource utilization in Kubernetes's built-in elastic scaling strategy. The entropy weight utilization composite algorithm calculates the comprehensive load value of the Kubernetes cluster by focusing on the distribution differences (information entropy method) and overall trends (average utilization weight method) of resource utilization across different nodes, thereby solving the problem of single metric evaluation. Next, this study constructs a predictive model that combines Adaptive Variational Mode Decomposition (AVMD) and the Attention Mechanism-based enhanced Long Short-Term Memory (Attention Mechanism-based LSTM) network to solve the latency and low resource utilization issues by predicting load changes. This model enables the system to quickly respond, expand its capacity at the onset of high traffic, and rapidly scale down to release resources once traffic subsides. Experimental results show that the improved elastic scaling strategy reduces the response time by 52% during the early stage of burst traffic compared with the default Kubernetes scaling strategy, and it rapidly scales down after the traffic subsides to release resources, demonstrating high practical application value.

Key words: Kubernetes cluster, entropy weight utilization composite algorithm, Adaptive Variational Mode Decomposition (AVMD) algorithm, Long Short-Term Memory (LSTM) algorithm, load prediction

摘要:

为解决Kubernetes内置的弹性伸缩策略衡量指标单一、反应滞后和资源利用效率低的问题, 提出一种熵权利用率复合算法结合预测模型的改进弹性伸缩策略。熵权利用率复合算法通过关注多种指标的资源利用率在不同节点上的分布差异(信息熵权法)和整体趋势(平均利用率权重法), 计算Kubernetes集群的综合负载值, 从而解决衡量指标单一的问题。构建自适应变分模态分解(AVMD)算法结合基于注意力机制增强的长短期记忆(Attention Mechanism-based LSTM)算法的预测模型, 通过预测负载变化以解决反应滞后和资源利用率低的问题。该模型根据预测的负载值, 在高流量初期促使系统快速响应进行扩容, 流量结束后迅速缩容以节约资源。实验结果表明, 与Kubernetes伸缩策略相比, 改进弹性伸缩策略在突发流量前期, 请求响应时间降低了52%, 在流量结束后快速缩容释放资源, 具有较高的实际应用价值。

关键词: Kubernetes集群, 熵权利用率复合算法, 自适应变分模态分解算法, 长短期记忆算法, 负载预测