作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (3): 197-203. doi: 10.19678/j.issn.1000-3428.0059944

• 体系结构与软件技术 • 上一篇    下一篇

基于异构Flink集群的节点优先级调度策略

汪文豪1, 史雪荣2   

  1. 1. 南京工业大学 计算机科学与技术学院, 南京 211816;
    2. 盐城师范学院 数学与统计学院, 江苏 盐城 224002
  • 收稿日期:2020-11-09 修回日期:2021-02-20 发布日期:2022-03-11
  • 作者简介:汪文豪(1997-),男,硕士研究生,主研方向为分布式应用;史雪荣,教授、博士。
  • 基金资助:
    国家自然科学基金(11872327);江苏省高等学校自然科学研究项目(20KJA190001)。

Node Priority Scheduling Strategy Based on Heterogeneous Flink Cluster

WANG Wenhao1, SHI Xuerong2   

  1. 1. College of Computer Science and Technology, Nanjing Tech University, Nanjing 211816, China;
    2. College of Mathematics and Statistics, Yancheng Teachers University, Yancheng, Jiangsu 224002, China
  • Received:2020-11-09 Revised:2021-02-20 Published:2022-03-11

摘要: Flink流处理系统默认的任务调度策略在一定程度上忽略了集群异构和节点可用资源,导致集群整体负载不均衡。研究分布式节点的实时性能和集群作业环境,根据实际作业环境的异构分布情况,设计结合异构Flink集群的节点优先级调整方法,以基于Ganglia可扩展分布式集群资源监控系统的集群信息为依据,动态调整适应当前作业环境的节点优先级指数。基于此提出Flink节点动态自适应调度策略,通过实时监测节点的异构状况,并在任务执行过程中根据实时作业环境更新节点优先级指数,为系统任务找到最佳的执行节点完成任务分配。实验结果表明,相比于Flink默认的任务调度策略,基于节点优先级调整方法的自适应调度策略在WorldCount基准测试中的运行时间约平均减少6%,可使异构Flink集群在保持集群低延迟的同时,节点资源利用率和任务执行效率更高。

关键词: Flink集群, 异构集群, 负载不均衡, 节点优先级, 自适应调度

Abstract: The default task scheduling strategy of the Flink stream processing system ignores the cluster heterogeneity and available resources of nodes to a certain extent, resulting in an unbalanced overall cluster load.This study investigates the real-time performance of distributed nodes and the cluster operation environment and designs a node priority-adjustment method based on heterogeneous Flink clusters according to the heterogeneous distribution problem of the actual operation environment.The method dynamically adjusts the node priority index that adapts to the current operating environment based on the cluster information of the Ganglia scalable distributed cluster resource-monitoring system.Based on this, a dynamic adaptive scheduling strategy for link nodes is proposed.By monitoring the heterogeneous status of nodes in real time and updating the node priority index according to the real-time working environment during the task execution process, the best execution node for the system task to complete the task assignment can be found.The experimental results show that compared with Flink's default task scheduling strategy, the adaptive scheduling strategy based on the node priority adjustment method reduces the running time of the WorldCount benchmark by approximately 6% on average.This enables the heterogeneous Flink cluster to maintain low cluster latency while maintaining higher node resource utilization and task execution efficiency.

Key words: Flink cluster, heterogeneous cluster, load unbalancing, node priority, adaptive scheduling

中图分类号: