AI-Curling: 一种冰壶现场分析与决策方法

doi:10.19678/j.issn.1000-3428.0069106

摘要/Abstract

摘要：

为满足冰壶智能训练的需求, 结合计算机视觉与深度强化学习(RL)技术, 提出一种新的现场冰壶决策方法AI-Curling。AI-Curling包含冰壶检测(SR-Yolo)以及策略生成(GSP-MCTS) 2个部分。SR-Yolo模块负责感知关键时刻冰壶状态, 提取实景冰壶的位置与种类信息。为提高大场景下的小目标检测精度, 防止不恰当下采样造成的特征损失, 引入浅层细化骨干网络(SRNet), 通过在网络初级阶段增加层级, 捕获更丰富的特征信息。此外, 在多尺度融合网络中, 引入自适应特征优化融合(AFOF)模块, 以增加各层网络有效样本, 避免小尺度目标淹没在复杂背景和噪声中。GSP-MCTS模块通过蒙特卡洛树搜索(MCTS)算法结合策略价值网络的方式, 实现冰壶比赛决策分析。该模块通过引入核函数处理动作空间连续性和执行不确定性, 并在策略价值网络中嵌入全局策略感知模块(GSP), 增强了网络空间感知能力。在实验中, SR-Yolo在常规冰壶数据集Curling上平均精度均值(mAP@0.5)为0.974, 在遮挡较多的复杂冰壶数据集Curling_hard上mAP@0.5为0.723。同时, GSP-MCTS与最新实景冰壶模型Curling MCTS对战获得62%的胜率。实验结果表明, GSP-MCTS具有更好的性能。

关键词: 强化学习, 深度学习, 冰壶检测, 小目标检测, 蒙特卡洛树搜索

Abstract:

In response to the need for intelligent curling training, a new on-site curling decision-making method that combines computer vision and deep Reinforcement Learning (RL) technologies, Artificial Intelligence (AI)-Curling, is proposed. AI-Curling comprises two components: SR-Yolo for curling detection and Global Strategy Perception-Monte Carlo Tree Search (GSP-MCTS) for strategy generation. The former is responsible for sensing the state of the curling stones at critical moments and extracting information on the location and type of stones in real scenes. To improve the detection accuracy of small targets in large scenes and prevent feature loss due to inappropriate downsampling, a Shallow Refinement Backbone Network (SRNet) is introduced to capture richer feature information by adding layers during the initial stages of the network. An Adaptive Feature Optimization Fusion (AFOF) module is further introduced into the multiscale fusion network to increase the number of effective samples in each layer, thereby preventing small-scale targets from being submerged in complex backgrounds and noise. In the strategy generation module, curling match decision analysis is implemented using a combination of the MCTS algorithm and policy value network. A GSP module is embedded into the policy value network to enhance network spatial perception by introducing a kernel function to deal with action space continuity and execution uncertainty. In the experiments, SR-Yolo achieved 0.974 mAP@0.5 on the standard Curling dataset and 0.723 mAP@0.5 on the more complex obstructed Curling_hard dataset. In addition, GSP-MCTS achieved a 62% winning percentage compared with the latest real-scene curling model Curling MCTS, indicating that GSP-MCTS has superior performance.

Key words: Reinforcement Learning (RL), deep learning, curling detection, small object detection, Monte Carlo Tree Search (MCTS)

孙浩淼, 李宗民, 肖倩, 孙文洁, 张雯欣. AI-Curling: 一种冰壶现场分析与决策方法[J]. 计算机工程, 2025, 51(2): 102-110.

SUN Haomiao, LI Zongmin, XIAO Qian, SUN Wenjie, ZHANG Wenxin. AI-Curling: An On-Site Curling Analysis and Decision-Making Method[J]. Computer Engineering, 2025, 51(2): 102-110.

https://www.ecice06.com/CN/Y2025/V51/I2/102

图/表 10

图1 AI-Curling的总体框架

Fig.1 Overall framework of AI-Culling

图2 SR-Yolo的总体结构

Fig.2 Overall structure of SR-Yolo

图3 AFOF模块结构

Fig.3 Architecture of AFOF module

图4 解耦训练框架

Fig.4 Framework of decoupling training framework

图5 AI-Curling决策结果

Fig.5 Decision-making results of AI-Curling

参考文献 25

1	BERTASIUS G, PARK H S, YU S X, et al. Am I a baller? Basketball performance assessment from first-person videos[C]//Proceedings of the IEEE International Conference on Computer Vision. Washington D. C., USA: IEEE Press, 2017: 2177-2185. 10.48550/arXiv.1611.05365
2	WADSWORTH N , CHARNOCK L , RUSSELL J , et al. Use of video-analysis feedback within a six-month coach education program at a professional football club. Journal of Sport Psychology in Action, 2020, 11 (2): 73- 91. doi: 10.1080/21520704.2018.1528324
3	BAO H S , YAO X . RETRACTED: dynamic 3D image simulation of basketball movement based on embedded system and computer vision. Microprocessors and Microsystems, 2021, 81, 103655. doi: 10.1016/j.micpro.2020.103655
4	王人成. 一种轮椅冰壶运动状态检测和目标跟踪系统: CN113033384A[P]. 2021-06-25.
	WANG R C. A system for state detection and target tracking in wheelchair curling: CN113033384A[P]. 2021-06-25. (in Chinese)
5	LEE S , PARK S , KIM T , et al. Development of speed/trajectory measurement system using smart glasses for beginning or unskilled curlers. Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology, 2019, 233 (1): 145- 159. doi: 10.1177/1754337118809864
6	KIM J, HWAJIN C. Curling stone tracking by an algorithm using appearance and colour features[C]//Proceedings of the World Congress on Electrical Engineering and Computer Systems and Science. Washington D. C., USA: IEEE Press, 2015: 334.
7	李文佳. 基于智能视觉的冰壶运动检测与测量[D]. 哈尔滨: 哈尔滨工业大学, 2020. 10.27061/d.cnki.ghgdu.2020.004616
	LI W J. Ice curling sport detection and measurement based on intelligent vision[D]. Harbin: Harbin Institute of Technology, 2020. (in Chinese)10.27061/d.cnki.ghgdu.2020.004616
8	王震. 冰壶机器人视觉检测与跟踪系统研究[D]. 哈尔滨: 哈尔滨工业大学, 2021. 10.27061/d.cnki.ghgdu.2021.003115
	WANG Z. Research on vision detection and tracking system of curling robot[D]. Harbin: Harbin Institute of Technology, 2021. (in Chinese)10.27061/d.cnki.ghgdu.2021.003115
9	吴安国. 冰壶比赛鹰眼系统设计与实现[D]. 哈尔滨: 哈尔滨工业大学, 2021. 10.27061/d.cnki.ghgdu.2021.001175
	WU A G. Design and implementation of Hawk-Eye system for curling competitions[D]. Harbin: Harbin Institute of Technology, 2021. (in Chinese)10.27061/d.cnki.ghgdu.2021.001175
10	SHI X K , WANG Q , WANG C , et al. An AI-based curling game system for winter Olympics. Research, 2022, 2022, 9805054. doi: 10.34133/2022/9805054
11	XIAO Q , LI Z M , WANG X D , et al. Policy decision of curling in real competition scenes. Complex & Intelligent Systems, 2023, 9 (3): 3301- 3312. URL
12	OTANI H, MASUI F, YANAGI H, et al. Advances in curling game information analysis by considering starting position[C]//Proceedings of the 5th International Congress on Sport Sciences Research and Technology Support. Funchal, Portugal: SciTePress, 2017: 1-10. 10.5220/0006498800890095
13	Kuroda. Digital curling[EB/OL]. [2023-10-22]. https://github.com/digitalcurling/DigitalCurling.
14	KAWAMURA T, KAMIMURA R, SUZUKI S, et al. A study on the curling robot will match with human result of one end game with one human[C]//Proceedings of the IEEE Conference on Computational Intelligence and Games (CIG). Washington D. C., USA: IEEE Press, 2015: 489-495. 10.1109/CIG.2015.7317934
15	YAMAMOTO M, KATO S, IIZUKA H. Digital curling strategy based on game tree search[C]//Proceedings of the IEEE Conference on Computational Intelligence and Games. Washington D. C., USA: IEEE Press, 2015: 474-480. 10.1109/CIG.2015.7317931
16	SILVER D , HUANG A , MADDISON C J , et al. Mastering the game of Go with deep neural networks and tree search. Nature, 2016, 529, 484- 489. doi: 10.1038/nature16961
17	YEE T, LIS'Y V, BOWLING M. Monte Carlo tree search in continuous action spaces with execution uncertainty[C]//Proceedings of the International Joint Conference on Artificial Intelligence. Washington D. C., USA: IEEE Press, 2016: 690-697.
18	LEE K, KIM S A, CHOI J, et al. Deep reinforcement learning in continuous action spaces: a case study in the game of simulated curling[C]//Proceedings of International Conference on Machine Learning. Berlin, Germany: Springer, 2018: 2937-2946.
19	HAN Y T , ZHOU Q B , DUAN F Q . A game strategy model in the digital curling system based on NFSP. Complex & Intelligent Systems, 2022, 8 (3): 1857- 1863. URL
20	LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation[C]//Proceedings of CVPR'18. Washington D. C., USA: IEEE Press, 2018: 8759-8768. 10.1109/CVPR.2018.00913
21	HONG M B , LI S W , YANG Y C , et al. SSPNet: scale selection pyramid network for tiny person detection from UAV images. IEEE Geoscience and Remote Sensing Letters, 2022, 19, 1- 5. URL
22	XIE S, GIRSHICK R, DOLLAR P, et al. Aggregated residual transformations for deep neural networks[C]//Proceedings of CVPR'17. Washington D. C., USA: IEEE Press, 2017: 1492-1500. 10.48550/arXiv.1611.05431
23	KOCSIS L, SZEPESVÁRI C. Bandit based Monte-Carlo planning[C]//Proceedings of the 17th European Conference on Machine Learning. Berlin, Germany: Springer, 2006: 282-293. 10.1007/11871842_29
24	WU D J. Accelerating self-play learning in Go[EB/OL]. [2023-10-22]. https://arxiv.org/pdf/1902.10565v2.
25	SILVER D , SCHRITTWIESER J , SIMONYAN K , et al. Mastering the game of Go without human knowledge. Nature, 2017, 550, 354- 359. doi: 10.1038/nature24270

[1]	袁亚剑, 毛力. 一种增强前景的轻量级交通标志检测模型[J]. 计算机工程, 2025, 51(3): 54-63.
[2]	李淑怡, 阳波, 陈灵, 沈玲, 唐文胜. 自适应奖励函数的PPO曲面覆盖方法[J]. 计算机工程, 2025, 51(3): 86-94.
[3]	戴康佳, 徐慧英, 朱信忠, 李悉钰, 黄晓, 陈国强, 张志雄. YGL-SLAM: 动态场景下基于点和线的语义SLAM系统[J]. 计算机工程, 2025, 51(3): 95-104.
[4]	李思源, 钟兴宇, 李凯茵, 徐清振. 基于多层图关系和强化学习的策略教学研究[J]. 计算机工程, 2025, 51(3): 122-130.
[5]	林绍福, 陈盈盈, 李硕朋. 基于深度强化学习的多无人机能量传输与边缘计算联合优化方法[J]. 计算机工程, 2025, 51(3): 144-154.
[6]	韩鹏, 黄韫栀, 任彩月, 程竞仪, 徐军. 基于双分支网络的乳腺PET新辅助化疗疗效评估[J]. 计算机工程, 2025, 51(3): 293-299.
[7]	胡朝举, 郭凤仪. 基于改进YOLOv7的MODF端口状态检测算法[J]. 计算机工程, 2025, 51(2): 78-85.
[8]	孙义康, 高建华. 基于卷积神经网络和长短期记忆的死代码检测方法[J]. 计算机工程, 2025, 51(2): 223-237.
[9]	曾建州, 李泽平, 张素勤. 基于TD3算法的多智能体协作缓存策略[J]. 计算机工程, 2025, 51(2): 365-374.
[10]	周宇, 谢威, 邝得互, 江健民. 基于三元自注意力的视频快照压缩成像重建[J]. 计算机工程, 2025, 51(1): 20-30.
[11]	胡升龙, 陈彬, 张开华, 宋慧慧. 场景结构知识增强的协同显著性目标检测[J]. 计算机工程, 2025, 51(1): 31-41.
[12]	喻勇涛, 孙奥, 李昂, 朱琳琳. 基于孪生网络的分类器输出重复性优化方法[J]. 计算机工程, 2025, 51(1): 118-127.
[13]	张会影, 圣文顺. 基于标记适应的人脸年龄识别优化算法[J]. 计算机工程, 2025, 51(1): 174-181.
[14]	杨红菊, 吉昌. 学习驱动的图像压缩算法研究[J]. 计算机工程, 2025, 51(1): 190-197.
[15]	王晓路, 汶建荣. 基于运动-时间感知的人体动作识别方法[J]. 计算机工程, 2025, 51(1): 216-224.

选择文件类型/文献管理软件名称

选择包含的内容