作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (5): 316-320. doi: 10.19678/j.issn.1000-3428.0057693

• 开发研究与工程应用 • 上一篇    

基于云端可视化交互的强化学习平台

姚铁锤1,2, 王珏1,2, 王彦棡1,2, 迟学斌1,2, 王晓光1   

  1. 1. 中国科学院计算机网络信息中心, 北京 100190;
    2. 中国科学院大学 计算机科学与技术学院, 北京 100049
  • 收稿日期:2020-03-12 修回日期:2020-05-13 发布日期:2020-05-21
  • 作者简介:姚铁锤(1993-),男,博士研究生,主研方向为强化学习、高性能计算;王珏(通信作者),副研究员;王彦棡、迟学斌,研究员、博士生导师;王晓光,工程师。
  • 基金资助:
    国家重点研发计划“大规模并行计算的工具库和领域相关基础软件包”(2017YFB0202202);“中国科技云”建设工程(二期)项目“超算资源池建设”(XXH13503);国家电网有限公司总部科技项目“电力人工智能实验及公共服务平台技术”(SGGR0000JSJS1800569)。

Reinforcement Learning Platform Based on Cloud Visual Interaction

YAO Tiechui1,2, WANG Jue1,2, WANG Yangang1,2, CHI Xuebin1,2, WANG Xiaoguang1   

  1. 1. Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China;
    2. School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2020-03-12 Revised:2020-05-13 Published:2020-05-21

摘要: 强化学习是一个与环境交互的学习过程,在实验场景中,训练环境部署的可扩展性和算法验证的便捷性常受限于物理引擎和渲染模块的高耦合性。为对物理引擎和渲染模块进行解耦,构建一种面向物理引擎和渲染模块的云端交互式模型,其中包括操作字典、元素字典和对应的算法接口,并基于该模型实现模拟器。通过集成模拟器、可视化工具和知识管理等组件,搭建支持云端可视化交互的强化学习平台,并以MuJoCo物理引擎为例,验证Web模拟器接入自定义物理引擎的便捷性。实验和分析结果验证了该模型的有效性,其可方便接入平台,实现云端渲染并提高所属集群的利用率。

关键词: 强化学习平台, 物理引擎, 渲染模块, 云端可视化交互, 接口标准

Abstract: Reinforcement learning is a learning process that interacts with the environment.In the experiment environment,the scalability of the training environment deployment and the convenience of algorithm verification are often limited by the high coupling between the physics engine and the rendering module.To solve the problem,this paper proposes a Cloud Interactive Model(CIM) for physics engine and rendering module,which consists of an operation dictionary,element dictionary and relevant algorithm interfaces,and on this basis implements a simulator.Furthermore, this paper integrates the simulator,visualization tools,knowledge management and other components to build a Reinforcement Learning Platform(RLP) supporting cloud visual interaction.By taking the MuJoCo physics engine as an example,the Web simulator is verified for its convenience of access to a custom physics engine.Experimental and analytical results show that this model can be conveniently connected to the platform to realize cloud rendering and improve the utilization rate of its cluster.

Key words: Reinforcement Learning Platform(RLP), physics engine, rendering module, cloud visual interaction, interface standard

中图分类号: