1 |
闫皎洁, 张锲石, 胡希平. 基于强化学习的路径规划技术综述. 计算机工程, 2021, 47 (10): 16- 25.
URL
|
|
YAN J J, ZHANG Q S, HU X P. Review of path planning techniques based on reinforcement learning. Computer Engineering, 2021, 47 (10): 16- 25.
URL
|
2 |
孙辉辉, 胡春鹤, 张军国. 移动机器人运动规划中的深度强化学习方法. 控制与决策, 2021, 36 (6): 1281- 1292.
doi: 10.13195/j.kzyjc.2020.0470
|
|
SUN H H, HU C H, ZHANG J G. Deep reinforcement learning for motion planning of mobile robots. Control and Decision, 2021, 36 (6): 1281- 1292.
doi: 10.13195/j.kzyjc.2020.0470
|
3 |
ZHU K, ZHANG T. Deep reinforcement learning based mobile robot navigation: a review. Tsinghua Science and Technology, 2021, 26 (5): 674- 691.
doi: 10.26599/TST.2021.9010012
|
4 |
黄锐. 基于深度强化学习的移动机器人导航策略研究[D]. 成都: 电子科技大学, 2021.
|
|
HUANG R. Research on navigation strategy of mobile robot based on deep reinforcement learning[D]. Chengdu: University of Electronic Science and Technology of China, 2021. (in Chinese)
|
5 |
|
6 |
KULHANEK J, DERNER E, BABUSKA R. Visual navigation in real-world indoor environments using end-to-end deep reinforcement learning. IEEE Robotics and Automation Letters, 2021, 6 (3): 4345- 4352.
doi: 10.1109/LRA.2021.3068106
|
7 |
YOKOYAMA K, MORIOKA K. Autonomous mobile robot with simple navigation system based on deep reinforcement learning and a monocular camera[C]//Proceedings of 2020 IEEE/SICE International Symposium on System Integration. Honolulu, USA: IEEE Press, 2020: 525-530.
|
8 |
杨思明, 单征, 丁煜, 等. 深度强化学习研究综述. 计算机工程, 2021, 47 (12): 19- 29.
URL
|
|
YANG S M, SHAN Z, DING Y, et al. Survey of research on deep reinforcement learning. Computer Engineering, 2021, 47 (12): 19- 29.
URL
|
9 |
ZENG L K, YAO W, SHUAI H, et al. Resilience assessment for power systems under sequential attacks using double DQN with improved prioritized experience replay. IEEE Systems Journal, 2023, 17 (2): 1865- 1876.
doi: 10.1109/JSYST.2022.3171240
|
10 |
刘颖. 深度强化学习中的经验回放研究[D]. 南京: 东南大学, 2021.
|
|
LIU Y. Research on experience replay in deep reinforcement learning[D]. Nanjing: Southeast University, 2021. (in Chinese)
|
11 |
|
12 |
孙涵彬. 基于经验池重采样的强化学习算法优化[D]. 成都: 电子科技大学, 2022.
|
|
SUN H B. Optimization of reinforcement learning algorithm based on experience pool resampling[D]. Chengdu: University of Electronic Science and Technology of China, 2022. (in Chinese)
|
13 |
陈茜. 基于经验回放机制的深度强化学习算法改进及应用[D]. 南京: 东南大学, 2021.
|
|
CHEN Q. Improvement and application of deep reinforcement learning algorithm based on experience playback mechanism[D]. Nanjing: Southeast University, 2021. (in Chinese)
|
14 |
MORO L, LIKMETA A, PRATI E, et al. Goal-directed planning via hindsight experience replay[C]//Proceedings of International Conference on Learning Representations. Washington D. C., USA: IEEE Press, 2022: 1-16.
|
15 |
LI K Y, LU Y, MENG M Q H. Human-aware robot navigation via reinforcement learning with hindsight experience replay and curriculum learning[C]//Proceedings of 2021 IEEE International Conference on Robotics and Biomimetics. Washington D. C., USA: IEEE Press, 2021: 346-351.
|
16 |
LUU T M, YOO C D. Hindsight goal ranking on replay buffer for sparse reward environment. IEEE Access, 2021, 9, 51996- 52007.
doi: 10.1109/ACCESS.2021.3069975
|
17 |
FANG M, ZHOU T, DU Y, et al. Curriculum-guided hindsight experience replay[C]//Proceedings of Advances in Neural Information Processing Systems. Cambridge, USA: MIT Press, 2019: 32.
|
18 |
VECCHIETTI L F, SEO M, HAR D. Sampling rate decay in hindsight experience replay for robot control. IEEE Transactions on Cybernetics, 2022, 52 (3): 1515- 1526.
doi: 10.1109/TCYB.2020.2990722
|
19 |
|
20 |
|
21 |
张峻伟, 吕帅, 张正昊, 等. 基于样本效率优化的深度强化学习方法综述. 软件学报, 2022, 33 (11): 4217- 4238.
|
|
ZHANG J W, LÜ S, ZHANG Z H, et al. Summary of deep reinforcement learning methods based on sample efficiency optimization. Journal of Software, 2022, 33 (11): 4217- 4238.
|
22 |
WOŁCZYK M, KRUTSYLO A. Remember more by recalling less: investigating the role of batch size in continual learning with experience replay(student abstract). Artificial Intelligence, 2021, 35 (18): 15923- 15924.
|
23 |
VAN HASSELT H, GUEZ A, SILVER D. Deep reinforcement learning with double Q-learning[C]//Proceedings of the 30th AAAI Conference on Artificial Intelligence. New York, USA: ACM Press, 2016: 2094-2100.
|
24 |
WANG Z Y, SCHAUL T, HESSEL M, et al. Dueling network architectures for deep reinforcement learning[C]//Proceedings of the 33rd International Conference on Machine Learning. New York, USA: ACM Press, 2016: 1995-2003.
|
25 |
|
26 |
|
27 |
TAN R R P, IKEDA K, VERGARA J P C. Hindsight-combined and hindsight-prioritized experience replay. Berlin, Germany: Springer, 2020.
|
28 |
袁帅, 张莉莉, 顾琦然, 等. 移动机器人优先采样D3QN路径规划方法研究. 小型微型计算机系统, 2023, 44 (5): 923- 929.
|
|
YUAN S, ZHANG L L, GU Q R, et al. Research on D3QN path planning method of mobile robot priority sampling. Journal of Chinese Computer Systems, 2023, 44 (5): 923- 929.
|