全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

分层强化学习在无人机领域应用综述
A Review of the Application of Hierarchical Reinforcement Learning in the Field of Drones

DOI: 10.12677/AIRR.2024.131008, PP. 66-71

Keywords: 分层强化学习,无人机,人工智能
Hierarchical Reinforcement Learning
, Drone, Artificial Intelligence

Full-Text   Cite this paper   Add to My Lib

Abstract:

分层强化学习是强化学习领域的一个重要分支。基于分而治之的思想,将一个复杂问题分解成多个子问题,最终解决整个问题。近年来,由于传感器能力的提高和人工智能算法的进步,基于分层强化学习的无人机自主导航成为研究热点。本篇文章对国内外发表的具有代表性的文章进行概述,首先分析无人机和分层强化学习的含义,其次重点研究了分层强化学习在无人机轨迹规划和资源分配的优化问题上的应用。
Hierarchical reinforcement learning is an important branch in the field of reinforcement learning. Based on the idea of divide and conquer, a complex problem is decomposed into multiple sub-problems and finally the entire problem is solved. In recent years, due to the improvement of sensor capabilities and the advancement of artificial intelligence algorithms, autonomous drone navigation based on hierarchical reinforcement learning has become a research hotspot. This article provides an overview of representative articles published at home and abroad. First, it analyzes the meaning of UAVs and hierarchical reinforcement learning. Secondly, it focuses on the application of hierarchical reinforcement learning in UAV trajectory planning and resource allocation problems.

References

[1]  Bellman, R. (1954) The Theory of Dynamic Programming. Bulletin of the American Mathematical Society, 60, 503-515.
https://doi.org/10.1090/S0002-9904-1954-09848-8
[2]  Sutton, R.S., Precup, D. and Singh, S. (1999) Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning. Artificial Intelligence, 112, 181-211.
https://doi.org/10.1016/S0004-3702(99)00052-1
[3]  Dietterich, T.G. (2000) Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
https://doi.org/10.1613/jair.639
[4]  Dayan, P. and Hinton, G.E. (1992) Feudal Reinforcement Learning. Advances in Neural Information Processing Systems, 5, 272-278.
[5]  Vezhnevets, A.S., Osindero, S., Schaul, T., et al. (2017) Feudal Networks for Hierarchical Reinforcement Learning. International Conference on Machine Learning, Sydney, 6 August 2017, 3540-3549.
[6]  Kou, K., Yang, G., Zhang, W., et al. (2022) Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning. 2022 International Conference on Automation, Robotics and Computer Engineering (ICARCE), Wuhan, 16-17 December 2022, 1-5.
https://doi.org/10.1109/ICARCE55724.2022.10046655
[7]  Alpdemir, M.N. (2023) A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments. Turkish Journal of Science and Technology, 18, 243-259.
https://doi.org/10.55525/tjst.1219845
[8]  程先峰, 严勇杰. 基于MAXQ分层强化学习的有人机/无人机协同路径规划研究[J]. 信息化研究, 2020, 46(1): 13-19.
https://doi.org/CNKI:SUN:DZGS.0.2020-01-004
[9]  Cheng, Y., Li, D., Wong, W.E., et al. (2022) Multi-UAV Collaborative Path Planning Using Hierarchical Reinforcement Learning and Simulated Annealing. International Journal of Performability Engineering, 18, 463-474.
https://doi.org/10.23940/ijpe.22.07.p1.463474
[10]  Ren, T., Niu, J., Dai, B., et al. (2021) Enabling Efficient Scheduling in Large-Scale UAV-Assisted Mobile-Edge Computing via Hierarchical Reinforcement Learning. IEEE Internet of Things Journal, 9, 7095-7109.
https://doi.org/10.1109/JIOT.2021.3071531
[11]  Zhang, Y., Mou, Z, Gao, F., et al. (2020) Hierarchical Deep Reinforcement Learning for Backscattering Data Collection with Multiple UAVs. IEEE Internet of Things Journal, 8, 3786-3800.
https://doi.org/10.1109/JIOT.2020.3024666

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413