[1] CHRIKI A, TOUATI H, SNOUSSI H, et al.FANET:communication, mobility models and security issues[J].Computer Networks, 2019, 163:106877. [2] 游静, 董超, 吴启晖.大规模无人机自组网分层体系架构研究综述[J].计算机科学, 2020, 47(9):226-231. YOU J, DONG C, WU Q H.Survey of layered architecture in large-scale FANETs[J].Computer Science, 2020, 47(9):226-231.(in Chinese) [3] SRIVASTAVA A, PRAKASH J.Future FANET with application and enabling techniques:anatomization and sustainability issues[J].Computer Science Review, 2021, 39:100359. [4] JEAN-AIMÉ M, MOHAMED-SLIM B M, NICOLAS L.Survey on UAANET routing protocols and network security challenges[EB/OL].[2021-06-05].https://hal-enac.archives-ouvertes.fr/hal-01465993/document. [5] NAZIB R A, MOH S.Reinforcement learning-based routing protocols for vehicular ad hoc networks:a comparative survey[J].IEEE Access, 2021, 9:27552-27587. [6] CHETTIBI S, CHIKHI S.A survey of reinforcement learning based routing protocols for mobile ad-hoc networks[M].Berlin, Germany:Springer, 2011. [7] JUNG W S, YIM J, KO Y B.QGeo:Q-learning-based geographic ad hoc routing protocol for unmanned robotic networks[J].IEEE Communications Letters, 2017, 21(10):2258-2261. [8] LIU J M, WANG Q, HE C T, et al.QMR:Q-learning based multi-objective optimization routing protocol for flying ad hoc networks[J].Computer Communications, 2020, 150:304-316. [9] SINGH K, VERMA A K.TBCS:a trust based clustering scheme for secure communication in flying ad-hoc networks[J].Wireless Personal Communications, 2020, 114(4):3173-3196. [10] WU C, OHZAHATA S, KATO T.Flexible, portable, and practicable solution for routing in VANETs:a fuzzy constraint Q-learning approach[J].IEEE Transactions on Vehicular Technology, 2013, 62(9):4251-4263. [11] ALSHEHRI A, BADAWY A H A, HUANG H.FQ-AGO:fuzzy logic Q-learning based asymmetric link aware and geographic opportunistic routing scheme for MANETs[J].Electronics, 2020, 9(4):576. [12] BIANCHI R A C, RIBEIRO C H C, COSTA A H R.Accelerating autonomous learning by using heuristic selection of actions[J].Journal of Heuristics, 2008, 14(2):135-168. [13] BIANCHI R A C, MARTINS M F, RIBEIRO C H C, et al.Heuristically-accelerated multiagent reinforcement learning[J].IEEE Transactions on Cybernetics, 2014, 44(2):252-265. [14] YANG X Y, ZHANG W L, LU H M, et al.V2V routing in VANET based on heuristic Q-learning[J].International Journal of Computers Communications & Control, 2020, 15(5):12-23. [15] KHANNA N, SACHDEVA M.Study of trust-based mechanism and its component model in MANET:current research state, issues, and future recommendation[J].International Journal of Communication Systems, 2019, 32(12):e4012. [16] XIA H, YU J, TIAN C L, et al.Light-weight trust-enhanced on-demand multi-path routing in mobile ad hoc networks[J].Journal of Network and Computer Applications, 2016, 62:112-127. [17] LIU G S, WANG X, LI X H, et al.ESRQ:an efficient secure routing method in wireless sensor networks based on Q-learning[C]//Proceedings of the 17th IEEE International Conference on Trust, Security and Privacy in Computing and Communications.Washington D.C., USA:IEEE Press, 2018:149-155. [18] DAI C H, XIAO X Y, DING Y Z, et al.Learning based security for VANET with blockchain[C]//Proceedings of IEEE International Conference on Communication Systems.Washington D.C., USA:IEEE Press, 2018:210-215. [19] ZHANG D J, YU F R, YANG R Z, et al.A deep reinforcement learning-based trust management scheme for software-defined vehicular networks[C]//Proceedings of the 8th ACM Symposium on Design and Analysis of Intelligent Vehicular Networks and Applications.New York, USA:ACM Press, 2018:1-7. [20] ZHANG D J, YU F R, YANG R Z.A machine learning approach for software-defined vehicular ad hoc networks with trust management[C]//Proceedings of IEEE Global Communications Conference.Washington D.C., USA:IEEE Press, 2018:1-6. [21] MAAKAR S K, SINGH Y, SINGH R.Flying ad hoc network:a newest research area for ad hoc networks[C]//Proceedings of the 2nd International Conference on Intelligent Communication and Computational Techniques.Washington D.C., USA:IEEE Press, 2019:298-302. [22] YU Y J, QIN Y, GONG H C.A fuzzy Q-learning algorithm for storage optimization in islanding microgrid[J].Journal of Electrical Engineering & Technology, 2021, 16(5):2343-2353. [23] SINGH K, VERMA A K.A fuzzy-based trust model for flying ad hoc networks(FANETs)[J].International Journal of Communication Systems, 2018, 31:23-47. |