Robust Neurocontrol for Autonomous Dynamic Soaring
Abstract
The flight endurance of small unmanned aerial vehicles can be significantly extended through the exploitation of naturally occurring wind phenomena. However, due to the limited computational hardware on board such aircraft and the uncertain, stochastic nature of real-world environments, there is a need for efficient and robust strategies that exhibit generalized behavior. In addressing these objectives, recent efforts have explored the use of artificial intelligence training algorithms and neural networks for the design of autonomous control schemes that exploit such wind phenomena. This study incorporates the Neuroevolution of Augmenting Topologies algorithm with domain randomization to train robust neurocontrollers that can control an aircraft along sustained traveling dynamic soaring trajectories in the presence of uncertainties and disturbances. This work presents the developed strategy for integrating robustness in neural network control systems, provides a method for quantifying and comparing robustness, and introduces an approach for identifying the network characteristics that contribute to the evolved robust behavior.
References
[1] , “Dynamic Soaring: Aerodynamics for Albatrosses,” European Journal of Physics, Vol. 30, No. 1, 2008, pp. 75–84. https://doi.org/10.1088/0143-0807/30/1/008
[2] , “Optimal Patterns of Glider Dynamic Soaring,” Optimal Control Applications and Methods, Vol. 25, No. 2, 2004, pp. 67–89. https://doi.org/10.1002/oca.739
[3] , “Minimum Shear Wind Strength Required for Dynamic Soaring of Albatrosses,” Ibis, Vol. 147, No. 1, 2005, pp. 1–10. https://doi.org/10.1111/j.1474-919x.2004.00295.x
[4] , “Efficient Trajectory Development for Small Unmanned Aircraft Dynamic Soaring Applications,” Journal of Guidance, Control, and Dynamics, Vol. 38, No. 3, 2015, pp. 519–523. https://doi.org/10.2514/1.G000543
[5] , “Real-Time Optimal Techniques for Unmanned Air Vehicles Fuel Saving,” Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering, Vol. 226, No. 10, 2012, pp. 1315–1328. https://doi.org/10.1177/0954410011418881
[6] , “Dubins Path-Based Dynamic Soaring Trajectory Planning and Tracking Control in a Gradient Wind Field,” Optimal Control Applications and Methods, Vol. 38, No. 2, 2017, pp. 147–166. https://doi.org/10.1002/oca.2248
[7] , “Parameterized Trajectory Planning for Dynamic Soaring,” AIAA SciTech 2020 Forum, AIAA Paper 2020-0856, 2020. https://doi.org/10.2514/6.2020-0856
[8] , “Hierarchical Reinforcement Learning Approach Towards Autonomous Cross-Country Soaring,” AIAA SciTech 2021 Forum, AIAA Paper 2021-2010, 2021. https://doi.org/10.2514/6.2021-2010
[9] , “Learning to Soar in Turbulent Environments,” Proceedings of the National Academy of Sciences, Vol. 113, No. 33, 2016, pp. E4877–E4884. https://doi.org/10.1073/pnas.1606075113
[10] , “Glider Soaring via Reinforcement Learning in the Field,” Nature, Vol. 562, No. 7726, 2018, pp. 236–239. https://doi.org/10.1038/s41586-018-0533-0
[11] , “Deep Neural Network-Based Feedback Control for Dynamic Soaring of Unpowered Aircraft,” IFAC-PapersOnLine, Vol. 52, No. 12, 2019, pp. 117–121. https://doi.org/10.1016/j.ifacol.2019.11.079
[12] , “Reinforcement Learning for Autonomous Dynamic Soaring in Shear Winds,” 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, Chicago, IL, 2014, pp. 3423–3428, https://ieeexplore.ieee.org/document/6943039. https://doi.org/10.1109/IROS.2014.6943039
[13] , “Autonomous Soaring Using Reinforcement Learning for Trajectory Generation,” 52nd Aerospace Sciences Meeting, AIAA SciTech Forum, AIAA Paper 2014-0990, 2014. https://doi.org/10.2514/6.2014-0990
[14] , “Learning to Soar: Resource-Constrained Exploration in Reinforcement Learning,” International Journal of Robotics Research, Vol. 34, No. 2, Dec. 2014, pp. 158–172. https://doi.org/10.1177/0278364914553683
[15] , “Neuro-Evolutionary Control for Optimal Dynamic Soaring,” AIAA SciTech 2020 Forum, AIAA Paper 2020-1946, 2020. https://doi.org/10.2514/6.2020-1946
[16] , “Evolving Neural Networks Through Augmenting Topologies,” Evolutionary Computation, Vol. 10, No. 2, 2002, pp. 99–127. https://doi.org/10.1162/106365602320169811
[17] , “Neuroevolutionary Control for Autonomous Soaring,” Aerospace, Vol. 8, No. 9, 2021, Paper 267. https://doi.org/10.3390/aerospace8090267
[18] , “Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World,” 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Inst. of Electrical and Electronics Engineers, New York, 2017, pp. 23–30. https://doi.org/10.1109/IROS.2017.8202133
[19] , “The Soaring of Birds,” Nature, Vol. 27, No. 701, 1883, pp. 534–535. https://doi.org/10.1038/027534a0
[20] , Monte Carlo Theory, Methods and Examples, Vol. 16, 2013, pp. 19–22.
[21] , “Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization,” 2019 International Conference on Unmanned Aircraft Systems (ICUAS), 2019, pp. 523–533, https://ieeexplore.ieee.org/document/8798254. https://doi.org/10.1109/ICUAS.2019.8798254