Next Article in Journal
Aircraft Longitudinal Aerodynamic Parameter Identification of Kernel Extreme Learning Machine Based on Improved Northern Goshawk Algorithm
Previous Article in Journal
Results of the H2Avia Project: Potential of Hydrogen for Global Aviation
Previous Article in Special Issue
Servo-Elastic Control of a Flexible Airship with Multiple Vectored Propellers
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships

1
National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China
2
University of Chinese Academy of Sciences, Beijing 100049, China
*
Authors to whom correspondence should be addressed.
Aerospace 2026, 13(6), 551; https://doi.org/10.3390/aerospace13060551 (registering DOI)
Submission received: 6 May 2026 / Revised: 29 May 2026 / Accepted: 11 June 2026 / Published: 12 June 2026

Abstract

Long-term station-keeping of stratospheric airships is challenged by strong time-varying wind fields, pronounced vertical stratification of wind speed and direction, and limited onboard energy. Existing reinforcement-learning approaches typically rely on instantaneous observations to make reactive decisions and therefore struggle to deliver foresighted control in dynamic environments. This paper proposes a Spatio-Temporal Foresight Reinforcement-Learning framework (STF-RL) that explicitly incorporates future wind information. A Transformer is introduced to model multi-step, multi-altitude forecast wind sequences, and a time–height dual positional encoding is designed to characterize both the temporal evolution and the vertical structure of the wind field. A task-conditioned attention pooling mechanism then extracts the future-wind features most relevant to the current state, which are concatenated with the airship state and fed into an actor–critic network to enable foresighted policy learning. A continuous action space supporting three-dimensional maneuvering is constructed, together with a multi-objective reward that jointly accounts for station-keeping performance, energy consumption and safety. Experimental results show that the proposed method outperforms baseline approaches in station-keeping performance, trajectory stability and energy-utilization efficiency, while exhibiting strong robustness across different wind-field conditions.
Keywords: transformer; reinforcement learning; stratospheric airship; path planning transformer; reinforcement learning; stratospheric airship; path planning

Share and Cite

MDPI and ACS Style

Bu, S.; Xie, W.; Peng, X.; Shen, X.; Ren, J.; Qin, R. A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships. Aerospace 2026, 13, 551. https://doi.org/10.3390/aerospace13060551

AMA Style

Bu S, Xie W, Peng X, Shen X, Ren J, Qin R. A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships. Aerospace. 2026; 13(6):551. https://doi.org/10.3390/aerospace13060551

Chicago/Turabian Style

Bu, Shaofeng, Wenming Xie, Xiaodong Peng, Xuchen Shen, Jingyi Ren, and Runnan Qin. 2026. "A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships" Aerospace 13, no. 6: 551. https://doi.org/10.3390/aerospace13060551

APA Style

Bu, S., Xie, W., Peng, X., Shen, X., Ren, J., & Qin, R. (2026). A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships. Aerospace, 13(6), 551. https://doi.org/10.3390/aerospace13060551

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop