A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships

Bu, Shaofeng; Xie, Wenming; Peng, Xiaodong; Shen, Xuchen; Ren, Jingyi; Qin, Runnan

doi:10.3390/aerospace13060551

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships

by

Shaofeng Bu

^1,2,

Wenming Xie

^1,2,*,

Xiaodong Peng

^1,*,

Xuchen Shen

¹,

Jingyi Ren

¹ and

Runnan Qin

^1,2

¹

National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China

²

University of Chinese Academy of Sciences, Beijing 100049, China

^*

Authors to whom correspondence should be addressed.

Aerospace 2026, 13(6), 551; https://doi.org/10.3390/aerospace13060551 (registering DOI)

Submission received: 6 May 2026 / Revised: 29 May 2026 / Accepted: 11 June 2026 / Published: 12 June 2026

(This article belongs to the Special Issue Design, Characterization and Operations of Lighter-than-Air Flying Vehicles)

Download Versions Notes

Abstract

Long-term station-keeping of stratospheric airships is challenged by strong time-varying wind fields, pronounced vertical stratification of wind speed and direction, and limited onboard energy. Existing reinforcement-learning approaches typically rely on instantaneous observations to make reactive decisions and therefore struggle to deliver foresighted control in dynamic environments. This paper proposes a Spatio-Temporal Foresight Reinforcement-Learning framework (STF-RL) that explicitly incorporates future wind information. A Transformer is introduced to model multi-step, multi-altitude forecast wind sequences, and a time–height dual positional encoding is designed to characterize both the temporal evolution and the vertical structure of the wind field. A task-conditioned attention pooling mechanism then extracts the future-wind features most relevant to the current state, which are concatenated with the airship state and fed into an actor–critic network to enable foresighted policy learning. A continuous action space supporting three-dimensional maneuvering is constructed, together with a multi-objective reward that jointly accounts for station-keeping performance, energy consumption and safety. Experimental results show that the proposed method outperforms baseline approaches in station-keeping performance, trajectory stability and energy-utilization efficiency, while exhibiting strong robustness across different wind-field conditions.

Keywords: transformer; reinforcement learning; stratospheric airship; path planning

Share and Cite

MDPI and ACS Style

Bu, S.; Xie, W.; Peng, X.; Shen, X.; Ren, J.; Qin, R. A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships. Aerospace 2026, 13, 551. https://doi.org/10.3390/aerospace13060551

AMA Style

Bu S, Xie W, Peng X, Shen X, Ren J, Qin R. A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships. Aerospace. 2026; 13(6):551. https://doi.org/10.3390/aerospace13060551

Chicago/Turabian Style

Bu, Shaofeng, Wenming Xie, Xiaodong Peng, Xuchen Shen, Jingyi Ren, and Runnan Qin. 2026. "A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships" Aerospace 13, no. 6: 551. https://doi.org/10.3390/aerospace13060551

APA Style

Bu, S., Xie, W., Peng, X., Shen, X., Ren, J., & Qin, R. (2026). A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships. Aerospace, 13(6), 551. https://doi.org/10.3390/aerospace13060551

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Spatio-Temporal Foresight Reinforcement-Learning Framework for Long-Term Station-Keeping of Stratospheric Airships

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI