Next Article in Journal
Absolute Calibration of Weather Radars Using Metal Spheres Based on Sector Scanning
Previous Article in Journal
Phenology-Adaptive Prediction of Walnut Leaf Area Index from UAV Multispectral Data via Hybrid Feature Selection and SHAP-Enhanced Machine Learning
Previous Article in Special Issue
Unsupervised Oil Spill Detection in Shipborne Radar Imagery Using Autoencoder-Enhanced Q-Learning and Improved Bat Optimization
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

AirplaneGen: Skeleton-Guided Generation of Remote Sensing Images with Multi-Instance Airplanes

1
The School of Electronic Engineering, Xidian University, Xi’an 710071, China
2
The National Key Laboratory of Scattering and Radiation, Shanghai 200438, China
3
The School of Computer Science and Engineering, The Southern University of Science and Technology, Shenzhen 518055, China
4
Emotion Machine Lab, Beijing 100101, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(12), 1940; https://doi.org/10.3390/rs18121940
Submission received: 20 April 2026 / Revised: 4 June 2026 / Accepted: 9 June 2026 / Published: 11 June 2026

Abstract

Generating realistic and controllable aerial images is important for building and evaluating remote sensing recognition systems, especially when real samples of rare aircraft types or dense airport layouts are limited. However, airplane synthesis remains challenging for generic generative models. Aircraft have rigid and symmetric structures, and airport scenes often contain many closely spaced instances; as a result, existing models tend to produce distorted wings and fuselages or merge adjacent airplanes into ambiguous shapes. To address these issues, we propose AirplaneGen, a skeleton-guided latent diffusion framework for multi-airplane remote sensing image generation. AirplaneGen represents each airplane with an editable eight-keypoint skeleton and uses skeleton-derived soft masks to separate instance-level refinement from background-context modeling during denoising. To support this task, we construct MARS20, a benchmark with 2778 high-resolution aerial scenes and 16,673 airplane instances annotated with skeletons, categories, and contextual descriptions. Experiments on MARS20 show that AirplaneGen improves image fidelity, geometric consistency, and instance separation over representative controllable generation methods.
Keywords: remote sensing; image generation; multi-instance; geometric constraint; diffusion model remote sensing; image generation; multi-instance; geometric constraint; diffusion model

Share and Cite

MDPI and ACS Style

Zhu, L.; Ma, Y.; Wu, J.; Fan, Y.; Wang, X.; Tan, M. AirplaneGen: Skeleton-Guided Generation of Remote Sensing Images with Multi-Instance Airplanes. Remote Sens. 2026, 18, 1940. https://doi.org/10.3390/rs18121940

AMA Style

Zhu L, Ma Y, Wu J, Fan Y, Wang X, Tan M. AirplaneGen: Skeleton-Guided Generation of Remote Sensing Images with Multi-Instance Airplanes. Remote Sensing. 2026; 18(12):1940. https://doi.org/10.3390/rs18121940

Chicago/Turabian Style

Zhu, Lingxuan, Yanze Ma, Jiaji Wu, Yanbo Fan, Xiaobing Wang, and Mingzhou Tan. 2026. "AirplaneGen: Skeleton-Guided Generation of Remote Sensing Images with Multi-Instance Airplanes" Remote Sensing 18, no. 12: 1940. https://doi.org/10.3390/rs18121940

APA Style

Zhu, L., Ma, Y., Wu, J., Fan, Y., Wang, X., & Tan, M. (2026). AirplaneGen: Skeleton-Guided Generation of Remote Sensing Images with Multi-Instance Airplanes. Remote Sensing, 18(12), 1940. https://doi.org/10.3390/rs18121940

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop