This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation
by
Zhengsheng Chen
Zhengsheng Chen *
,
Junjie Xu
Junjie Xu ,
Dongdong Guan
Dongdong Guan
,
Xiaolong Zheng
Xiaolong Zheng and
Yujie Li
Yujie Li
PLA Rocket Force University of Engineering, Xi’an 710025, China
*
Author to whom correspondence should be addressed.
Sensors 2026, 26(12), 3686; https://doi.org/10.3390/s26123686 (registering DOI)
Submission received: 16 April 2026
/
Revised: 28 May 2026
/
Accepted: 4 June 2026
/
Published: 9 June 2026
Abstract
Building instance segmentation in remote sensing imagery supports applications such as urban management, disaster assessment, 3D urban modeling, and land-cover monitoring. However, variations in building scale, dense spatial distribution, complex background textures, shadows, and occlusions make it difficult to balance segmentation accuracy, boundary recovery, inference efficiency, and deployment cost. This study establishes a unified multidimensional benchmark for remote sensing building instance segmentation. The primary benchmark evaluates mask-predicting instance segmentation models, including YOLOv8-seg, YOLOv11-seg, YOLO26-seg, and Mask R-CNN, under consistent training and evaluation settings. RT-DETR-l and RT-DETR-x are retained only as auxiliary detection-only Transformer baselines because they do not output instance masks in the implemented setting. The benchmark covers bounding-box detection, mask-based segmentation, inference efficiency, model complexity, training behavior, and qualitative visualization. To assess cross-dataset transferability and degradation-specific robustness beyond a single dataset, we further conduct zero-shot WHU-to-Inria testing, independent Inria training/testing with different initialization strategies, and controlled degradation tests involving shadow/occlusion and Gaussian blur. Results on WHU and Inria show that high-capacity YOLO-seg models are competitive among the evaluated mask-predicting models. Under the current experimental settings, YOLOv11x-seg achieves the highest or near-highest mask-based accuracy, whereas YOLOv11m-seg provides a favorable balance between accuracy, speed, and complexity. The zero-shot WHU-to-Inria test reveals a clear domain shift, while the Inria in-domain experiments indicate that high-capacity YOLO-seg models recover competitive performance after target-domain training. The controlled degradation tests show a smaller performance drop under shadow/occlusion than under Gaussian blur for YOLOv11x-seg. These findings provide benchmark-specific evidence for selecting remote sensing building instance segmentation models under accuracy-oriented and efficiency-oriented deployment requirements.
Share and Cite
MDPI and ACS Style
Chen, Z.; Xu, J.; Guan, D.; Zheng, X.; Li, Y.
A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation. Sensors 2026, 26, 3686.
https://doi.org/10.3390/s26123686
AMA Style
Chen Z, Xu J, Guan D, Zheng X, Li Y.
A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation. Sensors. 2026; 26(12):3686.
https://doi.org/10.3390/s26123686
Chicago/Turabian Style
Chen, Zhengsheng, Junjie Xu, Dongdong Guan, Xiaolong Zheng, and Yujie Li.
2026. "A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation" Sensors 26, no. 12: 3686.
https://doi.org/10.3390/s26123686
APA Style
Chen, Z., Xu, J., Guan, D., Zheng, X., & Li, Y.
(2026). A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation. Sensors, 26(12), 3686.
https://doi.org/10.3390/s26123686
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.