A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation

Chen, Zhengsheng; Xu, Junjie; Guan, Dongdong; Zheng, Xiaolong; Li, Yujie

doi:10.3390/s26123686

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation

by

Zhengsheng Chen

^*

,

Junjie Xu

,

Dongdong Guan

,

Xiaolong Zheng

and

Yujie Li

PLA Rocket Force University of Engineering, Xi’an 710025, China

^*

Author to whom correspondence should be addressed.

Sensors 2026, 26(12), 3686; https://doi.org/10.3390/s26123686 (registering DOI)

Submission received: 16 April 2026 / Revised: 28 May 2026 / Accepted: 4 June 2026 / Published: 9 June 2026

(This article belongs to the Section Remote Sensors)

Download Versions Notes

Abstract

Building instance segmentation in remote sensing imagery supports applications such as urban management, disaster assessment, 3D urban modeling, and land-cover monitoring. However, variations in building scale, dense spatial distribution, complex background textures, shadows, and occlusions make it difficult to balance segmentation accuracy, boundary recovery, inference efficiency, and deployment cost. This study establishes a unified multidimensional benchmark for remote sensing building instance segmentation. The primary benchmark evaluates mask-predicting instance segmentation models, including YOLOv8-seg, YOLOv11-seg, YOLO26-seg, and Mask R-CNN, under consistent training and evaluation settings. RT-DETR-l and RT-DETR-x are retained only as auxiliary detection-only Transformer baselines because they do not output instance masks in the implemented setting. The benchmark covers bounding-box detection, mask-based segmentation, inference efficiency, model complexity, training behavior, and qualitative visualization. To assess cross-dataset transferability and degradation-specific robustness beyond a single dataset, we further conduct zero-shot WHU-to-Inria testing, independent Inria training/testing with different initialization strategies, and controlled degradation tests involving shadow/occlusion and Gaussian blur. Results on WHU and Inria show that high-capacity YOLO-seg models are competitive among the evaluated mask-predicting models. Under the current experimental settings, YOLOv11x-seg achieves the highest or near-highest mask-based accuracy, whereas YOLOv11m-seg provides a favorable balance between accuracy, speed, and complexity. The zero-shot WHU-to-Inria test reveals a clear domain shift, while the Inria in-domain experiments indicate that high-capacity YOLO-seg models recover competitive performance after target-domain training. The controlled degradation tests show a smaller performance drop under shadow/occlusion than under Gaussian blur for YOLOv11x-seg. These findings provide benchmark-specific evidence for selecting remote sensing building instance segmentation models under accuracy-oriented and efficiency-oriented deployment requirements.

Keywords: YOLO; building segmentation; remote sensing; model evaluation

Share and Cite

MDPI and ACS Style

Chen, Z.; Xu, J.; Guan, D.; Zheng, X.; Li, Y. A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation. Sensors 2026, 26, 3686. https://doi.org/10.3390/s26123686

AMA Style

Chen Z, Xu J, Guan D, Zheng X, Li Y. A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation. Sensors. 2026; 26(12):3686. https://doi.org/10.3390/s26123686

Chicago/Turabian Style

Chen, Zhengsheng, Junjie Xu, Dongdong Guan, Xiaolong Zheng, and Yujie Li. 2026. "A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation" Sensors 26, no. 12: 3686. https://doi.org/10.3390/s26123686

APA Style

Chen, Z., Xu, J., Guan, D., Zheng, X., & Li, Y. (2026). A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation. Sensors, 26(12), 3686. https://doi.org/10.3390/s26123686

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Unified Multidimensional Benchmark and Multi-Dataset Evaluation of YOLO-Based Models for Remote Sensing Building Instance Segmentation

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI