Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing

Xu, Junjie; Chen, Zhengsheng; Zhang, Qinghua; Zhu, Mulei

doi:10.3390/rs18121955

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing

¹

PLA Rocket Force University of Engineering, Xi’an 710025, China

²

School of Geography, Nanjing Normal University, Nanjing 210023, China

³

School of Geoscience and Technology, SouthWest Petroleum University, Chengdu 610500, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2026, 18(12), 1955; https://doi.org/10.3390/rs18121955 (registering DOI)

Submission received: 27 April 2026 / Revised: 7 June 2026 / Accepted: 9 June 2026 / Published: 12 June 2026

(This article belongs to the Special Issue High-Resolution Remote Sensing Image Processing and Applications)

Download Versions Notes

Abstract

This paper addresses the problem that high pixel-level segmentation accuracy does not necessarily lead to geometrically compact building boundaries in vectorized outputs. A multi-task directional field learning framework is proposed based on U-Net with a ResNet-50 encoder. The framework introduces directional field supervision and a mask-field alignment loss to jointly optimize building region prediction and local boundary orientation consistency. In addition, a mild topological simplification procedure with a fixed small tolerance is applied to reduce residual staircase-like artifacts during vectorization. Experiments on the WHU building dataset at 0.2 m and 0.3 m spatial resolutions show that the proposed framework produces compact vector representations while maintaining high overlap relative to the raster reference annotations. In the 0.2 m setting, directional field learning improves Boundary IoU compared with the Baseline U-Net, whereas the complete pipeline slightly reduces Mask IoU and F1-score due to the additional simplification step. In the 0.3 m setting, the complete method does not consistently outperform several baselines in conventional pixel-level metrics, but it shows a favorable trade-off between polygon compactness and vector overlap under raster-reference evaluation. These results indicate that the proposed method is more suitable for geometry-aware vector reconstruction and vector simplification than for maximizing general semantic segmentation accuracy. In particular, the average number of polygon vertices is substantially reduced while Vector IoU remains approximately 90–92%. To further address the limitation of evaluating only on the WHU dataset, an additional in-domain validation experiment was conducted on the JAX dataset, which contains more complex building appearances and scene variations. The results show that the proposed Directional Field + Mild DP pipeline consistently reduces polygon complexity on the JAX dataset while maintaining competitive vector overlap. The central objective of the proposed framework is not only to improve mask-level building extraction, but also to enhance boundary-oriented vector reconstruction by learning local boundary-direction consistency and reducing raster-induced polygonal redundancy.

Keywords: U-Net; building extraction; remote sensing; semantic segmentation; directional field; deep learning

Share and Cite

MDPI and ACS Style

Xu, J.; Chen, Z.; Zhang, Q.; Zhu, M. Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sens. 2026, 18, 1955. https://doi.org/10.3390/rs18121955

AMA Style

Xu J, Chen Z, Zhang Q, Zhu M. Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sensing. 2026; 18(12):1955. https://doi.org/10.3390/rs18121955

Chicago/Turabian Style

Xu, Junjie, Zhengsheng Chen, Qinghua Zhang, and Mulei Zhu. 2026. "Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing" Remote Sensing 18, no. 12: 1955. https://doi.org/10.3390/rs18121955

APA Style

Xu, J., Chen, Z., Zhang, Q., & Zhu, M. (2026). Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sensing, 18(12), 1955. https://doi.org/10.3390/rs18121955

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI