Next Article in Journal
Preliminary Feasibility of a Single-Channel Nighttime Cloud Detection in Artificially Lit Regions Using Ground Light Source Observations from VIIRS/DNB Images
Previous Article in Journal
Retrieval of All-Sky Land Surface Temperature from MERSI-II/FY-3D Data
Previous Article in Special Issue
BuildFunc-MoE: An Adaptive Multimodal Mixture-of-Experts Network for Fine-Grained Building Function Identification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing

1
PLA Rocket Force University of Engineering, Xi’an 710025, China
2
School of Geography, Nanjing Normal University, Nanjing 210023, China
3
School of Geoscience and Technology, SouthWest Petroleum University, Chengdu 610500, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(12), 1955; https://doi.org/10.3390/rs18121955 (registering DOI)
Submission received: 27 April 2026 / Revised: 7 June 2026 / Accepted: 9 June 2026 / Published: 12 June 2026
(This article belongs to the Special Issue High-Resolution Remote Sensing Image Processing and Applications)

Abstract

This paper addresses the problem that high pixel-level segmentation accuracy does not necessarily lead to geometrically compact building boundaries in vectorized outputs. A multi-task directional field learning framework is proposed based on U-Net with a ResNet-50 encoder. The framework introduces directional field supervision and a mask-field alignment loss to jointly optimize building region prediction and local boundary orientation consistency. In addition, a mild topological simplification procedure with a fixed small tolerance is applied to reduce residual staircase-like artifacts during vectorization. Experiments on the WHU building dataset at 0.2 m and 0.3 m spatial resolutions show that the proposed framework produces compact vector representations while maintaining high overlap relative to the raster reference annotations. In the 0.2 m setting, directional field learning improves Boundary IoU compared with the Baseline U-Net, whereas the complete pipeline slightly reduces Mask IoU and F1-score due to the additional simplification step. In the 0.3 m setting, the complete method does not consistently outperform several baselines in conventional pixel-level metrics, but it shows a favorable trade-off between polygon compactness and vector overlap under raster-reference evaluation. These results indicate that the proposed method is more suitable for geometry-aware vector reconstruction and vector simplification than for maximizing general semantic segmentation accuracy. In particular, the average number of polygon vertices is substantially reduced while Vector IoU remains approximately 90–92%. To further address the limitation of evaluating only on the WHU dataset, an additional in-domain validation experiment was conducted on the JAX dataset, which contains more complex building appearances and scene variations. The results show that the proposed Directional Field + Mild DP pipeline consistently reduces polygon complexity on the JAX dataset while maintaining competitive vector overlap. The central objective of the proposed framework is not only to improve mask-level building extraction, but also to enhance boundary-oriented vector reconstruction by learning local boundary-direction consistency and reducing raster-induced polygonal redundancy.
Keywords: U-Net; building extraction; remote sensing; semantic segmentation; directional field; deep learning U-Net; building extraction; remote sensing; semantic segmentation; directional field; deep learning

Share and Cite

MDPI and ACS Style

Xu, J.; Chen, Z.; Zhang, Q.; Zhu, M. Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sens. 2026, 18, 1955. https://doi.org/10.3390/rs18121955

AMA Style

Xu J, Chen Z, Zhang Q, Zhu M. Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sensing. 2026; 18(12):1955. https://doi.org/10.3390/rs18121955

Chicago/Turabian Style

Xu, Junjie, Zhengsheng Chen, Qinghua Zhang, and Mulei Zhu. 2026. "Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing" Remote Sensing 18, no. 12: 1955. https://doi.org/10.3390/rs18121955

APA Style

Xu, J., Chen, Z., Zhang, Q., & Zhu, M. (2026). Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sensing, 18(12), 1955. https://doi.org/10.3390/rs18121955

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop