This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing
by
Junjie Xu
Junjie Xu 1,
Zhengsheng Chen
Zhengsheng Chen 1,*
,
Qinghua Zhang
Qinghua Zhang 2
and
Mulei Zhu
Mulei Zhu 3
1
PLA Rocket Force University of Engineering, Xi’an 710025, China
2
School of Geography, Nanjing Normal University, Nanjing 210023, China
3
School of Geoscience and Technology, SouthWest Petroleum University, Chengdu 610500, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(12), 1955; https://doi.org/10.3390/rs18121955 (registering DOI)
Submission received: 27 April 2026
/
Revised: 7 June 2026
/
Accepted: 9 June 2026
/
Published: 12 June 2026
Abstract
This paper addresses the problem that high pixel-level segmentation accuracy does not necessarily lead to geometrically compact building boundaries in vectorized outputs. A multi-task directional field learning framework is proposed based on U-Net with a ResNet-50 encoder. The framework introduces directional field supervision and a mask-field alignment loss to jointly optimize building region prediction and local boundary orientation consistency. In addition, a mild topological simplification procedure with a fixed small tolerance is applied to reduce residual staircase-like artifacts during vectorization. Experiments on the WHU building dataset at 0.2 m and 0.3 m spatial resolutions show that the proposed framework produces compact vector representations while maintaining high overlap relative to the raster reference annotations. In the 0.2 m setting, directional field learning improves Boundary IoU compared with the Baseline U-Net, whereas the complete pipeline slightly reduces Mask IoU and F1-score due to the additional simplification step. In the 0.3 m setting, the complete method does not consistently outperform several baselines in conventional pixel-level metrics, but it shows a favorable trade-off between polygon compactness and vector overlap under raster-reference evaluation. These results indicate that the proposed method is more suitable for geometry-aware vector reconstruction and vector simplification than for maximizing general semantic segmentation accuracy. In particular, the average number of polygon vertices is substantially reduced while Vector IoU remains approximately 90–92%. To further address the limitation of evaluating only on the WHU dataset, an additional in-domain validation experiment was conducted on the JAX dataset, which contains more complex building appearances and scene variations. The results show that the proposed Directional Field + Mild DP pipeline consistently reduces polygon complexity on the JAX dataset while maintaining competitive vector overlap. The central objective of the proposed framework is not only to improve mask-level building extraction, but also to enhance boundary-oriented vector reconstruction by learning local boundary-direction consistency and reducing raster-induced polygonal redundancy.
Share and Cite
MDPI and ACS Style
Xu, J.; Chen, Z.; Zhang, Q.; Zhu, M.
Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sens. 2026, 18, 1955.
https://doi.org/10.3390/rs18121955
AMA Style
Xu J, Chen Z, Zhang Q, Zhu M.
Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sensing. 2026; 18(12):1955.
https://doi.org/10.3390/rs18121955
Chicago/Turabian Style
Xu, Junjie, Zhengsheng Chen, Qinghua Zhang, and Mulei Zhu.
2026. "Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing" Remote Sensing 18, no. 12: 1955.
https://doi.org/10.3390/rs18121955
APA Style
Xu, J., Chen, Z., Zhang, Q., & Zhu, M.
(2026). Multi-Task Directional Field Learning for Geometry-Aware Building Extraction and Simplified Vector Reconstruction in High-Resolution Remote Sensing. Remote Sensing, 18(12), 1955.
https://doi.org/10.3390/rs18121955
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.