MBA-Former: A Boundary-Aware Transformer for Synergistic Multi-Modal Representation in Pine Wilt Disease Detection from High-Resolution Satellite Imagery
Abstract
1. Introduction
2. Materials
2.1. Overview of the Study Area
2.2. Experimental Data Acquisition and Preprocessing
2.2.1. High Resolution Satellite Remote Sensing Imagery
2.2.2. High-Resolution UAV Remote Sensing Imagery
3. Methods
3.1. Construction of the Standardized Semantic Segmentation Dataset
3.1.1. Feature Enhancement
3.1.2. Sample Dataset Generation and Partitioning
3.2. The MBA-Former Model
3.2.1. Multi-Modal Gating Fusion Module (MFM)
3.2.2. Backbone Network
3.2.3. Boundary-Aware (BA) Module
3.2.4. The Joint Loss Function
3.3. Evaluation Metrics
4. Results
4.1. Experimental Parameter Setup and Model Training
4.2. Ablation Study
4.2.1. Comparative Analysis of Different Feature Combinations
4.2.2. Validation of the Effectiveness of Independent Innovation Modules
4.3. Comparative Analysis of Performance with SOTA Models
4.4. Visual Analysis
4.4.1. Mitigating Feature Confusion via the MFM
4.4.2. Refining Contours via the BA Module
4.4.3. Comprehensive Visual Comparison
5. Discussion
5.1. The Importance of Synergistic Feature Representation and Boundary Refinement
5.2. Performance Comparison with Previous Research
5.3. Limitations and Future Perspectives
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Ye, J.; Wu, X. Progress in pine wilt disease research. China For. Pest Dis. 2022, 41, 1–10. [Google Scholar] [CrossRef]
- Chen, F.; Li, M. Comprehensive strategies for the prevention and control of pine wilt disease in China: A review and future directions. J. For. Res. 2024, 36, 9. [Google Scholar] [CrossRef]
- Zhang, C.; Sun, H.; Chen, Y.; Liu, B.; Feng, F.; Zhou, T. 2025 national occurrence of major forestry pests and trend forecast for 2026. China For. Pest Dis. 2026, 45, 69–74. (In Chinese) [Google Scholar] [CrossRef]
- Hussain, T.; Aslam, A.; Ozair, M.; Tasneem, F.; Gómez-Aguilar, J.F. Dynamical aspects of pine wilt disease and control measures. Chaos Solitons Fractals 2021, 145, 110764. [Google Scholar] [CrossRef]
- Yousefi, S.; Haghighian, F.; Jahromi, M.N.; Pourghasemi, H.R. Pest-infected oak trees identification using remote sensing–based classification algorithms. In Computers in Earth and Environmental Sciences; Elsevier: Amsterdam, The Netherlands, 2022; pp. 363–376. [Google Scholar] [CrossRef]
- Xu, H.C.; Luo, Y.Q.; Zhang, Q. Changes in water content, pigments, and antioxidant enzyme activities in pine needles of Pinus thunbergii and Pinus massoniana affected by pine wood nematode. Sci. Silvae Sin. 2012, 11, 140–143. [Google Scholar]
- Song, Q.; Xiang, R.; Qin, L.; Yuan, L.; Tian, H.; Zhu, P.; Liu, W.; Yang, W.; Qu, Y.; Zhou, J. Analysis of spectral reflectance parameters and moisture content in pine needles. Sci. Technol. Innov. 2018, 3, 26–28. [Google Scholar] [CrossRef]
- Yu, R.; Luo, Y.; Zhou, Q.; Zhang, X.; Wu, D.; Ren, L. A machine learning algorithm to detect pine wilt disease using UAV-based hyperspectral imagery and LiDAR data at the tree level. Int. J. Appl. Earth Obs. Geoinf. 2021, 101, 102363. [Google Scholar] [CrossRef]
- Xu, H.; Luo, Y.; Zhang, T.; Shi, Y. Spectral feature variations of pine needles at different infection stages following natural infestation by pine wilt nematode. Spectrosc. Spectr. Anal. 2011, 31, 1352–1356. [Google Scholar] [CrossRef]
- Li, N.; Huo, L.; Zhang, X. Classification of pine wilt disease at different infection stages by diagnostic hyperspectral bands. Ecol. Indic. 2022, 142, 109198. [Google Scholar] [CrossRef]
- Wu, B.; Liang, A.; Zhang, H.; Zhu, T.; Zou, Z.; Yang, D.; Tang, W.; Li, J.; Su, J. Application of conventional UAV-based high-throughput object detection to the early diagnosis of pine wilt disease by deep learning. For. Ecol. Manag. 2021, 486, 118986. [Google Scholar] [CrossRef]
- Sun, Z.; Wang, Y.; Pan, L.; Xie, Y.; Zhang, B.; Liang, R.; Sun, Y. Pine wilt disease detection in high-resolution UAV images using object-oriented classification. J. For. Res. 2022, 33, 1377–1389. [Google Scholar] [CrossRef]
- Li, J.; Wang, X.; Zhao, H.; Hu, X.; Zhong, Y. Detecting pine wilt disease at the pixel level from high spatial and spectral resolution UAV-borne imagery in complex forest landscapes using deep one-class classification. Int. J. Appl. Earth Obs. Geoinf. 2022, 112, 102947. [Google Scholar] [CrossRef]
- Luo, J.; Fan, J.; Huang, S.; Wu, S.; Zhang, F.; Li, X. Semi-supervised learning techniques for detection of dead pine trees with UAV imagery for pine wilt disease control. Int. J. Remote Sens. 2025, 46, 575–605. [Google Scholar] [CrossRef]
- Wang, J.; Jin, L.; Wang, F.; Zhou, H.; Lin, H. Hierarchical attention and feature enhancement network for multi-scale small targets in pine wilt disease. Comput. Electron. Agric. 2025, 239, 111037. [Google Scholar] [CrossRef]
- Cai, P.; Chen, G.; Yang, H.; Li, X.; Zhu, K.; Wang, T.; Liao, P.; Han, M.; Gong, Y.; Wang, Q.; et al. Detecting individual plants infected with pine wilt disease using drones and satellite imagery: A case study in Xianning, China. Remote Sens. 2023, 15, 2671. [Google Scholar] [CrossRef]
- Wang, G.; Chai, G.; Aierken, N.; Chen, L.; Wang, J.; Qian, Z.; Lei, L.; Zhang, X. Monitoring forest damage caused by pine wilt disease with remote sensing technologies: A review. IEEE Geosci. Remote Sens. Mag. 2025, 14, 286–305. [Google Scholar] [CrossRef]
- Kim, S.R.; Lee, W.K.; Lim, C.H.; Kim, M.; Kafatos, M.C.; Lee, S.H.; Lee, S.S. Hyperspectral analysis of pine wilt disease to determine an optimal detection index. Forests 2018, 9, 115. [Google Scholar] [CrossRef]
- Olegario, T.V.; Baldovino, R.G.; Bugtai, N.T. A decision tree-based classification of diseased pine and oak trees using satellite imagery. In Proceedings of the 2020 IEEE 12th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM), Manila, Philippines, 3–6 December 2020; pp. 1–4. [Google Scholar] [CrossRef]
- Ma, Y.; Zhao, N.; Wang, H.; Zhang, H. Monitoring study of pine wilt disease in Pinus massoniana based on BJ-2 satellite. Agric. Technol. 2021, 41, 76–79. (In Chinese) [Google Scholar] [CrossRef]
- Qin, L.; Meng, X.; Zhang, S.; Xue, Y.; Liu, X.; Xing, P. Application evaluation of Beijing-3 satellite data in remote sensing monitoring of pine wilt disease. For. Resour. Manag. 2022, 4, 126–133. (In Chinese) [Google Scholar] [CrossRef]
- Wang, X.; Hu, Z.; Shi, S.; Hou, M.; Xu, L.; Zhang, X. A deep learning method for optimizing semantic segmentation accuracy of remote sensing images based on improved UNet. Sci. Rep. 2023, 13, 7600. [Google Scholar] [CrossRef]
- Bolyn, C.; Lejeune, P.; Michez, A.; Latte, N. Mapping tree species proportions from satellite imagery using spectral–spatial deep learning. Remote Sens. Environ. 2022, 280, 113205. [Google Scholar] [CrossRef]
- Saputra, M.R.U.; Bhaswara, I.D.; Nasution, B.I.; Ern, M.A.L.; Husna, N.L.R.; Witra, T.; Feliren, V.; Owen, J.R.; Kemp, D.; Lechner, A.M. Multi-modal deep learning approaches to semantic segmentation of mining footprints with multispectral satellite imagery. Remote Sens. Environ. 2025, 318, 114584. [Google Scholar] [CrossRef]
- Grondin, V.; Fortin, J.M.; Pomerleau, F.; Giguère, P. Tree detection and diameter estimation based on deep learning. Forestry 2023, 96, 264–276. [Google Scholar] [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015: 18th International Conference, Munich, Germany, 5–9 October 2015; Springer: Cham, Switzerland, 2015; pp. 234–241. [Google Scholar] [CrossRef]
- Chen, L.C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A.L. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 40, 834–848. [Google Scholar] [CrossRef]
- Zhou, H.; Yuan, X.; Zhou, H.; Shen, H.; Ma, L.; Sun, L.; Fang, G.; Sun, H. Surveillance of pine wilt disease by high resolution satellite. J. For. Res. 2022, 33, 1401–1408. [Google Scholar] [CrossRef]
- Huang, J.; Lu, X.; Chen, L.; Sun, H.; Wang, S.; Fang, G. Accurate identification of pine wood nematode disease with a deep convolution neural network. Remote Sens. 2022, 14, 913. [Google Scholar] [CrossRef]
- Ye, W.; Lao, J.; Liu, Y.; Chang, C.C.; Zhang, Z.; Li, H.; Zhou, H. Pine pest detection using remote sensing satellite images combined with a multi-scale attention-UNet model. Ecol. Inform. 2022, 72, 101906. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 11–17 October 2021; pp. 10012–10022. [Google Scholar]
- Yang, B.; Wang, Z.; Guo, J.; Guo, L.; Liang, Q.; Zeng, Q.; Zhao, R.; Wang, J.; Li, C. Identifying Plant Disease and Severity from Leaves: A Deep Multitask Learning Framework Using Triple-Branch Swin Transformer and Deep Supervision. Comput. Electron. Agric. 2023, 209, 107809. [Google Scholar] [CrossRef]
- Amin, S.U.; Jung, Y.; Fayaz, M.; Kim, B.; Seo, S. Enhancing pine wilt disease detection with synthetic data and external attention-based transformers. Eng. Appl. Artif. Intell. 2025, 159, 111655. [Google Scholar] [CrossRef]
- Zhou, X.; Wang, H.; Chen, C.; Nagy, G.; Jancso, T.; Huang, H. Detection of growth change of young forest based on UAV RGB images at single-tree level. Forests 2023, 14, 141. [Google Scholar] [CrossRef]
- Datt, B. Visible/near infrared reflectance and chlorophyll content in Eucalyptus leaves. Int. J. Remote Sens. 1999, 20, 2741–2759. [Google Scholar] [CrossRef]
- Kurniati, F.T.; Manongga, D.H.; Sediyono, E.; Prasetyo, S.Y.J.; Huizen, R.R. GLCM-based feature combination for extraction model optimization in object detection using machine learning. arXiv 2024, arXiv:2404.04578. [Google Scholar] [CrossRef]
- Kang, X.; Duan, P.; Li, J.; Li, S. Efficient swin transformer for remote sensing image super-resolution. IEEE Trans. Image Process. 2024, 33, 6367–6379. [Google Scholar] [CrossRef] [PubMed]
- Mao, A.; Mohri, M.; Zhong, Y. Cross-entropy loss functions: Theoretical analysis and applications. In Proceedings of the International Conference on Machine Learning, Honolulu, HI, USA, 23–29 July 2023; pp. 23803–23828. [Google Scholar]
- Wang, J.; Chen, T.; Zheng, L.; Tie, J.; Zhang, Y.; Chen, P.; Luo, Z.; Song, Q. A multi-scale remote sensing semantic segmentation model with boundary enhancement based on UNetFormer. Sci. Rep. 2025, 15, 14737. [Google Scholar] [CrossRef] [PubMed]
- Lu, Y.; Huang, Y.; Sun, S.; Zhang, T.; Zhang, X.; Fei, S.; Chen, V. M2fnet: Multi-modal forest monitoring network on large-scale virtual dataset. In Proceedings of the 2024 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW), Orlando, FL, USA, 16–21 March 2024; pp. 539–543. [Google Scholar] [CrossRef]
- Cao, Y.; Coops, N.C.; Murray, B.A.; Sinclair, I.; Geordie, R.M. M3FNet: Multi-modal multi-temporal multi-scale data fusion network for tree species composition mapping. ISPRS J. Photogramm. Remote Sens. 2026, 231, 797–814. [Google Scholar] [CrossRef]
- Park, J.; Sim, W.; Lee, J. Detection of trees with pine wilt disease using object-based classification method. J. For. Environ. Sci. 2016, 32, 384–391. [Google Scholar] [CrossRef][Green Version]
- Wang, J.; Zhao, J.; Sun, H.; Lu, X.; Huang, J.; Wang, S.; Fang, G. Satellite remote sensing identification of discolored standing trees for pine wilt disease based on semi-supervised deep learning. Remote Sens. 2022, 14, 5936. [Google Scholar] [CrossRef]
- Henrich, J.; van Delden, J.; Seidel, D.; Kneib, T.; Ecker, A.S. TreeLearn: A deep learning method for segmenting individual trees from ground-based LiDAR forest point clouds. Ecol. Inform. 2024, 84, 102888. [Google Scholar] [CrossRef]
- Pierrat, Z.A.; Magney, T.; Maguire, A.; Brissette, L.; Doughty, R.; Bowling, D.R.; Logan, B.; Parazoo, N.; Frankenberg, C.; Stutz, J. Seasonal timing of fluorescence and photosynthetic yields at needle and canopy scales in evergreen needleleaf forests. Ecology 2024, 105, e4402. [Google Scholar] [CrossRef]










| Province | Number of Images | Spatial Resolution | Cloud Cover | Acquisition Date | Coverage Area (km2) |
|---|---|---|---|---|---|
| Hunan | 2 | 0.8 m | <5% | 28 September 2023 | 793.86 |
| Zhejiang | 1 | 0.8 m | <5% | 21 October 2023 | 402.35 |
| Liaoning | 3 | 0.8 m | <5% | 5 September 2022 | 1215.77 |
| Dataset | Total Samples (Before Data Balancing) | Positive Samples | Negative Samples | Final Samples (After Data Augmentation) |
|---|---|---|---|---|
| Training set | 15,004 | 1384 | 13,620 | 19,156 |
| Validation set | 1876 | 169 | 1707 | 1876 |
| Test set | 1876 | 195 | 1681 | 1876 |
| Total | 18,756 | 1748 | 17,008 | 22,908 |
| Parameter | Configuration Value |
|---|---|
| Optimizer | AdamW |
| Initial Learning Rate | 3 × 10−4 |
| Learning Rate Scheduler | Cosine Annealing |
| Batch Size | 64 |
| Total Epochs | 200 |
| Weight Decay | 1 × 10−5 |
| Input Patch Size | 256 × 256 pixels |
| Data Normalization | Z-score standardization |
| Boundary Loss Weight (α) | 0.7 |
| Case | Input Feature Composition | Input Channels | mIoU (%) | F1-Score (%) | Boundary F1 (%) |
|---|---|---|---|---|---|
| 1 | Base (Spectral Only) | 4 | 78.53 | 86.95 | 74.28 |
| 2 | Base + VIs | 6 | 80.81 | 88.31 | 77.15 |
| 3 | Base + Tex | 6 | 79.92 | 87.80 | 76.53 |
| 4 | Base + VIs + Tex (Full) | 8 | 81.74 | 89.68 | 78.62 |
| ID | Model Configuration | MFM | BA | mIoU (%) | Infested Pine IoU (%) | Boundary F1 (%) |
|---|---|---|---|---|---|---|
| a | Swin-Unet V2 | 78.86 | 74.03 | 74.16 | ||
| b | MFM only | ✓ | 80.92 | 75.79 | 75.98 | |
| c | BA only | ✓ | 80.15 | 74.61 | 78.23 | |
| d | MBA-Former (Full) | ✓ | ✓ | 81.74 | 77.58 | 78.62 |
| Model | Backbone | Model Type | mIoU (%) | Infested Pine IoU (%) | Others IoU (%) | Boundary F1 (%) |
|---|---|---|---|---|---|---|
| U-Net | Simple CNN | CNN-based | 73.58 ± 0.52 | 69.15 ± 0.61 | 75.42 ± 0.48 | 68.46 ± 0.55 |
| DeepLabV3+ | ResNet-50 | CNN-based | 75.05 ± 0.45 | 71.28 ± 0.53 | 76.81 ± 0.44 | 70.73 ± 0.49 |
| ConvNeXt-Unet | ConvNeXt-T | Modern CNN | 78.23 ± 0.41 | 73.14 ± 0.47 | 80.15 ± 0.39 | 73.49 ± 0.42 |
| Swin-Unet V2 | Swin-T V2 | Transformer-based | 78.86 ± 0.38 | 74.03 ± 0.42 | 80.92 ± 0.35 | 74.16 ± 0.39 |
| MBA-Former (Ours) | Swin-T V2 | Transformer-based (Enhanced) | 81.74 ± 0.32 | 77.58 ± 0.35 | 83.15 ± 0.28 | 78.62 ± 0.31 |
| Model | Params (M) | FLOPs (G) | Inference Time (ms/patch) | mIoU (%) |
|---|---|---|---|---|
| U-Net | 14.7 | 32.5 | 18 | 73.58 |
| DeepLabV3+ | 41.2 | 75.3 | 42 | 75.05 |
| ConvNeXt-Unet | 28.5 | 40.1 | 29 | 78.23 |
| Swin-Unet V2 | 27.2 | 45.1 | 31 | 78.86 |
| MBA-Former (Ours) | 31.5 | 52.4 | 35 | 81.74 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Hou, R.; Zhou, Y.; Wang, Y.; Huang, Z.; Yao, J.; Jiao, Q.; Huang, W.; Zhang, B. MBA-Former: A Boundary-Aware Transformer for Synergistic Multi-Modal Representation in Pine Wilt Disease Detection from High-Resolution Satellite Imagery. Forests 2026, 17, 517. https://doi.org/10.3390/f17050517
Hou R, Zhou Y, Wang Y, Huang Z, Yao J, Jiao Q, Huang W, Zhang B. MBA-Former: A Boundary-Aware Transformer for Synergistic Multi-Modal Representation in Pine Wilt Disease Detection from High-Resolution Satellite Imagery. Forests. 2026; 17(5):517. https://doi.org/10.3390/f17050517
Chicago/Turabian StyleHou, Rui, Yantao Zhou, Ying Wang, Zhiquan Huang, Jing Yao, Quanjun Jiao, Wenjiang Huang, and Biyao Zhang. 2026. "MBA-Former: A Boundary-Aware Transformer for Synergistic Multi-Modal Representation in Pine Wilt Disease Detection from High-Resolution Satellite Imagery" Forests 17, no. 5: 517. https://doi.org/10.3390/f17050517
APA StyleHou, R., Zhou, Y., Wang, Y., Huang, Z., Yao, J., Jiao, Q., Huang, W., & Zhang, B. (2026). MBA-Former: A Boundary-Aware Transformer for Synergistic Multi-Modal Representation in Pine Wilt Disease Detection from High-Resolution Satellite Imagery. Forests, 17(5), 517. https://doi.org/10.3390/f17050517

