DCA-DeepLab: Dual-Coordinate Attention DeepLab with Adaptive Focal Loss for Cotton Growth Semantic Segmentation from UAV Remote Sensing Images
Highlights
- A UAV cotton growth dataset covering four classes (three foreground growth levels and the background) was constructed for fine-grained segmentation evaluation.
- DCA-DeepLab improves cotton growth segmentation by modelling row and column field structure, adaptively fusing features and addressing class imbalance.
- Encoding directional crop row priors benefits fine-grained UAV agricultural segmentation.
- The proposed framework provides a practical basis for field-level cotton growth monitoring and precision management.
Abstract
1. Introduction
- 1.
- We introduce a Dual-Coordinate Attention Gating (DCAG) module that decouples horizontal and vertical encoding to inject the row-and-column planting prior, addressing the structural anisotropy of mechanised cotton fields.
- 2.
- We design a Multi-Scale Attention-Guided Modulated Feature Merging (MSAM-MFM) module that uses multi-scale spatial attention to adaptively fuse semantic and detail features, preserving the boundaries between adjacent growth classes.
- 3.
- We develop an adaptive pixel-level modulated focal loss (APMFL) that pairs an adaptive class weight with a per-pixel confidence modulator to focus learning on minority and hard pixels.
- 4.
- We construct a high-resolution UAV cotton dataset acquired during the flowering and boll-forming stage at the Manas Hui’er Farm and the Changji Huaxing Farm in Xinjiang. On the cotton growth dataset, DCA-DeepLab achieved the highest overall mIoU among eleven representative CNN-, transformer-, and Mamba-based baselines. On the public LoveDA benchmark, it also achieved the highest overall mIoU among the compared methods. The method maintains a computational budget comparable to the DeepLabv3+ baseline.
2. Materials and Methods
2.1. Study Area and UAV Data Acquisition
2.2. Cotton Growth Dataset Construction
2.2.1. Annotation Protocol
2.2.2. Image Partitioning and Augmentation
2.3. DCA-DeepLab
2.3.1. Overall Architecture
2.3.2. Dual-Coordinate Attention Gating
2.3.3. Multi-Scale Attention-Guided Modulated Feature Merging
2.3.4. Adaptive Pixel-Level Modulated Focal Loss
2.4. Experimental Set-Up
3. Results
3.1. Sensitivity to the Initial Learning Rate
3.2. Comparison with State-of-the-Art Methods
3.2.1. Cotton Growth Dataset
3.2.2. Computational Complexity
3.2.3. Cross-Domain Evaluation on LoveDA
3.3. Qualitative Analysis
3.3.1. Segmentation Visualisation
3.3.2. Confusion Matrix
3.4. Ablation Study
3.4.1. Module-Level Ablation
3.4.2. Comparison of Attention Mechanisms
3.4.3. Comparison of Loss Functions
3.4.4. Sensitivity to the Focal Factor
4. Discussion
4.1. Comparison with Existing Approaches
4.2. Why DCA-DeepLab Works on UAV Cotton Imagery
4.3. Implications for UAV-Based Precision Agriculture
4.4. Limitations and Future Work
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Yu, Z.; Yang, Y. Carbon footprint of global cotton production. Resour. Environ. Sustain. 2025, 20, 100214. [Google Scholar] [CrossRef]
- Food and Agriculture Organization of the United Nations. Cotton Market Review. 2025. Available online: https://www.fao.org/markets-and-trade/food-and-agricultural-markets-analysis-FAMA/cotton/en (accessed on 22 February 2026).
- U.S. Department of Agriculture, Foreign Agricultural Service. Production, Supply and Distribution (PSD) Database: Cotton. Database, 2025. In Data Release Synchronized with May 2026 Cotton Report; U.S. Department of Agriculture, Foreign Agricultural Service: Washington, DC, USA, 2026. [Google Scholar]
- Mao, S.; Li, P.; Cheng, S.; Zhang, Z.; Li, X.; Tian, L.; Gao, L.; Wang, Z. Achievements, contributions, experience and prospects of the development of Xinjiang oasis cotton: Written on the 70th anniversary of the founding of the Xinjiang Uygur Autonomous Region. China Cotton 2025, 52, 1–17. (In Chinese) [Google Scholar]
- National Bureau of Statistics of China. Announcement of the National Bureau of Statistics on Cotton Production in 2025; National Bureau of Statistics of China: Beijing, China, 2025. (In Chinese)
- Abdullah, H.M.; Islam, M.; Islam, M.S.; Sen, S.; Tuhin, A.K.; Arman, S.E.; Hasan, M.M. Cotton seedling monitoring and growth stage classification integrating deep learning and feature engineering. Smart Agric. Technol. 2025, 12, 101619. [Google Scholar] [CrossRef]
- Quddus, M.A.; Chowdhury, S.; Jasper, W.; Das, L. Enhancing cotton yield prediction with robust deep neural network-based framework. Eur. J. Agron. 2025, 170, 127750. [Google Scholar] [CrossRef]
- Guo, S.; Liu, T.; Han, Y.; Wang, G.; Du, W.; Wu, F.; Li, Y.; Feng, L. Changes in within-boll yield components explain cotton yield and quality variation across planting dates under a double cropping system of cotton-wheat. Field Crop. Res. 2023, 293, 108853. [Google Scholar] [CrossRef]
- Bista, M.K.; Kodadinne Narayana, N.; Chakravaram, A.; Pieralisi, B.; Dhillon, J.; Reddy, K.R.; Bheemanahalli, R. Intensifying heat stress impacts cotton flowering and boll development efficiency. BMC Plant Biol. 2025, 25, 984. [Google Scholar] [CrossRef] [PubMed]
- Bai, Z.; Dong, B.; Fan, J.; Kefauver, S.C.; Araus, J.L.; Zhang, F.; Yin, F. Cotton yield estimation based on vegetation indices from UAV RGB imagery. Trans. Chin. Soc. Agric. Mach. 2025, 56, 182–192. (In Chinese) [Google Scholar]
- Zhang, R.; Wu, X.; Li, J.; Zhao, P.; Zhang, Q.; Wuri, L.; Zhang, D.; Zhang, Z.; Yang, L. A bibliometric review of deep learning in crop monitoring: Trends, challenges, and future perspectives. Front. Artif. Intell. 2025, 8, 1636898. [Google Scholar] [CrossRef]
- Vaz, C.M.P.; Ferreira, E.J.; Speranza, E.A.; Franchini, J.C.; Naime, J.d.M.; Inamasu, R.Y.; Lopes, I.d.O.N.; das Chagas, S.; Schelp, M.X.; Vecchi, L.; et al. Cotton yield map prediction using Sentinel-2 satellite imagery in the Brazilian Cerrado production system. AgriEngineering 2025, 7, 390. [Google Scholar] [CrossRef]
- Xue, S.; Shi, H.; Li, X.; Yan, J.; Wang, W.; Miao, Q.; Yan, Y.; Hou, C.; Zhao, Y.; Li, X. Crop classification mapping in multi-cloud regions: An integrated approach using GF-SG time series reconstruction and 3D deep convolutional networks. Sci. Remote Sens. 2026, 13, 100361. [Google Scholar] [CrossRef]
- Zhang, S.; Wang, X.; Lin, H.; Dong, Y.; Qiang, Z. A review of the application of UAV multispectral remote sensing technology in precision agriculture. Smart Agric. Technol. 2025, 12, 101406. [Google Scholar] [CrossRef]
- Yao, W.; Liu, C.; Liu, Y.; Zheng, Q.; Wang, J.; Yu, H.; Chen, C.; Guo, S. Unmanned aerial vehicle payload technology applications in agriculture and other low-altitude scenarios: A review. Front. Plant Sci. 2025, 16, 1721484. [Google Scholar] [CrossRef]
- Bazrafkan, A.; Igathinathane, C.; Bandillo, N.; Flores, P. Optimizing integration techniques for UAS and satellite image data in precision agriculture: A review. Front. Remote Sens. 2025, 6, 1622884. [Google Scholar] [CrossRef]
- Zhang, L.; Wang, A.; Zhang, H.; Zhu, Q.; Zhang, H.; Sun, W.; Niu, Y. Estimating leaf chlorophyll content of winter wheat from UAV multispectral images using machine learning algorithms under different species, growth stages, and nitrogen stress conditions. Agriculture 2024, 14, 1064. [Google Scholar] [CrossRef]
- Ji, J.; Li, N.; Cui, H.; Li, Y.; Zhao, X.; Zhang, H.; Ma, H. Study on monitoring SPAD values for multispatial spatial vertical scales of summer maize based on UAV multispectral remote sensing. Agriculture 2023, 13, 1004. [Google Scholar] [CrossRef]
- Tang, Y.; Zhou, Y.; Cheng, M.; Sun, C. Comprehensive growth index (CGI): A comprehensive indicator from UAV-observed data for winter wheat growth status monitoring. Agronomy 2023, 13, 2883. [Google Scholar] [CrossRef]
- Gu, H.; Xue, C.; Wang, G.; Lan, Y.; Wang, H.; Song, C. UAV-based multispectral inversion of integrated cotton growth. Agronomy 2024, 14, 2903. [Google Scholar] [CrossRef]
- Zhang, D.; Qi, H.; Guo, X.; Sun, H.; Min, J.; Li, S.; Hou, L.; Lv, L. Integration of UAV multispectral remote sensing and random forest for full-growth stage monitoring of wheat dynamics. Agriculture 2025, 15, 353. [Google Scholar] [CrossRef]
- Zarbakhsh, S.; Fakhrzad, F.; Rajkovic, D.; Niedbała, G.; Piekutowska, M. Approaches and challenges in machine learning for monitoring agricultural products and predicting plant physiological responses to biotic and abiotic stresses. Curr. Plant Biol. 2025, 43, 100535. [Google Scholar] [CrossRef]
- Cai, W.; Lu, K.; Fan, M.; Liu, C.; Huang, W.; Chen, J.; Wu, Z.; Xu, C.; Ma, X.; Tan, S. Rice growth-stage recognition based on improved YOLOv8 with UAV imagery. Agronomy 2024, 14, 2751. [Google Scholar] [CrossRef]
- Xia, M.; Chen, X.; Tian, X.; Wen, H.; Zhao, Y.; Liu, H.; Liu, W.; Zheng, Y. Lightweight deep learning for real-time cotton monitoring: UAV-based defoliation and boll-opening rate assessment. Agriculture 2025, 15, 2095. [Google Scholar] [CrossRef]
- Li, Y.; Yang, W.; Lu, Z.; Shi, H. YH-RTYO: An end-to-end object detection method for crop growth anomaly detection in UAV scenarios. PeerJ Comput. Sci. 2024, 10, e2477. [Google Scholar] [CrossRef]
- Rana, A.; Vaidya, P. YOLO-based deep learning framework for real-time multi-class plant health monitoring in precision agriculture. Sci. Rep. 2026, 16, 197. [Google Scholar] [CrossRef] [PubMed]
- Reji, J.; Nidamanuri, R.R. Deep learning-based prediction of plant height and crown area of vegetable crops using LiDAR point cloud. Sci. Rep. 2024, 14, 14903. [Google Scholar]
- Zhao, D.; Yang, G.; Xu, T.; Yu, F.; Zhang, C.; Cheng, Z.; Ren, L.; Yang, H. Dynamic maize true leaf area index retrieval with KGCNN and TL and integrated 3D radiative transfer modeling for crop phenotyping. Plant Phenomics 2025, 7, 100004. [Google Scholar] [CrossRef]
- Zeng, F.; Wang, R.; Jiang, Y.; Liu, Z.; Ding, Y.; Dong, W.; Xu, C.; Zhang, D.; Wang, J. Growth monitoring of rapeseed seedlings in multiple growth stages based on low-altitude remote sensing and semantic segmentation. Comput. Electron. Agric. 2025, 232, 110135. [Google Scholar] [CrossRef]
- Zhan, Y.; Zhou, Y.; Bai, G.; Ge, Y. Bagging improves the performance of deep learning-based semantic segmentation with limited labeled images: A case study of crop segmentation for high-throughput plant phenotyping. Sensors 2024, 24, 3420. [Google Scholar] [CrossRef]
- Chen, L.C.; Zhu, Y.; Papandreou, G.; Schroff, F.; Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 833–851. [Google Scholar]
- Xie, E.; Wang, W.; Yu, Z.; Anandkumar, A.; Alvarez, J.M.; Luo, P. SegFormer: Simple and efficient design for semantic segmentation with transformers. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Virtual, 6–14 December 2021; Volume 34, pp. 12077–12090. [Google Scholar]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021; pp. 9992–10002. [Google Scholar] [CrossRef]
- Kirillov, A.; Mintun, E.; Ravi, N.; Mao, H.; Rolland, C.; Gustafson, L.; Xiao, T.; Whitehead, S.; Berg, A.C.; Lo, W.Y.; et al. Segment Anything. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 1–6 October 2023; pp. 3992–4003. [Google Scholar] [CrossRef]
- Zhang, J.; Li, Y.; Yang, X.; Jiang, R.; Zhang, L. RSAM-Seg: A SAM-based model with prior knowledge integration for remote sensing image semantic segmentation. Remote Sens. 2025, 17, 590. [Google Scholar] [CrossRef]
- Kheddar, H.; Habchi, Y.; Ghanem, M.C.; Hemis, M.; Niyato, D. Recent Advances in Transformer and Large Language Models for UAV Applications. arXiv 2025, arXiv:2508.11834. [Google Scholar] [CrossRef]
- Guan, X.; Liu, M.; Cao, S.; Jiang, J. Phenology-aware transformer for semantic segmentation of non-food crops from multi-source remote sensing time series. Remote Sens. 2025, 17, 2346. [Google Scholar] [CrossRef]
- Valicharla, S.K.; Karimzadeh, R.; Li, X.; Park, Y.L. Transformer-based semantic segmentation of Japanese knotweed in high-resolution UAV imagery using Twins-SVT. Information 2025, 16, 741. [Google Scholar] [CrossRef]
- Liu, H.; Meng, L.; Zhang, X.; Ustin, S.; Ning, D.; Sun, S. Cotton yield estimation model based on time-series Landsat imagery. Trans. Chin. Soc. Agric. Eng. 2015, 31, 215–220. (In Chinese) [Google Scholar]
- Feng, M.; Su, Y.; Lin, T.; Yu, X.; Song, Y.; Jin, X. High-throughput cotton yield estimation based on UAV multi-source remote sensing data and machine learning. Trans. Chin. Soc. Agric. Mach. 2025, 56, 169–179. (In Chinese) [Google Scholar]
- Li, Y.; Guo, S.; Jia, S.; Yan, Y.; Jia, H.; Zhang, W. Quantifying the effects of UAV flight altitude on the multispectral monitoring accuracy of soil moisture and maize phenotypic parameters. Agronomy 2025, 15, 2137. [Google Scholar] [CrossRef]
- Stamenković, Z.; Kešelj, K.; Kostić, M.; Aćin, V.; Tekić, D.; Ivanišević, M.; Novaković, T. Assessing the impact of UAV flight altitudes on the accuracy of multispectral indices. Contemp. Agric. 2024, 73, 157–164. [Google Scholar] [CrossRef]
- Hu, Z.; Fan, S.; Li, Y.; Tang, Q.; Bao, L.; Zhang, S.; Sarsen, G.; Guo, R.; Wang, L.; Zhang, N.; et al. Estimating stratified biomass in cotton fields using UAV multispectral remote sensing and machine learning. Drones 2025, 9, 186. [Google Scholar] [CrossRef]
- Ding, J.; Ji, M.; Alkahtani, J.; Li, H.; Liu, Y.; Zhou, F.; Zhao, Z.; Dong, S.; Chen, Y.; Zhang, X.; et al. Influence of population biomass accumulation during different growth periods on agronomic traits and cotton yield. Agronomy 2024, 14, 2625. [Google Scholar] [CrossRef]
- Hand, C.; Snider, J.L.; Roberts, P.M. Cotton Growth Monitoring and PGR Management; Circular 1244; University of Georgia Cooperative Extension: Athens, GA, USA, 2022. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Deng, J.; Dong, W.; Socher, R.; Li, L.J.; Li, K.; Fei-Fei, L. ImageNet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami, FL, USA, 20–25 June 2009; pp. 248–255. [Google Scholar]
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 7132–7141. [Google Scholar]
- Wang, J.; Zheng, Z.; Ma, A.; Lu, X.; Zhong, Y. LoveDA: A remote sensing land-cover dataset for domain adaptive semantic segmentation. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Virtual, 6–14 December 2021; pp. 23968–23980. [Google Scholar]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
- Gu, Z.; Cheng, J.; Fu, H.; Zhou, K.; Hao, H.; Zhao, Y.; Zhang, T.; Gao, S.; Liu, J. CE-Net: Context encoder network for 2D medical image segmentation. IEEE Trans. Med. Imaging 2019, 38, 2281–2292. [Google Scholar] [CrossRef]
- Yang, M.; Yu, K.; Zhang, C.; Li, Z.; Yang, K. DenseASPP for semantic segmentation in street scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 3684–3692. [Google Scholar]
- Strudel, R.; Garcia, R.; Laptev, I.; Schmid, C. Segmenter: Transformer for semantic segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021; pp. 7262–7272. [Google Scholar]
- Chen, J.; Mei, J.; Li, X.; Lu, Y.; Yu, Q.; Wei, Q.; Luo, X.; Xie, Y.; Adeli, E.; Wang, Y.; et al. TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers. Med. Image Anal. 2024, 97, 103280. [Google Scholar] [CrossRef] [PubMed]
- Ma, X.; Zhang, X.; Pun, M.O. RS3Mamba: Visual state space model for remote sensing image semantic segmentation. IEEE Geosci. Remote Sens. Lett. 2024, 21, 6011405. [Google Scholar] [CrossRef]
- Zheng, J.; Shao, A.; Yan, Y.; Wu, J.; Zhang, M. Remote sensing semantic segmentation via boundary supervision-aided multiscale channelwise cross attention network. In IEEE Transactions on Geoscience and Remote Sensing; IEEE: New York, NY, USA, 2023; Volume 61, p. 4405814. [Google Scholar]
- Zeng, Q.; Zhou, J.; Tao, J.; Chen, L.; Niu, X.; Zhang, Y. Multiscale Global Context Network for semantic segmentation of high-resolution remote sensing images. In IEEE Transactions on Geoscience and Remote Sensing; IEEE: New York, NY, USA, 2024; Volume 62, p. 5622913. [Google Scholar]
- Li, J.; Wei, Y.; Wei, T.; He, W. A comprehensive deep-learning framework for fine-grained farmland mapping from high-resolution images. In IEEE Transactions on Geoscience and Remote Sensing; IEEE: New York, NY, USA, 2025; Volume 63, p. 5601215. [Google Scholar]
- Huang, W.; Deng, F.; Liu, H.; Ding, M.; Yao, Q. Multiscale semantic segmentation of remote sensing images based on edge optimization. In IEEE Transactions on Geoscience and Remote Sensing; IEEE: New York, NY, USA, 2025; Volume 63, p. 5616813. [Google Scholar]
- Zhou, Q.; Qiang, Y.; Mo, Y.; Wu, X.; Latecki, L.J. BANet: Boundary-assistant encoder-decoder network for semantic segmentation. IEEE Trans. Intell. Transp. Syst. 2022, 23, 25259–25270. [Google Scholar] [CrossRef]
- Zhao, H.; Shi, J.; Qi, X.; Wang, X.; Jia, J. Pyramid Scene Parsing Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 2881–2890. [Google Scholar]
- Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Hou, Q.; Zhou, D.; Feng, J. Coordinate Attention for efficient mobile network design. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 13708–13717. [Google Scholar]
- Ho, J.; Kalchbrenner, N.; Weissenborn, D.; Salimans, T. Axial Attention in multidimensional transformers. arXiv 2019, arXiv:1912.12180. [Google Scholar] [CrossRef]
- Huang, Z.; Wang, X.; Wei, Y.; Huang, L.; Shi, H.; Liu, W.; Huang, T.S. CCNet: Criss-cross attention for semantic segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 603–612. [Google Scholar]
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2999–3007. [Google Scholar]
- Salehi, S.S.M.; Erdogmus, D.; Gholipour, A. Tversky Loss Function for Image Segmentation Using 3D Fully Convolutional Deep Networks. In Proceedings of the International Workshop on Machine Learning in Medical Imaging (MLMI), Quebec City, QC, Canada, 10 September 2017; pp. 379–387. [Google Scholar]
- Berman, M.; Triki, A.R.; Blaschko, M.B. The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 4413–4421. [Google Scholar]
- Gao, M.; Lu, T.; Wang, L. Crop mapping based on Sentinel-2 images using semantic segmentation model of attention mechanism. Sensors 2023, 23, 7008. [Google Scholar] [CrossRef]
- Wang, G.; Chen, J.; Mo, L.; Wu, P.; Yi, X. Lightweight land cover classification via semantic segmentation of remote sensing imagery and analysis of influencing factors. Front. Environ. Sci. 2024, 12, 1329517. [Google Scholar] [CrossRef]
- de Silva, R.; Cielniak, G.; Gao, J. Vision based crop row navigation under varying field conditions in arable fields. Comput. Electron. Agric. 2024, 217, 108581. [Google Scholar] [CrossRef]
- Chen, X.; Zhang, H.; Wong, C.U.I. Dynamic Monitoring and Precision Fertilization Decision System for Agricultural Soil Nutrients Using UAV Remote Sensing and GIS. Agriculture 2025, 15, 1627. [Google Scholar] [CrossRef]
- Yang, X.; Chen, J.; Lu, X.; Liu, H.; Liu, Y.; Bai, X.; Qian, L.; Zhang, Z. Advances in UAV Remote Sensing for Monitoring Crop Water and Nutrient Status: Modeling Methods, Influencing Factors, and Challenges. Plants 2025, 14, 2544. [Google Scholar] [CrossRef] [PubMed]
- Deng, Y.; Chen, G.; Tang, B.; Duan, X.; Zuo, L.; Zhao, H. Study on Class Imbalance in Land Use Classification for Soil Erosion in Dry–Hot Valley Regions. Remote Sens. 2025, 17, 1628. [Google Scholar] [CrossRef]
- Nadeem, N.; Asad, M.H.; Bais, A. CCUDA-SS: Cross-Crop Unsupervised Domain Adaptation for Semantic Segmentation. In IEEE Transactions on AgriFood Electronics; IEEE: New York, NY, USA, 2026. [Google Scholar] [CrossRef]
- Blekanov, I.; Molin, A.; Zhang, D.; Mitrofanov, E.; Mitrofanova, O.; Li, Y. Monitoring of grain crops nitrogen status from UAV multispectral images coupled with deep learning approaches. Comput. Electron. Agric. 2023, 212, 108047. [Google Scholar] [CrossRef]













| Class | Criteria |
|---|---|
| Background | Bare soil, inter-row gaps or non-vegetated areas with no cotton canopy present; canopy coverage ; NDVI typically . |
| Vigorous | Dense canopy with continuous rows and columns; bare soil between rows almost fully covered; canopy coverage ; plant height cm (typically 85–95); main stem nodes (typically 18–20); daily growth – cm/d; ≥9 (typically 9–11) fruit branches per plant; first fruit branch at ≤7th node (typically 6–7); ≥8 (typically 8–10) bolls per plant; NDVI . |
| Moderate | Canopy coverage incomplete but rows clearly visible; some bare soil between rows; canopy coverage ; plant height 70–85 cm; main stem nodes 15–18; daily growth – cm/d; 6–9 fruit branches; first fruit branch at the 7–8th node; 5–8 bolls per plant; NDVI –. |
| Sparse | Sparse canopy with broken rows and large bare soil exposure; canopy coverage ; plant height cm; main stem nodes ; daily growth cm/d; ≤6 fruit branches; first fruit branch at the ≥8th node (typically 8–9); ≤5 bolls per plant; NDVI . |
| Model | Year | Acc (%) | mIoU (%) | Background | Vigorous | Moderate | Sparse |
|---|---|---|---|---|---|---|---|
| U-Net [50] | 2015 | 68.28 | 47.20 | 69.54 | 28.01 | 61.57 | 29.66 |
| CENet [51] | 2019 | 67.26 | 46.43 | 69.61 | 29.64 | 58.29 | 28.17 |
| DenseASPP [52] | 2018 | 70.64 | 48.85 | 70.95 | 32.85 | 61.09 | 30.52 |
| Segmenter [53] | 2021 | 70.73 | 49.90 | 72.17 | 35.61 | 61.22 | 30.58 |
| RS3Mamba [55] | 2024 | 70.31 | 49.13 | 71.93 | 34.05 | 60.54 | 29.99 |
| MCCANet [56] | 2023 | 71.12 | 50.37 | 72.38 | 37.03 | 61.02 | 31.07 |
| MSGCNet [57] | 2024 | 71.25 | 50.49 | 72.52 | 37.18 | 61.05 | 31.20 |
| DBBANet [58] | 2025 | 70.87 | 49.95 | 72.36 | 36.17 | 60.93 | 30.34 |
| MSEONet [59] | 2025 | 71.49 | 50.64 | 72.43 | 37.90 | 61.12 | 31.11 |
| TransUNet [54] | 2024 | 70.98 | 50.12 | 71.93 | 36.84 | 60.76 | 30.95 |
| DeepLabv3+ [31] | 2018 | 70.36 | 49.84 | 72.96 | 36.47 | 59.48 | 30.45 |
| DCA-DeepLab (Ours) | – | 72.13 | 51.74 | 72.78 | 39.98 | 61.85 | 32.36 |
| Model | Acc (%) | mIoU (%) | Params (M) | FLOPs (G) | FPS |
|---|---|---|---|---|---|
| U-Net | 68.28 | 47.20 | 31.2 | 94.7 | 147.9 |
| CENet | 67.26 | 46.43 | 29.6 | 89.3 | 187.3 |
| DenseASPP | 70.64 | 48.85 | 57.9 | 176.4 | 95.7 |
| Segmenter | 70.73 | 49.90 | 86.3 | 212.8 | 116.1 |
| RS3Mamba | 70.31 | 49.13 | 41.7 | 126.1 | 44.9 |
| MCCANet | 71.12 | 50.37 | 48.6 | 149.2 | 85.4 |
| MSGCNet | 71.25 | 50.49 | 50.8 | 154.6 | 83.6 |
| DBBANet | 70.87 | 49.95 | 47.3 | 146.8 | 123.8 |
| MSEONet | 71.49 | 50.64 | 52.9 | 159.4 | 64.3 |
| TransUNet | 70.98 | 50.12 | 104.7 | 281.6 | 62.5 |
| DeepLabv3+ | 70.36 | 49.84 | 59.4 | 177.9 | 104.6 |
| DCA-DeepLab (Ours) | 72.13 | 51.74 | 61.7 | 184.6 | 68.6 |
| Model | Year | mIoU (%) | Background | Building | Road | Water | Barren | Forest | Agriculture |
|---|---|---|---|---|---|---|---|---|---|
| BANet [60] | 2022 | 50.15 | 53.94 | 62.14 | 51.33 | 64.59 | 27.07 | 43.86 | 48.12 |
| PSPNet [61] | 2017 | 48.31 | 44.40 | 52.13 | 53.52 | 76.50 | 9.73 | 44.07 | 57.85 |
| U-Net [50] | 2015 | 47.84 | 43.06 | 52.74 | 52.78 | 73.08 | 10.33 | 43.05 | 59.87 |
| Segmenter [53] | 2021 | 47.11 | 37.99 | 50.68 | 48.72 | 77.41 | 13.32 | 43.47 | 58.21 |
| RS3Mamba [55] | 2024 | 46.90 | 39.72 | 58.75 | 57.92 | 61.00 | 37.24 | 39.67 | 33.98 |
| TransUNet [54] | 2024 | 48.87 | 43.05 | 56.12 | 53.71 | 78.04 | 9.35 | 44.92 | 56.91 |
| MSEONet [59] | 2025 | 50.66 | 45.22 | 55.22 | 53.72 | 78.45 | 15.65 | 46.50 | 59.84 |
| DBBANet [58] | 2025 | 49.95 | 45.21 | 55.40 | 53.04 | 77.81 | 15.17 | 45.26 | 57.75 |
| MCCANet [56] | 2023 | 48.09 | 40.80 | 52.92 | 52.98 | 77.09 | 16.81 | 41.32 | 54.72 |
| DeepLabv3+ [31] | 2018 | 47.62 | 42.97 | 50.88 | 52.02 | 74.36 | 10.40 | 44.21 | 58.53 |
| DCA-DeepLab (Ours) | – | 51.71 | 44.92 | 54.18 | 56.32 | 71.96 | 31.34 | 44.58 | 58.70 |
| Model | MSAM-MFM | DCAG | Acc (%) | mIoU (%) | Background | Vigorous | Moderate | Sparse |
|---|---|---|---|---|---|---|---|---|
| 1 | × | × | 70.52 | 50.19 | 72.26 | 38.10 | 59.36 | 31.02 |
| 2 | ✔ | × | 71.78 | 50.81 | 71.47 | 38.48 | 61.57 | 31.71 |
| 3 | × | ✔ | 71.65 | 51.13 | 72.43 | 38.96 | 61.26 | 31.88 |
| 4 | ✔ | ✔ | 72.13 | 51.74 | 72.78 | 39.98 | 61.85 | 32.36 |
| Model | Acc (%) | mIoU (%) | Background | Vigorous | Moderate | Sparse |
|---|---|---|---|---|---|---|
| SE | 70.89 | 50.44 | 72.34 | 38.28 | 59.89 | 31.24 |
| CBAM | 71.07 | 50.58 | 72.35 | 38.44 | 60.12 | 31.39 |
| CA | 71.18 | 50.71 | 72.38 | 38.52 | 60.43 | 31.51 |
| Axial | 71.38 | 50.89 | 72.41 | 38.70 | 60.78 | 31.65 |
| CCA | 71.42 | 50.95 | 72.37 | 38.76 | 60.94 | 31.72 |
| DCAG (Ours) | 71.65 | 51.13 | 72.43 | 38.96 | 61.26 | 31.88 |
| Loss | mIoU (%) | Background | Vigorous | Moderate | Sparse |
|---|---|---|---|---|---|
| CE | 48.98 | 72.70 | 37.19 | 57.02 | 28.99 |
| Focal | 49.43 | 72.11 | 37.89 | 57.68 | 30.05 |
| Tversky | 50.84 | 72.96 | 37.97 | 60.98 | 31.45 |
| Lovász | 50.66 | 72.85 | 37.53 | 60.68 | 31.57 |
| APMFL (Ours) | 51.74 | 72.78 | 39.98 | 61.85 | 32.36 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Jia, L.; Gao, J.; Li, Z.; Shi, H.; Zhu, J. DCA-DeepLab: Dual-Coordinate Attention DeepLab with Adaptive Focal Loss for Cotton Growth Semantic Segmentation from UAV Remote Sensing Images. Drones 2026, 10, 456. https://doi.org/10.3390/drones10060456
Jia L, Gao J, Li Z, Shi H, Zhu J. DCA-DeepLab: Dual-Coordinate Attention DeepLab with Adaptive Focal Loss for Cotton Growth Semantic Segmentation from UAV Remote Sensing Images. Drones. 2026; 10(6):456. https://doi.org/10.3390/drones10060456
Chicago/Turabian StyleJia, Liruizhi, Jiazhan Gao, Zuolong Li, Heng Shi, and Jihong Zhu. 2026. "DCA-DeepLab: Dual-Coordinate Attention DeepLab with Adaptive Focal Loss for Cotton Growth Semantic Segmentation from UAV Remote Sensing Images" Drones 10, no. 6: 456. https://doi.org/10.3390/drones10060456
APA StyleJia, L., Gao, J., Li, Z., Shi, H., & Zhu, J. (2026). DCA-DeepLab: Dual-Coordinate Attention DeepLab with Adaptive Focal Loss for Cotton Growth Semantic Segmentation from UAV Remote Sensing Images. Drones, 10(6), 456. https://doi.org/10.3390/drones10060456

