Improved Mask R-CNN Multimodal Framework for Simultaneous Soil Horizon Delineation, Soil Group Identification and SOM Prediction from Soil Profile Images
Abstract
1. Introduction
2. Materials and Methods
2.1. Study Area and Soil Profiles
2.2. Labeling Schemes
2.3. Data Augmentation
2.4. The Improved Mask R-CNN Model Architecture
2.5. Loss Function
2.6. Model Training
3. Results and Discussion
3.1. Contributions of Data Augmentation and Transfer Learning in Soil Horizon Segmentation Models
3.2. Model Training Performance and Evaluation
3.3. Performance Evaluation of the Soil-Attribute Branch
3.4. Quantitative and Qualitative Evaluation of Labeling Schemes for Soil Horizon Segmentation
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Hartemink, A.; Zhang, Y.; Bockheim, J.; Curi, N.; Silva, S.; Grauer-Gray, J.; Lowe, D.J.; Krasilnikov, P. Soil horizon variation: A review. Adv. Agron. 2020, 160, 125–185. [Google Scholar]
- Raiesi, F. A minimum data set and soil quality index to quantify the effect of land use conversion on soil quality and degradation in native rangelands of upland arid and semiarid regions. Ecol. Indic. 2017, 75, 307–320. [Google Scholar] [CrossRef]
- Mirzaee, S.; Ghorbani-Dashtaki, S.; Mohammadi, J.; Asadi, H.; Asadzadeh, F. Spatial variability of soil organic matter using remote sensing data. Catena 2016, 145, 118–127. [Google Scholar] [CrossRef]
- Lehmann, J.; Bossio, D.A.; Kögel-Knabner, I.; Rillig, M.C. The concept and future prospects of soil health. Nat. Rev. Earth Environ. 2020, 1, 544–553. [Google Scholar] [CrossRef]
- Jiang, Z.-D.; Owens, P.R.; Zhang, C.-L.; Brye, K.R.; Weindorf, D.C.; Adhikari, K.; Sun, Z.-X.; Sun, F.-J.; Wang, Q.-B. Towards a dynamic soil survey: Identifying and delineating soil horizons in-situ using deep learning. Geoderma 2021, 401, 115341. [Google Scholar] [CrossRef]
- Zhang, Y.; Hartemink, A.E. Soil horizon delineation using vis-NIR and pXRF data. Catena 2019, 180, 298–308. [Google Scholar] [CrossRef]
- Sun, F.; Bakr, N.; Dang, T.; Pham, V.; Weindorf, D.C.; Jiang, Z.; Li, H.; Wang, Q.-B. Enhanced soil profile visualization using portable X-ray fluorescence (PXRF) spectrometry. Geoderma 2020, 358, 113997. [Google Scholar] [CrossRef]
- Zhang, Y.; Hartemink, A.E. A method for automated soil horizon delineation using digital images. Geoderma 2019, 343, 97–115. [Google Scholar] [CrossRef]
- Yang, R.; Chen, J.; Wang, J.; Liu, S. Toward field soil surveys: Identifying and delineating soil diagnostic horizons based on deep learning and RGB image. Agronomy 2022, 12, 2664. [Google Scholar] [CrossRef]
- Naeimi, M.; Porwal, V.; Scott, S.; Krzic, M.; Daggupati, P.; Vasava, H.; Saurette, D.; Biswas, A.; Roul, A.; Biswas, A. Deep metric learning for soil organic matter prediction: A novel similarity-based approach using smartphone-captured images. Comput. Electron. Agric. 2025, 237, 110728. [Google Scholar] [CrossRef]
- Chen, Y.; Liu, K.; Xin, Y.; Zhao, X. Soil image segmentation based on mask R-CNN. In Proceedings of the 2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE), Guangzhou, China, 6–8 January 2023; pp. 507–510. [Google Scholar]
- Viscarra Rossel, R.; Webster, R. Discrimination of Australian soil horizons and classes from their visible–near infrared spectra. Eur. J. Soil Sci. 2011, 62, 637–647. [Google Scholar] [CrossRef]
- Yosinski, J.; Clune, J.; Bengio, Y.; Lipson, H. How transferable are features in deep neural networks? Adv. Neural Inf. Process. Syst. 2014, 27. [Google Scholar] [CrossRef]
- Pan, S.J.; Yang, Q. A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 2009, 22, 1345–1359. [Google Scholar] [CrossRef]
- Bayoudh, K.; Knani, R.; Hamdaoui, F.; Mtibaa, A. A survey on deep multimodal learning for computer vision: Advances, trends, applications, and datasets. Vis. Comput. 2022, 38, 2939–2970. [Google Scholar] [CrossRef]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar]
- Lu, L.; Wang, J. High-resolution mapping of soil texture at various depths in Anhui Province, China. Earth Sci. Inform. 2025, 18, 174. [Google Scholar] [CrossRef]
- Yabuki, T.; Matsumura, Y.; Nakatani, Y. Evaluation of pedodiversity and land use diversity in terms of the Shannon entropy. arXiv 2009, arXiv:0905.2821. [Google Scholar] [CrossRef]
- Ditzler, C.; Scheffe, K.; Monger, H. Soil Survey Manual. In USDA Handbook 18; United States Department of Agriculture: Washington, DC, USA, 2017. [Google Scholar]
- Jahn, R.; Blume, H.-P.; Asio, V.; Spaargaren, O.; Schad, P. Guidelines for Soil Description; FAO: Rome, Italy, 2006. [Google Scholar]
- NY/T 1121.6–2006; Soil Testing-Part 6: Method for Determination of Soil Organic Matter. Ministry of Agriculture of the People’s Republic of China: Beijing, China, 2006.
- Pribyl, D.W. A critical review of the conventional SOC to SOM conversion factor. Geoderma 2010, 156, 75–83. [Google Scholar] [CrossRef]
- GB/T 17296–2009; Classification and Codes for Chinese Soil. Standardization Administration of the People’s Republic of China: Beijing, China, 2009.
- Russell, B.C.; Torralba, A.; Murphy, K.P.; Freeman, W.T. LabelMe: A database and web-based tool for image annotation. Int. J. Comput. Vis. 2008, 77, 157–173. [Google Scholar] [CrossRef]
- Ahmed, S.F.; Alam, M.S.B.; Hassan, M.; Rozbu, M.R.; Ishtiak, T.; Rafa, N.; Mofijur, M.; Shawkat Ali, A.; Gandomi, A.H. Deep learning modelling techniques: Current progress, applications, advantages, and challenges. Artif. Intell. Rev. 2023, 56, 13521–13617. [Google Scholar] [CrossRef]
- Wong, S.C.; Gatt, A.; Stamatescu, V.; McDonnell, M.D. Understanding data augmentation for classification: When to warp? In Proceedings of the 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA), Gold Coast, Australia, 30 November–2 December 2016; pp. 1–6. [Google Scholar]
- Pham, V.; Weindorf, D.C.; Dang, T. Soil profile analysis using interactive visualizations, machine learning, and deep learning. Comput. Electron. Agric. 2021, 191, 106539. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Lin, T.-Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2117–2125. [Google Scholar]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
- Birodkar, V.; Lu, Z.; Li, S.; Rathod, V.; Huang, J. The surprising impact of mask-head architecture on novel class segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Virtual, 11–17 October 2021; pp. 7015–7025. [Google Scholar]
- Shorten, C.; Khoshgoftaar, T.M. A survey on image data augmentation for deep learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef]
- Zhong, S.; Han, Z.; Du, J.; Ci, E.; Ni, J.; Xie, D.; Wei, C. Relationships between the lithology of purple rocks and the pedogenesis of purple soils in the Sichuan Basin, China. Sci. Rep. 2019, 9, 13272. [Google Scholar] [CrossRef] [PubMed]
- Weidong, L.; Baret, F.; Xingfa, G.; Qingxi, T.; Lanfen, Z.; Bing, Z. Relating soil surface moisture to reflectance. Remote Sens. Environ. 2002, 81, 238–246. [Google Scholar] [CrossRef]
- Guo, Z.; Zhang, Z.; Zhou, H.; Wang, D.; Peng, X. The effect of 34-year continuous fertilization on the SOC physical fractions and its chemical composition in a Vertisol. Sci. Rep. 2019, 9, 2505. [Google Scholar] [CrossRef]
- Lin, L.; Gao, Z.; Liu, X.; Sun, Y. A new method for multicolor determination of organic matter in moist soil. Catena 2021, 207, 105611. [Google Scholar] [CrossRef]
- Qiu, N.-X.; Xie, X.-L.; Guan, L.; Li, A.-B.; Liu, J.; Liu, M.; Zhao, Y.-G. Eliminating the influence of free iron oxides on the prediction of organic matter in red soils using Vis-NIR reflectance spectroscopy. Geoderma 2025, 464, 117639. [Google Scholar] [CrossRef]
- Gozukara, G.; Hartemink, A.E.; Zhang, Y. Illumination levels affect the prediction of soil organic carbon using smartphone-based digital images. Comput. Electron. Agric. 2023, 204, 107524. [Google Scholar] [CrossRef]
- Islam, M.; Glocker, B. Spatially varying label smoothing: Capturing uncertainty from expert annotations. In Proceedings of the International Conference on Information Processing in Medical Imaging, Virtual Event, 28–30 June 2021; pp. 677–688. [Google Scholar]
- Kim, D.; Tsai, Y.-H.; Suh, Y.; Faraki, M.; Garg, S.; Chandraker, M.; Han, B. Learning semantic segmentation from multiple datasets with label shifts. In Proceedings of the European Conference on Computer Vision, Tel Aviv, Israel, 23–27 October 2022; pp. 20–36. [Google Scholar]
- Reiß, S.; Seibold, C.; Freytag, A.; Rodner, E.; Stiefelhagen, R. Every annotation counts: Multi-label deep supervision for medical image segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 9532–9542. [Google Scholar]
- Marmanis, D.; Schindler, K.; Wegner, J.D.; Galliani, S.; Datcu, M.; Stilla, U. Classification with an edge: Improving semantic image segmentation with boundary detection. ISPRS J. Photogramm. Remote Sens. 2018, 135, 158–172. [Google Scholar] [CrossRef]
- Ding, H.; Jiang, X.; Liu, A.Q.; Thalmann, N.M.; Wang, G. Boundary-aware feature propagation for scene segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27–28 October 2019; pp. 6819–6829. [Google Scholar]




| Section | Category | Count (n) | Percentage (%) |
|---|---|---|---|
| Soil group | Purple soils | 19 | 4.21 |
| Limestone soils | 20 | 4.43 | |
| Red soils | 65 | 14.4 | |
| Skeletal soils | 35 | 7.76 | |
| Yellow-brown earths | 45 | 9.98 | |
| Paddy soils | 101 | 22.39 | |
| Shajiang black soils | 70 | 15.52 | |
| Fluvo-aquic soils | 55 | 12.21 | |
| Yellow-brown soils | 28 | 6.21 | |
| Yellow soils | 13 | 2.89 | |
| Total | 451 | 100 | |
| Horizon occurrence | A | 446 | 98.89 |
| B | 347 | 76.94 | |
| C | 293 | 64.97 | |
| AB | 306 | 67.85 | |
| AC | 104 | 23.06 | |
| BC | 168 | 37.25 |
| Soil Horizon Segmentation | Soil Group Classification | SOM Prediction | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Precision | Recall | F1 | Seg mAP | Accuracy | F1_Group | R2 | MAE | RMSE | |
| NDA-Scheme 1 | 0.694 | 0.545 | 0.611 | 0.666 | - | - | - | - | - |
| NDA-Scheme 2 | 0.543 | 0.397 | 0.459 | 0.440 | - | - | - | - | - |
| NDA-Scheme 3 | 0.627 | 0.464 | 0.533 | 0.525 | - | - | - | - | |
| Scheme 1 | 0.901 | 0.919 | 0.910 | 0.854 | 0.689 | 0.624 | 0.514 | 0.253 | 0.367 |
| Scheme 2 | 0.896 | 0.826 | 0.860 | 0.802 | 0.674 | 0.601 | 0.486 | 0.274 | 0.389 |
| Scheme 3 | 0.925 | 0.933 | 0.929 | 0.918 | 0.717 | 0.677 | 0.565 | 0.227 | 0.327 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, Q.; Fang, G.; Zhang, N.; Pei, C.; Wu, S.; Yang, M.; Shen, J.; Yu, K.; Shi, X.; Sun, W.; et al. Improved Mask R-CNN Multimodal Framework for Simultaneous Soil Horizon Delineation, Soil Group Identification and SOM Prediction from Soil Profile Images. Soil Syst. 2026, 10, 39. https://doi.org/10.3390/soilsystems10030039
Liu Q, Fang G, Zhang N, Pei C, Wu S, Yang M, Shen J, Yu K, Shi X, Sun W, et al. Improved Mask R-CNN Multimodal Framework for Simultaneous Soil Horizon Delineation, Soil Group Identification and SOM Prediction from Soil Profile Images. Soil Systems. 2026; 10(3):39. https://doi.org/10.3390/soilsystems10030039
Chicago/Turabian StyleLiu, Qi, Guodong Fang, Naichi Zhang, Chenhao Pei, Song Wu, Min Yang, Jie Shen, Kai Yu, Xuezheng Shi, Weixia Sun, and et al. 2026. "Improved Mask R-CNN Multimodal Framework for Simultaneous Soil Horizon Delineation, Soil Group Identification and SOM Prediction from Soil Profile Images" Soil Systems 10, no. 3: 39. https://doi.org/10.3390/soilsystems10030039
APA StyleLiu, Q., Fang, G., Zhang, N., Pei, C., Wu, S., Yang, M., Shen, J., Yu, K., Shi, X., Sun, W., Liu, J., Liu, C., & Wang, Y. (2026). Improved Mask R-CNN Multimodal Framework for Simultaneous Soil Horizon Delineation, Soil Group Identification and SOM Prediction from Soil Profile Images. Soil Systems, 10(3), 39. https://doi.org/10.3390/soilsystems10030039

