CIDR-MobileNet: A Monocular Pseudo-Depth and Cross-Modal Feature Fusion Approach for Chili Pepper Above-Ground Biomass Estimation
Abstract
1. Introduction
- (1)
- Pseudo-depth information is generated from a single RGB image using the Depth Anything V2 model, providing low-cost 3D structural features and overcoming the limitation of traditional methods that rely on additional depth sensors.
- (2)
- A cross-modal feature interaction fusion (CFIF) module is designed to achieve deep semantic interaction between RGB and pseudo-depth features via a bidirectional attention mechanism, enhancing multimodal feature complementarity.
- (3)
- A multi-branch distribution-based regression head (MDBR-Head) is constructed to formulate biomass prediction as a distribution estimation problem, improving model robustness and uncertainty awareness.
- (4)
- A ranking loss is introduced to enforce ordinal constraints among predicted biomass values, strengthening the modelling of relative relationships, while maintaining a lightweight design suitable for edge-device deployment.
2. Materials and Methods
2.1. Dataset Description
Data Acquisition Method
2.2. Dataset Construction and Annotation
2.2.1. Dataset Size
2.2.2. Annotation Procedure
- (1)
- Spatial binding: During image acquisition, a tag carrying a unique code for the sampling unit was placed within the camera’s field of view. The spatial correspondence between the image and the sampling unit was established through visual markers, achieving a one-image-one-code precise binding.
- (2)
- Temporal binding: Image acquisition and biomass weighing were completed within the same time window, and the start and end times of both operations were recorded using timestamps. This approach avoided the influence of plant growth dynamics on the mapping relationship and ensured temporal consistency.
2.3. Pseudo-Depth Map Generation
2.3.1. Method Origin
2.3.2. Generation Workflow
- (1)
- Input preprocessing: The acquired RGB images (maintaining their original resolution) were pixel-normalised to comply with the model input specifications.
- (2)
- Feature extraction and depth regression: The model extracted multi-scale visual features using an encoder, and after decoder-based fusion, predicted relative depth values at the pixel level.
- (3)
- Output mapping: A grayscale pseudo-depth map with a resolution identical to that of the input RGB image was generated, in which the grayscale intensity of each pixel represents the relative height and spatial topological relationships of the plant canopy.
2.4. Overall Network Architecture
2.4.1. Dual-Branch Structure
2.4.2. Feature Extraction Backbone Network
2.4.3. Overall Workflow
2.5. CFIF Feature Fusion Module
2.5.1. Cross-Feature Interactive Fusion (CFIF) Module
2.5.2. Fusion Mechanism
- (1)
- Explicit Interaction Layer
- (2)
- Adaptive Fusion Layer
- (3)
- Residual Topology
2.6. Multi-Branch Distribution Regression Head (MDBR-Head)
2.6.1. Architecture of the Regression Head
2.6.2. Multi-Granularity Feature Decoupling Head
- (1)
- Global Semantic Branch
- (2)
- Local Structural Branch
- (3)
- Distribution Regression Branch
2.6.3. Dynamic Adaptive Fusion Mechanism
2.7. Pairwise Ranking Loss
2.7.1. Random Pairwise Ranking Modelling Mechanism
2.7.2. Definition of the Ranking Loss
2.7.3. Stage-Wise Training Strategy
3. Results
3.1. Experimental Setup and Evaluation Metrics
3.1.1. Experimental Environment and Hyperparameter Settings
3.1.2. Evaluation Metrics
3.2. Multi-Modal Effectiveness Analysis
3.3. Comparative Experiments
3.4. Ablation Experiments
3.4.1. Effectiveness of the Cross-Modal Feature Interaction Fusion Module (CFIF)
3.4.2. Contribution of the Multi-Branch Distribution Regression Head (MDBR-Head)
3.4.3. Optimisation Effect of the Ranking Loss
3.5. Visualisation Analysis
3.5.1. Training Convergence and Stability Analysis
3.5.2. Analysis of Prediction Fitting Ability
3.5.3. Residual Error Distribution Analysis
3.5.4. Model Stability Evaluation Based on K-Fold Cross-Validation
4. Discussion
4.1. Synergistic Enhancement of Model Performance by Core Modules
4.2. Distribution Regression vs. Point Estimation
4.3. Application Scenarios and Comparison with Existing Solutions
4.4. Statistical Significance
4.5. Limitations and Future Work
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| AGB | above-ground biomass |
| NDVI | Normalised Difference Vegetation Index |
| NDRE | Normalised Difference Red Edge |
| CNN | convolutional neural network |
| UAV | unmanned aerial vehicle |
| UAS | unmanned aerial system |
| LiDAR | Light Detection and Ranging |
| MVS | multi-view stereo |
| CFIF | cross-feature interactive fusion |
| GAP | global average pooling |
| ReLU | Rectified Linear Unit |
| BN | batch normalisation |
| MDBR | multi-branch distribution regression |
| MAE | mean absolute error |
| MAPE | mean absolute percentage error |
| RMSE | root mean square error |
References
- Karim, K.M.R.; Rafii, M.Y.; Misran, A.B.; Ismail, M.F.B.; Harun, A.R.; Khan, M.M.H.; Chowdhury, M.F.N. Current and Prospective Strategies in the Varietal Improvement of Chilli (Capsicum annuum L.) Specially Heterosis Breeding. Agronomy 2021, 11, 2217. [Google Scholar] [CrossRef]
- Liu, Y.; Feng, H.; Yue, J.; Fan, Y.; Jin, X.; Song, X.; Yang, H.; Yang, G. Estimation of Potato Above-Ground Biomass Based on Vegetation Indices and Green-Edge Parameters Obtained from UAVs. Remote Sens. 2022, 14, 5323. [Google Scholar] [CrossRef]
- Smith, E.N.; van Aalst, M.; Tosens, T.; Niinemets, Ü.; Stich, B.; Morosinotto, T.; Alboresi, A.; Erb, T.J.; Gómez-Coronado, P.A.; Tolleter, D.; et al. Improving Photosynthetic Efficiency toward Food Security: Strategies, Advances, and Perspectives. Mol. Plant 2023, 16, 1547–1563. [Google Scholar] [CrossRef] [PubMed]
- Tai, S.; Tang, Z.; Li, B.; Wang, S.; Guo, X. Intelligent Recognition and Automated Production of Chili Peppers: A Review Addressing Varietal Diversity and Technological Requirements. Agriculture 2025, 15, 1200. [Google Scholar] [CrossRef]
- Buxbaum, N.; Lieth, J.H.; Earles, M. Non-destructive Plant Biomass Monitoring with High Spatio-Temporal Resolution via Proximal RGB-D Imagery and End-to-End Deep Learning. Front. Plant Sci. 2022, 13, 758818. [Google Scholar] [CrossRef] [PubMed]
- Munser, L.; Sathyanarayanan, K.K.; Raecke, J.; Mansour, M.M.; Uland, M.E.; Streif, S. Precise and Continuous Biomass Measurement for Plant Growth Using a Low-Cost Sensor Setup. Sensors 2025, 25, 4770. [Google Scholar] [CrossRef] [PubMed]
- Wang, J.; Chen, C.; Wang, J.; Yao, Z.; Wang, Y.; Zhao, Y.; Sun, Y.; Wu, F.; Han, D.; Yang, G.; et al. NDVI Estimation Throughout the Whole Growth Period of Multi-Crops Using RGB Images and Deep Learning. Agronomy 2025, 15, 63. [Google Scholar] [CrossRef]
- Cardenas-Gallegos, J.S.; Lacerda, L.N.; Severns, P.M.; Peduzzi, A.; Klimeš, P.; Ferrarezi, R.S. Advancing Biomass Estimation in Hydroponic Lettuce Using RGB-Depth Imaging and Morphometric Descriptors with Machine Learning. Comput. Electron. Agric. 2025, 234, 110299. [Google Scholar] [CrossRef]
- Dhawi, F.; Ghafoor, A.; Almousa, N.; Ali, S.; Alqanbar, S. Predictive Modelling Employing Machine Learning, Convolutional Neural Networks (CNNs), and Smartphone RGB Images for Non-Destructive Biomass Estimation of Pearl Millet (Pennisetum glaucum). Front. Plant Sci. 2025, 16, 1594728. [Google Scholar] [CrossRef] [PubMed]
- Carlier, A.; Dandrifosse, S.; Dumont, B.; Mercatoris, B. Comparing CNNs and PLSr for Estimating Wheat Organs Biophysical Variables Using Proximal Sensing. Front. Plant Sci. 2023, 14, 1204791. [Google Scholar] [CrossRef] [PubMed]
- Schreiber, L.V.; Atkinson Amorim, J.G.; Guimarães, L.; Motta Matos, D.; Maciel da Costa, C.; Parraga, A. Above-Ground Biomass Wheat Estimation: Deep Learning with UAV-Based RGB Images. Appl. Artif. Intell. 2022, 36, 2055392. [Google Scholar] [CrossRef]
- Okada, M.; Barras, C.; Toda, Y.; Hamazaki, K.; Ohmori, Y.; Yamasaki, Y.; Takahashi, H.; Takanashi, H.; Tsuda, M.; Hirai, M.Y.; et al. High-Throughput Phenotyping of Soybean Biomass: Conventional Trait Estimation and Novel Latent Feature Extraction Using UAV Remote Sensing and Deep Learning Models. Plant Phenomics 2024, 6, 0244. [Google Scholar] [CrossRef] [PubMed]
- Jin, Z.; Hong, W.; Wang, Y.; Jiang, C.; Zhang, B.; Sun, Z.; Liu, S.; Lv, C. A Transformer-Based Symmetric Diffusion Segmentation Network for Wheat Growth Monitoring and Yield Counting. Agriculture 2025, 15, 670. [Google Scholar] [CrossRef]
- Bazrafkan, A.; Igathinathane, C.; Bandillo, N.; Flores, P. Optimizing Integration Techniques for UAS and Satellite Image Data in Precision Agriculture—A Review. Front. Remote Sens. 2025, 6, 1622884. [Google Scholar] [CrossRef]
- Liu, Y.; Feng, H.; Yue, J.; Fan, Y.; Bian, M.; Ma, Y.; Jin, X.; Song, X.; Yang, G. Estimating Potato Above-Ground Biomass by Using Integrated Unmanned Aerial System-Based Optical, Structural, and Textural Canopy Measurements. Comput. Electron. Agric. 2023, 213, 108229. [Google Scholar] [CrossRef]
- Castro, W.; Marcato Junior, J.; Polidoro, C.; Osco, L.P.; Gonçalves, W.; Rodrigues, L.; Santos, M.; Jank, L.; Barrios, S.; Valle, C.; et al. Deep Learning Applied to Phenotyping of Biomass in Forages with UAV-Based RGB Imagery. Sensors 2020, 20, 4802. [Google Scholar] [CrossRef] [PubMed]
- Zhu, S.; Zhang, W.; Yang, T.; Wu, F.; Jiang, Y.; Yang, G.; Zain, M.; Zhao, Y.; Yao, Z.; Liu, T.; et al. Combining 2D Image and Point Cloud Deep Learning to Predict Wheat Above Ground Biomass. Precis. Agric. 2024, 25, 3139–3166. [Google Scholar] [CrossRef]
- Deng, L.; Fu, B.; Wu, Y.; He, H.; Sun, W.; Jia, M.; Deng, T.; Fan, D. Comparison of 2D and 3D Vegetation Species Mapping in Three Natural Scenarios Using UAV-LiDAR Point Clouds and Improved Deep Learning Methods. Int. J. Appl. Earth Obs. Geoinf. 2023, 125, 103588. [Google Scholar] [CrossRef]
- Chang, J.; Walsh, J.J.; Mangina, E.; Ruan, J.; Negrão, S. AI-Driven 3D Point Cloud Analysis in Plant Phenotyping: A Systematic Review. Smart Agric. Technol. 2026, 13, 101869. [Google Scholar] [CrossRef]
- Li, Y.; Liang, Z.; Liu, B.; Yin, L.; Wan, F.; Qian, W.; Qiao, X. Applications of 3D Reconstruction Techniques in Crop Canopy Phenotyping: A Review. Agronomy 2025, 15, 2518. [Google Scholar] [CrossRef]
- Li, S.; Cui, Z.; Yang, J.; Wang, B. A Review of Optical-Based Three-Dimensional Reconstruction and Multi-Source Fusion for Plant Phenotyping. Sensors 2025, 25, 3401. [Google Scholar] [CrossRef] [PubMed]
- Wu, Z.; He, N.; Wu, Y.; Guo, X.; Wen, W. Point Cloud Data-Driven Methods for Estimating Maize Leaf Biomass. Smart Agric. 2026, 8, 156–166. [Google Scholar] [CrossRef]
- Wang, F.; Jia, W.; Guo, H.; Zhang, X.; Li, D.; Li, Z.; Sun, Y. Point Cloud-Based Crown Volume Improves Tree Biomass Estimation: Evaluating Different Crown Volume Extraction Algorithms. Comput. Electron. Agric. 2024, 225, 109288. [Google Scholar] [CrossRef]
- Verma, M.K.; Yadav, M. 3D LiDAR-Based Techniques and Cost-Effective Measures for Precision Agriculture: A Review. Rev. Int. Geomat. 2025, 34, 855–879. [Google Scholar] [CrossRef]
- Rosell-Polo, J.R.; Gregorio, E.; Gené, J.; Llorens, J.; Torrent, X.; Arnó, J.; Escolà, A. Kinect v2 Sensor-Based Mobile Terrestrial Laser Scanner for Agricultural Outdoor Applications. IEEE/ASME Trans. Mechatron. 2017, 22, 2420–2427. [Google Scholar] [CrossRef]
- Liu, X.; Xu, K.; Shinoda, R.; Santo, H.; Okura, F. Masks-to-Skeleton: Multi-View Mask-Based Tree Skeleton Extraction with 3D Gaussian Splatting. Sensors 2025, 25, 4354. [Google Scholar] [CrossRef] [PubMed]
- Liu, Z.; Liu, J.; Zuo, X.; Hu, M. Multi-Scale Iterative Refinement Network for RGB-D Salient Object Detection. Eng. Appl. Artif. Intell. 2021, 106, 104473. [Google Scholar] [CrossRef]
- Chen, H.; Zhou, H.; Zhang, Y.; Lin, Z.; Deng, Y. Dissecting RGB-D Learning for Improved Multi-Modal Fusion. IEEE Trans. Image Process. 2026, 35, 1846–1857. [Google Scholar] [CrossRef] [PubMed]
- Yang, L.; Kang, B.; Huang, Z.; Xu, X.; Feng, J.; Zhao, H. Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2024; pp. 10371–10381. [Google Scholar] [CrossRef]
- Yang, L.; Kang, B.; Huang, Z.; Zhao, Z.; Xu, X.; Feng, J.; Zhao, H. Depth Anything V2. Adv. Neural Inf. Process. Syst. 2024, 37, 21875–21911. [Google Scholar] [CrossRef]
- Shang, Y.; Sun, G.; Zhang, H. DepthCL-Seg: Dual-Stream Feature Fusion for Green Fruit Instance Segmentation Based on Monocular Depth. Agriculture 2026, 16, 283. [Google Scholar] [CrossRef]
- Li, Y.; Song, J.; Chen, C.; Liu, X. Learning Modality Complementarity for RGB-D Salient Object Detection via Dynamic Neural Network. Electronics 2026, 15, 1361. [Google Scholar] [CrossRef]
- Yang, Y.; Huang, N.; Zhang, Q.; Han, J. Mitigating Fusion Bias for RGB-D Salient Object Detection. Pattern Recognit. 2026, 171, 112135. [Google Scholar] [CrossRef]
- Gonçalves, F.M.F.; Pedronette, D.C.G.; da Silva Torres, R. Regression by Re-Ranking. Pattern Recognit. 2023, 140, 109577. [Google Scholar] [CrossRef]
- Sun, T.; Sheldon, D.; O’Connor, B. A Probabilistic Approach for Learning with Label Proportions Applied to the US Presidential Election. In 2017 IEEE International Conference on Data Mining (ICDM); IEEE: New York, NY, USA, 2017; pp. 445–454. [Google Scholar] [CrossRef]
- Paul, A.; Machavaram, R. Evolution of Sweet Pepper (Capsicum annuum) Harvesting: From Traditional Practices to Mechanization and Robotics. Curr. Res. Agric. Sci. 2025, 2, 100047. [Google Scholar] [CrossRef]
- Han, D.; Wang, C.; Zhang, H.; Pang, H.; Wang, X.; Chen, X.; Wen, X. Advances in Mechanized Harvesting Technologies and Equipment for Chili Peppers. Agriculture 2025, 15, 1129. [Google Scholar] [CrossRef]
- Cao, S.; Xu, B.; Zhou, W.; Zhou, L.; Zhang, J.; Zheng, Y.; Hu, W.; Han, Z.; Lu, H. The blessing of Depth Anything: An almost unsupervised approach to crop segmentation with depth-informed pseudo labeling. Plant Phenomics 2025, 7, 100005. [Google Scholar] [CrossRef] [PubMed]
- Papić, V.; Bugarin, N.; Marin, I.; Gotovac, S.; Gugić, J. Monocular depth estimation driven canopy segmentation for enhanced determination of vegetation indices in olive grove monitoring. Remote Sens. 2025, 17, 3245. [Google Scholar] [CrossRef]
- Zhuo, Y.; You, F. 3D plant phenotyping from a single image: Learning fine-scale organ morphology with monocular depth estimation. Comput. Electron. Agric. 2025, 239, 110925. [Google Scholar] [CrossRef]
- Howard, A.; Sandler, M.; Chu, G.; Chen, L.-C.; Chen, B.; Tan, M.; Wang, W.; Weyand, T.; Pang, R.; Vasudevan, V.; et al. Searching for MobileNetV3. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2019; pp. 1314–1324. [Google Scholar] [CrossRef]
- Howard, A.G.; Zhu, M.; Chen, B.; Kalenichenko, D.; Wang, W.; Weyand, T.; Andreetto, M.; Adam, H. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv 2017, arXiv:1704.04861. [Google Scholar] [CrossRef]
- Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.-C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2018; pp. 4510–4520. [Google Scholar] [CrossRef]
- Sosa-Herrera, J.A.; Alvarez-Jarquin, N.; Cid-Garcia, N.M.; López-Araujo, D.J.; Vallejo-Pérez, M.R. Automated Health Estimation of Capsicum annuum L. Crops by Means of Deep Learning and RGB Aerial Images. Remote Sens. 2022, 14, 4943. [Google Scholar] [CrossRef]
- Li, H.; Shi, D.; Shi, H.; Li, M.; Diao, M. A Stage-Aware Cascaded Detection–Segmentation Framework for Leaf Phenotyping and Leaf Dry Biomass Estimation of Pepper Seedlings. Plants 2026, 15, 1912. [Google Scholar] [CrossRef] [PubMed]
- Zhang, R.; Peng, J.; Chen, H.; Peng, H.; Wang, Y.; Jiang, P. Enhancing Aboveground Biomass Prediction through Integration of the SCDR Paradigm into the U-Like Hierarchical Residual Fusion Model. Sensors 2024, 24, 2464. [Google Scholar] [CrossRef] [PubMed]
- Niccoli, N.; Seidenari, L.; Greco, I.; Rovero, F. Benchmark on Monocular Metric Depth Estimation in Wildlife Setting. In Image Analysis and Processing—ICIAP 2025 Workshops; Rodolà, E., Galasso, F., Masi, I., Eds.; Springer Nature: Cham, Switzerland, 2026; pp. 529–539. [Google Scholar] [CrossRef]









| Dataset | Total Samples (N) | Target Variable Range (g) | Training/Validation Split (10-fold) | Mean ± Std of Biomass (g) |
|---|---|---|---|---|
| Dataset Description | 275 | 271–6451 | 90%/10% | 2724.51 ± 1683.95 |
| Parameter | Setting |
|---|---|
| Optimizer | AdamW |
| Initial learning rate | 0.0001 |
| Batch size | 16 |
| Training epochs | 100 |
| Weight decay | 0.0001 |
| Input size | 224 × 224 |
| Loss function | Dynamic loss (Huber Loss for the first 10 epochs; thereafter Huber Loss + 0.3 × Ranking Loss) |
| K-fold cross-validation | 10 folds |
| Random seed | 42 |
| Models | MAE (g) | MAPE (%) | RMSE (g) | R2 (95% CI) | Parameters (M) | Size (M) |
|---|---|---|---|---|---|---|
| RGB_Only | 241.12 | 19.90 | 296.77 | 0.943 [0.912, 0.974] | 0.928 | 0.928 |
| Depth_Only | 311.64 | 17.78 | 403.71 | 0.923 [0.881, 0.966] | 0.928 | 0.928 |
| Dual | 273.64 | 14.86 | 345.58 | 0.955 [0.917, 0.993] | 1.855 | 1.855 |
| Models | MAE (g) | MAPE (%) | RMSE (g) | R2 (95% CI) |
|---|---|---|---|---|
| ShuffleNetV2 | 294.09 | 19.52 | 367.48 | 0.934 [0.890, 0.978] |
| GhostNetV3 Small | 290.24 | 19.74 | 366.63 | 0.933 [0.895, 0.970] |
| MobileNetV3_Large | 309.10 | 24.71 | 380.73 | 0.934 [0.892, 0.976] |
| EfficientNet-B0 | 238.52 | 13.24 | 307.40 | 0.956 [0.922, 0.990] |
| MobileViT-S | 237.40 | 16.88 | 293.05 | 0.955 [0.922, 0.988] |
| Tiny ViT 5M | 277.93 | 19.21 | 357.40 | 0.938 [0.901, 0.974] |
| RepViT-0.9 | 175.71 | 11.95 | 226.71 | 0.968 [0.956, 0.979] |
| MobileNetV3 _Small (Baseline) | 273.64 | 14.86 | 345.58 | 0.955 [0.917, 0.993] |
| CIDR-MobileNet (Ours) | 174.56 | 9.56 | 230.74 | 0.972 [0.945, 0.998] |
| Models | Parameters (M) | Size (M) | GFLOPs | CPU Latency (ms) | Training Time |
|---|---|---|---|---|---|
| ShuffleNetV2 | 3.29 | 12.91 | 0.36 | 18.27 | 1 h 56 m 42 s |
| GhostNetV3 Small | 4.23 | 18.10 | 0.57 | 102.36 | 2 h 11 m 32 s |
| MobileNetV3_Large | 5.95 | 22.87 | 0.44 | 19.66 | 2 h 30 m 46 s |
| EfficientNet-B0 | 8.02 | 30.90 | 0.12 | 40.47 | 2 h 19 m 23 s |
| MobileViT-S | 9.88 | 37.77 | 2.88 | 40.92 | 2 h 35 m 8 s |
| Tiny ViT 5M | 10.10 | 38.98 | 1.69 | 36.47 | 3 h 24 m 45 s |
| RepViT-0.9 | 13.15 | 50.23 | 3.57 | 53.28 | 2 h 32 m 48 s |
| MobileNetV3 _Small (Baseline) | 1.86 | 7.17 | 0.12 | 10.86 | 2 h 1 m 8 s |
| CIDR-MobileNet (Ours) | 3.28 | 12.59 | 0.18 | 10.56 | 2 h 2 m 51 s |
| CFIF | MDBR-Head | Ranking Loss | MAE (g) | MAPE (%) | RMSE (g) | R2 |
|---|---|---|---|---|---|---|
| × | × | × | 273.64 | 14.86 | 345.58 | 0.955 |
| √ | × | × | 234.37 | 11.69 | 304.40 | 0.956 |
| × | √ | × | 214.68 | 11.33 | 285.01 | 0.959 |
| × | × | √ | 219.82 | 12.28 | 283.66 | 0.957 |
| √ | √ | × | 220.27 | 12.57 | 277.59 | 0.962 |
| × | √ | √ | 208.18 | 10.22 | 266.83 | 0.961 |
| √ | × | √ | 225.84 | 13.08 | 277.86 | 0.963 |
| √ | √ | √ | 174.56 | 9.56 | 230.74 | 0.972 |
| Comparison | MAE | MAPE | RMSE | R2 |
|---|---|---|---|---|
| CIDR-MobileNet vs. MobileNetV3_Small (Baseline) | <0.001 | <0.001 | <0.001 | <0.001 |
| CIDR-MobileNet vs. RepViT-0.9 | 0.012 | 0.018 | <0.001 | 0.009 |
| CIDR-MobileNet vs. EfficientNet-B0 | <0.001 | <0.001 | <0.001 | <0.001 |
| CIDR-MobileNet vs. MobileViT-S | <0.001 | <0.001 | <0.001 | 0.002 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Wang, Y.; Deng, J.; Yang, L.; Ruan, S.; Wang, W.; Hu, W.; Jiang, P. CIDR-MobileNet: A Monocular Pseudo-Depth and Cross-Modal Feature Fusion Approach for Chili Pepper Above-Ground Biomass Estimation. Agriculture 2026, 16, 1457. https://doi.org/10.3390/agriculture16131457
Wang Y, Deng J, Yang L, Ruan S, Wang W, Hu W, Jiang P. CIDR-MobileNet: A Monocular Pseudo-Depth and Cross-Modal Feature Fusion Approach for Chili Pepper Above-Ground Biomass Estimation. Agriculture. 2026; 16(13):1457. https://doi.org/10.3390/agriculture16131457
Chicago/Turabian StyleWang, Yi, Jingtao Deng, Lin Yang, Shangjing Ruan, Weijie Wang, Wenwu Hu, and Ping Jiang. 2026. "CIDR-MobileNet: A Monocular Pseudo-Depth and Cross-Modal Feature Fusion Approach for Chili Pepper Above-Ground Biomass Estimation" Agriculture 16, no. 13: 1457. https://doi.org/10.3390/agriculture16131457
APA StyleWang, Y., Deng, J., Yang, L., Ruan, S., Wang, W., Hu, W., & Jiang, P. (2026). CIDR-MobileNet: A Monocular Pseudo-Depth and Cross-Modal Feature Fusion Approach for Chili Pepper Above-Ground Biomass Estimation. Agriculture, 16(13), 1457. https://doi.org/10.3390/agriculture16131457

