Machine Learning Approaches for Assessing Avocado Alternate Bearing Using Sentinel-2 and Climate Variables—A Case Study in Limpopo, South Africa
Highlights
- Alternate bearing in avocado can be effectively assessed and predicted using a combination of Sentinel-2 vegetation indices and key climatic variables (VPD, Tmin, Tmax, precipitation) during the flowering period.
- The TabPFN model outperformed other machine learning algorithms (Accuracy = 0.88; AUC = 0.95) due to its ability to capture nonlinear relationships among phenological, spectral, and climatic factors.
- Early prediction of “on” and “off” years enables improved orchard management, optimized harvest planning, and better alignment of market supply with production potential.
- Integration of remote sensing and climate data provides a scalable framework for stabilizing avocado yield and supporting sustainable orchard management under variable climatic conditions.
Abstract
1. Introduction
- The development of a high resolution, remotely sensed framework that integrates Sentinel-2 spectral indices (VIs and FIs) with climate variables to characterize canopy dynamics associated with alternate bearing.
- The application and comparison of multiple ML algorithms to detect and classify AB behavior in commercial avocado orchards.
- A demonstration of the feasibility of remote sensing-driven AB detection in a data-limited African production context, addressing a longstanding knowledge gap.
- The provision of a scalable, data-driven approach that supports improved orchard-level management and long-term production planning in major avocado-growing regions.
2. Materials and Methods
2.1. Study Area
2.2. Avocado Phenology, Historical Yield, and Alternate Bearing
- Years with yield greater than or equal to the median were labeled as “on” year.
- Years with yield less than the median were labeled as “off” year.
2.3. Sentinel 2 Data Acquisition and Spectral Indices
2.3.1. Vegetation and Flowering Indices for Bearing Status Classification
2.3.2. Savitzky–Golay Smoothing
2.4. Climate Data Acquisition
2.5. Model Development
2.5.1. Multi-Source Data Fusion
2.5.2. Feature Engineering of Vegetation and Flowering Indices as Well as Climate Variables
- Peak Bloom Stage (August–September)—Maximum values of FIs and minimum values of VIs were extracted, corresponding to the stage of highest flower intensity and lowest vegetative dominance in the study area [29].
- Early Fruit Drop (7–8 weeks after peak flowering)—Minimum values of FIs and maximum values of VIs were computed, reflecting the period when abscission processes are most pronounced and vegetative recovery is underway.
- Temporal Gradient—The rate of change between the two above stages was calculated to capture sharp declines in FIs or distinct peaks in VIs, serving as strong indicators of “on” or “off” years.
2.5.3. Machine Learning Model Algorithms
- Random Forest (RF): RF is an ensemble classifier that constructs multiple decision trees through bootstrap aggregation [63]. Predictions are derived via majority voting across trees, providing resilience against overfitting and robustness in handling noisy, multicollinear datasets. For this study, the number of trees (n_estimators), maximum tree depth, and minimum samples per split were optimized using cross-validation.
- Extreme Gradient Boosting (XGBoost): XGBoost implements gradient boosting with enhanced computational efficiency and regularization [42]. It builds trees sequentially, where each subsequent tree reduces the residual errors of the ensemble. Critical hyperparameters included learning rate, maximum tree depth, subsample fraction, and number of boosting iterations.
- Categorical Boosting (CatBoost): CatBoost extends gradient boosting by incorporating ordered boosting to mitigate overfitting and reduce prediction shift [43]. While originally designed for categorical feature handling, in this study it was applied exclusively to continuous predictors. Hyperparameters such as learning rate, tree depth, and number of iterations were tuned using grid search.
- Light Gradient Boosting Machine (LightGBM): LightGBM employs histogram-based feature binning and a leaf-wise growth strategy with depth constraints [44]. These optimizations accelerate training while reducing memory usage. Tuning parameters included number of leaves, maximum depth, feature fraction, and learning rate.
- Tabular Prior-Data Fitted Network (TabPFN): TabPFN is a transformer-based neural network trained on millions of synthetic datasets, approximating Bayesian inference for tabular data classification [45]. Unlike conventional algorithms, TabPFN requires minimal parameter adjustment and leverages prior knowledge to achieve strong generalization. In this study, the pretrained TabPFN model was directly applied without additional tuning. The core architecture consists of a multi-layer transformer encoder with self-attention mechanisms that enable the model to infer complex interactions between tabular features. TabPFN is trained using a prior-data-fitted strategy, where the network learns to approximate Bayesian posterior predictions from a very large corpus of synthetically generated classification tasks. This meta-training paradigm equips the network with strong inductive biases for small-to-medium tabular datasets, reducing the need for dataset-specific optimization. TabPFN is particularly well suited for the AB classification problem because the dataset contains heterogeneous spectral, climatic, and phenological predictors with potentially nonlinear interactions, and the model’s attention-based architecture can efficiently capture these relationships. Furthermore, its Bayesian-like inference enables robust generalization even under limited sample conditions, which is advantageous for orchard-level agricultural studies.
2.5.4. Training and Validation Strategy
Leave-One-Year-Out (LOYO) Cross-Validation
Hyperparameter Tuning of Machine Learning Models
2.5.5. Model Evaluation Metrics
- Accuracy: Accuracy measures the overall correctness of the model, defined as the ratio of correctly predicted observations to the total number of observations:
- 2.
- Precision: Precision quantifies the proportion of positive predictions that are actually correct. It is especially important when the cost of false positives is high.
- 3.
- Recall (Sensitivity or True Positive Rate): Recall indicates the proportion of actual positive cases that were correctly identified by the model:
- 4.
- F1-Score: The F1-score is the harmonic mean of precision and recall and is a balanced metric for evaluating classification performance when classes are imbalanced:
- 5.
- Matthews Correlation Coefficient (MCC): The Matthews Correlation Coefficient (MCC) is a comprehensive statistical metric that evaluates the quality of binary classifications by considering true and false positives and negatives. It is defined as follows:
- 6.
- Receiver Operating Characteristic (ROC) Curve and Area Under the Curve (AUC): The ROC curve plots the true positive rate (recall) against the false positive rate across various threshold settings. The AUC quantifies the model’s ability to distinguish between classes:
- An AUC of 1.0 indicates perfect classification.
- An AUC of 0.5 suggests no discriminative power.
2.5.6. Model Interpretation
2.5.7. Computational Environment
3. Results
3.1. Temporal Dynamics of Vegetation and Flowering Indices
3.2. Climate Variables and Their Influence
3.3. Model Performance for Alternate Bearing Classification
3.4. Temporal Stabiligy of Models
3.5. Confusion Matrix Analysis
3.6. Feature Importance and Variable Contribution
3.7. Block-Level Alternate Bearing Map
4. Discussion
4.1. Phenological Drivers and Physiological Basis of Alternate Bearing
4.2. Behavior of Flowering and Vegetation Indices Across Seasons
4.3. Spectral Indicators of Canopy Physiology and Their Role in AB
4.4. Climatic Controls on Flowering and Yield Formation
4.5. Machine Learning Model Performance and Interpretations
4.6. Practical Implications and Operational Relevance
4.7. Synthesis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- FAOSTAT. Food and Agriculture Organization of the United Nations: Crops and Livestock Products. Available online: https://www.fao.org/faostat/en/ (accessed on 12 July 2025).
- Schwartz, M.; Maldonado, Y.; Luchsinger, L.; Lizana, L.A.; Kern, W. Competitive Peruvian and Chilean avocado export profile. Acta Hortic. 2018, 1194, 1079–1084. [Google Scholar] [CrossRef]
- World’s TopExport. Avocado Exports by Country. Available online: https://www.worldstopexports.com/avocados-exports-by-country/ (accessed on 25 July 2025).
- Kephe, P.N.; Siewe, L.C.; Lekalakala, R.G.; Kwabena Ayisi, K.; Petja, B.M. Optimizing smallholder farmers’ productivity through crop selection, targeting and prioritization framework in the Limpopo and Free State provinces, South Africa. Front. Sustain. Food Syst. 2022, 6, 738267. [Google Scholar] [CrossRef]
- Zwane, S.; Ferrer, S.R. Competitiveness analysis of the South African avocado industry. Agrekon 2024, 63, 277–302. [Google Scholar] [CrossRef]
- Wolstenholme, B.N. Alternate bearing in Avocado: An Overview. 2010. Available online: http://www.avocadosource.com/papers/southafrica_papers/wolstenholmenigel2010.pdf (accessed on 5 August 2025).
- Lovatt, C.; Zheng, Y.; Khuong, T.; Campisi-Pinto, S.; Crowley, D.; Rolshausen, P. Yield characteristics of ‘Hass’ avocado trees under California growing conditions. In Proceedings of the VIII World Avocado Congress, Lima, Peru, 13–18 September 2015; pp. 13–18. [Google Scholar]
- Goldschmidt, E.E.; Sadka, A. Yield alternation: Horticulture, physiology, molecular biology, and evolution. Hortic. Rev. 2021, 48, 363–418. [Google Scholar]
- Smith, H.M.; Samach, A. Constraints to obtaining consistent annual yields in perennial tree crops. I: Heavy fruit load dominates over vegetative growth. Plant Sci. 2013, 207, 158–167. [Google Scholar] [CrossRef]
- Ali, H.; Abbas, A.; Rehman, A. Alternate bearing in fruit plants. Biol. Agric. Sci. Res. J. 2022, 2. [Google Scholar] [CrossRef]
- Jangid, R.; Kumar, A.; Masu, M.M.; Kanade, N.; Pant, D. Alternate Bearing in Fruit Crops: Causes and Control Measures. Asian J. Agric. Hortic. Res. 2023, 10, 10–19. [Google Scholar] [CrossRef]
- Iturrieta, R.A. First Things First: Matching an Alternate Bearing Model to Confirmed Field Phenotypes of Avocado (Persea americana, Mill.). Ph.D. Thesis, University of California, Riverside, CA, USA, 2017. [Google Scholar]
- Lovatt, C. Eliminating alternate bearing of the ‘Hass’ avocado. In Proceedings of the California Avocado Research Symposium, Riverside, CA, USA, 30 October 2004; pp. 127–142. [Google Scholar]
- Liakos, K.G.; Busato, P.; Moshou, D.; Pearson, S.; Bochtis, D. Machine learning in agriculture: A review. Sensors 2018, 18, 2674. [Google Scholar] [CrossRef]
- Kamilaris, A.; Prenafeta-Boldú, F.X. Deep learning in agriculture: A survey. Comput. Electron. Agric. 2018, 147, 70–90. [Google Scholar] [CrossRef]
- Robson, A.; Rahman, M.M.; Muir, J. Using Worldview Satellite Imagery to Map Yield in Avocado (Persea americana): A Case Study in Bundaberg, Australia. Remote Sens. 2017, 9, 1223. [Google Scholar] [CrossRef]
- Rahman, M.M.; Robson, A.; Brinkhoff, J. Potential of Time-Series Sentinel 2 Data for Monitoring Avocado Crop Phenology. Remote Sens. 2022, 14, 5942. [Google Scholar] [CrossRef]
- Drusch, M.; Del Bello, U.; Carlier, S.; Colin, O.; Fernandez, V.; Gascon, F.; Hoersch, B.; Isola, C.; Laberinti, P.; Martimort, P. Sentinel-2: ESA’s optical high-resolution mission for GMES operational services. Remote Sens. Environ. 2012, 120, 25–36. [Google Scholar] [CrossRef]
- Rouse, J.W.; Haas, R.H.; Schell, J.A.; Deering, D.W. Monitoring vegetation systems in the Great Plains with ERTS. In Proceedings of the Third Earth Resources Technology Satellite-1 Symposium, Volume I: Technical Presentations, NASA SP-351, Washington, DC, USA, 1 January 1974; pp. 309–317. [Google Scholar]
- Rahman, M.M.; Robson, A.J. A Novel Approach for Sugarcane Yield Prediction Using Landsat Time Series Imagery: A Case Study on Bundaberg Region. Adv. Remote Sens. 2016, 5, 93–102. [Google Scholar] [CrossRef]
- Jiang, Z.; Huete, A.R.; Didan, K.; Miura, T. Development of a two-band enhanced vegetation index without a blue band. Remote Sens. Environ. 2008, 112, 3833–3845. [Google Scholar] [CrossRef]
- Delegido, J.; Verrelst, J.; Alonso, L.; Moreno, J. Evaluation of sentinel-2 red-edge bands for empirical estimation of green LAI and chlorophyll content. Sensors 2011, 11, 7063–7081. [Google Scholar] [CrossRef]
- Immitzer, M.; Vuolo, F.; Atzberger, C. First experience with Sentinel-2 data for crop and tree species classifications in central Europe. Remote Sens. 2016, 8, 166. [Google Scholar] [CrossRef]
- Lin, S.; Li, J.; Liu, Q.; Li, L.; Zhao, J.; Yu, W. Evaluating the effectiveness of using vegetation indices based on red-edge reflectance from Sentinel-2 to estimate gross primary productivity. Remote Sens. 2019, 11, 1303. [Google Scholar] [CrossRef]
- Garner, L.C.; Lovatt, C.J. The relationship between flower and fruit abscission and alternate bearing of ‘Hass’ avocado. J. Am. Soc. Hortic. Sci. 2008, 133, 3–10. [Google Scholar] [CrossRef]
- Afsar, M.M.; Iqbal, M.S.; Bakhshi, A.D.; Hussain, E.; Iqbal, J. MangiSpectra: A Multivariate Phenological Analysis Framework Leveraging UAV Imagery and LSTM for Tree Health and Yield Estimation in Mango Orchards. Remote Sens. 2025, 17, 703. [Google Scholar] [CrossRef]
- Sulik, J.J.; Long, D.S. Spectral indices for yellow canola flowers. Int. J. Remote Sens. 2015, 36, 2751–2765. [Google Scholar] [CrossRef]
- Salazar-García, S.; Lord, E.M.; Lovatt, C.J. Inflorescence and flower development of the ‘Hass’ avocado (Persea americana Mill.) during “on” and “off” crop years. J. Am. Soc. Hortic. Sci. 1998, 123, 537–544. [Google Scholar] [CrossRef]
- Randela, M.Q. Climate Change and Avocado Production: A Case Study of the Limpopo Province of South Africa. Master’s Thesis, University of Pretoria, Pretoria, South Africa, 2018. [Google Scholar]
- Howden, M.; Newett, S.; Deuter, P. Climate change-risks and opportunities for the avocado industry. In Proceedings of the New Zealand and Australian Avocado Grower’s Conference, Tauranga, New Zealand, 20–22 September 2005; pp. 1–28. [Google Scholar]
- Anguiano, C.; Alcántar, R.; Toledo, B.; Tapia, L.; Vidales-Fernández, J. Soil and climate characterization of the avocado-producing area of Michoacán, Mexico. In Proceedings of the VI World Avocado Congress, Viña Del Mar, Chile, 12–16 November 2007; Available online: https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.avocadosource.com/WAC6/en/Resumen/3c-112.pdf&ved=2ahUKEwjWz-S8t6WRAxU2r1YBHeIDE04QFnoECBcQAQ&usg=AOvVaw2VnLHwGhNionq9Pmf9VRcr (accessed on 12 July 2025).
- Domínguez, A.; García-Martín, A.; Moreno, E.; González, E.; Paniagua, L.L.; Allendes, G. Identifying Optimal Zones for Avocado (Persea americana Mill) Cultivation in Iberian Peninsula: A Climate Suitability Analysis. Land 2024, 13, 1290. [Google Scholar] [CrossRef]
- Ramírez-Gil, J.G.; Henao-Rojas, J.C.; Morales-Osorio, J.G. Mitigation of the adverse effects of the El Niño (El Niño, La Niña) Southern Oscillation (ENSO) phenomenon and the most important diseases in avocado cv. Hass crops. Plants 2020, 9, 790. [Google Scholar] [CrossRef]
- Gafni, E. Effect of Extreme Temperature Regimes and Different Pollinators on the Fertilization and Fruit-Set Processes in Avocado. Master’s Thesis, Hebrew University of Jerusalem, Jerusalem, Israel, 1984. [Google Scholar]
- Acosta-Rangel, A.; Li, R.; Mauk, P.; Santiago, L.; Lovatt, C.J. Effects of temperature, soil moisture and light intensity on the temporal pattern of floral gene expression and flowering of avocado buds (Persea americana cv. Hass). Sci. Hortic. 2021, 280, 109940. [Google Scholar] [CrossRef]
- Sedgley, M.; Grant, W.J.R. Effect of low temperatures during flowering on floral cycle and pollen tube growth in nine avocado cultivars. Sci. Hortic. 1983, 18, 207–213. [Google Scholar] [CrossRef]
- Erazo-Mesa, E.; Ramírez-Gil, J.G.; Sánchez, A.E. Avocado cv. Hass Needs Water Irrigation in Tropical Precipitation Regime: Evidence from Colombia. Water 2021, 13, 1942. [Google Scholar] [CrossRef]
- Brinkhoff, J.; Robson, A.J. Block-level macadamia yield forecasting using spatio-temporal datasets. Agric. For. Meteorol. 2021, 303, 108369. [Google Scholar] [CrossRef]
- Torgbor, B.A.; Rahman, M.M.; Brinkhoff, J.; Sinha, P.; Robson, A. Integrating Remote Sensing and Weather Variables for Mango Yield Prediction Using a Machine Learning Approach. Remote Sens. 2023, 15, 3075. [Google Scholar] [CrossRef]
- Rahman, M.M.; Robson, A.; Bristow, M. Exploring the Potential of High Resolution WorldView-3 Imagery for Estimating Yield of Mango. Remote Sens. 2018, 10, 1866. [Google Scholar] [CrossRef]
- Jeong, J.H.; Resop, J.P.; Mueller, N.D.; Fleisher, D.H.; Yun, K.; Butler, E.E.; Timlin, D.J.; Shim, K.-M.; Gerber, J.S.; Reddy, V.R.; et al. Random Forests for Global and Regional Crop Yield Predictions. PLoS ONE 2016, 11, e0156571. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Dorogush, A.V.; Ershov, V.; Gulin, A. CatBoost: Gradient boosting with categorical features support. arXiv 2018, arXiv:1810.11363. [Google Scholar] [CrossRef]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Hollmann, N.; Müller, S.G.; Eggensperger, K.; Hutter, F. TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second. In Proceedings of the 10 th International Conference on Learning Representations, (ICLR2022), Virtual, 25–29 April 2022. [Google Scholar]
- Blanco, V.; Blaya-Ros, P.J.; Castillo, C.; Soto-Vallés, F.; Torres-Sánchez, R.; Domingo, R. Potential of UAS-based remote sensing for estimating tree water status and yield in sweet cherry trees. Remote Sens. 2020, 12, 2359. [Google Scholar] [CrossRef]
- Lazare, S.; Zipori, I.; Cohen, Y.; Haberman, A.; Goldshtein, E.; Ron, Y.; Rotschild, R.; Dag, A. Jojoba pruning: New practices to rejuvenate the plant, improve yield and reduce alternate bearing. Sci. Hortic. 2021, 277, 109793. [Google Scholar] [CrossRef]
- Bernardes, T.; Moreira, M.A.; Adami, M.; Rudorff, B.F.T. Monitoring biennial bearing effect on coffee yield using modis remote sensing imagery. Remote Sens. 2012, 4, 2492–2509. [Google Scholar] [CrossRef]
- Myeni, L.; Mahleba, N.; Mazibuko, S.; Moeletsi, M.E.; Ayisi, K.; Tsubo, M. Accessibility and utilization of climate information services for decision-making in smallholder farming: Insights from Limpopo Province, South Africa. Environ. Dev. 2024, 51, 101020. [Google Scholar] [CrossRef]
- Bunce, B. Municipal case study: Greater Tzaneen Local Municipality, Limpopo. In GTAC/CBPEP/EU Project on Employment-Intensive Rural Land Reform in South Africa: Policies, Programmes and Capacities; GTAC, 2020; Available online: https://uwcscholar.uwc.ac.za/items/32320e09-800c-4269-b10e-8ec60f2295e8 (accessed on 12 July 2025).
- Kotze, J. Phases of seasonal growth of the avocado tree. Res. Rep. S. Afr. Avocado Grow. Assoc. 1979, 3, 14–16. [Google Scholar]
- Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
- Gitelson, A.A.; Kaufman, Y.J.; Merzlyak, M.N. Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sens. Environ. 1996, 58, 289–298. [Google Scholar] [CrossRef]
- Barnes, E.; Clarke, T.; Richards, S.; Colaizzi, P.; Haberland, J.; Kostrzewski, M.; Waller, P.; Choi, C.; Riley, E.; Thompson, T.; et al. Coincident detection of crop water stress, nitrogen status and canopy density using ground-based multispectral data. In Proceedings of the Fifth International Conference on Precision Agriculture, Madison, WI, USA, 16–19 July 2000; pp. 16–19. [Google Scholar]
- Gitelson, A.A.; Gritz, Y.; Merzlyak, M.N. Relationships between leaf chlorophyll content and spectral reflectance and algorithms for non-destructive chlorophyll assessment in higher plant leaves. J. Plant Physiol. 2003, 160, 271–282. [Google Scholar] [CrossRef]
- Gao, B.C. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sens. Environ. 1996, 58, 257–266. [Google Scholar] [CrossRef]
- Fernando, H.; Ha, T.; Attanayake, A.; Benaragama, D.; Nketia, K.A.; Kanmi-Obembe, O.; Shirtliffe, S.J. High-Resolution Flowering Index for Canola Yield Modelling. Remote Sens. 2022, 14, 4464. [Google Scholar] [CrossRef]
- Savitzky, A.; Golay, M.J. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
- Abatzoglou, J.T.; Dobrowski, S.Z.; Parks, S.A.; Hegewisch, K.C. TerraClimate, a high-resolution global dataset of monthly climate and climatic water balance from 1958–2015. Sci. Data 2018, 5, 170191. [Google Scholar] [CrossRef]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
- Li, J.; Zhu, Q.; Wu, Q.; Fan, Z. A novel oversampling technique for class-imbalanced learning based on SMOTE and natural neighbors. Inf. Sci. 2021, 565, 438–455. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.e.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. the Journal of machine Learning research. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Sokolova, M.; Lapalme, G. A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 2009, 45, 427–437. [Google Scholar] [CrossRef]
- Powers, D.M.W. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv 2011, arXiv:2010.16061. [Google Scholar]
- Saito, T.; Rehmsmeier, M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE 2015, 10, e0118432. [Google Scholar] [CrossRef]
- Chicco, D.; Jurman, G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genom. 2020, 21, 6. [Google Scholar] [CrossRef]
- Bradley, A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997, 30, 1145–1159. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. arXiv 2017, arXiv:1705.07874. [Google Scholar] [CrossRef]
- Monselise, S.P.; Goldschmidt, E.E. Alternate bearing in fruit trees. Hortic. Rev. 1982, 4, 128–173. [Google Scholar]
- Whiley, A.W. Crop management. In The Avocado Botany, Production and Uses, 1st ed.; Whiley, A.W., Schaffer, B., Wolstenholme, B.N., Eds.; CABI Publishing: Wallingford, UK, 2002; Volume 1, pp. 231–258. [Google Scholar] [CrossRef]
- Whiley, A.W.; Rasmussen, T.S.; Saranah, J.B.; Wolstenholme, B.N. Delayed harvest effects on yield, fruit size and starch cycling in avocado (Persea americana Mill.) in subtropical environments. I. the early-maturing cv. Fuerte. Sci. Hortic. 1996, 66, 23–34. [Google Scholar] [CrossRef]
- Silber, A.; Naor, A.; Cohen, H.; Bar-Noy, Y.; Yechieli, N.; Levi, M.; Noy, M.; Peres, M.; Duari, D.; Narkis, K.; et al. Irrigation of ‘Hass’ avocado: Effects of constant vs. temporary water stress. Irrig. Sci. 2019, 37, 451–460. [Google Scholar] [CrossRef]
- Sommaruga, R.; Eldridge, H.M. Avocado Production: Water Footprint and Socio-economic Implications. EuroChoices 2021, 20, 48–53. [Google Scholar] [CrossRef]
- Lavee, S. Biennial bearing in olive (Olea europaea). Ann. Ser. Hist. Nat. 2007, 17, 101–112. [Google Scholar]
- Goldschmidt, E.E. Fifty Years of Citrus Developmental Research: A Perspective. HortScience 2013, 48, 820–824. [Google Scholar] [CrossRef]


















| Index | Description | Sentinel 2 Formula | Purpose | References |
|---|---|---|---|---|
| NDVI | Normalized difference vegetation index | Canopy vigor and biomass | [19] | |
| GNDVI | Green normalized difference vegetation index | Canopy vigor and biomass | [53] | |
| NDRE | Normalized difference red edge index | Chlorophyll content and photosynthetic activity | [54] | |
| CIG | Chlorophyll Index Green | Canopy chlorophyll content | [55] | |
| CIRE | Chlorophyll Index Red Edge | Canopy chlorophyll content | [55] | |
| EVI2 | Enhance Vegetation Index 2 | High biomass minimizing soil and atmosphere influences | [21] | |
| LSWI | Land Surface Water Index | Water content in vegetation | [56] | |
| WYI | Weighted yellowness index | Flowering detection (yellow reflectance) | [26] | |
| NDYI | Normalized Difference Yellowness Index | Flower pigment contrast | [57] | |
| MTYI | Mango tree yellowness index | Tree flowering index | [26] |
| Model | Parameter | Value |
|---|---|---|
| Random Forest (RF) | n_estimators | 500 |
| max_depth | 4 | |
| min_samples_split | 20 | |
| XGBoost | n_estimators | 100 |
| learning_rate | 0.1 | |
| max_depth | 4 | |
| CatBoost | iterations | 600 |
| learning_rate | 0.05 | |
| depth | 6 | |
| LightGBM | n_estimators | 200 |
| learning_rate | 0.05 | |
| max_depth | 6 | |
| TabPFN | configuration | Default pretrained model (no tuning) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Rahman, M.M.; Robson, A.; Bekker, T. Machine Learning Approaches for Assessing Avocado Alternate Bearing Using Sentinel-2 and Climate Variables—A Case Study in Limpopo, South Africa. Remote Sens. 2025, 17, 3935. https://doi.org/10.3390/rs17243935
Rahman MM, Robson A, Bekker T. Machine Learning Approaches for Assessing Avocado Alternate Bearing Using Sentinel-2 and Climate Variables—A Case Study in Limpopo, South Africa. Remote Sensing. 2025; 17(24):3935. https://doi.org/10.3390/rs17243935
Chicago/Turabian StyleRahman, Muhammad Moshiur, Andrew Robson, and Theo Bekker. 2025. "Machine Learning Approaches for Assessing Avocado Alternate Bearing Using Sentinel-2 and Climate Variables—A Case Study in Limpopo, South Africa" Remote Sensing 17, no. 24: 3935. https://doi.org/10.3390/rs17243935
APA StyleRahman, M. M., Robson, A., & Bekker, T. (2025). Machine Learning Approaches for Assessing Avocado Alternate Bearing Using Sentinel-2 and Climate Variables—A Case Study in Limpopo, South Africa. Remote Sensing, 17(24), 3935. https://doi.org/10.3390/rs17243935

