Estimation and Classification of Coffee Plant Water Potential Using Spectral Reflectance and Machine Learning Techniques

Delfino, Deyvis Cabrini Teixeira; Ferreira, Danton Diego; Volpato, Margarete Marin Lordelo; Silva, Vânia Aparecida; Delfino, Renan Teixeira; de Matos, Christiano Sousa Machado; Santos, Meline de Oliveira

doi:10.3390/biophysica5040060

Open AccessArticle

Estimation and Classification of Coffee Plant Water Potential Using Spectral Reflectance and Machine Learning Techniques

by

Deyvis Cabrini Teixeira Delfino

¹,

Danton Diego Ferreira

^2,*

,

Margarete Marin Lordelo Volpato

³

,

Vânia Aparecida Silva

³,

Renan Teixeira Delfino

⁴

,

Christiano Sousa Machado de Matos

³ and

Meline de Oliveira Santos

³

¹

Department of Electrical Engineering, Federal University of Juiz de Fora, Juiz de Fora 36036-900, MG, Brazil

²

Department of Automatic, Federal University of Lavras, Lavras 37203-202, MG, Brazil

³

Agricultural Research Company of Minas Gerais (EPAMIG), Lavras 37203-202, MG, Brazil

⁴

Department of Water Resources, Federal University of Lavras, Lavras 37203-202, MG, Brazil

^*

Author to whom correspondence should be addressed.

Biophysica 2025, 5(4), 60; https://doi.org/10.3390/biophysica5040060 (registering DOI)

Submission received: 29 August 2025 / Revised: 25 November 2025 / Accepted: 26 November 2025 / Published: 4 December 2025

Download

Browse Figures

Versions Notes

Abstract

Water potential is an important indicator used to study water relations in plants, as it reflects the level of hydration in their tissues. There are different numerical variables that describe plant properties and can be acquired from leaf reflectance. The objective of this study was to estimate water potential in coffee plants using spectral variables. For this, a range of wavelengths that provided analytical flexibility was used. After this, machine learning techniques were employed to build data-driven models. The dataset used presents spectral characteristics (wavelength) of coffee plants, collected through the CI-710 Mini-Leaf Spectrometer equipment and also the water potential of each coffee plant, measured by the Scholander Chamber equipment. The dataset was divided into two crop management groups: irrigated and rainfed. Four machine learning techniques were implemented: Multi-Layer Perceptron (MLP), Decision Tree, Random Forest and K-Nearest Neighbor (KNN). The implementation of machine learning techniques followed two distinct strategies: regression and classification. The results indicate that the decision tree-based model demonstrated superior performance under irrigated conditions for regression tasks. In contrast, the KNN technique achieved the best performance for classification. Under rainfed conditions, the MLP model outperformed the other techniques for regression, while the Random Forest method exhibited the highest accuracy in classification tasks. While no hardware prototype was developed, the machine learning-based methods presented here suggest a possible pathway toward future intelligent, user-friendly, and accessible sensing technologies for coffee plantations.

Keywords:

coffee farming; machine learning; water potential; data analysis; reflectance

1. Introduction

Ranked fifth among the most exported plant-based products by Brazil, coffee is one of the commodities that has significantly contributed to the expansion of agribusiness exports in 2023, according to a report by the Ministry of Agriculture, Livestock, and Supply (MAPA).

According to the United States Department of Agriculture (USDA), Brazil is the largest producer of Arabica coffee (Coffea arabica) and ranks second in Robusta coffee (Coffea canephora) production, popularly known as Conilon variety, behind only Vietnam. Globally, Brazil occupies the first position in the ranking, considering both types (Arabica and Canephora), thus becoming the largest coffee producer in the world (United States Department of Agriculture Foreign Agricultural Service June 2023 Coffee: World Markets and Trade (https://fas.usda.gov/data/coffee-world-markets-and-trade-06222023), accessed on 22 September 2025).

In an increasingly demanding consumer market, in addition to the volume of sacks produced, the quality of coffee beans is crucial to secure a larger market share. One way to ensure the high quality of the product is through understanding the plant’s water relations, aiming to keep it consistently hydrated, thus ensuring a final product of high quality.

With the growing demand in both national and international consumer markets, maintaining production requires ensuring adequate hydration of the plant, which is essential for delivering a high-quality final product. The traditional way to measure plant hydration is through a pressure chamber, known as a Scholander Chamber, where the value of the water potential is determined by samples of leaves collected from plants that are subjected to different pressure levels. However, this measurement method implies a time-consuming process, must be estimated at a specific time (between 4:00 and 5:00 a.m.), requires specialized labor, in addition to being a destructive test and may pose a risk to the operator. Due to these limitations, alternative methods for indirectly measuring plant water conditions have been proposed, based on spectral signatures [1]. Although water potential is a physiologically robust indicator of plant hydration, it remains highly unusual, impractical, and laborious for farmers, as measurements require pre-dawn sampling, destructive leaf excision, and specialized pressure-chamber equipment operated under strict safety conditions. For this reason, more accessible proxies are commonly preferred in applied or commercial settings. Importantly, however, a variety of hydration indices are physically interrelated with

Ψ

, meaning that alternative metrics often reflect the same underlying physiological state. Indicators such as relative water content (RWC), gravimetric water content (WC) or mass loss, dielectric or impedance-based measurements, and low-field NMR parameters represent complementary facets of plant hydration. These measures are inherently connected: pressure-volume relations link RWC directly with

Ψ_{W}

, while progressive water loss simultaneously reduces water content and alters dielectric behavior and LF-NMR signatures. Consequently, a monotonic correspondence between

Ψ_{W}

and these alternative indicators is generally expected, even though each quantifies a different dimension of water status (amount vs. potential vs. mobility) [2].

Spectral signature analysis can provide various information regarding different aspects related to plant health [3,4]. These aspects are studied by experts in the field to ensure the relevance of the information. Thus, certain reflectances in the spectral signature have a relationship with the plant’s water status, which may be linear or nonlinear to varying degrees, depending on the wavelength of the spectral signature. Thus, it is expected that artificial intelligence-based models may be used in an attempt to estimate plant characteristics indirectly.

In the study reported in [5], the aim was to determine the effect of water stress on maize (Zea mays L.) using spectral indices, chlorophyll readings, and consequently, evaluate reflectance spectra. Similarly, in the study of [6], coffee plants in irrigated and non-irrigated crops had their spectral indices measured and used to determine the water conditions of the coffee plants applying artificial neural networks and decision tree algorithms, obtaining simple and efficient predictive models. Unlike these, ref. [7] used geostatistical analysis (via semivariograms) to investigate the spatial dependence of the leaf water potential. The study reported in [8] evaluated the relationship between water potential and coffee canopy temperature obtained by a thermal camera mounted on a remotely piloted aircraft. Results showed that a decrease in WP was associated with stomatal closure and reduced stomatal conductance, leading to an increase in canopy temperature due to water deficit.

Current studies estimating coffee plant water potential face several limitations, including restricted spectral resolution of low-cost sensors, sensitivity to environmental conditions, limited accuracy at the leaf scale, coarse and infrequent satellite observations, and poor model generalization across regions. These challenges indicate that water potential estimation in coffee remains an open research area, encouraging the development of more robust and scalable approaches.

In order to explore a different approach from the works of [5,6], the current study did not address spectral indices. Spectral indices, despite their widespread use, have limitations that can affect the accuracy of water potential estimation. The primary limitation lies in their reductionist nature, as they condense the complexity of the reflectance spectrum into a single value. This simplification can obscure relevant information about the interaction of electromagnetic radiation with the leaf, especially in situations of moderate to severe water stress [9]. In addition, the indices are calculated from specific spectral bands, focusing on predetermined characteristics, which can lead to the loss of relevant information, which inevitably occurs when the captured window is restricted [10]. According to [10,11], the analysis of a larger window of the reflectance spectrum offers a more holistic and detailed view of the interactions between electromagnetic radiation and the leaf.

In contrast to previous studies based on spectral indices, this study employed full-spectrum data in combination with multiple models to predict the water potential of coffee plants, enabling a more detailed and comprehensive analysis of the interactions between electromagnetic radiation and leaf reflectance. Additionally, it explored which specific wavelength or range of wavelengths was best suited for inferring the water potential of coffee plants.

The present study addresses the implementation of four machine learning techniques to estimate and classify the water potential of coffee plants: Multi-Layer Perceptron (MLP), Decision Tree, Random Forest, and K-Nearest Neighbor (KNN). Using these techniques for regression and classification tasks is valuable due to their diverse learning mechanisms, which allow for robust performance across varying data structures and complexities [12,13]. A Multi-Layer Perceptron (MLP) is a type of artificial neural network composed of an input layer, one or more hidden layers, and an output layer, where each layer consists of interconnected neurons that use non-linear activation functions to model complex relationships in data [14]. According to the Universal Approximation Theorem, an MLP can approximate any continuous function to an arbitrary degree of accuracy with sufficient hidden neurons, making it highly versatile for modeling complex, non-linear relationships in data [14]. In summary, the decision trees present tree-like structures composed of a set of interconnected nodes. Each internal node tests input attributes as decision constants and determines the next descendant node [15]. They are computationally simple in the operating phase and more interpretable than neural networks, which are often regarded as black-box models. Random Forest is an ensemble technique widely recognized in the literature for its ability to increase model complexity by incorporating new data while maintaining strong generalization performance. Ensemble methods consist of a collection of classifiers; in the case of Random Forest, it utilizes a set of decision trees that determine the final prediction through a majority voting process [16]. Finally, the K-Nearest Neighbors (KNN) is a simple, instance-based learning algorithm that classifies data points based on the majority label of their nearest neighbors, offering advantages such as ease of implementation, flexibility, and effectiveness in handling non-linear data distributions. As a regressor, KNN predicts continuous values by averaging the outcomes of its nearest neighbors, offering advantages such as simplicity, non-parametric nature, and the ability to model complex, non-linear relationships without requiring explicit assumptions about the data [17].

The reminder of the paper is organized as follows. The next section presents the methodology employed, where the database used is presented and the steps to design the proposed models are described. Section 3 presents the achieved results and discussions. Finally, Section 4 presents the final conclusions and gives directions for future works.

2. Materials and Methods

This section presents the database details, the data pre-processing, the design of the models and the metrics used for performance evaluation.

2.1. Database

The data were collected in an experiment set up on a private rural property in the municipality of Diamantina, located in the northern region of the state of Minas Gerais, Brazil. The experimental design was a randomized complete block with split-strip plots, consisting of 10 Coffea arabica genotypes provided by the coffee breeding program of EPAMIG (Empresa de Pesquisa Agropecuária de Minas Gerais), two irrigation systems, and four blocks. Each experimental plot comprised six plants, and the experiment was conducted at a spacing of 3.7 m × 0.6 m. Two types of management of coffee plants were considered: irrigated and rainfed.

The irrigation system was gravity-driven drip, with drippers delivering

2.3 {Lh}^{- 1}

and spaced every 75 cm. The drip lines were supplied by a 75 mm PVC pipe, and water was sourced from a local reservoir. The emitter flow rate was

2.30 {Lh}^{- 1} m^{- 2}

(equivalent to

2.3 {mmh}^{- 1}

), and irrigation was applied for approximately two hours per day. Irrigation was managed to replace crop evapotranspiration, as measured by a local automatic weather station.

The experiment, including both irrigated and rainfed systems, was established in March 2014, when the plants were 24 months old. In the irrigated system, plants continued to receive water through the drip system. In contrast, under rainfed management, the irrigation hoses were disconnected in the corresponding plots to impose water deficit, and coffee plants relied exclusively on natural precipitation for hydration, with no artificial irrigation applied. Fertilization in the irrigated treatments was performed via fertigation, whereas in the rainfed treatments, fertilization was carried out conventionally by surface application beneath the coffee canopy, following recommended guidelines.

To capture the effect of seasonal climate variations, data were collected when the plants were 25 months old (April 2014), 26 months (May 2014), 30 months (September 2014), 34 months (January 2015), 36 months (March 2015), 39 months (June 2015), 41 months (August 2015), 42 months (September 2015), 44 months (November 2015), 48 months (March 2016), 53 months (August 2016), and 54 months (September 2016). The database was provided by the field research team of EPAMIG and presents spectral characteristics and the corresponding water potential of each coffee plant. Leaf samples were collected from the same marked plants across all sampling periods, with each genotype corresponding to a single, permanently identified plant. Repeated measurements therefore represent longitudinal data from the same genotypic individual, ensuring that temporal variations reflect physiological or environmental effects rather than genotypic differences. Each genotype (i.e., marked plant) was treated as a biological replicate in the statistical analyses to account for the repeated-measures design. Leaves from the third or fourth pair of plagiotropic branches in the middle third of the coffee plant, on the sun-exposed side facing east, were collected between 3:00 and 5:00 a.m.

Pre-dawn water potential (

Ψ_{W}

) measurements were made using the Scholander-type pressure chamber (PMS Instruments Plant Moisture-Model 1000, PMS Instrument Company, Albany, OR, USA). The leaves collected in the field were inserted into the chamber using an appropriately sized gasket, and subsequently pressure was applied until exudation occurred from the cut leaf petiole.

Leaf reflectance spectra were measured using a CI-710 miniaturized leaf spectrometer (CID Bio-Science, Camas, WA, USA), which illuminates the adaxial surface of the coffee leaf with blue LED light and an incandescent lamp, providing output across the visible to near-infrared range (400–1000 nm). Spectral data were analyzed using SpectraSnap software (version 1.1.3.150, CID Bio-Science, Camas, Washington, DC, USA) with an integration time of 900 ms. Measurements were conducted following preliminary calibration of the device according to the manufacturer’s instructions.

The database was composed of 437 samples of irrigated coffee and 445 of rainfed. Each sample consisted of 2863 attributes, which were defined by the collection date, genotype/cultivar number, the repetition for the corresponding genotype/cultivar, and the reflectance sequence corresponding to the wavelength range from 400 to 950 nm. Although the device is nominally specified to operate within the 400–1000 nm range, its spectral response exhibits decreased precision and increased noise beyond approximately 950 nm. Consequently, the analyses were restricted to the 400–950 nm region to ensure data quality and reproducibility. Furthermore, each sample from both databases had the corresponding water potential (

Ψ_{W}

), measured with a Scholander Pressure Chamber.

2.2. Pre-Processing

For the preprocessing stage, we adopted the median filtering method developed by [18], which aims to smooth impulsive noise in digital signals and images [19]. The median filter operates by determining a window of N samples, where the N values were arranged in ascending order. The median is the value located precisely in the middle of the sample, and the median filter replaces the “problematic” value with the median of the window. In this work, a fourth-order median filter was implemented.

Subsequently, the dataset was normalized using a scaling method to the range [0, 1], according to Equation (1).

P_{n} = \frac{(P - P_{m i n})}{(P_{m a x}) - (P_{m i n})},

(1)

where P_n corresponds to the normalized value of variable n, P, P_min, and P_max represent the original, minimum, and maximum values, respectively [20].

An important aspect of preprocessing in pattern recognition and regression methods is the feature selection. For this stage, the technique used was the Pearson’s coefficient, which measures the degree of linear correlation between two variables. This coefficient, typically represented by

ρ

, takes values only between −1 and 1. Table 1 displays the interpretation of the Pearson’s coefficient (

ρ

) values [21,22].

2.3. Model Design

Following data normalization and the selection of the most relevant features, four machine learning techniques—Multi-Layer Perceptron (MLP), Decision Tree, Random Forest, and K-Nearest Neighbor (KNN)—were implemented to estimate the water potential of coffee plants. Additionally, the regression problem was converted into a classification task by segmenting the water potential values into discrete classes. The classes were defined according to the work of [23] and are shown in Table 2.

The datasets corresponding to irrigated and rainfed management systems differ due to varying levels of water stress. The rainfed management dataset exhibits water potential values ranging from −0.25 MPa to −6.60 MPa, covering all values listed in Table 2. Consequently, when classes are assigned to this dataset, the samples under rainfed conditions are distributed across five distinct classes. In contrast, the irrigated management dataset, characterized by lower water stress, shows water potential values ranging from −0.20 MPa to −2.40 MPa, covering only the first three classes.

The number of samples per class for both datasets (rainfed and irrigated conditions) is presented in Table 3. Note that both datasets have imbalanced classes, which makes the classification problem more complex. Imbalanced classes pose challenges for pattern recognition by biasing models towards the majority class, often leading to reduced accuracy and poor performance on minority class predictions. To deal with this issue, we applied the Synthetic Minority Over-sampling Technique (SMOTE) [24] to create synthetic data, however, the results showed a decreasing in the classification performance with the use of SMOTE (See Appendix A). This reduction was likely due to the synthetic samples not fully capturing the spectral variability of the real data, which introduced noise and reduced the models’ generalization ability. Thus, we decided to not use SMOTE and to perform a stratified division of the data into training and testing sets. The division was conducted using the Cross-Validation technique [25], with five folds, so that for each fold, one is chosen for testing and the other four for training. The methods were implemented via MATLAB software (version 2011). To optimize the classifiers and predictors, the hyperparameters were adjusted to identify the most parsimonious models. Consequently, each model was run 30 times, resulting in a total of 150 executions for each machine learning method.

Furthermore, classes 1, 4, and 5 have fewer samples compared to the others in the rainfed condition dataset (see Table 3). This may hinder the learning and generalization process of some classifiers that require more samples to converge. The KNN classifier is suitable for small datasets, since it does not require explicit training [12]. It relies on the distances between data points, which means its performance can be competitive with limited data if the feature space is well-defined. Also, decision trees are interpretable and perform well on small datasets. They can easily fit the data, even with complex relationships, without requiring large amounts of training data [12].

Figure 1 illustrates the steps of the design of the proposed approaches. Note that the preprocessing stage (data normalization and feature selection) follows the same procedure for both regression and classification approaches.

2.4. Evaluation and Performance Metrics

For performance evaluation, confusion matrices were used for the classification approaches. In addition, the balanced accuracy was utilized. It is a performance evaluation metric particularly useful when the classes in a classification problem are imbalanced. Equation (2) shows how to calculate the balanced accuracy.

B A = \frac{1}{N} (\sum_{i = 1}^{N} \frac{T P_{i}}{T P_{i} + F N_{i}})

(2)

where N is the number of classes in the dataset,

V P_{i}

and

F N_{i}

refer to the true positives and false negatives of class i, respectively. Balanced accuracy aims to reduce bias caused by class imbalance by averaging the individual accuracies of each class, giving equal weight to all classes regardless of their size.

For the regression approaches, the root mean squared error (RMSE) was used (see Equation (3)). It is a standard metric used to evaluate the accuracy of a model by measuring the differences between predicted and observed values. Since errors are squared before averaging, RMSE places greater weight on larger errors. A lower RMSE indicates better model performance, with perfect predictions yielding an RMSE of 0 [26,27].

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}},

(3)

where

{\hat{y}}_{i}

is the estimated value of

y_{i}

(observed value).

The coefficient of determination was also employed as a metric for the regression approaches, as shown in Equation (4). It is a number between 0 and 1 that measures how well a model predicts an outcome and can be understood as the percentage of data variation explained by the model. Therefore, the higher the

R^{2}

, the more explanatory the model is, meaning it fits the data better [28,29].

R^{2} = 1 - \frac{\sum {(y_{i} - \hat{y_{i}})}^{2}}{\sum {(y_{i} - \bar{y})}^{2}},

(4)

where

\bar{y}

is the mean value of the observations.

3. Results

The Pearson correlation coefficient (

ρ

) was calculated to assess the relationship between the attributes and the target variable (water potential). For data under irrigated conditions, a threshold of

ρ = 0.30

was applied, where attributes with

ρ < 0.30

were deemed irrelevant for the model. This threshold reduced the number of attributes from 2863 dimensions to 22. For data under rainfed conditions, a higher threshold of

ρ = 0.47

, indicating a moderate correlation, was used. In this case, the number of attributes was reduced from 2863 dimensions to 20. The thresholds for both irrigated and rainfed conditions were determined experimentally, with the primary goal of identifying more parsimonious models. To this end, we varied the correlation threshold from

ρ = 0.20

to

ρ = 0.80

. The selected values (

ρ = 0.30

for irrigated and

ρ = 0.47

for rainfed) corresponded to the models that achieved the best trade-off between accuracy and complexity.

Finally, the ten most relevant attributes for both conditions were selected. They are shown in Table 4 and Table 5, corresponding to the irrigated and rainfed conditions, respectively.

The results in Table 4 indicate that reflectance values near the 780 nm wavelength are the most significant under irrigated conditions. This finding is consistent with previous studies [30,31], which report that the spectral signature of vegetative targets—particularly hydrated green leaves—exhibits high reflectance in the Near-Infrared (NIR) range (700–1300 nm). In the infrared region (720–1100 nm), these interactions are primarily associated with mesophyll structure and variations in leaf water content [32,33]. This occurs because liquid water strongly absorbs solar radiation in the NIR (720–1000 nm) and Shortwave Infrared (SWIR, 1000–2500 nm) bands. In a study of spring wheat, hyperspectral data revealed that the 780 nm and 1750 nm bands were sensitive to water-related parameters, with both wavelengths showing strong positive correlations with soil water potential, soil relative moisture, and canopy water content [30].

Under rainfed conditions, however, wavelengths around 690 nm were identified as the most prominent (Table 5), highlighting them as critical bands for analysis. In the visible spectrum, wavelengths between 650 nm and 700 nm correspond to the red region of light. This observation agrees with existing literature [30,31], which reports that coffee plants grown under rainfed conditions tend to exhibit higher reflectance in this range. Healthy vegetation, by contrast, strongly absorbs light in this region due to chlorophyll a, which exhibits peak absorption at approximately 680 nm. This correlation supports the observed results, as coffee leaves under rainfed conditions often display a brownish or reddish hue.

Drought stress significantly affects several physiological parameters in more sensitive coffee genotypes, reducing relative leaf water content, net assimilation rate, stomatal conductance, and chlorophyll pigments compared to well-watered conditions [31]. Although the NIR and red bands are not directly correlated with water content, they are linked to chlorophyll concentration, green leaf biomass, and photosynthetic activity—parameters that are negatively affected by drought stress [34,35]. Consistent with our observations, in spring wheat, VIS and SWIR reflectance were reported to increase with decreasing soil water content, whereas NIR reflectance increased with higher soil water content [30].

After the execution of preprocessing and feature selection steps, the machine learning models (MLP, Decision Tree, Random Forest, and KNN) were implemented considering all selected features as input variables. After that, the classification and regression models were designed considering only the ten and five most relevant features.

3.1. Irrigated Condition

3.1.1. Results for the Regression Models

The RMSE and

R^{2}

values were computed for all executions. For each fold (in the context of the k-fold cross validation), the best result was selected among the 30 executions. The results of mean (

μ

) and standard deviation (

σ

), considering the 5 folds of the k-fold cross validation, are displayed in Table 6, for the selected 22 features, and for the 10 and 5 most relevant features. A stratified k-fold cross-validation procedure was implemented using the cvpartition function of MATLAB with the k-fold parameter set to five, ensuring that the proportional representation of classes was maintained across all folds. This approach minimizes potential bias arising from unequal class distributions during model training and testing. However, no fixed random seed was applied, meaning that fold assignment was based on the software’s default randomization routine. Consequently, while the procedure is fully reproducible within the same computational environment and configuration, minor variations may occur if the randomization process is reinitialized.

Analyzing the results from Table 6, it can be observed that the Decision Tree achieved the best result among the three attribute options, with a mean root mean squared error value (

μ_{R M S E}

) of 0.3884 ± 0.0299 and a coefficient of determination (

μ_{R^{2}}

) of 0.6313 ± 0.0569, corresponding to the model with the 5 most relevant attributes considering the regression method. The decision tree model that performed better among the five folds presented a simple structure, consisting of 7 levels and a total of 30 nodes, being the root node, 9 internal nodes and 20 leaf nodes. Figure 2 illustrates the predicted values in the best-performing fold for the decision tree model compared to the ideal line (blue line), with errors increasing as the estimated data points move farther from the line, the greater the errors associated with those data points. A noticeable dispersion of data around the ideal line is observed, with the largest errors occurring for water potential values higher than

- 2.5

. For this classifier, the achieved RMSE was 0.3354 and the coefficient of determination (

R^{2}

) was 0.7259.

To compare the performance of the four machine learning models across the three feature sets (22, 10, and 5 features), a one-way ANOVA was applied to evaluate whether the mean prediction errors varied among the different configurations. When RMSE was used as the response metric, the ANOVA indicated a significant overall effect (

p = 0.0419

), demonstrating that at least one model–feature combination performed differently from the others. The subsequent Tukey HSD test identified a single statistically significant contrast: the decision tree with 5 features (DT 5 features) differed from the KNN with 22 features (KNN 22 features). As illustrated in Figure 3, these two configurations represent opposite extremes in the distribution of RMSE values, while the remaining approaches show largely overlapping confidence intervals, indicating similar error magnitudes.

For the

R^{2}

metric, the results revealed a consistent pattern. The ANOVA detected a significant global effect (

p = 0.0242

), and the Tukey HSD test again found only one significant pairwise difference—between DT 5 features and KNN 22 features. The decision tree with 5 features achieved higher

R^{2}

values, whereas the KNN with 22 features showed the lowest performance. The other model–feature combinations displayed similar predictive capacity, with no statistically distinguishable differences according to Tukey’s test.

3.1.2. Results for the Classification Models

For the classification procedure, the balanced accuracy metric was used, as it provides a more realistic assessment of the performance of Machine Learning models when dealing with imbalanced datasets. The results for irrigated condition are displayed in Table 7 in terms of the number of features used as input for each model. The hyperparameters used for each model are presented in Appendix B. These results comprise the mean (

μ

) and standard deviation (

σ

) values of the 5 folds (k-fold cross validation) used. It is observed that the KNN classifier applied to the five most relevant features achieved the best

B A

values (67.73 ± 3.48). Among the five folds evaluated, the best performance was achieved for fold 5, with a balanced accuracy near to 73%. For this result, five neighbors and the Hamming distance were used to calculate the proximity between data points. The Inverse Function was applied for distance weighting, where the influence of neighbors on the classification of a new point is weighted inversely to its distance—closer neighbors have greater influence on the decision [36]. The confusion matrix presented in Table 8 refers to fold 5. It indicates some confusion between classes, but maintaining an accuracy above 70% for all classes, which leads to a global accuracy of 73.56%. The class imbalance in the training dataset (103, 167 and 80 samples for classes 1, 2, and 3, respectively) adversely impacted the model’s performance, as it hindered the model’s ability to effectively learn. This imbalance led to lower performance for Class 3, which had fewer training samples available.

Table 9 presents the per-class metrics for the irrigated condition for the results showed in Table 8 (confusion matrix for the 5 most relevant features of fold 5, for the KNN classifier in the Irrigated condition). These results reveal heterogeneous performance across the three classes. Class 1 exhibits balanced behavior, with precision, recall, and F1-score all at 73.1%, indicating that the model identifies samples from this class consistently, without notable bias toward false positives or false negatives. Class 2 presents a slightly different pattern: while its recall (75.6%) is the highest among all classes—meaning the model successfully retrieves most of the samples belonging to this class—its precision is comparatively lower (70.5%). This discrepancy suggests a tendency toward false positives for Class 2, which ultimately reduces its F1-score (72.9%). Class 3 achieves the highest precision (82.4%), showing that predictions for this class are highly reliable when the model assigns that label. However, the recall for Class 3 drops to 70.0%, indicating that a non-negligible portion of true Class 3 samples are misclassified. Consequently, its F1-score (75.7%) reflects this imbalance.

3.2. Rainfed Condition

3.2.1. Results for the Regression Models

Similarly to the samples under irrigated conditions, the Machine Learning techniques were applied to the rainfed dataset, by considering the 5, 10 and 20 most relevant attributes as input features. Table 10 displays the mean (

μ

) and standard deviation (

σ

) values of RMSE and

R^{2}

. The MLP method presented the best results considering 20 attributes, with an RMSE of 1.0569 ± 0.12462 and an

R^{2}

of 0.5441 ± 0.1099.

Figure 4 compares the actual and estimated data for the best fold and for the MLP model. Small errors can be observed for water potential values between

- 2.0

and 0, while other values show an underestimation. The parameters obtained after the iterations comprise a neural network with a single hidden layer, featuring 3 neurons and the Sigmoid activation function. For this model,

R M S E = 0.7841

and

R^{2} = 0.7690

were found.

To assess the comparative performance of the machine learning models under rainfed conditions, a one-way ANOVA was applied to evaluate whether the mean prediction errors differed among the combinations of algorithms and feature sets (5, 10, and 20 features). For the RMSE metric, the ANOVA did not detect a significant global effect (

p = 0.7039

), indicating that the evaluated configurations exhibit statistically indistinguishable prediction errors. Consistent with this outcome, the Tukey HSD test revealed no significant pairwise contrasts, as illustrated in Figure 5, where the confidence intervals of the model–feature combinations show substantial overlap.

A similar pattern was observed for the

R^{2}

metric. The ANOVA again indicated the absence of significant differences across models and feature sets (

p = 0.6258

), and the Tukey HSD comparisons confirmed that none of the pairwise contrasts reached statistical significance. As shown in Figure 5, the

R^{2}

values are tightly grouped, and the confidence intervals widely intersect, demonstrating that the predictive performance of the models is comparable under rainfed conditions.

3.2.2. Results for the Classification Models

For the classification procedure, Table 11 displays the mean (

μ

) and standard deviation (

σ

) values of the balanced accuracies achieved by the implemented methods considering three different sets of the most relevant attributes for rainfed conditions. The hyperparameters used for each model are presented in Appendix B. The balanced accuracy results did not exceed 47.55%, indicating a high level of confusion between classes and suggesting that the classifiers struggled to learn the patterns. Among the classifiers, the Random Forest exhibited the best results, achieving a balanced accuracy of

47.55 \pm 2.41

%, considering the 20 most relevant attributes as inputs. The Random Forest was parameterized with the Bag method, 38 iterations, minimum leaf size of 27, maximum number of splits of 4, and split criterion deviance.

Table 12 displays the confusion matrix for the best-performing fold in the case of Random Forest, where a global accuracy of 52.81% was found. The lowest accuracies were achieved for classes 1 and 4. Class 1 was frequently misclassified as class 2, while class 4 was primarily misclassified as classes 3 and 5. Acceptable classification accuracies were found for classes 2, 3 and 5. Converting class labels to water potential values according to Table 2, values greater than −0.5 MPa, between −2.5 MPa and −3.5 MPa, and less than −3.5 MPa exhibit an imbalance when compared to the others. The class imbalance in the training dataset (53, 90, 112, 49, and 52 samples for classes 1, 2, 3, 4, and 5, respectively) adversely impacted the model’s performance, as it hindered the model’s ability to effectively learn. This imbalance led to lower performance for classes 1 and 4, which had fewer training samples available.

Table 13 presents the per-class metrics for the rainfed condition for the results showed in Table 12 (confusion matrix for the 20 most relevant features of fold 2, for the Random Forest classifier in the Rainfed condition). These results reveal substantial variability in model performance across the five classes, indicating that the rainfed scenario poses greater classification challenges compared to the irrigated condition. Class 1 shows the largest imbalance between precision (66.7%) and recall (28.6%), resulting in a low F1-score of 40.0%. This pattern suggests that, although the model is relatively conservative and often correct when predicting Class 1, it fails to capture the majority of true Class 1 samples, leading to a high rate of false negatives. Class 2 demonstrates almost the opposite behavior: its precision is modest (41.9%), but recall reaches 59.1%, resulting in an F1-score of 49.1%. The model retrieves many actual Class 2 samples, yet the low precision indicates substantial confusion with other classes, reflecting a higher incidence of false positives. Class 3 exhibits the most balanced and robust performance, with precision and recall both at 64.3%, yielding the highest F1-score (64.3%). This result indicates that the model distinguishes this class more effectively than the others under rainfed conditions. For Class 4, the model achieves moderate precision (50.0%) but very low recall (23.1%), producing the lowest F1-score (31.6%). This again points to a tendency to miss a significant portion of true Class 4 samples, suggesting that the model has difficulty characterizing features associated with this class. Finally, Class 5 shows an interesting contrast: although its precision (50.0%) is only moderate, its recall is the highest among all classes (75.0%), leading to an F1-score of 60.0%. This indicates that the model successfully identifies most instances of Class 5, but at the cost of misclassifying samples from other classes as Class 5.

3.3. Discussion

In our study, Pearson’s correlation coefficient (

ρ

) was calculated to select the evaluated attributes most related to water potential. The results in Table 4 indicate that reflectances near the 780 nm wavelength are the most significant, for irrigated conditions. This finding aligns with previous studies [37,38], which suggest that the spectral signature of vegetative targets, particularly hydrated green leaves, exhibits high reflectance in the Near-Infrared (NIR) range (700–1300 nm). In the infrared region (720 to 1100 nm) the interactions are related to the mesophyll structure and the variation in the amount of water [32,33]. However, for the rainfed condition, a prominence of wavelengths around 690 nm was observed (Table 5), identifying them as critical bands for analysis. This finding aligns with existing literature [37,38], which reports that coffee plants grown under rainfed conditions tend to exhibit higher reflectance in this range, while healthy vegetation strongly absorbs light in this wavelength range due to chlorophyll. In the visible electromagnetic spectrum, wavelengths between 650 nm and 700 nm correspond to the red light region. This correlation supports the observed results, as the leaves of coffee plants under rainfed conditions often display a brownish or reddish hue. Then, classification and regression models were designed considering only the most relevant features, for each condition, irrigated and rainfed.

The machine learning methods used for estimating coffee leaf water potential differ in complexity, interpretability, and robustness. MLPs can model complex nonlinear relationships but require large datasets, careful tuning, and are less interpretable. Decision trees are simple and interpretable but prone to overfitting and may not capture subtle spectral patterns. Random forests improve accuracy and robustness by combining multiple trees, though they are less interpretable and computationally heavier. k-Nearest Neighbors is a simple, non-parametric approach but suffers in high-dimensional spectral spaces and with noisy data. Overall, each method presents trade-offs between predictive performance, interpretability, and computational cost, highlighting the need to select the approach based on data characteristics and study objectives. Importantly, the proposed models can be retrained efficiently when new cultivars or sensors are introduced, since they are based on standard machine learning algorithms with relatively low computational cost. In practice, retraining requires only the acquisition of new spectral and physiological data under the desired conditions, which can be processed using the same pipeline described in this study.

In summary, the proposed classifiers and estimators demonstrated superior performance when applied to irrigated coffee data. However, the non-uniform variation of values across the wavelength ranges posed challenges for the classifiers and estimators. Using more advanced oversampling or undersampling techniques, along with collecting additional data, could improve the results and should be explored in future work.

Decision tree and MLP techniques achieved the best performance for irrigated and rainfed data, respectively, when using the estimation method. However, performance may have been affected by the limited number of samples with water potential values below −2.5 MPa.

For the classification method, two distinct techniques demonstrated the best performance: Random Forest for rainfed data and K-Nearest Neighbors (KNN) for irrigated data. An imbalance in the data set was observed, which affected the results. This issue can be mitigated by employing oversampling algorithms along with collecting additional data, which are planned to be used in future studies.

Despite the significant relevance of leaf water potential and its association with spectral indices, there is a notable gap in the literature on studies aimed at estimating and classifying water potential in coffee plants without relying on complex direct measurements. This gap limits the availability of comparative benchmarks in the field.

Using global accuracy as the evaluation metric for the classification method, the results of this study were less favorable compared to those reported by our previous work [6], which utilized spectral indices derived from wavelengths obtained through field-based spectral measurements. However, the importance of estimation via spectral signatures remains critical, as direct measurement of water potential involves labor-intensive and technically demanding procedures.

By providing detailed information on plant water status, the spectral signature-based methods explored in this study indicate a potential pathway toward the future development of intelligent, accessible, real-time sensing systems for coffee plantations. While no hardware prototype or cost evaluation was conducted, such systems could ultimately support more precise irrigation strategies and enhanced crop monitoring.

4. Conclusions

This study presented a novel approach for estimating and classifying the water potential of coffee plants using full-spectrum leaf reflectance data combined with machine learning techniques, avoiding reliance on spectral indices or labor-intensive direct measurements. By implementing four methods—Multi-Layer Perceptron (MLP), Decision Tree, Random Forest, and K-Nearest Neighbors (KNN)—and applying both regression and classification strategies, we demonstrated that different algorithms perform optimally under distinct crop management conditions. Specifically, Decision Tree and MLP models achieved the highest accuracy for irrigated and rainfed plants, respectively, while Random Forest and KNN were superior for classification tasks.

Future work should focus on addressing data limitations, such as imbalanced water potential ranges and non-uniform spectral variation, through the collection of additional measurements and the application of advanced oversampling or undersampling techniques. Moreover, exploring hyperspectral datasets and more sophisticated machine learning architectures may further enhance predictive accuracy and generalizability across different cultivars, environmental conditions, and management systems.

Regarding the instrumentation, the spectrometer used in this study costs on the order of USD 20,000. One of our ongoing objectives is to adapt the proposed methodology to low-cost sensing systems that can reproduce the same analytical principles while achieving a substancial cost reduction (approximately 80%). Future developments will focus on implementing simplified optical sensors or miniaturized spectrometers to enable wider application in both research and field contexts.

Author Contributions

Conceptualization, D.D.F., M.M.L.V. and V.A.S.; methodology, D.D.F. and D.C.T.D.; software, D.C.T.D. and R.T.D.; validation, C.S.M.d.M. and M.d.O.S.; formal analysis, D.D.F. and M.M.L.V.; investigation, V.A.S.; resources, C.S.M.d.M. and M.d.O.S.; data curation, C.S.M.d.M. and M.d.O.S.; writing—original draft preparation, D.C.T.D.; writing—review and editing, D.D.F., M.M.L.V. and V.A.S.; visualization, D.D.F. and D.C.T.D.; supervision, D.D.F. and M.M.L.V.; project administration, M.M.L.V. and V.A.S.; funding acquisition, M.M.L.V. and V.A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research and the APC were funded by Fundação de Amparo à Pesquisa do Estado de Minas Gerais, grant number APQ-04470-23.

Data Availability Statement

The datasets and MATLAB codes used in this study are available at http://www.aia.ufla.br/home/filesdatasets/, accessed on 27 November 2025.

Acknowledgments

We would like to express our gratitude primarily to the researchers from the Graduate Program at the Federal University of Lavras and the researchers from EPAMIG. Their support, along with the backing from the Coffee Research Consortium, EMBRAPA, CNPq, INCT-Coffee, Fapemig, and Capes, was instrumental in the completion of this project. As a team, they were essential in providing support in knowledge and development for the realization of this project.

Conflicts of Interest

We declare that have no confict of interest.

Appendix A. Analysis with SMOTE

In order to enhance the performance of the machine learning techniques, the Synthetic Minority Over-sampling Technique (SMOTE) algorithm [24] was employed. This method generates synthetic samples to mitigate class imbalance by augmenting the minority classes. Table A1 presents the class distributions after generating synthetic samples using the SMOTE algorithm for the irrigated condition. Table A2 presents the SMOTE balanced accuracy achieved (for the best fold of k-fold cross validation) for the irrigated condition.

Table A1. Data distribution after applying the SMOTE algorithm-irrigated condition.

Classification	Total Data	Class 1	Class 2	Class 3
Raw Training Data	350	103	167	80
Synthetic Training Data	178	61	0	117
Total Training Data	528	161	167	197
Test Data	87	26	41	20

The comparison between classification results using SMOTE and the approaches without synthetic data generation for the irrigated condition (Section 3.1.2) indicates that SMOTE did not consistently improve model performance. In the majority of comparisons, the upper limit of the accuracy obtained without SMOTE exceeded the best accuracy achieved with SMOTE. Notably, the KNN model showed a substantial degradation in performance when SMOTE was applied, dropping from an average accuracy of approximately 67% to around 55–58%. This suggests that the synthetic samples generated by SMOTE, especially for KNN, likely introduced noise or misrepresented class boundaries. The only significant improvement occurred for the Random Forest model with 22 features, for which SMOTE yielded an accuracy of 71.00%, exceeding the upper accuracy limit of 66.91% obtained without SMOTE.

Table A2. SMOTE Balanced Accuracy for the irrigated condition in percentage (%).

Technique	5 Features Accuracy	10 Features Accuracy	22 Features Accuracy
Neural Network	62.94	58.98	65.81
Decision Tree	63.08	64.33	61.27
Random Forest	62.52	62.13	71.00
KNN	57.97	56.26	55.05

Appendix B. Optimized Parameters

Table A3 and Table A4 present the hyperparameters used for each classifier and for each number of input features, for the irrigated and rainfed conditions, respectively. These hyperparameters were found by using the hyperparameterOptimizationOptions MATLAB function.

Table A3. Hyperparameters used by the classifiers in the irrigated condition.

Feature Set	Technique	Parameter	Value
05 Features	Neural Network	Activations	relu
		Lambda	8.5764 × 10⁻⁶
		LayerSizes	2 109 53
	Decision Tree	MinLeafSize	11
		MaxNumSplits	234
		SplitCriterion	deviance
		NumVariablesToSample	5
	Random Forest	Method	AdaBoostM2
		NumLearningCycles	10
		LearnRate	0.20432
		MinLeafSize	10
		MaxNumSplits	342
		SplitCriterion	twoing
	KNN	NumNeighbors	23
		Distance	hamming
10 Features	Neural Network	Activations	tanh
		Lambda	0.00025248
		LayerSizes	2 19
	Decision Tree	MinLeafSize	13
		MaxNumSplits	18
		SplitCriterion	deviance
		NumVariablesToSample	10
	Random Forest	Method	AdaBoostM2
		NumLearningCycles	11
		LearnRate	0.074121
		MinLeafSize	2
		MaxNumSplits	12
		SplitCriterion	twoing
	KNN	NumNeighbors	22
		Distance	jaccard
22 Features	Neural Network	Activations	sigmoid
		Lambda	0.00051691
		LayerSizes	15 1
	Decision Tree	MinLeafSize	12
		MaxNumSplits	330
		SplitCriterion	gdi
		NumVariablesToSample	20
	Random Forest	Method	RUSBoost
		NumLearningCycles	31
		LearnRate	0.022617
		MinLeafSize	13
		MaxNumSplits	14
		SplitCriterion	twoing
	KNN	NumNeighbors	14
		Distance	jaccard

Table A4. Hyperparameters used by the classifiers in the rainfed condition.

Feature Set	Technique	Parameter	Value
05 Features	Neural Network	Activations	none
		Lambda	5.7363 × 10⁻⁷
		LayerSizes	39
	Decision Tree	MinLeafSize	2
		MaxNumSplits	22
		SplitCriterion	twoing
		NumVariablesToSample	5
	Random Forest	Method	AdaBoostM2
		NumLearningCycles	295
		LearnRate	0.36024
		MinLeafSize	6
		MaxNumSplits	5
		SplitCriterion	deviance
	KNN	NumNeighbors	60
		Distance	chebychev
10 Features	Neural Network	Activations	none
		Lambda	2.3528 × 10⁻⁶
		LayerSizes	40 8
	Decision Tree	MinLeafSize	6
		MaxNumSplits	5
		SplitCriterion	twoing
		NumVariablesToSample	10
	Random Forest	Method	Bag
		NumLearningCycles	16
		LearnRate	NaN
		MinLeafSize	1
		MaxNumSplits	39
		SplitCriterion	deviance
	KNN	NumNeighbors	9
		Distance	chebychev
20 Features	Neural Network	Activations	none
		Lambda	5.4017 × 10⁻⁸
		LayerSizes	46 47
	Decision Tree	MinLeafSize	10
		MaxNumSplits	108
		SplitCriterion	gdi
		NumVariablesToSample	21
	Random Forest	Method	AdaBoostM2
		NumLearningCycles	38
		LearnRate	0.95278
		MinLeafSize	27
		MaxNumSplits	4
		SplitCriterion	deviance
	KNN	NumNeighbors	23
		Distance	cosine

References

Zhang, C.; Pattey, E.; Liu, J.; Cai, H.; Shang, J.; Dong, T. Retrieving leaf and canopy water content of winter wheat using vegetation water indices. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 112–126. [Google Scholar] [CrossRef]
Makraki, T.; Tsaniklidis, G.; Papadimitriou, D.M.; Taheri-Garavand, A.; Fanourakis, D. Non-Destructive Monitoring of Postharvest Hydration in Cucumber Fruit Using Visible-Light Color Analysis and Machine-Learning Models. Horticulturae 2025, 11, 1283. [Google Scholar] [CrossRef]
Tayade, R.; Yoon, J.; Lay, L.; Khan, A.L.; Yoon, Y.; Kim, Y. Utilization of spectral indices for high-throughput phenotyping. Plants 2022, 11, 1712. [Google Scholar] [CrossRef]
Muraoka, H.; Noda, H.M.; Nagai, S.; Motohka, T.; Saitoh, T.M.; Nasahara, K.N.; Saigusa, N. Spectral vegetation indices as the indicator of canopy photosynthetic productivity in a deciduous broadleaf forest. J. Plant Ecol. 2013, 6, 393–407. [Google Scholar] [CrossRef]
Genc, L.; Inalpulat, M.; Kizi, U.; Mirik, M.; Smith, S.E.; Mendes, M. Determination of water stress with spectral reflectance on sweet corn (Zea mays L.) using classification tree (CT) analysis. Zemdirbyste-Agriculture 2013, 100, 81–90. [Google Scholar] [CrossRef]
Nunes, P.H.; Pierangeli, E.V.; Snatos, M.O.; Silveira, H.R.O.; de Matos, C.S.M.; Prereira, A.B.; Alves, H.M.R.; Volpato, M.M.L.; Silva, V.A.; Ferreira, D.D. Predicting coffee water potential from spectral reflectance indices with neural networks. Smart Agric. Technol. 2023, 4, 100213. [Google Scholar] [CrossRef]
Sthéfany Airane, d.S.; Gabriel Araújo, e.S.F.; Vanessa, C.F.; Margarete Marin, L.V.; Marley, L.M.; Vânia, A.S. Evaluation of the water conditions in coffee plantations using RPA. AgriEngineering 2023, 5, 65. [Google Scholar]
Santos, L.M.d.; Ferraz, G.A.e.S.; Carvalho, M.A.d.F.; Campos, A.A.V.; Neto, P.M.; Xavier, L.A.G.; Mattia, A.; Becciolini, V.; Rossi, G. A spatial analysis of coffee plant temperature and its relationship with water potential and stomatal conductance using a thermal camera embedded in a remotely piloted aircraft. Agronomy 2024, 14, 2414. [Google Scholar] [CrossRef]
Polivova, M.; Brook, A. Detailed investigation of spectral vegetation indices for fine field-scale phenotyping. In Vegetation Index and Dynamics; IntechOpen: London, UK, 2021. [Google Scholar]
Dao, P.D.; He, Y.; Proctor, C. Plant drought impact detection using ultra-high spatial resolution hyperspectral images and machine learning. Int. J. Appl. Earth Obs. Geoinf. 2021, 102, 102364. [Google Scholar] [CrossRef]
Asaari, M.S.M.; Mishra, P.; Mertens, S.; Dhondt, S.; Inzé, D.; Wuyts, N.; Scheunders, P. Close-range hyperspectral image analysis for the early detection of stress responses in individual plants in a high-throughput phenotyping platform. ISPRS J. Photogramm. Remote Sens. 2018, 138, 121–138. [Google Scholar] [CrossRef]
Theodoridis, S.; Koutroumbas, K. Pattern Recognition, 4th ed.; Academic Press: Cambridge, MA, USA, 2009. [Google Scholar]
Chen, Y.; Song, L.; Liu, Y.; Yang, L.; Li, D. A review of the artificial neural network models for water quality prediction. Birches. J. 2020, 10, 5776. [Google Scholar] [CrossRef]
Haykin, S. Neural Networks and Learning Machines, 3rd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2008. [Google Scholar]
Witten, I.H.; Frank, E.; Hall, M.A. Data Mining: Practical Machine Learning Tools and Techniques; Morgan Kaufmann Publishers: Burlington, MA, USA, 2011. [Google Scholar]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Suyal, M.; Goyal, P. A review on analysis of K-nearest neighbor classification machine learning algorithms based on supervised learning. Int. J. Eng. Trends Technol. 2022, 70, 43–48. [Google Scholar] [CrossRef]
Tukey, J.W. Exploratory Data Analysis; Pearson: New York, NY, USA, 1975. [Google Scholar]
Pratt, W.K. Digital Image Processing: Piks Inside; John Wiley & Sons: New York, NY, USA, 2001. [Google Scholar]
Watt, A. Database Design; BCcampus: Victoria, BC, Canada, 2014. [Google Scholar]
Ding, C.; Peng, H. Minimum redundancy feature selection from microarray gene expression data. Comput. Syst. Bioinform. 2003, 3, 523–528. [Google Scholar]
Darbellay, G.A.; Vajda, I. Estimation of the information by an adaptive partitioning of the observation space. IEEE Trans. Inf. Theory 1999, 45, 1315–1321. [Google Scholar] [CrossRef]
Silva, V.A.; Volpato, M.M.L.; Figueiredo, V.C.; Pereira, A.B.; de Matos, C.S.M.; Santos, M.O. Impacto do déficit hídrico e temperaturas elevadas sobre o estado hídrico do cafeeiro nas regiões Sul e Cerrado de Minas Gerais. J. Epamig 2021, 35, 1–5. [Google Scholar]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Ojala, M.; Garriga, G.C. Permutation tests for studying classifier performance. IEEE Int. Conf. Data Min. 2009, 9, 908–913. [Google Scholar]
Hodson, T.O. Root-mean-square error (RMSE) or mean absolute error (MAE): When to use them or not. Geosci. Model Dev. 2022, 15, 5481–5487. [Google Scholar] [CrossRef]
Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE)?—Arguments against avoiding RMSE in the literature. Geosci. Model Dev. 2014, 7, 1247–1250. [Google Scholar] [CrossRef]
Saunders, L.J.; Russell, R.A.; Crabb, D.P. The Coefficient of Determination: What Determines a Useful R² Statistic? Investig. Ophthalmol. Vis. Sci. 2012, 53, 6830–6832. [Google Scholar]
Nagelkerke, N.J.D. A note on a general definition of the coefficient of determination. Biometrika 1991, 78, 691–692. [Google Scholar] [CrossRef]
Wang, X.; Zhao, C.; Guo, N.; Li, Y.; Jian, S.; Yu, K. Determining the Canopy Water Stress for Spring Wheat Using Canopy Hyperspectral Reflectance Data in Loess Plateau Semiarid Regions. Spectrosc. Lett. 2015, 48, 492–498. [Google Scholar] [CrossRef]
Chekol, H.; Warkineh, B.; Shimber, T.; Mierek-Adamska, A.; Dąbrowska, G.B.; Degu, A. Drought Stress Responses in Arabica Coffee Genotypes: Physiological and Metabolic Insights. Plants 2024, 13, 828. [Google Scholar] [CrossRef] [PubMed]
Bowker, D.E.; Davis, R.E.; Myrick, D.L.; Jones, W.T. Spectral Reflectance of Natural Targets for Use in Remote Sensing Studies; NASA: Hampton, VA, USA, 1985. [Google Scholar]
Dulam, F.; Solanki, P.; Yadav, M.; Pandey, K.V.; Rajput, R.; Kumar, M.; Srishtty, S. Applications of remote sensing in horticulture: A review. Plant Arch. 2025, 25, 2552–2561. [Google Scholar]
Hunt, E.R.; Rock, B.N. Detection of changes in leaf water content using near- and middle-infrared reflectances. Remote Sens. Environ. 1989, 30, 43–54. [Google Scholar]
da Silva, P.C.; Ribeiro Junior, W.Q.; Ramos, M.L.G.; Lopes, M.F.; Santana, C.C.; Casari, R.A.d.C.N.; Brasileiro, L.d.O.; Veiga, A.D.; Rocha, O.C.; Malaquias, J.V.; et al. Multispectral Images for Drought Stress Evaluation of Arabica Coffee Genotypes Under Different Irrigation Regimes. Sensors 2024, 24, 7271. [Google Scholar] [CrossRef]
Dudani, S.A. The Distance-Weighted k-Nearest-Neighbor Rule. IEEE Trans. Syst. Man Cybern. 1976, 6, 325–327. [Google Scholar]
Carter, G.A. Primary and secondary effects of water content on the spectral reflectance of leaves. Am. J. Bot. 1991, 78, 916–924. [Google Scholar]
Gerhards, M.; Schlerf, M.; Mallick, K.; Udelhoven, T. Challenges and future perspectives of multi-/hyperspectral thermal infrared remote sensing for crop water-stress detection. Remote Sens. Agric. Veg. 2019, 11, 1240. [Google Scholar]

Figure 1. Flow chart of the design of the proposed approaches.

Figure 2. Actual data vs. estimated data for the best performing fold for the decision tree model-irrigated condition in red asterisks. The blue line represents the ideal 1:1 relationship between the estimated and the real values.

Figure 3. Tukey HSD test for the regression models in the irrigated condition considering RMSE and

R^{2}

as metrics. The blue and red colors highlight the methods whose mean performance differs significantly from the others according to Tukey’s test.

Figure 3. Tukey HSD test for the regression models in the irrigated condition considering RMSE and

R^{2}

as metrics. The blue and red colors highlight the methods whose mean performance differs significantly from the others according to Tukey’s test.

Figure 4. Actual data vs. estimated data for the best performing fold for the MLP model-rainfed condition in red asterisks. The blue line represents the ideal 1:1 relationship between the estimated and the real values.

Figure 5. Tukey HSD test for the regression models in the rainfed condition considering RMSE and

R^{2}

as metrics.

Figure 5. Tukey HSD test for the regression models in the rainfed condition considering RMSE and

R^{2}

as metrics.

Table 1. Interpretation of correlation coefficient values (

ρ

).

Table 1. Interpretation of correlation coefficient values (

ρ

).

Value of $\| ρ \|$	Interpretation
0.00 to 0.19	Very weak correlation
0.20 to 0.39	Weak correlation
0.40 to 0.69	Moderate correlation
0.70 to 0.89	Strong correlation
0.90 to 1.00	Very strong correlation

Table 2. Classes and ranges of water potential.

Water Potential Values ( $Ψ$ _W) (MPa)	Class
Ψ_W up to −0.5 MPa	1
Ψ_W between −0.5 and −1.4 MPa	2
Ψ_W between −1.5 and −2.4 MPa	3
Ψ_W between −2.5 and −3.5 MPa	4
Ψ_W less than −3.5 MPa	5

Table 3. Number of samples per class.

	Rainfed	Irrigated
Class	Number of Samples	Number of Samples
1	67	129
2	112	208
3	140	100
4	62	0
5	64	0

Table 4. The 10 most relevant attributes for irrigated conditions.

Ranking	Attribute Designation
1	Collection month
2	Reflectance for $λ$ = 780 nm
3	Reflectance for $λ$ = 785 nm
4	Collection year
5	Reflectance for $λ$ = 783 nm
6	Reflectance for $λ$ = 781 nm
7	Reflectance for $λ$ = 779 nm
8	Reflectance for $λ$ = 784 nm
9	Reflectance for $λ$ = 782 nm
10	Reflectance for $λ$ = 779 nm

Table 5. The 10 most relevant attributes for rainfed conditions.

Ranking	Attribute Designation
1	Collection month
2	Reflectance for $λ$ = 691 nm
3	Reflectance for $λ$ = 694 nm
4	Reflectance for $λ$ = 693 nm
5	Reflectance for $λ$ = 698 nm
6	Reflectance for $λ$ = 690 nm
7	Reflectance for $λ$ = 688 nm
8	Reflectance for $λ$ = 695 nm
9	Reflectance for $λ$ = 692 nm
10	Reflectance for $λ$ = 689 nm

Table 6. Achieved RMSE and

R^{2}

values for irrigated condition in terms of

μ \pm σ

.

Table 6. Achieved RMSE and

R^{2}

values for irrigated condition in terms of

μ \pm σ

.

	22 features
Method	$μ_{R M S E} \pm σ_{R M S E}$	$μ_{R^{2}} \pm σ_{R^{2}}$
MLP	0.4189 ± 0.0400	0.5733 ± 0.0781
Decision Tree	0.4160 ± 0.0284	0.5787 ± 0.0684
Random Forest	0.4249 ± 0.0363	0.5604 ± 0.0687
KNN	0.4812 ± 0.0373	0.4353 ± 0.0788
	10 features
Method	$μ_{R M S E} \pm σ_{R M S E}$	$μ_{R^{2}} \pm σ_{R^{2}}$
MLP	0.4309 ± 0.0487	0.5457 ± 0.0933
Decision Tree	0.4063 ± 0.0365	0.5965 ± 0.0752
Random Forest	0.4325 ± 0.0222	0.5482 ± 0.0555
KNN	0.4605 ± 0.0443	0.4867 ± 0.0860
	5 features
Method	$μ_{R M S E} \pm σ_{R M S E}$	$μ_{R^{2}} \pm σ_{R^{2}}$
MLP	0.4205 ± 0.0519	0.5697 ± 0.0954
Decision Tree	0.3884 ± 0.0299	0.6313 ± 0.0569
Random Forest	0.4256 ± 0.0449	0.5564 ± 0.0837
KNN	0.4522 ± 0.0324	0.5098 ± 0.0663

Table 7. Classification results in terms of balanced accuracy (

μ_{B A} \pm σ_{B A}

) for irrigated condition in %.

Table 7. Classification results in terms of balanced accuracy (

μ_{B A} \pm σ_{B A}

) for irrigated condition in %.

Method	22 Features	10 Features	5 Features
MLP	63.37 ± 3.98	61.91 ± 5.35	66.76 ± 4.56
Decision Tree	64.39 ± 1.36	65.39 ± 5.26	64.59 ± 2.61
Random Forest	64.40 ± 2.51	66.99 ± 4.58	65.60 ± 4.83
KNN	66.76 ± 6.50	66.77 ± 5.52	67.73 ± 3.48

Table 8. Confusion matrix for the 5 most relevant features of fold 5, for the KNN classifier-irrigated condition (Global Accuracy: 73.56%).

		Class 1	Class 2	Class 3	True Total
True Class	Class 1	19 (73.1%)	7 (26.9%)	0 (0%)	26 (100.0%)
	Class 2	7 (17.1%)	31 (75.6%)	3 (7.3%)	41 (100.0%)
	Class 3	0 (0%)	6 (30.0%)	14 (70.0%)	20 (100.0%)

Table 9. Per-class metrics for the irrigated condition in percentage (%).

Class	Precision	Recall	F1-Score
1	73.1	73.1	73.1
2	70.5	75.6	72.9
3	82.4	70.0	75.7

Table 10. Achieved RMSE and

R^{2}

values for rainfed condition in terms of

μ \pm σ

.

Table 10. Achieved RMSE and

R^{2}

values for rainfed condition in terms of

μ \pm σ

.

	20 features
Method	$μ_{R M S E} \pm σ_{R M S E}$	$μ_{R^{2}} \pm σ_{R^{2}}$
MLP	1.0569 ± 0.1246	0.5441 ± 0.1099
Decision Tree	1.0959 ± 0.1334	0.5169 ± 0.1153
Random Forest	1.0982 ± 0.1196	0.5117 ± 0.1045
KNN	1.1476 ± 0.1197	0.4728 ± 0.0952
	10 features
Method	$μ_{R M S E} \pm σ_{R M S E}$	$μ_{R^{2}} \pm σ_{R^{2}}$
MLP	1.0887 ± 0.1423	0.5129 ± 0.1101
Decision Tree	1.0938 ± 0.1399	0.5141 ± 0.1216
Random Forest	1.1361 ± 0.1287	0.4783 ± 0.0999
KNN	1.1577 ± 0.1255	0.4725 ± 0.0913
	5 features
Method	$μ_{R M S E} \pm σ_{R M S E}$	$μ_{R^{2}} \pm σ_{R^{2}}$
MLP	1.1160 ± 0.1157	0.4917 ± 0.1061
Decision Tree	1.0840 ± 0.1388	0.5250 ± 0.1182
Random Forest	1.0927 ± 0.1126	0.5173 ± 0.0930
KNN	1.1229 ± 0.1278	0.5018 ± 0.0946

Table 11. Classification results in terms of balanced accuracy (

μ_{B A} \pm σ_{B A}

) for rainfed condition in %.

Table 11. Classification results in terms of balanced accuracy (

μ_{B A} \pm σ_{B A}

) for rainfed condition in %.

Method	20 Features	10 Features	5 Features
MLP	45.38 ± 5.84	45.69 ± 4.99	45.86 ± 2.46
Decision Tree	46.17 ± 3.49	46.59 ± 1.26	43.94 ± 3.53
Random Forest	47.55 ± 2.41	46.10 ± 1.92	46.17 ± 4.57
KNN	43.14 ± 2.95	46.65 ± 4.90	42.72 ± 3.67

Table 12. Confusion matrix for the 20 most relevant features of fold 2, for the Random Forest classifier-rainfed condition (Overall Accuracy: 52.81%).

		Class 1	Class 2	Class 3	Class 4	Class 5	True Total
True Class	Class 1	4 (28.6%)	10 (71.4%)	0 (0%)	0 (0%)	0 (0%)	14 (100.0%)
	Class 2	2 (9.1%)	13 (59.1%)	3 (13.6%)	0 (0%)	4 (18.2%)	22 (100.0%)
	Class 3	0 (0.0%)	7 (25.0%)	18 (64.3%)	1 (3.6%)	2 (7.1%)	28 (100.0%)
	Class 4	0 (0%)	0 (0%)	7 (53.8%)	3 (23.1%)	3 (23.1%)	13 (100.0%)
	Class 5	0 (0%)	1 (8.3%)	0 (0%)	2 (16.7%)	9 (75.0%)	12 (100.0%)

Table 13. Per-class metrics for the rainfed condition in percentage (%).

Class	Precision	Recall	F1-Score
1	66.7	28.6	40.0
2	41.9	59.1	49.1
3	64.3	64.3	64.3
4	50.0	23.1	31.6
5	50.0	75.0	60.0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Delfino, D.C.T.; Ferreira, D.D.; Volpato, M.M.L.; Silva, V.A.; Delfino, R.T.; de Matos, C.S.M.; Santos, M.d.O. Estimation and Classification of Coffee Plant Water Potential Using Spectral Reflectance and Machine Learning Techniques. Biophysica 2025, 5, 60. https://doi.org/10.3390/biophysica5040060

AMA Style

Delfino DCT, Ferreira DD, Volpato MML, Silva VA, Delfino RT, de Matos CSM, Santos MdO. Estimation and Classification of Coffee Plant Water Potential Using Spectral Reflectance and Machine Learning Techniques. Biophysica. 2025; 5(4):60. https://doi.org/10.3390/biophysica5040060

Chicago/Turabian Style

Delfino, Deyvis Cabrini Teixeira, Danton Diego Ferreira, Margarete Marin Lordelo Volpato, Vânia Aparecida Silva, Renan Teixeira Delfino, Christiano Sousa Machado de Matos, and Meline de Oliveira Santos. 2025. "Estimation and Classification of Coffee Plant Water Potential Using Spectral Reflectance and Machine Learning Techniques" Biophysica 5, no. 4: 60. https://doi.org/10.3390/biophysica5040060

APA Style

Delfino, D. C. T., Ferreira, D. D., Volpato, M. M. L., Silva, V. A., Delfino, R. T., de Matos, C. S. M., & Santos, M. d. O. (2025). Estimation and Classification of Coffee Plant Water Potential Using Spectral Reflectance and Machine Learning Techniques. Biophysica, 5(4), 60. https://doi.org/10.3390/biophysica5040060

Article Menu

Estimation and Classification of Coffee Plant Water Potential Using Spectral Reflectance and Machine Learning Techniques

Abstract

1. Introduction

2. Materials and Methods

2.1. Database

2.2. Pre-Processing

2.3. Model Design

2.4. Evaluation and Performance Metrics

3. Results

3.1. Irrigated Condition

3.1.1. Results for the Regression Models

3.1.2. Results for the Classification Models

3.2. Rainfed Condition

3.2.1. Results for the Regression Models

3.2.2. Results for the Classification Models

3.3. Discussion

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Analysis with SMOTE

Appendix B. Optimized Parameters

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI