Estimating Tea Plant Physiological Parameters Using Unmanned Aerial Vehicle Imagery and Machine Learning Algorithms

Zhuang, Zhong-Han; Tsai, Hui-Ping; Chen, Chung-I

doi:10.3390/s25071966

Open AccessArticle

Estimating Tea Plant Physiological Parameters Using Unmanned Aerial Vehicle Imagery and Machine Learning Algorithms

by

Zhong-Han Zhuang

^1,2

,

Hui-Ping Tsai

^1,2,3,4,*

and

Chung-I Chen

⁵

¹

Department of Civil Engineering, National Chung Hsing University, Taichung 402, Taiwan

²

Innovation and Development Center of Sustainable Agriculture, National Chung Hsing University, Taichung 402, Taiwan

³

i-Center for Advanced Science and Technology (i-CAST), National Chung Hsing University, Taichung 402, Taiwan

⁴

Smart Multidisciplinary Agriculture Research and Technology Center, National Chung Hsing University, Taichung 402, Taiwan

⁵

Department of Forestry, National Pingtung University of Science and Technology, Pingtung 912, Taiwan

^*

Author to whom correspondence should be addressed.

Sensors 2025, 25(7), 1966; https://doi.org/10.3390/s25071966

Submission received: 27 February 2025 / Revised: 17 March 2025 / Accepted: 19 March 2025 / Published: 21 March 2025

(This article belongs to the Special Issue Application of UAV and Sensing in Precision Agriculture)

Download

Browse Figures

Versions Notes

Abstract

:

Tea (Camellia sinensis L.) holds agricultural economic value and forestry carbon sequestration potential, with Taiwan’s annual tea production exceeding TWD 7 billion. However, climate change-induced stressors threaten tea plant growth, photosynthesis, yield, and quality, necessitating an accurate real-time monitoring system to enhance plantation management and production stability. This study surveys tea plantations at low, mid-, and high elevations in Nantou County, central Taiwan, collecting data from 21 fields using conventional farming methods (CFMs), which emphasize intensive management, and agroecological farming methods (AFMs), which prioritize environmental sustainability. This study integrates leaf area index (LAI), photochemical reflectance index (PRI), and quantum yield of photosystem II (ΦPSII) data with unmanned aerial vehicles (UAV)-derived visible-light and multispectral imagery to compute color indices (CIs) and multispectral indices (MIs). Using feature ranking methods, an optimized dataset was developed, and the predictive performance of eight regression algorithms was assessed for estimating tea plant physiological parameters. The results indicate that LAI was generally lower in AFMs, suggesting reduced leaf growth density and potential yield differences. However, PRI and ΦPSII values revealed greater environmental adaptability and potential long-term ecological benefits in AFMs compared to CFMs. Among regression models, MIs provided greater stability for tea plant physiological parameters, whereas feature ranking methods had minimal impact on accuracy. XGBoost outperformed all models in predicting parameters, achieving optimal results for (1) LAI: R² = 0.716, RMSE = 1.01, MAE = 0.683, (2) PRI: R² = 0.643, RMSE = 0.013, MAE = 0.009, and (3) ΦPSII: R² = 0.920, RMSE = 0.048, MAE = 0.013. Overall, we highlight the effectiveness of integrating gradient boosting models with multispectral data to capture tea plant physiological characteristics. This study develops generalizable predictive models for tea plant physiological parameter estimation and advances non-contact crop physiological monitoring for tea plantation management, providing a scientific foundation for precision agriculture applications.

Keywords:

unmanned aerial vehicles; tea plants; machine learning; leaf area index; photochemical reflectance index; quantum yield of photosystem II

1. Introduction

Tea plants (Camellia sinensis L.) hold economic value in agriculture and possess the characteristics of an evergreen perennial in forestry. Beyond their potential for atmospheric CO₂ sequestration [1,2,3], Tea-based beverages are rich in bioactive compounds, particularly polyphenols, which provide notable health benefits [4,5]. In Taiwan, annual tea production reaches 12,000 metric tons, with a total market value exceeding TWD 7 billion, highlighting the tea ecosystem’s sustainability and its crucial role in the agricultural economy. According to projections from the Taiwan Climate Change Projection Information and Adaptation Knowledge Platform (TCCIP), based on Intergovernmental Panel on Climate Change (IPCC) AR6 scenarios, Taiwan is expected to experience greater seasonal rainfall variability, along with an increased risk of spring droughts. Changes in precipitation and temperature will likely impact yield and distribution or raise the risk of wildfires [6,7,8]. The growing frequency of extreme weather events and water shortages will further stress photosynthetic efficiency in tea plants [9] and increase growth risks [10], ultimately affecting tea leaf quality [11]. Tea plants are predominantly cultivated in tropical, subtropical, and temperate regions at elevations below 3000 m [12]. Their bud growth rate is regulated by a combination of environmental factors and genetic traits [13]. Consequently, enhancing tea plant resilience to climate stress requires accurate, real-time physiological monitoring to enable early intervention strategies.

Photosynthesis drives the terrestrial carbon cycle and serves as a key indicator of crop growth, productivity [14], and adaptation to environmental changes. This study focuses on three physiological indices of tea plants: (1) leaf area index (LAI), (2) photochemical reflectance index (PRI), and (3) quantum yield of photosystem II (ΦPSII). LAI quantifies total leaf area, offering insights into canopy density [15], crop yield, and production efficiency [16], while also serving as a key regulator of photosynthesis, respiration, and rainfall interception [17,18,19]. PRI is closely related to photosynthetic efficiency [20] and a plant’s response to environmental stress [21]. PRI primarily reflects energy partitioning in the light-harvesting process, with typical values ranging from −0.3 to 0.3, where higher values indicate greater light use efficiency, while lower values suggest activation of photoprotective mechanisms. ΦPSII is a key indicator of photochemical efficiency, commonly used to assess photosynthetic performance and quantify the effects of environmental stress on light reactions [22]. ΦPSII values typically range from 0 to 0.85, with higher values indicating greater PSII activity, while values below 0.2 suggest that photosynthesis is inhibited.

Traditional crop monitoring relies on manual field inspections and point-based instrument sampling, which offer high accuracy and direct insights into vegetation structure, physiological status, and photosynthetic efficiency in situ. However, these methods are time-consuming and spatially limited [23]. As research demands increase, remote sensing technologies offer advantages for large-scale agricultural monitoring, encouraging a shift toward integrating ground-based measurements with remote sensing approaches. By using high-precision field measurements as reference values and leveraging the spatial coverage of remote sensing imagery, researchers can overcome the limitations of traditional methods [24]. Among these technologies, unmanned aerial vehicles (UAVs) provide rapid, non-destructive, and large-scale crop monitoring capabilities, offering essential spectral, high-spatial, and temporal resolution data for precision agriculture [25,26,27,28]. UAVs can be equipped with visible, multispectral, hyperspectral, and thermal sensors, capturing spectral imagery to extract key crop characteristics for monitoring field conditions [29,30].

For LAI estimation, previous studies have combined spectral information, texture features, and canopy structure [31,32,33]. Gong et al. [34] successfully developed a rice LAI estimation model using destructive sampling, UAV-derived visible imagery, and canopy height data, achieving a root mean square error (RMSE) below 1.1. Similarly, Ochiai et al. [35] used visible and multispectral imagery to estimate sweet potato LAI, showing that removing background noise and selecting optimal image features improved estimation accuracy, achieving an R² of 0.887 using partial least squares regression. For PRI estimation, challenges arise due to the mismatch between the narrowband reflectance wavelengths required for PRI calculation and the available spectral bands of multispectral sensors [36]. However, previous research has demonstrated that field-measured PRI values are significantly correlated with chlorophyll content, carotenoid content, and the carotenoid-to-chlorophyll ratio [37]. As a result, alternative chlorophyll-sensitive indices have been proposed, such as the green chlorophyll index (GCI), chlorophyll vegetation index (CVI), and green normalized difference vegetation index (GNDVI), which exhibit moderate correlations with chlorophyll and carotenoid content [38]. Notably, GNDVI has shown greater sensitivity to chlorophyll concentration compared to the commonly used normalized difference vegetation index (NDVI) [39]. Estimating ΦPSII generally relies on hyperspectral sensors capable of capturing sun-induced chlorophyll fluorescence (SIF) signals, which are weak fluorescence emissions from photosynthetic pigments [40]. These emissions span the 650–800 nm range, with fluorescence peaks at 685 nm and 740 nm [41]. Sims and Gamon [42] further noted that visible wavelengths are primarily influenced by leaf surface structure, whereas near-infrared reflectance provides better insights into leaf internal structure and physical properties.

Despite the significant potential of UAVs in agricultural management and crop parameter estimation, challenges remain in applying spectral techniques for tea plant monitoring. Therefore, this study focuses on tea plantations at low, mid-, and high elevations in Nantou County, central Taiwan, collecting in situ LAI, PRI, and ΦPSII measurements from tea fields managed under the conventional farming method (CFM) and the agroecological farming method (AFM). Visible and multispectral UAV imagery was also acquired to develop a dataset of image-derived indices for constructing tea plant physiological parameter prediction models. The specific objectives of this study are (1) to investigate the effects of elevation and farming methods on tea plant physiological status through long-term in situ measurements; (2) to optimize feature selection by ranking image indices based on their relationships with tea plant physiological parameters; and (3) to evaluate the accuracy of eight regression models in predicting tea plant physiological parameters and further analyze model performance under different season, elevation, and farming method conditions, validating their applicability across diverse environmental conditions.

2. Materials and Methods

2.1. Overview

The overall workflow of this study is illustrated in Figure 1. It consists of three main steps: (1) data acquisition and preprocessing, (2) feature engineering, and (3) modeling and evaluation. In the data acquisition and preprocessing stage, physiological parameters, including leaf area index (LAI), photochemical reflectance index (PRI), and quantum yield of photosystem II (ΦPSII), were collected in the fields, while UAV flight missions were conducted to capture aerial imagery. The acquired images underwent post-processing, which included orthomosaic generation, image cropping, extraction of tea tree regions, and computation of image-based indices, including color indices (CIs) and multispectral indices (MIs). In the feature engineering stage, important features were ranked, and the dataset was divided into training and testing subsets. Finally, in the modeling and evaluation step, eight statistical and machine learning algorithms were applied to develop predictive models, with hyperparameter optimization performed to enhance model performance. The accuracy of the models was assessed, leading to the construction of predictive models for estimating tea tree physiological parameters.

2.2. Experimental Site and Design

The experimental tea plantations are located in Nantou County, central Taiwan, a key tea-growing region that accounts for approximately 60% of Taiwan’s total tea production, exceeding 7400 metric tons annually. To ensure the generalizability of this study, tea plantations at three different elevations were selected, encompassing both conventional farming methods (CFMs) and agroecological farming methods (AFMs). Regarding farming practices, the CFM involves more frequent human intervention, typically including regular weed removal, fertilization, and irrigation to ensure that tea plants receive a stable and sufficient supply of nutrients. In contrast, AFM emphasizes sustainable agricultural practices that promote ecological conservation, including reducing chemical fertilizer usage and enhancing environmental sustainability. The field survey was conducted from July 2021 to August 2024, with each experimental site surveyed once per month. Data collection was carried out sequentially based on elevation, meaning that all tea plantations within a given elevation were surveyed for one full year before transitioning to the next, ensuring that complete annual observational data were obtained for all sites at each elevation. This study focused on three elevation ranges (Figure 2). At low elevations (0~500 m) in Mingjian Township (120°37′34″ E~120°39′24″ E, 23°50′25″ N~23°52′39″ N), the predominant cultivar is Sijichun, with eight tea plantations selected for this study. At mid-elevations (500~1000 m) in Lugu Township (120°45′49″ E~120°47′20″ E, 23°43′32″ N~23°44′53″ N), the study area includes six tea plantations growing Chin-Shin-Dapan and Jinxuan. At high elevations (1000~1500 m) in Ren’ai Township (121°4′46″ E~121°7′5″ E, 23°58′56″ N~23°59′34″ N), seven tea plantations were selected, primarily cultivating Chin-Shin-Oolong. Additionally, Jinxuan, known for its strong environmental adaptability, was included in 1 to 2 plantations at each elevation. The climate of the study area is classified as subtropical to temperate, with an annual mean temperature ranging from 18 to 24 °C and an average annual precipitation of 1700 to 2600 mm. The orthomosaic images of each tea plantation are provided in Figure A1, while the basic information for each field is shown in Table A1.

2.3. Data Acquisition

Field data collection was divided into two main components: (1) physiological measurements of tea plants and (2) UAV imagery acquisition. The physiological measurements included the leaf area index (LAI) (Figure 3a), photochemical reflectance index (PRI) (Figure 3b), and chlorophyll fluorescence parameters (Figure 3c), while UAV imagery was collected using visible-light (Figure 3d) and multispectral sensors (Figure 3e). To ensure data consistency and spatial correspondence, physiological parameters and images were collected on the same day. In each sample site, three to five rectangular experimental zones with dimensions of 2 m in length and width were established, referred to as designed square zones (DSZs), which served as corresponding areas for physiological data measurements and image analysis (Figure 3f).

2.3.1. Tea Plant Physiological Parameter Acquisition

The leaf area index (LAI) is defined as the total leaf area per unit of projected ground area [43]. It reflects the density of crop leaf area and serves as a crucial parameter for assessing canopy coverage, growth, and yield potential [44]. In this study, LAI measurements for tea plants were conducted using the LAI-2200C Plant Canopy Analyzer (Li-Cor, Inc., Lincoln, NE, USA). The measurement process involved analyzing incident light from five different fields of view (FOVs). First, a reference measurement was taken in an open area to establish baseline light intensity, followed by a measurement beneath the crop canopy to assess transmitted light. The LAI value for each DSZ was computed using software algorithms.

The photochemical reflectance index (PRI) is a remote sensing method for assessing photosynthetic efficiency based on reflectance measurements [45,46]. In this study, PRI was measured using the PlantPen PRI 200 (Photon Systems Instruments Ltd., Brno, Czech Republic). Leaf clips were used to record spectral reflectance values from tea leaves, with 10 measurements taken per DSZ, and the average value was used as the PRI for that DSZ. The index is calculated from the reflectance at 531 nm and 570 nm using the following formula:

P R I = \frac{R_{531} - R_{570}}{R_{531} + R_{570}}

(1)

R_{531}

and

R_{570}

represent the green reflectance at 531 nm and the yellow reflectance at 570 nm, respectively [47]. The green band is closely related to the de-epoxidation state of the xanthophyll cycle, which plays a key role in regulating excess light energy dissipation [48,49]. Additionally, long-term variations in PRI values are influenced by changes in the size of constitutive pigment pools within leaves [50], reflecting the plant’s regulatory mechanisms for long-term environmental adaptation. Under environmental stress conditions, plants activate the xanthophyll cycle to dissipate excess light energy, thereby reducing photoinhibition damage [47]. Furthermore, PRI has been used to monitor crop water stress [51,52] and to assess photosynthetic efficiency at both leaf and canopy scales [53].

This study monitored chlorophyll fluorescence parameters and calculated the quantum yield of PSII (ΦPSII), which is an important parameter reflecting the photochemical processes in plants. By measuring the linear electron transport rate, ΦPSII describes the proportion of absorbed light energy utilized by photosystem II (PSII) for photochemical reactions [54,55]. The MINI-PAM-II photosynthesis yield analyzer (MINI-PAM-II, Walz, Germany) was used for measurements, with ΦPSII calculated using the following formula:

Φ P S I I = \frac{F_{m}^{'} - F_{s}}{{F_{m}}^{'}}

(2)

Here,

F_{m}^{'}

represents the maximum fluorescence yield of the leaf under light conditions, while

F_{s}

denotes the steady-state fluorescence yield. To ensure data representativeness, measurements were taken from 10 tea leaves per DSZ, with a mean value used for analysis.

During the experiment, a small portion of data was missing due to occasional instrument malfunctions. In the preprocessing step, all missing values were carefully removed before analysis. A total of 876 LAI values, 881 PRI values, and 827 ΦPSII values were collected for analysis.

2.3.2. UAV Image Acquisition

In this study, a DJI Phantom 4 Pro and a DJI Phantom 4 Multispectral (DJI, Shenzhen, China) were used to capture visible-light and multispectral data, respectively. The experimental design achieved centimeter-level resolution, enabling leaf-level assessments. For visible-light UAV flight missions, the flight altitude was set at 30 m. If the terrain variation within a site exceeded 20 m, a multi-altitude flight strategy was applied to ensure both the quality and completeness of the orthomosaic images. For multispectral UAV flights, an altitude of 10 m above the tea canopy was maintained to match the spatial resolution of visible-light images within each DSZ. The image processing workflow included camera calibration, geographic coordinate correction, and radiometric correction (applied only to multispectral images), ultimately generating orthomosaic images. The final image resolution was approximately 0.8 cm/pixel.

2.4. Image Processing

2.4.1. Canopy Part Segmentation

High-resolution UAV imagery captures fine details but also includes non-target features such as soil, weeds, and irrigation structures [56]. The 21 tea plantations surveyed were managed by different tea farmers, resulting in diverse landscape characteristics across sites. To integrate UAV imagery from tea plantations across elevations and farming methods while reducing spectral influence from non-tea areas and improving future operational efficiency, this study developed an image-processing-based tea canopy classification method (Figure 4). This method first applies simple linear iterative clustering (SLIC) [57] for superpixel segmentation of DSZ images. Mean values of selected image indices are then calculated for each superpixel, and binarization thresholds are empirically adjusted. Experts manually review the results to ensure accuracy, segmenting images into target objects (tea plants) and non-target objects (soil, weeds, and artificial structures). This classification method offers four key advantages: (1) It eliminates the need for large labeled datasets, maintains controllable classification accuracy, and reduces data preparation time. (2) SLIC segmentation reduces shadow and gap interference within tea canopies, minimizing salt-and-pepper noise often caused by simple binarization. (3) It adapts to varying light conditions, enabling robust differentiation between vegetation and non-vegetation regions. (4) It ensures consistency across time-series data, allowing the same hyperparameters for classification within the same site across different time periods, thus supporting long-term monitoring and analysis.

The SLIC method is a K-means clustering technique that segments images into superpixels based on Lab color similarity and spatial pixel coordinates. The algorithm initially distributes cluster centers and iteratively updates only neighboring pixels, allowing cluster centers to converge in regions with similar color and spatial characteristics. Since SLIC operates locally, it is well-suited for high-spatial-resolution imagery, preserving important boundary information within the image. In this study, SLIC parameters were adjusted based on image characteristics, including the number of superpixels and the compactness factor. The number of superpixels was site-specific, while the compactness factor, which controls the shape of superpixels, was set to 20 for visible-light images and 0.05 for multispectral images. After segmentation, Otsu’s binarization method was applied using specific vegetation indices for classification. For visible-light images, the red–green–blue vegetation index (RGBVI) and the visible atmospherically resistant index (VARI) were used, while for multispectral images, the enhanced vegetation index (EVI) and the ratio vegetation index (RVI) were selected. Threshold values were fine-tuned for each site, with upper and lower limits set to exclude non-tea areas. When spectral reflectance made differentiation challenging, manual adjustments were incorporated to enhance segmentation accuracy.

2.4.2. Calculation of Color Indices and Multispectral Indices

The index calculation for each DSZ image was based on the tea canopy segmentation results, meaning that only pixels identified as tea plants were used for index computation. The mean value of these indices within the tea plant regions was taken to represent the index value for each DSZ. For visible-light images, which consist of red, green, and blue bands, a total of 19 color indices (CIs) were computed. For multispectral images, which include five spectral bands—blue, green, red, red edge, and near-infrared (NIR)—a total of 50 multispectral indices (MIs) were derived. The names and formulas for the CIs and MIs are provided in Table A2.

2.5. Feature Ranking Methods

This study employed three feature ranking methods to analyze linearity, correlation and redundancy, and trend similarity, followed by ranking based on their results. These methods included: (1) Pearson correlation analysis (PCA), (2) Minimum Redundancy Maximum Relevance (MRMR) [58], and (3) Gray Relational Analysis (GRA). By applying the above-mentioned algorithms to assess the correlation between image indices and the target variables, the explanatory power of each index was determined, allowing for the assessment of the indices. Features were then sequentially added to the model based on their ranking to identify the optimal combination of independent variables for achieving the highest prediction accuracy.

2.5.1. Pearson Correlation Analysis

Pearson correlation analysis (PCA) evaluates the linear relationship between two variables, with results expressed as the correlation coefficient (r). In this study, feature ranking was performed based on the absolute value of r. The formula for calculating r is as follows:

r = \frac{\sum (x_{i} - \bar{x}) (y_{i} - \bar{y})}{\sqrt{\sum {(x_{i} - \bar{x})}^{2}} \sqrt{\sum {(y_{i} - \bar{y})}^{2}}}

(3)

where

x_{i}

and

y_{i}

represent the ith values of variables x and y, respectively, and

\bar{x}

and

\bar{y}

denote their respective mean values.

2.5.2. Minimum Redundancy Maximum Relevance

Minimum redundancy maximum relevance (MRMR) simultaneously considers both the relevance of features to the target variable and the redundancy among features, aiming to select a feature subset with high relevance and low redundancy. MRMR is based on mutual information, which quantifies the dependency between two variables.

MRMR evaluates feature selection through two key components: (1) Max-Relevance and (2) Min-Redundancy. The core algorithm optimizes feature selection by maximizing the relevance of features to the target variable while minimizing redundancy among the selected features. The objective function is defined as follows:

\max_{x_{j} \in X - S_{m - 1}} [I (x_{j}, y) - \frac{1}{m - 1} \sum_{x_{i} \in S_{m - 1}} I (x_{j}, x_{i})]

(4)

where

I (x_{j}, y)

represents the mutual information value between the candidate feature

x_{j}

and the target variable y, with higher values indicating greater relevance, and

\sum_{x_{i} \in S_{m - 1}} I (x_{j}, x_{i}) / (m - 1)

represents the mean mutual information between the candidate feature, (x_j), and all features in the already-selected subset,

S_{m - 1}

, with lower values indicating less redundancy.

2.5.3. Gray Relational Analysis

Gray relational analysis (GRA) evaluates the trend similarity between features and target variables [23]. Since temporal variations may influence the relationships among indices, GRA can effectively identify key factors affecting a system [59]. GRA normalizes the dataset to eliminate numerical range differences and uses the gray relational coefficient (GRC) to quantify the similarity between each feature and the target variable, calculated as follows:

γ_{0 i} (k) = \frac{∆_{m i n} + ρ ∆_{m a x}}{∆_{0 i} (k) + ρ ∆_{m a x}}

(5)

where

γ_{0 i} (k)

is the GRC of the ith feature at the kth data point,

∆_{m i n}

and

∆_{m a x}

are the minimum and maximum absolute differences among all sequences,

∆_{0 i} (k)

represents the absolute difference between the ith feature and the target variable at the kth point, and

ρ

is the distinguishing coefficient, which regulates the influence of the minimum and maximum differences on the correlation coefficient and is typically set to 0.5. The gray correlation degree (GCD) is computed to quantify the overall influence of a feature on the target variable and is calculated as follows:

γ_{i} = \frac{1}{n} \sum_{k = 1}^{n} γ_{0 i} (k)

(6)

where

γ_{i}

is the average GRC between the ith feature and the target variable, and n is the number of data points for the ith feature. Finally, features are ranked based on their GCD values, allowing for the identification of key features that most significantly influence the target variable [60].

2.6. Model Training and Evaluation Metrics

This study employed eight regression algorithms to explore the relationship between tea plant image indices and physiological parameters, including polynomial regression (PR), partial least squares regression (PLSR), lasso regression (LR), ridge regression (RR), decision tree regression (DTR), random forest regression (RFR), eXtreme gradient boosting (XGBoost), and the light gradient boosting machine (LightGBM). The hyperparameters adjusted for each model in this study are summarized in Table 1.

Polynomial regression (PR) is an extension of linear regression that incorporates higher-order terms of independent variables to fit nonlinear relationships, allowing the model to capture curved relationships among variables. Partial least squares regression (PLSR) integrates the strengths of principal component analysis, canonical correlation analysis, and multiple linear regression [61]. PLSR is particularly effective for handling multicollinearity among independent variables while reducing data dimensionality [62,63,64]. Lasso regression (LR) and ridge regression (RR) are regularized linear regression models but differ in their regularization approaches. Lasso regression applies an L1 regularization term in the loss function, forcing the coefficients of less important variables to shrink to zero [65,66], thereby performing automatic feature selection [67]. In contrast, ridge regression employs an L2 regularization term, which minimizes the loss function to handle multicollinear regression data, preventing overfitting when a large number of predictors are included [68]. The regularization strength for both models is controlled by the α parameter—higher values retain only the most influential variables, while lower values reduce regularization, increasing the risk of overfitting [69]. Decision tree regression (DTR) is a tree-structured model that simplifies complex decision-making through binary splits, progressively partitioning data into smaller subsets [70,71]. As decision trees are directly constructed from training samples, pruning is required to improve model generalization. Random forest regression (RFR) is an ensemble learning method based on decision trees [72] and is suitable for nonlinear and high-dimensional data. By constructing multiple randomized, decorrelated decision trees and averaging their results [73], RFR improves prediction performance while reducing overfitting using a bagging ensemble strategy [64,68,74]. eXtreme Gradient Boosting (XGBoost) is an enhanced gradient boosting algorithm [75] that divides the dataset into multiple subsets, trains base learners, and then aggregates their weighted predictions [74]. XGBoost runs efficiently and randomly selects feature indices to reduce overfitting risks. Light gradient boosting machine (LightGBM) is a high-efficiency ensemble learning algorithm built on the gradient boosting decision tree framework [76]. LightGBM employs a histogram-based decision tree algorithm, which accelerates node-splitting calculations by grouping features. Additionally, a leaf-wise growth strategy is used, splitting the leaf node with the highest gain, enabling the model to better capture nonlinear relationships in the data.

The dataset was trained and validated using three-fold cross-validation to reduce the risk of model overfitting. Model performance was evaluated using the coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE). The formula is as follows:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i}^{m} - y_{i}^{p})}^{2}}{\sum_{i = 1}^{n} {(y_{i}^{m} - y_{i}^{a})}^{2}}

(7)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i}^{m} - y_{i}^{p})}^{2}}

(8)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y_{i}^{m} - y_{i}^{p}|

(9)

where i represents the sample index, and n denotes the total number of samples,

y_{i}^{m}

refers to the measured physiological parameter value of the ith sample,

y_{i}^{p}

represents the predicted physiological parameter value of the ith sample, and

y_{i}^{a}

denotes the average of the measured physiological parameter values in the dataset.

3. Results

3.1. Effects of Elevation Distribution and Farming Method Differences on Tea Plant Physiology

Table 2 presents the statistical data for three physiological parameters under two farming methods. The results indicate that the leaf area index (LAI) had a higher mean value under conventional farming (mean = 4.878) and exhibited a relatively stable data distribution (CV = 0.349). This stability may be associated with the management practices in conventional farming, which promote more uniform tea plant growth and reduce competition from weeds or other vegetation. In contrast, the mean quantum yield of PSII (ΦPSII) was higher under agroecological farming (mean = 0.4586), suggesting a potential advantage of this method in enhancing photosynthetic efficiency.

Figure 5 illustrates the distribution of the three physiological indices across low, mid-, and high elevations, comparing the measurement results between conventional and agroecological farming, along with the results of statistical significance testing. T-tests (* indicating significant differences between CFM and AFM at each elevation) showed that LAI differed significantly only at mid-elevation (p < 0.05), while PRI exhibited significant differences at both low and mid-elevations (p < 0.05). ΦPSII showed a significant difference only at low elevation (p < 0.05). To further analyze the variability across different elevations, a one-way analysis of variance (ANOVA) was performed. If the assumption of homogeneity of variance was not met, a Welch test was applied [77], with differences presented using the letter grouping method. The results indicated that under conventional farming, the three physiological parameters showed significant differences between low and mid-elevations, as well as between low and high elevations, while no significant differences were observed between mid- and high elevations. This result may be related to pruning practices in low-elevation plantations, where lower branches and older leaves are removed for management purposes. In contrast, higher-elevation plantations tend to have lower management intensity due to their larger area, allowing for the retention of larger canopy leaves, which could contribute to the overall higher LAI values. Under agroecological farming, both LAI and PRI exhibited significant differences across all three elevations, while ΦPSII did not show significant differences.

3.2. Feature Ranking of Image Indices

Figure 6 presents the heatmap of importance scores for three physiological parameters (LAI, PRI, and ΦPSII) based on three feature ranking methods, separately for two image datasets (CIs and MIs). The results show that compared to CIs, the importance scores of MIs are generally higher, indicating that multispectral images provide greater discriminative power and higher applicability in assessing plant physiological status.

3.3. Comparison of Regression Model Accuracy for Tea Plant Physiological Parameter Estimation

This study employed eight regression models to evaluate the predictive accuracy of CIs and MIs for estimating three physiological parameters (Figure 7, Figure 8 and Figure 9). Overall, model accuracy improved with an increase in the number of independent variables. However, the PR model exhibited unstable performance, with fluctuating accuracy values, indicating that simple nonlinear models have limitations in capturing the relationship between image indices and tea plant physiological parameters. In contrast, XGBoost, LightGBM, RFR, and DTR demonstrated slightly better accuracy than PLSR and the two regularized regression models (LR and RR), with XGBoost achieving the highest regression performance. The improvement in regression accuracy with XGBoost varied across different models. Compared to PR and PLSR, the accuracy gain ranged from 0 to 0.919 and 0.01 to 0.857, respectively. For regularized models (LR and RR), the improvement ranged from 0.01 to 0.849, while for decision tree-based models (DTR and RFR), the improvement was between 0.02 and 0.499. When compared to LightGBM, the accuracy gain was 0.006 to 0.375. Additionally, in the PR, PLSR, LR, and RR models, CIs struggled to effectively capture physiological parameter variations, while MIs often exhibited accuracy saturation. Although LightGBM and XGBoost performed comparably in most cases, XGBoost tended to capture physiological variations more efficiently, allowing it to achieve slightly better accuracy than LightGBM with fewer independent variables. This suggests that, given the same training and validation dataset, gradient boosting-based models demonstrate a higher capability in capturing the nonlinear relationships between tea plant physiological parameters and image indices. This can be attributed to the iterative optimization process of gradient boosting, which systematically reduces errors by compensating for uncaptured features and effectively accounting for interactions among variables.

This suggests that gradient boosting-based models are more effective at capturing the nonlinear relationships between tea plant physiological parameters and image indices. Regarding feature ranking methods, the differences in mean accuracy across the three methods were less than 0.099, indicating no significant differences. However, PCA-based ranking resulted in slightly lower prediction accuracy. The MRMR ranking method yielded more stable accuracy for CIs, while GRA-based ranking produced the best results for MIs. A comparison between the two types of image indices showed that using MIs generally resulted in higher prediction accuracy than using CIs. Specifically, this improvement was observed across all three physiological parameters, with LAI, PRI, and ΦPSII showing increases of approximately 15–115%, 98–179%, and 0–96%, respectively, when using MIs instead of CIs. These findings indicate that combining MIs with gradient boosting models provides greater suitability for capturing the nonlinear relationships of tea plant physiological parameters, while the choice of feature ranking method has a relatively minor impact on regression accuracy.

This study compares the performance of CIs and MIs in predicting three physiological parameters and evaluates regression accuracy using different feature ranking methods. The results indicate that the variation in model accuracy across different feature ranking methods was less than 0.04, suggesting that the chosen feature ranking approaches had no significant impact on regression accuracy in this study. For LAI regression performance, both CIs and MIs achieved an R² exceeding 0.59, with RMSE ranging from 1.049 to 1.214 and MAE between 0.632 and 0.759. In PRI regression, the predictive power of CIs was weak (R² = 0.28), whereas MIs improved R² to 0.643, demonstrating the superior explanatory capability of multispectral data for PRI. At this stage, RMSE was 0.013, and MAE was 0.009, with MAE lower than the standard deviation of PRI under both farming methods (CFM: 0.0196, AFM: 0.0251), indicating that the prediction error was within an acceptable range. For ΦPSII prediction, both CIs and MIs achieved the highest predictive accuracy, with R² exceeding 0.90, RMSE ranging from 0.048 to 0.052, and MAE between 0.013 and 0.024, indicating stable model performance. The optimal predictive results for the three physiological parameters using CIs and MIs indicate that the prediction accuracy of LAI and PRI improved by approximately 16.4% and 118.2%, respectively, with MIs, while RMSE decreased by 12.8% and 28.1%. For ΦPSII, the results indicate that the accuracy achieved using CIs and MIs was comparable. These findings suggest that MIs are more effective in capturing variations in tea plant physiological parameters in this study. Additionally, among all prediction results, only PRI prediction using CIs achieved the highest accuracy with the RFR model, while all other predictions achieved optimal accuracy using XGBoost. This highlights that gradient boosting regression models exhibit superior performance in predicting tea plant physiological parameters. The findings indicate that the models used in this study could explain 70–90% of the variance in LAI and ΦPSII data. While PRI predictions exhibited lower accuracy (approximately 65%), this level of precision remains within the acceptable range for crop physiological parameter modeling applications (Table 3).

This study further explored the relationship between the number of independent variables and regression accuracy, using the highest regression accuracy as a reference and setting 95% of this accuracy as the threshold to iteratively determine the number of independent variables required to meet this criterion. This approach reduces the number of independent variables while maintaining comparable accuracy. The results indicate that optimal accuracy is generally achieved when using more than half or even all variables, with GRA-based ranking performing best (see Table 4). In the MIs application, the highest accuracy for LAI and PRI was 0.716 and 0.643, respectively, with PRI showing a significant improvement of 0.359 compared to CIs. In contrast, the accuracy difference for ΦPSII between the two image datasets was minimal (0.001). When applying the 95% confidence interval as the accuracy threshold, the overall accuracy decreased slightly by approximately 0.014 to 0.046, but the number of required independent variables was significantly reduced. The number of independent variables for LAI and ΦPSII decreased by 32 and 19, respectively, while PRI required only 2 fewer variables. These findings suggest that image indices are more effective for predicting LAI and ΦPSII, whereas PRI may require additional information to accurately reflect its variations.

3.4. Effects of Elevation and Seasonal Conditions on Prediction Accuracy

Figure 10 illustrates the prediction errors of the best-performing regression model for estimating physiological parameters across different seasons. The results indicate that for LAI predictions, the conventional farming dataset exhibited a wider range of prediction errors (RMSE: 1.029–1.776), while the agroecological farming dataset contained more outliers. In terms of elevation, the high-elevation datasets from both farming methods showed the highest prediction accuracy across all seasons (RMSE: 0.021–0.107). At mid- and low-elevation plantations, the agroecological farming dataset exhibited smaller prediction errors (RMSE: 0.507–1.240), possibly due to lower intervention management strategies, which made plants more susceptible to field conditions and resulted in more extreme values. Additionally, predictions for conventional farming tended to be underestimated, which may be attributed to higher plant density and uniform growth, causing actual LAI values to be relatively high. Conversely, the overestimation observed in agroecological farming could be related to sparser plant distribution. Seasonal variations also influenced prediction accuracy, with higher prediction errors in summer and autumn, likely due to harvesting activities during the growing season, which made LAI estimation more complex.

For PRI predictions, the highest prediction errors occurred at mid-elevation plantations (RMSE: 0.013–0.026), whereas high-elevation plantations had the lowest prediction errors (RMSE: 0.003–0.015). Regarding farming methods, conventional farming exhibited relatively stable prediction errors across elevations and seasons (RMSE: 0.010–0.017). In contrast, for agroecological farming, low-elevation predictions were generally underestimated, while mid-elevation predictions showed higher errors in spring and winter (RMSE: 0.023–0.026) and lower errors in summer and autumn (RMSE: 0.015–0.018).

For ΦPSII predictions, the estimated values were slightly overestimated in most plantations. The agroecological farming dataset generally exhibited lower prediction errors (RMSE: 0.022–0.084), with autumn showing the highest errors among all seasons. In contrast, prediction errors for conventional farming (RMSE: 0.030–0.130) decreased with increasing elevation. These findings highlight the complex interactions between elevation, farming method, and seasonal effects on the prediction accuracy of tea plant physiological parameters.

4. Discussion

4.1. Differences in Tea Plant Physiological Parameters Across Elevations and Farming Methods

This study examines the physiological variations and differences in tea plants across elevations and two farming methods. Previous research has primarily focused on the impact of different farming methods on soil properties and yield differences [78,79], suggesting that these variations may result from a combination of geographical factors, crop types, and farming methods [80,81]. However, the physiological status of crops during their growth stages is a key determinant of yield and quality, yet it has received relatively less attention. Therefore, this study analyzes the physiological parameters of tea plants grown using CFMs and AFMs to explore the effects and differences.

The results indicate that at all elevations, LAI values were consistently higher in CFM than in AFM, with a significant difference observed only at mid-elevation. This phenomenon aligns with findings that tea yield under AFM conditions ranged from 20% to 80% of that under CFM conditions. Similarly, Schärer et al. [82] reported that winter wheat grown using organic farming had significantly lower LAI than that grown using conventional farming. Petcu et al. [83] further speculated that this discrepancy might be attributed to lower water and nutrient availability in organic farming systems, as organic fertilizers release nutrients more gradually, leading to relatively lower soil nitrogen content [84]. Olsen and Weiner [85] found that LAI is significantly correlated with nitrogen availability, and when nutrient supply is insufficient, leaf growth may be suppressed, directly impacting LAI performance. Although PRI and ΦPSII did not show significant differences between the two farming methods, values were generally higher in agroecological farming, suggesting that tea plants in AFMs may have advantages in physiological function and light energy utilization efficiency. Furthermore, tea plants grown using AFM exhibited greater physiological hardening effects in response to challenging environmental conditions than those grown using CFM. This suggests that under future extreme weather conditions, AFM helps maintain tea plant physiological health and stress resilience, offering greater long-term sustainability under global climate change conditions. Chen et al. [86] supported this perspective, showing that stomatal conductance in conventionally farmed tea plants was consistently lower than that in soil-cultured plants across all seasons, leading to reduced photosynthetic capacity and increased susceptibility to water stress. Over time, the sustained application of soil-based cultivation practices may improve tea plant resilience to climate change, particularly in ecosystems with irregular rainfall distribution. Overall, the results suggest that differences in tea plant physiological parameters between the two farming methods were relatively limited. However, although the yield under agroecological farming conditions is lower than that of conventional farming, it contains higher concentrations of antioxidant compounds, suggesting potential health benefits [84]. Additionally, the use of organic fertilizers significantly enhances tea quality and benefits soil fertility [87]. Furthermore, this study found that agroecological farming demonstrates advantages in environmental adaptability and light use efficiency, indicating that tea plants grown using agroecological farming are more resilient to environmental changes.

4.2. Effects of Feature Ranking Methods and Image Indices on Prediction Models

Feature selection is an effective approach for handling high-dimensional datasets, enabling the construction of a streamlined dataset that prioritizes variables with the greatest influence on the target variable [88]. The results indicate that the rankings produced by the three feature ranking methods varied, which is likely attributable to differences in algorithmic efficiency and performance characteristics [89]. This study also found that feature ranking methods had a limited impact on improving the accuracy of LR and RR models, a trend similarly observed by Shahsavari et al. [90]. This is primarily because regularization algorithms inherently balance feature importance through penalty terms, making the effect of pre-selection relatively insignificant for these models.

Furthermore, this study revealed that when using MIs to estimate physiological parameters, GRA-based feature ranking consistently yielded the highest regression accuracy. However, GRA has limitations when the relationship between independent and dependent variables is weak, making it less effective in distinguishing feature importance [91]. For instance, when CIs were used as independent variables, LAI and ΦPSII achieved the best performance using MRMR and Pearson correlation-based rankings, respectively. Overall, the impact of feature ranking methods on model accuracy was relatively minor in this study, whereas reducing the number of independent variables improved computational efficiency. Future research should consider the application when selecting appropriate feature selection methods, ensuring that the chosen approach effectively identifies key features and evaluates their influence on prediction accuracy.

4.3. Applicability Analysis of Regression Models

This study compared the performance of eight regression models to evaluate their applicability in predicting tea plant physiological parameters. The results indicate that the PR model exhibited lower accuracy and greater fluctuations, likely due to its sensitivity to multicollinearity and its limited ability to effectively model the relationship between image indices and crop physiological parameters [92]. In contrast, machine learning algorithms demonstrated superior performance in establishing relationships between image indices and crop physiological parameters, making them a more suitable choice for physiological parameter estimation [93]. When MIs were used for estimating physiological parameters, LAI prediction accuracy improved by approximately 0.095, with the largest improvement observed for PRI (0.359), while ΦPSII was less affected, highlighting the advantages of multispectral data in capturing crop physiological characteristics. Similarly, Zhang et al. [94] reported that multivariate regression models using multispectral data outperformed those based on visible-light data.

Our results also show that increasing the number of selected features generally improves model accuracy, but the magnitude of improvement is limited. To address this, we compared the number of features required for optimal accuracy versus a 5% reduction in accuracy. For LAI and ΦPSII, reducing accuracy by 5% resulted in a 60–90% reduction in the number of required features, providing advantages in feature computation and model processing time. Duro et al. [95] similarly found that removing 30–60% of irrelevant variables improved model interpretability while reducing accuracy by only 0.5%. Additionally, model performance metrics indicate the suitability of a method rather than its superiority. While an R² value closer to 1 suggests higher accuracy, for high-variability systems, an R² range of 0.7–0.9 is generally acceptable [96]. In wheat LAI prediction, Wu et al. [97] demonstrated that using a small set of UAV-derived features combined with multiple linear regression yielded efficient predictions (R² = 0.679, RMSE = 1.231), while data fusion from multiple sensors further improved accuracy (R² = 0.815, RMSE = 1.023). Wittstruck et al. [98] successfully predicted winter wheat LAI (R² = 0.83, RMSE = 0.41) using UAV-derived visible-light data and plant height information. Similarly, Li et al. [99] applied visible UAV imagery, integrating CIs and texture features to estimate wheat LAI, achieving higher regression accuracy (R² = 0.730, RMSE = 0.691) than models using individual features.

The results of this study show that while MIs significantly improved PRI prediction accuracy, their underlying physiological characteristics remain incompletely understood. Meanwhile, LAI and ΦPSII predictions reached acceptable accuracy levels. The best-performing model across all three physiological parameters was XGBoost, highlighting its capability to handle nonlinear relationships and complex feature interactions [100,101]. This demonstrates the effectiveness of XGBoost in achieving a strong generalization ability for predicting tea plant physiological parameters, providing a scientific foundation for future agricultural applications and model improvements.

5. Conclusions

This study utilized UAV-derived visible and multispectral imagery to estimate and analyze three physiological indices of tea plants (LAI, PRI, and ΦPSII). We evaluated color indices (CIs) and multispectral indices (MIs) and ranked their importance using Pearson correlation analysis (PCA), minimum redundancy maximum relevance (MRMR), and gray relational analysis (GRA) to develop predictive models for tea plant physiological parameters. Statistical analysis of tea plant physiological parameters revealed that while tea plants grown using agroecological farming methods (AFMs) did not show an advantage in external characteristics, this difference was indirectly reflected in yield variations between conventional farming methods (CFMs) and AFMs. However, physiological parameters related to light use efficiency and environmental adaptability suggest that tea plants grown using AFMs exhibit slightly higher resilience to environmental conditions, potentially offering greater long-term ecological benefits. The feature ranking results indicate that GRA achieved the best performance, while MIs exhibited a more robust explanatory capability for tea plant physiological parameters compared to CIs derived from visible-light imagery. Regarding regression model performance, XGBoost demonstrated superior predictive accuracy across all three physiological parameters. Based on the model configurations, the best-performing combinations for each physiological parameter were: (1) LAI: MI-GRA-XGBoost (R² = 0.716, RMSE = 1.01, MAE = 0.683), (2) PRI: MI-GRA-XGBoost (R² = 0.643, RMSE = 0.013, MAE = 0.009), and (3) ΦPSII: CI-PCA-XGBoost (R² = 0.920, RMSE = 0.048, MAE = 0.013). These findings demonstrate the advantage of combining multispectral data with gradient boosting models for effectively capturing the complex physiological characteristics of tea plants.

This study framework focuses on developing a generalizable predictive model for tea plant physiological parameters to assess the feasibility of remote sensing-based monitoring methods for broader applications in tea plantations. The core objective is to evaluate the strategy of reducing the number of independent variables by allowing a slight trade-off in accuracy within an acceptable prediction error range while assessing the performance of various basic regression algorithms. Therefore, future research may refer to the feature selection approach used in this study to identify optimal features and further explore ensemble models or deep learning techniques to enhance predictive performance. The findings validate the potential of integrating multispectral data with feature ranking methods for predicting tea plant physiological parameters, providing a scientific reference for data-driven management and applications in precision agriculture.

Author Contributions

Conceptualization, Z.-H.Z., H.-P.T. and C.-I.C.; methodology, Z.-H.Z. and H.-P.T.; software, Z.-H.Z. and H.-P.T.; validation, Z.-H.Z., H.-P.T. and C.-I.C.; formal analysis, Z.-H.Z. and H.-P.T.; investi-gation, Z.-H.Z.; resources, H.-P.T.; data curation, Z.-H.Z. and C.-I.C.; writing—original draft preparation, Z.-H.Z.; writing—review and editing, H.-P.T. and C.-I.C.; visualization, Z.-H.Z. and H.-P.T.; supervision, H.-P.T. and C.-I.C.; project administration, H.-P.T.; funding acquisition, H.-P.T. and C.-I.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially financially supported by the “Innovation and Development Center of Sustainable Agriculture” from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan. Additionally, support was provided by the National Science and Technology Council under projects 111-2121-M-005-002-, 112-2321-B-005-007-, 112-2634-F-005-002-, 112-2119-M-005-001-, 112-2121-M-005-003-, 112-2621-M-002-005-MY3, 113-2321-B-005-005-, 113-2121-M-005-005-, and 113-2119-M-005-001-.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are included in the article. Further inquiries can be made to the corresponding author upon request.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Figure A1. Orthomosaic images for each tea plantation.

Table A1. Basic information for each tea plantation.

Low elev.	Field	CLS001	CLS002		CLS003		CLS004			CLJ001		ALS001			ALS002			ALS003
	Area	3399	1142		1616		1110			6499		1651			2697			3065
	Slope	1.74	3.04		4.12		1.96			4.17		1.74			1.36			4.19
	Elev.	329	279		388		324			371		315			348			288
Mid-elev.	Field	CMC001		CMJ001		CMJ002			AMC001				AMJ001			AMJ002
	Area	1131		1658		1911			1131				1658			1911
	Slope	1.04		16.87		3.70			1.04				16.87			3.70
	Elev.	565		826~833		924~926			565				826~833			924~926
High elev.	Field	CHO001	CHO002		CHO003			AHO001			AHO002			AHO003			AHJ001
	Area	28,749	5122		16,138			3412			10,744			4805			2082
	Slope	42.35	36.83		24.35			18.19			35.65			20.63			27.25
	Elev.	1510~1572	1406~1451		1381~1414			1523~1536			1476~1508			1452~1467			1471~1478

Field naming: The first letter: “C” represents conventional farming tea plantations, while “A” represents agroecological farming tea plantations. The second letter: “L” means low elevation, “M” means mid-elevation, and “H” means high elevation. The third letter: “S” denotes Sijichun, “J” denotes Jinxuan, “C” denotes Chin-Shin-Dapan, and “O” denotes Chin-Shin-Oolong. Uniy: Area (m²); Slope (%); Elev. (m).

Table A2. Names and formulae of color indices (CIs) and multispectral indices (MIs).

Color Indices (CIs)
Normalized Blue	$b = \frac{B}{R + G + B}$
Normalized Green	$g = \frac{G}{R + G + B}$
Normalized Red	$r = \frac{R}{R + G + B}$
Color Index of Vegetation	$C I V E = 0.441 \times r - 0.811 \times g + 0.3856 \times b + 18.78745$
Excess Blue Vegetation Index	$E x B = 1.4 \times b - g$
Excess Green Vegetation Index	$E x G = 2 \times g - r - b$
Excess Red Vegetation Index	$E x R = 1.4 \times r - g$
Excess Green Minus Excess Red Index	$E x G R = E x G - E x R$
Green Leaf Index
Green–Red Vegetation Index	$G L I = \frac{2 \times G - R - B}{2 \times G + R + B}$
Color Intensity Index	$I N T = \frac{R + G + B}{3}$
Kawashima Index	$I K A W = \frac{R - B}{R + B}$
Principal Component Analysis Index	$I P C A = 0.994 \times \|R - B\| + 0.961 \times \|G - B\| + 0.914 \times \|G - R\|$
Modified Green–Red Vegetation Index	$M G R V I = \frac{G^{2} - R^{2}}{G^{2} + R^{2}}$
Red–Green–Blue Vegetation Index	$R G B V I = \frac{G^{2} - B \times R}{G^{2} + B \times R}$
Hue	$θ = {c o s}^{- 1} \{\frac{\frac{1}{2} [(R - G) + (R - B)]}{\sqrt{{(R - G)}^{2} + (R - B) (G - B)}}\}$ $H U E = \{\begin{matrix} θ \\ 360 ° - θ \end{matrix} \begin{matrix} i f \\ i f \end{matrix} \begin{matrix} B \leq G \\ B > G \end{matrix}\}$
Saturation	$S A T = 1 - \frac{3}{R + G + B} m i n (R, G, B)$
Value	$V A L = \frac{1}{3} (R + G + B)$
Multispectral indices (MIs)
Anthocyanin Reflectance Index	$A R I = \frac{(R E - G)}{(G \times R E)}$
Atmospherically Resistant Vegetation Index	$A R V I = \frac{[N I R - (R - 2 \times (B - R))]}{[N I R + (R - 2 \times (B - R))]}$
Blue Normalized Difference Vegetation Index	$B N D V I = \frac{N I R - B}{N I R + B}$
Chlorophyll Index Red Edge	$C I R E = \frac{N I R}{R E} - 1$
Chlorophyll Vegetation Index	$C V I = \frac{(N I R \times R)}{(G^{2})}$
Difference Vegetation Index	$D V I = N I R - R$
Enhanced Normalized Difference Vegetation Index	$E N D V I = \frac{[(N I R + G) - 2 \times B]}{[(N I R + G) + 2 \times B]}$
Enhanced Vegetation Index	$E V I = \frac{2.5 \times (N I R - R)}{(N I R + 6 \times R - 7.5 \times B + 1)}$
Enhanced Vegetation Index 2	$E V I 2 = \frac{2.5 \times (N I R - R)}{(N I R + 2.4 \times R + 1)}$
Green–Blue Normalized Difference Vegetation Index	$G B N D V I = \frac{N I R - (G + B)}{N I R + (G + B)}$
Green Atmospherically Resistant Index	$G A R I = \frac{N I R}{G} - 1$
Green Difference Vegetation Index	$G D V I = N I R - G$
Green Leaf Index	$G L I = \frac{2 \times G - R - B}{2 \times G + R + B}$
Green Normalized Difference Vegetation Index	$G N D V I = \frac{(N I R - G)}{(N I R + G)}$
Green Optimized Soil-Adjusted Vegetation Index	$G O S A V I = \frac{G - R}{G + R + 0.16}$
Green–Red Normalized Difference Vegetation Index	$G R N D V I = \frac{N I R - (G + R)}{N I R + (G + R)}$
Green–Red Vegetation Index	$G R V I = \frac{(G - R)}{(G + R)}$
Green Soil-Adjusted Vegetation Index	$G S V A I = \frac{1.5 \times (N I R - G)}{N I R + G + 0.5}$
Green Vegetation Index	$G V I = \frac{N I R}{G}$
Infrared Percentage Vegetation Index	$I P V I = \frac{N I R}{N I R + R}$
Leaf Chlorophyll Index	$L C I = \frac{N I R - R E}{N I R + R}$
Modified Chlorophyll Absorption in Reflectance Index	$M C A R I = (R E - R) - 0.2 \times (R E - G) \times \frac{R E}{R}$
Modified Normalized Difference Blue Index	$m N D B l u e = \frac{(B - R E)}{(B + N I R)}$
Modified Nonlinear Vegetation Index	$M N L I \frac{1.5 \times ({N I R}^{2} - R)}{({N I R}^{2} + R + 0.5)}$
Modified Soil-Adjusted Vegetation Index 2	$M S A V I 2 = \frac{2 \times N I R + 1 - \sqrt{{(2 \times N I R + 1)}^{2} - 8 \times (N I R - R)}}{2}$
Modified Simple Ratio	$M S R = \frac{\frac{N I R}{R} - 1}{\sqrt{\frac{N I R}{R}} + 1}$
Red Edge Modified Simple Ratio	$M S R R E = \frac{\frac{N I R}{R E} - 1}{\sqrt{\frac{N I R}{R E}} - 1}$
Modified Red Edge Simple Ratio	$m S R = \frac{N I R - B}{R E - B}$
Modified Triangular Vegetation Index 1	$M T V I 1 = 1.2 \times [1.2 \times (N I R - G) - 2.5 \times (R - G)]$
Modified Triangular Vegetation Index	$M T V I 2 = \frac{1.5 \times [1.2 \times (N I R - G) - 2.5 \times (R - G)]}{\sqrt{{(2 \times N I R + 1)}^{2} - (6 \times N I R - 5 \sqrt{R}) - 0.5}}$
Normalized Difference Red Edge Index	$N D R E = \frac{N I R - R E}{N I R + R E}$
Normalized Difference Red Edge/Red	$N D R E R = \frac{R E - R}{R E + R}$
Normalized Difference Vegetation Index	$N D V I = \frac{N I R - R}{N I R + R}$
Normalized Green Intensity	$N G I = \frac{G}{R + G + B}$
Nonlinear Vegetation Index	$N L I = \frac{{N I R}^{2} - R}{{N I R}^{2} + R}$
Normalized Red–Blue Difference Index	$N R B D I = \frac{G - B}{G + B}$
Optimized Soil-Adjusted Vegetation Index	$O S A V I = \frac{1.16 \times (N I R - R)}{(N I R + R + 0.16)}$
Pan Normalized Difference Vegetation Index	$P N D V I = \frac{N I R - (G + R + B)}{N I R + (G + R + B)}$
Plant Senescence Reflectance Index	$P S R I = \frac{R - G}{R E}$
Red–Blue Normalized Difference Vegetation Index	$R B N D V I = \frac{N I R - (R + B)}{N I R + (R + B)}$
Renormalized Difference Vegetation Index	$R D V I = \frac{N I R - R}{\sqrt{N I R + R}}$
Red–Green–Blue Vegetation Indices	$R G B V I = \frac{G^{2} - B \times R}{G^{2} + B \times R}$
Red–Green Index	$R G I = \frac{R}{G}$
Ratio Vegetation Index	$R V I = \frac{N I R}{R}$
Soil-Adjusted Vegetation Index	$S A V I = \frac{1.5 \times (N I R - R)}{(N I R + R + 0.5)}$
Structure Insensitive Pigment Index	$S I P I = \frac{(N I R - B)}{(N I R - R)}$
Spectral Polygon Vegetation Index	$S P V I = 0.4 \times (3.7 \times (N I R - R) - 1.2 \times \|G - R\|)$
Transformed Chlorophyll Absorption in Reflectance Index	$T C A R I = \frac{3 \times [(R E - R) - 0.2 \times (R E - G) \times (R E / R)]}{[1.16 \times (N I R - R) / N I R + R + 0.16]}$
Transformed Difference Vegetation Index	$T D V I = \frac{1.5 \times (N I R - R)}{\sqrt{{N I R}^{2} + R + 0.5}}$
Triangular Vegetation Index	$T V I = 60 \times (N I R - G) - 100 \times (R - G)$

References

Li, S.; Wu, X.; Xue, H.; Gu, B.; Cheng, H.; Zeng, J.; Peng, C.; Ge, Y.; Chang, J. Quantifying carbon storage for tea plantations in China. Agric. Ecosyst. Environ. 2011, 141, 390–398. [Google Scholar]
Pramanik, P.; Phukan, M. Potential of tea plants in carbon sequestration in North-East India. Environ. Monit. Assess. 2020, 192, 211. [Google Scholar] [PubMed]
Chettri, V.; Ghosh, C. Tea Gardens, A Potential Carbon-Sink for Climate Change Mitigation. Curr. Agric. Res. J. 2023, 11, 695–704. [Google Scholar]
Debnath, B.; Haldar, D.; Purkait, M.K. Potential and sustainable utilization of tea waste: A review on present status and future trends. J. Environ. Chem. Eng. 2021, 9, 106179. [Google Scholar] [CrossRef]
Pan, S.Y.; Nie, Q.; Tai, H.C.; Song, X.L.; Tong, Y.F.; Zhang, L.J.F.; Wu, X.-W.; Lin, Z.-H.; Zhang, Y.-Y.; Ye, D.-Y.; et al. Tea and tea drinking: China’s outstanding contributions to the mankind. Chin. Med. 2022, 17, 27. [Google Scholar] [CrossRef]
Eitzinger, A.; Läderach, P.; Quiroga, A.; Pantoja, A.; Gordon, J. Future Climate Scenarios for Kenya’s Tea Growing Areas; International Center for Tropical Agriculture: Palmira, Colombia; Consultative Group on International Agricultural Research: Montpellier, France, 2011. [Google Scholar]
Zhong, C.; Cheng, S.; Kasoar, M.; Arcucci, R. Reduced-order digital twin and latent data assimilation for global wildfire prediction. Nat. Hazards Earth Syst. Sci. 2023, 23, 1755–1768. [Google Scholar]
Xu, Z.; Li, J.; Cheng, S.; Rui, X.; Zhao, Y.; He, H.; Xu, L. Wildfire risk prediction: A review. arXiv 2024, arXiv:2405.01607. [Google Scholar]
Hussain, S.; Ulhassan, Z.; Brestic, M.; Zivcak, M.; Zhou, W.; Allakhverdiev, S.I.; Yang, X.; Safdar, M.E.; Yang, W.; Liu, W. Photosynthesis research under climate change. Photosynth. Res. 2021, 150, 5–19. [Google Scholar]
RM. Climate Change and the Tea Industry. Assignment: Climate Change Challenge. 2016. Available online: https://d3.harvard.edu/platform-rctom/submission/climate-change-and-the-tea-industry/ (accessed on 21 December 2024).
Ahmed, S.; Stepp, J.R.; Orians, C.; Griffin, T.; Matyas, C.; Robbat, A.; Cash, S.; Xue, D.; Long, C.; Unachukwu, U.; et al. Effects of extreme climate events on tea (Camellia sinensis) functional quality validate indigenous farmer knowledge and sensory preferences in tropical China. PLoS ONE 2014, 9, e109126. [Google Scholar]
Jayasinghe, S.L.; Kumar, L. Climate change may imperil tea production in the four major tea producers according to climate prediction models. Agronomy 2020, 10, 1536. [Google Scholar] [CrossRef]
De Costa, W.A.; Mohotti, A.J.; Wijeratne, M.A. Ecophysiology of tea. Braz. J. Plant Physiol. 2007, 19, 299–332. [Google Scholar]
Calzadilla, P.I.; Carvalho FE, L.; Gomez, R.; Neto, M.L.; Signorelli, S. Assessing photosynthesis in plant systems: A cornerstone to aid in the selection of resistant and productive crops. Environ. Exp. Bot. 2022, 201, 104950. [Google Scholar]
Chen, J.M.; Ju, W.; Ciais, P.; Viovy, N.; Liu, R.; Liu, Y.; Lu, X. Vegetation structural change since 1981 significantly enhanced the terrestrial carbon sink. Nat. Commun. 2019, 10, 4259. [Google Scholar] [PubMed]
Li, X.; Liu, Q.; Yang, R.; Zhang, H.; Zhang, J.; Cai, E. The design and implementation of the leaf area index sensor. Sensors 2015, 15, 6250–6269. [Google Scholar] [CrossRef] [PubMed]
Koetz, B.; Baret, F.; Poilvé, H.; Hill, J. Use of coupled canopy structure dynamic and radiative transfer models to estimate biophysical canopy characteristics. Remote Sens. Environ. 2005, 95, 115–124. [Google Scholar]
Asner, G.P.; Scurlock, J.M.; Hicke, J.A. Global synthesis of leaf area index observations: Implications for ecological and remote sensing studies. Glob. Ecol. Biogeogr. 2003, 12, 191–205. [Google Scholar]
Cao, X.; Zhou, Z.; Chen, X.; Shao, W.; Wang, Z. Improving leaf area index simulation of IBIS model and its effect on water carbon and energy—A case study in Changbai Mountain broadleaved forest of China. Ecol. Model. 2015, 303, 97–104. [Google Scholar]
Trotter, G.M.; Whitehead, D.; Pinkney, E.J. The photochemical reflectance index as a measure of photosynthetic light use efficiency for plants with varying foliar nitrogen contents. Int. J. Remote Sens. 2002, 23, 1207–1212. [Google Scholar]
Lu, Y.; Zhu, X. Response of mangrove carbon fluxes to drought stress detected by photochemical reflectance index. Remote Sens. 2021, 13, 4053. [Google Scholar] [CrossRef]
He, H.; Yang, R.; Jia, B.; Chen, L.; Fan, H.; Cui, J.; Yang, D.; Li, M.; Ma, F.Y. Rice photosynthetic productivity and PSII photochemistry under nonflooded irrigation. Sci. World J. 2014, 2014, 839658. [Google Scholar]
Hasan, U.; Sawut, M.; Chen, S. Estimating the leaf area index of winter wheat based on unmanned aerial vehicle RGB-image parameters. Sustainability 2019, 11, 6829. [Google Scholar] [CrossRef]
Atzberger, C. Advances in remote sensing of agriculture: Context description, existing operational monitoring systems and major information needs. Remote Sens. 2013, 5, 949–981. [Google Scholar] [CrossRef]
Lelong, C.C.; Burger, P.; Jubelin, G.; Roux, B.; Labbé, S.; Baret, F. Assessment of unmanned aerial vehicles imagery for quantitative monitoring of wheat crop in small plots. Sensors 2008, 8, 3557–3585. [Google Scholar] [CrossRef] [PubMed]
Sun, Q.; Sun, L.; Shu, M.; Gu, X.; Yang, G.; Zhou, L. Monitoring maize lodging grades via unmanned aerial vehicle multispectral image. Plant Phenomics 2019, 2019, 5704154. [Google Scholar] [PubMed]
Jiang, Q.; Huang, Z.; Xu, G.; Su, Y. MIoP-NMS: Perfecting crops target detection and counting in dense occlusion from high-resolution UAV imagery. Smart Agric. Technol. 2023, 4, 100226. [Google Scholar]
Sishodia, R.P.; Ray, R.L.; Singh, S.K. Applications of Remote Sensing in Precision Agriculture: A Review. Remote Sens. 2020, 12, 3136. [Google Scholar] [CrossRef]
Tsouros, D.C.; Bibi, S.; Sarigiannidis, P.G. A review on UAV-based applications for precision agriculture. Information 2019, 10, 349. [Google Scholar] [CrossRef]
Velusamy, P.; Rajendran, S.; Mahendran, R.K.; Naseer, S.; Shafiq, M.; Choi, J.G. Unmanned Aerial Vehicles (UAV) in precision agriculture: Applications and challenges. Energies 2021, 15, 217. [Google Scholar] [CrossRef]
Li, S.; Yuan, F.; Ata-UI-Karim, S.T.; Zheng, H.; Cheng, T.; Liu, X.; Tian, Y.; Zhu, Y.; Cao, W.; Cao, Q. Combining color indices and textures of UAV-based digital imagery for rice LAI estimation. Remote Sens. 2019, 11, 1763. [Google Scholar] [CrossRef]
Qiao, L.; Zhao, R.; Tang, W.; An, L.; Sun, H.; Li, M.; Wang, N.; Liu, Y.; Liu, G. Estimating maize LAI by exploring deep features of vegetation index map from UAV multispectral images. Field Crops Res. 2022, 289, 108739. [Google Scholar]
Duan, B.; Liu, Y.; Gong, Y.; Peng, Y.; Wu, X.; Zhu, R.; Fang, S. Remote estimation of rice LAI based on Fourier spectrum texture from UAV image. Plant Methods 2019, 15, 124. [Google Scholar] [CrossRef] [PubMed]
Gong, Y.; Yang, K.; Lin, Z.; Fang, S.; Wu, X.; Zhu, R.; Peng, Y. Remote estimation of leaf area index (LAI) with unmanned aerial vehicle (UAV) imaging for different rice cultivars throughout the entire growing season. Plant Methods 2021, 17, 88. [Google Scholar] [PubMed]
Ochiai, S.; Kamada, E.; Sugiura, R. Comparative analysis of RGB and multispectral UAV image data for leaf area index estimation of sweet potato. Smart Agric. Technol. 2024, 9, 100579. [Google Scholar]
Na, S.I.; Park, C.W.; So, K.H.; Ahn, H.Y.; Lee, K.D. Photochemical Reflectance Index (PRI) mapping using drone-based hyperspectral image for evaluation of crop stress and its application to multispectral Imagery. Korean J. Remote Sens. 2019, 35, 637–647. [Google Scholar]
Garrity, S.R.; Eitel, J.U.; Vierling, L.A. Disentangling the relationships between plant pigments and the photochemical reflectance index reveals a new approach for remote estimation of carotenoid content. Remote Sens. Environ. 2011, 115, 628–635. [Google Scholar]
Clemente, A.A.; Maciel, G.M.; Siquieroli AC, S.; de Araujo Gallis, R.B.; Pereira, L.M.; Duarte, J.G. High-throughput phenotyping to detect anthocyanins, chlorophylls, and carotenoids in red lettuce germplasm. Int. J. Appl. Earth Obs. Geoinf. 2021, 103, 102533. [Google Scholar]
Candiago, S.; Remondino, F.; De Giglio, M.; Dubbini, M.; Gattelli, M. Evaluating multispectral images and vegetation indices for precision farming applications from UAV images. Remote Sens. 2015, 7, 4026–4047. [Google Scholar] [CrossRef]
Tubuxin, B.; Rahimzadeh-Bajgiran, P.; Ginnan, Y.; Hosoi, F.; Omasa, K. Estimating chlorophyll content and photochemical yield of photosystem II (ΦPSII) using solar-induced chlorophyll fluorescence measurements at different growing stages of attached leaves. J. Exp. Bot. 2015, 66, 5595–5603. [Google Scholar]
Zou, T.; Zhang, J. A new fluorescence quantum yield efficiency retrieval method to simulate chlorophyll fluorescence under natural conditions. Remote Sens. 2020, 12, 4053. [Google Scholar] [CrossRef]
Sims, D.A.; Gamon, J.A. Relationships between leaf pigment content and spectral reflectance across a wide range of species, leaf structures and developmental stages. Remote Sens. Environ. 2002, 81, 337–354. [Google Scholar]
Barclay, H.J. Convert the total leaf area to the projected leaf area in lodgepole pine and Douglas-fir. Tree Physiol. 1998, 18, 185–193. [Google Scholar] [CrossRef] [PubMed]
Yang, K.; Gong, Y.; Fang, S.; Duan, B.; Yuan, N.; Peng, Y.; Wu, X.; Zhu, R. Combining Spectral and Texture Features of UAV Images for the Remote Estimation of Rice LAI throughout the Entire Growing Season. Remote Sens. 2021, 13, 3001. [Google Scholar] [CrossRef]
Gamon, J.A. The Dynamic 531-Nanometer A Reflectance Si qlnal: A Survey of Twenty Angiosperm Species. In Proceedings of the Photosynthetic Responses to the Environment, Honolulu, HI, USA, 24–27 August 1992; American Society of Plant Physiologists: Rockville, MD, USA, 1993; p. 172. [Google Scholar]
Peñuelas, J.; Filella, I.; Gamon, J.A. Assessment of photosynthetic radiation-use efficiency with spectral reflectance. New Phytol. 1995, 131, 291–296. [Google Scholar] [CrossRef]
Sukhova, E.; Sukhov, V. Relation of photochemical reflectance indices based on different wavelengths to the parameters of light reactions in photosystems I and II in pea plants. Remote Sens. 2020, 12, 1312. [Google Scholar] [CrossRef]
Gamon, J.A.; Field, C.B.; Bilger, W.; Björkman, O.; Fredeen, A.L.; Peñuelas, J. Remote sensing of the xanthophyll cycle and chlorophyll fluorescence in sunflower leaves and canopies. Oecologia 1990, 85, 1–7. [Google Scholar] [CrossRef]
Nakamura, Y.; Tsujimoto, K.; Ogawa, T.; Noda, H.M.; Hikosaka, K. Correction of photochemical reflectance index (PRI) by optical indices to predict non-photochemical quenching (NPQ) across various species. Remote Sens. Environ. 2024, 305, 114062. [Google Scholar] [CrossRef]
Zhang, C.; Filella, I.; Garbulsky, M.F.; Peñuelas, J. Affecting factors and recent improvements of the photochemical reflectance index (PRI) for remotely sensing foliar, canopy and ecosystemic radiation-use efficiencies. Remote Sens. 2016, 8, 677. [Google Scholar] [CrossRef]
Thénot, F.; Méthy, M.; Winkel, T. The Photochemical Reflectance Index (PRI) as a water-stress index. Int. J. Remote Sens. 2002, 23, 5135–5139. [Google Scholar] [CrossRef]
Winkel, T.; Méthy, M.; Thénot, F. Radiation use efficiency, chlorophyll fluorescence, and reflectance indices associated with ontogenic changes in water-limited Chenopodium quinoa leaves. Photosynthetica 2002, 40, 227–232. [Google Scholar] [CrossRef]
Garbulsky, M.F.; Peñuelas, J.; Gamon, J.; Inoue, Y.; Filella, I. The photochemical reflectance index (PRI) and the remote sensing of leaf, canopy and ecosystem radiation use efficiencies: A review and meta-analysis. Remote Sens. Environ. 2011, 115, 281–297. [Google Scholar] [CrossRef]
Genty, B.; Briantais, J.M.; Baker, N.R. The relationship between the quantum yield of photosynthetic electron transport and quenching of chlorophyll fluorescence. Biochim. Biophys. Acta (BBA)-Gen. Subj. 1989, 990, 87–92. [Google Scholar] [CrossRef]
Maxwell, K.; Johnson, G.N. Chlorophyll fluorescence—A practical guide. J. Exp. Bot. 2000, 51, 659–668. [Google Scholar] [PubMed]
Zhuang, Z.H.; Tsai, H.P.; Chen, C.I.; Yang, M.D. Subtropical region tea tree LAI estimation integrating vegetation indices and texture features derived from UAV multispectral images. Smart Agric. Technol. 2024, 9, 100650. [Google Scholar]
Achanta, R.; Shaji, A.; Smith, K.; Lucchi, A.; Fua, P.; Süsstrunk, S. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 2274–2282. [Google Scholar] [CrossRef]
Peng, H.; Long, F.; Ding, C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1226–1238. [Google Scholar]
Bai, J.; Zhou, Z.; Zou, Y.; Pulatov, B.; Siddique, K.H. Watershed drought and ecosystem services: Spatiotemporal characteristics and gray relational analysis. ISPRS Int. J. Geo-Inf. 2021, 10, 43. [Google Scholar] [CrossRef]
Aixiang, T. Grey Relation Analysis on Agriculture Energy Consumption and Its Affecting Factors. In Proceedings of the 2010 International Conference on Digital Manufacturing & Automation, Changcha, China, 18–20 December 2010; IEEE: Piscataway, NJ, USA, 2010; Volume 1, pp. 786–789. [Google Scholar]
Tao, H.; Feng, H.; Xu, L.; Miao, M.; Long, H.; Yue, J.; Li, Z.; Yang, G.; Yang, X.; Fan, L. Estimation of crop growth parameters using UAV-based hyperspectral remote sensing data. Sensors 2020, 20, 1296. [Google Scholar] [CrossRef]
Luo, D.; Gao, Y.; Wang, Y.; Shi, Y.; Chen, S.; Ding, Z.; Fan, K. Using UAV image data to monitor the effects of different nitrogen application rates on tea quality. J. Sci. Food Agric. 2022, 102, 1540–1549. [Google Scholar]
Wold, S.; Sjöström, M.; Eriksson, L. PLS-regression: A basic tool of chemometrics. Chemom. Intell. Lab. Syst. 2001, 58, 109–130. [Google Scholar]
Zhu, C.; Ding, J.; Zhang, Z.; Wang, Z. Exploring the potential of UAV hyperspectral image for estimating soil salinity: Effects of optimal band combination algorithm and random forest. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2022, 279, 121416. [Google Scholar]
Shafiee, S.; Lied, L.M.; Burud, I.; Dieseth, J.A.; Alsheikh, M.; Lillemo, M. Sequential forward selection and support vector regression in comparison to LASSO regression for spring wheat yield prediction based on UAV imagery. Comput. Electron. Agric. 2021, 183, 106036. [Google Scholar]
Qun’ou, J.; Lidan, X.; Siyang, S.; Meilin, W.; Huijie, X. Retrieval model for total nitrogen concentration based on UAV hyper spectral remote sensing data and machine learning algorithms—A case study in the Miyun Reservoir, China. Ecol. Indic. 2021, 124, 107356. [Google Scholar]
Zhang, Y.; Jiang, Y.; Xu, B.; Yang, G.; Feng, H.; Yang, X.; Yang, H.; Liu, C.; Cheng, Z.; Feng, Z. Study on the Estimation of Leaf Area Index in Rice Based on UAV RGB and Multispectral Data. Remote Sens. 2024, 16, 3049. [Google Scholar] [CrossRef]
Sun, C.; Feng, L.; Zhang, Z.; Ma, Y.; Crosby, T.; Naber, M.; Wang, Y. Prediction of end-of-season tuber yield and tuber set in potatoes using in-season UAV-based hyperspectral imagery and machine learning. Sensors 2020, 20, 5293. [Google Scholar] [CrossRef]
Zhai, W.; Li, C.; Cheng, Q.; Mao, B.; Li, Z.; Li, Y.; Ding, F.; Qin, S.; Fei, S.; Chen, Z. Enhancing wheat above-ground biomass estimation using UAV RGB images and machine learning: Multi-feature combinations, flight height, and algorithm implications. Remote Sens. 2023, 15, 3653. [Google Scholar] [CrossRef]
Xu, M.; Watanachaturaporn, P.; Varshney, P.K.; Arora, M.K. Decision tree regression for soft classification of remote sensing data. Remote Sens. Environ. 2005, 97, 322–336. [Google Scholar]
Balogun, A.L.; Tella, A. Modelling and investigating the impacts of climatic variables on ozone concentration in Malaysia using correlation analysis with random forest, decision tree regression, linear regression, and support vector regression. Chemosphere 2022, 299, 134250. [Google Scholar]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar]
Liu, Y.; Liu, S.; Li, J.; Guo, X.; Wang, S.; Lu, J. Estimating biomass of winter oilseed rape using vegetation indices and texture metrics derived from UAV multispectral images. Comput. Electron. Agric. 2019, 166, 105026. [Google Scholar]
Yang, X.; Yang, R.; Ye, Y.; Yuan, Z.; Wang, D.; Hua, K. Winter wheat SPAD estimation from UAV hyperspectral data using cluster-regression methods. Int. J. Appl. Earth Obs. Geoinf. 2021, 105, 102618. [Google Scholar]
Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
Xiaosong, Z.; Qiangfu, Z. Stock prediction using optimized LightGBM based on cost awareness. In Proceedings of the 2021 5th IEEE International Conference on Cybernetics (CYBCONF), Sendai, Japan, 8–10 June 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 107–113. [Google Scholar]
Welch, B.L. On the comparison of several mean values: An alternative approach. Biometrika 1951, 38, 330–336. [Google Scholar] [CrossRef]
Reganold, J.P.; Elliott, L.F.; Unger, Y.L. Long-term effects of organic and conventional farming on soil erosion. Nature 1987, 330, 370–372. [Google Scholar] [CrossRef]
Schrama, M.; De Haan, J.J.; Kroonen, M.; Verstegen, H.; Van der Putten, W.H. Crop yield gap and stability in organic and conventional farming systems. Agric. Ecosyst. Environ. 2018, 256, 123–130. [Google Scholar] [CrossRef]
De Ponti, T.; Rijk, B.; Van Ittersum, M.K. The crop yield gap between organic and conventional agriculture. Agric. Syst. 2012, 108, 1–9. [Google Scholar] [CrossRef]
Wortman, S.E.; Galusha, T.D.; Mason, S.C.; Francis, C.A. Soil fertility and crop yields in long-term organic and conventional cropping systems in Eastern Nebraska. Renew. Agric. Food Syst. 2012, 27, 200–216. [Google Scholar]
Schärer, M.L.; Dietrich, L.; Kundel, D.; Mäder, P.; Kahmen, A. Reduced plant water use can explain higher soil moisture in organic compared to conventional farming systems. Agric. Ecosyst. Environ. 2022, 332, 107915. [Google Scholar]
Petcu, E.; Toncea, I.; Mustăţea, P.; Petcu, V. Effect of organic and conventional farming systems on some physiological indicators of winter wheat. Org. Farming 2009, 2010, 131–135. [Google Scholar]
Han, W.Y.; Wang, D.H.; Fu, S.W.; Ahmed, S. Tea from organic production has higher functional quality characteristics compared with tea from conventional management systems in China. Biol. Agric. Hortic. 2018, 34, 120–131. [Google Scholar] [CrossRef]
Olsen, J.; Weiner, J. The influence of Triticum aestivum density, sowing pattern and nitrogen fertilization on leaf area index and its spatial variation. Basic Appl. Ecol. 2007, 8, 252–257. [Google Scholar]
Chen, C.I.; Lin, K.H.; Huang, M.Y.; Yang, C.K.; Lin, Y.H.; Hsueh, M.L.; Lee, L.-H.; Lin, S.-R.; Wang, C.W. Gas exchange and chlorophyll fluorescence responses of Camellia sinensis grown under various cultivations in different seasons. Bot. Stud. 2024, 65, 10. [Google Scholar]
Liu, W.; Cui, S.; Wu, L.; Qi, W.; Chen, J.; Ye, Z.; Ma, J.; Liu, D. Effects of bio-organic fertilizer on soil fertility, yield, and quality of tea. J. Soil Sci. Plant Nutr. 2023, 23, 5109–5121. [Google Scholar] [CrossRef]
Li, J.; Cheng, K.; Wang, S.; Morstatter, F.; Trevino, R.P.; Tang, J.; Liu, H. Feature selection: A data perspective. ACM Comput. Surv. (CSUR) 2017, 50, 1–45. [Google Scholar] [CrossRef]
Zhu, Y.; Liu, K.; Liu, L.; Myint, S.W.; Wang, S.; Liu, H.; He, Z. Exploring the potential of worldview-2 red-edge band-based vegetation indices for estimation of mangrove leaf area index with machine learning algorithms. Remote Sens. 2017, 9, 1060. [Google Scholar] [CrossRef]
Shahsavari, M.; Mohammadi, V.; Alizadeh, B.; Alizadeh, H. Application of machine learning algorithms and feature selection in rapeseed (Brassica napus L.) breeding for seed yield. Plant Methods 2023, 19, 57. [Google Scholar] [CrossRef]
Fang, S.; Yao, X.; Zhang, J.; Han, M. Grey correlation analysis on travel modes and their influence factors. Procedia Eng. 2017, 174, 347–352. [Google Scholar] [CrossRef]
Shao, G.; Han, W.; Zhang, H.; Zhang, L.; Wang, Y.; Zhang, Y. Prediction of maize crop coefficient from UAV multisensor remote sensing using machine learning methods. Agric. Water Manag. 2023, 276, 108064. [Google Scholar] [CrossRef]
Chen, Z.; Jia, K.; Xiao, C.; Wei, D.; Zhao, X.; Lan, J.; Wei, X.; Yao, Y.; Wang, B.; Sun, Y.; et al. Leaf area index estimation algorithm for GF-5 hyperspectral data based on different feature selection and machine learning methods. Remote Sens. 2020, 12, 2110. [Google Scholar] [CrossRef]
Zhang, C.; Yi, Y.; Wang, L.; Zhang, X.; Chen, S.; Su, Z.; Zhang, S.; Xue, Y. Estimation of the Bio-Parameters of Winter Wheat by Combining Feature Selection with Machine Learning Using Multi-Temporal Unmanned Aerial Vehicle Multispectral Images. Remote Sens. 2024, 16, 469. [Google Scholar] [CrossRef]
Duro, D.C.; Franklin, S.E.; Dubé, M.G. Multi-scale object-based image analysis and feature selection of multi-sensor earth observation imagery using random forests. Int. J. Remote Sens. 2012, 33, 4502–4526. [Google Scholar] [CrossRef]
Nagy, A.; Szabó, A.; Elbeltagi, A.; Nxumalo, G.S.; Bódi, E.B.; Tamás, J. Hyperspectral indices data fusion-based machine learning enhanced by MRMR algorithm for estimating maize chlorophyll content. Front. Plant Sci. 2024, 15, 1419316. [Google Scholar] [CrossRef]
Wu, S.; Deng, L.; Guo, L.; Wu, Y. Wheat leaf area index prediction using data fusion based on high-resolution unmanned aerial vehicle imagery. Plant Methods 2022, 18, 68. [Google Scholar] [PubMed]
Wittstruck, L.; Jarmer, T.; Trautz, D.; Waske, B. Estimating LAI from winter wheat using UAV data and CNNs. IEEE Geosci. Remote Sens. Lett. 2022, 19, 1–5. [Google Scholar]
Li, H.; Yan, X.; Su, P.; Su, Y.; Li, J.; Xu, Z.; Gao, C.; Zhao, Y.; Feng, M.; Shafiq, F.; et al. Estimation of winter wheat LAI based on color indices and texture features of RGB images taken by UAV. J. Sci. Food Agric. 2025, 105, 189–200. [Google Scholar] [PubMed]
Liu, S.; Zeng, W.; Wu, L.; Lei, G.; Chen, H.; Gaiser, T.; Srivastava, A.K. Simulating the leaf area index of rice from multispectral images. Remote Sens. 2021, 13, 3663. [Google Scholar] [CrossRef]
Zhang, J.; Cheng, T.; Guo, W.; Xu, X.; Qiao, H.; Xie, Y.; Ma, X. Leaf area index estimation model for UAV image hyperspectral data based on wavelength variable selection and machine learning methods. Plant Methods 2021, 17, 49. [Google Scholar]

Figure 1. Research flow.

Figure 2. Geographical distribution map of the study plantations. (a) Geographic location and elevation of Nantou County. (b) Ren’ai Township. (c) Mingjian Township. (d) Lugu Township.

Figure 3. Field measurement instruments and designed square zone (DSZ). (a) LAI-2200C Plant Canopy Analyzer. (b) PlantPen PRI 200. (c) MINI-PAM-II photosynthesis yield analyzer. (d) DJI Phantom 4 Pro. (e) DJI Phantom 4 Multispectral. (f) Designed square zone.

Figure 4. Workflow for tea canopy image classification.

Figure 5. Violin plots and significance analysis of (a) LAI, (b) PRI, and (c) ΦPSII distributions. *: Significant difference between the two farming methods (p < 0.05); letters (a, b, c) denote significant differences among the three elevation levels within the same farming method (p < 0.05).

Figure 6. Heatmap of image indices importance based on (a) Pearson correlation, (b) minimum redundancy maximum relevance, and (c) gray relational analysis.

Figure 7. Regression accuracy of LAI using color (a–c) and multispectral indices (d–f) with three feature ranking methods.

Figure 8. Regression accuracy of PRI using color (a–c) and multispectral indices (d–f) with three feature ranking methods.

Figure 9. Regression accuracy of ΦPSII using color (a–c) and multispectral indices (d–f) with three feature ranking methods.

Figure 10. Boxplots of prediction errors for three physiological parameters across different elevations (low, mid, and high) and seasons (spring, summer, autumn, and winter) under conventional (a,c,e) and agroecological (b,d,f) farming methods.

Table 1. Hyperparameters adjusted for each model.

Model	Hyperparameter
Polynomial Regression	Polynomial Terms: 2nd degree
Partial Least Squares Regression	n_components: from 2 to N based on the input features
Lasso Regression	Regularization strength (α): 0, 0.01, 0.1, 1, 10, 100
Ridge Regression	Regularization strength (α): 0, 0.01, 0.1, 1, 10, 100
Decision Tree Regression	max_depth: 4~100 min_samples_split: 5~50, increasing in increments of 5
Random Forest Regression	n_estimators: 50~150, increasing in increments of 25 max_depth: 3~50 min_samples_split: 5~50, increasing in increments of 5
eXtreme Gradient Boosting	max_depth: 3~25 learning_rate: 0.001, 0.005, 0.01, 0.05, 0.1 n_estimators: 50~150, increasing in increments of 25
Light Gradient Boosting Machine	num_leaves: 50~150, increasing in increments of 10 learning_rate: 0.001, 0.005, 0.01, 0.05, 0.1 n_estimators: 50~150, increasing in increments of 25

Table 2. Summary of descriptive statistics for LAI, PRI, and ΦPSII under two farming methods.

Parameter	Farming Method	Number	Min	Mean	Max	StDev	CV
LAI	CFM	499	0.500	4.878	10.370	1.704	0.349
LAI	AFM	377	0.130	4.058	9.810	2.035	0.501
PRI	CFM	503	−0.0371	0.0228	0.0766	0.0196	0.860
PRI	AFM	378	−0.0770	0.0201	0.0596	0.0251	1.249
ΦPSII	CFM	477	0.0729	0.4121	0.8277	0.1671	0.405
ΦPSII	AFM	350	0.0819	0.4586	0.8648	0.1717	0.374

Table 3. Accuracy of three physiological parameters under different image indices (CIs and MIs) and feature ranking methods.

Parameter	Index	Feature Ranking	R²	RMSE	MAE	Model
LAI	CI	PCA	0.59	1.214	0.759	XGBoost
		MRMR	0.599	1.2	0.752
		GRA	0.591	1.212	0.751
	MI	PCA	0.687	1.06	0.676	XGBoost
		MRMR	0.691	1.053	0.669
		GRA	0.694	1.049	0.632
PRI	CI	PCA	0.281	0.019	0.014	RFR
		MRMR	0.284	0.019	0.014
		GRA	0.284	0.019	0.014
	MI	PCA	0.603	0.014	0.009	XGBoost
		MRMR	0.607	0.014	0.009
		GRA	0.643	0.013	0.009
ΦPSII	CI	PCA	0.92	0.048	0.013	XGBoost
		MRMR	0.919	0.049	0.016
		GRA	0.915	0.05	0.014
	MI	PCA	0.909	0.052	0.021	XGBoost
		MRMR	0.913	0.05	0.024
		GRA	0.919	0.049	0.015

The bolded words in the table indicate the best results for this physiological parameter in CI/MI.

Table 4. Comparison of regression models, feature ranking methods, and accuracy metrics (best accuracy and 95% confidence interval) for tea plant physiological parameters.

Parameter	Indices	Feature Ranking	Accuracy ¹	Model	# Variables ¹ (Difference)
LAI	CI	MRMR	0.599/0.569	XGBoost	14/5 (9)
LAI	MI	GRA	0.716/0.680	XGBoost	43/11 (32)
PRI	CI	GRA	0.284/0.270	RFR	19/17 (2)
PRI	MI	GRA	0.643/0.611	XGBoost	55/53 (2)
ΦPSII	CI	PCA	0.920/0.874	XGBoost	22/3 (19)
ΦPSII	MI	GRA	0.919/0.873	XGBoost	36/3 (33)

¹ Best accuracy/95% confidence interval lower bound. The bolded words in the table indicate the best results for this physiological parameter in CI/MI.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhuang, Z.-H.; Tsai, H.-P.; Chen, C.-I. Estimating Tea Plant Physiological Parameters Using Unmanned Aerial Vehicle Imagery and Machine Learning Algorithms. Sensors 2025, 25, 1966. https://doi.org/10.3390/s25071966

AMA Style

Zhuang Z-H, Tsai H-P, Chen C-I. Estimating Tea Plant Physiological Parameters Using Unmanned Aerial Vehicle Imagery and Machine Learning Algorithms. Sensors. 2025; 25(7):1966. https://doi.org/10.3390/s25071966

Chicago/Turabian Style

Zhuang, Zhong-Han, Hui-Ping Tsai, and Chung-I Chen. 2025. "Estimating Tea Plant Physiological Parameters Using Unmanned Aerial Vehicle Imagery and Machine Learning Algorithms" Sensors 25, no. 7: 1966. https://doi.org/10.3390/s25071966

APA Style

Zhuang, Z.-H., Tsai, H.-P., & Chen, C.-I. (2025). Estimating Tea Plant Physiological Parameters Using Unmanned Aerial Vehicle Imagery and Machine Learning Algorithms. Sensors, 25(7), 1966. https://doi.org/10.3390/s25071966

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimating Tea Plant Physiological Parameters Using Unmanned Aerial Vehicle Imagery and Machine Learning Algorithms

Abstract

1. Introduction

2. Materials and Methods

2.1. Overview

2.2. Experimental Site and Design

2.3. Data Acquisition

2.3.1. Tea Plant Physiological Parameter Acquisition

2.3.2. UAV Image Acquisition

2.4. Image Processing

2.4.1. Canopy Part Segmentation

2.4.2. Calculation of Color Indices and Multispectral Indices

2.5. Feature Ranking Methods

2.5.1. Pearson Correlation Analysis

2.5.2. Minimum Redundancy Maximum Relevance

2.5.3. Gray Relational Analysis

2.6. Model Training and Evaluation Metrics

3. Results

3.1. Effects of Elevation Distribution and Farming Method Differences on Tea Plant Physiology

3.2. Feature Ranking of Image Indices

3.3. Comparison of Regression Model Accuracy for Tea Plant Physiological Parameter Estimation

3.4. Effects of Elevation and Seasonal Conditions on Prediction Accuracy

4. Discussion

4.1. Differences in Tea Plant Physiological Parameters Across Elevations and Farming Methods

4.2. Effects of Feature Ranking Methods and Image Indices on Prediction Models

4.3. Applicability Analysis of Regression Models

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI