1. Introduction
Water stress is one of the main abiotic stresses limiting crop production and quality [
1,
2,
3,
4,
5,
6]. It is important to know the level of water stress in a crop, in order to schedule irrigation [
7,
8]. As the canopy temperature is closely linked to stomatal behavior, particularly stomatal conductance, and stomatal closure reduces transpiration and heat loss, leading to increased leaf temperature [
9,
10,
11], the canopy temperature (Tc) is an indicator of crop water stress, and the reduction in canopy temperature relative to the ambient environment reflects the ability of transpiration to cool the plant’s leaves. In practice, continuous measurement of canopy temperature is a useful tool to identify crops’ water stress [
12,
13,
14] and indicate the irrigation needed to optimize growth and yield in crops [
15,
16], and it can be used to study drought and heat stress tolerance [
17,
18]. Furthermore, canopy temperature can also allow for more accurate estimates of the consequences of heat stress on the crop and its yield compared to air temperature [
19,
20,
21,
22], since canopy temperature can deviate from air temperature under field conditions because of the interplay among plant traits, plant water availability, air temperature and humidity, solar radiation, wind velocity, and the ensuing canopy microclimate [
23,
24].
There are different ways to model and simulate canopy temperatures. Biophysics-based crop models have components to simulate canopy temperature, such as the rice sterility model [
25], the wheat simulation model SIRIUS for predicting leaf appearance in wheat [
26,
27], the generic STICS model for crop simulation with water and nitrogen balance [
28], APSIM for farming system modeling [
29], and direct simulation of canopy temperature based on Monin–Obukhov similarity theory [
30]. However, due to the complicated biophysical processes, these models face challenges in quantifying the complex factors that affect canopy temperature because of large variations in crop types, locations, and management regimes, as well as significant exogenous information of farm/location-specific qualities that are often not available [
30]. In addition, these physical-based models often simplify the relationship between canopy temperature and weather variables (particularly with the use of only air temperature) or require ancillary measurements to reliably estimate the model parameters (such as heat transfer resistance in the energy balance approach) [
31].
Data-driven models are popular alternatives in modeling practice [
24]. Several models have previously been used to predict canopy temperature from the more easily obtained weather variables, including the Brown and Zeiher day-and-night model [
32], the Georgia day–night models [
33], the Georgia 24 h models [
33], and the generalized model proposed in [
34,
35]. All of these models were applied to cotton [
33]. All of these models are in linear form, i.e., constant coefficients of weather variables in multiple linear regression. Shao et al. [
14] overcame the limitation of the linear setting by proposing the so-called PeriodiCT model, which outperforms all of the abovementioned models for cotton.
Various deep learning models have recently been reported for long-term high-frequency time-series forecasts like 15 min canopy temperature [
36], but they demand a large amount of training time series, and only a few of them have simple mechanisms to include covariates available in the future [
37]. Furthermore, more sophisticated machine learning and statistical models need a lot of training data, as well as expertise in setting up the hyperparameters in order to reach their optimal forecast performance [
38]. As the canopy temperature interacts with environment differently at various development stages [
39], the requirement of large training data often becomes unfeasible because the model parameters can be different over different development stages. Therefore, it is interesting to know whether the simpler machine learning model PeriodiCT suffers from large amounts of training data (i.e., whether a smaller, simpler amount of training data would be sufficient to train the model parameters). Furthermore, it is also interesting to know whether a trained PeriodiCT model can be used over different fields and across different seasons.
One of the practical uses of canopy temperature is for irrigation scheduling. With affordable canopy temperature sensors, the use of continuous canopy temperature measurements for irrigation scheduling is made even more accurate by using the Biologically Identified Optimal Temperature Interactive Console (BIOTIC), which was developed in [
40], whose algorithm has been updated to the stress time temperature (STT) threshold [
40]. In Australia, the potential use of the BIOTIC for furrow flood irrigation systems in cotton has been supported by the use of an accumulative STT [
15,
16].
Note that water is a precious natural resource and is costly to farmers, and better decision-making is required on irrigation scheduling [
41]. In order for farmers to use the BIOTIC and SST to guide their irrigation scheduling, particularly in the case of furrow flood irrigation systems, there is a need to predict canopy temperature data in the near future (1 to 7 days in advance) to determine when irrigation is potentially needed. For example, Australian cotton growers need to plan when and how to attain the water moats for irrigations before they are needed, because some farms need to order water from large reservoirs, which can be some distance away from the farms [
42,
43], so as to allow the water to be released from reservoirs so that it arrives at the farms on time.
The following considerations need to be addressed for implementing the forecasting algorithms in practice: Firstly, for each sensor, one can only use the historical data to train the model parameters for predicting the future canopy temperature. Secondly, as the number of observation days of data can be quite small in the early season and the model can vary at different crop development stages, one is interested in knowing how many days are needed to reliably estimate the model parameters. Thirdly, there may only be a limited number of sensors in a field, and one may be interested in knowing whether the trained model for one season can be used to predict canopy temperature across fields with the same crop variety but with no canopy temperature sensors available. Fourthly, in cases where no observations are available to train the model (such as at the start of a crop season), one may be interested in knowing whether the trained model from the previous season can be used to predict the canopy temperature in the current season. Finally, it is possible that some weather variables are not available, and it is useful to know how the model performs if some weather variables are missing.
To assist in the implementation of the PeriodiCT model from economic and practical perspectives in term of forecasting, this work aims to assess the performance of our PeriodiCT model against all of the above issues, by answering the following questions: (1) How many data (days, as sample size) are required in model training to achieve reasonable predictability? (2) How well does the model predict into the future? (3) How well does the trained model from one location predict the canopy temperature at another location where there is a similar crop variety? (4) How well does the trained model from one season predict the canopy temperature in another season in the same field with the same crop variety? The full model, using all predictors (air temperature Ta, vapor pressure VP, wind speed U, and solar radiation S), and all sub-models without S are also assessed and compared with the performance of corresponding benchmarks using leave-one-out cross-validation procedures.
2. Materials and Methods
2.1. Cotton Experiment Used to Test PeriodiCT
To understand the impact of sample size (the number of days used for model training), we used canopy temperature data and weather data collected from the same 5 sensors as used in Ref. [
14], with sensor identification (ID) = S1303, S1304, S1305, S1306, and S1309. A threshold temperature of 28 °C was also used as the threshold to indicate the point at which the biological optimum is exceeded during the day [
41]. In this study, at no time did temperatures exceed 28 °C for 5.25 h for the day where yield could be impacted [
15,
16].
To assess the applicability of the PeriodiCT model across a field with the same crop variety, several fields in the cotton region around Narrabri are investigated.
The fields and sensors are summarized in
Table 1, including Emerald (commercial farm in the Central Highlands Region of Queensland), Moree (commercial farm, Gywdir Valley), and ACRI (Namoi Valley) in Narrabri, New South Wales, over the 2014/2015 and/or 2015/2016 cropping seasons.
Figure 1 presents a map of Australia with indication of the study fields. The Central Highlands Region has a subtropical climate with hot and humid summers, with average temperatures ranging from 20 to 28 °C; and mild, dry winters, with average temperatures ranging from 9 to 20 °C. Rainfall is most common during the summer months (December–February). Narrabri, with latitude from −30.24741288° S to −29.24741288° S and longitude from 149.818732° E to 150.818732° E, located in the northwest of New South Wales, Australia, is a major center for cotton as well as red meat production. The field sites are characterized as semi-arid, with average annual rainfall of around 657 mm. The area has mild winters and hot summers, with average summer temperatures ranging from 20 to 39 °C and average winter temperatures ranging from 0 to 20 °C. Rainfall is summer-dominant, with two-thirds of yearly rainfall occurring during the cotton season of September to February (Emerald) and October to March (Moree and ACRI). The soils in the study areas are uniform grey cracking clay (USDA Soil Taxonomy: Typic Haplustert) and alkaline, with a high clay fraction and high background fertility, making them suitable for various crops’ growth. Irrigated cotton farming is one of the major agricultural activities. All of the experiments used a randomized complete block design with three replicates. The experimental blocks were 180 m by 16 or 20 rows at ACRI, 550 m by 16 or 20 rows at Emerald, and 1200 m by 24 rows at Moree. The canopy temperature scheduled plots were irrigated when the estimated STT of 37 h was reached. The traditional irrigation scheduling treatments at Emerald and Moree used combinations of soil moisture meters (capacitance probes), farmers’ experience, and intuition. All approaches used were intended to minimize stress.
To assess the applicability of the PeriodiCT model across years, the experimental data from Emerald over the 2014–2015 and 2015–2016 seasons were used as follows: the data collected in 2014–2015 were used to train the, models and then the trained models were used to predict the canopy temperature over the 2015–2016 season by using the observed weather data.
In the experiments, the canopy temperatures were measured by commercial thermal temperature sensors installed in the fields, directly facing the cotton leaves. Regular adjustments were conducted to ensure that the sensors were above the cotton plants. Given that, like other models, including the Brown and Zeiher model [
32] and Georgia model [
33], the PeriodiCT model was developed for irrigated crops such as cotton [
44], all of the experiments and treatments took place under fully watered situations.
2.2. PeriodiCT: Canopy Temperature Prediction Models
Based on the measured canopy temperature data, a data-driven model was constructed to predict the canopy temperature by using weather variables. The PeriodiCT model for this purpose is given in the regression form as follows:
where
t represents the Julian time (in the form of 24 h),
Tc(
t) is the canopy temperature estimated by the model in °C at time t,
Ta(
t) is the air temperature at time t in °C,
ea(t) is the vapor pressure at time
t in kPa, U(
t) is the wind speed at time t in ms
−1, and
S(
t) is the solar radiation at time
t in Wm
−2.
is a periodic function of t with a 24 h period, with values between 0 and 24 h. The regression coefficients
,
,
,
, and
are the functions of
to be estimated. The coefficient functions are estimated by using the procedure in Ref. [
13] through
The advantage of PeriodiCT is that varying coefficients capture the need for parameter values to change over time within a day, and the periodicity constraint ensures that the parameter change should smoothly evolve over time [
45]. For each experiment and treatment, the regression coefficient functions are trained by using measured canopy temperatures and corresponding weather variables.
In addition to the full PeriodiCT model using all four predictors, we also considered the sub-models, as in Ref. [
44]. For the ease of notation, we used digital numbers to name the sub-models, with 1 representing air temperature, 2 vapor pressure, 3 wind speed, and 4 solar radiation. For example, sub-model 123 represents the sub-model using air temperature (as 1), vapor pressure (as 2), and wind speed (as 3).
2.3. Assessment Criteria
To assess the performance of the PeriodiCT model on the canopy temperature prediction with different lengths of historical data, one needs to train the model by using a dataset with known weather and canopy temperature data, and then apply this model to predict unknown canopy temperatures using only known corresponding weather data in the following days. The cross-validation approach was used in the assessment [
46]. To do this, a consecutive
k days of the experiment dataset (called training data) were used to train the model parameters by using both the weather and canopy temperature observations, and then the trained model was applied to the next
l days (called validation or testing data, with lead time
l) to obtain the predicted canopy temperature, using only the weather observations. We evaluated the performance for all of the combinations of
k from 1 to 15 previous days and
l from 1 to 14 future days. The predicted canopy temperatures were compared with the corresponding observations (which are indeed known in the validation data). For each pair of (
k,
l), this training and validation process was performed for all possible combinations of a given experimental dataset. We named this process forward (
k,
l) cross-validation.
The temperature difference
ETC, absolute difference
DTC (both in percentage), root-mean-square error
RMSETC, and the Nash–Sutcliffe coefficient of efficiency
NSTC were used to measure the model’s performance in canopy temperature prediction [
47]. The percentage of relative temperature difference, the percentage of absolute difference, and the root-mean-square error are defined by
and
respectively. The coefficient of efficiency describes how well the forecasts compare to the measurements and is defined by
where
Tci is the measured canopy temperature at time
i,
is the mean canopy temperature of measurement in the summation,
is the forecasted canopy temperature at time
i by the forward (
k, l) cross-validation testing process, and
n is the number of data used in the summation. That is,
is forecasted by the model trained by using the data from the (
i−
k−
l+1)th to (
i−
l)th days and weather data from the
ith day (so the
ith day has the lead time
l in the setting). The smaller the
ETC,
DTC, or
RMSETC is, the better the model’s predictability is. The closer the value of
NSTC is to 1, the more successfully the model forecasts.
To evaluate the model’s ability to detect crop stress status, where stress time (ST) accumulates once the canopy temperature reaches above a pre-defined threshold (which is 28 °C in our case studies; see [
40]), the following four categories were defined for testing reliability: a forecast was said to be (1) a correct negative when both the forecast and measurement were below the threshold, (2) a correct hit if both the forecast and measurement were above the threshold, (3) a miss if the forecast was below the threshold while the measurement was above the threshold, and (4) a false alarm if the forecast was above the threshold while the measurement was below the threshold. By counting the number of measurement/forecast pairs for a model, a contingency table can be formed, as shown in
Table 2.
The accuracy was then defined as follows:
where “
Total” is the total number of measurements (or pairs in the contingency table; see
Table 2 as an example) [
48].
To assess the performance of the PeriodiCT model on the canopy temperature prediction in a field over time, one needs to train the model by using the dataset from the sensor in one growing season with known weather and canopy temperature data, and then apply the trained model to predict unknown canopy temperature from the same sensor in the growing season of interest with only known, corresponding weather data at the same sensor in the whole growing season. We named this assessment one-to-one predictability assessment across sensors. There are n × (n − 1) pairs of assessments for a treatment with n sensors (n > 1). As there are often more than two sensors, the pairs of assessment are often quite numerous. Alternatively, one can assess the predictability based on one sensor by using the trained model based on the rest of the sensors within the corresponding treatment of an experiment. We named this latter assessment multiple-to-one predictability assessment across sensors. The same assessment criteria of ETC, DTC, RMSETC, and NSTC were used to evaluate the model performance, as before, but by letting be the forecasted temperature at time i over the whole growing season of interest.
To assess the performance of the PeriodiCT model on the canopy temperature prediction across a field, one needs to train the model by using a dataset from one or more sensors with known weather and canopy temperature data and then apply this model to predict the unknown canopy temperature of the other sensor(s) using only known, corresponding weather data from these sensors. We named this assessment one-to-one predictability assessment across seasons. There are n × m pairs of assessments for a treatment with n sensors in one season and m sensors in another season. Alternatively, one can assess the predictability based on one sensor by using the trained model based on all of the sensors in the other season. We named this latter assessment multiple-to-one predictability assessment across seasons. Again, the same assessment criteria of ETC, DTC, RMSETC, and NSTC were used to evaluate the model performance, as before, but by letting be the forecasted canopy temperature for the selected sensor at time i over the whole growing season.
Furthermore, the predictabilities of trained models in different scenarios were assessed in comparison with the so-called leave-one-out cross-validation test. As this leave-one-out cross-validation gives the best criterion values, we called its criterion values the benchmarks.
The assessments were undertaken using a hindcast approach, where actual measured values of canopy temperature and weather variables were used. The assessments were also undertaken not only for the full model, where all weather variables were assumed to be available, but also for the sub-models in cases where only some weather variables were available. The assessments for sub-models are also important in real forecasts, because forecasted weather is always subject to uncertainty, and the use of predictors with low skill may reduce the accuracy of canopy temperature forecasts. However, we did not consider the forecast uncertainty in our assessment, because the assessment results are largely affected by the ability of the weather models. This is why we used the hindcast approach. The assessments were based on the data from fully watered furrow-flood-irrigated cotton in Australia.
4. Discussion
This work assessed the performance of the PeriodiCT model in terms of the sample size requirements in model training, the predictability into the future, and the applicability of the models trained on different sensors within the same field and across seasons, as well as the performances of the sub-models.
There are other data-driven models that may be applicable to canopy temperature modeling. For example, statistical autoregressive models such as ARIMA (Autoregressive Integrated Moving Average; see Ref. [
49]) and LSTM (Long Short-Term Memory; see Ref. [
50]) use only the past canopy temperature as the input, which has not been used for canopy temperature but has been used for canopy covers [
51]. Other machine learning models have also been proposed for different crops, including the random forest model [
52]. Various deep learning models have recently been reported for long-term high-frequency time-series forecasts like 15-min canopy temperature [
36], but they demand a large amount of training time series, and only a few of them have simple mechanisms to include covariates that become available in the future [
37]. It will be an interesting research topic to compare these models with similar objectives to those used in the current work.
As data-driven models are flexible enough to include more data as predictors, future research could use other possible hydroclimatic variables. For example, the traditional and dynamic treatments were scheduled by a combination of neutron moisture probe readings and forecasted evapotranspiration. The traditional schedules had irrigation targets based on soil moisture deficits of ~50, 55, and 60 mm, and the water applied per irrigation event reflected these deficit values. As canopy temperature is partially regulated by evapotranspiration through energy consumption at the canopy surface and can be used to infer changes in stomatal regulation and vegetation water stress [
27,
53], evapotranspiration could be potentially included as a predictor. Furthermore, vapor pressure deficit, instead of vapor pressure, could be a potential factor to consider [
54].
Similar to other data-driven models, PeriodiCT needs to be trained. The model training and the order of the predictors’ contributions can vary in different regions under different environments and climates. As we can see from the cross-season results in this work, the solar radiation could not be used in the next season, because the solar radiation pattern changed dramatically, meaning that the climate patterns were quite different in these two seasons. It was found that humidity and cloud cover (which are directly linked to solar radiation) are important factors in semi-arid environments [
55].
In this work, 28 °C was used as the threshold for the accuracy calculation, as it was tested for cotton fields in Australia. However, accuracy could be tested by using different thresholds. The threshold method was used in another study [
56]. There are other indices to assess crops’ water stress, such as the crop water stress index (CWSI) to guide irrigation [
12,
35,
57,
58] and the plant stress index (PSI) for the same purpose [
59].
It should be noted that this assessment was conducted by using the true weather observations as predictors. In real applications, weather forecasts will be used (except for the cross-sensor case, which can be viewed as real-time prediction and gap-filling), and these weather variables will be subject to uncertainty. It will be important to assess the model’s performance in real forecasting, which of course depends heavily on the weather forecasting model.
5. Conclusions
A periodic coefficient model named PeriodiCT was proposed to predict canopy temperature by using weather variables, aiming to assist in irrigation scheduling and planning. In order for farmers to confidently use this model, it is important to assess the model’s performance in different situations, including the sample size requirement to train the model, as the model may vary at different stages of crop development; the applicability of the trained model to other fields, which can be important for reducing the cost of the sensor installation; and across different seasons, as a trained model may be unavailable at the early stage of a new season. It is also important to know the model’s performance if some weather variables are unavailable or unreliable, such that only a sub-model can be used. This work aimed to address all of these concerns.
In the assessment, three criteria were used to assess the performance, including relative error, coefficient of efficiency, and accuracy, with a threshold value of 28 °C, which is the value to calculate the water stress time in irrigation scheduling. The performances were assessed in comparison with the benchmark criterion values when the leave-one-out cross-validation was used for the whole crop season, because the leave-one-out cross-validation gives the best criterion values.
By using selected experimental data from Australia and comparing the results with the benchmark values using the leave-one-out cross-validation procedure, and from conducting analysis on the full models and sub-models with different numbers of weather variables, which were added in order of importance (air temperature, vapor pressure, wind speed, and solar radiation), we can conclude the following:
(1) For the sample size requirement, the model can be trained well by using the data for only 5 days (days in), performing quite well in prediction in comparison with the benchmark values, and the predictability remains stable and shows only marginal improvement as the number of days ahead (days out) increases. Even the models trained by using only one day’s data (days in = 1) can achieve reasonable predictability. The solar radiation has a very marginal contribution to the predictability, followed by the vapor pressure and wind speed, which in fact have very similar contributions to the predictability.
(2) When a model trained using data from one sensor was applied to predict the canopy temperature for another sensor in the field, the predictability performed comparatively well in comparison with the benchmark, except for the experiment in Moree, where the cross-sensor predictabilities were lower than the benchmark. The reason for the low predictability in Moree may be the inferior sensor calibration before its installation in the field. To compare the models in terms of cross-sensor performance, the importance of the predictors is in the order of air temperature, vapor pressure, wind speed, and solar radiation, while the vapor pressure and wind speed have similar contributions and solar radiation has only marginal contributions.
(3) When the models trained by using data from one season are applied to predict the canopy temperature in the next season, the full model performs badly, with some extremely large prediction values (even when the Georgia 24 h model is used), and therefore is not recommended. The predictabilities of the sub-models decrease in comparison with the benchmarks, but they still perform reasonably well. It is interesting to see that the use of wind speed decreases the predictability.
(4) In all assessments, the air temperature is always the most important predictor, which dominates the prediction performance of PeriodiCT. By taking the reliability of predictors into account in real forecasting, we recommend the use of vapor pressure as the second most important predictor, if available, followed by wind speed if it is useful. However, solar radiation should not be used in real forecasting, as it is less reliable in weather forecasting and has only a marginal contribution to canopy temperature forecasting.
Overall, the prediction model PeriodiCT for canopy temperature, using weather variables as predictors, performs reasonably well with low data availability within a sensor and across sensors. However, for our Australian experiment, the solar radiation cannot be used, and the wind speed is not recommended when a model trained in one season is used for prediction in the next season. The assessment provided important guidance for the use of PeriodiCT in practice, adding further confidence for farmers to adopt the model in their management, particularly in irrigation scheduling.