1. Introduction
Various atmospheric phenomena in Brazil cause significant societal impacts; however, frost is considered one of the most detrimental to the country’s economy, particularly in sectors related to food production. In years with a high frost incidence, there is a marked decline in agricultural yields, which leads to a rise in prices due to product scarcity. Classic examples of reduced production and subsequent price increases have been extensively documented in studies on coffee crops (Margolis [
1]; Hewitt [
2]; Moricochi et al. [
3]), wheat (Junges et al. [
4]; Melo and Moro [
5]), and corn (Tsunechiro and Miura [
6]), among others.
The term frost is technically defined as the formation of ice crystals on exposed surfaces, either by freezing dew or by the phase transition from vapor to ice (Blanc et al. [
7]; Bettencourt [
8]; Mota [
9]; Cunha [
10]). However, this term is also used colloquially to characterize meteorological events that cause damage to various plant crops. The literature has no consensus regarding the definition of frost from a meteorological perspective. Consequently, several definitions can be found, including the following: (a) an air temperature less than or equal to 0 °C measured in a shelter at a height between
and
m (Hogg [
11,
12]; Lawrence [
13]); (b) an air temperature below 0 °C without specifying the type and height of the shelter (Raposo [
14]; Hewett [
15]); and (c) a surface temperature below 0 °C (Cunha [
16]).
Several methods (both passive and active) exist to minimize damage caused by frost (Snyder and Melor [
17]); however, some of these methods can be expensive and require advance preparation time to implement. In this context, significant efforts have been applied to developing tools that can predict the occurrence of frost events in advance.
In recent years, machine learning methods research has been widely used for frost prediction. Diedrichs et al. [
18] developed a component for an IoT-enabled frost prediction system, where they used machine learning algorithms trained by previous readings of temperature and humidity sensors to predict future temperatures. Ding et al. [
19] propose the construction of predictive models using the support vector machine approach to capture possible causal relationships between several environmental factors and frost. Fuentes et al. [
20] propose a neural network model, based on backpropagation, to predict the minimum air temperature of the following day from meteorological data using air temperature, relative humidity, radiation, precipitation, and wind direction and speed to detect the occurrence of radiative frost events. Another area of research using applied machine learning for frost prediction is the study from Maqsood et al. [
21]. The authors present a 24 h weather forecast in southern Saskatchewan, Canada from a set of artificial neural networks, all trained with temperature, relative humidity, and wind speed data. Lira et al. [
22] utilized a spatio-temporal neural network architecture, achieving advancements compared to existing state-of-the-art methods for frost prediction. Similarly, Talsma et al. [
23] explored the performance of two distinct neural network models: a fully connected network and a convolution-based model, by benchmarking them against a Random Forest algorithm. Further contributions to this field include the work of Talsma et al. [
23] and Wassan et al. [
24], both of whom applied convolutional models for frost prediction, showcasing the growing importance of deep learning approaches in addressing this challenge.
In recent studies, Rozante et al. [
25] developed a frost index capable of predicting the possibility of frosts occurring five days in advance for three regions located in the south/southeast of Brazil, and part of Argentina, Uruguay, and Paraguay. This index is obtained from multivariate statistical techniques applied to meteorological variables predicted by a regional model of high spatial and temporal resolution. According to the authors, a comparison between the forecasts of the regional model and the index indicated significant improvements by the index for all regions and forecasts analyzed. Rozante and co-authors [
26] also presented a frost prediction system by using a multi-layer perception neural network. It uses two optimization stochastic gradient descent schemes for the learning process and was applied in the South Region, Brazil.
The present study proposes the use of a methodology based on machine learning for the prediction of frosts in the south and southeast regions of Brazil, and some countries that include Uruguay, Paraguay, northern Argentina, and southeast Bolivia.
The machine learning model developed in this study was compared with the frost index proposed by Rozante et al. [
25], which is currently operational at the National Institute for Space Research (INPE: Instituto Nacional de Pesquisas Espaciais, Brazil). A key innovation of this research is the implementation of a committee machine [
27], which integrates multiple machine learning algorithms to improve prediction accuracy. Two inputs are considered: the frost index computed by Rozante et al. [
25], and a second index derived from a deep learning approach. The AdaBoostClassifier is employed as the voting committee, combining the strengths of both models to enhance the robustness and reliability of frost forecasts.
The paper is structured as follows:
Section 2 provides a brief description of the dataset used in the research, the study area of interest, the experimental setup, the machine learning algorithms, and the evaluation metrics.
Section 3 is dedicated to presenting the results, showcasing the key findings through figures, tables, and statistical analyses that illustrate the performance of the models under various conditions.
Section 4 provides a detailed discussion of the results and concludes with a summary of the main contributions of the study.
2. Materials and Methods
Rozante et al. [
25] define favorable situations for the occurrence of frost in two distinct classes: firstly, the current meteorological conditions; and secondly, conditions such as terrain exposure, proximity to forests, latitude, and altitude. In terms of atmospheric conditions, it is important to note the following: low temperature, clear sky, light winds, high atmospheric pressure, and low humidity.
To classify the atmospheric conditions, five predicted meteorological attributes extracted from the Eta regional meteorological model were used: temperature and relative humidity at 2 m, wind speed at 10 m, mean pressure at sea level, and cloudiness. The attributes were extracted after 24 h forecasting period by Eta model, and the time of minimum temperatures from meteorological ground stations was used to select the Eta model meteorological attributes to compute the frost index with the Rozante et al. strategy [
25], and by using the TensorFlow deep learning approach.
There is no universal definition to characterize the occurrence of frost. Technically, frost is described as the formation of ice crystals on surfaces, either through the freezing of dew or the direct phase transition of water vapor into ice. However, the criteria for identifying frost vary depending on the context. In meteorology, some authors define frost as the occurrence of temperatures equal to or below 0 °C in a standard meteorological shelter, while others consider any air temperature below 0 °C, without specifying the type or height of the shelter. In agriculture, the characterization of frost is heavily dependent on the type of plant and its phenological stage, as each species has varying levels of tolerance to low temperatures. According to da Mota (1987) [
9], leaf temperature thresholds for causing plant damage range from approximately −6 °C for more resistant plants to 0 °C for more sensitive ones. These leaf temperatures correspond to air temperatures between 0 °C and 6 °C when measured in a meteorological shelter, as described by [
28]. Therefore, the chosen temperature range reflects the interaction between meteorological conditions and the potential damage to plants. For our studies and analyses, we adopted the criterion of minimum observed temperatures (
) ≤ 6 °C. This approach aligned our methodology with the evidence reported in the literature and facilitated direct comparisons with previous studies, such as those presented by [
25]. By doing so, we ensured a consistent and scientifically grounded framework for characterizing frost, addressing both agricultural impacts and the robustness of the adopted criteria.
The methodology for frost prediction presented here is general and uses some local conditions, for instance, topography altitude, and latitude. The case study area for frost forecasting using the machine learning approach herein corresponds to the south and southeast of Brazil and some countries that include Uruguay, Paraguay and part of Argentina and Bolivia. The study area and topography are illustrated in
Figure 1. The first region (R1) corresponds to northern Argentina, Uruguay, Paraguay and southeast Bolivia. The second region (R2) encompasses the entire southern region of Brazil, covering the three states: Rio Grande do Sul, Santa Catarina, and Paraná. Finally, the third region (R3) includes the states of São Paulo, Mato Grosso do Sul, Rio de Janeiro, and south of Minas Gerais. These regions encompass diverse landscapes. The Pampas plains in Uruguay and northern Argentina (R1) are characterized by flat, low-lying terrain that is prone to frost formation. The R2 region, located in southern Brazil, features a combination of flat terrain and areas with moderate relief, with altitudes ranging from low to medium. Additionally, this region is known for its extensive agricultural areas, such as the plains and fields of Rio Grande do Sul, which are particularly susceptible to frost formation during periods of low temperatures. The highlands of southeastern Brazil (R3) consist of mountainous areas and valleys, where the topography significantly influences temperature variations and frost occurrence.
2.1. Data
The data used in this study were obtained from the Eta model, which has been operational at the INPE since 1996. The Eta model, originally developed at the University of Belgrade [
29,
30], is a limited-area atmospheric model that utilizes the Arakawa E grid [
31] and a terrain-following vertical coordinate system (
), making it well suited for regions with complex topography. The model covers the entire South American continent and surrounding oceanic areas. For this study, operational forecasts with a spatial resolution of 15 km and 50 vertical levels were employed. The model’s initial and lateral boundary conditions were provided by the analyses and forecasts of the Global Forecast System (GFS).
The analysis of frost patterns was conducted using two distinct data time series. The first dataset consisted of observed minimum temperature (Tobs) measurements collected from conventional meteorological stations distributed by the Global Telecommunication System (GTS) and provided by the National Institute of Meteorology (INMet). The second dataset comprised hourly numerical forecasts obtained from integrations of the regional Eta model. This model was initialized with conditions at 0000 and 1200 UTC, featuring a horizontal resolution of 15 km and 50 vertical levels, as described in Mesinger et al., Black et al., and Chou et al. [
29,
30,
32].
The data collected for the frost prediction experiments was 6 years (2012 to 2017). For the calibration of the model, data were used in the period (2012 to 2016) and for the validation of the index, 2017 was selected.
2.2. IG-Frost Index
As already mentioned, Rozante and co-authors [
25] established a frost index
IG (in Portuguese, Índice de Geada) for a region in South America, to indicate the occurrence or not of frosts, from meteorological variables associated to this event. Five meteorological variables (temperature (T), humidity (H), sea level pressure (P), wind (V), and cloudiness (N)) as predicted from the Eta limited area meteorological model are recorded for the IG calculation.
Averages—indicated by the operator
—and standard deviations of the five variables—Equations (
1) and (
2), respectively—were computed only for observed cases of frost:
where
, or
P as predicted by the Eta model;
denotes the grid points nearest to the positions of the weather stations;
n is the number of days with frost observations;
h is the predicted times (24 h); and
expresses the standard deviations for each variable.
Finally, the IG is computed as a weighted linear combination of the five variables, averages, and standard deviations:
where
u indicates the type of meteorological variable, and
are the weights. The calibration for the IG is described by Rozante et al. [
25], where a set of thresholds
is determined for each grid point and forecast hour for detecting a frost event:
Threshold parameters
depend on the (latitude, longitude) coordinates, the prediction time cycle, and other processes.
2.3. Neural Network
TensorFlow is a robust, open-source framework designed for the development and deployment of advanced machine learning algorithms [
33]. It is applied as a high-level interface for the definition of complex models and as a scalable system optimized for executing computations on large datasets. Initially developed by the Google Brain team in 2011, TensorFlow was engineered to facilitate the exploration and application of large-scale deep neural networks, enabling both cutting-edge research and integration into a wide range of Google products.
TensorFlow is highly versatile, implementing a wide range of machine learning algorithms, particularly deep neural networks. It has been employed across diverse fields within computer science, as well as other disciplines, such as speech recognition, computer vision, robotics, natural language processing, and computational biology. The framework API and reference implementation were made publicly available in November 2015 under the Apache 2.0 license, with access provided at.
Through the utilization of TensorFlow, users can design diverse neural network architectures, which are typically organized with an input layer, one or more hidden layers, and an output layer (
Figure 2). In addition to the number of layers, several parameters must be configured, such as the number of units in the hidden layers, the activation functions for each layer, the initial weights between connections, and the optimization algorithms used during training. These hyperparameters play a crucial role in determining the model’s overall performance.
The Google Colaboratory [
34]—
CoLab —was used for prototyping the machine learning models. This platform is a product from Google Research that allows anybody to write and execute arbitrary Python code through the browser and is especially well suited to machine learning, data analysis, and education. More technically,
CoLab is a hosted Jupyter notebook service that requires no setup to use, while providing access free of charge to computing resources including GPUs [
34].
Figure 2 illustrates the topology of an artificial neural network, with an input layer with eight neurons, two hidden layers with four neurons each, and an output layer with a single neuron.
2.4. Voting Committee
Voting committees (VC) are a type of machine committee [
27], which is a model trained to decide on the best forecast among an ensemble of models. The primary goal of a machine committee is to improve the overall prediction accuracy by combining the strengths of multiple individual models. Each model in the ensemble contributes to the final decision, typically through a voting mechanism. The VC model is trained to determine which forecast’s first index must be considered when there is a divergence between IG and TF forecasts. In this research, the AdaBoostClassifier implementation available in the Scikit-learn Python 1.6.1 module [
35,
36] was used.
The definition of boosting uses the principle that a very accurate prediction can be produced by the combination of several inaccurate models. The general boosting idea is to develop the classifier ensemble incrementally, adding one classifier at a time. The classifier that joins the ensemble at one step is trained on a dataset selectively sampled from the training dataset. The sampling distribution starts from uniform and is updated for each new classifier. The likelihood of the objects being misclassified at the previous step is increased so that they have a higher chance of entering the training sample of the next classifier. The algorithm is called AdaBoost in [
37], which comes from ADAptive BOOSTing [
38].
An AdaBoost classifier [
37] is a meta-estimator that starts by fitting a classifier on the original dataset. It then fits additional copies of the classifier on the same dataset but adjusts the weights of incorrectly classified instances so that subsequent classifiers focus more on the difficult cases.
2.5. Evaluation Metrics
A statistical evaluation was performed using the indices presented in
Table 1. The distribution of observed and predicted cases for positive and negative events is shown in
Table 2, which is used to calculate evaluation indices, such as CSI (Critical Success Index), POD (Probability of Detection), SR (Success Ratio), FAR (False Alarm Ratio), PC (Proportion Correct), and BIAS.
These metrics are commonly used in meteorology to assess the performance of forecast models, providing insights into their strengths and weaknesses in predicting specific events.
2.6. Description of Experiments
Experiments were performed using a meteorological dataset from 2012 to 2017. The dataset consists of several meteorological variables extracted from 24 h forecasts from the Eta Model, such as temperature, pressure, wind speed, cloud cover, humidity, and topography (height above sea level). Also, the observed temperature at several stations in the study area was used to define frost and non-frost events. These experiments consisted of creating frost forecast models using three different approaches: frost index, TensorFlow, and voting committee.
To perform the analysis, the dataset was divided into three regions, as illustrated in
Figure 1. Two time periods were defined for the training and testing phases, corresponding to 2012–2016 and 2017, respectively. The TensorFlow model was trained using data from the 2012–2016 period, with the model configuration detailed in
Table 3. Subsequently, the trained TensorFlow model was applied to the test dataset (2017) to compute the statistical metrics shown in
Table 1.
The voting committee (VC) model was also trained using data from 2012 to 2016. This model uses the same input attributes as the TensorFlow (TF) and frost index (IG) approaches but incorporates the outputs of both models as additional input features. The VC model was specifically designed to improve forecasts in cases where the IG and TF predictions diverged. The trained VC model was then tested on the 2017 dataset, focusing on instances where divergences between IG and TF forecasts occurred. The statistical indices obtained from this application are presented in
Table 1.
The total number of frost and no-frost occurrences is presented in
Figure 3. Generally, the number of frost occurrences is lower than the number of no-frost occurrences. Additionally, it is observed that the training and validation period (2012 to 2016) contains a significantly higher number of frost cases compared to the test period (2017).
A comparison of the 2017 results from the TensorFlow and voting committee models was performed against the frost index approach described by Rozante et al. [
25]. Furthermore, the 24 h model trained with the 2012–2016 dataset was used in a case study to generate a 72 h forecast for 21 May 2018, enabling comparison with a similar case study presented in Rozante et al.
Table 3 presents the hyperparameters and other characteristics of the TensorFlow model.
4. Discussion and Conclusions
Two methodologies for frost prediction were developed in this study: one leveraging deep learning through the TensorFlow (TF) platform and another based on the selection of two frost indexes estimated by the IG approach (see reference [
25]). When these two methodologies converge in their predictions, the forecaster gains greater confidence in disseminating frost warnings. In instances of divergence, a machine learning-based voting committee is employed, reducing subjectivity by selecting the most accurate prediction based on a consensus-driven approach.
The TensorFlow model demonstrated notable performance across several frost prediction cases, particularly on 18 and 20 July 2017, where it achieved a high accuracy, correctly classifying 113 of 115 cases on July 18 and 86 of 105 cases on 20 July. Furthermore, on 24 July 2017, the model successfully identified a frost event in region R1 that was not captured by the IG model. However, its performance varied across regions, with less satisfactory results observed in region R3 on 22 July, where accuracy was comparatively lower. These findings highlight the model’s ability to capture complex patterns in favorable conditions while underscoring the need for refinement to address challenges in regions with more complex climate dynamics.
In summary, the voting committee approach plays a crucial role in refining forecasts and reducing the number of misses, particularly in cases where individual methodologies provide divergent predictions. By integrating the strengths of both the TensorFlow and IG-based approaches, the voting committee enhances the overall forecast reliability and supports more accurate decision-making.
The integration of these methodologies provides a robust and flexible framework for frost prediction. The deep learning model excels in capturing intricate patterns in meteorological data, while the IG-based index contributes interpretability and domain knowledge. By combining these strengths, the voting committee enhances reliability and supports more informed decision-making in operational forecasting.
The methodologies presented in this study, while developed with a focus on South America, demonstrate potential for global applicability. The framework can be adaptable to various climatic and environmental conditions, making it a valuable tool for frost forecasting and risk management in diverse regions worldwide.
Future work could focus on extending the applicability of the voting committee to longer forecast periods and exploring additional machine-learning techniques to further improve prediction accuracy and reliability.