Next Article in Journal
Research on the Synergistic Mechanism of Maize–Soybean Rotation and Bio-Organic Fertiliser in Cold Regions
Previous Article in Journal
Soil-Available Nitrogen and Phosphorus and Their Temporal Stability in the Tibetan Grasslands
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting Heat Treatment Duration for Pest Control Using Machine Learning on a Large-Scale Dataset

by
Stavros Rossos
1,
Paraskevi Agrafioti
2,
Vasilis Sotiroudas
3,
Christos G. Athanassiou
2 and
Efstathios Kaloudis
1,*
1
Computer Simulation, Genomics and Data Analysis Laboratory, Department of Food Science and Nutrition, University of the Aegean, 81400 Myrina, Greece
2
Laboratory of Entomology and Agricultural Zoology, Department of Agriculture, Crop Production and Rural Environment, University of Thessaly, Phytokou Str., 38446 Volos, Greece
3
Agrospecom Ltd., Kalochori, 57009 Thessaloniki, Greece
*
Author to whom correspondence should be addressed.
Agronomy 2025, 15(5), 1254; https://doi.org/10.3390/agronomy15051254
Submission received: 27 March 2025 / Revised: 14 May 2025 / Accepted: 16 May 2025 / Published: 21 May 2025
(This article belongs to the Section Pest and Disease Management)

Abstract

:
Pest control in industrial buildings, such as silos and storage facilities, is critical for maintaining food safety and economic stability. Traditional methods like fumigation face challenges, including insect resistance and environmental concerns, prompting the need for alternative approaches. Heat treatments have emerged as an effective and eco-friendly solution, but optimizing their duration and efficiency remains a challenge. This study leverages machine learning (ML) to predict the duration of heat treatments required for effective pest control in various industrial buildings. Using a dataset of 1423 heat treatment time series collected from IoT devices, we applied exploratory data analysis (EDA) and ML models, including random forest, XGBoost, ridge regression, and support vector regression (SVR), to predict the time needed to reach 50 °C, a critical threshold for pest mortality. Results revealed significant variations in treatment effectiveness based on building type, geographical location, and ambient temperature. XGBoost and random forest models outperformed others, achieving high predictive accuracy. The findings highlight the importance of tailored heat treatment protocols and the potential of data-driven approaches to optimize pest control strategies, reduce energy consumption, and improve operational efficiency in industrial settings. This study underscores the value of integrating IoT and ML for real-time monitoring and adaptive control in pest management.

1. Introduction

Pest control in empty industrial buildings, silos, and similar structures is crucial for preventing disruptions in supply chains. Infestations pose significant threats to food safety, economic stability, and regulatory compliance. Pests can introduce pathogens to where the food is processed or stored, posing health risks [1], while pests such as Tribolium spp. and Sitophilus spp., commonly addressed as flour beetles and grain weevils, can cause substantial loss to quality and quantity in stored grains, which can impact food security [2]. These infestations undermine the safety and quality of food products, leading to significant economic impacts. For instance, insect infestations can lead to damages or destroyed products, resulting in significant financial losses to producers and retailors [3]. Moreover, the presence of pests in food premises can lead to formal actions by food authorities, and stringent pest control measures are essential to prevent the spread of invasive species through international trade, ensuring compliance with global pest control standards [1,4].
Fumigation with phosphine is a traditional and widely used method to control pests in grain storage, but it comes with significant drawbacks [5]. Insects start developing resistance to chemical pesticides, rendering them ineffective over time. Resistance to Bacillus thuringiensis (bt) toxins has been developed by the diamondback moth (Plutella xylostella) and tobacco budworm (Heliothis virescens) [6]. Similarly, many stored product insect populations (Rhyzopertha dominica and, specifically in Greece, Oryzaephilus surinamensis) have developed resistance to phosphine throughout the world, a commonly used fumigant, complicating its effectiveness [5,7,8,9,10,11,12,13]. Extensive use of chemical pesticides leads to environmental pollution and poses a human health hazard [14]. Furthermore, indiscriminate use of chemicals disrupts ecosystems by reducing biodiversity among natural predators, harming non-target species, and overall disrupting food webs in ecosystems [15]. Due to chemicals’ negative health impact and environmental effects, many chemical fumigants face stringent regulatory restrictions. For example, methyl bromide has been banned in several countries, although it is still used in several quarantine applications [14]. Methyl bromide also has a complex application process; effective fumigation requires precise application and monitoring, which is logistically challenging. For instance, constructing gas-tight enclosures and ensuring proper aeration post-fumigation are critical steps that require expertise and resources. Aeriation of the treated facility (e.g., silo, warehouse) is essential after fumigation to ensure that residual fumigants are safely vented and the site is safe for re-entry.
Heat treatments are increasingly recognized as effective and environmentally friendly alternatives to chemical pesticides for managing stored product insects in several structures, such as flour mills, rice mills, or feed mills [16,17]. Heat treatment is based on the application of high temperatures for a relatively short time for the control of different stored product insects without affecting product quality [18]. There are different types of heat treatments that are effective in different applications; for instance, there is the dry heat and forced hot air, the first of which utilizes high temperatures and is effective in flour mills and grain-processing facilities [19,20], and the other involves circulating hot air to maintain uniform temperatures and is commonly used in post-harvest treatments for fruits and vegetables [21,22]. Sterilization of plant media is crucial in horticulture and greenhouse systems, as it prevents the spread of soil-borne pathogens and ensures plant health. Amongst the available techniques, steam and vapor heat are commonly used. Steam is applied to sterilize plant media to control fungal and bacterial pathogens effectively as well as plant-parasitic nematodes [23]. Vapor heat is typically used to disinfest commodities like dates, ensuring pest control without chemical residue [24]. While these methods are more commonly used in post-harvest fruits and horticulture contexts, the underlying principles of heat-based pest control are applicable to industrial settings. Additionally, there are hot water treatments, which are more efficient than hot air treatments and are often used in post-harvest treatments of fruits and also certain plants because of the benefit of increasing rooting, budding, or shelf life [22]. Lastly, radio frequency (RF) heating uses electromagnetic energy to heat commodities, and it is effective for disinfesting legumes and walnuts, though uniformity remains a challenge [25].
There are various applications of heat treatments in industrial facilities. Firstly, there are the flour mills, which use electric heaters and have shown efficacy in controlling pests like the confused flour beetle, Tribolium confusum Jacquelin du Val (Coleoptera: Tenebrionidae), and the rice weevil, Sitophilus oryze (L.) (Coleoptera: Curculionidae) [26,27]. The treatment duration is still a matter of some debate; some studies suggest that a few minutes of elevated temperature is sufficient to disinfest a building completely [28], while others posit that a treatment duration of 24–36 h is typically necessary for complete pest control [20]. Heat treatments are successfully used to control insect pests in stored grains in grain-processing facilities [26,29,30]. Temperature distribution and the use of circulating fans are critical for effective pest control [31]. Also, heat treatments have been successfully applied in Sicilian mills, demonstrating both technical and economic feasibility [32,33].
Heat treatments are valued for reducing environmental impact and leaving no chemical residues [21,32]. They are effective against a broad spectrum of pests, including insects [18,27], nematodes, and plant pathogens [22,23]. A key distinction between heat treatments and chemical fumigation lies in their application methods. Heat treatments are typically applied to empty storage and processing facilities, targeting residual pests in structural components and equipment. In contrast, chemical fumigants such as phosphine are generally used to treat the commodity itself, penetrating the product to eliminate internal infestations while leaving the surrounding equipment untreated [12]. Despite their advantages, heat treatments face notable challenges, primarily the high energy demands associated with heating large buildings. Energy consumption is significantly influenced by the structure and insulation properties of the facility [34]. Moreover, ensuring uniform temperature distribution is essential for achieving effective and consistent pest control outcomes [22,35].
Heat treatments can be monitored and made more efficient using sensors or IoT devices. For example, an IoT system was developed to monitor and regulate temperature in a clean room, achieving a data accuracy rate of approximately 99% [36]. This system includes sensors for dynamic temperature and air conditioning, which can be adapted for monitoring heat treatments in warehouses or silos. The integration of advanced automation and sensor systems is crucial for optimizing thermal treatments. Sensors play a pivotal role in monitoring and regulating parameters such as temperature, pressure, relative humidity, and flow rates, enabling real-time data collection and immediate adjustments to maintain optimal conditions [12,37,38,39]. Specifically, Sakka et al., 2022 deployed temperature wireless sensors and monitored the temperature change during the treatment [27]. In their study, electric and gas heaters were also included, as well as floor fans that were electronically controlled and automatically changed to provide hot air to attain the desired air temperature in the ambient space (>50 °C). The use of IoT (Internet of Things) and wireless sensor networks for industrial automation, including temperature control, enables remote monitoring and control of industrial devices, reducing complexity and costs [40].
Data analysis and machine learning (ML) can play a crucial role in optimizing heat treatment parameters. Exploratory data analysis (EDA) is crucial for thoroughly exploring and analyzing datasets, which is a preliminary step before applying ML models. It helps in identifying key features and understanding complex patterns and dependencies within the data, leading to more accurate predictions [41]. ML models such as random forest, support vector machines, decision tree, and artificial neural network have been employed to predict and optimize heat treatment parameters. These models help in understanding complex patterns and dependencies within the data, leading to more accurate predictions [41,42,43].
Thermal imaging combined with ML algorithms is used for early detection and automated identification of pests and diseases [44]. Various ML techniques, including supervised learning, unsupervised learning, and deep learning, are employed for pest and disease detection. ML techniques help in classifying diseases, detecting anomalies, and improving crop health monitoring, while some of them help in establishing the probability of pest presence and aid in decision making [45,46]. Studies have shown that sensors can be deployed and collect data on gas concentration, temperature, and exposure time that would provide information on treatment effectiveness and help guide the way to improve efficacy [38,47]. AI systems integrated with sensors optimize irrigation, fertilization, and pest management plans. These systems can trigger automatic responses based on real-time data, improving crop yields and reducing resource use [48].
ML models like RF and SVM are used to predict the temperature of heat-carrying agents in heating systems, which are essential for optimizing energy efficiency and diagnosing non-typical processes [42]. Also, ML methods are applied to predict these parameters in heat exchangers, surpassing the accuracy of traditional empirical correlations [43]. While no studies have specifically applied ML to heat treatment in empty silos or empty industrial storage facilities, the principles of ML and data analysis used in other heat treatment systems can be adapted to this context. For instance, predicting the duration or temperature required for effective heat treatments in silos can benefit from similar ML techniques, enabling more precise control and optimization of the process. This study uses data derived from Agrospecom, a pest control company in Greece that collaborates with Centaur [49] and uses IoT devices to monitor heat treatments for pest control.
The primary aim of this study is to address a gap in the literature by gathering and analyzing time series and general heat treatment data for pest control. By leveraging machine learning as a tool, we aim to develop a predictive model that can accurately estimate the duration of each treatment as a function of structural and environmental variables. This approach will not only help optimize heat treatments, reducing energy consumption, but also provide a deeper understanding of the factors influencing the effectiveness of heat treatments in industrial settings. Machine learning serves to achieve these objectives, enabling data-driven insights and more efficient pest control strategies.

2. Materials and Methods

2.1. Data Collection

The dataset used in this study was collected from heat treatment processes conducted in empty industrial buildings and silos over a 2-year span (2022 and 2023). The data were obtained from IoT devices deployed by Agrospecom, Thessaloniki, Greece, accessible through the Internet of Crops platform developed by Centaur, Volos, Greece. The dataset comprised 1423 heat treatments, containing 6503 time series records. Each record included measurements of temperature and time. Data records also included a brief description of the process (e.g., building type), location (latitude, longitude), average temperature, and start time. Derived data, including the month and exact building type, were extracted for each heat treatment. Additionally, time series analysis was performed to determine whether each treatment reached 40 °C and 50 °C (t_40 °C, t_50 °C), the time it took to reach these temperatures, the duration above these thresholds, and whether the temperature exceeded 60 °C (overshot), as Figure 1 explains.
The industrial facilities involved in this study varied significantly in size, structure, and material composition. These included flat storage warehouses, upright cylindrical silos, ISPM-15 compliant wooden crates, and concrete storage rooms. Such structural differences affect the thermal properties of the buildings, with materials like concrete and metal influencing the rate of heat retention and dissipation. The facilities ranged widely in scale, from small structures ranging from 33 to 66 m3 to large industrial facilities like flour mills and flat storage warehouses, with volumes reaching up to approximately 30,000 m3. Typically, heaters are positioned at floor level or near air inlets, and hot air is circulated using high-capacity fans to achieve uniform temperature distribution. Mapping airflow paths is crucial in practice, as uneven heating can lead to cold spots where pests may survive. These structural and operational factors contribute to the variation in heat treatment effectiveness and are reflected in the predictive models developed in this study.
Although the dataset used in this study does not include pre-treatment pest counts, it is important to note that in real-world applications, pest population assessments are commonly performed before heat treatment. Methods such as pitfall traps, probe traps, and visual inspections are used to detect key pests like Tribolium spp. and Sitophilus spp., providing critical information for treatment planning and success evaluation.

2.2. Data Preprocessing

The raw data were preprocessed to ensure quality and consistency before analysis. All preprocessing and analysis were conducted using Python (v. 3.12.4), a programming language widely used in scientific computing and data analysis due to its simplicity, readability, and extensive ecosystem of libraries [50]. The data were structured into a Pandas Dataframe. Pandas is a powerful library for data manipulation and analysis, particularly suited for handling structured data [51]. To incorporate environmental factors, the Open-Meteo API [52] was used to fetch the average daily temperature for each treatment based on its location and date.
The preprocessing steps began with handling missing values, which were identified and removed to maintain data accuracy and realism. This step ensured that only complete and reliable records were used for analysis. Sensors with unrealistically low durations before reaching 40 °C (less than 12 min) were also removed, as they were deemed unreliable for analysis. Each treatment was categorized according to building type (industrial storage, containers, ipsm15, rooms), and unrecognized building types were classified as “Others” and excluded from further analysis. To account for the circularity of time, the start time and month were encoded to cyclical variables using sine and cosine transformations. Time-based features, such as hours of the day or months of the year, are inherently cyclical, meaning that the end of cycle (e.g., 23:59) is adjacent to the beginning (e.g., 00:00). However, many machine learning models treat features as linear, which can lead to incorrect assumptions about the relationships between adjacent time points. For example, the model might interpret 23:59 and 00:00 as being far apart, even though they are only one minute apart in reality. To address this, we transformed the start time and month into two new features, one representing the sine of the time and the other representing the cosine of the time. For start time, the hour of the day (ranging from 0 to 23) was converted into an angle (ranging from 0 to 2π) and then transformed using sine and cosine functions. The months of the year (ranging from 1 to 12) were similarly converted into an angle and transformed. Geographical clustering was performed using the DBSCAN algorithm from the scikit-learn library [53], which grouped treatments based on latitude and longitude, allowing for the identification of regional patterns in heat treatment effectiveness. Previous studies have shown that susceptibility to heat varies across insect life stages, with eggs and pupae often being more tolerant than larvae or adults. Effective mortality typically requires several minutes of exposure above 50 °C (t_abv_50 °C), with Fields [28] reporting that a duration above 50 °C (t_abv_50 °C) of a few minutes is required for complete pest mortality; for this purpose, the threshold of 4.8 min (0.08 h) was chosen. Finally, categorical data, such as building type, were encoded using the “get_dummies” function in pandas [51], creating a separate dataset suitable for machine learning models that require numerical input.

2.3. Exploratory Data Analysis (EDA)

Exploratory data analysis was conducted to uncover patterns and relationships in the dataset. A geographical heatmap was created using the Folium library to visualize the distribution of heat treatments across different locations. The heatmap highlighted clusters of treatments and their density, providing insights into regional trends. Temporal analysis was performed to calculate the average duration until 50 °C by month, and the average temperature of major geographical clusters (representing mostly Greek cities) was analyzed monthly to identify seasonal trends. The Pearson correlation coefficient, calculated using the scipy library [54], was used to assess the relationship between the average duration until 50 °C and the monthly average temperature. Statistical tests, including one-way ANOVA [55], were performed to evaluate differences in treatment effectiveness across building categories. Post hoc analysis was conducted using Tukey’s test to identify specific group differences.

2.4. Machine Learning Model Preparation, Evaluation, and Refinement

The dataset was further modified to make it suitable for machine learning (ML) training. The parameters used as input features for the ML models included environmental temperature, which was derived from the average temperature during the heat treatment using the Open-Meteo API. To account for the cyclical nature of the time-based data, the start time and month were encoded as cyclical variables using sine and cosine transformations, resulting in features such as “time sin”, “time cos”, “month sin”, and “month cos”. Additionally, the building type (e.g., industrial storage, containers, rooms) was one-hot encoded to represent categorical data numerically, while the geographical location of each treatment was represented by clusters generated using the DBSCAN algorithm, replacing raw latitude and longitude coordinates. The target variable for the ML models was the duration until the temperature reached 50 °C (t_till_50 °C), a critical threshold for effective pest control. ML tasks were performed using the scikit-learn library [53].
The dataset was split into training (80%) and testing (20%) sets to ensure robust model evaluation. The target variable was set as the duration until the temperature reaches 50 °C. The data were scaled to ensure compatibility with the support vector regressor (SVR) model. Scaling was not required for other models, such as random forest regressor, XGBoost regressor, and ridge regressor, which are capable of handling non-linear relationships without preprocessing. Regression models were chosen because the goal was to predict numerical values (duration) rather than probabilities or binary outcomes. The selected models included random forest regressor [53], an ensemble method known for its robustness and ability to handle non-linear relationships, XGBoost regressor [56], a gradient boosting algorithm renowned for its performance in regression tasks, ridge regressor [53], a linear model with L2 regularization to prevent overfitting, and support vector regression (SVR) [53], a kernel-based method effective for capturing complex, non-linear relationships.
Grid search with cross-validation was used to optimize hyperparameters for each model, ensuring optimal performance. Grid search is an exhaustive search technique that evaluates all possible combinations of hyperparameters within a predefined search space. For each combination, the model is trained and evaluated using k-fold cross-validation, where the dataset is split into k subsets (folds) and the model is trained on k-1 folds while being validated on the remaining fold. This process is repeated k times, with each fold serving as the validation set once. The performance metric (negative mean absolute error) is averaged across all folds to provide a robust estimate of the model’s generalization ability. The combination of hyperparameters that yields the best performance is selected as the optimal configuration. In this study, a parameter grid was defined to specify the range of hyperparameters to be explored (e.g., learning rate, number of trees, maximum depth of trees, minimum samples required at leaf node, etc.).
The grid search was implemented using the GridSearchCV class from the scikit-learn library [53], with 5-fold cross-validation and negative mean absolute error (MAE) as the scoring metric. The negative MAE was used because scikit-learn optimizes for higher scores, and minimizing MAE is equivalent to maximizing its negative value. After fitting the grid search, the best hyperparameters and corresponding scores were identified for each model.
The performance of each model was evaluated using metrics such as mean absolute error (MAE), mean squared error (MSE), and R2 score. Visualizations, including actual vs. predicted values and residual plots, were generated using Matplotlib to visually assess model performance. These visualizations provided insights into the models’ ability to capture the underlying patterns in the data and identify any systematic errors.
To further refine the models, the two best-performing models (based on MAE, MSE, and R2 scores) were retrained on data subsets to specific building categories (e.g., industrial storage, containers, rooms). This approach aimed to improve model performance by capturing category-specific patterns.

3. Results

3.1. Statistical Analysis

The effectiveness of heat treatments was evaluated across different building categories using one-way ANOVA, which revealed significant differences in the duration until 50 °C (t_50 °C) among groups (F-statistic = 352.98, p-value < 0.001). Post hoc analysis using Tukey’s test further identified specific pairwise differences between categories as shown in Table 1. For instance, treatments in industrial storages required significantly longer durations to reach 50 °C compared to containers (mean difference = 6.54 h, p-adj < 0.001) and rooms (mean difference = 2.08 h, p-adj < 0.001). Conversely, no significant differences were observed between containers and housing (mean difference = 0.11 h, p-adj = 1) or between IPSM15 and rooms (mean difference = −0.12 h, p-adj = 0.999). These results suggest that building type significantly influences the time required to achieve heat treatment.

3.2. Temporal and Geographical Trends

Temporal analysis revealed seasonal variations in the average duration until 50 °C and mean environmental temperature across months. The mean t_50 °C and the environmental mean temperature, visualized in a histogram as shown in Figure 2 and Figure 3, each showed a seasonal pattern, with higher temperatures and lower durations during summer months and lower temperatures and longer durations during winter. This trend was consistent across geographical clusters, as indicated by the geographical heatmap in Figure 3, which highlights the distribution of treatments across different regions. The geographical heatmap revealed dense clusters of treatments in urban areas, such as Athens and Thessaloniki, with fewer treatments in rural regions. The heatmap (Figure 4) reflects Agrospecom’s business activities, which are concentrated in regions with high industrial activity and significant grain production, such as Thessaly and northern Greece. It is important to note that the heatmap represents only Agrospecom’s treatments and not all heat treatments in Greece. The observed clusters highlight the company’s focus on areas with high demand for pest control services, particularly in industrial and agricultural settings.
A Pearson correlation analysis was conducted to assess the relationship between the mean t_50 °C and the mean environmental temperature by month to verify the significance of Figure 2 and Figure 3. The analysis yielded a moderate negative correlation (r = −0.65, p-value = 0.02), indicating that higher ambient temperatures are associated with shorter duration to reach 50 °C. This finding underscores the influence of environmental conditions on heat treatment efficiency.

3.3. Machine Learning Model Performance

Several machine learning models were trained to predict the duration until 50 °C based on building type, geographical cluster, and environmental factors. The models included random forest regressor, XGBoost regressor, ridge regressor, and support vector regression (SVR). Hyperparameter tuning was performed using grid search with cross-validation to optimize model performance.
The XGBoost regressor achieved the best overall performance, with a mean absolute error (MAE) of 0.50 h, a mean squared error (MSE) of 2.07, and an R2 score of 0.81. This indicates that the model was able to accurately predict the duration until 50 °C, capturing 81% of the variance in the target variable. The random forest regressor also performed well, with an MAE of 0.47 h, MSE of 2.13, and R2 score of 0.80. Both tree-based models outperformed the ridge regressor (MAE = 1.19, R2 = 0.44) and SVR (MAE = 0.63, R2 = 0.73), highlighting the superiority of ensemble methods for this task. The relatively poor performance of ridge regression suggests that the relationships between the predictors and the target variable are non-linear, which tree-based models are better equipped to handle.
To further refine the models, the top-performing models (Random Forest and XGBoost) were retrained on data subsets corresponding to specific building categories. As shown in Table 2, both models demonstrated exceptional performance for IPSM15, with XGBoost achieving an MAE of 0.58 h, MSE of 0.92, and R2 score of 1.00 and random forest achieving an MAE of 0.60 h, MSE of 1.06, and R2 score of 1.00. Across all building categories, random forest consistently outperformed XGBoost in overall scores, highlighting its robustness and ability to capture complex patterns in the data. However, both models underperformed in the rooms category, with random forest achieving an MAE of 1.46 h and an R2 Score of 0.55, and XGBoost achieving of MAE of 1.48 h and an R2 score of 0.57. This underperformance is likely due to the limited number of treatments recorded for the rooms category, which may have restricted the models’ ability to learn robust patterns.
The performance of the models was further evaluated using visualizations. Visual analysis of actual vs. predicted values and residual plots provided insights into the predictive accuracy and potential biases of each model. The XGBoost model demonstrated strong predictive performance, with predicted values closely aligning with the actual values, as shown in Figure 5. The red dashed line, representing the ideal fit, indicates that most predictions are accurate. The residual plot for XGBoost (Figure 6) further confirms this, with residuals randomly distributed around zero, suggesting no systematic bias in the model’s predictions. Similar behavior was observed for the RF model, indicating robust performance for both ensemble methods.
In contrast, the ridge regression model exhibited systematic bias, as evident from the deviation of predicted values from the ideal fit line. This bias suggests that the model consistently under-predicts t_50 °C in certain ranges, which may limit its reliability for accurate predictions, while overpredicting medium to higher t_50 °C values (Figure 6). The increasing variance observed in Figure 7 at higher predicted treatment durations suggests that model performance diminishes for longer heating events. This effect is likely due to heteroscedasticity in the dataset, where larger or more complex structures—typically associated with longer treatment times—exhibit greater variability in thermal behavior. Structural features such as insulation quality, air leakage, and internal airflow dynamics, which are not fully captured by the general “building type” variable, may account for this variation. On the other hand, the SVR model did not show any significant patterns or biases in its predictions. While the model’s performance was consistent, it did not stand out in terms of predictive accuracy compared to XGBoost and RF.

4. Discussion

The findings of this study provide valuable insights into the factors influencing the effectiveness of heat treatments for pest control in industrial buildings and silos. By analyzing a large dataset of heat treatment processes, we identified significant variations in treatment effectiveness across different building types, geographical regions, and environmental conditions. These results have important implications for optimizing heat treatment protocols and improving pest management strategies in real-world conditions.
One of the key findings of this study is the significant influence of building type on the duration required to reach 50 °C, a critical threshold for effective pest control. Treatments in industrial storages required significantly longer durations compared to containers and rooms, as revealed by the one-way ANOVA and Tukey’s test. This variation can be attributed to differences in building size, insulation, and structural materials, which affect heat distribution and retention [32,34]. For example, Jian et al., 2012 underlined the difficulties associated with uniformly heating large-scale flour mills, emphasizing that areas with thicker walls or inadequate insulation frequently operate as “thermal sinks”, slowing down the heating process and creating refuges where pests can survive. Moreover, the complexity of the internal equipment and stored commodities in such facilities contributes to temperature stratification and impedes uniformity [31]. Several published studies have emphasized the importance of achieving uniform heat distribution in a treated facility and identified that the internal fluctuations of the temperature profile are the major factor contributing to insect survival [28,29]. These findings indicate that specialized heat treatment techniques, such as strategic heater placement and real-time temperature monitoring, are required for larger structures to ensure lethal temperatures are maintained throughout the facility.
The analysis also revealed a moderate negative correlation (r = −0.65, p-value = 0.02) between the average duration until 50 °C and the mean temperature by month. This relationship has been extensively documented in prior studies, in which higher ambient temperatures during summer months resulted in shorter heating periods and lower energy consumption [18,27,31]. For instance, Mahroof et al., 2003 showed that ambient temperatures above 25 °C considerably improved the effectiveness of heat treatments in food-processing facilities, reducing the risk of treatment failure due to incomplete heating [29]. The results of the present study are in accordance with the previous studies, where higher ambient temperatures, particularly during warmer months, facilitate faster heating and reduce the time required to achieve effective pest control. This finding emphasizes the role of environmental conditions in heat treatment performance. However, the seasonal variability of heat treatments also underscores the need for adaptive treatment protocols that account for fluctuations in ambient temperature. While ambient temperature was incorporated into the model, relative humidity was not included due to inconsistent availability across the dataset. Humidity can influence both pest mortality and heat distribution, and its integration into future models could enhance predictive performance and ecological validity. Additionally, while the model captures structural and environmental variation, it does not account for differences in thermal tolerance among pest species. Since different insects exhibit varying degrees of heat resistance, incorporating species-specific biological parameters in future studies could improve the accuracy and applicability of treatment duration prediction.
The geographical heatmap highlighted regional variations in treatment frequency, with dense clusters of treatments in urban areas and fewer treatments in rural regions. This pattern likely reflects the distribution of industrial activity, as urban areas tend to have higher concentrations of food-processing and storage facilities. Gerken and Campbell (2022) observed similar patterns, finding that the pest management treatments are common in locations with a dense network of storage-processing facilities [3]. The clustering of treatments suggests that regional factors, such as climate and infrastructure, may influence the implementation and effectiveness of heat treatments.
The machine learning models developed in this study demonstrated strong predictive performance, particularly the random forest regressor and XGBoost regressor. These models outperformed linear methods like ridge regression, highlighting the non-linear nature of the relationships between predictors and the target variable. The success of tree-based models can be attributed to their ability to capture complex interactions between building type, geographical cluster, and environmental factors. While the models employed in this study are considered classical, their application to real-time prediction of industrial heat treatment duration represents a novel use case. These models offer the advantage of interpretability and operational readiness, making them particularly suitable for industry adoption. The results demonstrate that even with established algorithms, significant performance can be achieved when paired with structured real-world data. The integration of IoT monitoring systems, as demonstrated by Agrafioti et al., 2019 and Sakka et al., 2022, highlighted that these predictive models provide real-time data from several points within each facility [18,27]. The exceptional performance of category-specific models further underscores the importance of tailoring heat treatment protocols to specific building types and conditions.
The results of this study have practical implications for pest control in industrial settings. Firstly, the significant variation in treatment effectiveness across building types highlights the need for customized treatment protocols that account for structural differences. For example, industrial storages may require longer treatment durations or additional heating sources to achieve uniform temperatures. Secondly, the influence of ambient temperature suggests that treatment schedules should be adjusted seasonally to optimize efficiency. Finally, the success of machine learning models in predicting treatment duration demonstrates the potential of data-driven approaches to improve pest control strategies. By integrating IoT devices and predictive models, industrial facilities can monitor and optimize heat treatments in real time, reducing energy consumption and operational costs.
The observed impact of building type on treatment duration likely stems from underlying physical and structural differences among facility types. For example, flat warehouses with large open volumes may experience slower heating due to air stratification and poor circulation, while upright silos heat more rapidly but may lose heat through exposed metal walls. Similarly, materials such as concrete offer better thermal mass and insulation compared to steel or wood, influencing both heat retention and energy requirements. These structural and material characteristics play a critical role in determining the dynamics of heat distribution during treatment, and practitioners should consider them when designing or adjusting thermal protocols.
Despite these promising outcomes, certain limitations should be acknowledged. The dataset analyzed in that study was geographically limited to industrial facilities in Greece, which may limit the generalizability of the findings to other regions with different climate conditions and building structures. Future research should expand the geographical scope to include diverse industrial settings. Moreover, while the building type and ambient temperature were examined, some other factors, such as pest species, infestation levels, and building occupancy patterns, could be incorporated to further improve model accuracy. Furthermore, the machine learning models were trained on historical data, which may not fully capture the dynamic nature of heat treatment processes. Future studies should focus on incorporating real-time monitoring and adaptive control systems to enhance treatment effectiveness [42,43]. Finally, although this study uses treatment duration as an indicator of operational efficiency, a direct comparison of energy consumption between machine learning-guided protocols and conventional methods was not conducted. Future studies could address this gap by linking model predictions with energy use metrics to quantify potential cost savings and sustainability benefits.

5. Conclusions

This study shows that building structures, climatic circumstances, and data-driven approaches play an important role in optimizing heat treatments for pest control in stored products. Furthermore, treatment efficiency varies significantly between facility types, highlighting the importance of specific protocols that take structural aspects like size and insulation into account. Furthermore, ambient temperature and seasonal fluctuations were shown to affect heating efficiency, implying that treatment regimens should be strategically altered to maximize energy savings and pest control performance. The successful implementation of machine learning models, when paired with real-time IoT monitoring, offers a potential foundation for improving treatment precision, reducing operational costs, and minimizing the environmental impact.

Author Contributions

Conceptualization, V.S. and E.K.; data curation, S.R.; investigation, V.S.; methodology, S.R., P.A., C.G.A., and E.K.; resources, V.S.; software, S.R.; supervision, E.K.; visualization, S.R.; writing—original draft, S.R.; writing—review and editing, P.A., C.G.A., and E.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Restrictions apply to the availability of these data.

Conflicts of Interest

Author Vasilis Sotiroudas was employed by the company Agrospecom. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Elson, R. Control of infestation. In Hobbs’ Food Poisoning and Food Hygiene, 7th ed.; CRC Press: Boca Raton, FL, USA, 2007; pp. 264–274. [Google Scholar] [CrossRef]
  2. Yang, H.; Li, Y.; Xin, L.; Teng, S.W.; Pang, S.; Zhao, H.; Cao, Y.; Zhou, X. MCSNet+: Enhanced Convolutional Neural Network for Detection and Classification of Tribolium and Sitophilus Sibling Species in Actual Wheat Storage Environments. Foods 2023, 12, 3653. [Google Scholar] [CrossRef] [PubMed]
  3. Gerken, A.R.; Campbell, J.F. Spatial and Temporal Variation in Stored-Product Insect Pest Distributions and Implications for Pest Management in Processing and Storage Facilities. Ann. Entomol. Soc. Am. 2022, 115, 239–252. [Google Scholar] [CrossRef]
  4. Duarte, S.; Magro, A.; Barros, G.; Carvalho, M.O. Stored products insects in Portugal–New data and overview. J. Stored Prod. Res. 2024, 105, 102230. [Google Scholar] [CrossRef]
  5. Nayak, M.K.; Daglish, G.J.; Phillips, T.W.; Ebert, P.R. Resistance to the Fumigant Phosphine and Its Management in Insect Pests of Stored Products: A Global Perspective. Annu. Rev. Entomol. 2020, 65, 333–350. [Google Scholar] [CrossRef]
  6. Borovsky, D. Trypsin modulating oostatic factor for developing resistant crops. In Insecticides Design Using Advanced Technologies; Springer: Berlin/Heidelberg, Germany, 2007; Volume 9783540469070, pp. 135–149. [Google Scholar] [CrossRef]
  7. Lorini, I.; Collins, P.J.; Daglish, G.J.; Nayak, M.K.; Pavic, H. Detection and characterisation of strong resistance to phosphine in Brazilian Rhyzopertha dominica (F.) (Coleoptera: Bostrychidae). Pest Manag. Sci. 2007, 63, 358–364. [Google Scholar] [CrossRef] [PubMed]
  8. Kaur, R.; Subbarayalu, M.; Jagadeesan, R.; Daglish, G.J.; Nayak, M.K.; Naik, H.R.; Ramasamy, S.; Subramanian, C.; Ebert, P.R.; Schlipalius, D.I. Phosphine resistance in India is characterised by a dihydrolipoamide dehydrogenase variant that is otherwise unobserved in eukaryotes. Heredity 2015, 115, 188–194. [Google Scholar] [CrossRef]
  9. Sağlam, Ö.; Edde, P.A.; Phillips, T.W. Resistance of Lasioderma serricorne (Coleoptera: Anobiidae) to Fumigation with Phosphine. J. Econ. Entomol. 2015, 108, 2489–2495. [Google Scholar] [CrossRef] [PubMed]
  10. Aulicky, R.; Stejskal, V.; Frydova, B.; Athanassiou, C.G. Susceptibility of Two Strains of the Confused Flour Beetle (Coleoptera: Tenebrionidae) Following Phosphine Structural Mill Fumigation: Effects of Concentration, Temperature, and Flour Deposits. J. Econ. Entomol. 2015, 108, 2823–2830. [Google Scholar] [CrossRef]
  11. Afful, E.; Elliott, B.; Nayak, M.K.; Phillips, T.W. Phosphine Resistance in North American Field Populations of the Lesser Grain Borer, Rhyzopertha dominica (Coleoptera: Bostrichidae). J. Econ. Entomol. 2018, 111, 463–469. [Google Scholar] [CrossRef]
  12. Agrafioti, P.; Athanassiou, C.G.; Nayak, M.K. Detection of phosphine resistance in major stored-product insects in Greece and evaluation of a field resistance test kit. J. Stored Prod. Res. 2019, 82, 40–47. [Google Scholar] [CrossRef]
  13. Agrafioti, P.; Kaloudis, E.; Kateris, D.; Athanassiou, C.G. Assessment of Phosphine Resistance in Major Stored-Product Insects in Greece Using Two Diagnostic Protocols. Insects 2024, 15, 802. [Google Scholar] [CrossRef]
  14. Yahia, E.M. Responses of some fruits to gas and heat stress. Acta Hortic. 2006, 712 II, 635–641. [Google Scholar] [CrossRef]
  15. Mohapatra, S.; Padhi, J.; Singh, S. Enhancing yield and economic benefits through sustainable pest management in Okra cultivation. Sci. Rep. 2024, 14, 22220. [Google Scholar] [CrossRef]
  16. Dean, G.A. Heat as a Means of Controlling Mill Insects. J. Econ. Entomol. 1911, 4, 142–161. [Google Scholar] [CrossRef]
  17. Dean, G.A. Further Data on Heat as a Means of Controlling Mill Insects. J. Econ. Entomol. 1913, 6, 40–55. [Google Scholar] [CrossRef]
  18. Agrafioti, P.; Athanassiou, C.G.; Subramanyam, B. Efficacy of heat treatment on phosphine resistant and susceptible populations of stored product insects. J. Stored Prod. Res. 2019, 81, 100–106. [Google Scholar] [CrossRef]
  19. Hansen, J.D.; Johnson, J.A.; Winter, D.A. History and use of heat in pest control: A review. Int. J. Pest Manag. 2011, 57, 267–289. [Google Scholar] [CrossRef]
  20. Campolo, O.; Verdone, M.; Laudani, F.; Malacrinò, A.; Chiera, E.; Palmeri, V. Response of four stored products insects to a structural heat treatment in a flour mill. J. Stored Prod. Res. 2013, 54, 54–58. [Google Scholar] [CrossRef]
  21. Tang, J.; Mitcham, E.; Wang, S.; Lurie, S. Heat Treatments for Postharvest Pest Control: Theory and Practice; CAB International: Wallingford, UK, 2007; p. 349. [Google Scholar]
  22. Huang, Z.; Yan, R.; Li, R.; Zhu, H.; Ling, B.; Wang, S. Finite element analysis on fruit temperature fields based on postharvest disinfestations with hot air and water treatments. Nongye Gongcheng XuebaoTransactions Chin. Soc. Agric. Eng. 2014, 30, 252–259. [Google Scholar] [CrossRef]
  23. Hara, A.H. Heat as a sustainable postharvest disinfestation treatment for export horticultural crops. Acta Hortic. 2013, 973, 45–54. [Google Scholar] [CrossRef]
  24. Finkelman, S.; Navarro, S.; Rindner, M.; Dias, R. Use of heat for disinfestation and control of insects in dates: Laboratory and field trials. Phytoparasitica 2006, 34, 37–48. [Google Scholar] [CrossRef]
  25. Wang, S.; Monzon, M.; Johnson, J.A.; Mitcham, E.J.; Tang, J. Industrial-scale radio frequency treatments for insect control in walnuts. I: Heating uniformity and energy efficiency. Postharvest Biol. Technol. 2007, 45, 240–246. [Google Scholar] [CrossRef]
  26. Boina, D.; Subramanyam, B. Relative Susceptibility of Tribolium confusum Life Stages Exposed to Elevated Temperatures. J. Econ. Entomol. 2004, 97, 2168–2173. [Google Scholar] [CrossRef] [PubMed]
  27. Sakka, M.K.; Jagadeesan, R.; Nayak, M.K.; Athanassiou, C.G. Insecticidal effect of heat treatment in commercial flour and rice mills for the control of phosphine-resistant insect pests. J. Stored Prod. Res. 2022, 99, 102023. [Google Scholar] [CrossRef]
  28. Fields, P.G. The control of stored-product insects and mites with extreme temperatures. J. Stored Prod. Res. 1992, 28, 89–118. [Google Scholar] [CrossRef]
  29. Mahroof, R.; Subramanyam, B.; Throne, J.E.; Menon, A. Time-Mortality Relationships for Tribolium castaneum (Coleoptera: Tenebrionidae) Life Stages Exposed to Elevated Temperatures. J. Econ. Entomol. 2003, 96, 1345–1351. [Google Scholar] [CrossRef]
  30. Yu, C.; Subramanyam, B.; Flinn, P.W.; Gwirtz, J.A. Susceptibility of Lasioderma serricorne (Coleoptera: Anobiidae) Life Stages to Elevated Temperatures Used During Structural Heat Treatments. J. Econ. Entomol. 2011, 104, 317–324. [Google Scholar] [CrossRef]
  31. Jian, F.; Fields, P.G.; Jayas, D.S.; White, N.D.G.; Loganathan, M. Measured and Predicted Temperatures in a Grain Processing Building under Heat Treatment—1. Temperature Profiles during Heat Treatment. Can. Biosyst. Eng./Genie Biosyst. Can. 2012, 54, 3.1–3.8. [Google Scholar] [CrossRef]
  32. Suma, P.; Chinnici, G.; La Pergola, A.; Russo, A.; Bella, S.; Pecorino, B.; Pappalardo, G. Assessing the Technical Effectiveness and Economic Feasibility of Pest Management Through Structural Heat Treatment: An Entomological and Economic Analysis in Four Mills in Sicily (Italy). J. Econ. Entomol. 2019, 112, 957–962. [Google Scholar] [CrossRef]
  33. Pappalardo, G.; Chinnici, G.; Pecorino, B. Assessing the economic feasibility of high heat treatment, using evidence obtained from pasta factories in Sicily (Italy). J. Clean. Prod. 2017, 142, 2435–2445. [Google Scholar] [CrossRef]
  34. Strano, L.; Tomaselli, G. Analysis of the building system of four mills and their suitability for heat treatment pest disinfestation. J. Agric. Eng. 2013, 44, 742–749. [Google Scholar] [CrossRef]
  35. Huang, Z.; Zhu, H.; Wang, S. Computer simulation of radio frequency heating for insect control in dried soybeans. ASABE Eng. Sustain. Future 2014, 1, 369–385. [Google Scholar] [CrossRef]
  36. Amangeldy, B.; Tasmurzayev, N.; Mansurova, M.; Imanbek, B.; Sarsembayeva, T. Design and Development of IoT Based Medical Cleanroom. Commun. Comput. Inf. Sci. 2023, 1864, 459–469. [Google Scholar] [CrossRef]
  37. Buratto, W.G.; Muniz, R.N.; Nied, A.; Barros, C.F.D.O.; Cardoso, R.; Gonzalez, G.V. A Review of Automation and Sensors: Parameter Control of Thermal Treatments for Electrical Power Generation. Sensors 2024, 24, 967. [Google Scholar] [CrossRef]
  38. Agrafioti, P.; Kaloudis, E.; Bantas, S.; Sotiroudas, V.; Athanassiou, C.G. Modeling the distribution of phosphine and insect mortality in cylindrical grain silos with Computational Fluid Dynamics: Validation with field trials. Comput. Electron. Agric. 2020, 173, 105383. [Google Scholar] [CrossRef]
  39. Agrafioti, P.; Sotiroudas, V.; Kaloudis, E.; Bantas, S.; Athanassiou, C.G. Real time monitoring of phosphine and insect mortality in different storage facilities. J. Stored Prod. Res. 2020, 89, 101726. [Google Scholar] [CrossRef]
  40. Mir, A.; Khachane, A. Sensing Harmful Gases in Industries Using IOT and WSN. In Proceedings of the 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), Pune, India, 16–18 August 2018. [Google Scholar] [CrossRef]
  41. Leni, D.; Karudin, A.; Abbas, M.R.; Sharma, J.K.; Adriansyah, A. Optimizing stainless steel tensile strength analysis: Through data exploration and machine learning design with streamlit. EUREKA Phys. Eng. 2024, 2024, 73–88. [Google Scholar] [CrossRef]
  42. Perekrest, A.; Chenchevoi, V.; Chencheva, O.; Kovalenko, A.; Kushch-Zhyrko, M.; Kalizhanova, A.; Amirgaliyev, Y. Prediction model of public houses’ heating systems: A comparison of support vector machine method and random forest method. Inform. Autom. Pomiary Gospod. Ochr. Srodowiska 2022, 12, 34–39. [Google Scholar] [CrossRef]
  43. Galicia, E.S.; Matamoros, A.H.; Miyara, A. Prediction of heat transfer coefficient and pressure drop of flow boiling and condensation using machine learning. In Proceedings of the 9th European Thermal Sciences Conference (Eurotherm 2024), Bled, Slovenia, 10–13 June 2024; Volume 2766. [Google Scholar] [CrossRef]
  44. Mirzaev, K.-G.; Kiourt, C. Machine Learning and Thermal Imaging in Precision Agriculture. Lect. Notes Netw. Syst. 2024, 1093, 168–187. [Google Scholar] [CrossRef]
  45. Amulothu, D.V.R.T.; Rodge, R.R.; Hasan, W.; Gupta, S. Machine learning for pest and disease detection in crops. In Agriculture 4.0: Smart Farming with IoT and Artificial Intelligence; Routledge: Abingdon-on-Thames, UK, 2024; pp. 111–132. [Google Scholar]
  46. Lasso, E.; Corrales, J.C. Towards an alert system for coffee diseases and pests in a smart farming approach based on semi-supervised learning and graph similarity. Adv. Intell. Syst. Comput. 2018, 687, 111–123. [Google Scholar] [CrossRef]
  47. Brabec, D.; Campbell, J.; Arthur, F.; Casada, M.; Tilley, D.; Bantas, S. Evaluation of Wireless Phosphine Sensors for Monitoring Fumigation Gas in Wheat Stored in Farm Bins. Insects 2019, 10, 121. [Google Scholar] [CrossRef]
  48. Grant, S.; Castaneda, J.; Samouei, H.; Tabei, F.; Parker, D.; Askarian, B. A Novel AI Method for Efficient and Sustainable Tomato Farming. In Proceedings of the 2023 ASABE Annual International Meeting, Omaha, NE, USA, 9–12 July 2023. [Google Scholar] [CrossRef]
  49. Centaur. Centaur-Build. Glob. Post-Harvest Qual. Chain. Available online: https://centaur.ag (accessed on 3 May 2025).
  50. Van Rossum, G.; Drake, F.L. (Eds.) The Python Language Reference, Release 3.0.1 [Repr.]. In Python Documentation Manual; Python Software Foundation: Hampton, NH, USA, 2010. [Google Scholar]
  51. The Pandas Development Team. Pandas-Dev/Pandas: Pandas. 2024. Available online: https://pandas.pydata.org/about/citing.html (accessed on 3 May 2025).
  52. Zippenfenig, P. Open-Meteo.com Weather API 2024. Available online: https://open-meteo.com/en/license (accessed on 3 May 2025).
  53. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  54. Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]
  55. Stahle, L.; Wold, S. Analysis of variance (ANOVA). Chemom. Intell. Lab. Syst. 1989, 6, 259–272. [Google Scholar] [CrossRef]
  56. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: San Francisco, CA, USA, 2016; pp. 785–794. [Google Scholar] [CrossRef]
Figure 1. Indicative plot of temperature over time during a heat treatment process. The plot illustrates the temperature profile of a typical heat treatment, with annotations for key parameters: time to reach 50 °C (t_50 °C) (4.00 h), duration above 40 °C (t_abv_40 °C) (13.00 h), duration above 50 °C (t_abv_50 °C) (10.00 h), and overshooting of 60 °C (6.00 h). The red dashed line indicates the 50 °C threshold, essential for effective pest control, while the orange dashed line marks the 60 °C threshold, highlighting potential inefficiencies or risks.
Figure 1. Indicative plot of temperature over time during a heat treatment process. The plot illustrates the temperature profile of a typical heat treatment, with annotations for key parameters: time to reach 50 °C (t_50 °C) (4.00 h), duration above 40 °C (t_abv_40 °C) (13.00 h), duration above 50 °C (t_abv_50 °C) (10.00 h), and overshooting of 60 °C (6.00 h). The red dashed line indicates the 50 °C threshold, essential for effective pest control, while the orange dashed line marks the 60 °C threshold, highlighting potential inefficiencies or risks.
Agronomy 15 01254 g001
Figure 2. The average duration required to reach 50 °C for each month, ordered chronologically from January to December. The results indicate a clear seasonal trend, with shorter durations during warmer months (e.g., July and August) and longer durations during the colder months (e.g., January and February). This trend aligns with the influence of ambient temperature on heat treatment efficiency, as higher external temperatures facilitate faster heating.
Figure 2. The average duration required to reach 50 °C for each month, ordered chronologically from January to December. The results indicate a clear seasonal trend, with shorter durations during warmer months (e.g., July and August) and longer durations during the colder months (e.g., January and February). This trend aligns with the influence of ambient temperature on heat treatment efficiency, as higher external temperatures facilitate faster heating.
Agronomy 15 01254 g002
Figure 3. The average environmental temperature for each month, ordered chronologically from January to December. The data reveal a typical seasonal pattern, with higher temperatures during the summer months (e.g., July and August) and lower temperatures during winter months (e.g., January and February). This pattern correlates with the observed variations in heat treatment duration, as shown in Figure 2, highlighting the impact of environmental conditions on heat treatment performance.
Figure 3. The average environmental temperature for each month, ordered chronologically from January to December. The data reveal a typical seasonal pattern, with higher temperatures during the summer months (e.g., July and August) and lower temperatures during winter months (e.g., January and February). This pattern correlates with the observed variations in heat treatment duration, as shown in Figure 2, highlighting the impact of environmental conditions on heat treatment performance.
Agronomy 15 01254 g003
Figure 4. The heatmap visualizes the distribution, ranging from blue (low) to red (high), of heat treatments conducted by Agrospecom across Greece (Ελλάς). Intense clusters are observed in regions with high population density and industrial activity, such as Athens and Thessaloniki, as well as in areas with significant grain production, such as Thessaly and northern Greece. These patterns reflect the concentration of industrial facilities and agricultural storage sites in these regions, highlighting the importance of localized pest control strategies.
Figure 4. The heatmap visualizes the distribution, ranging from blue (low) to red (high), of heat treatments conducted by Agrospecom across Greece (Ελλάς). Intense clusters are observed in regions with high population density and industrial activity, such as Athens and Thessaloniki, as well as in areas with significant grain production, such as Thessaly and northern Greece. These patterns reflect the concentration of industrial facilities and agricultural storage sites in these regions, highlighting the importance of localized pest control strategies.
Agronomy 15 01254 g004
Figure 5. Comparison of the actual and predicted values of the t_50 °C (time it takes to reach 50 °C in hours) for the XGBoost model. The red dashed line represents the ideal fit, where predicted values match actual values. The scatter plot shows the model’s predictions, with most points closely aligned with the ideal fit, indicating strong predictive performance. Similar behavior was observed for the RF model.
Figure 5. Comparison of the actual and predicted values of the t_50 °C (time it takes to reach 50 °C in hours) for the XGBoost model. The red dashed line represents the ideal fit, where predicted values match actual values. The scatter plot shows the model’s predictions, with most points closely aligned with the ideal fit, indicating strong predictive performance. Similar behavior was observed for the RF model.
Agronomy 15 01254 g005
Figure 6. The residuals (difference between actual and predicted values) for the t_50 °C (time it takes to reach 50 °C in hours) predicted by the XGBoost model. The residuals are randomly distributed around zero, indicating no systematic bias in the model’s predictions. Similar behavior was observed for the RF model.
Figure 6. The residuals (difference between actual and predicted values) for the t_50 °C (time it takes to reach 50 °C in hours) predicted by the XGBoost model. The residuals are randomly distributed around zero, indicating no systematic bias in the model’s predictions. Similar behavior was observed for the RF model.
Agronomy 15 01254 g006
Figure 7. Residual plot comparing actual versus predicted values for t_50 °C across different prediction ranges. The scattered blue points represent residual values (difference between actual and predicted), while the red dotted line indicates zero residual. The distribution pattern shows greater variance at higher predicted values, suggesting model performance varies with the magnitude of predictions.
Figure 7. Residual plot comparing actual versus predicted values for t_50 °C across different prediction ranges. The scattered blue points represent residual values (difference between actual and predicted), while the red dotted line indicates zero residual. The distribution pattern shows greater variance at higher predicted values, suggesting model performance varies with the magnitude of predictions.
Agronomy 15 01254 g007
Table 1. Tukey’s test results for differences in t_50 °C across building categories. The table presents the results of Tukey’s post hoc test, which compares the mean duration until the temperature reaches 50 °C (t_50 °C) across different building categories. The columns show the pairwise comparisons (group1 vs. group2), the mean difference (meandiff), the adjusted p-value (p-adj), the lower and upper bounds of the 95% confidence interval, and whether the difference is statistically significant (reject).
Table 1. Tukey’s test results for differences in t_50 °C across building categories. The table presents the results of Tukey’s post hoc test, which compares the mean duration until the temperature reaches 50 °C (t_50 °C) across different building categories. The columns show the pairwise comparisons (group1 vs. group2), the mean difference (meandiff), the adjusted p-value (p-adj), the lower and upper bounds of the 95% confidence interval, and whether the difference is statistically significant (reject).
Group1Group2Meandiffp-AdjLowerUpperReject *
0CONTAINERHOUSING0.11151−3.71773.9407FALSE
1CONTAINERINDUSTRIAL_STORAGES6.536905.99777.0761TRUE
2CONTAINERIPSM154.582603.90275.2624TRUE
3CONTAINERROOMS4.459303.22935.6892TRUE
4HOUSINGINDUSTRIAL_STORAGES6.42540.00012.565710.2851TRUE
5HOUSINGIPSM154.4710.01450.58928.3529TRUE
6HOUSINGROOMS4.34770.02610.33298.3626TRUE
7INDUSTRIAL_STORAGESIPSM15−1.95430−2.7891−1.1196TRUE
8INDUSTRIAL_STORAGESROOMS−2.07760.0002−3.3995−0.7558TRUE
9IPSM15ROOMS−0.12330.9992−1.50851.2619FALSE
* Significant differences (p < 0.05) are highlighted, with TRUE indicating a significant difference and FALSE indicating no significant difference.
Table 2. Model performance by building category. The table summarizes the performance of random forest and XGBoost models for specific building categories, including containers, industrial storages, IPSM15, and rooms. Metrics include mean absolute error in hours (MAE), mean squared error (MSE), and R2 score. The models achieved exceptional performance for IPSM15 and containers but underperformed for Rooms, likely due to the limited number of treatments recorded for this category.
Table 2. Model performance by building category. The table summarizes the performance of random forest and XGBoost models for specific building categories, including containers, industrial storages, IPSM15, and rooms. Metrics include mean absolute error in hours (MAE), mean squared error (MSE), and R2 score. The models achieved exceptional performance for IPSM15 and containers but underperformed for Rooms, likely due to the limited number of treatments recorded for this category.
Building TypeMAE (Hours)MSER2
Random ForestContainer0.100.050.87
Storage2.4311.480.70
IPSM150.601.061.00
Rooms1.464.700.55
XGBoostContainer0.170.070.80
Storage2.3611.400.70
IPSM150.580.921
Rooms1.484.470.57
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Rossos, S.; Agrafioti, P.; Sotiroudas, V.; Athanassiou, C.G.; Kaloudis, E. Predicting Heat Treatment Duration for Pest Control Using Machine Learning on a Large-Scale Dataset. Agronomy 2025, 15, 1254. https://doi.org/10.3390/agronomy15051254

AMA Style

Rossos S, Agrafioti P, Sotiroudas V, Athanassiou CG, Kaloudis E. Predicting Heat Treatment Duration for Pest Control Using Machine Learning on a Large-Scale Dataset. Agronomy. 2025; 15(5):1254. https://doi.org/10.3390/agronomy15051254

Chicago/Turabian Style

Rossos, Stavros, Paraskevi Agrafioti, Vasilis Sotiroudas, Christos G. Athanassiou, and Efstathios Kaloudis. 2025. "Predicting Heat Treatment Duration for Pest Control Using Machine Learning on a Large-Scale Dataset" Agronomy 15, no. 5: 1254. https://doi.org/10.3390/agronomy15051254

APA Style

Rossos, S., Agrafioti, P., Sotiroudas, V., Athanassiou, C. G., & Kaloudis, E. (2025). Predicting Heat Treatment Duration for Pest Control Using Machine Learning on a Large-Scale Dataset. Agronomy, 15(5), 1254. https://doi.org/10.3390/agronomy15051254

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop