Next Article in Journal
Longitudinal Analysis of Teacher Technology Acceptance and Its Relationship to Resource Viewing and Academic Performance of College Students during the COVID-19 Pandemic
Next Article in Special Issue
Determining the Factors Affecting the Boiling Heat Transfer Coefficient of Sintered Coated Porous Surfaces
Previous Article in Journal
Understanding the Transformation to a Knowledge-Based Health Bioeconomy: Exploring Dynamics Linked to Preventive Medicine in Kenya
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Classification of a Greenhouse Environment for a Rose Crop Based on AI-Based Surrogate Models

1
Institute of Communication Engineering, National Tsing Hua University, Hsinchu 300044, Taiwan
2
Department of Power Mechanical Engineering, National Tsing Hua University, No. 101, Section 2, Guangfu Road, East District, Hsinchu 300044, Taiwan
3
Institute of Computer Science and Information Technology, The Women University, Multan 154-8533, Pakistan
4
Department of Mechanical Engineering, National Yang Ming Chiao Tung University, 1001 University Road, Hsinchu 300093, Taiwan
5
Department of Agricultural Engineering, Bahauddin Zakariya University, Bosan Road, Multan 60800, Pakistan
6
Department of Physics, College of Khurma University College, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
*
Authors to whom correspondence should be addressed.
Sustainability 2021, 13(21), 12166; https://doi.org/10.3390/su132112166
Submission received: 3 October 2021 / Revised: 25 October 2021 / Accepted: 29 October 2021 / Published: 4 November 2021
(This article belongs to the Special Issue Sustainable Agricultural Engineering Technologies and Applications)

Abstract

:
A precise microclimate control for dynamic climate changes in greenhouses allows the industry and researchers to develop a simple, robust, reliable, and intelligent model. Accordingly, the objective of this investigation was to develop a method that can accurately define the most suitable environment in the greenhouse for an optimal yield of roses. Herein, an optimal and highly accurate BO-DNN surrogate model was developed (based on 300 experimental data points) for a quick and reliable classification of the rose yield environment considering some of the most influential variables including soil humidity, temperature and humidity of air, CO2 concentration, and light intensity (lux) into its architecture. Initially, two BO techniques (GP and GBRT) are used for the tuning process of the hyper-parameters (such as learning rate, batch size, number of dense nodes, number of dense neurons, number of input nodes, activation function, etc.). After that, an optimal and simple combination of the hyper-parameters was selected to develop a DNN algorithm based on 300 data points, which was further used to classify the rose yield environment (the rose yield environments were classified into four classes such as soil without water, correct environment, too hot, and very cold environments). The very high accuracy of the proposed surrogate model (0.98) originated from the introduction of the most vital soil and meteorological parameters as the inputs of the model. The proposed method can help in identifying intelligent greenhouse environments for efficient crop yields.

1. Introduction and Motivation

1.1. Introduction

Climate change throughout the globe is affecting agricultural production due to increasing temperatures, fluctuating precipitation patterns, and rising corban dioxide concentrations in the atmosphere. In these changing environmental conditions, greenhouse crop cultivation is preferred compared with open field growing. The cultivation of crops in the greenhouse prolongs the agricultural growing season, protect yields against weather variations, offers a reliable growing ecosystem, and thus maximizes productivity. Thus, it is essential to adopt precision agriculture techniques in order to maintain the ideal environmental parameters such as humidity, carbon dioxide, and temperature along with soil moisture and nutrients in accordance with the crop growth cycle [1,2]. Exposure to uneven environmental factors produces stress, disease, or even a fall in the crops, resulting in substantial financial losses to growers [3]. Greenhouse weather control mechanisms need to consider multivariate and non-linear systems with variables greatly dependent on the external environment and the design of the greenhouse [4,5], even though the greenhouse cannot be independently controlled. Thus, developing a precise climate model in a greenhouse is an essential approach to control these dynamic climate changes and attain proficient climate management.
Greenhouse environment models can be developed on either the physical laws driving ecological cycles, or the interpretation of data obtained from such processes. With the development of high-performance computational systems, several analytical models [6,7,8] have been developed. Yet, this methodology may produce inconsistent outcomes when applied to true environmental conditions due to the complexity of these models and the frequent need for calculation and the approximation of unmeasurable parameters, for example, water vapor pressure, biological factors, rate of photosynthesis, soil heat flux density, and other factors [9]. On the contrary, due to the advancement of existing computational strategies, deep leaning prediction models based on big data [10] are being progressively applied to several fields. ANN models are incredible predicting tools [11,12] because of their capabilities to model systems without making assumptions [13] and to evaluate nonlinear systems. The most significant benefits of deep learning models over several classes of nonlinear models is that ANN models can approximate a vast group of functions with a high level of precision [14]. This approach delivers swift and reliable results for precision agriculture applications, namely, the climate estimation of greenhouses [15], the growth of plants, and the detection of stress compared to existing physical models [6,7].
For the generation and collection of data, smart greenhouses are equipped with IoTs, wireless sensor networks (WSNs), and actuators [16]. Sensors sense the atmosphere in the greenhouse and measure temperature, light intensity, humidity, CO2 levels, pressure, etc. If any irregularity is detected in the environmental conditions of the greenhouse, the ANN-based central control station directs actuators to execute required actions such as watering the crops, increasing or decreasing the light intensity, opening and closing windows, etc.
Besides, an appropriate comprehension of the variations of different parameters in the greenhouse climate related to the requirements of the particular crop at various development phases needs more consideration. As rose plants are susceptible to large variations in temperature, light, and humidity, the cultivation of greenhouse roses in geographical areas with environmental conditions that are not satisfactorily near the base prerequisites will encompass added risks and costs of production [17]. Exclusively relying upon the parameter measurement data from sensors is insufficient to obtain solid harvests in the greenhouse. Having a profound learning model for forecasting the future air parameters will assist in keeping up with the climate [18]. For instance, having the predicted values of temperature, CO2, and humidity assist in maintaining the flower size and a high yield, and can prevent the growth of pests that harm the rose plants. Additionally, predicting greenhouse climate changes will help in the event of sensor breakdown and will reduce the energy utilization in the greenhouse [3].

1.2. Aims and Motivation

Roses are amongst some of the most highly marketed flowers globally and have ruled the flower market since the 1990s owing to their year-long availability and the ever-increasing demand in beauty products and from the decoration industry. Roses are a functional food product similar to barley and other crops [19]. Natural environmental conditions are not always optimum to achieve the growing demand of crop requirements [20,21]. Extreme weather conditions such as exposure to direct sun, hail, biotic, and abiotic stresses can critically damage the product quality and yield [22]. Therefore, greenhouses are increasingly being used, since they can adjust the interior environmental parameters through artificial lights, aeration, and heating and ventilation systems [23]. Thus, crop growing cycles can be designed based on market demands. The environmental parameters required for the appropriate growth of roses are relative humidity, CO2 concentration, soil humidity, air temperature, light intensity, and the electrical conductivity of soil (see Figure 1).
There are several analytical models for the interpretation of the data collected from wireless sensor networks or IoTs, but these models may produce inconsistent outcomes when applied to true environmental conditions due to the high complexity of these models and the frequent need for calculation and the approximation of unmeasurable parameters. Based on the aforementioned discussion, it is extremely crucial to develop a method particularly for AI-based methods [24] that can accurately define the most suitable environment in the greenhouses for rose yield production. This is because the AI-based methods have gained lot of success in agriculture during recent years in relation to crop yield production, detection, precision agriculture, and so on [25,26,27,28,29].
To the best of the authors’ knowledge, only a single study is available in the literature regarding the use of AI in the rose’s greenhouse environment. This study presents the ANN and ANFIS methods to forecast the risk level for pests in the rose greenhouse [30]. Other than this, no study is available in the literature on this subject. The present study is the first of its kind in classifying the greenhouse environment for rose crops based on AI-based surrogate models. The proposed models are deep neural networks based on the optimal set of hyper-parameters defined by the Bayesian optimization scheme. AI-based surrogate models can be a reliable, simple, and robust solution. For instance, Bayesian optimization (BO) techniques such as the Gaussian process (GP) and Gradient boosting (GBRT) can be employed to provide optimal hyper-parameters to be integrated with deep neural networks (DNN). In line with this, the objective of this study was to develop an optimal and highly accurate BO-DNN surrogate model (based on 300 experimental data points) for a quick and reliable classification of the rose yield environment considering some of the most influential variables including soil humidity, the temperature and humidity of air, CO2 concentration, and light intensity (lux) into its architecture. The rose yield environments (outputs) are classified into four classes such as soil without water, correct environment, too hot, and very cold environments. Initially, two BO techniques (GP and GBRT) were used for the tuning process of the hyper-parameters (such as learning rate, batch size, number of dense nodes, number of dense neurons, number of input nodes, activation function, etc.). The most accurate set of hyper-parameters was selected to build the DNN model based on 300 data points, which was further used to classify the rose yield environment. The very high accuracy of the proposed surrogate model originates from the introduction of the most vital soil and meteorological parameters as the inputs of the model.

2. Materials and Methods

2.1. Data Collection

A total of 300 experimental data points from various sensors regarding soil humidity, light intensity, temperature, air humidity, and CO2 concentration for 04 different classes of greenhouse rose yield environments were taken from the open literature [31]. The data were acquired by an autonomous robot integrating the sensors including soil humidity, light intensity, temperature, air humidity, and CO2 concentration. Table 1 shows that a wide range of experimental data have been included in this study to discuss greenhouse rose yield environments.

2.2. Data Visualization

The experimental data have been visualized in terms of heat maps, correlation charts, pairs, and violin plots. The heat map and correlation chart represent the relationship between input and output features while the data distribution has been visualized by pairs, violins, and distplot. In addition, the data density for each class has been shown. A heat map showing the correlation between the input and output variables Figure 2. The dependency of the various input variables on the output parameters can be visualized by using a correlation chart as provided Figure 3. The data distribution of the input and output parameters including the soil humidity, air temperature and humidity, CO2 concentration, lux (light intensity), and class (output) is represented by a pair plot (see Figure 4). A clearer picture of the experimental data distribution of various input features with respect to the only output parameter, class, is highlighted in the violin plots (see Figure 5).
Herein, the experimental data were distributed into 04 different classes (namely class 0, class 1, class 2, and class 3). The total number of data points for each class is illustrated in Figure 6.
The density of each input parameter’s acquired data is presented bydistplot (see Figure 7). The distplot illustrates the data distribution of each parameter in terms of density distribution.

2.3. Bayesian Optimization Integrated with a Deep Neural Network Algorithm

Algorithms of two different Bayesian optimization schemes, namely, Gaussian process regression (GPR) and Gradient boosting regression trees (GBRT) integrated with the deep neural network are illustrated in Figure 8.

3. Results and Discussion

In this section, the range of the considered hyper-parameters is first provided followed by the tuning processes of two different Bayesian optimization schemes (GP and GBRT). Furthermore, the way that the maximum convergence was achieved is explained. In addition, the optimal combination of the hyper parameters is chosen. The chosen optimal hyper-parameters are then employed to develop a deep neural network model, which is then used to classify the greenhouse environments for rose yields. Moreover, the classification accuracy of the developed model in terms of a confusion matrix and an accuracy table is presented. The details of the input features and their impact on the model’s classification accuracy is evaluated in the sensitivity analysis section. Other than that, individual impact of each input variable on the model’s classification accuracy is evaluated. More discussions are presented in the subsequent sections.

3.1. Optimization of the Hyper-Parameters

The considered hyper-parameters were tuned by using two different Bayesian optimization schemes (GP and GBRT). The selected hyper-parameters include the learning rate, Adam decay, input nodes, dense layers, dense nodes, batch size, and activation function. The range of all the investigated hyper-parameters for the tuning process is given in Table 2.
The range of the considered hyper-parameters along with the hyper-parameter tuning process by the GBRT and GPR algorithms is depicted in Figure 9 and Figure 10, respectively. It is worth mentioning that the blue and orange regions represent the strong and weak dependence of the variable, respectively, while the asterisk sign points towards the optimal point. For further analysis, the GPR algorithm was considered. Detailed information on the finally selected architecture of the optimal model (GPR) is tabulated in Table 3. From the Figure 10, it can be observed that the ‘tanh’ activation function provided optimal results compared to the Softmax, sigmoid, and ReLU. A comparison between the suitability of these activation functions is provided in Figure 11.
Convergence plots for both optimization schemes such as GP and GBRT provide a clear picture of the way the error was minimized. The initial convergence was reached very fast because the number of input parameters and the amount of training data affected the convergence rate, and in this study the model was evaluated for 300 experimental data points containing five input parameters. For instance, in Figure 12, it can be clearly noticed that the convergence error for both the GP and GBRT algorithms was minimized at the fourth call.

3.2. Training and Developing the Deep Neural Network

The experimental data were distributed into training (80%) and testing (20%) datasets. The training and validation losses of the developed model are depicted in Figure 13. The total number of iterations was kept up to 80. Apparently, both of the losses were minimized until the 36th iteration, so the training process was stopped. This shows that the training process was computationally economical and quick.
Figure 14 illustrates the classification performance of the developed model for each class of the environment. Apparently, the developed model was able to accurately classify 59 out of 60 environments for various classes. This explains how well the model performs for different greenhouse environments within the tested range. The classification accuracy of the selected surrogate model is presented in Table 4.
The performance of the developed model is highlighted in terms of precision, recall, and F1-score. Precision and recall are the fraction of the relevant instances among the retrieved instances and the fraction of the relevant instances that were retrieved. Both precision and recall are therefore based on relevance. The values of precision and recall from Table 4 show that the proposed model had a high classification efficiency for the rose’s greenhouse environment. The F1-score from Table 4 also indicates the perfect precision and recall of the optimal surrogate model. In addition, the overall accuracy of the model along with the macro and weighted averages are described as well. The final model could perform the classification task with an overall accuracy of 0.98.

3.3. Sensitivity Analysis

The individual impact of each input variable on the model’s output (i.e., classification of the greenhouse rose yield environments) is portrayed using the SHAP library. More particularly, the ways in which the various input features such as soil humidity, temperature, air humidity, light intensity, and CO2 concentration affected the model’s classification accuracy are shown in Figure 15. It can be clearly seen that the sensitivity of the different features was not the same for various classes. However, some of the factors were sensitive for all the classes. For example, the most influential factor for each class was the soil humidity followed by the temperature. Regarding class 1 (correct environment), the feature with the most impact was air humidity followed by soil humidity, temperature, light intensity, and CO2 concentration.
A tabulated performance comparison of the various developed models is illustrated in Table 5. In the original model, all five input variables were considered for classification while in the rest of the models, each single variable was dropped and rest of the four variables were used to classify the rose yield environment. It is obvious that there was no noticeable impact of dropping a single (any of the variable at a time) variable on the classification accuracy of the models. All of the developed models were able to perform the classification task with an overall accuracy of 0.98.

4. Conclusions

In the current study, surrogate models were developed that can accurately define the most suitable environment in greenhouses for rose yield production. In this regard, Bayesian optimization (BO) techniques such as the Gaussian process (GP) and Gradient boosting (GBRT) were employed to provide optimal hyper-parameters to be integrated with deep neural networks (DNN).
-
The optimal set of hyper-parameters includes the learning rate (0.000416), the number of hidden layers (10), the number of neurons in each hidden layer (265), the activation function (tanh), batch size (36), Adam decay (0.007963), and number of iterations (80).
-
An optimal and highly accurate BO-DNN surrogate model (based on 300 experimental data points) was developed for a quick and reliable classification of the rose yield environment considering the most influential variables including soil humidity, temperature and humidity of air, CO2 concentration, and light intensity (lux) into its architecture.
-
The proposed surrogate models can accurately classify the rose yield environments (classified into four classes such as soil without water, correct environment, too hot, and very cold environments).
-
The developed model can classify different roses yield environments with an overall accuracy of 0.98. The very high accuracy of the proposed surrogate models originates from the inclusion of the most influential parameters as the inputs of the model.
-
This study provides an easy, quick, reliable, and intelligent method to identify and perform corrective measures to improve the quality of the roses. With the proposed method, greenhouse environments can be evaluated and selected for an efficient crop yield of roses and other vegetables and fruits.

Author Contributions

Conceptualization, S.A.B. and N.-F.H.; methodology, S.A.B., I.H., F.B., U.S.; software, S.A.B., I.H., F.B.; validation, A.S.A., K.H.M.; formal analysis, S.A.B., M.S.; investigation, S.A.B., U.S., N.-F.H.; resources, U.S., S.A.B.; data curation, S.A.B., N.-F.H., I.H., U.S.; writing—original draft preparation, S.A.B., U.S., N.-F.H., F.B.; writing—review and editing, S.A.B., U.S., A.S.A., M.S., K.H.M., F.B.; visualization, S.A.B., U.S., I.H.; supervision, N.-F.H.; project administration, N.-F.H., M.S., U.S.; funding acquisition, N.-F.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Science and Technology (MOST) of Taiwan Grant Number: MOST109-2923-E-007-006. The APC was funded by N.-F.H.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to acknowledge the financial support of Taif University Researchers Supporting Project number (TURSP-2020/189), Taif University, Taif, Saudi Arabia. The authors are also indebted to the financial support from the Ministry of Science and Technology, Taiwan under contracts MOST109-2923-E-007-006.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shamshiri, R.; Ismail, W.I.W. A review of greenhouse climate control and automation systems in tropical regions. J. Agric. Sci. Appl. 2013, 2, 176–183. [Google Scholar] [CrossRef]
  2. Shamshiri, R.R.; Hameed, I.A.; Balasundram, S.K.; Ahmad, D.; Weltzien, C.; Yamin, M. Fundamental research on unmanned aerial vehicles to support precision agriculture in oil palm plantations. In Agricultural Robots—Fundamentals and Application; Intech Open: London, UK, 2018; pp. 91–116. [Google Scholar]
  3. Jung, D.-H.; Kim, H.S.; Jhin, C.; Kim, H.-J.; Park, S.H. Time-serial analysis of deep neural network models for prediction of climatic conditions inside a greenhouse. Comput. Electron. Agric. 2020, 173, 105402. [Google Scholar] [CrossRef]
  4. El Ghoumari, M.; Tantau, H.-J.; Serrano, J. Non-linear constrained MPC: Real-time implementation of greenhouse air temperature control. Comput. Electron. Agric. 2005, 49, 345–356. [Google Scholar] [CrossRef]
  5. Seginer, I.; McClendon, R. Methods for optimal control of the greenhouse environment. Trans. ASAE 1992, 4, 1299–1307. [Google Scholar] [CrossRef]
  6. Reddy, M.N.; Rao, N. GIS Based Decision Support Systems in Agriculture; National Academy of Agricultural Research Management: Rajendranagar, India, 1995; pp. 1–11.
  7. Kaiwartya, O.; Abdullah, A.H.; Cao, Y.; Raw, R.S.; Kumar, S.; Lobiyal, D.K.; Isnin, I.F.; Liu, X.; Shah, R.R. T-MQM: Testbed-based multi-metric quality measurement of sensor deployment for precision agriculture—A case study. IEEE Sens. J. 2016, 16, 8649–8664. [Google Scholar] [CrossRef]
  8. Sarmah, K.; Deka, C.; Sharma, U.; Sarma, R. Role of GIS based technologies in sustainable agriculture resource planning & management using spatial decision support approach. Int. J. Innov. Res. Eng. Manag. 2018, 5, 30–34. [Google Scholar]
  9. Pawlowski, A.; Sánchez-Molina, J.; Guzmán, J.; Rodríguez, F.; Dormido, S. Evaluation of event-based irrigation system control scheme for tomato crops in greenhouses. Agric. Water Manag. 2017, 183, 16–25. [Google Scholar] [CrossRef]
  10. Bhat, S.A.; Huang, N.-F. Big Data and AI Revolution in Precision Agriculture: Survey and Challenges. IEEE Access 2021, 9, 110209–110222. [Google Scholar] [CrossRef]
  11. Sajjad, U.; Hussain, I.; Hamid, K.; Bhat, S.A.; Ali, H.M.; Wang, C.-C. A deep learning method for estimating the boiling heat transfer coefficient of porous surfaces. J. Therm. Anal. Calorim. 2021, 145, 191–1913. [Google Scholar] [CrossRef]
  12. Hamid, K.; Sajjad, U.; Yang, K.S.; Wu, S.-K.; Wang, C.-C. Assessment of an energy efficient closed loop heat pump dryer for high moisture contents materials: An experimental investigation and AI based modelling. Energy 2022, 238, 121819. [Google Scholar] [CrossRef]
  13. Asfahan, H.M.; Sajjad, U.; Sultan, M.; Hussain, I.; Hamid, K.; Ali, M.; Wang, C.-C.; Shamshiri, R.R.; Khan, M.U. Artificial intelligence for the prediction of the thermal performance of evaporative cooling systems. Energies 2021, 14, 3946. [Google Scholar] [CrossRef]
  14. Sajjad, U.; Hussain, I.; Wang, C.-C. A high-fidelity approach to correlate the nucleate pool boiling data of roughened surfaces. Int. J. Multiph. Flow 2021, 142, 103719. [Google Scholar] [CrossRef]
  15. Nasrollahi, H.; Ahmadi, F.; Ebadollahi, M.; Nobar, S.N.; Amidpour, M. The greenhouse technology in different climate conditions: A comprehensive energy-saving analysis. Sustain. Energy Technol. Assess. 2021, 47, 101455. [Google Scholar]
  16. Shamshiri, R.R.; Hameed, I.A.; Thorp, K.R.; Balasundram, S.K.; Shafian, S.; Fatemieh, M.; Sultan, M.; Mahns, B.; Samiei, S. Greenhouse Automation Using Wireless Sensors and IoT Instruments Integrated with Artificial Intelligence; InTech Open: London, UK, 2021. [Google Scholar]
  17. Bheemanahalli, R.; Gajanayake, B.; Lokhande, S.; Singh, K.; Seepaul, R.; Collins, P.; Reddy, K.R. Physiological and pollen-based screening of shrub roses for hot and drought environments. Sci. Hortic. 2021, 282, 110062. [Google Scholar] [CrossRef]
  18. van Klompenburg, T.; Kassahun, A.; Catal, C. Crop yield prediction using machine learning: A systematic literature review. Comput. Electron. Agric. 2020, 177, 105709. [Google Scholar] [CrossRef]
  19. Zeng, Y.; Pu, X.; Du, J.; Yang, X.; Li, X.; Mandal, M.; Nabi, S.; Yang, T.; Yang, J. Molecular mechanism of functional ingredients in barley to combat human chronic diseases. Oxid. Med. Cell. Longev. 2020, 2020, 3836172. [Google Scholar] [CrossRef]
  20. Erazo, M.; Rivas, D.; Pérez, M.; Galarza, O.; Bautista, V.; Huerta, M.; Rojo, J.L. Design and implementation of a wireless sensor network for rose greenhouses monitoring. In Proceedings of the IEEE 2015 6th International Conference on Automation, Robotics and Applications (ICARA), Queenstown, New Zealand, 17–19 February 2015; pp. 256–261. [Google Scholar]
  21. Plaut, Z.; Zieslin, N. Productivity of greenhouse roses following changes in soil moisture and soil air regimes. Sci. Hortic. 1974, 2, 137–143. [Google Scholar] [CrossRef]
  22. Shamshiri, R.R.; Jones, J.W.; Thorp, K.R.; Ahmad, D.; Man, H.C.; Taheri, S. Review of optimum temperature, humidity, and vapour pressure deficit for microclimate evaluation and control in greenhouse cultivation of tomato: A review. Int. Agrophys. 2018, 32, 287–302. [Google Scholar] [CrossRef]
  23. Shamshiri, R.R.; Kalantari, F.; Ting, K.; Thorp, K.R.; Hameed, I.A.; Weltzien, C.; Ahmad, D.; Shad, Z.M. Advances in greenhouse automation and controlled environment agriculture: A transition to plant factories and urban agriculture. Int. J. Agric. Biol. Eng. 2018, 11, 1–22. [Google Scholar] [CrossRef]
  24. Zhu, N.; Liu, X.; Liu, Z.; Hu, K.; Wang, Y.; Tan, J.; Huang, M.; Zhu, Q.; Ji, X.; Jiang, Y. Deep learning for smart agriculture: Concepts, tools, applications, and opportunities. Int. J. Agric. Biol. Eng. 2018, 11, 32–44. [Google Scholar] [CrossRef]
  25. Zheng, Y.-Y.; Kong, J.-L.; Jin, X.-B.; Wang, X.-Y.; Su, T.-L.; Zuo, M. CropDeep: The crop vision dataset for deep-learning-based classification and detection in precision agriculture. Sensors 2019, 19, 1058. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Elavarasan, D.; Vincent, P.D. Crop yield prediction using deep reinforcement learning model for sustainable agrarian applications. IEEE Access 2020, 8, 86886–86901. [Google Scholar] [CrossRef]
  27. Mohan, P.; Patil, K.K. Deep learning based weighted SOM to forecast weather and crop prediction for agriculture application. Int. J. Intell. Eng. Syst. 2018, 11, 167–176. [Google Scholar] [CrossRef]
  28. Altikat, S. Prediction of CO2 emission from greenhouse to atmosphere with artificial neural networks and deep learning neural networks. Int. J. Environ. Sci. Technol. 2021, 18, 3169–3178. [Google Scholar] [CrossRef]
  29. Mekonnen, Y.; Namuduri, S.; Burton, L.; Sarwat, A.; Bhansali, S. Machine learning techniques in wireless sensor network based precision agriculture. J. Electrochem. Soc. 2019, 167, 037522. [Google Scholar] [CrossRef]
  30. Tay, A.; Lafont, F.; Balmat, J.-F. Forecasting pest risk level in roses greenhouse: Adaptive neuro-fuzzy inference system vs artificial neural networks. Inf. Process. Agric. 2020, 8, 368–397. [Google Scholar] [CrossRef]
  31. Wilmer Champutiz, P.R.-M.; Fuentes, E.; Peluffo, D. Roses Greenhouse Cultivation Database Repository (RosesGreenhDB). In IEEE Dataport; IEEE: New York, NY, USA, 2019. [Google Scholar]
Figure 1. Parameters affecting the greenhouse rose yield environment.
Figure 1. Parameters affecting the greenhouse rose yield environment.
Sustainability 13 12166 g001
Figure 2. Heat map showing the correlation between the input and output variables.
Figure 2. Heat map showing the correlation between the input and output variables.
Sustainability 13 12166 g002
Figure 3. Dependency of the input variables on the output parameter.
Figure 3. Dependency of the input variables on the output parameter.
Sustainability 13 12166 g003
Figure 4. Representation of the data distribution via a pair plot.
Figure 4. Representation of the data distribution via a pair plot.
Sustainability 13 12166 g004
Figure 5. Violin plots for the experimental data distribution of (a) light, (b) temperature, (c) CO2 concentration, (d) air humidity, and (e) soil humidity with respect to the class.
Figure 5. Violin plots for the experimental data distribution of (a) light, (b) temperature, (c) CO2 concentration, (d) air humidity, and (e) soil humidity with respect to the class.
Sustainability 13 12166 g005aSustainability 13 12166 g005b
Figure 6. The experimental data distribution for each class.
Figure 6. The experimental data distribution for each class.
Sustainability 13 12166 g006
Figure 7. Illustration of the data distribution of each parameter in terms of density distribution.
Figure 7. Illustration of the data distribution of each parameter in terms of density distribution.
Sustainability 13 12166 g007
Figure 8. Bayesian optimization integrated in the deep neural network (a) Gaussian process regression and (b) Gradient boosting regression trees.
Figure 8. Bayesian optimization integrated in the deep neural network (a) Gaussian process regression and (b) Gradient boosting regression trees.
Sustainability 13 12166 g008
Figure 9. The tuning process of the hyper-parameters in GBRT.
Figure 9. The tuning process of the hyper-parameters in GBRT.
Sustainability 13 12166 g009
Figure 10. The tuning process of the hyper-parameters in GPR.
Figure 10. The tuning process of the hyper-parameters in GPR.
Sustainability 13 12166 g010
Figure 11. A performance comparison of the selected activation functions.
Figure 11. A performance comparison of the selected activation functions.
Sustainability 13 12166 g011
Figure 12. Convergence plots for the GPR and GBRT algorithms.
Figure 12. Convergence plots for the GPR and GBRT algorithms.
Sustainability 13 12166 g012
Figure 13. Training and validation losses during training.
Figure 13. Training and validation losses during training.
Sustainability 13 12166 g013
Figure 14. Classification performance of the developed model.
Figure 14. Classification performance of the developed model.
Sustainability 13 12166 g014
Figure 15. Individual impact of input variables on the model’s output.
Figure 15. Individual impact of input variables on the model’s output.
Sustainability 13 12166 g015
Table 1. Investigated parameters and their data range.
Table 1. Investigated parameters and their data range.
ParameterData Range
Soil humidity (kPa)124–821
Light intensity (lux)0–54612.5
Temperature (°C)15.9–40.2
Air humidity (%)39.2–96.9
CO2 concentration (ppm)34–243
EnvironmentClass 0, 1, 2, and 3
Table 2. Range of hyper parameters.
Table 2. Range of hyper parameters.
Hyper ParameterInvestigated Range
Learning rate0.0001–0.1
Adam decay0.000001–0.01
Input nodes1–5
Dense layers1–10
Dense nodes1–500
Batch size1–100
Activation functionSoftmax, Sigmoid, ReLU, tanh
Table 3. Selected architecture of the optimal model.
Table 3. Selected architecture of the optimal model.
Optimization MethodGaussian Process
Learning rate0.000416
No. of hidden layers10
No. of neurons in input layer5
No. of neurons in each hidden layer265
Activation functiontanh
Batch size36
Adam decay0.007963
No. of neurons in output layer4
No. of iterations80
Table 4. Classification accuracy description of the selected surrogate model.
Table 4. Classification accuracy description of the selected surrogate model.
PrecisionRecallF1-Score
01.001.001.00
10.801.000.89
21.001.001.00
31.000.940.97
Accuracy 0.98
Macro Avg.0.950.980.96
Weighted Avg.0.990.980.98
Table 5. Performance comparison of the various developed models.
Table 5. Performance comparison of the various developed models.
Model No.Input FeaturesConfusion Heat MapOverall Accuracy
1soil humidity
light intensity
temperature
air humidity
CO2 concentration
Sustainability 13 12166 i0010.98
2light intensity
temperature
air humidity
CO2 concentration
Sustainability 13 12166 i0020.98
3soil humidity
temperature
air humidity
CO2 concentration
Sustainability 13 12166 i0030.98
4soil humidity
light intensity
air humidity
CO2 concentration
Sustainability 13 12166 i0040.98
5soil humidity
light intensity
temperature
CO2 concentration
Sustainability 13 12166 i0050.98
6soil humidity
light intensity
temperature
air humidity
Sustainability 13 12166 i0060.98
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bhat, S.A.; Huang, N.-F.; Hussain, I.; Bibi, F.; Sajjad, U.; Sultan, M.; Alsubaie, A.S.; Mahmoud, K.H. On the Classification of a Greenhouse Environment for a Rose Crop Based on AI-Based Surrogate Models. Sustainability 2021, 13, 12166. https://doi.org/10.3390/su132112166

AMA Style

Bhat SA, Huang N-F, Hussain I, Bibi F, Sajjad U, Sultan M, Alsubaie AS, Mahmoud KH. On the Classification of a Greenhouse Environment for a Rose Crop Based on AI-Based Surrogate Models. Sustainability. 2021; 13(21):12166. https://doi.org/10.3390/su132112166

Chicago/Turabian Style

Bhat, Showkat Ahmad, Nen-Fu Huang, Imtiyaz Hussain, Farzana Bibi, Uzair Sajjad, Muhammad Sultan, Abdullah Saad Alsubaie, and Khaled H. Mahmoud. 2021. "On the Classification of a Greenhouse Environment for a Rose Crop Based on AI-Based Surrogate Models" Sustainability 13, no. 21: 12166. https://doi.org/10.3390/su132112166

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop