Next Article in Journal
Analysis of Split-System Air Conditioner Faults through Electrical Measurement Data
Previous Article in Journal
TM–IoV: A First-of-Its-Kind Multilabeled Trust Parameter Dataset for Evaluating Trust in the Internet of Vehicles
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Data Descriptor

Experimental Data in a Greenhouse with and without Cultivation of Stringless Blue Lake Beans

by
Sebastian-Camilo Vanegas-Ayala
1,2,*,
Julio Barón-Velandia
1,
Oscar-Mauricio Garcia-Chavez
2,
Adrian Romero-Palencia
2 and
Daniel-David Leal-Lara
1,2
1
Doctorate in Engineering, Faculty of Engineering, Universidad Distrital Francisco José de Caldas, Bogotá 111611, Colombia
2
Computer and Systems Engineering Program, Faculty of Engineering and Basic Sciences, Fundación Universitaria Los Libertadores, Bogotá 111221, Colombia
*
Author to whom correspondence should be addressed.
Data 2024, 9(9), 105; https://doi.org/10.3390/data9090105
Submission received: 16 July 2024 / Revised: 16 August 2024 / Accepted: 2 September 2024 / Published: 4 September 2024

Abstract

Greenhouse cultivation is one of the current strategies to address the challenges of food production, sustainability, and food quality. Similarly, the use of technological tools to automate greenhouse environments through a set of sensors and actuators allows for the control and improvement of processes within this environment. This document presents data collected from the sensors and actuators of two identical greenhouse environments, one with the cultivation of stringless blue lake beans and the other without cultivation. The aim is that this dataset will provide a broader characterization of the behavior of climatic variables inside greenhouse environments and how they are impacted by control actions, subsequently contributing to the development of new research on implementations of or improvements to control, supervision, management, and automation actions in greenhouse environments.

1. Summary

Greenhouse agriculture is gaining greater relevance due to increasing food demands and the need for sustainability, and technology has emerged as a fundamental ally in this context [1]. The integration of automated systems in greenhouses represents an innovative solution that promises to improve crop efficiency through the precise management of environmental conditions [2,3].
Automation in greenhouses, supported by sensors and microcontrollers, enables continuous data collection that contributes to the efficient distribution of resources and improvements in crop yields [4,5]. The cultivation of stringless blue lake beans in a microcontroller-controlled environment allowed studying interactions between automated greenhouses and short-cycle crops. The data were collected in two greenhouse environments managed with a microcontroller in Bogotá, Colombia, located at 2559 m above sea level.
The first environment corresponded to a greenhouse where beans are cultivated, while the second environment was a greenhouse operating without any type of cultivation. Both environments were equipped with a set of sensors and actuators for their operation, such as sensors for internal relative humidity, internal temperature, ground humidity, light intensity, CO2 concentration, and luminosity, as well as actuator systems for ventilation, irrigation, and heating.
With these data, the aim is to contribute to developing intelligent crop management strategies, focusing on maximizing efficiency and productivity in automated greenhouses. By detailing the behavior of the environment in each scenario, this study contributes to the state of the art in precision agriculture, providing a solid foundation for future research and practical applications. This study is expected to offer a basis for implementing and adjusting technologies in precision agriculture, especially in urban contexts where space and resources could be limited.

2. Data Description

Two microcontroller-managed greenhouse environments are presented: one with the cultivation of stringless blue lake beans (greenhouse_dataset_cc.csv) with 99,957 records, and another operating without cultivation (greenhouse_dataset_sc.csv) with 118,233 records. The measurements from the system’s sensors and actuators were recorded every minute over a three-month period, from 23 May to 11 September 2021.

2.1. Geographical and Seasonal Environment

The greenhouses are located in Bogotá, Colombia, at 2559 m above sea level. Bogotá is situated in the Eastern Andean range of Colombia, in a flat savanna surrounded by mountains. It experiences a highland subtropical climate characterized by moderate temperatures throughout the year, ranging between 7 °C and 19 °C, with an approximate annual average temperature of 14 °C [6].
The precipitation pattern in Bogotá is influenced by the trade winds and the Intertropical Convergence Zone (ITCZ). The city experiences two rainy seasons (April–May and October–November) and two dry seasons (December–March and June–September). The average annual precipitation is around 800 mm, with irregular distribution throughout the year [7].

2.2. Plant Selection

The stringless blue lake bean variety was selected to be cultivated in the greenhouse environment. This cultivation is characterized by a short growth cycle, meaning that harvests are obtained within short periods of time, in this case, within a maximum period of three months. This allowed a shorter time window for data collection, encompassing the behavior of the controlled environment throughout the entire life cycle of the cultivation.
In selecting this cultivation, the uncontrolled environmental characteristics provided by the geographical area were considered, such as the direct light requirements, which for this variety amount to 6 to 8 h of direct sunlight per day. Additionally, the selection ensured scenarios for the activation of the system actuators throughout the life cycle of the cultivation. Therefore, a variety was chosen that is not overly sensitive to climatic changes and is relatively easy to cultivate [8], prioritizing the characterization of the greenhouse environment.

2.3. Range of Values

  • created_at: Date of creation of the registry.
  • hum: Internal relative humidity, range of 0 to 100 % .
  • temp: Internal temperature, temperature range of −40 °C to 80 °C.
  • light_intensity: UV index of 0 to 11 for the UVA (315 nm a 400 nm) and UVB (280 nm a 315 nm) bands.
  • luminosity: Luminosity, range of 188 to 88,000 Lux.
  • ground_humidity_per: Ground humidity, range of 1 to 1023 V transformed from 0 to 100%.
  • co2_ppm: CO2 concentration, range from 0 to 10,000 ppm.
  • act_fan: Ventilation activation (0 off, 1 on).
  • act_solenoid_valve: Irrigation system activation (0 off, 1 on).
  • act_heating: Activation of the heating system (0 off, 1 on).

2.4. Descriptive Statistics

Descriptive statistics are presented for the two environments managed by microcontrollers: the count of values, the average, the standardized measure of the deviation of the variable from the mean, and the value of each quartile, including the minimum and maximum values. Table 1 shows the descriptive statistics for the environment with cultivation of stringless blue lake beans, for each of the nine variables analyzed.
Similarly, Table 2 shows the descriptive statistics for the environment without cultivation.
When comparing the statistics of the data from both environments, it was found that
  • The average values of humidity, temperature, light intensity, ground humidity, and CO2 concentration were higher in the dataset with cultivation.
  • The average value of ground humidity was the sensed variable that showed the greatest difference in values between the two environments.
  • Actuators were only activated in the cultivated environment, showing a tendency for their state to be off.

2.5. Null Values

A description of null values for the datasets, due to network interference during data communication, communication channel congestion in the microcontroller, or power outages in the environments is presented in Table 3. Similarly, missing values were also recorded for the time period due to the absence of measurements for the environments with and without cultivation, and considering the subtraction of duplicate date records, in addition to the null values.
The occurrence of null values for the actuators in the environment without cultivation was noted, as, although the data collected were from actuators in the off state, they still reported data and also showed missing values due to interference in the communication channel or for the reasons mentioned above.

2.6. Outlier Data

The description of outlier data for the two datasets, for the recorded climatic variables, is presented in Figure 1 for the dataset with cultivation and Figure 2 for the dataset without cultivation, with the following findings:
  • hum: The environment with cultivation presented more outlier data.
  • temp: The distribution in the environment without cultivation was more centralized.
  • light_intensity: In the environment without cultivation, the outlier data were much further from the median compared to the environment with cultivation.
  • luminosity: The distribution of outlier data was similar in both environments.
  • ground_humidity_per: The environment without cultivation presented a greater number of outlier values than the environment with cultivation.
  • co2_ppm: The environment without cultivation presented a greater number of outlier values, but they were closer to the median.
The outliers recorded were mostly within the possible range of values that occur in the geographical location where the greenhouses are located, so their appearance represented a distribution that was affected by environmental changes characteristic of the region.
Figure 1 is subdivided into humidity variables in Figure 1a, temperature in Figure 1b, light intensity in Figure 1c, luminosity in Figure 1d, ground humidity in Figure 1e, and CO2 in Figure 1f.
Similarly, Figure 2 for the environment without cultivation, is subdivided into humidity variables in Figure 2a, temperature in Figure 2b, light intensity in Figure 2c, luminosity in Figure 2d, ground humidity in Figure 2e, and CO2 in Figure 2f.

2.7. Data Dispersion

Data dispersion over time is shown in Figure 3, Figure 4 and Figure 5, along with the state of the ventilation, irrigation, and heating actuators, respectively, for the environment with cultivation, providing a perspective on how the activation or deactivation of the actuators influenced the behavior of the system variables over time.
Figure 3, Figure 4 and Figure 5 are subdivided into humidity variables in Figure 3a, Figure 4a and Figure 5a; temperature in Figure 3b, Figure 4b and Figure 5b; light intensity in Figure 3c, Figure 4c and Figure 5c; luminosity in Figure 3d, Figure 4d and Figure 5d; ground humidity in Figure 3e, Figure 4e and Figure 5e; and CO2 in Figure 3f, Figure 4f and Figure 5f. These graphs provide a representation of the interaction between the actuators and the system variables over time, facilitating the identification of patterns, trends, and possible causal relationships in the collected data.
In Figure 6 dispersion diagrams are used to explore the relationship between the system variables over time. As observed and related in Table 3, there were time periods without measurements or with missing values. Notably, it is possible to observe that the time periods with data allowed for the characterization of the behavior of humidity variables in Figure 6a, temperature in Figure 6b, light intensity in Figure 6c, luminosity in Figure 6d, ground humidity in Figure 6e, and CO2 in Figure 6f.
In the datasets, a greater similarity is evident in the distributions of the temperature and luminosity variables, with differences in the dispersion of all variables and in the presence of limit values.

2.8. Normality

The results of applying the Shapiro–Wilk test to each of the fields in the datasets are shown in Table 4.
Taking into account a statistic close to 1, the results from Table 4 indicate that the variables temperature, ground humidity in the cultivation environment, and relative humidity and ambient temperature in the non-cultivation environment have a low probability of rejecting the null hypothesis that the sample results from a normal distribution.

2.9. Symmetry and Kurtosis

Skewness and kurtosis values were calculated for the variables in both the cultivation and non-cultivation environments, as shown in Table 5.
As shown in Table 5, the variables light intensity, CO2 concentration, irrigation, and heating actuators had high values in kurtosis for the environment with cultivation, as well as light intensity and CO2 concentration in the environment without cultivation. This indicates more extreme outlier values than a normal distribution, with longer tails. On the other hand, the temperature and soil humidity variables had skewness values closer to zero for the environment with cultivation, as did the relative humidity and ambient temperature variables in the environment without cultivation, which corresponds to the findings of the normality test.
A visualization of the skewness from the frequency plot for the environments is shown in Figure 7 subdivided into the environment with cultivation in blue for the humidity variables in Figure 7a, temperature in Figure 7c, light intensity in Figure 7e, luminosity in Figure 7g, ground humidity in Figure 7i, and CO2 in Figure 7k; and in the environment without cultivation, in purple, for the variables of humidity in Figure 7b, temperature in Figure 7d, light intensity in Figure 7f, luminosity in Figure 7h, ground humidity in Figure 7j, and CO2 in Figure 7l.
Figure 7 shows that the presence of cultivation significantly affected the distribution of ambient and ground humidity, this is evidenced by the fact that the humidity data for the environment with cultivation are mostly distributed around values close to 80 or 100, unlike the environment without cultivation, where the values are uniformly concentrated around 70, forming a normal distribution. Similarly, the soil humidity in the environment with cultivation is distributed within a range of 10 to 90, while in the environment without cultivation, it is concentrated in two distinct groups: one primarily between 0 and 10, and another between 55 and 65.

2.10. Correlation Matrix

The correlation of variables in the environment with cultivation is shown in Table 6, where the presence of cultivation increased the correlation values among most environmental variables, for 10 of the correlations. In contrast to the correlation matrix in Table 7, the correlations are generally weaker, indicating that cultivation affected the interactions among environmental variables.
By comparing Table 6 and Table 7, and subtracting the absolute values of their correlations, excluding the diagonals, we find that in only 5 out of the 15 correlations, the environment without cultivation shows higher correlation values. These cases involve the correlations of luminosity with humidity, temperature, and ground humidity; humidity with soil humidity; and CO2 concentration with light intensity.

2.11. Effect of Actuators

In Table 8, it is shown how the actuators ventilation, irrigation, and heating affected the environmental variables in the environment with cultivation, relating the mean values and standard deviation of each variable filtered by the occurrence of the corresponding combination of actuators in their off (0) or on (1) states. The combination of all actuators being on is not recorded in Table 8 because this only occurred in one record of the dataset.
Table 8 presents changes in the variables resulting from the activation of the actuators, highlighting some representative cases for each variable:
  • Humidity: A maximum average variation of 17.96% with the activation of ventilation and heating.
  • Temperature: A maximum average variation of 4.96 degrees with the activation of the irrigation system and heating.
  • Light Intensity: A maximum average variation of 0.03 UV indices with the activation of the irrigation system and heating. It should be noted that this value is influenced by changes in natural light.
  • Luminosity: A maximum average variation of 447.3 lux with the activation of the irrigation system and heating. Similarly, this value is affected by changes in natural light.
  • Ground humidity: A minimum average variation of 0.69% and 0.59% with the activation of the ventilation–heating and ventilation–irrigation systems, respectively.
  • CO2 concentration: A maximum average variation of 1541.43 ppm with the activation of the irrigation and heating systems.

3. Methods

The recording of values was conducted from 23 May to 11 September 2021, at one-minute intervals through data transmission over the Internet. Two identical greenhouse environments were used, each equipped with a set of sensors and actuators for operation. Among the sensors were an internal relative humidity sensor, an internal temperature sensor, a ground humidity sensor, a light intensity sensor, a CO2 concentration sensor, and a luminosity sensor. Additionally, the actuators included a ventilation system, a solenoid valve irrigation system, and a heating system.
These devices allowed monitoring and recording of key environmental variables such as relative humidity, temperature, ground humidity, light intensity, CO2 concentration, and luminosity. Additionally, the actuators provided the capability to control factors such as ventilation, irrigation, and heating within the greenhouses. The control devices were only used in one of the greenhouse environments, while in the other environment, stringless blue lake beans were cultivated.
Both environments communicated with the network through wireless connection. They were placed in parallel positions under equal lighting, ventilation, and natural heating conditions, reflecting the climatic conditions of Bogotá D.C. Similarly, both had fresh soil for cultivation at the start of data recording.
For the data description, the following activities were conducted:
  • Loading of the two datasets.
  • Descriptive statistics of each dataset.
  • Identification of null and missing values over the time period.
  • Identification of outliers using a Boxplot, where quartile values were represented along with a line at the median, considering outliers as those that exceeded 1.5 * I Q R , where I Q R = Q 3 Q 1 [9].
  • Data visualization with a dispersion diagram.
  • Shapiro–Wilk normality test, excluding null values. [10].
  • Calculation of skewness using Pearson’s method, excluding null values [11].
  • Calculation of kurtosis using Fisher–Pearson coefficient, excluding null values [12].
  • Visualization of skewness from frequency plots.
  • Calculation of correlations among sensed variables using Pearson’s method [13].
  • Calculation of the effect of actuators on variables in the environment with cultivation, grouping the average statistic for each combination of actuators and using the combination with all actuators off as reference.

4. User Notes

Some considerations are presented about use of the dataset, taking advantage of its characteristics:
  • Time structure: This dataset presents a time series structure with a granularity of one minute. It is recommended to use time series analysis techniques to explore patterns, trends, and seasonalities.
  • Treatment of missing data: The presence of null values requires consideration when using the entire dataset. It is suggested to evaluate methods for the treatment of missing values or, if required, the use of sections where they are not present in the majority. If you choose to use a section of the dataset, it is recommended to either remove records with null values or replace them using interpolation based on adjacent values or nearest neighbor imputation algorithms, such as KNN. Regression algorithms are not recommended, as the variables may exhibit dynamic, non-linear, and distributed parameter characteristics. Additionally, the dataset allows the application and testing of models that can adapt to the presence of missing values; in this case, no imputation is required.
  • Outliers: To address outliers, it is recommended to apply an elimination method, retaining only those that fall within the range of possible values for the analyzed region of Bogotá, Colombia. Retention should be based on whether the values have either similar values or close neighbors in the scatter diagram.
  • Non-normal distributions: The results of the Shapiro–Wilk test indicated that several variables did not follow a normal distribution. This aspect should be considered when selecting appropriate statistical methods, possibly opting for non-parametric approaches when necessary.
  • Correlation analysis: The correlation matrices revealed complex interactions between variables, suggesting the need for modeling approaches that can capture these interdependencies.
  • Impact of actuators: An effect of the actuators on the environmental variables was evident. This aspect deserves detailed analysis, potentially through intervention models or interrupted time series analysis.
  • Geographic and climatic contextualization: The data were collected in specific geographical and climatic conditions of Bogotá, Colombia, that is, the Neotropical zone. It is recommended to consider these factors when interpreting the results or when comparing with studies in different contexts.
  • Comparison of environments: The differences observed between environments with and without cultivation offer opportunities for comparative analysis. It is suggested to explore statistical methods to quantify and characterize these differences. Some of the differences described in the text are evident in the descriptive statistics, such as the average ground humidity; the differentiation in the presence of outliers in the climatic variables; the distribution and normality values, which were influenced by the actuators in the environment with cultivation; and the stronger correlations observed in the environment with cultivation.

Author Contributions

Conceptualization, S.-C.V.-A. and J.B.-V.; methodology, S.-C.V.-A.; software, O.-M.G.-C.; validation, S.-C.V.-A., O.-M.G.-C., and A.R.-P.; formal analysis, D.-D.L.-L.; investigation, S.-C.V.-A. and D.-D.L.-L.; resources, J.B.-V.; data curation, A.R.-P.; writing—original draft preparation, O.-M.G.-C. and A.R.-P.; writing—review and editing, S.-C.V.-A.; visualization, D.-D.L.-L.; supervision, J.B.-V.; project administration, S.-C.V.-A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original data presented in this data descriptor are openly available in the Zenodo repository: https://doi.org/10.5281/zenodo.12175649 (accessed on 20 June 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Vatistas, C.; Avgoustaki, D.D.; Bartzanas, T. A Systematic Literature Review on Controlled-Environment Agriculture: How Vertical Farms and Greenhouses Can Influence the Sustainability and Footprint of Urban Microclimate with Local Food Production. Atmosphere 2022, 13, 1258. [Google Scholar] [CrossRef]
  2. Maraveas, C.; Karavas, C.S.; Loukatos, D.; Bartzanas, T.; Arvanitis, K.G.; Symeonaki, E. Agricultural Greenhouses: Resource Management Technologies and Perspectives for Zero Greenhouse Gas Emissions. Agriculture 2023, 13, 1464. [Google Scholar] [CrossRef]
  3. Koukounaras, A. Advanced Greenhouse Horticulture: New Technologies and Cultivation Practices. Horticulturae 2021, 7, 1. [Google Scholar] [CrossRef]
  4. Zhao, X.; Han, Y.; Lewlomphaisarl, U.; Wang, H.; Hua, J.; Wang, X.; Kang, M. Parallel Control of Greenhouse Climate with a Transferable Prediction Model. IEEE J. Radio Freq. Identif. 2022, 6, 857–861. [Google Scholar] [CrossRef]
  5. Ullah, I.; Fayaz, M.; Aman, M.; Kim, D. Toward Autonomous Farming—A Novel Scheme Based on Learning to Prediction and Optimization for Smart Greenhouse Environment Control. IEEE Internet Things J. 2022, 9, 25300–25323. [Google Scholar] [CrossRef]
  6. Instituto de Hidrología, Meteorología y Estudios Ambientales. Características Climatológicas de Ciudades Principales y Municipios Turísticos. 2024. Available online: http://www.ideam.gov.co/documents/21021/418894/Características+de+Ciudades+Principales+y+Municipios+Turísticos.pdf/c3ca90c8-1072-434a-a235-91baee8c73fc (accessed on 21 June 2024).
  7. Instituto de Hidrología, Meteorología y Estudios Ambientales; Fondo de previsión y Atención de Emergencias. Estudio de la Caracterización Climática de Bogotá y Cuenca Alta del rí o Tunjuelo. 2024. Available online: http://www.ideam.gov.co/documents/21021/21135/CARACTERIZACION+CLIMATICA+BOGOTA.pdf/d7e42ed8-a6ef-4a62-b38f-f36f58db29aa (accessed on 21 June 2024).
  8. Ferry-Morse. Bean, Blue Lake Stringless Pole Organic Seeds. 2024. Available online: https://ferrymorse.com/products/bean-blue-lake-stringless-pole-organic-seeds (accessed on 16 August 2024).
  9. NumFOCUS Inc. DataFrame Boxplot. 2024. Available online: https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.boxplot.html (accessed on 21 June 2024).
  10. The SciPy Community. SciPy Shapiro-Wilk. 2024. Available online: https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.shapiro.html (accessed on 21 June 2024).
  11. The SciPy Community. SciPy Skew. 2024. Available online: https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.skew.html (accessed on 21 June 2024).
  12. The SciPy Community. SciPy Kurtosis. 2024. Available online: https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.kurtosis.html (accessed on 21 June 2024).
  13. NumFOCUS Inc. DataFrame Correlation. 2024. Available online: https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.corr.html (accessed on 21 June 2024).
Figure 1. Box plot of the variables for the environment with cultivation.
Figure 1. Box plot of the variables for the environment with cultivation.
Data 09 00105 g001
Figure 2. Boxplot of the variables for the environment without cultivation.
Figure 2. Boxplot of the variables for the environment without cultivation.
Data 09 00105 g002
Figure 3. Dispersion over time in the environment with cultivation, indicating the activation state of the ventilation actuator.
Figure 3. Dispersion over time in the environment with cultivation, indicating the activation state of the ventilation actuator.
Data 09 00105 g003
Figure 4. Dispersion over time in the environment with cultivation, indicating the activation state of the irrigation actuator.
Figure 4. Dispersion over time in the environment with cultivation, indicating the activation state of the irrigation actuator.
Data 09 00105 g004
Figure 5. Dispersion over time in the environment with cultivation, indicating the activation state of the heating actuator.
Figure 5. Dispersion over time in the environment with cultivation, indicating the activation state of the heating actuator.
Data 09 00105 g005
Figure 6. Dispersion of field data in the greenhouse environment without cultivation over time.
Figure 6. Dispersion of field data in the greenhouse environment without cultivation over time.
Data 09 00105 g006
Figure 7. Statistical histogram of the variables for environments with and without cultivation.
Figure 7. Statistical histogram of the variables for environments with and without cultivation.
Data 09 00105 g007
Table 1. Descriptive statistics of environment with cultivation.
Table 1. Descriptive statistics of environment with cultivation.
StatisticHumTempLight IntensityLuminosityGround Humidity perco2 ppmAct FanAct Solenoid ValveAct Heating
count97,919.0097,919.0098,051.0092,900.0098,051.0098,051.0067,409.0067,409.0067,409.00
mean88.9223.170.03433.4666.87455.210.150.030.11
std13.454.120.04607.6711.44396.090.360.180.31
min14.2014.80−0.02−59.2733.000.000.000.000.00
25%82.2019.400.000.1660.00242.000.000.000.00
50%94.2024.500.0078.6771.00388.000.000.000.00
75%99.9025.600.06794.6976.00508.000.000.000.00
max99.9039.500.282645.4489.0020,865.001.001.001.00
Table 2. Descriptive statistics of environment without cultivation.
Table 2. Descriptive statistics of environment without cultivation.
StatisticHumTempLight IntensityLuminosityGround Humidity perco2 ppmAct FanAct Solenoid ValveAct Heating
count115,779.00115,779.00115,781.00110,877.00115,781.00115,781.0086,533.086,533.086,533.0
mean72.5221.110.02460.8817.00438.110.00.00.0
std14.554.440.06630.0422.19250.730.00.00.0
min37.6013.60−1.31−0.140.005.000.00.00.0
25%61.6017.300.000.166.00354.000.00.00.0
50%70.1020.000.0069.616.00388.000.00.00.0
75%80.6024.600.02882.527.00424.000.00.00.0
max99.9035.201.142734.9970.002365.000.00.00.0
Table 3. Null and missing values.
Table 3. Null and missing values.
EnvironmentFieldNull ValuesTotal Minutes without Record
With cultivationhum203861,838
temp203861,838
light_intensity190661,706
luminosity705766,857
ground_humidity_per190661,706
co2_ppm190661,706
act_fan32,54892,348
act_solenoid_valve32,54892,348
act_heating32,54892,348
Without cultivationhum245443,964
temp245443,964
light_intensity245243,962
luminosity735648,866
ground_humidity_per245243,962
co2_ppm245243,962
act_fan31,70073,210
act_solenoid_valve31,70073,210
act_heating31,70073,210
Table 4. Results of the normality test.
Table 4. Results of the normality test.
EnvironmentFieldTest Statisticp_Value
With cultivationhum0.7938 1.5728 × 10 132
temp0.9629 2.0756 × 10 86
light_intensity0.7392 1.1788 × 10 139
luminosity0.7463 1.7774 × 10 137
ground_humidity_per0.9336 6.1109 × 10 101
co2_ppm0.6630 1.2617 × 10 147
act_fan0.4284 2.3772 × 10 154
act_solenoid_valve0.1681 1.2863 × 10 166
act_heating0.3584 4.4510 × 10 158
Without cultivationhum0.9395 6.4584 × 10 102
temp0.9347 6.4742 × 10 104
light_intensity0.3538 2.0204 × 10 173
luminosity0.7563 1.3339 × 10 140
ground_humidity_per0.5408 6.3678 × 10 162
co2_ppm0.6145 3.4676 × 10 156
Table 5. Data symmetry and kurtosis.
Table 5. Data symmetry and kurtosis.
EnvironmentFieldKurtosisSkewness
With cultivationhum5.2232−1.4442
temp2.60660.2257
light_intensity6.35081.7282
luminosity3.75521.3444
ground_humidity_per2.6737−0.6325
co2_ppm95.49954.8878
act_fan4.78201.9447
act_solenoid_valve28.79995.2726
act_heating7.33812.5176
Without cultivationhum2.38880.5172
temp2.26880.5855
light_intensity117.13989.0983
luminosity3.39111.2333
ground_humidity_per3.14791.4569
co2_ppm21.42473.7241
Table 6. Variable’s correlations in environment with cultivation.
Table 6. Variable’s correlations in environment with cultivation.
HumTempLight IntensityLuminosityGround Humidity perco2 ppmAct FanAct Solenoid ValveAct Heating
hum1.000000−0.604911−0.072303−0.2474870.1385300.145315−0.1954370.049326−0.127851
temp−0.6049111.0000000.4300250.4375720.200001−0.1024520.011895−0.006932−0.087940
light intensity−0.0723030.4300251.0000000.8540610.2271820.0109020.0592700.009809−0.119948
luminosity−0.2474870.4375720.8540611.000000−0.030065−0.0570600.1157310.014779−0.154853
ground humidity per0.1385300.2000010.227182−0.0300651.0000000.253412−0.1147180.0343520.067306
co2 ppm0.145315−0.1024520.010902−0.0570600.2534121.000000−0.1764360.3688960.181121
act fan−0.1954370.0118950.0592700.115731−0.114718−0.1764361.0000000.017318−0.071265
act solenoid valve0.049326−0.0069320.0098090.0147790.0343520.3688960.0173181.000000−0.017410
act heating−0.127851−0.087940−0.119948−0.1548530.0673060.181121−0.071265−0.0174101.000000
Table 7. Variable’s correlations in environment without cultivation.
Table 7. Variable’s correlations in environment without cultivation.
HumTempLight IntensityLuminosityGround Humidity perco2 ppm
hum1.000000−0.515352−0.000974−0.2905870.6932760.106558
temp−0.5153521.0000000.2921490.6552300.0653930.019724
light intensity−0.0009740.2921491.0000000.5350520.2156640.012362
luminosity−0.2905870.6552300.5350521.000000−0.0351610.021030
ground humidity per0.6932760.0653930.215664−0.0351611.0000000.144275
co2 ppm0.1065580.0197240.0123620.0210300.1442751.000000
Table 8. Effect of actuators on the variables.
Table 8. Effect of actuators on the variables.
Fan, Solenoid_Valve, HeatingSample SizeFieldMeanStandard DeviationAffectation
0, 0, 043,675hum90.5112.47N/A
temp23.264.11N/A
light_intensity0.030.04N/A
luminosity447.46608.91N/A
ground_humidity_per67.1511.44N/A
co2_ppm432.12310.24N/A
0, 0, 16316hum84.6712.71−5.84
temp22.033.96−1.23
light_intensity0.010.03−0.02
luminosity169.80435.96−277.66
ground_humidity_per69.3610.712.21
co2_ppm660.41575.39228.29
0, 1, 01513hum94.827.494.31
temp23.003.98−0.26
light_intensity0.030.030
luminosity505.69630.0658.23
ground_humidity_per69.647.792.49
co2_ppm1313.10747.01880.98
0, 1, 1168hum95.702.645.19
temp18.300.36−4.96
light_intensity0.000.00−0.03
luminosity0.160.00−447.3
ground_humidity_per69.798.262.64
co2_ppm1973.551376.091541.43
1, 0, 08370hum83.3715.94−7.14
temp23.124.19−0.14
light_intensity0.030.040
luminosity626.90668.83179.44
ground_humidity_per63.5012.41−3.65
co2_ppm271.18168.62−160.94
1, 0, 1535hum72.5513.96−17.96
temp23.873.560.61
light_intensity0.020.04−0.01
luminosity340.10667.55−107.36
ground_humidity_per66.4614.14−0.69
co2_ppm315.01118.23−117.11
1, 1, 0358hum81.7112.65−8.8
temp24.813.761.55
light_intensity0.040.040.01
luminosity665.01658.24217.55
ground_humidity_per66.567.45−0.59
co2_ppm734.28376.26302.16
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Vanegas-Ayala, S.-C.; Barón-Velandia, J.; Garcia-Chavez, O.-M.; Romero-Palencia, A.; Leal-Lara, D.-D. Experimental Data in a Greenhouse with and without Cultivation of Stringless Blue Lake Beans. Data 2024, 9, 105. https://doi.org/10.3390/data9090105

AMA Style

Vanegas-Ayala S-C, Barón-Velandia J, Garcia-Chavez O-M, Romero-Palencia A, Leal-Lara D-D. Experimental Data in a Greenhouse with and without Cultivation of Stringless Blue Lake Beans. Data. 2024; 9(9):105. https://doi.org/10.3390/data9090105

Chicago/Turabian Style

Vanegas-Ayala, Sebastian-Camilo, Julio Barón-Velandia, Oscar-Mauricio Garcia-Chavez, Adrian Romero-Palencia, and Daniel-David Leal-Lara. 2024. "Experimental Data in a Greenhouse with and without Cultivation of Stringless Blue Lake Beans" Data 9, no. 9: 105. https://doi.org/10.3390/data9090105

APA Style

Vanegas-Ayala, S.-C., Barón-Velandia, J., Garcia-Chavez, O.-M., Romero-Palencia, A., & Leal-Lara, D.-D. (2024). Experimental Data in a Greenhouse with and without Cultivation of Stringless Blue Lake Beans. Data, 9(9), 105. https://doi.org/10.3390/data9090105

Article Metrics

Back to TopTop