Proximal Monitoring of CO2 Dynamics in Indoor Smart Farming: A Deep Learning and Image-Sensor Fusion Approach

Lee, Seunghun; Kim, Bora; Cheon, Sang-Gyu; Lee, Jae Won

doi:10.3390/su172310838

Open AccessArticle

Proximal Monitoring of CO₂ Dynamics in Indoor Smart Farming: A Deep Learning and Image-Sensor Fusion Approach

¹

Division of Mechanical Engineering, Korea Maritime & Ocean University, 727 Taejong-ro, Yeongdo-gu, Busan 49112, Republic of Korea

²

PANASIA Co., Ltd., 55 Mieumsandan3-ro, Gangseo-gu, Busan 46747, Republic of Korea

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Sustainability 2025, 17(23), 10838; https://doi.org/10.3390/su172310838

Submission received: 14 October 2025 / Revised: 1 December 2025 / Accepted: 2 December 2025 / Published: 3 December 2025

(This article belongs to the Section Sustainable Agriculture)

Download

Browse Figures

Versions Notes

Abstract

In controlled environment agriculture (CEA), CO₂ enrichment can promote photosynthesis while simultaneously reducing evapotranspiration, but the optimal settings vary depending on crop type, growth stage, and microclimate. This study presents a near-field remote sensing framework that fuses RGB image features with environmental variables to predict the CO₂ uptake/respiration dynamics of five leafy vegetables grown in a hydroponic culture system and evaluate their impact on resource efficiency under CO₂ control. A hybrid deep model incorporating You Only Look Once version 11 (YOLOv11) and a Residual Network with 50 layers (ResNet50) extracts growth-related visual cues and integrates them with tabular features (CO₂, temperature, and light conditions) to predict chamber CO₂ dynamics. Performance was evaluated by Mean Absolute Error (MAE)/Mean Squared Error (MSE) on withheld data, and the system-level impacts on water use (ET), pumping energy, and relative yield were analyzed using a conventional greenhouse model. The model exhibited high accuracy (MAE = 0.95; MSE = 1.62). Scenario analysis results showed that increasing ambient CO₂ concentration from 400 to 1200 ppm reduced modeled water demand by approximately 11%, increased modeled yield by approximately 9%, and resulted in a corresponding reduction in pumping energy per unit area. Unlike conventional single-crop, table-based approaches, this study demonstrates multi-crop generalization and image-environment fusion for CO₂ dynamic prediction, establishing proximity sensing as a viable decision-making layer for CEA. While yield/ET results were simulated rather than measured in long-term trials, and leaf area normalization was not available, the proposed framework provides a viable path for data-driven CO₂ control in indoor farms by linking image-based monitoring with operational optimization.

Keywords:

CO₂ capture; CO₂ enrichment control; absorption; indoor; energy consumption; low CO₂ concentration

1. Introduction

Climate change and environmental issues have emerged as critical challenges for humanity. The rapid acceleration of industrialization and urbanization has significantly increased greenhouse gas emissions, particularly carbon dioxide (CO₂), leading to global warming and extreme climate variations. These changes have resulted in severe consequences, including ecosystem disruption, rising sea levels, and an increase in the frequency of natural disasters, all of which pose direct threats to human survival [1,2,3].

To address these challenges, the International Energy Agency (IEA) has proposed the “Net Zero by 2050” initiative, which aims to achieve carbon neutrality by 2050 and limit the global temperature increase to within 1.5 °C [4]. One such strategy, Bioenergy with Carbon Capture and Storage/Utilization (BECCS/U), has garnered attention for its potential to both reduce atmospheric CO₂ and enhance biomass-based productivity in agricultural field [5,6]. Within this framework, the use of CO₂ enrichment in controlled environments-such as smart farms-offers a promising pathway to simultaneously promote plant growth and sequester carbon, making it highly relevant for sustainable agriculture and climate mitigation.

Photosynthesis, a vital process for plant growth and development, involves the absorption of CO₂ and the synthesis of carbohydrates. Numerous studies have sought to optimize photosynthesis by considering environmental conditions and various stages of plant growth. For example, Bhargava et al. [7] examined the effects of CO₂ concentration and light intensity on plant nutrition, while Chen et al. [8] optimized environmental factors such as temperature and illumination to enhance growth rates. Additionally, Jung et al. [9] developed a model to analyze the environmental factors influencing photosynthesis in Romaine lettuce. However, these studies often relied on traditional analytical methods, which struggled to account for the variability inherent in plant growth data.

Recent advancements in artificial intelligence (AI) have provided more robust tools for addressing variability in photosynthesis predictions. Ying et al. [10] developed a deep neural network (DNN) to predict photosynthetic rates based on light intensity, CO₂ concentration, and temperature. Zhang et al. [11] employed machine learning models, such as XGBoost (eXtreme Gradient Boosting) and support vector machines (SVMs), using tabular data (e.g., leaf area, length, and width) to forecast photosynthesis rates. While effective, these approaches were limited to structured data formats and single-crop predictions. Kaneko et al. [12] introduced a hybrid artificial neural network (ANN) model by preprocessing environmental data and integrating it with plant-specific information. Meanwhile, Niu et al. [13] utilized a backpropagation neural network (BPNN) model optimized with the Incremental Constructive Extreme Learning Machine (IELM) for photosynthesis prediction. Additionally, Zhang et al. [14] combined particle swarm optimization with DNNs to create a PSO-BP model for predicting photosynthesis rates based on fluorescence properties and environmental factors. However, significant limitations persist. Current models struggle to generalize across various crops, and most studies overlook image data, which has considerable potential for capturing growth variability and enhancing prediction accuracy.

To overcome these challenges, this study proposes a hybrid deep learning framework that integrates structured environmental data (e.g., CO₂ concentration, light conditions, temperature) with unstructured image data to predict short-term CO₂ uptake and release dynamics. Unlike existing approaches that focus solely on single crops or tabular inputs, this study generalizes to five leafy vegetables and leverages both visual and environmental cues. Combining CNN and YOLOv11-based image segmentation, it extracts plant-specific features (e.g., size, shape, and color patterns) and combines these with sensor data to generate accurate and time-resolved CO₂ dynamics predictions (Figure 1). Importantly, this prediction model is extended to evaluate operational impacts, specifically water use efficiency and yield responses, under various CO₂ control strategies. This step is crucial in indoor agricultural systems, where irrigation and pumping are energy-intensive. The analysis in the present study suggests that optimizing CO₂ concentration can simultaneously reduce water use and increase crop yields, contributing to energy savings and sustainable crop production. By linking computer vision, sensor fusion, and plant physiology, this research highlights the role of proximity remote sensing as a scalable tool for monitoring and optimizing CO₂-related processes in smart agricultural settings.

2. Materials and Methods

This section describes the methodology for quantifying CO₂ absorption and emission during the photosynthetic and respiratory processes of five leafy vegetable species. A deep learning model was utilized to predict the optimal CO₂ concentration at each growth stage, with the goal of maximizing photosynthetic efficiency and productivity. Additionally, the study investigates variations in crop yield in response to changes in water consumption, employing evapotranspiration models.

2.1. Material Preparation and Characterization

The leafy vegetable species selected for this study were red skirt lettuce Lactuca sativa L. var. crispa ‘Red Skirt’), green skirt lettuce (L. sativa L. var. crispa ‘Green Skirt’), lettuce red (L. sativa L. var. acephala ‘Lettuce Red’), bok choy (Brassica rapa L. subsp. chinensis ‘Bok Choy’), and romaine lettuce (L. sativa L. var. longifolia ‘Romaine’) all obtained from ASIA SEED Co. (Seoul, Korea) These species were selected to represent two major families in leafy green production, Asteraceae (Lactuca spp.) and Brassicaceae (Brassica rapa), and to include varieties with different leaf pigmentation. All selected species utilize the C3 photosynthetic pathway. A high-purity CO₂ gas (99.999%, KyungDong Gas, Busan, Korea) was used to regulate CO₂ concentration within enclosed chambers. Seedlings were grown hydroponically for 21 days post-germination. For the experiment, plants were transplanted into the experimental chamber when they had developed 5–6 true leaves. This point of transplanting was designated as Day 1 of the experiment. Environmental conditions were continuously monitored using CO₂ and temperature sensors. Additionally, high-resolution images of the plants were captured to document growth patterns and morphological changes over time. The camera used for this experiment was a Samsung S5KHM3 (Samsung Electronics, Suwon, Korea), with a resolution of 108 megapixels and an aperture of f/1.8. A single camera was used throughout the entire imaging process. A single camera was employed throughout the entire imaging process and was positioned at the center of the top surface of the chamber. During image capture, the lighting provided by the LED grow lights, which were used for plant growth, served as the only light source, ensuring consistency with the growth conditions.

2.2. Experimental Method

The selected plant species were cultivated in a hydroponic system under controlled environmental conditions (25–30 °C, 60–70% relative humidity). Following germination, the plants were exposed to 100 W·m⁻² LED lighting to promote growth. To ensure consistent nutrient availability, the hydroponic solution was replenished every three days. The experimental setup for measuring CO₂ concentration is illustrated in Figure 2. It is important to note that the CO₂ chamber used in this study was designed with fully opaque black walls and external blackout curtains to completely eliminate incidental light during both the LED-illuminated phase and the dark phase. Because this light-shielding structure prevents interior photographs from capturing the chamber layout or sensor configuration, real images do not provide meaningful technical information; therefore, a schematic representation is provided instead. Figure 2 presents a detailed schematic that clearly illustrates the internal configuration of the enclosed chamber, including the CO₂ inlet for concentration control, the environmental control unit, the arrangement of LED modules that define illumination conditions, the overhead camera mount, and the placement of CO₂ and temperature sensors.

For the CO₂ absorption experiment, eight plants of the same species were placed in a sealed chamber, with external air introduced to stabilize temperature and CO₂ levels. The CO₂ concentration inside the chamber was initially elevated to 1200 ppm. Photosynthetic activity was then stimulated by activating the light source, and CO₂ absorption was monitored until the concentration decreased to 400 ppm. Immediately following the absorption phase, a CO₂ emission experiment was conducted. During this phase, the light source was turned off, and blackout curtains were employed to create a dark environment, simulating nighttime conditions and facilitating plant respiration. CO₂ emissions were measured until the concentration returned to 1200 ppm. This rapid cycling protocol was designed to efficiently generate a dynamic dataset for model training. A total of five consecutive cycles were performed over a 24 h period for each measurement day. The CO₂ concentration curves presented in the results represent the data from a single, representative cycle performed after an initial stabilization period, rather than an average of multiple cycles. Images were captured once per measurement cycle. Specifically, images were taken at the beginning of each light phase, when the CO₂ concentration was at 1200 ppm, to correlate the plant’s morphological state with the subsequent CO₂ absorption dynamics. Quantitative data on leaf area and color changes were extracted from the images and analyzed for correlations with CO₂ absorption and emission rates. Furthermore, it is crucial to clarify that the CO₂ exchange data represent the total values for the entire plant biomass in the chamber and are not normalized by leaf area. As the quantitative leaf area data were not systematically recorded, direct comparisons of the magnitude of CO₂ uptake between species should be made with caution, as they are influenced by the overall plant size. This reveals rich and dynamic CO₂ patterns, which are primarily used to train and validate deep learning prediction models.

2.3. Methodology for Hybrid Prediction Model

A predictive model for CO₂ absorption in crops was developed by integrating tabular data with the output of a hybrid deep learning model that combines ResNet50 and YOLOv11. The model was implemented in Python 3.9, a widely adopted language in machine learning research due to its extensive ecosystem of optimized libraries (TensorFlow 2.10, Keras 2.10, Scikit-learn for deep learning 1.2; NumPy 1.23, SciPy 1.10 for numerical computation) and robust GPU acceleration support through CUDA 11.8 integration. Python’s interpretability and rapid prototyping capabilities facilitated iterative model development and hyperparameter tuning. The implementation utilized an NVIDIA RTX 3090 4-way Ti GPU cluster for accelerated training and inference. Key machine learning and deep learning frameworks, including TensorFlow 2.10, Keras 2.10, Scikit-learn 1.2, and Ultralytics 8.0 [15] were employed in the model development process.

The dataset used for training consisted of image data obtained from CO₂ absorption experiments. The training dataset consisted of 225 original images collected from five crop species at three growth stages: Days 1, 6, and 10 post-transplanting (DPT). Images were captured at 1050 × 1400 × 3 pixel resolution under CO₂ concentrations of 400–1200 ppm. Data augmentation (rotation ±20°, flips, spatial translations) was applied with an augmentation factor of 298×, yielding 67,050 total images. The dataset was split into training (70%, 46,935 images), validation (15%, 10,057 images), and test (15%, 10,058 images) sets. During the data preprocessing stage, data augmentation techniques were applied [16]. In this process, various transformations were implemented while ensuring that the modifications did not extend beyond the original crop images. Specifically, rotation within a 20° range, vertical and horizontal flipping, and slight positional shifts in all directions were applied. Through data augmentation, the dataset size was increased by a factor of 298. Before training, data augmentation techniques were applied to enhance the model’s robustness. The dataset was then divided into training, validation, and test sets in a 7:1.5:1.5 ratio to ensure a balanced evaluation of the model [17].

2.3.1. YOLOv11 Transfer Learning

To enable automated crop recognition and physical feature extraction, the YOLOv11 instance segmentation model was adapted through transfer learning. The model was initialized with weights pretrained on the COCO dataset (80 object categories) [18], providing robust low-level feature detection (edges, textures, shapes) that generalizes across visual domains. Fine-tuning specialized the model for five crop species (RSL, GSL, LR, BC, RL) using selective layer freezing: backbone convolutional layers retained COCO-learned weights for general feature extraction, while detection and segmentation heads were retrained on crop-specific data to learn species-distinctive characteristics. Training employed stochastic gradient descent (momentum = 0.9) with initial learning rate 0.01 (cosine annealing decay), batch size 16, converging after approximately 100–150 epochs. The fine-tuned model outputs instance segmentation masks for each crop, enabling extraction of physical features: leaf area (cm²), plant count, red pigmentation ratio (%), and green pigmentation ratio (%). These metrics quantify biomass and physiological characteristics directly related to photosynthetic capacity.

2.3.2. Hybrid Neural Network Architecture for CO₂ Prediction

The CO₂ prediction model integrates three distinct data streams through a fully connected neural network architecture (schematically illustrated in Figure 1; detailed architecture in Figure 3). The first stream consists of physical features extracted from YOLOv11, comprising four quantitative metrics: leaf area, plant count, red pigmentation ratio, and green pigmentation ratio. These features explicitly characterize crop biomass and chlorophyll content. The second stream incorporates hierarchical visual features from ResNet50, a convolutional neural network pretrained on ImageNet. Feature maps are extracted from the final convolutional layer, yielding 2048-dimensional representations that encode complex visual patterns such as leaf texture, plant architecture, and subtle color variations not explicitly quantifiable through simple metrics. The third stream captures tabular environmental data through seven features: CO₂ concentration (ppm), temperature (°C), light status (binary: dark or illuminated), and crop species identity (one-hot encoded across five categories). The concatenated input vector of 2059 dimensions is processed through three sequential fully connected layers with 512, 256, and 128 nodes, respectively, each employing ReLU activation and dropout regularization (rate = 0.3) for the first two layers. The final output layer consists of a single node with linear activation to predict chamber CO₂ concentration change (ppm). Model training employed the Adam optimizer with an initial learning rate of 0.001 and exponential decay, minimizing mean squared error loss with a batch size of 32, which was selected through optimization trials comparing batch sizes of 16, 32, and 64. Early stopping with 300-epoch patience was implemented, ultimately selecting model weights from epoch 100 where validation loss reached its minimum (0.95 ppm MAE). This architecture leverages complementary information: explicit physical measurements provide interpretable features directly linked to photosynthetic processes, while deep visual features capture subtle growth-stage cues that enhance prediction robustness across different crop species and developmental stages.

2.3.3. Model Training and Performance Evaluation

To quantify the model’s predictive accuracy, Mean Absolute Error (MAE) and Mean Squared Error (MSE) were employed as the primary evaluation metrics. MAE measures the average magnitude of the errors between predicted and observed values, while MSE gives higher weight to larger errors. MSE was used as the loss function to guide model optimization during training, and MAE was used to assess the final validation performance. The metrics are defined in Equations (1) and (2), where

\hat{Y_{i}}

and

Y_{i}

represent the predicted and observed values, respectively, and n is the total number of samples.

M A E = \frac{1}{n} \sum_{i = 1}^{n} |\hat{Y_{i}} - Y_{i}|

(1)

M S E = \frac{1}{n} \sum_{i = 1}^{n} (\hat{Y_{i}} - Y_{i})^{2}

(2)

2.4. Analysis of Water Consumption, Energy Use, and Relative Yield

To analyze the impact of CO₂ enrichment on resource efficiency, crop evapotranspiration, associated energy consumption, and relative changes in crop yield were estimated.

2.4.1. Evapotranspiration and Water Consumption Modeling

Evapotranspiration (ET) was estimated using the Stanghellini models, as described in Equation (3) [19]. These models are particularly suited for enclosed environments, such as greenhouses, rather than open-field conditions [20]. The specific input parameters and environmental conditions used for the Stanghellini equation are detailed in Table 1.

E T = \frac{δ R_{n} (\frac{2 L A I ρ_{a} C_{a}}{r_{c}}) V P D}{γ (1 + \frac{δ}{γ} + \frac{r_{i}}{r_{e}})}

(3)

The internal crop resistance is a critical factor in ET estimation, as it varies directly with CO₂ concentration levels. To investigate the effect of CO₂ uptake on evapotranspiration, CO₂ concentrations were varied from 400 ppm to 1200 ppm for analysis.

Evapotranspiration flux (ET) is expressed in [W m⁻²], representing the rate of water vapor transfer from the crop surface to the atmosphere. The parameter δ represents the slope of the saturation curve of the psychrometric chart, expressed in [Pa °C⁻¹]. Similarly, R_n denotes the net radiation, given in [W m⁻²]. The leaf area index (LAI) is a dimensionless parameter that quantifies the leaf surface area per unit ground area, influencing transpiration and photosynthesis rates. In this study, LAI was fixed at 4.4 because the objective was to isolate the physiological effect of CO₂ induced changes in internal crop resistance (r_i). Since LAI was not measured dynamically, and varying it without measured data could introduce structural errors. Since the immediate response of C3 plants to elevated CO₂ is stomatal closure reducing transpiration, fixing LAI allowed the analysis to focus on the CO₂–r_i relationship. The parameters ρ_a and C_a represent the air density [kg m⁻³] and the specific heat capacity of air [J kg⁻¹ °C⁻¹], respectively. The vapor pressure deficit (VPD) is expressed in [Pa], while the psychometric constant (γ) is given in [Pa °C⁻¹]. The internal crop resistance and external crop resistance (r_e) are both measured in [s m⁻¹]. The internal leaf resistance can be approximated using the microclimate parameters of the greenhouse by applying Stanghellini’s equation, as presented in Equation (4).

r_{i} = r_{m i n} r_{i} (I_{s}) r_{i} (T_{0}) r_{i} (C_{{C O}_{2}}) r_{i} (P_{s} - P_{a})

(4)

The parameters influencing

r_{i}

include

r_{m i n}

, the minimum internal resistance;

I_{s}

[W m⁻²] the solar radiation;

T_{0}

[°C], the leaf surface temperature;

C_{{C O}_{2}}

, the CO₂ concentration; and

P_{s}

and

P_{a}

[kPa], which represent the saturation vapor pressure and actual vapor pressure, respectively.

C_{{C O}_{2}}

is a key parameter in this study and its effect on internal resistance is described by Equation (5).

r_{i} (C_{{C O}_{2}}) = 1 + C_{1} {(C_{{C O}_{2}} - 200)}^{2}

(5)

In this study, the parameter

C_{1}

and the minimum internal resistance (

r_{m i n}

) are adopted from Stanghellini’s research, as presented in Table 2.

By calculating Equation (3) using the derived parameters, the crop’s evapotranspiration can be estimated, enabling the quantification of water consumption during the growth period. Additionally, leaf conductance is defined as the reciprocal of internal resistance (

r_{i}

).

2.4.2. Energy Consumption Modeling

To quantify the energy savings from reduced water consumption, the power required for groundwater pumping was calculated using Equation (6), as agricultural water in Korea primarily relies on groundwater.

E = \frac{9.8 \times W_{l i f t} \times W_{m a s s}}{3.6 \times 10^{6} \times ρ}

(6)

The parameters used in this equation are summarized in Table 3.

2.4.3. Relative Yield Estimation

It is important to clarify that the yield values presented in this study were not obtained from direct physical harvesting of the crops. Instead, they are simulated estimates of relative yield changes, calculated using the empirical model as shown in Equation (7) [21]. This model predicts that a decrease in leaf conductance, the reciprocal of internal resistance, reduces the uptake of airborne pollutants through stomata, which can lead to improvements in yield. Specifically, when leaf conductance decreases by x%, the absorption of pollutants, such as ozone (O₃) and sulfur dioxide (SO₂), both measured in ppm, into leaf tissues also decreases by x%. To quantify the impact of pollutant absorption on crop yield, the following predictive models were used [21]. To reflect conditions in Korea, average O₃ and SO₂ concentrations of 0.09 ppm and 0.008 ppm, respectively, were applied in the model based on monitoring data from Air Korea [22,23].

Y = 534.5 - 3988.6 [O_{3}] - 479.7 [S O_{2}] + 2661 [O_{3}] [S O_{2}] + 10960 {[O_{3}]}^{2}

(7)

To analyze the effects of varying CO₂ concentrations, internal leaf resistance was calculated for CO₂ levels ranging from 400 to 1200 ppm. As CO₂ concentrations increased, internal leaf resistance also rose, while its reciprocal, leaf conductance, decreased. The reduction in leaf conductance at CO₂ concentrations of 600, 800, 1000, and 1200 ppm, relative to the baseline of 400 ppm, was computed using Equations (2) and (3). The average decline in leaf conductance across these conditions was subsequently utilized to estimate potential yield increases. While the empirical model has been extensively validated in greenhouse agriculture with typical accuracy of ±10–15%, our specific predictions require confirmation through controlled cultivation trials [21]. We propose a randomized complete block design with three treatments replicated across 3–5 growth cycles comparing ambient CO₂ (~400 ppm), fixed enrichment (1200 ppm), and variable enrichment (1000–1100 ppm during early growth, 1150–1200 ppm during late growth). Direct measurements should include fresh weight at harvest to validate the 9.3% yield increase, cumulative water consumption to validate 11.1% savings, total energy costs, and temporal LAI dynamics. Statistical validation via ANOVA with α = 0.05 significance threshold would compare treatment effects. Trials should be conducted in operational smart farms to assess real-world performance under spatial heterogeneity and environmental fluctuations, quantifying return on investment and identifying any deviations from model predictions.

3. Results and Discussion

This study investigates the effects of CO₂ concentration on the growth characteristics, water consumption, and CO₂ absorption of five leafy vegetable species cultivated under smart farming conditions in Korea. Experiments were conducted in a sealed chamber with CO₂ concentrations ranging from 400 to 1200 ppm. Based on the collected experimental data, a hybrid CNN model was developed to predict the CO₂ absorption and emission rates of the crops. Additionally, the study presents a system-level analysis of crop growth, water consumption, and CO₂ utilization using this predictive model.

3.1. Experimental Analysis of CO₂ Concentration Changes

To investigate the relationship between CO₂ concentration and crop physiology, variations in CO₂ absorption and emission were analyzed at different time points post-transplanting. The experiment considered key environmental factors, including CO₂ concentration, temperature, light availability, crop species, and leaf characteristics. Figure 4 illustrates the CO₂ concentration dynamics for five crop species during a representative measurement cycle, showing rapid absorption in the light phase and subsequent emission in the dark phase. A key observation is that the total CO₂ respired during the dark phase nearly equaled the amount assimilated during the light phase. It is important to note that this near-zero net carbon gain observed within a single cycle is a limitation of the short-cycle experimental design, which was specifically designed to generate model training data, and should not be interpreted as the plant’s overall carbon balance under a typical diurnal growth cycle.

Figure 5 presents the CO₂ absorption dynamics for the five crop species, measured at DPT 1, 6, and 10 to examine temporal variations in photosynthetic activity. The data represent the change in absolute CO₂ concentration within the chamber and is not normalized by leaf area. For green skirt lettuce and romaine lettuce (Figure 5a,b), the rate of CO₂ absorption markedly increased from DPT 1 to DPT 10. This trend provides strong qualitative evidence that as the plants developed, their total leaf area expanded, thereby enhancing their overall photosynthetic capacity, even though the specific leaf area was not quantified. In contrast, for bok choy (Figure 5c), the CO₂ absorption trend remained similar between Day 6 and Day 10. This plateau effect was likely attributed to growth saturation, suggesting that by Day 6, the plant’s leaf area had reached a point where further expansion was limited, causing CO₂ absorption rates to stabilize.

The red-pigmented species, red skirt lettuce and lettuce red (Figure 5d,e), exhibited different characteristics. To investigate the impact of pigmentation, we analyzed the proportion of red-pigmented areas using the image segmentation capabilities of our model, with the results presented in Figure 6 [24].For red skirt lettuce (Figure 6a), the proportion of red pigmentation increased from 2.8% at Day 1 to 38.2% at Day 10, which correlated with a slowdown in CO₂ absorption between Day 6 and Day 10. Conversely, for lettuce red (Figure 6b), red pigmentation increased to 41.4% by Day 10, while CO₂ absorption continued to rise throughout the period. These differing trends suggest a complex relationship between anthocyanin accumulation (red pigmentation) and photosynthetic activity that may be species-specific.

3.2. Performance of the Trained Model

This study proposes a model for predicting CO₂ concentration in crops by integrating the outputs of a hybrid CNN model with tabular data. The hybrid model combines YOLOv11 and ResNet50 to improve prediction accuracy. The crop image data collected during the experiments were used to fine-tune YOLOv11, an instance segmentation model. The trained model effectively identified and highlighted the research crops, including green skirt lettuce, romaine lettuce, bok choy, red skirt lettuce, and lettuce red (Figure 7). The features extracted from the segmented masks, such as crop size and color ratio, served as input variables for the predictive model.

Additionally, ResNet50, a deep neural network comprising 50 layers, was utilized to extract hierarchical features including shape, texture, and color from the images [17]. The resulting feature maps, shown in Figure 8, provided valuable input for further analysis. It represented the original data in Figure 8a. The middle images illustrated the feature extraction process, where Figure 8b corresponded to the extraction of low-level features. At this step, the initial convolutional layers of ResNet50 detected and extracted low-level features such as edges, textures, and contrasts. This process was commonly referred to as edge detection, which played a crucial role in learning patterns within CNN architectures. In Figure 8c, as the network depth increased, the model learned high-level features, allowing it to recognize plant shapes and structural patterns. This step facilitated the classification of crops based on leaf structure and morphological characteristics. It also presented a binarized contour image in Figure 8d, demonstrating the segmentation process utilizing feature maps extracted by ResNet50. This technique enhanced plant contour detection, allowing for the analysis of plant growth status and the extraction of specific structural features.

The combination of YOLOv11 for instance segmentation and ResNet50 for feature extraction facilitated the precise identification of crop regions while providing additional data on size and color ratio. These extracted features were subsequently integrated to enhance the model’s predictive performance. Furthermore, the image-based data were combined with tabular data containing environmental parameters that influence CO₂ absorption, as well as hierarchical and physical crop characteristics. The final integrated dataset served as input for the neural network model, enabling the prediction of CO₂ absorption rates with high accuracy.

Furthermore, the extracted data were combined with tabular data, which included environmental factors affecting water CO₂ absorption, as well as hierarchical and physical features. This integrated dataset was then used as input for the neural network model, to predict the CO₂ absorption rates of the crops.

The model’s hyperparameters were optimized, and the final model was trained with a batch size of 32. As shown in Figure 9, training was halted at 100 epochs using an early stopping protocol to prevent overfitting. The performance of this optimized model was evaluated on the test set, achieving a Mean Absolute Error (MAE) of 0.95 and a Mean Squared Error (MSE) of 1.62. This indicates a high level of predictive accuracy across the five different crop types and their growth stages. The corresponding training and validation loss curves demonstrate that the validation loss reached its minimum at approximately 100 epochs, after which it began to rise while the training loss continued to decrease indicating the onset of overfitting. Therefore, the model corresponding to the 100th epoch was selected as the optimal model for further analysis. Unlike previous studies that primarily focused on a single crop type, the proposed model successfully predicted CO₂ absorption patterns across five different crops and their respective growth stages. The following results compare the model’s predictions with experimental measurements to validate its performance across various crop conditions.

The predictive performance of the deep learning model was validated by comparing its predicted CO₂ concentrations with the experimentally measured values on DPT 1, as shown in Figure 10. Under these conditions, the order of CO₂ absorption was observed as follows: green skirt lettuce > bok choy > lettuce red > romaine lettuce > red skirt lettuce. During the early growth stage, CO₂ concentrations ranged from 1000 to 1100 ppm, and from 1150 to 1200 ppm just before the late growth stage. These results indicate that the model successfully captured both crop-specific and time-dependent variations in CO₂ uptake. In addition, texture-based feature extraction from leaf images revealed that crops with reddish leaf coloration tended to exhibit slower growth and delayed CO₂ absorption. This observation supports the variation in uptake patterns, which can be attributed to phenotypic differences in leaf development during early growth.

3.3. Energy Conversion Results

To conduct the analysis using the empirical crop model, experimental data were collected under controlled smart farm conditions representative of Korea’s atmospheric environment. As previously described, average O₃ and SO₂ concentrations in Korea were applied in the model using one month of monitoring data from Air Korea, with values of 0.09 ppm and 0.008 ppm, respectively [22,23]. Figure 11 shows that as CO₂ concentration increased from 400 to 1200 ppm, leaf conductance decreased by 36.4%, while crop yield increased by 9.27%.

By applying the calculated internal resistance and other collected parameters, annual evapotranspiration per unit area across CO₂ concentrations was determined. Figure 12 indicates that water required for evapotranspiration decreased by approximately 11%.

To quantify the energy savings associated with reduced evapotranspiration, the energy required for agricultural water use was calculated. In Korea, agricultural water primarily relies on groundwater [25]. The power consumption required for groundwater pumping was determined using (7) [26]. In Table 3, W_lift represents total dynamic head in meters, and η denotes pump efficiency, typically set at 40% [27]. Groundwater depth was estimated at approximately 33.7 m. Annual energy requirements per unit area were 0.43 kWh at 400 ppm CO₂ and decreased to 0.38 kWh at 1200 ppm, representing an 11% reduction in energy consumption.

It presents the variations in crop growth, water consumption, and CO₂ absorption for each method, as summarized in Table 4 and Figure 13. The developed model was utilized to maintain the CO₂ concentration within the range of 1000 to 1200 ppm during the growth stages, thereby optimizing the CO₂ concentration process. Continuing with this methodology, the optimization of CO₂ concentration resulted in increased photosynthetic activity, reduced leaf stomatal conductance, and consequently, a decrease in the CO₂ injection required to maintain the optimized concentration [28]. As a result, water consumption was reduced, and crop growth was enhanced.

4. Limitations and Future Directions

Several limitations should be acknowledged. The controlled environment employed fixed light intensity (100 W LED), temperature (25–30 °C), and CO₂ range (400–1200 ppm), limiting extrapolation to variable field conditions. Chamber-scale experiments (8 plants) do not replicate commercial-scale spatial heterogeneity. The model was trained exclusively on five leafy vegetable species grown hydroponically; generalization to other crop types requires additional validation. CO₂ measurements represent chamber-level integrated exchange, and LAI was held constant (4.4) due to equipment constraints, potentially underestimating temporal growth dynamics. Yield improvements (9.3%) and water savings (11.1%) are simulation-based estimates requiring validation through harvest trials. Despite these limitations, this study demonstrates that vision-based multi-species CO₂ monitoring is feasible without invasive measurements, providing a validated framework for future pilot-scale implementation. Priority future work includes expanding crop diversity, testing variable environmental conditions, implementing continuous LAI measurement, and conducting harvest validation trials.

5. Conclusions

This study established a vision-based framework for non-invasive, multi-species CO₂ monitoring in controlled environment agriculture, addressing the critical need for automated crop-responsive CO₂ management without species-specific recalibration. Three key findings demonstrate practical feasibility.

First, the hybrid deep learning model integrating YOLOv11 segmentation, ResNet50 features, and environmental data achieved accurate CO₂ prediction across five leafy vegetable species (MAE = 0.95 ppm, MSE = 1.62), eliminating the need for invasive gas exchange measurements.

Second, variable CO₂ enrichment optimized through this monitoring approach yielded 7.4% greater cumulative CO₂ absorption compared to fixed enrichment, translating to projected 9.3% yield improvements and 11.1% water savings through reduced evapotranspiration.

Third, energy analysis demonstrated net positive returns, with reduced water pumping requirements offsetting CO₂ generation costs. The proposed system is scalable, non-destructive, and compatible with existing smart farm infrastructure, enabling real-time optimization of CO₂ supplementation for enhanced resource efficiency. While harvest-level validation is necessary to confirm simulated yield benefits, the validated monitoring framework provides growers with a practical tool for implementing data-driven CO₂ management strategies that improve both crop productivity and environmental sustainability in indoor agriculture.

Author Contributions

Conceptualization, S.L. and J.W.L.; methodology, S.L.; software, S.L.; validation, S.L., B.K. and S.-G.C.; formal analysis, S.L.; investigation, S.L.; resources, S.-G.C. and J.W.L.; data curation, S.L.; writing—original draft preparation, S.L.; writing—review and editing, B.K. and J.W.L.; visualization, S.L. and B.K.; supervision, J.W.L.; project administration, J.W.L.; funding acquisition, J.W.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Korea Institute for Advancement of Technology (KIAT) through grant funded by the Korea Government (MOTIE). (RS-2024-00424595, Regional Residency Program for Cultivating Advanced Research Talent in Next-Generation Marine Mobility Industry Innovation). It was also supported by the National Research Foundation of Korea (NRF) grant funded by the Korea Government(MSIT) (RS-2025-00523335), and by the Korea Maritime and Ocean University Research Fund in 2023.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

Author Sang-Gyu Cheon was employed by the company PANASIA Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Lindsey, R.; Dahlman, L. Climate Change: Global Temperature; National Oceanic and Atmospheric Administration (NOAA): Washington, WA, USA, 2020. [Google Scholar]
Lan, X.; Tans, P.; Thoning, K. Trends in Globally-Averaged CO₂ Determined from NOAA Global Monitoring Laboratory Measurements. 2023. Available online: https://gml.noaa.gov/ccgg/trends/global.html?doi=10.15138/9n0h-zh07 (accessed on 1 December 2025).
Fawzy, S.; Osman, A.I.; Doran, J.; Rooney, D.W. Strategies for Mitigation of Climate Change: A Review. Environ. Chem. Lett. 2020, 18, 2069–2094. [Google Scholar] [CrossRef]
Bouckaert, S.; Pales, A.F.; McGlade, C.; Remme, U.; Wanner, B.; Varro, L.; D’Ambrosio, D.; Spencer, T. Net Zero by 2050—A Roadmap for the Global Energy Sector; International Energy Agency: Paris, France, 2021. [Google Scholar]
Gielen, D.; Gorini, R.; Wagner, N.; Leme, R.; Gutierrez, L.; Prakash, G.; Asmelash, E.; Janeiro, L.; Gallina, G.; Vale, G. Global Energy Transformation: A Roadmap to 2050; International Renewable Energy Agency (IRENA): Masdar City, United Arab Emirates, 2019. [Google Scholar]
Zhai, P.; Pörtner, H.O.; Roberts, D.; Skea, J.; Shukla, P.R.; Pirani, A.; Moufouma-Okia, W.; Péan, C.; Pidcock, R.; Connors, S. Global Warming of 1.5 °C: IPCC Special Report on Impacts of Global Warming of 1.5 °C above Pre-Industrial Levels in Context of Strengthening Response to Climate Change, Sustainable Development, and Efforts to Eradicate Poverty; Cambridge University Press: Cambridge, UK, 2022. [Google Scholar]
Bhargava, S.; Mitra, S. Elevated Atmospheric CO₂ and the Future of Crop Plants. Plant Breed. 2021, 140, 1–11. [Google Scholar] [CrossRef]
Chen, D.; Zhang, J.; Sun, Z.; Zhang, Z.; Hu, J. Multi-Objective Optimal Regulation Model and System Based on Whole Plant Photosynthesis and Light Use Efficiency of Lettuce. Comput. Electron. Agric. 2023, 206, 107617. [Google Scholar] [CrossRef]
Jung, D.H.; Kim, D.; Yoon, H.I.; Moon, T.W.; Park, K.S.; Son, J.E. Modeling the Canopy Photosynthetic Rate of Romaine Lettuce (Lactuca sativa L.) Grown in a Plant Factory at Varying CO₂ Concentrations and Growth Stages. Hortic. Environ. Biotechnol. 2016, 57, 487–492. [Google Scholar] [CrossRef]
Qu, Y.; Clausen, A.; Jørgensen, B.N. Application of Deep Neural Network on Net Photosynthesis Modeling. In Proceedings of the IEEE International Conference on Industrial Informatics (INDIN), Palma de Mallorca, Spain, 21–23 July 2021; Institute of Electrical and Electronics Engineers Inc.: New York, NY, USA, 2021; Volume 2021-July. [Google Scholar]
Zhang, X.Y.; Huang, Z.; Su, X.; Siu, A.; Song, Y.; Zhang, D.; Fang, Q. Machine Learning Models for Net Photosynthetic Rate Prediction Using Poplar Leaf Phenotype Data. PLoS ONE 2020, 15, e0228645. [Google Scholar] [CrossRef] [PubMed]
Kaneko, T.; Nomura, K.; Yasutake, D.; Iwao, T.; Okayasu, T.; Ozaki, Y.; Mori, M.; Hirota, T.; Kitano, M. A Canopy Photosynthesis Model Based on a Highly Generalizable Artificial Neural Network Incorporated with a Mechanistic Understanding of Single-Leaf Photosynthesis. Agric. For. Meteorol. 2022, 323, 109036. [Google Scholar] [CrossRef]
Niu, Y.; Lyu, H.; Liu, X.; Zhang, M.; Li, H. Photosynthesis Prediction and Light Spectra Optimization of Greenhouse Tomato Based on Response of Red–Blue Ratio. Sci. Hortic. 2023, 318, 112065. [Google Scholar] [CrossRef]
Zhang, P.; Zhang, Z.; Li, B.; Zhang, H.; Hu, J.; Zhao, J. Photosynthetic Rate Prediction Model of Newborn Leaves Verified by Core Fluorescence Parameters. Sci. Rep. 2020, 10, 3013. [Google Scholar] [CrossRef] [PubMed]
Jocher, G.; Qiu, J. Ultralytics YOLO11. 2024. Available online: https://github.com/ultralytics/ultralytics (accessed on 1 December 2025).
Shorten, C.; Khoshgoftaar, T.M. A Survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef]
Wang, Y.; Ke, Z.; He, Z.; Chen, X.; Zhang, Y.; Xie, P.; Li, T.; Zhou, J.; Li, F.; Yang, C.; et al. Real-Time Burn Depth Assessment Using Artificial Networks: A Large-Scale, Multicentre Study. Burns 2020, 46, 1829–1838. [Google Scholar] [CrossRef] [PubMed]
Lin, T.-Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Dollár, P.; Zitnick, C.L. LNCS 8693-Microsoft COCO: Common Objects in Context; Springer International Publishing: Cham, Switzerland, 2014. [Google Scholar]
Stanghellini, C. Transpriation of Greenhouse Crops an Aid to Climate Management; Institute of Agricultural Engineering (IMAG): Wageningen, The Netherland, 1987. [Google Scholar]
Villarreal-Guerrero, F.; Kacira, M.; Fitz-Rodríguez, E.; Kubota, C.; Giacomelli, G.A.; Linker, R.; Arbel, A. Comparison of Three Evapotranspiration Models for a Greenhouse Cooling Strategy with Natural Ventilation and Variable High Pressure Fogging. Sci. Hortic. 2012, 134, 210–221. [Google Scholar] [CrossRef]
Allen, L.H. Plant Responses to Rising Carbon Dioxide and Potential Interactions with Air Pollutants. J. Environ. Qual. 1990, 19, 15–34. [Google Scholar] [CrossRef]
Air Korea. O₃ concentration comparison data (ItemCode 10003). Available online: https://www.airkorea.or.kr (accessed on 1 December 2025).
Air Korea. SO₂ concentration comparison data (ItemCode 10001). Available online: https://www.airkorea.or.kr (accessed on 1 December 2025).
Ma, G.; Yue, X. An Improved Whale Optimization Algorithm Based on Multilevel Threshold Image Segmentation Using the Otsu Method. Eng. Appl. Artif. Intell. 2022, 113, 104960. [Google Scholar] [CrossRef]
Karunakalage, A.; Lee, J.Y.; Daqiq, M.T.; Cha, J.; Jang, J.; Kannaujiya, S. Characterization of Groundwater Drought and Understanding of Climatic Impact on Groundwater Resources in Korea. J. Hydrol. 2024, 634, 131014. [Google Scholar] [CrossRef]
Mishra, V.; Asoka, A.; Vatta, K.; Lall, U. Groundwater Depletion and Associated CO₂ Emissions in India. Earths Future 2018, 6, 1672–1681. [Google Scholar] [CrossRef]
Khara, D.S.; Ghuman, R.S. Efficiency Concerns of Groundwater Irrigation in Green Revolution States of India: Data Envelopment Analysis (DEA) Approach. Water Econ. Policy 2023, 9, 2240004. [Google Scholar] [CrossRef]
Mortensen, L.M. Review: CO₂ Enrichment in Greenhouses. Crop Responses. Sci. Hortic. 1987, 33, 1–25. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of study showing the AI framework for predicting CO₂ absorption and the experimental setup for CO₂ enrichment.

Figure 2. Experimental setup of the enclosed plant growth chamber with CO₂ enrichment and environmental control.

Figure 3. Structure of the ResNet50-based neural network for CO₂ absorption prediction. The red box highlights the bottleneck block architecture (1 × 1→3 × 3→1 × 1 convolutions with 128, 128, and 512 filters), which efficiently extracts hierarchical visual features through channel dimensionality reduction and expansion.

Figure 4. CO₂ absorption and emission in crops over time. (a) Temporal variation in CO₂ absorption across different crop species. (b) CO₂ emission rates of crop species based on experimental measurements. (RSL, Red Skirt Lettuce; RL, Lettuce Red; BC, Bok Choy; GSL, Green Skirt Lettuce; RML, Romaine Lettuce).

Figure 5. CO₂ absorption patterns of different crop species at 1, 6, and 10 days post-transplanting (DPT): (a) green skirt lettuce, (b) romaine lettuce, (c) bok choy, (d) red skirt lettuce, and (e) lettuce red. The data represent the change in absolute CO₂ concentration within the chamber and is not normalized by leaf area.

Figure 6. ImageJ 1.54-based analysis of red pigmentation in red skirt lettuce and lettuce red at different growth stages (Day 1, Day 6, and Day 10). (a) Red skirt lettuce: Red pigmentation increased from 2.8% (Day 1) to 27.1% (Day 6), and further to 38.2% (Day 10), with a slowdown in CO₂ absorption observed on Day 10; (b) Lettuce red: Red pigmentation increased from 8.7% (Day 1) to 35.8% (Day 6), and further to 41.4% (Day 10), with a slowdown in CO₂ absorption observed on Day 10.

Figure 7. Finetuned YOLOv11 image segmentation model (a) green skirt lettuce; (b) romaine lettuce; (c) bok choy; (d) red skirt lettuce; (e) lettuce red.

Figure 8. Feature extraction and edge detection using ResNet-50 for leaf structure analysis. (a) Original image of the plant sample. (b) Low-level feature map showing edge and texture detection through the initial convolutional layer of ResNet 50. (c) High-level feature map representing abstract morphological patterns learned from deeper layers of the network. (d) Binarized contour map demonstrating leaf segmentation based on hierarchical feature extraction.

Figure 9. Training and validation loss curves across 1500 epochs for the batch size 32 model.

Figure 10. Comparison of experimental measurements and deep learning predictions of CO₂ concentration on Day 1. The panels represent different lettuce types: (a) green skirt lettuce; (b) romaine lettuce; (c) bok choy; (d) red skirt lettuce; (e) lettuce red.

Figure 11. Relative changes in estimated yield and leaf conductance at different CO₂ concentrations. The 400 ppm concentration serves as the 0% baseline for comparison.

Figure 12. Effect of CO₂ concentration on crop evapotranspiration.

Figure 13. Comparison of crop growth factors between constant (1200 ppm) and optimized CO₂ injection strategies.

Table 1. Input parameter values and environmental conditions for the Stanghellini Equation.

Symbol	Value	Unit
LAI	4.4
$ρ_{a}$	1.2	kg m⁻³
$C_{a}$	1004	J kg⁻¹ °C⁻¹
VPD	520	Pa
r_e	185	s m⁻¹
I	0.005	m
$λ_{a}$	0.02	W m⁻¹ k⁻¹
${R H}_{m e a n}$	70%	%
γ	6.69	Pa °C⁻¹
$P_{a t m}$	101,325	Pa

Table 2. Parameter values for internal resistance calculation.

Symbol	Value
$C_{1}$	$6.1 \times 10^{- 7}$
$r_{m i n}$ (s m⁻¹)	82

Table 3. Parameter values for the energy equation related to groundwater pumping for evapotranspiration.

Symbol	Value	Unit
$W_{l i f t}$	33.7	m
η	40%	%

Table 4. Comparison of positive effects according to injection methods.

Injection Method	Production	Water Consumption	CO₂ Consumption
Non-injection	0%	0%	0%
1200 ppm Injection	9.27%	−11.18%	100%
Optimized Injection	9.82%	−12.10%	92%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lee, S.; Kim, B.; Cheon, S.-G.; Lee, J.W. Proximal Monitoring of CO₂ Dynamics in Indoor Smart Farming: A Deep Learning and Image-Sensor Fusion Approach. Sustainability 2025, 17, 10838. https://doi.org/10.3390/su172310838

AMA Style

Lee S, Kim B, Cheon S-G, Lee JW. Proximal Monitoring of CO₂ Dynamics in Indoor Smart Farming: A Deep Learning and Image-Sensor Fusion Approach. Sustainability. 2025; 17(23):10838. https://doi.org/10.3390/su172310838

Chicago/Turabian Style

Lee, Seunghun, Bora Kim, Sang-Gyu Cheon, and Jae Won Lee. 2025. "Proximal Monitoring of CO₂ Dynamics in Indoor Smart Farming: A Deep Learning and Image-Sensor Fusion Approach" Sustainability 17, no. 23: 10838. https://doi.org/10.3390/su172310838

APA Style

Lee, S., Kim, B., Cheon, S.-G., & Lee, J. W. (2025). Proximal Monitoring of CO₂ Dynamics in Indoor Smart Farming: A Deep Learning and Image-Sensor Fusion Approach. Sustainability, 17(23), 10838. https://doi.org/10.3390/su172310838

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Proximal Monitoring of CO₂ Dynamics in Indoor Smart Farming: A Deep Learning and Image-Sensor Fusion Approach

Abstract

1. Introduction

2. Materials and Methods