Indoor Air Quality in Cob Buildings: In Situ Studies and Artificial Neural Network Modeling

: Knowledge of indoor air quality (IAQ) in cob buildings during the ﬁrst few months following their delivery is of vital importance in preventing occupants’ health problems. The present research focuses on evaluating IAQ in cob buildings through a prototype built in Normandy, France. To achieve this, the prototype was equipped with a set of sensors to monitor various parameters that determine indoor and outdoor air quality. These parameters include relative humidity (RH), carbon dioxide (CO 2 ), nitrogen dioxide (NO 2 ), ozone (O 3 ), particulate matter (PM1 and PM10), and volatile organic compounds (VOCs). The obtained experimental results indicate that, overall, there is good indoor air quality in the prototype building. However, there are some noteworthy ﬁndings, including high indoor RH and occasional spikes in CO 2 , PM1, PM10, and VOCs concentrations. The high RH is believed to be a result of the ongoing drying process of the cob walls, while the peaks in pollutants are likely to be attributed to human presence and the earthen ﬂoor deterioration. To ensure consistent good air quality, this study recommends the use of a properly sized Controlled Mechanical Ventilation system. Additionally, this study explored IAQ in the cob building from a numerical perspective. A Long Short-Term Memory (LSTM) model was developed and trained to predict pollutant concentrations inside the building. A validation test was conducted on the CO 2 concentration data collected on-site, and the results indicated that the LSTM model has accurately predicted the evolution of CO 2 concentration within the prototype building over an extended period.


Introduction
Indoor air quality (IAQ) is a critical concern for public health, especially in urban areas where people spend the majority of their time indoors [1,2].Exposure to indoor air pollution can present significant health and productivity impacts.Several factors can influence IAQ within buildings, including outdoor pollution, ventilation systems, furniture, and human activities.To address this issue, effective methods and tools are needed to evaluate IAQ and ensure the well-being of building occupants [3,4].Research has highlighted the importance of IAQ standards and certifications.Studies have compared IAQ requirements set by various building standards and found that pollutants such as formaldehyde, benzene, carbon dioxide, particulate matter, and radon are commonly considered in IAQ standards [5].The need for health-level standards to address epidemic prevention has also been emphasized.
The effectiveness of green buildings in improving IAQ and reducing energy consumption has been studied.Green office buildings with one-to-three-star ratings were found to have superior IAQ compared to ordinary buildings, with lower concentrations of carbon dioxide and particulate matter [6].However, the concentration of total Volatile Organic Compounds (VOCs) was higher in green buildings.These findings underscore the importance of assessing IAQ in green buildings to evaluate their impact on indoor air quality and energy consumption.IAQ requirements have been reviewed in various green building certifications worldwide, with VOCs, formaldehyde, and carbon dioxide being the most frequently considered indoor air pollutants [7].Emission source control, ventilation, and indoor air measurement are the primary pathways used in green building schemes for IAQ management.
In the United States, IAQ, in a Leadership in Energy & Environmental Design (LEED)certified green building, is regularly evaluated on an annual basis to ensure that indoor air pollutants remain within safe limits [8].The assessment of such buildings has revealed several advantages in terms of air quality compared to conventional buildings.While the green building concept incorporate measures such as increased fresh air supply and the use of safe materials to address IAQ during the design phase, there is a scarcity of experimental data confirming IAQ improvements during the operational phase.
Regarding earthen architecture, it is important to note that there is a paucity of studies dealing with the effect of such vernacular techniques on a building's IAQ under living conditions.One of the noteworthy features of such buildings is their non-containment of Volatile Organic Compounds (VOCs), in stark contrast to many other construction materials like paint, adhesives, and sealants.These VOCs can significantly impact health and pose risks to building occupants throughout their lifetime [9].Also, indoor air quality is directly linked to indoor relative humidity levels [10] and, therefore, earth ability to passively regulate moisture levels could be beneficial for the occupants' health.Moreover, clay particles present in these materials can help remove pollutants due to their passive removal properties [11].
Despite the many advantages of cob construction, including hygrothermal comfort and low environmental impact, indoor air quality in this type of building has not been sufficiently studied, especially during the first few months following building delivery.Many studies have focused on other building types, such as concrete or timber structures, leaving the potential benefits or risks of cob construction relatively unknown.The present work attempts to fill this gap by collecting indoor air pollutants concentrations and investigating the IAQ of a cob building during the first few months following its delivery.
Elsewhere, advancements in digital technologies have led to the development of computational methods and tools for assessing IAQ.Computational Fluid Dynamics (CFD) simulations can provide insights into airflow patterns and pollutant dispersion within buildings.Machine learning and deep learning models can predict IAQ parameters and optimize heating, ventilation and air-conditioning (HVAC) strategies for energy management.Artificial Intelligence based techniques have been discussed for applications in IAQ, including energy forecasting, occupancy comfort prediction, occupancy detection, and fault detection [12].However, selecting the most appropriate machine learning and deep learning models remains a challenge due to the wide range of algorithms used in building performance studies.Further developments and guidelines are needed to encourage best practices in model selection.
In some cases, sensors have been integrated into buildings to gather IAQ data and control HVAC systems to improve indoor conditions [13].Artificial neural networks trained on monitored data have been used to regulate ventilation rates through IoT communication protocols.
As a response to the pressing need for an enhanced IAQ in buildings, several studies have delved into innovative strategies and technologies to ensure healthier indoor environments.Thus, Li et al. [14] introduced an innovative approach that leverages CFD in conjunction with a Back Propagation Neural Network (BPNN) integrated with a Particle Swarm Optimizer (PSO) algorithm.This pioneering method swiftly predicts and optimizes IAQ conditions, while requiring only a limited number of CFD simulations.The results of this groundbreaking approach exhibited great promise, as the BPNN-PSO algorithm outperformed existing state-of-the-art IAQ control methods.It achieved a remarkable reduction in indoor air pollutant concentrations, reducing them by as much as 6.44%, while concurrently curtailing computational costs by up to 23.53%.
The research proposed by Ahtesham Bakht et al. [15] presents a promising approach to the predictive IAQ management through the utilization of deep learning techniques.While conventional deterministic methods for forecasting IAQ often require extensive calculations and specialized domain knowledge, deep learning-based methods have demonstrated excellent performance with significantly reduced computational requirements in recent studies.The authors introduce a hybrid deep learning framework which integrates multiple deep learning frameworks, including Convolution Neural Network, Long Short-Term Memory (LSTM), and Deep Neural Network.This framework is specifically designed to capture temporal patterns and informative features from indoor and outdoor air quality parameters, surpassing the capabilities of standalone deep learning models.The study's focus is on forecasting PM10 and PM2.5 concentrations.The demonstrated effectiveness of the hybrid framework in predicting future pollutant levels using historical air quality data is noteworthy.
Another noticeable study is the one reported by B. Lagesse et al. [16] which shed light on the significant gap in monitoring and regulating levels of the particulate matter PM2.5 within large office buildings.To address this critical issue, the authors designed statistical models using diverse modeling methods, including multiple linear regression, artificial neural networks, and LSTM.These models were meticulously trained using environmental and meteorological parameters as predictive factors to estimate PM2.5 concentrations in well-mixed indoor air.Impressively, the LSTM model emerged as the standout performer, with an interesting root mean square error.This model demonstrated remarkable accuracy in estimating PM2.5 levels even in the absence of ambient PM2.5 data.
In summary, this research not only addresses a critical gap in IAQ research by investigating cob buildings but also showcases the applicability of advanced deep learning models in the predictive IAQ management in such buildings.The insights gained from this study can contribute to a better understanding of the environmental and health implications of cob construction.Also, it can provide a valuable guidance for optimizing IAQ in these buildings during the first months following their delivery, enhancing occupants' comfort, health, and well-being.

Methodology
After reviewing the relevant recent literature, we developed an approach outlining our research procedures.The last lift of the studied prototype building being raised on the 33rd week of 2021, indoor/outdoor plasters and earthen floor were implemented, and carpentry installed.The CobBauge building was delivered on May 2022.The IAQ stations were then installed and data started to be recovered after 15 September 2022.The conceptual study strategy for this research is divided into four steps, as shown in Figure 1.Each step consists of several phases, all of which are covered in detail in the following sections.A summary of these phases is as follows: 1.
Building construction and instrumentation: this phase provides a description of the building composition and the installation of air quality sensors.

2.
Data collection: this phase describes the data recovery method.

3.
Data preprocessing: for numerical treatment, the data were preprocessed by cleaning it, removing outliers, and aggregating it on an hourly basis.

4.
Model selection: we evaluated the performance of the Long Short-Term Memory model (LSTM).LSTM is a type of recurrent neural network that can capture temporal dependencies in the data.

5.
Model estimation: setting all the parameters required for the algorithm's execution constitutes the fifth step before evaluating the model.6.
Model evaluation: we assessed the performance of the models on the test set using various metrics such as the coefficient of determination (R-squared).We also visualized the predictions of the models and compared them with the actual data.
Buildings 2023, 13, x FOR PEER REVIEW 4 of 19 4. Model selection: we evaluated the performance of the Long Short-Term Memory model (LSTM).LSTM is a type of recurrent neural network that can capture temporal dependencies in the data.5. Model estimation: setting all the parameters required for the algorithm's execution constitutes the fifth step before evaluating the model.6. Model evaluation: we assessed the performance of the models on the test set using various metrics such as the coefficient of determination (R-squared).We also visualized the predictions of the models and compared them with the actual data.

Prototype Building Description
A cob building prototype was constructed within the Cotentin and Bessin Marshes Regional Natural Park's territory.This prototype building features an internal floor area of 13 m 2 and a total surface area of approximately 20 m 2 , with an earthen-based floor, see Figure 2. The construction of the prototype building employs a double-walling method, where cob and light earth naturally adhere to form a single wall.The typical wall thicknesses range from 50 to 70 cm.Specifically, the walls of this prototype are 50 cm thick on the south and west facades, and 70 cm thick on the east and north facades.While in the north and east facades, the walls are constituted by 40 cm cob (load bearing layer) and 30 cm light earth (insulating layer), in the south and west, facade are constituted by 25 cm cob and 25 cm light earth.The light earth layer is continuous on the outside of all walls to ensure better insulation performances.The walls were built using several lifts, each approximately 70 cm in height.

Prototype Building Description
A cob building prototype was constructed within the Cotentin and Bessin Marshes Regional Natural Park's territory.This prototype building features an internal floor area of 13 m 2 and a total surface area of approximately 20 m 2 , with an earthen-based floor, see Figure 2. The construction of the prototype building employs a double-walling method, where cob and light earth naturally adhere to form a single wall.The typical wall thicknesses range from 50 to 70 cm.Specifically, the walls of this prototype are 50 cm thick on the south and west facades, and 70 cm thick on the east and north facades.While in the north and east facades, the walls are constituted by 40 cm cob (load bearing layer) and 30 cm light earth (insulating layer), in the south and west, facade are constituted by 25 cm cob and 25 cm light earth.The light earth layer is continuous on the outside of all walls to ensure better insulation performances.The walls were built using several lifts, each approximately 70 cm in height.

Indoor and Outdoor Air Quality Instrumentation
The CobBauge prototype building, designed for tertiary use, was equipped with instrumentation and subjected to monitoring.In this regard, two NEMo XT air quality sta-

Indoor and Outdoor Air Quality Instrumentation
The CobBauge prototype building, designed for tertiary use, was equipped with instrumentation and subjected to monitoring.In this regard, two NEMo XT air quality stations from Ethera-labs were installed: one inside the prototype and the second one outside, as illustrated on Figure 3.These stations facilitate the measurement of various air quality parameters, including carbon dioxide (CO 2 ), nitrogen dioxide (NO 2 ), ozone (O 3 ), volatile organic compounds (VOCs), fine particles (PM1, PM2.5, PM4, PM10), and relative humidity (RH).These stations measure also the temperature and the pressure.

Indoor and Outdoor Air Quality Instrumentation
The CobBauge prototype building, designed for tertiary use, was equipped with instrumentation and subjected to monitoring.In this regard, two NEMo XT air quality stations from Ethera-labs were installed: one inside the prototype and the second one outside, as illustrated on Figure 3.These stations facilitate the measurement of various air quality parameters, including carbon dioxide (CO2), nitrogen dioxide (NO2), ozone (O3), volatile organic compounds (VOCs), fine particles (PM1, PM2.5, PM4, PM10), and relative humidity (RH).These stations measure also the temperature and the pressure.For carbon dioxide, the detection method utilized non-dispersive infrared absorption spectroscopy (NDIR) within a measuring range spanning from 0 to 5000 ppm, with a resolution of 1 ppm and an uncertainty of ±30 ppm or ±3% of the measured value.Moreover, VOCs were detected using an electrochemical method, with a measuring range from 20 to 5 ppm, a resolution of 1 ppb, and an uncertainty of ±20 ppb.The VOCs detected include formaldehyde and gases containing 1 to 4 carbon atoms (aldehydes, alcohols, etc.).
The temperature range covered by the monitoring system is from −55 to 125 °C, with a precision of ±2 °C.Relative humidity can be measured within the range of 5 to 95% with a precision of ±3% (between ±11 and 89%) and ±7% outside this range.
For nitrogen dioxide, an electrochemical method is employed, with a measuring range spanning from 1 to 17 ppm and an uncertainty of ±15 ppb.Similarly, the detection For carbon dioxide, the detection method utilized non-dispersive infrared absorption spectroscopy (NDIR) within a measuring range spanning from 0 to 5000 ppm, with a resolution of 1 ppm and an uncertainty of ±30 ppm or ±3% of the measured value.Moreover, VOCs were detected using an electrochemical method, with a measuring range from 20 to 5 ppm, a resolution of 1 ppb, and an uncertainty of ±20 ppb.The VOCs detected include formaldehyde and gases containing 1 to 4 carbon atoms (aldehydes, alcohols, etc.).
The temperature range covered by the monitoring system is from −55 to 125 • C, with a precision of ±2 • C. Relative humidity can be measured within the range of 5 to 95% with a precision of ±3% (between ±11 and 89%) and ±7% outside this range.
For nitrogen dioxide, an electrochemical method is employed, with a measuring range spanning from 1 to 17 ppm and an uncertainty of ±15 ppb.Similarly, the detection of ozone also uses an electrochemical method, with a measuring range from 1 to 7600 ppb and an uncertainty of ±15 ppb.
Data recorded by the various sensors constituting the air quality stations are collected at 10 min intervals between 15 September 2022 and 15 February 2023.

Artificial Neural Network Modelling
The literature frequently employs artificial neural networks (ANN) as a data-driven model for analysing building heating efficiency [17].ANN models are derived from a simplified representation of biological neurons, which are translated into numerical forms.These neurons are expressed as mathematical entities capable of processing information [18].The typical architecture of ANN models (as depicted in Figure 4) consists of three primary components:

•
Synapses, which are connected to weights.

•
A "summing junction" processing element that combines the synaptic weighting data input and adjusts them by incorporating a bias variable (bk).

•
An activation function that governs the strength of the signal exiting the neuron.[18].The typical architecture of ANN models (as depicted in Figure 4) consists of three primary components: • Synapses, which are connected to weights.

•
A "summing junction" processing element that combines the synaptic weighting data input and adjusts them by incorporating a bias variable (bk).

•
An activation function that governs the strength of the signal exiting the neuron.The following equations can theoretically explain the neuron k: where vk is the weighted sum of the input signal modified using bk and uk is the linear yield of the original signal.
The output signal of the neuron k is determined by a series of mathematical equations called the activation function φ.The final output signal (yk), which is represented by the Equation (3), is obtained by adding the weighted and bias-corrected inputs.The sigmoid function, also known as the "gate", is used to determine which information is allowed into the cell state.
There are several types of neural network models used in indoor air quality prediction and analysis such as ANN, CNN, and LSTM.The LSTM model is often preferred for The following equations can theoretically explain the neuron k: where v k is the weighted sum of the input signal modified using b k and u k is the linear yield of the original signal.
The output signal of the neuron k is determined by a series of mathematical equations called the activation function ϕ.The final output signal (y k ), which is represented by the Equation (3), is obtained by adding the weighted and bias-corrected inputs.The sigmoid function, also known as the "gate", is used to determine which information is allowed into the cell state.
There are several types of neural network models used in indoor air quality prediction and analysis such as ANN, CNN, and LSTM.The LSTM model is often preferred for time series data due to its ability to retain long-term dependencies in sequential data [19,20], which can be important in predicting air quality as it often involves complex temporal patterns influenced by various factors.The LSTM's architecture (Figure 5(left,right)), with its memory cells and ability to learn from sequences, can capture longer-term dependencies and nuances in time-series data, making it suitable for capturing the dynamics of air quality changes over time.
The LSTM model incorporates three key gates: the input gate, the forget gate, and the output gate.These gates play a pivotal role in governing the flow of information through the cell state by determining what information is added, what is forgotten, and what is outputted at each time step.The LSTM model, as described in reference [14], is defined by the following equations: The LSTM algorithm generates internal variables such as f t , X t, and c t , which are utilized in the computation of c(t) and h(t) within the hidden layer.It is important to note that these equations must be recalculated for each upcoming time step, as the provided formulae are applicable to a single cycle.If the time series comprises three time steps, the equations will be solved three times.The weight matrices ( time series data due to its ability to retain long-term dependencies in sequential data [19,20], which can be important in predicting air quality as it often involves complex temporal patterns influenced by various factors.The LSTM's architecture (Figure 5a,b), with its memory cells and ability to learn from sequences, can capture longer-term dependencies and nuances in time-series data, making it suitable for capturing the dynamics of air quality changes over time.The LSTM model incorporates three key gates: the input gate, the forget gate, and the output gate.These gates play a pivotal role in governing the flow of information through the cell state by determining what information is added, what is forgotten, and what is outputted at each time step.The LSTM model, as described in reference [14], is defined by the following equations: The LSTM algorithm generates internal variables such as ft, Xt, and  , which are utilized in the computation of c(t) and h(t) within the hidden layer.It is important to note that these equations must be recalculated for each upcoming time step, as the provided formulae are applicable to a single cycle.If the time series comprises three time steps, the equations will be solved three times.The weight matrices (Wf, WX, WY, Wc, Uf, UX, UY, Uc) and biases (bf, bX, bY, bc) utilized in the model are constant and not time-dependent.Therefore, the same matrices are used to compute results for multiple time steps.The artificial neural network model is applied in three distinct stages.The first stage involves training the model to minimize the error function by adjusting the weight factors through a statistical comparison between experimental and simulation outputs.Subsequently, the model's results are assessed for their applicability using the determination coefficient (R 2 ) and Root-Mean-Square Error (RMSE).
where y i represents the observed values of the dependent variable.ŷi represents the predicted values of the dependent variable based on the regression model.n is the number of data points.
y i represents the mean (average) of the observed values of the dependent variable.
The LSTM model excels at predicting future outcomes by analyzing historical data sequences.It learns patterns and trends from the input data and produces forecasts for upcoming time periods.The model is trained using the most recent data point, leveraging previously acquired patterns to predict outcomes for the subsequent time step.This iterative process can be applied for as many future time steps as necessary.LSTM is a powerful tool widely employed in various applications, including language modeling, time series forecasting, and speech recognition, allowing it to predict future results based on past sequence data effectively.
Before being used as input features, the lagged time series data underwent preprocessing, including data normalization.This normalization step was applied.The purpose of normalization was to ensure that the input features were brought to a common scale, facilitating the model's training process.The division of the data into training (80%) and testing sets (20%) happened in the data partitioning function.This division ensured that the model was trained on a subset of the data and tested on another independent subset to evaluate its performance.

Experimental Results
The experimental results reveal that the outdoor air temperature fluctuates between approximately −1 and 24 • C, with pressure ranging from about 980 to 1050 mbar.Relative humidity varies between approximately 60 and 95%.Under these external conditions, the indoor air temperature of the building remains stable at around 20 • C even before the heating system is activated (started on 14 October).Figure 6 illustrates that the indoor temperature consistently remains within the comfort level throughout the entire study period.This clearly demonstrates that this prototype house is not prone to significant overheating risks, thanks to the behavior of its structural building materials during heatwaves and its location.
In terms of indoor relative humidity, Figure 6a indicates that RH oscillates around 80% (with occasional peaks) until November 30th.After this date, it decreases notably and fluctuates between 35 and 60%.This decrease can be attributed to the activation of a dehumidifier.
An indoor relative humidity approaching 80% is relatively high and suggests that the moisture balance is not fully achieved.This is likely due to the prototype cob walls not being completely dry.Such conditions may create a potential breeding ground for mold and bacteria if not addressed properly.It is generally accepted that the optimal range for indoor RH falls between 30 and 60% [21,22].In this study, this range was achieved by installing a dehumidifier starting from 15 November.However, in real occupied buildings, an efficient ventilation system must also be designed to ensure adequate indoor RH [23].
Figure 6 indicates also that indoor pressure follows a similar trend to the outdoor pressure.This is noteworthy, as excessively low indoor pressure can potentially lead to the infiltration of outdoor pollutants into the indoor environment, while excessively high indoor pressure may cause the backdraft of combustion products and other indoor pollutants [24].

• CO 2
In the outdoor environment, the levels of carbon dioxide vary between approximately 400 and 5000 ppm (5000 ppm being the detection limit of the air quality station), as shown in Figure 7.These CO 2 levels tend to be higher during the night and in the morning.It is worth noting that, in addition to photosynthesis, the increase in humidity during these periods can affect the measuring station.Therefore, absolute CO 2 values that appear significantly different from those observed in less humid conditions should be interpreted with caution.As the day progresses, photosynthesis occurs, and CO 2 levels tend to approach approximately 400 ppm.In terms of indoor relative humidity, Figure 6a indicates that RH oscillates around 80% (with occasional peaks) until November 30th.After this date, it decreases notably and fluctuates between 35 and 60%.This decrease can be attributed to the activation of a dehumidifier.
An indoor relative humidity approaching 80% is relatively high and suggests that the moisture balance is not fully achieved.This is likely due to the prototype cob walls not being completely dry.Such conditions may create a potential breeding ground for mold and bacteria if not addressed properly.It is generally accepted that the optimal range for indoor RH falls between 30 and 60% [21,22].In this study, this range was achieved by installing a dehumidifier starting from 15 November.However, in real occupied buildings, an efficient ventilation system must also be designed to ensure adequate indoor RH [23].
Figure 6 indicates also that indoor pressure follows a similar trend to the outdoor pressure.This is noteworthy, as excessively low indoor pressure can potentially lead to the infiltration of outdoor pollutants into the indoor environment, while excessively high indoor pressure may cause the backdraft of combustion products and other indoor pollutants [24].• CO2 In the outdoor environment, the levels of carbon dioxide vary between approximately 400 and 5000 ppm (5000 ppm being the detection limit of the air quality station), as shown in Figure 7.These CO2 levels tend to be higher during the night and in the morning.It is worth noting that, in addition to photosynthesis, the increase in humidity during these periods can affect the measuring station.Therefore, absolute CO2 values that appear significantly different from those observed in less humid conditions should be interpreted with caution.As the day progresses, photosynthesis occurs, and CO2 levels tend to approach approximately 400 ppm.Indoor carbon dioxide levels primarily range between 400 and 500 ppm throughout the testing period, with occasional spikes primarily linked to human presence within the building.These temporary spikes rapidly subside once the occupancy ceases.For instance, on October 14th, a spike reaching 3300 ppm is observed due to the presence of four indi- Indoor carbon dioxide levels primarily range between 400 and 500 ppm throughout the testing period, with occasional spikes primarily linked to human presence within the building.These temporary spikes rapidly subside once the occupancy ceases.For instance, on October 14th, a spike reaching 3300 ppm is observed due to the presence of four individuals in the prototype building throughout the morning.To prevent such spikes, an efficient ventilation system tailored to occupancy patterns should be designed and implemented [23].Elevated indoor CO 2 levels can lead to drowsiness, headaches, and other health issues.
Additionally, it is noticeable that the measured CO 2 levels are generally lower indoors across the entire considered period.In conclusion, Figure 7 indicates that indoor CO 2 concentration remains mostly below the standard limit of 1000 ppm recommended by European guidelines [25].A description of the indoor CO 2 concentration is given in Table 1.

• VOCs
Construction products that contain organic materials, such as cob construction with a high content of vegetable fibers, can emit volatile organic compounds.The presence of these compounds in a building often results in odors that can be irritating.The perception of unpleasant odors tends to increase as temperatures rise due to solar radiation or during warmer months, especially when building ventilation is limited [26].Apart from product emissions, several other factors, including relative humidity, air exchange rates, and temperature, can influence the levels of VOCs inside buildings [27].VOCs are a common indoor air pollutant that can have adverse health effects [28], making their control critically important.
In the present study, outdoor VOCs level ranged from approximately 5 to 50 ppb but were primarily between 10 and 20 ppb.A spike was observed during the initial hours of measurement, which resulted from the transportation of the air quality stations in a car from the office to the experimentation site.These spikes rapidly declined as the stations were installed on-site on 15 September.
In the indoor environment, VOCs concentrations remained close to zero until October 14th, when four individuals were active in the prototype building for half a day.After that date, VOCs level remained quite low until December 8th, as shown in Figure 8.Following that date, a noticeable increase in VOCs level can be observed, likely attributed to the reduction in indoor relative humidity, as depicted in Figure 6.The maximum concentration of VOCs is reached several dozen days later, showing a delayed response to the decrease in indoor humidity.Moreover, the higher VOCs concentration persists at an elevated level [28].
Subsequently, after that date, the level of volatile organic compounds fluctuated between approximately 0 and 30 ppb, approaching the range of outdoor values.The low levels of VOCs observed during the experiment suggest that there may be no significant indoor sources of VOCs, or any potential sources are effectively controlled or removed by the natural ventilation [29].

Particulate matter
The levels of particulate matter (PM1 and PM10) vary between approximately 0 and 10 µg/m 3 , with occasional spikes, as seen in the data.These spikes in particulate matter concentration may be attributed to local sources of pollution in the vicinity of the monitoring site.For instance, the spike observed on 29 November 2022, could be linked to nearby activities such as agricultural operations, increased traffic on neighboring roads, or industrial activities, especially considering the presence of a materials preparation plant nearby.Two significant peaks observed indoors on 16 October 2022 and 24 January 2023, are associated with a human presence.These peaks resulted from the movement of individuals on the earthen floor, which was already damaged, causing the release of substantial quantities of fine particles.
In general, the concentration of particulate matter tends to be relatively lower indoors compared to outdoors.However, the trends are similar, although with a slight time delay, as indicated in Figures 9 and 10. important.
In the present study, outdoor VOCs level ranged from approximately 5 to 50 ppb but were primarily between 10 and 20 ppb.A spike was observed during the initial hours of measurement, which resulted from the transportation of the air quality stations in a car from the office to the experimentation site.These spikes rapidly declined as the stations were installed on-site on 15 September.
In the indoor environment, VOCs concentrations remained close to zero until October 14th, when four individuals were active in the prototype building for half a day.After that date, VOCs level remained quite low until December 8th, as shown in Figure 8.Following that date, a noticeable increase in VOCs level can be observed, likely attributed to the reduction in indoor relative humidity, as depicted in Figure 6.The maximum concentration of VOCs is reached several dozen days later, showing a delayed response to the decrease in indoor humidity.Moreover, the higher VOCs concentration persists at an elevated level [28].Subsequently, after that date, the level of volatile organic compounds fluctuated between approximately 0 and 30 ppb, approaching the range of outdoor values.The low levels of VOCs observed during the experiment suggest that there may be no significant indoor sources of VOCs, or any potential sources are effectively controlled or removed by the natural ventilation [29].

Particulate matter
The levels of particulate matter (PM1 and PM10) vary between approximately 0 and 10 µg/m 3 , with occasional spikes, as seen in the data.These spikes in particulate matter concentration may be attributed to local sources of pollution in the vicinity of the monitoring site.For instance, the spike observed on November 29th, 2022, could be linked to nearby activities such as agricultural operations, increased traffic on neighboring roads, or industrial activities, especially considering the presence of a materials preparation plant nearby.Two significant peaks observed indoors on 16 October 2022 and 24 January 2023, are associated with a human presence.These peaks resulted from the movement of individuals on the earthen floor, which was already damaged, causing the release of substantial quantities of fine particles.
In general, the concentration of particulate matter tends to be relatively lower indoors compared to outdoors.However, the trends are similar, although with a slight time delay, as indicated in Figures 9 and 10

• NO2 and O3
It is indeed essential to note that nitrogen dioxide is a chemical compound known to have adverse health effects, particularly on individuals with pre-existing respiratory conditions [30].
In the current study, the outdoor level of NO2 was found to range between approximately 10 and 30 ppb, with occasional peaks reaching approximately 0 to 55 ppb.On the other hand, ozone levels in the outdoor environment primarily fluctuated between approximately 0 and 40 ppb.
Indoors, the level of NO2 displayed a wider range, varying between 0 and 20 ppb but mainly oscillating between 5 and 15 ppb from 15 September to 30 November.After 30 November, NO2 levels became negligible, except for a brief period between February 5th and 8th, which coincided with the switch-off of the heating system and the presence of a dehumidifier, as illustrated in Figure 11.

• NO 2 and O 3
It is indeed essential to note that nitrogen dioxide is a chemical compound known to have adverse health effects, particularly on individuals with pre-existing respiratory conditions [30].
In the current study, the outdoor level of NO 2 was found to range between approximately 10 and 30 ppb, with occasional peaks reaching approximately 0 to 55 ppb.On the other hand, ozone levels in the outdoor environment primarily fluctuated between approximately 0 and 40 ppb.
Indoors, the level of NO 2 displayed a wider range, varying between 0 and 20 ppb but mainly oscillating between 5 and 15 ppb from 15 September to 30 November.After 30 November, NO 2 levels became negligible, except for a brief period between February 5th and 8th, which coincided with the switch-off of the heating system and the presence of a dehumidifier, as illustrated in Figure 11.In contrast, O3 concentrations indoors were relatively negligible until 30 November, except for occasional peaks associated with human presence in the building.Human presence typically involves the opening of doors and a decrease in relative humidity.After 30 November, ozone content indoors varied between approximately 0 and 10 ppb, as shown In contrast, O 3 concentrations indoors were relatively negligible until 30 November, except for occasional peaks associated with human presence in the building.Human presence typically involves the opening of doors and a decrease in relative humidity.After 30 November, ozone content indoors varied between approximately 0 and 10 ppb, as shown in Figure 12.In contrast, O3 concentrations indoors were relatively negligible until 30 November, except for occasional peaks associated with human presence in the building.Human presence typically involves the opening of doors and a decrease in relative humidity.After 30 November, ozone content indoors varied between approximately 0 and 10 ppb, as shown in Figure 12.These figures highlight that indoor levels of both NO2 and O3 are generally lower than those observed outdoors.Furthermore, the operation of the dehumidifier had a noticeable impact on indoor air quality, affecting the levels of NO2 and O3.Importantly, these These figures highlight that indoor levels of both NO 2 and O 3 are generally lower than those observed outdoors.Furthermore, the operation of the dehumidifier had a noticeable impact on indoor air quality, affecting the levels of NO 2 and O 3 .Importantly, these indoor levels fall within the recommended guidelines set by various health organizations.Depending on ventilation rates, mean indoor O 3 content can range from 0 to 60 ppb.The European accepted levels for NO 2 and O 3 during a 1 h period are equal to 100 ppb [31][32][33].
The experimental results indicate that indoor air quality is marked by elevated relative humidity, possibly attributed to the continuous release of moisture from the cob construction into the indoor environment, even after 13 months from the last lift's implementation [34].Additionally, the levels of carbon dioxide, volatile organic compounds, particulate matter, nitrogen dioxide, and ozone appear to be lower indoors than outdoors, although the impact of the damaged earthen floor and the dehumidifier's operation has been noted.Moreover, occasional spikes in these parameters, resulting from occupancy and indoor environmental variations, have been observed.However, when dealing with buildings evolving in conditions different from those considered in the present study, the RH and pollutants absolute values reported in the present study should be considered carefully.In such studies, attention should be paid to the mixtures' composition and preparation, implementation method and conditions, weather conditions, and use scenarios.These findings underscore the importance of monitoring and managing indoor air quality in cob buildings to ensure occupant comfort and health.

Numerical Results
In Figure 13a, we present a comparison between the in situ CO 2 measurements obtained from the air quality station and the predicted CO 2 evolution using the LSTM model.Additionally, a regression vector with an R 2 value of 0.984 is calculated and a Pearson correlation map is included, see Figure 14.An RMSE value of 15.94 is also calculated.The high R-squared value indicates that the model explains a significant proportion of the variance in the data.Figure 13a demonstrates the validation of the LSTM model, which accurately detected the peak in CO 2 concentration that occurred on 14 October.Figure 13b displays the predicted CO 2 data beyond the testing period.LSTM was able to validate CO 2 by considering a sequence of past CO 2 concentration data as input and using this data to learn patterns and trends.Once trained, the model generated predictions for future CO 2 concentrations by incorporating the most recent CO 2 concentration data point and applying the previously learned patterns to predict the next time step.This process could be repeated for as many future time steps as needed.To predict upcoming CO2 concentrations beyond the testing period, LSTM employed the same approach as in the validation process.It considered the most recent CO2 concentration data point and used the previously learned patterns to predict future CO2 concen-   To predict upcoming CO2 concentrations beyond the testing period, LSTM employed the same approach as in the validation process.It considered the most recent CO2 concentration data point and used the previously learned patterns to predict future CO2 concentrations for as many time steps as desired.This allowed for reliable predictions of future To predict upcoming CO 2 concentrations beyond the testing period, LSTM employed the same approach as in the validation process.It considered the most recent CO 2 concentration data point and used the previously learned patterns to predict future CO 2 concentrations for as many time steps as desired.This allowed for reliable predictions of future CO 2 concentrations based on historical sequence data.
Based on our predictions using LSTM, it appears that the evolution of CO 2 concentrations inside the building is expected to fluctuate between 510 and 530 ppm in the future, consistently remaining within a favorable range.Furthermore, LSTM's ability to forecast future CO 2 concentrations based on historical data can be valuable for building managers and facility operators seeking to optimize indoor air quality, ventilation, and energy efficiency.

Conclusions
The present study has provided valuable insights into the indoor air quality (IAQ) of a cob prototype building through a combination of experimental measurements and numerical analysis.This research addresses a gap in the existing body of knowledge regarding the IAQ of cob buildings and contributes to a better understanding of the key pollutants that impact occupants' health.This research was particularly focused on the IAQ in a cob building during the first few months following its delivery.The in situ measurements conducted during this study revealed several important findings regarding the indoor ambience:

•
Air temperature was maintained within the human comfort zone.

•
Relative Humidity (RH) levels were relatively high, approaching 80%, which may be attributed to the characteristics of cob construction.The cob walls are supposed to be still wet and their practical water content level was not completely reached [35].This may suppose the use of a drying fan for a longer period of time.Otherwise, prefabricating cob before its implementation makes sense.

•
Carbon Dioxide (CO 2 ) levels remained between 400 and 500 ppm throughout the testing period, with occasional spikes associated with human presence that quickly dissipated.The indoor CO 2 concentration generally fell below the standard limit of 1000 ppm recommended by European guidelines.

•
Low levels Volatile Organic Compounds (VOCs) were observed indoors, suggesting the absence of significant indoor sources of VOCs and the potential effectiveness of the natural ventilation.

•
Particulate Matter (PM1 and PM10): human presence and walking on a damaged earthen floor raised significant quantities of fine particles (PM1 and PM10).Otherwise, their indoor concentration followed similar trends to outdoor levels, with a slight time delay.

•
Nitrogen Dioxide (NO 2 ) and Ozone (O 3 ) levels were generally lower than those measured outdoors.The operation of the dehumidifier had an impact on indoor NO 2 and O 3 levels, but these remained within the recommended guidelines (100 ppb).
Overall, the majority of the studied parameters suggest that cob buildings can provide good IAQ even during the first few months following their delivery.However, challenges such as fluctuating indoor RH and occasional pollutant peaks need to be addressed to mitigate potential risks to human health.To achieve consistently good IAQ, an appropriate mechanical ventilation system may be necessary.Also, the eventual degradation of earthen floors and their eventual impact on the IAQ should be considered when designing such buildings.The use of the Long Short-Term Memory model for predicting pollutant concentrations may be a valuable tool for building managers and facility operators aiming to optimize IAQ.
Further research exploring IAQ in a completely dry cob building under use may be performed.Also, other studies considering other soils and vegetal straws, other regions and climates can also be carried out.

Figure 1 .
Figure 1.Conceptual study plan.Indoor (a) and outdoor ambiences (b) with the air quality stations (red squares).

Figure 1 .
Figure 1.Conceptual study plan.Indoor (a) and outdoor ambiences (b) with the air quality stations (red squares).

Figure 2 .
Figure 2. Global view of the CobBauge prototype building.

Figure 2 .
Figure 2. Global view of the CobBauge prototype building.

Figure 4 .
Figure 4. Simplified representation of an artificial neural network model.

Figure 4 .
Figure 4. Simplified representation of an artificial neural network model.
and biases (b f , b X , b Y , b c ) utilized in the model are constant and not time-dependent.Therefore, the same matrices are used to compute results for multiple time steps.Buildings 2023, 13, x FOR PEER REVIEW 7 of 19

Figure 5 .
Figure 5. Structure of Recurrent Neural Network with hidden neurons (left) and an example of a long term memory cell's usual architecture (right).

Figure 5 .
Figure 5. Structure of Recurrent Neural Network with hidden neurons (left) and an example of a long term memory cell's usual architecture (right).

Figure 6 .
Figure 6.Indoor (a) and outdoor (b) temperature, pressure, and relative humidity of the cob prototype building.

Figure 6 .
Figure 6.Indoor (a) and outdoor (b) temperature, pressure, and relative humidity of the cob prototype building.

Figure 7 .
Figure 7. CobBauge prototype building indoor and outdoor carbon dioxide content.

Figure 7 .
Figure 7. CobBauge prototype building indoor and outdoor carbon dioxide content.

Figure 8 .
Figure 8. CobBauge prototype building indoor and outdoor volatile organic compounds.

Figure 11 .
Figure 11.CobBauge prototype building indoor and outdoor nitrogen dioxide level.

Figure 11 .
Figure 11.CobBauge prototype building indoor and outdoor nitrogen dioxide level.

Figure 12 .
Figure 12.CobBauge prototype building indoor and outdoor ozone level.

Figure 12 .
Figure 12.CobBauge prototype building indoor and outdoor ozone level.