Occupancy Estimation in Academic Laboratory: A CO2-Based Algorithm Incorporating Temporal Features for 1–16 Occupants

Kańtoch, Eliasz; Augustyniak, Piotr

doi:10.3390/electronics14071377

Open AccessFeature PaperArticle

Occupancy Estimation in Academic Laboratory: A CO₂-Based Algorithm Incorporating Temporal Features for 1–16 Occupants

by

Eliasz Kańtoch

and

Piotr Augustyniak

^*

AGH University of Krakow, al. Adama Mickiewicza 30, 30-059 Kraków, Poland

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(7), 1377; https://doi.org/10.3390/electronics14071377

Submission received: 27 February 2025 / Revised: 21 March 2025 / Accepted: 26 March 2025 / Published: 29 March 2025

(This article belongs to the Special Issue Human-Computer Interactions in E-health)

Download

Browse Figures

Versions Notes

Abstract

:

Private, non-intrusive presence detection methods contribute to various applications, from occupancy monitoring to energy optimization and security. This study presents a deep learning approach for predicting occupancy patterns using CO₂ sensor data and temporal features, derived from a year-long dataset (18 September 2023–21 November 2024) collected via the Smart Indoor Air Quality Monitor. We created a dataset of 19,189 samples of CO₂ levels (0–5000 ppm) with timestamps. A sequential neural network with three fully connected layers was implemented in TensorFlow. The developed model demonstrated the feasibility of predicting occupancy based on CO₂ data and temporal features with an accuracy of 0.97 and an F1-score of 0.92. Model visualization was performed using heatmaps. Its advantages include low computational requirements, cost-effective sensors, an IoT-enabled interface, and scalability. However, the study is limited to a university laboratory with a capacity of 1–16 occupants, which may impact its generalizability to other settings. These findings highlight the utility of CO₂ levels and temporal features for occupancy estimation in laboratory conditions and contribute a unique, long-term multimodal dataset to the research community.

Keywords:

CO₂ sensor; indoor occupancy estimation; deep learning; IoT-enabled monitoring; environmental data; internet of things

1. Introduction

Occupancy detection methods that are non-invasive, seamlessly integrated into existing infrastructures, and privacy-assuring are important for numerous applications, ranging from energy management to enclosed spaces monitoring. These techniques enable identification of human presence in designated spaces, providing the opportunity to optimize energy consumption based on occupancy patterns [1]. Moreover, they support efficient monitoring of laboratory and clinical room usage, ensuring that resources are allocated optimally. Importantly, by analyzing temporal occupancy patterns, such systems can also detect anomalous behavior that may indicate potential security or safety concerns.

In their recent research article, Mena-Martinez et al. [2] identified personal privacy as a major problem in the deployment of direct indoor occupancy estimation methods, including video-based approaches. Motion detection and security alarm systems have traditionally relied on passive infrared (PIR) sensors, which detect the infrared radiation emitted by objects based on the pyroelectric effect [3]. These sensors offer several advantages, including low power consumption, a compact form factor, low cost, and ease of installation. Although PIR sensors are effective at detecting motion, they are limited in scenarios where individuals or objects remain stationary within their field of view, and are unable to estimate the number of occupants [3].

Human respiration produces carbon dioxide as a metabolic byproduct. According to measurements from the NOAA Global Monitoring Laboratory, the global average carbon dioxide concentration was 420.37 ppm in November 2024 [4]. In indoor settings, exhaled CO₂ mixes with the surrounding air, and its concentration depends on the number of occupants, room volume, and ventilation efficiency.

Persily et al. [5] examined human carbon dioxide generation in relation to indoor air quality and building ventilation based on metabolic and physiological principles. They explored methods for estimating CO₂ generation from building occupants and presented an approach leveraging basal metabolic rate and level of physical activity. The authors concluded that future research should focus on validating and quantifying the accuracy of CO₂ generation estimates using activity schedules. Furthermore, they highlighted the need to investigate these estimates as a function of group size.

According to [6], humans inhale air containing 21% O₂ and exhale air containing 16% O₂ and approximately 5% CO₂. Initial adverse effects, including headache, irritability, and confusion, result from increased CO₂ concentrations starting at 1000 ppm and become more dominant with further increases. A concentration of 20,000 ppm can lead to loss of consciousness and asphyxiation [6].

The concept of detecting humans in indoor spaces based on carbon dioxide levels is not new and has been investigated by various researchers through various simulation models and academic prototypes.

Arief-Ang et al. [7] proposed a CO₂-based time-series model to predict indoor human occupancy, achieving 94.4% accuracy in an academic room (up to four people) and 77.8% accuracy in a cinema theater (up to 300 people). The dataset consisted of measurements over 13 days (Scenario 1) and 23 days (Scenario 2). Notably, the authors utilized sensors from the same vendor as those employed in this research.

This work aimed to develop an indoor occupancy estimation model using CO₂ sensor data and temporal features, which traditional thresholding or simple statistical methods might fail to detect effectively. The model was developed using a multi-modal original dataset of environmental parameters including relative humidity, temperature, sound energy and CO₂ collected in a university laboratory over a one-year period. The development of the model adhered to the following criteria: privacy-preserving estimation, ease of installation within existing building and laboratory infrastructure, presence of an Internet of Things interface, low computational complexity, short retraining and deployment cycles for subsequent model versions (for specific time windows), and scalability across an entire department.

To the best of our knowledge, our work is the first to report such a large dataset of recordings, consisting of over one year of measurements. Moreover, other authors have not published datasets to enable replication of their studies. In their 2022 systematic review, ‘Measuring Indoor Occupancy through Environmental Sensors: A Systematic Review on Sensor Deployment’, published in Sensors, Alma Rosa Mena et al. [8] concluded that only 11 studies (11.82%) report the availability of their datasets, with these datasets being publicly accessible for download and experimentation by others. With our approach, we aim to contribute to the scientific community by creating a dedicated webpage for this research, featuring an API to access the proposed model and the corresponding dataset.

The rest of the paper is organized as follows. Section 2 presents related works. Section 3 describes the sensors, the experimental dataset, and the proposed occupancy-estimation algorithm. In Section 4, we describe the experimental setup and results. Discussion is presented in Section 5. Finally, conclusions are given in Section 6.

2. Background Related Works

In a comprehensive systematic review of indoor occupancy measurement using environmental sensors, Mena et al. [8] observed that most studies were conducted in office environments (61.29%). The dominant sensor type was CO₂ (90.32%), though temperature and relative humidity sensors were also used (4.30%). Data fusion techniques were employed in 27.95% of studies. The most common approaches were based on parameter and feature combination. Machine learning algorithms, particularly Support Vector Machines (20.43%), Random Forests (15.05%), and Artificial Neural Networks (12.90%), were most popular. Principal conclusions from that state of the art are that environmental sensor data, including CO₂, paired with advanced data analytics can determine the correlation of environmental features with human presence in specific indoor environments [9,10,11,12]. D. Cali et al. [13] proposed an algorithm for the detection of occupant presence based on the mass balance equation.

Previous studies have focused on the verification of CO₂-based occupancy monitoring in estimating energy consumption, e.g., Yixuan Wei et al. [1] estimated occupancy levels using a blind system identification (BSI) based on CO₂ concentration and fresh-air system electricity consumption to predict air conditioning (AC) system electricity consumption. They showed that knowledge of the occupancy number can improve accuracy in predicting energy consumption using a feed-forward neural network (FFNN). This shows promise in the development of intelligent systems for building energy management.

Developing indoor occupancy models is significantly challenged by the time-consuming and labor-intensive process of collecting recording datasets. In [7], Arief-Ang et al. observed that real world data analysis is superior to simulated models due to many factors.

Measuring CO₂ concentration is a challenging task. Several methods can be employed to address this challenge. Photoacoustic spectroscopy sensors detect pressure waves generated by CO₂ molecules absorbing modulated light. Non-dispersive infrared (NDIR) sensors operate on the principle that CO₂ molecules absorb mid-infrared wavelengths around 4.26 µm [14]. Electrochemical sensors measure the gas concentration by measuring a change in electrical properties (resistance, capacitance, electric potential) induced by absorption of gas. Another method that is both rapid and suitable for real-time monitoring and measurement is laser absorption spectroscopy. This technique employs laser light tuned to the absorption wavelengths of CO₂ to measure its concentration in real time. For laboratory analysis, gas chromatography is commonly used. In chemical adsorption, CO₂ is captured through a chemical reaction with a sorbent, and the amount absorbed is subsequently measured. However, this approach has the drawback of being slower and less suited for real-time applications. Metal oxide sensors detect CO₂ by monitoring changes in the material’s conductivity, which result from the adsorption or desorption of gas on the surface of the metal oxide. This phenomenon was first demonstrated using zinc oxide [15]. However, as Fine et al. [15] observed, a significant challenge with these sensors is their difficulty in reliably measuring CO₂ within specific concentration ranges. Liu et al. [16] highlighted the potential of carbon-based composites for developing CO₂ sensors. In their work, they demonstrated a low-cost and effective chemiresistive CO₂ sensor based on a composite of functionalized carbon nanotubes (f-CNTs) and polyethyleneimine (PEI). Generally, NDIR sensors are most popular for measuring CO₂ concentrations, because they offer a good balance of accuracy, stability, and long-term reliability [14]. In Table 1, we compare and contrast key methods for occupancy detection, highlighting their advantages, limitations, accuracy, and key references.

3. Methods

In this section, we describe the proposed methodology for CO₂-based occupancy estimation, including the description and characterization of the measurement device, data collection procedures, and the proposed occupancy-estimation algorithm.

3.1. Sensors and Experimental Dataset

We chose the Smart Indoor Air Quality Monitor (SIAQM) developed by Netatmo to build our experimental dataset because it provide APIs and developer documentation to access recorded data. This approach has also been used by other researchers. SIAQM is powered by a USB wall adapter and connected to the Internet. SIAQM samples data every 5 min. It can measure key parameters of the indoor environment, including relative humidity (0 to 100%, accuracy ± 3%), temperature (0 °C to 50 °C, accuracy ± 0.3 °C), sound energy (35 dB to 120 dB), and CO₂ (0 to 5000 ppm, accuracy ± 100 ppm to 1000 ppm and ± 10% for measurements greater than 1000 ppm) [19]. We used Netatmo API to write Python scrips that enabled the recording and processing of experimental data, as well as export of the data to a CSV file for further processing and analysis.

We installed the device in a university laboratory that typically holds 1 to 15 participants plus 1 academic teacher. The device was used to collect data from 18 September 2023 to 21 November 2024. We chose to place the sensor 1.2 m above the floor, similar to the works of other researchers (who often placed sensors at a height of about 100 cm [8]) in a well-ventilated network rack that was designed to promote efficient airflow. A photograph of the laboratory, highlighting the sensor location with a red box, is shown in Figure 1.

Finally, we created a dataset containing 19189 datapoints with available environmental data and timestamps. The created dataset and developed model are available on our research group website, with the link available on request for other researchers if they want to reproduce this study or implement and use the developed algorithm in their laboratory [20].

3.2. Occupancy-Estimation Algorithm Using CO₂ Levels and Temporal Features

We proposed an algorithm to predict room occupancy based on carbon dioxide (CO₂) concentrations and temporal features derived from timestamped environmental data from SIAQM. We implemented the algorithm in Python 3.11.5 using TensorFlow.

Our processing and analysis pipeline was divided into the following major steps: data preprocessing, feature engineering and extraction, and deep learning techniques to model occupancy as a binary classification problem. The occupancy-estimation algorithm flowchart is presented in Figure 2.

In the first step, we loaded and preprocessed the dataset from the API to a CSV file to ensure numerical integrity by converting CO₂ values to 32-bit integers. We extracted temporal features from date-time stamps, including fractional hour (Hour), integer hour (Hour_int), and day-of-week indices (DayOfWeek).

We observed that the occupancy of the laboratory is generally periodic, starting from 8 am and ending in the evening on weekdays, while remaining empty on weekends, so we computed the following cyclical features:

SinHour = sin(2π·Hour/24)

(1)

CosHour = cos(2π·Hour/24)

(2)

This encoding would enable the deep learning model to recognize and learn any patterns in occupancy that might be influenced by the day of the week. This helped us to encode time-of-day in a periodic manner. We used one-hot encoding technique to encode the day-of-week value into seven binary columns (Day_0 to Day_6). We used this one-hot encoding technique because it effectively captures the temporal patterns of weekly room occupancy observed in our dataset. This approach allows the model to recognize recurring occupancy trends on specific weekdays. By using one-hot encoding, the model can better distinguish variations in occupancy patterns across different days, improving prediction accuracy. The final feature matrix X combines CO₂ levels with the mentioned above temporal descriptors, yielding a 10-dimensional input per sample. Occupancy labels (y) were manually assigned.

We experimented with different artificial neural networks architectures and configuration parameters. Finally, we proposed a sequential neural network with three fully connected layers: an input layer with 10 neurons (matching feature dimensionality), a hidden layer (ReLU activation), and a single-output layer (sigmoid activation) for binary classification. We used the Adam optimizer and binary cross-entropy loss. The model was trained over 20 epochs using a batch size of 16. The dataset was split chronologically (80% training, 20% testing) to preserve temporal dependencies.

4. Experimental Results

We developed two model variants: a short-term model and a long-term model. The long-term model was trained on the entire collected dataset, while the short-term model used data from the period 18 October 2023 to 18 December 2023. This specific period was chosen due to the high quality of data and the regular weekly occupancy of the laboratory.

The performance of the proposed neural network model for occupancy detection based on CO₂ sensor data was evaluated using a comprehensive set of metrics: Accuracy, Precision, Recall, F1-score, and AUC-ROC [21].

Accuracy = (TP + TN)/(TP + TN + FP + FN)

(3)

Precision = TP/(TP + FP)

(4)

Recall = TP/(TP + FN)

(5)

F1-Score = 2 × (Precision × Recall)/(Precision + Recall)

(6)

where:

True Positives (TP): The model correctly predicts the positive class (room is occupied).

True Negatives (TN): The model correctly predicts the negative class (room is unoccupied).

False Positives (FP): The model incorrectly predicts the positive class (predicts that the room is occupied when it is not).

False Negatives (FN): The model incorrectly predicts the negative class (predicts that the room is unoccupied when it is occupied).

Table 2 presents the classification metrics for the short-term (B) and long-term (A) models. Figure 3, Figure 4 and Figure 5 show occupancy trends based on CO₂ levels (model A). Figure 6, Figure 7 and Figure 8 show heatmaps to compare, contrast, and monitor model performance (model B).

5. Discussion

The experimental results demonstrate the effectiveness of the proposed deep learning model in predicting occupancy based on CO₂ sensor data and temporal features. As presented in Table 1, the classification performance is notably high, with an accuracy of 0.97, an F1-score of 0.92, and an AUC-ROC of 0.99, indicating strong discriminatory power. The precision score of 1.00 suggests the model exhibits no false positives, making it highly reliable for detecting occupied states. However, the recall value of 0.86 implies some missed occupancy detections, which may be attributed to variations in CO₂ levels that do not fully capture the presence of individuals. These results indicate the feasibility of leveraging CO₂ concentration and temporal patterns for occupancy detection while highlighting potential areas for refinement, such as enhancing recall through additional feature engineering or adaptive thresholding techniques.

In Figure 3, mean CO₂ levels exhibit a marked increase during office hours (09:00–18:00), with a gradual decline until 23:00, reflecting the expected work schedule of the laboratory. Figure 4 further confirms this, showing that Tuesdays and Wednesdays are the busiest days, whereas Mondays and Fridays see over 50% lower occupancy. The weekend patterns suggest minimal laboratory usage, as indicated by stable and low CO₂ levels. Figure 5 reveals peak occupancy between 11:00–14:00, with the highest CO₂ concentrations observed at 12:00. The sharp variations at this time likely result from increased people flow and the inertia of CO₂ accumulation from the morning period. Our measurements were not disturbed by the ventilation and air conditioning systems. The lowest recorded CO₂ levels occur at 07:00, reflecting the unoccupied state of the laboratory before work hours.

Figure 6, Figure 7 and Figure 8 provide further validation of the model’s accuracy in predicting occupancy. Figure 6 shows that actual occupancy is concentrated between 09:00 and 18:00, aligning with typical work hours. Figure 7 demonstrates that the predicted occupancy rate closely matches the actual occupancy trends, highlighting the model’s reliability. Figure 8 reveals minor discrepancies, particularly on Wednesday at 14:00 and Thursday at 20:00, though overall deviations remain minimal. These findings reinforce the robustness of the model while suggesting areas for improvement in capturing subtle occupancy variations during specific time slots.

We investigated two model variants: a short-term model and a long-term model. We found that developing a reliable model was not possible with a dataset smaller than two months. In this specific case, the short-term model outperformed the long-term model, primarily due to the regular weekly occupancy of the laboratory. This was shown in Figure 9 and Figure 10. The data also suggest that, in some cases, short-term predictions can be more accurate than long-term predictions due to holidays, vacations, and breaks that disrupt the weekly occupancy cycle. However, in terms of overall results, the short-term model is prone to bias. For example, we observed periods with lower occupancy on Fridays (Figure 11) that disrupt the model’s predictions, which may not hold true in other contexts, such as the summer semester. In such cases, the stability of the long-term model is preferable. Future research could explore hybrid models that integrate the advantages of both short-term and long-term approaches.

A primary limitation of our research is the lack of investigation surrounding the opening of windows and doors, which would significantly influence both the results and model performance. Ventilation typically occurs every 90 min, during 15 min breaks. Academic teachers generally do not open windows during classes due to excessive noise from the street. Our findings revealed that indoor air quality in the laboratory was often suboptimal, particularly after extended class sessions. In addition, the data labeling process was prone to errors. Furthermore, the model does not account for the inertia of CO₂ level measurements, which further affects the outcomes. Although heatmaps generally indicate higher occupancy levels during later hours compared to morning hours, this trend may be influenced by the measurement inertia.

An alternative solution to the presented approach for occupancy estimation is camera-based, as it directly captures the presence of individuals [22,23]. However, significant drawbacks make it less practical in certain scenarios. These include higher energy requirements, the need for precise camera placement, susceptibility to lighting interference, and privacy concerns. Unlike the CO₂-based approach presented in this study, which preserves privacy, camera-based methods are not feasible for privacy-sensitive applications.

Nikolaos Schizas et al. [24], in their survey, explored a Tiny Machine Learning (TinyML) paradigm in the areas of hardware, software, and algorithms designed to process sensor data on ultra-low-power devices. They identified key obstacles and outlined future directions for TinyML research. They provided a comprehensive list of existing TinyML-based toolkits for training in various TinyML application areas. The TinyML concept could be adopted in this work, as the design of our proposed algorithm can be adapted to machine learning-supported processing devices. However, this approach requires adjusting the entire processing pipeline to accommodate the limitations and constraints of such a target device. Our system is designed with a fully automated workflow for scalability: we developed a daemon that continuously monitors the API status and updates the database file automatically. Our proposed server-based architecture supports rapid deployment; new model versions can be deployed on our processing server within minutes, enabling on-the-fly retraining.

Future research could focus on incorporating additional environmental parameters, such as humidity and temperature, to enhance predictive accuracy, as well as deploying the model across diverse settings to assess its generalizability. This approach not only advances occupancy estimation but also contributes to broader applications in building management, security, and energy optimization.

6. Conclusions

In this study, we developed and validated a deep learning-based model for indoor occupancy estimation using CO₂ sensor data and temporal features based on a year-long dataset collected from a university laboratory. The proposed algorithm captured periodic occupancy patterns through cyclical time encodings and one-hot day-of-week representations, integrated with CO₂ measurements, achieving robust performance across a comprehensive set of metrics. The model’s design prioritizes privacy, scalability, and low computational complexity, making it suitable for integration into existing IoT-enabled building infrastructures with minimal retraining overhead.

By making the dataset and model accessible through our research group’s website, this work lays a foundation for further exploration and refinement by the scientific community. Future work may explore the integration of additional environmental variables and more complex temporal modeling to enhance predictive robustness across diverse settings.

Author Contributions

E.K. contributed 90% to the work, while P.A. contributed 10% to the work. Conceptualization, E.K.; methodology, E.K.; software, E.K.; validation, E.K.; formal analysis, E.K.; investigation, E.K. and P.A.; resources, E.K. and P.A; data curation, E.K.; writing—original draft preparation, E.K.; writing—review and editing, E.K. and P.A.; visualization, E.K.; supervision, E.K. and P.A.; project administration, E.K.; funding acquisition, E.K. and P.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by AGH under grant no. 16.16.120.773.

Data Availability Statement

The developed model and the dataset supporting reported results can be found at http://www.wearables.agh.edu.pl/occupancysense/ (accessed on 1 February 2025).

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Wei, Y.; Xia, L.; Pan, S.; Wu, J.; Zhang, X.; Han, M.; Zhang, W.; Xie, J.; Li, Q. Prediction of occupancy level and energy consumption in office building using blind system identification and neural networks. Appl. Energy 2019, 240, 276–294. [Google Scholar] [CrossRef]
Mena-Martinez, A.; Delgado, M.D.; Alvarado-Uribe, J.; Ceballos, H.G. A Real-Life Evaluation of Supervised and Semi-Supervised Machine Learning Approaches for Indirect Estimation of Indoor Occupancy. IEEE Access 2024, 12, 118673–118693. [Google Scholar] [CrossRef]
Wu, L.; Wang, Y.; Liu, H. Occupancy Detection and Localization by Monitoring Nonlinear Energy Flow of a Shuttered Passive Infrared Sensor. IEEE Sens. J. 2018, 18, 8656–8666. [Google Scholar] [CrossRef]
Lan, X.; Tans, P.; Thoning, K.W. Trends in Globally-Averaged CO₂ Determined from NOAA Global Monitoring Laboratory measurements. Version Friday. 13:14:06 MST. Available online: https://gml.noaa.gov/ccgg/trends/global.html?doi=10.15138/9n0h-zh07 (accessed on 7 February 2025).
Persily, A.; de Jonge, L. Carbon dioxide generation rates for building occupants. Indoor Air 2017, 27, 868–879. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Pleil, J.D.; Wallace, M.A.G.; Davis, M.D.; Matty, C.M. The physics of human breathing: Flow, timing, volume, and pressure parameters for normal, on-demand, and ventilator respiration. J. Breath Res. 2021, 15, 042002. [Google Scholar] [CrossRef] [PubMed]
Arief-Ang, I.B.; Salim, F.D.; Hamilton, M. CD-HOC: Indoor Human Occupancy Counting using Carbon Dioxide Sensor Data. arXiv 2017, arXiv:1706.05286. [Google Scholar]
Mena, A.R.; Ceballos, H.G.; Alvarado-Uribe, J. Measuring Indoor Occupancy through Environmental Sensors: A Systematic Review on Sensor Deployment. Sensors 2022, 22, 3770. [Google Scholar] [CrossRef] [PubMed]
Huang, Q.; Syndicus, M.; Frisch, J.; van Treeck, C. Spatial features of CO₂ for occupancy detection in a naturally ventilated school building. Indoor Environ. 2024, 1, 100018. [Google Scholar] [CrossRef]
Blazevic, M.; von Straussenburg, A.F.A.; Riehle, D.M. Sensing the Unseen—Using CO₂ as a Key Indicator for Occupancy Detection in Smart Collaboration Spaces. Procedia Comput. Sci. 2024, 239, 1312–1319. [Google Scholar] [CrossRef]
Liang, X.; Shim, J.; Anderton, O.; Song, D. Low-cost data-driven estimation of indoor occupancy based on carbon dioxide (CO₂) concentration: A multi-scenario case study. J. Build. Eng. 2023, 82, 108180. [Google Scholar] [CrossRef]
Xie, L.; Dai, L.; Saidani, T.; Shutaywi, M.; Innab, N.; Deebani, W.; Wang, L. Intelligent detection of office occupancy using hybrid data-mining. Energy Build. 2024, 322, 114690. [Google Scholar] [CrossRef]
Calì, D.; Matthes, P.; Huchtemann, K.; Streblow, R.; Müller, D. CO₂ based occupancy detection algorithm: Experimental analysis and validation for office and residential buildings. Build. Environ. 2015, 86, 39–49. [Google Scholar] [CrossRef]
Jia, X.; Roels, J.; Baets, R.; Roelkens, G. A Miniaturised, Fully Integrated NDIR CO₂ Sensor On-Chip. Sensors 2021, 21, 5347. [Google Scholar] [CrossRef] [PubMed]
Fine, G.F.; Cavanagh, L.M.; Afonja, A.; Binions, R. Metal Oxide Semi-Conductor Gas Sensors in Environmental Monitoring. Sensors 2010, 10, 5469–5502. [Google Scholar] [CrossRef] [PubMed]
Liu, T.; Baggett, R.; Lang, K.; Padilla, D.J.; Patel, R.J.; Berry, J.; Eldredge, R.L.; Robledo, C.J.; Bowen, W.; Landorf, C.W.; et al. Functionalized carbon nanotubes enabled flexible and scalable CO₂ sensors. Carbon Trends 2023, 12, 100291. [Google Scholar] [CrossRef]
Choi, H.; Um, C.Y.; Kang, K.; Kim, H.; Kim, T. Review of vision-based occupant information sensing systems for occupant-centric control. Build. Environ. 2021, 203, 108064. [Google Scholar] [CrossRef]
Shokrollahi, A.; Persson, J.A.; Malekian, R.; Sarkheyli-Hägele, A.; Karlsson, F. Passive Infrared Sensor-Based Occupancy Monitoring in Smart Buildings: A Review of Methodologies and Machine Learning Approaches. Sensors 2024, 24, 1533. [Google Scholar] [CrossRef] [PubMed]
Netatmo. Available online: https://www.netatmo.com/smart-indoor-air-quality-monitor?srsltid=AfmBOor1VdFLmKhrjt-fJFXz2VVJkRlgyAfEv9A0fCc35Pm5Ahu1nr0Z (accessed on 24 February 2025).
AGH Wearables. Available online: http://www.wearables.agh.edu.pl/occupancysense/ (accessed on 24 February 2025).
Machine Learning Metrics. Available online: https://developers.google.com/machine-learning/crash-course/classification/accuracy-precision-recall?hl=pl (accessed on 24 February 2025).
Cokbas, M.; Pyltsov, V.; Zolkos, J.; Gevelber, M.; Konrad, J. A comparison of occupancy-sensing and energy-saving performance: CO₂ sensors versus fisheye cameras. Energy Build. 2024, 321, 114652. [Google Scholar] [CrossRef]
Wei, S.; Tien, P.W.; Chow, T.W.; Wu, Y.; Calautit, J.K. Deep learning and computer vision based occupancy CO₂ level prediction for demand-controlled ventilation (DCV). J. Build. Eng. 2022, 56, 104715. [Google Scholar] [CrossRef]
Schizas, N.; Karras, A.; Karras, C.; Sioutas, S. TinyML for Ultra-Low Power AI and Large Scale IoT Deployments: A Systematic Review. Futur. Internet 2022, 14, 363. [Google Scholar] [CrossRef]

Figure 1. Experimental laboratory set-up.

Figure 2. Occupancy-estimation algorithm flowchart.

Figure 3. Mean CO₂ levels by day of week and hour (long-term).

Figure 4. CO₂ levels by day of week (long-term).

Figure 5. CO₂ levels by hour of day (long-term).

Figure 6. Actual occupancy rate by day of week and hour (long-term).

Figure 7. Predicted occupancy rate by day of week and hour (long-term).

Figure 8. Difference (predicted vs. actual) in occupancy rate (long-term).

Figure 9. Predicted occupancy rate by day of week and hour (short-term).

Figure 10. Difference (predicted vs. actual) in occupancy rate (short-term).

Figure 11. CO₂ levels by day of week (short-term).

Table 1. Occupancy detection methods.

Method	Description	Advantages	Limitations	Accuracy	Key Reference
Visual data	Use of cameras and image processing software	Fast response time, high accuracy	High computational cost, limited view area, affected by lighting conditions, non private	Median accuracy 95% for presence	Haneul Choi et al. [17]
Environmental sensors	Measurement of CO₂, temperature, humidity, etc.	Low cost, non-invasive, privacy preserving	Slow response time, sensitive to ventilation and air flow systems, low computational cost	Accuracy between 88.7% and 97.1%	Mena et al. [8]
Infrared-based	PIR Sensors	Low cost, non-invasive, popular in security applications	Not suitable for stationary measurements, less accurate in crowded settings, low computational cost	Accuracy between 87.5 and 99.6%	Shokrollahi et al. [18]
Proposed approach	Measurement of CO₂ and temporal features (long term)	Low cost, non-invasive, ease of installation within existing building and laboratory infrastructure, Internet of Things interface, original training dataset, privacy-preserving	Slow response time, sensitive to ventilation and air flow systems, low computational cost	Accuracy 97%, recall 86%	-

Table 2. Classification Metrics.

Metric	Model A (Long-Term)	Model B (Short-Term)
Accuracy	0.97	0.99
Precision	1.00	0.98
Recall	0.86	0.96
F1-Score	0.92	0.97
AUC-ROC	0.99	0.99

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kańtoch, E.; Augustyniak, P. Occupancy Estimation in Academic Laboratory: A CO₂-Based Algorithm Incorporating Temporal Features for 1–16 Occupants. Electronics 2025, 14, 1377. https://doi.org/10.3390/electronics14071377

AMA Style

Kańtoch E, Augustyniak P. Occupancy Estimation in Academic Laboratory: A CO₂-Based Algorithm Incorporating Temporal Features for 1–16 Occupants. Electronics. 2025; 14(7):1377. https://doi.org/10.3390/electronics14071377

Chicago/Turabian Style

Kańtoch, Eliasz, and Piotr Augustyniak. 2025. "Occupancy Estimation in Academic Laboratory: A CO₂-Based Algorithm Incorporating Temporal Features for 1–16 Occupants" Electronics 14, no. 7: 1377. https://doi.org/10.3390/electronics14071377

APA Style

Kańtoch, E., & Augustyniak, P. (2025). Occupancy Estimation in Academic Laboratory: A CO₂-Based Algorithm Incorporating Temporal Features for 1–16 Occupants. Electronics, 14(7), 1377. https://doi.org/10.3390/electronics14071377

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Occupancy Estimation in Academic Laboratory: A CO₂-Based Algorithm Incorporating Temporal Features for 1–16 Occupants

Abstract

1. Introduction

2. Background Related Works

3. Methods

3.1. Sensors and Experimental Dataset

3.2. Occupancy-Estimation Algorithm Using CO₂ Levels and Temporal Features

4. Experimental Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Occupancy Estimation in Academic Laboratory: A CO2-Based Algorithm Incorporating Temporal Features for 1–16 Occupants

Abstract

1. Introduction

2. Background Related Works

3. Methods

3.1. Sensors and Experimental Dataset

3.2. Occupancy-Estimation Algorithm Using CO2 Levels and Temporal Features

4. Experimental Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Occupancy Estimation in Academic Laboratory: A CO₂-Based Algorithm Incorporating Temporal Features for 1–16 Occupants

3.2. Occupancy-Estimation Algorithm Using CO₂ Levels and Temporal Features