Next Article in Journal
A Comprehensive Analysis of Trapezius Muscle EMG Activity in Relation to Stress and Meditation
Next Article in Special Issue
RETRACTED: Utilizing Generative Adversarial Networks for Acne Dataset Generation in Dermatology
Previous Article in Journal
Design and Modelling of an Induction Heating Coil to Investigate the Thermal Response of Magnetic Nanoparticles for Hyperthermia Applications
Previous Article in Special Issue
Predictive Analysis of Endoscope Demand in Otolaryngology Outpatient Settings
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quantifying Inhaled Concentrations of Particulate Matter, Carbon Dioxide, Nitrogen Dioxide, and Nitric Oxide Using Observed Biometric Responses with Machine Learning

Department of Physics, The University of Texas at Dallas, Richardson, TX 75080, USA
*
Author to whom correspondence should be addressed.
BioMedInformatics 2024, 4(2), 1019-1046; https://doi.org/10.3390/biomedinformatics4020057
Submission received: 9 February 2024 / Revised: 18 March 2024 / Accepted: 25 March 2024 / Published: 3 April 2024
(This article belongs to the Special Issue Feature Papers in Applied Biomedical Data Science)

Abstract

:
Introduction: Air pollution has numerous impacts on human health on a variety of time scales. Pollutants such as particulate matter—PM1 and PM2.5, carbon dioxide (CO2), nitrogen dioxide (NO2), and nitric oxide (NO) are exemplars of the wider human exposome. In this study, we adopted a unique approach by utilizing the responses of human autonomic systems to gauge the abundance of pollutants in inhaled air. Objective: To investigate how the human body autonomically responds to inhaled pollutants in microenvironments, including PM1, PM2.5, CO2, NO2, and NO, on small temporal and spatial scales by making use of biometric observations of the human autonomic response. To test the accuracy in predicting the concentrations of these pollutants using biological measurements of the participants. Methodology: Two experimental approaches having a similar methodology that employs a biometric suite to capture the physiological responses of cyclists were compared, and multiple sensors were used to measure the pollutants in the air surrounding them. Machine learning algorithms were used to estimate the levels of these pollutants and decipher the body’s automatic reactions to them. Results: We observed high precision in predicting PM1, PM2.5, and CO2 using a limited set of biometrics measured from the participants, as indicated with the coefficient of determination (R2) between the estimated and true values of these pollutants of 0.99, 0.96, and 0.98, respectively. Although the predictions for NO2 and NO were reliable at lower concentrations, which was observed qualitatively, the precision varied throughout the data range. Skin temperature, heart rate, and respiration rate were the common physiological responses that were the most influential in predicting the concentration of these pollutants. Conclusion: Biometric measurements can be used to estimate air quality components such as PM1, PM2.5, and CO2 with high degrees of accuracy and can also be used to decipher the effect of these pollutants on the human body using machine learning techniques. The results for NO2 and NO suggest a requirement to improve our models with more comprehensive data collection or advanced machine learning techniques to improve the results for these two pollutants.

1. Introduction

This study employed a novel approach to gauge the levels of pollutants found in inhaled air using autonomic human responses as discerned using a suite of biometric sensors. The environmental and social context has a significant impact on human well-being. The issue of air pollution is of particular concern, as reported by the World Health Organization’s findings that both outdoor and indoor pollution contribute to more than 7 million premature deaths each year [1]. Air pollution can come from various sources, including natural events such as wildfires and volcanic eruptions, as well as human activities such as vehicle emissions, industrial processes, and the operation of coal-fueled power plants.
The air quality standards established by the U.S. Environmental Protection Agency under the Clean Air Act include six pollutants. These include particulate matter (PM), carbon monoxide (CO), ground-level ozone, nitrogen dioxide (NO2), sulfur dioxide (SO2), and lead [2]. Some of the other pollutants include carbon dioxide (CO2) and volatile organic compounds. Particulate matter refers to minuscule solid or liquid particles that are present in the air and are categorized on the basis of their aerodynamic diameter. They include PM1.0, PM2.5, and PM10 with an aerodynamic diameter of less than 1 μ m, 2.5 μ m, and 10 μ m, respectively. With the small size of PM2.5, these particulates can penetrate deeply into the lungs and bloodstream, creating adverse health effects related to the respiratory system [3], increased mortality [4], heart disease [5], inflammatory responses, and adverse birth-related effects [6]. The pollutants we considered are exemplars of the wider human exposome [7,8,9], which refers to the comprehensive accumulation of all environmental exposures that an individual encounters throughout their lifetime, including chemicals and biological agents. The exposome encompasses exposures to both gases and particulates, and appropriate care should be taken to include the often ignored ultrafine particulates [10].
Guidelines on the recommended levels of exposure to pollutants provided by the World Health Organization (WHO) [11] and the Environmental Protection Agency (EPA) [12] contain only two designations: short-term exposures (an average of over 24 h) and long-term exposures (a 1-year average). Brief daily encounters, such as passing a construction site, walking on a busy road, or even working in poorly ventilated indoor spaces, can expose individuals to levels higher than the recommended guidelines. The size of airborne PM has a major influence on how far it can penetrate the lungs, which in turn affects human health. The WHO acknowledges that PM with diameters below 2.5 μ m (PM2.5) has a significant disease burden on human health [11,13], while larger particles, although less likely to reach the alveoli, can still cause health problems by irritating the eyes, nose, and throat [11]. Therefore, research efforts focused on prolonged exposure to poor air quality, including airborne particles of varying sizes, are of particular importance when considering long-term health.
The area of respiratory health receives significant attention due to the high incidence of poor air quality caused by factors such as smoke, vehicle emissions, and dust. Prolonged exposure to these sources, all of which produce PM of varying sizes, can affect long-term health, including physiological, psychological, and neurological functioning. For example, consider the following.
  • Inflammation: Exposure to air pollution can cause inflammation in the brain, which can cause cognitive impairment [14,15].
  • Oxidative stress: Exposure to air pollution can increase oxidative stress, leading to cell damage and cognitive impairment [14,16].
  • Reduced oxygen supply: Air pollution can reduce the amount of oxygen available to the body, which can lead to fatigue, decreased endurance, and impaired cognitive function [17,18,19,20].
  • Increased respiratory effort: Air pollution can increase the effort required to breathe, leading to reduced exercise capacity and decreased performance [18,21,22].
  • Neurotransmitter disruption: Exposure to environmental pollutants such as lead, mercury, and polychlorinated biphenyls (PCBs) can alter neurotransmitter function and cause cognitive problems [23,24].
  • Epigenetic modifications: Exposure to environmental pollutants can lead to changes in DNA methylation and other epigenetic changes, which can contribute to cognitive problems [25,26,27,28].
  • The breakdown of the blood–brain barrier: Exposure to air pollution can disrupt the blood–brain barrier, allowing pollutants to enter the brain and cause neurological damage [16].
  • Neurotoxicity: Exposure to certain environmental pollutants, such as lead, mercury, and polychlorinated biphenyls (PCBs), can be neurotoxic and affect the nervous system [24,29].
CO2 exposure has been associated with cognitive problems [30,31,32] and physiological changes in lung and cardiovascular function [33]. Long-term exposure to NO2, which is a gaseous pollutant, has been associated with cardiovascular disease, lung cancer, and respiratory problems, modifying the severity of asthma [34,35,36,37]. The inhalation of regulated nitric oxide (NO) under controlled conditions and medications that produce nitric oxide have a wide range of therapeutic uses, such as cardiopulmonary conditions [38,39]. On the other hand, NO, when inhaled in excess amounts, can react with oxygen to form NO2 in the lungs, creating lung problems [39,40]. A higher concentration of NO is considered toxic, although limited studies have been performed on the direct effects of NO inhalation.
In this study, we combined data sets obtained from two different experimental paradigms and provided an overview of our previous work in which biometric data from participants were used to estimate and understand the effects of inhaled ambient PM2.5 [41], CO2 [42], and NO2 [43] on the human body using machine learning models and now including PM1 and NO in the study as well. While long-term exposure to air pollution can result in plenty of health-related effects, as mentioned before, short-term exposure to air pollution also has immediate effects on the human body, bringing physiological changes immediately. In this study, we examined the autonomous responses on small temporal (∼2 s) and spatial (∼2 m) scales of the five mentioned pollutants within microenvironments. To comprehensively capture cognitive and physiological changes brought upon by air pollution, we made use of several sensors to measure as many biological measurements as possible, which included skin temperature, respiration rate, blood oxygen saturation (SpO2), heart rate, the galvanic skin response (GSR), the pupil diameter of the left eye, the pupil diameter of the right eye, the distance between the pupils, and the measurement of electrical activity in the brain and heart using electroencephalography (EEG) and electrocardiogram (ECG), respectively.
Since the relations between and among variables are not always linear or functional, we made use of machine learning algorithms to perform regression for nonlinear, non-parametric, multidimensional data. The use of machine learning models has been shown to estimate ambient PM with high degrees of precision, especially PM2.5 [44,45,46]. By simultaneously measuring biological parameters and air quality components, we examined the interaction between the body and the environment while also testing the accuracy of estimating pollutants using machine learning techniques.

2. Materials and Methods

The core methodology in the study of these pollutants from two different experimental paradigms was essentially the same; in it, several biometric data (or biometric variables, predictor variables, or cognitive and physiological changes) of participants were collected simultaneously using a biometric suite when a participant was cycling, while other sensors simultaneously measured the ambient air pollutants (or target variable).

2.1. Experimental Paradigms

Figure 1 shows the two experimental setup scenarios for the simultaneous measurement of biometric variables and ambient air pollutants. Figure 1a shows the scenario in which a participant wore a biometric suite to capture autonomous changes in the body when cycling a static bike indoors, while sensors placed nearby measured ambient PM2.5 and PM1. Similarly, Figure 1b shows the procedure for data collection in which a participant, equipped with the same biometric suite to measure biometric variables, was cycling outdoors with an electric car behind them that was equipped with several different sensors measuring ambient CO2, NO2, NO, PM1, and PM2.5 simultaneously. An electric car was used so that the sensors placed in the trunk of the car did not measure the pollutants produced by the car but only those of the ambient air.
Two of the experimental paradigms in this study shared some similarities and differences as well. Table 1 shows some of the similarities between the two experimental paradigms.
The differences between the two experimental paradigms are shown in Table 2.

2.2. Data Collection

The process of data collection for both of the experimental paradigms involved the simultaneous measurement of biometric data using the same biometric suite and environmental data (or the target variable or pollutant). Several biometric variables were measured, among which the ones that were considered for the study are presented in Table 3.
The EEG data were collected using a Cognionics EEG headset consisting of 64 electrodes following the 10–10 nomenclature system [47] (https://www.cgxsystems.com/mobile-128, accessed on 16 January 2024) with a sampling rate of 500 Hz. Among the rest of the physiological responses (or non-EEG variables) the ECG, GSR, SpO 2 , respiration rate, skin temperature, and heart rate were measured using a Cognionics AIM Generation 2 device (https://www.cgxsystems.com/auxiliary-input-module-gen2, accessed on 16 January 2024) with a sampling rate of 500 Hz. The Tobii Pro Glasses 2 (https://www.tobii.com/products/discontinued/tobii-pro-glasses-2, accessed 16 January 2024) provided several pupillometric measurements at a sampling rate of 100 Hz, but the ones that were considered were the pupil diameter of the left eye, the pupil diameter of the right eye, and the distance between the pupils.
The data obtained from each of the 64 electrodes (or channels) of the EEG headset were received as a time series of voltages. These voltages were measured with respect to a virtual reference that was averaged from all the channels. The voltage time series could be transformed from the time domain to the frequency domain. One of the ways to do so is the Welch method [48], which was implemented using scipy [49]. The transformation thus identified a power spectrum density ( V 2 / Hz ) in the Y-axis and frequency in the X-axis. The frequency could be divided into five frequency bands named delta, theta, alpha, beta, and gamma, each representing a different brain state. With the data obtained from each of the 64 electrodes, transforming each into a frequency domain and dividing each frequency into five frequency bands provided a total of 320 biometric variables from the EEG headset.
From the three measured pupillometric variables, other variables such as the average pupil diameter of the two pupils, the difference between pupil diameters of the left and right eyes, and the absolute value of the difference between the pupil diameters were calculated, providing extra features that were considered.
Before data collection for the study began, in each of the experiments, baseline biometric measurements were made for two minutes with the participants’ eyes closed and their eyes open. The biometric suite was placed on the participants so that it had little effect on their physiological responses.
CO2 measurement was performed using the LI-COR LI-850 device (https://www.licor.com/env/support/LI-850/topics/description.html#Onlineresources, accessed 21 January 2024) with a sampling rate of 0.5 Hz (twice every second). The measurement of NO2 and NO was carried out using the Model 405 nm NO2/NO/ NO x Monitor from 2B technologies (https://2btech.io/items/other-monitors/model-405-nm-no2-no-nox-monitor/, accessed 21 January 2024) with a sampling rate of 0.2 Hz (once every 5 s), and the measurement of PM1 and PM2.5 was carried out using the Fidas Frog device (https://www.palas.de/en/product/fidasfrog, accessed 21 January 2024) with a sampling of 1 Hz (once every 1 s). The measurement of biometric data was stopped when cycling was stopped and collected again when cycling was resumed.
At times, the precision of the data captured via biometric sensors can be compromised due to their movement, resulting in the possibility of no values being recorded. Furthermore, the devices also have different sampling rates. Therefore, the data were cleaned and down-sampled to 1 s for CO2, 5 s for NO2, 5 s for NO, and 1 s for PM 2.5 . The total number of biometric variables used and the number of data records collected for each pollutant are shown in Table 4.
The data collection for CO2, NO2, and NO was carried out on three separate days: Of 26 May, 9 June, and 10 June 2021, accurate data for CO2 readings were collected only on 9 June 2021 and 10 June 2021 with 2 trials on each day. Accurate data for NO2 and NO were received on all 3 days with 2 trials on each day. Data collection for PM1 and PM2.5 took place on October 21 2021, January 14 2022, January 27 2022, and February 9 2022 with different participants on each day.
The data obtained for NO2 and NO from the measuring device were filtered to include only records that passed multiple quality criteria. These quality criteria included (a) the cell flow rate of the sample gas between (1400 to 1600) cc/min, (b) the ozone flow rate between (60 to 80) cc/min, (c) the cell photodiode voltage (PDV) of at least 0.6 Volts, and (d) the PDV ozone generator of at least 0.1 Volts.

2.3. Data Analysis and Developing a Machine Learning Model

After the construction of the four datasets, each consisting of biometric variables as the input features and one output target variable, we sought to estimate the inhaled pollutant concentrations. Each target variable was estimated separately using random forests [50] for multidimensional, nonlinear regression using the ensemble Random Forest Regressor package from scikit-learn (version 1.0.2) [51] in Python (version 3.11.1). All models were trained using 80% of the data, and the remaining 20% were used as an independent test set. The determination coefficient (R2) and the root mean square error (RMSE) were calculated between the true values of the pollutant and the estimated values of the pollutant to quantify the goodness of fit. Scatter plots, quantile–quantile plots, and time series plots of the actual and estimated pollutant values were also plotted for a qualitative analysis of the goodness of fit.
Each of the scatter diagrams was overlaid with a 1:1 black line to indicate how far the prediction was from the true values with data points with an exact prediction lying on the 1:1 line. A quantile–quantile plot for each of the four machine learning models was drawn and overlaid with percentiles to indicate where in the distribution the data points deviated from the actual values with data points that had an exact prediction lying on the red 1:1 line.
To identify the effectiveness of biometric variables in predicting the target variable, the SHAP values (SHapley Additive exPlanations) [52,53] of the SHAP library (version 0.41.0) were used to rank the predictor variables in descending order. The SHAP values for variables below the ninth order were found to be small and, thus, less effective in making the prediction. Since the data were mostly nonlinear, the top 9 of those variables in the predictor ranking were then used, and a 10 × 10 mutual information matrix including the pollutant to be estimated was calculated using a package from scikit-learn [51] to identify the nonlinear relationship between the variables. These mutual information values were greater than zero, with higher values indicating a stronger relationship and zero values indicating that the two variables were independent of each other.

3. Results

Among the several biometric variables that were measured in this study, some of the readings were not easily measurable, and the devices were expensive as well, for example, the EEG and Tobii Pro glasses 2. Other variables, such as skin temperature, SpO 2 , heart rate, respiration rate, GSR, and ECG, could be measured relatively easily and were also inexpensive. Therefore, the study was classified into two parts; first, we considered all the biometric features that were either measured or calculated, and second, we considered biometric variables that could be easily measured and were accessible.

3.1. Using all Features

Table 5 shows the coefficient of determination (R2) and RMSE between the true values of the pollutant and the estimated values of the pollutant in the training set and the testing set for each pollutant using the Ensemble Random Forest Regressor package from scikit-learn and considering all biometric variables with default hyperparameters.
Table 5 shows that the train R2 for all pollutants was nearly 1, and the train RMSE was also low, which was expected since this part of the data set was used in the machine learning model for learning. The independent test R2 for PM1, CO2, and PM2.5 was also almost 1, and the RMSE was also small, indicating that the performance and generalization of the machine learning model in estimating PM1, CO2, and PM2.5 was very good. For pollutants such as NO2 and NO, for which we had far fewer data records, the performance was not as good, with low test R2 values. One possible explanation for the result is that there were not enough training examples, explained detailly in Section 3.1.2 and Section 3.1.3 for NO2 and NO respectively. The R2 values and the RMSE values for all these pollutants can change according to the way the data are shuffled. For PM1, PM2.5, and CO2, these values remained close to each other because there was an abundance of data points over a range of values. However, for NO2 and NO, these values changed to some extent, depending on how the data were shuffled, especially considering the large number of predictor variables for a relatively small data set. When the algorithm was run five times, the average R2 between true and estimated values of NO2 in the training set and the test set was 0.91 and 0.12, respectively, and the average RMSE between the true and estimated values of NO2 was 3.11 ppb and 8.56 ppb in the training set and test set, respectively. Similarly, for NO, when the algorithm was run five times, the average R2 value in the training and testing set was 0.93 and 0.21, respectively, and the average RMSE was 5.23 ppb and 11.62 ppb, respectively.
To test whether the precision of the estimation could be improved further, we optimized the hyperparameters of the random forest algorithm. Two of the hyperparameters were optimized, including (a) n_estimators (the total number of trees), and (b) max_features (the number of features considered at each split). To find the best combination of these two parameters, scikit-Learn’s GridSearchCV was used. A set of integers of these two parameters was provided as an input to the GridSearchCV function, which then ran all the possible combinations with K-fold cross-validation. With the provided set of n_estimators (say m), a set of max_estimators (say n), and K- fold cross-validation, the model was trained a total of m × n × k times, and GridSearchCV returned the best possible combination that had the lowest mean squared error. Since the default value of the n_estimators parameter is 100, we provided n_estimators with a set of values that were either less than 100 or greater than or equal to 100. Also, the default value of max_features=1.0, and it considered features equal to the total number of features when considering the best split. Thus, a set of integers less than the total number of features was provided to the max_features parameter. The number of features considered at each split was then equal to the number in the set of values provided to the max_features parameter.
Table 6 shows the best combination of the n_estimators and the max_features parameter for each of the pollutants. Since the best value of the parameter was between the set of values provided to the GridSearchCV function, other values did not need to be tested.
After the hyperparameters of the random forest algorithm were optimized, the model was then used to test the accuracy of the prediction. Table 7 shows the quantification of the estimation of the pollutants with optimized hyperparameters.
When comparing Table 6 and Table 7, we observe that the results were almost similar when the hyperparameters were optimized. For NO, the results seem to have decreased. Since Pearson’s correlation coefficient is highly susceptible to outliers, and the data are shuffled each time the algorithm is run, this disparity was expected. We ran the algorithm five times for NO2 and NO. The average RMSE between the true and estimated values of NO2 was found to be 3.26 ppb and 6.45 ppb in the training and testing set, respectively, while the R2 was found to be 0.92 and 0.19. Similarly, the average RMSE between the true and estimated values of NO was found to be 5.01 ppb and 12.83 ppb in the training and testing set, respectively, while the R2 was found to be 0.93 and 0.15.

3.1.1. Carbon Dioxide

For the study of CO2, a total of 329 biometric input variables were taken into account, including the 320 variables from the EEG data, and the remaining variables included the following: ECG, respiration rate, SpO2, heart rate, GSR, skin temperature, pupil distance, the average pupil diameter, and the absolute value of the difference between pupil diameters. Figure 2a shows a SHAP value beeswarm plot of the top nine features in descending order to indicate the biometric variables that were the most influential in estimating CO2. Figure 2b shows a mutual information matrix consisting of the nine variables with the nine highest SHAP values and CO2.
These SHAP values for CO2 on the X-axis are expressed in ppm units. As indicated by the SHAP values, the average diameter of the pupil, the GSR, and skin temperature were among the top physiological responses that were the most effective in predicting CO2. The order of these variables can change, depending on how the data are shuffled, especially when the SHAP values are close to each other—for example, for features in order numbers 5, 6, and 7. The plot also indicates that higher values of the average pupil diameter tend to decrease the prediction, while lower values tend to increase the prediction, as large portions of SHAP values for the average pupil diameter are negative and positive, respectively.
In addition to the fact that the diameter of the pupil changes, depending on the light entering it, the diameter of the pupil has been associated with cognitive ability [54]; as mentioned before, CO2 intake is linked to cognitive issues [30,31,32] as well. The GSR sensor measures the response to sweat, and sweating can be caused by physical tasks such as cycling. CO2 inhalation can cause sweating when the concentration is 6 to 10% [55]. Other biometric variables include the respiration rate, heart rate, and skin temperature, and considering that exposure to CO2 can cause physiological changes in lung and cardiovascular function [33], it was expected that these variables would be affected by CO2 intake.
Similarly, the EEG variables included T7, FT10, and AF8 electrodes with frequency band delta, beta, and delta bands, respectively. According to the 10–10 system of nomenclature [47], electrodes with odd numbers are on the left side, and those with even numbers are on the right side. The T7 electrode is above the temporal lobe, which is associated with speech and short-term memory [56]. The FT10 electrode is located between the frontal and temporal lobes. The SHAP value of the AF8 electrode was very low, and therefore, all variables below the order had a smaller SHAP value and provided a small contribution to CO2 prediction.
The mutual information matrix shows that inhaling CO2 has a high nonlinear relationship with the GSR, skin temperature, ECG, respiration rate, and heart rate, indicating the several changes brought about via CO2 intake. Similarly, these biometrics are also mutually related to each other, as GSR has high mutual information with the average pupil diameter, skin temperature, ECG, and heart rate, while skin temperature has high mutual information with ECG, the ECG with the respiration rate, and the respiration rate with the heart rate.
The scatter diagram and the quantile–quantile plot for CO2 are shown in Figure 3. The scatter diagram in Figure 3a shows that not only most of the data points in the training set but also, more importantly, those in the testing set lie very close to the black 1:1 line, indicating that the predictions are close to each other for a large portion of the dataset.
The quantile–quantile plot in Figure 3b also shows that, for most of the distribution, the data points lie close to the red 1:1 line. The quantiles in the distribution deviate for values between 700 and 800 ppm, and one of the possible reasons could be the scarcity of data points in this range of value, which is also depicted in the scatter diagram.

3.1.2. Nitrogen Dioxide

The 329 variables that were considered for the study of NO2 include the 320 EEG variables, ECG, respiration rate, SpO2, heart rate, GSR, skin temperature, average pupil diameter, pupil distance, and difference in pupil diameter.
In the case of NO2, the estimate was not as good, as indicated by the value of R2 and RMSE between the true and estimated values of NO2 in Table 5. However, Figure 4a shows the SHAP value beeswarm plot of the top nine biometric features that were the most influential in estimating NO2. Figure 4b shows the mutual information matrix of the top nine features chosen by SHAP values and NO2.
The SHAP values, in this case on the X-axis, are in units of ppb. The SHAP value of the ECG and skin temperature is relatively higher than the other variables, so these variables do not tend to change order. However, the ordering of the rest of the variables can change, depending on how the data are shuffled, as the SHAP values are close to each other, especially at the lower end of the order. The plot also shows that lower values of skin temperature tend to decrease the prediction, while higher values tend to increase the prediction. As long and short-term exposure to NO2 is associated with cardiovascular disease [57], it is likely that the ECG is one of the main variables. Since the inhalation of a higher concentration of NO2 causes inflammation of the airways, changes in the respiration rate, skin temperature, and sweating are also likely to affect the GSR sensor.
Other variables include EEG ones. The F7 electrode is one of the main EEG variables. The SHAP value of the F7-gamma variable and the following two variables are low compared to the rest of the variables, indicating their low effectiveness in estimating NO2.
The mutual information matrix in Figure 4b shows that NO2 has high mutual information with the ECG, skin temperature, heart rate, and GSR, which was again to be expected, as the SHAP values for these variables were high. The matrix also shows that the ECG has higher mutual information with skin temperature, respiration rate, heart rate, and GSR, while skin temperature has higher mutual information with the respiration rate, heart rate, and GSR, the respiration rate has higher mutual information with the heart rate and GSR, and the heart rate has higher mutual information with the GSR. This is similar to what is seen in the mutual information matrix in Figure 2b, indicating that the variables are mutually related to each other.
The scatter plot and the quantile–quantile plot for NO2 are shown in Figure 5a,b, respectively. The scatter diagram in Figure 5a shows that the lower values of the data points lie close to the black 1:1 line, where there is an abundance of data. The quantile–quantile graph in Figure 5b shows that around 90% of the data are less than 20 ppb, where the quantile–quantile graph is close to the red 1:1 line. As the values of NO2 increase, the number of data points is low; this could possibly have caused the number of data points to deviate from the 1:1 black and red line for higher NO2 values, as there is a very small number of data points from which the machine learning model can learn.

3.1.3. Nitric Oxide

In the NO study, the same 329 variables used in NO2 were considered. Similar to the case of NO2, the estimation of NO using biometrics did not appear to be very accurate, as indicated by R2 and the RMSE values between the true and estimated values of NO in Table 5.
The SHAP value beeswarm plot in Figure 6a shows that physiological responses such as skin temperature, average pupil diameter, and ECG are among the main biometric variables in estimating NO. The unit of the SHAP value on the X-axis here is ppb. The plot also shows that higher skin temperature values tend to lower the prediction, while lower values tend to increase the prediction. Since the inhalation of NO when reacted with oxygen can create NO2, skin temperature and ECG were possibly affected, and they were also common variables in NO2. There appears to be a large number of EEG variables as well. The PO7 electrode is located between the parietal and occipital lobes on the left side of the brain. The gamma band that seems common is dominant in tasks that involve concentration [58]. Other biometric variables, such as Fp2-gamma, Fpz-beta, and those below them, have low SHAP values and, thus, provide less of a contribution to NO estimation.
The mutual information matrix in Figure 6b shows that, among the predictor variables, NO has high mutual information with skin temperature and ECG. Skin temperature and ECG also have high mutual information with each other.
Figure 7 shows the scatter plot and the quantile–quantile plot of the true values of NO compared to the estimated values of NO.
The overall structure of the scatter diagram and the quantile–quantile graph for NO looks similar to that of NO2, with smaller values of NO lying close to the corresponding 1:1 line, where there is an abundance of data points. The quantile–quantile plot in Figure 7b shows that more than 90% of the data are below around 20 ppb. As the values of NO get larger, these data points tend to deviate from the 1:1 line; one of the possible reasons could be the scarcity of data points in the region.

3.1.4. PM1

Since, in the static bike ride experimental setup in which the measurement of PM1 and PM2.5 was performed, the T7 electrode of the EEG headset did not give any readings, the number of biometric variables was reduced to 322. These biometric variables include the 315 variables of the EEG headset, respiration rate, SpO2, heart rate, skin temperature, the average pupil diameter, pupil distance, and the difference in pupil diameter. As shown in Table 5, the PM1 performance was the highest with an R2 value of 0.99 and the lowest RMSE of 0.07 μ g/m3 in the test set.
The SHAP value beeswarm plot in Figure 8a shows that skin temperature, pupil distance, and heart rate were among the main features that were the most influential in estimating PM1. Skin temperature and the distance of the pupil were also important variables in the estimation of PM1 when a single participant was used for the study [46]. The distance of the pupils, which indicates the vergence of the eyes, has been associated with the attention load [59]. A series of EEG variables were amongst the top variables in which the SHAP value of the Cz-delta variable was low, and all the variables below it were even lower, very close to zero. Thus, removing these features from the study would have little effect in estimating PM1.
The mutual information matrix in Figure 8b shows that PM1 has high mutual information with physiological responses such as pupil distance, heart rate, and respiration rate, and it also shows that the physiological responses are indeed mutually related to each other.
The scatter diagram and quantile–quantile graph with true PM1 values on the X-axis and estimated PM1 values on the Y-axis are shown in Figure 9a,b, respectively.
Both the scatter plot and the quantile–quantile plot show that the data points are very close to the 1:1 line of the corresponding graph, indicating that the prediction is the most accurate and precise among all the pollutants.

3.1.5. PM2.5

The biometric variables that were considered in the study of PM2.5 were the same as those of PM1. As shown in Table 5, the estimation of PM2.5 was highly accurate, as indicated by an R2 value between the true and estimated values of PM2.5 in both the training and the test set, which was almost 1. The RMSE is also one of the lowest among all the pollutants.
The SHAP value beeswarm plot in Figure 10a shows that the physiological responses that were the most effective in estimating PM2.5 included skin temperature, pupil distance, average pupil diameter, and heart rate, three of which are common to that of PM1. The SHAP values in the X-axis here are in μ g/m3. The inflammatory response created via the higher concentration of PM2.5 can possibly cause changes in skin temperature and heart rate. Furthermore, PM2.5 also causes adverse health effects on the respiratory system [3], and heart problems [5] could be the reason why the heart rate is one of the most important variables. The size of the pupils has been associated with cognitive ability [54].
Several EEG variables are on the list of the top nine variables. The FT8 electrode is located on the right side of the brain between the frontal and temporal lobes. The CP4-gamma variable has a low SHAP value with variables below the order of even lower SHAP values, indicating that the elimination of these variables will have little effect on the prediction of PM2.5.
The mutual information matrix in Figure 10b shows that there is some sort of non-linear relationship between PM2.5 and skin temperature, pupil distance, and heart rate, which was again to be expected, as the SHAP values for these variables were also high. Just as in the case of other pollutants for which physiological changes were related to each other, such was the case here with skin temperature, pupil distance, and heart rate mutually being related to each other.
Figure 11a,b show the scatter plot and the quantile–quantile plot of the true values of PM2.5 versus the estimated values of PM2.5. The scatter plot and the quantile–quantile plot in Figure 11a and b, respectively, show that most of the data points lie in the 1:1 line of the corresponding graph. This shows that, for most of the data set, the estimate was close to the true PM2.5 values.
A time series graph of the three gaseous pollutants is shown in Figure 12. The background of the time series plot is shaded with different colors, depending on different trials, and the trials have been separated with vertical black lines.
Figure 13 shows the time series plot of the true values of PM1 and PM2.5 overlaid with the estimated values of PM1 and PM2.5, respectively. Figure 12 and Figure 13 show that the true values of the pollutant were close to the estimated values of the pollutant for most of the data set.

3.2. Using Easily Measurable Variables

Now, let us focus on just the subset of biometric variables that can be easily measured using affordable sensors, for example, the respiration rate, SpO2, heart rate, GSR, and skin temperature. All models using the reduced number of input features were trained with the same ensemble random forest regression algorithm from scikit-learn with optimized hyperparameters. Two of the hyperparameters that were optimized are max_features and n_estimators, and the best combination was found using GridSearchCV.
Table 8 shows the set of n_estimators and max_features provided to the GridSearchCV function and the best optimized parameters as well after the model was trained with 3-fold cross-validation.
Table 9 shows the results of R2 and RMSE between the true values and the estimated values of the pollutant with the corresponding number of biometric variables used to estimate the pollutant.
The comparison of Table 9 with Table 5 shows that the test R2 and RMSE for PM1, PM2.5, and CO2 were very close to each other. Similar results can be seen for NO2 and NO as well, for which the test R2 was better, and the test RMSEs were very close to each other. The numbers for NO2 and NO can change to some extent based on how the data are shuffled but with little disparity since the number of dimensions has now been significantly reduced. When the algorithm was run five times, the average R2 between the actual and estimated values of NO2 in the training and the test was 0.92 and 0.30, respectively, while that of NO was 0.95 and 0.55, respectively. Similarly, when the algorithm was run five times, the average RMSE between the true and estimated values of NO in the training and test set was 2.65 ppb and 7.81 ppb, while that of NO was 3.92 ppb and 9.23 ppb, respectively. The mentioned values clearly indicate that the performance when the number of variables was reduced had increased for NO2 and NO.
The biometric variables that have been considered for CO2, NO2, and NO now include the GSR, skin temperature, respiration rate, heart rate, and SpO2, while those for PM1 and PM2.5 include skin temperature, heart rate, respiration rate, and SpO2. A SHAP value beeswarm plot, scatter plot, and quantile–quantile plot of the gaseous pollutants estimated using the reduced number of variables is shown in Figure 14.
The orderings of the variables in the SHAP value were similar to each other for all gaseous pollutants. SpO2 seems to be the lowest among all pollutants, and the elimination of this variable could have a small effect on the results. Since the number of dimensions has now been significantly reduced, the ordering will remain almost similar when the data are shuffled.
The scatter plot of CO2 is similar to that when all variables were considered. As the R2 value has increased for NO2 and NO, the data points in the scatter plot are closer to the 1:1 line. Similarly, the structure of each quantile–quantile plot is similar for all gaseous pollutants when compared to the process in which all variables were considered.
Figure 15 shows the SHAP value beeswarm plot, scatter plot, and quantile–quantile plot when estimating PM1 and PM2.5 using only four biometric variables. The beeswarm plot in Figure 15a,d shows that skin temperature remains the main variable for estimating PM1 and PM2.5 with a very high SHAP value compared to the other variables. The overall structure of the scatter plot and the quantile–quantile plot of PM1 and PM2.5 also remains similar with a large portion of the data set close to the 1:1 black line and the 1:1 red line, respectively.
The time series plot with the reduced number of biometric variables to estimate CO2, NO2, and NO is shown in Figure 16. The time series plot with a reduced number of features to estimate PM1 and PM2.5 is shown in Figure 17.
Figure 16a shows that the difference between the true values and the estimated values of CO2 is now smaller, as the true values and estimated values are much closer to each other compared to the time series when all features were considered. Similarly, the time series plot for NO2 and NO is also similar to that when all variables were considered.
The time series plot of PM1 and PM2.5 in Figure 17 shows that the true values and estimated values are close and similar to those when all features were considered.

4. Discussion

The human body is a sensing system in itself, and it reacts to environmental variables and changes in them such as temperature, humidity, and air quality. It was previously shown that autonomous physiological and cognitive responses that result from the inhalation of particulate matter on a small temporal and spatial scale can be used to estimate PM1 and PM2.5 using machine learning models with very high accuracy [46] in a study that was limited to a single participant. The inclusion of multiple participants in the experimental static bike ride paradigm in which the measurement of PM1 and PM2.5 was performed shows that the methodology that was implemented on a single participant can be extended to multiple participants as well, producing even better results for PM1 and PM2.5 with an R2 value of nearly 1 and a very low RMSE, as shown in Table 9. In fact, the results show that a few biometric variables are good enough to estimate PM1 and PM2.5 with similar results.
The time series plot of PM1 and PM2.5 in Figure 17a,b shows that their true values are very close to the estimated values for the majority of the data set without any significant differences, which explains their smallest RMSE among all pollutants. This supports the conclusion made previously [46] that two of the possible reasons why these estimates are highly accurate and precise could be (a) that these particulate matter are abundant and mix well with the ambient environment, thus having a higher probability of being inhaled by the participant and entering the sensors placed nearby and (b), with the minute size of PM2.5, that these particulates, when inhaled, can reach deep into the lungs and bloodstream, creating many negative health effects [3,5,6], thereby impacting the human body to a large extent.
Air quality components include not only particulate matter but also gaseous pollutants such as CO2, NO2, and NO, which were included in this study. The methodology that was implemented to estimate and understand autonomous responses in the human body can be used for gaseous pollutants such as CO2 as well. The R2 value, which is nearly 1, between the true and estimated values of CO2 in the test set using a small number of biometrics supports this claim, as shown in Table 9. Making the model simpler by considering a small number of biometrics also appears to have reduced the RMSE between the true and estimated values of CO2, which can be seen clearly by comparing the time series in Figure 12a and Figure 16a. Given the several physiological changes brought about by inhaling CO2, such as changes in lung and cardiovascular function [33], cognitive issues [54], sweating [55], and the inflammation of airways, these autonomous responses can indeed be used to predict the concentration of CO2 with high accuracy.
The results of estimating NO2 and NO for the entire range of data were not very accurate, as indicated by the value of R2 and RMSE between the true and estimated values of the corresponding gas shown in Table 9. However, the scatter diagram of these two gases in Figure 14e,h and the quantile–quantile plot of both of these gases in Figure 14f,i indicate that the prediction is reliable to some extent for lower values of the gas when there is a higher concentration of data, as the data points in the plot are close to their corresponding 1:1 line, respectively. As the number of data points decreases for higher values of these two gases, the data points in the scatter plot and the quantile–quantile plot deviate from their corresponding 1:1 line, with one possible reason being the very small number of data points for the machine learning model to learn from in this region of data. Moreover, Pearson’s correlation coefficient (R) is highly susceptible to outliers when few data points deviate from the 1:1 line can largely affect the value of R. This could have possibly reduced the precision when the entire data set was considered for study. This claim is supported by the scatter diagram in Figure 14b and the quantile–quantile plot in Figure 14c of CO2, where the data points deviate from the corresponding 1:1 line between 700 ppm and 800 ppm, one possible reason being the scarcity of data points in that region of data for the machine learning model to learn from and then to be tested on an independent test set. Improvements to the result in future work can possibly be made with either more expansive data collection or better machine learning models that can learn with a limited set of data to better minimize the error and then be tested in an independent test set.
Another possible reason for the results concerning NO and NO2 not being highly accurate could also be that the autonomous responses during the process of data collection were dominated by PM1 particles. As shown in the time series graph in Figure 16b,c, the concentration of NO2 and NO was under 80 ppb and 90 ppb, respectively, with occasional high concentration. The concentration of PM1 particles during these trials was between 0.708 and 7.655 μ g/m3. As mentioned before, since these minute particles, when inhaled, can pass through the nose and reach deep into the lungs and bloodstream, the immediate changes in the body were, thus, most likely dominated by these PM1 particles for which the estimation of PM1 using biometric variables was very high with the R2 between the true and estimated values being 0.91 [46].
The result for all these air quality components shows that a small number of biometric variables used to estimate these pollutants provide similar and, in some cases, better results. In fact, the results are significantly better for NO and NO2. Reducing the number of dimensions in a small data set, thus, seems to be more efficient in predicting the concentration, rather than a large number of input features. This aligns with Occam’s razor principle suggesting that a simpler model usually generalizes well. Moreover, the reduction of the number of variables, that is, reducing the number of dimensions, was a necessity, given the small number of data sets compared to the large number of biometric variables for which data were collected.
There were a few limitations to this study that can possibly be removed in future work. One of them was the collection of data from a single participant for CO2, NO2, and NO. Multiple trials have been conducted to mitigate the issue. Future work can include more extensive data collection from multiple participants to provide further confirmation. However, the data collection in this experimental paradigm was performed on multiple days, with multiple trials under different environmental conditions; the results are, thus, likely to hold in a variety of environmental situations, probably except for situations with extreme weather. Moreover, due to the experimental paradigm involving a static bike ride, in which the study was conducted using multiple participants, and measurements of PM1 and PM2.5 were obtained, the results will also likely hold over a variety of populations. The other limitation of the study involved readings from some of the electrodes in the EEG headset that could be distorted due to activities such as blinking, head movement, swallowing, jaw clenching, neck movement, and tongue movement, which are frequent when a participant is cycling. This results in a lot of noise in the data that can be removed, but these activities are frequent, and the procedure can significantly reduce the number of data records. However, the results show that the removal of EEG data as biometric variables also yields similar results.
The methodology used in this study presents a unique application of machine learning. The use of biological measurements as input features for machine learning models can predict the concentration of air quality components such as PM1, PM2.5, and CO2 with high degrees of accuracy. We can, thus, know the quality of air in microenvironments just by using a small set of biological measurements. Furthermore, with the use of predictor ranking, we can observe which biological parameter is most affected by these air quality components. Since the study was conducted outdoors, the participants were inhaling a mixture of varying pollutants. In order to study the direct effects of these pollutants, participants could be placed in a closed chamber with autonomous responses examined by artificially varying just one of the pollutants. A study can be conducted using just an EEG headset and observing how different areas of the brain can be affected when various components of air quality are inhaled. Future work can also conduct studies concerning other pollutants, such as lead, carbon monoxide, and volatile organic compounds.
Since this study was conducted on different days under different environmental conditions, confounding variables in the experimental setup were expected. For example, the ambient temperature can affect skin temperature and the GSR sensor as well. Future work can measure these environmental variables and identify these variables via causal analysis.

Author Contributions

Methodology, D.J.L., S.T. and T.L.; Software, S.R., S.T. and A.F.; formal analysis, S.R. and D.J.L.; data curation, S.T., D.J.L., L.O.H.W., A.F., T.L., M.L., J.S., A.A., J.W. and P.M.H.D.; writing—original draft preparation, S.R.; writing—review and editing, S.R., D.J.L., J.W. and L.O.H.W.; visualization, S.R.; supervision, D.J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the following grants: the US Army (Dense Urban Environment Dosimetry for Actionable Information and Recording Exposure, U.S. Army Medical Research Acquisition Activity, BAA CDMRP Grant Log #BA170483), EPA 16th Annual P3 Awards Grant Number 83996501, entitled Machine Learning-Calculated Low-Cost Sensing, The Texas National Security Network Excellence Fund Award for Environmental Sensing Security Sentinels, and the SOFWERX Award for Machine Learning for Robotic Team and NSF Award OAC-2115094.

Institutional Review Board Statement

All experimental protocols were approved by the University of Texas at Dallas Institutional Review Board.

Informed Consent Statement

Informed consent was obtained from all the participants.

Data Availability Statement

The codes and data that were used to produce the results are publicly available at: https://github.com/mi3nts/Estimate-Inhaled-PM-and-Gases (accessed on 9 February 2024). The data set for PM1 and NO is also available through Zenodo: https://zenodo.org/records/10639498 (accessed on 9 February 2024).

Acknowledgments

The authors acknowledge the OIT-Cyberinfrastructure Research Computing group at the University of Texas at Dallas and the TRECIS CC* Cyberteam (NSF 2019135) for providing HPC resources that were used in this study: https://utdallas.edu/oit/departments/circ/, (accessed on 9 February 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
PMParticulate matter
EEGElectroencephalogram
ECGElectrocardiogram
GSRGalvanic skin response
SpO2Blood oxygen saturation
RMSERoot mean square error

References

  1. WHO. 7 Million Premature Deaths Annually Linked to Air Pollution. 2024. Available online: http://www.who.int/mediacentre/news/releases/2014/air-pollution/en/ (accessed on 29 August 2016).
  2. U.S. Environmental Protection Agency. Air Quality Management Process. 2023. Available online: https://www.epa.gov/airquality-management-process/managing-air-quality-air-pollutant-types (accessed on 14 January 2024).
  3. Xing, Y.F.; Xu, Y.H.; Shi, M.H.; Lian, Y.X. The impact of PM2.5 on the human respiratory system. J. Thorac. Dis. 2016, 8, E69. [Google Scholar] [PubMed]
  4. Franklin, M.; Koutrakis, P.; Schwartz, J. The role of particle composition on the association between PM2.5 and mortality. Epidemiology 2008, 19, 680. [Google Scholar] [CrossRef] [PubMed]
  5. Thangavel, P.; Park, D.; Lee, Y.C. Recent Insights into Particulate Matter (PM2.5)-Mediated Toxicity in Humans: An Overview. Int. J. Environ. Res. Public Health 2022, 19, 7511. [Google Scholar] [CrossRef] [PubMed]
  6. Kloog, I.; Melly, S.J.; Ridgway, W.L.; Coull, B.A.; Schwartz, J. Using new satellite based exposure methods to study the association between pregnancy PM2.5 exposure, premature birth and birth weight in Massachusetts. Environ. Health 2012, 11, 40. [Google Scholar] [CrossRef]
  7. Li, Z.; Xiong, J. A dynamic inventory database for assessing age-, gender-, and route-specific chronic internal exposure to chemicals in support of human exposome research. J. Environ. Manag. 2023, 339, 117867. [Google Scholar] [CrossRef] [PubMed]
  8. Gu, Y.; Peach, J.T.; Warth, B. Sample preparation strategies for mass spectrometry analysis in human exposome research: Current status and future perspectives. TrAC Trends Anal. Chem. 2023, 166, 117151. [Google Scholar] [CrossRef]
  9. Hartung, T. A call for a Human Exposome Project. Altern. Anim. Exp. 2022, 40 1, 4–33. [Google Scholar] [CrossRef]
  10. Erdely, A.D. 157 Keynote: Understanding Exposure, Hazard Identification, and Human Health Effects: How Ultrafine/Nano Particle Toxicology Influenced Occupational Safety and Health. Ann. Work. Expo. Health 2023, 67, i40. [Google Scholar] [CrossRef]
  11. World Health Organization. Air Quality Guidelines. Guidelines for Particulate Matter (PM2.5 and PM10). 2021. Available online: https://www.who.int/news/item/04-04-2022-billions-of-people-still-breathe-unhealthy-air-new-who-data (accessed on 4 February 2021).
  12. United States Environmental Protection Agency. National Ambient Air Quality Standards (NAAQS) for Particulate Matter. Standards for PM2.5 and PM10. 2023. Available online: https://www.epa.gov/pm-pollution/national-ambient-air-qualitystandards-naaqs-pm (accessed on 4 February 2021).
  13. Uwak, I.; Olson, N.; Fuentes, A.; Moriarty, M.; Pulczinski, J.; Lam, J.; Xu, X.; Taylor, B.D.; Taiwo, S.; Koehler, K.; et al. Application of the navigation guide systematic review methodology to evaluate prenatal exposure to particulate matter air pollution and infant birth weight. Environ. Int. 2021, 148, 106378. [Google Scholar] [CrossRef]
  14. Block, M.L.; Calderon-Garciduenas, L. Air pollution: Mechanisms of neuroinflammation and CNS disease. Trends Neurosci. 2009, 32, 506–516. [Google Scholar] [CrossRef]
  15. Levesque, S.; Surace, M.J.; McDonald, J.; Block, M.L. Air pollution & the brain: Subchronic diesel exhaust exposure causes neuroinflammation and elevates early markers of neurodegenerative disease. J. Neuroinflammation 2011, 8, 105. [Google Scholar] [PubMed]
  16. Calderón-Garcidueñas, L.; Mora-Tiscareño, A.; Ontiveros, E.; Gómez-Garza, G.; Barragán-Mejía, G.; Broadway, J.; Chapman, S.; Valencia-Salazar, G.; Jewells, V.; Maronpot, R.R.; et al. Air pollution, cognitive deficits and brain abnormalities: A pilot study with children and dogs. Brain Cogn. 2008, 68, 117–127. [Google Scholar] [CrossRef] [PubMed]
  17. Goldberg, M.S.; Wheeler, A.J.; Burnett, R.T.; Mayo, N.E.; Valois, M.; Brophy, J.M.; Giannetti, N. Physiological and perceived health effects from daily changes in air pollution and weather among persons with heart failure: A panel study. J. Expo. Sci. Environ. Epidemiol. 2014, 25, 187–199. [Google Scholar] [CrossRef] [PubMed]
  18. Hahad, O.; Kuntic, M.; Frenis, K.; Chowdhury, S.; Lelieveld, J.; Lieb, K.; Daiber, A.; Muenzel, T. Physical Activity in Polluted Air—Net Benefit or Harm to Cardiovascular Health? A Comprehensive Review. Antioxidants 2021, 10, 1787. [Google Scholar] [CrossRef] [PubMed]
  19. Koulova, A.; Frishman, W.H. Air pollution exposure as a risk factor for cardiovascular disease morbidity and mortality. Cardiol. Rev. 2014, 22, 30–36. [Google Scholar] [CrossRef] [PubMed]
  20. Mudway, I.; Kelly, F. Ozone and the lung: A sensitive issue. Mol. Asp. Med. 2000, 21, 1–48. [Google Scholar] [CrossRef] [PubMed]
  21. Giorgini, P.; Rubenfire, M.; Das, R.; Bard, R.L.; Gracik, T.; Wang, L.; Morishita, M.; Jackson, E.A.; Ferri, C.; Brook, R.D. Abstract 13344: Higher Ambient Fine Particulate Matter Air Pollution and Temperature Levels Adversely Impact Cardiopulmonary Exercise Performance Among Patients Beginning Cardiac Rehabilitation. Circulation 2014, 130, A13344. [Google Scholar] [CrossRef]
  22. Chen, H.; Feng, J. PO-002 Relationship between Air Pollution and College Students’ Stamina. Exerc. Biochem. Rev. 2018, 1. [Google Scholar] [CrossRef]
  23. Rossignol, D.A.; Genuis, S.J.; Frye, R.E. Environmental toxicants and autism spectrum disorders: A systematic review. Transl. Psychiatry 2014, 4, e360. [Google Scholar] [CrossRef]
  24. Grandjean, P.; Landrigan, P.J. Developmental neurotoxicity of industrial chemicals. Lancet 2006, 368, 2167–2178. [Google Scholar] [CrossRef]
  25. Baccarelli, A.; Wright, R.O.; Bollati, V.; Tarantini, L.; Litonjua, A.A.; Suh, H.H.; Zanobetti, A.; Sparrow, D.; Vokonas, P.S.; Schwartz, J. Rapid DNA Methylation Changes after Exposure to Traffic Particles. Am. J. Respir. Crit. Care Med. 2009, 179, 572–578. [Google Scholar] [CrossRef] [PubMed]
  26. Del Real, Á.; Santurtún, A.; Zarrabeitia, M.T. Epigenetic related changes on air quality. Environ. Res. 2021, 197, 111155. [Google Scholar] [CrossRef] [PubMed]
  27. Kistler, M. Monitoring of Volatile Organic Compounds in Mouse Breath as a New Tool for Metabolic Phenotyping. Ph.D. Thesis, Technische Universität München, München, Germany, 2016. [Google Scholar]
  28. Wang, J.G.; Chen, S.; Zhu, M.; Miao, C.; Song, Y.; He, H. Particulate Matter and Respiratory Diseases: How Far Have We Gone? J. Pulm. Respir. Med. 2018, 8, 465. [Google Scholar]
  29. Carpenter, D.O. Polychlorinated biphenyls (PCBs): Routes of exposure and effects on human health. Rev. Environ. Health 2006, 21, 1–24. [Google Scholar] [CrossRef]
  30. Hutter, H.P.; Haluza, D.; Piegler, K.; Hohenblum, P.; Fröhlich, M.; Scharf, S.; Uhl, M.; Damberger, B.; Tappler, P.; Kundi, M.; et al. Semivolatile compounds in schools and their influence on cognitive performance of children. Int. J. Occup. Med. Environ. Health 2013, 26, 628–635. [Google Scholar] [CrossRef] [PubMed]
  31. Lowe, R.J.; Huebner, G.M.; Oreszczyn, T. Possible future impacts of elevated levels of atmospheric CO2 on human cognitive performance and on the design and operation of ventilation systems in buildings. Build. Serv. Eng. Res. Technol. 2018, 39, 698–711. [Google Scholar] [CrossRef]
  32. Satish, U.; Mendell, M.J.; Shekhar, K.; Hotchi, T.; Sullivan, D.; Streufert, S.; Fisk, W.J. Is CO2 an indoor pollutant? Direct effects of low-to-moderate CO2 concentrations on human decision-making performance. Environ. Health Perspect. 2012, 120, 1671–1677. [Google Scholar] [CrossRef] [PubMed]
  33. Zhang, X.; Zhang, T.; Luo, G.; Sun, J.; Zhao, C.; Xie, J.; Liu, J.; Zhang, N. Effects of exposure to carbon dioxide and human bioeffluents on sleep quality and physiological responses. Build. Environ. 2023, 238, 110382. [Google Scholar] [CrossRef]
  34. Atkinson, R.W.; Butland, B.K.; Anderson, H.R.; Maynard, R.L. Long-term Concentrations of Nitrogen Dioxide and Mortality: A Meta-analysis of Cohort Studies. Epidemiology 2018, 29, 460–472. [Google Scholar] [CrossRef]
  35. Huang, S.; Li, H.; Wang, M.; Qian, Y.; Steenland, K.; Caudle, W.M.; Liu, Y.; Sarnat, J.; Papatheodorou, S.; Shi, L. Long-term exposure to nitrogen dioxide and mortality: A systematic review and meta-analysis. Sci. Total Environ. 2021, 776, 145968. [Google Scholar] [CrossRef]
  36. Samoli, E. Short-term effects of nitrogen dioxide on mortality: An analysis within the APHEA project. Eur. Respir. J. 2006, 27, 1129–1138. [Google Scholar] [CrossRef]
  37. Breysse, P.N.; Diette, G.B.; Matsui, E.C.; Butz, A.M.; Hansel, N.N.; McCormack, M.C. Indoor air pollution and asthma in children. Proc. Am. Thorac. Soc. 2010, 7, 102–106. [Google Scholar] [CrossRef] [PubMed]
  38. Yu, B.; Ichinose, F.; Bloch, D.B.; Zapol, W.M. Inhaled nitric oxide. Br. J. Pharmacol. 2019, 176, 246–255. [Google Scholar] [CrossRef]
  39. Witek, J.; Lakhkar, A.D. Nitric Oxide. In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2023. [Google Scholar]
  40. Miller, O.; Celermajer, D.; Deanfield, J.; Macrae, D. Guidelines for the safe administration of inhaled nitric oxide. Arch. Dis. Child. Fetal Neonatal Ed. 1994, 70, F47. [Google Scholar] [CrossRef]
  41. Fernando, B.A.; Talebi, S.; Wijeratne, L.O.H.; Waczak, J.; Sooriyaarachchi, V.; Lary, D.; Sadler, J.; Lary, T.; Lary, M.; Aker, A. Gauging Size Resolved Ambient Particulate Matter Concentration Solely Using Biometric Observations: A Machine Learning and Causal Approach; Zenodo: Geneva, Switzerland, 2023. [Google Scholar] [CrossRef]
  42. Ruwali, S.; Fernando, B.A.; Talebi, S.; Wijeratne, L.O.H.; Waczak, J.; Lary, D.; Sadler, J.; Lary, T.; Lary, M.; Aker, A. Gauging Ambient Environmental Carbon Dioxide Concentration Solely Using Biometric Observations: A Machine Learning Approach; Zenodo: Geneva, Switzerland, 2023. [Google Scholar] [CrossRef]
  43. Ruwali, S.; Fernando, B.A.; Talebi, S.; Wijeratne, L.O.H.; Waczak, J.; Lary, D.; Sadler, J.; Lary, T.; Lary, M.; Aker, A. Estimating Inhaled Nitrogen Dioxide from the Human Biometric Response; Zenodo: Geneva, Switzerland, 2023. [Google Scholar] [CrossRef]
  44. Lary, D.J.; Faruque, F.S.; Malakar, N.; Moore, A.; Roscoe, B.; Adams, Z.L.; Eggelston, Y. Estimating the global abundance of ground level presence of particulate matter (PM2.5). Geospat. Health 2014, 8, 611. [Google Scholar] [CrossRef] [PubMed]
  45. Wijeratne, L.O.; Kiv, D.R.; Aker, A.R.; Talebi, S.; Lary, D.J. Using Machine Learning for the Calibration of Airborne Particulate Sensors. Sensors 2020, 20, 99. [Google Scholar] [CrossRef]
  46. Talebi, S.; Lary, D.J.; Wijeratne, L.O.H.; Fernando, B.; Lary, T.; Lary, M.; Sadler, J.; Sridhar, A.; Waczak, J.; Aker, A.; et al. Decoding Physical and Cognitive Impacts of Particulate Matter Concentrations at Ultra-Fine Scales. Sensors 2022, 22, 4240. [Google Scholar] [CrossRef]
  47. Acharya, J.N.; Hani, A.J.; Cheek, J.; Thirumala, P.; Tsuchida, T.N. American clinical neurophysiology society guideline 2: Guidelines for standard electrode position nomenclature. Neurodiagnostic J. 2016, 56, 245–252. [Google Scholar] [CrossRef] [PubMed]
  48. Welch, P. The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms. IEEE Trans. Audio Electroacoust. 1967, 15, 70–73. [Google Scholar] [CrossRef]
  49. Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef] [PubMed]
  50. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  51. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  52. Lundberg, S.M.; Lee, S.I. A Unified Approach to Interpreting Model Predictions. In Advances in Neural Information Processing Systems; Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
  53. Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.I. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2020, 2, 2522–5839. [Google Scholar] [CrossRef] [PubMed]
  54. Tsukahara, J.S.; Harrison, T.L.; Engle, R.W. The relationship between baseline pupil size and intelligence. Cogn. Psychol. 2016, 91, 109–123. [Google Scholar] [CrossRef] [PubMed]
  55. Kulkarni, S.G.; Mehendale, H.M. Carbon Dioxide. In Encyclopedia of Toxicology, 2nd ed.; Wexler, P., Ed.; Elsevier: New York, NY, USA, 2005; pp. 419–420. [Google Scholar] [CrossRef]
  56. Jawabri, K.H.; Sharma, S. Physiology, Cerebral Cortex Functions. In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2023. [Google Scholar]
  57. Faustini, A.; Rapp, R.; Forastiere, F. Nitrogen dioxide and mortality: Review and meta-analysis of long-term studies. Eur. Respir. J. 2014, 44, 744–753. [Google Scholar] [CrossRef]
  58. Abhang, P.A.; Gawali, B.W.; Mehrotra, S.C. Chapter 2—Technological Basics of EEG Recording and Operation of Apparatus. In Introduction to EEG- and Speech-Based Emotion Recognition; Abhang, P.A., Gawali, B.W., Mehrotra, S.C., Eds.; Academic Press: Cambridge, MA, USA, 2016; pp. 19–50. [Google Scholar] [CrossRef]
  59. Sole Puig, M.; Pallarés, J.M.; Perez Zapata, L.; Puigcerver, L.; Cañete, J.; Supèr, H. Attentional selection accompanied by eye vergence as revealed by event-related brain potentials. PLoS ONE 2016, 11, e0167646. [Google Scholar] [CrossRef]
Figure 1. Two of the experimental paradigms for biometric and environmental data collection in which a participant is wearing the same biometric suite for biometric data collection. (a) Each of the participants rode a static bike with sensors placed nearby to measure ambient PM 2.5 and PM1. (b) The participants in the study rode a bicycle followed by an electric car measuring environmental CO2, NO2, and NO among other environmental variables. Source: Figure 4 from [46].
Figure 1. Two of the experimental paradigms for biometric and environmental data collection in which a participant is wearing the same biometric suite for biometric data collection. (a) Each of the participants rode a static bike with sensors placed nearby to measure ambient PM 2.5 and PM1. (b) The participants in the study rode a bicycle followed by an electric car measuring environmental CO2, NO2, and NO among other environmental variables. Source: Figure 4 from [46].
Biomedinformatics 04 00057 g001
Figure 2. (a) A SHAP value beeswarm plot of the top nine features in descending order for estimating the inhaled CO2. (b) A mutual information matrix consisting of the top nine biometric variables that were the most influential in the prediction of CO2 and the target variable, CO2.
Figure 2. (a) A SHAP value beeswarm plot of the top nine features in descending order for estimating the inhaled CO2. (b) A mutual information matrix consisting of the top nine biometric variables that were the most influential in the prediction of CO2 and the target variable, CO2.
Biomedinformatics 04 00057 g002
Figure 3. (a) Scatter diagram of the true values of CO2 against the estimated values of CO2 with a black 1:1 line overlaid. (b) Quantile–quantile plot of the true values of CO2 against the estimated values of CO2 with a red 1:1 line overlaid.
Figure 3. (a) Scatter diagram of the true values of CO2 against the estimated values of CO2 with a black 1:1 line overlaid. (b) Quantile–quantile plot of the true values of CO2 against the estimated values of CO2 with a red 1:1 line overlaid.
Biomedinformatics 04 00057 g003
Figure 4. (a) A SHAP value beeswarm plot of the top nine features in descending order that were useful in estimating NO2. (b) Mutual information matrix consisting of the top nine biometric variables that were the most influential in predicting NO2 and the target variable, NO2.
Figure 4. (a) A SHAP value beeswarm plot of the top nine features in descending order that were useful in estimating NO2. (b) Mutual information matrix consisting of the top nine biometric variables that were the most influential in predicting NO2 and the target variable, NO2.
Biomedinformatics 04 00057 g004
Figure 5. (a) Scatter diagram of the true values of NO2 against the estimated values of NO2 with a black 1:1 line overlaid. (b) Quantile–quantile plot of the true values of NO2 against the estimated values of NO2 with a red 1:1 line overlaid.
Figure 5. (a) Scatter diagram of the true values of NO2 against the estimated values of NO2 with a black 1:1 line overlaid. (b) Quantile–quantile plot of the true values of NO2 against the estimated values of NO2 with a red 1:1 line overlaid.
Biomedinformatics 04 00057 g005
Figure 6. (a) A SHAP value beeswarm plot of the top nine features in descending order that were useful in estimating NO. (b) Mutual information matrix consisting of the top nine biometric variables that were the most influential in predicting NO and the target variable, NO.
Figure 6. (a) A SHAP value beeswarm plot of the top nine features in descending order that were useful in estimating NO. (b) Mutual information matrix consisting of the top nine biometric variables that were the most influential in predicting NO and the target variable, NO.
Biomedinformatics 04 00057 g006
Figure 7. (a) Scatter diagram of the true values of NO against the estimated values of NO with a black 1:1 line overlaid. (b) Quantile–quantile graph of the true values of NO against the estimated values of NO with a red 1:1 line overlaid.
Figure 7. (a) Scatter diagram of the true values of NO against the estimated values of NO with a black 1:1 line overlaid. (b) Quantile–quantile graph of the true values of NO against the estimated values of NO with a red 1:1 line overlaid.
Biomedinformatics 04 00057 g007
Figure 8. (a) A SHAP value beeswarm plot of the top nine features in descending order that are useful in estimating PM1. (b) Mutual information matrix consisting of the top nine biometric variables that were the most influential in predicting PM1 and the target variable, PM1.
Figure 8. (a) A SHAP value beeswarm plot of the top nine features in descending order that are useful in estimating PM1. (b) Mutual information matrix consisting of the top nine biometric variables that were the most influential in predicting PM1 and the target variable, PM1.
Biomedinformatics 04 00057 g008
Figure 9. (a) Scatter diagram of the true values of PM1 against the estimated values of PM1 with a black 1:1 line overlaid. (b) Quantile–quantile plot of the true values of PM1 against the estimated values of PM1 with a red 1:1 line overlaid.
Figure 9. (a) Scatter diagram of the true values of PM1 against the estimated values of PM1 with a black 1:1 line overlaid. (b) Quantile–quantile plot of the true values of PM1 against the estimated values of PM1 with a red 1:1 line overlaid.
Biomedinformatics 04 00057 g009
Figure 10. (a) A SHAP value beeswarm plot of the top nine features in descending order that were useful in estimating PM2.5. (b) Mutual information matrix consisting of the top nine biometric variables that were the most influential in predicting PM2.5 and the target variable, PM2.5.
Figure 10. (a) A SHAP value beeswarm plot of the top nine features in descending order that were useful in estimating PM2.5. (b) Mutual information matrix consisting of the top nine biometric variables that were the most influential in predicting PM2.5 and the target variable, PM2.5.
Biomedinformatics 04 00057 g010
Figure 11. (a) Scatter diagram of the true values of PM2.5 against the estimated values of PM2.5 with a black 1:1 line overlaid. (b) Quantile-quantile plot of the true values of PM2.5 against the estimated values of PM2.5 with a red 1:1 line overlaid.
Figure 11. (a) Scatter diagram of the true values of PM2.5 against the estimated values of PM2.5 with a black 1:1 line overlaid. (b) Quantile-quantile plot of the true values of PM2.5 against the estimated values of PM2.5 with a red 1:1 line overlaid.
Biomedinformatics 04 00057 g011
Figure 12. Time series plot of the true values of gaseous pollutants overlaid with estimated values of the pollutants for (a) CO2, (b) NO2, and (c) NO.
Figure 12. Time series plot of the true values of gaseous pollutants overlaid with estimated values of the pollutants for (a) CO2, (b) NO2, and (c) NO.
Biomedinformatics 04 00057 g012
Figure 13. Time series plot of the true values of the pollutant overlaid with estimated values of the pollutant for (a) PM1 and (b) PM2.5.
Figure 13. Time series plot of the true values of the pollutant overlaid with estimated values of the pollutant for (a) PM1 and (b) PM2.5.
Biomedinformatics 04 00057 g013
Figure 14. Top features and performance graphs using a reduced number of features: (ac) to estimate inhaled CO2, (df) to estimate inhaled NO2, and (gi) to estimate inhaled NO.
Figure 14. Top features and performance graphs using a reduced number of features: (ac) to estimate inhaled CO2, (df) to estimate inhaled NO2, and (gi) to estimate inhaled NO.
Biomedinformatics 04 00057 g014
Figure 15. Top features and performance graphs using a reduced number of features: (ac) to estimate inhaled PM1 and (df) to estimate inhaled PM2.5.
Figure 15. Top features and performance graphs using a reduced number of features: (ac) to estimate inhaled PM1 and (df) to estimate inhaled PM2.5.
Biomedinformatics 04 00057 g015
Figure 16. Time series plot of the true values of pollutants overlaid with the estimated values of pollutants using a reduced number of variables for (a) CO2, (b) NO2, and (c) NO.
Figure 16. Time series plot of the true values of pollutants overlaid with the estimated values of pollutants using a reduced number of variables for (a) CO2, (b) NO2, and (c) NO.
Biomedinformatics 04 00057 g016aBiomedinformatics 04 00057 g016b
Figure 17. Time series plot of the true values of the pollutant overlaid with estimated values of the pollutant using a reduced number of variables for (a) PM1 and (b) PM2.5.
Figure 17. Time series plot of the true values of the pollutant overlaid with estimated values of the pollutant using a reduced number of variables for (a) PM1 and (b) PM2.5.
Biomedinformatics 04 00057 g017aBiomedinformatics 04 00057 g017b
Table 1. Similarities between the two experimental paradigms.
Table 1. Similarities between the two experimental paradigms.
Similarities
Use of the same biometric suite to measure biometric variables.
Pollutants were measured using sensors that were in close proximity to the participant.
Machine learning models were used to estimate the inhaled pollutants and examine the autonomous responses in the human body.
Table 2. Differences between the two experimental paradigms.
Table 2. Differences between the two experimental paradigms.
Bike in MotionStatic Bike Ride
A single participant for data collection.Multiple participants for data collection.
The participants rode a bike on multiple tracks.The participants rode a stationary bike.
The location of the data collection was outdoors in Breckenridge Park in Richardson.The location of the data collection was indoors inside the WSTC building at the University of Texas at Dallas, Richardson.
In this study, the measurement of ambient CO2, NO2, and NO as an environmental variable was considered.The measurement of PM1 and PM2.5 as an environmental variable was considered.
Data collection was carried out in 2021.Data collection took place in 2021 and 2022.
All of the 64 electrodes on the EEG headset were working.The T7 electrode of the EEG headset was not working.
Table 3. List of biometrics measured in both experiments.
Table 3. List of biometrics measured in both experiments.
Biometric VariableUnitsLocation of the Sensor
Electroencephalography (EEG)Volt (V)A headset
Electrocardiography (ECG)Volt (V)Upper part of chest
Galvanic Skin Response (GSR)MicroSiemens ( μ Siemens)Upper back
Oxygen Saturation ( SpO 2 )Percentage (%)Left ear
Respiration rateBreathing rate per minute (brpm)Same device used to measure GSR
Skin temperature°CRight temple
Heart rateBeats per minute (bpm)Same device used to measure SpO 2
Pupil diameter of both eyesMillimeter (mm)Eye tracking glasses
Distance between pupilsMillimeter (mm)The same eye tracking glasses
Table 4. Collection of data for five pollutants.
Table 4. Collection of data for five pollutants.
PollutantTotal Number of BiometricsDays of Data CollectionNumber of TrialsData Records in Each TrialTotal Number of Data Records
CO232924710, 696, 673, 2382317
PM2.532244298, 239, 528, 3181383
PM132244298, 239, 528, 3181383
NO232936136, 23, 126, 120, 132, 45582
NO3293681, 15, 96, 88, 98, 32410
Table 5. Quantification of the estimation of the pollutant using all features with default hyperparameters of the random forest algorithm.
Table 5. Quantification of the estimation of the pollutant using all features with default hyperparameters of the random forest algorithm.
PollutantTrain R2Test R2Train RMSETest RMSENumber of Biometrics Inputs
PM10.990.990.03 μ g/m30.07 μ g/m3322
CO20.990.979.89 ppm21.63 ppm329
PM2.50.990.970.14 μ g/m30.37 μ g/m3322
NO0.960.455.27 ppb11.50 ppb329
NO20.910.122.95 ppb8.93 ppb329
Table 6. Set of hyperparameters provided as inputs to GridSearchCV and the best possible parameter when considering all available features.
Table 6. Set of hyperparameters provided as inputs to GridSearchCV and the best possible parameter when considering all available features.
PollutantSet of n_estimators Set of max_features Folds for Cross-ValidationTotal Number of TrainingOptimized Parameter
PM180, 90, 100, 110, 120250, 275, 300345110, 275
CO280, 90, 100, 110, 120250, 275, 300, 325360110, 300
PM2.5100, 110, 120, 140150, 180, 200, 250, 275360120, 200
NO80, 90, 100, 110, 120250, 275, 300, 325360110, 275
NO280, 90, 100, 110, 120180, 200, 240, 25036090, 200
Table 7. Quantification of the estimation of the pollutants using all features with optimized hyperparameters of the random forest algorithm.
Table 7. Quantification of the estimation of the pollutants using all features with optimized hyperparameters of the random forest algorithm.
PollutantTrain R2Test R2Train RMSETest RMSENumber of Biometrics Inputs
PM10.990.990.03 μ g/m30.07 μ g/m3322
CO20.990.9810.37 ppm21.61 ppm329
PM2.50.990.970.15 μ g/m30.37 μ g/m3322
NO0.960.375.33 ppb9.29 ppb329
NO20.910.112.95 ppb10.08 ppb329
Table 8. Set of hyperparameters provided as inputs to GridSearchCV and the best possible parameter when considering reduced number of features.
Table 8. Set of hyperparameters provided as inputs to GridSearchCV and the best possible parameter when considering reduced number of features.
PollutantSet of n_estimators Set of max_features Folds for Cross-ValidationTotal Number of TrainingOptimized Parameter
PM180, 90, 100, 110, 120, 1502, 3, 4354120, 3
CO280, 90, 100, 110, 120, 1502, 4, 5354100, 4
PM2.580, 90, 100, 110, 120, 150, 1802, 3, 4363150, 3
NO60, 70, 80, 90, 100, 110, 1202, 4, 536390, 4
NO280, 90, 100, 110, 120, 1502, 3, 4354110, 2
Table 9. Quantification of the estimation of the pollutant using reduced number of variables with optimized hyperparameters of the random forest algorithm.
Table 9. Quantification of the estimation of the pollutant using reduced number of variables with optimized hyperparameters of the random forest algorithm.
PollutantTrain R2Test R2Train RMSETest RMSENumber of Biometrics Used
PM10.990.990.03 μ g/m30.07 μ g/m34
CO20.990.987.62 ppm18.55 ppm5
PM2.50.990.960.16 μ g/m30.39 μ g/m34
NO0.980.533.38 ppb12.70 ppb5
NO20.930.242.39 ppb10.38 ppb5
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ruwali, S.; Talebi, S.; Fernando, A.; Wijeratne, L.O.H.; Waczak, J.; Dewage, P.M.H.; Lary, D.J.; Sadler, J.; Lary, T.; Lary, M.; et al. Quantifying Inhaled Concentrations of Particulate Matter, Carbon Dioxide, Nitrogen Dioxide, and Nitric Oxide Using Observed Biometric Responses with Machine Learning. BioMedInformatics 2024, 4, 1019-1046. https://doi.org/10.3390/biomedinformatics4020057

AMA Style

Ruwali S, Talebi S, Fernando A, Wijeratne LOH, Waczak J, Dewage PMH, Lary DJ, Sadler J, Lary T, Lary M, et al. Quantifying Inhaled Concentrations of Particulate Matter, Carbon Dioxide, Nitrogen Dioxide, and Nitric Oxide Using Observed Biometric Responses with Machine Learning. BioMedInformatics. 2024; 4(2):1019-1046. https://doi.org/10.3390/biomedinformatics4020057

Chicago/Turabian Style

Ruwali, Shisir, Shawhin Talebi, Ashen Fernando, Lakitha O. H. Wijeratne, John Waczak, Prabuddha M. H. Dewage, David J. Lary, John Sadler, Tatiana Lary, Matthew Lary, and et al. 2024. "Quantifying Inhaled Concentrations of Particulate Matter, Carbon Dioxide, Nitrogen Dioxide, and Nitric Oxide Using Observed Biometric Responses with Machine Learning" BioMedInformatics 4, no. 2: 1019-1046. https://doi.org/10.3390/biomedinformatics4020057

APA Style

Ruwali, S., Talebi, S., Fernando, A., Wijeratne, L. O. H., Waczak, J., Dewage, P. M. H., Lary, D. J., Sadler, J., Lary, T., Lary, M., & Aker, A. (2024). Quantifying Inhaled Concentrations of Particulate Matter, Carbon Dioxide, Nitrogen Dioxide, and Nitric Oxide Using Observed Biometric Responses with Machine Learning. BioMedInformatics, 4(2), 1019-1046. https://doi.org/10.3390/biomedinformatics4020057

Article Metrics

Back to TopTop