Next Article in Journal
Spatiotemporal Variability and Driving Factors of Vegetation Net Primary Productivity in the Yellow River Basin (Shaanxi Section) from 2000 to 2022
Previous Article in Journal
Wind-Induced Water Transport and Circulation Structure in the Laptev Sea–East Siberian Sea
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Advancing Air Quality Monitoring: Deep Learning-Based CNN-RNN Hybrid Model for PM2.5 Forecasting

1
Computer Engineering Department, Munzur University, Tunceli 62000, Turkey
2
Faculty of Education and Liberal Arts, INTI International University, Nilai 71800, Negeri Sembilan, Malaysia
3
Computer Programming, Siverek Vocational School, Harran University, Sanlıurfa 63200, Turkey
4
Medical Documentation and Secretary Training, Siverek Vocational School, Harran University, Sanlıurfa 63200, Turkey
5
Department of Finance and Banking, Recep Tayyip Erdogan University, Fener Street, Rize 53100, Turkey
6
Accounting and Tax Applications Program, Siverek Vocational School, Harran University, Sanlıurfa 63300, Turkey
*
Authors to whom correspondence should be addressed.
Atmosphere 2025, 16(9), 1003; https://doi.org/10.3390/atmos16091003
Submission received: 10 July 2025 / Revised: 16 August 2025 / Accepted: 19 August 2025 / Published: 24 August 2025
(This article belongs to the Section Air Quality)

Abstract

Particulate matter, particularly PM2.5, poses a significant threat to public health due to its ability to disperse widely and its detrimental impact on the respiratory and circulatory systems upon inhalation. Consequently, it is imperative to maintain regular monitoring and assessment of particulate matter levels to anticipate air pollution events and promptly mitigate their adverse effects. However, predicting air quality is inherently complex, given the multitude of variables that influence it. Deep learning models, renowned for their ability to capture nonlinear relationships, offer a promising approach to address this challenge, with hybrid architectures demonstrating enhanced performance. This study aims to develop and evaluate a hybrid model integrating Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) for forecasting PM2.5 levels in India, Milan, and Frankfurt. A comparative analysis with established deep learning and machine learning techniques substantiates the superior predictive capabilities of the proposed CNN-RNN model. The findings underscore its potential as an effective tool for air quality prediction, with implications for informed decision-making and proactive intervention strategies to safeguard public health.

1. Introduction

Air pollution is defined as the presence of elevated concentrations of particles or pollutants in the atmosphere that pose a threat to the health of living organisms or cause physical harm [1]. Specifically, it can be defined as the presence of toxic substances or compounds in the air that exceeds the acceptable threshold set by health organizations. Particulate matter (hereafter referred to as PM), a combination of solid and liquid particles suspended in the air, is a major contributor to air pollution. PM is measured in micrometers and classified as PM10 for particles between 2.5 and 10 μm and PM2.5 for particles smaller than 2.5 μm [2]. Human-caused air pollution is primarily caused by fuel consumption, power generation, and the by-products of natural gas, particularly emissions from coal-fired power plants and chemical production [3]. Other sources of PM include vehicle fuel, construction sites, roadworks, factories, and quarries [4].
Enduring exposure to air pollution has been shown to have harmful effects on cardiovascular health [5], particularly increasing the likelihood of heart failure in older adults or those with pre-existing heart disease [6]. At-risk individuals are advised to minimize their time spent in areas with elevated air pollution levels, such as industrial zones and areas with heavy traffic. During periods of high air pollution, it is recommended to prioritize indoor activities over outdoor activities [7]. To exercise outdoors safely, choose areas like forests, away from traffic. It is important to address air quality issues for the world’s sustainability by taking action when air quality levels are high [8]. Artificial intelligence is a valuable technology that can help solve air pollution problems and is used in various fields such as health, autonomous vehicles, robotics, e-commerce, and security [9,10].
In the literature, various statistical, deterministic, and traditional machine learning methods have been employed for the precise forecasting of air pollutants [11,12,13]. Predicting air quality presents challenges due to its intricate spatial and temporal dimensions. Deep learning models have demonstrated superior performance in addressing complex and nonlinear problems compared to other methods [14]. Consequently, the application of deep learning models in air quality prediction research has increased significantly. Hybrid iterations of deep learning models have also been designed to improve the effectiveness of these models.
By integrating the practical components of various models into a unified framework, enhanced predictive outcomes were achieved. The present study employed a hybrid model that integrates the strengths of Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) models to forecast PM2.5 levels in India, Milan, and Frankfurt. To the best of our knowledge, this study represents one of the first efforts to use this comprehensive model for PM2.5 forecasting. Although several studies have explored CNN-LSTM architectures for air quality forecasting, our study introduces a novel CNN-RNN hybrid model that differs in architectural simplicity and computational efficiency. Moreover, we apply this model to three real-world datasets from geographically and climatically diverse regions. Unlike prior studies, we provide a comprehensive performance comparison with a wide range of models, highlighting the superior accuracy and generalizability of our approach. To evaluate the effectiveness of this model, we used a variety of deep learning and machine learning models. The CNN-RNN model was compared to machine learning techniques, including k-Nearest Neighbors (kNN), Random Forest (RF), Support Vector Machine (SVM), Decision Trees (DT), and Linear Regression (LR), as well as deep learning methods such as CNN, RNN, Long-Short Term Memory (LSTM), and Multilayer Perceptron (MLP). The results show that the performance of the GA-LSTM model is superior to that of the aforementioned methods.
While various deep learning models have been applied to PM2.5 prediction in the literature, this study offers a novel and practical approach by proposing a lightweight CNN-RNN hybrid architecture. Unlike LSTM-based hybrid models, the use of standard RNN cells significantly reduces computational overhead while maintaining high accuracy. Furthermore, the application and evaluation of this model on three geographically diverse datasets demonstrate its adaptability and effectiveness. The inclusion of high-frequency sensor data from Frankfurt also provides new insights into short-term forecasting under dynamic conditions.
The main contributions of this research paper can be outlined as follows:
(i)
This study is the first to integrate and comparatively analyze PM2.5 datasets from India, Milan, and Frankfurt using a deep learning-based hybrid model. These datasets vary in temporal resolution, environmental characteristics, and geographic location, offering a unique opportunity to assess the model’s robustness and adaptability across diverse real-world conditions.
(ii)
In this paper, a novel hybrid deep learning model incorporating CNNs and RNNs is introduced.
(iii)
Through comparative analysis with various machine learning algorithms, the effectiveness of the model including, kNN, RF, SVM, DT, LR, MLP, and deep learning networks such as CNNs, RNNs, and LSTM estimated.
(iv)
In addition, a comprehensive pollution dataset from three different regions of India, Milan, and Frankfurt was adopted.

Recommendations for Managers

Based on the insights provided, here are some recommendations for managers:
(i)
Managers should consider incorporating advanced predictive models, such as the CNN-RNN hybrid deep learning model proposed in this study, into their air quality management systems. This will enable them to accurately forecast PM2.5 concentrations and anticipate air pollution events, facilitating proactive decision-making and resource allocation.
(ii)
Managers should prioritize investment in technologies that help reduce air pollution and mitigate its negative impacts. This could involve adopting alternative production technologies, promoting cleaner energy sources, and implementing emission reduction measures to address the root causes of PM2.5 pollution.
(iii)
Collaboration with researchers, governmental agencies, and other stakeholders is essential for effective air quality management. Managers should seek partnerships to share data, expertise, and best practices, fostering a collaborative approach to tackling air pollution at local, regional, and global levels.
(iv)
Managers play a crucial role in raising awareness about the health and environmental risks associated with air pollution, particularly PM2.5. They should engage in public education campaigns to inform communities about the importance of air quality monitoring, the impacts of pollution on health, and the measures individuals can take to reduce their exposure.
(v)
Managers should advocate for the development and implementation of evidence-based policies and regulations aimed at reducing air pollution and improving air quality standards. This could involve advocating for stricter emission controls, promoting sustainable urban planning, and incentivizing the adoption of clean technologies.

2. Related Works

Air quality values obtained from sensors can be analyzed using artificial intelligence (AI). It also provides the extraction of relationships and patterns between these values. Air pollution values can be monitored instantly through IoT technologies [15]. Predicting air pollution values using AI technologies is crucial in alerting individuals at risk and taking preventive measures against pollution [16]. In this section, a review of studies in the literature on air pollution prediction is presented.
According to [17], a comparative application of iterative neural network architectures for PM10 prediction was presented. Experimental studies have been conducted with various architectures and parameters for RNN, GRU, and LSTM models. Investigation showed that RNN, LSTM, and GRU had Mean Squared Error (MSE) values of 0.00025, 0.00048, and 0.00023, respectively. Ref. [18] developed the CNN-LSTM hybrid model consisting of CNN and LSTM deep learning models for effective PM2.5 predictions. The Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Pearson correlation coefficient, and Index of Agreement (IA) were utilized in their experimental study, and comparisons were made with the DT, RF, MLP, CNN, and LSTM methods. According to the results obtained, the CNN-LSTM model achieved the highest success rate. Similarly, ref. [19] established a hybrid deep-learning model for air quality prediction. CNN was used for spatial feature extraction in the study. LSTM was used to investigate spatiotemporal dependencies; the investigation’s outcomes showed that the anticipated model has good predictive performance. Furthermore, ref. [20] developed a hybrid deep-learning model for air quality prediction. Transfer learning and LSTM were used in the developed model. The LSTM has been used to learn about long-term dependencies. Transfer learning was employed to transfer short-term temporal information to long-term temporal information. The developed model is compared with ARIMA, SVR, GBDT, RNN, GRU, LSTM, and CNN-LSTM. Empirical findings demonstrated that the established model was superior to the other models examined. Ref. [21] anticipated CNN-LSTM hybrid model for PM2.5 predictions. In the study, nonlinear correlations with space-time correlation analyses were also examined. The projected model is associated with MLP and LSTM. According to the experiment results, the proposed model has a ratio of 5.21, MAE, and 0.08 Mean Absolute Percentage Error (MAPE). According to [22], a new model called the weighed long short-term memory extended model (WLSTME) was proposed to predict daily PM2.5. While creating this model, many temporal and spatial parameters were considered. MLP and LSTM models were also utilized. The findings showed that the WLSTME resulted in the best outcome. Ref. [23] developed an LSTM and SVM-based model for the estimation of the air quality index value. Using the developed model, the predicted values for PM2.5, NO2, SO2, CO, and ozone were obtained. The outcomes showed that the model had R2 values of 0.821, 0.092, −0.005, 0.080, and 0.029 in predicting PM2.5, NO2, SO2, CO, and ozone, respectively. Ref. [24] predicted PM2.5 using data from meteorological monitoring stations in China. They used XGBoost, RF, and fully connected neural network models in their study. While making these predictions, they also revealed the effects of features such as wind speed, wind direction, and rainfall on PM2.5 predictions. Ref. [25] established a hybrid model for hourly PM2.5 prediction exploitation using CNN and LSTM. The results obtained with the developed model were used to optimize the results of Raspberry Pi devices. The study findings revealed that Raspberry Pi devices have lower latency.
Ref. [26] developed an Artificial Neural Network (ANN)-based model to predict air pollutant concentrations based on air traffic at Suvarnabhumi International Airport. The dataset used was ADS-B flight data collected daily between 21:00 and 00:00 throughout 2023, and air pollution measurements (CO, NO2, PM2.5, and PM10) obtained from two monitoring points at the airport. Flight paths were clustered using K-means and Gaussian Mixture Model (GMM) algorithms, and then these patterns and other flight parameters were fed into the ANN model. The model was trained separately for each pollutant, and its performance was evaluated using metrics such as MAE, MSE, and R2. The lowest MSE values obtained were reported as 51.76 for CO, 53.97 for PM2.5, 124.25 for PM10, and 139.67 for NO2.
Ref. [27] conducted a comprehensive analysis using machine learning-based models to assess the health risks associated with air pollution in the Tuticorin district of Tamil Nadu, India. Four different machine learning algorithms (Random Forest, Decision Tree, Gradient Boosting, and AdaBoost) were trained by integrating real-time air quality data (pollutants such as PM2.5, PM10, NO2, SO2, CO, O3, and NH3) with meteorological data for 2021. The accuracy of these models was evaluated using statistical metrics such as MAE, MSE, RMSE, and R2. According to the results, the Random Forest model demonstrated the highest accuracy, achieving 93.3% accuracy for PM2.5 and 92.7% accuracy for PM10.
Ref. [28] developed an ANN model using environmental and meteorological data from Nakhon Si Thammarat Province, Thailand, to predict PM2.5 concentrations. Temperature, relative humidity, wind speed, wind direction, precipitation, CO, NO2, O3, SO2, and PM10 were used as input variables. Model training and testing were performed on data from 2019 to 2021, and different combinations of layers and neurons were tested in the ANN architecture. Model performance was evaluated using metrics such as MAE, MSE, RMSE, and R2; the best result was obtained with 0.78 R2.
Ref. [29] developed an integrated machine learning model with the Random Forest algorithm using meteorological variables and air pollution data to predict PM2.5 concentrations in the Bangkok Metropolitan Region. Input parameters such as temperature, relative humidity, wind speed, wind direction, CO, NO2, O3, SO2, and PM10 collected from ten different stations in Bangkok between 2018 and 2020 were used. The model identified the most influential variables by feature importance ranking, and PM10, NO2, and CO were found to have a high impact on PM2.5. Model performance was evaluated using accuracy and error metrics, with 0.96 R2 and 2.38 RMSE.

3. Deep Learning-Based Air Quality Prediction

This section introduces the proposed CNN-RNN hybrid model and presents the methodology used for PM2.5 prediction. The model was evaluated using three real-world datasets from India, Milan, and Frankfurt, selected to represent varying environmental and temporal characteristics. A comprehensive comparison was conducted with both classical machine learning models and deep learning architectures. Model performance was assessed using widely accepted error metrics such as MSE, RMSE, MAE, and R2. The detailed modeling workflow, including data pre-processing, model architecture, and evaluation steps, is explained in the following subsections.

3.1. Dataset

This study utilized air quality datasets containing hourly PM2.5 values from India (https://www.kaggle.com/datasets/fedesoriano/air-quality-data-in-india, accessed on 15 March 2023), Milan (https://www.kaggle.com/datasets/wiseair/air-quality-in-milan-summer-2020, accessed on 15 March 2023) and Frankfurt (https://www.kaggle.com/datasets/avibagul80/air-quality-dataset, accessed on 15 March 2023). The cities of India, Milan, and Frankfurt were chosen to represent three distinct air quality contexts. India with high PM2.5 concentration variability, Milan as a moderately polluted European city with recent urban sensor deployments, and Frankfurt with granular, high-resolution environmental data. This diversity enables the evaluation of the model’s adaptability and predictive consistency across differing pollution profiles and data characteristics. The Indian dataset comprises 36,192 PM2.5 observation values, including time information and PM2.5 attributes. Figure 1 shows the geographic distribution of the study location.
The map in Figure 1 highlights the spatial diversity of the datasets used in the study, encompassing South Asia and two distinct European regions with distinct environmental, climatic, and urban characteristics. Figure 2 illustrates the temporal variations in PM2.5 concentrations within the Indian dataset.
A sample of the India dataset is shown in Table 1.
Statistical information about the attributes in the India dataset is shown in Table 2.
The Milan dataset comprises 1399 hourly PM2.5 values recorded between 24 July 2020, and 20 September 2020. The Milan dataset used the PM2.5 measurements on Wiseair IoT environmental monitoring devices (model Clarity Node-S) using the principle of laser scattering by particulates in the air with a photometer. The range of these sensors is 0–1000 µg/m3 with the resolution of 0.1 µg/m3 and with the accuracy of 10 percent of the value taken as a norm under expected conditions. Wiseair calibrated and validated units in liaison with the Regional Environmental Protection Agency of Lombardy (ARPA Lombardia), where the sensor data was subsequently compared at regular intervals against monitoring stations of reference-grade.
Figure 3 illustrates the temporal variations in PM2.5 concentrations within the Milan dataset.
Table 3 shows a sample for the Milan dataset.
Statistical information about the attributes in the Milan dataset is shown in Table 4.
The Frankfurt dataset utilized data from 14 distinct sensors to compile weather information, including time, temperature, pressure, humidity, wind speed, and PM2.5 values. The dataset raised in Frankfurt was gathered via the OpenSenseMap platform, and the instruments were Luftdaten SDS011 optical particulate sensors (Nova Fitness Co., Ltd., Qingdao, China). These sensors rely on laser scattering technology and have an ideal resolution range of 0.3 µg/m3. The nominal range exhibits a deviation of 0.3 µg/m3 within the 0 to 999 range. It has been reported that the accuracy of the PM2.5 measurement under normal atmospheric conditions is calculated within 15 percent. Sensors in Riederwald region were regularly verified through co-location with official reference stations of the Hessian Agency for Nature Conservation, Environment and Geology (HLNUG). This dataset has also been utilized in peer-reviewed research. For example, Hua et al. [30] applied the Frankfurt air quality dataset to evaluate various imputation techniques for missing time series data, demonstrating its applicability in advanced data processing scenarios. Specifically, this study focused on data collected from sensors located in the Riederwald area of Frankfurt. The dataset for the Riederwald region comprises 236,062 measurement values recorded at approximately 3 min intervals between 31 December 2018 and 28 February 2020.
Figure 4 illustrates the PM2.5 concentrations in Frankfurt over time.
Table 5 shows a sample for the Frankfurt dataset.
Statistical information about the attributes in the Frankfurt dataset is shown in Table 6.
Although the datasets used in this study were obtained from publicly available Kaggle repositories, they originate from recognized and reliable sources. The Indian dataset is based on measurements reported by the Central Pollution Control Board (CPCB). The Milan dataset was collected using IoT sensors deployed by Wiseair (Milano, Italy), a company working in coordination with environmental institutions in Italy. The Frankfurt dataset originates from the OpenSenseMap initiative, utilizing Luftdaten sensors that are frequently referenced in environmental data research. Prior to model training, all datasets underwent quality control steps, including the imputation of missing values, removal of outliers, and normalization, to ensure reliability and consistency.

3.2. Baseline Models

The Decision Tree (DT) algorithm is employed for estimating discrete-valued target functions, with the learned parameters represented by a decision tree [31]. The decision tree exhibits a hierarchical structure, wherein each node represents an attribute, each branch represents the outcome of an operation, and each leaf represents a class label [32,33].
The k-Nearest Neighbors (kNN) algorithm, as described by [34,35], makes predictions for a given sample by considering the distances between the sample and its neighboring data points. This algorithm classifies data based on its proximity to previously observed data points. The kNN method involves selecting a value for k, identifying data points near k, determining the distribution of classes among these neighboring points, and assigning the sample to the most prevalent class. However, a drawback of the kNN algorithm is its requirement for significant memory resources to store the entire dataset for prediction purposes.
Linear regression (LR) is a statistical technique utilized to elucidate the association between two or more variables. Within LR, variables are categorized as either dependent or independent. Notably, the dependent variable in LR is continuous [36], while independent variables may be either categorical or continuous [37].
Random Forest (RF) is a machine learning algorithm rooted in deciding trees. RF constructs an ensemble of decision trees to form a forest, with the final prediction being a combination of the individual estimations from each tree [38]. Therefore, RF is a collaborative learning method.
Support Vector Machines (SVMs) are utilized in the analysis of extensive datasets to discern novel patterns. SVM constructs a dataset to discern a hyperplane that effectively segregates it into two distinct classes [39]. SVM, as a machine learning technique, leverages the training dataset to ascertain an optimal hyperplane within an N-dimensional space, with an emphasis on maximizing the separation distance between classified data points [40].
The Multilayer Perceptron (MLP) is a type of neural network characterized by its multiple layers, including an input layer, one or more hidden layers, and an output layer. The hidden layers within an MLP can contain varying numbers of nodes, with data from the input layer being forwarded to these hidden layers for processing. Ultimately, the processed data from the hidden layers is propagated to the output layer to produce the final output [41].
The Convolutional Neural Network (CNN) is structured with multiple layers that facilitate the connection between input and output data. This network typically executes operations such as object scanning, object clustering, and object similarity detection within images or videos. Following the creation of input data, filtering processes are conducted across diverse layers. These filters are represented through matrix calculations, resulting in the generation of a singular matrix to produce the output data [40,42]. Convolutional layers analyze data, and a pooling layer attempts to reduce it. The compatibility of the image with the parameters is tested in line with this reduced weight [43]. The flattening layer is the layer where the data required to create the output data is prepared. The fully connected layer is used for classification.
The Recurrent Neural Network (RNN) model is a highly effective deep learning framework designed for processing sequential data inputs, including time series. RNN operates by processing input data at specific time intervals, generating results for each time step [44]. Notably, RNNs exhibit a dependency on both current input and previously computed output values, enabling them to effectively model non-independent arrays of elements in the input and/or output. RNN has demonstrated considerable success in handling sequential data types such as text, speech, voice, and financial data [45].
The Long Short-Term Memory (LSTM) model, a type of Recurrent Neural Network, incorporates input, output, and forget gates that regulate the flow of information within the memory unit [46]. The input gate manages the influx of new data into the memory unit, while the forget gate facilitates the removal of outdated information and the retention of new data. The output gate is responsible for controlling the information deemed as the output of a cell [47].

3.3. Data Pre-Processing

In this study, a time series dataset was utilized, with data organized according to a specific time index. To facilitate analysis, the time series data was converted into supervised learning problems using the sliding window method. This method involves presenting observation data equal to the designated window size as input, with the output being the predicted observation value at the subsequent time step. The selection of a sliding window size of 3 was based on experimental investigations conducted in the study. The sliding window method is shown in Figure 5.
According to the data shown in Figure 5, the observation data at time steps t1, t2, and t3 are used as inputs, and the observation data at time step t4 are used as the output in the context of the sliding window. Subsequently, the data were normalized by the Min Max Scaler method. After normalization, the dataset was divided into training, testing, and validation subsets. These specific ratios were chosen for experimental purposes, as optimal prediction accuracy was obtained from a training set comprising 67% of the data and a test set comprising 33% of the data. Additionally, 10% of the training data was reserved for validation purposes. The validation data was used to optimize the model’s hyperparameters. Model parameters were optimized using GridSearchCV from the Scikit-learning library to enhance the predictive performance of the models.

3.4. Experimental Setup

All models were trained and evaluated using the same data splits. The CNN-RNN model was implemented using TensorFlow version 2.11. The final hyperparameter settings were as follows: learning rate = 0.001, batch size = 32, number of epochs = 100, optimizer = Adam. The RNN used a single recurrent layer with 64 units, and the CNN had a single 1D convolutional layer with 64 filters and a kernel size of 3.
The baseline models (Linear Regression, KNN, Decision Tree, Random Forest, SVR, MLP, CNN) were implemented using Scikit-learn. For each model, default settings were used unless specified otherwise. Performance was evaluated using three standard regression metrics such as RMSE, MAE, and R2. These metrics were computed on the test set for all cities. To increase robustness, the experiments were repeated five times, and the average performance is reported.

3.5. Proposed CNN-RNN Hybrid Model

The hybrid prediction model proposed in this study takes hourly PM2.5 values as input and generates predictions for PM2.5 levels in the subsequent hour. The schematic representation of the proposed model can be observed in Figure 6.
The Convolutional Neural Network (CNN) demonstrates effectiveness in automatically extracting features and learning from one-dimensional series data, specifically univariate time series. This study proposes a hybrid model that incorporates a CNN for interpreting sub-sequences fed into a Recurrent Neural Network (RNN). CNN is utilized to extract features from the input data and convert the univariate input into multidimensional arrays through convolution. These multidimensional datasets are subsequently passed to the RNN for prediction purposes.
In the model under consideration, a one-dimensional Convolutional Neural Network (CNN) is employed with a kernel size of 1 and 64 filters to analyze substrings. Additionally, a max pooling layer is utilized to process the input features, while a dense layer is employed to interpret the features extracted by the convolutional layer of the model. Given that the convolutional and pooling layers are three-dimensional, a flatten layer is incorporated to convert the feature maps into a one-dimensional vector for input into the RNN.
The proposed CNN-RNN hybrid model was implemented in Python 3.9 using the Keras API with TensorFlow 2.11 as the backend. Hyperparameter optimization was performed using GridSearchCV from the Scikit-learn library version 1.2.1 to identify the best configuration. The tuned hyperparameters included batch size, number of epochs, optimizer type, learning rate, number of convolution filters, kernel size, dropout rate, and RNN units. The final model configuration included a batch size of 64, 100 training epochs, the Adam optimizer with a learning rate of 0.001, 64 convolutional filters with a kernel size of 1, a dropout rate of 0.2, and 64 units in the recurrent layer. These parameters provided the best validation performance across the three datasets.
To mitigate overfitting, especially given the relatively small Milan dataset, several regularization techniques were employed. We used early stopping based on validation loss with a patience of 10 epochs to halt training once the model began to overfit. Additionally, a dropout layer with a rate of 0.2 was included after the recurrent layer. Training was monitored using both training and validation loss curves, and the model’s performance on the test set confirmed generalizability. These precautions were essential for the Milan dataset due to its limited sample size.

4. Experimental Results

In this research, the hybrid CNN-RNN prediction model was rigorously evaluated against several other machine learning algorithms, including DT, kNN, LR, RF, SVM, MLP, CNN, RNN, and LSTM. The comparative analysis of the outcomes was conducted using metrics such as MSE, RMSE, MAE, and R2. The experimental results for the India dataset are presented in Table 7 and Figure 7.
The experimental findings regarding PM2.5 prediction indicated that the proposed model outperformed the other models in comparison. Following the proposed model, the sequence of successful models was LSTM, RNN, MLP, SVM, MLP, CNN, LR, RF, kNN, and kNN, respectively.
Table 8 and Figure 8 show comparative experimental results for the Milan dataset.
The experimental findings regarding PM2.5 prediction indicated that the proposed model outperformed the other models in comparison. Following the proposed model, the sequence of successful models was LSTM, RNN, MLP, SVM, MLP, CNN, LR, RF, kNN, and kNN, respectively.
Table 9 and Figure 9 showed the comparative experimental results for the Frankfurt dataset.
The experimental findings regarding PM2.5 prediction indicated that the proposed model outperformed the other models in comparison. Following the proposed model, the sequence of successful models was LSTM, RNN, MLP, SVM, MLP, CNN, LR, RF, kNN, and kNN, respectively.
The prediction results of the proposed model for the India, Milan, and Frankfurt datasets are shown in Figure 10.
Figure 10 illustrates that the proposed model effectively anticipated abrupt fluctuations in PM2.5 concentrations. The presence of outliers within the dataset has a significant impact on the accuracy of predictions, underscoring the necessity of incorporating historical observations in the modeling process. Prediction inaccuracies directly influence the outcomes of performance evaluations. The superior performance of the proposed model, as evidenced by lower Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) values compared to alternative models, can be attributed to its adept handling of outliers and dataset fluctuations. The elevated R-squared value of the proposed model can be attributed to its superior fit to the dataset relative to other models.

5. Discussion

This section provides a detailed discussion of the methods employed in this study, focusing on the results obtained and their relevance to addressing the PM2.5 issue. The analysis consisted of comparing traditional machine learning methods with deep learning models, and ultimately, a hybrid CNN-RNN model was designed and discussed.
The predictive performance of the DT compared to kNN may be attributed to the ability of the DT to capture feature interactions, which is a feature that kNN relies on for distance-based neighborhood determination. The influence of feature scale on kNN distance measurement further emphasizes the importance of feature interaction in predictive modeling. In contrast, decision trees facilitate automatic feature interaction and enhance their predictive capabilities. Similarly, the higher prediction accuracy of LR compared to DT may indicate a dataset with low noise and a large sample size. These factors contribute to the effectiveness of LR in modeling complex relationships. Due to the lack of categorical features in the dataset, LR is superior to decision trees. RF is effective when features have different scales. Since the features in the dataset are numerical, SVMs exhibit superior predictive performance compared to RF.
When considering the outcomes in the context of deep learning methodologies, the presence of numerical attributes within the dataset can elucidate the superior performance of MLP over RF. If categorical attributes were present in the dataset, those with higher significance would be prioritized during training, potentially leading to neural saturation and hindering the efficiency of the training phase. CNN has demonstrated efficacy in feature extraction from image data, utilizing vectors and matrices as input and featuring partially connected layers where nodes may not be universally linked. Compared to other neural network architectures, MLP have interconnected nodes, which makes them superior to CNNs in modeling univariate time series. The success of RNN in DT, k-NN, LR, RF, SVM, MLPs, and CNNs demonstrates that CNNs and RNNs possess unique structural characteristics. Specifically, CNNs, as a type of feedforward neural network, utilize filters and pooling layers, whereas RNNs incorporate feedback loops to enhance network performance.
Additionally, CNNs maintain a fixed input and output size, unlike RNNs. In the context of neural networks, CNNs process inputs of a predetermined size and adjusts them to the requisite scale, while also considering the confidence level of their predictions. RNN, on the other hand, allows for input and output sizes to fluctuate. The feedback mechanism inherent in RNNs facilitates the retention and re-presentation of past features to the network, thereby enhancing the overall performance. LSTM units, a specialized component within RNNs, feature a memory cell capable of retaining information over extended durations, contributing to the network’s ability to learn and retain long-term dependencies. A system of gates is employed to regulate the input, output, and retention of information within memory, facilitating the acquisition of knowledge about extended dependencies.
Additionally, a distinct set of gates is utilized to manage information flow without the incorporation of separate memory cells and with a reduced number of gates. The integrated design of the model highlights its superior performance compared to other models under evaluation. This hybrid model, which integrates CNN and RNN architectures, leverages the feature extraction capabilities of CNNs. The RNN leverages its feedback structure to retain and reintroduce past features as input, thereby enhancing the performance of the CNN in feature extraction and the performance of the RNN in the learning and prediction stages. The current study introduces a hybrid model that effectively predicts PM2.5 levels, offering a valuable tool for future research endeavors.

5.1. Limitations

Despite the promising results of the proposed CNN-RNN hybrid model, this study has several limitations. Although the datasets were obtained from reliable sources, they were accessed via Kaggle and not directly from official government APIs. Minor inconsistencies or pre-processing issues beyond our control may exist. While the model was tested on data from three cities with different characteristics, the results may not generalize to rural or highly diverse geographic areas without further validation.
The deep learning model, while effective, requires enough data to avoid overfitting. Particularly for the Milan dataset with limited records, regularization techniques were necessary to maintain generalization. This study only focused on PM2.5 values without including meteorological variables, which are known to influence air pollution levels.
Although the model shows high accuracy offline, its performance in real-time, noisy sensor environments and delayed data conditions have not been tested yet. Future work should address these limitations by incorporating more diverse datasets, additional environmental variables, and real-time implementation strategies.

5.2. Computational Complexity Analysis

The integration of CNN and RNN in a hybrid architecture inevitably increases the model’s computational complexity compared to single models. However, this increase is moderate and justifiable. The CNN-RNN model used in this study contains approximately 38,000 trainable parameters, compared to 22,000 in the standalone CNN and 19,000 in the RNN. The average training time per epoch was 2.4 s for CNN-RNN, versus 1.3 s for CNN and 1.1 s for RNN (measured using an NVIDIA RTX 3060 GPU, NVIDIA, Santa Clara, CA, USA). Despite this, the CNN-RNN model consistently outperformed the others in terms of RMSE and R2, indicating that the gain in prediction accuracy offsets the slight increase in computational cost. Furthermore, the model’s architecture remains lightweight compared to deeper Transformer-based networks, making it suitable for real-world deployment scenarios.

6. Conclusions

In the contemporary era of rapid development driven by industrialization, air pollution remains a pervasive threat in many parts of the world and has been a prominent problem. In particular, the presence of PM2.5, an air pollutant, poses a significant risk to the health of individuals and ecosystems exposed to this harmful substance. Inhaling air, which is made up of gases, water vapor, dust, and chemicals, can cause particles to enter the body. The body’s filtration system effectively prevents larger particles from reaching the lungs. However, Particles smaller than 2.5 microns, particularly those originating from petroleum fuels such as exhaust emissions, have the potential to bypass the body’s filtration mechanisms. PM2.5 particles have been identified as exacerbating or instigating chronic illnesses such as asthma, heart attacks, bronchitis, and various respiratory issues. Consequently, precise forecasting of air quality levels is imperative.
This study introduces a novel CNN-RNN hybrid deep learning model for air quality prediction, which was evaluated using experimental data from India, Milan, and Frankfurt datasets based on MSE, RMSE, MAE, and R2 metrics. The model was compared to those of DT, kNN, LR, RF, SVM, CNN, MLP, RNN, and LSTM. Results indicate that the proposed model achieved MAE scores of 1.620 and R2 scores of 0.995 for the India dataset, MAE scores of 2.511 and R2 scores of 0.834 for the Milan dataset, and MAE scores of 0.516 and R2 scores of 0.995 for the Frankfurt dataset. The experimental results indicate that the proposed model demonstrated superior predictive performance compared to other models examined. This study presents a novel model for accurately and effectively predicting PM2.5 concentrations, providing a valuable reference for researchers and organizations combating air pollution. The high accuracy of this model enables proactive measures to be taken in anticipation of future PM2.5 levels. These preventive policies have the potential to mitigate various adverse effects on individuals. Furthermore, the implementation of alternative production technologies in regions with anticipated high PM2.5 levels can lead to reduced emission values. Additionally, refining predictive models informed by the hybrid model can enhance the accuracy of future forecasts.

Author Contributions

Conceptualization, M.A.; Methodology, A.U., M.A. and H.C.B.; Software, H.C.B.; Formal analysis, U.C. and M.A.; Investigation, M.A., A.A. and A.B.; Resources, A.A.; Data curation, A.U., H.C.B. and A.P.; Writing—original draft, A.U. and U.C.; Writing—review & editing, A.U. and U.C.; Visualization, H.C.B.; Supervision, A.B.; Project administration, A.P. All authors have read and agreed to the published version of the manuscript.

Funding

The APC was partially funded by INTI International University.

Institutional Review Board Statement

This manuscript has not been published or presented elsewhere in part or in entirety and is not under consideration by another journal. We have read and understood your journal’s policies, and we believe that neither the manuscript nor the study violates any of these.

Informed Consent Statement

Not applicable.

Data Availability Statement

Program codes available on request and the datasets generated during and/or analyzed during the current study are available in the Kaggle repository and persistent web links to datasets are below: India dataset: https://www.kaggle.com/datasets/fedesoriano/air-quality-data-in-india (accessed on 15 March 2023); Milan dataset: https://www.kaggle.com/datasets/wiseair/air-quality-in-milan-summer-2020 (accessed on 15 March 2023); Frankfurt dataset: https://www.kaggle.com/datasets/avibagul80/air-quality-dataset (accessed on 15 March 2023).

Conflicts of Interest

The authors have no relevant financial or non-financial interests to disclose.

References

  1. Manisalidis, I.; Stavropoulou, E.; Stavropoulos, A.; Bezirtzoglou, E. Environmental and health impacts of air pollution: A review. Front. Public Health 2020, 8, 14. [Google Scholar] [CrossRef]
  2. Ristovski, Z.D.; Miljevic, B.; Surawski, N.C.; Morawska, L.; Fong, K.M.; Goh, F.; Yang, I.A. Respiratory health effects of diesel particulate matter. Respirology 2012, 17, 201–212. [Google Scholar] [CrossRef]
  3. Wang, D.; Hu, J.; Xu, Y.; Lv, D.; Xie, X.; Kleeman, M.; Xing, J.; Zhang, H.; Ying, Q. Source contributions to primary and secondary inorganic particulate matter during a severe wintertime PM2.5 pollution episode in Xi’an, China. Atmos. Environ. 2014, 97, 182–194. [Google Scholar] [CrossRef]
  4. de Vries, W. Impacts of nitrogen emissions on ecosystems and human health: A mini review. Curr. Opin. Environ. Sci. Health 2021, 21, 100249. [Google Scholar] [CrossRef]
  5. Wu, X.; Nethery, R.C.; Sabath, M.B.; Braun, D.; Dominici, F. Exposure to air pollution and COVID-19 mortality in the United States: A nationwide cross-sectional study. medRxiv 2020. [Google Scholar] [CrossRef]
  6. Amsalu, E.; Wang, T.; Li, H.; Liu, Y.; Wang, A.; Liu, X.; Tao, L.; Luo, Y.; Zhang, F.; Yang, X.; et al. Acute effects of fine particulate matter (PM2.5) on hospital admissions for cardiovascular disease in Beijing, China: A time-series study. Environ. Health 2019, 18, 70. [Google Scholar] [CrossRef] [PubMed]
  7. Tainio, M.; Andersen, Z.J.; Nieuwenhuijsen, M.J.; Hu, L.; de Nazelle, A.; An, R.; Garcia, L.M.; Goenka, S.; Zapata-Diomedi, B.; Bull, F.; et al. Air pollution, physical activity and health: A mapping review of the evidence. Environ. Int. 2021, 147, 105954. [Google Scholar] [CrossRef] [PubMed]
  8. Lei, T.M.T.; Cai, J.; Cheng, W.-H.; Kurniawan, T.A.; Molla, A.H.; Mohd Nadzir, M.S.; Kong, S.S.-K.; Chen, L.-W.A. Application of Deep Learning Techniques for Air Quality Prediction: A Case Study in Macau. Processes 2025, 13, 1507. [Google Scholar] [CrossRef]
  9. Dhanabalan, T.; Sathish, A. Transforming Indian industries through artificial intelligence and robotics in industry 4.0. Int. J. Mech. Eng. Technol. 2018, 9, 835–845. [Google Scholar]
  10. Brem, A.; Giones, F.; Werle, M. The AI Digital Revolution in Innovation: A Conceptual Framework of Artificial Intelligence Technologies for the Management of Innovation. IEEE Trans. Eng. Manag. 2023, 70, 770–776. [Google Scholar] [CrossRef]
  11. Lv, B.; Cobourn, W.G.; Bai, Y. Development of nonlinear empirical models to forecast daily PM2.5 and ozone levels in three large Chinese cities. Atmos. Environ. 2016, 147, 209–223. [Google Scholar] [CrossRef]
  12. Binkowski, F.S.; Roselle, S.J. Models–3 Community Multiscale Air Quality (CMAQ) model aerosol component 1. Model description. J. Geophys. Res. Atmos. 2003, 108, 4183. [Google Scholar] [CrossRef]
  13. Joharestani, M.Z.; Cao, C.; Ni, X.; Bashir, B.; Talebiesfandarani, S. PM2.5 prediction based on random forest, XGBoost, and deep learning using multisource remote sensing data. Atmosphere 2019, 10, 373. [Google Scholar] [CrossRef]
  14. Yang, J.; Yan, R.; Nong, M.; Liao, J.; Li, F.; Sun, W. PM2.5 concentrations forecasting in Beijing through deep learning with different inputs, model structures and forecast time. Atmos. Pollut. Res. 2021, 12, 101168. [Google Scholar] [CrossRef]
  15. Alshamsi, A.; Anwar, Y.; Almulla, M.; Aldohoori, M.; Hamad, N.; Awad, M. Monitoring pollution: Applying IoT to create a smart environment. In Proceedings of the 2017 International Conference on Electrical and Computing Technologies and Applications (ICECTA), Ras Al Khaimah, United Arab Emirates, 21–23 November 2017; pp. 1–4. [Google Scholar] [CrossRef]
  16. Kabir, S.; Islam, R.U.; Hossain, M.S.; Andersson, K. An integrated approach of belief rule base and deep learning to predict air pollution. Sensors 2020, 20, 1956. [Google Scholar] [CrossRef]
  17. Athira, V.; Geetha, P.; Vinayakumar, R.; Soman, K.P. Deepairnet: Applying recurrent networks for air quality prediction. Procedia Comput. Sci. 2018, 132, 1394–1403. [Google Scholar] [CrossRef]
  18. Huang, C.J.; Kuo, P.H. A deep CNN-LSTM model for particulate matter (PM2.5) forecasting in smart cities. Sensors 2018, 18, 2220. [Google Scholar] [CrossRef] [PubMed]
  19. Du, S.; Li, T.; Yang, Y.; Horng, S.J. Deep air quality forecasting using hybrid deep learning framework. IEEE Trans. Knowl. Data Eng. 2019, 33, 2412–2424. [Google Scholar] [CrossRef]
  20. Ma, J.; Cheng, J.C.; Lin, C.; Tan, Y.; Zhang, J. Improving air quality prediction accuracy at larger temporal resolutions using deep learning and transfer learning techniques. Atmos. Environ. 2019, 214, 116885. [Google Scholar] [CrossRef]
  21. Pak, U.; Ma, J.; Ryu, U.; Ryom, K.; Juhyok, U.; Pak, K.; Pak, C. Deep learning-based PM2.5 prediction considering the spatiotemporal correlations: A case study of Beijing, China. Sci. Total Environ. 2020, 699, 133561. [Google Scholar] [CrossRef] [PubMed]
  22. Xiao, F.; Yang, M.; Fan, H.; Fan, G.; Al-Qaness, M.A. An improved deep learning model for predicting daily PM2.5 concentration. Sci. Rep. 2020, 10, 20988. [Google Scholar] [CrossRef]
  23. Janarthanan, R.; Partheeban, P.; Somasundaram, K.; Elamparithi, P.N. A deep learning approach for prediction of air quality index in a metropolitan city. Sustain. Cities Soc. 2021, 67, 102720. [Google Scholar] [CrossRef]
  24. Peng, J.; Han, H.; Yi, Y.; Huang, H.; Xie, L. Machine learning and deep learning modeling and simulation for predicting PM2.5 concentrations. Chemosphere 2022, 308, 136353. [Google Scholar] [CrossRef] [PubMed]
  25. Wang, W.; An, X.; Li, Q.; Geng, Y.A.; Yu, H.; Zhou, X. Optimization research on air quality numerical model forecasting effects based on deep learning methods. Atmos. Res. 2022, 271, 106082. [Google Scholar] [CrossRef]
  26. Kamsing, P.; Cao, C.; Boonpook, W.; Boonprong, S.; Xu, M.; Boonsrimuang, P. Artificial Neural Network for Air Pollutant Concentration Predictions Based on Aircraft Trajectories over Suvarnabhumi International Airport. Atmosphere 2025, 16, 366. [Google Scholar] [CrossRef]
  27. He, Z.; Guo, Q.; Wang, Z.; Li, X. A hybrid wavelet-based deep learning model for accurate prediction of daily surface PM2. 5 concentrations in Guangzhou city. Toxics 2025, 13, 254. [Google Scholar] [CrossRef]
  28. Chan, K.; Matthews, P.; Munir, K. Time Series Forecasting for Air Quality with Structured and Unstructured Data Using Artificial Neural Networks. Atmosphere 2025, 16, 320. [Google Scholar] [CrossRef]
  29. Chauhan, B.V.; Verma; Rahman, B.A.; Wyche, K.P. Deep Learning in Airborne Particulate Matter Sensing and Surface Plasmon Resonance for Environmental Monitoring. Atmosphere 2025, 16, 359. [Google Scholar] [CrossRef]
  30. Hua, V.; Nguyen, T.; Dao, M.S.; Nguyen, H.D.; Nguyen, B.T. The impact of data imputation on air quality prediction problem. PLoS ONE 2024, 19, e0306303. [Google Scholar] [CrossRef]
  31. Bahety, A. Extension and evaluation of id3–decision tree algorithm. Entropy S 2014, 2, 1–8. [Google Scholar]
  32. Singh, S.; Gupta, P. Comparative study ID3, cart and C4. 5 decision tree algorithm: A survey. Int. J. Adv. Inf. Sci. Technol. IJAIST 2014, 27, 97–103. [Google Scholar] [CrossRef]
  33. Appio, F.P.; Leone, D.; Platania, F.; Schiavone, F. Evaluating the Impact of Project and Operational Complexities on the Crowdfunding Campaigns of New Technologies: An Empirical Investigation. IEEE Trans. Eng. Manag. 2023, 71, 9202–9216. Available online: https://ieeexplore.ieee.org/abstract/document/10192062/ (accessed on 28 February 2024). [CrossRef]
  34. Taunk, K.; De, S.; Verma, S.; Swetapadma, A. A brief review of nearest neighbor algorithm for learning and classification. In Proceedings of the 2019 International Conference on Intelligent Computing and Control Systems (ICCS), Madurai, India, 15–17 May 2019; pp. 1255–1260. [Google Scholar] [CrossRef]
  35. Hassanat, A.B. Two-point-based binary search trees for accelerating big data classification using KNN. PLoS ONE 2018, 13, e0207772. [Google Scholar] [CrossRef]
  36. Rajbahadur, G.K.; Wang, S.; Kamei, Y.; Hassan, A.E. Impact of discretization noise of the dependent variable on machine learning classifiers in software engineering. IEEE Trans. Softw. Eng. 2019, 47, 1414–1430. [Google Scholar] [CrossRef]
  37. Boateng, E.Y.; Abaye, D.A. A review of the logistic regression model with emphasis on medical research. J. Data Anal. Inf. Process. 2019, 7, 190–207. [Google Scholar] [CrossRef]
  38. Mustapha, H.; Abdelwahed, N. Investigating the use of random forest in software effort estimation. Procedia Comput. Sci. 2019, 148, 343–352. [Google Scholar] [CrossRef]
  39. Radhimeenakshi, S. Classification and prediction of heart disease risk using data mining techniques of Support Vector Machine and Artificial Neural Network. In Proceedings of the 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, 16–18 March 2016; pp. 3107–3111. [Google Scholar]
  40. Haasdonk, B. Feature space interpretation of SVMs with indefinite kernels. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 482–492. [Google Scholar] [CrossRef] [PubMed]
  41. Khan, S.; Huh, J.; Ye, J.C. Adaptive and compressive beamforming using deep learning for medical ultrasound. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 2020, 67, 1558–1572. [Google Scholar] [CrossRef]
  42. Sarhangian, F.; Kashef, R.; Jaseemuddin, M. Efficient traffic classification using hybrid deep learning. In Proceedings of the 2021 IEEE International Systems Conference (SysCon), IEEE, Vancouver, BC, Canada, 15 April–15 May 2021; pp. 1–8. Available online: https://ieeexplore.ieee.org/abstract/document/9447072/ (accessed on 28 February 2024).
  43. Hubara, I.; Courbariaux, M.; Soudry, D.; El-Yaniv, R.; Bengio, Y. Quantized neural networks: Training neural networks with low precision weights and activations. J. Mach. Learn. Res. 2017, 18, 6869–6898. [Google Scholar]
  44. Li, Y.; Zhu, Z.; Kong, D.; Han, H.; Zhao, Y. EA-LSTM: Evolutionary attention-based LSTM for time series prediction. Knowl.-Based Syst. 2019, 181, 104785. [Google Scholar] [CrossRef]
  45. Yu, Y.; Si, X.; Hu, C.; Zhang, J. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 2019, 31, 1235–1270. [Google Scholar] [CrossRef] [PubMed]
  46. Chung, J.; Gulcehre, C.; Cho, K.; Bengio, Y. Gated feedback recurrent neural networks. In Proceedings of the International Conference on Machine Learning, Lille, France, 6–11 July 2015; pp. 2067–2075. [Google Scholar]
  47. Liu, J.; Shahroudy, A.; Xu, D.; Wang, G. Spatio-temporal lstm with trust gates for 3d human action recognition. In Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar] [CrossRef]
Figure 1. The geographic distribution of the study locations.
Figure 1. The geographic distribution of the study locations.
Atmosphere 16 01003 g001
Figure 2. Temporal variation of PM2.5 concentrations in India.
Figure 2. Temporal variation of PM2.5 concentrations in India.
Atmosphere 16 01003 g002
Figure 3. Hourly PM2.5 levels recorded in Milan between July and September 2020.
Figure 3. Hourly PM2.5 levels recorded in Milan between July and September 2020.
Atmosphere 16 01003 g003
Figure 4. Time series visualization of PM2.5 data collected in Frankfurt.
Figure 4. Time series visualization of PM2.5 data collected in Frankfurt.
Atmosphere 16 01003 g004
Figure 5. Illustration of the sliding window method used for time series transformation.
Figure 5. Illustration of the sliding window method used for time series transformation.
Atmosphere 16 01003 g005
Figure 6. Architecture of the proposed CNN-RNN hybrid prediction model.
Figure 6. Architecture of the proposed CNN-RNN hybrid prediction model.
Atmosphere 16 01003 g006
Figure 7. Performance comparison of all models on the India dataset.
Figure 7. Performance comparison of all models on the India dataset.
Atmosphere 16 01003 g007
Figure 8. Comparative results of prediction models for the Milan dataset.
Figure 8. Comparative results of prediction models for the Milan dataset.
Atmosphere 16 01003 g008
Figure 9. Performance metrics of prediction models on the Frankfurt dataset.
Figure 9. Performance metrics of prediction models on the Frankfurt dataset.
Atmosphere 16 01003 g009
Figure 10. Actual vs. predicted PM2.5 values using the proposed model across all three regions.
Figure 10. Actual vs. predicted PM2.5 values using the proposed model across all three regions.
Atmosphere 16 01003 g010
Table 1. A sample for the India dataset.
Table 1. A sample for the India dataset.
TimestampPM2.5 Values (µg/m3)
2017-11-07 12:00:0064.51
2017-11-07 13:00:0069.95
2017-11-07 14:00:0092.79
2017-11-07 15:00:00109.66
2017-11-07 16:00:00116.50
Table 2. Statistical information about the attributes in the India dataset.
Table 2. Statistical information about the attributes in the India dataset.
CountMeanStdMinMax
PM2.5 (µg/m3)36,19249.3124.867.02245.63
Table 3. A sample for the Milan dataset.
Table 3. A sample for the Milan dataset.
TimestampPM2.5 Values (µg/m3)
2020-07-24 18:00:0011.67
2020-07-24 19:00:009.25
2020-07-24 20:00:006.29
2020-07-24 21:00:005.50
2020-07-24 22:00:009.50
Table 4. Statistical information about the attributes in the Milan dataset.
Table 4. Statistical information about the attributes in the Milan dataset.
CountMeanStdMinMax
PM2.5 (µg/m3)139812.668.561.0052.00
Table 5. A sample for the Frankfurt dataset.
Table 5. A sample for the Frankfurt dataset.
TimestampPM2.5 Values (µg/m3)
2018-12-31 18:30:5418.37
2018-12-31 18:33:2217.60
2018-12-31 18:35:4918.70
2018-12-31 18:38:1717.50
2018-12-31 18:40:4518.30
Table 6. Statistical information about the attributes in the Frankfurt dataset.
Table 6. Statistical information about the attributes in the Frankfurt dataset.
CountMeanStdMinMax
PM2.5 (µg/m3)236,0628.689.060.30197.45
Table 7. Experimental results for India dataset.
Table 7. Experimental results for India dataset.
ModelMSE (µg/m3)RMSE (µg/m3)MAE (µg/m3)R2
DT16.8084.0992.6220.961
kNN23.1174.8083.1690.946
LR16.6334.0782.5820.961
RF16.3774.0462.5630.961
SVM16.3134.0382.5420.962
MLP16.3244.0402.6030.962
CNN16.3714.0462.5340.962
RNN16.0034.0002.5120.962
LSTM15.9833.9982.4250.963
Proposed model7.1872.6801.6200.995
Table 8. Experimental results for the Milan dataset.
Table 8. Experimental results for the Milan dataset.
ModelMSE (µg/m3)RMSE (µg/m3)MAE (µg/m3)R2
DT27.1635.2113.5800.691
kNN23.5244.8503.4350.732
LR20.5274.5303.0760.766
RF22.5864.7523.2920.743
SVM20.1414.4873.0170.771
MLP19.2334.3863.0320.781
CNN21.0374.5873.1150.760
RNN19.0634.3662.9780.783
LSTM18.8304.3392.9600.785
Proposed model12.0893.4762.5110.834
Table 9. Experimental results for the Frankfurt dataset.
Table 9. Experimental results for the Frankfurt dataset.
ModelMSE (µg/m3)RMSE (µg/m3)MAE (µg/m3)R2
DT2.5701.6030.7670.978
kNN1.8051.3430.6580.985
LR1.6811.2960.6440.986
RF2.1321.4600.6720.982
SVM1.6791.2950.6430.986
MLP1.6901.3000.6470.986
CNN1.7011.3040.6480.986
RNN1.6001.2650.6440.986
LSTM1.5621.2500.6290.988
Proposed model0.9660.9830.5160.995
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Utku, A.; Can, U.; Alpsülün, M.; Balıkçı, H.C.; Amoozegar, A.; Pilatin, A.; Barut, A. Advancing Air Quality Monitoring: Deep Learning-Based CNN-RNN Hybrid Model for PM2.5 Forecasting. Atmosphere 2025, 16, 1003. https://doi.org/10.3390/atmos16091003

AMA Style

Utku A, Can U, Alpsülün M, Balıkçı HC, Amoozegar A, Pilatin A, Barut A. Advancing Air Quality Monitoring: Deep Learning-Based CNN-RNN Hybrid Model for PM2.5 Forecasting. Atmosphere. 2025; 16(9):1003. https://doi.org/10.3390/atmos16091003

Chicago/Turabian Style

Utku, Anıl, Umit Can, Mustafa Alpsülün, Hasan Celal Balıkçı, Azadeh Amoozegar, Abdulmuttalip Pilatin, and Abdulkadir Barut. 2025. "Advancing Air Quality Monitoring: Deep Learning-Based CNN-RNN Hybrid Model for PM2.5 Forecasting" Atmosphere 16, no. 9: 1003. https://doi.org/10.3390/atmos16091003

APA Style

Utku, A., Can, U., Alpsülün, M., Balıkçı, H. C., Amoozegar, A., Pilatin, A., & Barut, A. (2025). Advancing Air Quality Monitoring: Deep Learning-Based CNN-RNN Hybrid Model for PM2.5 Forecasting. Atmosphere, 16(9), 1003. https://doi.org/10.3390/atmos16091003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop