Next Article in Journal
Factors Associated with Single-Use and Co-Use of Tobacco and Alcohol: A Multinomial Modeling Approach
Previous Article in Journal
Combined Effects of Aircraft, Rail, and Road Traffic Noise on Total Noise Annoyance—A Cross-Sectional Study in Innsbruck
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Air Quality Early-Warning System Based on Artificial Intelligence

1
College of Atmospheric Sciences, Lanzhou University, Lanzhou 730000, China
2
School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
3
School of Management, Lanzhou University, Lanzhou 730000, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2019, 16(19), 3505; https://doi.org/10.3390/ijerph16193505
Submission received: 10 August 2019 / Revised: 13 September 2019 / Accepted: 16 September 2019 / Published: 20 September 2019

Abstract

:
The problem of air pollution is a persistent issue for mankind and becoming increasingly serious in recent years, which has drawn worldwide attention. Establishing a scientific and effective air quality early-warning system is really significant and important. Regretfully, previous research didn’t thoroughly explore not only air pollutant prediction but also air quality evaluation, and relevant research work is still scarce, especially in China. Therefore, a novel air quality early-warning system composed of prediction and evaluation was developed in this study. Firstly, the advanced data preprocessing technology Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN) combined with the powerful swarm intelligence algorithm Whale Optimization Algorithm (WOA) and the efficient artificial neural network Extreme Learning Machine (ELM) formed the prediction model. Then the predictive results were further analyzed by the method of fuzzy comprehensive evaluation, which offered intuitive air quality information and corresponding measures. The proposed system was tested in the Jing-Jin-Ji region of China, a representative research area in the world, and the daily concentration data of six main air pollutants in Beijing, Tianjin, and Shijiazhuang for two years were used to validate the accuracy and efficiency. The results show that the prediction model is superior to other benchmark models in pollutant concentration prediction and the evaluation model is satisfactory in air quality level reporting compared with the actual status. Therefore, the proposed system is believed to play an important role in air pollution control and smart city construction all over the world in the future.

Graphical Abstract

1. Introduction

Air is one of the most basic elements for human survival and good air quality is necessary for human health. Unfortunately, air pollution has become a global problem, which has aroused widespread concern from scholars, governments and the public. Some studies have found that exposure to air pollutants is associated with the occurrence of many diseases such as respiratory disease, cardiovascular disease and even cancer, contributing to as many as 4–9 million human deaths per year globally [1,2]. The situation in China is also grim. With the rapid development of industrialization and urbanization, more and more fossil fuels are being burned, which results in increasing emissions of sulphur, nitrogen and particulate matter, causing deteriorating air quality and frequent hazy weather. As the “Capital Economic Circle” and future world-class urban agglomeration, influenced by adverse geographical and meteorological conditions along with industrial structure, the Jing-Jin-Ji region has become one of the most heavily polluted areas, with frequent long duration, wide range and severe degree regional pollution events. To solve this serious problem, researchers have done a lot of work, including air pollutant prediction and air quality evaluation.
Numerous forecasting models have been proposed, mainly for pollutant concentration. According to their principles, these forecasting models can be divided into three categories: statistic forecasting models, numerical forecasting models and machine learning models.
Statistic forecasting models have been widely used in air quality forecasting from the early days because of their simplicity and rapidity, and they still have value in application and research up to now. They can predict pollutant concentrations in the future only by studying the relationship between pollutant concentration and meteorological factors from past records without information about pollution sources. Common statistical models include the multiple linear regression model (MLR) [3], autoregressive integrated moving average model (ARIMA) [4], grey model (GM) and Markov model. For example, Elbayoumi et al. [5] used MLR to predict the annual indoor concentrations of PM2.5 and PM10 by analyzing the meteorological variables (wind speed, temperature and relative humidity) collected from 12 natural ventilation systems. Jian et al. [6] used ARIMA to study the effects of meteorological factors on the concentrations of ultrafine particles and PM10 in Hangzhou under heavy traffic conditions. A first-order variable grey differential equation model was proposed by Pai et al. [7] to predict the hourly PM concentration in Banqiao, Taiwan. A Hidden Markov Model (HMMS) was used to predict daily average PM2.5 concentrations [8]. Although these statistic forecasting models (linear method) have been widely used in PM concentration prediction (non-linear process), their accuracy is largely limited by their linear mapping ability. Most of the air pollutant time series in the real world are non-linear and irregular, so statistic forecasting model may not be suitable for these data.
Since the 1990s, with the development of computer technology and the abundance of air pollution data, numerical forecasting models have been greatly developed and are currently in the third generation. Based on the idea of “One Atmosphere”, they realize two-way coupling between atmospheric dynamics and atmospheric chemistry which can simulate atmospheric physical and chemical processes on different scales and therefore predict the concentrations of different air pollutants [9]. Numerical forecasting model usually consist of meteorological modules, emission modules and chemical modules following this principle that weather or climate modules provide meteorological background fields which drive the chemical transport modules. At present, common numerical forecasting models include the U.S. Models-3 and WRF-Chem, Polyphemus from France as well as Nested Air Quality Prediction Modeling System (NAQPMS) from China [10,11,12]. Although numerical forecasting models are helpful to reveal the mechanism of pollution processes, their accuracy, especially in severe air pollution incidents, is greatly limited by some difficulties such as inaccurate atmospheric boundary layer simulation schemes, insufficient emission inventory of pollution sources and limited knowledge of atmospheric physical and chemical process. Furthermore, they require a lot of computing time.
Machine learning belongs to the field of artificial intelligence. The arrival of the big data era has brought unprecedented opportunities for the development of machine learning. Machine learning has excellent performance in regression and classification problems, and it is usually recognized as one of the most powerful tools in pollutant prediction for its high robustness and fault tolerance. Therefore, there are increasingly studies on pollutant concentration prediction with machine learning models. For example, support vector machine (SVM) [13] and artificial neural network (ANN) [14] are commonly selected. Paschalidou et.al [15] used a radial basis function (RBF) and multilayer perceptron (MLP) to predict hourly concentrations of PM2.5 in Cyprus. Wu et al. [16] acquired predictions of PM10 concentrations using a general regression neural network (GRNN).
Pollutant concentration data are too abstract for the public to understand, and people are eager for simplified and intuitive information to quickly understand the state of ambient air, which means air quality evaluation is indispensable. When it comes to methods of air quality evaluation, the most commonly used method is the air quality index (AQI) originally proposed by the US Environmental Protection Agency (EPA). AQI is widely used worldwide, while the standards vary among countries. China’s standard comes from “Technical Regulation on Air Quality Index (on trial) (HJ 633-2012)” issued by the Ministry of Environmental Protection. It considers a variety of pollutants including PM2.5, PM10, NO2, SO2, CO, O3. However, as with all environmental quality evaluations, there are ambiguities in air quality evaluation due to the vagueness of evaluation factors, criteria and objects, etc., which makes it difficult to justify the use of sharp boundaries in classification schemes, so the air quality index method has some limitations, for example, a slight increase or decrease of pollutant data near a boundary value will change the evaluation level. Such fuzziness has led many researchers to seek advanced evaluation methods [17], for instance, fuzzy mathematics. Fuzzy mathematics is proved to be a useful tool for air quality evaluation [18,19], and many air quality indicators based on fuzziness [20,21,22,23] are proposed.
Individual prediction or evaluation is not enough to help us cope with air pollution, so an integrated and complete system is expected to play a greater value. Some early-warning systems including prediction and evaluation have been gradually proposed. The problem of air pollution in China has attracted increasing attention, but there are relatively few in-depth and targeted studies in air quality early warning based on artificial intelligence. Consideration of pollutants which affect air quality should be as comprehensive as possible, but some studies only focus on single pollutant, mainly PM. Although the selection of experimental sites is of importance, some scholars don’t give sufficient reasons such as purpose and significance for their choices. The selection of algorithms and pollutant concentration limits in air quality evaluation also remain to be discussed. Therefore, developing an accurate and robust air quality early-warning system has become an urgent need of society. It is hoped to provide not only air quality information comprehensively and objectively, but also necessary preventive measures for citizens to avoid hazards, and even help relevant departments to better control air pollution and minimize negative impacts.
Based on the above analyses, this paper proposes a novel air quality early-warning system composed of prediction and evaluation. The prediction part took advantage of advanced improved complete ensemble empirical mode decomposition with adaptive noise (ICEEMDAN) and combined whale optimization algorithm (WOA) with extreme learning machine (ELM). The three methods have been proved to be effective in air pollutant forecasting [24,25,26]. Fuzzy comprehensive evaluation (FCE) based on fuzzy mathematics was conducted subsequently.
Generally speaking, the contributions of this paper are as follows:
  • A complete air quality early-warning system was established and achieved good results in the Jing-Jin-Ji region where air pollution problems are of great concern.
  • A novel hybrid prediction model ICEEMDAN-WOA-ELM was proposed for the main air pollutants in Beijing, Tianjin and Shijiazhuang. ICEEMDAN and WOA are confirmed to greatly improve the prediction ability of ELM through comparison.
  • The predictive results can be transformed into corresponding air quality levels by fuzzy comprehensive evaluation, which means citizens without professional knowledge of atmospheric science can easily understand the current air quality and get scientific advices to avoid air pollution.
  • The air quality early-warning system is feasible and practical in air pollution treatment, which can not only protect the public from air pollution but also offer services for government decision-making on environmental protection.
The rest of this paper is organized as follows: Section 2 briefly introduces the methodologies adopted in this paper. Empirical research is given in Section 3, along with the description of experiment sites, data, evaluation criteria and so forth. Section 4 gives the conclusions.

2. Methodology

2.1. The Proposed Air Quality Early-Warning System

In this section, the air quality early-warning system whose core is the hybrid ICEEMDAN-WOA-ELM-FCE model is introduced in detail. The flow diagram consisted of four steps, presented in Figure 1.
  • Step 1: Pollutant concentration data are usually chaotic time series, requiring denoising technology to eliminate the influences of outliers and improve the prediction accuracy. ICEEMDAN is used to process the original data into several IMFs from high frequency to low frequency, which contain different characteristics of the original data.
  • Step 2: The ELM optimized by WOA is applied to build a predictor for each IMF. The WOA algorithm is used to obtain the best parameters of ELM to establish a forecasting model which is not only fast but also accurate. All the predictive results of IMFs are synthesized and the final predictive result is obtained. The optimized ELM model is used to forecast the concentrations of six major air pollutants in Beijing, Tianjin and Shijiazhuang, which will be the key information for the evaluation model.
  • Step 3: Fuzzy comprehensive evaluation can convert the predictive results into air quality levels scientifically and objectively, providing crucial information for further research and analysis.
  • Step 4: The air quality information can be applied to guide people’s daily lives. Different colors are assigned to different levels, so air quality information can be easily understood. In addition, brief but practical guidance corresponding to levels can be offered to the public against air pollution. Scientific and precise results also serve the government decision-making on environmental protection. Generally, the proposed air quality early-warning system will play a key role in future air pollution prevention.
In this section, all individual methods belonging to the air quality early-warning system are described in detail, including ICEEMDAN, WOA, ELM and FCE.

2.2. Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN)

The empirical mode decomposition (EMD) [27] is a widely used method to analyze non-linear and non-stationary data. Compared with the traditional decomposition algorithm, Fourier transform or wavelet transform which are more applicable to stationary and linear data, EMD is adaptive and highly efficient. Original data can be expressed as a sum of intrinsic mode functions (IMFs) and a final monotonic trend by EMD, but oscillations may be produced with different scales in one mode or with same scale in different modes which called “mode mixing”. The ensemble empirical mode decomposition (EEMD) [28] is proposed to address this problem by adding Gaussian white noise to the original signal, but the added noise can’t be completely neutralized and different noisy copies of the signal may produce different number of modes. The complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) [29] provides accurate reconstruction of the original signal, better spectral separation of the mode and computational efficiency, achieving huge improvements on EEMD. Furtherly, the ICEEMDAN [30] improves some aspects of CEEMDAN involving residual noise, “spurious mode” and so forth, becoming the latest decomposition method of EMD family. In this study, considering the non-stationary and non-linear characteristics of pollutant concentration, ICEEEMDAN was used as a data preprocessing method to better dig out the rules behind the pollutant data and serve the prediction later. The main steps of ICCEMDAN are summarized as follows:
(1) Calculate local means of I realizations x ( i ) = x + β 0 E 1 ( w ( i ) ) by EMD to get the first residue r 1 = M ( x ( i ) ) , where w ( i ) ( i = 1 , , I ) is a realization of white Gaussian noise with zero mean unit variance, E k ( . ) is the operator that produces the k th mode obtained by EMD and M ( . ) is the operator that generates the local mean of the applied signal.
(2) Calculate the first mode d 1 ˇ = x r 1 at the first stage ( k = 1 ) .
(3) Calculate the second residue as the average of local means of the realizations r 1 + β 1 E 2 ( w ( i ) ) and define the second mode: d 2 ˜ = r 1 r 2 = r 1 M ( r 1 + β 1 E 2 ( w ( i ) ) ) .
(4) For k = 3 , K calculate the k th residue r k = M ( r k 1 + β k 1 E k ( w ( i ) ) ) .
(5) Calculate the k th mode d k ˜ = r k + 1 r k .
(6) Return to step 4 for next k .

2.3. Whale Optimization Algorithm (WOA)

Inspired by the bubble-net hunting strategy which corresponds to the social behavior of humpback whales, a nature-inspired meta-heuristic optimization algorithm called WOA [31] was proposed in 2016. Tested with 29 mathematical benchmark functions and six structural engineering problems in exploration, exploitation, local optima avoidance and convergence behavior, WOA was proved to be highly competitive compared to the state-of-art meta-heuristic algorithms as well as conventional methods. The mathematical model of WOA is illustrated as follows [31].

2.3.1. Encircling Prey

Humpback whales can identify and encircle the location of their prey. After defining the best search agent, other search agents will try to move to the best location. This behavior is expressed by the following mathematical formulas:
D = | C · X * ( t ) X ( t ) |
X ( t + 1 ) = X * ( t ) A · D
where t is the current iteration, X * is the best position, X denotes the position vector, · is an element-by element multiplication, and A and C are coefficient vectors which can be calculated by the following equations:
A = 2 a · r a
C = 2 · r
where r is a random vector between 0 and 1, and a is linearly reduced from 2 to 0 in the iteration process.

2.3.2. Bubble-Net Attacking Method (Exploitation Phase)

Humpback whales usually attack their prey using the bubble-net strategy and two approaches are designed:
(1) Shrinking encircling mechanism.
This behavior is realized by reducing the value of a in Equation (3). Setting random A values in [−1,1], the new position can be obtained between the original position and the current position of the best agent.
(2) Spiral updating position
A spiral equation is established between whales and prey to simulate the helix-shaped movements of humpback whales:
X ( t + 1 ) = D · e b l · cos ( 2 π l ) + X * ( t )
where D = | X * ( t ) X ( t ) | is the distance between the i th whale and the best position obtained so far, b is a constant to define the logarithmic spiral, l is a random number between −1 and 1, and · is an element-by-element multiplication. WOA assumes that there is a 50% probability of choosing shrinking encircling mechanism or the spiral model to update the position of whales in the optimization process. The algorithm is defined as follows:
X ( t + 1 ) = { X * ( t ) A · D i f   p < 0.5 D · e b l · cos ( 2 π l ) + X * ( t )   i f   p 0.5
where p is a random number between 0 and 1.

2.3.3. Search for Prey (Exploration Phase)

Humpback whales can randomly search for prey according to the position of each other. In the exploration phase, we can update the location of a search agent based on a randomly selected search agent, rather than the best search agent found so far. This mechanism emphasizes exploration, allowing the WOA algorithm to perform a global search. This mathematical model is expressed as follows:
D = | C · X r a n d X |
X ( t + 1 ) = X r a n d A · D
where X r a n d is a random location vector selected from the current population.
The WOA algorithm (Algorithm 1) starts with a set of random solutions. In each iteration, the search agent updates its location based on the randomly selected search agent or the best solution obtained so far. A random search agent is selected when | A | > 1, and the best solution is selected when | A | < 1. According to p value, WOA can switch between spiral and circular movement. The WOA algorithm is terminated when it satisfies the termination criterion. The pseudo code of the WOA algorithm is represented as follows:
Algorithm 1 WOA
Input: Maximum number of iterations I t e r M a x , Fitness function F i , Current iteration number t,
   A random number l between −1 and 1, A constant number b .
1: Initialize the whales population X i ( i = 1 , 2 , 3 , , n )
2: for each search agent do
3:  Calculate the fitness function F i
4: end for
5: X * = the best search agent
6: while t < I t e r M a x do
7:   for each search agent do
8:   Update a , A , C , l and p
9:   if p < 0.5 then
10:     if | A | < 1 then
11:      Update the position of search agent using Eq(2);
12:     elseIf | A | 1
13:      Select a random search agent X r a n d ;
14:      Update position of search agent using Eq(8);
15:     end if
16:    elseIf p > 0.5
17:     Update the position with spiral Eq(5);
18:    end if
19:  end for
20:  Check if any search agent goes beyond the search space and amend it;
21:  for each search agent do
22:    Calculate the fitness function F i
23:  end for
24:  Update X * if there is a better solution;
25:   t = t + 1
26: end while
27: return X *

2.4. Extreme Learning Machine (ELM)

ELM [32] is a simple and extremely fast learning algorithm of single-hidden layer feedforward neural networks (SLFN). ELM randomly assign input weights and hidden layer biases (thresholds) without adjustment in the training process, which leads to thousands of times faster than traditional feedforward network learning algorithms and better generalization performance in most artificial and real benchmark problems. The structure of single-hidden layer feedforward neural network is shown in Figure 2.
For N independent samples ( x i , t i ) , x i = [ x i 1 , x i 2 , , x i n ] T R n and t i = [ t i 1 , t i 2 , t i m ] T R m . SLFN can be expressed as [32]:
i = 1 L β i g ( w i · x j + b i ) = o j , j = 1 , 2 , , N
where w i = [ w i 1 , w i 2 , , w i n ] T is the weight vector between the input layer neurons and the i th hidden layer neuron, b i is the threshold of the i th hidden layer neuron, g ( x ) is the activation function, and β i = [ β i 1 , β i 2 , , β i m ] T is the weight vector between the i th hidden layer neuron and the output layer neurons. Formula (9) can be expressed as:
H β = T
where H is the output matrix of the hidden layer, β is the weight vector between the hidden layer neurons and the output layer neurons, T is the expected output of network, represented as follows [32]:
H = [ g ( w 1 · x 1 + b 1 ) g ( w L · x 1 + b L ) g ( w 1 · x N + b 1 ) g ( w L · x N + b L ) ] N × L
The number of required hidden layer neurons L N when activation function g is infinitely differentiable. Its solution is:
β ^ = H + T
where H + is the Moore-Penrose generalized inverse of H .
ELM can generate w and b randomly before training and calculate β only by determining L and g ( x ) . Generally, the ELM algorithm has the following steps:
(1) Determine the number of neurons L in the hidden layer, and randomly set the connection weight w between the input layer and the hidden layer and the threshold b of hidden layer neurons.
(2) An infinitely differentiable function g ( x ) is selected as the activation function of the hidden layer neurons, and then the output matrix H of the hidden layer is calculated.
(3) Calculate the weight of the output layer: β ^ = H + T .

2.5. Fuzzy Comprehensive Evaluation (FCE)

Environmental quality is a huge and ambiguous system with a large number of uncertain factors. Fuzzy mathematics [18] can effectively solve the influences of ambiguity of evaluation boundary and monitoring error on evaluation. Using membership function to represent air quality level can eliminate subjective and artificial factors in classification, objectively reflecting regional air quality. The concrete steps of fuzzy comprehensive evaluation are as follows:
(1) Establish the factor set
A factor set is a set of elements that affect the evaluation object, usually represented by U = { u 1 , u 2 , , u m } . It is well known that different pollutants can cause different hazards to human health, so these parameters should be treated separately. Therefore, six main pollutants are selected as air quality parameters in this project:
U = { u 1 , u 2 , , u 6 } = { P M 2.5 , P M 10 , N O 2 , S O 2 , C O , O 3 }
(2) Set up the evaluation set
Because this research is carried out in China, air pollutant concentration limits from “Technical Regulation on Air Quality Index (on trial) (HJ 633-2012)” of China have a reference value. On account of the lack of values of O3 (8 h) beyond the fifth level, we have a decision that the evaluation set comprises five levels: V = { v 1 , v 2 , v 5 } = { I ,   II ,   III ,   IV ,   V } and the corresponding air quality categories are “Excellent, Good, Moderate, Poor, Hazardous”. The air quality levels and corresponding concentration limits of different pollutants are given in Table 1.
(1) Establish fuzzy matrix
The fuzzy matrix can be expressed by the matrix R , where R i j is the membership degree of factor u i aiming at the comment v j :
R = ( r i j ) m × n = [ r 11 r 12 r 21 r 22 r 1 n r 2 n r m 1 r i 2 r m n ]
The membership function can calculate the membership degree of pollutant concentration to the evaluation grade. There are many membership functions such as halved trapezoidal distribution function, Gauss membership function, triangular membership function, etc. In this study, the halved trapezoidal distribution function [33] which has often been used in air quality evaluation is selected and details are presented as follows:
r i j = { ( 1 u i ( j + 1 ) x i ) / ( u i ( j + 1 ) u i j ) 0     x i u i j u i j < x i u i ( j + 1 ) x i > u i ( j + 1 )      j = 1 r i j = { ( x i u i ( j 1 ) ) / ( u i j u i ( j 1 ) ) ( u i ( j + 1 ) x i ) / ( u i ( j + 1 ) u i j ) 0     u i ( j 1 ) x i u i j u i j < x i u i ( j + 1 ) x i > u i ( j + 1 )      j = 2 , 3 , 4 r i j = { 0 ( x i u i ( j 1 ) ) / ( u i j u i ( j 1 ) ) 1     x i u i ( j 1 ) u i ( j 1 ) < x i u i j x i > u i j      j = 5
(2) Determine the factor weights
The weight of a factor is an index to measure the relative degree of a pollutant impact on air quality. The multi-scale weighting method is commonly used in the fuzzy evaluation of environment quality, therefore the weight of pollution factor can be obtained by Equation (16):
w i = [ u i k / ( 1 n j = 1 n v i j ) ] / i = 1 m [ u i k / ( 1 n j = 1 n v i j ) ]
(3) Evaluation result
By synthesizing the weight vector and the fuzzy matrix with the appropriate operator, the final result of the fuzzy comprehensive evaluation can be obtained. The Zadeh operator M ( , ) is commonly used as a solution, therefore it is adopted here:
B = W R = ( w 1 , w 2 , w m )   ( , )   [ r 11 r 12 r 21 r 22 r 1 n r 2 n r m 1 r m 2 r m n ]
According to the principle of maximum membership degree, the maximum value of B is the result of fuzzy comprehensive evaluation of air quality.

3. Experimental Results and Analysis

In this section, in order to evaluate the performance of proposed air quality early-warning system, three datasets from three cities (Beijing, Tianjin, Shijiazhuang) in China were used in case studies (the simple map of the study areas is displayed in Figure 3. The main reasons for the choice are: (1) Jing-Jin-Ji region is a Beijing-centered world-class urban agglomeration which has a developed economy and important strategic position. It covers 13 cities, 110 million people and 218,000 km2 of land area, so air pollution is really of concern here. (2) In this region, the heavy industrial structure, dense population and limited environmental capacity have led to frequent haze events which cause serious troubles to people’s normal life and social development. At the moment, how to balance economic development and environmental protection is urgent and it is hoped our system will be beneficial for air pollution control. (3) Influenced by meteorological conditions, pollutant emissions and transport, secondary transformation of particulate matter, synthetical effect of nature and human, the air pollution is extremely complex and prominent here. This problem not only seriously endangers human health and economic development, but also has impacts on climate and environment change. Therefore, relevant research conducted in this region is representative and referential for air pollution control of other metropolis in the world.

3.1. Dataset Description

Datasets used in this study were from the Ministry of Ecology Environment of China including daily concentration of six main air pollutants in three cities from 1 September 2016 to 30 September 2018. For missing data, the nearby mean was used as the missing data. The sample size of one pollutant in one city was 760, which were divided into training set (699) from 1 September 2016 to 31 July 2018 and testing set (61) from 1 August 2018 to 30 September 2018. The characteristics of the whole samples including maximum (Max), minimum (Min), mean (Mean) and standard deviation (Std.) are shown in Table 2.

3.2. Evaluation Criteria

To evaluate the performance of the proposed system in forecast, a set of four criteria [34] are applied: Mean absolute error (MAE), Root mean square error (RMSE), Mean absolute percentage error (MAPE) and Theil’s inequality coefficient (TIC). MAE reflects the difference between the predicted and actual value. RMSE reflects the extent of the difference between the predicted and actual values. MAPE is an index to measure the forecasting accuracy of a model in statistics. TIC is an indicator used to measure the predictive capability of a model. For all criteria, the smaller the value is, the better predictive performance the model has.
  • Mean absolute error (MAE):
    M A E = 1 N i = 1 N | F i ^ F i |
  • Root mean square error (RMSE):
    R M S E = 1 N i = 1 N ( F i ^ F i ) 2
  • Mean absolute percentage error (MAPE):
    M A P E = 1 N I = 1 N | F i ^ F i F i |
  • Theil’s inequality coefficient (TIC):
    T I C = 1 N i = 1 N ( F i ^ F i ) 2 1 N i = 1 N F i ^ 2 + 1 N i = 1 N F i 2
    where N is the number of data, F i ^ and F i are the predicted and actual value at time i , respectively.

3.3. Diebold-Mariano (D-M) Test

The Diebold-Mariano test [35] is a hypothesis test that is employed to evaluate the significance of the performance of proposed model compared with other models. The hypothesis test is defined as follows:
H 0 : E [ l ( ε n + t 1 ) ] = E [ l ( ε n + t 2 ) ]
H 1 : E [ l ( ε n + t 1 ) ] E [ l ( ε n + t 2 ) ]
where l is the loss function, ε n + t 1 and ε n + t 2 are the forecast errors of two forecasting models. Each forecast accuracy is evaluated by an appropriate loss function, and the commonly used loss function is the MAE (Equation (18)) function [36]. For given significance level, the null hypothesis indicates that there is no significant difference between proposed model and comparison model in predictive performance.
The statistical function of the DM test is as follows:
D M = t = 1 T ( l ( ε n + t 1 ) l ( ε n + t 2 ) ) / T s 2 / T s 2
where s 2 is the estimate of variance of D i = l ( ε n + t 1 ) l ( ε n + t 2 ) . The null hypothesis is that the two prediction models have the same predictive accuracy. The DM statistic converges to the standard normal distribution N ( 0 , 1 ) , and the null hypothesis will be rejected if | D M | > Z α / 2 . Z α / 2 denotes the critical z -value of the standard normal distribution, and α is the significance level.

3.4. Case Studies

In this paper, case studies were carried out to measure the performance of forecasting model. Single model and hybrid model including ARIMA [37], GRNN [38], ELM [26], GA-ELM, WOA-ELM and EEMD-WOA-ELM were used as benchmarks to assess the proposed hybrid model. The experiment was first conducted in Beijing to verify the predictive performance of the model in details, and then experiments in Tianjin and Shijiazhuang were used to prove universality. If the proposed model outperforms other models in all case studies, we can certainly draw the conclusion that the proposed model has not only high accuracy but also universal applicability in different environments. Meanwhile, the model was assessed by the statistical test based on DM test. Furthermore, the trial-and-error method was used to determine the best experimental parameters which are listed in Table 3.

3.4.1. Case Study One: Beijing

The daily concentrations of six air pollutants from 1 September 2016 to 31 September 2018 in Beijing were employed to verify the forecasting performance of the proposed hybrid model. Daily pollutant concentrations of two months from 1 August 2018 to 31 September 2018 were predicted and compared with actual data. Figure 4 shows the predictive results and Figure 5 shows the daily relative errors (relative error = (predicted value − actual value)/actual value). In addition, four performance indicators are calculated and given in Table 4. At the same time, Table 4 also shows the predictive effectiveness of ARIMA, GRNN, ELM, GA-ELM, WOA-ELM, EEMD-WOA-ELM and ICCEMDAN-WOA-ELM as comparison, and the bold values represent the best values for each criterion. It is evident that ICEEMDAN-WOA-ELM model has the most excellent performance among all models. Its predictive results are very closer to actual values than other models. Influenced by many factors, though relative errors of PM2.5 and PM10 are larger than that of other pollutants for the highly nonlinear and non-stationary characteristics, the proposed model is more satisfactory.
Based on the information in Figure 4, Figure 5 and Table 4, it is clear that the proposed model obtains the best results for all evaluation indicators. Therefore, we can conclude that the proposed ICEEMDAN-WOA-ELM model is superior to benchmark models in the prediction of air pollutant concentrations. More comparative analyses are presented as follows:
(1) As one of time series forecasting models, ARIMA is superior to single artificial intelligence models in accuracy. The four indexes (MAE, RMSE, MAPE, TIC) of ARIMA are almost better than those of single artificial intelligence models, which is attributed to the high volatility and irregularity of air pollutant concentration data. The results show that single artificial intelligence models cannot meet the requirements of air pollutant prediction which means it is urgent to develop a hybrid model to improve the predictive performance.
(2) From the comparison between ELM and GA-ELM as well as WOA-ELM, we can conclude that optimization algorithms can really help neural network model improve performance. The ELM optimized by GA or WOA provides better predictive results for six air pollutants. For example, in PM2.5 forecast, the MAE, RMSE, MAPE, TIC are 17.0523, 21.7668, 125.3109, 0.2562 and 16.1554, 21.1993, 116.7680, 0.2531 for GA-ELM and WOA-ELM respectively, with moderate improvements compared with 19.1736, 23.3620, 143.0254, 0.2694 of ELM. From the comparative indicators, we can also see that WOA has better optimization capability as it can avoid local optima and maintain fast convergence. Compared with other optimization algorithms, WOA not only is simple, flexible and effective, but also can achieve a balance between exploration and exploitation.
(3) It can be clearly seen that the data preprocessing algorithm has brought a great improvement to the neural network model. Compared with other models, EEMD-WOA-ELM and ICEEMDAN-WOA-ELM are so outstanding in prediction, which fully proves the concept “decomposition and integration” or “divide and conquer” to be effective for establishing a robust air pollutant prediction model. It is obvious that model with ICEEMDAN performs better than the counterpart with EEMD in any cases. For instance, four metrics are 14.1866, 17.2348, 13.9759, 0.0665 and 21.1571, 24.5963, 21.8809, 0.0938, respectively, for O3. The results show that the decomposition method can greatly reduce predictive errors of model, and moreover, ICEEMDAN is superior to other decomposition methods in data decomposition.
Through the above analyses, MAE, RMSE, MAPE and TIC were used to prove the proposed ICEEMDAN-WOA-ELM hybrid model is obviously superior to all considered benchmark models for its higher accuracy and stability. Compared with single model, all models based on “decomposition and integration” framework have better predictive effectiveness, which shows this framework can effectively improve the model performance. The proposed forecasting model fits all the data of air pollutants with high volatility and irregularity, so it is qualified as the prediction part of air quality early-warning system.

3.4.2. Case Study Two: Tianjin and Shijiazhuang

In order to verify the predictive and universal capabilities of the proposed model, the daily air pollutant concentration data of Tianjin and Shijiazhuang (from 1 September 2016 to 30 September 2018) were also used in case studies. The main purpose is to test the generality of model under different environments. The predictive results are shown in Figure 6, Figure 7, Figure 8 and Figure 9 and Table 5 and Table 6. In Table 5 and Table 6, bold values represent the best values for each criterion among all models. From predictive results, we can see at a glance that the proposed model has better predictive results and predicted values are much closer to real values.
These experiments lead to the same conclusion as case study one that the hybrid model ICEEMDAN-WOA-ELM is superior to all listed benchmark models. For example, for the PM10 forecast in Tianjin and Shijiazhuang, four metric values (MAE, RMSE, MAPE, TIC) of the proposed hybrid model are 3.9662, 5.3299, 7.8186, 0.0471 and 6.3997, 8.9303, 8.9955, 0.0613, respectively, which are much lower than that of other models, and the same is true for other pollutants. Overall, different denoising methods and optimization algorithms lead to the large gaps in predictive performance. Obviously, the model has great accuracy, stability, applicability and can be well adapted to various pollutants under different environments.
Four typical evaluation indicators (MAE, RMSE, MAPE and TIC) were used to measure the performance of all models. The hybrid model ICEEMDAN-WOA-ELM performs best for it has the greatest evaluation criteria. Through these experiments, we can reasonably draw following conclusions: data preprocessing algorithm and optimization algorithm can significantly improve the predictive performance of the model. The proposed model with excellent predictive performance will be the bedrock of establishing air quality early-warning system. In addition, for their universality, these methods can be combined with some basic models to meet the needs of other research fields.

3.4.3. Diebold-Mariano Test

In this section, Diebold-Mariano test was used to examine the effectiveness of the proposed hybrid model. DM test is employed to test under which circumstance an experiment will enable us to reject null hypothesis at a given significance level. The detailed description of DM test is presented in Section 3.3. The null hypothesis (Equation (22)) here is that there is no significant difference between the two models. Table 7 shows the DM test statistic value based on the MAE (Equation (18)) function. The DM values from all models are greater than the upper limits at the 1% significance level, which reflects that the proposed hybrid model significantly outperforms other comparison models.

3.5. Fuzzy Comprehensive Evaluation of Air Quality

In this section, predicted data of September 2018 were used for fuzzy comprehensive evaluation and further analysis. This work could visualize the predicted results of three cities. Limited by the length of paper, we had to take only the results of twenty days in September 2018 as examples which included evaluation results based on predicted and actual value as comparison to get the accuracy of the model in level forecast. Firstly, according to the methodology of fuzzy comprehensive evaluation described in Section 2.4, the evaluation set V = {I, II, III, IV, V} was established. Secondly, the membership degree of each factor to each evaluation level was calculated by the membership degree formula, and the fuzzy matrix R was established. Thirdly, the weight of pollution factor value calculated by multi-scale weighting method was an index to measure the relative degree of environmental hazards which greatly affected the evaluation result. Finally, according to the fuzzy matrix and weight index, the membership degree of evaluation level and air quality level were obtained. Evaluation results of Beijing are shown in Table 8. Taking the result of one day (01/09/2018) as an example, the probability of air quality as “I” is 0.3759, and the probability of “II”, “III”, “IV” and “V” are 0.3409, 0, 0 and 0 respectively. According to the principle of maximum membership degree, the comprehensive evaluation level of air quality should be “I” and the corresponding category is “Excellent”.
The consistency ratio of the two results in Beijing is 27/30 (90%), which not only shows the high accuracy of the proposed hybrid model in level forecast but also indirectly proves predicted concentration data are so accurate that they can fully satisfy the need of air quality early warning. Using the same algorithm, the fuzzy comprehensive evaluation of air quality for Tianjin and Shijiazhuang were conducted and the results are shown in Table 9 and Table 10. Overall, the evaluation results are basically same in these two cities. The consistency rates in Tianjin and Shijiazhuang are 26/30 (87%) and 30/30 (100%), respectively.
Therefore, the evaluation method can effectively link the pollutant concentration prediction with air quality early warning. Nevertheless, precise predictions of pollutant concentration and air quality level are not enough, because achievements of scientific research are expected to truly serve the society. Further work can be performed based on former research which means more intuitive air quality information can be released and public alarms can be issued. Therefore, an air pollution early-warning handbook was compiled and details are shown in Table 11. This work can not only guide people’s daily activities against air pollution but also provide decision-making support for government such as evaluate whether the air quality of a city meets the criteria or which temporary but mandatory measures should be taken to address potential air pollution problems.
Tips: six major air pollutants include PM2.5, PM10, NO2, SO2, CO and O3. It is necessary to know their characteristics:
  • PM2.5: namely fine particulate matter, particle size less than or equal to 2.5 μm, it has smaller size, larger area, stronger activity, easier to attach toxic and harmful substances, longer residence time and transportation distance in atmosphere which mean more harmful to human health and air quality than PM10, can enter bronchioles and alveoli causing cardiopulmonary disease and even lung cancer.
  • PM10: namely inhalable particulate matter, particle size less than or equal to 10 μm, can reduce the atmospheric visibility, enter upper respiratory tract causing respiratory disease.
  • NO2: rufous and irritating odor, can promote acid rain and ozone, damage respiratory tract.
  • SO2: colorless and irritating odor, can be oxidized into sulfuric acid mist (acid rain) or sulfate aerosol, cause respiratory diseases and cancer.
  • CO: colorless and tasteless, mainly from uncompleted combustion, cause suffocation even death.
  • O3: light blue with special odor, major constituent of photochemical smog, damage human mucosa and respiratory tract.

4. Conclusions

Air pollution is a long-standing problem that plagues the whole world, seriously harming human health, social development and natural environment. In order to solve this problem, a great deal of manpower and material resources have been invested, but unfortunately the results are not satisfactory enough. There is always a way and the rapid development of artificial intelligence in recent years has brought new hope for air pollution control. This proposed air quality early-warning system is hoped to play a key role in future for its accuracy and effectiveness. This system mainly consists of two parts: prediction model and evaluation model.
In order to establish the prediction model, ELM, which is famous for accuracy and robustness, was employed. Taking ELM as the core, a hybrid model ICEEMDAN-WOA-ELM was proposed. Firstly, according to the theory of “decomposition and integration”, the original time series of pollutant concentration were decomposed into IMFs by decomposition algorithm (ICEEMDAN). Secondly, the ELM optimized by WOA was used to predict each IMF. Finally, all the predictive results were combined to get the final predictive result. In this study, six main air pollutants PM2.5, PM10, NO2, SO2, CO and O3 in Beijing, Tianjin and Shijiazhuang were chosen. This proposed prediction model was used to predict air pollutant concentrations and compare with the six benchmark models including ARMA, GRNN, ELM, GA-ELM, WOA-ELM and EEMD-WOA-ELM. The simulation results showed that the proposed ICEEMDAN-WOA-ELM model was superior to other models and ICEEMDAN decomposition algorithm along with WOA optimization algorithm played important roles in improving the prediction accuracy of neural network.
In addition to prediction of air pollutant concentration, air quality evaluation was an indispensable part of the air early warning system. For the sake of understanding the future state of air, air quality was evaluated with the above predicted data by fuzzy comprehensive evaluation. The evaluation results were satisfactory enough compared with the actual status, which means our proposed evaluation model can meet the requirement of early warning. Furthermore, air pollution early-warning handbook was compiled to provide the public with intuitive air quality information and reasonable measures.
The combination of air pollutant prediction and air quality evaluation lays a solid foundation for the establishment and implementation of air quality early-warning system. The proposed system can offer us accurate air pollutant concentration prediction, correct air quality evaluation, reasonable countermeasures and scientific decision-making support, which means it will become a sharp weapon for air pollution control and even smart city construction in future.

Author Contributions

Software, Z.Q. and H.L.; validation, X.M. and H.L.; formal analysis, X.M.; investigation, X.M.; data curation, X.M. and H.L.; writing—original draft preparation, X.M. and H.L; writing—review and editing, X.M., H.L., Z.Q., and L.Z.; supervision, L.Z.; project administration, L.Z.; funding acquisition, L.Z.

Funding

This research was funded by the National Natural Science Foundation of China (grant number 91744311).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kan, H.; Li, X.; Jin, L. Air pollution: A global problem needs local fixes. Nature 2019, 570, 437–439. [Google Scholar]
  2. Gardiner, B. Choked: Life and Breath in the Age of Air Pollution; University of Chicago Press: Chicago, IL, USA, 2019. [Google Scholar]
  3. Zhang, H.; Zhang, W.; Palazoglu, A.; Sun, W. Prediction of ozone levels using a hidden markov model (HMM) with gamma distribution. Atmos. Environ. 2012, 62, 64–73. [Google Scholar] [CrossRef]
  4. Konovalov, I.B.; Beekmann, M.; Meleux, F.; Dutot, A.; Foret, G. Combining deterministic and statistical approaches for PM10 forecasting in europe. Atmos. Environ. 2009, 43, 6425–6434. [Google Scholar] [CrossRef]
  5. Maher Elbayoumi, M.; Ramli, N.A.; Yusof, N.F.F.M.; Yahaya, A.S.B.; Al Madhoun, W.; Ul-Saufie, A.Z. Multivariate methods for indoor PM10 and PM2.5 modelling in naturally ventilated schools buildings. Atmos. Environ. 2014, 94, 11–21. [Google Scholar] [CrossRef]
  6. Jian, L.; Zhao, Y.; Zhu, Y.-P.; Zhang, M.-B.; Bertolatti, D. An application of ARIMA model to predict submicron particle concentrations from meteorological factors at a busy roadside in Hangzhou, China. Sci. Total Environ. 2012, 426, 336–345. [Google Scholar] [CrossRef] [PubMed]
  7. Pai, T.-Y.; Ho, C.-L.; Chen, S.-W.; Lo, H.-M.; Sung, P.-J.; Lin, S.-W.; Lai, W.-J.; Tseng, S.-C.; Ciou, S.-P.; Kuo, J.-K.; et al. Using seven types of GM (1, 1) model to forecast hourly particulate matter concentration in Banciao city of Taiwan. Water Air Soil Pollut. 2011, 217, 25–33. [Google Scholar] [CrossRef]
  8. Sun, W.; Zhang, H.; Palazoglu, A.; Singh, A.; Zhang, W.; Liu, S. Prediction of 24-hour-average PM2.5 concentrations using a hidden Markov model with different emission distributions in Northern California. Sci. Total Environ. 2013, 443, 93–103. [Google Scholar] [CrossRef] [PubMed]
  9. Byun, D.; Schere, K.L. Review of the governing equations, computational algorithms, and other components of the Models-3 Community Multiscale Air Quality (CMAQ) modeling system. Appl. Mech. Rev. 2006, 59, 51–77. [Google Scholar] [CrossRef]
  10. Liu, S.; Zhu, C.; Tian, H.; Wang, Y.; Zhang, K.; Wu, B.; Liu, X.; Hao, Y.; Liu, W.; Bai, X.; et al. Spatiotemporal Variations of Ambient Concentrations of Trace Elements in a Highly Polluted Region of China. J. Geophys. Res. Atmos. 2019, 124, 4186–4202. [Google Scholar] [CrossRef]
  11. Yuan, T.; Chen, S.; Huang, J.; Zhang, X.; Luo, Y.; Ma, X.; Zhang, G. Sensitivity of simulating a dust storm over Central Asia to different dust schemes using the WRF-Chem model. Atmos. Environ. 2019, 207, 16–29. [Google Scholar] [CrossRef]
  12. Chen, X.; Yang, W.; Wang, Z.; Li, J.; Hu, M.; An, J.; Wu, Q.; Wang, Z.; Chen, H.; Wei, Y.; et al. Improving new particle formation simulation by coupling a volatility-basis set (VBS) organic aerosol module in NAQPMS+ APM. Atmos. Environ. 2019, 204, 1–11. [Google Scholar] [CrossRef]
  13. Sánchez, A.S.; Nieto, P.J.G.; Fernández, P.R.; del Coz Díaz, J.J.; Iglesias-Rodríguez, F.J. Application of an SVM-based regression model to the air quality study at local scale in the Avilés urban area (Spain). Math. Comput. Model. 2011, 54, 1453–1466. [Google Scholar] [CrossRef]
  14. Metaxiotis, K.; Kagiannas, A.; Askounis, D.; Psarras, J. Artificial intelligence in short term electric load forecasting: A state-of-the-art survey for the researcher. Energy Convers. Manag. 2003, 44, 1525–1534. [Google Scholar] [CrossRef]
  15. Paschalidou, A.K.; Karakitsios, S.; Kleanthous, S.; Kassomenos, P.A. Forecasting hourly PM10 concentration in Cyprus through artificial neural networks and multiple regression models: Implications to local environmental management. Environ. Sci. Pollut. Res. 2011, 18, 316–327. [Google Scholar] [CrossRef] [PubMed]
  16. Wu, S.; Feng, Q.; Du, Y.; Li, X. Artificial Neural Network Models for Daily PM10 Air Pollution Index Prediction in the Urban Area of Wuhan, China. Environ. Eng. Sci. 2011, 28, 357–363. [Google Scholar] [CrossRef]
  17. Fisher, B. Fuzzy environmental decision-making: Applications to air pollution. Atmos. Environ. 2003, 37, 1865–1877. [Google Scholar] [CrossRef]
  18. Klir, G.J.; Yuan, B. Fuzzy Sets and Fuzzy Logic: Theory and Applications; Prentice Hall: Upper Saddle River, NJ, USA, 1995; p. 563. [Google Scholar]
  19. Akkaya, G.; Turanoğlu, B.; Öztaş, S. An integrated fuzzy AHP and fuzzy MOORA approach to the problem of industrial engineering sector choosing. Expert Syst. Appl. 2015, 42, 9565–9573. [Google Scholar] [CrossRef]
  20. Yang, T.; Yang, X. Fuzzy comprehensive assessment, fuzzy clustering analysis and its application for urban traffic environment quality evaluation. Transp. Res. Part. D Transp. Environ. 1998, 3, 51–57. [Google Scholar]
  21. Chang, N.-B.; Chen, H.-W.; Ning, S.-K. Identification of river water quality using the Fuzzy Synthetic Evaluation approach. J. Environ. Manag. 2001, 63, 293–305. [Google Scholar] [CrossRef] [Green Version]
  22. Debnath, J.; Majumder, D.; Biswas, A. Air quality assessment using weighted interval type-2 fuzzy inference system. Ecol. Inform. 2018, 46, 133–146. [Google Scholar] [CrossRef]
  23. Bai, L.; Wang, J.; Ma, X.; Lu, H. Air Pollution Forecasts: An Overview. Int. J. Environ. Res. Public Health 2018, 15, 780. [Google Scholar] [CrossRef] [PubMed]
  24. Wang, D.; Wei, S.; Luo, H.; Yue, C.; Grunder, O. A novel hybrid model for air quality index forecasting based on two-phase decomposition technique and modified extreme learning machine. Sci. Total Environ. 2017, 580, 719–733. [Google Scholar] [CrossRef] [PubMed]
  25. Zhao, F.; Li, W. A Combined Model Based on Feature Selection and WOA for PM2.5 Concentration Forecasting. Atmosphere 2019, 10, 223. [Google Scholar] [CrossRef]
  26. Li, Y.; Jiang, P.; She, Q.; Lin, G. Research on air pollutant concentration prediction method based on self-adaptive neuro-fuzzy weighted extreme learning machine. Environ. Pollut. 2018, 241, 1115–1127. [Google Scholar] [CrossRef] [PubMed]
  27. Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.-C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
  28. Wu, Z.; Huang, N.E. Ensemble empirical mode decomposition: A noise-assisted data analysis method. Adv. Adapt. Data Anal. 2009, 1, 1–41. [Google Scholar] [CrossRef]
  29. Yeh, J.-R.; Shieh, J.-S.; Huang, N.E. Complementary ensemble empirical mode decomposition: A novel noise enhanced data analysis method. Adv. Adapt. Data Anal. 2010, 2, 135–156. [Google Scholar] [CrossRef]
  30. Colominas, M.A.; Schlotthauer, G.; Torres, M.E. Improved complete ensemble EMD: A suitable tool for biomedical signal processing. Biomed. Signal. Process. Control. 2014, 14, 19–29. [Google Scholar] [CrossRef]
  31. Mirjalili, S.; Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
  32. Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
  33. Liu, L.; Zhou, J.; An, X.; Zhang, Y.; Yang, L. Using fuzzy theory and information entropy for water quality assessment in Three Gorges region, China. Expert Syst. Appl. 2010, 37, 2517–2521. [Google Scholar] [CrossRef]
  34. Luo, H.; Wang, D.; Yue, C.; Liu, Y.; Guo, H. Research and application of a novel hybrid decomposition-ensemble learning paradigm with error correction for daily PM10 forecasting. Atmos. Res. 2017, 201, 34–45. [Google Scholar] [CrossRef]
  35. Diebold, F.X.; Mariano, R. Comparing predictive accuracy. J. Bus. Econ. Statist. 1995, 13, 134–144. [Google Scholar]
  36. Xiao, L.; Wang, J.; Hou, R.; Wu, J. A combined model based on data pre-analysis and weight coefficients optimization for electrical load forecasting. Energy 2015, 82, 524–549. [Google Scholar] [CrossRef]
  37. Rahman, N.H.A.; Lee, M.H.; Latif, M.T. Artificial neural networks and fuzzy time series forecasting: An application to air quality. Qual. Quant. 2015, 49, 2633–2647. [Google Scholar] [CrossRef]
  38. Zhou, Q.; Jiang, H.; Wang, J.; Zhou, J. A hybrid model for PM2.5 forecasting based on ensemble empirical mode decomposition and a general regression neural network. Sci. Total Environ. 2014, 496, 264–274. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flow diagram of the proposed system.
Figure 1. Flow diagram of the proposed system.
Ijerph 16 03505 g001
Figure 2. Single hidden layer feedforward neural network.
Figure 2. Single hidden layer feedforward neural network.
Ijerph 16 03505 g002
Figure 3. Locations and climatic conditions of the study areas.
Figure 3. Locations and climatic conditions of the study areas.
Ijerph 16 03505 g003
Figure 4. Predictive results (Beijing).
Figure 4. Predictive results (Beijing).
Ijerph 16 03505 g004
Figure 5. Daily relative error (Beijing).
Figure 5. Daily relative error (Beijing).
Ijerph 16 03505 g005
Figure 6. Predictive effectiveness (Tianjin).
Figure 6. Predictive effectiveness (Tianjin).
Ijerph 16 03505 g006
Figure 7. Daily relative error (Tianjin).
Figure 7. Daily relative error (Tianjin).
Ijerph 16 03505 g007
Figure 8. Predictive results (Shijiazhuang).
Figure 8. Predictive results (Shijiazhuang).
Ijerph 16 03505 g008
Figure 9. Daily relative error (Shijiazhuang).
Figure 9. Daily relative error (Shijiazhuang).
Ijerph 16 03505 g009
Table 1. The air quality level and corresponding concentration limit (units: μg/m3, CO (mg/m3)).
Table 1. The air quality level and corresponding concentration limit (units: μg/m3, CO (mg/m3)).
LevelCategoryPM2.5PM10NO2NO2NO2O3
IExcellent355040502100
IIGood75150801504160
IIIModerate11525018047514215
IVPoor15035028080024265
VHazardous250420565160036800
Table 2. The statistical properties of air pollutant concentration.
Table 2. The statistical properties of air pollutant concentration.
CityPollutant Concentration ((μg/m3), CO (mg/m3))
IndicatorPM2.5PM10NO2SO2COO3
BeijingMax454840155848311
Min57720.23
Mean61.289.845.57.21.098.4
Std.57.572.822.27.20.863.3
TianjinMax290931132899282
Min8111420.33
Mean62.397.348.615.31.3105.6
Std.47.868.321.611.50.861.5
ShijiazhuangMax62187018315310297
Min12221350.36
Mean91.5160.853.131.41.5106.1
Std.82.0118.024.224.11.168.2
Table 3. Experimental parameters.
Table 3. Experimental parameters.
ParameterPM2.5PM10NO2SO2COO3
Input variable448383
Number of search agents101010101010
MaxIter of WOA200200200200200200
MaxIter of ICEEMDAN100010001000100010001000
MaxIter: maximum iteration; WOA: Whale Optimization Algorithm; ICEEMDAN: Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise.
Table 4. Predictive effectiveness (Beijing).
Table 4. Predictive effectiveness (Beijing).
PollutantCriterionARIMAGRNNELMGA-ELMWOA-ELMEEMD-WOA-ELMICEEMDAN-WOA-ELM
PM2.5MAE14.600319.583819.173617.052316.155411.20745.9228
RMSE18.560523.738223.362021.766821.199315.08017.7563
MAPE89.4611155.6999143.0254125.3109116.768060.340328.8783
TIC0.24450.27720.26940.25620.25310.20190.1038
PM10MAE17.059125.078820.877319.344918.078311.73586.8349
RMSE21.479529.088026.448325.297923.690414.40548.6609
MAPE47.618283.189467.827963.452457.300430.258516.8055
TIC0.20110.23640.21690.21070.20320.13750.0830
NO2MAE0.75941.44531.37931.07070.76430.47340.2982
RMSE0.96771.56971.52071.34471.02220.58260.3717
MAPE27.788860.480556.831939.468928.033216.14349.9484
TIC0.15800.21640.20980.19980.16170.10060.0631
SO2MAE6.686411.81169.95587.66647.35615.66063.2131
RMSE8.630213.411611.39119.57998.92387.84104.1274
MAPE22.129346.298738.742328.575627.134118.409410.7605
TIC0.13620.18180.15960.14050.13150.12030.0632
COMAE0.18560.26480.24650.21050.19520.09720.0740
RMSE0.23900.29900.29160.24760.24040.11670.0915
MAPE30.89450.839850.326337.566534.680515.349010.3210
TIC0.14990.17700.17420.15090.14740.07370.0586
O3MAE0.18560.26480.24650.21050.19520.09720.0740
RMSE0.23900.29900.29160.24760.24040.11670.0915
MAPE30.89450.839850.326337.566534.680515.349010.3210
TIC0.14990.17700.17420.15090.14740.07370.0586
ARIMA: Autoregressive Integrated Moving Average; GRNN: Generalized Regression Neural Network; ELM: Extreme Learning Machine; GA: Genetic Algorithm; WOA: Whale Optimization Algorithm; EEMD: Ensemble Empirical Mode Decomposition; ICEEMDAN: improved complete ensemble empirical mode decomposition with adaptive noise; MAE: Mean absolute error; RMSE: Root mean square error; MAPE: Mean absolute percentage error; TIC: Theil’s inequality coefficient. Bold values represent the best values for each criterion among all models.
Table 5. Predictive effectiveness (Tianjin).
Table 5. Predictive effectiveness (Tianjin).
PollutantCriterionARIMAGRNNELMGA-ELMWOA-ELMEEMD-WOA-ELMICCEMDAN-WOA-ELM
PM2.5MAE11.139523.960018.341215.665313.18649.40363.6645
RMSE14.045926.742221.081818.233816.487712.60584.6569
MAPE54.9751136.5288106.161388.489972.350146.523714.3900
TIC0.20050.29820.25460.22760.21310.17430.0669
PM10MAE12.767324.989820.474217.883716.36967.45943.9662
RMSE15.904328.580224.167122.388020.54789.53075.3299
MAPE30.355265.063552.641645.189042.639814.66177.8186
TIC0.13880.21060.18370.17330.16240.08440.0471
NO2MAE2.18362.59982.52022.31062.17311.25110.6417
RMSE2.74343.14923.00932.83002.06741.64990.7914
MAPE30.111944.156142.945037.742535.500618.37439.2402
TIC0.17360.17650.16810.16250.15280.09790.0478
SO2MAE6.39058.56737.87097.80247.56935.00083.4196
RMSE8.642710.96839.73839.82999.39286.36934.3845
MAPE19.400027.138825.862825.222925.243715.865010.9658
TIC0.12680.15400.13400.13640.12890.08940.0619
COMAE0.16900.21240.18620.18300.18320.10710.0613
RMSE0.21270.26450.23210.22870.22440.12560.0834
MAPE19.559427.964023.866122.698722.491212.06266.4375
TIC0.10510.12350.11050.10900.10780.06240.0413
O3MAE36.985545.53137.697636.309333.806723.406616.9133
RMSE46.963556.358948.652348.388543.763229.564922.2439
MAPE39.876742.499239.766938.329934.763621.414714.7261
TIC0.16060.19990.16880.16890.15310.10290.0779
Bold values represent the best values for each criterion among all models.
Table 6. Predictive effectiveness (Shijiazhuang).
Table 6. Predictive effectiveness (Shijiazhuang).
PollutantCriterionARIMAGRNNELMGA-ELMWOA-ELMEEMD-WOA-LMICCEMDAN-WOA-ELM
PM2.5MAE10.485921.716421.492112.761512.82176.72213.6732
RMSE15.327224.755124.650316.963316.83408.84845.5817
MAPE30.353880.513074.768741.251639.636720.41929.6500
TIC0.19440.25020.25140.19520.19560.10960.0708
PM10MAE16.077532.369531.674123.559722.387210.08356.3997
RMSE23.452035.656535.811929.345927.543413.75318.9303
MAPE25.040457.278354.053339.177437.260214.73428.9955
TIC0.16010.20500.20530.17720.16890.09390.0613
NO2MAE2.55054.23603.71393.61672.55861.43410.8704
RMSE3.32754.74594.31044.23343.34931.74691.0678
MAPE20.217540.347935.738134.704021.725411.66196.8435
TIC0.12630.15750.14640.14450.12120.06470.0402
SO2MAE7.01098.40428.05166.92426.65804.25022.9950
RMSE8.570510.14759.60718.50938.39865.01933.3823
MAPE21.164331.520328.583524.044523.385713.75259.2431
TIC0.11700.12790.12270.11110.10870.06620.0448
COMAE0.15370.23200.17700.17160.16330.09920.0469
RMSE0.21320.26210.23400.23840.22310.14770.0606
MAPE18.915630.001323.516322.459721.493213.69735.8451
TIC0.12550.14210.12890.13300.12510.08360.0351
O3MAE33.259836.619234.48633.133632.271718.017113.3396
RMSE41.693344.607342.28941.240140.119922.460917.9202
MAPE36.729535.148636.667135.913334.224018.264912.8219
TIC0.15310.16630.15700.15080.14850.08420.0675
Bold values represent the best values for each criterion among all models.
Table 7. Diebold-Mariano test of seven models.
Table 7. Diebold-Mariano test of seven models.
ModelDM Value
ARIMA4.490165 *
GRNN6.575409 *
ELM6.162329 *
GA-ELM5.093331 *
WOA-ELM4.742791 *
EEMD-WOA-ELM3.587006 *
ICEEMDAN-WOA-ELM-
* Denotes the 1% significance level.
Table 8. Air quality evaluation results of Beijing.
Table 8. Air quality evaluation results of Beijing.
DatePredicted ValueActual Value
IIIIIIIVVLevelIIIIIIIVVLevel
2018/9/10.37590.3409000I0.28790.3823000II
2018/9/20.34200.0630000I0.38050.2167000I
2018/9/30.39960000I0.35300000I
2018/9/40.55860.1814000I0.50550000I
2018/9/50.52500.1205000I0.48390.2333000I
2018/9/60.47110000I0.41050000I
2018/9/70.50590000I0.51250000I
2018/9/80.48550000I0.46750000I
2018/9/90.40090.4009000I0.39620.3962000I
2018/9/100.33670.3367000I0.30670.3067000I
2018/9/110.33630.3363000I0.33770.3377000I
2018/9/120.30940.33980.003500II0.27440.38850.072700II
2018/9/130.18920.36490.223100II0.21230.34180.181800II
2018/9/140.25790.38740.262000II0.21740.37500.389400III
2018/9/150.34590.1011000I0.38400000I
2018/9/160.46000000I0.45320000I
2018/9/170.30940.0540000I0.32670.0250000I
2018/9/180.26730.2636000I0.27220.2500000I
2018/9/190.28670.2427000I0.27550.2749000I
2018/9/200.31290.0907000I0.29540.2954000I
Table 9. Air quality evaluation results of Tianjin.
Table 9. Air quality evaluation results of Tianjin.
DatePredicted ValueActual Value
IIIIIIIVVLevelIIIIIIIVVLevel
2018/9/10.37420.4024000II0.38730.3000000I
2018/9/20.34290.2484000I0.33690.2500000I
2018/9/30.38990.3899000I0.32860.0833000I
2018/9/40.41650.4944000II0.49330.1167000I
2018/9/50.43250.4325000I0.41600.4160000I
2018/9/60.38550.0981000I0.34480.1900000I
2018/9/70.41380000I0.43720000I
2018/9/80.39910000I0.39700000I
2018/9/90.41370.3976000I0.38330.4446000II
2018/9/100.21170.4074000II0.20500.41090.127300II
2018/9/110.21550.41490.194900II0.20900.43660.272700II
2018/9/120.23290.39030.288900II0.24230.37960.109100II
2018/9/130.27480.35100.349500II0.26240.34270.309100II
2018/9/140.31180.3118000I0.21740.32610.072700II
2018/9/150.30530.0750000I0.31110.0800000I
2018/9/160.27260.1214000I0.33420.0750000I
2018/9/170.24360.2089000I0.28070.2512000I
2018/9/180.33210.3321000I0.29240.2250000I
2018/9/190.32630.4014000II0.18780.4159000II
2018/9/200.32110.2638000I0.30700.2000000I
Table 10. Air quality evaluation results of Shijiazhuang.
Table 10. Air quality evaluation results of Shijiazhuang.
DatePredicted ValueActual Value
IIIIIIIVVLevelIIIIIIIVVLevel
2018/9/10.37600.3359000I0.40520.4000000I
2018/9/20.21130.4302000II0.23260.42480.200000II
2018/9/30.36020.1608000I0.37490.2000000I
2018/9/40.38980.0818000I0.40830.0500000I
2018/9/50.34640.0377000I0.34530.1833000I
2018/9/60.33770.1469000I0.31750.2300000I
2018/9/70.39800000I0.40720000I
2018/9/80.40980.2360000I0.38830.2333000I
2018/9/90.29820.3537000II0.25000.3614000II
2018/9/100.26020.3283000II0.26540.32430.036400II
2018/9/110.27290.3297000II0.28210.3100000II
2018/9/120.31500.3306000II0.26100.3448000II
2018/9/130.27040.2704000I0.25430.2400000I
2018/9/140.22240.3135000II0.18210.32270.175000II
2018/9/150.24770.3391000II0.32450.3619000II
2018/9/160.28300.2830000I0.28420.2842000I
2018/9/170.28580.2087000I0.29980.2998000I
2018/9/180.30470.2141000I0.26220.2168000I
2018/9/190.31700.2195000I0.32910.2131000I
2018/9/200.25970.2129000I0.24510.2197000I
Table 11. Air pollution early-warning handbook.
Table 11. Air pollution early-warning handbook.
LevelCategoryColorConditionMeasure
Iexcellentgreensatisfactory air qualityOutdoor activities are suitable for all people.
IIgoodblueacceptable air qualityThe very few abnormally sensitive people should reduce outdoor activities.
IIImoderateyellowmild pollution is unhealthy to sensitive peopleSensitive people including children, the elderly and patients with respiratory tract, cardiovascular and cerebrovascular diseases should reduce outdoor activities. Public transportation is recommended for travel.
IVpoorredmoderate pollution is unhealthy to all peopleSensitive people should avoid outdoor activities which also need to be reduced by general people. Prefer public transportation and reduce construction and traffic dust.
Vhazardouspurpleheavy pollution is hazardous to all peopleBesides above measures, road flushing and cleaning, suspension of large-scale open-air activities, outdoor personnel wear masks are all needed.

Share and Cite

MDPI and ACS Style

Mo, X.; Zhang, L.; Li, H.; Qu, Z. A Novel Air Quality Early-Warning System Based on Artificial Intelligence. Int. J. Environ. Res. Public Health 2019, 16, 3505. https://doi.org/10.3390/ijerph16193505

AMA Style

Mo X, Zhang L, Li H, Qu Z. A Novel Air Quality Early-Warning System Based on Artificial Intelligence. International Journal of Environmental Research and Public Health. 2019; 16(19):3505. https://doi.org/10.3390/ijerph16193505

Chicago/Turabian Style

Mo, Xinyue, Lei Zhang, Huan Li, and Zongxi Qu. 2019. "A Novel Air Quality Early-Warning System Based on Artificial Intelligence" International Journal of Environmental Research and Public Health 16, no. 19: 3505. https://doi.org/10.3390/ijerph16193505

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop