Next Article in Journal
Codesign and Feasibility Testing of a Tool to Evaluate Overweight and Obesity Apps
Next Article in Special Issue
Particulate Matter Emission Factors for Dairy Facilities and Cattle Feedlots during Summertime in Texas
Previous Article in Journal
SARS-CoV-2 Circulation in the School Setting: A Systematic Review and Meta-Analysis
Previous Article in Special Issue
Living Lab Experience in Turin: Lifestyles and Exposure to Black Carbon
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling and Predicting Pulmonary Tuberculosis Incidence and Its Association with Air Pollution and Meteorological Factors Using an ARIMAX Model: An Ecological Study in Ningbo of China

1
School of Medicine, Ningbo University, 818 Fenghua Road, Ningbo 315211, China
2
Center for Health Economics, School of Economics, University of Nottingham Ningbo China, 199 Taikang East Road, Ningbo 315100, China
3
Institute of Tuberculosis Prevention and Control, Ningbo Municipal Center for Disease Control and Prevention, 237 Yongfeng Road, Ningbo 315010, China
4
Department of Occupational and Environmental Health Sciences, School of Public Health, Peking University, 38 Xueyuan Road, Beijing 100191, China
5
Institute of Occupational Health and Radiation Protection, Zhejiang Provincial Center for Disease Control and Prevention, 3399 Binshen Road, Hangzhou 310051, China
*
Authors to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(9), 5385; https://doi.org/10.3390/ijerph19095385
Submission received: 20 March 2022 / Revised: 23 April 2022 / Accepted: 25 April 2022 / Published: 28 April 2022

Abstract

:
The autoregressive integrated moving average with exogenous regressors (ARIMAX) modeling studies of pulmonary tuberculosis (PTB) are still rare. This study aims to explore whether incorporating air pollution and meteorological factors can improve the performance of a time series model in predicting PTB. We collected the monthly incidence of PTB, records of six air pollutants and six meteorological factors in Ningbo of China from January 2015 to December 2019. Then, we constructed the ARIMA, univariate ARIMAX, and multivariate ARIMAX models. The ARIMAX model incorporated ambient factors, while the ARIMA model did not. After prewhitening, the cross-correlation analysis showed that PTB incidence was related to air pollution and meteorological factors with a lag effect. Air pollution and meteorological factors also had a correlation. We found that the multivariate ARIMAX model incorporating both the ozone with 0-month lag and the atmospheric pressure with 11-month lag had the best performance for predicting the incidence of PTB in 2019, with the lowest fitted mean absolute percentage error (MAPE) of 2.9097% and test MAPE of 9.2643%. However, ARIMAX has limited improvement in prediction accuracy compared with the ARIMA model. Our study also suggests the role of protecting the environment and reducing pollutants in controlling PTB and other infectious diseases.

1. Introduction

Tuberculosis (TB) is a chronic infectious disease caused by Mycobacterium tuberculosis, and it often affects the lungs. WHO proposed an End TB Strategy in 2014, with the targets to reduce TB deaths by 95% and to cut incident cases by 90% between 2015 and 2035 [1]. The cumulative reduction in TB incidence from 2015 to 2020 was 11%, just over half-way to the 2020 milestone of the strategy [2]. To achieve this ambitious goal, accurate prediction of disease trends, as well as related factors, is of great importance.
The autoregressive integrated moving average (ARIMA) model, also known as the Box–Jenkins model, is a commonly used model in time series analysis. Despite its effectiveness to study time series, the ARIMA model applies only to one variable. The autoregressive integrated moving average with exogenous regressors (ARIMAX) model, however, adds other variables related to the target series as input variables to improve the prediction accuracy. Unlike the ARIMA model, the autoregressive integrated moving average with exogenous regressors (ARIMAX) model adds other variables related to the target series as input variables to improve the prediction accuracy. Several ARIMAX studies suggested that weather parameters such as temperature, rainfall, humidity, wind speed, and air pollutants may influence the occurrence of disease [3,4,5,6,7]. ARIMAX modeling studies of tuberculosis are still rare. A time series study in Jiangsu, China, showed that the prediction accuracy of the ARIMA model for PTB was improved by adding monthly PM2.5 with 0-month lag as an external variable [8]. Another time series study in eastern China also indicated that the predictive performance of the ARIMA model was improved after incorporating meteorological factors [9].
Several studies have proposed the biological mechanisms linking air pollution exposure and the risk of PTB. For example, PM2.5 exposure disrupted the synthesis and secretion of inflammatory cytokines including interferon (IFN)-γ, tumor necrosis factor (TNF)-α, and interleukin (IL)-10 and impaired key anti-mycobacterial T cell immune functions [10]. In addition, blocking the IL-10 pathway and downregulating TNF-α by CO in lung macrophages may promote the reactivation of PTB [11]. Therefore, we predicted a positive correlation between pollutant concentration and tuberculosis incidence. The possible link between PTB and meteorological factors may be attributable to the following reasons. The risk of Mycobacterium tuberculosis transmission appears to be the greatest during the cold winter, particularly in overcrowded and poorly ventilated settings [12]. Less humidity leads to the evaporation of droplets, reduces their size, and escalates their ability to travel further, which increases the possibility of transmission [13]. The atmosphere pressure has an indirect negative effect on the incidence of PTB. Low-level air rises under low-pressure conditions, and surface pollutants diffuse vertically into the air, resulting in increased air pollution. Therefore, we predicted a negative correlation between meteorological factors and tuberculosis incidence [14].
To our knowledge, few studies used the ARIMAX model to incorporate both the air pollution and meteorological factors to predict PTB. Thus, in the current study, we performed a time series analysis in Ningbo, China, and applied ARIMA models (ARIMA, univariate ARIMAX and multivariate ARIMAX model) to explore whether the inclusion of air pollution and meteorological factors can improve the performance of prediction modeling.

2. Materials and Methods

2.1. Study Site and Data Collection

As a city in Zhejiang province located along the eastern coast of China, Ningbo covers an area of 98,000 thousand square kilometers. It governed 10 counties and had a permanent population of 9.4 million at the end of 2020. All newly diagnosed TB cases are registered in an online Tuberculosis Management Information System (TBIMS), which is operated by the Center for Disease Control and Prevention (CDC) of China. We extracted monthly incidence of PTB from January 2015 to December 2019 as the study subjects. Population data were obtained from the Ningbo Statistical Yearbook. We used the monthly incidence from January 2015 to December 2018 as the model-construction datasets and incidence from January 2019 to December 2019 as the validation datasets.
The monthly average concentrations of the ambient air pollutants including nitrogen dioxide (NO2), carbon monoxide (CO), sulfur dioxide (SO2), ozone (O3), particulate matter 2.5 μm in diameter (PM2.5), and particulate matter 10μm in diameter (PM10) for the same period were obtained from the Ningbo Environment Monitoring Center. Meteorological factors included monthly average temperature (MAT, °C), monthly average highest temperature (MAHT, °C), monthly average lowest temperature (MALT, °C), monthly average relative humidity (MAH, %), monthly average atmospheric pressure (MAP, hPa), and monthly average wind speed (MAS, m/s) and data were obtained from the Ningbo Meteorological Bureau.

2.2. Construction of the ARIMA Model

Following Li et al. [9], we constructed a seasonal ARIMA model, which was expressed as ARIMA (p, d, q) (P, D, Q)s. The variable p is the order of the autoregression (AR) process, q is the number of moving average (MA) terms, d represents the differencing process to form a stationary times series, and P, D, and Q are the seasonal orders of the AR, differencing, and MA processes, respectively [3]. Additionally, s denotes the seasonal period. The number of PTB incidence predicted at time t (Yt) was determined by the formula: Y t = θ q B Θ Q B S a t Φ P B S φ p B ( 1 B ) d ( 1 B S ) D , where θq(B) is the operator of the moving-average model, ΘQ(BS) is the operator of the seasonal-moving average model, φp(B) is the operator of the auto-regressive model, ΦP(BS) is the operator of the seasonal autoregressive model, (1−B)d is the component of the ordinary differences, (1−BS)D is the component of the seasonal differences, at is white noise, and Yt is the predicted variable [9]. Based on the monthly incidence of PTB, we used the auto. arima ( ) function in R software (the R Core Team, Vienna, Austria) to undertake automatic ARIMA model selection and fitting according to Bayesian information criterion (BIC). BIC takes into account the number of observations and has a larger penalty compared with Akaike Information Criterion (AIC) [15]. When the number of observations is too large, it can effectively prevent the model from being too complicated and over-fitting. The model with the lowest BIC is defined as the optimal model. The Ljung–Box test was used to test whether the model residual sequence showed auto-correlation. Finally, we selected the optimal ARIMA model to predict PTB incidence in 2019 and mean absolute percentage error (MAPE) was used to model validation.

2.3. Cross-Correlation Analysis

Due to strong auto-correlations in the data, correlations of the time series of the monthly incidence of PTB with air pollution and meteorological factors were difficult to identify. In this study, a prewhitening process was applied to the data among the multiple exogenous regressors [3]. Prewhitening is used to avoid common trends between incidence and ambient factors [16]. We calculated the cross-correlation function (CCF) between the residual series from ARIMA model of the monthly incidence and ambient factors to identify the significant time lags. Based on the results from previous studies as well as biological and epidemiological plausibility, within a lag length of 12, we selected the lag periods with positive and significant values for the next step of the analysis [17].

2.4. Construction of the ARIMAX Model

The lag value of the statistically significant factors identified by the cross-correlation analysis was incorporated as exogenous regressors into the ARIMAX model constructed above. The ARIMAX model is described by equation: Y t = θ q B Θ Q B S a t Φ P B S φ p B ( 1 B ) d ( 1 B S ) D + X , where X represents the exogenous regressor, which can be univariate or multivariate [9,16]. The other parameters are as described in the ARIMA model above. In this study, we first took a single lag period of risk factor in the univariate ARIMAX model. The coefficients of the model were estimated using the maximum likelihood method. Fitted MAPE was used to measure the performance of the ARIMA and the ARIMAX model in predicting PTB incidence from 2016 to 2018, and test MAPE was used to measure the performance in 2019. We determined the suitable univariate ARIMAX models according to four criteria: (a) the BIC value smaller than the optimal ARIMA model; (b) the coefficients of the regression term all significant (p < 0.05); (c) the fitted MAPE value smaller than the optimal ARIMA model; and (d) the test MAPE value smaller than the optimal ARIMA model.
For the multivariate ARIMAX analysis, the incorporated factors were selected from the factors in the suitable univariate ARIMAX models. The suitable multivariate ARIMAX models were also determined by four criteria above. The optimal ARIMAX model should have the lowest BIC, fitted MAPE, and test MAPE values.

2.5. Statistical Software

We used the packages of “forecast”, “Stats”, and “ggplot2” of R 3.5.1 (the R Core Team, Vienna, Austria) (https://www.r-project.org/, accessed on 8 Janunary 2022) to estimate the ARIMA and ARIMAX models and present data visualization. The R code can be found in Supplementary File S1. The significance level was set at 0.05.

3. Results

3.1. Descriptive Analysis

Descriptive statistics of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019 are listed in Table 1. Time series plots of these are shown in Figure 1. The annual PTB notification rates from 2015 to 2019 of Ningbo were 48.20/100,000, 45.81/100,000, 45.99/10,000, 46.92/100,000, and 44.14/100,000, respectively. The monthly incidence series plot showed a seasonal fluctuation. The peak incidence mainly occurred in March, April, and May, and the trough was more common in November, December, and February. The mean concentrations of PM2.5, PM10, SO2, CO, NO2, O3, were 35.79 μg/m3, 57.42 μg/m3, 10.90 μg/m3, 0.78 mg/m3, 37.47 μg/m3, and 95.26 μg/m3, respectively, compared with the World Health Organization air quality guideline levels of 5 μg/m3, 15 μg/m3, 40 μg/m3, 4 mg/m3, 10 μg/m3, and 60 μg/m3, respectively [18].

3.2. ARIMA Model

The optimal ARIMA model constructed by monthly incidence of PTB from 2015 to 2018 according to BIC was the ARIMA (0,0,0)(1,1,0)12 model with a BIC of 18.2228. The optimal ARIMA model for PM2.5, PM10, SO2, CO, NO2, O3, MAT, MAHT, MALT, MAH, MAP, and MAS from 2015 to 2018 is listed in Table 2. The Ljung–Box test indicated that residual sequence from all models did not significantly depart from a white noise sequence (p > 0.05).

3.3. Cross-Correlation Analysis

3.3.1. CCF between the PTB and Ambient Factors

After prewhitening, we calculated the cross-correlation function (CCF) coefficients between the PTB and each ambient factor at different lag periods. Table 3 shows that the PTB was positively correlated with the PM2.5 (3-month lag), PM10 (3-month lag), and CO (9-month lag) and negatively correlated with O3 (0-month lag), MAT (1, 3-month lag), MALT (1,3-month lag), MAH (7-month lag), and MAP (2, 11-month lag) (p < 0.05).

3.3.2. CCF between the Air Pollutants and Meteorological Factors

We also calculated the CCF coefficients between prewhitened air pollutants concentration and meteorological factors residual series at lag 0 between 2015 and 2018. Table 4 shows that PM2.5 and PM10 were significant negatively correlated with MAT, MALT, and MAH; SO2 was significant negatively correlated with MALT and MAH; NO2 was significant negatively correlated with MALT, MAH, and MAS; O3 was significant negatively correlated with MAH, but significant positively correlated with MAP.

3.4. Univariate and Multivariate ARIMAX Analyses

Based on the cross-correlation analysis, we tested the univariate ARIMAX model by incorporating different lag periods of risk factors as exogenous regressors based on the ARIMA (0,0,0)(1,1,0)12 model from PTB time series. Because different ambient factors have different lag periods, from 1 to 11 months, in order to compare the performance of different univariate ARIMAX models and the optimal ARIMA model, it is necessary to make the sequence length of modeling consistent. Therefore, the monthly incidence of PTB and environmental factors with different lag periods from January 2016 to December 2018 were selected to construct the ARIMAX model and the modeling times were all 36 months.
As shown in Table 5, the ARIMAX (0,0,0)(1,1,0)12 model with the O3 (0-month lag) and ARIMAX (0,0,0)(1,1,0)12 model with the MAP (11-month lag) is the optimal univariate ARIMAX model in terms of the four criteria being considered. The BIC value, fitted and test MAPE of the two univariate ARIMAX models were all smaller than the ARIMA model, and the coefficients of the regression term of these two univariate ARIMAX models were all significant (p < 0.05). Then, we tested the multivariate ARIMAX model by incorporating both the O3 (0-month lag) and the MAP (11-month lag) as exogenous regressors based on the ARIMA (0,0,0)(1,1,0)12 model, and the coefficients of the regression term of the multivariate ARIMAX model were all significant (p < 0.05). Compared with the optimal ARIMA and two suitable univariate ARIMAX models, the multivariate ARIMAX model was the best fitting model, with the lowest BIC (8.1092), fitted MAPE (2.9097%), and test MAPE (9.2643%). Finally, the actual incidence, fitted and predicted incidences of PTB using ARIMA (0,0,0)(1,1,0)12 and ARIMA (0,0,0)(1,1,0)12 with O3 (0-month lag) and MAP (11-month lag) in Ningbo were shown in Figure 2.

4. Discussion

In this study, we developed and evaluated time series models to characterize the ambient factors of pulmonary tuberculosis transmission in Ningbo to inform control measures. To our knowledge, this is the first time series study to construct ARIMAX models to explore the role of both air pollution and meteorological factors in predicting PTB. We found that both air pollution and meteorological factors were associated with the incidence of tuberculosis in Ningbo with a lag effect. Additionally, the multivariate ARIMAX model that included both the ozone at lag 0 and the monthly mean atmospheric pressure at lag 11 had a better predicting performance than the inclusion of one variable. This modeling technique can be a useful tool for planning control interventions and could be implemented during routine tuberculosis surveillance in Ningbo.
Although preventive measures have made great progress, the prevention and treatment of tuberculosis still involves enormous challenges, such as increased drug resistance [19,20], dual infection of tuberculosis and AIDS [21], and increased migrant population [22]. Furthermore, urban air quality is an important potential factor in the contribution of tuberculosis infection. There is a growing body of evidence suggesting an association between air pollution exposure and PTB incidence. However, available evidence on the association of air pollution and PTB risk is inconsistent [23]. Our study found a significantly positive correlation of particulate matter with an aerodynamic diameter ≤2.5 μm (PM2.5) or ≤10 μm (PM10) with PTB incidence, which is consistent with other studies [24,25]. Moreover, we also found that CO was positively correlated with PTB incidence, while O3 was negatively correlated with PTB incidence, which is consistent with other studies [11,24]. Limited evidence from vitro studies has suggested that the survival rate of mice intravenously inoculated with Mycobacterium tuberculosis after intravenous injection of dissolved ozone was significantly higher than that of mice not treated with ozone [26]. The association between PTB and air pollution exposure varied across regions, which may be partially attributed to pollutant concentration or analytic methods. Our study site is a coastal city of South China, the concentrations of air pollutants in our study (PM2.5: 35.79 ± 14.73 μg/m3, SO2: 10.90 ± 3.97 μg/m3, NO2: 37.47 ± 12.25 μg/m3) were much lower than those in mega cities such as Beijing (PM2.5: 105.10 ± 80.90 μg/m3, SO2: 48.60 ± 49.10 μg/m3, NO2: 64.20 ± 25.70 μg/m3) [27]. Evidence from the area with lower air pollution levels will be important to strengthen the basis for policy making [28].
Meteorological factors, including MAT, MALT, MAH, and MAP, had a negative effect on PTB with a lag effect in our study, which are largely consistent with the findings in previous studies [14,29]. Climate change affects tuberculosis through diverse pathways: changes in climatic factors such as temperature, humidity, and precipitation influence host response through alterations in vitamin D distribution, ultraviolet radiation, malnutrition, and other risk factors [30]. Our research also found that PM2.5 and PM10 were significant negatively correlated with MAT, MALT, and MAH, and O3 was significant negatively correlated with MAH, but significant positively correlated with MAP. The results of our study support previous findings that climatic factors could affect the incidence of tuberculosis by indirectly regulating urban air quality [14].
Our study used incidence of PTB from 2016 to 2018 and delayed ambient factors modeling to predict TB incidence in 2019 and compare it with the actual incidence. Cross-validating to calculate test MAPE could avoid the over-fitting of the model caused by incorporating ambient factors. For example, ARIMA (0,0,0)(1,1,0)12 + MAT (lag1) has the smallest fitted MAPE of 2.3205%, but its test MAPE is as high as 12.0531%, which is an obvious over-fitting phenomenon. The fitted and test MAPE of ARIMA (0,0,0)(1,1,0)12 + O3 (lag0) + MAP (lag11) are both smaller than the optimal ARIMA model, so over-fitting is effectively avoided. Through ARIMA model prewhitening, we found a lag correlation between the residual sequence of ambient factors and the residual sequence of tuberculosis incidence, which suggests that it is possible to control tuberculosis and other infectious diseases by protecting the environment and reducing pollutants.
Admittedly, there were limitations in our study. Firstly, because there was a lag relationship between ambient factors and the incidence of PTB, we could accurately predict the incidence in these future lag periods by using the ARIMAX model. For example, the lag time of MAP is 11 months, so we can accurately predict the incidence of the next 11 months. However, there was no lag relationship between O3 and PTB in this study. Therefore, using O3 to predict tuberculosis in the same month does not produce significant result. In addition, the ARIMA model can only extract the seasonal and long-term trends of ambient factors and tuberculosis incidence. This model cannot extract and predict the information of residual sequences. Thus, it is invalid to use the ARIMA model to predict future ambient factors and then incorporate them into the ARIMAX model. Because we are temporarily unable to accurately predict the future ambient factors, we can only rely on factors such as MAP to accurately predict the incidence of PTB in the next 11 months. It is our future direction to improve the accuracy of the ARIMAX model in predicting long-term incidence of PTB by finding a more advanced model to predict the future ambient factors. Secondly, the ARIMAX model can only identify a correlation between PTB incidence and ambient factors but cannot identify causal relationships. Thirdly, data from fixed monitoring stations assume that all participants in the target area were at the same level of air pollution exposure, and personal exposure data were not collected [31].

5. Conclusions

Ambient air pollutants and meteorological factors were associated with monthly incidence of PTB with a lag effect. Meteorological factors may affect the incidence of PTB by indirectly regulating urban air quality. The multivariate ARIMAX model that included both the ozone with a 0-month lag and the monthly average atmospheric pressure with an 11-month lag had the best performance to predict the incidence of PTB. However, ARIMAX has limited improvement in prediction accuracy compared with the ARIMA model. This modeling technique can be a useful tool for planning control interventions and could be implemented during routine tuberculosis surveillance in Ningbo. When the peak of tuberculosis and low atmosphere pressure arise, health education on tuberculosis should be strengthened to remind people to seek medical treatment as soon as symptoms appear.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijerph19095385/s1, File S1: R code.

Author Contributions

Conceptualization, Y.-P.C. and T.-F.H.; methodology, Y.-P.C. and Z.-Q.X.; software, G.-X.S. and Z.-Q.X.; validation, Y.C. and T.-F.H.; formal analysis, J.H. and G.-X.L.; data curation, T.-F.H. and Y.C.; writing—original draft preparation, Y.-P.C.; writing—review and editing, T.-F.H. and L.-F.L.; visualization, Y.-P.C.; supervision, T.-F.H. and G.-X.L.; project administration, T.-F.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Medical Technology Program Foundation of Zhejiang [No. 2021KY334; No. 2022KY1189], the Project of Zhejiang Public Welfare Fund [No. LGFLGF19H260010], the Zhejiang Medical Key Discipline [No. 07-013], and the Ningbo Health Branding Subject Fund [No. PPXK2018-10].

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used and analyzed in the current study are not publicly available because restrictions apply to the availability of the data. Data are, however, available from the corresponding author by reasonable requests, and with permission from the Ningbo Municipal CDC.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. WHO. The End TB Strategy. 2014. Available online: https://www.who.int/teams/global-tubercul-osis-programme/the-end-tb-strategy (accessed on 18 January 2022).
  2. WHO. Global Tuberculosis Report 2021. 2021. Available online: https://www.who.int/publications/digital/global-tuberculosis-report-2021 (accessed on 18 January 2022).
  3. Jing, Q.L.; Cheng, Q.; Marshall, J.M.; Hu, W.B.; Yang, Z.C.; Lu, J.H. Imported cases and minimum temperature drive dengue transmission in Guangzhou, China: Evidence from ARIMAX model. Epidemiol. Infect. 2018, 146, 1226–1235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. N’Gattia, A.K.; Coulibaly, D.; Nzussouo, N.T.; Kadjo, H.A.; Chérif, D.; Traoré, Y.; Kouakou, B.K.; Kouassi, P.D.; Ekra, K.D.; Dagnan, N.S.; et al. Effects of climatological parameters in modeling and forecasting seasonal influenza transmission in Abidjan, Cote d’Ivoire. BMC Public Health 2016, 16, 972. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Chadsuthi, S.; Modchang, C.; Lenbury, Y.; Iamsirithaworn, S.; Triampo, W. Modeling seasonal leptospirosis transmission and its association with rainfall and temperature in Thailand using time-series and ARIMAX analyses. Asian Pac. J. Trop. Med. 2012, 5, 539–546. [Google Scholar] [CrossRef] [Green Version]
  6. Chadsuthi, S.; Iamsirithaworn, S.; Triampo, W.; Modchang, C. Modeling Seasonal Influenza Transmission and Its Association with Climate Factors in Thailand Using Time-Series and ARIMAX Analyses. Comput. Math. Methods Med. 2015, 2015, 436495. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Gharbi, M.; Quenel, P.; Gustave, J.; Cassadou, S.; La Ruche, G.; Girdary, L.; Marrama, L. Time series analysis of dengue incidence in Guadeloupe, French West Indies: Forecasting models using climate variables as predictors. BMC Infect. Dis. 2011, 11, 166. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Ding, Z.Q.; Li, Y.X.; Wang, X.M.; Li, H.L.; Cai, Y.L.; Wang, B.X.; Wang, K.; Wang, W.M. The impact of air pollution on the transmission of pulmonary tuberculosis. Math. Biosci. Eng. 2020, 17, 4317–4327. [Google Scholar] [CrossRef]
  9. Li, Z.Q.; Pan, H.Q.; Liu, Q.; Song, H.; Wang, J.M. Comparing the performance of time series models with or without meteorological factors in predicting incident pulmonary tuberculosis in eastern China. Infect. Dis. Poverty 2020, 9, 151. [Google Scholar] [CrossRef]
  10. Ibironke, O.; Carranza, C.; Sarkar, S.; Torres, M.; Choi, H.T.; Nwoko, J.; Black, K.; Quintana-Belmares, R.; Osornio-Vargas, Á.; Ohman-Strickland, P.; et al. Urban Air Pollution Particulates Suppress Human T-Cell Responses to Mycobacterium Tuberculosis. Int. J. Environ. Res. Public Health 2019, 16, 4112. [Google Scholar] [CrossRef] [Green Version]
  11. Liu, Y.; Cui, L.; Hou, L.; Yu, C.; Tao, N.; Liu, J.; Li, Y.; Zhou, C.; Yang, G.; Li, H. Ambient Air Pollution Exposures and Newly Diagnosed Pulmonary Tuberculosis in Jinan, China: A Time Series Study. Sci. Rep. 2018, 8, 17411. [Google Scholar] [CrossRef]
  12. Fares, A. Seasonality of tuberculosis. J. Glob. Infect. Dis. 2011, 3, 46–55. [Google Scholar] [CrossRef]
  13. Liu, L.; Wei, J.; Li, Y.; Ooi, A. Evaporation and dispersion of respiratory droplets from coughing. Indoor Air 2017, 27, 179–190. [Google Scholar] [CrossRef] [PubMed]
  14. Zhang, C.Y.; Zhang, A. Climate and air pollution alter incidence of tuberculosis in Beijing, China. Ann. Epidemiol. 2019, 37, 71–76. [Google Scholar] [CrossRef] [PubMed]
  15. Vrieze, S.I. Model selection and psychological theory: A discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). Psychol. Methods 2012, 17, 228–243. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Box, G.; Jenkins, G. Time Series Analysis: Forecasting and Control, 4th ed.; China Machine Press: Peking, China, 2011; pp. 291–292. [Google Scholar]
  17. Behr, M.A.; Edelstein, P.H.; Ramakrishnan, L. Revisiting the timetable of tuberculosis. BMJ 2018, 362, k2738. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. WHO. WHO Global Air Quality Guidelines: Particulate Matter (PM2.5 and PM10), Ozone, Nitrogen Dioxide, Sulfur Dioxide and Carbon Monoxide. Available online: https://www.who.int/publications/i/item/9789240034228 (accessed on 18 January 2022).
  19. Wang, N.; Ma, Y.; Liu, Y.H.; Du, J.; Zhang, H.; Xie, S.H.; Zhu, K.; Lyu, X.Y.; Shu, W.; Wang, H.H.; et al. Risk of Treatment Failure in Patients with Drug-susceptible Pulmonary Tuberculosis in China. Biomed. Environ. Sci. 2016, 29, 612–617. [Google Scholar] [CrossRef]
  20. Shao, Y.; Yang, D.; Xu, W.; Lu, W.; Song, H.; Dai, Y.; Shen, H.; Wang, J. Epidemiology of anti-tuberculosis drug resistance in a Chinese population: Current situation and challenges ahead. BMC Public Health 2011, 11, 110. [Google Scholar] [CrossRef] [Green Version]
  21. Culqui-Lévano, D.R.; Rodriguez-Valín, E.; Donado-Campos, J.M. Analysis of extrapulmonary tuberculosis in Spain: 2007–2012 National Study. Enferm. Infecc. Microbiol. Clin. 2017, 35, 82–87. [Google Scholar] [CrossRef]
  22. Ködmön, C.; Zucs, P.; van der Werf, M.J. Migration-related tuberculosis: Epidemiology and characteristics of tuberculosis cases originating outside the European Union and European Economic Area, 2007 to 2013. Euro Surveill. 2016, 21, 30164. [Google Scholar] [CrossRef] [Green Version]
  23. Xiang, K.; Xu, Z.; Hu, Y.Q.; He, Y.S.; Dan, Y.L.; Wu, Q.; Fang, X.H.; Pan, H.F. Association between ambient air pollution and tuberculosis risk: A systematic review and meta-analysis. Chemosphere 2021, 277, 130342. [Google Scholar] [CrossRef]
  24. Huang, K.; Ding, K.; Yang, X.J.; Hu, C.Y.; Jiang, W.; Hua, X.G.; Liu, J.; Cao, J.Y.; Zhang, T.; Kan, X.H.; et al. Association between short-term exposure to ambient air pollutants and the risk of tuberculosis outpatient visits: A time-series study in Hefei, China. Environ. Res. 2020, 184, 109343. [Google Scholar] [CrossRef]
  25. Kim, H.; Yu, S.; Choi, H. Effects of particulate air pollution on tuberculosis development in seven major cities of Korea from 2010 to 2016: Methodological considerations involving long-term exposure and time lag. Epidemiol. Health 2020, 42, e2020012. [Google Scholar] [CrossRef] [PubMed]
  26. Belianin, I.I.; Nikolaeva, G.M.; Martynova, L.P. Action of dissolved ozone on mycobacterium tuberculosis and alveolar macrophages in experimental tuberculosis. Probl. Tuberk. 1997, 1997, 56–59. [Google Scholar]
  27. Guo, Y.; Li, S.; Tian, Z.; Pan, X.; Zhang, J.; Williams, G. The burden of air pollution on years of life lost in Beijing, China, 2004–2008: Retrospective regression analysis of daily deaths. BMJ 2013, 347, f7139. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Huang, J.; Li, G.; Qian, X.; Xu, G.; Zhao, Y.; Huang, J.; Liu, Q.; He, T.; Guo, X. The burden of ischemic heart disease related to ambient air pollution exposure in a coastal city in South China. Environ. Res. 2018, 164, 255–261. [Google Scholar] [CrossRef]
  29. Bie, S.; Hu, X.; Zhang, H.; Wang, K.; Dou, Z. Influential factors and spatial-temporal distribution of tuberculosis in mainland China. Sci. Rep. 2021, 11, 6274. [Google Scholar] [CrossRef]
  30. Maharjan, B.; Gopali, R.S.; Zhang, Y. A scoping review on climate change and tuberculosis. Int. J. Biometeorol. 2021, 65, 1579–1595. [Google Scholar] [CrossRef]
  31. Popovic, I.; Soares Magalhaes, R.J.; Ge, E.; Marks, G.B.; Dong, G.H.; Wei, X.; Knibbs, L.D. A systematic literature review and critical appraisal of epidemiological studies on outdoor air pollution and tuberculosis outcomes. Environ. Res. 2019, 170, 33–45. [Google Scholar] [CrossRef]
Figure 1. Time series of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019.
Figure 1. Time series of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019.
Ijerph 19 05385 g001
Figure 2. Actual incidence, fitted and predicted incidences of ARIMA (0,0,0) (1,1,0)12 and ARIMA (0,0,0) (1,1,0)12 with O3 (0-month lag) and MAP (11-month lag) in Ningbo.
Figure 2. Actual incidence, fitted and predicted incidences of ARIMA (0,0,0) (1,1,0)12 and ARIMA (0,0,0) (1,1,0)12 with O3 (0-month lag) and MAP (11-month lag) in Ningbo.
Ijerph 19 05385 g002
Table 1. Descriptive statistics of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019.
Table 1. Descriptive statistics of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019.
VariablesMeanS.D.MinP25P50P75MAXIQR
Monthly incidence of PTB3.850.502.773.483.804.175.030.69
Air pollutants concentration
PM2.5 (μg/m3)35.7914.7314.3524.3032.6443.9581.4219.65
PM10 (μg/m3)57.4221.5124.4841.6252.5569.57121.1627.95
SO2 (μg/m3)10.903.975.1308.18210.3612.66526.9004.483
CO (mg/m3)0.780.150.530.670.750.851.250.18
NO2 (μg/m3)37.4712.2514.6828.7635.3646.1964.5217.43
O3 (μg/m3)95.2626.7931.6871.86104.70113.32143.5541.46
Meteorological factors
MAT (°C)17.797.855.3210.6218.2124.6630.5714.04
MAHT (°C)21.887.978.6114.6623.2528.2435.9713.58
MALT (°C)14.538.012.836.3514.2022.0626.7415.70
MAH (%)76.705.2363.4073.4977.7480.7886.707.29
MAP (hPa)1015.587.771002.691009.2051015.891022.411027.1915.18
MAS (m/s)21.364.7210.6718.6421.7525.1832.816.54
Notes: IQR = P75P25; MAT: monthly average temperature; MAHT: monthly average highest temperature; MALT: monthly average lowest temperature; MAH: monthly average relative humidity; MAP: monthly average atmospheric pressure; MAS: monthly average wind speed.
Table 2. The optimal ARIMA model for monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2018.
Table 2. The optimal ARIMA model for monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2018.
VariablesModelBICLjung–Box Test
X-Squaredp-Value
Monthly incidence of PTBARIMA(0,0,0)(1,1,0)1218.22282.18330.1395
Air pollutants concentration
PM2.5 (μg/m3)ARIMA (1,0,0)(0,1,0)12266.63360.73670.3907
PM10 (μg/m3)ARIMA (1,0,0)(0,1,0)12298.67340.68800.4068
SO2 (μg/m3)ARIMA (1,0,0)(0,1,0)12161.63010.53410.4649
CO (mg/m3)ARIMA (1,0,0)(1,1,0)12−74.64221.34170.2467
NO2 (μg/m3)ARIMA (1,0,0)(0,1,1)12238.2070.71070.3992
O3 (μg/m3)ARIMA (0,0,1)(1,1,0)12297.42980.00110.9736
Meteorological factors
MAT (°C)ARIMA (0,0,0)(0,1,1)12126.29392.25530.1332
MAHT (°C)ARIMA (1,0,0)(0,1,1)12148.70690.12100.7280
MALT (°C)ARIMA (0,0,0)(0,1,0)12139.38572.59960.1069
MAH (%)ARIMA (0,0,0)(1,1,0)12224.07590.00090.9757
MAP (hPa)ARIMA (0,0,0)(1,1,0)12250.43390.25650.6125
MAS (m/s)ARIMA (0,1,1)262.49650.48510.4861
Table 3. CCF coefficients between the prewhitened ambient factors residuals series with different time lags and the prewhitened PTB incidence residuals series from 2015 to 2018.
Table 3. CCF coefficients between the prewhitened ambient factors residuals series with different time lags and the prewhitened PTB incidence residuals series from 2015 to 2018.
FactorsLag Periods (Months)
0123456789101112
PM2.50.0400.145−0.1250.353 *0.0280.124−0.0180.164−0.1750.1510.013−0.1220.112
PM100.0680.166−0.1140.343 *0.0410.076−0.0490.209−0.1720.190−0.007−0.1100.077
SO20.2020.069−0.1270.1760.125−0.161−0.0340.151−0.2110.0310.103−0.2700.145
CO−0.0130.075−0.028−0.086−0.1110.100−0.0360.115−0.0720.291 *−0.0830.1650.000
NO20.034−0.100−0.1060.0440.0730.013−0.1640.008−0.2090.278−0.075−0.0640.073
O3−0.321 *−0.135−0.279−0.085−0.0140.026−0.0640.0720.1110.0810.113−0.1510.216
MAT−0.206−0.406 *−0.102−0.316 *−0.002−0.0810.0340.0930.022−0.126−0.0950.1460.124
MAHT0.093−0.215−0.040−0.1710.155−0.0140.1480.2410.1620.080−0.0290.1770.174
MALT−0.198−0.310 *−0.040−0.423 *−0.094−0.085−0.077−0.158−0.018−0.1330.0140.1490.122
MAH−0.136−0.016−0.019−0.240−0.2380.073−0.053−0.307 *−0.074−0.273−0.036−0.085−0.157
MAP0.029−0.136−0.299 *−0.0300.105−0.252−0.1510.0660.053−0.2480.027−0.335 *−0.066
MAS−0.0910.125−0.112−0.205−0.169−0.0980.0680.0450.091−0.1990.038−0.053−0.240
Note: *, p < 0.05.
Table 4. CCF coefficients between the prewhitened air pollutants concentration and meteorological factors residuals series at lag 0 between 2015 and 2018.
Table 4. CCF coefficients between the prewhitened air pollutants concentration and meteorological factors residuals series at lag 0 between 2015 and 2018.
Air PollutantsMeteorological Factors
MATMAHTMALTMAHMAPMAS
PM2.5−0.317 *−0.154−0.527 *−0.477 *0.022−0.289 *
PM10−0.319 *−0.136−0.586 *−0.605 *0.066−0.248
SO2−0.249−0.059−0.457 *−0.529 *0.428 *−0.098
CO−0.249−0.097−0.115−0.059−0.199−0.150
NO2−0.272−0.201−0.308 *−0.400 *−0.016−0.437 *
O30.0480.122−0.161−0.403 *0.318 *0.027
Note: *, p < 0.05.
Table 5. Summary of the fitted parameters of the optimal ARIMA, univariate ARIMAX, and the multivariate ARIMAX model analysis in Ningbo, 2016–2018.
Table 5. Summary of the fitted parameters of the optimal ARIMA, univariate ARIMAX, and the multivariate ARIMAX model analysis in Ningbo, 2016–2018.
ModelBICMAPE(%)Risk Factors
FittedTestVarsCoefS.E.Tp-Value
(1) ARIMA(0,0,0)(1,1,0)129.03763.326910.6693sar1−0.58290.19073.05660.0021 *
(2) ARIMA(0,0,0)(1,1,0)12+PM2.5(lag3) d12.16823.374810.5262 dsar1−0.54580.27202.00640.0264 *
PM2.5(lag3)0.00190.00870.21470.4156
(3) ARIMA(0,0,0)(1,1,0)12+PM10(lag3) d12.06623.383310.4819 dsar1−0.53550.24602.17670.0183 *
PM10(lag3)0.00180.00480.38090.3528
(4) ARIMA(0,0,0)(1,1,0)12+CO(lag9) d11.19003.2306 c9.5569 dsar1−0.58430.19033.07070.0021 *
PM10(lag3)0.00420.00411.02210.1570
(5) ARIMA(0,0,0)(1,1,0)12+O3(lag0) abcd8.2634 a3.0226 c9.7944 dsar1−0.64180.17173.73740.0003 *
O3(lag0)−0.00610.00292.07510.0228 * b
(6) ARIMA(0,0,0)(1,1,0)12+MAT(lag1) abc−1.4075 a2.3205c12.0531sar1−0.56730.19782.86790.0035 *
MAT(lag1)−0.13170.03104.2421<0.0001 * b
(7) ARIMA(0,0,0)(1,1,0)12+MAT(lag3)10.52443.386011.0464sar1−0.51460.21692.37300.0117 *
MAT(lag3)−0.05950.04431.34100.0944
(8) ARIMA(0,0,0)(1,1,0)12+MALT(lag1) abc7.8580 a2.8932 c12.4087sar1−0.45830.26701.71650.0476 *
MALT(lag1)−0.06970.03571.94970.0297 * b
(9) ARIMA(0,0,0)(1,1,0)12+MALT(lag3) abc8.0334 a3.2171 c11.4344sar1−0.50350.22172.27090.0148 *
MALT(lag3)−0.07360.03472.11980.0207 * b
(10) ARIMA(0,0,0)(1,1,0)12+MAH(lag7) cd10.23653.1969 c10.4618 dsar1−0.63370.17323.65870.0004 *
MAH(lag7)−0.01570.01071.46250.0764
(11) ARIMA(0,0,0)(1,1,0)12+MAP(lag2) c9.95903.1324 c10.9889sar1−0.67330.16074.1897<0.0001
MAP(lag2)−0.00950.00591.60230.0592
(12) ARIMA(0,0,0)(1,1,0)12+MAP(lag11) abcd8.4574 a3.2063 c10.0108 dsar1−0.49400.21862.26010.0152 *
MAP(lag11)−0.01250.00602.06800.0232 * b
(13) ARIMA(0,0,0)(1,1,0)12+ O3(lag0)+MAP(lag11) abcd8.1092 a2.9097 c9.2643 dsar1−0.56080.20182.77910.0045 *
O3(lag0)−0.00540.00281.93830.0306 * b
MAP(lag11)−0.01150.00591.94610.0301 * b
Notes: Fitted: fitted results; Test: test results; BIC: Bayesian information criterion; MAPE: mean absolute percentage error; Coef: coefficient of risk factors; lag: time lag of risk factors; S.E.: standard error; T: t statistic; sar1: seasonal AR (1); a: meet criteria (a): the BIC value smaller than the optimal ARIMA model; b: meet criteria (b): the coefficients of the regression term all significant (p < 0.05); c: meet criteria (c): the fitted MAPE value smaller than the optimal ARIMA model; d: meet criteria (d): the test MAPE value smaller than the optimal ARIMA model; *: p < 0.05.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chen, Y.-P.; Liu, L.-F.; Che, Y.; Huang, J.; Li, G.-X.; Sang, G.-X.; Xuan, Z.-Q.; He, T.-F. Modeling and Predicting Pulmonary Tuberculosis Incidence and Its Association with Air Pollution and Meteorological Factors Using an ARIMAX Model: An Ecological Study in Ningbo of China. Int. J. Environ. Res. Public Health 2022, 19, 5385. https://doi.org/10.3390/ijerph19095385

AMA Style

Chen Y-P, Liu L-F, Che Y, Huang J, Li G-X, Sang G-X, Xuan Z-Q, He T-F. Modeling and Predicting Pulmonary Tuberculosis Incidence and Its Association with Air Pollution and Meteorological Factors Using an ARIMAX Model: An Ecological Study in Ningbo of China. International Journal of Environmental Research and Public Health. 2022; 19(9):5385. https://doi.org/10.3390/ijerph19095385

Chicago/Turabian Style

Chen, Yun-Peng, Le-Fan Liu, Yang Che, Jing Huang, Guo-Xing Li, Guo-Xin Sang, Zhi-Qiang Xuan, and Tian-Feng He. 2022. "Modeling and Predicting Pulmonary Tuberculosis Incidence and Its Association with Air Pollution and Meteorological Factors Using an ARIMAX Model: An Ecological Study in Ningbo of China" International Journal of Environmental Research and Public Health 19, no. 9: 5385. https://doi.org/10.3390/ijerph19095385

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop