Next Article in Journal
A Rule Extraction Study from SVM on Sentiment Analysis
Next Article in Special Issue
A Multi-Modality Deep Network for Cold-Start Recommendation
Previous Article in Journal
Reimaging Research Methodology as Data Science
Article Menu

Export Article

Open AccessArticle
Big Data Cogn. Comput. 2018, 2(1), 5; doi:10.3390/bdcc2010005

A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization

Department of Computer Science, University of Iowa, Iowa City, IA 52242, USA
Department of Occupational and Environmental Health, University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA
Department of Management Sciences, University of Iowa, Iowa City, IA 52242, USA
Author to whom correspondence should be addressed.
Received: 28 December 2017 / Revised: 16 February 2018 / Accepted: 19 February 2018 / Published: 24 February 2018
(This article belongs to the Special Issue Learning with Big Data: Scalable Algorithms and Novel Applications)
View Full-Text   |   Download PDF [595 KB, uploaded 25 February 2018]   |  


In this paper, we tackle air quality forecasting by using machine learning approaches to predict the hourly concentration of air pollutants (e.g., ozone, particle matter ( PM 2.5 ) and sulfur dioxide). Machine learning, as one of the most popular techniques, is able to efficiently train a model on big data by using large-scale optimization algorithms. Although there exist some works applying machine learning to air quality prediction, most of the prior studies are restricted to several-year data and simply train standard regression models (linear or nonlinear) to predict the hourly air pollution concentration. In this work, we propose refined models to predict the hourly air pollution concentration on the basis of meteorological data of previous days by formulating the prediction over 24 h as a multi-task learning (MTL) problem. This enables us to select a good model with different regularization techniques. We propose a useful regularization by enforcing the prediction models of consecutive hours to be close to each other and compare it with several typical regularizations for MTL, including standard Frobenius norm regularization, nuclear norm regularization, and 2 , 1 -norm regularization. Our experiments have showed that the proposed parameter-reducing formulations and consecutive-hour-related regularizations achieve better performance than existing standard regression models and existing regularizations. View Full-Text
Keywords: air pollutant prediction; multi-task learning; regularization; analytical solution air pollutant prediction; multi-task learning; regularization; analytical solution

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Zhu, D.; Cai, C.; Yang, T.; Zhou, X. A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization. Big Data Cogn. Comput. 2018, 2, 5.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Metrics

Article Access Statistics



[Return to top]
Big Data Cogn. Comput. EISSN 2504-2289 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top