A Data-Based Framework for Identifying a Source Location of a Contaminant Spill in a River System with Random Measurement Errors
Abstract
:1. Introduction
2. Background
2.1. Problem Description
2.2. Data Acquisition with Simulation
3. Methods
3.1. Overall Description of the Proposed Framework
the discretized time index at which each sensor returns an observation; | |
the location index of the sensor that raises the first alarm; | |
the point of time when the first alarm is on by the sensor at | |
the point of time when the first alarm is off by the sensor at | |
the concentration level reported by the sensor at at time (i.e., ; | |
a pre-specified window length for spill detection; | |
a lag parameter pre-designated by users to determine ; | |
a nonlinear regression model for in the time period [ | |
a vector representing the curvature characteristics of ; | |
the set of candidate spill locations that can be identified only by the sensor at and | |
the random forest model corresponding to the sensor at |
3.2. Spill Detection
Constructing CUSUM monitoring statistics |
Set and . |
Whiledo |
. |
. |
end while |
3.3. Data Preprocessing
3.4. Source Identification
4. Case Study
4.1. Simulation Setup for the Targeted River System
4.2. Experimental Setup
4.3. Results
5. Conclusions
Author Contributions
Acknowledgments
Conflicts of Interest
References
- Aral, M.M.; Guan, J. Genetic algorithms in search of groundwater pollution sources. In Advances in Groundwater Pollution Control and Remediation, 2nd ed.; Aral, M.M., Ed.; Springer: Dordrecht, The Netherlands, 1996; Volume 9, pp. 347–369. ISBN 978-94-009-0205-3. [Google Scholar]
- Aral, M.M.; Guan, J.; Maslia, M.L. Identification of contaminant source location and release history in aquifers. J. Hydrol. Eng. 2001, 6, 225–234. [Google Scholar] [CrossRef]
- Gorelick, S.M.; Evans, B.; Remson, I. Identifying sources of groundwater pollution: An optimization approach. Water Resour. Res. 1983, 19, 779–790. [Google Scholar] [CrossRef]
- Singh, R.M.; Datta, B. Identification of groundwater pollution sources using GA-based linked simulation optimization model. J. Hydrol. Eng. 2006, 11, 101–109. [Google Scholar] [CrossRef]
- Sun, A.Y.; Painter, S.L.; Wittmeyer, G.W. A robust approach for iterative contaminant source location and release history recovery. J. Contam. Hydrol. 2006, 88, 181–196. [Google Scholar] [CrossRef] [PubMed]
- Neupauer, R.M.; Lin, R. Identifying sources of a conservative groundwater contaminant using backward probabilities conditioned on measured concentrations. Water Resour. Res. 2006, 42. [Google Scholar] [CrossRef] [Green Version]
- Neupauer, R.M.; Wilson, J.L. Numerical implementation of a backward probabilistic model of ground water contamination. Groundwater 2004, 42, 175–189. [Google Scholar] [CrossRef]
- Sun, A.Y. A robust geostatistical approach to contaminant source identification. Water Resour. Res. 2007, 43. [Google Scholar] [CrossRef] [Green Version]
- Singh, R.M.; Datta, B. Artificial neural network modeling for identification of unknown pollution sources in groundwater with partially missing concentration observation data. Water Resour. Manag. 2007, 21, 557–572. [Google Scholar] [CrossRef]
- Singh, R.M.; Datta, B.; Jain, A. Identification of unknown groundwater pollution sources using artificial neural networks. J. Water Resour. Plan. Manag. 2004, 130, 506–514. [Google Scholar] [CrossRef]
- Srivastava, D.; Singh, R.M. Breakthrough curves characterization and identification of an unknown pollution source in groundwater system using an artificial neural network (ANN). Environ. Forensics 2014, 15, 175–189. [Google Scholar] [CrossRef]
- Boano, F.; Revelli, R.; Ridolfi, L. Source identification in river pollution problems: A geostatistical approach. Water Resour. Res. 2005, 41. [Google Scholar] [CrossRef]
- Chen, Y.; Zhao, K.; Wu, Y.; Gao, S.; Cao, W.; Bo, Y.; Shang, Z.; Wu, J.; Zhou, F. Spatio-temporal patterns and source identification of water pollution in lake taihu (China). Water 2016, 8, 86. [Google Scholar] [CrossRef]
- Ghane, A.; Mazaheri, M.; Samani, J.M.V. Location and release time identification of pollution point source in river networks based on the backward probability method. J. Environ. Manag. 2016, 180, 164–171. [Google Scholar] [CrossRef] [PubMed]
- Telci, I.T.; Aral, M.M. Contaminant source location identification in river networks using water quality monitoring systems for exposure analysis. Water Qual. Expo. Health 2011, 2, 205–218. [Google Scholar] [CrossRef]
- Lee, Y.J.; Park, C.; Lee, M.L. Identification of a contaminant source location in a river system using random forest model. Water 2018, 10, 391. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Kim, S.-H.; Aral, M.M.; Eun, Y.; Park, J.J.; Park, C. Impact of sensor measurement error on sensor positioning in water quality monitoring networks. Stoch. Environ. Res. Risk Assess. 2017, 31, 743–756. [Google Scholar] [CrossRef]
- Montgomery, D.C. Introduction to Statistical Quality Control; Wiley: Hoboken, NJ, USA, 2009; ISBN 047151988X. [Google Scholar]
- Kim, S.-H.; Alexopoulos, C.; Tsui, K.L.; Wilson, J.R. A distribution-free tabular CUSUM chart for autocorrelated data. IIE Trans. 2007, 39, 317–330. [Google Scholar] [CrossRef]
- Rossman, L.A. Storm Water Management Model User’s Manual, Version 5.0; U.S. Environmental Protection Agency: Cincinnati, OH, USA, 2004.
- Cleveland, W.S. Robust locally weighted regression and smoothing scatterplots. J. Am. Stat. Assoc. 1979, 74, 829–836. [Google Scholar] [CrossRef]
- Ho, T.K. The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Mech. Intell. 1998, 20, 832–844. [Google Scholar] [CrossRef]
- Park, C.; Telci, I.T.; Kim, S.-H.; Aral, M.M. Designing an optimal water quality monitoring network for river systems using constrained discrete optimization via simulation. Eng. Optim. 2014, 46, 107–129. [Google Scholar] [CrossRef]
- Telci, I.T.; Nam, K.; Guan, J.; Aral, M.M. Optimal water quality monitoring network design for river systems. J. Environ. Manag. 2009, 90, 2987–2998. [Google Scholar] [CrossRef] [PubMed]
Sensor Location | Random Forest Model | Set of Candidate Spill Locations | % of OOB Error (L, L) | % of OOB Error (H, H) |
---|---|---|---|---|
9 | 29.34 | 33.84 | ||
19 | 21.08 | 26.68 | ||
26 | 4.49 | 7.43 | ||
33 | 4.46 | 9.83 | ||
46 | 30.82 | 35.17 | ||
53 | 4.4 | 10.11 |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kim, J.H.; Lee, M.L.; Park, C. A Data-Based Framework for Identifying a Source Location of a Contaminant Spill in a River System with Random Measurement Errors. Sensors 2019, 19, 3378. https://doi.org/10.3390/s19153378
Kim JH, Lee ML, Park C. A Data-Based Framework for Identifying a Source Location of a Contaminant Spill in a River System with Random Measurement Errors. Sensors. 2019; 19(15):3378. https://doi.org/10.3390/s19153378
Chicago/Turabian StyleKim, Jun Hyeong, Mi Lim Lee, and Chuljin Park. 2019. "A Data-Based Framework for Identifying a Source Location of a Contaminant Spill in a River System with Random Measurement Errors" Sensors 19, no. 15: 3378. https://doi.org/10.3390/s19153378
APA StyleKim, J. H., Lee, M. L., & Park, C. (2019). A Data-Based Framework for Identifying a Source Location of a Contaminant Spill in a River System with Random Measurement Errors. Sensors, 19(15), 3378. https://doi.org/10.3390/s19153378