1. Introduction
Statistical tools are considered important elements in water quality analysis. Multivariate statistical techniques (MSTs) represent a very interesting approach as many works prove, followed by concrete case studies like in Wang et al. [
1], where discriminant analysis, principal component analysis, and factor analysis have been involved. Among the multivariate statistical methods, factor analysis and cluster analysis are applied by Ismail et al. [
2] to identify the principal factors and pollution sources that affect the water quality of the Danube River and to evaluate the spatial and temporal similarities or dissimilarities among the sites for sampling and monitoring periods. The quality of water can be evaluated by analyzing its physical, chemical, and biological characteristics. The use of cluster analysis, multivariate statistical methods, principal component analysis, and factor analysis for the interpretation and analysis of water quality data have been presented in [
3]. To identify the sources of pollution and to obtain a better understanding of changes in the quality of surface water, these statistical techniques are useful. The use of multivariate statistical methods in the monitoring data gives information about the chemical structure of water bodies. Also, in Hue et al. [
4], the assessment of spatial variability and the determination of the main contamination in surface water quality using multivariate statistical analysis techniques, considering principal component analysis and cluster analysis, has been performed.
Water management, involving monitoring techniques that accurately assess the water quality and determine the effects of pollution using multivariate statistical methods and water quality, is proposed to evaluate the processes controlling water chemical composition and the overall water quality status in da Silva et al. [
5], where a holistic approach for the water quality conditions and related effects has been performed. Once again, the multivariate statistical techniques to assess the quality of surface waters are used by Dawood et al. [
6] by studying the temporal and spatial water quality variability to reveal the characteristics and pollution indicators. Water evaluation via multivariate statistical techniques was also proposed in [
7,
8,
9,
10,
11,
12] followed by study cases that, actually, justify the statistical approaches, and it is sustained using these mathematical approaches.
Multivariate statistical techniques combined with water quality identification index represent one of the most useful combinations that realize accurate monitoring and the assessment of water quality, i.e., the spatiotemporal analysis of water quality, as in works by [
13,
14,
15] for surface water quality and also for groundwater in the case of [
15]. MSTs have also been successfully applied to solve similar issues in [
16,
17,
18].
However, not only MSTs represent the statistical studies for waters, especially surface waters. Many other statistical methods and models can be remarkably cited in this kind of analysis, as in the papers of Mohammed et al. [
19], Bărbulescu et al. [
20], Schreiber et al. [
21], Kovrov et al. [
22], Antonopoulos et al. [
23], Majerek et al. [
24], Stričević et al. [
25], Khouri et al. [
26], Bhat et al. [
27], and Le et al. [
28].
Statistical tools in the analysis of water quality to process the collected data have been used and capitalized in papers, as that of Andrejiová et al. [
29] and Wang et al. [
30], book chapters by Fu et al. [
31], and more extended works, as that of Helsel and Hirsch [
32], Bal [
33], and Hajigholizadeh [
34].
The author of this work used statistical methods in environmental monitoring, especially for atmospheric evaluation in [
35,
36,
37,
38,
39,
40,
41] and control of water pollutants, in [
42,
43,
44]. As specific methods applied in cited works, we can highlight the demonstrations that the analyzed data are under statistical control and use the most involved model of time series, with a lot of important properties of this powerful instrument. Statistical control can be used for monitoring the degree of water pollution in a considered zone. The aim of the usage of the time series properties is to understand the driving forces and internal structures that produce the observed data, to organize the data into a model, and to involve them in forecasting, monitoring, or even feedback and feedforward control. The most important difference between modeling data via time series methods and using the process monitoring methods is that, in time series analysis, the data collected over time may present internal structure such as autocorrelation, trend, or seasonal variation.
In this paper, data, methods, and models from previous papers [
42,
43,
44] are analyzed and discussed in order to deepen and systematize appropriate instruments and procedures to control the pollution of surface waters in or near a big town. The data collected for certain pollutants in two periods of time and analyzed in the above three cited papers, together with the statistical methods used there, have been discussed and compared in order to highlight mathematical models that can govern such phenomena, to capitalize on some statistical instruments that can be used to control such processes. Also, the approach in this study is able to be replicated, so it can be adapted accordingly in similar situations, other data representing a new study, and, therefore, the methods and models used here can be in a certain way generalized and exemplified.
2. Materials and Methods Analysis
The monitored pollutants for surface waters in and near Bucharest are heavy metals such as lead, monitored between March 2007 and November 2009; mercury, for which the concentration of Hg2+ ions within the analyzed lakes was monitored daily during the same period, except for winter time, when the lakes were dried or frozen; and concentration of ions NO₂⁻, NO₃⁻, and PO₄³⁻ in surface waters from Bucharest during the period 2013–2016. The concentrations of these last ions are an important indicator for the eutrophication degree of lakes.
The monitored lakes are presented in the following:
Lakes over the Colentina River: Străuleşti, Griviţa, Băneasa, Herăstrău, Floreasca, Tei, Colentina, and Pantelimon;
Lakes inside some parks: Tineretului, Circului, Cişmigiu, and Titan;
Morii Lake, organized on Dâmboviţa River, having the purpose to regularize its flow.
The location of these lakes on the map of Bucharest is illustrated in
Figure 1, while their important characteristics are displayed in
Table 1.
The riverbed of the Dâmboviţa and Colentina Rivers is formed with sand and gravel, and the banks of the river are covered with concrete slabs, while the lakes from parks are arranged on swampy zones.
The climate in Bucharest is temperate continental, and topoclimate is lacustrine. The monthly and annual averages of temperature in Bucharest, at Filaret station, are presented in
Table 2.
The values for the monthly averages of rainfall in the lake areas measured in two meteorological stations, at Băneasa and Dragomireşti Vale, are presented in
Table 3.
The analyzed lakes are artificial, and they are the result of water stream barrage. They appeared from regularization needs or in recreation areas. The industrial pollution has been significantly reduced since most of industrial units in this area are closed. So, an increasing pollution source is provided by buildings, usually chaotically placed or organized in chains of residential districts, with rather precarious systematization and sewage. The lake areas still remain frequent waste deposit sites, from small productive units or from population. The lakes from the parks are less exposed to pollution.
2.1. Heavy Metals Data
The main pollution source with Pb
2+ and Hg
2+ is represented by particulate matter (PM) from the atmosphere, which falls on the free surface of water [
47,
48,
49,
50]. As a result, the quality of waters in Bucharest area, Colentina River, and surrounding lakes, is inadequate, due to the wastewater discharge upstream the river from industrial units and domestic sewage. The main pollution trend is represented by high organic content and heavy metals.
The analyses for the concentration of Pb2+ and Hg2+ ions in surface waters of Bucharest area and the identification of their pollution sources are presented in the following:
The concentration of ions of Pb2+ and Hg2+ in the lakes analyzed has been daily monitored during the period March 2007–November 2009, except for wintertime, when they were dried or frozen.
The water sampling for each lake has been performed as follows (according to standard procedures):
Daily at one meter far from the bank, in at least 5 places in the same spatial-temporal conditions;
In half liter recipients of single-use, made of P polyethylene, specially produced for water sampling;
The samples were analyzed daily in specialized laboratory by using AAnalyst 700 Atomic Absorption Spectrometer, Perkin Elmer, Waltham, USA and a Direct Mercury analyzer, DMA 80, Milestone, Shelton, USA for lead and mercury, respectively.
2.2. Nitrates and Phosphates Data
Data resulting from concentration monitoring of ions NO₂⁻, NO₃⁻, PO₄³ ⁻, and Ptot in surface waters from Bucharest during the 2013–2016 period are presented and analyzed.
A measure of the degree of the eutrophication of the lakes is represented by the concentrations of these ions. The surface waters which are monitored are that of 10 lakes resulting from Colentina River systematizing (
Figure 2). The sources of pollution with nitrates and phosphates are coming from economic activities around lakes, urban transport, and the initial pollution of Colentina River.
A direction of data analysis aims to establish whether there is a significant statistical difference between concentrations of these pollutants in surface waters analyzed.
4. Discussion
The daily registration of concentrations for lead and mercury in surface waters forms a univariate time series that consists of observations that have been recorded sequentially over equal time intervals. Such time series have periodicity that represents sinusoidal fluctuations over entire weeks and seasonality over the year. The seasonal pattern is consistent with a wave sine model. The autocorrelation function has been used to obtain the autocorrelation structure property of the series.
In the case of the first set of data (lead emissions), which shows a trend and periodicity, monitoring diagrams are built based only on concentration fluctuations obtained after the removal of trend and periodicity from the initial data [
35].
The results represented in
Figure 20 and also the range diagram from
Figure 21 prove that the experimental measurements can be statistically monitored with a diagram of Shewhart type. The diagram
R of the range (amplitudes) of fluctuation variations confirms that these are within the limits of statistical control of the process.
Monitoring and statistics for the concentrations of lead ions in surface waters from Bucharest zone showed the following aspects:
The maximum admitted limit of Pb2+ concentration is sometimes overcome in several lakes exposed to traffic pollution or waste disposal;
The lakes situated inside the parks are less polluted;
The daily Pb2+ concentrations present a decreasing trend for all the studied cases, and they have a weekly periodic character;
Shewhart type monitoring diagrams X and R can be involved to monitor these data, but only for concentration fluctuations prepared by removing trend and periodicity from the initial data;
The surface water pollution with lead in the Bucharest area is quite small and caused mainly via PM from air as the result of the combustion processes, demolition, and rehabilitation of degraded buildings.
At present time, the reported level of global mercury contamination is growing, having various harmful effects on the atmosphere, aquatic, and terrestrial environment. From
Figure 22 results the increasing amount of mercury in the environment that follows the trend of industrial development. A big part of mercury compounds provided via anthropogenic activity is released into the atmosphere, whence they are transported and transferred on water surface and soil. The mercury present under various chemical forms is highly accumulated in sediments and living organisms in the aquatic medium, as it is illustrated in the image of mercury circuit around environmental factors in
Figure 23. In accordance with EPA studies [
55], mercury is bio-accumulated from the aquatic medium into food, and then it is bio-concentrated into human body (
Figure 24).
The main source of mercury uptake in the human body is represented by contaminated food, especially fish. Methylmercury can bio-concentrate to 1 part per million, a level found in some top predators like tuna, swordfish, and shark, that means a million-fold increase, in water containing 1 to 10 parts of mercury per trillion parts of water.
From the pie diagram shown in
Figure 25, the main pollution source with mercury of the atmosphere, and implicitly of surface water and soil, is represented by gases released from fuel combustion in thermal power plants and building heating, as can be seen.
In the chemical industry, in the fabrication of caustic soda, dyes, paper, some pesticides and fungicides, pharmaceutic products and disinfectants, etc., mercury is frequently used. These industries are potential pollution sources with mercury, both by means of their wastes and of the products used in other industries. The inappropriate use of organomercury fungicides in the various phases of agriculture works could cause the poisoning of birds and animals, the contamination of soil, and transfer into the aquifer.
Vehicles using petrol or diesel oil release burning gases containing heavy metals such as lead, mercury, and cadmium (this last heavy metal has been also sampled and analyzed in the laboratory, but, the obtained concentration values being too small, the continuation of its study did not show any importance). It is considered that the mercury generated by road traffic represents cca 3% of the total amount released into the atmosphere from various sources; it is included in the sector of ‘Area sources’ in
Figure 25.
Severe measures have been taken to limit the use of organic mercury compounds in the paper industry and agriculture, by restricting the discharge of waste or banning fishing in some polluted areas, as a result of the awareness of the potential dangers caused via the mercury pollution of the ecosystem.
The weekly doses temporarily admitted are 0.3 mg total Hg per person, out of which less than 0.2 mg can be methylmercury, CH3Hg+ (expressed in mercury), which corresponds to 0.005 mg and 0.0033 mg/kg body, in accordance with the recommendations of the Expert Committee on Food Additives of FAO/OMS. A concentration of approx. 0.02 µg/g methylmercury in blood and 0.04 µg/g in erythrocytes is accepted for a safety coefficient equal to 10. When these doses are overpassed, a series of harmful effects on human organisms are manifested: nervous system diseases, the diminishing abilities of learning, memory losses, DNA alterations, allergies, etc.
Mercury, together with other heavy metals as lead and cadmium, is a toxic pollutant able to cause irreversible damage to the human body.
The maximum admitted limit of mercury is not attained in most of the surface waters around the Bucharest area, except the Morii, Griviţa, Străuleşti, and Băneasa Lakes, where this limit is accidentally overpassed.
The monitored data on mercury concentration form a time series, which is not under statistic control since it is not stationary, having trend and periodicity. The distribution law is asymmetrical and the capability index, Cp, is less than 1.
Another time series based on concentration fluctuations was built, after the removal of trend and periodicity from the initial data, in order to obtain the statistical control. The novel time series is under statistical control because its distribution law is symmetrical and near the normal distribution, and, as a result, the 3σ rule is applied. The initial series of constitutive elements is recomposed, and then trend, periodicity, and fluctuations are changed to draw valuable conclusions on monitoring mercury concentrations in surface waters.
The highest mercury concentrations can be correlated with the intensity of road traffic, since the time variation is cyclic with a 7-day period, with smaller values during weekend and bigger values during the working days.
Eutrophication is the enrichment of water with nutrients. This process induces the growth of the biomass load and consequently may result in the depletion of oxygen in the water. The discharge of phosphate-containing fertilizers and sewage into aquatic systems is the main cause of eutrophication [
57]. The phosphate-containing detergents were an important source of eutrophication and, since the end of the last century, gradually were replaced with environmentally-friendly detergents. The existence of phosphorus compounds causes a severe reduction of the quality of surface waters. Phosphorus is a necessary nutrient for plants and is the limiting factor for plant growth in many freshwater ecosystems. In agriculture, phosphate fertilizers adhere tightly to soil and so are transported by the erosion of lakes. The extraction of phosphate into water is slow, hence the difficulty of reversing the effects of eutrophication. Mainly due to the use of agricultural fertilizer, the rate of phosphorus cycling on Earth has increased by four times. As a result, the primary limiting factor for eutrophication is phosphate. In eutrophication management, the main component is the control of phosphorus sources.
The nutrients contained in organic matter are converted into inorganic forms by microorganisms in water systems when plants die. This process is oxygen-consuming, and it reduces the concentration of oxygen dissolved in water. The depleted oxygen levels lead to a range of effects reducing biodiversity. Nutrients can be concentrated in an anoxic zone and may only be made available again in conditions of turbulent flow, mainly generated by human activity. The enhanced growth of aquatic vegetation perturbs the normal functioning of the ecosystem, producing a variety of problems such as the lack of oxygen necessary for biotope survival. The value and aesthetic enjoyment of lakes decrease, since the water becomes muddy. The rate at which nutrients enter ecosystems can be accelerated by human activities. Although eutrophication is commonly caused by human activities, it can also be a natural process in lakes. Climate change is critical in regulating the natural productivity of nutrients in lakes. The main difference between natural and anthropogenic eutrophication is that the natural process is very slow, occurring on geological time scales.
Nitrogen is another typically limiting nutrient, because the chemical forms of this chemical element are related to eutrophication. The additions of nitrogen compounds stimulate the growth of plants since they have high nitrogen requirements. The ecosystems that receive more nitrogen than the plants require can contribute to water eutrophication. High levels of atmospheric compounds of nitrogen, mainly due to the traffic intensities, can increase nitrogen availability in surface waters.
Anyway, phosphorus fertilizers are generally much less soluble than nitrogen fertilizers. They are leached from the soil at a much slower rate than nitrogen. Therefore, phosphorus is much more important as a limiting nutrient in aquatic systems. The trophic state of lakes is related to phosphorus levels in water [
58,
59].
This article presents studies conducted between 2014 and 2016, regarding the eutrophication possibility of some lakes around Bucharest. The results prove that if current environmental conditions are maintained, there is no danger of eutrophication of these lakes. The obtained experimental data form a stationary time series with a uniform pattern. Two forecast methods have been applied that have yielded satisfactory results.