Investigation of the Wastewater Treatment Plant Processes E ﬃ ciency Using Statistical Tools

: The paper presents modelling of wastewater treatment plant (WWTP) operation work efficiency using a two-stage method based on selected probability distributions and the Monte Carlo method. Calculations were carried out in terms of sewage susceptibility to biodegradability. Pollutant indicators in raw sewage and in sewage after mechanical treatment and biological treatment were analysed: BOD 5 , COD, total suspended solids (TSS), total nitrogen (TN) and total phosphorus (TP). The compatibility of theoretical and empirical distributions was assessed using the Anderson–Darling test. The best-fitted statistical distributions were selected using Akaike criterion. Performed calculations made it possible to state that out of all proposed methods, the Gaussian mixture model (GMM) for distribution proved to be the best-fitted. Obtained simulation results proved that the statistical tools used in this paper describe the changes of pollutant indicators correctly. The calculations allowed us to state that the proposed calculation method can be an effective tool for predicting the course of subsequent sewage treatment stages. Modelling results can be used to make a reliable assessment of sewage susceptibility to biodegradability expressed by the BOD 5 / COD, BOD 5 / TN and BOD 5 / TP ratios. New data generated this way can be helpful for the assessment of WWTP operation work and for preparing different possible scenarios for their operation.


Introduction
Systematic control of the sewage quality at the outflow from a wastewater treatment plant (WWTP) is one of the elements of its proper operation. Control of the pollutant indicators in the sewage discharged into the natural environment makes it possible to assess the efficiency and the reliability of WWTP operation work. In addition, investigation of sewage quality after mechanical and biological treatment processes allow us to assess whether WWTP facilities or technological lines function properly. The subsequent stages of sewage treatment at mechanical and biological WWTPs have a clearly defined purpose: the elimination of a specific group of pollutants to the required level. For example, according to Dadrasnia et al. [1], at the mechanical treatment stage, in addition to 50-60% reduction of total suspended solids, a 20-30% reduction of organic impurities expressed by BOD 5 may be achieved. After the removal of mainly solid impurities in mechanical line facilities of a WWTP, sewage undergoes biological processes. However, these give the required effect only if the right conditions for nitrification, denitrification and dephosphatation processes are ensured. The efficiency of these processes depends on technical solutions of the WWTP facilities and treatment technologies used. Therefore, advanced research on of pollutant reduction after mechanical and biological treatment is indicated as a novelty aspect of this study.

Case Study
The research was carried out based on one of the WWTPs located in Cracow (Poland). The analysed WWTP receives sewage both from industrial plants and individual users; it has adapted the two-stage processes of sewage treatment along with the full thermal utilization of sewage sludge. Hydraulic capacity of the biological line facilities is 328,000 m 3 /d and the average daily flow is set at 165,000,000 m 3 /d; these operate for a population of 780,000. Figure 1 shows the localization of the WWTP along with a simplified scheme of the mechanical and biological technological lines.
Sustainability 2020, 12, x FOR PEER REVIEW 3 of 17 pollutant indicator values. The implementation of such a model based on a multi-distribution analysis for investigation of pollutant reduction after mechanical and biological treatment is indicated as a novelty aspect of this study.

Case Study
The research was carried out based on one of the WWTPs located in Cracow (Poland). The analysed WWTP receives sewage both from industrial plants and individual users; it has adapted the two-stage processes of sewage treatment along with the full thermal utilization of sewage sludge. Hydraulic capacity of the biological line facilities is 328,000 m 3 /d and the average daily flow is set at 165,000,000 m 3 /d; these operate for a population of 780,000. Figure 1 shows the localization of the WWTP along with a simplified scheme of the mechanical and biological technological lines. Mechanical sewage treatment takes place firstly on wide and dense screens, then in three horizontal grit chambers and in four preliminary settling tanks. In the next stage, sewage is subjected to biological treatment in five bioreactors operating in activated sludge technology. Before effluent is discharged into the environment from each of the five biological reactors, sewage flows through the secondary settling tanks; activated sludge is separated there and effluent is clarified. The treated sewage receiver is the Drwina River (inflow of the Vistula River) (Figure 1).

Methodology
The basis of the study was the observational data series of the pollutant indicators, which consisted of 44 elements, both for raw sewage and for sewage after mechanical treatment and biological treatment. These included biochemical five-day oxygen demand (BOD5), chemical oxygen demand (COD), total suspended solids (TSS), total nitrogen (TN) and total phosphorus (TP). The study was carried out in several stages: preliminary statistical data analysis, selection of the best-fitted theoretical distributions to the empirical distributions, modelling of the pollutant indicator values in raw sewage and in sewage after mechanical treatment and biological treatment along with the analysis of the accuracy of the two-stage treatment process and the assessment of sewage susceptibility to biodegradability.

Preliminary Statistical Data Analysis
As a part of the preliminary data analysis of the pollutant indicators observational data series in raw sewage and in sewage after mechanical treatment and biological treatment, the following Mechanical sewage treatment takes place firstly on wide and dense screens, then in three horizontal grit chambers and in four preliminary settling tanks. In the next stage, sewage is subjected to biological treatment in five bioreactors operating in activated sludge technology. Before effluent is discharged into the environment from each of the five biological reactors, sewage flows through the secondary settling tanks; activated sludge is separated there and effluent is clarified. The treated sewage receiver is the Drwina River (inflow of the Vistula River) (Figure 1).

Methodology
The basis of the study was the observational data series of the pollutant indicators, which consisted of 44 elements, both for raw sewage and for sewage after mechanical treatment and biological treatment. These included biochemical five-day oxygen demand (BOD 5 ), chemical oxygen demand (COD), total suspended solids (TSS), total nitrogen (TN) and total phosphorus (TP). The study was carried out in several stages: preliminary statistical data analysis, selection of the best-fitted theoretical distributions to the empirical distributions, modelling of the pollutant indicator values in raw sewage and in sewage after mechanical treatment and biological treatment along with the analysis of the accuracy of the two-stage treatment process and the assessment of sewage susceptibility to biodegradability.

Preliminary Statistical Data Analysis
As a part of the preliminary data analysis of the pollutant indicators observational data series in raw sewage and in sewage after mechanical treatment and biological treatment, the following descriptive statistics were determined: location measures (minimum, maximum, mean) and dispersion measures (standard deviation S, coefficient of variation C s ).

Analysis of the Best-Fitted Theoretical Distributions
Based on the observational data series of pollutant indicators, an attempt was made to determine the best-fitted theoretical distributions for the empirical distributions. The following theoretical distributions were considered in the group of candidate probability distributions: general extreme values (GEV), Gaussian mixture model (GMM), log-normal, normal, triangular and Weibull. The group of the candidate probability distributions, first of all, was created based on their common use of the description of the continuous random variables. Because of the properties of these probability distributions, they can be applied for analysis of the water and sewage facilities operation [19]. Moreover, as it can be seen in the literature studies, besides normal and Weibull distribution [16][17][18][19], in water and wastewater sciences, studies of application and the possibility of using the other distributions can be found in the papers of Coronado-Hernández et al. [20] (GEV), Fan et al. [21] and Sun et al. [22] (GMM), Oliveira et al. [23] and Van Buren et al. [24] (log-normal) and Wang and Ran [25] (triangular).
GMM distribution: where K is the number of components, ω k is weights such that Σ k ω k = 1, N P (0 P , Σ k ) is a multi-variate Gaussian distribution. Log-normal distribution: Normal distribution: where σ 2 is the variance, µ is the mean. Triangular distribution: where a is the lower limit, m is the mode, b is the upper limit.
Weibull distribution: where κ is the shape parameter, α is the scale parameter. The compatibility of theoretical distributions with the empirical distributions of pollutant indicators was assessed using the Anderson-Darling test (A-D). The proposed statistic is sensitive in the whole distribution range. Moreover, by using it, it is more likely that some differences between distributions will be detected, if any appear. Therefore, it gives better compatibility assessment compared to the other tests. The hypothesis H 0 of this test states the compatibility of theoretical distribution with empirical distribution, while the alternative hypothesis H 1 indicates a lack of the distribution's compatibility. The calculations were carried out for a significance level of α = 0.05. The Anderson-Darling statistic is described by the following dependence [32][33][34] where n is the number of the observations, F is the theoretical cumulative distribution, F n is the empirical cumulative distribution. Because the critical values of the A-D statistic depend on the type of probability distribution being tested, the hypotheses were verified based on the p-values. If the obtained p-value was less than the significance level α = 0.05, then the hypothesis H 0 was rejected in favour of the alternative hypothesis H 1 .

Selection of the Best-Fitted Theoretical Distributions
If the random variables supported many hypotheses of the empirical distributions (i.e., it was possible to describe them using more than one theoretical distribution), then the best-fitted theoretical distributions to the empirical distributions were determined using Akaike criterion (AIC) (Equation (10) [35]). The best-fitted theoretical distribution to the empirical distribution of the random variables is this one, for which minimum values of AIC criterion are obtained.
where k is the numbers of distribution parameters, n is the sample size, RMSE is the root mean square error.

Modelling of the Pollutant Indicator Values
Modelling of the pollutant indicator values was performed using the Monte Carlo method. The essence of this method is to use a series of random numbers to build a sample from the hypothetical population. Based on this, it is possible to determine statistical estimators of the distribution parameters. In practice, this is a creation of the stochastic model describing a specific phenomenon. Then, this model is implemented again using randomly generated variables (in compliance with the considered probability distribution), and next, statistical analysis of the obtained results is conducted. The Monte Carlo model can be described as follows: In order to determine value m, random variable ξ is assumed such that Mξ = m and dispersion Dξ is finite. For N independent values of random variables ξ 1 , ξ 2 , . . . , ξ 3 , it can be determined, as Equation (11) shows: The above estimation is known as the Monte Carlo method of determined variable m. The accuracy of this method depends on dispersion Dξ. Generally the Monte Carlo method cannot define the calculation algorithm, because the random variables can be modelling in different ways, for example where g relates to the method used, γ 1 , γ 2 -simulated values.
In this paper, simulations were made for the best-fitted statistical distributions. In one simulation cycle, 365 random values of the pollutant indicators were generated. Then, each simulation cycle was repeated 100 times, and then, 24 pollutant indicators values were randomly selected from each of the 100 repetitions. Presented methodology was used for all of the five pollutant indicators considered in Sustainability 2020, 12, 10522 6 of 16 this study and in three sewage treatment stages (raw sewage, mechanically treated sewage, biologically treated sewage).

Simulation of the Sewage Quality Changes in Subsequent Treatment Stages
The results obtained by modelling were used for determination of the predicted minimum, maximum and mean values of BOD 5 , COD, total suspended solids (TSS), total nitrogen (TN) and total phosphorus (TP) in raw sewage and in sewage after mechanical treatment and biological treatment. Percentage reduction of each of the pollutant indicators in the subsequent treatment stages was also determined. Additionally, based on the Polish regulations [36], the quality of effluent was compared with the required sewage quality discharged to the natural environment.

Simulation of Sewage Susceptibility to Biodegradability
Simulated values of the pollutant indicators in raw sewage, in mechanically treated sewage and in biologically treated sewage enabled us to assess sewage susceptibility to biodegradability in each treatment stage. For this purpose, BOD 5 /COD (organic matter decomposition), BOD 5 /TN (nitrogen decomposition) and BOD 5 /TP (phosphorus decomposition) indexes were used. Biodegradability measures presented in the literature by Tomašić and Zelić [5] were used to assess the sewage susceptibility to organic compounds decomposition. The assessment of the nitrogen compounds' decomposition efficiency was made in relation to the optimum values of the BOD 5 /TN ratio, as is provided by Jansen et al. [7]. In turn, as the measure of sewage susceptibility to effective dephosphotation, a minimum required ratio of BOD 5 /TP was used, as is presented by Janssen et al. [7] and Cooper et al. [8].

Preliminary Statistical Data Analysis
As a part of the preliminary data analysis, descriptive statistics including location measures and dispersion measures for the observational data series of pollutant indicators in raw sewage and in sewage after mechanical treatment and biological treatment were determined. The results of the analysis are presented in Table 1. Based on the results summarized in Table 1, it was found that for BOD 5 , differences between the extreme values (minimum and maximum) were 84%, 72% and 94% for raw sewage, for sewage after mechanical treatment and for sewage after biological treatment, respectively. In the case of COD, these differences were similar. In turn, for total suspended solids (TSS), differences between the minimum and maximum values were definitely greater. They amounted to 91%, 90% and 98% for raw sewage and for sewage after mechanical treatment and biological treatment, respectively. In the case of biogenic compounds, differences between their extreme concentrations in sewage before treatment and in sewage after mechanical treatment and biological treatment were stated as follows: 71%, 70% and 80% (total nitrogen) and 83%, 77% and 96% (total phosphorous). It should be emphasized that the raw sewage quality inflowing to the analysed WWTP did not differ from the typical composition of municipal sewage in Southern Poland, as is indicated by Kaczor [37]. It was found that both in the case of TN and TP, their mean concentrations in mechanical-treated sewage were greater than in raw sewage. This can be explained by the fact that the raw sewage may contain previously not crushed suspended solids with nitrogen and phosphorus compounds. For a longer period of time, these compounds could have been released into the effluent; this could have resulted in increasing biogenic compounds concentration in the sewage [38]. The calculated values of the coefficients of variation C s indicate the average variability of the raw sewage composition and sewage after mechanical treatment. In turn, in the case of the sewage after biological treatment, the pollutant indicators' variability was much greater. This is characteristic of municipal sewage, as is demonstrated by the research carried out by Chmielowski et al. [10], Bugajski et al. [16], Kaczor et al. [39] and Wąsik and Chmielowski [40].

Verification of the Theoretical Probability Distributions
A compatibility verification of the theoretical and empirical pollutant indicator distributions was performed using the following candidate distributions: general extreme values (GEV), Gaussian mixture model (GMM), log-normal, normal, triangular and Weibull; for this purpose, the Anderson-Darling test (A-D) for the significance level of α = 0.05 was used. If the p-value was less than the assumed significance level, theoretical distribution was considered as not compatible with the empirical distribution. The results of the analysis are presented in Table 2. As was shown in Table 2, the observational data series of the oxygen pollutant indicators (BOD 5 and COD) in raw sewage and in sewage after mechanical treatment can be described using almost all of the analysed theoretical distributions (excluding the triangular distribution). In the case of sewage after biological treatment, the BOD 5 and COD series cannot be described by normal and Weibull distributions additionally. In the case of total suspended solids (TSS), theoretical and empirical distributions for raw sewage and for sewage after mechanical treatment were not compatible with normal and triangular distributions. In the sewage after biological treatment, the possibility of using Weibull distribution has been additionally excluded. Independent of the treatment stages, TSS can be described using all the considered probability distributions. It was found that the empirical distributions of TP in raw sewage cannot be described using the triangular distribution and additionally using normal and Weibull distributions (sewage after biological treatment). In mechanically and biologically treated sewage, the total nitrogen observational data series can be described by all of the analysed functions. Conducted analysis showed that almost in every case, triangular distribution cannot be used for the description of the pollutant indicators observational data series. It should be emphasized that when triangular distribution compatibility with the empirical distributions has been found, for these, obtained p-values usually were on the limit level of the hypothesis H 0 acceptability. The inability to use triangular distribution results from the properties of this function. As Pereira et al. [41] and Stein and Keblis [42] indicate, triangular distribution is a homogeneous function, characterized by a minimum value and the most likely maximum value. Sewage treatment is a dynamic process that is affected by many factors changing over time. Therefore, a high variability of the pollutant indicators in treated sewage can be noted. Hence, for triangular distribution, prediction of the most often repeated minimum and maximum values is difficult. In addition, it should be emphasized that in reality, very few types of random variables can be described using triangular distribution.

Selection of the Best-Fitted Probability Distribution
Because the previous results have shown that the observational data series of the pollutant indicators can be described using more than one theoretical distribution, the best-fitted distributions were selected using Akaike criterion (AIC). Calculations were carried out only for those functions for which criterion A-D confirmed theoretical and empirical distribution compatibility. As the best-fitted function, this one with the lowest AIC value was accepted. The results of the analysis are summarized in Table 3. Figures 2-6 show quantile-quantile (Q-Q) charts for the best-fitted statistical distributions.               Table 3 shows that for BOD5, the best-fitted theoretical distribution in raw sewage was GEV distribution, while for the sewage after mechanical treatment and biological treatment, it was GMM distribution. Similarly, GMM distribution was the best for COD description in the case of raw sewage, mechanically treated sewage and biologically treated sewage. For the total suspended solids, it was noted that GEV distribution best describes TSS concentration in raw sewage and in sewage after biological treatment. GMM distribution turned out to be the best for mechanically treated sewage. By analysing the biogenic compounds concentrations, it was found that both for total nitrogen (TN) and for total phosphorous (TP), GMM distribution was the best-fitted function for sewage after each treatment stage. Obtained results made it possible to state that independently for the sewage treatment stages, GMM distribution described most of the pollutant indicators values in the best way (12 out of 15 cases) ( Table 3). Similar results were presented by Młyński et al. [19]. The authors of the cited paper [19] also showed that GMM distribution is the best for pollutant indicator value descriptions. The possibility of using GMM distribution for the control of WWTP operation work has been presented by Bouzenad and Ramdani [43] and Yu [44] too.  Table 3 shows that for BOD 5 , the best-fitted theoretical distribution in raw sewage was GEV distribution, while for the sewage after mechanical treatment and biological treatment, it was GMM distribution. Similarly, GMM distribution was the best for COD description in the case of raw sewage, mechanically treated sewage and biologically treated sewage. For the total suspended solids, it was noted that GEV distribution best describes TSS concentration in raw sewage and in sewage after biological treatment. GMM distribution turned out to be the best for mechanically treated sewage. By analysing the biogenic compounds concentrations, it was found that both for total nitrogen (TN) and for total phosphorous (TP), GMM distribution was the best-fitted function for sewage after each treatment stage. Obtained results made it possible to state that independently for the sewage treatment stages, GMM distribution described most of the pollutant indicators values in the best way (12 out of 15 cases) ( Table 3). Similar results were presented by Młyński et al. [19]. The authors of the cited paper [19] also showed that GMM distribution is the best for pollutant indicator value descriptions. The possibility of using GMM distribution for the control of WWTP operation work has been presented by Bouzenad and Ramdani [43] and Yu [44] too.

Analysis of Expected Sewage Quality Changes after Mechanical and Biological Treatment
Simulated values of the BOD 5 ( Figure 7a) and COD (Figure 7b) indicate the right efficiency of the processes of organic pollutants removal. This proves that the operation work efficiency of the analysed WWTP is typical for two-stage operated facilities. It is expected that mechanical treatment may provide an average BOD 5 and COD reduction of about 22%. In turn, as a result of mechanical and biological treatment, it is possible to achieve approximately 98% BOD 5 reduction and almost 95% COD reduction (Table 4); these meet the Polish requirements [36] on minimum pollutant reduction. In addition, it is not supposed that even maximum values of COD after biological treatment (Figure 7b  As the simulation results show, it is expected that mechanical treatment can provide a 40% reduction of total suspended solids (TSS), while as the final effect of the two-stage treatment, nearly 97% of TSS reduction can be achieved ( Table 4). The average concentration of total suspended solids changed from approximately 329.0 mg/L (raw sewage), throughout about 197.0 mg/L (mechanically treated sewage), to 11.3 mg/L in biologically treated sewage (Figure 7c). This proves that the facilities of the WWTP technological line are properly adapted to the removal of total suspended solids.
Simulated values of total nitrogen (TN) and total phosphorus (TP) concentration in the subsequent sewage treatment stages (Figure 7d,e), like the other pollutant indicators, confirm the compliance of the statistical tools used with the efficiency of this type of facility. While as a result of the mechanical treatment, it is possible to achieve at most a few or several percent reduction in biogenic compounds, biological processes are largely responsible for the removal of nitrogen and phosphorus compounds. As can be seen (Figure 7d), in the sewage discharged from the considered WWTP, the amount of TN that exceeds the permissible concentration (10.0 mg/L), as defined by the regulations [36], may be periodically recorded. However, the percentage reduction of TN at the WWTP outflow (76.6%) ( Table 4) meets the required TN reduction. The amount TP that exceeds the permissible TP concentration at the WWTP outflow also can be observed. Nevertheless, simulated average phosphorus concentration in treated sewage (Figure 7e) is 50% less than the limit value set at 1.0 mg/L [36].
The results presented in this paper correspond with the results presented in other authors' papers, i.e., [10,11]. At the same time, it can be concluded that the model used correctly describes the changes in the sewage quality in the two-stage operated WWTP. Simulated values of the pollutant indicators in raw sewage and in sewage treated mechanically and biologically, as well as the percentage reduction, are typical for properly operated mechanical and biological WWTPs.

Analysis of Expected Sewage Quality Changes after Mechanical and Biological Treatment
Simulated values of the BOD5 (Figure 7a) and COD (Figure 7b) indicate the right efficiency of the processes of organic pollutants removal. This proves that the operation work efficiency of the analysed WWTP is typical for two-stage operated facilities. It is expected that mechanical treatment may provide an average BOD5 and COD reduction of about 22%. In turn, as a result of mechanical and biological treatment, it is possible to achieve approximately 98% BOD5 reduction and almost 95% COD reduction (Table 4); these meet the Polish requirements [36] on minimum pollutant reduction. In addition, it is not supposed that even maximum values of COD after biological treatment (Figure 7b

Assessment of Expected Sewage Susceptibility to Biodegradability
As the next stage of the study, based on the simulation results, the assessment of sewage susceptibility to the organic matter decomposition expressed by the BOD 5 /COD ratio was conducted.
Obtained results indicate that both the composition of raw sewage and mechanically treated sewage will ensure a proper course of the biological processes of the organic matter removal; this is due to a high value of the biodegradability index (BOD 5 /COD = 0.5) (Figure 8). With reference to the literature data presented by Henze et al. [45], it can be stated that for municipal sewage, a BOD 5 /COD equal to 0.5 is the average value of this parameter. However, a BOD 5 /COD ratio after biological treatment of 0.2 proves that the facilities of the WWTP biological treatment line should ensure a high efficiency of. Based on the obtained results it can be stated that the mutual proportions between the content of organic carbon and nitrogen compounds will provide the right conditions for the proper course of the denitrification process ( Figure 8). This is demonstrated by the BOD5/TN ratio, whose value is greater than the minimum recommended by Jansen et al. [7]. Similarly, in order to achieve effective phosphorus compounds removal, during the dephosphatation process, it is necessary to ensure a BOD5/TP ratio of at least 20; in this case, it has been achieved. As the treatment stages proceed successively, it is expected that the BOD5/TN and BOD5/TP ratios will gradually decrease. After biological treatment processes, these values will be significantly lower compared with those of the raw sewage. Obtained results for the sewage after biological treatment, i.e., BOD5/TN = 0.5 and BOD5/TP = 14.3, indicate the depletion of the biodegradable substances and the cessation of further biogenic compounds degradation. In addition, based on Henze et al. [45], it can be stated that in municipal sewage, a BOD5/TN of 5 ( Figure 8) belongs to the average range of typical values. In turn, a calculated value of the BOD5/TP of about 45 ( Figure 8) for raw sewage, can definitely be considered as high.
Because the obtained results are compatible with the results presented e.g., by Chmielowski et al. [10] and Młyńska et al. [11], the accuracy of the statistical tools used in this paper for pollutant indicator simulation can be stated. The mutual proportions between pollutant indicators in subsequent treatment stages were similar to those presented by the authors of [10,11].

Summary and Conclusions
As the novelty aspect of this paper, a methodology for the simulation of WWTP effectiveness in terms of the pollutants reduction in the subsequent stages of mechanical and biological treatment can be indicated. Compared with other research, the proposed methodology is based on a multi-distribution analysis. Based on the empirical data, a two-stage statistical analysis was conducted: firstly, selection of the best-fitted probability distributions from the group of candidate distributions (GEV, GMM, log-normal, normal, Weibull and triangular), and then, a simulation of pollutant indicator values including BOD5, COD, total suspended solids (TSS), total nitrogen (TN) Based on the obtained results it can be stated that the mutual proportions between the content of organic carbon and nitrogen compounds will provide the right conditions for the proper course of the denitrification process ( Figure 8). This is demonstrated by the BOD 5 /TN ratio, whose value is greater than the minimum recommended by Jansen et al. [7]. Similarly, in order to achieve effective phosphorus compounds removal, during the dephosphatation process, it is necessary to ensure a BOD 5 /TP ratio of at least 20; in this case, it has been achieved. As the treatment stages proceed successively, it is expected that the BOD 5 /TN and BOD 5 /TP ratios will gradually decrease. After biological treatment processes, these values will be significantly lower compared with those of the raw sewage. Obtained results for the sewage after biological treatment, i.e., BOD 5 /TN = 0.5 and BOD 5 /TP = 14.3, indicate the depletion of the biodegradable substances and the cessation of further biogenic compounds degradation. In addition, based on Henze et al. [45], it can be stated that in municipal sewage, a BOD 5 /TN of 5 ( Figure 8) belongs to the average range of typical values. In turn, a calculated value of the BOD 5 /TP of about 45 ( Figure 8) for raw sewage, can definitely be considered as high.
Because the obtained results are compatible with the results presented e.g., by Chmielowski et al. [10] and Młyńska et al. [11], the accuracy of the statistical tools used in this paper for pollutant indicator simulation can be stated. The mutual proportions between pollutant indicators in subsequent treatment stages were similar to those presented by the authors of [10,11].

Summary and Conclusions
As the novelty aspect of this paper, a methodology for the simulation of WWTP effectiveness in terms of the pollutants reduction in the subsequent stages of mechanical and biological treatment can be indicated. Compared with other research, the proposed methodology is based on a multi-distribution analysis. Based on the empirical data, a two-stage statistical analysis was conducted: firstly, selection of the best-fitted probability distributions from the group of candidate distributions (GEV, GMM, log-normal, normal, Weibull and triangular), and then, a simulation of pollutant indicator values including BOD 5 , COD, total suspended solids (TSS), total nitrogen (TN) and total phosphorus (TP). Based on the selected theoretical distributions, a simulation using the Monte Carlo method was performed. The Anderson-Darling test showed that triangular distribution should not be used for a description of the pollutant indicators observational data series. Based on the Akaike criterion it was found that GMM distribution turned out to be the best-fitted statistical distribution.
Obtained simulation results prove that the statistical tools used in this paper accurately describe the changes of BOD 5 , COD, TSS, TN and TP values during sewage treatment. As it turned out, and as has been presented by the authors, the results of modelling can be used to make a reliable assessment of sewage susceptibility to the biological decomposition of organic matter and biogenic compounds. This is evidenced by the calculated values of the BOD 5 /COD, BOD 5 /TN and BOD 5 /TP ratios; these correspond with the typical values of properly operated mechanical and biological WWTPs.
To summarize, it can be stated that the two-step calculation method proposed as part of this paper can be an effective tool for predicting the course of WWTP processes, provided that an appropriate theoretical distribution of random variables is determined. As a result of this, it is possible to generate new data, maintaining existing correlation structures between variables. In terms of the proposed model application, the values generated this way can be helpful for the assessment of a WWTP's operation and for preparing different possible scenarios for its operation. The ability to predict sewage quality in subsequent treatment stages makes the operating activities of the WWTP facilities easier and more effective.