Urban Floods: Linking the Overloading of a Storm Water Sewer System to Precipitation Parameters

: The lack and ine ﬃ ciency of urban drainage systems, as well as extreme precipitation, can lead to system overloading and, therefore, an urban pluvial ﬂood. The study brings insights into this phenomenon from the perspective of the statistical relationship between precipitation and ﬂooding parameters. The paper investigates the possibility of predicting sewer overloading based on the characteristics of the upcoming rain event using the Storm Water Management Model (SWMM) and statistical methods. Additionally, it examines the inﬂuence of precipitation resolution on the model sensitivity regarding ﬂoods. The study is set in a small urban catchment in Dresden (Germany) with a separated stormwater sewer system (SWSS). The ﬂood-event-based calibrated model runs with observed and designed heavy rain events of various sums, durations, and intensities. Afterward, the analysis focuses on precipitation and model overloading parameters (total ﬂood volume, maximum ﬂooding time and ﬂow rate, and maximum nodal water depth) with pairwise correlation and multi-linear regression (MLR). The results indicate that it is possible to deﬁne a certain threshold (or range) for a few precipitation characteristics, which could lead to an urban ﬂood, and ﬁtting MLR can noticeably improve the predictability of the SWSS overloading parameters. The study concludes that design and observed rain events should be considered separately and that the resolution of the precipitation data (1 / 5 / 10 min) does not play a signiﬁcant role in SWSS overloading.


Introduction
In 2018, more than half of the world's population lived in urban settlements, and by 2030, this number is projected to increase by 10%. Moreover, each third person will live in cities with at least half a million inhabitants [1]. Urbanization causes the disturbance of natural landscapes by the replacement of the vegetated surfaces with impermeable surfaces, leading to significant hydrologic changes in cities [2] such as increased surface runoff during heavy precipitation events, generating urban floods. There are several definitions of this phenomenon, i.e.,: • Urban floods are events that cause damage in small catchment areas of less than 100 km 2 (even less than 10 km 2 ). They are trigged by small-scale rain events with volumes far above design rainfall for the concerned hydrological structures [3].

•
Urban floods or pluvial flooding in urban areas is the result of high-intensity or prolonged heavy rainfall leading to overland flow and ponding. They can be produced due to the exceedance or blockage of sewer and drainage systems, or high water levels in receiving watercourses [4].
• Detect the potential SWSS overloading based on the precipitation forecast; • Identify the heavy precipitation characteristics with the highest prediction capacity for SWSS inundation volume, time, and rate; • Compare results from pairwise correlation and multi-linear regression (MLR) approaches in predicting SWSS overloading accurately.
The paper is divided into five parts. The introduction to the general topic is followed by an overview of the study site and data used (Section 2.1) and model build-up (Section 2.2). The methodology part presents a model calibration procedure (Section 2.3), explains how the overloading of the drainage system is implemented in the SWMM (Section 2.4), and describes the statistical methods for the post-processing of the results (Section 2.5). The results part contains the model set-up and calibration performance (Section 3.1), the outcome of the model runs with different scenarios (Section 3.2), the statistical analysis of the results with pairwise correlations (Section 3.3) and MLR (Section 3.4), and a discussion of the influence of precipitation resolution (Section 3.5). Finally, conclusions are drawn and outlooks are discussed in Section 4.

Data and Methods
This section describes the general methods and tools used to complete the previously listed tasks as well as a description of the study site and data specification. This includes data collection and processing, the set-up and calibration of the SWMM, and the statistical analysis of the results. Figure 1 presents the applied workflow. It starts with the collection and processing of terrain and land cover data, all possible information on the sewer system, and climate data. Then, the model is set-up in the SWMM and calibrated using observed time series and design/observed heavy precipitation events used for the model forcing. Finally, the output data are statistically analyzed with a focus on the SWSS overloading.
Hydrology 2020, 7, x FOR PEER REVIEW 3 of 26 • Identify the heavy precipitation characteristics with the highest prediction capacity for SWSS inundation volume, time, and rate; • Compare results from pairwise correlation and multi-linear regression (MLR) approaches in predicting SWSS overloading accurately. The paper is divided into five parts. The introduction to the general topic is followed by an overview of the study site and data used (Section 2.1) and model build-up (Section 2.2). The methodology part presents a model calibration procedure (Section 2.3), explains how the overloading of the drainage system is implemented in the SWMM (Section 2.4), and describes the statistical methods for the post-processing of the results (Section 2.5). The results part contains the model setup and calibration performance (Section 3.1), the outcome of the model runs with different scenarios (Section 3.2), the statistical analysis of the results with pairwise correlations (Section 3.3) and MLR (Section 3.4), and a discussion of the influence of precipitation resolution (Section 3.5). Finally, conclusions are drawn and outlooks are discussed in Section 5.

Data and Methods
This section describes the general methods and tools used to complete the previously listed tasks as well as a description of the study site and data specification. This includes data collection and processing, the set-up and calibration of the SWMM, and the statistical analysis of the results. Figure 1 presents the applied workflow. It starts with the collection and processing of terrain and land cover data, all possible information on the sewer system, and climate data. Then, the model is set-up in the SWMM and calibrated using observed time series and design/observed heavy precipitation events used for the model forcing. Finally, the output data are statistically analyzed with a focus on the SWSS overloading.

Study Area and Required Initial Data
The study site is located in the south-eastern part of Dresden, Germany (51.00277 0 , 13.8374 0 , Figure 2) and has a drainage area of 0.12 km 2 . The land cover shows a typical urban configuration with approximately 50% of impervious areas: buildings (block living, schools, and grocery stores), pavements, parking lots, and green (mainly grass) areas. The topography consists of mostly flat

Study Area and Required Initial Data
The study site is located in the south-eastern part of Dresden, Germany (51.00277 • , 13.8374 • , Figure 2) and has a drainage area of 0.12 km 2 . The land cover shows a typical urban configuration with approximately 50% of impervious areas: buildings (block living, schools, and grocery stores), pavements, parking lots, and green (mainly grass) areas. The topography consists of mostly flat slopes with a general drainage direction to the north-east. In this specific area, a separated urban drainage system was installed initially, meaning that stormwater and sewage flow in parallel, but separated, pipes. The SWSS is formed of approximately 2.7 km of pipes, connected by 91 manholes, receiving water through 69 gully pots and discharges water in the local creek, which then flows out to the Elbe.
Hydrology 2020, 7, x FOR PEER REVIEW  4 of 26 slopes with a general drainage direction to the north-east. In this specific area, a separated urban drainage system was installed initially, meaning that stormwater and sewage flow in parallel, but separated, pipes. The SWSS is formed of approximately 2.7 km of pipes, connected by 91 manholes, receiving water through 69 gully pots and discharges water in the local creek, which then flows out to the Elbe.  Table 1 presents the required data for the modeling in the SWMM and the used sources.  [28], Koordinierte Starkniederschlagsregionalisierung undauswertung (KOSTRA) des DWD for design precipitation [29] The specification of an SWSS should include at least a detailed geometry of the pipe system and connections, such as elevations, lengths, diameters, shapes, the materials of pipe sections, and junctions, which are usually obtained from construction/monitoring documentation or field surveys. For the study, SWSS data were produced and provided by the Urban Water Management Department of TU Dresden; however, a small field survey was additionally conducted to correct the positions of some manholes and locate all the gully-pots, since they were not presented on the  Table 1 presents the required data for the modeling in the SWMM and the used sources.  [28], Koordinierte Starkniederschlagsregionalisierung und -auswertung (KOSTRA) des DWD for design precipitation [29] The specification of an SWSS should include at least a detailed geometry of the pipe system and connections, such as elevations, lengths, diameters, shapes, the materials of pipe sections, and junctions, which are usually obtained from construction/monitoring documentation or field surveys. For the study, SWSS data were produced and provided by the Urban Water Management Department of TU Dresden; however, a small field survey was additionally conducted to correct the positions of some manholes and locate all the gully-pots, since they were not presented on the schemes.
Detailed information on the catchment is required to set-up the general boundaries of the SWSS, delineate sub-catchments, calculate drainage areas and terrain slopes, assign flow pathways to SWSS inlets, and determine the main catchment parameters (roughness, width, permeability, infiltration, Hydrology 2020, 7, 35 5 of 23 depression storages, etc.). These data can be retrieved from satellite images, remote sensing elevation data or topographic maps, and field surveys, and some parameters can be found in textbooks. For the present study, a digital terrain model (DTM) of one-meter resolution (produced and provided by the TU Dresden), OpenLayer QGIS plugin [30] (which incorporates Open Street Maps and satellite images (Bing and Google)), and results of a small field survey were used.
Climate data include those of precipitation and temperature. While the first are the critical data for the model forcing, the second are additional ones to estimate daily evaporation (Hargreaves' method). The main intention was to simulate heavy rain precipitation events. Therefore, assuming negligible evapotranspiration fluxes during extreme rain events, other climatic data were not considered. Precipitation data were obtained from the nearest meteorological stations (Figure 2) from the database of Deutscher Wetterdienst (DWD, German Meteorological Service) [28]. Furthermore, design precipitation events with various characteristics were constructed from the KOSTRA-DWD-2010R (Koordinierte Starkniederschlagsregionalisierung und -auswertung des DWD (coordinated heavy precipitation regionalization and evaluation)) dataset [29]. Distributed via gridded (resolution of approximately 67 km 2 ) rasters for the whole of Germany, this dataset consists of pre-calculated design heavy precipitation via block-rain (rectangular hyetograph). The user can choose specific values of precipitation sums (in mm) and intensities (l s −1 ha −1 ) depending on different event durations (5 min to 72 h) and RPs (1-100 years).

Model Build-Up
The initial SWSS data on pipes, manholes, and outlets were checked for apparent errors-i.e., misconnections, typos in diameters, and manhole position errors-and all the gully-pots found within the field survey were added (with an average depth of 1 m) and routed to the associated manholes. The fixed parameters for inlets and junctions include shape, elevation, and depth; those for pipes include shape, length, and diameter. Since there is no option in the SWMM to select variable node diameter, the nodal surface area of 0.44 m 2 was established as a weighted mean of the standard manhole and gully-pot chamber diameters installed in the studied SWSS. The application of a weighted rather than simple mean is explained by the different numbers of gully-pots (69) and manholes (91); thus, taking a simple mean value will give a lower value for the effective diameter. It is important to estimate this parameter with proper justification, since it has a direct effect on the inundation characteristics because it defines the nodal volume, which acts as a buffer before SWSS overloads.
Afterward, the sub-catchments of the three main presented land classes (streets/pavements, roofs, and vegetation) were delineated according to the DTM and satellite images and assigned to the SWSS inlets. The fixed parameters for the sub-catchments included area, mean width, terrain slope, imperviousness (80-100% for streets/pavements, 100% for roofs, 20% for vegetation), and infiltration parameters (capillary suction head: 100 mm; saturated conductivity: 10.9 mm h −1 ; the difference between porosity and initial moisture content: 10%) for the mean soil type in the area (sandy loam) used in Green-Ampt method [31]. The area-related characteristics were calculated in QGIS [32] ("Field calculator" and "Terrain analysis" tools), the permeability was assigned according to satellite images [25,26], and the field survey, and the soil hydraulic properties were taken from the SWMM recommendations [33].
The general SWMM parameters (different from defaults) included rainfall/runoff and flow routing modules, the dynamic wave routing model, a routing step of 1 sec, and a head convergence tolerance of 1 mm.
The SWSS and catchments (with necessary characteristics and parameters) were prepared with QGIS 3.4.15 [32]. The final transfer from "shp" files to "inp" (SWMM data format) was done using inpPINS [34]. An overview of the model is presented in Figure 3.

Model Calibration
Two major approaches are commonly used in the calibration of the hydrological modelcontinuous and (multi-)event matching. Since the study investigates modeling storm events and the overloading of the SWSS, a calibration with short heavy rain events instead of long time-series with dry weather or regular stormwater flow is more likely to provide more accurate results. This multievent approach is quite popular in practice for the same kind of studies and used by various researchers using different hydrological and hydraulic models [35][36][37][38]. It was already successfully applied in the nearby sub-catchment for studies of low impact development practices [39].
Depending on the availability of observation data, the model can be calibrated using various techniques and with different performance indicators. In general, there are many parameters available for validation in the SWMM: sub-catchment runoff, node/pipe water depth, pipe/node inlet/outlet discharge, and node flooding.
Conventionally, urban sewer system models are calibrated using the outlet pipe discharge (i.e., outfall to a local open channel or inlet to a wastewater treatment plant), hence by hydrograph comparison. Thus, to evaluate the goodness of the model results, several well-known performance criteria can be used ( Table 2): Nash-Sutcliffe Efficiency [40], Kling-Gupta Efficiency [41], and peak and volume errors.

Model Calibration
Two major approaches are commonly used in the calibration of the hydrological model-continuous and (multi-)event matching. Since the study investigates modeling storm events and the overloading of the SWSS, a calibration with short heavy rain events instead of long time-series with dry weather or regular stormwater flow is more likely to provide more accurate results. This multi-event approach is quite popular in practice for the same kind of studies and used by various researchers using different hydrological and hydraulic models [35][36][37][38]. It was already successfully applied in the nearby sub-catchment for studies of low impact development practices [39].
Depending on the availability of observation data, the model can be calibrated using various techniques and with different performance indicators. In general, there are many parameters available for validation in the SWMM: sub-catchment runoff, node/pipe water depth, pipe/node inlet/outlet discharge, and node flooding.
Conventionally, urban sewer system models are calibrated using the outlet pipe discharge (i.e., outfall to a local open channel or inlet to a wastewater treatment plant), hence by hydrograph comparison. Thus, to evaluate the goodness of the model results, several well-known performance criteria can be used ( Table 2): Nash-Sutcliffe Efficiency [40], Kling-Gupta Efficiency [41], and peak and volume errors.
is the Pearson correlation coefficient between the simulated and observed flow, β is the ratio between the mean simulated and mean observed flow, α is the ratio between the simulated and observed flow variance Peak errors (maximum discharge value and time) [−∞, +∞] PE = 0-a perfect match between modeled and observed event peak Event volume error [−∞, +∞] VE = 0-a perfect match betweem modeled and observed event volume The calibration procedure could be performed in automatic (by the definition of the cost function and iterations using preset boundary values for the chosen calibration parameters) or manual (changing parameters "by hand") mode. The selected mode depends on the model complexity and the number of changing parameters. For the particular case-study, manual mode was chosen due to the event-based calibration and the use of only a few parameters (pipe and surface roughness, contribution areas, and depression storages).

Overloading of the Stormwater Sewage System in SWMM
In the present study, an urban flood in an SWSS is considered as a process of drainage system overloading caused by heavy rain, which leads to the flooding of the surface. In the SWMM, this phenomenon is modeled as follows. Each non-outfall node is assigned a maximum allowable pressure head H max . It consists of a maximum free water surface elevation (i.e., manhole depth) plus an optional "surcharge" depth that allows for an additional pressure head (i.e., if a manhole has a solid cover without openings). Typically, when the new head H new at a node is computed by the iterative solution process, and it exceeds H max , it is set equal to H max , and the node becomes flooded. Then, the overflow rate Q ovfl is calculated as an average net flow rate (inflow−outflow) of the node over the current time step [27].
Therefore, the flooding parameters are available in the model summary for each flooding node: total flooded time, maximum flooding discharge, time of maximum flooding, and total flooding volume.
A critical remark to add is that once flooding occurs, afterward, this flooding volume is then lost from the system, as the SWMM does not allow overloading water routing.

Statistical Post-Processing of the Results
Two statistical approaches were used to study the relationship between precipitation and the SWSS overloading parameters: pairwise correlation and MLR.
The classical Pearson correlation test used as the first approach is a measure of the linear correlation between two variables.
MLR is a relatively powerful yet simple tool to increase the correlation coefficient of the dependent variable using non-clear individual relationships with predictors. The general formula for the MLR function is as follows: Hydrology 2020, 7, 35 where y is the dependent variable, x i is the predictor, a is the intercept, b i is the slope, and n is the number of predictors. This approach was used to find a better relationship between precipitation and the SWSS overloading parameters. However, to fit the MLR function properly, the data and the obtained regression should fit the relatively strict statistical assumptions [42] listed in Table 3. Table 3. Multi-linear model assumption testing.

Assumptions Test
Linear relationship and independent predictors Scatter plot, correlation matrix Symmetrical (normal) distribution Histogram ( Conventional statistical tests are used to test for the normality of the data (Shapiro-Wilk [43]), the autocorrelation of residuals (Durbin-Watson [44]), and the homoscedasticity of the variance (Breusch-Pegan [45]). The general interpretation of the tests comes down to a comparison of the test statistics' p-values to a conventional threshold value of 0.05 [46] to decide on the rejection (p < 0.05) or non-rejection (p > 0.05) of the test null hypothesis. The following null hypotheses denote the chosen tests (Shapiro-Wilk, Durbin-Watson, and Breusch-Pegan, respectively): a sample comes from a normally distributed population; there is no autocorrelation (at a lag of 1) in the residuals (prediction errors) according to a regression analysis; the variance of the errors according to a linear model is homoscedastic.
Additionally, data transformation methods were used to convert non-normally distributed data to an approximately normal distribution. The simple reason for this is that in fact, after transformation, one is able to "recycle" the knowledge about the analysis of the normally distributed data, so that one can apply standard analysis methods. An outstandingly important class of transformations is powers and logarithms. The so-called Box-Cox transformation is applied to determine an optimal transformation for this class [47]. The original formula looks like this: where: GM(x)-the geometric mean of the sample and λ-maximum of the log-likelihood function (assuming the transformed observations come from a normal distribution with mean µ and standard deviation σ, the log-likelihood function looks as follows log[L(λ, µ, σ)]). In this study, a simplified version of Box-Cox transformation was used, with the formula x i = x λ i and λ rounded to closest value with a 0.5 step (which, however, still remained within the 95% confidence interval).
Statistical analysis was conducted within the R environment [48] using standard and the following custom libraries: "lmtest" [49] and "MASS" [50].

Model Calibration
For the calibration of the model, almost two years (01.2017-10.2018) of irregular discharge measurements in the outflow to Lockwitzbach creek were available at a 1 min resolution (taken and provided by the Urban Water Management Department of TU Dresden). For the model forcing, meteorological input data of the closest station (Lockwitz, 1.6 km from the catchment) with a 5 min resolution were used. As the goal was to calibrate the model using the event-based principle specifically for high-flow conditions, available discharge time series were screened, and five flood events were extracted. To gain more accurate calibration parameters and because of the already too-small sample size, the authors decided to sacrifice the normal procedure of splitting data into calibration-validation sets and instead used all the events for calibration purposes. This trade-off indeed weakens the reliability calibration procedure and needs to be elaborated in possible future studies when longer observation time series will be available.
The calibration was performed manually, and the chosen fitting criteria were the following: peak discharge, time to peak, and total volume. The first two were met by adjusting the roughness of the conduits (0.0155 m 1/3 s −1 , accounting for the pipe material and conduits' current conditions) and catchments (0.016 m 1/3 s −1 for impervious and 0.03 m 1/3 s −1 for the pervious terrain). The recommended range of Manning's coefficients for steel and plastic pipes (whose mixture forms the studied SWSS) is 0.009-0.19 m 1/3 s −1 [27,51]. The ranges of the coefficients for the surface roughness were taken from [52]: 0.01-0.023 for impervious (smooth and rough asphalt, tar, and concrete) and 0.017-0.06 m 1/3 s −1 (smooth and rough packed soil, gravel, grass, and residential land use). Additionally, catchment depression storage parameters were used in the calibration: percentage of impervious area with no depression storage (50%) and depth of depression storage on pervious/impervious area (for vegetation: 5/1 mm; for streets: 5/2.6 mm; and 1.27 mm for roofs). Various estimations and formulas [33] suggest the following ranges for depression storage: in general, values of 0.1-11 mm are mentioned; specifically, with empirical formulas, 0.1-2.8 mm is suggested, and with case study estimations, -1.2 to 2.5 or 5 to 11 mm for impervious or pervious areas are suggested, respectively. Hence, parameters exhibit strong variation from study to study, and extensive preliminary work to obtain accurate estimates or to fit them within the exact the ranges of empirical formula outcomes or other researcher's values is unnecessary. Moreover, this parameter was stated as being sensible only for low-depth short storms [33]. Fitting the flood volume could be adjusted by rearranging the catchment boundaries, adding missing or deleting areas that are clearly not contributing to SWSS runoff. This can be achieved with the help of satellite images and consideration of the impressions from an on-site survey. In the study case, mainly pervious areas and roofs were deleted. Figure 4 shows the comparison of the measurements and the calibrated model output for the five events and the corresponding precipitation, and Table 4 presents the results for the performance criteria.
Hydrology 2020, 7, x FOR PEER REVIEW 9 of 26 For the calibration of the model, almost two years (01.2017-10.2018) of irregular discharge measurements in the outflow to Lockwitzbach creek were available at a 1 min resolution (taken and provided by the Urban Water Management Department of TU Dresden). For the model forcing, meteorological input data of the closest station (Lockwitz, 1.6 km from the catchment) with a 5 min resolution were used. As the goal was to calibrate the model using the event-based principle specifically for high-flow conditions, available discharge time series were screened, and five flood events were extracted. To gain more accurate calibration parameters and because of the already toosmall sample size, the authors decided to sacrifice the normal procedure of splitting data into calibration-validation sets and instead used all the events for calibration purposes. This trade-off indeed weakens the reliability calibration procedure and needs to be elaborated in possible future studies when longer observation time series will be available.
The calibration was performed manually, and the chosen fitting criteria were the following: peak discharge, time to peak, and total volume. The first two were met by adjusting the roughness of the conduits (0.0155 m 1/3 s −1 , accounting for the pipe material and conduits' current conditions) and catchments (0.016 m 1/3 s −1 for impervious and 0.03 m 1/3 s −1 for the pervious terrain). The recommended range of Manning's coefficients for steel and plastic pipes (whose mixture forms the studied SWSS) is 0.009-0.19 m 1/3 s −1 [27,51]. The ranges of the coefficients for the surface roughness were taken from [52]: 0.01-0.023 for impervious (smooth and rough asphalt, tar, and concrete) and 0.017-0.06 m 1/3 s −1 (smooth and rough packed soil, gravel, grass, and residential land use). Additionally, catchment depression storage parameters were used in the calibration: percentage of impervious area with no depression storage (50%) and depth of depression storage on pervious/impervious area (for vegetation: 5/1 mm; for streets: 5/2.6 mm; and 1.27 mm for roofs). Various estimations and formulas [33] suggest the following ranges for depression storage: in general, values of 0.1-11 mm are mentioned; specifically, with empirical formulas, 0.1-2.8 mm is suggested, and with case study estimations, -1.2 to 2.5 or 5 to 11 mm for impervious or pervious areas are suggested, respectively. Hence, parameters exhibit strong variation from study to study, and extensive preliminary work to obtain accurate estimates or to fit them within the exact the ranges of empirical formula outcomes or other researcher's values is unnecessary. Moreover, this parameter was stated as being sensible only for low-depth short storms [33]. Fitting the flood volume could be adjusted by rearranging the catchment boundaries, adding missing or deleting areas that are clearly not contributing to SWSS runoff. This can be achieved with the help of satellite images and consideration of the impressions from an on-site survey. In the study case, mainly pervious areas and roofs were deleted. Figure 4 shows the comparison of the measurements and the calibrated model output for the five events and the corresponding precipitation, and Table 4 presents the results for the performance criteria.   A relatively high agreement was achieved for four out of five events. The NSE values for these four events are around 0.5, and the KGE values, around 0.7. Regarding the lower performance of the event dating 22.06.2017, the issue with the low NSE values could be explained by the general problem of time shift for all the events in the range of 3-10 min. Several attempts to adjust timing were conducted, and the only case where the time-shift of the uprising limb was matched related to lowering the conduit roughness to 0.008-0.009 m 1/3 s −1 . However, firstly, these values do not look realistic for a relatively old pipe system (besides enormous velocities of 2-4 ms −1 ), and secondly, they resulted in the shortening of the flood duration almost twice while sharply increasing the maximum flow rate. The most probable background of this effect is the travel time of the rain cell, as there is a distance of 1.6 km between the study area and the meteorological station (resulting in propagation velocities of 3-10 ms −1 , which are reasonable, i.e., for convective cells [53,54]). Thus, the issue was considered as non-crucial for the study, since in this case, it will not affect the characteristics of SWSS overloading. That is why it is important to have a look at the KGE values (whose component decomposition will give a low correlation value but high bias and variability ratio agreement), which are all higher than −0.41 (benchmark of mean flow performance [55]).
The higher and positive volume errors (12-31%) gained for four out of the five events are potentially a bigger problem for the study set-up. However, as can be seen from the graphs, in most cases, these volumetric errors are the sum-up result of too long tails of modeled hydrographs, while the observed ones are much shorter. Nevertheless, as the main intention of the study was inundation research, the main interest lies in the upper part of hydrographs, where a better agreement was achieved. Generally speaking, these deviations could probably be explained by the different resolutions of the discharge and precipitation measurements (i.e., for 22.06.2017 and 18.07.2017, three and two clear peak inflows are observed, respectively, while only one is found in the precipitation measurements). The relative peak errors are much smaller (−9 ± 14%) and potentially result from the generalization of the initial model conditions (as the event-based calibration strategy does not normally imply a warm-up period like in conventional continuous calibration) or simplifications of the surface flow routing in the SWMM. The apparent overestimations and underestimations of the peak discharge and total volume are again the probable results of the different resolutions and heterogeneity of the rain cell, which is a known issue for SWSS modeling [56].

The Model Runs with Various Heavy Precipitation Scenarios
Data from the three nearest meteorological stations (Figure 1) were screened within the available time-period of 1998-2018 [28] to extract 35 heavily observed precipitation events, with a precipitation measurement interval of 1 min (the data from the nearest Lockwitz station are available only with a 5 min resolution and much shorter time period). To get an insight into the sensitivity of the model to different input data types, KOSTRA-DWD-2010R (see Section 2.1) was used and the following design rain events were extracted: RPs of 2, 20, and 100 years and durations of 5, 10, 120, and 2880 years. For each of the rain events, the following characteristics were derived: duration, sum, maximum intensity (I max ), mean intensity (I mean = sum/duration), form (K 1 = maximum intensity/sum), skewness (K 2 = time to peak/duration), and flatness (K 3 = maximum intensity/mean intensity). K 1 , K 2 , and K 3 were added to account for a pluviograph shape: K 1 indicates the steepness of the event (values close to 1 indicate a very steep event, as the maximum intensity and sum are equally big), K 2 marks whether the peak comes at the beginning or the end of the rain event, and K 3 describes how pronounced the peak of the rain is compared to the mean intensity of the event. Table 5 shows a summary of the scenarios, and a full list is presented in Appendix A. A subset of the observed rain events was made to cover a relatively wide range of the possible combinations of precipitation characteristics but, in general, starting from short light rains, which are unable to overload an SWSS (but are necessary to detect the overloading threshold), continuing with long/short events of moderate intensity, and, finally, finishing with long/short heavy precipitation with RPs of 20-100 years. A summary of the SWMM simulation results is presented in Table 6, and a full list with all the model outputs is presented in Appendix B. In case flooding occurred, the maximum flooding time (among all flooded nodes), discharge at the most pronounced junction, and total flooding volume in the whole SWSS were reported. If no flooding was observed, the relative node depth at the most loaded junction is documented (calculated as the ratio between the maximal water depth and total manhole depth).

Detection of "System Overloading" Precipitation Threshold
To find the precipitation threshold for SWSS overloading, a graphical solution is proposed. Using flooding values obtained from the SWMM simulations (i.e., flooding volume) and parameters of rain events (i.e., maximum intensity), the upper part of the graph (Figure 6) is defined. With the intercept of the X-and Y-axis representing the virtual terrain surface of the system (the elevation of the manhole top), the lower part of the graph defines non-surface-flooding conditions by the maximum water depth in the node (or relative nodal loading-for better scaling) during a rain event. Hence, the intercept will give the surface flooding threshold of the SWSS for a specific precipitation parameter.

Detection of "System Overloading" Precipitation Threshold
To find the precipitation threshold for SWSS overloading, a graphical solution is proposed.
Using flooding values obtained from the SWMM simulations (i.e., flooding volume) and parameters of rain events (i.e., maximum intensity), the upper part of the graph (Figure 6) is defined. With the intercept of the X-and Y-axis representing the virtual terrain surface of the system (the elevation of the manhole top), the lower part of the graph defines non-surface-flooding conditions by the maximum water depth in the node (or relative nodal loading-for better scaling) during a rain event. Hence, the intercept will give the surface flooding threshold of the SWSS for a specific precipitation parameter.

Detection of "System Overloading" Precipitation Threshold
To find the precipitation threshold for SWSS overloading, a graphical solution is proposed. Using flooding values obtained from the SWMM simulations (i.e., flooding volume) and parameters of rain events (i.e., maximum intensity), the upper part of the graph (Figure 6) is defined. With the intercept of the X-and Y-axis representing the virtual terrain surface of the system (the elevation of the manhole top), the lower part of the graph defines non-surface-flooding conditions by the maximum water depth in the node (or relative nodal loading-for better scaling) during a rain event. Hence, the intercept will give the surface flooding threshold of the SWSS for a specific precipitation parameter. Figure 6. Overview of the study project data workflow. Figure 6. Overview of the study project data workflow. Table 7 shows the Pearson's correlations calculated for all scenarios and excluding design events. Among all the precipitation parameters tested, a few of them (duration, sum, mean and max intensity, and K 1 ) depicted significant (p < 0.05) negative and positive correlations with inundation parameters. In summary, it could be stated that, in general, rain events with a shorter duration, higher maximum intensity, and total sum will cause higher SWSS flooding time, discharge and total volume, which is in agreement with the findings of other studies [15,16] (partly, since no studies were found to cover correlation with all the used precipitation characteristics). An exclusion of KOSTRA design rain leads to an improvement of the correlation, while block-typed and natural rain events lead to a different response of the SWSS. Since civil engineers usually use design rainfall for planning drainage systems and scientists stick to real observation data, there is no right answer as to which data type is better. However, the authors want to highlight the message that due to the significantly different relationship between the overloading and precipitation parameters, design and observed rain events should not be mixed together, to obtain better results. Table 7. Correlation between precipitation and SWSS-overloading parameters. Picking up the three most promising precipitation parameters (sum, mean, and max intensity), an attempt to find an SWSS-overloading threshold with a graphical approach was conducted (Figure 7). The results indicate a not very good relationship, since even if a linear function can be derived for the upper part of the graph, most often, the points from the bottom (the maximum node loading) have different relationships or/and are indistinct. Therefore, the possible ranges of the interception with the 0-ordinate axis representing the starting of SWSS flooding are relatively big. Nevertheless, for the maximum intensity, it is possible to detect the upper and lower values of the threshold for each flooding parameter (0.2 and 1 mm/min, respectively) and for the event sum and the mean intensity-at least at the upper boundary (60-90 mm and 0.6 mm/min, respectively).

Prediction of Overloading Parameters with Multi-Linear Regression
To increase the predictability of the flooding occurrence and its parameters in particular, a second approach was followed. The MLR was fitted to various combinations of the precipitation parameters to determine whether this method improved the correlation compared to the pairwise approach. Here, observed and designed rain events were not separated and acted as one dataset. The adjusted coefficient of determination (squared correlation coefficient corrected by the number of predictors) was used as an MLR selection performance measure. The following models were tested: one/two with original data (all predictors and reduced number if the fit was better) and two/three with transformed data (all predictors, best predictors with which MLR fits all statistical assumptions, and best predictors without assumption fitting). The reduction of predictors was based on the pairwise correlation analysis and the significance of a specific predictor's coefficient in the summary print of MLR.
At first, the data were tested against the MLR assumptions mentioned in Section 2.5. Approximate linearity could be confirmed with simple scatter plots (i.e., for the exemplary data in

Prediction of Overloading Parameters with Multi-Linear Regression
To increase the predictability of the flooding occurrence and its parameters in particular, a second approach was followed. The MLR was fitted to various combinations of the precipitation parameters to determine whether this method improved the correlation compared to the pairwise approach. Here, observed and designed rain events were not separated and acted as one dataset. The adjusted coefficient of determination (squared correlation coefficient corrected by the number of predictors) was used as an MLR selection performance measure. The following models were tested: one/two with original data (all predictors and reduced number if the fit was better) and two/three with transformed data (all predictors, best predictors with which MLR fits all statistical assumptions, and best predictors without assumption fitting). The reduction of predictors was based on the pairwise correlation analysis and the significance of a specific predictor's coefficient in the summary print of MLR.
At first, the data were tested against the MLR assumptions mentioned in Section 2.5. Approximate linearity could be confirmed with simple scatter plots (i.e., for the exemplary data in Figure 7). The independency of certain predictors within a certain pool is defined by parameters such as duration, sum, and intensity (which are the only ones that are strictly independent); however, a pair like sum and K 3 are also valid. The results of the normality test and histograms for the direct model output showed a skewness of the data distribution. Thus, a transformation was necessary. Table 8 shows the result of the Shapiro-Wilk test before and after Box-Cox transformation; additionally, the transformation parameters are depicted. It could be stated that for all the parameters (except duration), logarithmic and power (square root) transformation worked, and approximate normality was achieved. Other assumptions (normality and the absence of the autocorrelation of the model residuals as well as the homoscedasticity of the variance) had already been tested after the model fitting employing the Shapiro-Wilk, Durbin-Watson, and Breusch-Pagan tests and MLR diagnostic plots. A summary of the MLR model fit results and list of the used predictors are shown in Table 9, and the adjusted coefficients of determination and p-values of the applied tests are presented in Appendix C. While there were no problems with the residual's autocorrelation and homoscedasticity for any of the models, non-transformed data exhibited problems with normality. Finally, only the model with independent predictors and transformed data fitted all the assumptions. Overall, the coefficients of determination varied from 0.70 to 0.92, with better performance for the maximum overloading discharge and slightly worse performance for the total overloading volume. Total overloading volume for SWSS (10 6 l) All yes X X X V V 4 Duration + I max + I mean + K 1 yes Max nodal overloading flow rate (l/s) A graphical representation of the performance for all the tested MLRs and all three overloading parameters is presented with the scatter plots of those modeled versus those predicted with MLR values (Figure 8). It can be observed that none of the derived functions delivers a perfect match. In fact, there is a tendency to detect flooding when it was not found in the SWMM output and, vice versa, to underestimate the flooding parameters at high ranges. As expected from Table 9, the maximum nodal flow rate has much better predictability, while for the flooding time, it seems to be very tricky to achieve good results. Finally, it is clear that the obtained MLR works much better for the observed rather than for the designed rain events, especially at high ranges. Transforming the coefficient of determination to Pearson's correlation, one can finally determine whether MLR outperformed the pairwise correlation. Using only the MLRs that fit all the statistical assumptions, it can be stated that MLR could considerably improve the prediction of the flooding time and volume (Table 10) by approximately 20-40%. Transforming the coefficient of determination to Pearson's correlation, one can finally determine whether MLR outperformed the pairwise correlation. Using only the MLRs that fit all the statistical assumptions, it can be stated that MLR could considerably improve the prediction of the flooding time and volume (Table 10) by approximately 20-40%.

Influence of the Precipitation Resolution
To test the sensitivity of the model to overload depending on the input data quality, all the scenarios with observed precipitation were rerun with the up-scaled resolution of 5 and 10 minutes. The results indicate that an increase in the precipitation aggregation period yielded a decrease in the overloading volume and maximum flow rate and an increase in the maximum flooding time (Figure 9). However, ANOVA tests showed that the changes are significant (p < 0.0.5) only for the maximum flow rate. Additionally, simulation errors were compared: the surface runoff errors decreased significantly with an increase in resolution, while the flow routing errors remained almost at the same level. Overall, this means that even in the absence of highly resolved precipitation measurements, the modeling of SWSS overloading could give good results. These results agree with other studies, which show that precipitation aggregations do not lead to the same noticeable degree of changes in SWSS flow [57].
Total overloading volume for SWSS (10 6 l) 0.68 0.67 0.83 0.83 * One value is given for both cases as both MLRs use K2 or K3, which cannot be derived for design rain.

Influence of the Precipitation Resolution
To test the sensitivity of the model to overload depending on the input data quality, all the scenarios with observed precipitation were rerun with the up-scaled resolution of 5 and 10 minutes. The results indicate that an increase in the precipitation aggregation period yielded a decrease in the overloading volume and maximum flow rate and an increase in the maximum flooding time ( Figure  9). However, ANOVA tests showed that the changes are significant (p < 0.0.5) only for the maximum flow rate. Additionally, simulation errors were compared: the surface runoff errors decreased significantly with an increase in resolution, while the flow routing errors remained almost at the same level. Overall, this means that even in the absence of highly resolved precipitation measurements, the modeling of SWSS overloading could give good results. These results agree with other studies, which show that precipitation aggregations do not lead to the same noticeable degree of changes in SWSS flow [57].

Conclusions and Outlook
The presented study gives novel insights into the behavior of stormwater sewage systems in the urban areas concerning their overloading due to heavy rain events, using a numerical modeling approach and statistical post-processing of the results. Using a case study of a small catchment in Germany, an SWMM model was set up and calibrated. Several scenarios with observed and designed precipitation events were tested to study the correlation between rain characteristics and overloading to detect the flooding of the system and predict its parameters. In our opinion, the goals of this study

Conclusions and Outlook
The presented study gives novel insights into the behavior of stormwater sewage systems in the urban areas concerning their overloading due to heavy rain events, using a numerical modeling approach and statistical post-processing of the results. Using a case study of a small catchment in Germany, an SWMM model was set up and calibrated. Several scenarios with observed and designed precipitation events were tested to study the correlation between rain characteristics and overloading to detect the flooding of the system and predict its parameters. In our opinion, the goals of this study were reached, and the case study showed that the methods are applicable and most possibly transferable. The following conclusions can be drawn:

•
The prediction of SWSS overloading using rain forecasts with precipitation characteristics and a proposed graphical approach is, in general, possible. However, the relationship between the upper (surface flooding) and the lower (nodal flooding) parts is quite fuzzy for some precipitation parameters. For the studied SWSS, surface flooding most probably will start after rain with around 1/0.6 mm min −1 of maximum/mean intensity and a total event sum of more than 60 mm.

•
The total overloading volume and maximum overloading flow rate showed a higher Pearson's correlation with the maximum rain intensity (R = 0.67 and R = 0.93, respectively), and for the maximum flooding time, the total rain event sum worked better (R = 0.59).

•
MLR with the precipitation characteristics can significantly improve the predictability of the SWSS overloading parameters (with an increase in the Pearson's correlation coefficient up to 50%). This, however, could require additional data manipulations.

•
Observed and designed rain events behave differently in terms of SWSS overloading; thus, the analysis and results should be treated separately.

•
The use of the coarser precipitation resolution leads to a decrease in the SWSS overloading volume and maximum flow and increase in the flooding time (relative changes in median values by approximately 20-30%).
Besides scientific value, the suggested methods can potentially be applied to improve sewer system design for extreme scenarios or to establish simple early warning systems for SWSS inundation based on rain forecasts (without comprehensive operational modeling frameworks) and thus provide a quick response to support preparations for flooding and the mitigation of its consequences.
Finally, the authors would like to highlight possible future steps for extending the presented study in the field of extreme precipitation and the prediction of SWSS overloading:

•
Testing the approach on different catchments, and the extension of the event sample size to obtain more robust statistics; Funding: Open Access Funding by the Publication Fund of the TU Dresden.

Conflicts of Interest:
The authors declare that they have no conflict of interest.
Data Availability: The calibrated model, raw data on the SWSS, discharge measurements, DTM, and prepared heavy precipitation dataset used in the presented study can be provided upon request. The raw climatological data can be obtained from the DWD server [28]. The rasters with design heavy precipitation can be obtained from the KOSTRA-DWD-2010R server [29].
Appendix A