Long-Term Modelling of an Agricultural and Urban River Catchment with SWMM Upgraded by the Evapotranspiration Model UrbanEVA

Evapotranspiration (ET) has a decisive effect on groundwater recharge and thus also affects the base flow of the receiving water. This applies above all to low-lying areas with a low depth to groundwater (GW), as is often the case in the north German lowlands. In order to analyze this relation, a coupled rainfall-runoff and hydraulic stream model was set up using the software SWMM-UrbanEVA, a version of the software SWMM that was upgraded by a detailed ET module. A corresponding model was set up for the same site but with the conventional software SWMM to compare the water balance and hydrographs. The total amount of ET calculated with the SWMM software is 7% higher than that computed with the upgraded version in the period considered. Therefore, less water is available for soil infiltration and lateral groundwater flow to the stream. This generally leads to a slight underestimation of base flows, with the exception of a notably wet summer month when the base flows were highly overestimated. Nevertheless, the base flow hydrograph shows a good adaptation to observed values (MAE = 0.014 m3s−1, R = 0.88, NSE = 0.81) but gives worse results compared to SWMM-UrbanEVA. The latter is very well able to reflect the GW-fed base flow in the sample stream in average (MAE = 0.011 m3s−1) and in its dynamics (R = 0.93, NSE = 0.85). By applying the UrbanEVA upgrade, SWMM is applicable to model the seasonal dynamics of near-natural river basins.


Introduction
The storm water management model (SWMM) was originally developed for the simulation and evaluation of storm runoff and sewer hydraulics in urban areas [1]. Since the introduction of SWMM in 1971, it has been continuously improved and thus opens up more and more fields of application. One of the biggest advantages of SWMM is that it combines both a hydrological rainfall-runoff model with a hydrodynamic drainage model in one software. This makes the numerical calculation very effective and stable, as no external coupling is necessary. Besides the modelling of urban infrastructure, like pipes, pumps, flow dividers and storage units, etc., it also allows the modelling of open streams with natural cross sections. Furthermore, the rainfall runoff model enables reliable groundwater simulation, as it uses physical parameters to calculate storage and discharge. Comparable models, e.g., Hec-HMS, only represent the aquifer in a simplified way [2]. As versatile as SWMM is, it has one drawback. This consists in the fact that the latest SWMM version (5.1.015) calculates purely physical actual evaporation from different environmental compartments but does not account for the biological transpiration of plants. When simulating a single rain event, this is not relevant, since ET practically does not take place and therefore has no significant impact on discharges. Considering a long-term

Study Area and Monitoring Station
The study area is the catchment of the Schmarler stream system, located in the north-eastern German lowlands. It is part of the city of Rostock and its rural surroundings (see Figure 1). The stream network altogether is 35 km long, consisting of open segments, pipes, and culverts. The main stream discharges into the river Warnow, which drains into the Baltic Sea. In total, 65% of the 23-km 2 catchment area is occupied by vegetation, while approximately 34% of the area is partial impervious, which is due to urban use (residential area, traffic area, and industry/trade; see Table 1). The annual average precipitation is 600-700 mm, of which 350-575 mm evapotranspirates depending on the type of land use and water availability [14]. The topography is relatively flat, ranging from 29 to 0 m above mean sea level. In most parts of the area, the depth to GW is relatively low (<5 m), which causes a significant seasonal component of base flow in the river system.
In the area of the Schmarler stream station 2 + 400, a monitoring station for continuous measurement of water level and flow velocity was installed (see Figure 1) in order to calculate flow rates from the two parameters. To accomplish this, an ultrasonic doppler flow meter was used. Since the device only measures flow velocity in the central lamella, a calibration function based on regular comparative manual multi-point measurements was set-up to obtain the mean flow velocity of the complete cross section. Figure 2 shows the corrected data at the monitoring station "Autobahn" (AB). Due to random errors, the noise of the data is quite high. Therefore, the time series was smoothed using the 3-h moving median. The latter was processed to separate the base flows, based on the sliding interval method [15] by forming the minimum of 3-day periods, respectively ( Figure 3).  In the area of the Schmarler stream station 2 + 400, a monitoring station for continuous measurement of water level and flow velocity was installed (see Figure 1) in order to calculate flow rates from the two parameters. To accomplish this, an ultrasonic doppler flow meter was used. Since the device only measures flow velocity in the central lamella, a calibration function based on regular comparative manual multi-point measurements was set-up to obtain the mean flow velocity of the complete cross section. Figure 2 shows the corrected data at the monitoring station "Autobahn" (AB). Due to random errors, the noise of the data is quite high. Therefore, the time series was smoothed using the 3-h moving median. The latter was processed to separate the base flows, based on the sliding interval method [15] by forming the minimum of 3-day periods, respectively ( Figure 3).

SWMM
The Storm Water Management Model SWMM is an open source software used for the simulation of the surface/subsurface runoff from primarily urban areas. It was developed by the United States Environmental Protection Agency (EPA) and combines a hydrological rainfall-runoff model with a hydrodynamic drainage model. The former traditionally consists of four compartments: (1) the atmosphere, (2) the land surface, (3) the sub-surface, and (4) the GW compartment. The rainfall-runoff model is semi-distributed and operates on a number of subcatchments consisting of permeable and impermeable sub-areas. Furthermore, SWMM's version 5 introduces the low-impact development controls (LIDs), such as a bioretention cell or a green roof, used to calculate runoff, storage, and infiltration in a more detailed way [1,16]. Different calculation algorithms are available for the different compartments of the rainfall-runoff model; the ones used in this study are described below.

SWMM
The Storm Water Management Model SWMM is an open source software used for the simulation of the surface/subsurface runoff from primarily urban areas. It was developed by the United States Environmental Protection Agency (EPA) and combines a hydrological rainfall-runoff model with a hydrodynamic drainage model. The former traditionally consists of four compartments: (1) the atmosphere, (2) the land surface, (3) the sub-surface, and (4) the GW compartment. The rainfall-runoff model is semi-distributed and operates on a number of subcatchments consisting of permeable and impermeable sub-areas. Furthermore, SWMM's version 5 introduces the low-impact development controls (LIDs), such as a bioretention cell or a green roof, used to calculate runoff, storage, and infiltration in a more detailed way [1,16]. Different calculation algorithms are available for the different compartments of the rainfallrunoff model; the ones used in this study are described below.
The atmosphere module contains time series for precipitation and ET 0 . The latter can only be set globally for the entire model area while distinct precipitation time series can be defined for every subcatchment. Originally, evaporation can occur for standing water on subcatchment surfaces, for subsurface water in aquifers, for water held in storage units, and for open channel flow [1]. The land surface receives precipitation and generates surface runoff using the nonlinear reservoir routing method. For infiltration calculation in the sub-surface zone of the subcatchments, SWMM offers five optional methods. For this study, the Horton method was chosen. It assumes that the infiltration capacity is high at the beginning of a rain event (maximum infiltration rate) and then decreases exponentially with increasing water content in the soil until a state of equilibrium is reached (minimum infiltration rate). However, for infiltration within the LID, SWMM only offers the Green & Ampt Scheme. In SWMM, the aquifer can lose or transfer water respectively through deep percolation, ET, and lateral GW flow to the drainage network. The height of the water table varies with time depending on the rates of inflow and outflow. Lateral GW flow is represented through a user-defined power function of the changing water table of the aquifer and depth of water in the receiving node of the conveyance system [1]: A threshold set by the user makes it possible to determine from which water level lateral GW flow to the stream network will occur [16,17]. This also enables the simulation of tile drainage.
The hydrodynamic drainage model is subdivided into nodes and conduits and receives its water from the subcatchments (runoff from surface and GW interflow) at defined nodes. The drainage network may consist of open sections, pipes, culverts, and other control structures like pumps and weirs. Gravity flow within the conduit link is calculated using the one-dimensional Saint-Venant equation, allowing different options (kinematic/diffuse/dynamic wave). Here, the dynamic wave option was applied, which enables the simulation of channel storage, backwater effects, and entrance/exit losses [17,18].

SWMM-UrbanEVA
As an upgrade of the original SWMM version, SWMM-UrbanEVA was developed at the University of Applied Sciences Münster (Germany) to calculate ET more precisely. It was designed as part of the well-known LID module ( Figure 4) and contains an approach for calculating the actual ET of vegetated areas based on the potential grass reference ET (ET 0 ). The existing three-layer system of the traditional LID module (surface-soil-storage) serves to model the infiltration and percolation processes depending on soil parameters. It is supplemented by a new vegetation layer in which vegetation-specific properties, like the crop factor (K C ) [13] and the leaf area index (LAI), can be parameterized ( Figure 4).
The actual vegetation-specific ET (E STI,a ) is computed for every time step and comprises the sub processes of interception, transpiration, soil evaporation, and evaporation of free water surfaces: E I,a = actual rate of interception mmh −1 ; E T,a = actual rate of transpiration mmh −1 ; E S,a = actual rate of soil evaporation mmh −1 ; and E W,a = actual rate of free water surface mmh −1 .
The calculation of each component is based on the potential vegetation-specific ET (E STI,p ). The latter is obtained by multiplying Kc with the input ET 0 : E STI,p = vegetation − specific potential ET mmh −1 ; and ET 0 = grass reference evapotranspiration mmh −1 .  The actual vegetation-specific ET ( , ) is computed for every time step and comprises the sub processes of interception, transpiration, soil evaporation, and evaporation of free water surfaces: , = actual rate of interception (mmh ); , = actual rate of transpiration (mmh ); , = actual rate of soil evaporation (mmh ); and , = actual rate of free water surface (mmh ).
The calculation of each component is based on the potential vegetation-specific ET ( , ). The latter is obtained by multiplying Kc with the input ET0: , = vegetation − specific potential ET (mmh ); and = grass reference evapotranspiration (mmh ).
Interception and transpiration are energetically decoupled from soil evaporation via the vegetation-covered fraction (SCF) of the surface [19,20].
The interception height (I) is derived depending on the leaf area index according to the approach of [21]. The LAI reflects the ET-active surface in the annual cycle and determines the maximum storage capacity of the leaf canopy: Interception and transpiration are energetically decoupled from soil evaporation via the vegetationcovered fraction (SCF) of the surface [19,20].
The interception height (I) is derived depending on the leaf area index according to the approach of [21]. The LAI reflects the ET-active surface in the annual cycle and determines the maximum storage capacity of the leaf canopy: [11] recommends S L = 0.29 (mm); LAI doy = LAI corresponding to day of year (-) (here monthly resolution); SCF = vegetation covered fraction (-); and P = precipitation (mm).
Interception occurs only from the wetted part of the leaf and is determined on the one hand by the precipitation height at low precipitation rates and on the other hand by the maximum interception capacity (S max ) at high precipitation rates [21]. The potential interception height of the current time step i (S I ) depends on the previous time step i−1 (S I,i−1 ): Water 2020, 12, 3089 The actual intercept height S I,a is then derived depending on the interception capacity and the interception height of the current time step: To calculate the potential interception rate E I,p , the potential evapotranspiration E STI,p is reduced using the vegetation-covered fraction and the wetted part of the leaf A b [22,23]: in which: The actual interception rate E I,a finally results from the minimum fill level of the interception height and the potential evaporation rate: The part of the precipitation that does not intercept is returned to the input precipitation, which enters the following infiltration and drainage processes: P net = Precipitation subtracted by interception losses (mmh −1 ); and P = input precipitation (mmh −1 ).
Equivalent to the potential interception rate, the potential transpiration rate E T,p is projected onto the vegetation-covered area. In contrast to interception, transpiration takes place from the dry portion of the leaf (1−A b ). Furthermore, the actual interception evaporation rate E I,a is subtracted as an upstream process: Since the process of transpiration is fed from the soil reservoir, the calculation of the actual transpiration rate E T,a is done in dependence of the available soil water: θ = actual moisture content (-); θ WP = moisture content at wilting point(-); D = thickness of soil layer (mm); and ∆t = simulation time step.
Soil evaporation is computed for the uncovered area share (1−SCF), as it is assumed that evaporation underneath vegetation is negligible [24]: E S,p = potential soil evaporation mmh −1 .
Water 2020, 12, 3089 The actual soil evaporation depends on the water content of the soil and is expressed by the relative soil moisture W rel . The latter describes the proportion of available soil water within the range of the usable field capacity: W rel = relative soil moisture (-); and θ FK = moisture content at field capacity (-).
According to [25], the potential evaporation rate is already met before the water content in the soil reaches field capacity. For this reason, the potential evaporation rate is increased by the coefficient e s if the relative soil moisture falls below a certain threshold (see Equations (17) and (18)). According to the recommendations of [20,25], this threshold is set to 0.6: The actual soil evaporation E S,a is finally calculated from the minimum of the available soil water per time step and the potential soil evaporation rate: If ponding occurs, two options can be chosen: In the case that vegetation height is short and completely covered by ponding water, only evaporation from the free water surface occurs while the other ET processes are suspended: h pond = height of ponding surface water (mm).
In the case of a high stand, such as a forest, only soil evaporation stops while interception, transpiration and evaporation of the free water surface will continue.

Model Setup
Both the setup of the rainfall-runoff model and the setup of the stream model was realized on the basis of GIS data. VBA scripts were used to convert them to the text format used by SWMM. Water courses were provided as lines and marked with a hierarchical stream segment index, which enables the assignment of a stream segment to a subcatchment and vice versa. Stream cross sections were derived from a high-resolution digital elevation model (DEM) with a grid size of 0.2 m. Each subcatchment was shaped by intersecting the superficial subcatchments based on DEM analysis and land use polygons ( Figure 5). Soil type maps were used to derive average soil attributes for each subcatchment and LID control while the aquifer was treated as homogeneous sand across the model area but with different fill levels depending on the average ground elevation of the subcatchments. GW flow to the river only occurs if a defined threshold is exceeded by the water table. Here, it is 1.2 m below the average ground height, as in the case of a drainage pipe, which comes on stream when the surrounding soil is saturated. The unsealed area of a subcatchment was simulated using the LID controls, more precisely a bioretention cell with the upgraded ET calculation. Furthermore, an equal model was set up for comparison with the same LID controls but with the conventional evaporation calculation.
Water 2020, 12, x FOR PEER REVIEW 9 of 20 subcatchment was shaped by intersecting the superficial subcatchments based on DEM analysis and land use polygons ( Figure 5). Soil type maps were used to derive average soil attributes for each subcatchment and LID control while the aquifer was treated as homogeneous sand across the model area but with different fill levels depending on the average ground elevation of the subcatchments. GW flow to the river only occurs if a defined threshold is exceeded by the water table. Here, it is 1.2 m below the average ground height, as in the case of a drainage pipe, which comes on stream when the surrounding soil is saturated. The unsealed area of a subcatchment was simulated using the LID controls, more precisely a bioretention cell with the upgraded ET calculation. Furthermore, an equal model was set up for comparison with the same LID controls but with the conventional evaporation calculation. Important additional input parameters for SWMM-UrbanEVA are the crop factor and the leaf area index with its annual cycle ( Figure 6). In general, the largest leaf area indices are recorded in July and June and the smallest in December and January. The coniferous forest shows the smallest decrease in winter. Important additional input parameters for SWMM-UrbanEVA are the crop factor and the leaf area index with its annual cycle ( Figure 6). In general, the largest leaf area indices are recorded in July and June and the smallest in December and January. The coniferous forest shows the smallest decrease in winter.

Calibration and Error Measures
The goal of a model calibration is to determine unknown parameters by adapting the calculation result to observed conditions by varying these parameters. Basically, there are two options: an automated (e.g., Latin Hypercube Sampling) and a manual calibration procedure. In automatic calibration, parameters are adjusted automatically according to a predefined search scheme and numerical measures of the goodness of fit [26]. Within this framework, a large number of parameter combinations must be generated and tested; depending on the number of parameters to be calibrated, several hundred to thousands of simulation runs are necessary. The procedure is often used for either single events or comparatively less complex models with a short runtime (see [26,27]). With long simulation durations (in this study, 3 h for 20 months) and a large number of model parameters, automatic calibration is only of limited use. Especially for physically based models, a specific manual calibration can ensure that physically reasonable parameter combinations and value ranges are maintained. This is especially important if the detected parameter combinations are to be transferred to further similar but unobserved areas. Therefore, the manual calibration approach was chosen. In a first step, the sensitivities of all model parameters were tested with regard to their effect on stream flow. This was done by a graphical evaluation of the flow hydrographs. In a next step, the detected sensitive parameters were modified so that the difference between the calculated values from the measured values became minimal. In the process, only one model parameter per simulation run was varied within its plausible limits. The goodness of fit of the model was assessed using the error measures and performance criteria in Table 2.

Calibration and Error Measures
The goal of a model calibration is to determine unknown parameters by adapting the calculation result to observed conditions by varying these parameters. Basically, there are two options: an automated (e.g., Latin Hypercube Sampling) and a manual calibration procedure. In automatic calibration, parameters are adjusted automatically according to a predefined search scheme and numerical measures of the goodness of fit [26]. Within this framework, a large number of parameter combinations must be generated and tested; depending on the number of parameters to be calibrated, several hundred to thousands of simulation runs are necessary. The procedure is often used for either single events or comparatively less complex models with a short runtime (see [26,27]). With long simulation durations (in this study, ~3 h for 20 months) and a large number of model parameters, automatic calibration is only of limited use. Especially for physically based models, a specific manual calibration can ensure that physically reasonable parameter combinations and value ranges are maintained. This is especially important if the detected parameter combinations are to be transferred to further similar but unobserved areas. Therefore, the manual calibration approach was chosen. In a first step, the sensitivities of all model parameters were tested with regard to their effect on stream flow. This was done by a graphical evaluation of the flow hydrographs. In a next step, the detected sensitive parameters were modified so that the difference between the calculated values from the measured values became minimal. In the process, only one model parameter per simulation run was varied within its plausible limits. The goodness of fit of the model was assessed using the error measures and performance criteria in Table 2.
Nash Sutcliffe Efficiency Q calc = calculated flow, Q obs = observed flow; obs = measured value (observed); calc = calculated value; Indices: i = location, t = time, n = number of measurement data.
The volume error quantifies the total deviation in the period under consideration. The MAE indicates to what extent the simulated values deviate on average from the measured values while having the same units as the model output. R and NSE allow statements to be made about the adaption of the dynamics of the simulated flows, whereby the former tests the linear correlation and the latter allows an assessment of how well the simulated values agree with the measured ones. The disadvantage of the NSE is that it is very sensitive to outliers due to the squared differences between measured and simulated values [28]. The following table (Table 3) provides a classification to assess the goodness of fit for R and NSE.

Sensitivity Analyses
The following table shows the parameters with the highest sensitivity to stream flow and the effects of parameter changes. Parameters not listed in Table 4, such as soil properties or the degree of sealing, have a sensitive effect on stream flow, too. However, since they were derived from soil maps or satellite data, respectively, they vary widely in space and were considered to be fixed in the process of calibration.
Initial values for the vegetation coefficients were determined according to [13]. Since they depend on local climate conditions [31], they were slightly varied and adjusted in the calibration process to achieve the best simulation results ( Table 5).   The parameters under the heading "subcatchment characteristics" influence the peak height of the direct runoff while the other parameters have a direct or indirect effect on the GW level and thus on the GW inflows to the stream. Here, the role of ET under the heading "LID Controls" in Table 4 should be emphasized, as it affects the amount of deep percolating water. In particular, the LAI monthly coefficients together with the average LAI enable an intensified ET in summer (June + July, see Figure 6) controlled by the LAI and a reduced ET in winter as well as a transition phase in spring and autumn, respectively. The crop factor Kc is highly sensitive. Kc > 1 increases and Kc < 1 decreases ET compared to the input grass reference ET. LAI and Kc form the key parameters for influencing the annual cycle of base flows in the stream. Figure 7 shows the simulated hydrographs with SWMM and SWMM-UrbanEVA and the observed stream flows at the monitoring station. Accordingly, Table 6 expresses the model fit in numbers by error measures and performance criteria.

Calibration Results
The measured and as well the simulated graphs show that the stream flow is composed of direct runoff peak flows and a base flow component. The direct runoff peak discharges strongly depend on the intensity of the input precipitation, which can vary locally, especially during heavy rainfall events. Therefore, it is important to mention that the input precipitation was not measured directly in the model area but about 8 km south of it. However, in contrast to individual rain events, it is assumed that the long-time sums of precipitation in the model area are the same as at the measuring station. Figure 7 shows the simulated hydrographs with SWMM and SWMM-UrbanEVA and the observed stream flows at the monitoring station. Accordingly, Table 6 expresses the model fit in numbers by error measures and performance criteria.  The volume error of the total stream flow calculated with SWMM is quite low but decreases even more with SWMM-UrbanEVA. The same applies to the MAE. The R and NSE of the SWMM-UrbanEVA results indicate a good fit; in contrast to this, the total results obtained from the original SWMM version are worse, which is mainly reflected in the volume error and NSE. It should again be noted that the performance criteria are sensitive to non-matching peak flows, for example, if it has rained over the precipitation measurement station but not in the model area itself or vice versa. Since direct runoff peak flows are not significantly influenced by ET, they will not be considered further in this study. However, a model comparison based on error measures and performance criteria does not go far enough. Since evaporation has different effects in different seasons, a process-oriented consideration is necessary.

Water Balance
The following table (Table 7) lists important parameters of the water balance in the period from 1 December 2015 to 31 July 2017 (20 months). Continuity errors are smaller than 1%. The initial conditions and storage levels at the beginning of the simulation are the same for the SWMM and the SWMM-UrbanEVA model. The water balance variables of the subcatchments are summarized under the heading "Runoff Quantity". It includes both the runoff of sealed areas of the subcatchments and those of the permeable areas. The latter are modelled as LID modules (bioretention cells). The direct surface runoff (95 mm in both cases) primarily comes from paved areas, since hardly any surface runoff is formed within the LID modules (<1 mm in both cases). It can be seen that the ET in SWMM-UrbanEVA is smaller than that calculated with SWMM in the period of time selected, hence the proportion of water that is passed on to the GW zone ("Infiltration Loss") is larger. Therefore, more water is available to the GW flow, which is ultimately reflected in the GW inflow to the stream ("GW Inflow"). The latter makes up the largest volume share in the stream overall. Looking at the distribution of evaporation and infiltration loss within the LID modules, important differences between the two models and their different land use classes become apparent (Figure 8). With the official SWMM version, there are hardly any differences between the land use classes; the slight differences are only due to different soil types and different water availability depending on the placement in the terrain. The evaporation loss is about 900 mm while the infiltration loss is slightly more than 200 mm. In contrast, ET in SWMM-UrbanEVA varies from around 440 (water surfaces) to 990 (agricultural areas) and infiltration from 140 to 690 depending on the land use class. In addition, a strong linear relationship between ET and percolation to the GW zone can be determined in the model area, which is suggested by the correlation coefficient of 0.99. This again underlines the influence of ET on the percolation to GW and the resulting base flow component in the river. Looking at the distribution of evaporation and infiltration loss within the LID modules, important differences between the two models and their different land use classes become apparent ( Figure 8). With the official SWMM version, there are hardly any differences between the land use classes; the slight differences are only due to different soil types and different water availability depending on the placement in the terrain. The evaporation loss is about 900 mm while the infiltration loss is slightly more than 200 mm. In contrast, ET in SWMM-UrbanEVA varies from around 440 (water surfaces) to 990 (agricultural areas) and infiltration from 140 to 690 depending on the land use class. In addition, a strong linear relationship between ET and percolation to the GW zone can be determined in the model area, which is suggested by the correlation coefficient of 0.99. This again underlines the influence of ET on the percolation to GW and the resulting base flow component in the river.

Groundwater Table and Groundwater Flows to Stream
In the considered model area, GW inflow plays a decisive role with respect to the stream flow. These in turn depend on the GW level, more precisely on the difference between the GW level and the bottom of the receiving node of the hydraulic system, and on the values for A1 (constant) and B1 (exponent) used in the GW power function. The GW hydrographs (Figure 9) are now to be examined more closely using the example of an agricultural area and a mixed forest area.

Groundwater Table and Groundwater Flows to Stream
In the considered model area, GW inflow plays a decisive role with respect to the stream flow. These in turn depend on the GW level, more precisely on the difference between the GW level and the bottom of the receiving node of the hydraulic system, and on the values for A1 (constant) and B1 (exponent) used in the GW power function. The GW hydrographs (Figure 9) are now to be examined more closely using the example of an agricultural area and a mixed forest area. Figure 9b,c refer to an agricultural subcatchment with a crop factor of 1.5 (highest value) and a drainage threshold height of 5.89 m. In this case, the crop factor in SWMM-UrbanEVA increases the evaporation rates fundamentally and the leaf area index in turn decreases ET in winter and increases it in summer (especially in June/July). The effect under the arable land is that the GW level simulated with SWMM-UrbanEVA is slightly lower in winter and is at the threshold in a "normal" summer, such as 2016, so that no GW flow occurs. However, a significant difference between the two software versions is noticeable in wet July 2017. Due to the abundant rainfall, the GW level and flow simulated by SWMM is higher than even in winter. In SWMM-UrbanEVA, on the other hand, the precipitation is simply "swallowed" by ET so that it is not reflected in the GW level or flow.
The situation is different under a mixed forest with a crop factor of 0.8 (Figure 9d,e). Here, the GW levels and flows generated by SWMM-UrbanEVA are higher in January and February and almost identical in spring and summer-except for July 2017. In July 2017, the GW level and flow in SWMM-UrbanEVA rises already at the beginning of July and not only from the middle of the month as it is the case in the SWMM model. For the land use "forest", the same A1 and B1 parameters were applied as under an agricultural area. It was assumed that the forest is not crossed by drainage pipes but crossed by drainage ditches. However, the very high GW peak flows suggest that a more differentiated land-use-oriented approach to the parameters A1 and B1 would have been useful to reduce the GW flow dynamics under this forest land use class.  Figure 9b,c refer to an agricultural subcatchment with a crop factor of 1.5 (highest value) and a drainage threshold height of 5.89 m. In this case, the crop factor in SWMM-UrbanEVA increases the evaporation rates fundamentally and the leaf area index in turn decreases ET in winter and increases it in summer (especially in June/July). The effect under the arable land is that the GW level simulated with SWMM-UrbanEVA is slightly lower in winter and is at the threshold in a "normal" summer, such as 2016, so that no GW flow occurs. However, a significant difference between the two software versions is noticeable in wet July 2017. Due to the abundant rainfall, the GW level and flow simulated by SWMM is higher than even in winter. In SWMM-UrbanEVA, on the other hand, the precipitation is simply "swallowed" by ET so that it is not reflected in the GW level or flow.

Base Flow Separation
The situation is different under a mixed forest with a crop factor of 0.8 (Figure 9d,e). Here, the GW levels and flows generated by SWMM-UrbanEVA are higher in January and February and almost identical in spring and summer-except for July 2017. In July 2017, the GW level and flow in SWMM-UrbanEVA rises already at the beginning of July and not only from the middle of the month as it is the case in the SWMM model. For the land use "forest", the same A1 and B1 parameters were applied as under an agricultural area. It was assumed that the forest is not crossed by drainage pipes but crossed by drainage ditches. However, the very high GW peak flows suggest that a more differentiated land-use-oriented approach to the parameters A1 and B1 would have been useful to reduce the GW flow dynamics under this forest land use class.

Base Flow Separation
The more water evapotranspirates, the less is available for deep seepage and GW inflow to the stream. To show the differences in base flows, a hydrograph separation was performed based on a 3-day sliding interval. Figure 10 illustrates the separated base flow hydrographs of the measured, the SWMM, and the SWMM-UrbanEVA stream flow at the observation point AB. Phase of high base flows; SWMM-UrbanEVA base flows are higher than the observed ones while those generated with SWMM fit well. Here, the discrepancy can be explained by two possible reasons: According to [13], the crop factor is not constant but changes in dependence of three developmental stages with specific water demands: an initial start-up phase, an intermediate phase in which the highest crop factors (or crop coefficients) are recorded, and a final phase in which the factor decreases again. If this could be taken into account in SWMM-UrbanEVA, it would be possible to calibrate the model, especially the aquifer, differently to achieve an even better adaptation. Another reason could be incorrect leaf area indices, as these were derived from satellite data and this indirect measuring method can lead to underestimation; only spectral data are evaluated and leaves lying on top of each other might not be considered. Besides, the spatial resolution is rather low.  Phase of high base flows; SWMM-UrbanEVA base flows are higher than the observed ones while those generated with SWMM fit well. Here, the discrepancy can be explained by two possible reasons: According to [13], the crop factor is not constant but changes in dependence of three developmental stages with specific water demands: an initial start-up phase, an intermediate phase in which the highest crop factors (or crop coefficients) are recorded, and a final phase in which the factor decreases again. If this could be taken into account in SWMM-UrbanEVA, it would be possible to calibrate the model, especially the aquifer, differently to achieve an even better adaptation. Another reason could be incorrect leaf area indices, as these were derived from satellite data and this indirect measuring method can lead to underestimation; only spectral data are evaluated and leaves lying on top of each other might not be considered. Besides, the spatial resolution is rather low. In the last third of the month, in addition to the high pre-humidity, very strong rainfall occurs, which causes the GW level to rise and restart all drainage systems. In this phase, the largest deviations between SWMM-UrbanEVA and SWMM are registered. The SWMM base flows are extraordinarily high and therefore do not offer a realistic curve. The SWMM-UrbanEVA hydrograph adapts much better, but the peak value of the base flow is still too high during this period. This is probably due to wooded areas, which provide too high peak flows (see Table 8).  If the basic flows are considered without the direct runoff peak flows, the valuation criteria improve considerably. Table 8 shows that the base flows are very well represented by the SWMM-UrbanEVA model both on average and in their dynamics. The results produced with SWMM are nevertheless in the "good" range.

Conclusions
The present study was carried out to show the importance of ET and its influence on the base flows of a stream system in the north German lowlands. It was shown that for near natural landscapes, such as agricultural areas, the inclusion of land-use-dependent ET is indispensable for the calculation of water balances over a cross-seasonal period. However, this only applies to areas where the inflow of GW or drainage water to the stream or river plays an important role. Especially in relatively wet summer months, SWMM extremely overestimates the GW inflow because the stocks do not evapotranspirate enough rainwater.
In contrast to the original SWMM version, SWMM-UrbanEVA makes it possible to distinguish between different land use classes and their specific water demands, which is particularly important in small catchments, where the respective characteristics are more pronounced.
A suggestion for improvement can still be made here: Since the water demands of a plant behave differently in different growth stages, it would be useful to allow the input of a dynamic crop factor to achieve an even better adaptation. Nevertheless, single precipitation events (scenarios) can be calculated with SWMM as well as with SWMM-UrbanEVA since ET does not affect direct runoff peaks.
Summarizing, SWMM-UrbanEVA introduces a significant improvement towards process-oriented water balance modelling. However, the introduced additional processes (transpiration, interception, and evaporation) account for computation time. For the presented model area with its 626 subcatchments, 590 junction nodes, and 580 conduit links (including Rainfall-Runoff, Horton Infiltration, GW Flow, and dynamic wave flow routing), SWMM-UrbanEVA requires 2 h 50 min for 20 months, while SWMM requires only 20 min. The ratio is 1:8.5. The computing time of SWMM-UrbanEVA is approximately the same as that of comparable software (e.g., MIKE-SHE).
However, the ability to integrate urban drainage systems and near natural river basins in one single model puts the increased computing time in a different perspective. In other model environments, integrated modelling would require numeric coupling of different modules, often connected with serious numeric stability problems. Funding: This study was conducted within the framework of the Project PROSPER-RO, funded by BMBF, grant number 033L212. We acknowledge financial support by Deutsche Forschungsgemeinschaft and Universität Rostock within the funding program Open Access Publishing.