Temporal Downscaling of IDF Curves Applied to Future Performance of Local Stormwater Measures

: Low-impact development (LID) structures are combined with traditional measures to manage stormwater and cope with increased runoff rates originating from heavy urbanization and climate change. As the use of LIDs for climate adaptation increases, practitioners need more knowledge on LID performance in future climates for successful planning and implementation. In this study, temporal downscaling of regional climate projections for three cities in Norway is performed, using the concept of scale invariance to downscale the distribution of extreme precipitation from daily to sub-daily timescales. From this, local-scale intensity-duration-frequency (IDF) curves for future precipitation were obtained. Using climate projections of daily temporal resolution as input to water balance models and the obtained IDF relationships as input to event-based models allowed for assessing the retention capacity, peak ﬂow reduction potential and pollution control of three different types of LIDs: green roofs, bioretention cells, and detention basins. The downscaling resulted in large local variations in presumed increase of both precipitation amount and intensity, contradicting current design recommendations in Norway. Countrywide, a decrease in the overall LID performance was found, although some positive effects of temperature rises were detected. The study illustrated the importance of evapotranspiration- and inﬁltration-based processes in future stormwater management and how coupling of LID structures in series can signiﬁcantly reduce required detention volumes.


Introduction
Modern stormwater management is aimed at reducing the disadvantages caused by urbanization by maintaining or restoring the predevelopment site hydrology [1]. Urbanization, with an increase of impervious areas, and the introduction of piped systems for stormwater have reduced the evapotranspiration rates, reduced the rate of infiltration to native soil and increased the surface runoffs both with respect to volumes and peak flows. It is expected that the impacts of increased runoff rates will be further induced as a consequence of more frequent and intense precipitation events caused by climate change [2,3]. A stormwater management that counteracts the effects of both urbanization and climate change is needed.
To cope with urbanization, stormwater at-site control was introduced in the 1980s resulting in a large development of distributed stormwater measures [4]. The management was first solely aimed at reducing flooding by peak flow control but has shifted over the past few decades to an approach with multiple objectives related to mitigating the changes in urban hydrology, improving the water quality of receiving waters and delivery of multiple benefits [5]. This approach has been given several different terms like low-impact development (LID), best management practices (BMP), and sustainable urban drainage systems (SUDS) and will in this paper be referred to as LIDs.
Peak flow control has commonly been solved by the use of distributed (site scale) or centralized (catchment scale) detention volumes. Peak flow control can contribute to reduced surface flooding, reduced erosion in natural waterways and reduced capacity problems in downstream piped systems associated with cellar flooding and combined sewer overflows. Challenges to the fixed-flow regulations alone are that they do not counteract the reduced infiltration and evapotranspiration rates, remove pollutants or restore the downstream low-flow regimes, and in some cases, unfortunate superposition of hydrographs might increase peak flows on a large scale and increase combined sewer overflows [1,4,[6][7][8].
Volume control of stormwater runoff is important to restore predevelopment site hydrology and can also contribute to flood control, pollutant control and reduced sewer overflows [4,6,7,9]. The main processes involved in volume control are reduction of stormwater runoff volumes by evapotranspiration and infiltration, often achieved by introducing LID structures. Studies investigating implementation of LID structures on a catchment scale have found these to be efficient in restoring pre-development hydrological conditions and protecting the ecology in receiving waters for small and medium-size precipitation events, while for large design storms additional measures are needed to prevent flooding [8,[10][11][12]. Performance of LID measures are found to vary with several factors, such as type and design of measure, initial saturation [13], precipitation characteristics [14,15] and location in the watershed [16].
Despite the increased knowledge on the disadvantages of applying solely peak flow control regulation on stormwater management, many cities, among those, large Norwegian cities, are focusing the design on large design events, while volume control and pollution control are requested but not quantified in the regulations [17][18][19]. This has resulted in comprehensive establishment of detention basins, while the introduction of other stormwater measures focusing on volume and pollution control has been limited [4,7]. This trend is currently changing and there is a growing interest in a wider variety of stormwater measures and their suitability for local climates and conditions [20][21][22].
However, city growth and climate change will increase the future challenges associated with stormwater management [23,24] and the extended focus to volume and pollution control alone is not enough to obtain a sustainable stormwater management. Regardless of the type of stormwater measure, it must be fit to meet future climates and there is a need for more knowledge on how the different stormwater measures perform both alone and together, in present and future climate, to assure resilient stormwater systems for a future climate. Changes in precipitation patterns and temperature might both affect future design values, as well as altering the hydrological processes that comprise stormwater management. This implies that optimal solutions in present climate are not necessarily optimal in a future climate. To investigate this, knowledge on future climate conditions is necessary. This knowledge is, however, very limited at the local and temporal scale needed for such assessments and there is a gap between available climate information provided by climate research and the information demanded by end-users [25].
To date, the primary source to projections of the future climate, is output of global climate models (GCMs), which are models that simulate the climatic response to scenarios of greenhouse gas emissions (referred to as Representative Concentration Pathways (RCPs)). On a political scale, much information can be drawn from the output of these models, but due to the models' complexity and substantial computational need, the spatial and temporal resolutions of the output are too coarse for the model output to be directly applied to stormwater purposes [25].
To resolve this, techniques for downscaling GCM output to an applicable temporal and spatial resolution have emerged. They are usually grouped into two main categories: (1) dynamical downscaling and (2) statistical downscaling [25,26], where (1) involves nesting of a fine-gridded regional climate models within selected boundaries of the GCM output and (2) refers to establishing a statistical relation between large-scale and local-scale climate [27]. The main advantage of dynamical downscaling is that it is based on physical relations, while the main advantage of statistical approaches is that it is more easily applied and does not require expert knowledge of the climate systems nor extensive computer capacity. Recent advances in climate research and downscaling techniques have made fine-gridded output from regional climate models (RCMs) accessible through projects such as the Coordinated Downscaling Experiment-European Domain (EURO-CORDEX) [28]. The improved resolution provided by such regional climate projections is valuable in order to assess slow hydrological processes, such as evapotranspiration and infiltration, over a long period of time, but the temporal resolution of the projections is, however, still an obstacle when dealing with the rapid runoff experienced in urban areas. Hence, further temporal downscaling of regional climate projections is needed in order to conduct a holistic investigation of stormwater measures' performance in a future climate.
Existing approaches for temporal downscaling can be categorized into stochastic rainfall generation by point process theory, multifractal and cascade processes, and rainfall disaggregation [3]. Common design practice in Norway is usually based on design events, where the design precipitation intensity is chosen from IDF curves following local guidelines. IDF curves are constructed based on frequency analyses of observed precipitation and shows the relationship between the intensity and duration of extreme precipitation events for a range of return periods. Thus, methods for projecting future IDF statistics are of high end-user demand in Norway in order to be able to merge current design practice with considerations for the future. Some studies on temporal downscaling have focused explicitly on such, by using fractal theory and the concept of scale invariance to find a scaling relation between the distribution of extreme precipitation of various durations to project extreme rainfall for the future [29][30][31].
In Norway, one of these approaches [30] has been tested for weather stations in the cities of Bergen and Trondheim. In these studies, daily and sub-daily extremes were extracted from observational series as annual maxima (AMs), fitted to the generalized extreme value (GEV) distribution, and a scaling relationship was found between the GEV statistics of the daily and the sub-daily scales. In Bergen, the results of the scaling were found satisfying for durations as low as 15 min, but the bias increased significantly for lower durations [32]. In Trondheim, the Gumbel distribution (GEV 1) had to be assumed for the AMs due to unsatisfying fittings of the shape parameter [33]. The results suggest that further research on defining rainfall extremes on fine temporal scales and proper parameter estimation is needed in Norway to provide reliable climate projections that can be accounted for in planning of stormwater management [32,33].
The motivation of this study was to address the need for knowledge on long-term LID performance and bridging the gap between available and required climate information. To accomplish this, the methodology for temporal downscaling from regional climate projections to future IDF curves is further developed and performed for three Norwegian cities. The regional projections and IDF curves obtained are further used to investigate the efficiency of different stormwater measures and their suitability in the present and a future climate. Relevant regulations like peak flow control, volume runoff control and pollution control are applied to evaluate the performance of three stormwater measures according to the state-of-the-art holistic approach for stormwater management. The stormwater measures assessed are: (1) green roof, (2) bioretention cell, (3) detention basin. Both water balance models and event-based models are used such that all hydrological processes, rapid and slow, under climate change are explored. Specifically, this study seeks answer the following research questions: (1) What is the best suited method for constructing future IDF curves based on scaling laws; (2) Investigating how geographical variations will influence the performance of local stormwater measures in a future climate.
The structure of the succeeding sections of this paper reflects the two-folded research questions. Section 2, Materials and Methods, includes a description of the study sites (Section 2.1) and the input data to the assessments (Section 2.2), namely meteorological observations and regional climate projections. Subsequently, the temporal downscaling approach for projecting future IDF curves and construction of future design-events (Section 2.3) and the LID models used for assessing future performance (Sections 2.4-2.6) are described. Furthermore, the climate projections and results of the downscaling are presented and discussed in Sections 3.1 and 3.2, and their implications for future LID performance in Sections 3.3-3.5. The work is concluded in Section 4, along with venues and recommendations for future work.

Materials and Methods
The defined research questions were addressed by coupling frameworks for temporal downscaling of climate projections with LID performance assessments ( Figure 1). Observational data from three study sites in Norway was used for calculating extreme value statistics and establishing a scaling relationship between daily and sub-daily extreme precipitation (Step 1). The scaling relationship obtained was further applied to regional climate projections of daily precipitation for the same three sites (Step 2) to construct IDF curves of projected precipitation. Finally, the projected IDF curves and the projected daily precipitation series were used as input to design-event (DE) models and water balance (WB) models to assess the local stormwater measures' event-based and long-term performance in a future climate, respectively (Step 3).

Materials and Methods
The defined research questions were addressed by coupling frameworks for temporal downscaling of climate projections with LID performance assessments ( Figure 1). Observational data from three study sites in Norway was used for calculating extreme value statistics and establishing a scaling relationship between daily and sub-daily extreme precipitation (Step 1). The scaling relationship obtained was further applied to regional climate projections of daily precipitation for the same three sites (Step 2) to construct IDF curves of projected precipitation. Finally, the projected IDF curves and the projected daily precipitation series were used as input to design-event (DE) models and water balance (WB) models to assess the local stormwater measures' event-based and long-term performance in a future climate, respectively (Step 3).

Figure 1.
High-level description of the steps and data flow from temporal downscaling to performance assessment of local stormwater measures.

Study Sites
Norway is located in Scandinavia, Northern Europe, and characterized by its long and narrow shape, ranging from latitudes 57 to 71. This span of latitudes, along with an extensive coastline covering the western part of the country, contributes to large variations in local climate conditions, such as mild and wet coastal climates in the southwest and cold winters combined with hot summers, inland. Three cities have been selected for this study, in order to capture parts of the climate variability found in Norway. The selected cities, Bergen, Oslo, and Trondheim (Figure 2), are the three largest cities in the country. The climate in Bergen is characterized by mild temperatures and high amounts of annual rainfall, originating from a combination of frontal and orographic precipitation. In Trondheim and Oslo, the climates are more continental, with less precipitation and colder winters. For a more comprehensive description of the climates for the three selected cities, the reader is referred to Johannessen et al. (2017) [34].

Study Sites
Norway is located in Scandinavia, Northern Europe, and characterized by its long and narrow shape, ranging from latitudes 57 to 71. This span of latitudes, along with an extensive coastline covering the western part of the country, contributes to large variations in local climate conditions, such as mild and wet coastal climates in the southwest and cold winters combined with hot summers, inland. Three cities have been selected for this study, in order to capture parts of the climate variability found in Norway. The selected cities, Bergen, Oslo, and Trondheim (Figure 2), are the three largest cities in the country. The climate in Bergen is characterized by mild temperatures and high amounts of annual rainfall, originating from a combination of frontal and orographic precipitation. In Trondheim and Oslo, the climates are more continental, with less precipitation and colder winters. For a more comprehensive description of the climates for the three selected cities, the reader is referred to Johannessen et al. (2017) [34].

Metrorological Input Data
Observations of precipitation were collected for the three study sites Bergen, Oslo, and Trondheim. The data was obtained from the Norwegian Meteorological Institute through the web portal www.eklima.no [36]. The currently operating stations with the longest records of minute data was selected (Table 1). The climate projections used for temporal downscaling, were regional climate projections for Norway covering the time span 1971-2100 made available by the Norwegian Centre for Climate Services (NCCS). The data is from heron referred to as the NCCS data, and they originate from downscaling of 10 EURO-CORDEX GCM/RCM simulations by an empirical quantile mapping method [37]. The spatial resolution of the available data is 1x1 km and the temporal scale is daily, hence the need for temporal downscaling. All 10 simulations are available for RCP4.5 and RCP8.5, which refers to an intermediate level of emissions and high level of emissions, respectively. Since the future of human greenhouse gas emissions is unknown, the most conservative, worst case scenario was studied in this paper (RCP8.5). To capture some of the uncertainty associated with climate modelling, the full ensemble of the 10 simulations for RCP8.5 was used. The data used in this study included daily precipitation depth and daily average temperature.

Metrorological Input Data
Observations of precipitation were collected for the three study sites Bergen, Oslo, and Trondheim. The data was obtained from the Norwegian Meteorological Institute through the web portal www. eklima.no [36]. The currently operating stations with the longest records of minute data was selected (Table 1). The climate projections used for temporal downscaling, were regional climate projections for Norway covering the time span 1971-2100 made available by the Norwegian Centre for Climate Services (NCCS). The data is from heron referred to as the NCCS data, and they originate from downscaling of 10 EURO-CORDEX GCM/RCM simulations by an empirical quantile mapping method [37]. The spatial resolution of the available data is 1 × 1 km and the temporal scale is daily, hence the need for temporal downscaling. All 10 simulations are available for RCP4.5 and RCP8.5, which refers to an intermediate level of emissions and high level of emissions, respectively. Since the future of human greenhouse gas emissions is unknown, the most conservative, worst case scenario was studied in this paper (RCP8.5). To capture some of the uncertainty associated with climate modelling, the full ensemble of the 10 simulations for RCP8.5 was used. The data used in this study included daily precipitation depth and daily average temperature.

Temporal Downscaling
The observations of fine time-scale rainfall were used to establish a scaling relation between the statistical properties of extreme daily and sub-daily precipitation. Assuming that the scaling relation found between observations also holds for the future, the obtained scaling relation was then applied to the projected daily precipitation (NCCS data) for the three study sites such that projections of the statistical properties of extreme sub-daily precipitation for the future period 1971-2100 was obtained.
The temporal downscaling of regional climate projections was based on fractal theory and the principle of scale invariance, which can be used to link the statistical properties of rainfall of different scales [38,39]. Fractal theory suggests that the statistical properties of the rainfall process are scale-invariant, and this scale-invariant property can be expressed by: where α is the precipitation distribution parameter, D is the time scale, λ the scale factor, and β the scale exponent [3]. In a simple scaling case, the scaling exponent is constant and equal to the slope of the linear relation between β and D in a double logarithmic plot [39] and the scaling relation also holds for the distribution quantiles [38]. Equation (1) was used to scale the distribution of extreme rainfall. Rainfall extremes are commonly extracted from observed time series in one of two ways: (1) by extracting the most extreme event within a hydrological year (AM, or (2) by extracting all events above a predefined threshold (peak-over-threshold (POT)). In general, the AM approach helps selecting events that are independent and identically distributed and the simplicity of the method is an advantage. However, some extremes could be lost if other events than the maximum of a year exceeds the annual maxima of other years. Thus, the POT approach may give a more consistent definition of extremes [3] and might be preferred when the observed rainfall series are short. Extreme events extracted from observational time series by the POT approach follow a generalized Pareto distribution (GPD) [40]. The cumulative distribution function, F(z), for the POT extremes (z) are given by: where u is the defined threshold, σ the scale parameter, and ξ the shape parameter. When applying models of threshold excesses, proper threshold selection is important in order to balance bias and variance. The threshold must be high enough to ensure that the observations do not belong to the central part of the distribution (without risking too few excesses above the threshold), but low enough to reduce variance [40]. Common approaches for threshold selection are to set the threshold based on a percentile (e.g., 99th or 99.5th percentile), by visual inspection of mean residual life (MRL) plots or dispersion index plots prior to model fitting, or by looking for stability of parameters after fitting the model over a range of thresholds. Manually inspecting many plots and series will be impractical, and some methods for optimizing this process have emerged, such as, the combined-peak-over-threshold (CPOT) approach by Anagnostopoulou and Tolika [41].
In addition to threshold selection, the GPD parameters will be affected by choice of method for parameter estimation. For practical reasons, all GPDs was fitted by maximum likelihood estimation (MLE) in this study [32]. Furthermore, the behavior of the GPD is greatly dependent on the shape parameter. A global study of the GEV shape parameter, including 71 data series from Norwegian stations found that the shape parameter in Norway lies in the interval 0.028-0.156, with an average of 0.044 [42]. In this study, precipitation extremes for time scales 5 min, 10 min, 15 min, 30 min, 1 h, 3 h, 6 h, 12 h and 24 h were extracted from aggregated time series of observations using various threshold selection techniques: (1) the combined POT (CPOT) approach [41], and (2) 95th, 99th and 99.5th percentiles. To secure independent events, an independence criterion of 48 h between daily and 12 h events and 24 h between the remaining sub-daily durations was set, following the recommendation that the independence criteria should exceed the durations of the events [3]. Since it has been argued that [42,43], the fitting of the GPD to daily rainfall extremes was tested with a fixed shape parameter of 0.044, corresponding to the Norwegian average [42] for all of the three locations. When fitting the sub-daily GPDs, the daily shape parameter for the respective locations was used, as previous studies has indicated that this gives a more stable downscaling model [44]. The goodness of the fit between the empirical and theoretical distribution was assessed by a traditional Chi-squared test, where the p-value was computed for a Monte Carlo test [45]. The analyses were performed in the open-source software R, using built-in functions and the R package 'POT' [46].
From the resulting extreme precipitation distribution parameters, return levels used for constructing the IDFs were estimated based on the formula: where N is the return period, Z N is the N-year return level, n y the number of observations per year, and ζ u = Pr{X > u} [40]. In this study, the sample proportion of points exceeding u was used as an estimation for ζ u [40]. From this, the scaling exponent, β, was found by studying the relationship between the observed Z n and D in a double logarithmic plot. The scaling exponent obtained was then used to scale Z N of the projected daily precipitation (NCCS data) to lower durations in order to construct IDF curves for the future period 1971-2100. The projected IDF curves were used to create design events to be used as input to the event-based models of LID performance. Local stormwater measures for small catchments are, in Norway, commonly designed for peak flow control with storm durations corresponding to time of concentration, typically ranging from 5-10 min up to one hour [47]. Synthetic designed precipitation hyetographs based on local IDF-curves, incorporating intensities for several different durations, are recommended methods when designing based on IDF-data. This method requires knowledge on precipitation patterns [48]. Little work has been done on creating specific synthetic hyetographs for Norwegian short duration precipitation. For the purpose of this study, two synthetic design events were constructed representing the extremes, where the peak arrives on initial dry or wet conditions. Both were constructed based on local IDF-curves and a 20-year return period event with durations 10 and 60 min. The peak was constructed as a 10 min block rain starting after 5 min for the initial dry conditions and after 50 min for the initial wet conditions. Both alternatives were designed to include the 60 min precipitation depth. The design events were constructed from the present, future maximum and future minimum IDF-curves.

Low-Impact Development (LID) Performance Assessment
The effect of future climate scenarios on different stormwater measures have been investigated with respect to both long time series and short-term event performance under climate change. Three different measures, bioretention cells, extensive green roofs and detention basins, have been included in the study to represent important processes for stormwater management as infiltration, evapotranspiration and peak runoff reduction and delay ( Table 2). The long-term performance depends mainly on the processes of evapotranspiration and infiltration to native soil reducing volume loads on the downstream system, while filtration processes reduce the runoff off pollutants and contribute to improved quality of receiving water bodies. The short-term design event performance, on the other hand, depend mainly on the processes of detention resulting in peak reductions and delay. The long-term performance was based on the 30-year time-series of daily precipitation and temperature projections (NCCS data). Short-term event-based performance was investigated by applying a 20-year return period design event based on locally derived IDF-curves for the future obtained by downscaling the NCCS data.  The focus was on a site scale rather than on a catchment scale, based on an assumption, as suggested by Burns et al. [49], that urban stormwater management should emphasize the restoration or protection of natural hydrologic processes at small scales, with the aim of restoring natural flow regimes at larger scales downstream. For comparative reasons, all stormwater measures were tested on a site scale based on a catchment area of 1000 m 2 of roof or another impervious catchment area (e.g., parking lot). A detailed description of the methods follows.
The green roofs modelled in this study were based on a build-up used in field studies in four Norwegian sites; Trondheim, Bergen, Sandnes and Oslo [50]. The field studies had a build-up consisting of a 30 mm pre-grown, sedum-based vegetation mat over additional 50 mm of green roof substrate, a 25-75 mm plastic or polystyrene drainage layer and a 5 mm thick textile retention fabric as the bottommost layer. This build-up corresponded to a theoretical water storage capacity of 19-24 mm. The green roof water balance model was run with a maximum water storage capacity (S max ) of 25 mm. The denotation in parenthesis refer to the parameter abbreviations used in Equations (4)- (9). The roof made for the model exemplification was a typical commercial building with a 1000 m 2 flat roof. Slope was set according to minimum requirements in Norwegian building regulations of 1:40 (2.5%) [51] and the maximum distance to drainpipe was set to 10 m. The bioretention cell area (A B ) (50 m 2 ) was set to 5% of the catchment area (A C ) (1000 m 2 ) corresponding to common sizing recommendations [52]. The catchment was assumed to be 100% impervious illustrating a conventional roof or a parking lot. The maximum ponding depth was set to 15 cm, giving an above ground storage volume of 7.5 m 3 (V1 MAX ) [53].
The bioretention media depth was set to 75 cm [53,54] with an infiltration rate of 10 cm/h (Inf1) which is a minimum recommended value for cold climates based on investigations carried out by Paus et al. [55]. A drainage pipe was placed in the upper part of a drainage layer. The drainage layer depth was set to 25 cm and with a high porosity of 50% this constituted a sub-surface storage layer of 6.25 m 3 (V2 MAX ). Many costal Norwegian cities suffer from low infiltration capacities due to their costal location with a large extent of marine sediments. For this reason, a relatively low native soil infiltration rate of 0.5 cm/h (Inf2) was applied, corresponding to minimum values found from infiltration tests at 60 cm depth in loam, silt loam and silt clay loam soil in Oslo [56].

Daily Time-Step Models
Long-term behavior of green roofs was modelled to study future climates effects on volume flows of evapotranspiration and runoff, by the use of a continuous green roof water balance model as described in [34]. The model was based on daily time steps (t) and calculated green roof runoff (R) as a function of precipitation (P), actual storage (S), maximum storage capacity (S max ), crop coefficient (C crop ) and evapotranspiration (Equations (4) and (5)). Potential evapotranspiration (PET) estimates were based on the Oudin model (Equation (6)), which was found to be the simplified temperature (T) based model performing best for the Nordic climates [57]. PET estimates were coupled with a soil moisture extraction function and a crop coefficient to calculate actual evapotranspiration (AET) (Equation (7)).
The model applied in [34] was improved by calibrating the crop coefficient based on three years of continuous observations from field studies [50]. The model was calibrated was based on minimizing the objective function relative percentage difference (RPD) for three year of continuous data (2015-2017) and validated with the objective function Nash-Sutcliffe efficiency (NSE) for non-winter data [58]. Model results were calculated both annually and for the temperate season (defined as the period May through October) when most of the retention was expected to take place. The data was further divided into events divided by dry periods, defined as days with less than 1 mm precipitation. Maximum duration of dry periods was expressed by the 99.5% percentile of all observed dry periods and a drought incident was defined by less than 1 mm water stored in the end of a dry period. Available retention capacity at the beginning of a precipitation event was defined as the median observed retention from events with more than 5 mm precipitation, duration less than 3 days and where runoff occurred.
Long-term behavior of bioretention cells was modelled to study future climates effects on volume flows of infiltration, runoff and potential evapotranspiration. A bioretention cell water balance model (Equations (8) and (9)) with daily time-steps equivalent to the one used for green roofs were set up, with the same input time series of precipitation (P) and evapotranspiration (PET), as used in the green roof water balance model.
Calculations of overflow of the unit were based on an assumption that all precipitation arrived during a period of 1.7 h. The time period was found as the average time with precipitation on the days when the total precipitation amounts exceeded the above surface storage volume. Storage in the bioretention media was not included, and evapotranspiration estimates were set equal to the potential evapotranspiration only for time-steps with water available. This would probably lead to an underestimation of evapotranspiration, but due to the low catchment to bioretention surface area ratio, evapotranspiration would represent a small volume flow and the simplification was, therefore, considered to be sufficient for the purpose which was to evaluate changes in volume flow for future climates.
Initial values:

Design Event Models
Green roof detention performance in present and future climate was investigated using the green roof specific LID module in the Storm Water Management Model (SWMM version 5.1.012) [59]. The green roof was modelled as a subcatchment totally covered by a green roof and compared to a reference roof modelled as a 100% impervious subcatchment covered with a black bitumen liner. The model was run on an event basis. The initial saturation in the green roof model is important as it defines the available retention capacity when precipitation starts, and was set to 5 mm. This is an approximate conservative value based on observations in Bergen, Sandnes, Trondheim and Oslo [50], as can also be seen when compared to the results from the green roof water balance model. The SWMM green roof module and a comparative black bitumen reference roof were based on parameters derived from a multi-site calibration of the model with field data [60]. The highest intensity events included in the calibration data was a 10 min intensity of 0.83 mm/min and 60 min intensity of 0.36 mm/min. The RECARGA model developed by Wisconsin Department of Natural Resources was used to evaluate the performance of a bioretention facility [61]. The RECARGA is a 1D model including up to three soil layers and optional underdrains, simulating the water movement applying the Green-Ampt infiltration model, and the van Genuchten relationship for drainage between soil layers. Model output includes inflow, overflow, flow through underdrain and infiltration to native soil. Evapotranspiration is set to zero during this short-term event modelling.
Stormwater detention basins, commonly built as underground constructions in Norway, was investigated by the use of a simplified spreadsheet model using the rain envelope method commonly applied by stormwater engineers [17,19,62,63]. Required detention basin volumes were calculated based on a maximum rate of outflow for different durations of a box type design rain from local IDF curves, and the duration giving the largest volume was used for design. Outflow was set to a constant value of 20 l/s·ha corresponding to a maximum discharge limit to public piped systems applied by the City of Oslo [63]. Most orifice restrictors would give some variations in outflow rate depending on the water depth in the basin, but for the purpose of comparing required volumes for different future climate scenarios this simplified approach was considered to be sufficient.
The effect of installing green roof and/or bioretention cells prior to stormwater detention basins was investigated. The measures were put in series, e.g., the runoff profile from the green roof SWMM model was used as input to the RECARGA model of the bioretention-cell, and the runoff from REGARGA was used as input to the detention basin. Stormwater measures in series could have been modelled inside SWMM, which introduced the possibility to model LIDs in series between 2015 and 2016 [59,64]. Still, RECARGA was chosen for the bioretention cell modelling due to the authors' prior experiences, where this model showed acceptable performance compared to observed bioretention cell runoff in a similar climate [65].

Temporal Downscaling
In total, the GPD was fitted to 27 datasets of extreme events (3 locations × 9 durations), with four different threshold levels. The obtained parameters and results of the chi-squared test were stored for each fit. According to the statistics obtained for daily extremes (Table 3), the variation (σ) obtained could be characterized as stable, compared to the threshold level (µ) which varies greatly between percentiles 0.95 and 0.995. Using the CPOT approach for threshold selection results in threshold levels belonging to the lower range of the percentile-based thresholds. The results of the fitting of the sub-daily extremes by the same approach is summarized by the % sub-daily GPD fits that resulted in the p-value exceeding a significance level of α = 0.05. Based on the chi-squared test p-value, a threshold selection corresponding to the 99.5th percentile is superior to other threshold levels for all locations and sub-daily durations. However, the chi-squared test also shows that some unsatisfactory results were obtained, especially for sub-daily extremes in Oslo and Trondheim. This is likely to be explained by the presumed shape parameter, which was fixed at 0.044. Letting ξ vary in the range 0.028-0.156, as suggested by Ragulina and Reitan [42], and testing other methods for parameter estimation, could result in better parameter fits and should be further investigated. However, fixing the shape parameter (ξ = 0.044) and limiting the parameter estimation techniques to one (MLE), allowed us to study the sensitivity of the GPD fit to threshold selection. In further steps, the fixed shape parameter and the 99.5th percentile threshold level was kept.
Return levels for return periods T = (2, 5,10, 20, 30, 50, 100) years were estimated (Equation (3)) and plotted against duration in a double logarithmic plot (Figure 3). The scaling exponent, β, is calculated for each return period, and the average of these is used for further calculations. By inspecting the plots, a clear linear relation in the double logarithmic plot is found for Bergen and Trondheim (Figure 3a,c), indicating a simple scaling case with a constant scaling exponent for all durations. The return levels for Oslo, however, show a concave behavior (Figure 3b). The shift occurs at~60 min, suggesting a difference in scaling behavior for sub-hourly extreme precipitation. This is similar to results reported by e.g., Nguyen et al. [30] who also found a simple scaling behavior within the same two different time intervals when performing a similar study for Quebec, Canada.
Trondheim (Figure 3a,c), indicating a simple scaling case with a constant scaling exponent for all durations. The return levels for Oslo, however, show a concave behavior (Figure 3b). The shift occurs at ~60 minutes, suggesting a difference in scaling behavior for sub-hourly extreme precipitation. This is similar to results reported by e.g., Nguyen et al. [30] who also found a simple scaling behavior within the same two different time intervals when performing a similar study for Quebec, Canada.
The results of scaling the observed daily return levels are compared to the return levels estimated based on the GPD parameters ( Figure 4). Common for all locations is that the scaling models underestimate the return levels for the highest and the lowest durations. For Oslo and Trondheim, the scaling models are found to behave very well from a duration of 3h and down to ~30min and ~15min, respectively (Figure 4b and c). The scaling model for Bergen is seen to always overestimate or underestimate the return levels and the offset is worst for higher durations and higher return periods. However, there is a higher agreement between scaled return levels and return levels estimated by GPD parameters than what has been found in earlier studies for Bergen [32] and Trondheim [33] based on AM and GEV distribution selection.  The results of scaling the observed daily return levels are compared to the return levels estimated based on the GPD parameters ( Figure 4). Common for all locations is that the scaling models underestimate the return levels for the highest and the lowest durations. For Oslo and Trondheim, the scaling models are found to behave very well from a duration of 3 h and down to~30 min and 15 min, respectively (Figure 4b,c). The scaling model for Bergen is seen to always overestimate or underestimate the return levels and the offset is worst for higher durations and higher return periods. However, there is a higher agreement between scaled return levels and return levels estimated by GPD parameters than what has been found in earlier studies for Bergen [32] and Trondheim [33] based on AM and GEV distribution selection.

Projected Future Climate
Both average daily temperatures and precipitation increased for all (10) tested future scenarios. Temperature increase from present to the median off all future models were found to be quite similar, with an increase of 4.6 • C in Bergen (BER), 5.1 • C in Trondheim (TRD) and 5.6 • C in Oslo (OSL), while daily precipitation increased more in Bergen (2 mm) compared to Trondheim (0.6 mm) and Oslo (0.3 mm) ( Figure 5). The 10 projected models also produce a range of scenarios. For daily precipitation in Bergen, this range is especially high. Furthermore, it is found that the models that produce the max and min scenarios are not the same across locations. This highlights the need for and usefulness of an ensemble approach.

Projected Future Climate
Both average daily temperatures and precipitation increased for all (10) tested future scenarios. Temperature increase from present to the median off all future models were found to be quite similar, with an increase of 4.6 °C in Bergen (BER), 5.1 °C in Trondheim (TRD) and 5.6 °C in Oslo (OSL), while daily precipitation increased more in Bergen (2 mm) compared to Trondheim (0.6 mm) and Oslo (0.3 mm) ( Figure 5). The 10 projected models also produce a range of scenarios. For daily precipitation in Bergen, this range is especially high. Furthermore, it is found that the models that produce the max and min scenarios are not the same across locations. This highlights the need for and usefulness of an ensemble approach. Scaling the projected daily return levels, using the obtained scaling relations was performed to construct IDF curves also for future precipitation. The scaling was performed for all 10 future scenarios, such that an ensemble of return levels was obtained for each return period. Similar to the daily projections, the projected T = 20 year sub-daily precipitation intensities for Bergen result in the largest increase compared to observations ( Figure 6). For Bergen, an increase of return levels is projected to all scenarios, while some scenarios result in a decrease of return levels for Oslo and Trondheim. For the observed return levels, it was estimated that Oslo experiences the highest intensity of short duration rainfall and a shift between Oslo and Bergen occurs approximately around duration 3 h. In the projections, the intensity of short duration rainfall in Bergen is expected to exceed the corresponding intensities for Oslo for all durations, and no such shift occurs. Comparing Oslo to Trondheim, the relative difference between intensities stay roughly the same from observations to projections.
The relative difference between projected and observed return levels, addressed climate factor (CF), is of practical interest for stakeholders planning sustainable systems in Norway as it is often included in design guidelines. In Norwegian municipalities, it is common to operate with a climate factor in the range 1.2-1.5 that is added to the design precipitation intensity (collected from IDF curve). Usually, vulnerable infrastructure is designed for higher return periods, but the climate factor is kept constant. The results of this study contradict this practice (Figure 7). For estimated hourly climate factors, there is a large variation in estimated climate factors both across locations and return periods. For return periods, T, in the range 2-100 years, the climate factor is increasing, indicating Scaling the projected daily return levels, using the obtained scaling relations was performed to construct IDF curves also for future precipitation. The scaling was performed for all 10 future scenarios, such that an ensemble of return levels was obtained for each return period. Similar to the daily projections, the projected T = 20 year sub-daily precipitation intensities for Bergen result in the largest increase compared to observations ( Figure 6). For Bergen, an increase of return levels is projected to all scenarios, while some scenarios result in a decrease of return levels for Oslo and Trondheim. For the observed return levels, it was estimated that Oslo experiences the highest intensity of short duration rainfall and a shift between Oslo and Bergen occurs approximately around duration 3 h. In the projections, the intensity of short duration rainfall in Bergen is expected to exceed the corresponding intensities for Oslo for all durations, and no such shift occurs. Comparing Oslo to Trondheim, the relative difference between intensities stay roughly the same from observations to projections.

LID Performance: Volume and Pollution Control
Evapotranspiration is in this study represented by green roofs and bioretention cells, while other stormwater measures like rooftop disconnections, the use of vegetation and pervious surfaces also benefit from this process. The green roof water balance model was run with a crop coefficient of 0.9, based on the calibration results from all sites with |RPD| < 10% and NSE > 0.6. Green roofs contribute to a considerable reduction in annual runoff volumes by retention and subsequent evapotranspiration in present climate with 15% in Bergen, 26% in Trondheim and 35% in Oslo ( Figure  8a). The fractions increase slightly for most future scenarios in spite of the increased precipitation volumes, due to the increased temperatures' effect on evapotranspiration. The performance The relative difference between projected and observed return levels, addressed climate factor (CF), is of practical interest for stakeholders planning sustainable systems in Norway as it is often included in design guidelines. In Norwegian municipalities, it is common to operate with a climate factor in the range 1.2-1.5 that is added to the design precipitation intensity (collected from IDF curve). Usually, vulnerable infrastructure is designed for higher return periods, but the climate factor is kept constant. The results of this study contradict this practice (Figure 7). For estimated hourly climate factors, there is a large variation in estimated climate factors both across locations and return periods. For return periods, T, in the range 2-100 years, the climate factor is increasing, indicating that vulnerable infrastructure also should be design with a higher climate factor. According to the investigated projections, Bergen is expected to experience the largest increase in extreme hourly precipitation, while the climate factors for Oslo and Trondheim are significantly lower but showing the same increase with return periods as Bergen.

LID Performance: Volume and Pollution Control
Evapotranspiration is in this study represented by green roofs and bioretention cells, while other stormwater measures like rooftop disconnections, the use of vegetation and pervious surfaces also benefit from this process. The green roof water balance model was run with a crop coefficient of 0.9, based on the calibration results from all sites with |RPD| < 10% and NSE > 0.6. Green roofs contribute to a considerable reduction in annual runoff volumes by retention and subsequent evapotranspiration in present climate with 15% in Bergen, 26% in Trondheim and 35% in Oslo ( Figure   Figure 7. Climate factors (CF) of hourly precipitation intensity, calculated as the relative difference between observed and projected return levels for locations Bergen, Oslo and Trondheim.

LID Performance: Volume and Pollution Control
Evapotranspiration is in this study represented by green roofs and bioretention cells, while other stormwater measures like rooftop disconnections, the use of vegetation and pervious surfaces also benefit from this process. The green roof water balance model was run with a crop coefficient of 0.9, based on the calibration results from all sites with |RPD| < 10% and NSE > 0.6. Green roofs contribute to a considerable reduction in annual runoff volumes by retention and subsequent evapotranspiration in present climate with 15% in Bergen, 26% in Trondheim and 35% in Oslo (Figure 8a). The fractions increase slightly for most future scenarios in spite of the increased precipitation volumes, due to the increased temperatures' effect on evapotranspiration. The performance improved most for Oslo, due to the combined effect of temperatures and precipitation, which is most favorable for Oslo with the highest temperature increase and the lowest precipitation increase. This makes green roofs and other stormwater measures relying on evapotranspiration for stormwater removal, favorable solutions for present and future stormwater management with respect to volume reduction. These results are based on relatively cold and wet Norwegian climates and the findings are expected to be even more advantageous for warmer and drier climates. Median available retention capacity in the temperate season increases for all future scenarios at all locations (Figure 8b). This measure is especially of interest during the temperate season where small precipitation events can be retained totally or where larger precipitation events experiences an enhanced detention effect in the first part of larger precipitation events due to the initial retention taking place before runoff is initiated. Analysis of short-term extreme precipitation events (5-15 min duration) show that these events occur in the temperate season, where advantage can be taken of the available retention capacity of the green roofs. improved most for Oslo, due to the combined effect of temperatures and precipitation, which is most favorable for Oslo with the highest temperature increase and the lowest precipitation increase. This makes green roofs and other stormwater measures relying on evapotranspiration for stormwater removal, favorable solutions for present and future stormwater management with respect to volume reduction. These results are based on relatively cold and wet Norwegian climates and the findings are expected to be even more advantageous for warmer and drier climates. Median available retention capacity in the temperate season increases for all future scenarios at all locations ( Figure  8b). This measure is especially of interest during the temperate season where small precipitation events can be retained totally or where larger precipitation events experiences an enhanced detention effect in the first part of larger precipitation events due to the initial retention taking place before runoff is initiated. Analysis of short-term extreme precipitation events (5-15 min duration) show that these events occur in the temperate season, where advantage can be taken of the available retention capacity of the green roofs. Dry periods involve risk of vegetation drought. There is no clear trend in changes in duration of the extreme dry periods from present situation to future scenarios, but the increased evapotranspiration rates give an increased number of drought incidents for most future scenarios (Figure 9). This can be coped with by increasing the water storage capacities of the applied systems. The risk of increased future green roof drought was found to be largest for Oslo, increasing from 4 to as much as 19 incidents in 30 years for the highest future scenario, indicating that higher storage capacities than the chosen 25 mm should be considered when building extensive green roofs for a future climate in Oslo. Dry periods involve risk of vegetation drought. There is no clear trend in changes in duration of the extreme dry periods from present situation to future scenarios, but the increased evapotranspiration rates give an increased number of drought incidents for most future scenarios (Figure 9). This can be coped with by increasing the water storage capacities of the applied systems. The risk of increased future green roof drought was found to be largest for Oslo, increasing from 4 to as much as 19 incidents in 30 years for the highest future scenario, indicating that higher storage capacities than the chosen 25 mm should be considered when building extensive green roofs for a future climate in Oslo.
Infiltration to native soil is in this study represented by a bioretention cell in a soil with relatively low infiltration capacity, to illustrate the potential for this process even at less favorable conditions. Infiltration is also important for other stormwater measures like rooftop disconnections, infiltration trenches and all types of pervious surfaces. Bioretention cells contribute to a considerable reduction in annual runoff volumes by infiltration to native soil in present climate with 54% in Bergen, 83% in Trondheim and 85% in Oslo (Figure 10b). The fractions decrease slightly for all future scenarios due to increased precipitation amounts but can also in a future climate give a large contribution to runoff volume reductions. Bioretention cells are the only stormwater measure in this study that can remove pollutants from stormwater by filtration treatment, while this effect will not be present for volumes that overflow the unit's capacity. Volumes treated for pollution control was found to be 63% in Bergen and 88% in Trondheim and Oslo for the present climate, decreasing slightly for all future scenarios tested (Figure 10a). The Bergen site experiences more than twice the precipitation amounts compared to the two other sites and a higher bioretention cell-to-catchment-area ratio, than the 5% used in this study, should be applied to improve the general performance. Bioretention cell above ground storage area could also be increased to improve performance with respect to fraction of runoff treated and/or infiltrated [8]. Evapotranspiration was only found to represent a small fraction of the volume reduction in bioretention cells (0.6-1.4%) due to the small surface to catchment area ratio. Dry periods involve risk of vegetation drought. There is no clear trend in changes in duration of the extreme dry periods from present situation to future scenarios, but the increased evapotranspiration rates give an increased number of drought incidents for most future scenarios (Figure 9). This can be coped with by increasing the water storage capacities of the applied systems. The risk of increased future green roof drought was found to be largest for Oslo, increasing from 4 to as much as 19 incidents in 30 years for the highest future scenario, indicating that higher storage capacities than the chosen 25 mm should be considered when building extensive green roofs for a future climate in Oslo. Infiltration to native soil is in this study represented by a bioretention cell in a soil with relatively low infiltration capacity, to illustrate the potential for this process even at less favorable conditions. Infiltration is also important for other stormwater measures like rooftop disconnections, infiltration trenches and all types of pervious surfaces. Bioretention cells contribute to a considerable reduction in annual runoff volumes by infiltration to native soil in present climate with 54% in Bergen, 83% in Trondheim and 85% in Oslo (Figure 10b). The fractions decrease slightly for all future scenarios due to increased precipitation amounts but can also in a future climate give a large contribution to runoff volume reductions. Bioretention cells are the only stormwater measure in this study that can remove pollutants from stormwater by filtration treatment, while this effect will not be present for volumes that overflow the unit's capacity. Volumes treated for pollution control was found to be 63% in Bergen and 88% in Trondheim and Oslo for the present climate, decreasing slightly for all future scenarios tested (Figure 10a). The Bergen site experiences more than twice the precipitation amounts compared to the two other sites and a higher bioretention cell-to-catchment-area ratio, than the 5% used in this study, should be applied to improve the general performance. Bioretention cell above ground storage area could also be increased to improve performance with respect to fraction of runoff treated and/or infiltrated [8]. Evapotranspiration was only found to represent a small fraction of the volume reduction in bioretention cells (0.6-1.4%) due to the small surface to catchment area ratio.

LID Performance: Peak Flow Control
Peak flow control in terms of peak reduction and peak delay was limited for the modelled green infrastructure alone (green roofs and bioretention cells) for the tested 20-year 1h design events. The performance improved with reduced peak precipitation intensities/precipitation depth and when the highest peak intensity arrived early in the precipitation event (when the unit was not saturated yet)

LID Performance: Peak Flow Control
Peak flow control in terms of peak reduction and peak delay was limited for the modelled green infrastructure alone (green roofs and bioretention cells) for the tested 20-year 1 h design events. The performance improved with reduced peak precipitation intensities/precipitation depth and when the highest peak intensity arrived early in the precipitation event (when the unit was not saturated yet) (Figures 11 and 12). The SWMM green roof modelling parameters were calibrated with field data with smaller peak intensities than the highest intensities tested here, limiting the accuracy of the model. Still, field observations on peak reductions for the largest observed events (Figure 11a) resemble the model results where the peak arrived on initial dry conditions giving peak reduction above 55%. This indicates that green roofs can contribute to some extent to peak flow control events for high intensity events. However, with only three year of observations the worst case situations are probably not captured. For the costal climates studies these could typically be high-intensity precipitation events inside a longer low intensity event giving initial conditions comparable to the modelled initial wet situation. For the highest modelled peak intensities, and when the peak arrived late in the precipitation event (initial wet) very low peak reductions and peak delays were found. The most likely explanation for this was that the green roofs were almost totally saturated, and when the flow inn increased, the same amount of water would have to be pushed out almost immediately by the hydraulic pressure. Bioretention cells were overflowed at most tested scenarios resulting in low peak reduction and peak delays (Figure 12c). A substantial part of the precipitation (11-35%) could be infiltrated even at these large design events. Fractions overflowing the unit resulting in no pollution control were high for the largest design events (up to 69%). The highest future scenarios represent an increase in precipitation for all locations, accompanied by a reduced detention performance for green roofs and bioretention cells. Green infrastructure is typically designed to target the most frequent precipitation events and cannot stand alone as a peak flow control measure for large design events. However, these measures can give valuable contributions, and with alternative sizing, or in combination with other stormwater measures, they can give a substantial contribution to peak flow control of large design events, as also suggested by Dietz et al. [8] and Rosa et al. [66]. for high intensity events. However, with only three year of observations the worst case situations are probably not captured. For the costal climates studies these could typically be high-intensity precipitation events inside a longer low intensity event giving initial conditions comparable to the modelled initial wet situation. For the highest modelled peak intensities, and when the peak arrived late in the precipitation event (initial wet) very low peak reductions and peak delays were found. The most likely explanation for this was that the green roofs were almost totally saturated, and when the flow inn increased, the same amount of water would have to be pushed out almost immediately by the hydraulic pressure. Bioretention cells were overflowed at most tested scenarios resulting in low peak reduction and peak delays (Figure 12c). A substantial part of the precipitation (11-35%) could be infiltrated even at these large design events. Fractions overflowing the unit resulting in no pollution control were high for the largest design events (up to 69%). The highest future scenarios represent an increase in precipitation for all locations, accompanied by a reduced detention performance for green roofs and bioretention cells. Green infrastructure is typically designed to target the most frequent precipitation events and cannot stand alone as a peak flow control measure for large design events. However, these measures can give valuable contributions, and with alternative sizing, or in combination with other stormwater measures, they can give a substantial contribution to peak flow control of large design events, as also suggested by Dietz et al. [8] and Rosa et al. [66].  Detention basins can be designed to meet any given peak runoff threshold resulting in variable volume requirements. Required detention basin volumes for a runoff area of 1000 m 2 and a runoff threshold of 20 l/s·ha were almost three times as high in Oslo (29 m 3 ) and twice as high in Bergen (22 m 3 ) compared to Trondheim (11 m 3 ) for the present climate ( Figure 13a). All future scenarios increased the required detention basin volumes substantially compared to the present for Bergen, while smaller increases were found for most future scenarios in Oslo and Trondheim. The size increased by 50% for the maximum future scenario in Oslo, 100% in Trondheim and as much as 250% for the maximum future scenario in Bergen. The duration of the design event was longer for Bergen in present climate and increased even further for all future scenarios, indicating that the type of events being critical for stormwater design might change in the future from short duration events, commonly being used for design today, to longer duration events. Stricter runoff thresholds than applied here will result in longer durations of the design event and larger basin volumes. Detention basins can be designed to meet any given peak runoff threshold resulting in variable volume requirements. Required detention basin volumes for a runoff area of 1000 m 2 and a runoff threshold of 20 l/s·ha were almost three times as high in Oslo (29 m 3 ) and twice as high in Bergen (22 m 3 ) compared to Trondheim (11 m 3 ) for the present climate ( Figure 13a). All future scenarios increased the required detention basin volumes substantially compared to the present for Bergen, while smaller increases were found for most future scenarios in Oslo and Trondheim. The size increased by 50% for the maximum future scenario in Oslo, 100% in Trondheim and as much as 250% for the maximum future scenario in Bergen. The duration of the design event was longer for Bergen in present climate and increased even further for all future scenarios, indicating that the type of events being critical for stormwater design might change in the future from short duration events, commonly being used for design today, to longer duration events. Stricter runoff thresholds than applied here will result in longer durations of the design event and larger basin volumes.

LID Performance: Combined Measures
Detention basins are efficient in reducing peak runoffs at high-intensity events from single plots, while providing no stormwater treatment and little or no peak reduction at smaller more frequent events. Stormwater management based on detention basins alone require large volumes to handle a 20-year design event and combination of measures are preferable for volume and pollution control

LID Performance: Combined Measures
Detention basins are efficient in reducing peak runoffs at high-intensity events from single plots, while providing no stormwater treatment and little or no peak reduction at smaller more frequent events. Stormwater management based on detention basins alone require large volumes to handle a 20-year design event and combination of measures are preferable for volume and pollution control in addition to peak flow control.
The combined effect of different stormwater measures has been investigated for present and future design events, based on a 20-year return period 60-min duration design event, with a 10 min peak with the most unfavorable timing placed late in the precipitation event ( Figure 14). The same areas, stormwater measures and methods as applied earlier were used in this comparison. Largest required volumes were found when only using a detention basin alone. The introduction of an extensive green roof before the detention basin reduced the required volumes. A slightly higher volume reduction was found by using a bioretention cell before the detention basin. A combination of both an extensive green roof and a bioretention cell reduced the required detention basin volume substantially and, in some cases, totally removed the need for the detention basin. The same pattern in volume reduction was found when applying a design event based on the future maximum and minimum scenarios. Most tested future scenarios gave an increase in required detention basin volumes and the largest increases were found for Bergen. The individual investigation of detention basins for Bergen showed that this was the only site where rain durations larger than 60 min were dimensional, giving even larger volume needs than shown in this combined investigation. Runoff thresholds are commonly based on a predevelopment runoff pattern, and, due to the climatic conditions, this would probably be set higher in Bergen compared to the other sites, and not similar as in this exemplification. The method still illustrates the consequences of the future precipitation scenarios and the efficiency of combining stormwater measures. The investigated measures also show the potential for upgrading existing detention basins designed for the present climate, by adding green roofs on conventional rooftops and directing runoff through bioretention cells prior to the detention basins, to perform sufficiently also in a future climate. volumes and the largest increases were found for Bergen. The individual investigation of detention basins for Bergen showed that this was the only site where rain durations larger than 60 min were dimensional, giving even larger volume needs than shown in this combined investigation. Runoff thresholds are commonly based on a predevelopment runoff pattern, and, due to the climatic conditions, this would probably be set higher in Bergen compared to the other sites, and not similar as in this exemplification. The method still illustrates the consequences of the future precipitation scenarios and the efficiency of combining stormwater measures. The investigated measures also show the potential for upgrading existing detention basins designed for the present climate, by adding green roofs on conventional rooftops and directing runoff through bioretention cells prior to the detention basins, to perform sufficiently also in a future climate.

Conclusions
This study has assessed the performance of LID structures in present and a future climate for three different locations in Norway: the cities of Bergen, Oslo and Trondheim. The assessment was performed using existing regional climate projections of daily temperature and precipitation and scaling laws to construct future IDF curves, and an extensive modelling framework to investigate the full range of slow and rapid hydrological processes.
The climate projections indicated a substantial increase in average annual temperatures for all investigated future scenarios with median value increase ranging from 4.6-5.6 °C, while average daily precipitation increased most for Bergen with a median increase of 2.0 mm compared to 0.6 mm for Trondheim and 0.3 mm for Oslo.
The scaling of IDF relationships was performed on rainfall extremes extracted by the POT approach and fitted to the generalized Pareto distribution. A clear scaling relationship between daily and sub-daily extreme rainfall statistics was found, and the scaling model itself was assessed as improved from previous attempts to scale similar data sets. However, the GPD assumption did not

Conclusions
This study has assessed the performance of LID structures in present and a future climate for three different locations in Norway: the cities of Bergen, Oslo and Trondheim. The assessment was performed using existing regional climate projections of daily temperature and precipitation and scaling laws to construct future IDF curves, and an extensive modelling framework to investigate the full range of slow and rapid hydrological processes.
The climate projections indicated a substantial increase in average annual temperatures for all investigated future scenarios with median value increase ranging from 4.6-5.6 • C, while average daily precipitation increased most for Bergen with a median increase of 2.0 mm compared to 0.6 mm for Trondheim and 0.3 mm for Oslo.
The scaling of IDF relationships was performed on rainfall extremes extracted by the POT approach and fitted to the generalized Pareto distribution. A clear scaling relationship between daily and sub-daily extreme rainfall statistics was found, and the scaling model itself was assessed as improved from previous attempts to scale similar data sets. However, the GPD assumption did not hold for all the assessed extreme series. Future IDF curves constructed based on the obtained scaling relationships showed that there are large variations in projected intensity increase across locations, where projections for Bergen entail a much higher increase than Oslo and Trondheim. The implication of this is that local conditions should be considered in order to achieve optimal adaptation of stormwater management. Furthermore, climate factors were found to increase with return period, implying that an even higher safety requirement should be considered for stormwater design practices in vulnerable surroundings.
According to long-term water balance simulations, green roofs can contribute with reductions in annual stormwater runoff volumes by evapotranspiration in the range of 15-40% depending on local climate. Bioretention cells covering 5% of the catchment area with relatively low infiltration capacities in native soil (0.5 cm/h) were found to reduce stormwater runoff volumes by 54-85% and to filtrate as much as 63-88% for pollutant control. Future performance of green roofs was found to be comparable or improved due to increased evapotranspiration rates caused by increased temperatures, while future performance of bioretention cells was found to be slightly poorer due to increased precipitation amounts. The risk of green roof drought was found to increase in future scenarios, and can be reduced by increasing water storage capacities.
Detention basin volumes required to handle a 20-year event increased for almost all tested scenarios, but most for Bergen where the design events increased most in size. Green roofs and bioretention cells can contribute with peak reduction and peak delay for some of the tested 20-year return period, 1 h design events. However, for the largest and highest-intensity events or when the peak arrived on initial wet condition, little detention effect was found. Despite the poor performance alone on design events, green roofs and bioretention cells can make an important contribution to design events, reducing the required downstream detention basin volumes substantially if applied in series.
In this study, LID performance was assessed for selected climate scenarios. One major limitation of the study is the high level of uncertainty linked to climate projections. In addition, several assumptions were needed for the statistical analysis comprising the temporal downscaling, such as choice of independence criteria and threshold selection, introducing even more uncertainty. By coupling frameworks, this uncertainty was further propagated through the assessments of LID performance in water-balance and event-based models. The daily time step models used are continuous and will, therefore, capture the processes dependency of antecedent conditions, but can only be used to evaluate slow processes like evapotranspiration and infiltration. Assumptions had to be made to distribute the daily precipitation into the bioretention cell introducing uncertainties. However, future time series of higher time resolution were not available to make a more accurate model for long-term future performance. Performance during high intensity precipitation was based on synthetic design events created from IDF curves. This is a common design method for stormwater measures, but has the limitation of not including the antecedent conditions, which could be relevant for the performance of this type of LID measure. This was partly accounted for by introducing two different synthetic design events, where the peak arrived on different initial conditions.
No prediction of the future will be perfect, but assessing the uncertainty of projections is necessary to provide better decision support to practitioners. The inclusion of more scenarios, sensitivity analysis of GPD parameters, and testing the methodology on more data sets would be valuable for such an assessment. Evaluation of different stormwater measures and combinations are necessary for design of future resilient systems. In this study this was conducted with different tools, while a practitioner would prefer this to be integrated in one tool. Stormwater models have lately introduced modules for local stormwater measures, but more work is needed to improve these. Model outputs have to be tested further versus field observations to provide reliable material dependent model parameters. Long-term processes, e.g., evapotranspiration from green roofs, have to be represented more accurate in the model, and the possibility to model different combinations of stormwater measures both in series and in parallel, should be tested further.