Evaluation of Various Resolution DEMs in Flood Risk Assessment and Practical Rules for Flood Mapping in Data-Scarce Geospatial Areas: A Case Study in Thessaly, Greece

: Floods are lethal and destructive natural hazards. The Mediterranean, including Greece, has recently experienced many ﬂood events (e.g., Medicanes Zorbas and Ianos), while climate change results in more frequent and intense ﬂood events. Accurate ﬂood mapping in river areas is crucial for ﬂood risk assessment, planning mitigation measures, protecting existing infrastructure, and sustainable planning. The accuracy of results is affected by all simplifying assumptions concerning the conceptual and numerical model implemented and the quality of geospatial data used (Digital Terrain Models—DTMs). The current research investigates ﬂood modelling sensitivity against geospatial data accuracy using the following DTM resolutions in a mountainous river sub-basin of Thessaly’s Water District (Greece): (a) open 5 m and (b) 2 m data from Hellenic Cadastre (HC) and (c) 0.05 m data from an Unmanned Aerial Vehicle (UAV) topographical mission. RAS-Mapper and HEC-RAS are used for 1D (steady state) hydraulic simulation regarding a 1000-year return period. Results include ﬂood maps and cross section-speciﬁc ﬂow characteristics. They are analysed in a graphical ﬂood map-based empirical fashion, whereas a statistical analysis based on the correlation matrix and a more sophisticated Machine Learning analysis based on the interpretation of nonlinear relationships between input–output variables support and particularise the conclusions in a quantiﬁable manner.


Introduction
Floods are natural disasters that can have severe impacts on human lives, infrastructure, and the environment [1,2].Floods can occur due to various reasons, such as heavy rainfall, river overflow, coastal storm surges, or tsunamis.The response of mountain basins to intense rainfall is rapid, mainly due to large slopes, while precipitation is spatially and temporally variable [3].Mountain basin floods are often flashy [4], allowing limited time for warnings.Flash floods usually occur in mountain river catchments draining less than 1000 km 2 [5].They constitute a common, extremely dangerous natural hazard and they are responsible for many deaths [6,7].Their impacts on various socioeconomic activities are extremely diverse [8,9].Around 40% of flood-related deaths in Europe between 1950 and 2006 are linked to flash floods [10]; still, there is a lack of relevant data, especially reliable discharge estimates [4].The Mediterranean region is one of the most flood-prone areas in the world due to its unique geographic, climatic, and environmental conditions, with floods occurring on average every two years; floods are one of the most lethal and destructive natural hazards [11] there.Greece is also affected by floods that are mainly caused by heavy rainfall (e.g., flood in the city of Karditsa caused by Medicane Ianos [12]), river overflow, and flash floods (e.g., flood in Mandra with 24 fatalities caused by Medicane Numa-Zenon [13]).Greece has also experienced many flood events during the last decades [7].
One of the most effective ways to reduce the impacts of floods is to develop and implement flood management plans that include prevention, preparedness, response, and recovery measures.Flood mapping is a crucial tool for flood management planning as it enables the a priori identification of flood-prone areas, and the estimation of flood extent, flow depth and characteristics, and flood frequency [14].There are several challenges in modelling floods using informed modelling, such as data quality issues, uncertainty in input parameters, and the need for improved model calibration.Flood model parameters are of grey-box nature and their global use is not suggested but rather should be carefully adopted [13].Moreover, advanced modelling approaches supported by detailed spatial information are not always the answer.They are extremely computationally and data-greedy in order to overcome uncertainties [15].Most of the time, a compromise between simulation accuracy and time defines the simulation scheme/model used.Hence, 1D flood modelling can be implemented, especially in data-scarce areas.In such cases, open real-world datasets can improve flood modelling [12].The use of forensic hydrology, reconstructing flood events through field observations, hydrological and hydraulic modelling, and geomorphological analysis, is an established method to overcome the lack of data and provide valuable insights into past events and inform future flood risk management strategies [12].
An essential tool for successful flood risk management is accurate flood mapping.This requires the use of high resolution and accurate Digital Elevation Models (DEMs) [16].High resolution does not always guarantee DEM accuracy, especially when dense vegetation and canopy are involved and the mapping is based on orthophotos.In such cases, the accuracy also depends on the vegetation filtering techniques used.
Current research investigates flood modelling sensitivity against geospatial data accuracy, in a case study concerning a part of the mountainous Enipeas river basin of Thessaly's Water District (Greece).The methodology that is implemented in the current research is graphically presented step-by-step in Figure 1.In particular, the following DEMs for the study area concerning flood modelling (flood area) are tested: (a) open 5 m resolution DEM data (DEM_5 m) from the Hellenic Cadastre (HC; [17]), (b) open 2 m resolution DEM data (DEM_2 m) from the HC, and (c) sub-meter (0.05 m) resolution DEM data (DEM_0.05m) from a research team's own designated Unmanned Aerial Vehicle (UAV) topographical mission.The US Army Corps of Engineers' software [18] RAS-Mapper [19] and HEC-RAS 1D [20] are used for 1D (steady state) hydraulic simulations (Sims) regarding a 1000-year return period for the three different DEMs (DEM_5 m = Sim 1; DEM_2 m = Sim 2; DEM_0.05m = Sim 3).Results include 2D flood maps graphically presenting spatial flood extents and flow depths, as well as flow characteristics for every cross-section of the hydrographic network, such as Froude number, hydraulic radius, and flood extent.In the absence of an ideal terrestrial mapping mission using land-based topographical instruments of the studied hydrographic network, DEM_0.05m and the resulting hydraulic simulation (Sim 3) are assumed to be the "ground truth".Thus, the investigation focuses on the comparison of the results of the two open data-based simulations, Sim 1 and Sim 2, against the closer to the truth Sim 3 results.Graphical abstract of the methodology implemented to investigate the sensitivity of flood risk mapping via 1D hydraulic simulations vs. various DEM resolutions.Aim: conclude on study area features that render the use of more accurate but costly and time-consuming UAV mapping imperative and decide on the next best free alternative option in Greek reality [29,42].
The flood modelling results are analysed in a graphical flood map-based empirical fashion, whereas a statistical analysis, based on the correlation matrix, and a more sophisticated Machine Learning (ML) analysis, support and particularise the conclusions in a quantifiable manner; the ML-assisted analysis is based on the interpretation of the nonlinear relationships between input-output variables (i.e., DEM, sinuosity, slope vs. flow extents, flow depths, flow characteristics).The goal is to track the errors in the simulation results and trace them back to the initial features that generated them, in relation to the selected DEM; this way, one can conclude on the features that render the use of the more accurate, but costly and time-consuming, UAV mapping imperative, while deciding on the next best free alternative DEM (5 m or 2 m) option in Greek reality.
The practical aim of this paper is to produce practical rules for optimal hydraulic simulation of a river basin, in terms of minimization of in situ topographical mapping costs without compromising the hydraulic simulation accuracy.This way, one can decide on which hydrographic network sections (if any) of any river basin demand UAV or other accurate mapping or not and what the free alternative is.This requires a transparent and detailed presentation of the implemented and proposed methodology so that the conclusions can be generalized.

Materials and Methods
This section presents the study area and all stages of the implemented methodology, with their discrete steps, as presented in the graphical abstract (Figure 1), mentioning all data sources and methods along the way.Stage 1 (see Section 2.2) refers to data preprocessing, including procurement and processing/manipulation of topography, hydrology, geology, soil, land uses, and precipitation data.Stage 2 (see Section 2.3) refers to hydrological simulations, including calculation of hydrological parameters and production of hydrographs, required as input for the hydraulic simulations included in Stage 3. Stage 3 (see Section 2.4) produces 2D flood maps and cross-section-specific results that are analysed in Stage 4 (see Section 2.5) via three different approaches: (a) empirical, (b) statistical, and (c) interpretation of non-linear relationships using Machine Learning.

Study Area
The Thessaly Water District (EL08; [21]) includes two main river basins: (a) the Pinios river basin and (b) the Almyros-Pilion basin.The study area is located in the district's southern section, being part of the Pinios basin.It specifically lies in the north-west of Mount Othris, being a part of the mountainous river basin of Enipeas river (code GR00080004002203). Figure 2 presents the location of the study area, and the wider study area (where the hydrological simulations are conducted; shortly referred to as "hydro area") as a part of the Thessaly Water District and the Enipeas river basin.Figure 3 presents the specific study area where the hydraulic simulations are conducted (shortly referred to as "flood area"), as a part of the wider "hydro area".

Topographic Data (Step 1)
The geospatial data utilised in this research come from HC [17] and a private UAV mission.DEM_5 m is actually the "Digital Elevation Model-DEM-LSO (5 m)" dataset series, as presented by HC [22], which "is a 5 m pixel size grid compilation (1:5000 cadastral tile distribution), deriving from the Large Scale Orthophotos project.It is a homogenous systematic point grid which refers to terrain elevation and creates an Earth Elevation Model".RAS-Mapper [19] is used to convert DEM_5 m into DTM_5 m (research product P1a; see Figure 1); the respective map is provided as Appendix A File SM1 (see Appendix A for details).DEM_2 m is actually the "Digital Elevation Model-DEM-LSO25" dataset series as presented by HC [23], which "is a 2 m pixel size grid compilation (1:2500 cadastral tile distribution), for the entire country from airphotos taken between 2014 and 2016, deriving from the Large Scale Orthophotos 25 cm (LSO25) project.It is a homogenous systematic point grid which refers to terrain elevation and creates an Earth Elevation Model".RAS-Mapper [19] is used to convert DEM_2 m into DTM_2 m (P1b_flood for "flood area" and P1b_hydro for "hydro area"; see SM2a and SM2b, respectively).DEM_0.05m is produced by the research team's own designated UAV topographical mission.Structure-from-Motion (SfM) photogrammetry using photographs obtained by UAVs is increasingly being utilised for producing high resolution DEMs.The UAV used is WingtraOne GEN II.The flight took place on 3 March 2022 and lasted about 20 min to survey the "flood area" of approximately 1.05 km 2 .The DEMs are interpolated from point clouds that represent entire landscapes, including terrain, vegetation, and infrastructure [24].In the current research, the vegetation filtering is conducted with the standard method of Agisoft Metashape software application [25].RAS-Mapper [19] is used to convert DEM_0.05m into DTM_0.05m (P1c; see SM3).DTM_0.05 is contained within the boundaries of the UAV mapping (DEM_0.05m) where the hydraulic simulations are conducted (flood area), as presented in Figure 3.The geospatial data utilised in this research come from HC [17] and a private UAV mission.DEM_5 m is actually the "Digital Elevation Model-DEM-LSO (5 m)" dataset series, as presented by HC [22], which "is a 5 m pixel size grid compilation (1:5000 cadastral tile distribution), deriving from the Large Scale Orthophotos project.It is a homogenous systematic point grid which refers to terrain elevation and creates an Earth Elevation Model".RAS-Mapper [19] is used to convert DEM_5 m into DTM_5 m (research product P1a; see Figure 1); the respective map is provided as Supplementary Material File SM1 (see Appendix A for Supplementary Material details).DEM_2 m is actually the "Digital Elevation Model-DEM-LSO25" dataset series as presented by HC [23], which "is a 2 m pixel size grid compilation (1:2500 cadastral tile distribution), for the entire country from airphotos taken between 2014 and 2016, deriving from the Large Scale Orthophotos 25 cm (LSO25) project.It is a homogenous systematic point grid which refers to terrain elevation and creates an Earth Elevation Model".RAS-Mapper [19] is used to convert DEM_2 m into DTM_2 m (P1b_flood for "flood area" and P1b_hydro for "hydro area"; see SM2a and SM2b, respectively).DEM_0.05m is produced by the research team's own designated UAV topographical mission.Structure-from-Motion (SfM) photogrammetry using photographs obtained by UAVs is increasingly being utilised for producing high resolution DEMs.The UAV used

Hydrology-Related Data and Calculations (Step 2)
Using the software ArcGIS Pro v3.1.0[26], the "hydro" studied area is divided into sub-basins/catchments (P2a; see Figure 3 and SM4) based on DTM_2 m in conjunction with orthophotos derived from HC [23] and satellite images.The finer of the two resolutions available in the wider "hydro" study area DTM_2 m is used, as it allows for a finer representation of the low-and very low-slope areas of the study area, as well as various technical structures such as embankments.Figure 4 presents the "hydro" area, together with the hydrographic network.The latter derives from the River Basin Management Plans of the Water District of Thessaly [27], in conformation with the Water Framework Directive (2000/60/EC; [28]) and comprises the main stream sections of Enipeas hydrographic network (P2b; see SM5).
A more accurate stream centerline (part of the hydrographic network in the "flood" area) is also produced in this step (P2c and P2c_DTM; see SM6a,b), derived from the most accurate DTM_0.05 m.This will be used for the hydraulic simulation.Moreover, although not needed for the simulations, the respective stream centerlines derived from DTM_5 m (P2d and P2d_DTM; see SM7a,b) and DTM_2 m (P2e and P2e_DTM; see SM8a,b) are produced to be used for comparison and deduction of conclusions in the last section of the paper.All stream lines are produced using ArcGIS Pro.All of the geomorphological characteristics of the sub-catchments of the "hydro" area are presented in Table 1.Step 2 ultimately features use of the Giandotti methodology [29] for the calculation of the six concentration times (Table 1), one per sub-catchment (P2f; see SM9).tions available in the wider "hydro" study area DTM_2 m is used, as it allows for a finer representation of the low-and very low-slope areas of the study area, as well as various technical structures such as embankments.Figure 4 presents the "hydro" area, together with the hydrographic network.The latter derives from the River Basin Management Plans of the Water District of Thessaly [27], in conformation with the Water Framework Directive (2000/60/EC; [28]) and comprises the main stream sections of Enipeas hydrographic network (P2b; see SM5).The most reliable sources for geological/soil data and, consequently, hydrolithological data are the European Soil Data Center (ESDAC) [30][31][32][33], the Soil Map of Greece by the Greek Payment Authority of Common Agricultural Policy (OPEKEPE; [34]), and the River Basin Management Plan for the Water District of Thessaly (RBMP-EL08) [27].The main source of the soil data is OPEKEPE; the available separate soil map tiles are scanned and georeferenced on the "hydro" area using ArcGIS Pro (P3a; see SM10).The soil map does not cover the full extent of the study area.The missing data are drawn by the hydrolithological map provided by RBMP-EL08 [27].The available data from ESDAC successfully validate the other sources.The soil types of OPEKEPE are linked to the respective hydromorphy and are categorised into classes (A: high; B: moderate; C: low; D and E: very low infiltration rates), whereas the RBMP map also categorises the soil types concerning the hydrolithological characteristics.The resulting merged hydrolithological map is presented in Figure 5 (P3b; see SM11). .Hydrolithological map of the study area by merged OPEKEPE [34] and RBMP [27] data (see SM11).

Land Use Data (Step 4)
The determination of the land cover was based on CORINE land cover data [35].The land use map is presented in Figure 6 (P4; see SM12).All information regarding CΟRΙΝΕ land cover classes and the respective land cover area per sub-catchment are presented in SM14 (P6).[34] and RBMP [27] data (see SM11).

Land Use Data (Step 4)
The determination of the land cover was based on CORINE land cover data [35].The land use map is presented in Figure 6 (P4; see SM12).All information regarding CORINE land cover classes and the respective land cover area per sub-catchment are presented in SM14 (P6).

Precipitation Data Hyetograph Production (Step 5)
The only available meteorological stations in the study area (hydro) are those located at Anavra and Skopia, as presented in Figure 3.In order to be on the safe side and investigate the worst-case scenario regarding rainfall intensity, the higher-elevation Anavra station is selected; it always provides greater precipitation heights.Under these data-scarce conditions, following the methodology by Koutsoyiannis et al. [36], the Intensity Duration Frequency (IDF) curve was designed, using the proposed equation: where i is the max point rainfall intensity of duration t for a return period of T; θ and η are parameters to be estimated, with θ ≥ 0 (in time units) and 0 < η < 1; κ > 0 is the shape parameter; λ' > 0 is the scale parameter; and ψ' is the location parameter.

Precipitation Data Hyetograph Production (Step 5)
The only available meteorological stations in the study area (hydro) are those located at Anavra and Skopia, as presented in Figure 3.In order to be on the safe side and investigate the worst-case scenario regarding rainfall intensity, the higher-elevation Anavra station is selected; it always provides greater precipitation heights.Under these datascarce conditions, following the methodology by Koutsoyiannis et al. [36], the Intensity Duration Frequency (IDF) curve was designed, using the proposed equation: where i is the max point rainfall intensity of duration t for a return period of T; θ and η are parameters to be estimated, with θ ≥ 0 (in time units) and 0 < η < 1; κ > 0 is the shape parameter; λ' > 0 is the scale parameter; and ψ' is the location parameter.
The parameters proposed by the IDF Report v4 [37] (supporting documents of [27]) for the Anavra station and for a return period of 1000 years are presented in Table 2.The resulting function and IDF curve are presented in Figure 7.The parameters proposed by the IDF Report v4 [37] (supporting documents of [27]) for the Anavra station and for a return period of 1000 years are presented in Table 2.The resulting function and IDF curve are presented in Figure 7. Next, the point rainfall intensity is transformed to areal rainfall intensity using the respective Areal Reduction Factor (ARF) per sub-catchment, calculated by [36,37]: where A is the river basin area (km 2 ) and d is the rainfall duration (h).
The ARF values per sub-catchment are presented in Table 1 (sources: [38][39][40][41]).The respective design hyetographs per sub-catchment are produced (P5; see SM13) based on the IDF curve using the Alternate Block Method [42,43].Next, the point rainfall intensity is transformed to areal rainfall intensity using the respective Areal Reduction Factor (ARF) per sub-catchment, calculated by [36,37]: where A is the river basin area (km 2 ) and d is the rainfall duration (h).

Hydrological Simulation (Methodology Stage 2)
All current hydrological simulations are conducted using the Hydrologic Engineering Center-Hydrologic Modelling System (HEC-HMS), developed by the US Army Corps of Engineers [18].HEC-HMS is a widely used and established tool that can simulate all hydrological processes of a watershed, including precipitation, infiltration, evaporation, snowmelt, and runoff.It is used for the design and management of water resource infrastructure, such as dams, reservoirs, and water supply systems.The transformation of precipitation into runoff for every sub-catchment was conducted using the Soil Conservation Service-Curve Number (SCS-CN) [44] unit hydrograph method, whereas losses are estimated using the SCS-CN model.

Curve Numbers (Step 6)
SCS-CN is a simple, widely used, and efficient method for determining the amount of runoff from rainfall, even in a particular area.A Curve Number (CN) [42] expresses the percentage of precipitation that will runoff as a function of the area's hydrologic soil group, land use, treatment, and hydrologic condition.Based on Chow et al. (1988;[42]), Koutsoyiannis and Xanthopoulos (1999;[45]) provided updated CN values, used in current research.There are various land uses in every sub-catchment, hence the weighted Curve Number (composite) value CNc is calculated [46].The estimation process is presented in SM14, whereas the calculated CNc values are presented in Table 1.

Lag Time Estimation (Step 7)
Following the SCS methodology [44] and the HEC-HMS Technical Reference manual [47], the lag times for each sub-catchment are estimated (Table 1).Lag time refers to the delay between the occurrence of rainfall and the peak discharge of a river.It depends on size and shape of the catchment, soil type, and vegetation cover.

Hydrological Simulation (Methodology Stage 2)
All current hydrological simulations are conducted using the Hydrologic Engineering Center-Hydrologic Modelling System (HEC-HMS), developed by the US Army Corps of Engineers [18].HEC-HMS is a widely used and established tool that can simulate all hydrological processes of a watershed, including precipitation, infiltration, evaporation, snowmelt, and runoff.It is used for the design and management of water resource infrastructure, such as dams, reservoirs, and water supply systems.The transformation of precipitation into runoff for every sub-catchment was conducted using the Soil Conservation Service-Curve Number (SCS-CN) [44] unit hydrograph method, whereas losses are estimated using the SCS-CN model.

Curve Numbers (Step 6)
SCS-CN is a simple, widely used, and efficient method for determining the amount of runoff from rainfall, even in a particular area.A Curve Number (CN) [42] expresses the percentage of precipitation that will runoff as a function of the area's hydrologic soil group, land use, treatment, and hydrologic condition.Based on Chow et al. (1988;[42]), Koutsoyiannis and Xanthopoulos (1999;[45]) provided updated CN values, used in current research.There are various land uses in every sub-catchment, hence the weighted Curve Number (composite) value CNc is calculated [46].The estimation process is presented in SM14, whereas the calculated CNc values are presented in Table 1.

Lag Time Estimation (Step 7)
Following the SCS methodology [44] and the HEC-HMS Technical Reference manual [47], the lag times for each sub-catchment are estimated (Table 1).Lag time refers to the delay between the occurrence of rainfall and the peak discharge of a river.It depends on size and shape of the catchment, soil type, and vegetation cover.

Hydrograph Production (Step 8)
Following the SCS methodology [44], hydrographs are produced (P8; see SM15) using HEC-HMS, applying the well-established Muskingum river routing method to all reaches.Parameters such as reach length and slope are estimated using topographical data, namely DTM_2 m.The nine hydrographs refer to the three junctions and six sub-catchments created by the studied Enipeas basin (hydro area) model, as presented in Figure 8.All current hydraulic simulations are conducted using the Hydrologic Engineering Center-River Analysis System (HEC-RAS 1D) developed by the US Army Corps of Engineers [48].It is a powerful tool for hydraulic modelling, based on a 1D numerical system, able to simulate steady/unsteady flow conditions.HEC-RAS can be used for a variety of hydraulic analyses, including floodplain mapping, bridge and culvert design, dam safety evaluations, sediment transport studies, and water quality modelling.The model solves equations for conservation of mass and momentum to calculate water surface elevation, flow velocity, and other hydraulic characteristics.It exhibits a user-friendly interface and features tools such as RAS-Mapper [19], supporting a range of data input formats, including DEMs, topographic maps, and surveyed cross-sections and post-processing utilities for displaying and analysing simulation results.
Although there is only one shared hydrological simulation applied in the wider "hydro" area, there are actually three hydraulic simulations, namely 1D implementations of the HEC-RAS 1D model, corresponding to the three input DTMs applied in the "flood" area: (a) Sim 1 uses DTM_5 m, (b) Sim 2 uses DTM_2 m, and (c) Sim 3 uses DTM_0.05 m.All three Sims share the river centerlines (Step 2; products P2c, P2d, P2e), as well as the  All current hydraulic simulations are conducted using the Hydrologic Engineering Center-River Analysis System (HEC-RAS 1D) developed by the US Army Corps of Engineers [48].It is a powerful tool for hydraulic modelling, based on a 1D numerical system, able to simulate steady/unsteady flow conditions.HEC-RAS can be used for a variety of hydraulic analyses, including floodplain mapping, bridge and culvert design, dam safety evaluations, sediment transport studies, and water quality modelling.The model solves equations for conservation of mass and momentum to calculate water surface elevation, flow velocity, and other hydraulic characteristics.It exhibits a user-friendly interface and features tools such as RAS-Mapper [19], supporting a range of data input formats, including DEMs, topographic maps, and surveyed cross-sections and post-processing utilities for displaying and analysing simulation results.
Although there is only one shared hydrological simulation applied in the wider "hydro" area, there are actually three hydraulic simulations, namely 1D implementations of the HEC-RAS 1D model, corresponding to the three input DTMs applied in the "flood" area:  [42]) and the HEC-RAS hydraulic reference manual [48].The values were adjusted, as [42] suggests, based on river irregularities, variation in channel cross-section, obstructions, vegetation, and meandering.The contraction and expansion coefficients for all Sims are assumed to be equal to 0.1 and 0.3, respectively [48].The maximum flow (sic.)values in the respective HEC-HMS-produced hydrographs of junctions J2 (539.3 m 3 /s; see SM15, page 1) and J3 (556.1 m 3 /s; see SM15, page 3) are used as steady flow discharges in all hydraulic simulations (Figure 8).Concerning upstream and downstream boundary conditions, the energy grade line slope values are assumed to be equal to the riverbed slope values [48].
Step 9 in Stage 2 of the applied methodology features Sim 1, namely HEC-RAS 1D simulation of the hydraulic model based on DTM_5 m, producing the respective W_5 m flood extent map (P9a; see SM16), the respective y_5 m flow depths map (P9b; see SM17), and calculating the flow characteristics.The fc_5 m flow characteristics selected to be investigated for their involvement in the propagation of errors generated by the lower resolution DEMs are presented per cross-section in SM18, including meter mark, hydraulic radius, Froude number, and flood extent.
Step 10 in Stage 2 of the applied methodology features Sim 2, namely HEC-RAS 1D simulation of the hydraulic model based on DTM_2 m, producing the respective W_2 m flood extent map (P10a; see SM19) and the respective y_2 m flow depths map (P10b; see SM20), and calculating the fc_2 m flow characteristics (P10c; see SM18).
Step 11 in Stage 2 of the applied methodology features Sim 3, namely HEC-RAS 1D simulation of the hydraulic model based on DTM_0.05 m produces the respective W_0.05 m flood extent map (P11a; see SM21) and the y_0.05 m flow depths map (P11b; see SM22), and calculating the fc_0.05m flow characteristics (P11c; see SM18).The production of conclusions concerning the free DEMs and the UAV-produced DEM in relation to the hydraulic simulation results requires an initial conventional comparative analysis of the results of the three simulations (Sim 1-3).This demands the post-processing of the results, including graphical representation in the form of comparative maps (i.e., flood extend maps, flow depths maps, and maps with centerlines produced from the various DTMs), for the empirical but informed conclusion production.These are presented and discussed in the "Results and discussion" section (Section 3.1).
These initial conclusions must be supported by a more quantitative statistical analysis of cross-section-specific variables and results.Specifically, the correlation coefficient (Pearson product-moment correlation coefficient), a measure of linear association between two variables, is calculated between all involved input data and variables (Sim/DTM-specific or common for all Sims/DTMs) and results (flood extents, flow depths, flow characteristics).The calculated correlation matrix can be used to support the empirical conclusions of Step 12, but also help explore other patterns, namely the factors that magnify and propagate errors originating from lower resolution DEMs in various sections.
As the investigation process is actually a root cause analysis concerning the impact of the various DEM resolutions on the hydraulic simulation, a more sophisticated method to interpret the nonlinear dependencies between variables and results is needed.A welldocumented methodology from Machine Learning, Random Forest (RF) importance [49], is utilised.An interval (t_lower, t_upper) is defined, where the residuals are reasonably small, and RF is fitted on the residuals that are larger than t_upper and lower than t_lower, observing the mean node impurity of the forest as a feature importance proxy.The key insight here is to include a gaussian noise "dummy" variable, uncorrelated with the target variable, as an additional feature, which is known to have no impact on the target output.By using the importance of this variable as a baseline, conclusions can be drawn on the importance of the other variables that have larger importance compared to the "dummy" variable, also including the error bars.This implementation should overcome the theoretical weakness of the statistically limited dataset, due to the study of a geospatially data-scarce area.

Comparative Analysis Based on Produced Maps (Step 12)
The production of empirical conclusions is mainly based on the comparative analysis of flood extent and flow depth maps.Figure 9 presents the flood extents for all three Sims projected on the same map (see SM24).It includes the five identified river sections exhibiting different sinuosity ratios (SR) in order to investigate the possible correlation between SR and simulation errors in lower resolution Sims 1 and 2. For a finer, more detailed comparison of the flood extent maps' differences, comparative maps for all combinations are available as Appendix A (SM25: W_5 m vs. W_0.05m; SM26: W_2 m vs. W_0.05m; SM27: W_5 m vs. W_2 m).The red line, representing the UAV-produced flood extents of Sim 3 (W_0.05m) is assumed to be the "ground truth", namely the closest to the truth available flood extents (inundated area = 0.7 km 2 ).Although Sim 1 and Sim 2 flood extents (W_5 m and W_2 m, respectively) constitute reliable simulations and provide satisfactory approximations, there are many errors, as presented in Tables 3 and 4. Inundation areas produced by Sim 1 and Sim 2 are 0.836 km 2 and 0.896 km 2 , respectively.Most of the errors are overestimations rather than underestimations (Table 3), at least being on the safe side.As presented in Table 3, the overall flood extent error of free DEM-based Sims (1 and 2) vs. the "ground truth" (Sim 3), expressed as the total area (km 2 ) of overestimation and underestimation, is 70% larger for Sim 2 (0.242 km 2 or 27.01% of the respective inundation area) compared to Sim 1 (0.242 km 2 or 17.22% of the respective inundation area).This is rather counter-intuitive: W_5 m is closer to the "ground truth", whereas W_2 m exhibits extended overestimations (0.219 km 2 ).Moreover, the most alarming result is that, while Sim 1's error is 99.31% overestimations, and only 0.69% underestimations (area-wise), Sim 2 exhibits 0.023 km 2 underestimations, approximately 10% of all its errors (Table 3).Examples of serious underestimations are indicated on Figure 9, e.g., in river section 5 (SR = 2.2).These underestimation errors are possibly risky, as they are not on the safe side.Overall, it is observed that there seem to be more errors in the sinuous and especially meandering sections of the river.Particular consideration should be given in areas where tributary streams converge to the main river (see Figure 9); these areas can be mistakenly considered as of greater than the actual importance flood-wise, due to their simulated extensive flooding.
Figure 10 presents the flow depth maps for all Sims side-by-side, whereas Figure 11 presents the flow depth differences between Sim 1 and Sim 3, and Sim 2 and Sim 3, respectively.Whereas Sim 2's range of flow depths (0.002 m-11.474m) generally matches the respective range of Sim 3 (0.002 m-11.397m; Table 4), a closer inspection reveals an inconsistency in their spatial distribution (Figure 10).With that in mind, Sim 1, though generally underestimating the flow depths, is in principle closer to Sim 3, and hydraulically more accurate.This is apparent in Figure 10, as the higher flow depths are positioned on the river centerline in Sim 1 (just like Sim 3).This is not the case for Sim 2, where the flow seems inconsistent and does not follow the real river centerline.Sim 1's flow depth errors (compared to Sim 3 "ground truth") range from −9.89 m to +5.85 m, whereas Sim 2's errors range from −7.47 m to +11.47 m.Sim 1 generally tends to underestimate flow depths up to 33% more than Sim 2, compared to the "ground truth", whereas Sim 2 tends to overestimate them up to 96% more than Sim 1. Considering the spatial distribution of flow depth errors, Sim 1 overestimates flow depths in a smaller area than Sim 2, while underestimating them in a larger area (Figure 11).
In search of the root of the errors in flood extents and flow depths, the meandering section of the river is selected to be scrutinised, as it is observed to exhibit extreme differences.Figure 12a presents the elevation differences between DTM_5 m and DTM_0.05,whereas Figure 12c presents the resulting flood extents of Sim 1 and Sim 3 (W_5 m and W_0.05 m) together with the flow depth differences between Sim 1 (y_5 m) and Sim 3 (y_0.05m).In a similar fashion, Figure 12b presents DTM_5 m vs. DTM_0.05,whereas Figure 12d presents the resulting W_5 m and W_0.05 m, together with y_5 m vs. y_0.05m. Figure 13 presents the locations of the two selected cross-sections, featured in detail in Figure 14.section of the river is selected to be scrutinised, as it is observed to exhibit extreme differences.Figure 12a presents the elevation differences between DTM_5 m and DTM_0.05,whereas Figure 12c presents the resulting flood extents of Sim 1 and Sim 3 (W_5 m and W_0.05 m) together with the flow depth differences between Sim 1 (y_5 m) and Sim 3 (y_0.05m).In a similar fashion, Figure 12b presents DTM_5 m vs. DTM_0.05,whereas Figure 12d presents the resulting W_5 m and W_0.05 m, together with y_5 m vs. y_0.05m. Figure 13 presents the locations of the two selected cross-sections, featured in detail in Figure 14.It is apparent that DTM_5 m exhibits elevation underestimations in most of the surveyed area (Figure 12a).The areas where it overestimates elevation are limited and marginally inside the banks, never on the centerline, hence managing to capture the geometry of the cross-sections (Figure 14).On the other hand, DTM_2 m generally underestimates   It is apparent that DTM_5 m exhibits elevation underestimations in most of the surveyed area (Figure 12a).The areas where it overestimates elevation are limited and marginally inside the banks, never on the centerline, hence managing to capture the geometry of the cross-sections (Figure 14).On the other hand, DTM_2 m generally underestimates elevation and overestimates near and inside the banks, even on the centerline, hence being unable to capture the true geometry of the cross sections (Figure 14).This practically results in the altering of the river pathways, as explicitly delineated in Figure 15, that simultaneously presents the river centerlines, as automatically produced based on DTM_5 m, DTM_2 m, and DTM_0.05m, respectively.Whereas the DTM_5 m-derived centerline is a good approximation of the "ground truth" DTM_0.05m, the DTM_2 m-derived centerline exhibits serious deviations, especially along the meandering sections.These errors in the initial topography data propagate and are the root causes of the errors in flood extent and flow depth results (Figure 12c,d).This is also obvious in Figure 14, where the water surface elevations (flow depths; y) vary significantly.A closer look reveals a strong connection of the errors in DTMs with the mapping of dense vegetation and canopy, especially in DTM_2 m, despite the fact that its resolution is in principle higher than DTM_5 m.

Statistical Analysis Based on Correlation Matrix (Step 13)
The correlation matrix presenting correlation coefficient values between all involved input data and variables (Sim/DTM-specific or common for all Sims/DTMs) and results (flood extents, flow depths, flow characteristics) are presented in Figure 16.The values of interest are highlighted and discussed.
There is an almost linear relationship between riverbed elevations of all DTMs (in relation to the real centerline derived by DTM_0.05 m) and the cross-section ID (consecutive) numbers.The strong linearities are explained by the fact that rivers flow downhill and riverbeds exhibit a positive slope in the vast majority of their length.Hence, the larger the ID number of a cross-section, the lower the respective elevation.Although this is expected, the correlation coefficient values (Cc) are (stronger to weaker correlations) Cc_0.05 m = −0.99,Cc_5 m = −0.98,and Cc_2 m = −0.95.The small variations support the previous findings, indicating better riverbed elevation approximation (compared to the "ground truth" DTM_0.05m) by DTM_5 m, rather than DTM_2 m.
Another interesting correlation is the relationship between riverbed elevations (Z) and the respective cross-section-specific flood extents (top widths; W).Again, the "ground truth" correlation between Z_0.05 m and W_0.05 m is the highest in value (Cc_0.05m = +0.71),followed by Cc_5 m = 0.6 and finally Cc_2 m = 0.55.The absolute value of any of the aforementioned Cc is rather random.A finding worth mentioning is the variation in Cc that supports the claim that the closest fit between DTM_5 m and DTM_0.05m, compared to DTM_2 m, also results in a closest fit between W_5 m and W_0.05 m, compared to W_2 m.This pattern continues in the relationship between the hydraulic radius values (R), as well as Froude number (Fr), and the respective W for each Sim.Sim 1 results are closer to Sim 3 compared to Sim 2. Specifically, the variation of the impact of R on W concerning Sim 2 is extreme: while Cc_0.05 m = −0.

Statistical Analysis Based on Correlation Matrix (Step 13)
The correlation matrix presenting correlation coefficient values between all involved input data and variables (Sim/DTM-specific or common for all Sims/DTMs) and results (flood extents, flow depths, flow characteristics) are presented in Figure 16.The values of interest are highlighted and discussed.
There is an almost linear relationship between riverbed elevations of all DTMs (in relation to the real centerline derived by DTM_0.05 m) and the cross-section ID (consecutive) numbers.The strong linearities are explained by the fact that rivers flow downhill and riverbeds exhibit a positive slope in the vast majority of their length.Hence, the larger the ID number of a cross-section, the lower the respective elevation.Although this is expected, the correlation coefficient values (Cc) are (stronger to weaker correlations) Cc_0.05 m = −0.99,Cc_5 m = −0.98,and Cc_2 m = −0.95.The small variations support the previous findings, indicating better riverbed elevation approximation (compared to the "ground truth" DTM_0.05m) by DTM_5 m, rather than DTM_2 m.
Another interesting correlation is the relationship between riverbed elevations (Z) and the respective cross-section-specific flood extents (top widths; W).Again, the "ground truth" correlation between Z_0.05 m and W_0.05 m is the highest in value (Cc_0.05m = +0.71),followed by Cc_5 m = 0.6 and finally Cc_2 m = 0.55.The absolute value of any of the aforementioned Cc is rather random.A finding worth mentioning is the variation in Cc that supports the claim that the closest fit between DTM_5 m and DTM_0.05m, compared to DTM_2 m, also results in a closest fit between W_5 m and W_0.05 m, compared to W_2 m.This pattern continues in the relationship between the hydraulic radius values (R), as well as Froude number (Fr), and the respective W for each Sim.Sim 1 results are closer to Sim 3 compared to Sim 2. Specifically, the variation of the impact of R on W concerning Sim 2 is extreme: while Cc_0.05 m = −0.

Statistical Analysis Based on Correlation Matrix (Step 13)
The histograms of Figure 17 present the distribution of errors of Sim 1 and Sim 2 flood extent errors, compared to Sim 3. The distributions indicate larger errors for Sim 2 compared to Sim 1, and are skewed, also indicating nonlinearities in the error generation and propagation.This is why the method of feature importance calculation using Mean Decrease in Impurity (MDI) is implemented with Random Forest.After a series of tests, the interval (t_lower, t_upper) is empirically selected as (−10, +1), so that the remaining negative and positive value sets of the distribution are split equally.

Machine Learning for Interpretation of Nonlinear Relationships (Random Forests)
The histograms of Figure 17 present the distribution of errors of Sim 1 and Sim 2 flood extent errors, compared to Sim 3. The distributions indicate larger errors for Sim 2 compared to Sim 1, and are skewed, also indicating nonlinearities in the error generation and propagation.This is why the method of feature importance calculation using Mean Decrease in Impurity (MDI) is implemented with Random Forest.After a series of tests, the interval (t_lower, t_upper) is empirically selected as (−10, +1), so that the remaining negative and positive value sets of the distribution are split equally.Following the already presented methodology, using a random noise feature as the "dummy" variable as a means of comparison, the feature importance of the following variables and variable errors regarding their impact on flood extent error ΔW are tested: sinuosity (SR; fixed, see Figure 9), rolling sinuosity (SR/30 sections), error of riverbed elevations (ΔZ), error of hydraulic radii (ΔR), and error of Froude number values (ΔFr).As far as Sim 1 flood extent errors are concerned, SR, ΔR, and ΔFr seem to be equally important.This can be interpreted as follows: the DEM_5 m intrinsic errors propagate up to the flood extent results in the sections of increased sinuosity, driven especially by the resulting errors in the hydraulic radius and Froude number values calculation.On the other hand, Sim 2 flood extent errors' origin and root cause are different.The feature that stands out is the hydraulic radii of the cross-sections; their importance is far higher than the respective features related to Sim 1 errors.These results fully support the earlier conclusions drawn by the flood maps-based comparative analysis (Step 12) and the correlation matrix-based statistical analysis (Step 13).The hydraulic radii per cross-section are distorted in the main channel in Sim 2, due to the DEM_2 m production process failing to filter dense vegetation.Following the already presented methodology, using a random noise feature as the "dummy" variable as a means of comparison, the feature importance of the following variables and variable errors regarding their impact on flood extent error ∆W are tested: sinuosity (SR; fixed, see Figure 9), rolling sinuosity (SR/30 sections), error of riverbed elevations (∆Z), error of hydraulic radii (∆R), and error of Froude number values (∆Fr).Figure 18 presents the MDI for the aforementioned features.Specifically, Figure 18a  As far as Sim 1 flood extent errors are concerned, SR, ∆R, and ∆Fr seem to be equally important.This can be interpreted as follows: the DEM_5 m intrinsic errors propagate up to the flood extent results in the sections of increased sinuosity, driven especially by the resulting errors in the hydraulic radius and Froude number values calculation.On the other hand, Sim 2 flood extent errors' origin and root cause are different.The feature that stands out is the hydraulic radii of the cross-sections; their importance is far higher than the respective features related to Sim 1 errors.These results fully support the earlier conclusions drawn by the flood maps-based comparative analysis (Step 12) and the correlation matrixbased statistical analysis (Step 13).The hydraulic radii per cross-section are distorted in the main channel in Sim 2, due to the DEM_2 m production process failing to filter dense vegetation.

Conclusions
This paper investigates flood modelling sensitivity against two sets of open access geospatial elevation data (5 m and 2 m resolution, respectively), derived from the Hellenic Cadastre, and an own designated Unmanned Aerial Vehicle topographical mission (0.05 m resolution).A case study is used concerning a part of the mountainous Enipeas river basin of Thessaly's Water District (Greece).
The first step of the proposed methodology includes a flood maps-based comparative analysis so that experts can empirically draw conclusions on the specific studied river catchment.In the current case study, most of the flood extent errors are overestimations rather than underestimations, at least being on the safe side.Though counter-intuitive, the DEM_5 m-derived (Sim 1) flood extents are closer to the "ground truth", whereas DEM_2 m-derived (Sim 2) extents are extensively overestimated, while also exhibiting relatively alarmingly high underestimations, which are not on the safe side and can have potentially catastrophic implications if used for design purposes.The sections of increased sinuosity ratio, especially the meandering river sections, seem more prone to flood modelling errors.The same applies for junctions of the main channel with modelled, or not, tributary streams.Concerning flow depth results, Sim 1 generally underestimates them and is, in principle, closer to DEM_0.05 m-derived Sim 3, and hydraulically more accurate.The reason is that, although Sim 2 range of flow depths is generally correct, their spatial distribution is inconsistent, as is the flow that does not follow the real river centerline.

Conclusions
This paper investigates flood modelling sensitivity against two sets of open access geospatial elevation data (5 m and 2 m resolution, respectively), derived from the Hellenic Cadastre, and an own designated Unmanned Aerial Vehicle topographical mission (0.05 m resolution).A case study is used concerning a part of the mountainous Enipeas river basin of Thessaly's Water District (Greece).
The first step of the proposed methodology includes a flood maps-based comparative analysis so that experts can empirically draw conclusions on the specific studied river catchment.In the current case study, most of the flood extent errors are overestimations rather than underestimations, at least being on the safe side.Though counter-intuitive, the DEM_5 m-derived (Sim 1) flood extents are closer to the "ground truth", whereas DEM_2 m-derived (Sim 2) extents are extensively overestimated, while also exhibiting relatively alarmingly high underestimations, which are not on the safe side and can have potentially catastrophic implications if used for design purposes.The sections of increased sinuosity ratio, especially the meandering river sections, seem more prone to flood modelling errors.The same applies for junctions of the main channel with modelled, or not, tributary streams.Concerning flow depth results, Sim 1 generally underestimates them and is, in principle, closer to DEM_0.05 m-derived Sim 3, and hydraulically more accurate.The reason is that, although Sim 2 range of flow depths is generally correct, their spatial distribution is inconsistent, as is the flow that does not follow the real river centerline.
The root of the errors concerning flood extents and flow depths lies in the topography data used.DEM_5 m mostly underestimates elevation, but manages to capture the geometry of the cross-sections, whereas DEM_2 m generally underestimates, but overestimates inside the critical zone of the main channel, near and inside the banks, even on the centerline; it is unable to capture the true geometry of the cross sections, practically altering the river pathways.This is more intense in the meandering river sections.The map-based analysis indicates that the true cause is the inability of DEM_2 m to capture the elevation of the ground in areas of dense vegetation and canopy, usually being wider in the meandering river sections.In absence of detailed information concerning the surveying process followed by the Hellenic Cadastre, and specifically the classification and filtering of the vegetation (tree removal), one can only speculate on the true source of this error.On the other hand, DEM_5 m, though constituting a lower resolution product, seems more suitable for approximating the real terrain.
The flood modelling results are also analysed via a statistical analysis, based on the correlation matrix presenting linear relationships between input data variables (i.e., elevation, slope, sinuosity ratio) and cross section-specific results, including flow characteristics (i.e., Froude number, hydraulic radius), flood extents, and flow depths.The correlation results indicate strong linearities where expected (riverbed elevations vs. cross-section ID numbers), and weaker where expected (i.e., riverbed elevations and hydraulic radii and Froude number vs. flood extents).Nevertheless, the important finding the statistical analysis has to offer is the quantifiable proof of the superiority of DEM_5 m-derived Sim 1 compared to the DEM_2 m-derived Sim 2 results, which supports the preceding empirical analysis conclusions.This is suggested by the fact that correlations of the analysed variables and flood extent results constantly follow the classification (stronger-to-weaker) Sim 3 > Sim 1 > Sim 2. The simple comparison between the correlation of the cross section-specific flood extents of Sim 1-Sim 3 and Sim 2-Sim 3 also supports the argument.
As the conventional approaches fail to identify the nonlinear dependencies of the root cause analysis and error propagation tracking side of the research problem, the proposed methodology finally implements a more sophisticated Machine Learning (ML) analysis, specifically Random Forest importance.The ML approach results further support and solidify the earlier conclusions drawn by the flood maps-based comparative analysis and the correlation matrix-based statistical analysis.The failure of the DEM_2 m production process to map the terrain in areas of dense vegetation and wide canopy leads to unreal cross-section geometries and inserts critical errors in the respective hydraulic radii, really important at least in 1D hydraulic analyses.These errors further propagate to the flood extent results, as the RF importance approach robustly indicates.
As far as the general proposed methodology is concerned, for deciding the best available alternative DEM of an accurate but costly UAV-based or in situ ground surveybased DEM, no step is redundant.The flood map-based comparative analysis by experts is the main and key evaluation tool and cannot be replaced by a statistical or even a more sophisticated Machine Learning-based analysis.Machine Learning methods can interpret nonlinear dependencies but depend on the way they are implemented and are susceptible to parameter errors.Nevertheless, they provide further insight on the root and cause of the error and the propagation mechanism, while identifying additional error patterns.The proposed stages and steps should be implemented as an integrated methodology.
The conclusions of the current paper and related research can be summed up as steps of a suggested procedure for the optimal hydraulic simulation of a river basin, in terms of minimisation of in situ topographical mapping costs without compromising the hydraulic simulation accuracy: 1.
Approximate the real river centerline, as accurately as possible, utilising any available source and technique possible.A realistic approach would be the use of the most recent and high resolution open-source DEM available, in order to automatically produce an approximate river centerline, calibrated by recent satellite imagery (e.g., google earth) and orthophotos (e.g., Hellenic Cadastre in Greek reality), supported by in situ inspection if possible or necessary.

2.
In river areas exhibiting one or more of the following characteristics: (a) dense vegetation or/and wide canopy, (b) meandering sections, (c) junctions of the main channel with tributary streams, or/and generally (d) centerline-bank lines derived from the available DEM diverging from the real centerline-bank lines, proceed to UAV-based mapping (combined with an a-posteriori vegetation removal process) or in situ topographical ground surveying.

3.
In areas not belonging to the previous category, if more than one open-source DEM is available, the implemented and proposed methodology of this paper should be followed in order to select the one that inserts and propagates less errors in the hydraulic simulations.In Greek reality, just use the DEM_5 m by Hellenic Cadastre, as it is proven to be more efficient for, at least 1D, hydraulic simulations, rather than the higher resolution DEM_2 m. 4.
Future research should include more case studies of different terrain characteristics, such as sinuosity, lush and weak vegetation areas, etc., combined with real in situ topographical ground surveying and validated flood extents and flow depth measurements.The larger datasets will also provide further credibility to the Machine Learning-based analysis.In the proposed methodology, 2D hydraulic simulations (e.g., using HEC-RAS 2D) should be tested to check their sensitivity vs. the various spatial resolution DEMs.Finally, more sophisticated ML techniques, such as Gradient Boost Algorithms, Fuzzy Cognitive Maps, and Self Organizing Maps, can be used for root cause analysis.

SM10
P3a SM10-(P3a) soil data map A soil data map presenting the available soil data concerning the "hydro area", derived from [34].

Figure 1 .
Figure 1.Graphical abstract of the methodology implemented to investigate the sensitivity of flood risk mapping via 1D hydraulic simulations vs. various DEM resolutions.Aim: conclude on study area features that render the use of more accurate but costly and time-consuming UAV mapping imperative and decide on the next best free alternative option in Greek reality.

Figure 1 .
Figure 1.Graphical abstract of the methodology implemented to investigate the sensitivity of flood risk mapping via 1D hydraulic simulations vs. various DEM resolutions.Aim: conclude on study area features that render the use of more accurate but costly and time-consuming UAV mapping imperative and decide on the next best free alternative option in Greek reality[29,42].

Figure 2 .
Figure 2. (a) Map of Greece, (b) Thessaly Water District, and (c) Enipeas river basin and wider studied catchment (hydro area) with the studied hydrographic network.Figure 2. (a) Map of Greece, (b) Thessaly Water District, and (c) Enipeas river basin and wider studied catchment (hydro area) with the studied hydrographic network.

Figure 2 .
Figure 2. (a) Map of Greece, (b) Thessaly Water District, and (c) Enipeas river basin and wider studied catchment (hydro area) with the studied hydrographic network.Figure 2. (a) Map of Greece, (b) Thessaly Water District, and (c) Enipeas river basin and wider studied catchment (hydro area) with the studied hydrographic network.

Figure 3 .
Figure 3. Study area of Enipeas river basin where the hydraulic simulations are conducted (flood area) as a part of the wider study area where the hydrological simulations are conducted (hydro area), together with the hydrographic network, the sub-catchments, and available meteorological stations.2.2.Data Pre-Processing (Methodology Stage 1) 2.2.1.Topographic Data (Step 1)

Figure 3 .
Figure 3. Study area of Enipeas river basin where the hydraulic simulations are conducted (flood area) as a part of the wider study area where the hydrological simulations are conducted (hydro area), together with the hydrographic network, the sub-catchments, and available meteorological stations.

Figure 4 .
Figure 4. Digital Terrain Model of 2 m × 2 m resolution (DTM_2 m) of the study area ("hydro") used for the delineation of the sub-catchments and hydrographic network (see SM2a and SM5).

Figure 4 .
Figure 4. Digital Terrain Model of 2 m × 2 m resolution (DTM_2 m) of the study area ("hydro") used for the delineation of the sub-catchments and hydrographic network (see SM2a and SM5).

Figure 7 .
Figure 7.The IDF function and curve (point rainfall intensity vs. time) for the Anavra station for a return period of 1000 years.

Figure 7 .
Figure 7.The IDF function and curve (point rainfall intensity vs. time) for the Anavra station for a return period of 1000 years.

2. 3 . 3 .
Hydrograph Production (Step 8) Following the SCS methodology [44], hydrographs are produced (P8; see SM15) using HEC-HMS, applying the well-established Muskingum river routing method to all reaches.Parameters such as reach length and slope are estimated using topographical data, namely DTM_2 m.The nine hydrographs refer to the three junctions and six subcatchments created by the studied Enipeas basin (hydro area) model, as presented in Figure 8.

Figure 8 .
Figure 8. HEC-HMS hydrological model of the studied (hydro) area, featuring its six sub-catchments and three junctions, with detail of the calculated design discharges used in HEC-RAS.

2. 4 .
Hydraulic Simulation (Methodology Stage 3; Steps 9, 10, 11) (a) Sim 1 uses DTM_5 m, (b) Sim 2 uses DTM_2 m, and (c) Sim 3 uses DTM_0.05 m.All three Sims share the river centerlines (Step 2; products P2c, P2d, P2e), as well as the bank lines, produced by DTM_0.05 m, which is assumed to be the "ground truth".Sims 1-3 also share 270 cross-sections positioned in an interval of approximately 20 m along the river centerline (see SM23).For all Sims, two Manning coefficient values are used, one for the main channel (n = 0.08) and a different one for the overbanks (n = 0.07).The values were decided after observations of satellite images and orthophotos and in situ inspection, based on Chow et al. (1988;

2. 5 .
Post-Processing and Analysis (Methodology Stage 4) Stage 4 features the post-processing of the results of the three hydraulic simulations (Sim 1-3) via (a) empirical (Step 12) and (b) statistical (Step 13) analysis as well as (c) use of Machine Learning for the interpretation of non-linear relationships of variables vs. results (Step 14).

Figure 13 .
Figure 13.Exact locations of two selected cross-sections (172 and 196; see SM23) in the meandering part of the river in the "flood area", featured in detail in Figure 14.

Figure 14 .
Figure 14.The geometry of the three versions of (a) cross-section 172 and (b) cross-section 196; see SM23) as derived from DTM_5 m (blue), DTM_2 m (green), and DTM_0.05m (red).DTM_5 m captures the general geometry, whereas DTM_2 m fails to delineate the main channel.

Figure 14 .
Figure 14.The geometry of the three versions of (a) cross-section 172 and (b) cross-section 196; (see SM23) as derived from DTM_5 m (blue), DTM_2 m (green), and DTM_0.05m (red).DTM_5 m captures the general geometry, whereas DTM_2 m fails to delineate the main channel.

Hydrology 2023 , 29 Figure 15 .
Figure 15.River centerlines derived from the 3 DTMs projected on the same map (see SM30) for comparison purposes.All simulations are based on "ground truth" centerline (by DTM_0.05m).

Figure 15 .
Figure 15.River centerlines derived from the 3 DTMs projected on the same map (see SM30) for comparison purposes.All simulations are based on "ground truth" centerline (by DTM_0.05m).

29 Figure 16 .
Figure 16.Correlation matrix presenting correlation coefficient values between all involved input data and variables (Sim/DTM-specific or common for all Sims/DTMs) and results (flood extents, flow depths, flow characteristics).Values of interest are highlighted.

Figure 16 .
Figure 16.Correlation matrix presenting correlation coefficient values between all involved input data and variables (Sim/DTM-specific or common for all Sims/DTMs) and results (flood extents, flow depths, flow characteristics).Values of interest are highlighted.

Figure 17 .
Figure 17.Histograms of the errors of Sim 1 and Sim 2 vs. "ground truth" Sim 3, indicating skewed distributions and nonlinear dependencies.

Figure 17 .
Figure 17.Histograms of the errors of Sim 1 and Sim 2 vs. "ground truth" Sim 3, indicating skewed distributions and nonlinear dependencies.

Table 2 .
[36]meters used in the IDF curve equation[36], regarding the Anavra station and a 1000year return period.

Table 2 .
[36]meters used in the IDF curve equation[36], regarding the Anavra station and a 1000-year return period.

Table 4 .
Flood extent and flow depth statistics for the three simulations.
Figure10presents the flow depth maps for all Sims side-by-side, whereas Figure11

Table 4 .
Flood extent and flow depth statistics for the three simulations.