1. Introduction
The main problem analyzed in the present research is the reconstruction of river channel bathymetry on the basis of sparse measurements. It is assumed that measurements are taken in typical ways for river engineering, such as bathymetric surveys in selected cross-sections. The importance of this problem is growing, with increasing interest in and application of one-dimensional as well as two-dimensional river hydrodynamic models. The development of such a tool has to reflect the requirements and needs of the application area to model flow and other processes in rivers. One of the most important needs is the integration of hydrodynamic modeling with the GIS environment. Hence, the research done is presented in the form of a toolbox consisting of methods supporting the process of river bathymetry reconstruction. The toolbox is called RiverBox and it is fully integrated with one of the most popular GIS environments, ArcGIS, developed by the company Esri (Redlands, CA, USA).
The reconstruction of river bathymetry is not a new problem, but interest in this area has begun to grow in recent times. Two of the first papers were published by Tate et al. [
1] and Goff and Nordfjord [
2]. These works, like the majority of the literature in this field, are based on a channel-oriented coordinate system. This means that the coordinate system consists of longitudinal and transversal coordinates. The first is assigned to the river centerline or thalweg. The second is perpendicular to the first in most cases. Such an approach is an application of an older concept related to the generation of boundary-fitted numerical grids [
3]. The two mentioned publications used very simple approaches. The main idea of the former is based on linking the GIS techniques and the well-known hydrodynamic modeling package HEC-RAS (Hydrologic Engineering Center—River Analysis System, HEC, Davis, CA, USA). The fundamental element is an arbitrary determination of the river centerline. The cross-section shape does not depend on other spatial information, e.g., location of banks. Although it is quite a simple approach, even today it is used widely in such modeling systems as HEC-RAS and MIKE 11 (DHI, Hørsholm, Denmark). The procedure developed in the second paper uses more information for bed reconstruction. This involves the river centerline, the bank lines, and additional track lines. All the lines are determined arbitrarily. The interpolation of elevations is done along the track lines including the centerline and the banks. The whole procedure of track line preparation is time consuming.
These two approaches are improved and extended in the papers published by Merwade and co-authors [
4,
5]. The main idea presented there is based on the linking of the longitudinal coordinate with the stream thalweg. The problem is that the thalweg is not determined a priori. It has to be calculated iteratively with improvement of the interpolation. This approach may extend the computational time. The extension of this concept was published in Merwade’s next papers [
6,
7] and finally applied as an ArcGIS plug-in [
7,
8,
9,
10,
11] working with older versions of ArcGIS (up to 10.3).
The problem of river bed reconstruction has also been studied by other researchers. In the paper written by Legleiter and Kyriakidis [
12] a very interesting and fairly complex procedure for the transformation of geographic and channel-oriented coordinates is described. Schäppi et al. [
13] also presented an original approach to the problem of coordinate transformation. In most cases the longitudinal coordinate is determined first, and then the transversal coordinate is calculated. Schäppi and co-authors developed a scheme where this direction is reversed. Additionally, the coordinate lines do not need to be orthogonal. The concepts presented by Merwade et al. [
4] and Schäppi et al. [
13] were implemented by Bogusz et al. [
14] and compared with an interpolation procedure available in MIKE 11. As mentioned above, the latter approach is similar to the basic method described by Tate et al. [
1]. A slightly different concept was presented by Kardos [
15]. The combination of algebraic mesh, linear interpolation along mesh lines, and spatial inverse distance interpolation is applied there. The most modern concept based on the discussed ideas is presented by Caviedes-Voullième et al. [
16]. In this research the thalweg plays the role of the longitudinal axis. The procedure is based on generation of intermediate cross-sections and transformation of the elevations interpolated in a cross-section to the 2D plane.
For some time, more sophisticated approaches have been presented in scientific journals. For example, Farina et al. [
17] applied the entropy principle for reconstruction of the bed. In turn, Zhang et al. [
18] developed a specific spatial interpolation method for this problem. The method is known as shortest temporal distance. It uses specific metrics to adjust the whole procedure to the channel-oriented problem. Finally, Lai et al. [
19] applied concepts borrowed once again from Thompson et al. [
3]. They applied an elliptic equation to generate an orthogonal and boundary-fitted grid along the channel. All these three mentioned approaches are very interesting, but their resilience to data sparseness has not been tested. According to the present author’s expectations, the uncertainty of the data caused by sparse measurements may be the reason to avoid sophisticated approaches and focus on simpler techniques.
The main purpose of the present research is to develop a toolbox including methods supporting reconstruction of river bathymetry. The river bed bathymetry should be created as a Digital Elevation Model (DEM). It should be easily linked with a DEM of the surrounding area, completing this layer with appropriate information about bed elevations. The toolbox is prepared as a set of methods fully integrated with the ArcGIS environment. The Python 2.7 scripting language is used, because this programming environment is also integrated with the current version of ArcGIS (10.5.1). The main idea applied is interpolation of the river elevations in a channel-oriented coordinate system. The main concern is the accuracy and efficiency of the methods applied. The problem of computational speed is crucial in cases of fairly slow scripting languages. To overcome this drawback, a specific hybrid method linking longitudinal and spatial interpolations is proposed and tested.
2. Materials
2.1. Chosen Case Study
Two river reaches were chosen for the tests. The selection was done considering their lengths, variability of geometric properties, and availability of data. The analyzed channels are: (1) the reach of the Warta river near the town of Wronki in the western part of Poland and (2) the downstream part of the Ner river in the central part of Poland.
The Warta river is a right tributary of the Oder river, which flows along the border between Poland and Germany. The Warta is fourth in the list of the longest rivers in Poland. The length of the first reach chosen is fairly short. It measures approximately 7.6 km. The average width of the channel is 70 m. Along this short reach the width changes slightly. The average depth is about 1.5–1.6 m. In the town of Wronki the gauge station is installed. It enables monitoring of the water stages and assessment of discharges. The mean discharge in this part of the Warta river is 124.7 m
3/s. For the determination of inundation areas and flood hazard maps the extreme discharges representing 10-, 100- and 500-year floods were used. Their values are 560 m
3/s, 928 m
3/s, and 1211 m
3/s, respectively. In 2016 these data were bought from the Polish Institute of Meteorology and Water Management (IMGW—Polish
Instytut Meteorologii i Gospodarki Wodnej). All quoted hydrological characteristics are estimated on the basis of observations and data measured in the period 1971–2014. The whole reach is denoted with a blue line in
Figure 1. The pink area represents the inundation area for a 100-year flood.
The Ner river is the right tributary of the Warta river. Although this reach is about 19 km long, the average width of the channel is much smaller than before, and it equals about 20 m. The changes of the width along the channel vary from over 12 m to 30 m or more. The average depth is about 1 m. The only gauge station in this reach is located in the middle, in the town of Dąbie. This equipment enables observation of water stages and assessment of discharges. The average discharge is 10.93 m
3/s. The extreme values are as follows: for a 10-year flood 129.72 m
3/s, for a 100-year flood 191.73 m
3/s, and for a 500-year flood 230.93 m
3/s [
20]. All these values are assessed on the basis of data observed during the period 1961–2013. The modeled reach is presented as a blue line in
Figure 2. As before, the pink areas denote the flood inundation for a 100-year event.
2.2. Available Data for Bathymetry Reconstruction
There are three sets of data necessary for interpolation of the river bed. These are: (1) the basic digital elevation model (DEM); (2) the measurements of the river cross-sections; and (3) two polyline layers representing the river centerline and river banks. Only the last element has to be prepared manually by the user.
Two DEMs are applied in the presented research. The first represents the area surrounding the reach of the Warta river near the town of Wronki. The second is used for tests with the reach of the Ner river. Both of them are obtained from Head Office of Geodesy and Cartography (GUGiK—Polish Główny Urząd Geodezji i Kartografii). Their spatial and vertical resolutions are the same. The size of the cell is 1 m in each direction. The vertical resolution is 15 cm. Their file sizes are 467 MB and 1340 MB, respectively. The sizes of tables expressed as column times rows are 12,995 × 9421 in the first case and 25,713 × 13,965 in the second. Hence, the total areas are 122 km2 and 359 km2, respectively. The nonempty areas are smaller, equaling 78 km2 and 297 km2, respectively.
Example data are shown in
Figure 3. The DEM is shown in green-red colors and covered with a transparent hillshade layer. The black line represents the river centerline. The red lines are used for reconstruction of the river banks. The measurement points are denoted as green dots. The number of points is not small, but their location disables application of standard spatial interpolation methods, e.g., those available in ArcToolbox (Redlands, CA, USA).
At the current stage of toolbox development, the measurements of cross-sections may have one of three forms. The differences between these forms are hidden in the tables of attributes. Examples are shown in
Figure 4. The basic form consists of the points from GPS measurements. They are referred to here as GPS points. In such a case the table of attributes has to contain the number of measurement points and the elevation at each point. The number of points has the format “XS.pnt”, where “XS” is the number of cross-sections and “pnt” is the number of points. The names of the attributes are not known a priori. The user has to choose appropriate attributes (columns in
Figure 4) before the tool is run. The more developed format is called the ISOK format. It is referred to here as “ISOK points”. This format was used for measurements in the ISOK project in Poland, as per the IT system of the Country’s Protection Against Extreme Hazards (ISOK—Polish
Informatyczny System Osłony Kraju przed nadzwyczajnymi zagrożeniami). The EU Flood Directive was implemented as part of this system in Poland. In the case of the ISOK format the table of attributes includes the number of the point as “XS.pnt” and elevation as well. The names of the attributes are known a priori. These are “NR_PKT” (in Polish, meaning point number) and “RZEDNA” (in Polish, meaning elevation), respectively. The measurements in the ISOK project were also used for description of the hydro-structures’ shape. Hence, there is also information about the type of the point in the table of attributes. This column is denoted as “KOD_PKT” (in Polish, meaning code of points). The identification numbers located there may be used for selection of points before the interpolation. The additional attribute present in the ISOK points is “KOD_TERENU” (in Polish, meaning code of terrain). These values may be used further for determination of the roughness coefficients. The final and the simplest format is that of the XS points. In the table of attributes there is only information about the number of cross-sections and the elevation. Both attributes have to be indicated by the user.
Along the reach of the Warta river there are 11 cross-sections measured. The average distance between measurements is 756 m. The measurements near two bridges in the town of Wronki are spaced closer together. Their average distance apart is 52 m. On the other hand, the maximum distance between cross-sections is 1460 m. The average number of measurement points in a single cross-section is 136. The course of the river and its width give 0.66 km2 for bed interpolation. The average and the longest distance between interpolated cells and measurements are 160 m and 730 m, respectively.
The characteristics of the analyzed reach of the Ner river are slightly different. There are 35 measurement cross-sections along the reach. The average and maximum distances between measurements are 557 m and 2565 m, respectively. The difference between the average and the maximum is larger than in the previous case, which suggests greater non-uniformity of the measurements’ location. Near the structures, the distance between measurements is similar to the previous values. This time it is 59 m. The average number of measurement points per cross-section is much smaller, at nine points. Hence, the accuracy of the measurements is lower. Although the length of the reach is greater, the area for interpolation is smaller. In this case it is 0.41 km2. The reason for such differences in interpolation areas is the much smaller width of the channel in the second case. The estimated average distance between cells and measurements is smaller than before and equals 136 m, but the estimated maximum is 1283 m, which is about twice that in the previous case.
The last elements necessary for interpolation are two vector layers including polylines. The first should represent the river centerline. This layer does not include any specific attributes. The only requirement is that the direction of the line should be compatible with the direction of the flow in the river. The second layer should consist of the lines of the river banks—left and right, respectively. There should be a string attribute called “LineType”, where the type of the bank is denoted. The denotation should be “Left” or “Right”. The direction of lines should also be compatible with the direction of river flow.
2.3. Data for Validation Tests
The data used for direct interpolation in the two presented cases were measured as an element of the ISOK project, mentioned before. The field measurements of this project were taken in 2011–2012. The DEM obtained from GUGiK was created on the basis of LIDAR (Light Detection and Ranging) measurements made at the same time. To verify the results of the interpolation, other data are used. In the same reach, additional measurements were taken during 2015. Three cross-sections marked in Figure 9a were measured as preliminary work for design of the highway bridge located downstream of cross-section no. 4. The elevations were determined with a GPS device. Today, after 3 years, it is difficult to obtain information about this device and assess its accuracy. However, it is possible to compare measurements on the floodplains with the DEM obtained from CODGiK (Polish—Centralny Ośrodek Dokumentacji Geodezyjnej i Kartograficznej). The comparison of elevations was done at 59 points. The results showed that the mean error is about 31 cm and the maximum is 331 cm. The standard deviation is about 53 cm. The differences of greater than 50 cm between data from 2015 and the DEM used may be explained by the possible changes in terrain which could occur over the 3 years between two types of measurements. For example, there are differences in bank location, which may be the result of accidentally occurring bank instability caused by lateral erosion.
The measurements made in 2015 are used later for direct comparisons with interpolated bed elevations. The different times of the ISOK measurements and validation data may cause errors of another kind—those related to sediment processes and morphological changes of the bed. In fact, the period of 3 years is not long considering the scales of continuous sediment transport. Hence, the validation data may be good to assess overall correctness of the interpolation.
5. Conclusions
Today, the technology applied to collect terrain data is based on aerial LIDAR scanning. It is quite a simple method to obtain high-quality data. The resolution of the scanning is appropriate for the majority of applications. Unfortunately, this approach is not able to provide sufficient information about the river and stream bathymetry. This means the DEM created on the basis of the LIDAR scanning does not fit the requirements of hydrodynamic modeling. Additional measurements have to applied. In most cases, GPS technology is used alone or in combination with acoustic or laser scanning of the channel bottom. Such measurements are performed from the ground or from a boat. Because the measurements of the river bed are difficult and time-consuming, the number of such measurements is limited. In most cases the data provided have the form of point-elevation data distributed along selected river cross-sections. The location of the cross-sections is sparse. Finally, the hydrodynamic modeler obtains two sets of data which are not compatible considering the form, resolution, and accuracy. The presented methodology is a way to overcome this problem. The only robust approach is to complete the river bed measurements in such a way that they fit the resolution of the DEM.
In this paper a specific methodology for river bed reconstruction is proposed. It is based on the concept of a channel-oriented coordinate system and basic linear interpolation. Such a simple approach appears to be the best in the case of truly sparse data. The environment chosen for implementation is ArcGIS (version 10.5.1) with the scripting language Python 2.7. The efficiency of the algorithm in the sense of memory usage and computational time is of crucial concern in practical cases. Hence, the ideas presented were implemented in three different computational schemes. The first is based on the raster–array transformation specific to the tools available in the standard Python module attached to ArcGIS and called ArcPy. The results provided are acceptably accurate, as the validation tests show. However, this approach was found to be too slow and with too great memory loading. In the second scheme, the inefficient raster–array transformation is replaced by more robust raster–point transformation. It overcame the problem of memory usage, but the problem of excessive computational time was still important. The main concept of the third scheme is to link two kinds of methods. The first consists of spatial interpolation tools available in the standard Arc Toolbox distributed with ArcGIS. The second is an algorithm developed for bed interpolation in a channel-oriented coordinate system. Such a linkage proved efficient and effective. The loss of potential accuracy is slight, the memory usage is not huge, and the time of computations is much shorter. The third approach is recommended in practical applications.
The algorithm is prepared as a set of tools included in the ArcGIS toolbox called RiverBox. It is open source software. The web address for download is given in the text. The available tools could help in the preparation of the river bed for further hydrodynamic model simulations. Although the toolbox is completed, to properly support preparation of the model for river flow computations further tools have to be added, e.g., interpolation of bed cover information and linking with terrain cover data. An additional problem is transformation of the GIS data to formats valid for hydrodynamic systems, e.g., HEC-RAS, MIKE 11, and others. Such tools will be prepared in the future.