Hyporheic Process Restoration: Design and Performance of an Engineered Streambed

: Stream restoration designed specifically to enhance hyporheic processes has seldom been contemplated. To gain experience with hyporheic restoration, an engineered streambed was built using a gravel mixture formulated to mimic natural streambed composition, filling an over ‐ excavated channel to a minimum depth of 90 cm. Specially designed plunge ‐ pool structures, built with subsurface gravel extending down to 2.4 m, promoted greatly enhanced hyporheic circulation, path length, and residence time. Hyporheic process enhancement was verified using intra ‐ gravel temperature mapping to document the distribution and strength of upwelling and downwelling zones, computation of vertical water flux using diurnal streambed temperature patterns, estimation of hyporheic zone cross section using sodium chloride tracer studies, and repeat measurements of streambed sand content to document evolution of the engineered streambed over time. Results showed that vertical water flux in the vicinity of plunge ‐ pool structures was quite large, averaging 89 times the pre ‐ construction rate, and 17 times larger than maximum rates measured in a pristine stream in Idaho. Upwelling and downwelling strengths in the constructed channel were larger and more spatially diverse than in the control. Streambed sand content showed a variety of response over time, indicating that rapid return to an embedded, impermeable state is not occurring.


Introduction
A river's boundary does not end at the channel margins. Instead, the river system includes physical processes that extend laterally, into the riparian zone and floodplain, and vertically, into the substrate beneath the channel and floodplain [1][2][3]. This transitional zone between subterranean and surface aquatic environments, commonly referred to as the hyporheic zone [4], provides ecological functions/benefits that help sustain streambed and aquatic conditions. Among these functions are vertical water flux between the stream and subsurface, water temperature moderation, recycling of carbon, energy, and nutrients, natural attenuation of certain pollutants, a sink/source of sediment for the channel, and habitat for benthic and interstitial organisms [4]. The role of the hyporheic zone is increasingly recognized for its significance with regard to river management, conservation, and restoration [4][5][6][7][8][9], and as such, the restoration of hyporheic zone processes was one of the primary design goals of the City of Seattle's floodplain pilot projects.
Although improved hyporheic processes may be mentioned as a project goal [10], rarely do hyporheic processes drive restoration or design [11,12]. Usually, the argument is put forth that, by constructing a channel that mimics a natural alluvial morphology, and by reestablishing natural

Thornton Creek Watershed and Study Site Description
Thornton Creek drains nearly 29 km 2 (11.1 square miles) of urban Seattle and Shoreline, Washington, USA, flowing through two second order tributaries and a 2.1 km (1.3 mile) mainstem into Lake Washington, just north of the Lake Washington Ship Canal [38] (see Figure 1). The watershed lies entirely within Pleistocene glacial drift, consisting of till and outwash deposits overlying compacted glaciolacustrine clay, and ranges in elevation from about 150 m in its headwater areas to 2.5 m at its mouth (490 to 8.2 feet, respectively) [38].
Originally heavily forested, the watershed receives nearly 89 cm (35 inches) of precipitation annually, which falls primarily as rain between October and May [39]. Currently, roads, buildings, and other impervious surfaces comprise 59% of the watershed area [38], and most of the remainder has been converted to non-native vegetation such as lawns and landscaping with different runoff, soil stability, and erodibility properties than originally existed.
The stream channel has been straightened and its banks hardened over much of its length. In most places, the streambed has been simplified by channel incision, deliberate channelization, and removal of its structural framework of large wood, such that alluvial gravels and sands exist only in a thin veneer, 15-25 cm thick (6-10 inches), over compacted till or clay. Structural complexity, which formerly would have retained much deeper alluvial deposits, is largely missing. The pre-project streambed surface layer median (D50) grain sizes ranged from 15.7 to 58.2 mm (0.6-2.3 inches), and widespread embedded conditions reduce permeability to water movement.
The glacial-drift stratigraphy of the Thornton Creek watershed is conducive to development of springs where permeable sands overlie till or clay. This groundwater emerges on the edge of the ravines through which creek channel flows. Normally, this water feeds floodplain wetlands, or enters the stream channel along its banks in places where the channel lies adjacent to ravine walls.
Several species of salmonid fish are present. Cutthroat trout (Oncorhynchus clarki) are the most common, but coho salmon (O. kisutch), and on occasions, Chinook salmon (O. tshawytscha), sockeye salmon (O. nerka), and rainbow trout (O. mykiss) are seen [38]. Rarely has the anadromous form of rainbow trout, the steelhead, been documented [38].
Typical of many urban streams [40], Thornton Creek suffers from excessive stormwater runoff and poor water quality. As a consequence, average early spawner mortality for coho salmon in Thornton Creek is the highest (79%) among Seattle's creeks [38,39].
In order to address the effects of stormwater runoff, Seattle Public Utilities (SPU) embarked on two channel and floodplain reconstruction projects: The Kingfisher site, located on the South Fork of Thornton Creek, and the Forks Confluence site, located at the confluence of the South and North Forks of Thornton Creek (see Figure 1). The Kingfisher site has a drainage area of 6.6 km 2 (2.5 square miles) and the Forks Confluence site drains 9.8 km 2 (3.8 square miles).
Control sites were established immediately upstream of each of the restoration sites in sections of the channel that were morphologically similar to pre-restoration conditions. For the purposes of discussion in this paper, the restoration sites shall be referred to as the treatment sites. This paper will focus almost exclusively on the Kingfisher Site, as at this site the full suite of hyporheic restoration techniques were implemented, whereas at the Forks Confluence site, only a portion of the hyporheic restoration techniques were implemented. At both sites, the control reach was located as close as possible upstream from the treatment sites; at Kingfisher, the control reach is approximately 90 m (300 feet) upstream of the treatment reach.
The Kingfisher reach is a gaining reach, with significant amounts of groundwater entering the channel from the surrounding hillsides (particular the south side) in a mixture of surface flow at discrete locations and dispersed subsurface flow [41]. Figure 2 illustrates typical pre-construction site conditions in and adjacent to the channel at the Kingfisher Treatment site. Small seasonal wetlands and overbank sand deposits existed on top of compacted artificial fill, which covered the original (but highly disturbed) alluvial deposits. Impermeable compacted clay underlies the site, forming a barrier to vertical water movement and a partial control on channel incision.   of treatment reach (c) before, and (d) after, construction, taken from close to same vantage point (the "before" is taken closer to the south slope, on the right side of this photo).
Wherever possible, monitoring techniques were implemented both on the treatment and control reaches. Both the treatment and the control reaches have been heavily impacted by human activities, including complete removal of historic forest cover, removal of instream wood, channel straightening, installation of major sewer lines through and along the channel, encroachment by residential yards, bridges, and culverts. The control site is in a more natural condition than was the pre-project restored site, having a small but functioning floodplain, an established forest cover (albeit with numerous non-native species), small amounts of in-channel large wood, and a wider channel, being about 4.6-5.8 m (15-19 feet) in width. The pre-project treatment site had a channel width of only 2.4 m (8 feet), was lined with large rocks, and had no large wood, and the floodplain had been replaced by several feet of fill to accommodate the five houses, and their outbuildings, that were removed from the site. Of critical importance for this study, both the pre-project treatment site and the control site had experienced similar channel incision due to human activities resulting in extremely similar streambed conditions. Both treatment and control streambeds consisted of a thin veneer, 15-25 cm (6-10 inches) thick, of moderately embedded alluvial gravel over a dense, impermeable clay substrate. As such, hyporheic exchange and flow were extremely low [42].

Design of Hyporheic Restoration Features
Design of the hyporheic restoration features as a part of the overall stream restoration project was completed in 2013. Project construction occurred during the summer of 2014 and was essentially complete by October of that year. Monitoring of hyporheic processes began in the summer of 2015, after the constructed streambed had experienced one winter season of adjustment to the hydraulic conditions and sediment load, and then continued through October 2017.
Urban stream restoration typically consists of channel reconstruction with hardened elements, such as large-wood or rock structures, to control channel gradient and channel location. The constructed morphology, as much as is possible, is designed to mimic morphology and habitat features found in more pristine settings. Often, clean, highly permeable gravel mixtures are imported to improve streambed conditions at shallow depths of 15-30 cm (0.5-1 feet), but only directly along the channel alignment.
Key design features of both treatment sites consisted of construction of a longer, wider, more sinuous channel, with numerous large-wood structures and creation of an inset floodplain designed to engage at the 1.5-year annual peak discharge. A primary goal of this floodplain restoration project was to store and detain floodwater temporarily, reducing water velocities, moderating flood impacts downstream, and reactivating soil water storage and deeper infiltration.
A unique aspect of this restoration design was incorporation of an engineered streambed for improved hyporheic function. Flow of water into, and out of, the hyporheic zone was very small under pre-project conditions, on the order of 0.018-0.019 m/day (0.7 inch/day) [42], and hyporheic cross section was quite small, due to the thinness and embedded condition of the alluvial streambed [39]. By building a streambed specifically engineered to enhance hyporheic exchange, increase hyporheic residence times, and increase hyporheic volume, improvements to temperature and chemical water quality were anticipated.
Since this engineered streambed was innovative and experimental, an intensive effort was made to monitor its effectiveness in improving hyporheic function. This paper both describes the new design elements used in hyporheic restoration, as well as the monitoring protocol and subsequent results for both treatment and control reaches. The Kingfisher site was selected for this presentation since it has a thicker engineered streambed, and therefore was expected to exhibit a larger response, than the Forks Confluence site. A flowchart illustrating the project design process appears in Figure  4.

Over-Excavation and Backfilling with Clean Gravel
The primary hyporheic restoration action was installing a deep (90-240 cm or 3-8 feet), wide (up to 9.1 m or 30 feet) alluvial gravel corridor into which the active channel was constructed. The goal was to greatly increase the hyporheic cross section over pre-project conditions, and to make the hyporheic cross section significantly larger than is usually attained in urban restoration projects. At the Kingfisher site, a 90 cm (3 feet) thickness was specified, tapering to a smaller depth at the edges of the active channel. Meanwhile, average active channel width was increased from about 2.6 to 6.8 m (8.6 to 22.2 feet), which considerably increased the hyporheic cross section. The alluvial layer target width was 9.1 m (30 feet), varied to accommodate site constraints and was approximately twice the average bankfull channel width.
A critical design element was sizing of the gravel to maximize hyporheic function. The design specifications were set to approximate the surface gravel size at the control site as shown in Figure 5, but the fine materials (<2 mm) were omitted to maximize hydraulic conductivity. In the native (subsurface) material, this fine material fraction comprises approximately 15% of the volume. The goal was to create a mixture that would be similar enough to the original streambed alluvial material so that it would be maintained over time by natural scour, transport, and fill processes, and yet would not be occluded with fine sand and silt. A thin layer, 10-20 cm (4-12 inches) of native soil material was placed over the top of the alluvial layer outside of the boundary of the active channel to facilitate plant growth on the floodplain.

Plunge-Pool Structures
Plunge-pool structures, constructed with channel-spanning sill logs to create a 24 cm (0.8 feet) step in the longitudinal profile of both the water surface and streambed, are a common design feature in stream restoration projects [43]. This step size is consistent with criteria for juvenile salmonid fish passage. The basic habitat enhancement goals for this structure are to generate plunging flow to maintain a scour pool for deep-water fish rearing habitat, with a resulting riffle at the tail-out of the pool; the upstream end of the riffle is typically a preferential location for salmonid spawning. For Thornton Creek, plunge-pool structures were designed as illustrated in Figure 6 below, with a deeply over-excavated placement hole, as much as 2.4 m (8 feet) deep, with 90 cm (3 feet) of clean gravel mixture placed in the hole before placing the channel-spanning sill log and footer log. In addition, a layer of compacted sand/silt/crushed-gravel mixture was placed on top of this gravel foundation to create an occluding layer, ranging in thickness from 1.5 m (5 feet) upstream to 30 cm (1 foot) downstream, which forces downwelling water to travel deeply beneath the structure before emerging in the tail out zone of the pool downstream. This increases the residence time and path length for water in the downwelling zone on the upstream side, in response to the hydraulic head differential created by the sill logs. The plunge-pool structure drives local hyporheic circulation, on the scale of one habitat unit, but the circulation below the pool is much deeper, and has a longer path length and residence time, than that associated with a simple channel-spanning log. At the Kingfisher site, six plunge-pool structures were installed, one of which was studied intensively.
A primary habitat benefit of this enhanced hyporheic circulation is cooling of the water as the heat from the incoming water is transferred to the subsurface gravel, and to the cooler native material below. Because the water is forced down more deeply, it encounters a cooler mineral substrate and interacts with a greater thermal mass for heat exchange. This cooler water then emerges into the pool at the upstream end of the riffle, benefiting both juvenile salmon rearing in the pool, as well as those eggs laid in the riffle [44]. The cooler water maintains a lower-stress condition for the eggs during development, and will improve overall survival [45]. Though not explored in this study, it is anticipated that in winter, the hyporheic circulation will warm the incoming water, with heat transferring from the native material into the hyporheic water [42]. On the broadest scale, this enhanced hyporheic circulation acts as a thermal buffer, moderating diurnal and seasonal water temperatures through greater heat exchange with the native parent material. This local water flux is superimposed upon the reach-scale hyporheic circulation induced by the site-wide over-excavation and backfilling with gravel. That is, some fraction of the water, which percolates down into the hyporheic zone at the upstream end of the reach, will stay in the subsurface, flowing downstream until forced to re-emerge at the end of the treatment project site.

Subsurface Drains
In pre-project conditions, water from hillslope seeps percolated through the wet soil or flowed across the soil surface to enter the stream channel. To capture this clean, cool water prior to it being warmed through air exposure or used up by plants, in one location, a subsurface trench was filled with permeable gravel that routed directly to the stream channel under the floodplain surface to deliver the water directly into adjacent plunge-pools, where cooler temperature inflows would provide the greatest habitat benefit (Figure 7). The effects of subsurface drains would manifest in reduced instream temperatures where the subsurface drain enters the stream. The effects of individual subsurface drains were not studied, though they would have a localized effect similar to upwelling hyporheic water.

Hyporheic Logs
Hyporheic logs were installed as single logs buried in the streambed at the bottom of the gravel layer within the active channel, approximately perpendicular to the flow, and with 10 cm (4 inches) of gravel separating them from the native substrate. They were intended to force hyporheic flow to mix laterally and vertically and increase hyporheic residence time and flow path length. The individual effects of hyporheic logs were not studied, though they would have an overall influence in greater lateral mixing of hyporheic flow and increased hyporheic residence time.

Pocket-Water Logs
Pairs of logs were installed at several locations to stabilize streambed elevation and slope (grade control), generate small scour pools (pocket water), and encourage hyporheic exchange similar to a plunge-pool, but on a smaller (shallower) scale. Each log had 3046 cm (11.5 feet) of a coarse cobble mixture between it and the native substrate, which limited scour depth while allowing hyporheic flow. This cobble mixture was in turn covered with 15 cm (0.5 feet) of streambed gravel. Structures similar to these are frequently used in stream restoration projects [43] so they were not studied but are called out as they are part of the overall stream restoration project and integrate with other the hyporheic enhancement features. Figure 7 shows an overall plan view of the majority of the Kingfisher restoration project, particularly the elements that were evaluated as a part of this study.

Materials and Methods for Hyporheic Study
The physical condition and function of the hyporheic zone was monitored using five attributes, which are described below and summarized in Table 1. Techniques used to monitor each attribute differed in terms of their cost and effort, whether the outcome was precise and quantitative versus relative or qualitative, whether the scope was locally intensive versus spatially extensive, and their utility for detecting change over time. Taken together, this suite of techniques provides a basis for inferences about the degree to which hyporheic function has been achieved and its persistence. All tests were completed during stable, baseflow hydrological conditions, during dry weather when diurnal air temperature patterns were not changing in order to minimize confounding effects from changes in water discharge or weather-induced temperature shifts.

Measurement of Vertical Water Flux
Hyporheic exchange rate was quantified by estimating the vertical water flux at discrete points on the streambed from the 10-minute-interval temperature time series measured at the surface and at two depths in the gravel, 10 and 20 cm (4 and 8 inches). To calculate the vertical hyporheic flux in m 3 /m 2 /day (which becomes m/d) between two elevations (e.g., streambed surface and 10 cm below the surface), we used the method of Garuglio et al., 2013 [46,47], which allows calculation of vertical water flux from temperature time-series data alone, without knowing pieziometric head or hydraulic conductivity.
At the Kingfisher restoration site, plunge-pool structure No. 4 was selected for vertical water flux instrumentation, as it was located in the middle of the reach in an area with floodplains on both sides of the channel, away from the influence of valley sides, and was installed with a full width of hyporheic gravel. At each of five locations upstream and five downstream of plunge-pool No. 4, stainless-steel piezometers 38 mm (1.5 inches) in diameter were driven into the streambed to a depth of 61 cm (2 feet) with their tops capped and flush to the streambed surface. Piezometers were custom fabricated from stainless-steel water pipe by crimping shut the lower end to act as a drive point. Each piezometer was equipped with a ring of 6 mm (0.25 inch) screen holes surrounding temperature data loggers (TidbiT v2 water temperature data loggers, model UBTI-001, Onset Computer Corp., Bourne, MA, USA) held at precise depths of 10 cm and 20 cm beneath the streambed surface. Foam gaskets (made from short sections of water pipe insulation) prevented water from moving vertically between each datalogger with its piezometer. The piezometers were placed in a T-shaped array of five units upstream of the plunge-pool structure, allowing for both a cross-channel and a longitudinal transect to be created. Downstream of the structure, only one of the five units in the array could be placed outside of the channel midline due to a more complex streambed topography and more vigorous gravel deposition/scour patterns. Each of the two arrays was also equipped with one data logger to record surface water temperature. Temperatures were measured every 10 minutes. In addition to the 10 piezometers bracketing plunge-pool No. 4, a piezometer was installed at the upstream end of the reach, just upstream of the sill log of plunge-pool No. 1, and two were placed at the far downstream end, in the pool tail-out zone for plunge-pool No. 6 (see Figure 7). These are zones where, on a reach scale, water is expected to first enter and then to finally exit the hyporheic zone, respectively [44].
To calculate water flux from the temperature data, we determined the ratio of the diurnal temperature sine wave amplitudes and the amount of phase shift between the two temperature sine waves bracketing the depth interval chosen. Amplitudes and phases were determined from the temperature time series by fitting the data to a sine function using non-linear least squares (NLS) regression. Calculations were performed for five time periods of 3-10-day durations, with stable (unchanging) weather and relatively constant discharge, and which were close enough to a piezometer maintenance visit to assure accurate knowledge of any streambed deposition or scour over the piezometer top. Flux over each of three possible depth intervals was determined: 0-10 cm, 0-20 cm, and 10-20 cm. The shallowest (0-10 cm) interval is likely to be the most representative of vertical exchange, since the assumption of 1-dimensional (vertical) flow holds best near the streambed surface. Large discrepancies between flux calculated for different depth intervals at the same piezometer location can occur if significant lateral flux is present.
Since there was frequent sediment deposition in this reach, the streambed elevation on top of each piezometer was surveyed with a total station theodolite (Nikon model DTM-520, Nikon-Trimble Co., Ltd., Tokyo, Japan) accurate to 3 mm (0.1 inch), to determine the actual depth to each data logger, before servicing (downloading and resealing). These actual depths were used in the calculations.

Mapping of Upwelling and Downwelling Zones
The locations and relative strengths of upwelling and downwelling zones were mapped out on a reach-wide scale by means of a comparison between the surface water temperature and the temperature at a depth of 10 cm (4 inches) into the streambed [48]. To record the temperature at the appropriate depths, an array of approximately 60 plastic tubes, of 9 mm (0.4 inch) inner diameter, were inserted into the streambed in equally spaced (approximately) transects of two to six tubes each. Tubes were cut from rolled flexible polyethylene water pipe. Figure 8 shows tube and data-logger locations for the Kingfisher Treatment reach. The streambed temperature at the bottom of each tube was measured three times over the course of a few hours using a precision hand-held thermister probe accurate to 0.015 °C (model HH41 ultrahigh accuracy hand held thermistor thermometer, Omega Engineering, Inc., Norwalk, CT, USA). The surface water temperature was measured at four locations in the reach at 10-or 15-minute intervals using programmable temperature data loggers accurate to 0.21 °C (TidbiT v2 water temperature data loggers, model UBTI-001, Onset Computer Corp., Bourne, MA, USA), and surface water temperature at each transect of tubes estimated by spatial and temporal interpolation. The average difference in temperature between the surface water and the interstitial water 10 cm (4 inches) into the streambed is an indicator of relative upwelling versus downwelling, and the relative strength of that process [48]. The temperature differences between surface water and subsurface were normalized by subtracting each difference from the median of all the differences, and then the average of the three normalized differences for each tube was computed for comparison. These averages are reported in the results.
Since the four surface water temperature data loggers were placed at locations uniformly spaced along the reach, as shown in Figure 8, they were also used to examine the longitudinal temperature patterns in the reach. The test was conducted during warm, sunny weather, before the vegetation at the Kingfisher Treatment site had re-established any shade. The hypothesis is that, under these conditions, steady warming of surface water in the downstream direction would be expected in the absence of influx of cool subsurface water. A different or less uniform pattern of temperature change would be strong evidence for hyporheic mixing and/or substantial groundwater influence. This longitudinal temperature pattern will be interpreted in the results separately.

Immobile versus Mobile Cross Section Ratio
Sodium chloride (NaCl) tracer studies are well-established as a method to characterize hyporheic exchange processes, using the relative size of the estimated "immobile" (i.e., long residence time) water cross-sectional area compared with that of the surface water flow ("mobile," short residence time) cross-sectional area as a surrogate for the strength of hyporheic water flux processes [49].
In a tracer study, concentrated sodium chloride solution is injected into the stream at a constant rate, and the time series of electrical conductivity (a surrogate for chloride ion concentration) in the surface water is measured. Measurement at a point where the water is thoroughly mixed immediately downstream of the injection point constitutes the "input" pulse, and measurement at the far downstream end of the reach documents how this input pulse is modified by longitudinal dispersion and exchange and mixing with hyporheic and other relatively "immobile" water. Once the pulse of increased conductivity at the end of the reach has stabilized to a constant "plateau" value, the tracer injection is turned off, and the conductivity time series measurements continued as the conductivity declines towards the background level.
A one-dimensional dispersion-advection model with immobile storage elements (OTIS-P, [50]) is then calibrated to maximize fit of the measured conductivity pulse at the end of the reach to the model predictions, given the measured mixing zone input pulse. Calibration gives an estimate of the effective "immobile" and "mobile" zone water cross sections [50,51]. The ratio of these cross sections is then used to infer hyporheic function [51].
At the Kingfisher Treatment and Control sites, electrical conductivity was measured at 30second intervals using submersible data loggers (Hobo fresh water conductivity data logger, model U24-001, Onset Computer Corp., Bourne, MA, USA). Conductivity was normalized by subtracting the upstream background conductivity, which was fairly high (average 218-221 microsiemens/cm). Surface water discharge was 8.9 L/s (0.32 ft 3 /s) and 23.8 L/s (0.84 ft 3 /s) at the treatment and control sites, respectively. At the treatment site, tracer solution containing 205 g/L NaCl was injected at a rate of 0.920 L/min, with a 6-min interruption (76-84 min) due to a pump reservoir blockage. At the control site, 205 g/L NaCl solution was injected at a rate of 0.710 L/min.
Model calibration focused on optimizing model parameters to minimize sum of squared error over the time representing the rapidly increasing limb of the pulse, the plateau, and the rapidly decreasing portion of the trailing limb, but not the long-term recovery to background levels. This strategy is consistent with standard practice, given that the assumptions of a one-dimensional model approximation become less valid over longer time intervals [51].

Lateral Subsurface Hydraulic Gradient
Lateral hyporheic flow potential was estimated using an array of five floodplain monitoring wells installed outside the limits of the constructed streambed gravel, and stilling wells equipped with data loggers to record water surface levels within the channel (Hobo 13-foot fresh water level data logger, model U20-001-04, Onset Computer Corp., Bourne, MA, USA, measurement accuracy 3 mm). The wells were aligned in three distinct transects representing the subsurface hydraulic gradient perpendicular to in-channel flow. Two transects were defined upstream of plunge-pool structure No. 4 and one downstream of plunge-pool No. 4. Measurements of water surface elevation in the floodplain wells was done by hand on eight occasions during steady, baseflow conditions, allowing inferences about lateral subsurface flow throughout different seasons. Hand measurements were done with a custom-made (by the authors) electronic water surface detector attached to a stiff measuring tape, accurate to 1 mm.

Streambed Surface and Subsurface Sand Content
Streambed sediment texture (grain-size composition), through its effect on hydraulic conductivity and porosity, controls hyporheic processes. In particular, evolution of the streambed grain size composition over time, as the project adjusts to its hydraulic and sediment regime, will determine whether hyporheic processes in the engineered streambed persist. The pre-project streambed was highly embedded with fine sand, effectively cutting off water movement. Thus, one consideration of over-excavation and backfilling with clean gravel was the possibility that embedded conditions might reestablish themselves in the surface layers, blocking hyporheic exchange, and that this could potentially happen within only a year or two.
Sample sites were selected to be representative of active alluvial portions of the streambed. Where possible, these comprised pool tail-out zones or glides. To collect the sample for use in volumetric analysis, the site was first isolated with a 3-sided plywood shield sealed to the streambed to prevent flushing of fine sediment by water currents as the sample was excavated but open on the downstream side to allow easy access. First, the surface layer was excavated using gloved hands to the embedded depth of the larger particles comprising it. Finally, the subsurface layer was separately excavated beneath this level, to the depth necessary to obtain roughly the same amount of material as was taken for the surface layer. These methods are described in McNamara et al. [52] and Bunte and Abt [53].
Samples were taken to the laboratory, dried, sieved, and weighed according to standard methods [53], using sieve sizes based on a powers of two (Wentworth scale) sequence. Although a complete particle grain-size distribution was obtained by this process, detailed interpretation of the overall distribution was beyond the scope of this report. For inference purposes, comparisons were based on percent sand content by weight, with sand being defined as all grain sizes passing through the 2-mm sieve.
Sampling was performed in 2015 after the streambed had one winter season of post-construction adjustment, and then again in 2017 at the Kingfisher sites. At the Confluence sites, post-project sampling was performed in 2016 and 2017. For this study attribute, results from both the Kingfisher and Forks Confluence sites are reported in the results, to give a more comprehensive picture of sediment dynamics.

Results
Physical monitoring results are presented separately for each technique.

Vertical Water Flux
Prior to restoration, vertical (upward) water flux at the Kingfisher site was estimated at 0.019 m/d (0.7 in/d) or 2.20 × 10 −7 m/s, using a deeper piezometer placement (1.4 m or 4.5 feet) and a modeling approach that utilized both temperature and pieziometric head data [39,42]. After restoration, the site demonstrated an average downwelling of 1.69 m/d (5.5 ft/d) and maximum of 8.10 m/d (25.6 ft/d), an 89-fold increase over pre-project rates. The comparison in rates is indirect because two different methods were used; the thin veneer of alluvial substrate present during preproject conditions necessitated modeling vertical flux as an average over a much larger thicknesses (1.4 m), which included non-alluvial substrate beneath this thin veneer [39,42]. The vertical water flux values at Kingfisher after restoration are significantly greater than those reported in pristine streams (a maximum of 0.48 m/d or 1.6 ft/d, in Bear Valley Creek, Idaho, USA, reported by Gariglio et al. [46]).
Upwelling was monitored with only 1/20 of the intensity of sampling wells and ranged from none observed to as high as 6.59 m/d (21.6 ft/d). Most upwelling zones in the vicinity of plunge-pool No. 4 were located in areas of very dynamic streambed scour and fill, which were unsuitable for piezometer placement. The computed vertical water flux is summarized below in Table 2, which reports the maximum and average values for each layer and time period.  Floodplain well measurements (red and blue diamonds) are part of the discussion of lateral subsurface hydraulic gradient (Section 3.5, below) but are included in these figures to assist in interpretation of the piezometer analysis. Where the floodplain well has a higher water surface elevation than the nearby surface water stilling well, it is shown in blue (the stream is gaining water); where it is lower (stream losing water), it is shown in red. The size of the diamonds indicates the magnitude of gaining or losing during the time period being represented. In this time period, these wells show that lateral movement is generally from bottom to top across the channel (south to north) and is stronger on the downstream side of plunge-pool No. 4. This could be attributable to the effects of lateral subsurface water movement, which is generally bottom to top in this portion of the reach and would likely be carrying cooler water. However, it is difficult to explain why this influence would only affect one, and not all five, of the piezometers in that array.
The single piezometer at the upstream end of the reach is not shown in these figures, as it was inadvertently installed adjacent to a buried pipe which affected its temperature readings. Likewise, one of the two piezometers at the downstream end of the reach had to be abandoned due to persistent, deep scour caused by high velocity water from plunge-pool structure No. 6.
A complete listing of the computed vertical flux values and location coordinates in provided in spreadsheet format in the Suplementary Materials, Spreadsheet S1: Summary of computed vertical water flux.

Mapping of Upwelling and Downwelling Zones
Normalized differences between surface water and intra-gravel temperatures for the Kingfisher Treatment site are shown in Figure 10 below as circles, with the diameters scaled according to the temperature difference magnitude. The test was conducted during a period of stable, sunny weather, with constant baseflow discharge of about 0.017 m 3 /s (0.6 ft 3 /s). The positive normalized differences are zones of relative upwelling (shown as blue circles) while the negative normalized differences are zones of relative downwelling (shown as red circles). In this figure, the circle size in the legend represents a temperature difference of 1.0 °C. The maximum and minimum actual temperature differences (in degrees C, not normalized) were 2.89 °C (upwelling) and −0.03 °C (downwelling), but these numbers, like the size of the circles, only indicate relative tendencies, and are not meaningful in an absolute or quantitative sense. That is, a location mapped as "upwelling" might actually be physically downwelling, but at a weaker rate than those "downwelling" locations with temperature differences less than the overall median. The results indicate a large diversity of shallow hyporheic interactions across the study reach. Note that several transects showed relative downwelling zones on one side of the channel, with upwelling zones on the other. The middle portion of the reach, between plunge-pool No. 4 and No. 5, was an area of strong influence from lateral subsurface flow entering from the south (bottom edge of the channel in the figure). Although this subsurface flow feeds floodplain wetlands before continuing to flow subsurface to enter the stream channel, it still entered the channel cooler than the surface water during the time of year this test was conducted (8 June 2015). There were fairly consistent subsurface temperature differences from the south (cooler) to north (warmer) sides in this area. This lateral input of cool water probably explains why the upstream (left) side of plunge-pool No. 5 is mapped as an upwelling zone, even though this would be expected to be a downwelling area. Looking at the strength of the upwelling, as indicated by the larger circles on the downstream side, we can still conclude that the structure is functioning as expected, with water entering the streambed on the upstream side and reemerging downstream. Figure 11 illustrates the relative upwelling and downwelling found at the Kingfisher Control reach during identical discharge and weather conditions. Note the large apparent upwelling zones occurring in the vicinity of the alluvial fan of a tributary, which enters in the left upper side of the figure. Other than this zone, most of the points in the Kingfisher Control had weaker upwelling or downwelling tendencies than those observed in the restored (treatment) reach, and less diversity in relative intensity. The dots in the legend represent 1.0 °C temperature difference. Median and average (absolute) temperature differences between the surface and intra-gravel temperatures were larger in the treatment reach ( Figure 10) than in the control reach (Figure 11, although the maximum (2.57 °C) and minimum (−1.52 °C) were somewhat more extreme in control reach. Median and average (absolute) temperature differences in the treatment reach were larger, and the maximum and minimum more extreme, post-construction than in the pre-construction channel in 2006 [39].
This subdued intensity of hyporheic interchange at the control reach was expected, due to the limited thickness of the alluvial streambed in the control reach. Table 3 summarizes the intra-gravel temperature measurements, where temperature difference refers to (surface water)-(intra-gravel) temperature. Intra-gravel temperature was measured 10 cm below the streambed surface. The data reveal that the restored treatment site yielded a mean temperature difference 1.5 times greater than control and nearly 4 times greater than the pre-restoration site. Variability, as indicated by standard deviation (S.D.) of temperature differences, shows a similar pattern. The complete set of intragravel temperature differences is provided in spreadsheet format in the Supplementary Materials, Spreadsheet S2: Summary of surface-intragravel temperature differences.

Surface Water Longitudinal Temperature Pattern
This section presents a portion of the time series of surface water temperatures collected during the intra-gravel temperature difference study. Locations of temperature data loggers are shown in Figure 8, and the temperature measurements are shown in Figure 12. Cooling occurred during part of the day between T01 and T02, even though this was a completely open, unshaded portion of the reach during the study. The only explanation for this is some combination of hyporheic cooling due to plunge-pool No. 4, which lies between these two data loggers, and lateral input of cool subsurface water. By contrast, there was considerable solar heating evident between T02 and T03, which is similarly situated relative to groundwater influx, has similar slope and morphology, and is also without shade, but has no log sills to promote hyporheic flow.
Even more surprising is the significant (>1.5 °C) cooling that occurred between T03 and T04, the most downstream data logger. This portion of the reach is characterized by deep pools and sluggish water velocities, so in the absence of lateral subsurface flow or hyporheic cooling due to plunge-pool No. 5, we would expect substantial heating between these two points. The only suitable explanation for this significant temperature decrease is due to the cooling provided via hyporheic exchange as water passes under and through plunge-pool No. 5.  Figure 13 shows end-of-reach water electrical conductivity data compared to optimized OTIS-P model predictions for the Kingfisher Treatment site to illustrate the degree of model fit to data. The model optimization excluded data beyond 219 min in order to focus parameter optimization on the periods of rapid change, which included the rising limb, plateau, and steep portions of the falling limbs of the tracer pulse, but not the long gradual recovery to background levels. This strategy is consistent with standard procedures [50,51]. The test reach was 80.5 m long, with water discharge of approximately 0.0089 m 3 /s (0.32 ft 3 /s).  Table 4 summarizes the hydraulic conditions and optimized model parameters for the Kingfisher Treatment and Control site tracer tests. Estimates of model parameters are within the ranges expected. The discharge measured in the restoration site was only 37% as large as that measured immediately upstream in the control site (0.0089 vs 0.0238 m 3 /s or 0.32 vs 0.84 ft 3 /s). This is an indication of the large volume of water that flows subsurface in the restoration site.

Immobile versus Mobile Cross-section Ratio
Both tracer tests resulted in DamKohler number, DaI, values that are within the acceptable range of about 0.2-2.0, indicating that reliable parameter estimates can be obtained via model optimization [54]. The Kingfisher Treatment site had an I/M ratio of 3.26, which is 2.5 times as large as I/M for the corresponding control site. Both sites had noticeable amounts of stagnant water along their margins, as well as several constrictions and expansions in channel cross section that generated slackwater areas. According to the model, immobile cross sections were 1.095 and 0.266 m 2 , respectively, for the treatment and control sites. One potentially confounding issue is that the restoration site had an increased surface morphological complexity compared to control.
To ascertain the distinction between hyporheic function as separate from surface morphological complexity, it is observed that the control site had a predicted immobile cross section that was 1.33 times the predicted mobile cross section, and virtually all of this was surface marginal zone, as this reach had a very thin alluvial streambed (15-30 cm thick, or 6-12 inches) with little hyporheic component [39]. Thus, most of the 0.266 m 2 (2.9 ft 2 ) of immobile zone consists of channel margin area. If, by extrapolation, the channel margin area is represented by a similar percentage in the treatment site, we would expect a channel margin immobile zone of 0.366 × 1.33 = 0.446 m 2 (4.8 ft 2 ). The remainder of the total immobile cross section, 0.650 m 2 (7.0 ft 2 ), would then represent hyporheic zone, giving a ratio of hyporheic to mobile cross section of 1.93. Thus, the post-construction effective hyporheic cross section was almost two times larger than the mobile cross section, and about 3.2 times larger than the control reach mobile cross section.

Lateral Subsurface Hydraulic Gradient
Lateral hyporheic flow potential is illustrated through measurements of water surface elevations along the two upstream and one downstream lateral transects shown in Figure 14 The upstream transect closest to plunge-pool No. 4 (Figure 14a) shows the classic pattern of lateral hyporheic flow around a channel spanning step [55,56]. Subsurface flow moves laterally outward, away from the channel center at both sides. Likewise, the downstream transect (Figure 14c) shows how this water, after having briefly entered the floodplain, flows back into the channel from both sides. Superimposed on this local lateral hyporheic circulation, is the regional flow of subsurface water from the valley sides on the south in a northerly direction [41], encountering the right side of the stream channel (as facing downstream). This is the cause of higher surface water elevations on the right side of Figure 14a, and steeper piezometric gradients on the right side of the downstream transect, Figure 14c. The effect of this regional subsurface water flow is even more evident in Figure  14b, where the orientation of the transect is such that this regional flow overwhelms the local circulation, producing a situation of through flow from southwest to northeast. Moreover, during certain times of the year, notably, in August, when surface water is at its lowest, this regional trend overwhelms the local circulation along the downstream transect as well.

Streambed Surface and Subsurface Sand Content
Summary statistics for the surface and subsurface layers pre-project and post project for both Kingfisher and Confluence Forks sites are summarized in Table 5, with additional sample data and locations shown in Appendix A.
In order to investigate streambed evolution in terms of grain size, changes to the percentage of sand (grain sizes less than 2 mm) were computed. Data from all nine sample locations at Kingfisher and Forks Confluence are shown in Table 5, in order to provide context for the discussion of variability observed.  n/a n/a n/a n/a At Kingfisher, the median surface layer grain size, D50, ranged from 45.7 to 28.9 mm (1. Interpreting trends in streambed grain size evolution is difficult from only two points in time, given the apparent variability in grain size distribution and sand content, and given the episodic nature of sediment input to the stream system, to which the streambed must adjust. Generally, the dominant sources of coarse sediment and sand are the walls of the ravine immediately upstream from the Kingfisher Control site. Thus, it is not surprising that sand content generally increased in the surface and subsurface layers in the Kingfisher sites, which are closest to this source. An exception is the most upstream sample on the Kingfisher Treatment (KFT1), which diminished by 2% in sand content, but only in its subsurface layer. So, even in these sites closest to the source, it is possible for the natural processes of streambed scour and fill to clear fine sediment from streambed, as would happen in a more pristine watershed setting. It also bears mentioning that these changes in sand content are based on volumetric samples, and thus appear small due to the relatively greater influence of the larger gravel -sized particles on overall sample mass [53,57]. That is, the streambed surface visual appearance will change from slightly to moderately "sandy" over a fairly small range of sand percentage as measured in a volumetric sample [57]. Nevertheless, volumetric sampling is a more accurate way to quantify fine particle content than surface particle counts [57,58] when tracking processes of sediment deposition and transport, since substantial infilling of surface layer voids can occur before these fine sediments are evident in particle counts.
In order to more confidently answer the question of whether the engineered streambed is destined to steadily increase its sand content over time, it is useful to bring into this discussion the three sediment sampling sites on the South Fork, near the Forks Confluence restoration site, 1.7 km (1.1 miles) downstream from Kingfisher. These are shown in Table 5, in upstream to downstream order. Here, all three sites decreased their surface sand content over the time period from 2016-2017. This was in spite of the fact that, according to repeat channel cross-section surveys, the South Fork Confluence constructed site was gaining streambed sediment (aggrading) in some locations over this time, as it adjusted to its new hydraulic and sediment load conditions. Magnitudes of decrease in surface sand content were larger than at the control site, which also decreased its sand content in the subsurface layer.
In general, the surface layer responds more rapidly to pulses of sediment and peak flows than does the subsurface, which tends to better represent long-term average bedload size distribution [59]. However, the fact that all of the South Fork Confluence sites decreased in surface sand content is indicative that scour and fill processes that periodically mobilize, and clean, the streambed of excessive fine sediment can function in this stream system. This study time interval, 2015-2017 in the case of Kingfisher and 2016-2017 in the case of the South Fork Confluence, is too short to make definitive conclusions about long-term trends in streambed condition. The data collected to date indicate the restored engineered channel has been resilient, has continued to maintain its significant hyporheic performance, and has not become embedded despite continuous inputs of fine sediment.

Discussion
Watersheds and river systems are complex, even those urban streams that have been simplified by eliminating much of that complexity. Variability on small spatial and temporal scales is an inherent property of river systems, and therefore measurement uncertainty and complex responses are expected [35]. There are limits to what we can measure or predict. Specific metrics, which can be measured in a way that temporal and spatial variability are adequately known, may be prohibitively costly to undertake, or may require longer study duration than is practical. Evaluating the effectiveness of an individual restoration project is also made difficult by the vast difference in spatial scale between the project footprint (a few hectares) and the watershed itself (29 km 2 or 11.1 square miles), which establishes controls on hydrology, sediment dynamics, solutes, pollutants, and energy. The "signal" from a restoration treatment must overwhelm the background variability established by watershed-scale processes in order to be detected. In this situation, restoration effectiveness is best indicated not by any single attribute (such as stream temperature), but by a "weight of evidence" approach, in which a suite of attributes, taken together, provide evidence for detection of change. In this monitoring effort, five different attributes were measured. Each one was found to exhibit considerable variability in the results. However, taken together, they create a coherent picture of increased hyporheic processes, both in terms of flow and energy exchange.
Moreover, treating a restoration project as an experiment, in which the design includes a control, and a post-and pre-treatment period can be difficult. True control reaches, which resemble the treatment reach in every respect except restoration, may be impossible to find. The Kingfisher Control reach, though morphologically different in terms of cross section and floodplain occurrence, did have hyporheic conditions very similar to the pre-project treatment reach, including lateral subsurface water influx, and thus was a good control for the purposes of this study.
Hyporheic process restoration by means of an engineered streambed is very new to the community of stream restoration practitioners [11,12,14,36]. As such, it brings up three important questions. First, does it increase the magnitude and extent of hyporheic flow? Second, does it improve surface water quality? Finally, how long will it continue to function? It is to answer these questions that the weight of the evidence approach followed in this study is directed.

Vertical Water Flux
Vertical water flux measurements reveal that this hyporheic restoration design significantly increases the magnitude and extent of hyporheic process relative to both control sites and natural sites. This study documented improvements in the strength and diversity of upwelling and downwelling zones, which indicates improved hyporheic circulation. Vertical water flux was quantified, and found to be quite large, about 89 times as large, on average, as measured in the preproject channel, with a maximum 16 times as large as maximum values measured in a pristine stream in Idaho, which, presumably, has a fully functional, self-maintaining streambed.
The large variability of vertical flux over time, and over short spatial distances, could lead one to suspect methodological imprecision. However, Peter et al. [30] confirmed by direct observation that hyporheic pathways, and presumably fluxes, do change radically over short time periods. These authors conducted tests using visible die tracers to map specific hyporheic pathways through plungepool No. 4 as part of their study to quantify hyporheic residence times and identify potential chemical transformations in the streambed. They found that the exit point for water entering the same injection location changed substantially over time periods as short as a few weeks.
Although the method assumes that the hyporheic water flux in the streambed is vertical, and does not explicitly account for lateral movement, it has been shown to be fairly robust even when there are moderate amounts of lateral flow. In general, the shallower the layer (e.g., 0-10 cm as opposed to 0-20 cm), the more closely to vertical the flux will be, and the better the method fits its assumptions [46]. Lateral flux may confound vertical flux estimates, especially for deeper layers in streambed and or in locations proximate to subsurface flow inputs.

Mapping of Upwelling and Downwelling zones
The primary method to measure how well the restored hyporheic design benefits water quality is to quantify water temperature effects. Immediate effects on hyporheic process was observed by measuring the difference between the temperature of surface and subsurface water under stable, baseflow conditions in sunny weather. The restored reach had a temperature difference 1.5 times that of the control reach and 4 times that of the pre-restoration reach. The temperature difference is more significant than absolute temperature, as the temperature difference indicates the cooling potential of a system, where the absolute temperature simply reflects local, immediate ambient conditions. Any hyporheic mixing achieved on the project site with this increased temperature difference will yield an overall cooler surface water in summer conditions.
Though it was not measured here, it is also feasible that this temperature difference could operate in reverse during winter months, with the ability to warm incoming surface waters through mixing. This warming could potentially improve conditions for spawning and rearing during the cold season [42]. Overall, a system with a large intra-gravel to surface water temperature difference and robust hyporheic mixing can act as to moderate thermal changes within the stream seasonally, cooling the surface waters in the summer and warming surface waters in the winter.
This attribute does not give us a quantitative measure of vertical water flux but does provide the relative strength from one point to another, extensively, over the entire reach. The results indicated much stronger upwelling and downwelling within the restoration reach compared to the control reach, demonstrating the relative uplift in hyporheic function in the restored reach. Results also showed greater upwelling and downwelling strength in the treatment reach than had been observed in pre-project conditions.
The normalized temperature-difference measurements obtained through this method do not tell us the actual threshold between upwelling and downwelling, however. Theoretically, in the absence of significant reach-scale groundwater influx or efflux, local upwelling and downwelling should balance, and this method would give a realistic picture of the distribution of these opposing processes. However, in a reach such as Kingfisher Treatment or Control that gains substantial amounts of groundwater, it is conceivable that upwelling could occur everywhere, in which case this method would only indicate the relative strength of that upwelling.

Surface Water Longitudinal Temperature Pattern
Longitudinal surface water temperature measurements indicated consistent temperature decreases during warm, sunny conditions through the zone of channel occupied by plunge-pool No.

5.
On an established restoration site with full vegetation, this cooling would be attributable, at least in part, to shade from vegetation. However, this explanation is not sufficient here, as the site had only recently been replanted and there was no shade over the stream channel. In the absence of hyporheic water exchange or lateral groundwater input, we would expect steady increases in surface water temperature in the upstream to downstream direction, due to solar heating. Instead, the data loggers recorded a cooling as water flowed downstream, which can only be attributed to thermal exchange with the subsurface as driven by hyporheic flow under plunge-pool No. 5.

Immobile versus Mobile Cross-section Ratio
A critical element to determining how well the restored hyporheic design can restore water quality is by considering the retention time of the surface water versus the hyporheic water. Longer retention times can yield greater reduction of incoming chemicals through adsorption, biological, or chemical transformation. Using NaCl tracer studies, this study documented large improvements in hyporheic volume compared to the control reach, as represented by a more than 3-fold increase in immobile cross section found in the tracer test modeling. A larger hyporheic cross section (and volume) translates into slower subsurface flow, through hydraulic continuity, and thus into longer subsurface residence times.

Lateral Subsurface Hydraulic Gradient
Subsurface water elevations along a transect perpendicular to the stream channel is an indicator of potential for lateral or floodplain hyporheic flow. In this study, evidence for local hyporheic flow around a plunge-pool structure, latterly, into the floodplain was demonstrated, as well as a regional flow of subsurface water from south to north across the site. The greatest hydraulic gradient was evident in the winter when there is greater in-channel flow, though the pattern of lateral flow was fairly consistent between seasons. In low flow summer conditions, reach-scale conditions such as lateral inputs from groundwater can overwhelm local gradients produced by the plunge-pool structures.

Streambed Surface and Subsurface Sand Content
Determining whether the design will continue to perform successfully over longer time spans was evaluated by sampling changes in streambed sediment grain size composition, in particular, its sand content. Sampling of streambed sediment has demonstrated continued ability of the stream to maintain its engineered bed as loose, mobile, alluvial gravel, without becoming embedded with fine sand, to date. This condition is associated with good hyporheic function. The literature is vague regarding the amount of time needed to characterize a trend in evolution of streambed grain size composition (e.g., [53,58]), but judging from the fact that there were significant observed changes to some of the permanent cross sections used to monitor morphological changes in the control reaches over the period 2006-2017 [41], 10 years would not be an unreasonable estimate. Subsequent sediment composition sampling is recommended to document any future changes, which could be influenced by changes in incoming substrate size, quantities, and/or hydraulics due to changes in channel cross section.
In this, and other channel reconstruction projects, the exposed portions of large wood structures have a finite lifespan in their original designed configuration. Eventually, as the wood decays, and as the channel adjusts to sediment pulses, fallen trees, floods and other disturbances, it will attain a configuration that is self-maintaining over time, if allowed to do so. Wood remaining partly or wholly buried in the streambed (e.g., the hyporheic logs), however, is expected to have a very long lifespan, potentially on the order of a thousand years [60]. As the system retains incoming wood (falling trees and/or washed in logs) it likely will retain additional alluvium, burying the wood as it builds its channel and floodplain over time. This configuration will likely have a hyporheic zone that is much larger and higher functioning than the pre-project condition long into the future.

Relation to Other Studies and Observations
Using mass spectroscopy, Peter et al. [30] found significant reductions in the number and concentration of organic chemicals, some of which are potentially associated with coho mortality (as identified in [29]) both under both base flow and stormflow conditions through plunge-pool No. 4 at the Kingfisher site. For example, polypropylene glycols (PPGs) were reduced in concentration by 46%-100% in a hyporheic pathway with residence time of only 32 min and estimated path length 4.4 m (14.4 feet) beneath plunge-pool No. 4 (called hyporheic design element or HDE No. 4 by these authors). In a nearby hyporheic pathway of residence time 3.75 h and path length about 5.0 m (18.4 feet), the PPGs were reduced by 92%-100%. Reductions in chemical concentrations were found to be significantly larger in the hyporheic flow paths, even though these were short, than reductions in the surface flow paths. During stormflow conditions, less than 17% of the 1900 non-target chemicals (i.e., those chemicals analyzed as a group without knowing the exact identity of all of them) were reduced in the surface flow by more than 50%, compared to 59% and 78% reductions in the short and long hyporheic pathways, respectively. These researchers attribute the contaminant reduction to sorption by biofilms on hyporheic sand and gravel particles. At base flow, the majority of the flow was found to circulate through the hyporheic zone, about 20% at each of the six structures. During storm flow conditions, 20%-60% of the flow was found to circulate through the hyporheic zone, indicating substantial water quality treatment potential associated with the plunge-pool structures.
Researchers documenting the biological response to these projects also detected differences between constructed and control sites. They sampled biota from the hyporheic zone at a depth of 15-25 cm (6-10 inches) beneath the streambed and found greater macroinvertebrate density and taxa richness in the treatment than in the control reaches [37]. Even though there was substantial variability from 2015 to 2016 (i.e., one to two years, post construction), the treatment reach consistently was higher than the control in both of these metrics. They also documented increased microbial metabolic activity as indicated by carbon metabolism, and changes to microbial and invertebrate taxonomic structure.
Part of the biological response study involved experimental inoculation of the hyporheic zone with macro-and micro-invertebrates from a more pristine reference stream, the Cedar River, located in forested setting in the municipal watershed for the City of Seattle. Mesh baskets of gravel were placed on the Cedar River streambed for a period of several months, then taken to Thornton Creek and placed into vertical perforated pipes embedded into the engineered streambed. However, only small, transient changes to the microbial taxonomic community structure were observed, and no significant changes to macroinvertebrate density or structure [37].
In support of this result, during autumn 2018, Chinook salmon were found spawning in the restored Forks Confluence [61]. This is the first documented spawning in this area ever since spawning surveys began in 1999, and is attributed to the improved streambed condition (which was visibly apparent in 2018, four years post-construction), the wider, more complex channel morphology, and restored hyporheic function providing the needed conditions to match spawning requirements.

Future Prospects of Hyporheic Restoration
Significant water quality benefits have been documented on a local scale by evaluating in detail a handful of parameters on an individual plunge-pool structure among the six plunge-pool structures in the Kingfisher restoration site and eight on the Confluence restoration site. Water quality within a stream is degraded by diffuse interactions with impervious surfaces, unshaded open water, and exposure to environmental chemicals occurring throughout the watershed. The next step is to explore how to correlate the amount of stream and hyporheic restoration, which is required to provide significant water quality benefits to treat that incoming pollution from a watershed, and how this work can be integrated into a larger vision which also incorporates current input-water engineered treatment solutions such as vegetated filter strips and stormwater retention structures. In the broadest sense, it is worth considering whether stream and hyporheic restoration, when implemented at scale throughout a watershed, serve as a superior replacement to current engineering practices and provide water quality enhancements with combined habitat uplift benefiting instream fauna as well as riparian functions including improvements to air quality, avian habitat, and soil moisture retention.

Conclusions
This new approach to enhanced hyporheic design successfully restored hyporheic processes and yielded significant water quality improvements compared to control and pre-project conditions. The engineered streambed and large-wood structures central to this design have yielded vertical water flux averaging 89 times greater than pre-project conditions, and with a maximum 16 times greater than that found in a pristine watercourse. Mapping of upwelling and downwelling zones in the restored reach yielded a surface-water to intra-gravel temperature difference 1.5 times that of the control reach and 4 times that of the pre-restoration reach, demonstrating the ability of the restoration design to yield an improved hyporheic circulation in summer conditions. Surface water longitudinal temperature patterns documented greater than 1.5 degrees of cooling as the water passed through a plunge-pool structure in full sun in the middle of the day, which can only be attributed to hyporheic exchange driving flows subsurface and subsequent cooling. Effective hyporheic zone cross section, as estimated by the immobile to mobile cross-section ratio, improved by a factor of 3.
The engineered streambed has been resilient to the influence of its sediment load. Surface and subsurface sand content, an indicator of hyporheic performance, has shown no observable tendency for the streambed to revert to the embedded, impermeable state that existed before project construction, at least over 2-3 years of sampling. The design has thus proven capable of maintaining natural scour and fill processes which slow or reverse embedded conditions and may actually improve over time through natural stream evolution, including processes of large-wood recruitment and associated accumulation of more diverse and extensive alluvial streambed material.
Other researchers studying the water chemistry and biology have shown that this hyporheic restoration has also been effective at significantly reducing concentrations of incoming pollutants [29], and at enhancing both hyporheic microbial heterotrophic production and macroinvertebrate taxa diversity and richness [37].
Supplementary Materials: The following are available online at www.mdpi.com/xxx/s1: Spreadsheet S1: Summary of computed vertical water flux, and Spreadsheet S2: Summary of surface-intragravel temperature differences. Funding: This research was funded by the City of Seattle, contract No. 14-133-A. Table A1. Streambed surface layer grain size statistics for all 9 Thornton Creek sites (see Figure A1).

Site
Year  Table A2. Streambed subsurface layer grain size statistics for all 9 Thornton Creek sites (see Figure  A1).   Tables A1 and A2.