Field Calibration of TDR to Assess the Soil Moisture of Drained Peatland Surface Layers

The proper monitoring of soil moisture content is important to understand water-related processes in peatland ecosystems. Time domain reflectometry (TDR) is a popular method used for soil moisture content measurements, the applicability of which is still challenging in field studies due to requirements regarding the calibration curve which converts the dielectric constant into the soil moisture content. The main objective of this study was to develop a general calibration equation for the TDR method based on simultaneous field measurements of the dielectric constant and gravimetric water content in the surface layers of degraded peatlands. Data were collected during field campaigns conducted temporarily between the years 2006 and 2016 at the drained peatland Kuwasy located in the north-east area of Poland. Based on the data analysis, a two-slopes linear calibration equation was developed as a general broken-line model (GBLM). A site-specific calibration model (SSM-D) for the TDR method was obtained in the form of a two-slopes equation describing the relationship between the soil moisture content and the dielectric constant and introducing the bioindices as covariates relating to plant species biodiversity and the state of the habitats. The root mean squared error for the GBLM and SSM-D models were equal, respectively, at 0.04 and 0.035 cm3 cm−3.


Introduction
Accurate determination of soil moisture is important for various hydrological, agricultural and environmental applications.In hydrology, soil moisture content plays a major role in the water cycle by controlling hydrological fluxes, such as infiltration, runoff and groundwater recharge [1].In climate studies, soil moisture governs the energy flux exchange between the land surface and the atmosphere through its impact on evapotranspiration [2].Determination of the soil moisture content is very important for hydrological studies in peatland soils, the actual condition and preservation of which depends on water-related processes [3][4][5].Peatland soils occupy about 400 million hectares, which is about 3% of the world's land [6].Most of these soils are located in the northern part of the globe, including large areas of North America, Europe and Russia.It is estimated that peat soils contain about one-third of global soil organic carbon stock [7].The last GIS-based estimation of peatland areas in Europe shows that this type of ecosystems occupies approximately 593,727 km 2 and 46% of this area is classified as peatland where the peat forming process was stopped [8].Stopping the forming process was associated with artificial land drainage in response to agricultural, forestry and horticultural demands [9].The great majority of drained peatland is used as meadow and pasture, and only a limited percent as arable land [10].Drainage of peat soils leads to the formation of a new organic soil material in the surface layers known as muck (moorsh), which has a granular structure where the original plant remains are not recognizable and contains more mineral matter than peat [11].
Soils contain twice as much organic carbon than the amount of C in the atmosphere [12], therefore any changes in the terrestrial pool are usually associated with the impact of global warming [13,14].The importance of peatlands in global warming relates to the fact that 14% to 20% of these areas globally are currently used for agriculture, mainly as meadows or pastures, and oxidized soils of these ecosystems are a source of carbon, which has an influence on the global increase of CO 2 emissions [15].A literature review shows that there are three major factors used to quantify the amount of CO 2 emission from drained peat soils, i.e., soil temperature [16][17][18], soil moisture content status [17,19,20] and water table depth [21].The increase of soil temperature and optimal soil moisture status create favorable conditions for the increase of CO 2 efflux.Soil moisture content is one of the main hydrological factors to be assessed in the calibration and validation of activities in global soil moisture products (spaceborne), as well as in order to increase the understanding of water, energy and carbon flux exchanges at the active surface on a local scale [22].
Over the last four decades, the most popular in situ soil moisture measuring technique was based on measurements of the dielectric properties of the medium in time domain.This method is named Time Domain Reflectometry (TDR) and its first application in soil science was reported in 1980 [23].The theory behind this measurement technique is based on travel time analysis of the short electromagnetic step pulse which formed an electrical field along a waveguide (e.g., two parallel rods of a given length) surrounded by the soil [24].The time required to travel the length of the waveguide (there and back) is recorded and used to calculate the dielectric constant (K a ) of the medium [25].The attractiveness of the TDR method results from the simple non-destructive and quick measurement of the dielectric constant which can be transformed using the calibration curve to the volumetric soil moisture content.However, many TDR probes on the market are often not properly tested for the high dielectric permittivity values of peat soils and that the algorithms for the TDR waveform analysis might fail in that range [26].The relationship between the measured K a and moisture content can be derived either by using the dielectric mixing model [27,28] or by empirical equations [25].In peat soils, in most cases, the calibration curves are empirical equations obtained by fitting a mathematical model of the measured dielectric constant to the gravimetrically measured soil moisture content data.The majority of calibration equations for peat soils are third-order polynomials with respect to the raw measured dielectric constant [22,23,[29][30][31][32] or its transformed form as a square root [33].The polynomial calibration equations are very flexible and more efficient than the manufacturer default relationships (determined by developers of the TDR measurement systems) but they are appropriate as a site-or soil-specific equations which enables improved accuracy of soil moisture content estimation [34].A more general calibration equation for the organic materials can also be expressed in the form of a nonlinear relationship between the soil moisture content and the dielectric constant [22,35,36].Another popular group of empirical equations constitutes the linear relationship between the soil moisture content and the square root of the dielectric constant.Malicki et al. [37] developed an empirical model which incorporates soil bulk density as second covariate, in addition to the Ka 0.5 , thereby improving the moisture content predictions for a broad range of soils, also including organic soils.Oleszczuk et al. [38], using a similar data analysis technique, obtained a general empirical model describing soil moisture content as a function of the dielectric constant, bulk density and ash content for different top organic soil layers from the Biebrza River Valley.However, this type of calibration procedure, apart from the dielectric constant measurements, requires extra knowledge regarding other soil properties in order to predict its moisture content status.Despite the large number of calibration equations for determination of the soil moisture content using TDR, they were developed mostly under laboratory conditions and their use is limited to specific sites or dedicated soils.
Peat soils are characterized by soil volume changes due to soil moisture content changes.Peat soil shrinks during drying and its bulk density increases while wetting, causing soil swelling and a decrease in bulk density [36,[39][40][41][42]].In the majority of laboratory calibrations using TDR, bulk density changes are not considered as a factor influencing the dielectric constant.
Interdependence of peat properties and vegetation cover is often observed in peatlands [43].The linkage between the physical and chemical properties of soil and the state of plant cover is often described using diversity indices representing plant species [44] or using more complex environmental indicators [45].Biodiversity is a useful metric for quantifying the composition and abundances of the plant species [46] providing information about community structure and it may also be used as an indicator of the ecosystem's health [47].In Europe, the Shannon-Wiener index (H') is a commonly used bioindicator for ecological assessment of the quality of ecosystems [48].According to its value, the ecosystem's status can be classed from "Bad" (H' ≤1) to "High" (H' >4).In between there are three classes of ecosystem quality: "Poor" (1 < H' ≤ 2), "Moderate" (2 < H' ≤ 3) and "Good" (3 < H' ≤ 4).The species composition of grassland is considered a good indicator of habitat properties which are frequently quantified using species indicator values [49].Ellenberg indicator values (EIV) [45] are the most commonly used ecological indicators in Central Europe because they summarize complex environmental factors including: Groundwater level, soil moisture content, precipitation, humidity, etc. in one figure and they are calculated on the basis of complete species composition of plant communities [50].The system of indicators, as compiled by [45], considers vascular plant species with respect to: Light intensity (L), mean annual air temperature (T), continentality (K), substrate moisture (F), substrate reaction (R), substrate nutrient availability (N) and salt content of the substrate (S).Six of these indicators have a nine-point scale and 12 levels have been established for the F value.The present definition of the EIV system and the re-interpretation of the L indicator were presented in the study by [51].Despite the ordinal character of the EIV indicators [50], their average estimators of a vegetation record provide helpful environmental site information, which can substitute costly and time-consuming measurements [52][53][54].
Taking into account the importance and range of occurrence of drained peatland ecosystems in the northern hemisphere, we have tried to develop a TDR calibration model which enables the monitoring of soil moisture content being the most important factor controlling environmental processes at the active surface.We have assumed that it is possible to establish an accurate relationship between soil moisture content and soil dielectric properties based on the field measurements of both features.In light of the literature review, we have also supposed that the soil moisture content prediction using K a values can be improved at the site scale by introducing the bioindicators describing the biodiversity of plant species and the state of habitats into the calibration equation.

Study Area and Site Characteristics
The study area was located in the Middle Basin of the Biebrza River Valley (22  75 N) within the Kuwasy drainage sub-irrigation system [55].The first drainage system in this area was created between 1933 and 1939 using open channels.Then, the drainage sub-irrigation ditches were constructed between 1951 and 1960.Due to the high rate of peat soils subsidence they were deepened in between 1977 and 1988.The Kuwasy Channel is the main part of the land reclamation system (about 15 km long) which connects the Rajgrodzkie Lake (the main reservoir of water used for the irrigation of the Kuwasy peatlands) with the Rudzki Channel.In this study, six experimental sites characterized by different soil properties and vegetation patches were selected (Figure 1).The term "site" used in this study is defined as a part of the field quarter covered by representative and relatively homogenous vegetation patches.All the studied sites were used as meadows.Two sites were located near Biebrza Village (site I and II) and the soil profiles were developed from alder peat.Next, two profiles were situated near the Stoczek Stream (III, IV) where soil upper layers had evolved from sedge-reed peat deposits.The last two experimental sites (V, VI) were located at the border of the Kuwasy system near the Jegrznia River.The reed peat was the parent material of these soil profiles.The underlying peat soil layers (below 15 cm) in sites I and II were classified as sapric Histosols horizons whereas peats from the other soil profiles were classified as hemic Histosols [56].The upper layers (up to 15 cm) of the studied soils were formed as a result of long-term water table lowering.Land drainage resulted in the soil mucky process which in turn led to developing a new organic soil material with specific chemical, physical and biological properties [57].The surface layers are often referred to as muck (moorsh) material [58].

Field Measurements and Basic Soil Properties
The set of field data was collected during 23 field campaigns conducted between 2008 and 2016 for sites I, III-VI, and 12 field campaigns during the vegetation seasons between 2006 and 2007 for site II.During the measurement dates at the selected sites, a soil pit was dug to the depth limited by the ground water table position.The measurements were done in the top layers (0-15 cm) recognized as the surface soil layers dominated by moorsh material.The dielectric constant was measured in the soil profiles at three locations spaced horizontally by 30 cm at three different depths: 5, 7.5 and 12.5 cm.A slightly different measurement methodology was applied at site II, where measurements were conducted at a depth of 10 cm.In each measurement spot, the two rods of a TDR probe (10 cm long) were installed horizontally for recording the dielectric constant.The TDR probe was a part of the measurement system (field operator meter), which also comprises the pulse generator Water 2018, 10, 1842 5 of 22 module producing a Gaussian needle pulse, with rises/falls at about 200 ps [59].After the dielectric constant measurements, the soil profiles were sampled for laboratory determination of gravimetric soil water content and soil bulk density.Undisturbed soil samples were collected into horizontally-oriented plastic cores (5.4 cm diameter and 11.8 cm height, 270 cm 3 volume), coaxially inserted in the place of K a measurements.The collected soil cores were sealed with a rubber cover at both ends and packed into thermal plastic bags to avoid water loss through evaporation and stored in a cooler at 4 • C until laboratory tests were conducted.In laboratory conditions, each soil sample was weighed and was oven dried at 105 • C for 24 h after air drying.Soil bulk density was calculated by dividing the oven-dry sample weight by the core volume.Gravimetric moisture content was calculated for each sample and then converted to volumetric moisture content by multiplying the gravimetric moisture content by soil bulk density.In total, 278 measurements regarding the soil moisture content, soil bulk density and dielectric constants were obtained (the measurement data for each site are shown in Table 1).
Additionally, in the laboratory conditions, the average ash content (AC) for each soil depth was determined using the ignition method.For this reason, the disturbed sub-samples (the number of samples are shown in Table 1) were ignited in a muffle furnace at 550 • C for 4 h.The specific soil particle density (ρ p ) of the analyzed upper layers of six soil profiles was calculated assuming that an increase of ash content of about 1% causes an increase of the ρ p parameter of about 0.011 g cm −3 with respect to the nominal value equal to 1.451 g cm −3 [60,61].

Soil Water Retention Characteristics
The undisturbed soil samples (standard volume 100 cm 3 ) were collected in three replications from the top layers (at depth 7.5 cm) of the six studied peat-muck soil profiles in order to determine soil water retention characteristics (pF curves).The moisture content values in the range of predefined matric potential from 0.4 to 2 pF were determined on a sand table, whereas the amount of water retained at pF: 2.2, 2.5, 2.7, 3.4 and 4.2 was measured in pressure chambers [62].The pF curves for each soil profile are represented in this study by average values of moisture content at nine predefined soil matric potentials.

Botanical Composition of Plant Cover
For each of the selected sites, the detailed quantitative botanical composition of plant species was measured in 2008 before the first cut (end of May or beginning of June).The full developed plants were cut from four randomly selected areas of 1 m 2 , each located next to the place of the soil property measurements.The biomass samples were stored in paper bags and then the dry weight of each species was determined in laboratory conditions.Based on the gravimetric measurements, the mass percentage of the particular species as a total dry mass was calculated.The list of 70 vascular species in total was made for all selected sites can be seen in Table S1.The floristic type of the meadow community was determined based on the dominant species, which accounted for at least 30% of the biomass."Pure" types (domination of one species), "mixed" types (predomination of two species together constituting at least 30% of the biomass) were specified.To characterize the habitat conditions for all meadow communities, the weighted means of Ellenberg's indices [50,52] were calculated.The proportion (%) of vascular plants in biomass samples was multiplied by the Ellenberg index of this species, i.e., light-L (1-9); soil moisture-F (1-12); soil acidity-R (1-9); soil nitrogen-N (1-9) according to [45], and then the results of multiplication for all species of plant community were summed.The value obtained was divided by 100 and, in this way, the values of individual features for the whole community were obtained.Additionally, the species composition of each site was characterized using the biodiversity Shannon-Weiner's index [63,64].The calculation results of the Ellenberg indices (L, F, R, N) and Shannon-Weiner's indices (H') are presented in the Supplementary Materials shown in Table S2.

Data Analysis
Segmented linear regression was applied to establish the relationship between the volumetric soil moisture content (θ v ) and the dielectric constant (K a ) for all sites and the whole range of the analyzed dataset.This type of regression is a piecewise linear model, which describes the relationship between response and explanatory variables and is represented by at least two straight lines crossed in a breakpoint [65].In this study, it was assumed that the established model is continuous, including the breakpoints (Ψ).When there is only one breakpoint, the simplest piecewise regression model can be written as follows [66]: or where: β 0 is intercept, β 1 and β 1 + β 2 are the slopes of the segmented lines (β 2 can be interpreted as a difference in slopes), and e values are assumed to be independent errors with a zero mean and constant variance.The normality of e values has been tested using the Shapiro-Wilk test and the p-value statistic.The iterative techniques included in the R segmented package [67], was applied to identify the breakpoint (Ψ) and to fit the regression model into the collected data.The broken-line model (segmented regression) was used to establish the general broken-line model (GBLM) and site-specific broken-line model (SSBLM).The root mean square error (RMSE) was used as a measure of the differences between the moisture content predicted by a model(s) and the actually observed values.The RMSE statistic is frequently used to evaluate different calibration curves for the TDR method [25] or for comparisons of the empirical models dependent on different sets of explanatory variables [68].Additionally, the bias representing the mean difference between the expected (using the TDR calibration model) and the measured values of soil moisture content was determined, as indicated in [22].The t-test was used for comparison of the statistical difference between the data subsets.
The applicability of the bioindicators for soil moisture prediction was studied using a correlation matrix and linear regression.Moreover, we analyzed the influence of the EIV and H values on the hydro-physical properties of the mucky soil.
Based on the results, with respect to the field soil dielectric properties and bioindicators of the site, the model for soil moisture content determination for upper layers of degraded peatlands was proposed in the form of: Site-specific model discontinuous (SSM-D) and site-specific model with re-fitting of ψ r value (SSM-R).The calculations were made using the RWeka package [69].For this purpose, we used the M5Rules algorithm [70,71] belonging to the regression tree method.The basic assumption of this type of algorithm is dividing the entire set of samples by selecting one independent input variable among all and then performing the binary split into at least two child sets, which should increase Water 2018, 10, 1842 7 of 22 the purity of the data compared with its single parent node [72].The nodes which cannot be further partitioned are called leaves.After growing the tree and the pruning process (removing the leaf with insignificant influence on purity) the linear regression can be fitted for the growing leaf in order to improve the fit of the model [70,73].The tree construction process, as described above, provides an approximation of function specific for each final node which can lead to sharp discontinuities occurring between adjacent leaves and requires smoothing of the tree after the full set of rules has been produced [74].During the learning process, we applied a cross-validation scheme [75] to establish the SSM-D model.We used all six possible different combinations of the data from five sites for the training of the M5Rules algorithm, the remaining dataset was used for testing.Each analyzed dataset included continuous variables, i.e., soil moisture content (θ v ) and dielectric constant (K a ) measured for all investigated sites, as well as categorical covariates represented by bioindices (F, L, N, R, H').The M5Rules algorithm was run six times to fit the data and the best SSM-D model was selected using the RMSE value for testing of the data.After that the SSM-D was iteratively smoothed and the threshold value ψ r and θ vr was determined, finally resulting in SSM-R model determination.

Soil Properties and Botanical Composition of Plant Cover
Table 1 summarizes the basic soil properties represented by the measured value of soil bulk density and ash content for the studied mucky horizons (0-15 cm).The average soil particle density as well as the porosity of the analyzed top soil layers was calculated using the basic physical properties of the soil.
The higher values of this feature compared to peat soils can be attributed to the intensive cultivation practice and soil compaction.Despite cultivation differences of peat soils in European countries, the analyzed average soil bulk density values (ρ b ) in the upper layers were comparable with peat soils agriculturally cultivated in England [76], Sweden [77], Norway [78] and Germany [79].The ash content in the analyzed mucky soil was low and this results from the lack of temporary river flood conditions on the selected sites during moorsh soil development.The average porosity of the studied moorsh horizons was lower, approximately 0.05 to 0.10 cm 3 cm −3 compared to the underlying original peat deposits [40,42,61].
The mucking process therefore, apart from the basic physical property changes of the soil, seriously affected the soil moisture retention characteristics (Table 2).The presented pF curves of the mucky soils are characterized by a relatively low amount of gravitational water (the difference between porosity and soil moisture content at pF = 2.0) compared to the peat soils. 1 mean-arithmetic average (cm 3 cm −3 ). 2 SD-standard deviation (cm 3 cm −3 ). 3 CV-coefficient of variation (%).
Water 2018, 10, 1842 In the present study, these soil water retention indicators were in the range between 0.11 and 0.20 cm3 cm −3 , whereas in the study of [42], in the peat soils, the amount of gravitational water was higher and equal to approximately 0.34 cm 3 cm −3 .However, the soil moisture content at the permanent wilting point (pF = 4.2) of the studied mucky layers was relatively high and remains in the same range as in the peat soils [42].The obtained results of hydro-physical properties of the mucky layers show an increment of mineral compounds in the soil matrix compared to the peat soils, which indicates that the structure of this organic material dramatically changed due to drainage [80] and cultivation practice, and evolved from fibrous into a grainy or humic structure [61].
The plant communities have been classified in the five floristic types, two "pure" (II, V-VI sites) and three "mixed" (Table 3).I, V and VI sites were characterized by the predominant occurrence of low grasses mainly Festuca rubra, and at site VI also sedge (Carex panicea).The Shannon-Wiener index in the analyzed plant communities was moderate [48] and oscillates (apart from site VI) in the range from 2.09 to 2.43.VI site was characterized by low diversity indices mainly because of the lowest number of species (20 species) and the predominance of two main species (about 73% of biomass).The highest values of the EIV measures were noted for the II site.The floristic composition of plant communities of the studied sites indicates the diversity of ecological conditions of the habitats (Table 3) thereby confirming the assumptions formulated in the materials and methods section focusing on the biological diversity of the plant cover.

Analysis of Existing Relationships between Dielectric Constant and Soil Moisture Content in Organic Soils
The relationship between the measured soil moisture content and the dielectric constant in respect to the field site is presented in Figure 2. Data are illustrated together with the TDR calibration curves for peat soils available in the literature listed in Table 4.A simple graphical inspection of the presented features indicates that any of the applied calibration equations do not fit the whole range of our dataset.The best prediction of the TDR soil moisture content for moorsh soil from the Biebrza River Valley can be approximated using equations P4 and PS2 [22].
The RMSE for these models was nearly 0.06 cm 3 cm −3 .Despite the reasonably good results of the RMSE statistic, the mentioned equations gave a satisfactory prediction of soil moisture content at different ranges of dielectric constant changes.The most appropriate literature model P4 enabled a reasonably good approximation of the soil moisture content in the range of dielectric constant between 10 and 20.The highest overestimation of soil moisture content (of about 0.15 cm 3 cm −3 ) occurred when the dielectric constant was equal to 30. Figure 2 also shows the measured data of the soil moisture content and the dielectric constant and their fitting using the logarithmic model (NL_own) The following form of equation was obtained: Water 2018, 10, 1842 The calculated RMSE value was essentially lower (0.0452 cm 3 cm −3 ) in comparison to the P4 and PS2 models.However, Equation (4) overestimated the moisture content by as much as 0.15 cm 3 cm −3 in the middle range of the dielectric constant changes.
conditions of the habitats (Table 3) thereby confirming the assumptions formulated in the materials and methods section focusing on the biological diversity of the plant cover.

Analysis of Existing Relationships between Dielectric Constant and Soil Moisture Content in Organic Soils
The relationship between the measured soil moisture content and the dielectric constant in respect to the field site is presented in Figure 2. Data are illustrated together with the TDR calibration curves for peat soils available in the literature listed in Table 4.A simple graphical inspection of the presented features indicates that any of the applied calibration equations do not fit the whole range of our dataset.The best prediction of the TDR soil moisture content for moorsh soil from the Biebrza River Valley can be approximated using equations P4 and PS2 [22].4.
Table 4. Selected literature calibration models for time domain reflectometry (TDR) methods and root mean square error (RMSE) statistics with respect to all collected dataset, as well as for data of the "dry" and "wet" segment of the general broken-line model (GBLM).

General and Site-Specific Relationships between Dielectric Constant and Soil Moisture Content
Based on the graphical inspection and simple statistical analysis, we decided to select the continuous broken-line model as an approximation to describe the non-linear relationship between soil moisture content and dielectric constant of the mucky soil (for all measured data of θ v and K a ).For this purpose, the segmented linear regression was applied and the model parameters (Equations ( 1) and (3)) were estimated iteratively [67].The results of the statistical computation indicated that the established continuous general broken-line model (GBLM) included one threshold parameter (Ψ), and can be written as: After rearrangement the Equation ( 6) can be written in its final form as follows: The break-point (Ψ) for the analyzed dataset was determined for K a equal to 31.49(Figure 3).The 95% confidence interval for this parameter was wide and ranged from 27.22 to 35.76 (Figure 3).Residuals from the model (Equations ( 5) and ( 6), or ( 5) and ( 7)) were normally distributed around the average which was confirmed by the Shapiro-Wilk testing statistic (W = 0.9919) and the p-value (0.1302).The RMSE value for the GBLM equations equals to 0.0405 cm 3 cm −3 and is essentially lower than the estimates determined for the equations indicated in Table 4.This type of broken-line relationship is often used to assess the threshold value where the effect of the covariate changes.In the fields of hydrology and soil hydrology, the piecewise linear regressions (PLR) and the determined threshold values are used to explain the influence of the vegetation cover fraction on the reduction of runoff [84] and the erodibility coefficient [85].In the water study, the two-segmented broken-line equation is applied for predicting water electrical conductivity using a portable X-ray fluorescence meter [86].The meaning of a break-point in this study of the TDR calibration curve can be explained as the point depending on the soil water retention characteristic (horizontal lines on Figure 4).The threshold value (Ψ) with the confidence interval in general was included between horizontal lines which indicated the moisture contents at matric potential between pF values from 2.0 to 2.7.The average moisture content at break point (θ ψ = 0.6044 cm 3 cm −3 ) was approximately equal to soil moisture content at pF = 2.2 (θ v = 0.6020 cm 3 cm −3 ).The validity of the threshold value can be explained by bulk density changes (Figure 4).
Water 2018, 10, x FOR PEER REVIEW 10 of 21 average which was confirmed by the Shapiro-Wilk testing statistic (W = 0.9919) and the p-value (0.1302).The RMSE value for the GBLM equations equals to 0.0405 cm 3 cm −3 and is essentially lower than the estimates determined for the equations indicated in Table 4.This type of broken-line relationship is often used to assess the threshold value where the effect of the covariate changes.In the fields of hydrology and soil hydrology, the piecewise linear regressions (PLR) and the determined threshold values are used to explain the influence of the vegetation cover fraction on the reduction of runoff [84] and the erodibility coefficient [85].In the water study, the two-segmented broken-line equation is applied for predicting water electrical conductivity using a portable X-ray fluorescence meter [86].The meaning of a break-point in this study of the TDR calibration curve can be explained as the point depending on the soil water retention characteristic (horizontal lines on Figure 4).The threshold value (Ψ) with the confidence interval in general was included between horizontal lines which indicated the moisture contents at matric potential between pF values from 2.0 to 2.7.The average moisture content at break point (θψ = 0.6044 cm 3 cm −3 ) was approximately equal to soil moisture content at pF = 2.2 (θv = 0.6020 cm 3 cm −3 ).The validity of the threshold value can be explained by bulk density changes (Figure 4).The soil bulk density of the two segments was statistically different.For the "wet" range (segment wet above the threshold Ψ), the average bulk density was equal to 0.31 g cm −3 , whereas at the "dry" range (segment dry) its estimates are equal to 0.33 g cm −3 .It can be concluded that below the breakpoint of the θv(Ka) characteristic, the significant increase of bulk density indicates the essential soil volume changes.The field study conducted for similar soils confirmed that, apart from the soil moisture contents changes [87], the soil bulk density also changes in space and time [40].
Despite improvements in the GBLM model in comparison to the literature equations, the bias of the The soil bulk density of the two segments was statistically different.For the "wet" range (segment wet above the threshold Ψ), the average bulk density was equal to 0.31 g cm −3 , whereas at the "dry" range (segment dry) its estimates are equal to 0.33 g cm −3 .It can be concluded that below the breakpoint of the θ v (K a ) characteristic, the significant increase of bulk density indicates the essential soil volume changes.The field study conducted for similar soils confirmed that, apart from the soil moisture contents changes [87], the soil bulk density also changes in space and time [40].
Despite improvements in the GBLM model in comparison to the literature equations, the bias of the calibration equation depends on site-specific data measurements of θ v (K a ).The segmented model gave a weak prediction of the soil moisture content for site II (bias equal to approximately 0.044 cm 3 cm −3 ) as well as in the dry range of the soil moisture content in sites VI, V and IV.The analysis of the GBLM model limitations forced us to develop the site-specific broken-line model (SSBLM).For this purpose, the linear regression lines were fitted for the analyzed sites in respect to data below and above the threshold value (Figure 3).The obtained slopes (β1, β1 + β2) and intercepts (β0) enabled the calculation of the breakpoint value separately for the soil from the specific sites.The parameters of the SSBLM models are summarized in Table 5. Fitted lines below the threshold ("dry" range) were characterized by similar slope values (average value equal to 0.0136 and CV = 6.31%).In the "wet" range, slopes (β1 + β2) were generally flatter than in the "dry" range (β1) and relatively more variable (CV = 24%) mainly because of the lowest slope of the θv(Ka) relationship for site II.Except for this site, the slope parameters in the "wet" range were also not essentially different.The Ψ parameter represents a wide range of changes in the The analysis of the GBLM model limitations forced us to develop the site-specific broken-line model (SSBLM).For this purpose, the linear regression lines were fitted for the analyzed sites in respect to data below and above the threshold value (Figure 3).The obtained slopes (β 1 , β 1 + β 2 ) and intercepts (β 0 ) enabled the calculation of the breakpoint value separately for the soil from the specific sites.The parameters of the SSBLM models are summarized in Table 5.
Fitted lines below the threshold ("dry" range) were characterized by similar slope values (average value equal to 0.0136 and CV = 6.31%).In the "wet" range, slopes (β 1 + β 2 ) were generally flatter than in the "dry" range (β 1 ) and relatively more variable (CV = 24%) mainly because of the lowest slope of the θ v (K a ) relationship for site II.Except for this site, the slope parameters in the "wet" range were also not essentially different.The Ψ parameter represents a wide range of changes in the dielectric constant (Table 5) which directly influences the really wide scatter of the soil moisture content (θ Ψ ) estimated at break point (Ψ).Despite the low CV value, the moisture content at the break points (θ Ψ ) ranged from 0.50 to 0.70 cm 3 cm −3 which represents a relatively wide scatter for this type of soil.The comparison of θ Ψ values with the range of moisture content of pF curves indicated that the threshold values were included in the range of the soil matric potential approximately between pF = 2.0 (site II) to pF = 2.5 (site IV).  Figure 5 shows the measured data of the soil moisture content and the dielectric constant for the specific sites and their fitting using the segmented model (SSBLM) and the logarithmic equations (NL type).
Water 2018, 10, x FOR PEER REVIEW 12 of 21 dielectric constant (Table 5) which directly influences the really wide scatter of the soil moisture content (θΨ) estimated at break point (Ψ).Despite the low CV value, the moisture content at the break points (θΨ) ranged from 0.50 to 0.70 cm 3 cm −3 which represents a relatively wide scatter for this type of soil.The comparison of θΨ values with the range of moisture content of pF curves indicated that the threshold values were included in the range of the soil matric potential approximately between pF = 2.0 (site II) to pF = 2.5 (site IV).
Figure 5 shows the measured data of the soil moisture content and the dielectric constant for the specific sites and their fitting using the segmented model (SSBLM) and the logarithmic equations (NL type).The SSBLM model developed separately for the soils from different sites indicates the reasonably good results of fitting in all ranges of the θv(Ka) relationship changes.The RMSE values were varied depending on the site data.For the calibration curves from sites III, V and VI, the values did not exceed 0.03 cm 3 cm −3 .The RMSE values for the rest of the developed SSBLM equations were slightly higher and varied from 0.036 cm 3 cm −3 (site I) to 0.041 cm 3 cm −3 (site IV).It should be stressed that logarithmic type models developed for specific site were less accurate and the RMSE value was generally higher (of about 0.005 to 0.012 cm 3 cm −3 ) in comparison to the SSBLM models.The logarithmic models underestimated the soil moisture content by as much as 0.07 cm 3 cm −3 in the wet The SSBLM model developed separately for the soils from different sites indicates the reasonably good results of fitting in all ranges of the θ v (K a ) relationship changes.The RMSE values were varied depending on the site data.For the calibration curves from sites III, V and VI, the values did not exceed 0.03 cm 3 cm −3 .The RMSE values for the rest of the developed SSBLM equations were slightly higher and varied from 0.036 cm 3 cm −3 (site I) to 0.041 cm 3 cm −3 (site IV).It should be stressed that logarithmic type models developed for specific site were less accurate and the RMSE value was generally higher (of about 0.005 to 0.012 cm 3 cm −3 ) in comparison to the SSBLM models.The logarithmic models underestimated the soil moisture content by as much as 0.07 cm 3 cm −3 in the wet range of calibration curve.From this reason for further analysis, we applied broken-line model parameters as an approximation of the non-linear relationship between K a and θ v .

Applicability Analysis of the Bio-Indices in Soil Moisture Prediction and Hydro-Physical Properties of Mucky Soils
Feasibility analysis of the calibration model development for the moorsh moisture content prediction as a function of dielectric constant with the participation of the covariates represented by bioindices is illustrated using the correlation coefficients (Figure 6).
The broken-line model parameters (Table 5) were correlated with the bioindices (Table 3).The highest correlation coefficient has been noted between intercept (β 0 ) and indicator L of the Ellenberg scale.However, the β 0 also correlates with "productivity" of the ecosystem (N) as well as with the Shannon-Weiner index (H').The slope in the "dry" range of the calibration curves (β 1 ) was strongly but negatively correlated with "productivity" of the ecosystem (N) determined on the basis of the plant cover as well as positively correlated with the ash content (AC) of the analyzed soils.Nevertheless, both of the explained covariates (AC and N) were moderately and negatively correlated, which can suggest that the increase of the AC amount in the soil causes reduction of the "productivity" of the analyzed ecosystems.The slope in the "wet" range of the calibration curves (β 1 + β 2 ) can be approximated using the soil moisture substrate indicator (F), which in the case of our studied sites describes fresh soil with moderate wetness [51].Due to the low range of the F indicator changes and moderate variation of the β 1 + β 2 parameter, the prediction of the slope in the "wet" range of the calibration curve can be more complex than simple application of the linear relationship.The estimation of this parameter is especially important for the organic soils where soil moisture changes in the wet range strongly influence the soil degradation processes and physical soil properties.On the other hand, the β 1 + β 2 parameters were highly correlated with soil pF curve characteristics.From the data presented in Figure 6, it can be observed that these parameters were also negatively correlated with soil moisture contents measured at the predefined soil matric potentials ranging from pF = 1.5 to pF = 4.2.Moreover, it should be stressed that the moisture of the soil substrate (indicator F) was also highly correlated with the moisture content of the pF curves (Figure 6) which, in general, is in agreement with the statements made in [50].Based on the correlation analysis, it can be noted that the slope in the "wet" range of the field calibration curves can be fairly well determined using the moisture indicator of the Ellenberg scale, or more precisely using the soil moisture content at pF = 2 as a soil type (site) specific predictor.
The broken-line model parameter Ψ was correlated with bioindicators of soil cover, physical soil properties and soil water retention characteristic.The highest r value was determined between Ψ and the Shannon-Weiner index.This could suggest that the variety of the plant species, which are typical for the site location, influences the slope separation point of the TDR calibration curves.This was especially visible in the case of two sites: VI and IV.In the former, the Ψ value was the highest and soil cover is mainly occupied by two plant species (Festuca rubra and Carex panicea) which affected the lowest value of the H' index.On the other hand, at site IV the highest H' value was observed (2.4) which indicates a condition of greater biodiversity (with a predominance of dicotyledonous plants), however, the Ψ value was the lowest (18.29).Although the Ψ value is was highly correlated with the Shannon-Weiner index, its value can also be carefully approximated by physical soil properties.Our data indicate that the increase of Ψ value was related to the increase of average soil porosity and soil moisture status at pF = 2.5 and 2.7.The Ψ value was also highly correlated with the ratio between L and F value suggesting an importance of light and moisture bioindicators in determination of the threshold value of the TDR calibration curve.
characteristics.From the data presented in Figure 6, it can be observed that these parameters were also negatively correlated with soil moisture contents measured at the predefined soil matric potentials ranging from pF = 1.5 to pF = 4.2.Moreover, it should be stressed that the moisture of the soil substrate (indicator F) was also highly correlated with the moisture content of the pF curves (Figure 6) which, in general, is in agreement with the statements made in [50].Based on the correlation analysis, it can be noted that the slope in the "wet" range of the field calibration curves can be fairly well determined using the moisture indicator of the Ellenberg scale, or more precisely using the soil moisture content at pF = 2 as a soil type (site) specific predictor.The broken-line model parameter Ψ was correlated with bioindicators of soil cover, physical soil properties and soil water retention characteristic.The highest r value was determined between Ψ and the Shannon-Weiner index.This could suggest that the variety of the plant species, which are typical for the site location, influences the slope separation point of the TDR calibration curves.This was especially visible in the case of two sites: VI and IV.In the former, the Ψ value was the highest and soil cover is mainly occupied by two plant species (Festuca rubra and Carex panicea) which affected the lowest value of the H' index.On the other hand, at site IV the highest H' value was observed (2.4) which indicates a condition of greater biodiversity (with a predominance of dicotyledonous plants), however, the Ψ value was the lowest (18.29).Although the Ψ value is was highly correlated with the Shannon-Weiner index, its value can also be carefully approximated by physical soil properties.Our data indicate that the increase of Ψ value was related to the increase of average soil porosity and soil moisture status at pF = 2.5 and 2.7.The Ψ value was also highly correlated with the ratio between L and F value suggesting an importance of light and moisture bioindicators in determination of the threshold value of the TDR calibration curve.
Figure 7a-d shows simple regression lines allowing prediction of the broken-line model parameters (Equations ( 1) and (3)) using bioindices (L, F, N, H') determined on the basis of the measured plant species' composition for the selected sites.Based on the discussion presented in Section 3.4, we can state that the development of the general calibration curve for predicting moorsh moisture content using the TDR method for a specific Based on the discussion presented in Section 3.4, we can state that the development of the general calibration curve for predicting moorsh moisture content using the TDR method for a specific site is very complex.We can predict each of the broken-line parameters one by one using simple linear equations plotted in Figure 7a-d.On the other hand, the simplification of the segmented line approach indicates that, for general purposes, both slopes (β 1 , β 1 + β 2 ) can be averaged (Equations ( 5) and ( 7)), however, this requires finding out the threshold Ψ values.The quality of both attempts depends mainly on the prediction of the SSBLM parameters based on the bioindicators of the soil cover as the simplest explanatory variables.However, error in the estimation of the parameters affects the accuracy of the soil moisture prediction.Therefore, taking into account the temporal stability of the investigated ecosystems (supported by water supply and farmer practice), we developed a equation which in one approach includes the relationship between soil moisture content and dielectric constant and the bioindicators' influence.For this reason, we used the M5Rules algorithm belonging to regression tree methods (see description in Section 2.5).
Table 6 summarizes the cross-validation scheme.In total we trained six models SSM-D1, . . ., SSM-D6.The results of the model selection approach indicate that, depending on training dataset, the SSM-D models included different explanatory variables for soil moisture prediction.The performance of the cross-validation split models is described by RMSE train .The main independent variable in each training model, which binary split our dataset into two child sets, was dielectric constant (K a ).In the second stage of the cross-validation scheme we reduced the number of variables to the commonly appearing covariates in the training model stage.For this reason, different sets of explanatory variables were tested.Finally, we found the appropriate combination of the variables i.e., K a , F, L and H' (SM-set of model) in the model for which at each split of the data the RMSEcv values were close to the RMSE train (Table 6).Additionally, in Table 6 the RMSEcv values determined in cross-validation procedure performed for the GBLM model are also included.A last SSM-D model fitting was conducted using the mentioned combination of variables (SM) as single fit to all data.Fortunately, the resulted SSM-D model includes two nodes for which the linear regression models have been established (Figure 8).This final fitting resulted in a two-slopes calibration model (SSM-D) split by the parent node (K a = 30.845)similarly to the segmented regression model (Equations ( 5) and ( 7)).It should be stressed that the value of 30.845 is included in the confidence interval of the GBLM model.The slope of the line below the splitting point (dry range) was equal to 0.0127 (regression parameter at K a variable) and was also comparable with its value determined using segmented regression (GBLM).The position of this line along the soil moisture content axis (the value of intercept) was improved by the H' and L values (specific measures) of the site.Above the K a = 30.845,the slope was equal to 0.0075 (nearly the same as in GBLM), however, the position of the line can be determined using covariates represented by F, L, bioindicators.However, it should be stressed that the influence of the L value was marginal for soil moisture prediction (wet range) which results from the regression coefficient at this variable.
Following the statements in [74], the regression tree indicated discontinuities occurring between adjacent leaves.In the post-processing procedure, we iteratively found out the threshold value ψ r which enables establishing a continuous soil moisture content prediction model in the whole range of the dielectric constant changes (site-specific model with re-fitting of ψ r value, SSM-R).This is especially important for monitoring of this feature.The uncertainty of this kind of the procedure was tested using the RMSE value as a criterion of losing quality in the model compared to the SSM-D model.The ψ r values are in fact the re-established criterion of the splitting of the dataset which enables, again, application of the linear equations developed for both leaves (Figure 8).
The main conclusion from the modelling attempt is that the SSM-D model is structurally similar to the GBLM model (the splitting value K a = 30.845 is similar to the ψ value of the GBLM model) and this results in discontinuities occurring between adjacent leaves.However, we propose to use the SSM-D model in the case when the TDR user does not want to perform a refitting (SSM-R).In the SSM-R model approach, additionally in the post-processing procedure, the threshold value in the form of ψ r parameter is obtained.In this sense, the SSM-R model is the smoothed form of the SSM-D model and can be treated as a site-specific model similar to SSBLM.The inconveniences of the modelling approach as well as the importance of threshold value in mucky soils should be further investigated.Table 7 presents the RMSE value estimation according to the different calibration models for the TDR method.The results of the data presented in the table showed that the RMSE values for the proposed model (SSM-R) produce nearly the same error estimators as the SSM-D algorithm.Based on this observation, we can state that the model, which is graphically presented in Figure 8, can be used for continuous prediction of moorsh soil moisture content based on dielectric constant measurements and complex information about bioindices.The form of the proposed model is generally in agreement with the main results of this study which showed the possibility to establish a two-segmented broken-line calibration model (GBLM, Equations ( 5) and ( 7)) for the TDR method.Table 7 presents the RMSE value estimation according to the different calibration models for the TDR method.The results of the data presented in the table showed that the RMSE values for the proposed model (SSM-R) produce nearly the same error estimators as the SSM-D algorithm.Based on this observation, we can state that the model, which is graphically presented in Figure 8, can be used for continuous prediction of moorsh soil moisture content based on dielectric constant measurements and complex information about bioindices.The form of the proposed model is generally in agreement with the main results of this study which showed the possibility to establish a two-segmented broken-line calibration model (GBLM, Equations ( 5) and ( 7)) for the TDR method.

Conclusions
Monitoring of soil moisture content in peat soil is a crucial issue which enables study processes related to peat soil degradation, particularly in the vadose zone.The presented study indicated that simple regression models which relate the soil dielectric constant with moisture content did not properly address determination of soil moisture content in degraded peatlands in field conditions.This was especially apparent in the intermediate range of dielectric constant (from approximately 20 to 40), where the best existing empirical models presented in the literature overestimated the moisture content by as much as 0.15 cm 3 cm −3 .The main expectation from TDR users is to be able to obtain relatively accurate moisture content determination in field conditions.The results of our investigations showed that the proposed calibration curve (GBLM) composed of two segments meets those requirements with the acceptable error of soil moisture determination in field conditions (RMSE = 0.04 cm 3 cm −3 ).Despite the relatively good approximation of moisture content by the broken-line models (GBLM, SSBLM) developed during this study, these are only empirical equations, which should be subject to an interpretation derived from further investigation of the segmented line parameters.It is important to stress that in the case of the GBLM model, the existence of the two independent slopes separated by the threshold value ψ has been proved empirically by the statistically significant difference in soil bulk densities in both ranges of dielectric constant changes.We propose that further investigation should be focused on the assessment of the influence of soil air content on the slope of calibration equation in the dry range and assessment of the physical meaning of the threshold value ψ.
The conducted research also proved that, in site-specific calibration equations (SSBLM), the broken-line model parameters, in general, are correlated with the following bioindices: L, F, R, N values and biodiversity index (H').This means that EIV and H' indicators can be useful in the assessment of soil moisture content in degraded peatlands using the TDR technique.It should be stressed that an increase of slope value (β 1 + β 2 ) in the "wet" range of calibration equation was also correlated with a decrease of moisture content at pressure head corresponding to pF = 2.This tendency was also observed in the correlation between the F (substrate moisture) indicator and β 1 + β 2 slope.Moreover, it can be concluded that the F classifier of the habitats determined only on the basis of plant species' composition is also highly correlated with the soil water retention characteristic at the investigated sites.Therefore, the performed research confirmed that the plant cover of the habitat depends on the water-related properties of the soil peatlands.
The analysis of the linear equations included in the SSM-D and SSM-R models indicates that the soil moisture content, aside from dielectric constant, was also dependent on the biodiversity index (H') and light intensity (L) in the "dry" range of calibration curves.In the "wet" range of the calibration curve, the proposed model contains, apart from the variable (K a ), covariates represented by the following parameters: Substrate moisture (F), and light intensity (L).
The results of our study showed that bioindices, as well as their basic application, can also help in the identification of the specific soil condition which can improve our empirical knowledge of soil moisture regimes.Therefore, we hope the quantification of the EIV and H' values of degraded peatlands as simply measured categorical variables will enable a TDR user to make decisions on what type of calibration equation can be selected for soil moisture monitoring in this type of ecosystem.

Figure 2 .
Figure 2. Measured data of dielectric constant and soil moisture content for different sites together with own empirical relationship (NL_own) and relationships listed inTable4.

Figure 2 .
Figure 2. Measured data of dielectric constant and soil moisture content for different sites together with own empirical relationship (NL_own) and relationships listed inTable4.

Figure 3 .
Figure 3. Field measurements and fitted GBLM for the relationship between soil moisture content and dielectric constant.The horizontal dotted black lines indicate the soil moisture content at predefined pressure head.The red vertical line indicates the average value of the breaking point (ψ = 31.49)whereas the red dotted vertical line is the 95% confidence interval.

Figure 3 .
Figure 3. Field measurements and fitted GBLM for the relationship between soil moisture content and dielectric constant.The horizontal dotted black lines indicate the soil moisture content at predefined pressure head.The red vertical line indicates the average value of the breaking point (ψ = 31.49)whereas the red dotted vertical line is the 95% confidence interval.

Water 2018 ,
10, x FOR PEER REVIEW 11 of 21

Figure 4 .
Figure 4. Boxplot of soil bulk density for "dry" and "wet" segments of the measured dataset (dotted line indicates overall average soil bulk density).The scattered point indicates the soil bulk density value with respect to sites.The dotted horizontal line indicates the overall soil bulk density average.The diamond symbol indicates the average soil bulk density for each segment of the dataset.The red dots indicate the outliers.

Figure 4 .
Figure 4. Boxplot of soil bulk density for "dry" and "wet" segments of the measured dataset (dotted line indicates overall average soil bulk density).The scattered point indicates the soil bulk density value with respect to sites.The dotted horizontal line indicates the overall soil bulk density average.The diamond symbol indicates the average soil bulk density for each segment of the dataset.The red dots indicate the outliers.

Figure 5 .
Figure 5. Field measurements and fitted models of the relationship between the soil moisture content and dielectric constant for specific sites.(a) Site I; (b) site II; (c) site III; (d) site IV; (e) site V; (f) site VI.

Figure 5 .
Figure 5. Field measurements and fitted models of the relationship between the soil moisture content and dielectric constant for specific sites.(a) Site I; (b) site II; (c) site III; (d) site IV; (e) site V; (f) site VI.

Figure 6 .
Figure 6.Correlation graph of the analyzed variables.Arrows in green and red indicate statistically significant positive and negative correlations.Dark yellow arrows indicate not statistically significant correlations.The labelled cells indicate statistically significant correlation between different types of variables regarding the state of the habitats, SSBLM and soil hydro-physical properties.

Figure 6 .
Figure 6.Correlation graph of the analyzed variables.Arrows in green and red indicate statistically significant positive and negative correlations.Dark yellow arrows indicate not statistically significant correlations.The labelled cells indicate statistically significant correlation between different types of variables regarding the state of the habitats, SSBLM and soil hydro-physical properties.

Figure
Figure 7a-d shows simple regression lines allowing prediction of the broken-line model parameters (Equations (1) and (3)) using bioindices (L, F, N, H') determined on the basis of the measured plant species' composition for the selected sites.

Figure 7 .
Figure 7. Linear regression lines between parameters of the broken-line model and explanatory variables: (a) the Ellenberg index L -light vs. the intercept β0 of SSBLM; (b) the Ellenberg index N -productivity vs. the slope β1 of SSBLM; (c) the Ellenberg index F -moisture vs. the slope β1 + β2 of SSBLM; (d) the biodiversity index -H' vs. the breakpoint (Ψ) of SSBLM.

Figure 7 .
Figure 7. Linear regression lines between parameters of the broken-line model and explanatory variables: (a) the Ellenberg index L -light vs. the intercept β 0 of SSBLM; (b) the Ellenberg index Nproductivity vs. the slope β 1 of SSBLM; (c) the Ellenberg index F -moisture vs. the slope β 1 + β 2 of SSBLM; (d) the biodiversity index -H' vs. the breakpoint (Ψ) of SSBLM.

Water 2018 ,
10, x FOR PEER REVIEW 16 of 21form of ψr parameter is obtained.In this sense, the SSM-R model is the smoothed form of the SSM-D model and can be treated as a site-specific model similar to SSBLM.The inconveniences of the modelling approach as well as the importance of threshold value in mucky soils should be further investigated.

Figure 8 .
Figure 8. Diagram representing graphical explanation of the SSM-D and site-specific model with re-fitting of ψr value (SSM-R) models.

Figure 8 .
Figure 8. Diagram representing graphical explanation of the SSM-D and site-specific model with re-fitting of ψ r value (SSM-R) models.

Table 1 .
Basic soil properties of mucky layers.

Table 2 .
Average soil moisture content at predefined soil matric potentials pF.

Table 3 .
The habitat conditions and the Shannon-Weiner's index of the meadow communities in the examined sites (mean values).

Table 5 .
Site-specific broken-line model parameters (SSBLM) for the TDR method.

Table 5 .
Site-specific broken-line model parameters (SSBLM) for the TDR method.

Table 6 .
Summary of the cross-validation approach for establishing the site-specific model discontinuous (SSM-D) calibration model for the TDR method in mucky soils.

Table 7 .
Values of RMSE for the different types of TDR calibration models established in this study.

Table 7 .
Values of RMSE for the different types of TDR calibration models established in this study.