Determination of the Optimal Mounting Depth for Calculating Effective Soil Temperature at L-band: Maqu Case

Effective soil temperature T e f f is one of the basic parameters in passive microwave remote sensing of soil moisture. At present, dedicated satellite soil moisture monitoring missions use the L-band as the operating frequency. However, T e f f at the L-band is strongly affected by soil moisture and temperature profiles. Recently, a two-layer scheme and a corresponding multilayer form have been developed to accommodate such influences. In this study, the soil moisture/temperature data collected and simulated by the Noah land surface model across the Maqu Network are used to verify the newly developed schemes. There are two key findings. Firstly, the new two-layer scheme is able to assess which site provides relatively higher accuracy when estimating T e f f. It is found that, on average, nearly 20% of the T e f f signal cannot be captured by the Maqu Network, in the currently assumed common installation configuration. This knowledge is important, since the spatial averaged brightness temperature (a function of T e f f) is used to determine soil moisture. Secondly, the developed method has made it possible to identify that the optimal mounting depths for the observation pair are 5 cm and 20 cm for calculating T e f f at the center station in the Maqu Network. It has been suggested that the newly developed method can provide an objective way to configure an optimal soil moisture/temperature network and improve the representativeness of the existing networks regarding the calculation of T e f f .


Motivation
Soil moisture is one of the essential climate variables (ECVs) [1][2][3][4].On land, soil moisture is one of the few variables depicting the storage of water and heat that can be released several months later, as vegetation cover and snow do [5][6][7].Even though the total energy represented by soil moisture is not comparable to that of the sea surface temperature, it could still constitute the turning point in studies on land-atmosphere interaction, seasonal climate forecasting and numerical simulations [8][9][10].Soil moisture is also an indicator for drought and flood events, which are key factors in agriculture, hydrology, and ecology, among others.For decades, scientists have tried to generate global soil moisture data through in situ observations, numerical simulations and remote sensing.Of these, passive remote sensing from satellites using L-band (1.4 GHz) has been considered the most promising tool for creating a global map of soil moisture.Both the Soil Moisture and Ocean Salinity (SMOS, launched in 2009) and the Soil Moisture Active Passive (SMAP, launched in January 2015) missions use L-band as their operating channel [11,12].The accuracy of soil moisture retrieval at L-band depends on the method for calculating the effective soil temperature (T e f f ) and for determining the sampling depth (i.e., which layer of the soil column is detected by the radiometer [13,14]).At L-band, the sampling depth varies from 0 to several centimeters below the surface (e.g., for wet soil) [15]; this will directly affect the estimation of T e f f .How to accurately estimate T e f f is one of the basic questions in soil moisture remote sensing [16,17].To calibrate and validate the soil moisture measurements from SMOS and SMAP, in situ observations are needed.When installing soil moisture sensors, a key question is how to configure the mounting depth.Usually, the mounting depth is decided based on experience.To date, a mounting depth for the sensors that is most readily detectable by satellite instruments has not been identified.This study will try to identify the optimal mounting depth through analyzing the definition of T e f f , which is the only variable requiring soil state profile information in a microwave transfer model.In addition, a mathematical derivation will be developed to provide instruction on how to configure this mounting depth.

Background
The physical concept of effective soil temperature T e f f was developed to describe the emissive capacity of a soil column.According to the Rayleigh-Jeans approximation, in the microwave domain the emitted energy from the soil is proportional to the thermodynamic temperature [18].The brightness temperature T B is expressed as: where T B is the radiation intensity received by the passive microwave sensor fixed near the soil surface or on the satellite platforms while neglecting the attenuation of atmosphere.ε is the emissivity that is strongly related to soil moisture.T e f f is the effective temperature and is formulated by Wilheit [19] as: where α pxq " 4π λ ε 2 pxq{2 " ε 1 pxq ‰ 1 2 .Equation (2) states that T e f f at the soil surface is a super position of the intensities emitted at various depths within the soil.
An accurate computation of T e f f is thus critical for obtaining relevant values of soil emissivity from brightness temperature measurements.It follows that soil moisture can be retrieved from the estimates of soil emissivity [18].However, the soil moisture and soil temperature profile information is usually limited in a field experiment.The discrete observation points are installed empirically or thought to be continuous with constant vertical intervals.Recently, a new two-layer scheme (hereafter Lv's scheme) has been derived directly from Equation (2).This is expressed by Lv et al. [20] as: in which B 1 " ∆x ¨4π λ ¨ε2 2 ?
ε 1 , a parameter related to wavelength pλq, to soil moisture through the dielectric constant (ε 2 is the imaginary part and ε 1 is the real part), and to sampling depth p∆xq.The dielectric model used in this study is Mironov's model which has also been adopted by SMOS [21].The soil moisture/temperature (hereafter SM/ST) can be specified using the values obtained from observation networks (e.g., Maqu Network).λ will be a constant for the specific sensors, i.e., λ " 21 cm for SMOS and SMAP.This leaves the sampling depth (intervals) as only unknown in Equation (3).Equation (3) can be further developed into a complete multi-layer scheme as [20]: where the first, second and third part of the right hand side of the equation represent the 1st, ith and nth layer (e.g., ith represents the layers in the middle, while nth represents the last layer).When the first layer has been fixed in field sites, the accuracy Equations ( 3) and ( 4) can achieve in estimating T e f f depends on the determination of sampling depth (intervals) ∆x i (i for ith layer).Figure 1 illustrates the subscripts related to Equations ( 3) and ( 4).The dielectric model used in this study is Mironov's model which has also been adopted by SMOS [21].The soil moisture/temperature (hereafter SM/ST) can be specified using the values obtained from observation networks (e.g., Maqu Network).λ will be a constant for the specific sensors, i.e., 21 λ = cm for SMOS and SMAP.This leaves the sampling depth (intervals) as only unknown in Equation (3).Equation (3) can be further developed into a complete multi-layer scheme as [20]: where the first, second and third part of the right hand side of the equation represent the 1st, ith and nth layer (e.g., ith represents the layers in the middle, while nth represents the last layer).When the first layer has been fixed in field sites, the accuracy Equations ( 3) and ( 4) can achieve in estimating eff T depends on the determination of sampling depth (intervals) i x Δ ( i for i th layer).Figure 1 illustrates the subscripts related to Equations ( 3) and (4).In this study, the features of parameter B and how B is related to x Δ will be discussed mathematically in Section 2. Section 3 will discuss two aspects regarding the application of Lv's scheme: (1) the assessment of existing sites in the Maqu Network in terms of their representativeness for calculating eff T ; (2) how to determine the optimal mounting depth for the second layer when provided with a fixed first layer.The proposed approach is validated by using both observations and model outputs.The discussion and conclusion are found in Section 4.

Methodology and Data
In Section 2.1, a series of concepts for evaluating eff T derived from Lv's multi-layer scheme will be established.An evaluation method inferred from these concepts will be applied to the SM/ST field observation at Maqu Network in Section 3.1.Besides, a mathematical derivation of optimal mounting depth inferred from Lv's multi-layer scheme will be presented in Section 2.2.Since the field observation contains a maximum of 6 layers, a set of Noah simulation data, which is already successfully used in other citations, will be a supplement to involve more layers (37 layers).The true eff T is obtained by using Lv's multi-layer scheme.Then, all two-layer combinations will be In this study, the features of parameter B and how B is related to ∆x will be discussed mathematically in Section 2. Section 3 will discuss two aspects regarding the application of Lv's scheme: (1) the assessment of existing sites in the Maqu Network in terms of their representativeness for calculating T e f f ; (2) how to determine the optimal mounting depth for the second layer when provided with a fixed first layer.The proposed approach is validated by using both observations and model outputs.The discussion and conclusion are found in Section 4.

Methodology and Data
In Section 2.1, a series of concepts for evaluating T e f f derived from Lv's multi-layer scheme will be established.An evaluation method inferred from these concepts will be applied to the SM/ST field observation at Maqu Network in Section 3.1.Besides, a mathematical derivation of optimal mounting depth inferred from Lv's multi-layer scheme will be presented in Section 2.2.Since the field observation contains a maximum of 6 layers, a set of Noah simulation data, which is already successfully used in other citations, will be a supplement to involve more layers (37 layers).The true T e f f is obtained by using Lv's multi-layer scheme.Then, all two-layer combinations will be presented to generate T e f f with Lv's two-layer scheme.Among these combinations, the theory derived in Section 2.2 will be checked in Section 3.2.

Characteristics of Parameter B
Lv's scheme uses an exponential function to distribute the weight among different layers (see Equation ( 4)).The only parameter in this scheme is B, which is another form of α in Wilheit's scheme.A series of extended concepts used to analyze the characteristics of B are listed in Table 1.In fact, Lv's multilayer scheme, which has a general form of exponential terms, is more flexible than the two-layer scheme.To achieve either an assessment of the Maqu Network or a determination of the optimal mounting depth/combination, the multilayer scheme will be deployed.When excluding both the top and bottom layers in the multilayer model, the in-between layers (the ith layer) share a common expression: T i `1 ´e´B i ˘i´1 ś j"1 e ´Bj , with `1 ´e´B i ˘i´1 ś j"1 e ´Bj called the weight function (double underline in Equation ( 4)).Thus, the weight function could be further divided into two parts: the representative term `1 ´e´B i ˘, which is dealing with B at the ith layer, and the term which could be computed with B above the ith layer.e ´Bj determines how much weight from the ith layer can directly contribute to T e f f without any prior knowledge of the (i `1)th layer.In other words, if the (i `1)th layer exists, whatever the soil temperature at the (i `1)th layer is, the soil temperature at the (i `1)th layer has to be multiplied by i ś j"1 e ´Bj .Hence, i ś j"1 e ´Bj can be called the residual (single-underline in Equation ( 4)) of the ith layer.With these concepts in mind, it is possible to estimate the potential contributions by different layers when calculating T e f f , as discussed in Section 3.1.
In the following, the mathematical derivation of Lv's scheme will be implemented to derive the optimal mounting depth.In Equation (3), T 1 depicts the mean soil temperature of the first layer, with 1 ´e´B 1 forming its weight in the calculation of T e f f .The same can be said for the second layer, for T 2 and e ´B1 .It can be inferred that the result will only be optimal if the following conditions are met: (1) T 1 matches the layer-averaged soil temperature integrated from the surface to the sampling depth ∆x, which is used for calculating 1 ´e´B 1 (see Equation (3)); (2) T 2 matches the layer-averaged soil temperature integrated from the sampling depth ∆x to the infinite, as in Equation (2).
When the above two conditions are met, the weights calculated by Equation (3) are the most representative for the first and second layer, in terms of calculating T e f f .The representativeness (i.e., depth intervals) of T is clearly defined in numerical model outputs; for instance, 0 to 7 cm for the first layer, 7 to 28 cm for the second layer, 28 to 100 cm for the third layer, and 100 to 289 cm for the bottom layer in ERA-interim reanalysis data.The same concept exists for observation data like the Maqu Network.The question of which depth interval a SM/ST sensor could represent is critical, considering Conditions 1 and 2.
Table 2 explains the definition of parameters used in determining the optimal mounting depths.To keep consistent with the B 1 in Equation (3), B 1s and B 2s are proposed to stand for the corresponding B values at the mounting depth of ∆x 1s and ∆x 2s , where the sensors are installed.

Optimal Mounting Depth
Considering the integral exponential function ş e ´xdx "1 ´e´x , it could be deduced that for each layer in soil profile dx, its weight should be e ´B.In order to satisfy Condition 1, the first layer e ´B1s is supposed to be the mean value of e B ˇˇB 1 0 , e.g., e ´B1s " 1 This demonstrates that the weight e ´B1s , calculated by using SM/ST at sampling depth ∆x 1s , should match the layer-averaged e ´B.Here e ´B1 should be calculated by using the layer-averaged SM/ST with a layer thickness of ∆x.
For the second layer, to satisfy Condition 2, T 2 is assumed to match the layer-averaged soil temperature integrated from the interval rB 1 , 8s.The infinite is usually represented numerically by using a cutting-off error with a finite number (in this case: 8 « B 2 `cutting_o f f _error).Consequently, T 2 is here defined as the layer-averaged ST between rB 1 , B 2 s where B 1 ă B 2s ă B 2 .With a cutting-off error B 2 , the First Mean Value Theorem for Integration [22] can be applied to prove that there must be a value of B 2s that satisfies Condition 2. In fact, B 2 can be numerically defined without necessarily being used in the calculation of the subsequent derivation with the math skill introduced below: The above implies that the weight e ´B2s calculated by using the SM/ST at the sampling depth ∆x 2s should match the layer-averaged e ´B (e.g., e ´B2s " Therefore, for the two-layer scheme, the total weight is the sum of Since B 1s is determined by field observation (e.g., the first SM/ST sensor below the soil surface), Equation ( 5) contains three unknowns: B 1 ; B 2 and B 2s , but with two formulas.With the preset B 1s known, B 1 can be calculated by using the first expression in Equation ( 5).There are two remaining parameters, B 2 and B 2s , that need to be solved in Equation (5).
To meet Condition 2, T 2 should match the layer-averaged ST for the second layer, which means T 2 is equal to the ST at the sampling depth ∆x 2s (see Table 2), because at this sampling depth the weight e ´B2s is representative for the second layer.On the other hand, to be representative for the second layer, the B 2s (corresponding to ∆x 2s , Table 2) must be located between B 1 and B 2 .Since the First Mean Value Theorem for Integration is defined within a closed interval rB 1 , B 2 s, an optimal mounting depth of ∆x 2s (e.g., at rB 1 , 8s) has to be determined to enable its layer representativeness.The determination of an optimal mounting depth can be achieved by using the characteristic ratio B 2 ´B2s for the second layer.Expanding this characteristic ratio, considering Equation ( 5), leads to: The form of the function with this characteristic ratio is f pxq " e x ´x x .Therefore, mathematically, to obtain an optimal characteristic ratio, the first order derivative of f pxq needs to be equal to zero.There is also a physical meaning: in Figure 2a, as will be discussed, to meet Condition 1 and 2, the pink area has to be made equal to the purple area (i.e., for the first layer), and the green area to the grey area (i.e., for the second layer).As can be seen, if B 2s is situated closer to either B 1 or B 2 , the weight of e ´B2s will not represent the second layer.Under the precondition that the annual averaged soil temperature tends to be homogeneous throughout the soil profile [23][24][25], to enable the layer representativeness of B 2s , the difference between B 2 ´B2s and B 2s ´B1 has to be made as small as possible.Meanwhile, to enable the grey and the green area to be equal in size for the second layer in Figure 2a, the ratio should not be smaller than a constant, considering the shape of the f pxq " e x ´x x curve.To find a solution for f pxq that reduces the ratio as much as possible actually means the first order derivative of f pxq should be equal to zero.This leads to x " 1, for the current function form, which means In this way, B 2 does not need to be defined exactly (Table 2), with the cost of causing a cutting-off error.However, due to the infinite form of Equation ( 2), the cutting-off error determined by the approach indicated above cannot be 0, but can be infinitely approaching 0. With the relation obtained in Equation (7), Equation ( 5) may be rewritten as, where ε 1 is computed by the SM/ST data collected at the first layer in the field experiment (∆x 1s " 0.05m for the Maqu Network).With Equation ( 8), the determination of B 1 is easy in the first expression once B 1s is known.This then naturally leads to the determination of B 2s in the second expression.
Figure 2a explains the concept of Equation ( 8) in a schematic diagram, by showing the relationship between the parameter B and the residual e ´B in a two-layer T e f f model.If the present value B 1 " 1 is used, then B 1s and B 2s can be derived by using Equation (8).For each B value, a corresponding depth can be calculated by ε 1 , where SM/ST is measured for each layer at the sampling depth of ∆x i .Therefore, the explanation of physical depth ∆x is tied up with B. This relationship may be expressed as B " g p∆xq.According to Equation ( 8), if the first layer sensor is installed at ∆x 1s , this will correspond with B 1s , while the depth interval ∆x should correspond with B 1 (e.g., layer-averaged).Then, to meet Condition 1, the pink area should be equal to the purple area in Figure 2a.
After determining B 1 p∆xq, the remaining area (i.e., the green area) represents the weight given to the second layer, from which B 2s p∆x 2s q can be inferred.The ratio (e.g., B 1 {B 1s , B 2s {B 1s ) obtained by Equation ( 8) can be used to infer the optimal mounting depth, if the thickness of the first layer is fixed.For instance, if B 1s obtained from this ratio is 0.46, which corresponds to a sampling depth of ∆x 1s , then B 1 is assumed to equal 1.Furthermore, B 2s should equal 2, based on Equation (8), and this corresponds to another sampling depth, ∆x 2s .Figure 2a and Equation ( 8) both indicate the capability of Lv's scheme in identifying the optimal mounting depth.They are deemed the simplified form of the multilayer model described in Equation (4).
correspond with 1 B (e.g., layer-averaged).Then, to meet Condition 1, the pink area should be equal to the purple area in Figure 2a.8) can be used to infer the optimal mounting depth, if the thickness of the first layer is Figure 2b mathematically shows B 1s , B 1 and B 2s distributed in a "B vs. e ´B" domain, based on Equation (8).As discussed above, with a sensor positioned at a certain mounting depth, the "B vs. e ´B" domain can help to understand how the weight of the soil layer at that depth contributes to the calculation of T e f f .By assuming B 1s = 0, 1, 2, . . ., we can obtain the black line in Figure 2b.Furthermore, B 1 is a function of B 1s (B 1 " f pB 1s q) and B 2s is a function of B 1 .Therefore, with B 1s varying, B 1 and B 2s form curves, as shown in Figure 2b.It should be noted that the y-axis in Figure 2b indicates a special case: if there is only one layer left to represent the whole soil volume, then B 2s " 1 is the optimal option.In this case, the real optimal mounting depth (∆x 2s ) could be inferred from ε 1 , considering the SM/ST conditions for dielectric constant.It is coincidental with the concept of penetration depth or what is called the temperature sensing depth as that depth satisfies 1{e of Equation ( 2) [26,27].The distribution of curves in Figure 2b is generic, and the curves are characteristic and derived from Equation (8). Figure 2a can be seen as a single vertical cross-section of Figure 2b.With Figure 2b, it is possible to identify the optimal mounting depth ∆x i (i = 1, 2), as long as B 1s and B 2s computed from specified ST/SM profiles align with the black and the green reference lines, respectively.As long as computed B 1s and B 2s meet the reference lines, Equation ( 8) is satisfied.To sum up, the basic steps in determining the optimal sampling depth could be: (1) Using SM/ST observation at first layer (∆x 1 ) to calculate B 1s with B i " ∆x i ¨4π λ ¨ε2 2 ?
(2) Using B 1s to calculate B 1 and B 2s with Equation ( 8); (2) Determining the optimal sampling depth for the second layer with B i " ∆x i ¨4π ε 1 again.If a network already exists whose sensors are installed at several depths (∆x i ), then select a two-layer combination and calculate its B 1s and B 2s .The pair of B 1s and B 2s that fits Equation ( 8) is the optimal sampling depth.The application of Figure 2b to an existing network (Maqu Network) will be discussed in Section 3.2.

The Maqu Network
In 2008, a soil moisture monitoring network was set up at Maqu [28,29].Maqu is located in the source region of the Yellow River and the east margin of the Tibetan Plateau.The average elevation is around 3300 m above sea level.The vegetation consists of alpine shrublands and meadows, with grasses less than 1 meter in height with roots extending tens of centimeters in depth.An accumulated humus layer of around 10 cm is mixed with the soil.Shrubs and trees are scarce in this region, while desert dunes sometimes appear along the river.The network includes 20 sites with a variety of terrain and soil textures, such as silt loam and sandy loam.The observation depth is set to 5 cm/10 cm for most sites, but a few sites have enhanced layer settings of 20 cm, 40 cm and 80 cm, respectively.Three of these sites, C01, N01 and N02, are relatively close to each other, i.e., within 200 m.There is also a boundary layer meteorological tower at C01, which takes its own SM/ST measurements; this station is hereafter called the center station [30].The layer settings for the center station are 5 cm, 10 cm, 20 cm, 40 cm, 80 cm and 160 cm, with an additional infrared sensor for the skin temperature.Since its establishment, the Maqu Network has provided accurate soil parameters for retrieving soil moisture data from satellites at the L-band [30][31][32][33].The time series of SM/ST at center station were collected from May 2009 to April 2010.To avoid introducing the additional complexity of frozen soil, the calculation in Section 3.2 is based only on the time series where ST > 0. Additionally, to obtain T e f f time series via the Wilheit's scheme, the Noah Land surface model [28] was adopted to generate ST/SM profiles with 37 layers (0 to 500 cm) between 12 and 28 June 2010.

Evaluation of Installation Configuration
The spatial resolution of passive microwave radiometers on satellite platforms is coarse compared to an area, which could be represented by a single SM/ST site.For instance, SMOS produce brightness temperature observation data with a resolution of about 40 km on average [11].For the whole region covered by the Maqu Network, T e f f calculated from a single site is not representative enough considering the degree of spatial variability in soil moisture and texture.It is common practice to take the average value for all sites in the network and compare this to the brightness temperature observation produced.Usually, the arithmetical average is used.In this section, the B values (i.e., weights) for different layers at the center station will be calculated using Equation ( 4).The same method can be applied over the whole network.How to evaluate the reliability of a single site, in terms of accurately evaluating T e f f , will be discussed in this section.The center station will be used as an example first and then the discussion will extend to other sites in the Maqu Network.The consecutive mounting depths employed at the center station are 5, 10, 20, 40, 80 and 160 cm.
Figure 3 shows the time series for B values (e.g., the contribution of weight, Table 1) at different mounting depths (or layers) at center station.The time series is divided into three periods: April to November 2009; November 2009 to February 2010; and the period after February 2010.The valley-shaped curve (e.g., for the upper layers ď20 cm) for the second period might be explained by the soil being frozen (when soil temperature is below 0 ˝C).Frozen soil makes the SM/ST profile homogenous, which leads to similar B values for the upper layers (ď40 cm).Usually, the upper layer contributes most to the T e f f , as seen in the first period.However, this is not the case in the third period.The contribution from the 10 cm layer is dominant in the third period.This implies that a two-layer model, which considers only the 5 cm and 160 cm layers, for instance, may lead to significant errors (e.g., loss of signal from the 10 cm layer).As expected, B varies throughout the seasons [20].However, the variation is not induced by an inter-annual cycle, but corresponds strongly to soil moisture.Several rainfall events can be identified that increased soil moisture at 5 cm and subsequently the B values for that same layer.However, the other layers (10, 20, 40, 80, and 160 cm) demonstrate different characteristics and may even be opposite phases to those at 5 cm.The B values of the 5 cm and the 10 cm layer show opposing patterns (the blue and green lines for Period I and III in Figure 3).The reduction of B at 5 cm, for Period III, is caused by surface dryness and lack of rainfall (e.g., soil moisture is less than 0.05 cm 3 ¨cm ´3 at 5 cm).On the other hand, the mean soil moisture content at 10 cm at that time is around 0.35 cm 3 ¨cm ´3.
Figure 4 indicates the time series of the residual (see definition in Table 1) for all layers.During Period I, when calculating T e f f , about 50% of the weight comes from the soil layer 0 to 5 cm (blue line in Figure 3), while the other 50% originates from the soil layers below 5 cm (i.e., the residual, Figure 4).The soil layer at 10 cm will contribute about 30% of the weight in calculating T e f f (green line in Figure 3).The remaining 20% of the weight will be derived from the soil layers below 10 cm (red line and rest in Figure 4).Thus, the soil layers at 5 cm and 10 cm combined will contribute about 80% of the signal when estimating T e f f at the center station.If we include more layers in the calculation, more weight will be included.For example, if information from the 20 cm layer is also included, only the 5% of weight from the soil layers beneath the 20 cm layer will not be considered.The residual (see Table 1) for each layer, where residual refers to the percentage of T e f f signals from deeper layers contributing to the result, as the land surface is reached.
During Period II, the soil is frozen, which makes the soil column vertically homogenous.The dielectric behavior of frozen soil is similar to dry soil; hence, the penetrating depth increases, which means that the microwave emission reaching the surface may come from deeper soil layers, implying an increased residual for each soil layer.In Figure 4, it can be seen that in Period II all residuals are increased, compared to other periods.
During Period III, the weight from the first layer (i.e., 5 cm) may be greatly reduced, but here the 10 cm layer might contribute more than 60% (green line in Figure 3).Correspondingly, during this period, by taking 5 cm as the corresponding depth in the first layer, the residual of weight is greatly increased (e.g., to be about 80%, blue line in Figure 4).On the other hand, the residual of weight for the 10 cm layer drops dramatically, to less than 30%, since the inclusion of the 10 cm increases the upper layer's contribution in the calculation of T e f f .
To sum up, it can be inferred that considering only the weight contribution from the combination of 5 cm and 10 cm (Figures 3 and 4) is not enough to estimate T e f f accurately, as the total represented weight will be only about 80%.If the 20 cm weight is also taken into account, the total weight will be around 90%, even without considering the contribution from 40 cm, 80 cm, and 160 cm.Thus, the calculation of parameter B (i.e., the weight contribution or the residual weight) may be applied to assess how representative each site in the Maqu Network is when estimating T e f f .
It can be inferred that, in order to estimate T e f f as accurately as possible, the physical mounting depth should be designed to minimize the residual as much as possible.To provide a proof of concept calculation, the same mounting depth was used throughout the Maqu Network, with ∆x 1 " 5 cm for the first layer and ∆x 2 " 10 cm for the second layer.For each site, the residual of weight for the second layer ( 1 ś j"1 e ´Bj , the middle layer as in Table 1) is shown in blue bars, while green bars represent the third layer ( 2 ś j"1 e ´Bj , the deepest layer as in Table 1, Figure 5).The smaller residual signal in Figure 5 indicates the smaller error of each site in estimating T e f f .Figure 5 shows that the NST-04 site is relatively reliable, with an error in estimating T e f f of less than 10%.This means that, based on the 5 cm and 10 cm installation configuration, the NST-04 site can be used to calculate T e f f representatively, especially when compared to other sites in the Maqu Network (Figure 5).Similarly, the NST-09 site is found not to be suitable for calculating T e f f , as the residual weight from the other soil layers can be about 40%, thus contributing to 40% of the error when estimating T e f f .Therefore, the NST-09 site should be given less credit when looking for a spatial average.There are many ways to choose the representative sites based on statistics like the Optimal Selection of Alteration Information Extraction Method.However, for the L-band, the pixel could be tens of kilometers, which means that all sites within the Maqu Networks make a contribution to the final signal detected by SMOS/SMAP.None of the sites could be simply removed or given full credit (weight number = 1) because of the complexity of soil moisture profiles, etc.The arithmetical average is taken for this reason, usually because of a lack of physical criteria.The basic hypothesis here is that the uncertainty (residual) is related to the weight.No site is removed or perfect, rather it is given a relative reliability among all sites.To quantify this credit pCq, the use of the normalized residual signal is suggested, to be expressed as: where i represents the ith site, i = 1 to n (i.e., the total number of sites in a network); R max and R min represent the maximum and minimum residual among all sites.Then, for the Maqu Network, an averaged value of T e f f could be given by where T e f f represents T e f f calculated at the ith site with the 5 cm and 10 cm installation configuration.
It is important to use Equations ( 9) and ( 10) to obtain an area-averaged T e f f .The measurement configuration will be optimal only when the residual approaches 0. None of the sites in the Maqu Network is less than 0.2, while the larger values imply a deficiency of instrument installation in describing T e f f .Without using these two equations on average at least 20% of T e f f signals cannot be captured by the Maqu Network with this assumed sensor installation configuration (5 cm & 10 cm).This approach is generic and can be applied to other observation networks to evaluate their representativeness in estimating T e f f .measurement configuration will be optimal only when the residual approaches 0. None of the sites in the Maqu Network is less than 0.2, while the larger values imply a deficiency of instrument installation in describing eff T .Without using these two equations on average at least 20% of eff T signals cannot be captured by the Maqu Network with this assumed sensor installation configuration (5 cm & 10 cm).This approach is generic and can be applied to other observation networks to evaluate their representativeness in estimating eff T .
Figure 5.The residual for each site for the first two layers as shown in Figure 4.
Theoretically, according to Equation ( 2), eff T will vary daily and seasonally with the SM/ST profiles and the mounting depth will vary as well.Nevertheless, the mounting depths of the SM/ST sensors are fixed once they are installed.For most of the sites in the Maqu Network, 5 cm/10 cm is a default installation setting for measuring SM/ST.Whether the 5 cm/10 cm combination is optimal for estimating effective temperature has not yet been discussed.It is clear that there is no fixed depth configuration that can accommodate all variations in SM/ST throughout the year for obtaining a perfect estimation of eff T variation.However, according to the in situ installation configuration, it is possible to investigate what the optimal mounting depth is in order to approach the perfect estimate of eff T .A practical way is to find the optimal combination of sensor installations that will represent the mean SM/ST for that site.The center station will be used as an example of such an investigation.The same procedure can then be applied across the whole Maqu Network.

Test of the Optimal Mounting Depth
The section above discusses how the residual weight can be estimated by Lv's scheme.This section will describe how to determine the optimal mounting depth.Based on this discussion, the Theoretically, according to Equation (2), T e f f will vary daily and seasonally with the SM/ST profiles and the mounting depth will vary as well.Nevertheless, the mounting depths of the SM/ST sensors are fixed once they are installed.For most of the sites in the Maqu Network, 5 cm/10 cm is a default installation setting for measuring SM/ST.Whether the 5 cm/10 cm combination is optimal for estimating effective temperature has not yet been discussed.It is clear that there is no fixed depth configuration that can accommodate all variations in SM/ST throughout the year for obtaining a perfect estimation of T e f f variation.However, according to the in situ installation configuration, it is possible to investigate what the optimal mounting depth is in order to approach the perfect estimate of T e f f .A practical way is to find the optimal combination of sensor installations that will represent the mean SM/ST for that site.The center station will be used as an example of such an investigation.The same procedure can then be applied across the whole Maqu Network.

Test of the Optimal Mounting Depth
The section above discusses how the residual weight can be estimated by Lv's scheme.This section will describe how to determine the optimal mounting depth.Based on this discussion, the current installation configuration at the Maqu Network can be enhanced by adding sensors to the current configuration according Lv's scheme.
For the field experiment, it is important to have a method to arrive at the optimal installation configuration.The proposed approach in this study is as follows: firstly, a defined depth is given to the top layer (i.e., ∆x 1s ), e.g., two columns with black squares in Figure 6a, 5 cm and 10 cm; and secondly, using Equation ( 8), B 1 may be calculated and the depth for the second layer determined (i.e., ∆x 2s ).Two columns are shown to present two different cases of study: (1) the first layer at the Maqu Network is 5 cm (marked in text) and B 5 cm = 0.267 (y-axis) and assumes 20 cm/40 cm/80 cm/160 cm to be the second layer; 10 cm is not shown because ∆x ą 10 cm (Table 2) when 5 cm is the first layer; (2) the first layer at the Maqu network is 10 cm and B 0.1m = 1.268 and assumes 40 cm/80 cm/160 cm to be the second layer; 20 cm is not included for the same reason noted above.With the same technique, we acquired the positions of all other layers (indicated by asterisks with the depth information).Not all layers deeper than the assumed first layer could be the second layer because according to Equation ( 8), ∆x may be larger than the supposed second layer.For instance, when 5 cm is the first layer, ∆x " 11.8 cm which is bigger than 10 cm.It is not necessary to take 10 cm into consideration since most of its information could be represented by 5 cm.This approach has been utilized to plot Figure 2b.The existing sites already have a fixed ∆x 1 (therefore a fixed B 1s ) for the first layer.Thus, B 1 can be easily calculated here using Equation (8).The core concept is that the choice of depth combination (e.g., 5 cm and 10 cm, or 5 cm and 20 cm, etc.) reduces the residual to a minimum (e.g., ideally = 0) under the constraint of Equation (8).The data collected at the center station contain six layers of SM/ST observations, which will be used to show which two of these layers can best estimate T e f f .The similar approach can be applied at other sites in the Maqu Network.According to Equation ( 8), based on the installation depth and the associated mean SM/ST, the sensor installation configuration can be mapped on the "B vs e ´B" domain as well (Figure 6).The observations at 5 cm, 10 cm, 20 cm, 40 cm, 80 cm, and 160 cm depth at the center station were plotted in this way and pooled in Figure 6a.The B 1 curve (corresponding to ∆x) divides the soil column into two parts.The B 1s curve indicates that the ´B (y-axis) and residual for the second layer e ´B (x-axis) can be achieved if a sensor is installed at ∆x 1s , and the B 2s curve indicates ´B for ∆x 2s while the first layer determines e ´B. Figure 2b shows that the B 1s , B 1 and B 2s curves are derived directly from Equation (8), which is generic and independent of mounting depth and SM/ST information.This implies that if the sensor installation configuration is mapped in the "B vs e ´B" domain, it can have matching points with the B 1s and B 2s curves.Then the corresponding installation configuration can be deemed the optimal configuration (or the optimal mounting depths).In other words, if the sensors can be installed at mounting depths corresponding to the B 1s and B 2s lines, the residual calculated from such an installation configuration would be minimized and the mounting depths considered optimal.For the center station, two pairs of lines emerge (Figure 6a): 5 cm and 20 cm, as well as 10 cm and 40 cm.From Figure 6a it can be deduced that the second pair contributes less than 5% to the T e f f calculation (x-axis, e ´B1 ă 0.05).Therefore, the optimal mounting depths are 5 cm and 20 cm for the center station of the Maqu Network.The same strategy can be used at the other sites in the Maqu Network, which have sensors installed at 5 cm/10 cm/20 cm/40 cm or 5 cm/10 cm/20 cm /40 cm/80 cm (Figure 6b).Assuming Wilheit's scheme provides the true value, Table 3 compares the possible combinations of layer pairs at the center station for calculating T e f f .This confirms that the optimal mounting depths are 5 cm and 20 cm and that this combination performs best when estimating T e f f .The residual of the 5 cm and 10 cm combination is 0.311 which means 31.1% of the T e f f signal beneath is unknown and implies potential uncertainty.The same information is deduced from Figures 4 and 5, where the residuals of all sites are compared.Most combinations arrive at a small residual but only the 5 cm and 20 cm and the 10 cm and 40 cm combinations meet the B 2s ´B1 " 1 rule.It may be inferred that if the first layer is fixed at 10 cm, it is impossible to reach good agreement, since the representative term `1 ´e´B 1 ˘for the 10 cm layer is more than 95% with only 5% left for the second layer (i.e., the first layer is set too deep to represent the top layer information, which will lead to the underestimation of T e f f ).The SM/ST information near the surface is overwritten by the 10 cm.In this case, the accuracy of output from Lv's scheme cannot be guaranteed.To summarize, the 5 cm and 20 cm combination has proven to be the best combination used as input for Equation (3).More importantly, this realization can be deduced mathematically from Equation (8) as discussed in Section 2. To apply Wilheit's integral T e f f scheme, the minimum number of soil layers should be more than 20 [34].Therefore, an extra-intensive model investigation of SM/ST was done for the period from 12 to 18 June 2010 and was simulated using the Noah Land-surface model.The total number of layers used is 37, from the soil surface to 5 m deep.Of these layers, 25 are within the upper 1 meter, which guarantees that an accurate T e f f will be acquired as the reference value by applying Wilheit's integral equation.With these 37 layers, all combinations of two different layers are used to generate a T e f f time series by the method of exhaustion, and then the reference value is compared with RMSE, as shown in Figure 7.For instance, the blue line with squares stands for the case where the first layer is 0.25 cm.The first square from the left in this blue line is the combination of 0.25 and 0.5 cm, the second one is 0.25 and 1 cm, the third is 0.25 and 2 cm, etc.The solid line with squares stands for a fixed first layer mounting depth and the dash line with crosses for the RMSE variation along with the changing second layer mounting depth.It demonstrates that if the first mounting depth is shallow, the best T e f f estimation can only be achieved when B 2s ´B1 " 1 (for the lines from 0.25 cm to 5 cm in Figure 7).Conversely, RMSE will increase dramatically if the first mounting depth is very deep (regardless of the second mounting depth).For the period in Figure 7, the optimal combination of mounting depths is 3 cm and 14 cm because it reaches the lowest RMSE (the black line with black square stands for 3 cm and the 9th square counting from the left stands for 14 cm).Figure 7 also proves that with the appropriately configured mounting depth, T e f f accuracy can be improved by at least several Kelvins.As indicated earlier, this combination is suitable only for the simulated period.To find the optimal mounting depth at the center station for the whole year, the annual mean SM/ST values from the existing mounting layer are needed.For the Maqu Network, the same principle seen in Equation (8) shows that the second mounting depth should be around 20 to 25 cm (with different dielectric mixing models) instead of the current 10 cm, with a mean SM = 0.138 cm 3 {cm 3 and ST = 4.503 ˝C.In addition, the penetration depth/temperature sensing depth is also seen in Figure 7 because some lines (the lines with cross marks) could also reach nearly the minimum RMSE while B 1 (B 1s ) is fixed.In these lines, RMSE does not increase with B 2s .It means B 1s « 1 and the first mounting depth already accounts for most of the weight for calculating T e f f .Remote Sens. 2016, 8, 476 16 of 19 dielectric mixing models) instead of the current 10 cm, with a mean SM = 0.138 ⁄ and ST = 4.503 °C.In addition, the penetration depth/temperature sensing depth is also seen in Figure 7 because some lines (the lines with cross marks) could also reach nearly the minimum RMSE while

Discussion
Either the definition of residual or the relationship of optimal mounting depth derived in this study is based on Lv's soil effective temperature scheme and its parameter B (

Discussion
Either the definition of residual or the relationship of optimal mounting depth derived in this study is based on Lv's soil effective temperature scheme and its parameter B(B " ∆x ¨4π λ ¨ε2 2 ?ε 1 ).The essence of B is a thickness of medium where the energy is reduced to 1{e B of its original (emitted) value.This is in coincidence with the concept of the sensing or penetrating depth in remote sensing when B " 1.In this case, Wilheit's integral T e f f scheme could be expressed as T e f f " ş 8 0 T pτq de ´τ, where τ " B. The residual could be comprehended as the percentage of energy which is not captured by existing ST/SM observation when used directly in microwave remote sensing.Similarly, the optimal mounting depth is a description of the mounting depths of two sensors in view of an optical thickness that corresponds to the microwave wavelength.
The main difficulty of these two methods is due in part to the definition of parameter B which assumes the SM/ST is constant or has only a linear gradient within one fixed layer.This assumption is only an approximation and is not always true, especially when it is under extreme dry environment [35,36].The extent to which this assumption is satisfied depends on how many layers are involved.This is the same reason why Lv's multi-layer scheme approaches Wilheit's integral scheme as the number of layers increases.Furthermore, it is not surprising that in Figure 2b, if there is only one layer left then B 2s " 1 is the optimal mounting depth because 1{e is the penetration depth, also known as the soil temperature sensing depth.The soil temperature observed at the soil temperature sensing depth equals T e f f if the variation of ST is linear.
Potentially, the Lv scheme could be applied to the frozen/thaw study as well.Seen from Figure 3, the B character is strongly affected by the frozen/thaw events during the transfer period from Period I to Period II.However, the soil moisture detected by the sensors in field becomes vertical uniform values when the soil temperature drops below 0 ˝C.In this study, it seems the freeze/thaw signal could be captured by B parameter.It means the T e f f can capture the emission signal even during freezing/thawing process.For the same reason, with complex dielectric model which considers water-ice mixing case, T e f f calculation is possible and may be further applied to T B simulation at L-band over frozen ground.

Conclusions
As stated in the introduction, there are many networks around the world for calibrating and validating soil moisture retrieval from microwave satellite observations.The representativeness of these SM/ST observing networks for calculating T e f f and in particular, their in situ sensor configuration can be evaluated by the method developed in this study.The operational SMOS retrieval uses the ECMWF simulated global soil temperatures of the first layer and either the deepest or next-deepest layer, as the surface and deep temperature, respectively.Wigneron's two-layer T e f f scheme took this pair of soil temperatures as inputs with auxiliary information [11].Similarly, the SMAP retrieval assumes a homogenous soil temperature and soil moisture profile at 6:00 A.M. and 6:00 P.M. and uses the surface skin temperature and soil temperature at 0-10 cm from the MERRA-Land model as the surface and deep temperature, respectively [36,37].The arithmetical average of these two temperatures is taken as T e f f .This study illustrates how to use Lv's scheme in assessing the Maqu Network, in terms of identifying the optimal mounting depths for a representative calculation of T e f f .With better installation configurations, more accurate estimates of T e f f can be achieved and the soil moisture remote sensing product can be improved.
The main conclusions of this paper are: (1) Lv's scheme can be used to assess which site has relatively reliable and sufficient information for estimating T e f f accurately.Section 3.1 showed that on average 20% of the signal is lost with the default 5 cm/10 cm installation configuration.Relatively speaking, the NST-09 site should be given little credit and the NST-04 site more credit in calculating the spatial average T e f f value, as indicated by Equations ( 9) and ( 10); (2) Lv's scheme determined that the 5 cm/20 cm depth combination is the best configuration for the center station of the Maqu Network.
Both mathematically and physically, it is proven that the second layer should be installed at 20 cm instead of at 10 cm at the center station in the Maqu Network.A similar approach may be applied to determine the optimal installation configuration for other monitoring networks, which can contribute to a homogenized validation strategy [38] for satellite soil moisture products.By using the newly developed Lv's scheme, it will be feasible to determine which layer of soil moisture can be detected by the radiometer.This could be useful in improving the capacity of in situ monitoring networks to calibrate and validate satellite soil moisture observations over different climate zones.It can also be used to derive the actual soil depths of the retrieved soil moisture and estimate the related errors, thus providing additionally important information for applications (e.g., in data assimilation).

Figure 1 .
Figure 1.Schematic of subscripts related to individual soil layers notation.

Figure 1 .
Figure 1.Schematic of subscripts related to individual soil layers notation.

Figure 2 .
Figure 2. (a) Schematic diagram of sampling depth for λ " 21 cm.B 1s and B 2s correspond to the physical depths where the sensors shall be installed if B 1 " 1; (b) The extension of Figure 2a to any B 1 values.B 1s and B 2s are reference lines increasing with B 1 .

Figure 3 .
Figure 3.The soil moisture and rainfall time series at the center station (upper panel); the B values for the six layers using Lv's multilayer scheme (bottom panel).The time series is divided into three periods: April 2009-mid November 2009 (Period I); mid November 2009-March, 2010 (Period II); April, 2010 (Period III).

Figure 3 .
Figure 3.The soil moisture and rainfall time series at the center station (upper panel); the B values for the six layers using Lv's multilayer scheme (bottom panel).The time series is divided into three periods: April 2009-mid November 2009 (Period I); mid November 2009-March, 2010 (Period II); April, 2010 (Period III).

Figure 3 .
Figure 3.The soil moisture and rainfall time series at the center station (upper panel); the B values for the six layers using Lv's multilayer scheme (bottom panel).The time series is divided into three periods: April 2009-mid November 2009 (Period I); mid November 2009-March, 2010 (Period II); April, 2010 (Period III).

Figure 4 .
Figure 4.The residual (see Table1) for each layer, where residual refers to the percentage of eff T Figure 4.The residual (see Table1) for each layer, where residual refers to the percentage of eff T signals from deeper layers contributing to the result, as the land surface is reached.

Figure 4 .
Figure 4.The residual (see Table1) for each layer, where residual refers to the percentage of T e f f signals from deeper layers contributing to the result, as the land surface is reached.

Figure 5 .
Figure 5.The residual for each site for the first two layers as shown in Figure 4.

Figure 6 .
Figure 6.(a) An illustration of the ratios for 1 1 2 / / s s B B B at the center station when 5 cm and 10 cm are taken as the first layer in the Maqu Network.The shaded area depicts the − range used for Figure 6b; (b) The other sites in the Maqu Network with 5 cm as the first layer and assuming 20 cm/40 cm/80 cm to be the second layer.10 cm is not included for the same reason shown in the left column of Figure 6a.

Figure 6 .
Figure 6.(a) An illustration of the ratios for B 1s {B 1 {B 2s at the center station when 5 cm and 10 cm are taken as the first layer in the Maqu Network.The shaded area depicts the e ´B range used for Figure 6b; (b) The other sites in the Maqu Network with 5 cm as the first layer and assuming 20 cm/40 cm/80 cm to be the second layer.10 cm is not included for the same reason shown in the left column of Figure 6a.

1 B
( 1s B ) is fixed.In these lines, RMSE does not increase with 2s B .It means 1 1 s B ≈ and the first mounting depth already accounts for most of the weight for calculating eff T .

Figure 7 .
Figure 7. Test of the optimal mounting depth relationship with Noah 37 layers SM/ST simulation, 12-18 June 2010.The X-axis depicts 2 1 s B B − in accordance with Figure 2a during the computation, while the Y-axis is the root-mean-square-error (RMSE) against Wilheit's.The legend shows the first layer 1s x Δ in these combinations.
is not shown, since RMSE will then surpass the Y axis.

Figure 7 .
Figure 7. Test of the optimal mounting depth relationship with Noah 37 layers SM/ST simulation, 12-18 June 2010.The X-axis depicts B 2s ´B1 in accordance with Figure 2a during the computation, while the Y-axis is the root-mean-square-error (RMSE) against Wilheit's.The legend shows the first layer ∆x 1s in these combinations.∆x 1s ą 25 cm is not shown, since RMSE will then surpass the Y axis.

Table 1 .
The mathematical details regarding the concepts mentioned in this study.

Table 2 .
The explanation of the parameters used in determining the optimal mounting depths.

Table 3 .
Correlation coefficient and RMSE with available six-layer soil moisture/soil temperature profiles.