Remote Sensing Agricultural Soil Alkalinity and Salinity Modeling in the Cropping Season in a Spectral Endmember Space of Tm in Temperate Drylands, Minqin, China

This paper presents the potential of the four-image spectral endmember (EM) space comprising sand (SL), green vegetation (GV), saline land (SA), and dark materials (DA), unmixed from Landsat TM/ETM+ to map dryland agricultural soil alkalinity and salinity (i.e., soil alkalinity (pH) and soil electrical conductivity (EC)) in the shallow root zone (0–20 cm) using partial least squares regression (PLSR) and an artificial neural network (ANN). The results reveal that SA, SL, and GV fractions at the subpixel level, and land surface temperature (LST) are necessary independent variables for soil EC modeling in Minqin Oasis, a temperate-arid system in China. The R 2 (coefficient of determination) of the optimized parameters with the ANN model was 0.79, the root mean squared error (RMSE) was 0.13, and the ratio of prediction to deviation (RPD) was 1.95 when evaluated against all sampled data. In addition to the aforementioned four variables, the DA fraction and the recent historical SA fraction (SAH) in the spring dry season in 2008 were also helpful for soil pH modeling. The model performance is R 2 = 0.76, RMSE = 0.24, and RPD = 1.96 for all sampled data. In summary, the stable EMs and LST space of TM imagery with an ANN approach can generate near-real-time regional soil alkalinity and salinity estimations in the cropping period. This is the case even in the critical agronomic range (EC of 0–20 dS·m −1 and pH of 7–9) at which researchers and policy-makers require near-real-time crop management information.


Introduction
Soil alkalinity and salinity is one of the most serious causes of desertification [1,2], and it is a major environmental hazard in dryland systems [3][4][5].Under the increasing population pressure, soil sodification and salinization hazards will increase with expanded irrigation system use [3].Usually, soil alkalinity and salinity in the shallow root zone (0-20 cm) are major threats for crop emergence and seedling growth in dryland oases [6].Furthermore, the effects of alkalinity and salinity on plant productivity are not linear, and they interact with other stress factors (such as soil erosion, drought, and nutrient loss) in the root zone [7,8].Thus, a rapidly responding and cost-effective technique is required for the assessment of regional soil alkalinity and salinity dynamics and status in the root zone for adaptive crop management in a dryland oasis.
To date, a large variety of remote sensing techniques have been applied for identifying and monitoring salt-affected areas, including multispectral, microwave, hyperspectral, and high-spatial multispectral images [1].There is renewed interest in the use of broadband multispectral imagery for regional soil salt content prediction (e.g., [4,5,[9][10][11][12][13][14][15][16]).Generally, these studies include (1) the bands' selection and investigation of spectral indices, including salinity indices and vegetation indices (e.g., [12,13]); (2) the optimized combination of bands and spectral indices for modeling, such as [4,5,14]; and (3) the long-term soil salinity dynamic analysis, such as in References [15,16].Current studies focus mainly on surface saline spectral features in areas with little or no vegetation in the dry season.The results indicate that the various spectral reflectance combinations (indices) and methods performed inconsistently [11].Some multispectral studies have also combined soil environmental covariates in agricultural soil salinity assessment models (e.g., [10,17]).However, the contribution of individual covariates can be inconsistent [18], partly due to the mismatch of spatio-temporal resolution and the quality of ancillary data [10].In addition, the thermal band of TM imagery-as an important indicator for water-energy balance is rarely applied to the quantitative modeling of soil alkalinity and salinity.Therefore, it is necessary to fully explore the potential of current broadband remote sensing (e.g., the Landsat series) in modeling cropland alkalinity and salinity dynamics and status during the cropping period.
The generic SVD (i.e., Soils and substrates, Vegetation, and Dark surface endmembers) model has been developed in the spectral mixing spaces of TM/ETM+ [19,20], providing the relative proportion of SVD at the subpixel level.The SVD, however, is needed to calibrate in the different geographic/ecological areas for local basic cover types [21].The specific SVD four-endmember model (i.e., SL, sand land; SA, saline land; GV, green vegetation; and DA, dark materials) in the multidate TM imagery was validated and applied to intra-annual cover classification and inter-annual change detection in the temperate dryland system of Minqin, China [21,22].In the stable SVD space, land cover can be identified without the known training areas; moreover, EMs (spectral endmembers) are easily validated and converted into the typical biophysical attributes [23][24][25].The potential of SVD space unmixing from TM imagery has not yet been evaluated for agricultural soil alkalinity/salinity quantitative modeling in cropping periods.
Partial least squares regression (PLSR) and artificial neural networks (ANN) are both popular approaches for the quantification of such relations.Soil salinity and soil reflectance can be approximated by a linear function in the absence of the disturbance of other environmental variables, and thus, there is a large similarity between the two approaches, and PLSR has the advantage of relative simplicity compared to ANN [26].The advantage of ANN is its nonlinear quantification of complex processes at the regional scale, and it was successfully used to estimate the spatial distribution of soil salinity based on complex environmental parameters (terrain indices, distance to drains, and long-term groundwater observation data) with ETM+ data (vegetation index, the raw bands 3 and 5) [17].In addition, the stable and easy-to-interpret SVD can possibly reduce the shortcomings of ANN as a "black box" approach [27], and its requirement of large training samples [28].
Therefore, in this paper, we attempted SVD with broadband imagery (e.g., TM/ETM+) as a generic procedure to empirically retrieve regional soil alkalinity and salinity data in the shallow root zone (0-20 cm) in the cropping season.The specific objectives were (1) to investigate the ability of SVD and LST (land surface temperature) to describe the process between crop growth and solute transport in the shallow root zone during the crop-growing period; and (2) to assess the potential of PLSR and ANN in modeling agricultural soil salinity and alkalinity in the EMs and LST space.

Study Area and in Situ Measurements
The temperate-arid dryland system of Minqin County is a 2000-year-old desert oasis located in the northeastern region of the Hexi Corridor, on the lower reaches of the Shiyang River in Gansu Province, Northwest China (38 • 03 N to 39 • 28 N, 101 • 49 E to 104 • 12 E, Figure 1).As an inland dryland system at the bottom of the inland Shiyang River, playas have developed (i.e., dry salt lake indicated in Figure 1), at local depressions or in foot areas of local residual hills (i.e., salinized sandland in Figure 1).
Excessive and unreasonable irrigation with low-quality groundwater is the major cause of secondary sodification and salinization in the oasis [29,30].Based on the 2008 dryland use map [21], we sampled 38 field sites during 7-15 September 2010, in agricultural areas (Figure 1).Each site was a 30 × 30 m 2 plot around the center point, and was further divided into four subplots to sample soil in the shallow root zone (0-20 cm).EC (soil electrical conductivity) and pH (soil reaction) were used to indicate the regional salinity and alkalinity, respectively.The EC was determined in a 1:5 soil-water aqueous extract with a model DDS-307 conductivity meter (REX Instrument Factory, Shanghai, China).Soil pH was determined using a PHS-3C pH meter (REX Instrument Factory, Shanghai, China) by mixing 10 g soil with 25 mL 0.01 M Cacl2 solution following the ISSCAS standard methods [31].

Remote Sensing Data and Preprocessing
An outline of the data processing flow is shown in Figure 2. Two Landsat 5 TM scenes (path 131, row 33, and path 132, row 33) cover the study area, and the 16 day temporal resolution means that it is difficult to identify sufficient cloud-free TM images that correspond to phenological events.Furthermore, remote sensing data were not available for the dates on which field data were obtained.One image (131/33) was collected on 15 July 2010, and the other (132/33) on 23 August 2010 (about two months and one month in advance of the fieldwork dates, respectively), and they belong to the same phenology.The image (131/33) includes a few clouds in the northeastern areas, but these parts were not used for modeling.
The geometrically-rectified 2008 images [21] were used as references to co-register and correct the TM images in 2010, and the average half-pixel image-to-image co-registration accuracy was achieved.These images were atmospherically corrected using the FLAASH (Fast Line-of-sight Atmospheric Analysis of Hypercubes) module of the ENVI 4.8 software for radiance to reflectance conversion, according to the corresponding image information.
A fully constrained linear spectral mixing analysis (LSMA) with the 2008 EM spectra set from Sun and Liu [21] was performed based on 2010 images to generate the corresponding EM fractions(Figure 3a-d).A full description of the dimensional mixing space and EMs selection in the study area was given in Sun and Liu [21].The mean overall root mean squared error (RMSE) and its Based on the 2008 dryland use map [21], we sampled 38 field sites during 7-15 September 2010, in agricultural areas (Figure 1).Each site was a 30 × 30 m 2 plot around the center point, and was further divided into four subplots to sample soil in the shallow root zone (0-20 cm).EC (soil electrical conductivity) and pH (soil reaction) were used to indicate the regional salinity and alkalinity, respectively.The EC was determined in a 1:5 soil-water aqueous extract with a model DDS-307 conductivity meter (REX Instrument Factory, Shanghai, China).Soil pH was determined using a PHS-3C pH meter (REX Instrument Factory, Shanghai, China) by mixing 10 g soil with 25 mL 0.01 M Cacl 2 solution following the ISSCAS standard methods [31].

Remote Sensing Data and Preprocessing
An outline of the data processing flow is shown in Figure 2. Two Landsat 5 TM scenes (path 131, row 33, and path 132, row 33) cover the study area, and the 16 day temporal resolution means that it is difficult to identify sufficient cloud-free TM images that correspond to phenological events.Furthermore, remote sensing data were not available for the dates on which field data were obtained.One image (131/33) was collected on 15 July 2010, and the other (132/33) on 23 August 2010 (about two months and one month in advance of the fieldwork dates, respectively), and they belong to the same phenology.The image (131/33) includes a few clouds in the northeastern areas, but these parts were not used for modeling.
The geometrically-rectified 2008 images [21] were used as references to co-register and correct the TM images in 2010, and the average half-pixel image-to-image co-registration accuracy was achieved.These images were atmospherically corrected using the FLAASH (Fast Line-of-sight Atmospheric Analysis of Hypercubes) module of the ENVI 4.8 software for radiance to reflectance conversion, according to the corresponding image information.
A fully constrained linear spectral mixing analysis (LSMA) with the 2008 EM spectra set from Sun and Liu [21] was performed based on 2010 images to generate the corresponding EM fractions(Figure 3a-d).A full description of the dimensional mixing space and EMs selection in the study area was given in Sun and Liu [21].The mean overall root mean squared error (RMSE) and its standard deviation for the two scenes were 0.015 and 0.003, less than 0.02, respectively, suggesting a generally good fit according to the study of Wu and Murray [32].
LST was derived from thermal band of Landsat TM data.The digital grey values of the thermal band were first converted to physical measurements of at-sensor radiance following the Landsat User's Handbook [33].The radiance data were then transformed into radiant surface temperatures using the thermal calibration constants supplied by Mtakham and Barker [34], as follows: where Ts is the radiant surface temperature ( • C), and ) are the pre-launch calibration constants (607.76 and 1260.56,respectively, for TM).
According to Wukelic et al. [35] and Washburne [36], the satellite temperatures derived from Landsat thermal band data are good approximations (within 1-3 • C) of ground temperature values on clear days.Further calibration requires local soil and vegetation information to estimate emissivity [37,38].Therefore, we did not apply further complex corrections, and adopted Ts directly as the surrogate of LST (Figure 3e).standard deviation for the two scenes were 0.015 and 0.003, less than 0.02, respectively, suggesting a generally good fit according to the study of Wu and Murray [32].
LST was derived from thermal band of Landsat TM data.The digital grey values of the thermal band were first converted to physical measurements of at-sensor radiance following the Landsat User's Handbook [33].The radiance data were then transformed into radiant surface temperatures using the thermal calibration constants supplied by Mtakham and Barker [34], as follows: where Ts is the radiant surface temperature (°C), and K1 and K2 (m W•cm −2 •sr −1 •µm −1 ) are the pre-launch calibration constants (607.76 and 1260.56,respectively, for TM).
According to Wukelic et al. [35] and Washburne [36], the satellite temperatures derived from Landsat thermal band data are good approximations (within 1-3 °C) of ground temperature values on clear days.Further calibration requires local soil and vegetation information to estimate emissivity [37,38].Therefore, we did not apply further complex corrections, and adopted Ts directly as the surrogate of LST (Figure 3e).  1 The 2008 images are geometrically rectified in Reference [21]. 2 , 3 The 2008 EM spectra and historical SA fraction are derived from 2008 images in Reference [21].ANN: artificial neural network; EM: spectral endmember; LST: land surface temperature; PLSR: partial least squares regression; SA: saline land.
The SA fraction in May 2008 [21]-as the historical SA information representing the potential salt cover with the less/no crops in the dry season-was used to verify the site legacy effect on salt movement due to cultivation in the study area(Figure 3f).In this study, the four nearest-neighboring pixels to the sites were selected, and the EMs' abundance was averaged for use in linking with the field measurements.The LSTs with a 120 × 120 m pixel size in the TM5 imagery were extracted from the nearest-neighboring pixel to the site location.  The 2008 images are geometrically rectified in Reference [21]; 2,3 The 2008 EM spectra and historical SA fraction are derived from 2008 images in Reference [21].ANN: artificial neural network; EM: spectral endmember; LST: land surface temperature; PLSR: partial least squares regression; SA: saline land.
The SA fraction in May 2008 [21]-as the historical SA information representing the potential salt cover with the less/no crops in the dry season-was used to verify the site legacy effect on salt movement due to cultivation in the study area(Figure 3f).In this study, the four nearest-neighboring pixels to the sites were selected, and the EMs' abundance was averaged for use in linking with the field measurements.The LSTs with a 120 × 120 m pixel size in the TM5 imagery were extracted from the nearest-neighboring pixel to the site location.

SVD and LST for Solute Transport Process
Agricultural alkalization and salinization often appeared homogenous at the field patch level, through long-term irrigation and similar agricultural practices in the dryland oasis [39].In addition, the iterative leaching down and moving up of the cation and anion components with irrigation activities is the most important mechanism of secondary alkalization in dryland soil [40].Thereby, agricultural soil alkalinity and salinity in the root zone are dynamic properties associated with the processes of crop growing, solute transport, irrigation, and their interactions in many semi-arid and arid ecosystems.
Aside from salt's osmotic effect on crops with drought-like symptoms [41], soil alkalinity also causes degradation of the soil structure and soil dispersion [42], and thus reduces soil nutrients and water use efficiency for crops.The four-endmember SVD space describes the possible consequences or environmental condition if salts in the root zone impose stress at a sub-30 m resolution.SA is conceptually simple, referring to salt exposure in a pixel, thereby indicating alkalinity and salinity status, with little or no vegetation in the dry season condition.A simple correlation analysis was adopted to depict the variables' behaviors.

SVD and LST for Solute Transport Process
Agricultural alkalization and salinization often appeared homogenous at the field patch level, through long-term irrigation and similar agricultural practices in the dryland oasis [39].In addition, the iterative leaching down and moving up of the cation and anion components with irrigation activities is the most important mechanism of secondary alkalization in dryland soil [40].Thereby, agricultural soil alkalinity and salinity in the root zone are dynamic properties associated with the processes of crop growing, solute transport, irrigation, and their interactions in many semi-arid and arid ecosystems.
Aside from salt's osmotic effect on crops with drought-like symptoms [41], soil alkalinity also causes degradation of the soil structure and soil dispersion [42], and thus reduces soil nutrients and water use efficiency for crops.The four-endmember SVD space describes the possible consequences or environmental condition if salts in the root zone impose stress at a sub-30 m resolution.SA is conceptually simple, referring to salt exposure in a pixel, thereby indicating alkalinity and salinity status, with little or no vegetation in the dry season condition.A simple correlation analysis was adopted to depict the variables' behaviors.

PLSR and ANN Parameters and Calculation
To quantify PLSR and ANN performance in soil alkalinity and salinity modeling, we calculated diverse parameters between estimated values from models and ground measurements.The root mean squared error (RMSE) indicates absolute estimation errors, whereas the coefficient of determination (R 2 ) and ratio of prediction to deviation (RPD) represent the strength of the statistical correlation between measured and predicted values.A predictive model is accurate if R 2 and RPD values are higher than 0.91 and 2.5, respectively.An R 2 between 0.82 and 0.9 and an RPD higher than 2 indicate a good prediction, whereas the R 2 of an approximate prediction is considered to lie between 0.66 and 0.81 with an RPD of higher than 1.5.An R 2 between 0.5 and 0.65 indicates a poor relationship [26].
In this study, the PLSR modeling was performed using SAS 9.2 software.One-at-a-time cross-validation was used to test the predictive performance of the models with different extracted factors, and the model with the minimum root mean square predicted residual error (RMSEP) was chosen as the calibration model (SAS/STAT(R) 2010).A multiple-layer, feed-forward back-propagation network was used, including input, hidden, and output layers.This type of network generally provides better performance in comparison to other network types [43], and is the most popular neural network structure in ecological modeling and soil science [44,45].Using a tan-sigmoid function in the hidden layer allows us to approximate nonlinear relations between input and output layers [46].The range of the appropriate number of neurons in a hidden layer is from (2n1/2 + m) to (2n + 1), where m refers to the number of output nodes and n to the number of input nodes [47].Within the range, the number of neurons in the hidden layer was systematically changed during the process of training the networks, and the most appropriate neurons were selected based on the resulting performance.
The Neural Network Toolbox of MATLAB was used for ANN modeling.The sample dataset was randomly divided into three subsets through several attempts-one subset for training (26 samples), one for validation (six samples), and another for testing (six samples)-the EC statistics of each subset were as close to those of the whole samples as possible (Tables 1 and 2).The Levenberg-Marquardt algorithm was applied to the network training, and only the training data subset was used to adjust the network according to the given error.During the training, the error associated with the validation data subset was monitored.Training was halted when the validation error increased above a specified number of iterations (six times in this study) since the last time it decreased.To avoid the over-fitting problem, the "early stopping" approach was used to halt the training stop process.The testing data subset had no effect on training, and thus its simulation error provided an independent measure of the network performance during and after training.

The Basic Description of Independent and Dependent Variables in the Study Area
Table 2 shows that pH values from all of the 38 sample locations were >7.The highest soil pH was 9.89, and the mean value was 8.58, indicating a strong alkalization process.The soil maximum EC was 1.43 dS•m −1 (1:5 soil-water extraction), which converted to its saturated paste extract EC (i.e., 8.58 dS•m −1 ), and belonged to moderate salinization.However, the mean value of 0.37 dS•m −1 was non-saline soil in the shallow root zone.
Figure 3a-d lists the calculated fractions of the four EMs, indicating the amount and spatial distribution of surface characteristics.It is difficult to notice clouds on the GV fraction image (Figure 3c), whereas they are clear in the SL fraction (Figure 3a) and DA fraction (Figure 3d) images.The different dates of the neighbor scenes have some influence on the fraction values (e.g., the SL fraction image, Figure 3a).Figure 3e  Table 3 shows that the historical SA fraction (SAH) in the 2008 spring season had a 0.1 level significant positive correlation with the soil EC, and a 0.1 level negative correlation with the soil pH, respectively, in the 2010 crop season.In addition, the SAH had a 0.05 level positive correlation with the SA fraction in the 2010 crop season.These results validate SA in the dry season condition with little or no vegetation, and can indicate alkalinity and salinity information, and a certain legacy effect existing on salt movement in dryland agricultural soil.As a result of agricultural practices such as irrigation and/or crop shading, there is often no clear or only partial evidence of salt crust or similar features on the soil surface in the cropping season.As such, the SA fraction in 2010 had no significant correlation with the soil EC or pH.
Clearly, the hidden information can possibly be compensated from other EMs.For example, GV (indicating crop vigor) is the response of the soil quality and agricultural practices.The illuminated SL fraction provides some soil structure condition for solute transport in the soil profile.DA-the sensed surface brightness (determined by dry plant and crop residues, shade, soil dispersion)-also gives some environmental condition information.LST, as an important solute transportation driver, is also a response of the cropland environment.Thus, LST had a significant positive correlation (0.05 level) with the soil EC, and a 0.01 level significant correlation with SL and GV (Table 2).However, these basic cover fractions, due to representing parts of the condition of the cropland individually, were not significantly correlated with soil alkalinity and salinity (i.e., pH and EC) in the shallow root zone, respectively (Table 3).They should thus be integrated together for solute transport modeling.

PLSR Models for Soil Salinity and Alkalinity Estimation
Figure 4 indicates that a single factor provided the minimum RMSEP for pH estimation, and four factors for EC prediction.Table 4 provides the performance statistics of the developed PLSR calibration models, the standardized coefficients of predictor variables in each model, and the results of the bootstrap tests.For pH, only 11% of the variability was explained by the first factor, and the PLSR model of EC performed a little better, with 36% of the variability being explained by the first four latent variables.As the historical saline land fraction, SAH in the 2008 spring season was a notable variable in the pH model, with a relatively large and significant standardized coefficient at the 0.1 level (Table 4).In the EC model, only SA failed to pass the bootstrap tests at different significance levels, implying that SL, GV, DA, SAH, and LST all make some linear contribution to soil EC estimation (Table 4).The poor explanatory abilities of the PLSR-calibration models for pH and EC suggest that the linear model cannot accurately describe the complex relationship among these variables in crop growing.

PLSR Models for Soil Salinity and Alkalinity Estimation
Figure 4 indicates that a single factor provided the minimum RMSEP for pH estimation, and four factors for EC prediction.Table 4 provides the performance statistics of the developed PLSR calibration models, the standardized coefficients of predictor variables in each model, and the results of the bootstrap tests.For pH, only 11% of the variability was explained by the first factor, and the PLSR model of EC performed a little better, with 36% of the variability being explained by the first four latent variables.As the historical saline land fraction, SAH in the 2008 spring season was a notable variable in the pH model, with a relatively large and significant standardized coefficient at the 0.1 level (Table 4).In the EC model, only SA failed to pass the bootstrap tests at different significance levels, implying that SL, GV, DA, SAH, and LST all make some linear contribution to soil EC estimation (Table 4).The poor explanatory abilities of the PLSR-calibration models for pH and EC suggest that the linear model cannot accurately describe the complex relationship among these variables in crop growing.

Neural Network Models for Salinity and Alkalinity Estimation
First, all six predictor variables were sent into the input layer.Within the range of nodes recommended, the neuron number in the hidden layer was adjusted to improve the network structure for soil salinity and alkalinity estimation.Compared with the PLSR models, ANN models improved R 2 from 0.11 to 0.76 and from 0.36 to 0.66 for pH and EC, respectively.Simultaneously, RMSE values decreased from 0.46 to 0.24 and from 0.23 to 0.17 for pH and EC, respectively, as shown in Tables 4-6.
We then eliminated one variable from the predictor sets at a single observation time-based on its linear contribution in the PLSR models-to determine the optimized surface parameters and network structure.The elimination of any predictor variable from the pH ANN model reduced the prediction accuracy (Table 5).The model without LST decreased the most (R 2 from 0.76 to 0.68).The DA also played an important role in the nonlinear model (R 2 from 0.76 to 0.71, Table 5)."-" means the elimination of one or some predictor variables from ALL. R 2 and RMSE of the optimal hidden-layer neuron number for each predictor variables set are bolded.
However, the variable elimination can improve the prediction ability of ANN for soil EC.The model without DA increased to the greatest accuracy (R 2 from 0.66 to 0.74 and RMSE from 0.17 to 0.15), and the model excluding SAH came second (Table 6).For the five predictor variables (without DA) to produce the best nonlinear fit for EC, we continued to exclude each one in turn to determine whether there was any associated improvement in prediction accuracy.We found that the removal of any variable except SAH reduced the precision of the model fit (Table 6).Thus, the combination of SL, SA, GV, and LST (with the removal of DA and SAH) improved the most (R 2 from 0.66 to 0.79).There was no further improvement of the model fit from subsequent exclusion experiments.As shown for the estimation of pH, the elimination of LST from the SL, SA, GV, and LST combination also produced a sharp reduction of the EC prediction accuracy (R 2 decreased from 0.79 to 0.65 and RMSE increased from 0.13 to 0.17, Table 6).
The above results give the final optimal ANN for soil pH prediction in terms of R 2 (0.76) and RMSE (0.24) values when the input layer is composed of all the predictor variables and the hidden layer contains six neurons.For soil EC prediction, the network with SL, SA, GV, and LST in the input layer and nine neurons in the hidden layer achieved the highest R 2 (0.79) and the smallest RMSE (0.13) values.Figure 5 shows that both the pH and EC ANN models learn quickly, and their respective MSE (mean squared error) values decrease sharply with respect to the training samples to below 0.01 after only eight epochs.The MSE values of validation samples and test samples became stable only after 15 and 12 epochs for pH and EC ANNs, respectively.The R 2 and RMSE values of the training and test datasets shown in Table 7 indicate that the optimal models of pH and EC learned quite well and had good generalization abilities.However, the performance parameters associated with the validation dataset were poor, which may be due to the different data distributions of the validation dataset and the training dataset (e.g., minimum values and standard deviation, Table 1).For the whole dataset, the pH and EC ANN models both have the approximate prediction abilities (R 2 values higher than 0.66 and RPD values higher than 1.5), which is also supported by similar RMSE values (Table 7).

Soil Alkalinity and Salinity Mapping and Classification
The optimized ANN models for soil pH and EC predictions were applied to get the contents and classes of soil alkalinity and salinity, respectively, in the 2010 crop season (Figure 6).Commonly, salinity and alkalinity are discriminated following the U.S. Salinity Laboratory (1954) [48], which is highly cited by remote sensing scientists working on soil salinity classification.The predicted EC (1:5 soil-water extraction) was converted to its saturated paste extract EC using a rough multiplying coefficient of 6 [49].We identified five alkalinity classes according to the simulated pH (<7, non-alkaline; 7-8, slight; 8-8.5, moderate; 8.5-9, strong; >9, very strong, Figure 6c), and three salinity classes were determined using the saturated paste extract EC ranges in cropland areas (EC: 0-4, non-saline; 4-8, slight; 8-16, moderate; Figure 6d).Results indicated that 33.44% and 35.96% of the agricultural soil was under moderate and strong alkalization, respectively.Furthermore, 15.22% of the agricultural soil had very strong alkalization in the root zone (0-20 cm).In contrast, 91.93% of the agricultural soil was non-saline, and only 7.27% of the agricultural soil was subject to slight salinization (Table 8).

Agricultural Soil Alkalinity and Salinity Modeling in the Study Area
In this study, the optimized physically-based surface parameters can be identified by the ANN with a series of experiments.The LST, GV, SL, and SA together in both EC and pH models indicate that they can measure the complex cropland environment (in dryland oasis, sandy loam is the major soil) for solute transport processes in the cropping season.
Figure 7a shows the distribution of EC samples in the (LST, GV) feature space; clearly there was a linear trend when GV was below 0.3 in the (LST, GV) space, and thus the samples were divided into two subparts.One group with GV below 0.3 (21 sites) included the samples with the crops harvested, such as winter wheat fields, or those in the final growth phase with sparse green leaves.Figure 7b shows the environment under sparse or no green vegetation, where LST and SL determined the transport of salt towards the surface.When SL crosses over a threshold, the soil surface would quickly dry to form a crust sealing the surface, and thus there is reduced vertical transport of salt towards the soil surface.Therefore, standardized coefficients of LST and SL were 1.04 and −0.60, respectively, and they were linearly combined to depict the solute transportation with R 2 = 0.53.However, the other high-GV fraction groups (17 sites) had no explicitly linear relation between EC and the independent variables, partly because EC (low average content in the 0-20 cm in the study area) was not a major stress for crops (Tables 2 and 8), or the complex mixture between soil evaporation and crop transpiration drove salt accumulation.The mixture is common in most crop-growing periods; therefore, nonlinear models (such as ANN) can better simulate and predict the diverse patterns of EC and pH.LST, as an important energy indicator, can provide surface biophysical process information.However, the coarse spatial resolution influences the wide application of the TM thermal band, compared to the visible-infrared bands.ANN has a certain advantage for fusion of the different resolution parameters, and multi-scale modeling is still a major challenge for the different spatiotemporal resolution data.

Agricultural Soil Alkalinity and Salinity Modeling in the Study Area
In this study, the optimized physically-based surface parameters can be identified by the ANN with a series of experiments.The LST, GV, SL, and SA together in both EC and pH models indicate that they can measure the complex cropland environment (in dryland oasis, sandy loam is the major soil) for solute transport processes in the cropping season.
Figure 7a shows the distribution of EC samples in the (LST, GV) feature space; clearly there was a linear trend when GV was below 0.3 in the (LST, GV) space, and thus the samples were divided into two subparts.One group with GV below 0.3 (21 sites) included the samples with the crops harvested, such as winter wheat fields, or those in the final growth phase with sparse green leaves.Figure 7b shows the environment under sparse or no green vegetation, where LST and SL determined the transport of salt towards the surface.When SL crosses over a threshold, the soil surface would quickly dry to form a crust sealing the surface, and thus there is reduced vertical transport of salt towards the soil surface.Therefore, standardized coefficients of LST and SL were 1.04 and −0.60, respectively, and they were linearly combined to depict the solute transportation with R 2 = 0.53.However, the other high-GV fraction groups (17 sites) had no explicitly linear relation between EC and the independent variables, partly because EC (low average content in the 0-20 cm in the study area) was not a major stress for crops (Tables 2 and 8), or the complex mixture between soil evaporation and crop transpiration drove salt accumulation.The mixture is common in most crop-growing periods; therefore, nonlinear models (such as ANN) can better simulate and predict the diverse patterns of EC and pH.LST, as an important energy indicator, can provide surface biophysical process information.However, the coarse spatial resolution influences the wide application of the TM thermal band, compared to the visible-infrared bands.ANN has a certain advantage for fusion of the different resolution parameters, and multi-scale modeling is still a major challenge for the different spatiotemporal resolution data.Soil pH is determined by the excessive amount of sodium (Na + ) adsorbed at the cation exchange sites.Thus, with the low salt content in the study area, a strong secondary alkalization process still took place, in which the more soluble Na + ions could easily enrich in the root zone, whereas Ca 2+ was stabilized as gypsum and calcium carbonate in the subsoil by the iterative leaching down and moving up with irrigation activities and strong evapotranspiration processes.Thus, the contrast between historical SAH in the dry season and current sensed SA can reflect the redistribution information of cations, and thus SAH entered in the pH model prediction.Soil EC-indicating the degree of dissolving salts in the soil-always varies seasonally.However, SAH-revealing the site legacy effect on the movement of salts-did not enter the EC ANN model to avoid content redundancy.DA (the mixture of dark materials), which may provide some soil Soil pH is determined by the excessive amount of sodium (Na + ) adsorbed at the cation exchange sites.Thus, with the low salt content in the study area, a strong secondary alkalization process still took place, in which the more soluble Na + ions could easily enrich in the root zone, whereas Ca 2+ was stabilized as gypsum and calcium carbonate in the subsoil by the iterative leaching down and moving up with irrigation activities and strong evapotranspiration processes.Thus, the contrast between historical SAH in the dry season and current sensed SA can reflect the redistribution information of cations, and thus SAH entered in the pH model prediction.Soil EC-indicating the degree of dissolving salts in the soil-always varies seasonally.However, SAH-revealing the site legacy effect on the movement of salts-did not enter the EC ANN model to avoid content redundancy.DA (the mixture of dark materials), which may provide some soil dispersion information by alkalization process, was also kept in the pH ANN model.The use of SAH and DA in the pH ANN model indicates the difference between the alkalization and salinization processes in the study area.
The stored salts were leached down or out, and surface salinization was ameliorated in the oasis, since the integrated restoration project began in 2005 with about triple the surface water coming into the oasis, instead of groundwater for irrigation [22].The current study further indicates that an excessive amount of sodium (Na + ) can remain within the shallow root zone.The adequate residue retention, mulching, and no-tillage management can reduce soil evaporation, and thus decrease the vertical transport of sodium towards the surface.The 2009 field sampling (90 sites for agricultural land) indicated soil-conservation activities have been adopted; for example, 54.2% crop residue retention, 84.6% no-tillage, and 82.1% plastic mulch in the study area [21].In addition, deep ploughing and incorporation of calcium supplements (usually gypsum) into the soil are also necessary to leach the excess sodium after several years of continuous cropping.Therefore, the maps of the predicted pH and EC content and classes (Figure 6) can be used to help farmers adjust crop types and conduct remedial measures to reduce soil alkalinity.

Potential of TM SVD for Regional Agricultural Soil Salinity and Alkalinity Mapping in the Root Zone
The ability of remote sensing data to capture soil-plant relationships changes with salinity and alkalinity, according to geographical context (e.g., References [4,11]).Due to the relatively homogenous alkalinity and salinity with the larger field patch in the dryland oasis, the generic and flexible SVD of TM/ETM+ provides an effective scene model of solute transport for near-real-time dryland crop-salts management.In addition, this method is independent of non-remote sensing covariates, which are often associated with mismatch problems in soil alkalinity and salinity modeling at different scales (e.g., Reference [10]).
In this study, ANN, combined with EMs and LST spatial data, can approximately predict agricultural soil salinity and alkalinity in the shallow root zone (0-20 cm). Figure 5 indicates that the two performance curves of validation and testing deviated from each other, possibly due to some bias between the test and validation samples (Table 1).In this situation, Atzberger and others [50] provide a better learning or training method with a jackknife approach to ensure the stability of the results.This is possible even with a low sampling density of soil pH, EC, and even with a certain different date range between the ground measurements and satellite imagery recording.However, crop species have various rooting patterns, particularly in high growth periods when the major roots can reach over 30 cm in depth.In future studies, we will sample additional sites at 0.20 m depth increments down to 1.5 m, and adopt the recursive jackknife approach developed by Atzberger and others [50] to further optimize the developed approach at different soil depths in different dryland irrigation systems.
Compared to the ANN approach, solute transport modeling is an explicit process-based approach for the prediction of salt distribution and movement in the soil profile in space and time.However, solute modeling is often limited to the points for which field data can be collected [1]; SVD and LST of TM imagery can provide the necessary regional surface parameters for the dynamics of crop growth and solute transport as landscape processes for integrated modeling in a geographical information system environment.In light of this, the proposed integrated approach should be connected to "slow" external drivers resulting from processes at a wider scale (i.e., irrigation infrastructure, terrain and landscape function, land use, and climate change) relating to persistent changes in the dryland agricultural system.

Conclusions
Soil alkalinity and salinity is one of the most serious hazards for dryland irrigation agriculture.This paper demonstrated for the first time that SVD and LST can effectively describe the solute process in the shallow root zone during crop-growing periods with broadband imagery (e.g., TM/ETM+), and that an ANN can quantify their nonlinear relationship for near-real-time dryland crop management.
In the study area, the R 2 of the ANN pH and EC model performance was 0.76 and 0.79 for all sampled data, respectively.In recent years, with about triple the surface water instead of groundwater for irrigation, soil salinization has been ameliorated, and 91.93% of the agricultural soil was non-saline, whereas alkalization was the major threat, with 33.44%, 35.96%, and 15.22% of the agricultural soil under moderate, strong, and very strong alkalization in the root zone, respectively, in the July-August 2010 cropping period.The developed SVD-LST framework of TM imagery has the potential to monitor dryland alkalinity and salinity with good land management practices for crop production and environmental health.

Figure 1 .
Figure 1.The 2008 land use and 2010 field sites in the study area.

Figure 1 .
Figure 1.The 2008 land use and 2010 field sites in the study area.

Figure 2 .
Figure 2.Flowchart of methods used in the study.1The 2008 images are geometrically rectified in Reference[21].2 , 3 The 2008 EM spectra and historical SA fraction are derived from 2008 images in Reference[21].ANN: artificial neural network; EM: spectral endmember; LST: land surface temperature; PLSR: partial least squares regression; SA: saline land.
shows the spatial variability of LST in the study area; LST at the left 132/33 scene (on 23 August 2010) is generally lower than that at the right 131/33 scene (on 15 July 2010).The difference between the neighboring croplands, however, is small.Based on the local meteorological observations (38.63 • N, 103.08 • E; 1367.5 m elevation), the canopy temperature at the local time of the Landsat passing was 28.7 • C on 15 July and 27.4 • C on 23 August 2010, respectively.The corresponding estimated LSTs from TM were 31.8• C and 30.1 • C, respectively, both approximately 3 • C higher than the ground direct observation values.

Figure 5 .
Figure 5. Mean squared error (MSE) curve of (a) pH and (b) EC optimal ANN model at the phase of learning for training, validation, and test datasets, respectively.

N
= neuron number of hidden layer, Tra.= training, Val.= validation.The unit of RMSE for EC is dS•m -1 .

Figure 5 .
Figure 5. Mean squared error (MSE) curve of (a) pH and (b) EC optimal ANN model at the phase of learning for training, validation, and test datasets, respectively.

( a )
Predicted pH of agricultural land.(b) Predicted saturated paste extract EC (dS•m -1 ) of agricultural land.Remote Sens. 2016, 8, 714 13 of 18 (c) The alkalinity class of agricultural land.(d) Soil salinity classes of agricultural land.

Figure 6 .
Figure 6.pH and EC maps (a,b) estimated by the optimal ANN models applied to the agricultural land of study area, and their respective classes (c,d).

Figure 6 .
Figure 6.pH and EC maps (a,b) estimated by the optimal ANN models applied to the agricultural land of study area, and their respective classes (c,d).

Figure 7 .
Figure 7.The bubble diagram of EC with the related EMs and LST.

. 3 Figure 7 .
Figure 7.The bubble diagram of EC with the related EMs and LST.(a) The bubble diagram of EC with (LST, GV) pairs; (b) The bubble diagram of EC with (SL, LST) pairs under their GV below 0.3.

Table 1 .
pH and electrical conductivity (EC) descriptive statistics of the three subsets in the artificial neural networks.
1Std. is representative of standard deviation.

Table 3 .
Correlation coefficients of each pairwise variable at 38 sites.

Table 3 .
Correlation coefficients of each pairwise variable at 38 sites.

Table 4 .
Performance statistics, standardized coefficients, and bootstrap tests of PLSR calibration models.RPD: ratio of prediction to deviation.
pH Figure 4. Cross-validation results of PLSR.RMSEP: root mean square predicted residual error.

Table 4 .
Performance statistics, standardized coefficients, and bootstrap tests of PLSR calibration models.RPD: ratio of prediction to deviation.

Table 5 .
ANN model performances of the different predictor variable combinations for soil pH estimation.
N = neuron number in the ANN hidden layer, ALL = all the predictor variables (SL, SA, GV, DA, SAH, and LST).

Table 6 .
ANN model performances of the different predictor variable combinations for soil EC estimation.
N = neuron number in the ANN hidden layer, ALL = all the predictor variables (SL, SA, GV, DA, SAH, and LST)."-" means the elimination of one or some predictor variables from ALL. R 2 and RMSE of the optimal hidden-layer neuron number for each predictor variables set are bolded.Remote Sens. 2016, 8, 714 11 of 18

Table 7 .
Optimal ANN model performances and parameters for soil pH and EC estimations.

Table 7 .
Optimal ANN model performances and parameters for soil pH and EC estimations.

Table 8 .
The concurrent area of agricultural soil salinity and alkalinity class in the root zone (ha).
(a) Predicted pH of agricultural land.