Quantitative Evaluation of Soil Quality Using Principal Component Analysis: The Case Study of El-Fayoum Depression Egypt

: Soil quality assessment is the ﬁrst step towards precision farming and agricultural management. In the present study, a multivariate analysis and geographical information system (GIS) were used to assess and map a soil quality index (SQI) in El-Fayoum depression in the Western Desert of Egypt. For this purpose, a total of 36 geo-referenced representative soil samples (0–0.6 m) were collected and analyzed according to standardized protocols. Principal component analysis (PCA) was used to reduce the dataset into new variables, to avoid multi-collinearity, and to determine relative weights ( Wi ) and soil indicators (Si), which were used to obtain the soil quality index (SQI). The zones of soil quality were determined using principal component scores and cluster analysis of soil properties. A soil quality index map was generated using a geostatistical approach based on ordinary kriging (OK) interpolation. The results show that the soil data can be classiﬁed into three clusters: Cluster I represents about 13.89% of soil samples, Cluster II represents about 16.6% of samples, and Cluster III represents the rest of the soil data (69.44% of samples). In addition, the simulation results of cluster analysis using the Monte Carlo method show satisfactory results for all clusters. The SQI results reveal that the study area is classiﬁed into three zones: very good, good, and fair soil quality. The areas categorized as very good and good quality occupy about 14.48% and 50.77% of the total surface investigated, and fair soil quality (mainly due to salinity and low soil nutrients) constitutes about 34.75%. As a whole, the results indicate that the joint use of PCA and GIS allows for an accurate and effective assessment of the SQI. and The results show that ECe has a signiﬁcant negative relationship with and silt ( p < 0.05). The soil organic matter has a signiﬁcant positive correlation ( p < 0.05) with CEC, available N, available P, available K, and clay, while it has a non-signiﬁcant positive correlation (r = 0.24) with ESP and a non-signiﬁcant negative correlation (r = − 0.06) with silt. CEC is signiﬁcantly positively correlated ( p < 0.05) with available N, P, K, ESP, and clay, while it has a non-signiﬁcant positive correlation with silt. and it has a signiﬁcant positive correlation silt. Available P is signiﬁcantly positively correlated ( p 0.05) available K, ESP, and clay and negatively correlated silt. Available K has a signiﬁcant positive correlation ( p < 0.05) with clay, while it has a non-signiﬁcant positive correlation with ESP and non-signiﬁcant negative correlation silt.


Introduction
Precision agriculture is based on the use of a set of techniques and technologies devised to assess the spatial variability of soil and plant properties to facilitate and optimize soil management, which often requires the use of several variables to support decisionmaking [1,2]. However, in some cases, numerous soil variables are required to assess soil quality. Because some of these variables can be redundant, the ability to identify key parameters/variables can reduce both the time and costs of in situ and laboratory analyses and optimize models and procedures for spatio-temporal soil assessment [3]. In this context, principal component analysis (PCA) is recognized as one of the most widely used methods for reducing the number of variables by identifying those that are most 1.
Removes correlated features that undermine the statistical significance of an independent variable [29]; 2.
Improves algorithm performance, which can be significantly degraded if too many features are present in models, and speeds up analyses [30]; 3.
Reduces overfitting: PCA helps in overcoming the overfitting issue by minimizing the number of variables in the investigated dataset [31]; 4.
Improves visualization: PCA transforms a high-dimensional dataset into a lowdimensional one while preserving the information content and making data visualization and exploration easier [32,33] Sustainability 2021, 13, 1824 The main aim of the present study is to assess, characterize, and map the SQI using a multivariate analysis based on the joint use of PCA and GIS in El-Fayoum depression in the Western Desert of Egypt.

Study Area
The study area is located in El-Fayoum Governorate, Western Desert of Egypt. It is bounded by latitudes 29 • 15 -29 • 35 N and longitudes 30 • 32 30 -30 • 52 30.59 E. The study area is characterized by an elevation that reaches 23 m above sea level. The area is connected to the Nile River by the Hawara canal through the Bahr Yousef canal (Figure 1). The physiographic units of El-Fayoum depression include three main landscapes, i.e., lacustrine plain, fluvio-lacustrine plain, and alluvial plain [34]. The main landforms in the area are recent and old lake terraces, depressions, plains, and basins [35] with varying vegetation cover; therefore, the sensitivity to desertification differs widely in the study area [36]. The climate of the study area is characterized by a hot and dry summer with limited winter rainfall and bright sunshine throughout the year. The area has low annual rainfall of around 7.2 mm/year, and the mean minimum and maximum annual temperatures are 14.5 and 31.0 • C, respectively. The lowest evaporation rate (1.9 mm/day) is recorded in January, while the highest value (7.3 mm/day) is recorded in June [34].
Sustainability 2021, 13, x FOR PEER REVIEW 3 of 20 4. Improves visualization: PCA transforms a high-dimensional dataset into a low-dimensional one while preserving the information content and making data visualization and exploration easier [32,33] The main aim of the present study is to assess, characterize, and map the SQI using a multivariate analysis based on the joint use of PCA and GIS in El-Fayoum depression in the Western Desert of Egypt.

Study Area
The study area is located in El-Fayoum Governorate, Western Desert of Egypt. It is bounded by latitudes 29°15′-29°35′ N and longitudes 30°32′30″-30°52′30.59″ E. The study area is characterized by an elevation that reaches 23 m above sea level. The area is connected to the Nile River by the Hawara canal through the Bahr Yousef canal (Figure 1). The physiographic units of El-Fayoum depression include three main landscapes, i.e., lacustrine plain, fluvio-lacustrine plain, and alluvial plain [34]. The main landforms in the area are recent and old lake terraces, depressions, plains, and basins [35] with varying vegetation cover; therefore, the sensitivity to desertification differs widely in the study area [36]. The climate of the study area is characterized by a hot and dry summer with limited winter rainfall and bright sunshine throughout the year. The area has low annual rainfall of around 7.2 mm/year, and the mean minimum and maximum annual temperatures are 14.5 and 31.0 °C, respectively. The lowest evaporation rate (1.9 mm/day) is recorded in January, while the highest value (7.3 mm/day) is recorded in June [34].

Sampling and Soil Analysis
The soil samples were collected using GPS and a soil cylinder auger ( Figure 1) at depths of 0-60 cm in 36 different locations. One mixed sample in each location was collected that represents the soil of root zone. The selected sites represent spatial changes in the study area, which is characterized by wide variation of physiographic features, such as lacustrine plain, fluvio-lacustrine plain, and alluvial plain [34]. The area is characterized by slope levels ranging between −15 and 45 m above sea level, and the change in

Sampling and Soil Analysis
The soil samples were collected using GPS and a soil cylinder auger ( Figure 1) at depths of 0-60 cm in 36 different locations. One mixed sample in each location was collected that represents the soil of root zone. The selected sites represent spatial changes in the study area, which is characterized by wide variation of physiographic features, such as lacustrine plain, fluvio-lacustrine plain, and alluvial plain [34]. The area is characterized by slope levels ranging between −15 and 45 m above sea level, and the change in slope has directly affected vegetation density and land suitability [37]. The soil classifications in the study area include Vertic Torrifluvent, Typic Haplocalcids, Typic Torrifluvents, Typic Haplogypsids, Typic Haplosalids, Typic Torripsamments, and Typic Haplargids [34].
The samples were air-dried, ground, and passed through a 2 mm sieve to prepare them for physical and chemical analyses according to standardized protocols described in [38][39][40]. The soil reaction (pH) of a 1:2.5 soil-to-water suspension was measured using a glass electrode [39]. The soil electrical conductivity was assessed in saturated soil paste extract (ECe) [39]. The Walkley and Black method was used to determine the soil organic matter [38,40]. Available nitrogen was determined by distillation using the micro-Kjeldahl method [40]. Available phosphorus was determined calorimetrically using the ascorbic acid method [40]. Available potassium was extracted with 1N NH4OAc at pH 7 and was measured using a flame-photometer device [40]. The exchangeable sodium percentage (ESP) was computed based on the mathematical equation described by van Reeuwijk [38]. The sodium acetate method was used to measure CEC [38]. Soil particle analyses were performed according to an international pipette method and based on the percentage of sand, silt, and clay; the soil texture was determined using the international texture triangle [38].

Statistical Analysis
Descriptive statistics of the studied soil characteristics include the minimum, maximum, arithmetic mean, and standard deviation, which were computed using SPSS version 25. The Shapiro-Wilk test was used to assess the normal distribution of the data. The Pearson correlation coefficient (r) was used to examine the linear relationships between the variables. XLSTAT software 2016 and SPSS version 25 were used to conduct the principal component analysis (PCA). PCA was used to reduce the dataset into new variables, which are called principal components (PCs), as well as to avoid multicollinearity between the original variables. These PCs explain most of the variation present in the original variables.

Soil Quality Index (SQI) Calculation and Mapping
The SQI was calculated using Equation (1) according to Cude [41]: where W i is the relative weight of each indicator and has values ranging between 0 and 1, and S i is the value of each soil indicator.
Wi expresses the component score coefficient (CSC) that is obtained from the PCA results. Because the soil indicators have different scales and units, the Si values are standardized using Equation (2) [42]: where z, x, x and σ refer to the standardized value, the value of a soil indicator, the average of a soil indicator, and the standard deviation of a soil indicator, respectively. Therefore, the SQI equation based on principal components (PCs) becomes the following (Equation (3)): Thus, the comprehensive SQI (CSQI) is computed using Equation (4): The CSQI, which is calculated using z scores, is transformed into a standard normal distribution (which has a mean of zero and a standard deviation of one) using Equation (5) [42]: where e and z refer to the natural logarithm, equal to approximately 2.718, and the CSQI, which is computed using z scores, respectively. Aprisal, Bambang, and Harianti [13] reported that the soil quality could be classified into the following conditions: very good (0.8-1), good (0.6-0.79), fair (0.35-0.59), bad (0.20-0.34), and very bad (0-0.19).

Cluster Analysis
From the PC scores of soil samples, a cluster analysis was performed using k-means to categorize the observations into groups [43][44][45]. This analysis was applied to classify the soils into specific zones according to their properties. A one-way ANOVA test and Duncan multiple range (DMR) test were performed for comparisons between the different soil zones that were generated.
The cluster analysis results were also simulated using the Monte Carlo approach, one of the most popular and widely used methods for simulation and probabilistic analyses based on the generation of a large number of random samples. This step was adopted to confirm the clusters obtained from the previous analyses [46,47].

Geostatistical Analyses
The geostatistical approach was adopted to predict the values of variables in unsampled locations using the ordinary kriging (OK) method. Semivariograms of the soil parameters were generated using the average squared differences among all pairs (Equation (6)) [48]: where γ(h) is the semivariance of the distance interval h, N(h) is the number of pairs of the lag interval, Z(xi) is the measured sample value at point i, and Z(x i + h) is the measured sample value at position (i + h).
The best semivariogram models were selected based on strong spatial dependence (SDC), mean error (ME), root-mean-square error (RMSE), mean standardized error (MSE), root-mean-square standardized error (RMSSE), and average standard error (ASE). If the values of ME, MSE, and ASE are close to zero and the RMSE is close to one, this indicates that the quality and suitability of the predicting model are high [49]. In addition, ratios of nugget to sill (SDC) of <0.25, 0.25-0.75, and >0.75 indicate strong, moderate, and weak spatial dependence, respectively [50].
A spatial distribution map of the soil quality index was generated using ordinary kriging interpolation in ArcGIS software version 10.2, where the kriging method was applied to predict the values of variables in unsampled locations and to interpolate the spatial soil properties using Equation (7) [51]: where Z*(x o ) is an estimated variable at location x o , Z*(X i ) is the value of an inspected variable at location X i , λ i is the statistical weight that is attributed to Z*(X i ) for a sample located near x o , and N is the number of observations in the neighborhood of the inspected point. The flowchart of the procedures used to determine the soil quality index in this study is shown in Figure 2. where Z*(xo) is an estimated variable at location xo, Z*(Xi) is the value of an inspected variable at location Xi, λi is the statistical weight that is attributed to Z*(Xi) for a sample located near xo, and N is the number of observations in the neighborhood of the inspected point. The flowchart of the procedures used to determine the soil quality index in this study is shown in Figure 2.

Soil Characteristics of the Study Area
The soil characteristics of the study area are listed in Table 1. In particular, the pH values range from 7.09 to 8.65, with an average value of 7.86 ± 0.47, which indicates that the conditions of the study area are mildly/strongly alkaline [40]. The results indicate that the study area is characterized by moderate to high salinity soils, with ECe values varying from 0.87 to 20.33 dSm −1 with an average value of 5.30 ± 5.05 dSm −1 [52]. The CEC of the study area varies within a wide range, between 3.45 and 40.23 cmolckg −1 soil, with an average of 20.62 ± 8.79 cmolckg −1 soil.
The ESP values range from 1.86 to 17.13, with an average of 9.75 ± 3.67, which indicates that the area is not exposed to sodicity hazards [53]. The OM contents range from low to high in the study area, in agreement with [40], with an average of 0.69 ± 0.46. The available N ranges between 1.33 mg kg −1 (2.98 kg N ha −1 ) and 61.6 mg kg −1 (138 kg N ha −1 ) with an average of 19.91 ± 17.42 mg kg −1 (44.6 ± 39 kg N ha −1 ), indicating that the nitrogen content in the area is low [40]. The available P content ranges from low (2.33 mg kg −1 ; 12.0 kg P ha −1 ) to high (19.84 mg kg −1 ; 101 kg P ha −1 ), with an average of 9.50 ± 4.51 mg kg −1 (48.7 kg P ha −1 ), and available K ranges from low (32.76 mg kg −1 ; 88.1 kg K ha −1 ) to high (734 mg kg −1 ; 1972 kg K ha −1 ), with an average of 183.5 ± 193 mg kg −1 (493 ± 519 kg K ha −1 ),

Soil Characteristics of the Study Area
The soil characteristics of the study area are listed in Table 1. In particular, the pH values range from 7.09 to 8.65, with an average value of 7.86 ± 0.47, which indicates that the conditions of the study area are mildly/strongly alkaline [40]. The results indicate that the study area is characterized by moderate to high salinity soils, with ECe values varying from 0.87 to 20.33 dSm −1 with an average value of 5.30 ± 5.05 dSm −1 [52]. The CEC of the study area varies within a wide range, between 3.45 and 40.23 cmolckg −1 soil, with an average of 20.62 ± 8.79 cmolckg −1 soil.
The ESP values range from 1.86 to 17.13, with an average of 9.75 ± 3.67, which indicates that the area is not exposed to sodicity hazards [53]. The OM contents range from low to high in the study area, in agreement with [40], with an average of 0.69 ± 0.46. The available N ranges between 1.33 mg kg −1 (2.98 kg N ha −1 ) and 61.6 mg kg −1 (138 kg N ha −1 ) with an average of 19.91 ± 17.42 mg kg −1 (44.6 ± 39 kg N ha −1 ), indicating that the nitrogen content in the area is low [40]. The available P content ranges from low (2.33 mg kg −1 ; 12.0 kg P ha −1 ) to high (19.84 mg kg −1 ; 101 kg P ha −1 ), with an average of 9.50 ± 4.51 mg kg −1 (48.7 kg P ha −1 ), and available K ranges from low (32.76 mg kg −1 ; 88.1 kg K ha −1 ) to high (734 mg kg −1 ; 1972 kg K ha −1 ), with an average of 183.5 ± 193 mg kg −1 (493 ± 519 kg K ha −1 ), which is classified as high according to [40]. The soil texture, which refers to the proportions of silt, clay, and sand, varies from 8.19 to 44.76%, 24.98 to 62.09%, and 12.98 to 55.95%, respectively.

Pearson Correlation Matrix, Bartlett's, and Kaiser Meyer Olkin (KMO) Tests
The correlations between soil indicators are listed in Table 2. The soil pH has a statistically significant negative relationship (p < 0.05) with all other soil indicators except for silt content (which exhibits a significant positive relationship). Soil EC has significant positive correlations (p < 0.05) with N (r = 0.59), P (r = 0.43), ESP (r = 0.55), and clay (0.35), while its correlations with K (r = 0.30), CEC (r = 0.23), and organic matter (r = 0.26) are positive but not significant. The results show that ECe has a significant negative relationship with pH and silt (p < 0.05). The soil organic matter has a significant positive correlation (p < 0.05) with CEC, available N, available P, available K, and clay, while it has a non-significant positive correlation (r = 0.24) with ESP and a non-significant negative correlation (r = −0.06) with silt. CEC is significantly positively correlated (p < 0.05) with available N, P, K, ESP, and clay, while it has a non-significant positive correlation with silt. Available N is significantly positively correlated (p < 0.05) with available P, available K, ESP, and clay, and it has a significant positive correlation with silt. Available P is significantly positively correlated (p < 0.05) with available K, ESP, and clay and negatively correlated with silt. Available K has a significant positive correlation (p < 0.05) with clay, while it has a non-significant positive correlation with ESP and non-significant negative correlation with silt. Soil pH affects other soil variables and controls the soil physical, chemical, and biological properties [54,55]; thus, pH demonstrates significant correlations with other properties. The negative correlation between EC and pH is largely dependent on the leaching process of major cations (Ca, Mg, Na, and K), as the reduction of these cations increases the pH and decreases EC and ESP. This process is also accompanied by increased mineralization and dissociation processes of organic matter, which explains the negative relation [56]. The increased decomposition of OM at low pH values leads to increases in H + ion content, soil CEC, and the availability of macronutrients (N, P, and K) [54,57]. Additionally, higher soil pH leads to increases in the mineralizable fractions of N and C ratios, where the bonds between clays and organic constituents are broken [58]. Clay content is associated with an increase in CEC and basic alkali cation adsorption, which is negatively correlated with pH [57]. There are positive correlations between soil EC, macronutrients (N, P and K), base cations, and clay content; these results agree with those in [59]. Increased OM has a positive correlation with clay content, which leads to increases in CEC and N, P, and K contents; in addition, OM improves soil physical and chemical properties [60,61]. Negative correlations were identified between clay and silt. The reverse effects exerted by clay and silt on other soil properties mainly depend on the ratio in which they contribute to soil particle size distribution because the surface area and CEC of clay are greater than those of silt [57]. Table 3 shows the results of Bartlett's test of sphericity and the KMO test of sampling adequacy. The significance level of Bartlett's test of sphericity was <0.0001, and the observed chi-square value was 334.63, which is larger than the critical chi-square value of 61.66; therefore, the variables are not completely uncorrelated, and PCA is appropriate for the dataset [62]. The results show that the KMO value is greater than 0.6, which indicates that the sample size is suitable for assessing the factor structure, in agreement with Barrett and Morgan [63]. According to results of these tests, the variables are not completely uncorrelated; the variables included in the model can explain the phenomenon, and a Principal Component Analysis is suitable [64][65][66]. The results of PCA are summarized in Table 4. The first three Principal Components (PCs) have eigenvalues greater than 1; therefore, these PCs were used according to the method described by Kaiser [67], while the other PCs were excluded (Table 4 and Figure 3). The results show that the first three PCs explain 83.63% of the total variance. According to the factor loadings, the first PC, which explains 56.45% of the total variance, has higher positive correlations with EC, OM, CEC, available NPK, ESP, and clay, while the second PC, which explains 16.76% of the total variance, is strongly correlated with silt. The third PC explains 10.41% of the total variance and is correlated with ESP. The PCA biplot in Figure 4 shows both the PC scores of samples and the loadings of variables.
The soil quality index was generated using the results of PCA using Equation ( Using Equation (4), the CSQI was computed as follows: The CSQI, which was computed using z scores, was transformed into a standard normal distribution using Equation (5). The results of CSQI are presented in Table 5 and Figure 5. The results reveal highly significant correlations between the different soil indicators and SQI.   1 Calculated according to standardized z scores; 2 the CSQI, which was computed using standardized z scores, was transformed into a standard normal distribution (which has a mean of zero and a standard deviation of one) using Equation (5).  Table 4. PCA biplot (biplot shows both PC scores of samples and loadings of variables).  Table 4. PCA biplot (biplot shows both PC scores of samples and loadings of variables).

Cluster Analysis (k-Means Clustering)
Clustering is an effective statistical approach to data analysis that can be used classify a large number of variables into specific groups. Each group represents a spec class of soil quality. According to the PC scores of samples, the data were divided i three clusters (Table 6). Cluster I occupies about 13.89% of the total data, Cluster II oc pies about 16.67%, and Cluster III occupies the rest of the data, which represents ab 69.44%. The results of ANOVA show that a statistically significant difference exists tween different clusters, mainly in the SQI.

Cluster Analysis (k-Means Clustering)
Clustering is an effective statistical approach to data analysis that can be used to classify a large number of variables into specific groups. Each group represents a specific class of soil quality. According to the PC scores of samples, the data were divided into three clusters (Table 6). Cluster I occupies about 13.89% of the total data, Cluster II occupies about 16.67%, and Cluster III occupies the rest of the data, which represents about 69.44%. The results of ANOVA show that a statistically significant difference exists between different clusters, mainly in the SQI.

Cluster Analysis (k-Means Clustering)
Clustering is an effective statistical approach to data analysis that can be used to classify a large number of variables into specific groups. Each group represents a specific class of soil quality. According to the PC scores of samples, the data were divided into three clusters (Table 6). Cluster I occupies about 13.89% of the total data, Cluster II occupies about 16.67%, and Cluster III occupies the rest of the data, which represents about 69.44%. The results of ANOVA show that a statistically significant difference exists between different clusters, mainly in the SQI.

Simulation of Cluster Analysis
The cluster analysis results were confirmed using Monte Carlo simulations based on 200 random values of the SQI for the three clusters (first, second, and third). Figure 6 shows the normal probability distribution, where the p value of the Anderson-Darling normality test is >0.05. The SQI simulation results are acceptable, with standard deviations of 0.03, 0.07, and 0.10 and mean values of 0.88, 0.67, and 0.37 for the first, second, and third cluster, respectively. The coefficient of variance (CV) was used to assure the quality of the cluster analysis [47]; the resulting CVs are 3.18%, 9.55%, and 26.25% from the average values of the first, second, and third cluster, respectively. Additionally, the mean values are close to the median values of 0.88, 0.68, and 0.38 for the first, second, and third cluster, respectively. Therefore, the mean values of the obtained SQIs are representative of the most probable SQI values of this study area.

. Simulation of Cluster Analysis
The cluster analysis results were confirmed using Monte Carlo simulations based on 200 random values of the SQI for the three clusters (first, second, and third). Figure 6 shows the normal probability distribution, where the p value of the Anderson-Darling normality test is >0.05. The SQI simulation results are acceptable, with standard deviations of 0.03, 0.07, and 0.10 and mean values of 0.88, 0.67, and 0.37 for the first, second, and third cluster, respectively. The coefficient of variance (CV) was used to assure the quality of the cluster analysis [47]; the resulting CVs are 3.18%, 9.55%, and 26.25% from the average values of the first, second, and third cluster, respectively. Additionally, the mean values are close to the median values of 0.88, 0.68, and 0.38 for the first, second, and third cluster, respectively. Therefore, the mean values of the obtained SQIs are representative of the most probable SQI values of this study area.

Mapping Soil Properties
The ordinary kriging interpolation method was used to estimate and map the unknown values of soil properties. The model's accuracy was confirmed for each soil property based on ME, RMSE, MSE, and RMSSE, as shown in Table 7. The results show that the exponential model is the most suitable for predicting the unknown values of most of soil properties (CEC, ESP, Av.N, Av.P, clay, and silt), followed by the K-Bessel model for pH and OM. Tetraspherical is the most suitable for ECe and Av.K. Finally, the spherical model is suitable for sand content. In addition, the results indicate that RMSSE is close to one and the MSE is close to zero for the selected soil properties; therefore, the selected models fit the data and are suitable for predicting the unsampled soil properties [68,69].
The results show that the spatial dependence (SD) is strong for all soil properties except for ESP and OM, for which SD is moderate and weak, respectively. A strong dependence may be attributable to natural factors, such as soil texture and terrain factors, while moderate and weak dependence may be due to other factors, such as inappropriate agricultural practices and agricultural management [70,71]. Figure 7 shows the spatial distribution maps of soil properties; pH varies from 7.06 to 9.28, and soil ECe ranges from low to high soil salinity (0.88-21 dS/m). This difference in soil salinity is a result of the activity of land degradation processes in the Fayoum depression, where inadequate drainage conditions reduce salinity, which is also a common feature in the soils of the North Delta [72,73]. The results show that the study area has low soil OM contents, ranging between 0.4% and 1%. The results indicate that the area is poor in nutrient content, except for some spots in the north of the area that contain reasonable values of soil nutrients; the maximum values are 66, 18, and 860 for Av. N, P, and K, respectively (Figure 7). Spatial distribution maps of soil properties affecting the SQI in the study area shown in Figure 7.

Mapping the Soil Quality Index
OK interpolation was used to interpolate the spatial variability of soil quality in the study area based on the results of CSQI, which was calculated using Equation (4). The results are shown in Table 5.
The results of the SQI range from 0.88 to 0.37. The SQI is classified into three quality zones according to Aprisal, Bambang, and Harianti [13], as shown in Figure 8 and Table  8. The soil is affected by its composition as well as the surrounding environmental and climatic conditions [74][75][76]; the first zone is characterized by a very good quality index

Mapping the Soil Quality Index
OK interpolation was used to interpolate the spatial variability of soil quality in the study area based on the results of CSQI, which was calculated using Equation (4). The results are shown in Table 5.
The results of the SQI range from 0.88 to 0.37. The SQI is classified into three quality zones according to Aprisal, Bambang, and Harianti [13], as shown in Figure 8 and Table 8. The soil is affected by its composition as well as the surrounding environmental and climatic conditions [74][75][76]; the first zone is characterized by a very good quality index that represents about 14.48% (70.52 × 10 6 ha) of the total area. The soils of this zone are characterized by adequate values of all soil characteristics. The second zone is characterized by good soil quality: this class covers about 50.77% of the area (247.19 × 10 6 ha). The third zone is fair (low quality) and covers about 34.7% (169.17 × 106 ha). The soil pH is mild in the first zone and strong in the second and third zones. The status of available N and available P in the second and third zones is low and medium, respectively. Available K is classified as high in the second zone and medium in the third zone.
The organic matter, clay, EC, available N, available P, available K, and CEC are the most effective factors contributing to the SQI in the Fayoum depression [12,13]. The low values of these parameters lead to negative effects on the SQI [9]. The physical indicators (depth, bulk density, porosity, aggregate stability, texture, and compaction) affect the organization of the particles and pores, explaining their impacts on root growth, speed of plant emergence, and water infiltration [9].

Conclusions
The precise evaluation of soil quality is a very important issue for precise farming (in particular) and for the proper management of sustainable agricultural practices (in general). This evaluation facilitates the identification of the most suitable crops and the potential agricultural uses of the area. Soil quality is affected by agricultural practices and climatic conditions, which, in turn, affect the physical, chemical, and fertility properties of the soil. In this study, the nutrients and physical and chemical properties of the soil

Conclusions
The precise evaluation of soil quality is a very important issue for precise farming (in particular) and for the proper management of sustainable agricultural practices (in general). This evaluation facilitates the identification of the most suitable crops and the potential agricultural uses of the area. Soil quality is affected by agricultural practices and climatic conditions, which, in turn, affect the physical, chemical, and fertility properties of the soil. In this study, the nutrients and physical and chemical properties of the soil were used to assess the SQI in the El-Fayum depression, in the Western Desert of Egypt. For the purpose of these investigations, PCA analysis was jointly used with GIS to capture, quantify, and map the soil quality index of the study area. The results showed that the PCs of PCA explained 83.6% of the total variance of soil data. In addition, the soil data were classified into three clusters: Cluster I represented about 13.89% of soil samples, Cluster II represented about 16.67%, and Cluster III represented the rest of the soil data, i.e., 69.44% of samples. The use of GIS to map soil properties immediately highlighted the changes and spatial variation in SQI from one place to another. The exponential model was the most suitable for predicting the unknown values of the majority of the soil properties (CEC, ESP, Av.N, Av.P, clay, and silt), followed by the K-Bessel model for pH and OM. The study area was classified into three zones based on the variation in CSQI values. These zones differed in both the number and type of the limiting factors that reduced the soil quality. In particular, zone 1 was characterized by significant improvement in the soil nutrients and chemical properties, whereas zones 2 and 3 were affected by a decrease in the soil's nutrient contents, in addition to an increase in soil salinity in zone 3. The areas categorized as very good and good quality occupied about 14.48% and 50.77%, respectively, of the total surface investigated, and fair soil quality (mainly due to salinity and low soil nutrients) constituted about 34.75%. As a whole, the results reflect that the joint use of PCA and GIS allows for an accurate and effective assessment of the SQI.