Next Article in Journal
Developments in the Research Base on Reducing Exposure to Second-Hand Smoke
Next Article in Special Issue
Effectiveness of Best Management Practices to Reduce Phosphorus Loading to a Highly Eutrophic Lake
Previous Article in Journal
Changes in Urinary Hydrogen Peroxide and 8-Hydroxy-2′-Deoxyguanosine Levels after a Forest Walk: A Pilot Study
Previous Article in Special Issue
Correlation Analysis between Landscape Metrics and Water Quality under Multiple Scales
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Impacts of Land Use Patterns on Water Quality in a Trans-Boundary River Basin in Northeast China Based on Eco-Functional Regionalization

1
Beijing Key Laboratory of Water Resources & Environmental Engineering, China University of Geosciences (Beijing), Beijing 100083, China
2
Chinese Research Academy of Environmental Sciences, Beijing 100012, China
3
China National Environmental Monitoring Center, Beijing 100012, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2018, 15(9), 1872; https://doi.org/10.3390/ijerph15091872
Submission received: 11 July 2018 / Revised: 21 August 2018 / Accepted: 26 August 2018 / Published: 29 August 2018
(This article belongs to the Special Issue Non-Point Source Pollution and Environmental Assessment)

Abstract

:
The relationships between land use patterns and water quality in trans-boundary watersheds remain elusive due to the heterogeneous natural environment. We assess the impact of land use patterns on water quality at different eco-functional regions in the Songhua River basin during two hydrological seasons in 2016. The partial least square regression indicated that agricultural activities associated with most water quality pollutants in the region with a relative higher runoff depth and lower altitude. Intensive grazing had negative impacts on water quality in plain areas with low runoff depth. Forest was related negatively with degraded water quality in mountainous high flow region. Patch density and edge density had major impacts on water quality contaminants especially in mountainous high flow region; Contagion was related with non-point source pollutants in mountainous normal flow region; landscape shape index was an effective indicator for anions in some eco-regions in high flow season; Shannon’s diversity index contributed to degraded water quality in each eco-region, indicating the variation of landscape heterogeneity influenced water quality regardless of natural environment. The results provide a regional based approach of identifying the impact of land use patterns on water quality in order to improve water pollution control and land use management.

1. Introduction

Water quality integrates important geomorphic, hydrologic, and some of the biological processes of a watershed which make it one of the essential elements of a healthy watershed [1]. The deterioration of surface water quality is a considerable issue in river basin management throughout the world, which has become a serious threat to the chemical integrity of the aquatic environment. Surface water can be polluted by anthropogenic activities in two ways: (1) by point sources, such as sewage treatment discharge; and (2) by non-point sources such as overland runoff from urban and agricultural areas (buffer zones) [2]. Non-point source pollution is more difficult to verify than point sources due to the intricate and diffuse nature of the interactions between runoff and landscape [3]. Land use patterns have been deemed as a significant regulator of contaminants in surface flows and interflows, which make it a critical research topic for clarifying the correlations of surface water quality and non-point source pollutants [4]. Previous studies have reported about the relationship between water quality and the composition of land use types, such as cropland and urban, were related with stream pollutants positively, while forest and grasslands that were less influenced by anthropogenic activities had negative correlations [5,6]. The spatial configuration of landscapes in the watershed played an important part in identifying hydrological processes, natural habitats, energy flows, and nutrient cycles [7]. Thus, the variation of landscape is one of the main factors affecting non-point pollution. A large numbers of landscape metrics, for example the quantification of the landscape, have been developed to characterize landscape patterns and used for clarifying the linkages between the landscape and the water quality [7,8,9]. Previous research usually correlated surface water quality with either simple measures of land use types or landscape configuration in a watershed. However, either of them are not comprehensive enough to indicate the influences of land use management on water quality in a watershed. Therefore, conducting an analysis on the relationships between water quality indicators and both watershed land use types and landscape characteristics can produce a more comprehensive result. Generally, multivariate linear regression (MLR) has widely been used to quantify the relationships between land use patterns and water quality parameters, especially stepwise multiple regression [7,10,11]. However, classical regression approach presents several problems when analysing the relationships between land use/landscape metrics and water quality parameters. First, many land use types and landscape metrics are highly correlated, which produces redundancies and leads to inaccurate results. Additionally, sample size (the number of different study plots or individuals) should be larger than the number of predictors in order to assure the significance of the regression analysis [12]. Thus, the application of technique such as partial least-square regression (PLSR) can overcome the inherent limitations caused by classical multivariate regression when handling multi-collinear and noisy data [13]. PLSR is a method to analyse the response variable by using a set of independent variables having best predictive power [14]. The output of PLSR is a combination and generalization of the principal component analysis (PCA) technique and the multiple linear regression technique. PLSR can deal with variables that are highly collinear by explicitly assuming dependency among the variables and evaluating the underlying structures and is specifically suitable for cases in which the number of samples is less than the possible variables [15].
The Songhua River basin plays an important role in industry, agriculture and forestry. Intense anthropogenic activities such as agricultural production and urbanization have long been affecting the water quality of Songhua River. In order to obtain the goal of national water environment treatment, the regionalization of freshwater ecological functions has been carried out in the eight major watersheds in China including the Songhua River. The definition and method of water eco-functional regionalization was first proposed by Omernik [16]. In the past few decades, many countries have conducted their study on water eco-functional regionalization. Most states in the USA have already completed the IV level water eco-functional regionalization. In addition, EU has established the regionalization systems of different scales in the WFD, of which the macroscale eco-functional regionalization is based on the similar descriptions of typology, biology and ecology. China started to carry out the study on freshwater eco-functional regionalization during the Eleventh Five-year Plan with the support of National Major Science and Technology Program for Water Pollution Control and Treatment of China. It is of great importance to implement watershed management on the basis of eco-functional regionalization. Thus, the objectives of this study are as follows: (1) analyse the spatial and temporal distribution characteristics of the surface water quality in each eco-region; (2) quantify the relationships between land use/landscape characteristics and water quality parameters using PLSR in a sub-basin scale, and identify the main factors determining the water quality during normal flow and high-flow periods in each eco-region [17].

2. Materials and Methods

2.1. Study Area and Sampling Sites

The Songhua River Basin (41°42′~51°38′ N, 119°52′~129°31′ E) is located in the northeast of China and is one of China’s seven major river basins. This river basin occupies a large part of Heilongjiang Province, Jilin Province and the Northeastern part of Inner Mongolia Autonomous Region with an area of 55.7 × 104 km2. The Songhua River Basin is a trans-boundary watershed consists of three sub river basins: the Nenjiang River Basin in the North, the Second Songhua River Basin in the south, and the lower Songhua River Basin (the mainstream of Songhua River) in the northeast. Trans-boundary watershed is a watershed that crosses at least one political border, either border within a nation or an international boundary. The Nenjiang River originates on Yihehuli Mountain in the Great Khingan Mountains while the Second Songhua River originates at Tianchi Lake in the Changbai Mountains. Then, the two rivers travel all the way down and meet in Songyuan in Jilin Province. With a gentle slope and wide surface, the lower Songhua River carries the combined flow from Nenjiang and Second Songhua rivers, flowing northwestward 939 km before entering the Amur River.
The Songhua River Basin lies within a north temperate monsoon climate zone, and the temperature and rainfall varies significantly during the year, with the warmest month being July (20 °C~25 °C) and the coldest January (−20 °C). Annual precipitation averages 500 mm, showing a spatial tendency of higher in mountain area and lower in plain. 60~80% of the annual precipitation occurs from July to September, and only 5% occurs from December to February [18]. The Songhua River Basin is an important agricultural and industrial area in Northeast China. The dominant crops in the basin include soybean, corn, sorghum, and wheat. Industries mainly include petrochemical, machine manufacturing and paper making, distributed in the urban belt of main industrial cities like Harbin, Changchun, and Jilin along Songhua River [19].
In order to eliminate the impact of land use patterns on water quality covered up by heterogeneous natural environment elements, all 86 sampling sites were classified according to the level I water eco-functional regionalization of Songhua river basin. This regionalization grouped the 86 study sites into five different eco-regions: (1) mountainous normal flow (n = 10); (2) plain low-flow (n = 23); (3) hilly high flow (n = 32); (4) mountainous high flow (n = 12); (5) plain normal flow (n = 9). The five eco-regions were identified using a combination of climatic, hydrologic and topographic factors. The three factors referred to annual temperature, elevation and multi-average runoff depth respectively. Figure 1 shows the characteristics of the three factors. They were normalized and processed in ArcGis 10.2 (Esri, Redlands, CA, USA) by using the cell statistics and raster calculator in the spatial analysis toolset [20]. The main environmental characteristics of these five eco-regions were shown in Table 1.

2.2. Water Sampling and Analytical Methods

Water sampling was conducted monthly at 86 sites throughout the Songhua river basin in high-flow season (July), normal flow season (September) in 2016 and in icebound season (March) in 2017 (Figure 2).
Forty-three of those sample sites are part of the local government’s long term monitoring network (Table A1). Not all samples were collected during the icebound season as some of the streams were frozen to the bottom. The water samples at each sites were sampled in polyethylene bottles pre-rinsed three times with distilled water and kept below 4 °C for laboratory analysis. Three samples were collected at each sampling site and a total of 256 samples were collected. Twelve representative parameters were selected for measurement, which are important indicators of water contamination influenced by anthropogenic activities. The parameters included pH, electrical conductivity (EC, µs·cm−1), dissolved oxygen (DO, mg·L−1), chemical oxygen demand (COD, mg·L−1), permanganate index (CODMn, mg·L−1), ammonia nitrogen (NH3-N, mg·L−1), nitrate nitrogen (NO3-N, mg·L−1), total nitrogen (TN, mg·L−1), total phosphorus (TP, mg·L−1), fluoride (F, mg·L−1), chloride (Cl, mg·L−1), and sulfate ( SO 4 2 , mg·L−1). The values of pH, DO and EC were directly measured in situ with a multi-parameter water quality monitoring instrument (Thermo Fisher Scientific, Waltham, MA, USA). The values of other parameters mentioned in the paper were analyzed in the laboratory following the national standard methods [21]. Additionally, water samples except for fluoride, chloride and sulfate analysis were acidified with sulfuric acid to adjust the pH < 2.

2.3. Land Use and Landscape Metrics

The stream network and sub-basin boundaries were extracted from a digital elevation model (DEM, 30 m × 30 m data) using the hydrology toolset in ArcGIS 10.2. As a result, there were a total of 144 sub-basins formed in the study area, 53 sub-catchments were selected to study the relationships between land use patterns and water quality. Land use patterns were acknowledged to have little change within 5 years. Thus we chose the land use data of Songhua river basin in 2015, which was provided by Data Centre for Resource and Environmental Sciences, Chinese Academy of Sciences (RESDC) (http://www.resdc.cn). The land use types were classified into six categories: (1) agricultural land, mostly planted with corn, rice and soybean; (2) forest; (3) vegetated land; (4) water bodies, including rivers, reservoirs and ponds; (5) urban areas, including residential, commercial and industrial lands; and (6) unused land, including gravel, bare ground, and bare rock (Figure 3). The six land use types were commonly used in previous studies and should all be considered to better understand the land use pattern and its relationship with water quality in the Songhua River Basin [5,7,22]. In order to investigate the relationships between landscape configurations and water quality, the landscape metrics in 53 sub-catchments representing the patch size, shape, structure, and landscape diversity were chosen at the landscape levels (Table 2). They have been commonly used in previous studies in dealing with land use patterns in explaining water quality. In addition, these landscape metrics are important in understanding the ecological functioning and human perception in a landscape [8,23,24]. FRAGSTATS 4.0 (University of Massachusetts: Amherst, MA, USA) was used to calculate the landscape metrics based on land use data.

2.4. Statistical Analysis

All the water quality data were tested for normality by eco-regions using the Shapiro-Wilk test, since the number of sampling sites in each eco-region was below 50. Parameters not normally distributed were log transformed to increase variable normality [25]. One-way analysis of variance (ANOVA) with the post hoc Tukey’s test was used to compare water quality variations between different eco-regions and seasons at significance level of p < 0.05. Boxplots of all sampling sites in the five eco-regions for the 12 water quality parameters were performed to study their seasonal and spatial variability. ArcGIS 10.2 was used to map the distributions of concentration of each water quality parameter by spatial interpolation in order to better understand the variation of water quality among sites. Besides water quality, one way ANOVA with the post hoc Turkey’s test was used to compare the variance of landscape metrics between different eco-regions. Welch’s Anova test was also done in case the heterogeneity of variance appears. Both ANOVA test were done at the significance level of p < 0.05. The test of normality of the landscape metrics was done using the Shapiro-Wilk test initially.
The partial least square regression (PLSR) was used to explore the relationships between land use patterns and water quality variations in different eco-regions and identify the key predictors for degraded water quality. This was carried out by converting explanatory variables onto orthogonal ‘latent’ components, which stand for independent variables in a regression. The calculated latent components in PLSR maximize the covariance between the response and explanatory variables through the simultaneous decomposition of X and Y matrices of vectors. The PLSR models were performed in SIMCA-P [26]. Cross-validation was the criterion used to determine the minimum number of latent components needed to acquire the most predictive PLSR model.
Within SIMCA-P, Q2 (the fraction of the total variation of the dependent variables that can be predicted by a component) stands for the cross-validation of components, when Q2 is larger than 0.5, the model is expected to introduce good predictive ability, while indicating no significance when Q2 is smaller than 0.05. Q2 was computed using the equation below:
Q 2 = 1.0 P R E S S   / S S
where PRESS is the abbreviation for the predicted residual sum of squares, and SS stands for residual sum of squares.
In the PLSR modelling, the variable importance in projection (VIP) is a criterion of estimating which independent variables can elucidate the dependent variables most significantly. VIP is calculated by the following equation:
V I P j = { p h = 1   m k R 2 ( y k , t h ) w h j 2 / h = 1 m k R 2 ( y k , t h ) } 1 / 2
In Equation (2), p is the number of independent variables, m represents the number of components extracted from independent variables, k stands for the number of dependent variables, th represents the components of independent variables, R2(yk,th) represents the square of regression coefficients of yk and th, w h j 2 is the weight of independent variables contributing to component th.
It is generally recognized that the independent variables with VIP values above 1 are of great significance for dependent variables; variable with VIP values below 0.8 are of minor importance; it is of medium significance when VIP values are between 0.8 and 1. The regression coefficients indicate the direction and strength of the impact of each variable in the PLSR model [14]. In order to avoid over fitting which leads to low statistical significance PLSR models for each response factors, not all anthropogenic factors must be included in a PLSR model. Therefore, the following PLSR analysis procedure was followed to obtain a most predictive model. Firstly, for each given water quality parameter, all predictor variables were included in the model. Next, a series of new PLSR models were conducted in which each new PLSR process was implemented with a variable excluded in order to minimize the value difference between the explained variation in the response (R2) and the predictive ability of the model (Q2) This procedure was repeated until as few predictors were remained [27]. Finally, the PLSR model with the largest Q2 was chosen as the optimal model. A test of collinearity of the explanatory variables was done in prior of the application of PLSR method (Table A2).

3. Results

3.1. Characteristics of Water Quality

Spatial and seasonal variations of water quality parameters in five eco-regions were illustrated by box-plots (Figure 4). Interpolation maps were illustrated by ArcGis 10.2 in order to help better understanding the spatial variations of water contaminants (Figure A1). According to the one-way ANOVA, all variables showed significant spatial differences among the five eco-regions (p < 0.05). Most of the variables showed significant temporal differences between different seasons. Large pH values often occurred in high-flow period, except in Zone 1 where high pH was found in normal flow period. EC was found higher in icebound season in every eco-region, however there was no significant difference between high-flow and normal flow period. The low concentrations of DO were mostly observed in ice bound season in every eco region, except in Zone 4. The concentrations of COD showed obvious temporal differences only in Zone 2 and Zone 5, large values were found in high-flow period in Zone 2, while higher concentrations were observed in Zone 5 in normal flow periods. The concentrations of CODMn were mostly found higher in high-flow periods, whereas low concentrations were found in the ice bound season, and only Zone 5 presents large values in the mean flow period. The concentrations of NH3-N were higher in ice bound season in every eco-region except in Zone 1. NO3-N had large values in high-flow periods while it presented low values in normal flow periods. The concentrations of TN were higher in the ice-bound season in every eco-region except for Zone 1. The values of F didn’t show obvious temporal variations, while the concentrations of Cl and SO 4 2 varied in different periods among the five eco-regions.

3.2. Land Use and Landscape Characteristics at Sub-Basin Scale

Figure 5 showed the distribution of land use for each sampling sites at sub-basin scale in 2015. Arable land and forest were the dominant land use in Zone 1, ranging from 15.26% to 65.60% and 6.61% to 62.78% respectively. A relatively less proportion of urban land use was occurred in Zone 1, except for S7 which occupied 7.52% of the sub-basin. Arable land was the dominant land use in Zone 2, ranging from 14.13% to 86.28%. Zone 2 had a relatively larger proportion of urban land use with an average of 4.06% and a maximum of 9.68%. However, the ratio of forest area was relatively lower than other eco-regions with a maximum of 59.52%, while others were lower than 10%. Therefore Zone 2 was obviously disturbed by anthropogenic activities. Arable land was the dominant land use in Zone 3, ranging from 2.76% to 83.46%, with an average of 50.62%. Likewise, a significant amount of forest was discovered in some of the sub-basins, ranging from 1.41% to 93.86% with an average of 38.96%. The proportion of urban land use was the highest among the five zones, with an average of 5.16% and a maximum of 15.26%. A large proportion of forest was observed in Zone 4, with an average of 64.94% and a maximum of 90.46%. The land use ratio of arable land ranked only second to forest, with a maximum of 29.05%. The proportion of urban land use was lower than other eco-region with an average of 2.44%. The dominant land use in Zone 5 was arable land, ranging from 17.53% to 63.02%, with an average of 54.95%. The ratio of forest in most of the sub-basins was lower than 30%, with an average of 30%, the highest was 68.55% in S79. The average proportion of urban land use was 3.08%.
The dominant land use types in Songhua river basin were arable land and forest. It was observed that sub-basins with a larger ratio of urban land had a lesser ratio of grassland and forest but a larger ratio of arable land. This observation might indicate that urbanization on one hand had decreased the amount of forest and grassland to support more living residents in the city, on the other hand retained or even increased the amount of arable land for more food was needed.
According to the one-way ANOVA, most of the landscape metrics except for LSI and AI were showed significant variations among the five eco-region (p < 0.05) (Table 3). The highest value of PD (0.17/100 ha), LPI (78.77%), CONTAG (56.59%) and AI (81.29%) and the lowest value of PD (0.04/100 ha), SHDI (0.84), SHEI (0.32) and ED (3.63 m/ha) were recorded in Zone 3. The highest value of ED (10.76 m/ha), LSI (27.22), IJI (73.84%) and SHDI (2.04) and the lowest value of LPI (10.61%), LSI (2.76) and CONTAG (34.85%) were recorded in Zone 2. The highest value of SHEI (0.85) and the lowest values of IJI (41.82%) were recorded in Zone1. The lowest value of AI (0.52%) was observed in Zone 4.

3.3. Linkages between Water Quality Parameters and Land Use, Landscape Metrics in Each Eco-Regions

The PLSR approach was applied to quantify the relationship between water quality and land use/landscape metrics for each eco-region mainly in the high flow and normal flow seasons. The ice-bound season was not included due to the little surface runoff. The summary of each optimal PLSR models were provided in Table 4, including the R2 and Q2 of each model as well as the number of components extracted in each model was to reach the minimum difference between R2 and Q2 and a larger Q2. The value of Q2 should be larger than 0.5 to make the model predictive and significant. As Table 4 shows, most of the optimal models for water quality parameters extracted two components in both seasons, whereas certain models for water quality parameters extracted three components. In addition, individual models for water quality parameters only extracted one component in some eco-regions which leaded to low values of Q2. As we had observed, in Zone 1, Zone 2 and Zone 3, the optimal model for SO 4 2 in high flow seasons and the models for Cl in Zone 4 in both seasons, as well as the model for Cl in high flow season extracted only one component. This indicate that increasing the number of components to the PLSR models cannot continually improve the explained variance and leads to lower predictive ability (i.e., larger gap between model R2 and Q2 values), the subsequent components are not strongly correlated with the residuals of the predicted variable [12]. Therefore, the models for SO 4 2 in Zone 1, Zone 2 and Zone 3 in high-flow season and the models for Cl in Zone 4 in both seasons and in Zone 5 in high-flow season were of low significance and predictive power.
The regression coefficient (RC) and the variable importance for the projection (VIP) are intuitive and comprehensive expressions of the relative importance of the variables when indicating how important land use types and landscape metrics are to the specific water quality parameters. Table 5 and Table 6 illustrated the key variables (VIP > 1) of each optimal model with their regression coefficients (RCs) in high-flow and normal flow seasons respectively. Table A3 and Table A4 presented all the regression coefficients of all the explanatory variables in each optimal model. The PLSR weights can be used to better understand the quantitative relation between the explanatory variables and response, because they are linear combinations of the original variables that define the scores. It is another important symbol to indicate the importance of individual land use/landscape metrics to water quality parameters. The weight plots illustrated the key predictors (VIP > 1) of each optimal model and highlighted the predictors with the highest weights in each model (Figure 6).
In Zone 1, water quality variables were influenced mostly by landscape metrics during high-flow and normal flow seasons. During high-flow season, only pH, EC, DO and SO 4 2 were most influenced by land use metrics, of which the key variables with highest VIP values were WA UN, UR and AR respectively. The key variables with the highest VIP values for other water quality contaminants were mostly landscape metrics. ED was the most important predictor for COD, CODMn and TP, while SHDI contributed the most to NO3-N, TN and Cl. NH3-N appeared to be influenced the most by SHEI, whereas AI was the most predictive variables of F. During the normal flow season, the most important predictors for pH and EC were the same as the high-flow season, while DO was most influenced by FO instead of UR. The key variables with the highest VIP values for NH3-N and NO3-N were UR and FO respectively. ED was the most important predictor for CODMn and TP, while CONTAG contributed the most to COD. LPI, SHEI and IJI were the most important predictors for TN, F and Cl respectively.
In Zone 2, land use metrics predominated in the key variables with the highest VIP values in optimal models. During high-flow season, GR was the most important variables of the variations of DO, CODMn, NO3-N, F and Cl, while IJI contributed the most to COD and NH3-N. FO, UR, AR and UN were the most important predictors for pH, EC, TN and TP. During normal flow season, pH, DO and CODMn were all most influenced by GR, whereas UR contributed most to EC, COD, NO3-N and Cl. Landscape metrics such as IJI, LSI and LPI were observed as the most important indicators of the models (VIP > 1). Lower IJI contributed to higher NH3-N and TN; higher LPI dedicated to higher TP, while LSI was positively correlated with F, but negatively correlated with SO 4 2 .
In Zone 3, land use metrics contributed to more water quality variables than landscape metrics. During high-flow season, all water quality parameters were most influenced by AR, except for pH, CODMn and NO3-N. FO was the most important predictors for pH and CODMn, while LPI contributed the most to NO3-N. During normal flow season, AI and PD were observed as the most important predictors for CODMn and SO 4 2 , whereas other water quality parameters were most influenced by land use metrics.
In Zone 4, during high-flow season, pH and SO 4 2 were most influenced by GR, whereas WA contributed the most to EC and COD. FO was observed the most important predictor for DO and CODMn, and was positively related to DO and CODMn. UR, AR and UN contributed most to NO3-N, F and Cl respectively. The rest of the water quality parameters were most influenced by ED, which was the only landscape metrics with the highest VIP values. During the normal flow season, landscape metrics such as SHEI, ED and PD contributed more to water quality parameters than that in high-flow season. SHEI was the most important variable for pH and COD; ED contributed more to NH3-N and TP; NO3-N and TN was impacted most by PD. Other water quality parameters were contributed most by land use metrics. The most important predictor for DO and CODMn was FO which was the same as high-flow season. UR contributed the most to EC and SO 4 2 , while F was most influenced by GR.
In Zone 5, during the high-flow season, AR was the most significant predictor for DO, NO3-N and TP; CODMn and NH3-N were most impacted by UR and UN respectively. The rest of the water quality parameters were influenced by landscape metrics in most cases. SHDI contributed most to COD and TN, while F and SO 4 2 were most impacted by LSI; LPI and SHEI was the most significant predictors for pH and EC respectively. During the normal flow season, F was most influenced by LSI, whereas pH and Cl were contributed most by UR. FO and UN were the most significant predictors for CODMN and SO 4 2 respectively. Other water quality parameters were most influenced by AR.
It should be noted that all the predictors in the final optimal model were to some extent correlated with the specific water quality parameter. However only predictors with VIP values above 1 were considered to be of major importance. Overall, it was observed that the key variables with highest VIP values of the optimal models were spatially and temporally different. In addition, landscape metrics contributed more in Zone 2, Zone 3 and Zone 4, while land use metrics dominated in the other two eco-regions.

4. Discussion

4.1. Key Land Use Types Predicting Water Quality

Many studies have reported that agricultural and urban land uses contribute to degrading water quality in adjacent aquatic systems, whereas vegetated areas such as grassland and forest have a positive contribution to water quality [6,28]. The results of this study were generally consistent with such previous findings. However not all water quality parameters in each eco-region showed a strong relationship with agricultural land use and urban land use.
In the high-flow season, although arable land was not always the variable with the highest VIP values in the optimal models of water quality indicators, it was predictive and associated with most water quality variables in each eco-region mainly in Zone 3 and Zone 5. Arable land was positively correlated with organic matter and some of the nutrients, while it correlated negatively with DO. This indicated that arable land actually served as a source for pollution in these eco-regions. Previous studies had noted that arable land had a positive impact on degraded water quality due to agricultural activities, such as fertilizer and pesticide application as well as livestock raising, which were often the major non-point source [4]. Forest was found to be closely related to most water quality variables than other land use types in Zone 4. It was identified that forest was positively correlated with DO and CODMn during both seasons, while negatively correlated with NH3-N, TN and TP. It was unexpected that higher forest land use contributed to higher CODMn, which to some extent contradicted previous research. Therefore, a possible speculation of this phenomenon is that the increase of forest would cause the accumulation of refractory organic matter derived from decaying plant material, afterwards flows into the surface water with overland runoff [29,30,31]. In addition, the terrain in Zone 4 is steeper than that in other eco-region which means that higher slope exists in this region. Previous study claimed that with an increasing slope, higher water flow rates would contribute to soil erosion and to the rates of particulate matter that picks up pollutants [32]. Although the dominant land use type in Zone 2 was arable land, the land use type contributed the most to the water quality in this region came out to be grassland. A negative relationship was identified between grassland and CODMn, NO3-N, TP and F, while DO was found positively correlated with grassland, which indicated the fixation and absorption effects of grassland for pollutants in overland runoff [5]. However the concentrations of non-point source pollutants, such as CODMn and TN in Zone 2 were relatively high among the 5 eco-regions and was appeared to be related positively with arable land. This suggested that effect of grassland in decontaminating overland runoff has been weaken, even in upstream of the sub-basin. The main leading might be the looseness of surface soil structure caused by intensive grazing, which had been discovered frequently throughout our field survey. In addition, Zone 2 was known as the black soil area, the rich humus in the soil contributed to the increase of CODMn and TN in surface water, especially in rainy season.
In the normal flow season, arable land was no longer the variable associating with the most water quality parameters in Songhua River Basin. Forest and urban land use had become the main predictors for water quality in some eco-regions such as Zone 1 and Zone 4. This phenomenon should be attributed to the decrease of precipitation, which reduced the volume of overland runoff generated by arable land. Another reason might be the first flush effects in storm events, which made surface runoff pollutants entering rivers more apparently in high flow season than that in other seasons [33]. Thus, fewer non-point pollutants was transported into surface water in normal flow season. However, forest was known to have the ability of intercepting the degraded water and filtering out nutrients. Thus forest land uses became one of the most influential factors towards the concentration of nutrients. Urban land use revealed its contribution to COD and NH3-N indicated that the water quality in Zone 1 was polluted by possible point sources mainly from domestic and industrial sewage since these pollution sources mostly distributed in built-up areas [5,7,32]. TN was the only water quality parameter that was dominated by arable land in Zone 2, while others were contributed mostly by grassland and urban land use. Indicating that point sources were more likely to be the major source of water contamination compared to that in high-flow season as urban areas was mostly covered with impervious surface and the drainage was continually routed to wastewater treatment plants and then discharged to local rivers as point sources [34].

4.2. Key Landscape Metrics Predicting Water Quality

Anthropogenic activities not only influence the composition of land use, but also change the landscape pattern. Previous studied had indicated that non-point source pollution loading, including soil erosion, sediment, and nutrient runoff, in rural watersheds was closely relevant to the landscape structure [35]. For example, a previous study had reported that landscape metrics consistently explained a 65% to 86% of the total variation in nutrients and suspended sediment to stream [36]. Landscape pattern metrics have been developed through the use of spatial tools, in order to attain the goal of quantification land use patterns and understand spatial heterogeneity and landscape structure [37].
In this study, degraded water quality was positively related with landscape fragmentation metrics such as PD and ED and was negatively related with LPI in most eco-regions. Previous studies have reported that PD and ED showed positive relationships with TN, TP, COD, and BOD concentrations in reservoirs in South Korea which was dominated by vegetated areas [7]. That was consistent with what it was found in this study, especially in Zone 1 and Zone 4, which are both mountainous regions with large proportions of forest. The value of PD and ED increased when growing numbers of small patched land cover types appeared in the watershed. The degree of forest fragmentation could be reflected by PD and ED [13]. Therefore, highly fragmented forest might not function efficiently to increase the permeation and decrease overland runoff and erosion from agricultural and urbanized areas, consequently let pollutants, sediments and nutrients flowed into the surface water without efficient interception.
The CONTAG metric reflects the level of aggregation of land use types. Generally, a low CONTAG metric value means that land use types are highly fragmented, while a high value of CONTAG represents an aggregated landscape [38]. In this study, CONTAG was observed to be negatively related with non-point pollutants in the mountainous normal flow region (Zone 1) in both high-flow and normal flow seasons. Previous research had reported that in streams non-point pollutants are mainly derived from soil erosion and sediment yield [13]. Thus the CONTAG metric might be a factor contributing to soil erosion and sediment yield, especially in mountainous areas. In addition, intensive soil erosion occurred in the eastern part of the mountainous normal flow region. Thus, land use management in this region should focus on the aggregation of land use types in order to reduce in-stream non-point pollutants.
The AI metric represents the degree of physical connectedness and aggregation of land use within watersheds, and it is higher when land uses are more clustered and aggregated. In this study, the AI metric contributed to more water quality parameters in the normal flow season, and a negative relationship was identified between AI and water quality in two seasons. Therefore, degraded water quality usually occurs in watersheds with scattered land uses and plentiful land use patches.
The IJI metric represents the degree to which patch types are interspersed (not necessarily dispersed); lower values represent landscapes with poorly interspersed patch types (i.e., uneven distribution of patch type adjacencies), whereas higher values characterize landscapes in which the patch types are well distributed (i.e., equally adjacent to each other). The IJI metric was a more effective indicator for water quality parameters in high-flow season, which was positively correlated with COD, nutrients, F and Cl, while negatively correlated with DO. This suggested that human activities might disperse patches and therefore increase the potential of contaminants flowing into the river.
The LSI metric is an alternative to patch shape indices based on the average patch characteristics, which measures the perimeter-to-area ratio for landscape as a whole [38]. The higher the index indicates the more complex of the patches in the landscape. In this study, the LSI metric was an effective indicator for water quality variables of F, Cl and SO 4 2 in high-flow season in Zone 1, Zone 4 and Zone 5, and pH, EC and NH3-N were found positively related to LSI in normal flow season in Zone 1 and Zone 5, besides F and SO 4 2 . This suggested that the shape of the patches in the landscape contributed more to anions concentration. Higher LSI indicates more edges presented in a landscape, which leads to negative impacts on water quality.
The SHDI and SHEI are diversity metrics influenced by richness and evenness respectively. Richness refers to the present patch type’s number; evenness refers to the distribution of area among different types [38]. Larger values of SHDI and SHEI imply greater landscape diversity. Although SHDI and SHEI were not always the predictors with the highest VIP, they were both positively correlated with degraded water quality in the two seasons. The SHDI metric indicated more water quality variables in each eco-region than SHEI, except in Zone 3 and Zone 4 which only SHEI was observed contributing to water quality variables. This suggested that the degradation of water quality is likely to occur when the number of land use types increased and when different land use types are distributed homogenously.

4.3. Importance of an Eco-Functional Regionalization Land Use Pattern Impacts

The results of this study demonstrated that the key predictors and seasonal variations of the optimal models of each eco-region were considerably different (Table 4, Table 5 and Table 6). It was reported in previous researches that the relationship between land use pattern and water quality to some extent was influenced by regional differences because the gradients of anthropogenic land use were frequently overlapped on an underlying gradient in primitive characteristics of natural terrain [39]. For instance, urban areas often located in flat land instead of mountainous region because the former is more likely to develop rapidly. This is consistent with the results in our study that a relatively high proportion of urban areas generally appears in the plain catchments. Allan emphasized that anthropogenic land uses often covary with natural landscape, a factor that can result in overestimation of land use influence on river water quality. In this study, the five eco-regions were classified based on an analysis involving climatic, hydrologic and topographic factors. The relationships between land use patterns and water quality were analyzed without the disturbance of natural terrain since each eco-region has the similar topographic features. The results showed that the significant predictors of water quality parameters in each eco-region were apparently different. In addition, previous study had proposed that landscape heterogeneity within a large river basin might bring in model errors which implied the influences of agricultural lands in hilly watersheds might be covered up by strong influence of urban areas in the plain watersheds [11]. This indicated the importance of regional basis when exploring the relationship between land use patterns and stream water quality. Statistical models should be conducted on a relatively homogenous region in accordance to natural terrain, climatic and hydrological condition in order to get a reliable result. In addition, it also meets the requirement of the Chinese government that the watershed environment should be administrated based on the eco-functional regionalization.

4.4. Model Performance and Limitations

Multicollinearity among various land use/landscape metrics is one of the obstacles when establishing precise relationship between land use patterns and stream water quality. PLSR is of great advantage in this study because it is particularly conducive when the number of predictive indicators is similar to or higher than the number of observations and/or there is strong collinearity among the predictors [12]. Additionally, the PLSR methodology to some extent excludes the confounding relationships among both the independent and dependent variables and encourages a more impartial view of the contribution of land use patterns to stream water pollutants [27]. Therefore, this approach could be applied to the investigation of water contamination in other watersheds.
The method used in this study has some limitations. First, the water quality data during the low-flow and snowmelt period was not collected. Some researchers have claimed that a low-flow period should also be involved in the analysis, as farming activities such as sowing and fertilization occur frequently during this period, which causes negative impacts on river water quality [4,5]. As for the snowmelt period, some researchers take it into account when studying the watersheds located in cold-climate regions, as nutrient export produced by snowmelt runoff may differ significantly from runoff during storm events [25,40]. Thus, it is desirable to involve in situ sampling in low-flow and snowmelt period when investigating the relationships between land use patterns and water quality in the Songhua River basin in order to better understand the temporal variability in the watershed located in cold climate region and with intense agricultural activities. Second, some of the models were of low predictive power within the five eco-regions. such as the models (Q2 < 0.5) of SO 4 2 in Zone 1, Zone 2 and Zone 3 in high-flow season and the models of Cl in Zone 4 in both seasons and in Zone 5 in high-flow season. This indicated that the variations of SO 4 2 and Cl might not be exclusively explained by land use and landscape metrics. This was consistent with some previous studies claiming that road density and geology characteristics could also contribute to the variation of ions besides land use patterns [10,41]. Therefore, further researches are inevitable to include other non-land use variables to build empirical models in order to obtain a higher predictive accuracy and confidence level.

5. Conclusions

The results of this study demonstrated that water contamination of the Songhua River basin in China presented high temporal and spatial variations within the five eco-regions. A partial least square regression (PLSR) approach was applied in exploring the relationships between land use patterns and specific water pollutants during both high-flow and normal flow seasons on a watershed scale. The results suggested that in different eco-regions, water quality parameters were influenced by different land use/landscape metrics. Arable land was observed a predictive variable contributing to degraded water quality mainly in hilly high flow region (Zone 3) and plain normal flow region (Zone 5), indicating that agricultural activities were the main factor contributing to degraded water quality in regions with a relative higher runoff depth and lower altitude. Grassland influenced most water quality parameters in the plain low flow region (Zone 2), suggesting the effect of intensive grazing in plain areas. Forest was observed to be negatively related to degraded water quality in mountainous high flow regions (Zone 4), indicating the importance of forest in mountainous areas. Landscape metrics were predictive variables contributing to degraded water quality in each eco-region during both hydrological seasons. The water quality degradation in both seasons were positively related to landscape metrics of patch density (PD), edge density (ED), landscape shape index (IJI), interspersion juxtaposition index (LSI), Shannon’s diversity index (SHDI) and Shannon’s evenness index (SHEI), and negatively associated with largest patch index (LPI), contagion (CONTAG) and aggregation index (AI). Additionally, PD and ED were observed to have major impacts on water quality contaminants, particularly in mountainous high flow region (Zone 4), which indicated the fragmentation of forest may not function effectively to decrease overland runoff and consequently let pollutants flow into the river; the landscape metrics of CONTAG was observed to be related to non-point pollutants in mountainous normal flow region (Zone 1) which suggested the correlations between CONTAG and soil erosion and sediment yield; LSI was found an effective indicator for anions in Zone 1, Zone 4 and Zone 5 mainly in high flow season; SHDI metric indicated more water quality variables in each eco-region than SHEI, except in Zone 3 and Zone 4 which only SHEI was observed to contribute to water quality variables. Indicating the variation of landscape heterogeneity contributed to the degradation of water quality in all eco-regions. The results of our study suggested that natural environmental factors such as topography, climate and hydrology may have some impacts on how land use patterns correlate with water quality. Thus, analyzing the relationships between land use patterns and water quality on an eco-functional regionalization basis is considerable. Therefore, further studies are needed to explore how the characteristics of eco-regions affect the casual relationship between land use patterns and water quality.

Author Contributions

P.C. and F.M. conceived and designed the research; P.C. and L.Z. performed the research; M.J. and P.C. contributed to data analysis of this work; P.C., F.M., Y.W., L.Z., Q.Y. and M.J. wrote the paper and approved the final version to be published.

Funding

This research was funded by National Major Science and Technology Program for Water Pollution Control and Treatment of China, grant number (2015ZX07201-008).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. The details of all the sampling sites with WGS 84 coordinates.
Table A1. The details of all the sampling sites with WGS 84 coordinates.
Number of Sampling SitesWGS 84 Coordinates of Sampling SitesLong Term Monitoring Sites
EN
S1123.7850.55Y
S2123.4447.96Y
S3122.8047.75Y
S4123.4648.09Y
S5124.7149.49Y
S6125.1849.18N
S7125.1049.12Y
S8124.7548.91Y
S9125.5149.66N
S10125.4549.41N
S11122.2645.92Y
S12123.4346.78Y
S13123.683346.78333N
S14123.8646.296Y
S15123.916747.36667N
S16126.105448.52878N
S17125.957448.56711Y
S18124.5348.48Y
S19124.5548.23333N
S20124.565448.36458Y
S21125.8948.0044N
S22126.19348.04149Y
S23124.442147.5309Y
S24124.729445.72N
S25124.646945.43639Y
S26124.8345.16222Y
S27124.983345.48333N
S28125.745.335Y
S29126.538845.75979N
S30126.71845.93Y
S31126.7245.82Y
S32126.416146.14218Y
S33125.35743.938Y
S34125.3543.85Y
S35125.44544.60313N
S36125.688644.78805N
S37126.188744.67113N
S38126.065844.9Y
S39126.4844.4Y
S40126.4944.31N
S41126.913946.00074N
S42127.013445.40921N
S43127.064444.90426N
S44127.336946.89167N
S45128.147445.91833N
S46128.601947.03444N
S47128.858947.8175N
S48127.87347.04286Y
S49129.355848.63694N
S50129.439446.27222N
S51129.573346.00361N
S52129.5846.33N
S53129.583644.53089Y
S54129.638946.73056N
S55129.672244.76579N
S56130.158147.95028N
S57128.739445.81808Y
S58125.303544.85627Y
S59125.675844.76854Y
S60125.900543.80154Y
S61125.779144.08246Y
S62125.674943.56837Y
S63126.03343.30054Y
S64127.259943.63825Y
S65126.4344.05Y
S66126.6143.78Y
S67129.016144.06063N
S68128.737243.75558N
S69127.849547.11137N
S70126.844643.11292N
S71126.8643.12Y
S72126.125442.69658Y
S73127.2242.73Y
S74126.9843.12Y
S75128.0942.36N
S76127.763242.04037N
S77130.543346.01944Y
S78130.583947.28806N
S79129.913646.64694N
S80130.687847.03167N
S81131.748947.23389N
S82132.5147.7N
S83132.458147.66139N
S84132.4647.724Y
S85131.078947.67972N
S86130.93345.8141Y
“Y” means the sample site belongs to long-term monitoring program, while “N” means the sample site is newly setted in this study.
Table A2. Correlation matrix of the land use and landscape metrics and population density used in the PLSR analysis a.
Table A2. Correlation matrix of the land use and landscape metrics and population density used in the PLSR analysis a.
Metrics bARFOGRWAURUNPDLPIEDLSICONTAGIJISHDISHEIAI
AR1
FO−0.9031
GR−0.231−0.0481
WA0.219−0.4860.0451
UR0.621−0.527−0.3650.0961
UN0.015−0.2540.290−0.025−0.2201
PD0.310−0.4340.2390.1580.3320.2021
LPI−0.2520.395−0.216−0.280−0.161−0.215−0.4681
ED0.196−0.2780.284−0.0050.2060.1810.742−0.7101
LSI−0.0780.126−0.029−0.3830.0990.2010.048−0.2760.4341
CONTAG−0.1330.368−0.231−0.488−0.08−0.282−0.6300.828−0.732−0.1121
IJI0.011−0.2480.0980.547−0.0150.2830.386−0.5020.172−0.167−0.7081
SHDI0.227−0.3790.1810.2450.1940.2950.502−0.8720.7220.404−0.8310.5451
SHEI0.084−0.2980.1840.4700.0420.2320.505−0.6630.522−0.101−0.8560.6960.6691
AI0.0220.012−0.1610.045−0.021−0.042−0.4060.166−0.3930.1210.276−0.086−0.131−0.6201
a The bold-faced numerical values indicate a significant relationship at a level of p < 0.01. b Abbreviations for land use and landscape metrics are listed in Table 2.
Table A3. Values of regression coefficients from PLSR models describing the relationships between land use/landscape metrics and individual water quality parameters in high flow seasons in each eco-region.
Table A3. Values of regression coefficients from PLSR models describing the relationships between land use/landscape metrics and individual water quality parameters in high flow seasons in each eco-region.
pHECDOCODCODMnNH3NNO3NTNTPFCl   SO   4 2
Zone 1 (N = 10)
AR 0.173 0.165 0.209 0.2530.2070.107 −0.006 0.340
FO0.006−0.1460.054−0.199 −0.541 0.044
GR −0.138 −0.367−0.217−0.750 −0.393
WA0.645 0.064 −0.462
UR0.400 −0.045 0.098 0.378−0.323
UN 0.2220.029 0.278 0.2050.190 0.301
PD0.036 0.059 0.336 −0.094
LPI0.0870.0760.208−0.207−0.151 −0.045
ED −0.1730.2640.2460.186 0.736 0.307
LSI −0.6050.338 −0.206
CONTAG 0.170−0.171−0.172−0.201 −0.335 −0.523
IJI 0.184 −0.0760.052 0.7080.588
SHDI 0.122−0.097−0.052 0.2590.396−0.477 −0.697
SHEI−0.299 0.009−0.291−0.184 −0.790 −0.139
AI0.261 −0.289−0.2311.101 0.970 −0.153
Zone 2 (N = 23)
AR 0.1630.067 0.547 0.007——
FO−0.440 0.047 0.164 −0.034——
GR0.231 0.280−0.065−0.401−0.103−0.243 −0.245−0.206 ——
WA −0.106 −0.029 ——
UR 0.362−0.210 0.176 0.478——
UN 0.539 0.125 0.553 0.301——
PD0.079−0.017 0.080−0.191−0.033 0.2660.249 ——
LPI 0.053 −0.1640.254 0.228−0.146 ——
ED −0.105 −0.054 −0.068 −0.0010.341 0.247 0.307 ——
LSI −0.054 ——
CONTAG 0.059 0.033 ——
IJI0.183 0.315 0.081 0.220 −0.460 0.588 ——
SHDI0.023 −0.060 −0.113 0.079 −0.368 −0.357 ——
SHEI 0.052−0.044 −0.342 ——
AI −0.071 ——
Zone 3 (N = 32)
AG−0.195 0.180 −0.235 0.047 0.255 0.204 0.278 0.2420.266 ——
FO0.207 −0.152 0.190 −0.065 0.266 −0.371 −0.209 −0.275 −0.228−0.190 ——
GR −0.029 ——
WA −0.371 −0.153 ——
UR 0.3110.061 0.1150.444 ——
UN 0.181 0.085 ——
PD 0.053 ——
LPI−0.105 −0.073 0.169 −0.428−0.335 0.100 ——
ED0.062 −0.121 0.012 0.382 ——
LSI0.168 0.180 −0.111 0.202 ——
CONTAG 0.106 −0.076 0.123 0.003 0.097 ——
IJI −0.1790.188 0.224 −0.137−0.104 ——
SHDI0.022 −0.099 0.063 −0.044 0.041 −0.099 ——
SHEI −0.138 −0.239 −0.107 −0.127 ——
AI−0.048 −0.376
Zone 4 (N = 12)
AG −0.037 0.2250.2660.233
FO 0.067 0.2490.291 0.2990.388−0.140−0.189
GR0.758 0.179 0.168 0.322
WA 0.571 0.494 −0.1530.168
UR 0.216 0.224 0.444
UN0.3650.187 0.0040.391 0.268 0.585
PD 0.058 0.3470.2690.245 0.3490.367
LPI 0.086 −0.054
ED 0.057 0.3760.3190.1750.6380.468 0.382
LSI0.0980.222−0.2340.322 0.202
CONTAG 0.3660.014 −0.146−0.132 −0.184
IJI −0.202
SHDI0.3160.249 −0.171
SHEI −0.1210.3710.201 −0.246−0.215−0.167
AI 0.2200.2980.248 0.081−0.012 0.376
Zone 5 (N = 9)
AR 0.1000.2350.0970.097 0.289 0.267 ——
FO 0.024−0.023 ——
GR 0.1350.209−0.160 ——0.116
WA 0.092 0.131 0.056 0.327——
UR 0.3870.1260.293 0.4510.235 ——0.105
UN0.071 0.249 ——
PD −0.100−0.103 ——
LPI0.938 0.073 −0.137 −0.182−0.3140.5610.472——
ED −0.068 0.119 ——
LSI 0.1520.1130.410——0.202
CONTAG 0.1040.023 −0.012 ——
IJI0.154 ——0.133
SHDI0.235 0.2870.116 0.2870.7930.391 ——
SHEI 0.115 0.022−0.034 ——
AI −0.090 −0.133−0.016——
The key predictors with the VIP values above 1 in the optimal models are in bold; “——” means no valid model was found for this water quality variable; Abbreviation of land use/landscape variables are listed in Table 2.
Table A4. Values of regression coefficients from PLSR models describing the relationships between land use/landscape metrics and individual water quality parameters in normal flow seasons in each eco-region.
Table A4. Values of regression coefficients from PLSR models describing the relationships between land use/landscape metrics and individual water quality parameters in normal flow seasons in each eco-region.
pHECDOCODCODMnNH3NNO3NTNTPFCl   SO   4 2
Zone 1 (N = 10)
AR0.0850.167−0.216 −0.2970.3170.040 ——
FO−0.043−0.1590.211−0.128 −0.050−0.3340.339−0.023−0.075 ——
GR −0.1380.259 0.025——
WA0.655 0.2820.221−0.2370.138−0.392——
UR0.2060.081−0.3320.306 0.624−0.006 −0.0730.066 ——
UN 0.302−0.182 0.409——
PD −0.046−0.0360.0660.288−0.0090.077 0.0821.024——
LPI 0.1600.184−0.217 0.2510.431−0.100 ——
ED −0.089−0.1690.3220.373 −0.229−0.2150.269−0.220 ——
LSI−0.299−0.0790.076 −0.2990.132 0.0940.019 ——
CONTAG −0.289−0.290 0.1660.224−0.2560.2160.841——
IJI 0.235 −0.0130.0660.2450.1700.103 −0.0780.641——
SHDI0.2430.171 −0.055 0.235 0.0410.0970.962——
SHEI 0.098−0.184−0.120−0.214 −0.076 −0.272−0.904——
AI0.321 0.032−0.140−0.086−0.153−0.254−0.026 −0.253 ——
Zone 2 (N = 23)
AR 0.230 −0.0230.240
FO 0.076−0.1510.189
GR0.269 0.192−0.136−0.503−0.0730.165 −0.058−0.062
WA0.249−0.1900.169 −0.119 −0.203−0.039
UR 0.254−0.1980.3000.111−0.1110.169 −0.0980.2250.429
UN−0.397 −0.127 0.6940.0830.344 0.1440.249−0.238
PD−0.0280.163−0.057−0.113 −0.0080.1920.2010.043−0.265
LPI−0.186 −0.0990.020 0.1150.115 0.202−0.176
ED0.003 0.013−0.013−0.0550.0250.2010.2120.003 0.069
LSI 0.214 0.197 −0.28
CONTAG−0.0830.046−0.0580.035 0.056 0.0580.094
IJI −0.0480.168−0.161 −0.190 −0.367−0.157
SHDI0.174 0.085−0.069 −0.098−0.053 −0.087 −0.388−0.35
SHEI−0.2890.0560.046 −0.024−0.219 −0.073 0.210
AI0.575 0.1950.050 0.249 0.007 −0.407−0.259
Zone 3 (N = 32)
AR−0.1890.249−0.223 0.136 −0.0490.3160.2780.310
FO0.202−0.2690.201 −0.147 0.095−0.178−0.213−0.274−0.300
GR 0.587 0.582 0.1910.134
WA −0.190 −0.013 0.247
UR−0.123 −0.291 −0.259
UN 0.152 0.372 0.112
PD −0.181 −0.193 0.188 −0.181 −0.408
LPI−0.1040.009−0.1140.1530.168−0.024−0.180−0.037 −0.0200.120−0.107
ED0.054−0.065 0.014
LSI0.069
CONTAG−0.0820.135−0.0650.0070.0240.320 0.1690.100 0.112
IJI0.093−0.1890.117−0.234 −0.328−0.121
SHDI0.0780.0490.0870.025−0.0490.3070.144 0.042−0.0450.206
SHEI0.086−0.1110.0810.0380.017−0.291 −0.032−0.129 −0.107
AI−0.0290.018−0.039 0.119 0.015
Zone 4 (N = 12)
AR 0.062 0.0740.038 0.0040.062
FO −0.0360.064 −0.181−0.233 0.192−0.4260.069
GR−0.124−0.0200.1610.188 −0.359 −0.389
WA −0.240 −0.280 −0.198−0.102 0.320
UR 0.220 −0.1260.1250.0170.175 0.0230.040 0.189
UN−0.054 0.205 −0.160
PD0.0440.241−0.120.1300.2920.2870.2350.3490.391 −0.05
LPI0.026−0.0920.065 −0.057−0.053−0.018−0.065 −0.135
ED0.0280.232−0.0980.1120.2910.3310.2290.4350.448 −0.050
LSI 0.142−0.0450.194 −0.170 −0.153
CONTAG−0.027−0.1250.073−0.109−0.256−0.227−0.051−0.111−0.227−0.230 0.009
IJI 0.316 −0.203−0.224
SHDI−0.0560.057 0.057 0.014−0.007 0.149
SHEI0.234 −0.239−0.225 −0.150
AI−0.236 0.257 0.051
Zone 5 (N = 9)
AR 0.228−0.396−0.217 0.5990.7050.4950.361 −0.344
FO 0.033 −0.018−0.187−0.051 −0.1510.103−0.0230.178
GR −0.038 0.058 0.037−0.347−0.329−0.014−0.082−0.026
WA 0.057 0.015 0.078−0.049−0.058
UR−1.167 −0.4720.1000.168−0.202 0.135−0.065−0.2980.395−0.041
UN−1.715−0.204 0.225−0.469 −0.467 0.191−0.639
PD−0.420 0.0050.176−0.016 0.0380.022−0.083
LPI 0.1300.235−0.153 −0.084 −0.3290.0590.147
ED −0.039 0.0640.0790.030 0.146−0.016−0.1140.118
LSI 0.188 −0.195 0.1990.0910.0490.319
CONTAG 0.059 −0.002 −0.071 0.1030.0250.0640.121
IJI−0.4340.2250.098 −0.1560.0110.023−0.006 −0.026
SHDI 0.180−0.123 0.3700.3970.3470.0860.022
SHEI −0.056 −0.036 0.080 −0.229−0.023−0.035−0.232
AI 0.081 −0.058 −0.072 −0.0630.0350.101−0.011
The key predictors with the VIP values above 1 in the optimal models are in bold; “——” means no valid model was found for this water quality variable; Abbreviation of land use/landscape variables are listed in Table 2.
Figure A1. Spatial variations of water quality parameters during different seasons in Songhua River Basin.
Figure A1. Spatial variations of water quality parameters during different seasons in Songhua River Basin.
Ijerph 15 01872 g0a1aIjerph 15 01872 g0a1b

References

  1. Basnyat, P.; Teeter, L.D.; Flynn, K.M.; Lockaby, B.G. Relationships Between Landscape Characteristics and Nonpoint Source Pollution Inputs to Coastal Estuaries. Environ. Manag. 1999, 23, 539–549. [Google Scholar] [CrossRef]
  2. Sliva, L.; Williams, D.D. Buffer zone versus whole cathcment approaches to studying land use impact on river water quality. Water Res. 2001, 35, 3462–3472. [Google Scholar] [CrossRef]
  3. Giri, S.; Qiu, Z. Understanding the relationship of land uses and water quality in twenty first century: A review. J. Environ. Manag. 2016, 173, 41–48. [Google Scholar] [CrossRef] [PubMed]
  4. Chen, Q.; Mei, K.; Dahlgren, R.A.; Wang, T.; Gong, J.; Zhang, M. Impacts of land use and population density on seasonal surface water quality using a modified geographically weighted regression. Sci. Total Environ. 2016, 572, 450–466. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Bu, H.; Meng, W.; Zhang, Y.; Wan, J. Relationships between land use patterns and water quality in the Taizi River Basin, China. Ecol. Indic. 2014, 41, 187–197. [Google Scholar] [CrossRef]
  6. Teixeira, Z.; Teixeira, H.; Marques, J.C. Systematic processes of land use/land cover change to identify relevant driving forces: Implications on water quality. Sci. Total Environ. 2014, 470–471, 1320–1335. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Lee, S.-W.; Hwang, S.-J.; Lee, S.-B.; Hwang, H.-S.; Sung, H.-C. Landscape ecological approach to the relationships of land use patterns in watersheds to water quality characteristics. Landsc. Urban Plan. 2009, 92, 80–89. [Google Scholar] [CrossRef]
  8. Uuemaa, E.; Roosaare, J.; Mander, Ü. Scale dependence of landscape metrics and their indicatory value for nutrient and organic matter losses from catchments. Ecol. Indic. 2005, 5, 350–369. [Google Scholar] [CrossRef]
  9. Shen, Z.; Hou, X.; Li, W.; Aini, G.; Chen, L.; Gong, Y. Impact of landscape pattern at multiple spatial scales on water quality: A case study in a typical urbanised watershed in China. Ecol. Indic. 2015, 48, 417–427. [Google Scholar] [CrossRef]
  10. Dow, C.L.; Arscott, D.B.; Newbold, J.D. Relating major ions and nutrients to watershed conditions across a mixed-use water-supply watershed. J. N. Am. Benthol. Soc. 2006, 25, 887–991. [Google Scholar] [CrossRef]
  11. Ding, J.; Jiang, Y.; Liu, Q.; Hou, Z.; Liao, J.; Fu, L.; Peng, Q. Influences of the land use pattern on water quality in low-order streams of the Dongjiang River basin, China: A multi-scale analysis. Sci. Total Environ. 2016, 551–552, 205–216. [Google Scholar] [CrossRef] [PubMed]
  12. Carrascal, L.M.; Galván, I.; Gordo, O. Partial least squares regression as an alternative to current regression methods used in ecology. Oikos 2009, 118, 681–690. [Google Scholar] [CrossRef] [Green Version]
  13. Shi, Z.H.; Ai, L.; Li, X.; Huang, X.D.; Wu, G.L.; Liao, W. Partial least-squares regression for linking land-cover patterns to soil erosion and sediment yield in watersheds. J. Hydrol. 2013, 498, 165–176. [Google Scholar] [CrossRef]
  14. Abdi, H. Partial least squares regression and projection on latent structure regression (pls regression). Wiley Interdisci. Rev.: Comput. Stat. 2010, 2, 97–106. [Google Scholar] [CrossRef]
  15. Luedeling, E.; Gassner, A. Partial least squares regression for analyzing walnut phenology in California. Agric. For. Meteorol. 2012, 158–159, 43–52. [Google Scholar] [CrossRef]
  16. Omernik, J.M. Ecoregions of the conterminous United States. Ann. Assoc. Am. Geogr. 1987, 77, 118–125. [Google Scholar] [CrossRef]
  17. Li, F.; Zhang, G.; Xu, Y.J. Spatiotemporal variability of climate and streamflow in the Songhua River basin, northeast China. J. Hydrol. 2014, 514, 53–64. [Google Scholar] [CrossRef]
  18. Wang, Y.; Meng, F. Characteristics of Water Environmental Pollution in Songhua River, 1st ed.; Chemical Industry Press: Beijing, China, 2013; pp. 22–23. [Google Scholar]
  19. Meng, F.; Wang, Y.; Zhang, L.; Cheng, P.; Xue, H.; Meng, D. Organic pollutant types and concentration changes of the water from Songhua river, China, in 1975–2013. Water Air Soil Pollut. 2016, 227. [Google Scholar] [CrossRef]
  20. Zhou, Q. Level I and II Eco-Functional Regionalization of Songhua River Watershed; National Major Science and Technology Program for Water Pollution Control and Treatment of China (Grant No: 2008ZX07526-002); Nankai University: Tianjin, China, 2011. (In Chinese) [Google Scholar]
  21. National Environmental Protection Board. Standard Methods for the Examination of Water and Wastewater (Version 4); China Environmental Science Publish Press: Beijing, China, 2002. [Google Scholar]
  22. Kandler, M.; Blechinger, K.; Seidler, C.; Pavlu, V.; Sanda, M.; Dostal, T.; Krasa, J.; Vitvar, T.; Stich, M. Impact of land use on water quality in the upper Nisa catchment in the Czech Republic and in Germany. Sci. Total Environ. 2017, 586, 1316–1325. [Google Scholar] [CrossRef] [PubMed]
  23. Alberti, M.; Booth, D.; Hill, K.; Coburn, B.; Avolio, C.; Coe, S.; Spirandelli, D. The impact of urban patterns on aquatic ecosystems: An empirical analysis in Puget lowland sub-basins. Landsc. Urban Plan. 2007, 80, 345–361. [Google Scholar] [CrossRef]
  24. Lee, S.-W.; Ellis, C.D.; Kweon, B.-S.; Hong, S.-K. Relationship between landscape structure and neighborhood satisfaction in urbanized areas. Landsc. Urban Plan. 2008, 85, 60–70. [Google Scholar] [CrossRef]
  25. Rattan, K.J.; Corriveau, J.C.; Brua, R.B.; Culp, J.M.; Yates, A.G.; Chambers, P.A. Quantifying seasonal variation in total phosphorus and nitrogen from prairie streams in the Red River basin, Manitoba Canada. Sci. Total Environ. 2017, 575, 649–659. [Google Scholar] [CrossRef] [PubMed]
  26. User Guide to Simca-p 13.0. 2012. Available online: http://landing.umetrics.com/downloads-simca-online (accessed on 28 August 2018).
  27. Ai, L.; Shi, Z.H.; Yin, W.; Huang, X. Spatial and seasonal patterns in stream water contamination across mountainous watersheds: Linkage with landscape characteristics. J. Hydrol. 2015, 523, 398–408. [Google Scholar] [CrossRef]
  28. Zampella, R.A.; Procopio, N.A.; Lathrop, R.G.; Dow, C.L. Relationship of land-use/land-cover patterns and surface-water quality in the Mullica river basin. J. Am. Water Resour. Assoc. 2007, 43, 594–604. [Google Scholar] [CrossRef]
  29. Zhou, P.; Huang, J.; Pontius, R.G., Jr.; Hong, H. New insight into the correlations between land use and water quality in a coastal watershed of China: Does point source pollution weaken it? Sci. Total Environ. 2016, 543, 591–600. [Google Scholar] [CrossRef] [PubMed]
  30. Nguyen, H.V.; Hur, J. Tracing the sources of refractory dissolved organic matter in a large artificial lake using multiple analytical tools. Chemosphere 2011, 85, 782–789. [Google Scholar] [CrossRef] [PubMed]
  31. Lee, Y.; Hur, J.; Shin, K.H. Characterization and source identification of organic matter in view of land uses and heavy rainfall in the lake Shihwa, Korea. Mar. Pollut. Bull. 2014, 84, 322–329. [Google Scholar] [CrossRef] [PubMed]
  32. Yu, S.; Xu, Z.; Wu, W.; Zuo, D. Effect of land use types on stream water quality under seasonal variation and topographic characteristics in the Wei River basin, China. Ecol. Indic. 2016, 60, 202–212. [Google Scholar] [CrossRef]
  33. Zhao, J.; Lin, L.; Yang, K.; Liu, Q.; Qian, G. Influences of land use on water quality in a reticular river network area: A case study in Shanghai, China. Landsc. Urban Plan. 2015, 137, 20–29. [Google Scholar] [CrossRef]
  34. Ahearn, D.S.; Sheibley, R.W.; Dahlgren, R.A.; Anderson, M.; Johnson, J.; Tate, K.W. Land use and land cover influence on water quality in the last free-flowing river draining the western Sierra Nevada, California. J. Hydrol. 2005, 313, 234–247. [Google Scholar] [CrossRef]
  35. He, C.S.; Malcolm, S.B.; Dahlberg, K.A.; Fu, B.J. A conceptual framework for integrating hydrological and biological indicators into watershed management. Landsc. Urban Plan. 2000, 49, 25–34. [Google Scholar] [CrossRef] [Green Version]
  36. Jones, K.B.; Neale, A.C.; Nash, M.S.; Van Remortel, R.D.; Wickham, J.D.; Riitters, K.H.; O′Neill, R.V. Predicting nutrient and sediment loadings to streams from landscape metrics: A multiple watershed study from the United States Mid-Atlantic Region. Landsc. Ecol. 2001, 16, 301–312. [Google Scholar] [CrossRef]
  37. Griffth, J.A. Geographic techniques and recent applications of remote sensing. to landscape-water quality studies. Water Air Soil Pollut. 2002, 138, 181–197. [Google Scholar] [CrossRef]
  38. McGarigal, K.; Marks, B.J. Fragstats: Spatial Pattern Analysis Program for Quantifying Landscape Structure; General Technical Report PNW-GTR-351; USDA Forest Service, Pacific Northwest Research Station: Portland, OR, USA, 1995. [Google Scholar]
  39. Allan, J.D. Landscapes and Riverscapes: The Influence of Land Use on Stream Ecosystems. Annu. Rev. Ecol. Evol. Syst. 2004, 35, 257–284. [Google Scholar] [CrossRef] [Green Version]
  40. Cade-Menun, B.J.; Bell, G.; Baker-Ismail, S.; Fouli, Y.; Hodder, K.; McMartin, D.W.; PerezValdivia, C.; Wu, K. Nutrient loss from Saskatchewan cropland and pasture in spring snowmelt runoff. Can. J. Soil Sci. 2013, 93, 445–458. [Google Scholar] [CrossRef]
  41. Xie, X.S.; Berner, N.Z.; Stüben, D. A Gis-supported multivariate statistical analysis of relationships among stream water chemistry, geology and land use in Baden-Württemberg, Germany. Water Air Soil Pollut. 2005, 167, 39–57. [Google Scholar] [CrossRef]
Figure 1. Distribution of environmental characteristics of Songhua River Basin.
Figure 1. Distribution of environmental characteristics of Songhua River Basin.
Ijerph 15 01872 g001
Figure 2. Locations of sampling sites in Songhua River Basin.
Figure 2. Locations of sampling sites in Songhua River Basin.
Ijerph 15 01872 g002
Figure 3. Land use distribution in Songhua River Basin.
Figure 3. Land use distribution in Songhua River Basin.
Ijerph 15 01872 g003
Figure 4. Distribution patterns of water quality parameters during high-flow, normal flow and icebound seasons in different eco-regions of the Songhua River Basin, China (The box represented 25th and 75th percentiles; the small square represented mean; the line in box represented median; values above or below whiskers were outliers). Zone 1–5 refers to the five eco-region listed in Table 1.
Figure 4. Distribution patterns of water quality parameters during high-flow, normal flow and icebound seasons in different eco-regions of the Songhua River Basin, China (The box represented 25th and 75th percentiles; the small square represented mean; the line in box represented median; values above or below whiskers were outliers). Zone 1–5 refers to the five eco-region listed in Table 1.
Ijerph 15 01872 g004
Figure 5. Land use composition (%) in five eco-regions in Songhua River Basin, China.
Figure 5. Land use composition (%) in five eco-regions in Songhua River Basin, China.
Ijerph 15 01872 g005
Figure 6. (a) Weight plots of the first and second PLSR components for individual water quality parameter in the high-flow season, and (b) weight plots of the first and second PLSR components for individual water quality parameter in the normal flow season. Land use/landscape variables with the highest VIP values in each eco-region was in red and highlighted with boxes. Abbreviations for land use/landscape metrics are listed in Table 2.
Figure 6. (a) Weight plots of the first and second PLSR components for individual water quality parameter in the high-flow season, and (b) weight plots of the first and second PLSR components for individual water quality parameter in the normal flow season. Land use/landscape variables with the highest VIP values in each eco-region was in red and highlighted with boxes. Abbreviations for land use/landscape metrics are listed in Table 2.
Ijerph 15 01872 g006
Table 1. Average values of the main environmental variables which characterize the five eco-regions.
Table 1. Average values of the main environmental variables which characterize the five eco-regions.
CodeDefinitionLocationAltitude (m)Annual Mean Temperature (°C)Runoff Depth (mm)
Zone 1Mountainous Normal flowGreat Khingan Mountain559−0.26179
Zone 2Plain low-flowSongnen Plain1820.3645
Zone 3Hilly high-flowSecond Songhua and Songhua river mainstream3320.23303.77
Zone 4Mountainous high-flowChangbai Mountain6040.26341.59
Zone 5Plain normal flowSanjiang Plain1740.32177
Table 2. Land use and landscape metrics a used in this study.
Table 2. Land use and landscape metrics a used in this study.
Landscape Metrics (Abbreviation)Descriptions
Arable land (AR)Land use for crops cultivation, land mainly used for planting and beach cultivated more than three years (unit: %)
Forest (FO)Including growing arbor, shrub, bamboo, mangrove and other young afforested land.
Grassland (GR)Land use for herbaceous plant, coverage above 5% (unit: %)
Water (WA)Inland water area and land use for water conservancy facilities (unit: %)
Urban (UR)Residential area, industrial area and roads (unit: %)
Unused land (UN)Barren land, alkaline land, sand and waste land (unit: %)
Patch density (PD)Numbers of patches per unit area (number per 100 ha)
Largest patch index (LPI)Percentage of the landscape in the largest patch (unit: %)
Edge density (ED)Total length of all edge segments per hectare for the considered landscape (unit: m/ha)
Landscape shape index (LSI)Provides a standardized measure of total edge or edge density that adjusts for size of the landscape.
Contagion (CONTAG)Tendency of land use types to be aggregated (unit: %)
Interspersion and juxtaposition index (IJI)Based on patch adjacencies, not cell adjacencies like the contagion index.
Shannon’s diversity index (SHDI)Based on information theory; indicates the patch density in a landscape (unitless)
Shannon’s evenness index (SHEI)Minus the sum across all patch types, of the proportional abundance of each patch type multiplied by that proportion, divided by the logarithm of the number of patch types (unitless)
Aggregation index (AI)Number of like adjacencies involving the corresponding land use type, divided by the maximum possible number of like adjacencies involving the corresponding land use type (unit: %)
a landscape metrics are calculated by FRAGSTATS 4.0.
Table 3. Descriptive statistics of landscape metrics in each eco-region in the Songhua River Basin.
Table 3. Descriptive statistics of landscape metrics in each eco-region in the Songhua River Basin.
Landscape MetricsZone 1Zone 2Zone 3Zone 4Zone 5One-Way Anova
MeanMeanMeanMeanMeanp
PD (#/100 ha)0.110.120.100.0930.0860.002 **
LPI (%)29.9037.3843.8250.3925.300.007 **
ED (m/ha)8.196.856.976.867.210.013 *
LSI15.5714.0614.5116.5511.830.403
CONTAG (%)50.7951.7455.3356.5947.260.037 *
IJI (%)52.3858.8853.1154.0457.560.015 *
SHDI1.441.491.381.321.640.040 *
SHEI0.570.580.510.520.630.024 *
AI (%)52.0260.9763.8954.2762.530.136
Abbreviation of landscape metrics are listed in Table 2. * means p < 0.05, ** means p < 0.01.
Table 4. Results from Partial Least Square regression analysis for water quality parameters in each eco-region.
Table 4. Results from Partial Least Square regression analysis for water quality parameters in each eco-region.
Season Zone 1Zone 2Zone 3Zone 4Zone 5
YR2Q2ComponentR2Q2ComponentR2Q2ComponentR2Q2ComponentR2Q2Component
High-flowpH0.530.5120.580.5620.540.5220.580.5620.540.522
EC0.530.5120.730.5830.720.6220.730.5830.720.622
DO0.880.6030.720.6930.730.6020.720.6930.730.602
COD0.660.5720.730.6220.760.5120.730.6220.760.512
CODMN0.630.6820.630.5120.640.5820.630.5120.640.582
NH3N0.680.5120.650.6220.700.6920.650.6220.700.692
NO3N0.710.6520.630.5720.750.5920.630.5720.750.592
TN0.660.5120.720.7020.710.5020.720.7020.710.502
TP0.730.6630.710.7120.770.5120.710.7120.770.512
F0.680.4230.690.5820.810.6630.690.5820.810.663
Cl0.680.5530.490.1110.540.3210.490.1110.540.321
SO 4 2 0.520.2420.750.7330.640.5220.750.7330.640.522
Normal flowpH0.700.5220.590.5120.640.6030.590.5120.640.603
EC0.600.5020.820.5220.840.6320.820.5220.840.632
DO0.760.5920.790.5720.640.5920.790.5720.640.592
COD0.780.5920.610.5920.610.5520.610.5920.610.552
CODMN0.630.5320.630.5220.680.6320.630.5220.680.632
NH3N0.640.5820.790.6020.600.1320.790.6020.600.132
NO3N0.770.6930.720.2120.930.5920.720.2120.930.592
TN0.750.6820.800.6020.760.6020.800.6020.760.602
TP0.780.8020.730.7820.890.5330.730.7820.890.533
F0.790.6420.730.6920.640.4830.730.6920.640.483
Cl0.870.7430.470.2410.840.5930.470.2410.840.593
SO 4 2 0.680.5420.730.6520.610.6030.730.6520.610.603
Table 5. The relative importance of the key variables in the optimal models in high-flow season.
Table 5. The relative importance of the key variables in the optimal models in high-flow season.
YSignificant Predictors (A)
Zone 1Zone 2Zone 3Zone 4Zone 5
pHWA (0.645), UR (0.400), SHEI (−0.299)FO (−0.440), PD (0.079), IJI (0.183)AR (−0.195), FO (0.207)GR (0.758), UN (−0.365)UN (−0.071), LPI (0.938)
ECGR (−0.138), UN (0.222), SHDI (0.122)UR (0.362)AR (0.180), FO (0.152), IJI (−0.179)WA (−0.571), CONTAG (0.366), AI (−0.220)AR (0.100), SHEI (−0.115)
DOFO (0.054), UR (−0.045), LPI (0.208)GR (0.280), UR (−0.210), IJI (0.315)AR (−0.235), FO (0.190), IJI (0.188)FO (0.067), GR (0.179), PD (−0.058)AR (−0.235), UR (−0.387), LPI (−0.073)
CODLPI (−0.207), ED (0.264), CONTAG (−0.171)PD (0.080), IJI (0.081), AI (−0.071)AR (0.047), UN (0.181), PD (0.053)WA (−0.494), SHEI (−0.201), AI (−0.248)AR (−0.097), UR (0.126), SHDI (0.116)
CODMnAR (0.209), ED (0.322), CONTAG (−0.172)AR (0.163), GR (−0.401), UN (0.539)FO (0.266), WA (−0.371), SHEI (−0.138)FO (0.249), PD (0.347), ED (0.376)FO (−0.024), UR (0.293)
NH3NGR (−0.367), SHEI (−0.291), AI (−0.289)LPI (−0.164), IJI (0.220), SHDI (0.079)AR (0.255), FO (−0.371), LPI (−0.428)FO (−0.291), PD (0.269), ED (0.376)UN (0.249)
NO3NSHDI (0.259), AI (−0.231)GR (−0.243), LPI (0.254), ED (0.341)UR (0.311), LPI (−0.335)UR (0.224), PD (0.245), ED (0.175)AR (0.289), GR (−0.135), SHDI (0.287)
TNAR (0.253), GR (−0.750), SHDI (0.396)AR (0.547), PD (0.266), IJI (−0.460)AR (0.204), FO (−0.209)FO (−0.299), ED (0.638)GR (−0.209), UR (0.451), SHDI (0.793)
TPAR (0.207), ED (0.736), CONTAG (−0.335)GR (−0.245), UN (0.553), SHDI (−0.357)AR (0.278), FO (−0.275), IJI (−0.137)FO (−0.388), PD (0.349), ED (0.468)AR (0.267), LPI (−0.561), SHDI (0.391)
FSHEI (−0.790), AI (0.970)GR (−0.206), LPI (−0.146)AR (0.242), FO (−0.228), UR (0.115)AR (0.225), GR (−0.168)WA (0.327), LPI (0.472), LSI (0.410)
ClED (0.307), IJI (0.588), SHDI (−0.697)UR (0.478), IJI (0.588)AR (0.266), FO (−0.190), UR (0.444)UR (0.444), UN (0.585), LSI (0.202)——
SO 4 2 AR (0.340), WA (−0.462), LSI (−0.206)————GR (−0.322), WA (0.168)GR (−0.116), LSI (0.202)
Y means the response variables in the PLSR models; A means the regression coefficient; the key variables with the highest VIP values in the optimal models are in bold; “——”means no valid model was found for this water quality variable; Abbreviation of land use/landscape variables are listed in Table 2.
Table 6. The relative importance of the key variables in the optimal models in normal flow season.
Table 6. The relative importance of the key variables in the optimal models in normal flow season.
YSignificant Predictors (A)
Zone 1Zone 2Zone 3Zone 4Zone 5
pHWA (0.655), UR (0.206), LSI (−0.299)GR (0.269), SHDI (0.174), AI (0.575)AR (−0.189), FO (0.202)SHEI (0.234), AI (−0.236)UR (−1.167), UN (−1.715)
ECGR (−0.138), UN (0.302), SHDI (0.171)WA (−0.190), UR (0.254), PD (0.163)AR (249), FO (−0.269), IJI (−0.189)WA (−0.240), UR (0.220), PD (0.241)AR (0.228), LSI (0.188), IJI (0.225)
DOAR (−0.216), FO (0.211), UR (−0.332)GR (0.192), UR (−0.198), IJI (0.168)AR (−0.223), FO (0.201), IJI (0.117)FO (0.064), GR (0.161), PD (−0.124)AR (−0.396), UR (−0.472), LPI (0.235)
CODUR (0.306), ED (0.322), CONTAG (−0.289)UR (0.300), AI (0.195)UN (0.372), SHEI (0.038)WA (−0.280), SHEI (−0.225), AI (0.257)AR (−0.217), UR (0.100), SHDI (−0.123)
CODMnED (0.373), CONTAG (−0.290)GR (−0.503), UN (0.694)LPI (0.168), SHEI (0.017), AI (0.119)FO (0.181), PD (0.292), ED (0.291)FO (−0.187), UR (0.168)
NH3NUR (0.624), SHEI (−0.214), AI (−0.153)LPI (0.115), IJI (−0.190), SHDI (−0.098)GR (0.587), LPI (−0.024)FO (−0.233), PD (0.287), ED (0.331)UR (−0.202), UN (0.225), LSI (−0.195)
NO3NFO (−0.334), SHDI (0.235), AI (−0.254)UR (0.169), LPI (0.115), ED (0.201)UR (−0.291), PD (0.188)UR (0.175), PD (0.235), ED (0.229)AR (0.599), GR (−0.347), SHDI (0.370)
TNFO (−0.339), LPI (0.224), CONTAG (0.224)AR (0.230), PD (0.201), IJI (−0.367)AR (0.316), GR (0.582)FO (−0.192), PD (0.349), ED (0.435)AR (0.705), GR (−0.329), SHDI (0.397)
TPAR (0.317), ED (0.269), CONTAG (−0.256)UR (0.144), LPI (0.202), SHDI (−0.087)AR (0.278), FO (−0.213), IJI (−0.121)FO (−0.426), PD (0.391), ED (0.448)AR (0.495), LPI (−0.329), SHDI (0.347)
FPD (0.082), SHEI (−0.272), AI (−0.253)UR (0.225), UN (0.249), LSI (0.197)AR (0.310), FO (−0.274), UR (−0.259)UR (0.040), GR (−0.359)AR (0.361), LPI (0.059), LSI (0.091)
ClUN (0.409), IJI (0.641), SHDI (0.962)UR (0.429), SHDI (−0.388), AI (−0.407)FO (−0.300), ——UR (0.395), LSI (0.049), AI (0.101)
SO 4 2 ——LSI (−0.278), SHDI (−0.351), AI (−0.259)WA (0.247), PD (−0.408)GR (−0.389), WA (0.320), UR (0.189)UR (−0.041), UN (−0.639), LSI (0.319)
Y means the response variables in the PLSR models; A means the regression coefficient; the key variables with the highest VIP values in the optimal models are in bold; “——”means no valid model was found for this water quality variable; Abbreviation of land use/landscape variables are listed in Table 2.

Share and Cite

MDPI and ACS Style

Cheng, P.; Meng, F.; Wang, Y.; Zhang, L.; Yang, Q.; Jiang, M. The Impacts of Land Use Patterns on Water Quality in a Trans-Boundary River Basin in Northeast China Based on Eco-Functional Regionalization. Int. J. Environ. Res. Public Health 2018, 15, 1872. https://doi.org/10.3390/ijerph15091872

AMA Style

Cheng P, Meng F, Wang Y, Zhang L, Yang Q, Jiang M. The Impacts of Land Use Patterns on Water Quality in a Trans-Boundary River Basin in Northeast China Based on Eco-Functional Regionalization. International Journal of Environmental Research and Public Health. 2018; 15(9):1872. https://doi.org/10.3390/ijerph15091872

Chicago/Turabian Style

Cheng, Peixuan, Fansheng Meng, Yeyao Wang, Lingsong Zhang, Qi Yang, and Mingcen Jiang. 2018. "The Impacts of Land Use Patterns on Water Quality in a Trans-Boundary River Basin in Northeast China Based on Eco-Functional Regionalization" International Journal of Environmental Research and Public Health 15, no. 9: 1872. https://doi.org/10.3390/ijerph15091872

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop