1. Introduction
By 2050, worldwide energy and water consumption is forecasted to increase by 80% and 55% (compared with the level in the year 2000), respectively [
1]. The stress on resource consumption comes from increasing demand, which is driven by the growing population as well as by shifts in anthropogenic processes [
2,
3,
4,
5,
6]. Urbanization is considered one of the most vital anthropogenic alterations [
7], and urban patterns are often used to dictate the concentration and distribution of such alterations. Urban patterns can be defined by a city’s area, aggregation, spatial metrics [
8], and coupling with anthropogenic activities, which, in turn, influence regional resource consumption through a complex mechanism [
9,
10,
11]. Given that the altering pattern and pace of different anthropogenic activities are identified to be distinct from one another [
12], then incorporate with the shifting urban pattern, resource consumption should be evaluated in a spatially and temporally focused manner to deliver a full picture.
Contemporary studies largely draw on how socio-economic variables, which can be expressed in terms of anthropogenic activity, spatial pattern, meso-economic structure, or combined effects on behavior, affect the flow of resource consumption and production [
13,
14,
15]. These variables may include ones with socio-demographic characteristics (e.g., income level, GDP, population, household size, density) or may include built environment attributes (e.g., urban zoning, housing unit, dwelling age, floor area) [
11,
15,
16,
17,
18,
19,
20]. Various social-economic contexts, such as socio-demographic and the built-environment, shape the specific spatial cluster attributes for a given region (township or city) and form different resource consumption patterns [
16,
21]. For example, variables of building types and layout, consumption behavior, and human lifestyle have been evaluated to identify spatial characteristics in recent years [
22,
23]. Thus, the complex spatial and temporal resource consumption facing urban transitions can be analyzed with a large quantity of spatial data of socio-economic variables [
24,
25]. Additionally, the consumption intensities of resource usage are dissimilar in various enterprises and in human activities under different sectors [
15], and as such, they should be evaluated separately. The relationship between sectoral social-economic contexts and spatially distributed resource consumption is less investigated, and the correlations are expected to be more intense for a region undergoing urban transformation.
Various analytical models can be used to analyze the flow of energy and water consumption. Conventional input–output analysis (IOA) and life cycle assessment (LCA) are static models that are unable to capture urban spatial processes [
17,
26]. Regional or city-scale analysis is developed by IOA and material flow assessment (MFA) with approximations (e.g., by downscaling national flows based on population share, or share in GDP), and the assumptions of consumption intensity are often borrowed at the national level. IOA, LCA, and MFA frequently emphasize illustrated disaggregated and intersectoral resource flows, yet the spatial drivers and underlying causes of such flows are less discussed or ignored. However, spatially aggregated models are made possible with these static models coupling with intersectional resource flows and spatial data sets [
27] or through simulation [
28]. Nevertheless, data availability and quality, assumptions, or spatial averaging for these models may lead to different results with estimated uncertainty, as the interactions among the constituting elements are not spatially and temporally stationary [
11,
29]. In contrast, the analytical approach of spatial econometrics has been used to better understand the overall context and to evaluate resource consumption by incorporating socio-economic variables in a spatial-temporal manner [
20,
26,
30]. In this study, a large quantity and detailed spatial open-source data available from the Taiwanese government are used to overcome the listed difficulties for truthful simulation.
Spatial econometric models are devised to indicate the spatial dynamics of resource consumption as a comprehensive representation of urban physical and social processes by highlighting the autocorrelation (spatial dependency), heterogeneity (spatial non-stationarity), and weighting (spatial correction) of explanatory variables. Geographically weighted regression (GWR) and ordinary least-squares (OLS) are the most commonly applied models. OLS is a stationary regression analysis, whereas GWR extends the function by adding a geographical location parameter and accounts for spatial heterogeneity through calibration [
31,
32,
33]. Until now, GWR analysis has been extensively used in the fields of urban planning [
34,
35,
36], environmental science [
37,
38,
39], ecosystem [
40,
41,
42] as well as sociology [
43]. In this study, GWR with modified procedures is applied to illustrate the non-stationary nature of resource consumption in spatial structures and to outline the spatial effect of its driving variables.
For a city that undergoes major urban transformation such as massive urban renewal or municipality merger, identification of the temporal-spatial changes of socio-economic variables becomes crucial in estimating resource consumption. Additionally, the illustration of the spatial dynamics of resource consumption over a timespan is essential in order to formulate resource utilization planning to enhance resource efficiency in metropolises. This study extends current GWR approaches by considering the non-stationary spatial and temporal variation of socio-economic factors to predict water and energy consumption at township-level (second level in geographical classification) through a series of procedures. Figures of a consolidated city-county (the unified administrative body of Taichung city is used as the case study) were extracted to explore dramatic changes in socio-economic variables. To diminish the bias of long-term changes and variance in non-linear regression analysis, local indicators of spatial association (LISA) was applied to identify local clusters and to screen a better bandwidth, which is needed in GWR analysis [
32,
38,
44,
45]. Multicollinearity was removed with OLS. The above procedures are used to improve prediction accuracy of GWR.
A GWR analysis was then used to assess the spatial effects of the selected variables in a benchmark year and further to predict the resource consumption trends based on the spatial coefficient of these variables. Moreover, socio-economic variables and resource consumption are grouped into domestic, industrial, and agricultural sectors in order to analyze sectoral consumption in terms of spatial distribution and to understand how the variations in variables contribute to sectoral consumption. The findings would be of great benefit for policymaker to develop new policies or to improve the effectiveness of existing programs regarding urban activities and resource planning.
The four objectives of the current study are (a) to evaluate the spatial-temporal characteristics of socio-economic variables and to develop cluster maps for a city undergoing mass urban transformation; (b) to analyze the coefficients of water and energy consumption of the driving variables at township-level; (c) to compare the predicted results with actual data for validation; and (d) to forecast the changes in domestic water and energy consumption to provide insight for policymaking.
2. Materials and Methods
2.1. The Study Site
Taichung City consists of 29 townships and was used as the study site. Former Taichung City and Taichung County were merged to form a larger Taichung City in 2010, and it is now the second largest metropolitan area in Taiwan. A total of 70% of the precision machinery, machine tools, and parts manufacturers in Taiwan have manufacturing facilities in Taichung. Production from the local industrial clusters comprises 64.1% of the local GDP, and 35.9% of total outputs come from the service sector, with insignificant agricultural and mining activities. The total production output has increased 21.9% from 2011 to 2016, indicating that Taichung City underwent urban transformation after 2010.
The energy and water consumption datasets as well as social and economic data were taken from the Taiwan Power Company (TPC), the Water Resources Agency (WRA), and the Ministry of the Interior (MOI), respectively. Data for the consumed energy and water were available for 625 villages at village-level (third level in geographical classification) with 1020 datasets from 2015 to 2017, and socio-economic variables were released at township-level (second level).
To understand the spatial effects of socio-economic variables on resource consumption, this study first analyzes the changes of three main variables from 2011 to 2015 (two variables were excluded with missing data), and the geostatistical patterns of five socio-economic variable are identified at township-level for 2015, as summarized in
Supplementary Table S1. The limitation of the available open-source data based on geographical classification [
46] and the geographic changes of resource consumption is that more than 30 datasets need to be used to generate cluster maps by means of LISA analysis. The energy and water consumption at village-level is available for the year 2015 and afterwards. The benchmark year of 2015 was selected, as it represents the earliest available village-level figures. However, data regarding socio-economic variables are available at township-level for 29 towns and were excluded from LISA analysis, as the results are only reliable if the input feature class contains at least 30 features.
2.2. Study Framework and Methodology
This study develops a comprehensive framework to analyze the spatial-temporal changes of regional water and energy consumption with respect to influencing factors in urban transformation, as shown in
Figure 1. First, the spatial-temporal changes of the socio-economic variables for Taichung City from 2011 to 2015 were outlined to demonstrate non-stationary spatial urban transformation. To improve the estimation accuracy, the spatial cluster characteristics and the impacts of multicollinearity are required to be identified. Cluster characteristic changes of water and energy consumption at township-level from 2015 to 2017 were presented in terms of local spatial autocorrelation and heterogeneity with LISA. Additionally, applicable bandwidth was specified in the process. These are used to provide a preliminary assessment of the non-stationarity characteristics of the dependent and independent GWR variables.
Then, spatial varying energy and water consumption derived from the identified socio-economic variables were evaluated with OLS analysis and spatial regression analysis (GWR) in the benchmark year 2015. In addition, the GWR analysis results were further refined by prescreening the variance inflation factor (VIF) during the OLS analysis (also in the third step). The estimated performance of GWR was then evaluated by comparing the estimated figures with actual data from 2016 and 2017. Finally, resource consumption forecasting for the domestic sector for the years 2030 and 2050 was provided to offer useful information in decisions for public policy.
2.3. Spatial Autocorrelation and Heterogeneity
The cluster characteristics are emphasized with the use of Moran’s I and LISA to prescreen the spatial autocorrelation effects [
32,
36,
39,
43,
45,
47,
48]. LISA is modified from Moran’s I to indicate the cluster for each location [
49], and was included to provide the local spatial autocorrelation.
Ii values from the LISA index are used to convert Z-score and local G-statistic to reveal the statistical significance and cluster trends. Here, the 90, 95, or 99 percent confidence levels were used. Additionally, the values with Z > 2.58 indicate a positive autocorrelation in the observations at the confidence coefficient of 99%. When Z is zero, it indicates spatial randomness.
The LISA formulas are shown as follows:
where
is an attribute (energy or water consumption) for feature
i;
is the average value of all of the attribute values in the study area;
wij is the spatial weight between feature
i and
j;
S is the variance of
;
Z(
Ii) is the expected value of LISA;
Ii for feature
i;
Gi* measures the high/low clustering for the attribute values (such as
Z,
Ii, or
) in the study site.
When Ii is positive, it shows similar values to a variable are found in its neighboring locations. Positive Ii determines the locations of statistically significant clusters, known as hotspots (HH or LL cluster). A negative value exists when dissimilar values in neighboring districts are reported, indicating HL or LH cluster. Then, a cluster map of four zones can be labeled based on the Z-score and the p-value of LISA, namely high–high (HH), low–high (LH), low–low (LL), and high–low (HL).
2.4. Spatial Regression Analysis
GWR is a local form of linear regression incorporating spatial non-stationarity in regression analysis [
31]. The ability to visually represent the varying strength of spatial relationship between the dependent and independent variables is one benefit of GWR. In this study, GWR was used to analyze the spatial relationships between energy and water consumption and driving factors in urban transformation. The proposed approach is also beneficial for predicting hotspots for regional resource consumption by characterizing the spatial-temporal changes of the socio-economic context at township-level. The important outcomes from GWR analysis are derived from the individual regression output: every spatial dataset has its regression equation with an R
2 value (R
2 and adjusted R
2), the predicted dependent variable, coefficient, intercept for each independent variable, residual value, and condition number. The R
2 value represents the variance for a dependent variable, and the adjusted R
2 value presents the better fitting results by checking Akaike information criterion (AIC). The predicted value (fitted y) is estimated by GWR; and the residual values are obtained by subtracting the fitted y values from the observed y values. The formula is as follows:
where
yj represents the dependent variable;
uj and
vj are the coordinates for location
j;
β0 (
uj,
vj) represents the intercept for location
j; and
βi (
uj,
vj) is the local estimated coefficient for the independent variable
xij;
εj presents the random spatial error coefficient.
GWR coefficients are used to disclose the strong or weak correlations between the independent variables and the dependent variables, and there is a concern for possible multicollinearity [
50]. Local collinearity may lead to unreliable coefficient estimates and should be treated with caution [
32,
50,
51]. A condition number larger than 30 indicates that GWR estimation may be unreliable due to the presence of multicollinearity. Also, GWR is sensitive to kernel bandwidth, and the optimal bandwidth can be chosen by measuring the minimized AIC in GWR [
32,
38,
40,
52] or through LISA. Multicollinearity can be quantified with the help of VIF in OLS prescreening, as it may cause misleading results for regression coefficients and standard errors. Most of the independent variables with VIF > 7.5 are then excluded in GWR analysis.
3. Results and Discussion
In Taichung City, water and energy consumption datasets are available at village-level, and most of the socio-economic variables are only available at township-level. The study subdivides these variables into three classified sectors of 29 districts (township-level) for LISA, GWR, and OLS. The benchmark year of 2015 is used as the benchmark year to estimate performance valuation and predication.
3.1. Spatial-Temporal Characteristics of Energy and Water Consumption
Based on previous papers, resources consumption is influenced by socio-economic variables, both spatially and temporally. There are three main socio-economic variables that have been selected to outline the magnitude of urban transformation in terms of relative ratios from 2011 to 2015 since Taichung City was merged in 2010. Other variables with missing data from open-source information such as building attributes were excluded from the analysis. Taichung City experienced obvious spatial variations after being merged, as exhibited in
Figure 2 and
Supplementary Table S2. Red, orange, and yellow-colored regions indicate the townships with significant changes. The variations are found to be noticeable when considering household income (
Figure 2b) and the number of commercial and industrial facilities (
Figure 2c), while the population remains modest. The spatial patterns at each township state the spatial non-stationarity of these socio-economic variables for Taichung City during the evaluated period.
The geostatistic results of five socio-economic variables (along with building attributes) are summarized in
Supplementary Table S1 for 29 townships to reveal the spatial characteristics. Different geostatistic types are identified, population is reported with lognormal distribution pattern, and the number of enterprises is presented with a geometric distribution pattern. These divergent geostatistic patterns of the variables largely affect spatial characteristics, such as autocorrelation and heterogeneity, for a given region.
The energy and water consumption determined with 1020 datasets for 625 villages were used in LISA analysis to reveal the spatial distribution for the years 2015, 2016, and 2017. As the map patterns of 2016 are similar to those of 2015, they are excluded in the following section.
Figure 3 reveals noticeable spatial changes for energy consumption, while the clustering effects are less obvious in terms of water consumption. Areas colored in red and dark blue represent HH and LL clusters, which correspond to significant clusters of high and low values, respectively. Areas shown in orange and light blue indicate HL and LH clusters, respectively. In addition, areas with no significant changes are uncolored and are excluded to diminish the spatial error term. The results indicate that energy and water consumption is varied spatially and non-stationarily over time. For example, the HH clusters for energy consumption in 2017 have expanded extensively compared to in 2015, especially in the southern area of Taichung City.
Spatial variations of socio-economic variables and the clustering of resource consumption rates are considered to provide spatial-temporal description for the variables. HH clusters of energy and water consumption are situated (
Figure 3a,c) for Xitun District, which is also the region with the highest average household income, suggesting that the level of resource consumption may be affected by household income. Additionally, other regions with HH clusters for energy consumption in 2017, such as Taiping, Wufeng, Wuri, and Dali District, are observed with non-stationary shifts in serval socio-economic variables, particularly in the number of enterprises. These findings are consistent with previous studies that showed energy usage is positively correlated with population and income level [
16,
53] and that building attributes [
54,
55], especially for built-up areas per unit, significantly affects resource consumption. However, the results are not able to conclude to what extent each variable contributes to the fluctuating resource consumption or if any spatial lag or time lag are experienced. Here, the non-stationary characteristics of spatial variation that potentially affect socio-economic variables and resource consumption for 29 townships are outlined.
3.2. Spatial Regression Analysis between Resource Consumption and Socio-Economic Variables
GWR was employed to investigate the relationship between resource consumption and socio-economic variables in order to identify the main driving force for energy and water consumption patterns for the benchmark year 2015. Seven socio-economic variables were selected based on previous studies, including the number of enterprises, average household incomes, population, household units, and building attributes (floor area and various building structure). A OLS analysis screening was conducted to reduce multicollinearity and bias prior to GWR analysis. Five out of seven independent variables were selected with low multicollinearity (VIF < 7.5 or with significant coefficients) for GWR analysis, as shown in
Table 1.
Table 2 and
Table 3 summarize the average energy and water consumption coefficients as determined with GWR analysis. By comparing AIC and local R
2 from OLS and GWR analysis, GWR is confirmed to have better predictive results than OLS, which can be reflected by higher adjusted R
2 values, lower AIC figures and lower residual values. The local R
2 values are relatively high for domestic energy and water consumption. Similarly, a higher spatial error term (
ε) for GWR means a higher heterogeneity of the dependent and independent variables (higher residual value), indicating a larger gap of the fit for the GWR prediction outcome with regard to the actual data. In that sense, the resource consumption of the industrial and agricultural sectors was found to have relatively weaker correlations with socio-economic variables than those of the domestic sector. The overestimated zones with higher residual values are displayed as orange and red regions in
Figure 4 and
Figure 5. Overall, the residual magnitude in industrial energy consumption can be confirmed to be one thousand times larger than that of water consumption. The number of overestimated zones and residual values for the agricultural sector outnumber those of the industrial sector, particularly in the northern parts of Taichung City. Hence, downscaling resource consumption for the industrial and agricultural sectors and detailed data sets are essential to reduce spatial errors (
ε) in the future.
The GWR results indicate that domestic resource consumption is positively driven by the number of enterprises, average income, population, and steel concrete structured buildings (SC). Much larger coefficient values are found for domestic energy consumption than they are for domestic water consumption for these four variables, indicating a stronger relationship between socio-economic context and domestic energy consumption.
The results reveal that the number of enterprises, population, and SC are positively correlated with both energy and water consumption, while average income and light steel-structured building (LS) are negatively correlated. This outcome is consistent with the current industry development trend that the light industry (commonly facilitated with LS) consumes less resources compared to other manufacturing activities. Additionally, the negative income phenomenon indirectly causes the changes in terms of human behavior and types of work [
56].
The spatial distributions of the socio-economic variables and domestic resource consumption at township-level in 2015 are presented in
Figure 6. The analysis of industrial and agricultural resource consumption has been excluded, as the correlations with socio-economic variables are relatively weak. The regions with higher coefficients are indicated in red, as those with lower coefficients are shown in green. The number of enterprises and average household income are positively correlated with resource consumption in the eastern (Heping District) and southern regions (Taiping, Wufeng, Dali, and Xinshe District) of Taichung City. A stronger correlation for the number of enterprises is observed, as the quantity of red zones representing the number of enterprises outperforms the regions representing average income. The more diversified outcomes are found in the population variable and SC for different townships. Greater population growth is reported in the eastern regions, which show lower relatedness to energy and water consumption. In contrast, weak population and SC growth are observed in Dajia, Daan, Qingshui, Wuqi, and Longjing District, but these two variables are revealed to be significantly correlated to resource consumption.
3.3. Estimate Performance of GWR Analysis
As the local R
2 values estimated in the industrial and agricultural sectors in 2015 are too low to be reliable, this study focuses on comparing the GWR estimations to the actual 2016 and 2017 data to validate the estimated performance for domestic energy and water consumption. The results are shown in
Table 4 and
Figure 7.
The ratios in
Table 4 are calculated with the prediction value as the numerator and actual data as the denominator for 29 townships. Ratios with a value larger than one indicate that these are overestimated predictions. In general, the water consumption prediction results are close to the actual data, except for Daan and Heping District, meaning that the estimated correlation coefficients between socio-economic variables and water consumption are confirmed to be well-grounded.
Conversely, most energy prediction figures are overestimated in 2016 and 2017. The differences between the predicted and actual data can be partially clarified by the energy-saving strategies implemented by the Taiwanese government in recent years [
57]. These downscaling energy-saving behaviors may be difficult to capture and present in GWR analysis. Hence, in-depth variable selection and the variation evaluation of individual socio-economic variables collected over time may be required to reduce the estimating error for the overestimated energy consumption. For example, policy implementation and other macroeconomic variables may lead to behavior changes or even energy substitution [
58].
Nevertheless, GWR is still a useful tool for exploring the spatial regression among various socio-economic influencing variables. The inclusion of quantified data sets characterizing resource usage behavior from different social groups is suggested in order to modify the spatial regression formula in GWR analysis in the future.
3.4. GWR Prediction of Domestic Energy and Water Consumption in 2030 and 2050
Predictions of socio-economic variables at township-level in 2030 and 2050 are estimated based on the actual data from 2011 to 2015 to forecast the future domestic energy and water consumption with GWR analysis. The domestic energy and water consumption of 29 townships demonstrate similar divergence patterns to one another. As the identical socio-economic variables are applied in both consumption projections, the expected trends and relative ratios of these two consumptions in 29 townships should be close to one another with slight variance. In that case, the conversion of domestic water and energy resource consumption into one index may be considered as a simplified estimation. Once water consumption is projected based on the reliable method and data sets, the energy consumption can be easily estimated with the defined index and vice versa. The projected results are summarized in
Table 5.
If past trends of socio-economic variables continue, the overall resource consumption is forecasted to increase 1.13 times in 2030 and 1.84 times in 2050 compared to the 2015 level. The forecasted resource consumption for each township is different in order to provide a sound and representative outcome. The highest growth rate goes to N. District, which has 2.40 and 15.62 relative ratios, while the relative ratios for Heping District in both 2030 and 2050 have remained unchanged.
As a whole, the influence of the number of enterprises outperforms other variables in terms of how they affect domestic resource consumption greatly, with higher GWR coefficients and higher relative ratios, as revealed in Dongshi, N., and Waipu District (
Figure 8). On the other hand, slight increase in the relative ratios of resource consumption in 2030 and 2050 are reported in Xitun, Dali, and Tanzi District, with strong increase in projected population and average income.
The projected results for the resource consumption and socio-economic variables of the 29 townships can be used as the evidence in regional policymaking. Hotspot indication allows the local and central governments to regulate or construct appropriate infrastructure or/and adaptive consumption strategies as the regions evolve overtime. Take N. District as an example, it is projected to have higher domestic resource consumption demands along with significant coefficients for socio-economic variables. The local council should use the projected demand against the current infrastructure capacity to identify the capacity gap or to implement resource-efficient strategies.
4. Conclusions
For a city that undergoes major urban transformation, identification of the temporal-spatial changes of the socio-economic variables becomes crucial in estimating resource consumption. This study presents a series of evaluation procedures to identify spatially and temporally varying socio-economic factors and then to estimate and predict the spatial non-stationarity effects of urban resource consumption. GWR analysis with modified procedures was used as both an exploratory and predictive tool to describe the spatial heterogeneity of association between resource consumption and five socio-economic variables for Taichung city. OLS was applied to select variables without the presence of multicollinearity, and LISA was used to reveal local clusters and to provide the optimal bandwidth for GWR. These procedures were used to achieve the minimum amount of errors and to improve the R2 value. GWR showed higher predictive power when compared to conventional linear models such as OLS with the assumption of homogeneity. The prediction performance was validated by comparing the estimated and actual data in 2016 and 2017. The predictive power of GWR was confirmed to be reliable, especially for water consumption.
The results further the understanding of energy and water consumption in the context of urban transformation as well as yield insight into resource utilization planning and sustainable policy implementation. In general, a strong correlation between number of enterprises and resource consumption was revealed, followed by population and average income level (depending on the target sectors). The regions of HH clusters were often the hotspots that experience rapid increases in the number of commercial and industrial enterprises. Noticeable sectoral differences were observed in terms of energy consumption. Positive correlations were reported between all socio-economic variables and domestic resource consumption, while both positive and negative correlations were reported for the industrial, and agricultural sectors. In sum, resource consumption of the industrial and agriculture sectors was found to have relatively weaker correlations with socio-economic variables than those of the domestic sector, indicating that the commonly used socio-economic drivers were appropriate in predicting domestic resource consumption but may not be relevant in evaluating industrial and agricultural resource consumption. A possible explanation is that industrial and agricultural consumption behaviors are more diversified depending on business activities, building structure, and characteristics. The selected socio-economic variables might not be able to reflect the usage patterns. Moreover, it is suggested that downscaling human energy-saving behaviors be included to modify the results for future research. The uncertainty of influential effects of individuals on resource consumption should also be considered, such as population. The overestimation of energy consumption derived from population growth (compared with other variables) can be partially explained by changing human behaviors over time. For example, users may be more aware of resource conservation and mitigation. Finally, the effect of spatial lag and time lag issues on the variables in terms resource consumption was unable to be located and identified in this study. Incorporation of lag effect is recommended for future research in identifying precise temporal variability of resource consumption.
As for the predication outcome, the changing pattern and coefficient of socio-economic variables are crucial in the projected resource consumption levels. The overall resource consumption is projected to increase 1.84 times in 2050 compared to the 2015 levels. The N. District was reported to have the highest relative ratio of 14.62, which is mainly due to the significant increase in the number of enterprises over time.
Consequently, through the different trends of socio-economic conditions and the spatial influence for each independent variable with the GWR results, this study predicts the hotspots for regional resource consumption based on the changes characterizing the socio-economic patterns of a city undergoing urban transformation. The conversion of domestic energy and water resource consumption into one index should be considered to simplify the estimation process and to provide a useful message for future research.