Examining Spatial Heterogeneity Effects of Landscape and Environment on the Residential Location Choice of the Highly Educated Population in Guangzhou, China

: The residential location choice of the highly educated population is an important consideration to construct a livable city. While landscape and environment are important factors, few studies have deeply analyzed the spatial heterogeneity effects of landscape and environment on the residential location choices of a highly educated population. Taking Guangzhou as the sample, we built a livability-oriented conceptual framework of landscape and environment, and constructed datasets for highly educated population proportion, landscape, and environment factors, and other influencing factors for Guangzhou’s 1364 communities. Global regression and geographically weighted regression (GWR) models are used for analysis. The GWR model is more effective than the global regression model. We found spatial heterogeneity in the strength and direction of the relationship between the highly educated population proportion and landscape and environment. We find that landscape and environment exert spatial heterogeneity effects on the residential location choice of the highly educated population in Guangzhou. The conclusions will be of reference value to further understand how the spatial limitations of landscape and environment affect residential location choices. This study will help city managers formulate spatially differentiated environment improvement policies, thereby increasing the city’s sustainable development capabilities. educated population (defined as junior high school education and below) using the OLS model. Results show that NPLE and ANLE have a significant (0.01 significance level) negative relationship with PHEP. This shows that the obvious difference in the influence of landscape and environment factors on the residential location choices between the highly educated population and low educated population. For every 1% increase in the NPLE (ANLE) score, the proportion of low educated population decreases by 0.1241% (0.1701%).


Introduction
A city is a place of residence, which is both, its main function and the core of its sustainable urban development. In "Transforming our World: The 2030 Agenda for Sustainable Development," 17 goals were proposed. Goal number 11 mentions the building of "sustainable cities and human settlements." In academic research on the area of residence, the choice of residential location has always been a core subject. However, due to systemic complexity and data constraints, the academic community is yet to reach a unified and cogent conclusion [1]. The characteristics and factors influencing residential location choice vary among different demographics [2]. The highly educated population, which forms a city's valuable human resource and the basis for its innovative and sustainable development, has become the target of contention for various cities [3,4]. Therefore, an in-depth understanding of the characteristics and mechanisms of the highly educated population's choice of residential location not only helps in attracting talents, but also provides an important reference point for improving the competitiveness and sustainable development of cities.
According to existing research, factors affecting choice of residential location include distance from the city center [5], accessibility to workplace [6], convenience of public transportation [7,8], public urban facilities [9,10], crime rate [11], and built environment [12][13][14][15]. Landscape and environment, as important components of the built environment, exert an impact on the choice of residential location that should not be ignored. Wolpert [16] believes location choice is derived from environmental perception. He proposed basic concepts and research perspectives such as "behavioral space" and "location utility," introducing environmental awareness into the analysis framework of location decision-making, thus, laying a theoretical foundation for research in this field. Therefore, landscape and environment are important factors that cannot be ignored in the choice of residential location [17,18].
There has been empirical research corroborating the impact of landscape and environment on the choice of residential location. Such an impact is divided into two forms: a) residents tending to move closer to positive landscapes and environments [17][18][19][20][21] and b) residents tending to stay away from negative landscapes and environments [22,23]. For the former, studies have found parks [24], trees [18], waterfronts [17,25,26], and landmark buildings [27,28] to be attractive to residents. Meanwhile, for the latter, studies have identified transportation infrastructure (such as airports, highways, and railways) [29,30] and municipal infrastructure (such as high-voltage corridors, substations, sewage treatment plants, landfill sites, and signal towers) [31,32], factories [33,34], and logistics centers and wholesale markets [34] to be repelling elements in residential location choices.
Demographics with different educational backgrounds have different preferences and show different factors influencing residential location choice [35,36]. The highly educated population, with a higher income and more elevated social status, show greater need for landscape and environment. Studies have found that landscape and environment exert a significant impact on the housing choices of highly educated people [3], which warrants the study of such factors. From the perspective of human needs, the choice of residential location tends to be based on convenience first, and then on landscape and environment, which serve to satisfy higher-level living needs.
Although some studies have investigated the influence of landscape and environment, at a citylevel, the spatial distribution of the choice of residential location among the highly educated population has shown marked differences versus other demographics, and a significant spatial heterogeneity is visible [37,38]. Therefore, landscape and environment may not be universally applicable in influencing the choice of residential location among the highly educated population anywhere in the city. However, when this group of residents chooses a location, landscape and environment may become a consideration only in some areas and not in others, resulting in spatial heterogeneity in the impact of landscape and environment on such choices.
As a first-tier city in China with a robust economy, Guangzhou has attracted highly educated people as a place of employment. The city has a noticeably heterogeneous social space [39][40][41], and there is significant spatial heterogeneity in the distribution of population by education levels [42]. In addition, Guangzhou's urban landscape and environment are diverse and complex, with significant variances [39,43,44], making it a suitable city for a case study.
This study attempts to explore spatial heterogeneity in relationship to landscape and environment in the context of the distribution of a city's highly educated population. Specifically, using Guangzhou as the case in point, this study explores the areas whose landscape and environment have a significant impact on the choice of residential location among the highly educated population; and areas with minimal impact. The study thus explores the direction and intensity of the impact of landscape and environment on the residential location of the highly educated population. Finally, it explores the spatially heterogeneous distribution characteristics of this influence.
To answer these questions, we constructed a cross-sectional data set of the proportion of highly educated population (PHEP), landscape, and environment factors, and six other influencing factors in 1364 communities in Guangzhou. Based on a global-level verification of the impact of Guangzhou's landscape and environment on the location choice of the highly educated population, geographically weighted regression (GWR) technology was used to analyze the spatial heterogeneity regarding the direction and degree of influence of 1364 communities. This study, unlike previous studies, focuses on the spatial variability of the impact of landscape and environment on the choice of residential location of a highly educated population. The research will be of reference value to further understand how the spatial limitations of landscape and environment affect the residential location choices. It will help city managers formulate spatially differentiated environment improvement policies, thereby increasing the city's sustainable development capabilities.
The rest of the paper is organized as follows: Section 2 includes the conceptual framework, factor indicator construction, data, and methodology. Section 3 analyzes the research results, including the spatial heterogeneity pattern of the distribution of highly educated population as well as landscape and environment in Guangzhou, and examines the degree and direction of the global influence of landscape and environment on the distribution of highly educated population as well as the spatial heterogeneity of such influence. Section 4 provides the conclusions and policy implications.

Study Area
Guangzhou is one of China's major and "first-tier" cities. As a superstar city, Guangzhou has been the destination for employment and residence of talents from both home and abroad. More importantly, there are significant differences in housing conditions and social space in Guangzhou. The Guangzhou Metropolitan Area in China was, therefore, selected as the area under study ( Figure  1). It is located within the Guangzhou Ring Expressway-Guangzhou-Gaoming Expressway-Guangzhou administrative boundary, as the city's functional area, and is smaller than the administrative area of Guangzhou. The research area covers 1364 community neighborhood committees, spanning 1409 square kilometers. The area has a population of 8.28 million (a total of 7.73 million people aged six years and above), of which those holding a bachelor's degree or above amounted to 1.09 million, accounting for 14.12% of the population aged six years and over (based on data from the sixth census held in 2010). According to Guangzhou's urban construction and development characteristics, this area can be roughly divided into four regional functional categories: old area, core area, urban district area, and suburban area, and the corresponding numbers of communities belonging to these areas are 238, 312, 504, and 307, respectively. Although the area under study only accounts for 18.95% of the Guangzhou municipality, it hosts 65.19% of Guangzhou's population. The Pearl River, New City located in the core area is the central business district (CBD) of Guangzhou, with the International Finance Center Building being the CBD center.

Conceptual Framework of Livability-Oriented Landscape and Environment
The concept of landscape and environment includes multiple aspects and varies according to different research backgrounds. Based on the understanding of residential location choice, this study proposes a conceptual framework of livability-oriented landscape and environment. This conceptual framework has two basic perspectives: (1) to be near positive landscape and environment, and (2) to avoid negative landscape and environment. We selected three indicators, namely parks, waterfronts, and famous landmarks, to evaluate positive landscape and environment, and selected three other indicators, namely municipal facilities, factory, and logistics and wholesale center, to evaluate negative landscape and environment ( Figure 2). Elaborations of the two perspectives are as follows: 1. Near positive landscape and environment (NPLE): This comprises of parks, waterfronts, and famous landmarks which enhance the landscape and environment. Parks [24,45,46] and rivers or lakes [25,26,47] not only create a pleasurable landscape but also improve the environment (such as purifying the air) and boost the microclimate (such as reducing the urban heat island effect) [48][49][50]. As places of daily leisure, these elements have a significant impact on residents' choice of location [21]. A famous landmark is the symbol of a city and may take the form of important landmark buildings, areas, and scenic spots; for example, Beijing's Tiananmen Square, Oriental Pearl Tower in Shanghai, West Lake in Hangzhou, and Pearl River New Town in Guangzhou. As important landscape sites and symbols of urban centers, landmarks have aesthetic and landscape values [27], and their surrounding areas often have a well-built environment and a refined urban atmosphere, making them a large attraction for residents [28].
In theory, a highly educated population is more inclined to live in areas close to a positive landscape and environment. 2. Avoid negative landscape and environment (ANLE): This comprises municipal facilities, factories, and logistics and wholesale centers, which are likely to exert a negative impact on the landscape and environment [51]. Municipal facilities include airports, train stations, coach stations, highways (or elevated roads), railways, gas stations, funeral homes, substations, sewage treatment plants, garbage disposal sites, signal transmission towers, high-voltage corridors, and more. Municipal facilities constitute negative urban landscape regarding visual and psychological aspects. Pollution is observed in noise, radiation, air quality, odor, hygiene, and safety risks. Some case studies have demonstrated the negative impact of these facilities on housing choices (or housing prices) [22,23,29,31,32]. A factory may also have negative impacts on the surroundings, such as noise and air pollution, so the industrial built environment leads to negative landscape features [33,34]. Further, logistics centers and professional wholesale markets tend to gather many freight vehicles around them, which can cause traffic congestion and generate noise and air pollution. Moreovre, the large and complex flow of people in logistics centers and specialized wholesale markets is likely to have a negative impact on urban safety [34]. In theory, the highly educated population often chooses to live away from negative and closer to positive landscapes and environments.

Research Design
This study designed a conceptual framework to understand specific areas where landscape and environment have a significant impact on the choice of housing location of the highly educated population, to study the spatial heterogeneity of landscape and environment and the direction and intensity thereof. The analysis process is as follows. First, using data from 1364 communities in the Guangzhou Metropolitan Area, a model of the residential location of highly educated population is built from the perspective of the landscape and environmental characteristics, building characteristics, location characteristics, and social characteristics. Second, we used the ordinary least squares (OLS) model to study the global influencing factors and directions that explain the choice of residential location of Guangzhou's highly educated population. Third, we used the geographically weighted regression (GWR) model to analyze the spatial heterogeneity of the effects of landscape and environment on the location choice of the highly educated population. Specifically, these include the significant-, direction-, and intensity differences in the influence of landscape and environment on the choice of residential location in different regions. Finally, the results are explained ( Figure 3). The segment of the population that has a bachelor's degree or above is defined as the highly educated population. The proportionate percentage (%) of the highly educated population among those aged six and above is used to represent the residential attractiveness of the community to the highly educated demographic group. The higher the percentage, the higher the chances of the highly educated population tending to live in the community; this serves as the dependent variable in this study. The two constituent elements of landscape and environment, which have been explained in the co conceptual framework, are the explanatory variables for the residential location of the highly educated population. Further, "building characteristics", "location characteristics", and "social characteristics" are the controlling factors, and are composed of a series of indicators as follows:

•
Building characteristics: Building age (BAGE), building area per household (BAREA), and housing facilities (HF) were chosen as the three factors reflecting "building characteristics [52][53][54][55][56]". The highly educated population with a higher income is more inclined to live in housing with a newer building, larger building area, and more complete housing facilities. • Location characteristic: Location is an important factor to consider when choosing a residential location [57]. It is found here from the three perspectives of daily life convenience (DLC), public services accessibility (PSA), and CBD accessibility. Among them, DLC is jointly evaluated by subway transport-, business service-, and office accessibility. PSA consists of basic education-, medical services-, and cultural and physical activity accessibility. Empirical studies have proved the positive impact of these indicators on housing prices from the perspective of characteristic prices [58][59][60], as they are important factors in choosing residential locations [61][62][63][64][65]. • Social characteristics: Population density (POPD) and employment rate (ER) are two main factors reflecting "social characteristics." POPD demonstrates the degree of residential congestion in the community. ER is an important indicator of community security and attraction.
Higher unemployment will lead to higher crime rates [66]. Theoretically, the choice of residential location of a highly educated population is influenced by POPD and ER.
The detailed definitions, as well as evaluation standards or calculation methods of the above factors, are shown in Table 1. For composite variables (for instance, NPLE, ANLE, HF, DLC, PSA), the factor analysis is used to calculate the weight of indicators. Scores of the variables mentioned above are calculated by weighted summation. Proportion of highly educated population among those aged six years and above expressed as a percentage (%)

Explanatory Variables-Landscape and Environment
Near positive landscape and environment (NPLE)

Waterfront accessibility
Within 200 m of a river (lake): (9) 200-400 m of river (lake): (7) 400-800 m of river (lake): (3) Beyond 800 m of river (lake): (1) The base score is 9 and is reduced by 1 for each aversive facility that the community is located near, based on the metrics defined here.

Avoid factories
Positive standard deviation value examination of the kernel density distribution of factory divided into five grades: community located at 3 sd (1), 2-3 sd (3), 1-2 sd (5), mean to 1 sd (7), and outside the mean (9) Avoid logistics and wholesale centers Positive standard deviation value examination of the kernel density distribution of logistics and wholesale center divided into five grades: Community located at 3 sd (1), 2-3 sd (3), 1-2 sd (5), meanto 1 sd (7), and outside the mean (9)  (7), and after 2000 (9). The proportion of each type of household in the community is calculated and multiplied by the corresponding score before adding up (the same calculation method is used for the following building characteristics indicator, BAREA) Housing facilities (HF)

Control Variables-Location Characteristics
Daily life convenience (DLC)

Data and Data Sources
Data for the PHEP, building characteristics (BAGE, BAREA, and HF), and social characteristics (POPD and ER) are collected from the sixth population census of Guangzhou municipality. NPLE data (Park, Waterfront, and Famous landmark) are derived from "Outline of the Guangzhou Urban Master Plan (2011-2020)" and are obtained by draft vectorization. ANLE and location characteristics (DLC, PSA, and CBDA) data are derived from the 2012 points of interest (POI) database in Guangzhou.

Global Regression Analysis
A global regression model was used to find the global factors of differences in the residential locations of Guangzhou's highly educated population. This was done to verify whether the aforementioned selected factor indicators were reasonable and to examine whether they had an accurate impact on the choice of residential locations of highly educated population. For the study of factors affecting regional differences, the ordinary least squares (OLS) methods are commonly used.
We use the OLS model to study the global influencing factors and influence directions that explain the choice of residential location of Guangzhou's highly educated population.
OLS is a linear model used to study the linear relationship between dependent and independent variables. It is based on the assumption that variables are independent of each other and ignores their spatial information. The OLS model can be expressed as: β0 is a constant term; and ε is the error of the model. The data of dependent and independent variables in this paper are processed in the form of logarithmic standardization.

Geographically Weighted Regression
A global multiple regression model requires only one equation for all data [67] and assumes that this statistical relationship remains consistent everywhere. This kind of statistical relationship will change locally, with different spatial locations. The GWR model allows for local spatial changes in the relationship between independent and dependent variables throughout the space [68,69]. The GWR model we use is similar to OLS, but the OLS method is extended by embedding observed geographic locations into the model. Each parameter is estimated in space, and the parameters change by spatial location [70], which greatly improves the fit of the model. The GWR model can be expressed as: u where (us, vs) is the geographic location coordinate of community s. Consequently, β0 (us, vs) is a constant term of the regression model of the community s, with βi (us, vs) being the local regression coefficient of the i th variable in community s, which varies with the location, and ε is the error term. Each local βi (us, vs) is used to estimate its neighboring space observations. The weighted least squares method is used to estimate the elastic coefficient at any point (us, vs) in the study area. The estimated value is expressed as follows: where X is the independent variable matrix and T is the matrix transpose operation. W (us, vs) represents a spatial weight matrix, which is composed of the monotonically decreasing function value of the geographical distance between point s and its surrounding observation points. The points closer to the regression coordinates play a greater role in parameter estimation and may take different functions/forms. We use the Gauss kernel function to calculate the weights between communities.
where Wsk represents the Euclidean distance between the position of community s and community k, and b is the optimal bandwidth. Wsk is a continuous monotone decreasing function of dsk., such that when dsk = 0, Wsk = 1. Due to the uneven distribution under observation, communities in Guangzhou's old-and coreareas are smaller and denser than those in suburban areas. Therefore, where the community is sparsely distributed, the bandwidth distance will become larger, while in the densely distributed communities, the bandwidth distance will become smaller. To obtain the best bandwidth, we use the adjusted value of the Akaike information criterion (AICC) [71]. AICC is expressed as follows: where σ is the maximum likelihood estimate of the variance of the random error, and tr(S) denotes the trace of the S matrix.

Spatial Heterogeneity in the Residential Location Choice of Highly Educated Population
Descriptive cluster statistics of the PHEP are shown in Figure 4. A natural breaks (Jenks) method is used as the division thresholds corresponding to five numerical ranges, i.e., 0-7.81%, 7.81-18.10%, 18.10-30.77%, 30.77-51.93%, and 51.93-91.99%, the corresponding number of communities are 655, 409, 201, 76, and 23, respectively. Figure 5 shows the spatial heterogeneity of Guangzhou's highly educated population drawn by ArcGIS 10.0. Areas with a PHEP higher than 18.10% are mainly distributed in the core area, the northeast fan-shaped region of the urban district, and the "university city" in the suburbs. Among these, the area with a PHEP exceeding 51.93% is where the universities are located. In general, the core area has the highest PHEP, nearly 22.62%, while the PHEP in urban districts is 12.94%, and that in the suburbs is 10.83%. The old area has the lowest PHEP at just 9.84%.

Spatial Heterogeneity of Landscape and Environment
Using ArcGIS 10.0, a quantile method is used to divide NPLE and ANLE into four grades, ranging from high to low, respectively. A higher score indicates a better landscape and environment. Regarding NPLE, the Pearl River, the new central axis of the core area, and the university city of the suburban areas achieve a higher score. In contrast, regarding ANLE, the core area generally scores higher, indicating that the area is less affected by negative landscape and environment. Several lowscoring communities exist in other areas. Overall, there are differences in the spatial distribution patterns of NPLE and ANLE ( Figure 6).

The Effects of Landscape and Environment on the Residential Location Choice of Highly Educated Population Using the Global Regression Model
It is necessary to analyze how landscape and environment affect the overall situation and the extent of their influence on the choice of residential locations of the highly educated population. Currently, regression analysis is the most widely used method in the field of residential location choice. Among global regression methods, the OLS method is most commonly used to estimate the significance and direction of the independent variable's influence on the dependent variable. We use GWR4.0 software to run the OLS model. A collinearity test shows that there is no collinearity among the ten factors. The R 2 and AIC value of OLS are found to be 0.5755 and 3307.51, respectively. OLS regression results show that eight of the ten factors discussed in this study have a significant (0.05 significance level) positive relationship with PHEP, consistent with theoretical expectations ( Table 2). The two landscape and environment factors (NPLE and ANLE) comply with theoretical expectations and are significant (0.01 significance level). For every 1% increase in the NPLE (ANLE) score, the PHEP increases by 0.3438% (0.3341%). This shows that globally, landscape and environment have a significant impact on the highly educated population's choice of residential location.
Regarding controlling factors, BAREA, HF, DLC, PSA, and CBDA also have a significant positive impact on the residential location choice of the highly educated population, which is consistent with theoretical expectations. In contrast, the effect of ER is not significant. Compared to architectural-and social characteristics, landscape and environment-, and location characteristics are more important factors influencing the choice of residential locations of highly educated population. In exchange for better landscape and environment, the highly educated population is willing to accept older housing and higher population density.
To compare the difference of residential location choice between the population with high education and low education, we analyze factors influencing residential location choices of low educated population (defined as junior high school education and below) using the OLS model. Results show that NPLE and ANLE have a significant (0.01 significance level) negative relationship with PHEP. This shows that the obvious difference in the influence of landscape and environment factors on the residential location choices between the highly educated population and low educated population. For every 1% increase in the NPLE (ANLE) score, the proportion of low educated population decreases by 0.1241% (0.1701%).

Impact of the Spatial Heterogeneity of Landscape and Environment on the Residential Location Choice of Highly Educated Population Based on the GWR Model
The spatial distribution of the PHEP in Guangzhou is characterized by heterogeneity. Therefore, it is necessary to further use the GWR model for local spatial regression. The GWR model can account for local spatial changes in the relationship of landscape and environment with Guangzhou's PHEP throughout the space. It can isolate smaller and relatively homogeneous local areas from the large heterogeneous areas, reducing the impact of spatial heterogeneity. The local regression includes only nearby communities and assigns weights to selected communities based on their respective spatial distance from the target community [71]. We use GWR4.0 software to run the GWR model and choose spherical coordinates as locational variables. The model type is Gaussian. Fixed Gaussian is used to geographic kernel type. The method for optimal bandwidth search is calculated using a golden section search algorithm. The criterion for optimal bandwidth is used to AICc. The best bandwidth calculated is 0.014. Table 3 shows the statistical values of the GWR local coefficients of the model. The GWR's R 2 , Log-likelihood, and AIC are 0.8872, −739.02, and 2597.86, respectively. These indicate that the model has acceptable overall goodness of fit, and can better explain the relationship between the residential location choice and the determinants taken into consideration by the highly educated population than the global regression model (OLS). The ten variables are included in the model on the premise that AIC is minimized and noncollinear. AIC and ANOVA analyses (F-test) both showed statistical significance. The spatial distribution map of local R 2 affecting the PHEP is shown in Figure 7. The local R 2 , ranging from 0.5513 to 1, reflects the comprehensive impact of the ten variables on PHEP. A natural breaks (Jenks) method is used as the division Local R 2 to five numerical ranges. In general, while the Local R 2 in the old area and its surrounding areas are relatively low, the suburban area has the highest R 2 (with R 2 in most communities exceeding 0.9023). This shows that these factors have a more significant impact on the housing choices of the highly educated population in suburban areas. A GWR model was established to explore the correlation between PHEP and various factors, and a pseudo-t-test (p < 0.05) was performed to test the statistical significance of the parameters ( Table  4). The results reveal a significant positive correlation between NPLE and PHEP in 47.80% of the communities, and a significant positive correlation between ANLE and PHEP in 44.28% of the communities (consistent with theoretical expectations). While these two values are lower than BAREA (84.90%), CBDA (51.83%) and DLC (47.87%), they are higher than POPD (31.74%), HF (27.57%), ER (22.65%), PSA (16.35%), and BAGE (5.65%). The direction, intensity, and significance of spatial distribution in the correlation of landscape and environment (NPLE and ANLE) with PHEP are shown in Figure 8. A quantile method is used to divide estimated coefficients into five grades. Insignificant areas are represented in gray. Regarding NPLE, the areas where NPLE and PHEP have a significantly positive correlation are mainly distributed in the old area, the southern part of the urban district (near the old area or the core area), and parts of the suburban area. Regarding ANLE, the areas with a significant positive correlation between ANLE and PHEP are distributed in the old area (although the coefficient is relatively low), north of the core area, and north of the urban district (near Baiyun Mountain), showing a decentralized distributive pattern in the suburbs. It can, therefore, observed that there is spatial heterogeneity in the degree of influence of landscape and environment on the highly educated population's choice of residential location. Communities with a significant positive relationship between landscape and environment and choice of residential location among the highly educated population show a spatially concentrated distribution.
The proportions of communities with significant positive correlations between landscape and environment and PHEP in the four perimeter areas (old area, core area, urban district, and suburban area) were calculated ( Table 5). The proportion of communities with a significant positive correlation between landscape and environment and PHEP was the highest for the old area, with NPLE and ANLE reaching 99.58% and 94.96%, respectively. Suburban areas had the lowest proportion percentage, with NPLE and ANLE achieving only 27.04% and 16.61%, respectively. It can be seen that the influence of landscape and environment on the highly educated population's choice of residential location is mainly reflected in the old area, and is least perceptible in the suburban area.

Discussions
The highly educated population often has higher socioeconomic status, and high-quality landscape and environment are scarce resources of the city. Therefore, the mechanisms of the effects of landscape and environment on the residential location choice are as follows: From the perspective of housing affordability, the better the landscape and environment, the higher the "threshold" of purchasing or renting. Through space competition or price competition, the population with higher education is more likely to have the property rights or rental rights of communities with better landscape and environment. From the perspective of the population's residential choice behavior, when all other housing conditions are the same, people are more inclined to choose the community with a high-quality landscape and environment. However, due to the limited number of such communities, only the population with higher education (with higher socio-economic status) is more likely to occupy them. From the perspective of human needs, the landscape and environment are higher-level living factors beyond the basic living needs. Compared with a low educated population (often only focusing on basic living factors), the highly educated population usually has higher requirements on landscape and environment. It is worth noting that there is significant spatial heterogeneity in the educational background structure of urban population and landscape and environmental conditions and, therefore, the degree of the mechanism is bound to show spatial heterogeneity.
This research constructs a comprehensive system of factors (including positive and negative landscape and environment) for location choice by adopting a livability angle, combining landscape and environment, architectural characteristics, and location characteristics to verify whether landscape and environment exert an impact and spatial heterogeneity of such an impact. The research, based on the global regression model OLS, proves the theoretical hypothesis that landscape and environment have a significant impact on the residential location choice of highly educated population. When choosing a location for their residence, highly educated people prefer positive landscapes and environment (parks, waterfronts, and famous landmarks) and not negative landscape and environment (municipal facilities, factories, logistics centers, and professional wholesale markets). The findings are consistent with previous studies. Further, the empirical analysis based on GWR explores the regional differences in the impact of landscape and environment on the highly educated population's choice of residential location in more detail than in previous studies (which are based on global studies). It focuses on analyzing the spatial heterogeneity of this impact relationship, ignored in previous studies. The significance of the discovery is that the conclusions of this study will be of reference value to understand the spatial limitations of landscape and environment on the residential location choices of the highly educated population. This provides a reference for the researchers to make the different spatial policies of urban landscape and environment improvement. Based on these findings, the existing research (based on global research) can make further progress from the perspective of spatial differentiation. By observing the relationship between landscape and environment and talent attraction, this study serves as a reference point for policymaking regarding the construction of livable and innovative cities. It offers theoretical support for sustainable urban development.

Conclusions
This study establishes a research framework for the highly educated population's choice of residential locations based on urban landscape and environment. It sets up a dataset of the PHEP and its influencing factors in 1364 communities in Guangzhou. GWR technology is used to analyze the spatial heterogeneity in the direction and degree of the influence of landscape and environment on the highly educated population's residential location choices. The research conclusions are summarized as follows.
Studies show a high degree of spatial heterogeneity in the distribution of the highly educated population in Guangzhou. Generally, the core area has the highest PHEP, while the old area has the lowest. Overall, the landscape and environment in the core areas are of the highest quality. However, the spatial patterns of NPLE and ANLE are not the same in Guangzhou.
A global regression model (OLS) was used to verify the relationship between the two aspects of landscape and environment (NPLE and ANLE) and the distribution of the highly educated population, and the significance thereof. The results show that NPLE and ANLE have a significant impact on the choice of residential location of the highly educated population, and the direction of influence is consistent with theoretical expectations. For every 1% increase in the NPLE (ANLE) score, the PHEP increases by 0.3438% (0.3341%).
The spatial heterogeneity of the relationship between landscape and environment and the PHEP was analyzed using the GWR method. Results show that GWR has better model performance than OLS, with larger R 2 and lower AIC. The effects of both NPLE and ANLE on PHEP show spatial heterogeneity. NPLE and ANLE have a positive and significant impact on the choice of residential location for the highly educated population in 47.80% and 44.28% of the communities, respectively. The areas with considerable influences show spatial clustering in distribution and are mainly located in the old area. In the suburban area, such an influence is weaker.

Policy Implications
The conclusions of this study highlight the importance of landscape and environment in attracting highly educated people. Moreover, the impact of landscape and environment on highly educated people varies according to spatial location. Therefore, in the formulation of policies and action plans for urban landscape shaping and environmental improvement, attention should be paid to spatial differences within cities. It is recommended that a spatially differentiated policy be adopted, and urban renewal and environment improvement plans are implemented in a differentiated manner as follows.
First, improve the urban restoration of the old area and the landscape environment thereof. The results of this study indicate that landscape and environment have the most significant impact on the highly educated population's choice of residential location in the old area; meanwhile, the old area hosts the lowest PHEP. Therefore, enhancing the landscape and environment of the old area will attract highly educated people. Suggested measures to improve the landscape and environment are as follows: 1. As the old area is densely built, resulting in a poorly built environment, it is recommended that more public space be made available therein by making full use of the land area for increasing green space. 2. The old area has a higher concentration of specialized wholesale markets, causing traffic congestion and noise pollution. These markets should be relocated to places with convenient transportation in the suburbs. In their place, urban green spaces, squares, creative industrial parks, science and technology parks, cultural exhibition venues, or urban public event venues should be built to enhance the landscape of the old area and improve environmental conditions.
3. It is recommended that more efforts be made to renovate municipal infrastructure in the old area to reduce the negative impact of the polluting municipal facilities on the environment of the old area.
Second, reinforce the construction of high-quality landscape and environment facilities in the suburb. The results of this study show that the quality of the suburban landscapes and the environment has the lowest score, and the PHEP in the suburb is exceedingly low. Therefore, it is necessary to reserve more space for the construction of suburban parks, public event venues, cultural event facilities, and landmark buildings. Thus, the accessibility of high-quality landscape and environment elements could be improved, and the residential attractiveness of the suburb to the highly educated population would also increase, relieving the pressure and living costs in the core area. Additionally, it is necessary to re-arrange polluting municipal and transportation facilities, keeping them farther from residential areas to reduce their impact on the landscape and environment of the residential areas.
Third, the quality of the existing landscape and environmental facilities should be further improved for the core area and urban district. They should be developed into livable areas through high-level planning and construction aimed at creating a world-class environment to attract global talent.

Limitations and Future Research
We use classical GWR to study the spatial heterogeneity of influencing factors. However, all factors may not have the effect of heterogeneity in space, and some of them having small changes in space can be ignored, such as the building area. The influence intensity of these factors may be consistent throughout the study; that is, not all variables can be estimated in the same way in GWR. Considering this limitation of GWR, in the next research, we will introduce the mixed GWR model to analyze the spatial heterogeneity of specific influencing factors, to get precise results.
From the perspective of the selection of influencing factors, ten factors (composed of 22 indicators) are considered in this study. However, theoretically, several factors can affect the residential choice of highly educated people. This paper does not cover all the factors because of data acquisition limitations. In future studies, the number of variables can be further increased to improve the model fit.