1. Introduction
Tourism services, facilities, or amenities are playing an increasing role in economic growth and social development in many regions. Tourism and leisure activities are not only a manifestation of urbanism that can improve the happiness of the urban way of life, but are also an important driver of the consumption-oriented economy [
1,
2].
In the literature of tourism studies, the concept of “space” matters in the understanding of the pattern of tourism resources and its impact on or connection with local society [
3]. The notion of “tourism space” was first proposed by Oppermann [
4] to discuss the spatial organization of tourism resources and infrastructure in a way that can help facilitate the economic transition of developing countries. In the past two decades or so, scholars in tourism studies and land management have put great emphasis on investigating the spatial layout and patterns of tourism resources and how it can form an essential understanding of the planning of tourism and land use [
5]. In particular, recent attention has been paid to the spatial difference and unbalanced development of tourism [
6], the impacts of transportation on tourism resource distribution [
7], and tourism’s spatial coupling with the local economy [
8], urbanization level [
9], and ecological system [
10]. Some other scholars are interested in revealing how the spatial distribution of tourism resources or facilities can optimize tourism’s economic impact on local society. For example, Law et al.’s [
11] research suggests that the distribution of hotels and travel agencies serves as an important intermediary that provides hospitality and distribution channels to facilitate the development of tourism. Santana-Jiménez et al. [
12] use population density to examine whether tourism services and facilities are overcrowded in a tourism area so as to provide a policy reference for reconciling tourism with the population and other supply-side factors. In other words, the distribution of tourism services needs to be consistent with their supporting system, including population, transportation, and hospitality facilities.
Notwithstanding the growing literature exploring the spatial patterns of tourism and its relationship with regional development, there are still two lines of inquiry that warrant further examination. First, the arrival of the era of big data has provided new avenues for the spatial analysis of tourism beyond traditional data sources like statistical yearbooks. In this paper, we use point of interest (POI) data to achieve a more accurate analysis of tourism space by appropriating POI’s advantages of comprehensive coverage, high identification accuracy, and easy data accessibility [
13]. In this sense, POI data can help us improve the spatial accuracy of spatial econometric models. Compared with most spatial econometric studies of tourism that normally rely on city- or provincial-level panel data, this paper uses panel POI data of a 1 km
2 grid to re-evaluate the influencing factors of tourism space.
Second, less research has paid attention to the coupling relationship of tourism space with local socio-economic indicators such as GDP (gross domestic product) and population density, despite there being some research on tourism’s coupling with environmental sustainability [
14]. The concept of coupling, originating from physics, refers to the phenomenon in which two or more systems establish a collective relationship through a range of interactions [
15]. The degree of coupling serves as an index to investigate the extent of mutual influence and interaction between these systems. When the components of systems are interdependent, coordinated, and mutually promoted, a relationship of coordinated development emerges, indicating a high degree of coupling. The value of coupling directly reflects the level of association/consistency between the two systems. For instance, Gal, Gal, and Hadas [
16] delve into the growth of rural tourism in the Israeli agricultural sector, exploring the potential synergies between rural tourism and farming, as well as how tourism may influence the transformation of agricultural processes. Akama [
17] identifies that the coordinated development of urbanization and tourism is an important indicator for urban planning in terms of satisfying the economic, social, and cultural needs of local communities. However, one difficulty in examining the coupling of tourism with GDP and population is identifying areas with different types of high coupling degrees, including (1) high–high congregation with high coupling between two variables; (2) low–low congregation with high coupling between two variables. In this paper, we achieve this through combining the degree of coupling with the local Moran’s I index to identify the areas where tourism has a good interaction with GDP or population and, thus, can be viewed as suitable for tourism development.
This paper therefore advances existing research on the spatial analysis of tourism by looking at the tourism space in the Guangdong–Hong Kong–Macao Greater Bay Area (GBA), one of the most dynamic and economically powerful regions in China. In recent years, tourism planning has been a crucial aspect to facilitate the integration of GBA development. Therefore, researching the spatial patterns and driving factors of tourism space and its coupling with the regional economy and population can provide useful policy insights for tourism planning in the GBA. By using the advantages of POI data, this paper focuses on three research questions related to the tourism space of the GBA: (1) How do we describe the spatial patterns of tourism services in the GBA through POI data? (2) What are the driving forces that are associated with the spatial distribution of tourism services in the GBA? (3) How do we identity the specific areas that are suitable for tourism development by examining the index of the coupling degree? In other words, how can we measure the extent to which tourism services are coupled with other socio-economic factors of development such as GDP and population?
  3. Methodology
This paper used a variety of methods to systematically analyze the spatial dynamics of urban tourism in the GBA of China. First, a series of methods including average nearest neighbor, standard deviation ellipse, the imbalance index, the geographical concentration index, Kernel density estimation, and the bivariate local Moran’s I to comprehensively map out the spatial distribution characteristics of tourism services. Then, we utilized different models of regression, including the ordinary least squares model (OLS), spatial lag model (SLM), and spatial error model (SEM), to discern what influences the spatial patterns of tourism services. Finally, we used the coupling degree to identify the areas in which the distribution of tourism services was coupled with the local economy and population.
  3.1. Average Nearest Neighbor (ANN)
The average nearest neighbor (ANN) method involves calculating the average distances between the centers of elements and the center positions of neighboring elements. The average nearest neighbor index represents the ratio of the average observed distance to the expected distance and can be addressed in the following form:
The average nearest neighbor ratio is calculated as:
        where the distance between 
 and its closest point of mass is the average of the actual measured elements:
 is the average distance of the random distribution of the elements:
In the above equation,  is the distance between element  and its nearest neighboring elements,  is the number of elements in the region, and A is the area of all element envelopes.
If the  < 1, the distribution is agglomerative; if the  > 1, the overall distribution is random.
  3.2. Standard Deviation Ellipse (SDE)
The standard deviation ellipse (SDE) was proposed by WertilyFifer in 1926 [
22], and the spatial distribution characteristics of POI elements are measured by four parameters: the long axis, short axis, rotation angle, and area. The long axis of the ellipse indicates the spatial distribution direction of the traffic facility point data, and the short axis indicates the spatial distribution range of the traffic facility point data:
        where 
 and 
 are the coordinates of the traffic facility point elements; 
 and 
 denote the calculated ellipse centers; 
 is the number of all traffic facility point elements in the study area; 
 and denotes 
 the mean center position of the traffic facility point elements; 
, 
 denote the standard deviation of the long and short axes of the ellipse respectively; and 
 is the azimuth of the ellipse.
  3.3. Imbalance Index (S)
The imbalance index (
) can reflect the balanced distribution of POI points [
23] in each city of the GBA, calculated by the formula:
		
		where 
n denotes the total number of POIs of each type, and 
Yi denotes the cumulative percentage of the number of POIs of each prefecture-level city to the i-th position of the total number of POIs of the GBA in descending order.
  3.4. Geographical Concentration Index (G)
The geographical concentration index (G) is mainly used to measure the concentration of the spatial distribution of geographical elements and is a common indicator in geographic studies [
23]. It indicates the degree of concentration of the studied elements in the regional space, and the smaller the value, the lower the concentration, and vice versa. Its value is between 0 and 1. The higher its value, the higher the degree of geographical concentration of the industry.
        
        where the larger the value of 
, the higher the degree of concentration, assuming the average distribution of POIs is 
; if 
, it means that the traditional villages are concentrated, and vice versa, they are scattered.
  3.5. Kernel Density Estimation (KDE) Method
The kernel density estimation (KDE) method is employed to estimate the density of traffic facility point elements by using moving cells [
24]. Its basic principles include defining a circular domain with a fixed search radius, ensuring that the circle domain encompasses each traffic facility point; determining the size of the output raster based on the required density accuracy; and calculating the density contribution of traffic facility points to each raster within the circle domain. The density contribution of each traffic facility point to every raster within the circle domain is accumulated, and the density value of each raster is assigned to output the density value of each raster.
Kernel density analysis enables the derivation of the distribution pattern of tourism spaces in the Guangdong–Hong Kong–Macao Bay Area, as represented by the following formula:
        where 
 is the bandwidth; 
 is the number of point-like elements of traffic facilities in the study area; 
 is called the kernel function; and 
 denotes the distance from the estimated point of traffic facilities to the sample 
.
  3.6. Bivariate Local Moran’s I
The concept of bivariate spatial correlation is complex and often misinterpreted. It is commonly perceived as the correlation between one variable and the spatial lag of another variable, as originally implemented in the precursor of GeoDa [
25]. However, this approach fails to consider the inherent correlation between the two variables. More precisely, the bivariate spatial correlation is between xi and 
, but does not take into account the correlation between 
 and 
, i.e., between the two variables at the same location.
As a result, this statistic is often interpreted incorrectly, as it may overestimate the spatial aspect of the correlation, which could actually be primarily attributed to the in-place correlation. Essentially, this notion of bivariate spatial correlation measures the degree to which the value for a given variable at a specific location is correlated with its neighbors for a different variable.
As in the univariate Moran scatter plot, the interest is in the slope of the linear fit. This yields a Moran’s I-like statistic as:
        where 
 is the element of the spatial weight matrix showing the proximity of the 
 region and the 
 region, and 
, 
 are the two variables studied.
The Moran scatter plot classifies spatial associations into four categories, corresponding to the location of the points in the four quadrants of the plot. These categories are known as high–high, low–low, low–high, and high–low, relative to the mean, which is positioned at the center of the graph. The Moran scatter diagram provides a visual representation of spatial associations, grouping them into these four categories based on the positions of the points within the quadrants. This distribution pattern can be displayed on a graph, also referred to as a bivariate Lisa clustering map.
  3.7. Ordinary Least Squares Model (OLS), Spatial Lag Model (SLM), and Spatial Error Model (SEM)
Considering the heterogeneity and agglomeration in the spatial distribution of service industries, such as the restaurant industry, a spatial regression model was selected for the analysis to account for the spatial distribution of variables and disturbance terms [
26]. In this paper, we employed the OLS model, spatial lag model, and spatial error model to analyze the factors influencing the spatially differentiated agglomeration of service industries, particularly the catering industry.
- The OLS model is a linear regression model. This model is the most basic and commonly used regression model to determine the closeness to the dependent variable by establishing a numerical model or fitting means to correlate two or more variables, generally in the form of:
         - 
        where  -  is the explained variable;  -  is the explanatory variable;  -  is the intercept parameter;  -  is the slope parameter; and  -  is the random error vector. 
- The spatial lag model ( SLM- ): This model describes the spatial correlation between the dependent variables, i.e., whether the neighboring explanatory variables affect the local explanatory variables through a spatial transmission mechanism, and mainly explores whether the variables have spillover effects in a region, in the general form of
         - 
        where  -  is the explained variable;  -  is the spatial regression coefficient;  -  is the  -  spatial weight matrix;  -  is the spatial lagged explained variable of the spatial weight matrix  - ; x is the  -  exogenous explanatory variable matrix;  -  reflects the effect of the explanatory variable on the explained variable; and  -  is the random error vector. 
- The spatial error model ( SEM- ): This model is a regression model that sets a spatial autocorrelation term on the error term in the model, i.e., whether the neighboring error term affects the local explanatory variables through a spatial transmission mechanism, and is applicable to the case where the interaction between regions differs depending on the relative locations in which they are located, and the general form of the model is:
         - 
        where  -  is the explanatory variable;  -  is the  -  matrix of exogenous explanatory variables;  -  reflects the coefficient of the degree of influence of the explanatory variable  -  on the explanatory variable  - ;  -  is the random error vector;  -  is the spatial error coefficient of the vector of explanatory variables;  -  is the  -  spatial weight matrix;  -  is the random error vector of normal distribution. 
  3.8. Coupling Degree and Coupling Coordination Analysis
Coupling degree is a physical concept utilized to describe the extent of mutual influence and interrelation between systems or elements [
27]. On the other hand, the degree of coordination is employed to depict the extent of favorable interaction and coordinated development between systems or elements, thereby reflecting the sustainability of benign interaction and correlation. The concept of coupling coordination degree is built upon both ideas and serves to characterize the degree of mutual influence and coordination between systems or elements. It not only indicates the strength of interconnectedness between systems but also reflects the level of coordination, whether it is favorable or unfavorable, among these systems [
28]. The specific formula for calculating the coupling coordination degree is shown as follows:
		
- 1.
- The data normalization process was completed in the previous section, so it is not repeated here. 
- 2.
- Calculate the weight of the value of the  - th indicator for the  - th sample:  
- 3.
- Calculate the information entropy of the indicators:
         
- 4.
- Calculate the information entropy redundancy:
         
- 5.
- Determine the indicator weights:
         
- 6.
- Calculate the combined score of each system:
         
- 7.
- Calculate the coupling degree and coupling coordination:
         - 
        where m is the number of evaluation samples, n is the number of indicators, and  ,-  is generally taken as 0.5. 
  3.9. Variable Selection for the Regression Model
The spatial distribution pattern of urban tourism may be affected by multiscale and multidimensional factors. Combining the existing theories and research results, the influencing factors were divided into four categories: demographic factors, economic factors, transportation factors, and facilities of hospitality.
- Demographic factors: As the population size of an area grows, the level of consumption demand and diversification of tourism also increases. Residential areas are positively correlated with tourism density [ 29- ], and the distribution of tourism services often changes in accordance with the spatial changes in population distribution. Small-scale tourism tends to be located in densely populated areas, residential areas, and areas which travelers can conveniently access. 
- Economic factors: Areas with a higher level of regional economic development have better facilities; intensive economic activities; a spatial convergence of human, logistic, and information flows; and, thus, higher tourism density. However, this conventional idea needs to be re-examined through micro-scale data. In this sense, this paper adopts one-square-kilometer GDP as the spatial unit of panel data, while most of the literature adopts city- and district-level data of GDP [ 30- ]. 
- Transportation factors: An improvement in public transportation capacity can bring a higher level of convenience to travelers and greater foot traffic and consumption to the area [ 31- ]. Thus, the distribution of tourism services tends to be in areas with better transportation accessibility. In this paper, three variables, namely, parking lots, subway stations, and bus stops, were selected to analyze the impact of different levels of transportation accessibility on the distribution of tourism. 
- Facilities of hospitality: Accommodation services and travel agencies can provide convenient conditions for the tourism industry, and the higher the abundance of both, the greater the diversity of choices for tourists, forming a positive driver for the concentration of foot traffic [ 32- ]. The variables and specific index design are shown in  Table 2- . 
  4. Results
  4.1. Descriptive Analysis of the Spatial Distribution of Tourism in the GBA
The results obtained from the average nearest neighbor index indicate that all four types of tourism services exhibit nearest neighbor ratios lower than 1, demonstrating clustering characteristics supported by Z and 
p values. This paper, therefore, categorizes the 2019 POIs into three levels: catering services and shopping services represent the most aggregated spaces, followed by sports and leisure services distributing in a more clustered tier. The scenic spots, in contrast to the other three types, display the lowest degree of clustering. These findings are presented in the ANN table (
Table 3).
Furthermore, by combining the outcomes of the standard deviation ellipse analysis (
Figure 2, 
Table 4 and 
Table 5), the following becomes evident:
The center of the standard deviation ellipse for tourism in the Guangdong–Hong Kong–Macao Greater Bay Area primarily lies to the right of the Nansha Wetland in Guangzhou, exhibiting a noticeable trend of southeastward distribution. The aggregation area follows a rough “southeast-northwest” orientation, covering most regions within Guangdong, Dongguan, Foshan, Shenzhen, Zhuhai, Hong Kong, and Macao.
The overall spatial distribution pattern of tourism services in the GBA is characterized by a denser concentration within the ellipse and sparser distribution outside the ellipse. Furthermore, the calculation of the imbalance index S reveals that the data of various POIs are unevenly distributed within the GBA. Meanwhile, the geographical concentration index G and G0, derived from the above equation, demonstrates that each G is greater than G0, indicating a high level of POI data concentration, corroborating the results of the analysis of nearest neighbor ratio.
  4.2. Kernel Density Analysis of Tourism Service in the GBA
In order to identify the specific area of agglomeration of tourism services in the GBA, the degree of spatial agglomeration of different types of tourism POI was examined using kernel density analysis (see 
Figure 3).
Catering and shopping services show a similar pattern of large-scale and highly concentrated clustering within the downtown areas of the cities in the GBA. Specifically, there are three main cores of tourism services: (1) Guangzhou’s downtown areas of Tianhe, Yuexiu, and Liwan; (2) the eastern part of Zhuhai–Macao clustering; (3) Shenzhen and Hong Kong clustering. The peripheral areas between these cores display a discrete distribution of tourism services. Additionally, non-core regions like the downtown area of Huizhou are anticipated to emerge as a new core area.
As for sports and leisure services, they are currently clustered at low density in Guangzhou City’s Huangpu and Nansha Districts, as well as in Foshan City’s Shunde District. A continuous development trend is expected in Guangzhou’s Tianhe and Huangpu Districts, along with Shunde District, in the future. Scenic services generally cluster in bands, forming several medium- and high-density centers in Guangzhou, Shenzhen, and Hong Kong. Compared with other tourism categories, scenic spots highly aggregate within the downtown areas of Guangzhou, Shenzhen, and Hong Kong.
  4.3. Bivariate Local Moran’s Index and Cluster Analysis
The bivariate local Moran’s index is used to test whether the distribution of tourism services has a spatial correlation with its neighbors for a different variable. (Here, we only tested tourism and GDP as an example.) As in 
Table 6, the bivariate local Moran’s index revealed a significant spatial correlation between the four types of POI services and 1 km
2 GDP in the overall region. However, the Lisa cluster analysis exposed distinct clustering characteristics for each POI type and helps us identify different types of spatial correlations between tourism services and GDP (see 
Figure 4). For example, as for 
Figure 4a, the high–high aggregation means the highly dense distribution of catering services is spatially correlated with its neighboring area with a high level of GDP. In other words, catering services tend to concentrate in areas with a high level of GDP. Notwithstanding this spatial relationship, the bivariate local Moran’s index cannot help us discern whether GDP is a driving force that influences the distribution of tourism services. This requires us to use spatial regression models and the coupling degree index to further detect this spatial relationship, which we will elaborate on later.
Like catering services, other tourism service categories of shopping, scenic spots, leisure, and sports services share similar characteristics of spatial correlation. However, in the periphery of the GBA, these services belong to low–low aggregation, with rare instances of other aggregation patterns observed. Unlike the kernel density estimation map, the high–high aggregation areas extend across municipal boundaries in the economic field, indicating significant development potential in the Pearl River Delta region. The non-significant areas surround the core high–high aggregation areas, indicating that these types of areas are discrete, with no obvious spatial bivariate correlation. They are randomly distributed, and there is a random pattern of aggregation of high and low values within each area, which does not effectively indicate the direction of development.
  4.4. Discussion on the Driving Forces of Tourism Space in the Guangdong–Hong Kong–Macao Greater Bay Area
  4.4.1. Overall Examination
The maximum likelihood estimation method was employed to analyze the factors influencing the spatial distribution of tourism services, using the OLS model, spatial lag model, and spatial error model (
Table 7). The mean variance inflation factor was calculated to be 1.77, with a maximum value of 2.76, indicating no significant issues of multicollinearity in the models. To facilitate a quantitative comparison of the influence of different factors on tourism service distribution, each independent variable was standardized to account for extreme differences. The explanatory power of all the models exceeded 72.0%. After comparing log likelihood values and variable significance across the three models, it was observed that the spatial error model exhibited the fewest large likelihoods and better significance for each variable. As a result, the spatial error model was selected as the benchmark model to analyze the influencing factors.
Regarding demographic factors, the model results show that population size has a significant positive effect on tourism services distribution. In terms of economic factors, this study shows a weak negative correlation between tourism services and the level of regional GDP at a network scale of one square kilometer.
Regarding transportation factors, the study found that superior transportation conditions promote tourism agglomeration, but the extent of the effect varies by the mode of transportation. The number of parking lots reflects the scale of tourism services accessible by automobile, which is significantly and positively correlated with tourism service density. Additionally, bus stops also show a significant positive relationship with tourism service density, as convenient public transportation helps to increase tourism service density. However, according to the spatial error model analysis, subway stations exhibit a negative correlation with tourism service density.
Regarding the facilities of hospitality, the Guangdong–Hong Kong–Macao Greater Bay Area has formed a spatial structure with the clustering of tourism facilities, clustering of tourism industry enterprises, and clustering of talents and employment. It is worth noting that both travel agencies and accommodation services have a positive effect on the distribution of tourism services.
  4.4.2. Categorical Examination
To further explore the variability of the spatial distribution of different categories of tourism services, namely catering, scenic spots, shopping, and sports and leisure, they were extracted as important components for analyzing and comparing influencing factors (
Table 8 and 
Table 9).
It can be observed that the spatial distribution of the three types of tourism services (catering, shopping, and sports and leisure) exhibited similarities, with the models explaining over 64.0% of the spatial distribution for these categories. However, the R-square value for attraction services was relatively lower. As shown in the table below, all four categories of tourism services were positively influenced by factors such as population distribution, the number of parking lots, bus stops, travel agencies, and accommodation services. However, the strength of the influence varied among the different categories. For instance, population and travel agency density had the highest contribution coefficient to sports and leisure service density, and only the subway station contribution coefficient for sports and leisure services was positive, while for the other three categories, the subway station contribution coefficient was negative. On the other hand, parking lots, bus stops, and accommodation service density had the highest contribution to catering service density, indicating a significant positive driving effect of accommodation services on catering services, which was closely related to people’s dining and accommodation needs. Furthermore, GDP only showed a positive influence on attraction and sports and leisure services, while it was negatively correlated with the other two categories of services.
  4.5. Analysis of the Coupling of Tourism Space with Economy and Population
The comprehensive evaluation model was employed to calculate the comprehensive evaluation value of GDP, population, and tourism services for each sample. Subsequently, correlation analysis was conducted in SPSS to assess the relationship between GDP, population, and tourism services. The Pearson correlation coefficient between GDP and tourism services was found to be 0.447, while the Pearson correlation coefficient between population and tourism services was 0.579. Both correlations were significant at the 0.01 level, indicating a high degree of synchronization between GDP, population, and tourism services.
Further data processing was performed according to the chosen theory and research method, resulting in the determination of the coupling degree of GDP, population, and tourism services at the 1 km
2 level, represented as images in ArcGIS (
Figure 5). The levels of coupling degree were classified as follows (
Table 10 and 
Table 11).
Analysis of the results indicates that the coupling effect of population and tourism services surpassed that of GDP and tourism services. The high-value areas of GDP and tourism service coupling were primarily concentrated in Guangzhou, Foshan, Zhuhai, Dongguan, and Shenzhen, signifying strong interdependence and mutual influence in these regions. Notably, the regions with a high coupling degree displayed dense clustering, such as the Liwan, Yuexiu, Haizhu, and Baiyun Districts in Guangzhou, as well as the Nanhai, Chancheng, Shunde Districts, among others. Additionally, large portions of Dongguan and most areas of Shenzhen exhibited high coupling degrees. However, in the fringe areas of the Guangdong–Hong Kong–Macao Greater Bay Area, connectivity was poor, resulting in low-value gathering areas for the coupling of GDP and tourism services.
  5. Discussion
With the help of ArcGIS, SPSS, and GeoDa, this paper analyzed the tourism spatial pattern characteristics of cities in the Guangdong–Hong Kong–Macao Bay Area by using a variety of methods and reveals some new findings that research based on traditional census statistics failed to discover. In what follows, we further respond to the three research questions outlined in the beginning of this paper.
  5.1. The Spatial Patterns of Tourism
The nearest neighbor ratio of tourism space of the GBA in 2019 was 0.140, which shows that the tourism service facilities in the GBA exhibited noticeable spatial aggregation. Catering and shopping services had a much higher degree of spatial aggregation than that of sports and leisure services.
Combined with the imbalance index S, the geographic concentration index G, and the local Moran’s index, it was found that the distribution of different types of POI data was imbalanced and clustered at the same time. This also manifested in the analysis of kernel density: (1) Catering, shopping, and sports and leisure services were spatially clustered in groups within the cities’ downtown areas where population concentrated. It was also illustrated in the regression model that population density positively correlated with the density of tourism services. (2) The distribution of scenic spots was not fully consistent with that of catering, shopping, and sports and leisure services, as many historical heritage sites or zoos are normally located in the old town or the suburb of the cities. Yet, all categories of tourism services tended to correlate with variables of economic level and population density in the grid of 1 km2. In conclusion, the distribution of tourism services is highly dependent on the economy and population density.
  5.2. Driving Forces of Tourism Space
This paper also examined what factors may influence the spatial distribution of tourism services in the GBA. Regarding demographic factors, the model results show that population density has a significant positive effect on the distribution of tourism services. Yet, the positive correlation between population and tourism services is often not singular but mutually reinforced. As Rapport’s research suggests, the consumption of tourism services relies on the high density of the population, while tourism services themselves can also be the amenities that become an important determinant of where people choose to live [
32].
Regarding economic factors, this study found a weak and negative correlation between tourism services and regional GDP at the scale of 1 
. This finding is in contrast with conventional studies based on city- or provincial-level census data, which show a positive correlation between GDP and tourism services. This is attributed to the micro-scale spatial panel data (1 km
2) of GDP that we used in this paper. The development of the tourism economy or services may be spatially exclusive with other economic sectors and industries like manufacturing, technological industries, and finance in the spatial unit of 1 km
2. Moreover, some scenic spots or areas do not necessarily have significant “economic spillover” effects on local development [
33]. This, therefore, requires us to further examine the coupling relationship of tourism and GDP, which will be elaborated on in this section.
The effect of transportation on tourism services is diverse. The models show that areas with a high number of parking lots tend to have a higher density of tourism services. From the spatial error model, the effect of bus stops is significant, while the effect of metro stops is not significant. Most of the current suburban areas are still outside the reach of the metro, and some metro stations may be located in non-tourist areas of the city or have poor connectivity to major tourist attractions. Yet, the density of metro stations is positively related to sports and leisure services, probably because it can improve people’s accessibility to stadiums, parks, and other sports and leisure facilities [
34].
Regarding hospitality facilities, the GBA has now formed a spatial structure with a concentration of tourism facilities, industry enterprises, and talent employment [
35,
36,
37]. Travel agents and accommodation services have a positive impact on the distribution of tourism services, as both can provide hospitality support and guidance for tourism services. They can provide tourists with convenient access to dining, leisure, shopping, and attractions, and this convenience can lead to the aggregation of tourism services [
34,
38]. In sum, we identify GDP, transportation, and hospitality facilities in a grid of 1 km
2 as three important drivers for the distribution of tourism services. Because of the limitation of data sources, other factors which were not included in this paper, such as the GDP of the service industry and the number of visitors, may also influence the distribution of tourism services. Future research can use more comprehensive variables to examine the factors influencing the distribution of tourism services.
  5.3. The Coupling of Tourism with GDP and Population
As the economy and population are crucial factors influencing the distribution of tourism services, it is important to evaluate the extent to which tourism services are coordinated with the index of GDP and population. The coupling degree, therefore, offers us a method to classify the areas in which tourism services have good interactions with GDP and population.
In general, the coupling effect of tourism services with population is better than the coupling effect of tourism services with GDP. In this sense, population density is fundamental for consumption-oriented tourism services. A high coupling degree between tourism services and population (see 
Figure 5b) indicates that (1) population and tourism can mutually facilitate each other; (2) the distribution of tourism services or amenities is consistent with the distribution of population and, therefore, is reasonable.
We also classified the areas in which tourism services had a high degree of coupling with GDP. It can be seen that the high-value areas with a coupling between GDP and tourism services are principally located in Guangzhou, Foshan, Zhuhai, Dongguan, and Shenzhen, indicating that tourism and GDP are highly dependent on each other. However, areas with a high degree of coupling do not necessarily indicate that tourism services can be a driving force of GDP; as we discussed before, tourism economy may be exclusive with other economic sectors on a scale of 1 km
2. Yet, if we combine and overlap the areas with high–high aggregation measured using the local Moran’s index and the areas with a high coupling degree between tourism services and GDP, we can clearly identify the areas where tourism services are not only highly developed but also have a positive interaction with local GDP (see 
Figure 6a,b). We also offer the methodological process for identifying hotspots for tourism development in 
Figure 7. In this sense, such areas can be considered as suitable for the development of tourism because tourism can promote GDP growth, bring higher flows of people, and, therefore, create “spillover efforts” on the local economy. This growth relationship is bidirectional because such areas are both highly coupled and highly interactive and, therefore, are important areas for urban and tourism planning.
  6. Conclusions and Policy Engagement
This study analyzed the spatial distribution and agglomeration of tourism services in China’s Great Bay Area and explored the coupling of tourism space with the local economy and population. The major research findings can be summarized as follows: (1) Tourism services exhibited noticeable spatial aggregation within the cities’ metropolitan areas. Yet, catering and shopping services had a much higher degree of spatial aggregation than that of sports and leisure services and scenic spots. (2) Through 1 km2 scale panel data analysis, it was found that population density, transportation, and hospitality facilities were positively associated with the distribution of tourism services, but tourism services may be spatially exclusive with other economic sectors like industries and, therefore, show a negative association with local GDP. (3) We identified the areas in which tourism services developed in tandem with the local economy and population density, and consider them as hotspots suitable for the development of tourism.
This paper also offers important policy implications for tourism development in the GBA. First, the planning of tourism services and facilities should be consistent with population density as it is an important supply-side indicator of the tourism market. Tourism services should also be supported by transportation and hospitality facilities, as they influence the movement and stasis of tourism flows. This is particularly important for some less developed areas that intend to use tourism as an economic facilitator: tourism needs to be developed in a holistic sense with the support of economic and transportation systems. Second, policymakers should pay special attention to the areas with a high degree of coupling between tourism, the economy, and population density in order to maximize the spatial–economic effects of tourism. As visualized in 
Figure 6, which shows hotspot areas, tourism investment and the planning of tourism facilities should look towards these areas so that the socio-economic benefits of tourism can be optimized.
However, this paper also has some limitations. First, the POI data can capture the category and number of tourism services but fail to reflect the quality of the specific spatial unit. For example, different scenic spots may have quite different capacities for attracting visitors and have different economic outputs. Second, the variables of transportation in the regression model can be improved by measuring the degree of accessibility that POI data cannot simply address. In this sense, future research should use multiple sources of data to address the limitations of POI data so as to more comprehensively cover both the quality and quantity of tourism services.