Nanjing’s Intracity Tourism Flow Network Using Cellular Signaling Data: A Comparative Analysis of Residents and Non-Local Tourists

: With the rapid development of transportation and modern communication technology, “tourism ﬂow” plays an important role in shaping tourism’s spatial structure. In order to explore the impact of an urban tourism ﬂow network on tourism’s spatial structure, this study summarizes the structural characteristics of the tourism ﬂow networks of 43 scenic spots in Nanjing from three aspects—tourism ﬂow network connection, node centrality, and communities—using cellular signaling data and the social network analysis method. A comparative analysis revealed the tourism ﬂow network structures of residents and non-local tourists. Our ﬁndings indicated four points. Firstly, the overall network connectivity was relatively good. Core city nodes displayed high spatial concentration and connection strength. However, suburban nodes delivered poor performance. Secondly, popular nodes were intimately connected, although there were no “bridging” nodes. Lesser-known nodes were marginalized, resulting in severe node polarization. Thirdly, regarding the network community structure, the spatial boundary between communities was relatively clear; the communities within the core city were more closely connected, with some parts encompassing suburban nodes. Most suburban communities were attached to the communities in the core area, with individual nodes existing independently. Fourthly, the primary difference in the tourism ﬂow network structures between residents and non-local tourists was that the nodes for residents manifested a more balanced connection strength and node centrality. Core communities encompassed more nodes with more extensive coverage. Conversely, the nodes for non-local tourists showed wide discrepancies in connection strength and node centrality. Furthermore, core communities were small in scale with clear boundaries.


Introduction
Since the 1960s, due to the continued developments in modern science and technology (including computer and network information technology, advanced transportation, modern communications, globalization, and informatization), global networking has become a significant development tendency. Against this background, by integrating the Marxist theory of globalization, information theory, and postmodern space theory, sociologist Manuel Castells proposed a novel social research theory-the space of flows theory-revealing a new perspective on the organizational logic of the modern social system [1]. Under its influence, "space of flows" has become a research hotspot in the geography domain, and has inspired many research topics, including information flow, traffic flow, knowledge flow, culture flow, and technology flow.
Tourism space is the projection of tourist activities in space. Regarding tourism destinations, tourists display the characteristic of mobility [2]. Therefore, the concept of tourism flow appeared when the spatial structures of tourism, at urban and regional levels, the changes in external environment, on user check-in patterns [28]. In addition, due to the current global spread of COVID-19, many scholars have combined the geo-tagged data of social media with epidemic prevention needs to establish a reference model to predict the infection risk of social interaction and travel between residents and tourists [29,30].
Considering the lack of existing research, this study highlights three aspects of innovation. Firstly, it highlights the combination of the application of big data and research on tourism flows within cities at the medium and micro scales. Due to the limited precision of big data, the demand for washing and screening users' phone signaling data within a city is higher; therefore, the existing research is mainly from a regional perspective, focusing on the tourist flow between cities and the relationship between the cities' tourism resources [31]. This study carried out data processing on a variety of filter condition experiments, and the processing results, multiple times, meaning the relatively accurate basic data could be used to analyze the spatial structure of tourism flow within the city. Secondly, it highlights the combination of the social network analysis method and traditional space theory. Traditional space theory emphasizes static material space expression; however, this study used the social network analysis method to connect human activities with spatial structures, which can reflect the functional connection in urban space more accurately. Thirdly, it analyses the differences between local residents and tourists in a tourism network structure. The differences in tourism flow network structures, caused by different sources of tourists (local and non-local, age structure differences, consumption level differences, etc.) have been ignored in the existing research. This study attempts to conduct a comparative analysis of the tourism flow network that is formed by local residents and tourists, and to excavate the differences between them.
Therefore, in order to further explore the impact of an urban tourism flow network on tourism's spatial structure, this study uses the social network analysis method and cellular signaling data. From the perspective of tourists' spatiotemporal behavior, and using a summary of the overall characteristics of the tourism flow network structure of Nanjing city proper, this paper differentiates between resident and non-local tourism flow network structures. In addition to addressing the deficiency in the existing research, regarding the application of big data and the absence of research scale, it provides a scientific basis for the differentiated organization of tourism space and tourism routes, urban infrastructure, transportation planning, and tourism social management.

Research Districts
Nanjing city has a long history and culture, and unique natural landscapes, making it a major scenic tourist destination at the national level. In 2015, the number of domestic tourists reached 99.9266 million, ranking it among the top tourist cities in the country. According to the 2018 monthly "Report on the Platform for Operation Monitoring of Smart Tourism Big Data in Nanjing", issued by the Nanjing Municipal Bureau of Culture and Tourism, the number of tourists visiting scenic spots with grades of 2A or above reached 91.8% (referring to China's "Classification and Evaluation of Tourist Scenic Spot Quality Grade", the scenic spot classification includes five grades; 5A is the highest grade for scenic spots). This study took Nanjing city proper as its spatial scope, selecting all three A-grade or above scenic spots in the city as the specific research districts, including 43 popular scenic spots (see Figure 1), for tabulating the statistics of tourist spatial behaviors.

Data Sources
This study performed two statistical analyses on different temporal and spatial scales, using cellular signaling data, to examine the phenomenon of tourism flow resulting from the spatial displacement of tourist crowds. This method overcame the predicament of the insufficient temporal and spatial accuracy of tourist behavior, encountered in previous research adopting the traditional analytical method of regional perspective. The data in this study came mainly from the cellular signaling data (user ID, number attribution, geographical position of triggering base station, and triggering moment) provided by Nanjing Mobile. The concrete selection steps are outlined as follows. First, data collection appearing in Nanjing city proper on four statutory holidays in 2015, viz., November 14,15,21,and 27. Second, tourist monitoring at related scenic spots in Nanjing city, referencing average length of visit for each scenic spot in the city. Data were selected from at least two scenic spots within the research scope (see the above 44 scenic spots) that tourists visited for over an hour. These two scenic spots were treated as the spatial departure and destination points. Third, on this basis, per number attribution, the data of local and non-local users were identified. Based on the length of evening visits (staying for more than 3 consecutive hours between 24:00 and 07:00) on working days, versus day visits during working hours (staying for more than 4 consecutive hours between 07:00 and 19:00), the residences and workplaces of local users were identified. Based on the time spent on handling business in the daytime (staying for more than 2 consecutive hours between 08:00 and 19:00), the destinations of non-local users on business travel were identified, which were later excluded. In the end, from the 2.365 million active users (who appeared at least twice on the four statutory holidays), the data of 1.212 million local users and 108,000 non-local users were identified and used as the foundational data for this comparative study on tourism flow structures of local and non-local users.
Of the sampled users, Nanjing Mobile's market share was 64.7%, meaning the above identification results were tantamount to a large sample. Based on the permanent resident population of approximately 8 million in Nanjing's 6th population census, the sample of identified local users equaled 23.4% of the total population. Considering the average daily reception of 270,000 non-local tourists in Nanjing, the amount of non-local users in the sample was 10%. Such samples are far larger than those in traffic and manual questionnaire surveys. No statistical data were available to examine the tourist spots identification results in terms of identification accuracy. However, on the four holidays, the correlation coefficients between the daily distribution of any two recreation areas all reached over 0.8, indicating that the recreation areas' identification results were stable and should also be credible, thereby bearing overall representativeness.

Research Methods
The transfer and diffusion of tourists between tourism destinations produces a certain connection between each destination, giving rise to a dynamic evolutionary system. The social network analysis method can precisely depict the various relationships within a system, from the perspective of macro-structural relations. For this reason, has been used reliably in recent tourism studies. Therefore, this study, with the aid of the social network analysis software UCINET [32], analyzed three aspects-namely, tourism flow network connection, node centrality, and network communities. In addition, the overall characteristics of Nanjing's tourism flow network structure were summarized visually using the ArcGIS digital technology platform. On this basis, this study further compared the differences in the characteristics of tourism flow network structures, between local residents and non-local tourists. The concrete research methods are shown in Figure 2.

Methods for Analyzing Tourism Flow Network Connections
Based on social network analysis, the spatial departure statistics and tourist destination points in users' tourist activities were collected at each scenic spot, per their spatial positions. The scenic spots were set as network nodes, and visiting tourists were sequentially linked to form a network. By collecting the flow volume statistics of each node, we obtained the spatial concentration of each node in the entire tourism flow network to represent the out-degree centrality. Next, an asymmetric adjacency matrix was constructed, the multi-valued matrix was converted to a binary matrix, and the ArcGIS software analyzed the strength of network connection between nodes.

Methods for Analyzing Tourism Flow Network Connections
Based on social network analysis, the spatial departure statistics and tourist destination points in users' tourist activities were collected at each scenic spot, per their spatial positions. The scenic spots were set as network nodes, and visiting tourists were sequentially linked to form a network. By collecting the flow volume statistics of each node, we obtained the spatial concentration of each node in the entire tourism flow network to represent the out-degree centrality. Next, an asymmetric adjacency matrix was constructed, the multi-valued matrix was converted to a binary matrix, and the ArcGIS software analyzed the strength of network connection between nodes.

Methods for Analyzing Node Centrality in Tourism Flow Networks
The centrality of nodes is a significant indicator in social network analysis. Through the depiction of the centrality of different standards, the value and status of a node's existence can be reflected in tourism flow network structures. Three forms of centrality have been selected for analysis in this study: • In-degree centrality (popularity and attractiveness) In-degree centrality measures the popularity of a certain node, representing the extent of a scenic node's clustering ability. This study selected the in-degree centrality and standardized it to the [0,1] interval. The calculating formula of the standard in-degree centrality CRDi of node i is as equation: • Eigenvector centrality (latent attractiveness) This depends on the direct relation of a node with its adjacent point and represents the connectivity level of tourism nodes. Regarding adjacent matrix A, the score Xi of the relative centrality of node i should exist:

Methods for Analyzing Node Centrality in Tourism Flow Networks
The centrality of nodes is a significant indicator in social network analysis. Through the depiction of the centrality of different standards, the value and status of a node's existence can be reflected in tourism flow network structures. Three forms of centrality have been selected for analysis in this study:

•
In-degree centrality (popularity and attractiveness) In-degree centrality measures the popularity of a certain node, representing the extent of a scenic node's clustering ability. This study selected the in-degree centrality and standardized it to the [0,1] interval. The calculating formula of the standard in-degree centrality C RDi of node i is as equation: • Eigenvector centrality (latent attractiveness) This depends on the direct relation of a node with its adjacent point and represents the connectivity level of tourism nodes. Regarding adjacent matrix A, the score X i of the relative centrality of node i should exist: Rewriting the expression gives the eigenvector calculating equation: Solving and standardizing the matrix gives the calculating equation of the standard eigenvector centrality C Rβ : In the equation, α is the standardized constant, λ is the eigenvalue corresponding to the first eigenvector, which determines the importance of the adjacent point to centrality, and I is the identity matrix.

•
Betweenness centrality (controlling ability of mediation) This measures the ability of a node to control the movement of tourists between node pairs in the tourism flow network, manifesting the controlling ability of a node, or its network mediating and moderating effects.
Assuming that the number for the shortest path between node j and node k is g jk , and the number for the shortest path between node j and node k that must pass through i is g jk(i) , then the probability of i situating in the shortest path between j and k can indicate the betweenness centrality of node i. The calculating formula of the standard betweenness centrality is as calculating equation: Based on the above standard centrality, centrality was divided into three classes (high, middle, and low) through natural breaks, in order to ensure that the internal difference within the same class was minimized, while the differences between different classes was maximized. This paper combined the characteristics of node centrality (in-degree centrality, eigenvector centrality, and betweenness centrality) (see Table 1), and constructed an evaluation system for the node status in the tourism network through the (matrix) model. Table 1. Combination relationship of three types of node centrality in the tourism network.

Low In-Degree Centrality Low Eigenvector Centrality Low Betweenness Centrality
High in-degree centrality -Nodes in more popular areas, although in single tourism routes.
Nodes themselves are popular, although they are usually treated as the departure and destination points on tourism routes.

High eigenvector centrality
Exist in the areas around nodes of high popularity. -Nodes and their adjacent nodes are located in areas of highly intensive activity, and possess interactive diversity.

High betweenness centrality
Although less known, such nodes are located where visitors are highly likely to pass on their way to access other nodes.
Rare type. Nodes in a monopoly position when connecting with marginal nodes, and are areas of highly intensive activity.

Methods for Analyzing the Communities in the Tourism Flow Network
Community analysis is another important focus of research in social network analysis. The communities that are demarcated through nodes, and their connections, can reflect the extent of independence and popularity change of network tourism routes. This paper used the CONCOR method of the UCINET analysis software to perform a cohesive subgroup analysis, measuring the strength of node connections in flow volume. The entire network was divided into several sub-networks with powerful internal connections. Through multiple iterations, a correlation coefficient matrix was created. The higher the numerical value of the density matrix, the closer the connection between subgroups. It has guiding significance regarding tourists in their selection of combinations of tourism nodes, and designation of tourism routes. Furthermore, through the "core-edge" model, we further determined the status of tourism nodes in the overall network, and summarized and analyzed the structural model of Nanjing's tourism flow network.

Spatial Concentration of the Tourism Flow Network
According to the research methods stated in Section 2.3.1, of the 43 scenic spots comprising the spatial scope, we selected 10,252 pairs of nodes with vectors. We collected the statistics of each node's visit volume. Then, the nodes were divided into five classes, according to natural breaks. The larger the nodes, the higher their spatial concentration. The spatial concentration of the overall-local-non-local networks (see Figure 3) was thus derived, representing their out-degree centrality. The results showed the following. displayed a characteristic of "relatively consistent overall spatial concentration, with slight differences existing in individual nodes."

Analysis of the Connection Strength of the Tourism Flow Network
Based on the above conclusion, this study further analyzed the strength of the network connection between nodes. The lines between tourism nodes represent the volume of tourism flows, with the thickness of the lines indicating the volume size, and the arrows signifying the direction of the flows. The connection strength was obtained from the overall-local-non-local networks (see Figure 4). Our findings showed the following.
Generally, over 85% of Nanjing's node connections were concentrated in the core city area. Fundamentally, the node connections gradually weakened as the distance from the core city area increased. Most of the suburban nodes showed a unidirectional inflow. Such connections resulted from the spillover from the core city nodes. The top five node pairs As a whole, the overall connectivity of Nanjing's tourism flow network was relatively good. No nodes were completely isolated. From the entire city proper, the network density was not high. However, the nodes in the core city were highly popular, with flows exhibiting the clustering feature. Nodes whose spatial concentration was in the first class (Zhongshan Scenic Area, Confucius Temple, Xuanwu Lake, Presidential Place (1912 District), Nanjing University Gulou District, and Hongshan Forest Zoo) were all located in the core city. Their spatial concentrations were far higher than those in the second class, thus demonstrating their absolute core status. Additionally, all of the nodes whose spatial concentration ranked in the first three classes were in the core city. Most of the 14 nodes with the lowest class spatial concentration were in the suburbs outside the core city (including Gaochun International Cittaslow Tranquil, Pingshan Forest Park, Jinniu Lake Scenic Area, Fangshan Scenic Area, Dajinshan Scenic Area, Tiansheng Bridge Scenic Area, etc.), except for the Nanjing Yangtze River Bridge and Meixian Xincun Memorial Hall, which were in the core city. Generally, the spatial concentration of Nanjing's tourism flow showed a network characteristic of "exceptionally high spatial concentration for core city nodes, low spatial concentration for suburban nodes, and outstanding performance of core nodes".
By comparing the nodes' spatial concentrations formed by the tourism behavior of residents and non-local tourists, we discovered that the local network visits to scenic spots manifested higher densities than non-local tourists, although both displayed the characteristic of high spatial concentration for the core city network, and low spatial concentration for the suburban network. The spatial concentration of residents and nonlocal tourists related to core city nodes was generally similar. Residents displayed higher spatial concentration than non-local tourists, in terms of humanistic scenic spots (e.g., the Nanjing Museum, Nanjing University Gulou District, and Yihe Road Republican Architecture Complex). In comparison with non-local tourists, residents showcased higher and more comprehensive spatial concentration regarding suburban scenic spots. For nonlocal tourists, the nodes of Pingshan Forest Park, Fangshan Scenic Area, and Gaochun International Cittaslow Tranquil were isolated. In sum, both groups, residents and non-local tourists, displayed a characteristic of "relatively consistent overall spatial concentration, with slight differences existing in individual nodes".

Analysis of the Connection Strength of the Tourism Flow Network
Based on the above conclusion, this study further analyzed the strength of the network connection between nodes. The lines between tourism nodes represent the volume of tourism flows, with the thickness of the lines indicating the volume size, and the arrows signifying the direction of the flows. The connection strength was obtained from the overall-local-non-local networks (see Figure 4). Our findings showed the following.
Generally, over 85% of Nanjing's node connections were concentrated in the core city area. Fundamentally, the node connections gradually weakened as the distance from the core city area increased. Most of the suburban nodes showed a unidirectional inflow. Such connections resulted from the spillover from the core city nodes. The top five node pairs in the network connections (Zhongshan Scenic Area-Confucius Temple, Zhongshan Scenic Area-Xuanwu Lake, Beiji Ge Park (Jiming Temple)-Xuanwu Lake, Hongshan Forest Zoo-Xuanwu Lake, and XuanwuLake-Hongshan Forest Park) were all bidirectionally connected. Among them, Zhongshan Scenic Area, Confucius Temple, and Xuanwu Lake formed strong connections with two or more nodes and, thus, served as the absolute core in the network connection. Most of the second class node pairs formed from the first class node connection with other nodes, and so on. Thus a "tree" network structure manifests in Nanjing city's tourism flow network, where nodes with high connection strength spill over progressively to those with low connection strength. On the whole, the connection strength of Nanjing's tourism flow showed a "high-strength connection between nodes in the core city area, with a progressive decrease towards the periphery spatially".
Comparing the node connection strength that was formed by residents and non-local tourist behaviors, we found that both exhibited high connection strength in the core city, and low connection strength in the suburbs. However, the nodes that residents visited showed a higher overall connection strength than those of non-local tourists. The connection strength of the nodes visited by residents displayed a relatively balanced spatial distribution, and more diverse types. Comparatively, the connection strength of the nodes visited by nonlocal tourists showed a more concentrated spatial distribution, with blank spaces appearing in the network connection of some suburb nodes. Additionally, the spatial concentration tended to fall on higher-rated and better-known node connections. Resident-visited nodes that fell in the first grade of connection strength included the following four pair nodes: Xuanwu Lake-Zhongshan Scenic Area, Xuanwu Lake-Beijing Ge Park (Jiming Temple), Xuanwu Lake-Hongshan Forest Zoo, and Old East Gate-Confucius Temple; non-local resident-visited nodes with first grade connection strengths include the following two node pairs: Xuanwu Lake-Zhongshan Scenic Area and Confucius Temple-Zhongshan Scenic Area. This shows that marked differences existed between these two groups. In terms of tourism route selection, residents attached greater priority to proximity in spatial location, whereas non-local tourists tended to choose well-known scenic spots. Generally, concerning node connection strength, "the nodes residents visited displayed a spatial balance in connection strength and diversification of types, while the nodes visited by non-local tourists exhibited a spatial concentration and inclination toward highly-rated nodes. The two groups showed distinct differences in tourism route selection".
Comparing the node connection strength that was formed by residents and non-local tourist behaviors, we found that both exhibited high connection strength in the core city, and low connection strength in the suburbs. However, the nodes that residents visited showed a higher overall connection strength than those of non-local tourists. The connection strength of the nodes visited by residents displayed a relatively balanced spatial distribution, and more diverse types. Comparatively, the connection strength of the nodes visited by non-local tourists showed a more concentrated spatial distribution, with blank spaces appearing in the network connection of some suburb nodes. Additionally, the spatial concentration tended to fall on higher-rated and better-known node connections. Resident-visited nodes that fell in the first grade of connection strength included the following four pair nodes: Xuanwu Lake-Zhongshan Scenic Area, Xuanwu Lake-Beijing Ge Park (Jiming Temple), Xuanwu Lake-Hongshan Forest Zoo, and Old East Gate-Confucius Temple; non-local resident-visited nodes with first grade connection strengths include the following two node pairs: Xuanwu Lake-Zhongshan Scenic Area and Confucius Temple-Zhongshan Scenic Area. This shows that marked differences existed between these two groups. In terms of tourism route selection, residents attached greater priority to proximity in spatial location, whereas non-local tourists tended to choose well-known scenic spots. Generally, concerning node connection strength, "the nodes residents visited displayed a spatial balance in connection strength and diversification of types, while the nodes visited by non-local tourists exhibited a spatial concentration and inclination toward highly-rated nodes. The two groups showed distinct differences in tourism route selection."

Node Centrality Analysis
Centrality is a quantitative statistic of node power, as seen in the analytical methods for tourism flow networks in Section 2.3.2. Centrality can depict the values of nodes in tourism networks. The calculations in this paper produced the differentiated measure values of the three types of centrality under different tourism flow network contexts-namely, overall, local, and non-local (see Table 2). The natural break classification categorized them into three levels. The internal difference within the same level was minimized, while the differences between different classes was maximized. A spatial visualization was also performed (see Figures 5-7). • Popularity and Attraction of Network Nodes (In-degree Centrality) As shown in Figure 5, the popularity and attraction of the core city nodes were far higher than the suburban nodes. A total of 90% of the core city nodes were within the high and medium levels of popularity and attraction, with a relatively small range. As for the suburban nodes, their popularity and attractiveness were low, with only a small number at the high and medium levels of popularity and attractiveness.
A comparison of the popularity and attractiveness of the nodes visited by residents and non-local tourists showed that, in both groups, core city nodes were more popular than suburban nodes. The most popular nodes included Xuanwu Lake, Zhongshan Scenic Area, Confucius Temple, and Nanjing University Gulou District. The nodes that attracted residents were more balanced in their spatial distribution (particularly in the core city), with many of the nodes falling within the high and middle classes. By comparison, the nodes that attracted non-local tourists were more concentrated, and the number of popular nodes was far lower than that of the popular nodes visited by residents. and medium levels of popularity and attraction, with a relatively small range. As for the suburban nodes, their popularity and attractiveness were low, with only a small number at the high and medium levels of popularity and attractiveness. A comparison of the popularity and attractiveness of the nodes visited by residents and non-local tourists showed that, in both groups, core city nodes were more popular than suburban nodes. The most popular nodes included Xuanwu Lake, Zhongshan Scenic Area, Confucius Temple, and Nanjing University Gulou District. The nodes that attracted residents were more balanced in their spatial distribution (particularly in the core city), with many of the nodes falling within the high and middle classes. By comparison, the nodes that attracted non-local tourists were more concentrated, and the number of popular nodes was far lower than that of the popular nodes visited by residents.
(a) (b) Figure 5. Classification map of the in-degree centrality of nodes under three types of network context (overall, resident, and non-local tourist): (a) entire city of Nanjing; (b) core city of Nanjing.
• Latent Attraction of Network Nodes (Eigenvector Centrality) As indicated in Figure 6, the intensity of the activities around the core city nodes was far higher than in the suburban nodes. Six nodes were located in centers of intense activity (Zhongshan Scenic Area, Xuanwu Lake, Hongshan Forest Zoo, Confucius Temple, Nan- • Latent Attraction of Network Nodes (Eigenvector Centrality) As indicated in Figure 6, the intensity of the activities around the core city nodes was far higher than in the suburban nodes. Six nodes were located in centers of intense activity (Zhongshan Scenic Area, Xuanwu Lake, Hongshan Forest Zoo, Confucius Temple, Nanjing University Gulou District, and Yihe Road Republican Architectural Complex). Qixiashan Scenic Area, in the suburbs, had a high-level latent attraction, which resulted from the high intensity in the core city nodes.
Comparing the latent attraction of the nodes that were visited by residents and nonlocal tourists, our results showed that both bore similarities in the core city area. The nodes that were visited by residents achieved a higher rate of latent attraction than those that were visited by non-local tourists. In the suburbs, the nodes in the non-local tourist network displayed a higher level of latent attraction than those that were visited by residents. The findings show that residents had more flexible and varied selections of tourism nodes, while non-local tourists selected popular nodes as often as possible. • Mediating and Controlling Power of Network Nodes (Betweenness centrality) As shown in Figure 7, there was an unclear relationship between the mediating and controlling power of the nodes in the city proper and their spatial locations. Almost all of the nodes were at medium and low levels, with Zhongshan Scenic Area being the only essential node. This indicates that all of Nanjing city's nodes have weak mediating and controlling power, meaning each node was relatively independent, or mostly combinations of nodes, with no nodes being necessary. Thus, the network structure was rather loose, with no significantly popular tourism routes (links whose node number was greater than two).
Similarities were found in the comparison of the mediating and controlling power of the nodes visited by residents and non-local tourists. The nodes that non-local tourists selected had slightly stronger mediating and controlling power than those that were selected by residents. This shows that the combinations of nodes in the non-local tourism flow network were relatively uniform and stable, while those in the resident networks were more flexible and varied.

•
Mediating and Controlling Power of Network Nodes (Betweenness centrality) As shown in Figure 7, there was an unclear relationship between the mediating and controlling power of the nodes in the city proper and their spatial locations. Almost all of the nodes were at medium and low levels, with Zhongshan Scenic Area being the only essential node. This indicates that all of Nanjing city's nodes have weak mediating and controlling power, meaning each node was relatively independent, or mostly combinations of nodes, with no nodes being necessary. Thus, the network structure was rather loose, with no significantly popular tourism routes (links whose node number was greater than two).
Similarities were found in the comparison of the mediating and controlling power of the nodes visited by residents and non-local tourists. The nodes that non-local tourists selected had slightly stronger mediating and controlling power than those that were selected by residents. This shows that the combinations of nodes in the non-local tourism flow network were relatively uniform and stable, while those in the resident networks were more flexible and varied.

Evaluation of Node Status based on Node Centrality
A correlation exists between all of the types of tourism node centrality. Therefore, these three types of node centrality were arranged and combined to create a more comprehensive status evaluation system. Figure 8 depicts a coordinate system of differences in centrality combinations. In this system, axes X, Y, and Z represent in-degree centrality, eigenvector centrality, and betweenness centrality, respectively, and each divides into high, medium, and low levels. If a node had a combination of high and medium levels, or medium and low levels, the range was one. However, if it consisted of all three levels, or only high and low levels, the range was two. When all of the three centrality type levels were identical, there was no gradation. When all of the three node centrality types were low, there was always an edge node in the tourism network. When all of the three node centrality types were at the medium level, this meant that the node was still in balanced development. When the three node centrality types were high from beginning to end, it was a popular core node. This coordinate system arranged and combined the three different centrality type levels (high, medium, and low) to highlight tourism significance.
All of the nodes populated the coordinate system according to the three centrality combination types depicted in the node status evaluation system (see Figure 9). Three nodes with low levels of eigenvalue centrality were excluded from the diagram to improve readability. In the overall table, the more common combinations were: (1) middle in-degree, middle eigenvalue, and low betweenness-nodes had a certain popularity, as did the adjacent points, although it is very likely that they were the tourist's starting and destination points; (2) high in-degree, medium eigenvalue, and low betweenness-nodes were popular, and the adjacent points also had a certain level of popularity, being situated in the core position of non-core communities; (3) high in-degree, high eigenvalue, and medium betweenness-nodes and adjacent points were popular, situated at the "bridging" position between edge nodes and the core district, or between small communities.

Evaluation of Node Status Based on Node Centrality
A correlation exists between all of the types of tourism node centrality. Therefore, these three types of node centrality were arranged and combined to create a more comprehensive status evaluation system. Figure 8 depicts a coordinate system of differences in centrality combinations. In this system, axes X, Y, and Z represent in-degree centrality, eigenvector centrality, and betweenness centrality, respectively, and each divides into high, medium, and low levels. If a node had a combination of high and medium levels, or medium and low levels, the range was one. However, if it consisted of all three levels, or only high and low levels, the range was two. When all of the three centrality type levels were identical, there was no gradation. When all of the three node centrality types were low, there was always an edge node in the tourism network. When all of the three node centrality types were at the medium level, this meant that the node was still in balanced development. When the three node centrality types were high from beginning to end, it was a popular core node. This coordinate system arranged and combined the three different centrality type levels (high, medium, and low) to highlight tourism significance.
All of the nodes populated the coordinate system according to the three centrality combination types depicted in the node status evaluation system (see Figure 9). Three nodes with low levels of eigenvalue centrality were excluded from the diagram to improve readability. In the overall table, the more common combinations were: (1) middle indegree, middle eigenvalue, and low betweenness-nodes had a certain popularity, as did the adjacent points, although it is very likely that they were the tourist's starting and destination points; (2) high in-degree, medium eigenvalue, and low betweenness-nodes were popular, and the adjacent points also had a certain level of popularity, being situated in the core position of non-core communities; (3) high in-degree, high eigenvalue, and medium betweenness-nodes and adjacent points were popular, situated at the "bridging" position between edge nodes and the core district, or between small communities.  This study further compared the resident and non-local tourist networks (see Figure  10). We revealed and ranked the resident network's first three combinations: (1) medium in-degree, medium eigenvalue, and low betweenness; (2) medium in-degree, high eigenvalue, and medium betweenness; (3) high in-degree, medium eigenvalue, and low betweenness. This showed "high in-degree centrality, generally moderate eigenvalue centrality, and seriously polarized betweenness centrality." The results indicate that the popular nodes that were approved by residents were more diversified. Some of the popular nodes showed close connections and balanced development, while others existed independently. The first three combinations in the ranking of non-local tourist networks were: (1) medium in-degree, medium eigenvalue, and low betweenness; (2) low in-degree, medium eigenvalue, and low betweenness; (3) medium in-degree, medium eigenvalue, and medium betweenness-manifesting the characteristic of "overall low in-degree centrality, higher eigenvalue centrality, and seriously polarized betweenness centrality." Compared to those visited by residents, the popular nodes that were approved by non-local tourists  This study further compared the resident and non-local tourist networks (see Figure  10). We revealed and ranked the resident network's first three combinations: (1) medium in-degree, medium eigenvalue, and low betweenness; (2) medium in-degree, high eigenvalue, and medium betweenness; (3) high in-degree, medium eigenvalue, and low betweenness. This showed "high in-degree centrality, generally moderate eigenvalue centrality, and seriously polarized betweenness centrality." The results indicate that the popular nodes that were approved by residents were more diversified. Some of the popular nodes showed close connections and balanced development, while others existed independently. The first three combinations in the ranking of non-local tourist networks were: (1) medium in-degree, medium eigenvalue, and low betweenness; (2) low in-degree, medium eigenvalue, and low betweenness; (3) medium in-degree, medium eigenvalue, and medium betweenness-manifesting the characteristic of "overall low in-degree centrality, higher eigenvalue centrality, and seriously polarized betweenness centrality." Compared to those visited by residents, the popular nodes that were approved by non-local tourists This study further compared the resident and non-local tourist networks (see Figure 10). We revealed and ranked the resident network's first three combinations: (1) medium indegree, medium eigenvalue, and low betweenness; (2) medium in-degree, high eigenvalue, and medium betweenness; (3) high in-degree, medium eigenvalue, and low betweenness. This showed "high in-degree centrality, generally moderate eigenvalue centrality, and seriously polarized betweenness centrality". The results indicate that the popular nodes that were approved by residents were more diversified. Some of the popular nodes showed close connections and balanced development, while others existed independently. The first three combinations in the ranking of non-local tourist networks were: (1) medium in-degree, medium eigenvalue, and low betweenness; (2) low in-degree, medium eigenvalue, and low betweenness; (3) medium in-degree, medium eigenvalue, and medium betweenness-manifesting the characteristic of "overall low in-degree centrality, higher eigenvalue centrality, and seriously polarized betweenness centrality". Compared to those visited by residents, the popular nodes that were approved by non-local tourists were relatively concentrated and stable, and mostly in a state of moderate popularity or marginalization. were relatively concentrated and stable, and mostly in a state of moderate popularity or marginalization.
(a) (b) Figure 10. Comparison of the node status evaluation in resident networks and non-local tourist networks: (a) resident networks; (b) non-local tourist networks.

Cohesive Subgroup Analysis of the Tourism Flow Network
The CONCOR method in the UNICET software was used to perform a cohesive subgroup analysis. We calculated the coefficients of each row (or column) in the matrix, with the final results shown in Table 3. The higher the numerical value of the density matrix, the closer the subgroup connection.
The results of the cohesive subgroup analysis revealed the substructures within the tourism flows, and show more tourist route combinations. In the cohesive subgroup density matrix of the overall tourism flow network in Nanjing city (see Table 3), eight substructures formed, each with varying closeness of flow connections between the subgroups. Subgroup 5, 6, and 8 showed the most frequent interactions between internal node

Cohesive Subgroup Analysis of the Tourism Flow Network
The CONCOR method in the UNICET software was used to perform a cohesive subgroup analysis. We calculated the coefficients of each row (or column) in the matrix, with the final results shown in Table 3. The higher the numerical value of the density matrix, the closer the subgroup connection.
The results of the cohesive subgroup analysis revealed the substructures within the tourism flows, and show more tourist route combinations. In the cohesive subgroup density matrix of the overall tourism flow network in Nanjing city (see Table 3), eight substructures formed, each with varying closeness of flow connections between the subgroups. Subgroup 5, 6, and 8 showed the most frequent interactions between internal node members. Subgroup 7 showed a higher frequency in the internal interaction between its subgroup nodes and their external connections. In this light, combination marketing, or combined tourism tickets, may appeal to these two subgroups. The nodes in subgroup 1 and 3 were mainly dependent on their connections with other subgroups. These node types may be treated as additional products to the above subgroups. Subgroup 2 and 4 were relatively independent, with comparatively weak internal and external connections, and can be branded separately as tourism products with unique features.  Based on the analysis of the eight cohesive subgroups in the overall tourism flow network, these subgroups fall into four classifications: (1) endogenous agglomeration type-AVG1 (degree of internal connection > AVG, and AVG2 (degree of external connection) ≤ AVG, manifesting as the subgroups converging inward, powerful connections between internal individual members, and weak connections outside of the subgroups; (2) internal-external balance type-AVG1 (degree of internal connection) > AVG, and AVG2 (degree of external connection) > AVG, manifesting as the subgroups having strong internal and external connections; (3) externally attached type-AVG1 (degree of internal connection) ≤ AVG, and AVG2 (degree of external connection) > AVG, manifesting as weak connections between members and internal subgroups, although forming a strong connection with one or several external subgroups; (4) individual independence type-AVG1 (degree of internal connection) ≤ AVG, and AVG2 (degree of external connection) ≤ AVG, with members within subgroups having relatively weak internal and external connections, and failing to form obvious network connections with other subgroups and individuals (or forming a connection with individual subgroups).
Likewise, the local and non-local tourism flow networks could be classified according to this criterion, which produced our results for the analysis of the tourism flow networks of the three types of subgroup (overall, local, non-local) (see Table 4, Figure 11).
Overall, in Nanjing city, all of the tourism nodes with a high visit volume (tourism brand image) were within the endogenous agglomeration and internal-external balance types, thereby becoming the absolute cores of these two kinds of subgroups. Spatially, they connected to several adjacent nodes, with well-defined edges to the subgroups, and all were within the core city of Nanjing. Furthermore, some nodes that were on the periphery of the core city existed by attaching to these two kinds of subgroups. There were also some relatively independent, small-scale node groups, with only a single flow path. These results align with the study on the all-for-one tourism policy of cities. From the residents' point of view, more nodes were classified as endogenous agglomeration and internal-external balance types. These form an even greater agglomeration network in the tourism flow, breaking down the spatial boundary of the core city, and absorbing more suburban nodes. Furthermore, the subgroups of externally attached and individual independence types had relatively few nodes, and showed the tendency of gradual absorption by the subgroups with a high agglomerative nature.
From the perspective of non-local tourists, no subgroup emerged from the endogenous agglomeration type. Additionally, almost all of the subgroups of the internal-external balance type were within the core city. Such subgroups were formed by core nodes and a few adjacent nodes, and were, therefore, subject to spatial limitations. More than half of the nodes were situated within the subgroups of the externally attached and individual independence types, indicating that only the core nodes in tourism flow network structures performed prominently, whereas the overall connection was insufficient and fragmented.

"Core-Edge" Model
Based on the binarization results of the communities in the tourism flow networks that were extracted using the "core-edge" model of UNICET, the results of the overall tourism flow in Nanjing indicate the following (see Table 5, Figure 12). The core district members included four nodes-namely, Confucius Temple, Presidential Palace (1912 District), Xuanwu Lake, and Zhongshan Scenic Area. The remaining 39 nodes were at the district's edge. Regarding the correlation degree, core district members reached 0.78,

"Core-Edge" Model
Based on the binarization results of the communities in the tourism flow networks that were extracted using the "core-edge" model of UNICET, the results of the overall tourism flow in Nanjing indicate the following (see Table 5, Figure 12). The core district members included four nodes-namely, Confucius Temple, Presidential Palace (1912 District), Xuanwu Lake, and Zhongshan Scenic Area. The remaining 39 nodes were at the district's edge. Regarding the correlation degree, core district members reached 0.78, while district edge members were only 0.08, indicating obvious structural stratification in the tourism network in Nanjing. Furthermore, the correlation degree between core members and edge members reached 0.29, indicating that a connection between core and district edge was also relatively close. Regarding spatial distribution, the core district nodes were all located within the core city. Spatially, a diminished connection density was observed between the district edge and the core district nodes, from the core to the periphery. Therefore, aside from actively developing the core tourism districts, it is necessary to simultaneously enhance the overall tourism competitiveness of Nanjing city through positive cultivation, systematic and active expansion, and linkage to edge tourism districts.
By comparing the "core-edge" structures that were formed by resident and nonlocal tourist tourism flow networks (see Table 5), this study found that the respective core district members had slightly different compositions. The core district that was formed by resident tourism flow included Zhongshan Scenic Area, Xuanwu Lake, Beiji Ge Park (Jiming Temple), Confucius Temple, and Hongshan Forest Zoo. The non-local tourist district included Zhongshan Scenic Area, Confucius Temple, Xuanwu Lake, and Presidential Palace (1912 District). These findings show that the approval rate of Beiji Ge Park (Jiming Temple) and Hongshan Forest Zoo was only high within Nanjing, while the Presidential Place (1912 District) was generally a check-in spot for non-local tourists. In terms of correlation degree, the core districts that were formed by the non-local tourist tourism flow had a higher degree of internal correlation than those that were formed by residents, although residents had a higher degree of internal correlation for the edge districts, and between the core members and edge members. This shows that the node combinations for non-local tourists were relatively stable, and their polarization was more serious. On the other hand, resident tourism routes have begun to develop in a diversified and individualized direction, with some niche scenic spots beginning to enter their horizons. In terms of spatial distribution, the core districts that were formed by non-local tourists in the tourism flow network showed a concentrated and contiguous distribution. Meanwhile, residents have just begun to break through the spatial contiguity, manifesting a spatial form of separation between the two groups. Therefore, the differences between the tourist groups should be considered when a tourism development strategy is formulated for differentiation, individualization, and diversification.  Figure 12. "Core-edge" model spatial distribution under three types of network context (overall, resident and, non-local tourist): (a) overall "core-edge" model of core city; (b) resident "core-edge" model of core city; (c) non-local "core-edge" model of core city.

Discussion
This study summarized the structural characteristics of the tourism flow network of 43 scenic spots in Nanjing city from three aspects: tourism flow network connection, node centrality, and communities. A comparative analysis revealed the tourism flow net structures that were formed by resident and non-local tourist behavior. The results are presented in Table 6. Table 6. Summary of tourism flow network structures under three context types (overall, resident and, non-local tourist).

Perspective
Overall Resident Non-Local Tourist

Network connection
Relatively good overall connectivity, and high spatial concentration and connection strength of core city nodes. Poor performance of suburban nodes.
High spatial concentration and connection strength of core city nodes. Relatively balanced on the whole, with emergent tendency of a gradually increasing connection between core city Spatial concentration and connection strength of core city nodes far higher than suburbs. Close connection between nodes with high spatial concentration Figure 12. "Core-edge" model spatial distribution under three types of network context (overall, resident and, non-local tourist): (a) overall "core-edge" model of core city; (b) resident "core-edge" model of core city; (c) non-local "core-edge" model of core city.

Discussion
This study summarized the structural characteristics of the tourism flow network of 43 scenic spots in Nanjing city from three aspects: tourism flow network connection, node centrality, and communities. A comparative analysis revealed the tourism flow net structures that were formed by resident and non-local tourist behavior. The results are presented in Table 6. Table 6. Summary of tourism flow network structures under three context types (overall, resident and, non-local tourist).

Analytical Perspective
Overall Resident Non-Local Tourist

Network connection
Relatively good overall connectivity, and high spatial concentration and connection strength of core city nodes. Poor performance of suburban nodes.
High spatial concentration and connection strength of core city nodes. Relatively balanced on the whole, with emergent tendency of a gradually increasing connection between core city and suburban nodes.
Spatial concentration and connection strength of core city nodes far higher than suburbs. Close connection between nodes with high spatial concentration and insufficient external connection.

Node centrality
Close mutual connection between popular nodes, no "bridging" nodes, less known nodes were marginalized, and relatively serious phenomenon of node polarization.
Connection between popular nodes relatively strong, with "bridging" nodes existing between them and marginal nodes. Influence of some popular nodes beginning to radiate outward to lesser-known nodes, and in a state of balanced development.
Concentration of popular nodes with very close internal connection, showing endogenous characteristics, weak external influence, and marginal nodes isolation.

Communities
Relatively clear spatial boundary between communities. The communities within the core city more closely connected, with some parts of the core absorbing suburban nodes. Most suburban communities attached to the communities of the core city, while individual nodes existed independently.
Obscure spatial boundary between communities. Community organization beginning to break down the spatial boundary of core districts, expanding to the whole region in a wedge shape. Attachment-type and independence-type communities gradually reduced.
Clear spatial boundary between communities. Communities in core city closely connected. Suburban communities marginalized and existing independently.
The tourism network structure in Nanjing city exhibits the characteristics of single center (core city), core-edge differentiation, and spatial agglomeration and diffusion. Furthermore, differences exist between the resident and non-local tourist characteristics. What are the reasons for the formation of such a tourism flow network structure in Nanjing? Generally speaking, at the urban scale, traffic flow is the main carrier of tourism flow. Therefore, traffic accessibility and travel mode must be the most directly influencing factors. Nanjing's road and rail traffic, for example, is radiated outward, from a central city layout; the density inside the core city road network is much higher than in other areas, meaning that the good traffic accessibility attracts tourists from all parts of the city. This directly caused the single center tourist flow network structure, which is located in the core city center. At the same time, due to the low accessibility of public transportation in the outer suburbs' scenic spots, road trips have become the main means of arrival. Additionally, local residents are more qualified to use this means of travel, which results in the tourism flow network of local residents virtually covering the whole city, while non-local tourists are mostly concentrated in the inner city of the central city.
Moreover, these differences are due to the combined influence of scenic spot characteristics and tourists forming the tourism flow network structure. Tourist spot characteristics (in addition to traffic factors) concretely involve the locations of scenic spots, grade, type differences (historical-cultural, natural landscape, indoors versus outdoors), and popularity (Internet heat). Furthermore, tourist characteristics include tourist source locations (local and non-local), residence, travel time (travel season, length of commute), travel motives, knowledge of and affection for destinations, information sources, revisit rate, and travel modes.
In light of the aforementioned conclusion, this study suggests that we should grasp the characteristics of the overall structure and nodes of the Nanjing city tourism flow network, and develop the entire region in unison. Efforts could include the following: (1) bringing nodes to maturity to form core tourism subdistricts; (2) energizing developing nodes; (3) devising specialized and themed tourism routes; (4) developing individualized ways of traveling for relatively independent nodes; (5) focusing on group differences among tourists and precisely positioning the audience groups for scenic spots; (6) executing accurate publicity.
The result of this study is the network structure of tourist flow represented by people flow. Therefore, it cannot reflect all of the characteristics of spatial structure, and different conclusions may be drawn by using other data, such as information flow, logistics and traffic flow. Secondly, the research uses mobile phone signaling data as the basic data, which is less authoritative than traditional official statistics; however, the characteristics of tourism flow networks are difficult to be realized using traditional data analysis. Although the data itself, and the processing process, will produce errors, the identification results were tested to ensure the randomness of the sampling, and can reflect the overall characteristics and rules. In addition, due to the length limit, this study focused on grasping the overall structural characteristics of the tourism flow network in Nanjing city and does not elaborate on the strategies used. Future studies should discuss in-depth methods to optimize tourism flow networks.