Next Article in Journal
Analyzing Spatiotemporal Patterns of Cultivated Land by Integrating Aggregation Degree and Omnidirectional Connectivity: A Case Study of Daqing City, China
Previous Article in Journal
Overview of “Landscape Research and Design for Urban and Peri-Urban Environments—2nd Edition”
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Inter-Provincial Similarities and Differences in Image Perception of High-Quality Tourism Destinations in China

1
Key Laboratory of Regional Sustainable Development Modeling, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
2
College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China
3
School of Architecture, Tsinghua University, Beijing 100084, China
*
Author to whom correspondence should be addressed.
Land 2025, 14(10), 1999; https://doi.org/10.3390/land14101999
Submission received: 9 September 2025 / Revised: 28 September 2025 / Accepted: 3 October 2025 / Published: 5 October 2025

Abstract

With the rapid development of China’s tourism industry, the homogenization of regional tourism images has become a growing concern. To address this, this study quantifies the similarities and differences in tourism image perception across China’s 31 provinces, focusing on 350 5A-level destinations, analyzing 757,046 tourist reviews collected from Ctrip.com in 2024. Using a three-dimensional framework (cognitive, affective, and overall image), we analyze social media data through natural language processing, random forest regression, and social network analysis. Key findings include the following: (1) most comments are positive, with Jiangsu and Chongqing showing high cognitive image similarity but low overall similarity; (2) cognitive image significantly impacts affective image, especially through unique tourism resources; (3) an inter-provincial similarity–difference matrix reveals significant perceptual differences among provinces. This study provides a novel methodological approach for multidimensional image evaluation and offers crucial empirical insights for regional policy-making aimed at optimizing land and tourism resource allocation, balancing regional disparities, and promoting sustainable land use and development across China.

1. Introduction

In recent years, China’s tourism industry has experienced rapid growth, establishing itself as one of the largest tourism markets globally. This trend mirrors the development in many countries where tourism plays a significant role in national economies, such as Cyprus and Malta, highlighting the universal importance of sustainable destination management. With continuous socioeconomic progress and rising living standards, tourism—often referred to as a “happiness industry”—has become an integral part of daily life. To ensure the sustainable development of the sector, the Chinese government has implemented a series of policies promoting high-quality tourism development. Notably, the 14th Five-Year Plan for Tourism Development explicitly calls for the transformation of the tourism industry towards a more sustainable and high-quality model, emphasizing optimized resource allocation, service enhancement, and spatial planning. This strategic direction aligns with global efforts to balance tourism growth with environmental and cultural preservation, providing a critical context for upgrading China’s tourism sector.
In China, tourism destinations are officially classified into five quality levels, from A to AAAAA (5A), based on rigorous criteria such as service quality and visitor appeal (mct.gov.cn). For the purpose of this study, we operationally define ‘high-quality tourism destinations’ as these national 5A-level sites. This operationalization is well justified for two primary reasons. First, these sites represent the pinnacle of tourism excellence in the country and serve as key drivers of regional tourism economies [1,2]. Second, due to their significant scale, comprehensive facilities, and immense influence, they often function as standalone destinations in their own right, attracting large volumes of both national and international tourists. Accordingly, to ensure a comprehensive and nationally representative analysis, our study includes 350 officially designated 5A-level destinations. However, the increasing number of 5A-level tourism destinations has led to concerns over homogenization, with many destinations exhibiting similar characteristics in terms of tourism resources and services. This not only diminishes the uniqueness of individual destinations but also reduces the overall diversity of the tourism experience [3].
Understanding the interplay between similarity and difference in tourism image perception is critical for sustainable regional development. On one hand, excessive similarity can lead to market saturation, limit tourist choices, and result in inefficient resource allocation [4]. On the other hand, stark differentiation, while enhancing individual competitiveness, may hinder regional branding and collaboration. Therefore, striking a strategic balance is essential for both coordinating regional tourism efforts and catering to diverse tourist preferences [5]. However, current tourism management is predominantly confined within provincial boundaries, creating a major obstacle to such coordination. Conducting large-scale research on cross-provincial tourism destinations is therefore essential for understanding inter-provincial relationships and promoting regional coordination [6]. By systematically quantifying similarities and differences in image perception, policymakers and tourism stakeholders can facilitate greater integration of tourism industries, optimize resource allocation, and foster balanced development [7]. This approach not only strengthens tourism branding and enhances regional image positioning but also removes spatial barriers, expands tourists’ choices, and promotes new forms of tourism, such as cross-provincial self-driving tours [8].
Image perception research is predominantly grounded in the “cognitive–affective” theory from psychology [9]. Baloglu first applied this framework to tourism destination image studies, proposing that image perception is a multidimensional construct comprising cognitive, affective, and overall dimensions. The cognitive image pertains to tourists’ functional and knowledge-based perceptions, while the affective image reflects their emotional responses to a destination. The overall image is a holistic synthesis of these two dimensions [10]. Particularly, the cognitive image exerts a significant influence on the affective image, because emotions are a key driver of tourist satisfaction, loyalty, and decision-making [11]. By examining the contribution of cognition to emotional perceptions, researchers and practitioners can gain valuable insights into the psychological mechanisms that underpin tourists’ attitudes and behaviors [12]. However, most existing studies have analyzed image perceptions from either a single or dual-dimensional perspective, lacking an integrated framework that fully explores the interrelationships between cognitive, affective, and overall image perceptions. Therefore, building on the dimensions of cognitive, affective, and overall perceptions, the introduction of research on the relationship between cognition and affective holds significant theoretical importance. It not only advances the study of tourism image perception but also expands the scope of the “cognitive–affective” theory.
The proliferation of user-generated content on online review platforms offers an unprecedentedly rich data source for understanding genuine tourist perceptions [13]. This paradigm shift in data availability calls for advanced computational methods. This study, therefore, leverages an integrated framework combining natural language processing (NLP) to analyze textual data, random forest regression to model the complex relationships between image dimensions, and social network analysis (SNA) to map inter-provincial similarities and differences [14,15,16]. This computational approach enables a more objective and nuanced analysis of tourism image on a large scale than has been possible with traditional methods.
Therefore, this study aims to address the aforementioned research gaps. The specific objectives are threefold: (1) investigate the multidimensional structure of tourism image perceptions (cognitive, affective, and overall) across China’s provinces; (2) explore the cognitive–affective relationship in tourism image perception, revealing how cognitive evaluations shape affective responses; (3) construct and analyze an inter-provincial similarity–difference matrix for high-quality tourism destinations. Theoretically, this study expands the tourism image perception theory by integrating the “cognitive–affective–overall image” three-dimensional framework through big data methods. Practically, this study establishes an inter-provincial tourism destination similarity–difference matrix, offering empirical evidence for regional tourism resource integration and high-quality development. It supports China’s tourism industry in advancing regional collaboration and achieving high-quality development.

2. Data and Methods

2.1. Research Framework

Employing Python 3.8-based analytical tools, the research processes online review data from 350 5A-level tourism destinations across 31 provincial administrative regions in China, excluding Hong Kong, Macau, and Taiwan (Figure 1). Utilizing SnowNLP, Latent Dirichlet Allocation (LDA), Jieba word segmentation, and keyword extraction, the investigation explores tourists’ perceptions of 5A-level tourism destination images within the cognitive–affective–overall framework. Then, random forest regression is employed to examine the relationship between cognitive and affective images. By integrating the aforementioned affective, cognitive, overall, and cognitive-to-affective image, a comprehensive similarity matrix is derived. This matrix emphasizes the static and local similarities among provinces. Meanwhile, the study utilizes the comprehensive similarity to derive the network relationships of image perception among provinces through SNA, employing edge weights to represent dynamic and global similarities. Finally, the inter-provincial similarity–difference matrix is generated by integrating the comprehensive similarity matrix and the network edge weight matrix.

2.2. Data

For this study, we collected tourist review data from Ctrip [17], one of China’s largest and most widely used online travel platforms. The data was programmatically crawled using Python, targeting all 350 5A-level tourism destinations across 31 provinces (excluding Hong Kong, Macao, and Taiwan). The platform’s extensive user base and large volume of user-generated content make it a highly valuable and representative data source for tourism perception studies [18]. It is important to note that these destinations are specific, officially designated scenic areas, not entire cities or regions. The detailed list is provided in Table S1, and their geographical distribution is shown in Figure 2. The list of 5A-level tourism destinations was obtained from the official directories published by provincial departments of culture and tourism (the statistical cutoff date is primarily concentrated in February 2024). It should be noted that there is not a one-to-one correspondence between 5A-level tourism destinations and those listed on the Ctrip platform. It is possible for one 5A-level destination to correspond to two Ctrip destinations and for several 5A-level destinations to correspond to one Ctrip destination. We have made every effort to collect the relevant data, but there are still nine destinations for which data is missing. Data collection was conducted from 28 November 2024 to 4 December 2024, covering a review period of ten years (16 October 2015 to 4 December 2024). In total, 757,046 original reviews were collected. To ensure data quality, the acquired reviews underwent a rigorous prepossessing procedure. This involved removing irrelevant columns; cleaning comment content by eliminating HTML tags, URL links, special symbols, and emoticons; as well as deleting empty or duplicate entries. After prepossessing, a total of 720,760 valid reviews remained for analysis. In this study, https://www.chiplot.online/(accessed on 4 December 2024) was utilized for chart creation.

2.3. Natural Language Processing

2.3.1. Sentiment Analysis

This study employs the SnowNLP library to assess the sentiment of the collected reviews. SnowNLP is an NLP tool based on Bayesian modeling, widely utilized for sentiment analysis in Chinese text, which aligns well with the objectives of this study [19]. For each review, the sentiment score is computed on a scale from 0 to 1, where 0 represents a highly negative sentiment and 1 represents a highly positive sentiment. Based on these scores, reviews are classified into three categories: positive (score > 0.75), neutral (0.25 ≤ score ≤ 0.75), and negative (score < 0.25). Additionally, the total count of reviews in each sentiment category is recorded, and an overall sentiment average is calculated.

2.3.2. Keyword Extraction

To extract keywords from the review data, this study utilizes Jieba for Chinese word segmentation and the Term Frequency–Inverse Document Frequency (TF-IDF) algorithm for keyword weighting [5,20]. The TF-IDF algorithm was specifically chosen because it effectively identifies words that are not only frequent within a review but also distinctive to it, which is crucial for capturing the unique characteristics of each destination’s image. For each review, the top five weighted keywords were extracted.
A keyword frequency table is then generated for each province by counting the occurrences of each keyword. The top 50 keywords for each province are manually categorized into six groups: tourist destinations, tourism facilities, tourism resources, tourists, affective vocabulary, and others. This classification aids researchers in focusing on the most relevant keywords. The detailed formulas for the TF-IDF calculation are available in the Supplementary Materials (Section S1).

2.3.3. Topic Modeling

To identify latent thematic structures in the reviews, this study applies the LDA topic modeling method [21]. LDA is suitable for research that requires automatic topic extraction and the discovery of hidden semantic structures from large volumes of text [22]. A crucial step is determining the optimal number of topics. To guide this, we trained models with 2 to 10 topics on the Beijing dataset, evaluating each with the topic coherence score (C_v). The analysis revealed two notable peaks: a significant initial peak at five topics (C_v = 0.4501) and the global maximum at eight topics. Although the eight-topic model scored slightly higher, it produced more granular and overlapping themes. For the purpose of a focused and streamlined provincial-level analysis, we selected the five-topic solution. This choice captures the most essential, distinct, and managerially relevant themes, representing the best balance between model coherence and thematic clarity for the scope of our study. This approach enables multi-processing, lexical prepossessing, word frequency matrix construction, topic extraction, and label generation. LDA facilitates the identification of overarching themes within the dataset while allowing for detailed keyword analysis at a micro-level, thereby refining the extracted thematic content.

2.4. Random Forest Regression

To analyze the relationship between keywords and sentiment scores, this study employs random forest regression, a machine learning algorithm capable of capturing complex, non-linear relationships through ensemble learning [23]. This model is particularly effective in handling high-dimensional and sparse textual data while assessing the influence of individual variables on sentiment scores [24]. Compared to traditional linear regression models, random forest provides superior predictive accuracy and adaptability to intricate data patterns.
The modeling process followed a standard machine learning workflow. First, the textual keyword data was pre-processed and transformed into a numerical feature matrix. The dataset was then partitioned into training and testing sets to build and validate the model. Subsequently, the random forest model was trained, and its hyperparameters were optimized to maximize performance. Finally, the model’s predictive accuracy was evaluated using Mean Squared Error (MSE) and R2 scores, and the feature importance values were extracted to identify the most influential keywords affecting sentiment.
All technical implementation details, including the specific Python libraries and functions used, the vectorization parameters, the dataset split ratio, and the hyperparameter search space, are documented in the Supplementary Materials (Section S2).

2.5. Cosine Similarity

This study employs Cosine Similarity to measure the similarity of different variables across provinces. Cosine Similarity is a widely used metric that evaluates the angle between two vectors, effectively quantifying the degree of similarity between provincial feature sets [25]. The calculation is as follows:
Given two vectors A and B (which represent the feature vectors of two provinces), the Cosine Similarity is defined as
C o s i n e S i m ( A , B ) = A B A B
where
A B is the dot product of vectors A and B, calculated as
A B = i = 1 n A i B i
A and B are the Euclidean norms of the vectors A and B, respectively, calculated as
A = i = 1 n A i 2
B = i = 1 n B i 2
A similarity value closer to 1 indicates higher similarity between two provinces, whereas values closer to 0 indicate greater differences.

2.6. A Network Analysis of Inter-Provincial Similarities and Differences in High-Quality Tourism Destinations

To systematically analyze inter-provincial similarities and differences in high-quality tourism destinations, this study focuses on image perception among provinces. Initially, a mean analysis is conducted on affective, cognitive, overall, and cognitive-to-affective image similarities to derive a comprehensive image similarity matrix for each province.
Subsequently, SNA is employed to construct a network based on image similarity [26]. In this network, nodes represent individual provinces, while edges represent the strength of the connection between two nodes. Edge weights not only convey networked expression of similarity but are also influenced by the overall structure of the network. For instance, if a province exhibits high similarity with multiple other provinces, its centrality within the network (such as degree centrality or betweenness centrality) may be elevated, thereby impacting the entire network’s architecture.
Finally, based on the comprehensive similarity matrix and the edge weight matrix derived from SNA, the inter-provincial similarity–difference matrix is formed. The natural breaks method is employed to categorize inter-provincial relationships into five distinct classes:
Category I: High similarity–high edge weight, high similarity–medium edge weight, medium similarity–high edge weight.
Category II: Medium similarity–medium edge weight.
Category III: Low similarity–medium edge weight, low similarity–high edge weight.
Category IV: Medium similarity–low edge weight, high similarity–low edge weight.
Category V: Low similarity–low edge weight.
The integration of similarity and edge weights offers the advantage of capturing both the static local similarities and the dynamic global similarities among provinces, providing a more comprehensive analytical perspective. This approach allows for the meticulous categorization of inter-provincial relationships, facilitating the formulation of precise strategies.

3. Results

3.1. Affective Image Analysis

The affective tone of comments for high-quality tourism destinations in most provinces is predominantly positive, with the proportion of positive comments generally exceeding 70% (Figure 3). Among these, Shanghai (85.06%), Hainan (84.96%), and Ningxia (84.79%) report the highest proportions of positive comments, while Guizhou has the lowest (68.16%). Provinces with higher proportions of negative comments include Guizhou (19.24%), Yunnan (15.73%), and Shanxi (15.69%), while Ningxia exhibits the lowest proportion of negative comments (6.92%) (Table 1). In terms of average scores, Guizhou (0.735) falls within the neutral range, while all other provinces score positively. Ningxia has the highest average score (0.878), followed by Hainan (0.877), Shanghai (0.875), and Beijing (0.874).

3.2. Cognitive Image Analysis

Cognitive image similarity among provinces is generally low (Figure 4a). However, Jiangsu, Chongqing, Hubei, and Henan demonstrate relatively high cognitive image similarity. Shanghai, in contrast, shows no cognitive image similarity with other provinces, suggesting substantial differences in its tourism experiences compared to other regions. The top 50 keywords for each province were categorized into six groups: tourist destinations, tourism facilities, tourism resources, tourists, affective vocabulary, and others (Table S2). Among these, tourism resources are the most diverse, reflecting tourists’ varied preferences for destinations and resources across different provinces. Regarding tourism facilities, tourists show strong recognition of convenient transportation options such as “sightseeing bus,” “cable car,” “electric cart,” and “boardwalk,” as well as supporting amenities like “tour guide,” “commentary,” and “commentator.” In terms of tourists, provinces like Shanghai, Jilin, Ningxia, Shandong, Guangdong, Henan, and Liaoning emphasize the importance of child-friendly features in tourism destinations. From the affective vocabulary, it is evident that tourists across provinces generally hold positive evaluations of tourism. However, cognitive differences exist among provinces concerning tourism resources, facilities, and affective vocabulary, with smaller differences in the perception of tourism resources (Figure 5).

3.3. Overall Image Analysis

The overall image similarity across most provinces is relatively low (Figure 4b). Provinces like Shanghai, Anhui, Guangxi, Zhejiang, and Guizhou show no overall image similarity with other regions. However, higher similarity is observed between Liaoning and Heilongjiang, Hainan and Inner Mongolia, and Hainan and Liaoning, suggesting shared characteristics in tourism resources or cultural features. The tourism labels of each province exhibit high diversity in overall image (Table 2). For example, natural landscapes include Yunnan’s “hot springs,” Xinjiang’s “grand canyon/Keketuohai,” and Qinghai’s “saltwater lakes.” Historical culture examples include Beijing’s “Hall of Prayer for Good Harvests/ancient China” and Shandong’s “Confucius Mansion.” Religious and spiritual culture is represented by Xizang’s “monasteries/Tibetan Buddhism” and “devotion.” Modern cities and service facilities are exemplified by Shanghai’s “the Bund/Lujiazui” and “education/science popularization.” These labels reveal unique characteristics of tourism resources across provinces and highlight tourists’ diverse needs for natural, cultural, and service-related experiences, offering a strong foundation for targeted marketing and resource optimization.

3.4. Analysis of the Relationship Between Cognitive and Affective Image

The results of the random forest regression model reveal the influence of cognitive images on the affective image of tourism destinations (Table 3). Specific iconic attractions in certain provinces significantly impact affective images, such as “Lijiang” in Yunnan and “Pingyao” in Shanxi. Additionally, general terms like “scenic area,” “scenery,” and “attractions” also contribute substantially to affective images. Furthermore, regional cultural and tourism characteristics influence affective cognition. For example, Tianjin’s “jianbing guozi” (a local snack) symbolizes the personalized impact of cognitive images on affective images. The frequent appearance of the term “ID card” in comments about Shanghai and Beijing warrants attention from scenic area managers. Conversely, negative terms such as “queuing” appear in comments from over 20 provinces, highlighting overcrowding issues in high-quality tourism destinations. Notably, Shanghai, besides “queuing,” also features terms like “too many people,” which strongly affect tourists’ affective perceptions. The relationship between cognitive and affective images is relatively low in similarity across provinces, with the most significant contrast observed between Shanxi and Shaanxi (Figure 4c).

3.5. Inter-Provincial Similarities and Differences in High-Quality Tourism Destinations

This section explores the similarities and differences among high-quality tourism destinations across provinces through the analysis of a comprehensive similarity matrix, network edge weight matrix, and inter-provincial similarity–difference matrix. First, the comprehensive similarity matrix illustrates all the four similarities in image perception among provinces (Figure 6a). The results show that the similarity between most provinces is relatively low (>0.2). Notably, the similarity between Jiangsu, Henan, Hubei, and Chongqing exceeds 0.5, indicating high consistency in image perception. Second, the analysis of network edge weights unveils the dynamic and global similarities among various provinces (Figure 6b). Most edge weights between provinces range from 0.2 to 0.6. Provinces such as Jilin and Beijing, Shaanxi and Jilin, Gansu and Jilin, Gansu and Shaanxi, and Gansu and Beijing exhibit values greater than 0.5. In contrast, provinces like Guizhou and Xinjiang, and Guizhou and Guangxi show lower edge weights.
Finally, by combining the comprehensive similarity matrix and the network edge weight matrix, an inter-provincial similarity–difference matrix is formed (Figure 6c). The analysis indicates that high-quality tourism destinations’ image perceptions in China exhibit significant differences, with 276 pairs in Category V, 84 pairs in Category IV, 84 pairs in Category III, and 20 pairs in Category II. Only Beijing and Shaanxi fall into Category I. Category I includes provinces with high or middle comprehensive similarity and strong network connections (high or middle edge weights). Category II consists of provinces with middle comprehensive similarity and intermediate network connections (middle edge weights). Category III encompasses provinces with low comprehensive similarity but moderate-to-strong network connections (middle-to-high edge weights). Category IV involves provinces with high or middle comprehensive similarity but weak network connections (low edge weights). Finally, Category V comprises provinces with low comprehensive similarity and weak network connections (low edge weights). This classification provides a systematic understanding of inter-provincial image perception of high-quality tourism destinations dynamics, supporting targeted strategies for regional development and collaboration.

4. Discussion

4.1. Interpreting the Patterns of High-Quality Tourism Destination Images: Homogeneity in Diversity

Our analysis reveals a complex geography of tourism image perception in China, characterized by both distinctiveness and convergence. While the predominantly positive affective image across most provinces indicates a general satisfaction with high-quality destinations, the underlying cognitive patterns show significant variations that warrant deeper discussion [27].
For instance, Guizhou’s relatively lower affective image, despite its rich cultural and natural assets, highlights a potential gap between destination resource quality and the perceived service experience. This finding aligns with a body of literature arguing that tangible service elements are critical in translating destination attributes into positive tourist emotions [28]. Our result thus confirms that even for resource-rich destinations, perceived service deficiencies can significantly dampen affective responses [29]. Conversely, Shanghai’s distinct urban tourism experience is consistent with established research on mega-city tourism that identifies modernity, commercial culture, and multiculturalism as key perceptual drivers [30,31]. Our study extends this literature by quantitatively demonstrating how this unique cognitive profile creates a distinct overall image at a national comparative scale, reinforcing the idea that a well-rounded combination of resources, rather than a single core attraction, enhances destination image [32].
Perhaps most intriguingly, the similarities observed between geographically distant regions, such as Hainan and Northeast China, challenge a purely static, geography-based view of image formation. This pattern can be largely attributed to the well-documented phenomenon of seasonal migration in China [33], a trend also fueled by developments in niche markets like ice–snow tourism [34]. Our finding contributes a new dimension to this research by showing that these mobility patterns do not merely reflect travel behavior but actively construct a shared cognitive image across disparate regions. This suggests that social networks and tourist flows are powerful, dynamic forces in shaping destination perception, a notion that echoes recent calls for a more relational understanding of place branding [35].

4.2. Deconstructing the Cognitive–Affective Linkage: Insights from Machine Learning

Our use of random forest regression offers a nuanced view of the intricate relationship between cognitive and affective images, a central theme in tourism image research [36,37]. The model’s relatively low R2 value, while indicating the inherent unpredictability of human emotion, is a finding consistent with other studies in computational social science that attempt to model sentiment from complex, user-generated textual data [38,39]. As such, the model’s primary value lies not in its predictive precision but in its explanatory power to identify significant influencing factors.
One of the key insights is the high feature importance of highly specific, iconic cultural symbols. The prominence of a local snack in shaping a destination’s affective image, for example, provides strong empirical support for the growing body of literature on gastronomy tourism and symbolic consumption [40]. Our finding extends this research by demonstrating that the cognitive–affective link is not just driven by general attributes but can be powerfully anchored to micro-level, tangible cultural icons. This suggests that symbolic consumption plays a more granular and emotionally resonant role in destination image formation than previously understood.
Furthermore, the significant negative impact of operational keywords related to service friction (e.g., queuing) on affective scores confirms a large body of service management literature identifying such friction points as key drivers of customer dissatisfaction [41,42]. The primary contribution of our study here is methodological: we demonstrate the use of a machine learning approach to quantify these effects from unstructured user-generated data at a large scale. This offers a new, scalable method for continuously monitoring and diagnosing sources of negative affect in the tourist experience, moving beyond traditional survey-based methods.

4.3. Implications for Tourism Image Theory

Collectively, our findings have several important implications for the theoretical understanding of tourism image. First, by showing how inter-provincial destinations can create image homogeneity, we propose that destination image models should incorporate network effects, moving beyond the traditional focus on a destination’s isolated, intrinsic attributes. This calls for a more dynamic and relational perspective on how regional image systems are formed and evolve.
Second, our machine learning approach reveals the granular nature of the cognitive–affective link [37]. This suggests that the theory could be refined by developing a typology of cognitive cues and examining their differential impacts on emotion. For instance, the emotional resonance of an iconic cultural symbol may function very differently from that of a functional service attribute.
Conceptually, our study contributes by framing destination image not as a static property of a place but as a relational and emergent outcome of tourist–destination interactions, inter-regional mobility, and the symbolic meanings attached to micro-level cultural elements. This perspective enriches the classic cognitive–affective framework [28] by adding layers of social network dynamics and symbolic interactionism.

5. Recommendations for Inter-Provincial Tourism Management and Development Under the Similarity–Difference Matrix

Comprehensive image similarity measures the degree of resemblance between two provinces as high-quality tourism destinations in terms of tourists’ affective, cognitive, and overall perceptions, as well as the cognitive influence on affective image perception. It reflects the commonalities between pairs of provinces. Network edge weights gauge the strength of connections between two provinces within the network, indicating their level of interaction and significance in the global network context. Integrating the two metrics balances local similarity and global network structure, providing a comprehensive analytical perspective. This integration not only advances the scientific and practical value of the research but also offers robust support for tourism collaboration and resource optimization.
Category I (High Similarity–High Edge Weight, High Similarity–Medium Edge Weight, Medium Similarity–High Edge Weight): Strong Synergistic Relationship, Suitable for Deep Collaboration
This relationship indicates significant commonalities and a solid foundation for close cooperation between the provinces. For instance, Beijing and Shaanxi, as cultural and historical tourism destinations, can deepen their collaboration through joint promotion of cultural heritage and the development of cross-provincial tourism routes, thereby enhancing their appeal to international tourists. Specific strategies include jointly building an international tourism brand, optimizing cross-provincial tourism service experience, and developing in-depth cultural tourism products.
Category II (Medium Similarity–Medium Edge Weight): Moderate Synergistic Relationship, Suitable for Gradual Promotion of Collaboration
This relationship suggests potential for cooperation, though it requires further development. For example, Shaanxi and Sichuan could gradually strengthen their collaboration through joint marketing initiatives (e.g., culinary tours) and resource sharing. Specific strategies include highlighting regional characteristics and launching joint marketing, promoting resource sharing and improving tourist mobility, and seeking policy support and improving infrastructure.
Category III (Medium Similarity–Low Edge Weight, High Similarity–Low Edge Weight): Potential Synergistic Relationship, Requiring Enhanced Collaboration
These inter-provincial relationships, despite their high comprehensive similarity, exhibit limited connectivity with other provinces within the network. This relationship indicates significant resource complementary between them, rather than direct competition. For instance, Fujian (coastal) and Jiangxi (mountainous) could jointly develop themed tourism routes, such as “Mountain–Sea Journeys,” leveraging their respective resource advantages to attract tourists. Specific strategies include developing complementary themed tourism routes, innovating collaboration models to enhance appeal, and promoting public–private partnerships to ensure project sustainability.
Category IV (Low Similarity–Medium Edge Weight, Low Similarity–High Edge Weight): Complementary Relationship, Suitable for Differentiated Collaboration
These provinces exhibit low comprehensive similarity but high edge weights, indicating untapped collaborative potential. For example, Jiangsu and Zhejiang could gradually strengthen their cooperation through joint promotional activities and pilot projects. Specific strategies include enhancing interaction and launching joint promotions, initiating pilot projects to explore new models, and leveraging existing tourist resources to extend travel experiences.
Category V (Low Similarity–Low Edge Weight): Independent Relationship, Suitable for Autonomous Development
This relationship suggests that the tourism markets of these provinces are relatively independent, with limited potential for collaboration. However, each province possesses unique tourism resources and development potential. For instance, Yunnan (adventure tourism) and Guangxi (wellness tourism) can focus on developing niche tourism markets with differentiated positioning. Specific strategies include deepening niche markets and building unique brands, establishing initial collaborative links and exploring potential synergies, and formulating long-term development plans to gradually enhance competitiveness.
These strategies not only promote the collaborative development of inter-provincial tourism destinations but also optimize regional tourism resources, leading to higher development levels. By tailoring recommendations to the unique characteristics of each category, this study provides a nuanced framework for enhancing tourism efficiency and fostering sustainable growth.

6. Conclusions

This study offers an innovative approach to constructing a multidimensional image perception model, integrating cognitive, affective, and overall image dimensions. By applying this model to create an evaluation framework for inter-provincial tourism destination similarities and differences, this research addresses an underexplored area in the literature concerning large-scale regional image perception.
First, the study systematically analyzes the image perception of high-quality tourism destinations in China. Our findings revealed a complex perceptual landscape: while most provinces enjoy a predominantly positive affective image, significant variations exist. The cognitive and overall images demonstrated high degrees of uniqueness, while also revealing patterns of homogeneity driven by factors like geographical proximity and tourist mobility.
Second, by exploring the cognitive–affective relationship, our analysis identified the specific attributes most influential in shaping tourist emotions. The findings highlight the significant impact of not only iconic attractions and general scenic terms but also micro-level cultural symbols like local foods and crucial service friction points.
Finally, the study successfully developed a novel matrix. This analytical tool, integrating both static similarity and dynamic network connections, classified inter-provincial relationships into five distinct categories, revealing that most provinces exhibit weaker or more complex relationships.
From a theoretical perspective, this study contributes by extending the traditional cognitive–affective theory with a more dynamic and relational viewpoint, demonstrating how network effects shape destination images. It also refines the theory by revealing the granular and symbolic nature of the cognitive–affective link, highlighting the powerful role of specific cultural icons in emotional formation.
From a practical perspective, this study provides several actionable insights. For example, our findings suggest that destination managers in provinces with similar cognitive but different affective images (a distinction made possible by our matrix) should prioritize emotion-focused storytelling in their branding. Furthermore, the identification of specific service friction points like ‘queuing’ provides clear, data-driven priorities for operational improvements to enhance the tourist experience. The similarity–difference matrix itself serves as a strategic tool for identifying optimal partners for regional tourism collaboration.

7. Limitations and Further Research

This study primarily focuses on the tourism image perception and explores the similarities and differences between high-quality tourism destinations across provinces based on big data. However, there are still areas that could be further improved. From a data perspective, this study relies solely on big data for analysis. Future research could integrate small data and field data, such as surveys and interviews, to further validate and supplement the findings from big data analysis [43]. Methodologically, text processing primarily focuses on word cleaning in this study. Although part-of-speech (POS) tagging was not performed, this omission did not significantly impact the overall analysis results, as the research objectives emphasize statistical features or thematic distributions at the word level rather than in-depth syntactic analysis [44]. Future research could consider incorporating POS tagging techniques to enhance the granularity of text processing [45]. Theoretically, this study only explores the relationship between affective and cognitive images in depth. Future research could quantify the complex network relationships among affective, cognitive, and overall images, building a more detailed theoretical framework [46]. Building on these points, future inquiry could also expand by conducting comparative cross-country studies, employing longitudinal analyses to track image evolution, and testing other advanced machine learning models, all of which would contribute to a deeper and more dynamic understanding of tourism image perception.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/land14101999/s1. S1: Calculation details for TF-IDF. S2: Implementation details for random forest regression model.

Author Contributions

Conceptualization, W.Z. and J.L.; methodology, W.Z.; software, W.Z.; validation, W.Z.; formal analysis, W.Z.; investigation, W.Z.; resources, W.Z.; data curation, W.Z.; writing—original draft preparation, W.Z.; writing—review and editing, W.Z., J.L., H.Z., F.L., Z.Z. and R.Z.; visualization, W.Z.; supervision, J.L.; project administration, J.L.; funding acquisition, J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China, grant number 2024YFF0809303.

Data Availability Statement

If data is needed, the author can be asked.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Ma, H.; Zou, J. Impacts of Official High-Standard Scenic Spots on Environment and Growth—Evidence from China’s 5A Scenic Spots at the City Level. Ecol. Econ. 2022, 201, 107555. [Google Scholar] [CrossRef]
  2. Xu, Q.; Cheng, X.; Zhao, H. Does the Selection of High-Quality Scenic Spots Promote the Growth of Tourism Economy? Evidence from China’s 5A-Rated Tourist Attractions. PLoS ONE 2024, 19, e0304108. [Google Scholar] [CrossRef]
  3. Lu, M.; Duan, Y.; Wu, X. Evaluation of the Coupling and Coordination Degree of Eco-Cultural Tourism System in the Jiangsu-Zhejiang-Shanghai-Anhui Region. Ecol. Indic. 2023, 156, 111180. [Google Scholar] [CrossRef]
  4. Liu, F.; Zhang, J.; Zhang, J.; Chen, D.; Liu, Z.; Lu, S. Roles and Functions of Tourism Destinations in Tourism Region of South Anhui: A Tourist Flow Network Perspective. Chin. Geogr. Sci. 2012, 22, 755–764. [Google Scholar] [CrossRef]
  5. Zhang, M.; Di, Q.; Liu, Y. Perceived Differences in Coastal Tourism Image under Tourist Experience-IPA Analysis Based on UGC Data of 12 Coastal Cities. PLoS ONE 2024, 19, e0299431. [Google Scholar] [CrossRef]
  6. Liu, T.; Yu, L.; Liu, J.; Xu, Z.; Chen, X.; Wu, H.; Zheng, S.; Zhao, Q.; Gong, P. Unveiling Interprovincial Geographic Patterns of 5A-Level Tourism Cultural Ecosystem Service Flows and Tourist Preferences in China’s Metacoupled Systems. Appl. Geogr. 2024, 172, 103432. [Google Scholar] [CrossRef]
  7. Zhang, X.; Pang, X.; Wen, X.; Wang, F.; Li, C.; Zhu, M. TriPlan: An Interactive Visual Analytics Approach for Better Tourism Route Planning. J. Vis. 2023, 26, 231–248. [Google Scholar] [CrossRef] [PubMed]
  8. Huang, Z.; Cao, F.; Jin, C.; Yu, Z.; Huang, R. Carbon Emission Flow from Self-Driving Tours and Its Spatial Relationship with Scenic Spots—A Traffic-Related Big Data Method. J. Clean. Prod. 2017, 142, 946–955. [Google Scholar] [CrossRef]
  9. Hou, Z.; Cui, F.; Meng, Y.; Lian, T.; Yu, C. Opinion Mining from Online Travel Reviews: A Comparative Analysis of Chinese Major OTAs Using Semantic Association Analysis. Tour. Manag. 2019, 74, 276–289. [Google Scholar] [CrossRef]
  10. Baloglu, S.; McCleary, K.W. A Model of Destination Image Formation. Ann. Tour. Res. 1999, 26, 868–897. [Google Scholar] [CrossRef]
  11. Wang, X.; Qin, X.; Zhou, Y. A Comparative Study of Relative Roles and Sequences of Cognitive and Affective Attitudes on Tourists’ pro-Environmental Behavioral Intention. J. Sustain. Tour. 2020, 28, 727–746. [Google Scholar] [CrossRef]
  12. Hadinejad, A.; Gardiner, S.; Kralj, A.; Scott, N.; Moyle, B.D. Cognition, Metacognition and Attitude to Tourism Destinations: The Impact of Emotional Arousal and Source Credibility. J. Hosp. Tour. Manag. 2022, 51, 502–511. [Google Scholar] [CrossRef]
  13. Ghermandi, A.; Sinclair, M. Passive Crowdsourcing of Social Media in Environmental Research: A Systematic Map. Glob. Environ. Change 2019, 55, 36–47. [Google Scholar] [CrossRef]
  14. Álvarez-Carmona, M.Á.; Aranda, R.; Rodríguez-Gonzalez, A.Y.; Fajardo-Delgado, D.; Sánchez, M.G.; Pérez-Espinosa, H.; Martínez-Miranda, J.; Guerrero-Rodríguez, R.; Bustio-Martínez, L.; Díaz-Pacheco, Á. Natural Language Processing Applied to Tourism Research: A Systematic Review and Future Research Directions. J. King Saud Univ.-Comput. Inf. Sci. 2022, 34, 10125–10144. [Google Scholar] [CrossRef]
  15. Teles Da Mota, V.; Pickering, C. Using Social Media to Assess Nature-Based Tourism: Current Research and Future Trends. J. Outdoor Recreat. Tour. 2020, 30, 100295. [Google Scholar] [CrossRef]
  16. Confente, I.; Mazzoli, V.; Camatti, N.; Bertocchi, D. Integrating Tourists’ Walk and Talk: A Methodological Approach for Tracking and Analysing Tourists’ Real Behaviours for More Sustainable Destinations. J. Sustain. Tour. 2024, 32, 2323–2343. [Google Scholar] [CrossRef]
  17. Ctrip.com. Available online: https://www.ctrip.com/ (accessed on 4 December 2024).
  18. Cheng, Y.; Zhao, B.; Peng, S.; Li, K.; Yin, Y.; Zhang, J. Effects of Cultural Landscape Service Features in National Forest Parks on Visitors’ Sentiments: A Nationwide Social Media-Based Analysis in China. Ecosyst. Serv. 2024, 67, 101614. [Google Scholar] [CrossRef]
  19. Wang, J.; Xia, Y.; Wu, Y. Sensing Tourist Distributions and Their Sentiment Variations Using Social Media: Evidence from 5A Scenic Areas in China. ISPRS Int. J. Geo-Inf. 2022, 11, 492. [Google Scholar] [CrossRef]
  20. Yuan, Z.; Jia, G. Systematic Investigation of Keywords Selection and Processing Strategy on Search Engine Forecasting: A Case of Tourist Volume in Beijing. Inf. Technol. Tour. 2022, 24, 547–580. [Google Scholar] [CrossRef]
  21. Taecharungroj, V.; Mathayomchan, B. Traveller-generated Destination Image: Analysing Flickr Photos of 193 Countries Worldwide. Int. J. Tour. Res. 2021, 23, 417–441. [Google Scholar] [CrossRef]
  22. Canoy, N.A.; Roxas, G.K.T.; Robles, A.M.Q.; Alingasa, A.P.T.; Ceperiano, A.M. From Cesspool to Fortified Paradise: Analyzing News Media Territorial Assemblages of Rehabilitating Boracay Island, Western Philippines. J. Sustain. Tour. 2020, 28, 1138–1157. [Google Scholar] [CrossRef]
  23. Grömping, U. Variable Importance Assessment in Regression: Linear Regression versus Random Forest. Am. Stat. 2009, 63, 308–319. [Google Scholar] [CrossRef]
  24. Chen, X.; Ding, H.; Fang, S.; Chen, W. Predicting the Success of Internet Social Welfare Crowdfunding Based on Text Information. Appl. Sci. 2022, 12, 1572. [Google Scholar] [CrossRef]
  25. Alsudais, A.; Alotaibi, W.; Alomary, F. Similarities between Arabic Dialects: Investigating Geographical Proximity. Inf. Process. Manag. 2022, 59, 102770. [Google Scholar] [CrossRef]
  26. Li, D.; Wei, Y.D.; Wang, T. Spatial and Temporal Evolution of Urban Innovation Network in China. Habitat Int. 2015, 49, 484–496. [Google Scholar] [CrossRef]
  27. Li, F.; Zhang, S.; Liu, J.; Jiang, L.; Gao, C. Comparative Study on Tourism Image Perception Between Beijing-Tianjin-Hebei and Yangtze River Delta Agglomerations. Econ. Geogr. 2023, 43, 194–208. [Google Scholar]
  28. Tang, H.; Ma, Y.; Ren, J. Influencing Factors and Mechanism of Tourists’ pro-Environmental Behavior—Empirical Analysis of the CAC-MOA Integration Model. Front. Psychol. 2022, 13, 1060404. [Google Scholar] [CrossRef] [PubMed]
  29. Yin, J.; Wei, D.; Qiu, Y.; Xinyuan, L.; Zhang, T. Strategies for Enhancing Tourism Efficiency in Guizhou, China: Based on Spatiotemporal Dynamic Analysis and Driving Force Decomposition. Environ. Dev. Sustain. 2024. [Google Scholar] [CrossRef]
  30. Lin, W.; Li, Y.; Li, X.; Xu, D. The Dynamic Analysis and Evaluation on Tourist Ecological Footprint of City: Take Shanghai as an Instance. Sustain. Cities Soc. 2018, 37, 541–549. [Google Scholar] [CrossRef]
  31. Mou, N.; Yuan, R.; Yang, T.; Zhang, H.; Tang, J.; Makkonen, T. Exploring Spatio-Temporal Changes of City Inbound Tourism Flow: The Case of Shanghai, China. Tour. Manag. 2020, 76, 103955. [Google Scholar] [CrossRef]
  32. Zhang, M.; Zhao, L.; Zhang, Y.; Liu, Y.; Luo, N. Effects of Destination Resource Combination on Tourist Perceived Value: In the Context of Chinese Ancient Towns. Tour. Manag. Perspect. 2021, 40, 100899. [Google Scholar] [CrossRef]
  33. Wang, Z. Social Inequality among Elderly Individuals Caused by Climate Change: Evidence from the Migratory Elderly of Mainland China. J. Environ. Manag. 2020, 272, 111079. [Google Scholar] [CrossRef]
  34. Jin, S.; Yang, J.; Wang, E.; Liu, J. The Influence of High-Speed Rail on Ice–Snow Tourism in Northeastern China. Tour. Manag. 2020, 78, 104070. [Google Scholar] [CrossRef]
  35. Jeong, Y.; Kim, S.-K.; Yu, J.-G. Sustaining Sporting Destinations through Improving Tourists’ Mental and Physical Health in the Tourism Environment: The Case of Korea. Int. J. Environ. Res. Public. Health 2019, 17, 122. [Google Scholar] [CrossRef]
  36. Mehran, J.; Olya, H.G. Canal Boat Tourism: Application of Complexity Theory. J. Retail. Consum. Serv. 2020, 53, 101954. [Google Scholar] [CrossRef]
  37. Woosnam, K.M.; Stylidis, D.; Ivkov, M. Explaining Conative Destination Image through Cognitive and Affective Destination Image and Emotional Solidarity with Residents. J. Sustain. Tour. 2020, 28, 917–935. [Google Scholar] [CrossRef]
  38. Ozili, P.K. The Acceptable R-Square in Empirical Modelling for Social Science Research. SSRN Electron. J. 2022. [Google Scholar] [CrossRef]
  39. Wankhade, M.; Rao, A.C.S.; Kulkarni, C. A Survey on Sentiment Analysis Methods, Applications, and Challenges. Artif. Intell. Rev. 2022, 55, 5731–5780. [Google Scholar] [CrossRef]
  40. Yeboah, R.; Ashie, D. Promotion of Local Food as Tourism Product in the Cape Coast Metropolis: An Explorative Research. Heliyon 2024, 10, e40950. [Google Scholar] [CrossRef]
  41. Tian, L.; Zhong, H. Study on Forming Factors and Affecting Paths of Tourist Crowding Events. Hum. Geogr. 2021, 36, 159–167. [Google Scholar] [CrossRef]
  42. Han, Y.; Zhang, T.; Wang, M. Holiday Travel Behavior Analysis and Empirical Study with Integrated Travel Reservation Information Usage. Transp. Res. Part Policy Pract. 2020, 134, 130–151. [Google Scholar] [CrossRef]
  43. Qu, Z.; Cao, X.; Ge, H.; Liu, Y. How Does National Image Affect Tourists’ Civilized Tourism Behavior? The Mediating Role of Psychological Ownership. J. Hosp. Tour. Manag. 2021, 47, 468–475. [Google Scholar] [CrossRef]
  44. Serrano, L.; Ariza-Montes, A.; Nader, M.; Sianes, A.; Law, R. Exploring Preferences and Sustainable Attitudes of Airbnb Green Users in the Review Comments and Ratings: A Text Mining Approach. J. Sustain. Tour. 2021, 29, 1134–1152. [Google Scholar] [CrossRef]
  45. Manning, C.; Surdeanu, M.; Bauer, J.; Finkel, J.; Bethard, S.; McClosky, D. The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Baltimore, Maryland, 23–24 June 2014; pp. 55–60. [Google Scholar]
  46. Sharma, P.; Nayak, J.K. Do Tourists’ Emotional Experiences Influence Images and Intentions in Yoga Tourism? Tour. Rev. 2019, 74, 646–665. [Google Scholar] [CrossRef]
Figure 1. Technical framework.
Figure 1. Technical framework.
Land 14 01999 g001
Figure 2. Geographical distribution of the 350 5A-level tourism destinations across China included in this study (image source: Ctrip).
Figure 2. Geographical distribution of the 350 5A-level tourism destinations across China included in this study (image source: Ctrip).
Land 14 01999 g002
Figure 3. Percentage of positive, neutral, and negative comments across provinces.
Figure 3. Percentage of positive, neutral, and negative comments across provinces.
Land 14 01999 g003
Figure 4. (a) Similarity of top 50 keywords across provinces. (b) Similarity of theme labels across provinces. (c) Similarity of top 50 keywords contributing the most to affective scores across provinces. A value closer to 1 indicates higher similarity, while a value closer to 0 indicates lower similarity.
Figure 4. (a) Similarity of top 50 keywords across provinces. (b) Similarity of theme labels across provinces. (c) Similarity of top 50 keywords contributing the most to affective scores across provinces. A value closer to 1 indicates higher similarity, while a value closer to 0 indicates lower similarity.
Land 14 01999 g004aLand 14 01999 g004b
Figure 5. Classified similarity of top 50 keywords across provinces. A value closer to 1 indicates higher similarity, while a value closer to 0 indicates lower similarity. (a) Tourism facilities, (b) tourism resources, (c) affective vocabulary, (d) others.
Figure 5. Classified similarity of top 50 keywords across provinces. A value closer to 1 indicates higher similarity, while a value closer to 0 indicates lower similarity. (a) Tourism facilities, (b) tourism resources, (c) affective vocabulary, (d) others.
Land 14 01999 g005aLand 14 01999 g005b
Figure 6. (a) Comprehensive similarity matrix, (b) network edge weight matrix, and (c) inter-provincial similarity–difference matrix.
Figure 6. (a) Comprehensive similarity matrix, (b) network edge weight matrix, and (c) inter-provincial similarity–difference matrix.
Land 14 01999 g006aLand 14 01999 g006b
Table 1. Evaluation of the number, proportion, and average scores of three types of comments by province.
Table 1. Evaluation of the number, proportion, and average scores of three types of comments by province.
ProvinceNumber of
Positive Comments
Percentage of
Positive Comments
Number of
Neutral Comments
Percentage of
Neutral Comments
Number of
Negative Comments
Percentage of
Negative Comments
Average Score
Shanghai703685.06%4875.89%7499.05%0.875
Yunnan19,49673.26%292911.01%418615.73%0.779
Inner Mongolia815374.25%136912.47%145813.28%0.793
Beijing20,59484.57%17377.13%20208.30%0.874
Jilin10,35779.14%142310.87%13079.99%0.832
Sichuan28,06377.54%370510.24%442212.22%0.817
Tianjin430574.33%83414.40%65311.27%0.800
Ningxia946984.79%9258.28%7736.92%0.878
Anhui19,88677.91%297111.64%266610.45%0.825
Shandong24,39275.12%352610.86%455214.02%0.794
Shanxi16,28172.55%263911.76%352215.69%0.773
Guangdong20,62577.26%335912.58%271310.16%0.820
Guangxi13,87074.42%201310.80%275414.78%0.789
Xinjiang18,46175.93%299112.30%286111.77%0.809
Jiangsu54,57880.53%684310.10%63569.38%0.845
Jiangxi17,21373.57%267911.45%350614.98%0.781
Hebei12,28774.16%211812.78%216413.06%0.793
Henan26,66980.31%31559.50%338210.18%0.840
Zhejiang39,25380.10%47719.74%498310.17%0.839
Hainan17,17584.96%15267.55%15147.49%0.877
Hubei30,63675.55%508912.55%482611.90%0.805
Hunan20,63876.92%302111.26%317111.82%0.814
Gansu975777.09%129310.22%160712.70%0.813
Fujian15,21076.49%229711.55%237711.95%0.811
Xizang899881.62%10099.15%10179.23%0.851
Guizhou12,62868.16%233412.60%356419.24%0.735
Liaoning12,90080.04%166910.36%15489.60%0.839
Chongqing14,35071.97%284714.28%274113.75%0.777
Shaanxi28,32577.79%35379.71%455012.50%0.818
Qinghai502981.06%5879.46%5889.48%0.848
Heilongjiang546971.07%115915.06%106713.87%0.772
Table 2. Theme labels by province.
Table 2. Theme labels by province.
ProvinceTheme 1 LabelTheme 2 LabelTheme 3 LabelTheme 4 LabelTheme 5 Label
ShanghaiWild AnimalsLandmarks/The Bund/LujiazuiShanghaiEducation/Science Popularization/ChinaNight View/Hours
YunnanCable CarEastern DistrictHot SpringsScenic Area/Nature
Inner MongoliaErdao Bridge/Most Beautiful/PlacesRecommendationTimeScenery
BeijingHeshenChina/BeijingScenic Area/Convenience/BeijingTickets/Places/AttractionsAncient China/Hall of Prayer for Good Harvests
JilinChildrenScenery/ChangchunCommentary/PlacesWest Slope/World/Attractions
SichuanWorld HeritageJiuzhaigouScenery/Attractions/PandasSnow Mountain/GuangwuCommentary
TianjinClay Figurine ZhangClay Figurine ZhangScenery/Scenic Area/JixianChildren/Hiking/Cost-Effectiveness
NingxiaScenery/ServiceTicketsExperience/Heritage SiteFilming/Scenes/Zhenbeibao
AnhuiScenic Area/Hours/DescentBodhisattva/TiantaiAncient Villages/Huizhou/NanhuRecommendation/Places
ShandongCable Car/Ropeway/HoursRecommendation/Attractions/Liugong IslandConvenience/Qingdao/EveningConfucius MansionTickets
ShanxiBuddha StatuesShanxi/ArchitecturePlacesQueuing/MianshanHukou Waterfall
GuangdongPlaces/West LakeChildrenTimeXiqiao Mountain/HuizhouEnvironment/Stone Towers/Scenery
GuangxiScenery/Cross-Border/SceneryBamboo Raft/Landscape/DockLijiang River/Night Tour/Duxiu PeakCulture/Experience/HistoryScenery/Tickets
XinjiangScenery/Grand Canyon/KeketuohaiSceneryTianshan/BayanbulakClear/LakesTime/Shuttle Bus
JiangsuTicketsGiant Buddha/PlacesLinggu/Meiling Palace/NanjingSuzhou/YuantouzhuJinshan/Confucius Temple
JiangxiConvenienceWorld/Lushan/SceneryChina/JiangnanMountain Climbing/Sanqingshan/Attractions
HebeiCable Car/ScenerySceneryUnderground Palace/MausoleumArchitecture/Summer Resort/Ancient City
HenanShaolinGrand Canyon/TaihangTicketsChildren/Places/AttractionsShows/Evening
ZhejiangTicketsNingbo/Yandang Mountain/HometownHangzhouWater Town/Scenic Area/Jiangnan Water TownQiandao Lake
HainanSanyaScenerySanyaSmall Cave/Places
HubeiConvenience/Fun/RecommendationPark/ChibiTicketsScenery/Suitable/PlacesPeople/Dam/Scenery
HunanGlass/Mountain/BoardwalkConvenienceLike/RecommendationScenery/Chairman Mao/PlacesNight View
GansuConvenienceDunhuang/TicketsKongtong Mountain/ChinaDesert/First Pass Under Heaven/Fortress
FujianGulangyu/Wuyi MountainAttractions/MazuRaftingRecommendation/Suitable/Fun
XizangDevotionLinzi/TibetMonasteries/Tibetan BuddhismLakeTime
GuizhouAttractionsGuizhou/CharacteristicsMountain Climbing/WeatherGuizhou/MagnificentTime/Tianxingqiao/Hours
LiaoningConvenience/AnimalsPlaces/Scenic AreaScenic AreaPlaces
ChongqingPlacesAttractions/TouristsChongqing/Attractions/WulongWaterfall/BaidichengArt/Grottoes/Beishan
ShaanxiMountain/Cable CarConvenience/AttractionsPark/Bicycle/LishanExperience/Famen Temple/ArchitectureRecommendation
QinghaiSaltwater Lakes/BeautifulCommentaryTemple/GelugpaDriver/TimeNiuxin Mountain/Faith
HeilongjiangIce and Snow/Songhua RiverTickets5APlacesScenic Area/Scenery/Magnificent
Table 3. Top 10 keywords contributing the most to affective scores by province.
Table 3. Top 10 keywords contributing the most to affective scores by province.
ShanghaiQueuingToo Many PeopleCtrip160Buy TicketsOriental Pearl TowerGo UpID CardAnimalsScience and Technology Museum
YunnanLijiangAncient TownScenic AreaInnAncient TownSceneryEggsTicketsHot SpringsElectric Cart
Inner MongoliaPackage TicketsScenic AreaQueuingSceneryGrasslandCtripCollect TicketsAttractionsNot BadTickets
BeijingID Card877CtripGreat WallQueuingTour GuideBuy TicketsCable CarEcho Wall20
JilinCtripQueuingBuy TicketsTicketsBook TicketsScenic AreaTianchiNot BadChangchun Film StudioScenery
SichuanScenic AreaCollect TicketsID CardQueuingCtripSceneryAttractionsCable CarDujiangyanTickets
TianjinJianbing GuoziScenic AreaNot BadSceneryAncient CultureTianjin60PanshanQueuingEar Hole Fried Cake
NingxiaScenic AreaCtripPackage TicketsTicketsJourney to the WestShapotouQueuingQR CodePlayDesert
AnhuiCollect TicketsScenic AreaTicketsCtripNot BadSceneryHongcunAttractionsCable CarHuangshan
ShandongScenic AreaNot BadQueuing10SceneryPenglai PavilionAttractionsRaftingTicketsBuy Tickets
ShanxiPingyaoHukou WaterfallScenic AreaAncient TownWaterfallCollect TicketsNot BadCtripAttractionsTickets
GuangdongNot BadSceneryQueuingTicketsScenic AreaFeesCollect TicketsAttractionsCtripBuy Tickets
GuangxiLijiang RiverGuilinElephant Trunk HillScenic AreaCollect TicketsCtripSceneryWaterfallBamboo RaftDuxiu Peak
XinjiangScenic AreaShuttle BusSceneryQueuingGrasslandCollect TicketsAttractionsCtripTicketsToo Hot
JiangsuID CardScenic AreaCtripAttractionsQueuingAncient TownCollect TicketsTicketsNot BadScenery
JiangxiTengwang PavilionQueuingCollect TicketsScenic AreaSceneryCable CarID CardNot BadCtripAttractions
HebeiSceneryScenic AreaNot BadCtrip10Buy TicketsAttractionsTicketsFirst Pass Under HeavenCollect Tickets
HenanScenic AreaCollect TicketsAttractionsCtripQueuingCable CarTicketsNot BadSceneryParking Lot
ZhejiangScenic AreaID CardTicketsCtripAttractionsCollect TicketsAncient TownNot BadWest Lake10
HainanScenic AreaCtripElectric CartAttractionsQueuingCollect TicketsSmall CaveBook TicketsWuzhizhou IslandTour Bus
HubeiScenic AreaNot BadSceneryAttractionsCollect TicketsWaterfallCtripID CardTicketsTourists
HunanQueuingCollect TicketsScenic AreaYueyang TowerNot BadOrange IslandSceneryAttractionsCtripTianmen Mountain
GansuScenic AreaCrescent Moon SpringJiayuguanTicketsAttractionsID CardEnterDanxiaMingsha MountainQueuing
FujianScenic AreaCollect TicketsBamboo RaftCtripNot BadWuyi MountainGulangyuAttractionsTicketsTulou
XizangLhasaReservationTicketsPotala PalaceTour GuideJokhang TempleQueuingCommentaryCtripAttractions
GuizhouWaterfallScenic AreaQueuingTianxingqiaoHuangguoshuSceneryNot BadAttractionsAncient TownCollect Tickets
LiaoningCtripQueuingBuy TicketsSceneryScenic AreaCollect TicketsWater CaveNot BadID CardBook Tickets
ChongqingNot BadSceneryScenic AreaID CardCollect TicketsTiankengCtripQueuingSceneryHours
ShaanxiHukou WaterfallScenic AreaCollect TicketsWaterfallBuy TicketsCable CarCtripCommentaryQueuingTour Guide
QinghaiKumbum MonasteryQinghai LakeScenic AreaBuy Tickets20Collect TicketsCtripEnterParking LotTibetan Buddhism
HeilongjiangWaterfallScenery100Not BadScenic AreaID CardCtripTicketsAttractionsSun Island
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, W.; Liu, J.; Zhu, H.; Li, F.; Zhu, Z.; Zhengchen, R. Inter-Provincial Similarities and Differences in Image Perception of High-Quality Tourism Destinations in China. Land 2025, 14, 1999. https://doi.org/10.3390/land14101999

AMA Style

Zhao W, Liu J, Zhu H, Li F, Zhu Z, Zhengchen R. Inter-Provincial Similarities and Differences in Image Perception of High-Quality Tourism Destinations in China. Land. 2025; 14(10):1999. https://doi.org/10.3390/land14101999

Chicago/Turabian Style

Zhao, Wudong, Jiaming Liu, He Zhu, Fengjiao Li, Zehui Zhu, and Rouyu Zhengchen. 2025. "Inter-Provincial Similarities and Differences in Image Perception of High-Quality Tourism Destinations in China" Land 14, no. 10: 1999. https://doi.org/10.3390/land14101999

APA Style

Zhao, W., Liu, J., Zhu, H., Li, F., Zhu, Z., & Zhengchen, R. (2025). Inter-Provincial Similarities and Differences in Image Perception of High-Quality Tourism Destinations in China. Land, 14(10), 1999. https://doi.org/10.3390/land14101999

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop