Amidst the rapid advancement of urbanization, coastal cities, characterized by their unique geographical locations and ecological resources, have demonstrated a significant capacity for factor agglomeration [
1]. In contrast to inland cities, coastal areas function as the primary interface for city–sea interactions, integrating urban public functions such as leisure, culture, and socialization, while also serving as corridors for marine activities and ecological perception [
2]. Recent studies have identified numerous challenges for coastal cities [
3,
4,
5]. The utilization of coastlines is limited by natural conditions and disaster prevention requirements, while land resources are constrained by economic, social, and ownership factors, among others. Heavy reliance on motorized transport systems often results in neglecting walking and cycling routes, leading to poor connectivity between scenic areas and leisure activities, which affects the accessibility and experiences of visitors and residents [
6,
7]. Ecological space fragmentation, landscape homogenization, environmental stress from poor beach cleanliness, and limited public accessibility are contradictions impeding coastal sentiment attachment and spatial identity [
8,
9]. Examining coastal experiences from the viewpoint of tourists presents valuable opportunities to explore the links between human activities, marine ecosystems, and built environments.
1.1. The Evaluation of Coastal Spaces from the Perspective of Tourist Experience
Coastal spaces serve as crucial public areas where individuals participate in leisure and social activities. Parasuraman et al. (1988) suggested that perceived value arises from users evaluating and contrasting perceived advantages and disadvantages [
10]. Sweeny and Soutar (2001) noted that users have subjective views on the overall impact of product or service features and other elements [
11]. According to the waterfront environmental theory by Hanayanagi (1999), coastal tourism relates to the interaction of human activities and waterfront environment, which extends to exploring human awareness and behavior at the waterfront, changes in behavior characteristics over time, and waterfront space value [
12]. Consequently, examining coastal spaces through the lens of user perception lays the groundwork for enhancing the current coastal environment, significantly contributing to the creation of more enjoyable and comfortable areas for users. Therefore, user perception and assessment of coastal spaces are key factors for evaluating the quality of these areas.
Conventional tourist experience research predominantly depends on comprehensive interviews, satisfaction evaluations, and questionnaire surveys, supplemented by statistical analysis. For instance, Roca et al. examined user perceptions in a coastal area of Spain using a questionnaire survey and categorized opinion groups [
13]. Nonetheless, traditional methods and spatial evaluation techniques are often constrained by quantitative surveys and static data, and their scope of data coverage and capacity to quantify users’ sentiment reactions to spatial elements frequently fall short [
14]. The low response rate of questionnaire surveys may result in a concentration of samples among “highly loyal users”, while on-site observations might not accurately reflect real usage scenarios due to limited case coverage [
15]. The recent advancements in natural language processing, combined with the vast wealth of online data resources, offer a novel method for evaluating tourist experiences in coastal areas through data generation and text mining analytics. By using text data mining techniques, especially integrating new technologies such as semantic mining and clustering, sentiment analysis, and spatial analysis, judgment can be conducted on tourist experiences and emotional changes across various spaces and seasons. This allows for a more precise evaluation of the strengths and weaknesses of the coastal spatial environment and offers insights for optimizing coastal zone management.
1.2. The Application of Improved Text Mining Approaches for Understanding Tourism Experiences
In recent years, the significance of online text data in researching images of tourist destinations has considerably increased. With the rapid expansion of the e-economy, tourists’ online reviews and opinions have become crucial intelligence sources for both businesses and consumers as they offer a more accurate and genuine reflection of users’ true experiences [
16]. Taking tourism as an example, tourism industry development and destination choices are increasingly influenced by the voice from social media [
17]. The variations in consumer positioning and consumption culture highlighted by social media data can also help in the development of urban commercial areas [
18]. Since geotagged social media data includes time and location information, tourist movement patterns can be further drawn from related spatiotemporal analysis.
Text mining new technology serves as a focused approach for examining the vast amounts of data gathered from online reviews, especially the employment of natural language processing (NLP) to uncover hidden insights within unstructured or semi-structured text data [
19]. This involves converting non-quantitative materials into quantitative data and deriving precise meanings through word and sentence inferences. For instance, ROST Content Mining 6 (ROST-CM6) is a widely used content analysis software for the Chinese language that extracts valuable information through methods such as word frequency statistics, social and semantic network analysis, and sentiment analysis [
20]. Additionally, network text analysis techniques like Word2vec word vector training, K-means clustering, Term Frequency-Inverse Document Frequency (TF-IDF) keyword extraction, and text sentiment analysis are also in vogue [
21]. However, these typical early text representation vector models are often simplistic, lacking contextual information and word-to-word associations. Recently, topic modeling has become a widely utilized tool for document representation. Related modeling methods have evolved from Latent Semantic Analysis (LSA) to Probabilistic Latent Semantic Indexing (PLS), and more recently to Latent Dirichlet Allocation (LDA). In tourism studies, the LDA model has proven powerful in identifying tourism categories. For example, nine tourism categories were identified in the analysis of tourist destinations and preferences in South Korea [
22].
By far, text mining still encounters several challenges when handling unstructured data in coastal assessments, such as comments and policies. Studies suggest that the frequent use of text in environmental impact assessments may result in inaccurate risk communication, and text mining has yet to fully address this issue [
23]. Additionally, existing research often examines user emotional or environmental data in isolation, lacking a dynamic correlation between the two [
24]. For instance, coastal research quantified the supply and demand of beach entertainment services using POIs and social media data, but it has not integrated user preferences with environmental characteristics [
25].
Zhang et al. (2011) thoroughly examined the contact relation and document relation in microblogs to aid topic mining by using the model of Micro Blog-Latent Dirichlet Allocation (MB-LDA) [
26]. Jiang et al. (2017) gathered travelogs, comments, and other social media data as initial text, utilizing complex network theory for text mining [
27]. Qiu et al. (2021) used ROST-CM6 to mine tourist comment data from online commentary texts sourced from Weibo [
28]. These studies enable travel recommendation services by extracting keywords from tourists’ online texts that reflect their emotional preferences on tourism destinations. With recent technological advancements, increasingly sophisticated tools are being integrated into this research field. Gu et al. (2019) applied a Word2vec-based word vector model for semantic mining of text big data, clustering keywords to effectively extract image perception elements, aiding in the analysis of urban tourism image structure and perception [
29]. Wan et al. (2023) employed a deep learning model, TNNFMB (Two-way Neural Network Fusion Model Based on BERT) [
30], which leverages advanced language knowledge from large-scale text data through transfer learning to address the data sparsity issue in small-sample sentiment analysis tasks. Their sentiment analyses have uncovered both positive and negative reviews from online users. Additionally, their studies have demonstrated the superior classification accuracy of the deep learning model, which achieves a significantly better classification effect in text mining.
Overall, the number of studies utilizing text mining technology to evaluate the needs and perceptions of users in specific locations remains limited, with most existing research still in the preliminary exploration phase. Compared to traditional methods, such as questionnaires, interviews, and field observations, employing text mining to systematically investigate the needs and perceptions of place users presents a technical challenge. The processing flow of comment data is overly simplistic; a more professional and in-depth approach can better capture a user’s perceived experience [
31]. Researchers often directly apply general text analysis models, such as topic models, sentiment analysis, and word frequency statistics [
32,
33,
34], without thoroughly considering how these technologies can effectively integrate with the core theoretical methods of place user needs and spatial perception. In summary, the research integrating user needs and perceptions based on text mining approach with the evaluation of coastal places is relatively nascent, lacking in-depth designs that adequately meet research needs [
35,
36].
Against the backdrop of growing attention to coastal recreational spaces in urban quality enhancement, understanding how tourists perceive and utilize these spaces has become critical for evidence-based management and experience optimization. Drawing on social media data from coastal tourists, rich in real-time, user-generated insights into on-site experiences, this study focuses on the spatial classification and temporal dynamics of coastal tourism activities. This paper aims to use text mining techniques to identify tourist experiences related to coastal areas, classify themes, and calculate sentiment values, from which the sentiment dynamics, thematic patterns, and spatial–temporal variations of coastal tourism can be captured. Ultimately, these efforts aim to provide actionable insights for refining coastal management strategies, tailoring space optimization measures to specific coastline types and enhancing the overall quality of coastal tourist experiences.