1. Introduction
The proliferation of user-generated text on digital platforms has increased both the volume and analytical relevance of unstructured data [
1,
2,
3]. Despite the abundance of such texts, transformer-based embeddings—though widely adopted—are often treated as black-box features, and their internal numerical and geometric properties remain largely unexplored [
4]. This creates a methodological gap that this study aims to address. While these texts are rich in semantic and experiential content, their unstructured and heterogeneous nature makes conventional quantitative analysis challenging [
5,
6,
7]. In domains like tourism and protected area management, visitor narratives often integrate evaluative judgments and situational descriptions, making simple text-to-number transformations insufficient. Traditional text representation methods, such as bag-of-words, TF-IDF, and probabilistic topic models, rely primarily on lexical features and are insensitive to word order or contextual nuance [
8,
9,
10]. This limits their ability to capture the complexity of natural language, especially in domains like tourism, where visitor narratives combine affective reactions, evaluative judgments, and situational descriptions. Reducing such texts to isolated word counts risks losing analytically important dimensions of meaning.
Recent advances in transformer-based language models have shifted text representation from isolated tokens to dense, context-sensitive embeddings [
11,
12]. These embeddings perform reliably across downstream tasks and have been widely adopted in tourism research for sentiment analysis and decision-support applications [
13,
14]. However, their rapid uptake has created a methodological blind spot: embeddings are often treated as black-box features, and their internal numerical and geometric structure remains largely unexplored. Embeddings are often treated as black-box features, serving as inputs to predictive or classification models without examining their internal structure [
15]. As a result, key properties such as numerical consistency, geometric organization, and variance behavior remain largely unexplored [
16]. This lack of understanding limits the reliability of embedding-based analyses in applied settings, including national parks, where misinterpretation of semantic representations can affect experience design, crowd management, and decision-making [
17].
Many statistical and machine-learning methods implicitly assume well-behaved metrics, variance distributions, and regular data structures [
18]. Introducing semantic embeddings without first examining their numerical and geometric properties leaves these assumptions untested, especially in high-dimensional contexts where variability concentration, distance concentration, and weak cluster separability can distort similarity measures and downstream analyses [
19]. Existing evaluations often focus on predictive performance, implicitly equating accuracy with representational adequacy [
20,
21,
22,
23,
24]. While suitable for task-specific objectives, this approach overlooks whether embeddings reliably capture the internal organization of semantic space. In national park reviews, visitor experiences are complex, involving mixed perceptions and trade-offs that simple task-based evaluation may fail to reflect.
The research conducted in this manuscript addresses the identified gap by developing a measurement structure for investigating transformer-based semantic embeddings as high-dimensional metric spaces [
25]. Visitor reviews from national parks are used because they represent a wide range of heterogeneous expressions. The objective of this study is not empirical generalization across tourism contexts, but methodological generalization, demonstrating how semantic embedding structures can be examined independently of domain-specific content. The manuscript treats semantic embeddings as numerical representations whose internal properties deserve attention in their own right, rather than focusing on task-specific prediction or domain-driven interpretation. The analysis concentrates on established statistical and geometric descriptors—including vector norms, cosine similarity distributions, variance structures explored through dimension reduction, and clustering validity metrics—examined prior to any predictive or optimization-oriented modeling. Taken together, this framework enables explicit assessment of numerical stability, geometric coherence, and semantic organization, providing insights into how visitor experiences are structured within the embedding space.
This manuscript contributes to the quantitative analysis of unstructured textual data by proposing a novel analytical framework for examining semantic embeddings. Unlike conventional embedding-based studies that rely on clustering, topic extraction, or similarity ranking, this approach treats embeddings as continuous high-dimensional metric spaces. It examines relational distance structures and dispersion patterns, interpreting these properties as indicators of latent experiential organization rather than thematic prevalence. Conceptually, semantic encoding is treated as a deterministic mapping from unstructured visitor narratives to structured numerical space, enabling quantitative examination of experiential meaning. The study assesses the numerical and geometric properties of transformer-based embeddings applied to heterogeneous national park reviews, focusing on internal organization and stability. This approach emphasizes similarity relations, variance behavior, and organizational coherence rather than assuming adequacy based on predictive performance.
Building on these results, the study proposes a methodological framework that can be applied prior to task-based analytics, facilitating more informed interpretation of sentiment classification, segmentation, and decision-support models in protected area management. Overall, this strengthens the methodological basis for integrating unstructured textual data while retaining the semantic nuances in visitor narratives. The manuscript is organized as follows: the next chapter provides research background and related work; the methodological framework is then presented, including data description, vector norms, cosine similarity, principal component analysis, and clustering validity metrics. The following chapter presents empirical results, followed by a discussion of methodological and managerial implications. The final chapter concludes with limitations and directions for future research.
2. Background and Related Work
Recent progress in language processing is based on models that can understand the meaning of text depending on context, using large amounts of data [
26]. Earlier approaches, such as simple word-count models or fixed word vectors, could not capture meaning in this way [
27,
28]. Modern models can link distant parts of a text and interpret the meaning of the whole message, not just individual words [
28,
29]. One practical result of this development is that text can be converted into numbers and represented in a shared “meaning space.” This makes it possible to analyze meaning quantitatively, not just by comparing individual words [
30]. Sentence-level models go a step further by representing an entire sentence or short text with a single set of numbers. This allows large amounts of user-generated text to be analyzed while still preserving the overall meaning, even when expressions vary widely [
31,
32].
These models are now used in many areas, including text meaning comparison, grouping similar content, information retrieval, and general text analysis [
33]. Their use is particularly visible in tourism and recreation research, where they are increasingly applied to analyze visitor reviews, satisfaction, and descriptions of experiences in tourist destinations and protected areas [
34]. National parks have proven especially well-suited to such approaches. The volume and richness of visitor-generated textual data capture diverse perceptions of nature quality, infrastructure, accessibility, crowding, interpretation, and overall experience [
35]. Most studies evaluate these models based on how well they perform specific tasks, such as identifying whether someone is satisfied or dissatisfied, or supporting decision-making [
36]. While this is useful in practice, it tends to overlook how meanings are actually structured within the models themselves.
Similarity in meaning between texts can be reliably measured using cosine similarity, a measure that indicates how similar two texts are in meaning regardless of their length, as well as related methods [
37]. This is particularly useful when working with large and complex datasets [
38,
39]. In applied research, these measures are often used to identify similar reviews, aggregate visitor narratives, and support recommendation and monitoring systems [
40]. Their usefulness in such contexts is well established. At the same time, many of these task-oriented applications rely on assumptions that are rarely examined explicitly. Numerical stability, spatial coherence, and the overall structure of the embedding space are usually taken for granted rather than carefully tested [
41,
42]. Embedding representations are often used as a reliable basis for analysis. However, it is rarely examined how different meanings are actually arranged within that space, or how similar, different, or well-organized they are.
Techniques for dimension reduction, including principal component analysis (PCA), are typically used for initial visualization of the embedding space and to provide a first intuition for how variance is concentrated [
43,
44]. In practical/applied research, people often interpret low-dimensional projections as major experiential themes or latent factors underlying visitor discourse [
45,
46]. Such an interpretation appears intuitive and simple when multifaceted datasets are reduced to simpler forms that can be scrutinized and communicated. However, in fact/practice, the application is largely descriptive. Formal discussion of how semantic information is distributed across latent dimensions, or of how much variance must be retained to preserve meaningful structure, is often limited or absent [
47]. That means structure is only partially or contextually defined by low-dimensional projections. Practically, in the case of national park studies, this result has consequences. Most often, visitor accounts are complex and ambiguous; positivity and negativity can easily be merged within one single account. Therefore, when such experiential narratives are reduced to a few components for purposes of visualization or reporting, the apparent experiential patterning will almost certainly be an oversimplification of the gradual, overlapping nature perceptions of visitors.
Unsupervised clustering methods are also widely applied to semantic embeddings to extract themes or visitor segments [
48]. The typical interpretation of such clusters, in tourism research and park management studies, is that they represent distinct types of visitors or experiences [
49]. This idea is appealing because it suggests that clearly defined groups can be directly translated into practical managerial decisions. In reality, however, little practical work has been conducted on checking the validity of clustering against alternative configurations. Very often, for reasons of interpretative convenience and not because it is strictly necessary from a methodological point of view, weak overlapping or continuous structures are forced into discrete partitions [
50,
51]. There is a risk in this practice. It may exaggerate categorical distinctions in visitor experiences that are, in reality, organized along slow semantic gradients by overlapping motivations, perceptions, and context-dependent trade-offs. In the simplest terms, this shows that there is a general problem. Although transformer-based semantic representations are now widely used and increasingly applied in decision-support systems [
52,
53], there is still no clear and established way to analyze them quantitatively. Embedding spaces are mostly treated as a technical step in the analysis, and their value is judged by how well they perform a given task rather than by examining their numerical and spatial structure [
54]. As a result, some important patterns in these representations may go unnoticed, even though their outputs are used in real-world decision-making.
This limitation becomes particularly significant in practical applications, such as national park management, where insights derived from textual data analysis support decision-making at multiple levels, with an emphasis on sustainable and caring capacity management [
55]. Good performance on standard measures does not necessarily mean that all crucial aspects or dimensions of visitors’ experience have been captured. Temporary experiences, mixed opinions, or early signs of problems often do not fit into predefined categories and can therefore be easily overlooked [
56,
57]. Such blind spots are hard to identify without analyzing the embedding space’s actual structure. High-performing models mask subtle but meaningful variation in visitor discourse and appear reliable. This observation speaks to a higher-level methodological need that is more broadly applicable. It is desirable to apply approaches before building specific models that carefully examine the numerical stability of semantic representations, as well as their structure and grouping, rather than assuming they function correctly on their own.
This study responds to that methodological need by proposing a clearer, more systematic quantitative approach to analyzing transformer-based semantic embeddings, treating them as complex, high-dimensional spaces. Rather than focusing on how successful a model is at a specific task or domain, the analysis examines the internal structure of semantic embeddings and how it influences subsequent analyses. By relying on clear and measurable analytical procedures, the study seeks to clarify how visitor experiences are encoded and organized within complex, high-dimensional semantic embeddings. This approach does not replace task-based analyses; instead, it complements them. It introduces a level of structural validation that enables more cautious and transparent interpretation of forecasting and classification results. In applied contexts, such as national park management, and in related domains where AI-generated textual analytics inform decision-making. The mentioned shift supports a more grounded interpretation and application of model outputs.
To contextualize this methodological position,
Table 1 provides a structured overview of prior studies that use semantic embeddings to analyze unstructured textual data. The table compares these studies by their research objectives, embedding models, analytical techniques, and key findings. This comparison enables identification of common methodological patterns and recurring limitations in existing approaches.
As summarized in
Table 1, existing studies predominantly evaluate semantic embeddings through downstream task performance or descriptive projections. Meanwhile, systematic examination of numerical stability, geometric organization, and variance structure remains largely absent. This gap motivates the analytical framework introduced in the following section, which treats semantic embeddings as objects of quantitative investigation in their own right.
3. Materials and Methods
3.1. Data Source and Corpus Description
The dataset was selected for its semantic richness and heterogeneity, making it an ideal testbed for examining transformer-based embeddings. Using national park reviews allows us to explore how experiential meaning is encoded in high-dimensional semantic spaces, independently of domain-specific content, demonstrating the broader applicability of the proposed analytical framework. The dataset was obtained from an open-access online repository, specifically the Kaggle platform, where it is distributed under the title Parks Reviews Dataset. The dataset used by the authors in this study comprises visitor reviews of national parks published during 2025. It serves as the empirical basis for the research, in which textual representations and transformer-based semantic modeling are examined. National park reviews are used as a theoretically suitable testbed due to their semantic richness; however, the proposed analytical framework is not domain-bound and can be applied to any corpus of unstructured experiential text. The dataset consists of 3034 English-language reviews that vary considerably in length, structure, and expressive complexity. Each review is treated as an independent textual observation and contains unstructured natural-language content reflecting visitors’ subjective evaluations, perceptions, and experiential accounts of park environments. In practice, these accounts rarely focus on a single aspect of the visit. Rather than focusing solely on the natural features of national parks and their undeniable beauty, the reviews typically combine descriptions of the landscape with evaluations of infrastructure, accessibility, crowding, the quality of visitor information, safety, and overall satisfaction. Taken as a whole, these texts provide a clear and balanced overview of visitor experiences in national parks. In this context, meaning does not emerge from isolated elements, but from the interaction between environmental conditions, park organization, and visitors’ emotional responses. Precisely because of this diversity, the dataset is particularly well suited for examining how meaning is encoded and organized within high-dimensional semantic spaces, without imposing predefined categories or interpretive constraints.
From a methodological perspective, national park reviews constitute a particularly demanding form of textual data. The way visitors express their views on protected areas shows considerable diversity in motivations, levels of environmental awareness, and styles of expression. Some reviews take the form of brief evaluations, while others are more extensive and written as narrative accounts. Visitors to national parks often include both positive and negative evaluations within the same review, while reviews that are entirely positive or entirely negative are very rare. Because of this diversity, the dataset is well-suited for analysis using semantic models that examine how meaning is constructed and connected into a coherent whole. Encoding models must be able to process all types of responses within a single numerical framework. In this study, such diversity is not treated as noise that should be removed or reduced. Instead, it is regarded as a central feature that allows latent semantic structure to emerge organically from the data.
Before analysis, the dataset was reviewed, and only records containing actual textual content were retained; entries without text and partial records lacking meaningful content were removed. Importantly, no additional filtering based on textual meaning or content was applied. By avoiding predefined categories during data preparation, the full diversity of how visitors describe their experiences is preserved. Rather than adapting the texts to a specific analytical objective, the dataset is treated as a direct, unfiltered record of visitors’ experiences in national parks. This approach allows meaning to emerge directly from the text and enables examination of how it is organized within semantic models, without additional assumptions or constraints.
3.2. Methodology
The overall analytical workflow is summarized in
Figure 1, illustrating the sequence from text collection and embedding generation to geometric and statistical diagnostics.
Figure 1 illustrates the methodological sequence from text collection and transformer-based embedding generation to vector normalization and exploratory geometric diagnostics, including cosine similarity analysis, principal component analysis, and clustering validity assessment. These components jointly support the interpretation of semantic space structure prior to task-specific modeling. The mathematical representation of the applied methodological approach for analyzing unstructured textual data is presented in
Section 3.2.
3.2.1. Vector Norms
A vector norm denotes a function
over a vector space
, such that for
and a scalar
, the following characteristics are established [
58]:
if
, then
, which is the zero vector.
The
vector norm is defined in the following way:
The taxicab (Manhattan grid) exists when
:
Euclidean norm is in the case when
The maximum norm for
is as follows:
Norm-based dispersion that outlines a relative spread of components with respect to the vector magnitude could be as follows:
3.2.2. Cosine Similarity
The angle between two vectors is estimated using the cosine similarity. This measure is calculated by dividing the vectors’ dot product by the product of their norm and could be computed as follows:
The range of cosine similarity is between 0 and 1. When the cosine similarity is close to 1, the two vectors are considered similar. In the opposite case, when cosine similarity is close to 0, two vectors are considered dissimilar [
59].
3.2.3. Principal Component Analysis
Principal Component Analysis (PCA) reduces the data to a few specific dimensions by computing a new set of variables. Each new variable represents a linear combination of the previous variables. A more reliable approximation is determined using more principal components, where every succeeding component is uncorrelated with the previous one. Therefore, the formula for
principal components is written as follows:
where
denotes the
matrix of original variables.
represents
matrix of new principal components, where
.
denotes
matrix of coefficients in relation to the original variables to the
principal components.
represents the residual matrix
[
60].
3.2.4. Unsupervised Clustering Validity Metrics
The clustering target is very precise—we aimed to perform an adequate clustering of the unlabeled dataset. Clustering is successful when high separation between clusters and high cohesion within clusters are achieved. The authors proposed a single measure combining separation and cohesion metrics. The most famous measures are shown below [
61,
62].
The Calinski–Harabasz coefficient (variance ratio criterion) represents a measure based on the internal cluster’s dispersion and the dispersion between clusters. Mathematically, it can be written as follows:
The Dunn index represents a ratio of the smallest and the largest distance between data from different clusters, and it is represented in the following way:
The Xie–Beni score was designed for fuzzy clustering, but it applies to crisp clustering, and it could be presented as follows:
The Ball–Hall index represents a dispersion measure based on the quadratic distances of the cluster points from the centroid:
Another measure is a Hartigan index that is based on the logarithmic relationship between the sum of squares within and between the clusters:
The silhouette coefficient is the most popular method for estimating separation and cohesion metrics, and it is calculated in three steps.
Step 1. Compute the average distance
for all points in the same cluster for all clusters:
. Compute the minimum average distance
between the considered point and the points that are in each cluster, omitting the analyzed point:
Step 3. The silhouette coefficient is determined by using the following formula:
The global silhouette coefficient is calculated as the average of the silhouette coefficients for each point in the dataset:
3.3. Text Preprocessing
Text preprocessing prior to analysis was kept to a minimum in order to allow the underlying structure of transformer-based semantic embeddings to be examined more clearly. Visitor reviews were retained in their original form to preserve their meaning, writing style, and narrative flow. This approach treats reviews as holistic descriptions of experience rather than as collections of isolated elements. In this study, diversity in the content and length of reviews is not treated as noise, but as a key characteristic of the data. Within a single review, visitors often combine descriptions of the natural environment, personal impressions, emotional responses, and practical details, with positive and negative experiences frequently appearing together.
By preserving the original form of the text, meaning can emerge from the full context, shaped by the relationships between words and their use within the narrative. Rather than steering the data toward a predefined analytical objective, the analysis focuses on observing how semantic structure emerges when visitor comments are encoded without prior categorization or labeling. Data preparation was therefore limited to technical steps necessary for reliable analysis, including the removal of non-textual metadata and header fields, verification of record integrity, and elimination of empty or malformed textual entries. These procedures do not alter the semantic content of the text, but ensure that all records constitute valid inputs for the semantic encoding process. Preserving differences in review length and writing style enables semantic models to faithfully represent the diversity of visitor experiences in national parks. This is particularly important in practice, as it ensures that insights derived from the analysis reflect real visitor experiences rather than artificially shaped representations.
3.4. Transformer-Based Semantic Encoding
Each textual review was encoded using a pre-trained transformer-based sentence embedding model. The encoder maps each input text
to a fixed-length numerical vector
384, producing a continuous semantic representation suitable for quantitative analysis. Formally, the encoding process can be expressed as follows:
where denotes the set of textual inputs and represents the transformer-based encoding function.
This approach shows how unstructured visitor review texts are translated into a numerical space where meaning can be examined through relationships between points. Reviews are treated as input texts, while their meaning is represented as positions within a shared, continuous space. In this way, meaning, which is usually interpreted in descriptive or qualitative terms, can be analyzed using statistical and spatial methods. At the level of sentences or short texts, the models assimilate word meaning, sentence composition, and wider context. Rather than assigning a single, fixed meaning to each word, transformer-based models interpret meaning in relation to surrounding words. Meaning, therefore, emerges from context, not from isolated terms. This is particularly important for national park reviews. The same words—such as “crowded”, “wild”, “quiet”, or “accessible”—can carry different meanings depending on how they are used. Their interpretation depends on the overall narrative, accompanying words, and the specific circumstance described.
By preserving contextual information, transformer-based models retain these differences in meaning within the semantic space, rather than losing them through simplification or surface-level encoding. To ensure consistent comparison, all vectors were normalized to the same length. This removes technical differences related to text length or writing style, allowing meaning to be compared solely on the basis of content similarity. That is why cosine similarity is used for comparison, as it reflects similarity in the meaning of the texts rather than differences in their length or writing style. This normalization is not a technical formality, but a deliberate methodological choice that allows reviews to be compared based on genuine similarity of experience, rather than on how they are written. The resulting embedding space can therefore be interpreted as a semantic space in which proximity reflects experiential similarity, without reliance on predefined categories or labels.
3.5. Quantitative Analysis of the Embedding Space
The analysis relied on normalized vector representations generated by a transformer-based encoder. Before any further processing, vector norms were checked to make sure that all embeddings were numerically comparable and that no residual magnitude effects remained. Cosine similarity was then used to examine relationships between review embeddings. Rather than inspecting individual text pairs, the focus was placed on the overall distribution of similarity values across the space. This provided a general view of how semantic proximity is structured in the corpus. To inspect how variance is spread across dimensions, principal component analysis (PCA) was applied to the normalized embeddings. In parallel, k-means clustering was used to explore several values of k. No assumptions were made about the existence of predefined semantic groups, and cluster validity was evaluated using standard internal metrics.
5. Discussion
5.1. Interpretation of Semantic Structure in Visitor Experience
The results of this study demonstrate that the internal organization of transformer-based semantic embeddings does not align with how their outputs are commonly interpreted in applied research and practice. In tourism studies and protected area management, semantic embeddings are frequently used to segment visitor experiences into stable categories or types. The findings presented here challenge this assumption by showing that visitor experiences in national parks are not naturally organized into clearly separable groups within the embedding space. Empirical analysis reveals that positive and negative evaluations, appreciation of natural landscapes, and criticism of infrastructure or management frequently coexist within the same textual narratives. Rather than simplifying or resolving this complexity, transformer-based models preserve it as continuous, overlapping semantic configurations. The absence of strong cluster separation and the weak explanatory power of low-dimensional projections indicate that experiential meaning is distributed across gradual semantic gradients rather than discrete experiential classes.
Misinterpretation arises when analytical techniques that assume rigid boundaries—such as hard clustering or categorical segmentation—are imposed on inherently continuous semantic spaces. While such approaches may yield apparently clear results, they introduce artificial structure that does not reflect the underlying organization of visitor discourse. As a result, analytical clarity is achieved at the expense of experiential nuance. This mismatch between data structure and interpretative practice represents a central empirical insight of the study.
5.2. Theoretical Implications
From a theoretical perspective, the findings contribute to ongoing debates in tourism and experience research by questioning frameworks that conceptualize visitor experience as stable, discrete, and classifiable. The observed semantic continuity supports theoretical views that treat experience as relational, fluid, and context-dependent rather than as a fixed set of attributes or types. The study shows that experiential meaning emerges through overlapping evaluations and situational trade-offs rather than through clearly defined experiential states. By demonstrating that semantic embeddings encode ambivalence, simultaneity, and transition rather than categorical separation, the results call into question the theoretical validity of rigid segmentation models when applied to experiential narratives. This contributes to a more nuanced understanding of the visitor experience as dynamically constructed through the interaction among environmental conditions, individual expectations, and situational context.
5.3. Methodological Implications
Methodologically, the study highlights the need to reconsider how transformer-based semantic embeddings are used in quantitative text analysis. Embeddings are often treated as neutral feature generators, with their validity inferred from downstream predictive performance. The results presented here demonstrate that such an assumption is insufficient, as the internal numerical and geometric properties of embedding spaces strongly shape analytical outcomes. The analysis shows that similarity distributions, variance concentration, and clustering behavior provide critical diagnostic information about how meaning is structured within the embedding space. When these properties are ignored, common analytical practices—such as unsupervised clustering or low-dimensional visualization—risk imposing artificial order on continuous semantic structures. This can lead to overinterpretation of weak patterns and to unwarranted claims about the existence of distinct experiential groups.
The proposed framework, therefore, contributes a methodological layer that precedes task-specific modeling. By examining numerical stability, semantic continuity, and spatial organization before applying predictive or classificatory models, researchers can better assess whether their analytical tools are appropriate to the data’s structure. This approach promotes more transparent, cautious, and structurally informed use of AI-based semantic models.
5.4. Practical Implications for Protected Area Management
The findings have direct implications for the use of textual analytics in national park management and similar applied contexts. Visitor experience management is often framed in terms of identifying satisfied and dissatisfied groups or segmenting visitors into distinct categories. The results suggest that such an approach may overlook important transitional and mixed experiences that are distributed across the semantic space. These transitional experiences often function as early indicators of emerging issues, such as crowding pressures, accessibility conflicts, or mismatches between visitor expectations and actual conditions. Analytical frameworks that rely exclusively on categorical interpretation may fail to detect such signals. By contrast, examining semantic continuity and overlap allows for a more sensitive understanding of experiential dynamics and supports more adaptive and responsive management strategies.
5.5. Limitations and Future Research Directions
Several limitations of the study should be acknowledged. First, the empirical analysis is based on a single corpus of English-language national park reviews, which limits direct generalization to other languages, cultural contexts, or forms of experiential discourse. Second, the study focuses on a specific class of transformer-based sentence embeddings; alternative architectures or dimensionalities may exhibit different geometric properties. Third, the analysis is exploratory and diagnostic in nature and does not assess causal relationships or predictive accuracy. Future research could extend the proposed framework across different domains, languages, and embedding models to evaluate the robustness of the observed structural properties. Longitudinal datasets could be used to examine how semantic spaces evolve over time, while comparative studies could explore how different embedding architectures shape experiential representation.
6. Conclusions
This study contributes to the quantitative analysis of unstructured textual data by advancing a methodological perspective that treats transformer-based semantic embeddings as analyzable high-dimensional metric spaces rather than as opaque inputs to downstream predictive models. By systematically examining numerical stability, similarity structure, variance distribution, and clustering behavior, the research demonstrates that semantic embeddings encode experiential meaning through continuous and overlapping configurations rather than through, overlapping configurations rather than discrete, naturally separable categories. The findings highlight that strong predictive performance alone is not sufficient to justify the analytical use of semantic embeddings in contexts where interpretability and decision relevance are critical. Instead, explicit examination of the internal numerical and geometric properties of embedding spaces is necessary to ensure that subsequent analyses are grounded in the actual structure of the data. The proposed framework provides a generalizable approach for conducting such diagnostics prior to task-specific modeling.
From an applied perspective, the study underscores the importance of cautious interpretation when using AI-based text analytics to support decision-making in national park management and related domains. Simplified categorical representations of visitor experience may obscure transitional, mixed, or emerging patterns that are essential for adaptive and sustainable management. Integrating representational diagnostics with task-oriented analytics enables more nuanced and responsible use of visitor-generated textual data. The framework introduced in this study is not limited to tourism research and can be applied across domains where large-scale unstructured text is used to inform quantitative analysis. Future research may extend this approach to alternative embedding architectures, multilingual corpora, and longitudinal datasets in order to further assess the robustness and generality of the observed structural properties. By shifting attention from model outputs to the structure of meaning within representations, this study supports more interpretable, reliable, and accountable integration of AI-based text analytics into empirical research and practice.