Next Article in Journal
Digitalization and Supply Chain Carbon Performance: The Role of Focal Firms
Previous Article in Journal
Exploring Factors Influencing Patients’ Intention to Adopt Generative AI on Online Healthcare Platforms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

From Big Data to Cultural Intelligence: An AI-Powered Framework and Machine Learning Validation for Global Marketing

Basic Science Research Institute, Chungbuk National University, Cheongju-si 28644, Republic of Korea
J. Theor. Appl. Electron. Commer. Res. 2025, 20(4), 288; https://doi.org/10.3390/jtaer20040288
Submission received: 6 September 2025 / Revised: 16 October 2025 / Accepted: 21 October 2025 / Published: 22 October 2025

Abstract

This research addresses the ‘cultural blind spot’ in Big Data and AI, where algorithms treat global user-generated content monolithically, fostering biased marketing models. It proposes a dynamic ‘contextual value amplification’ framework, integrating Impression Management and Construal Level Theories. The study argues that service context—luxury versus budget—systematically reconfigures how cultural values are expressed in online customer reviews. A dual-method approach was applied to 284,746 negative hotel reviews. First, a high-dimensional fixed-effects model provided evidence for ‘cultural complaint signatures’ and revealed a novel mechanism: the luxury context amplifies individualists’ focus on relational Service but dampens their focus on transactional Value. Second, an XGBoost model offered computational validation. Including these theoretically derived features improved the model’s ability to classify a reviewer’s cultural orientation by over 220%. The study proposes a dynamic, context-contingent theory of cross-cultural expression, offers a methodological template fusing econometrics and machine learning to mitigate bias, and advances a conceptual framework for ‘Cultural Intelligence’.

1. Introduction

The integration of Big Data and Artificial Intelligence (AI) has instigated a paradigm shift in marketing, fundamentally altering how firms manage customer relationships [1] and extract insights from the digital “voice of the customer.” This revolution, fueled by vast quantities of user-generated content (UGC) from online customer reviews (OCRs), has reshaped the business landscape [2]. OCRs now serve as a primary information source for consumers, significantly influencing their decision-making and purchase intentions [3,4]. Recognizing this, firms invest heavily in analyzing this data [5], assuming it holds the key to optimizing global marketing. Electronic word-of-mouth (eWOM), encompassing OCRs, is recognized for its speed, reach, and influence, often exceeding traditional WOM [2,6,7].
However, as firms scale these data-driven strategies globally, they encounter a critical and largely unexamined flaw: the ‘cultural blind spot’ [8]. The dominant practice in marketing analytics often treats global Big Data as a culturally monolithic resource [9], ignoring the deep-seated cultural schemas that systematically shape how consumers perceive, evaluate, and articulate their experiences [10,11]. This oversight presents significant challenges, as algorithms trained on culturally skewed data risk learning a biased model of service quality, leading to flawed global strategies [12,13].
Addressing this requires moving beyond purely data-driven approaches to re-engage with foundational theories of cultural influence. A nascent but growing stream of research confirms that culture shapes UGC [14], influencing everything from review valence and emotionality to the specific product aspects discussed [15,16]. For instance, studies have found that Asian guests are more likely to complain about service failures, whereas non-Asian guests focus more on room issues [17]. Similarly, Chinese reviewers tend to focus on product functionality, while American reviewers prioritize usability [18]. Culture also moderates the relationship between eWOM and market share [19] affects how consumers complain after service failures [20,21].
Despite these insights, this body of work has key limitations. First, it often focuses on dyadic country comparisons, which limits generalizability [18]. Second, and more critically, it has predominantly documented the static main effects of culture [14,17], failing to provide a unifying framework that explains how and why context alters cultural expression [22]. A significant portion of this research relies on Hofstede’s [23] cultural dimensions, a framework that has faced justifiable criticism for its potential obsolescence and questionable generalizability [24,25]. This has prompted urgent calls for alternative theoretical frameworks that can better capture the nuances of digital consumer behavior [8,26]
This study addresses these gaps by proposing and validating a framework of contextual value amplification. I integrate three theoretical lenses—foundational cultural theory [27], Impression Management (IM) Theory [28,29], and Construal Level Theory (CLT [30])—to argue that the service context (e.g., budget vs. luxury) systematically reconfigures which culturally prescribed values become most salient. The act of writing a review is a public performance aimed at managing impressions, and CLT provides the mechanism explaining how psychological distance associated with the context alters the level of abstraction, thereby amplifying or dampening the expression of these values. This leads to my primary research questions:
  • RQ1: How does a user’s national culture, through its influence on their impression management motivations, systematically predict the substantive topics they focus on in negative online reviews?
  • RQ2: How is this relationship moderated by the service-market context (i.e., budget vs. luxury), potentially via context-induced shifts in construal level?
To answer these questions, this research employs a multi-stage, AI-powered design on a dataset of 284,746 negative hotel reviews. This approach first leverages advanced, semi-supervised AI (Seeded LDA) to quantify cultural expressions from raw text, then fuses high-dimensional fixed-effects (HDFE) econometric modeling for robust identification [31] with an XGBoost machine learning model for predictive validation [32]. This study offers three primary contributions. First, I advance a dynamic, context-contingent theory of cross-cultural digital expression, moving beyond the static explanations that have dominated the literature [33]. Second, I provide a methodological template for mitigating algorithmic bias [13] by integrating theory-driven modeling with predictive machine learning. Finally, my work presents a computationally supported framework that offers a pathway for firms to advance from culturally blind Big Data [34] to actionable ‘Cultural Intelligence’ [35], offering a sustainable competitive advantage [36].

2. Theoretical Framework and Hypotheses Development

An effective predictive model is not built on data alone; its explanatory power and validity are maximized when grounded in a robust theoretical framework [35]. The successful fusion of established management theory with data science is critical for moving from correlation to causation and from prediction to actionable insight [37]. Accordingly, this section develops the theoretical underpinnings of my predictive framework by integrating three powerful and complementary lenses: foundational cultural theory, IM Theory, and CLT (See Figure 1). First, I establish the link between cultural schemas and the impression management goals that motivate online expression. I then introduce CLT as the mechanism explaining how the service context systematically moderates this expression.

2.1. Cultural Schemas, Impression Management, and Online Consumer Expression

My framework begins with the premise that the content of UGC is a form of culturally scripted impression management. Culture, the “collective programming of the mind” [23], functions as a cognitive schema that automatically guides an individual’s attention, perception, and behavior [10,11]. These cultural values have a profound impact on consumer behavior, from expectations and satisfaction to complaining intentions [3,21,38]. In the digital sphere, these schemas do not disappear; they are materialized in the text of online reviews [14,25], shaping how consumers from different backgrounds express their satisfaction or dissatisfaction [17].
The act of writing a review is not merely a passive report of an experience; it is a public performance enacted for a vast, often anonymous audience [35,39]. Consumers act as social actors engaged in IM—the goal-directed process of controlling information to influence how others perceive them [28,29]. This strategic self-presentation is deeply intertwined with a desire to project a particular social identity [40,41]. The motivations for engaging in eWOM are complex, ranging from concern for other consumers and self-enhancement to seeking social benefits and venting negative feelings [42].
A robust body of research categorizes self-presentation tactics as either acquisitive (assertive), aimed at gaining social approval, or protective (defensive), aimed at avoiding social disapproval [43,44]. The crucial link, and the foundation of my framework, is that these IM strategies are powerfully scripted by culture. Cross-cultural psychology provides compelling evidence that cultural values systematically predict a preference for one style over the other. Individualistic cultures, which value uniqueness and self-expression, tend to foster acquisitive self-presentation styles aimed at showcasing individual competence and asserting one’s rights [45,46]. In contrast, cultures that prioritize group harmony and “face,” such as many collectivistic societies, encourage protective self-presentation to avoid disrupting social relations or appearing incompetent [47,48,49]. This is consistent with findings from the OCR literature that consumers from collectivist cultures are less likely to deviate from average ratings and are more hesitant to express extreme emotions [50,51,52].
While a burgeoning stream of research using UGC provides fragmented evidence of these culturally scripted IM strategies [17,18,53], it has often lacked a unifying theoretical framework, proceeding with ad hoc hypotheses [2]. I address this gap by proposing that the acquisitive-protective IM dichotomy serves as the core organizing mechanism linking abstract cultural values to concrete online expressions. This IM-based framework allows me to move beyond simply describing differences and toward explaining them as rational, goal-directed behaviors [35].
To provide a test of this framework, I adopt a focused theoretical approach, deliberately selecting the cultural dimensions and service attributes most relevant to this acquisitive-protective dichotomy. Specifically, I focus on Individualism (IND) as the primary driver of acquisitive, self-enhancing IM [54], and on Masculinity (MAS) and Long-Term Orientation (LTO) as key drivers of protective, pragmatic IM [47]. This choice is not arbitrary; these dimensions map cleanly onto the core pillars of a service evaluation. Complaining about relational Service and transactional Value serves the acquisitive goal of asserting one’s rights and discernment, consistent with the emphasis on the independent self-concept in individualistic cultures [10,21]. A service failure can be perceived as a deprivation of power, prompting a desire to complain to regain a sense of control [20,55]. This aligns with research showing that customers in individualistic cultures have higher expectations and complain more directly [56,57].
In contrast, complaining about functional Room Features and pragmatic Value serves the protective goal of demonstrating pragmatism and prudence. This aligns with the achievement-oriented logic of MAS, where functional failures impede success [58,59], and the investment-oriented logic of LTO, where value failures represent a poor use of resources [60]. This is consistent with findings that consumers from these cultures focus more on functionality and utility in their evaluations [61].
I deliberately exclude other dimensions like Power Distance (PD) and Uncertainty Avoidance (UA) from my main effects hypotheses on content, as prior work suggests they primarily influence the style or propensity of complaining rather than its substantive focus [21,62]. While these dimensions are controlled for in my model, my focus remains on the dimensions most directly linked to the content of impression management. This focused approach allows for a clean, robust test of my central theoretical proposition: the existence of distinct cultural complaint signatures driven by divergent impression management goals.

2.2. A Construal Level Theory Perspective

The influence of culture and its associated impression management goals is not monolithic; it is powerfully shaped by situational cues that can activate or reconfigure the expression of underlying values [21,63]. While international marketing has long acknowledged the importance of context [64], the specific psychological mechanisms through which marketing contexts moderate cultural expression in digital environments remain undertheorized [35]. I introduce CLT as the central mechanism of my framework to explain this dynamic process of contextual value amplification.
CLT posits that individuals can represent events at varying levels of abstraction, from concrete and detail-oriented (a low-level construal) to abstract and essential (a high-level construal) [30]. The specific level of construal is systematically driven by psychological distance. Events perceived as psychologically distant—in time, space, social circles, or likelihood—trigger high-level construals. This abstract mindset causes people to focus on the core, desirable features of an event and ask “why” it is important, evaluating its ultimate goals and values [65]. Conversely, psychologically proximal events trigger low-level construals, causing a focus on concrete, feasibility-related details and the mechanics of “how” an event will unfold [66].
I argue that budget and luxury service contexts systematically evoke different construal levels. A budget hotel stay is often a utilitarian transaction, making it psychologically proximal [67]. The consumer’s focus is typically on concrete, feasibility-related questions: ‘Is the bed clean? How do I get to the city center?’ This low-level construal prioritizes the functional and instrumental attributes of the service, such as cleanliness and basic room features, which are common sources of complaints for lower-class hotels [17,68].
In contrast, a luxury hotel stay is inherently psychologically distant. This distance is multifaceted. It is socially distant, associated with a high-status, aspirational reference group [69], and often temporally distant as a non-routine, special occasion [18]. More critically, it is hypothetically distant, deeply intertwined with a consumer’s ideal self-concept and personal values rather than their everyday, actual self [35,70]. The evaluation shifts from the “how” of execution to the “why” of the experience—Why am I having this experience, and what does it say about who I am?—reliably triggering a high-level construal [65]. This aligns with the concept of the “customer journey,” which encompasses a customer’s cognitive, emotional, and social responses throughout their interactions with a firm [35,71].
This shift to a high-level construal is not trivial; it fundamentally reorients the basis of judgment. As CLT research robustly demonstrates, an abstract mindset acts as a cognitive spotlight, illuminating an individual’s core values and making them the primary lens through which the experience is interpreted and judged [72,73]. Abstract principles that define the self (e.g., ‘I deserve respect,’ ‘I am a discerning person’) become paramount. Service failures in this context are no longer mere inconveniences but are perceived as significant threats to one’s self-concept and social standing [20,74]. Therefore, I predict that the psychologically distant luxury context will cause a consumer’s underlying cultural schemas and their associated impression management goals to “speak louder.” This cognitive shift will reconfigure how consumers articulate a service failure, leading them to prioritize complaints that align with their most central, abstract, and culturally prescribed values. This provides a mechanism to move beyond the static, main-effects findings of prior cross-cultural UGC studies [14,25] toward a more dynamic and interactionist perspective.

2.3. Hypotheses Development

My theoretical framework posits that cultural values shape the content of online complaints through the mediating mechanism of impression management goals, and that this relationship is systematically moderated by the service context, which induces different levels of psychological construal. I develop my specific hypotheses below.
  • The Main Effects of Culture-Driven Impression Management
The relational and transactional schemas central to high-IND cultures foster an acquisitive impression management style aimed at asserting one’s status as a discerning and respected consumer [54]. The relational facet, rooted in the independent self-concept, demands dignity and respect, making a failure in Service a direct interpersonal affront that warrants a vocal complaint to assert one’s rights and regain a sense of personal power [10,20,55]. The transactional facet demands a fair exchange, making a failure in Value a violation of equity that must be called out to signal one’s astuteness and avoid being exploited [17,75]. This dual focus is strongly supported by a growing body of UGC research showing that consumers from individualistic cultures prioritize these aspects in their online reviews and complaining behaviors [17,18,21,38]. Therefore, I hypothesize:
Hypothesis 1a:
A reviewer’s Individualism is positively associated with a greater focus on Service and Value, but not on Room Features, in negative online reviews.
In sharp contrast, the instrumental schemas of high-LTO and high-MAS cultures foster a protective impression management style aimed at demonstrating practicality and prudence [47]. From this perspective, a service is a means to a functional end [35]. A failure in tangible Room Features is thus a critical impediment to performance, violating the achievement-oriented logic of MAS where the focus is on success and tangible rewards [58,59]. Similarly, a failure in Value is viewed as a “bad investment,” a misallocation of resources that runs counter to the pragmatic, future-oriented mindset of LTO [60]. Complaining about these functional and economic failures serves the protective IM goal of signaling competence and fiscal responsibility [76]. This linkage is consistent with findings that consumers from these cultures focus more on functionality and utility in their evaluations [17,61]. Thus, I propose:
Hypothesis 1b:
A reviewer’s Long-Term Orientation and Masculinity are positively associated with a greater focus on Value and Room Features, but not on Service, in negative online reviews.
  • The Moderating Effect of Context-Induced Construal Level
My framework suggests that the luxury context does not merely amplify all cultural expressions uniformly. Instead, by inducing a high-level construal, it triggers a cognitive re-prioritization of evaluative criteria [35], an effect I argue is most pronounced for the abstract, self-centric schema of Individualism.
As established by CLT, the psychological distance of the luxury context makes core personal values more salient guides for judgment [73]. For a high-IND consumer, this transforms the meaning of a service failure. The abstract value of personal respect becomes the central grievance. A failure in Service is no longer a simple inconvenience but is interpreted as a significant, abstract threat to their self-concept and social standing [74,77]. The acquisitive IM goal thus shifts from demonstrating transactional savvy to affirming one’s high status, which in turn amplifies the complaint focus on the relational aspect of Service. This aligns with research indicating that high-class hotel guests complain more about service-related issues [17,78].
Hypothesis 2a:
The positive relationship between a reviewer’s Individualism and their focus on Service will be significantly stronger (amplified) in the luxury context than in the budget context.
Conversely, in this high-level construal state, the concrete, feasibility-related concern of Value becomes cognitively secondary. As CLT predicts, when consumers focus on abstract desirability rather than concrete feasibility [30], their attention shifts away from pragmatic considerations like price. Furthermore, complaining about price in a luxury context is antithetical to the newly salient IM goal of signaling high status [69]. This cognitive shift away from concrete details and the recalibration of IM goals should therefore dampen the focus on Value. This is consistent with findings that while guests of high-class hotels complain more about service, they are less focused on functional attributes like room quality or cleanliness compared to guests of low-class hotels [17].
Hypothesis 2b:
The positive relationship between a reviewer’s Individualism and their focus on Value will be significantly weaker (dampened) in the luxury context than in the budget context.
In contrast, I propose that the evaluative frameworks for LTO and MAS, while also containing abstract goals like ‘prudence’ or ‘achievement,’ are less susceptible to moderation by the luxury context due to their primarily instrumental nature. For these cultures, a service experience—even a luxury one—is often viewed as a means to an external end (e.g., a successful business trip, a prudent family investment). The evaluative focus thus remains on the functional and pragmatic aspects that facilitate that external goal, such as Room Features and Value [17,50]. This differs fundamentally from the IND schema, where values of self-concept and social standing are intrinsically tied to the experience itself [35]. For a high-IND consumer, the luxury context is not merely a service but a stage for identity affirmation. Therefore, the shift to a high-level construal in this context makes the IND schema, which is centered on the self, uniquely sensitive to abstract, identity-relevant threats like poor service. The instrumental schemas of LTO and MAS, however, are less likely to be re-prioritized by this specific identity-focused construal. Therefore, I do not predict a significant moderation effect for these cultural dimensions.

3. Research Methodology

To test my hypotheses and validate the framework, I adopt a two-stage analytical strategy that combines econometric modeling for robust identification with machine learning for predictive validation. This dual approach allows me to not only explain the underlying drivers of cultural expression but also to test the real-world predictive power of my theoretical framework, a form of methodological triangulation that enhances the robustness and practical relevance of the conclusions [79].

3.1. Data and Sample

My research required a large-scale, global dataset of authentic consumer opinions from a context characterized by high cultural variance and a reliance on eWOM. The international hotel industry, with TripAdvisor.com as its largest eWOM platform, provides an ideal setting for this inquiry [17]. The resulting dataset exhibits the core “5V” characteristics of Big Data: high Volume, Variety, and Velocity, from which my analysis is designed to extract Value while ensuring Veracity [80].
The data collection and sample construction involved several systematic steps. I began by crawling an initial dataset of over 10 million unique reviews from TripAdvisor. To isolate cultural effects from linguistic ones, I first filtered for reviews written in English, a standard approach as English functions as the de facto lingua franca of global platforms [3,17]. I explicitly acknowledge that this filter may introduce a selection bias towards cultures with higher English proficiency; I demonstrate this bias directly with a formal check in Appendix B. After removing observations with missing data for key variables (e.g., reviewer’s home country), I obtained a clean dataset of 3,965,825 reviews.
From this large dataset, I implemented a final theoretical sampling step. As my framework posits that cultural values are most saliently expressed during potent events like service failures, which can lead to significant dissatisfaction and complaining behaviors [20,21], I focused my analysis on the content of articulated dissatisfaction. I therefore filtered for negative experiences, defined as reviews with a star rating of one or two (out of five) [17]. This yielded my final analytical sample of 284,746 negative reviews for 21,644 unique hotels, providing substantial statistical power and enhancing the generalizability of my findings.

3.2. Measurement of Variables

Dependent Variables: Complaint Focus. To transform the unstructured review text into quantitative variables, I employed Seeded Latent Dirichlet Allocation (Seeded LDA), an advanced, AI-driven semi-supervised topic modeling approach [81]. Unlike purely unsupervised LDA, which can produce topics that are difficult to interpret or theoretically irrelevant [82], this method allows my analysis to be guided by pre-defined seed words derived directly from my theoretical framework [76]. This ensures the resulting topics align with my constructs of Service, Value, and Room Features, thereby increasing construct validity. This approach builds on prior research that has successfully used LDA and other text mining techniques to extract meaningful themes from online reviews [17,18]. The model outputs a per-document topic distribution, with my dependent variable being the proportion of each topic in a given review. A detailed description of this process and the seed word list is provided in Appendix A.
Independent and Moderating Variables. My primary independent variables are the three dimensions of national culture from Hofstede’s [58] framework that align with my theoretical model: IND, MAS, and LTO. The systematic protocol for sourcing and assigning these national cultural scores to the dataset is detailed in Appendix C. While I acknowledge the risk of the ecological fallacy in applying national-level scores to individual behaviors [83], this framework remains the most robust and widely used tool for large-scale, cross-country analysis where individual-level measures are not feasible [84]. Its continued relevance in eWOM and service research is well-documented [21]. My key moderating variable, Hotel Market Tier, was operationalized using the hotel’s star rating, a salient and widely used heuristic for service context and quality. I recognize this is a proxy and discuss its limitations in Section 5.3.
Control Variables: To isolate the effect of culture, I include a comprehensive set of control variables at both the review and country levels. At the review level, I control for structural text characteristics such as Word Count (WC) and Words Per Sentence (WPS). WC accounts for the fact that longer reviews may naturally cover more topics, ensuring my dependent variable captures topical focus rather than mere verbosity, while WPS accounts for variations in linguistic complexity [26]. I also include the review date to control for time-varying factors. Furthermore, to account for linguistic style, I incorporate features from the Linguistic Inquiry and Word Count (LIWC) dictionary, specifically controlling for Analytic thinking to disentangle cognitive style from complaint content, and for emotional Tone to isolate the effect of cultural values from the intensity of the language used [85]. Finally, at the country level, I include GDP per capita and all other Hofstede cultural dimensions to ensure my variables of interest are not capturing the confounding effects of economic development or other cultural traits [25].

3.3. Analytical Strategy: A Two-Stage Approach

3.3.1. Stage 1: Econometric Modeling for Robust Identification

A primary challenge in cross-cultural research using observational data is omitted variable bias. To overcome this, my primary specification is a high-dimensional fixed-effects (HDFE) model. The model is specified as:
Topic_d,c = β1Culture_c + X’_d,cβ2 + α_chain + α_traveler + ε_d,c
Here, Topic_d,c is the proportion of a given topic in review d from a reviewer from country c. Culture_c is the vector of cultural dimension scores for country c, and X’_d,c is a vector of all control variables.
The key terms are α_chain and α_traveler, which represent full sets of hotel brand chain fixed effects and traveler type fixed effects, respectively. By including dummy variables for each hotel chain (e.g., ‘Marriott’, ‘Hilton’) and traveler type (e.g., ‘Family’, ‘Business’) associated with the review, this specification absorbs all time-invariant characteristics common to those groups. α_chain absorbs chain-specific factors, whether observed (e.g., brand standards, global marketing, quality positioning) or unobserved (e.g., underlying service philosophy), while α_traveler absorbs factors common to that travel purpose (e.g., shared expectations, price sensitivity).
This powerful identification strategy means that the coefficient of interest, β1, is estimated by comparing reviews from different cultures within the same hotel chain and from the same traveler type, effectively differencing out a vast array of confounding factors at these levels [31].

3.3.2. Stage 2: Machine Learning for Predictive Validation

To validate my framework’s practical utility, I built an XGBoost machine learning model, a state-of-the-art gradient boosting algorithm known for its high performance and efficiency with large, complex datasets [32]. The primary goal of this stage was not to build a perfect classifier, but to use a classification task as a robust method for computational validation, specifically testing whether the ‘cultural complaint signatures’ identified in my econometric analysis possess real-world predictive power [17,76].
The dependent variable for this binary classification task was a reviewer’s cultural orientation, specifically high-Individualism versus low-Individualism. I focused on IND as it is a central construct in my theory. To create this categorical target variable, I converted the continuous IND score using the sample median (89.00) as a threshold.
To isolate the predictive value of my theoretical constructs, I designed a comparison of three models based on their features. The Baseline Model used only stylistic and structural variables (e.g., Word Count, LIWC features), establishing a performance benchmark based on how a review is written. The Full Model then added my theoretically derived topic proportions (Service, Value, Room Features) to the baseline features. The central test of my framework lies in the performance lift from the Baseline to the Full Model; a significant improvement would confirm that the substantive content (what is written), as conceptualized by my theory, adds meaningful predictive power. Finally, the Optimized Model used the same features as the Full Model but applied a class weight to address the data imbalance. Given this imbalance, I evaluate the models not only on Accuracy but also on more robust metrics like Cohen’s Kappa and Specificity (the true negative rate), which are better suited for assessing classification skill on imbalanced datasets [86].

4. Results

This section presents the findings from my two-stage empirical analysis. I first report the results from the econometric models testing my hypotheses about the main and moderated effects of culture on complaint focus. I then present the results of the machine learning validation, which assesses the real-world predictive power of my theoretical framework.

4.1. Econometric Modeling Results

Table 1 presents the descriptive statistics for the key variables. The data shows substantial variance in both cultural values and review characteristics, rendering it suitable for investigating my research questions. As noted and detailed in Appendix B, the sample-weighted mean for Individualism is 72.64, significantly higher than the global population average. This confirms a notable selection bias, often observed in studies using English-language global platforms, toward more individualistic cultures, underscoring the importance of my analytical approach that controls for baseline differences. Regarding my dependent variables, complaints pertaining to Value (mean = 0.30) represent the most frequent component of articulated dissatisfaction. Complaints focusing on relational Service (mean = 0.06) and functional Room Features (mean = 0.06) are less common but still constitute a significant portion of consumer grievances, aligning with prior findings.
Table 2 summarizes the results from the HDFE model testing my main effects hypotheses (H1a and H1b). The findings offer strong support for my theory of culturally scripted impression management. In full support of H1a, Individualism is positively and significantly associated with a focus on both Service (β = 0.00006, p < 0.01) and Value (β = 0.00017, p < 0.05). This aligns with my argument that individualistic consumers employ an acquisitive IM strategy, emphasizing relational respect (Service) and transactional fairness (Value) to assert their rights.
Hypothesis 1b, which predicted that Long-Term Orientation and Masculinity would be positively associated with a focus on Value and Room Features driven by protective IM goals, received partial support. As predicted, both MAS (β = 0.00009, p < 0.001) and LTO (β = 0.00014, p < 0.001) were positively and significantly associated with a focus on Room Features, consistent with the emphasis on functionality in these cultural orientations. LTO also showed the expected positive association with Value (β = 0.00018, p < 0.001), reflecting a pragmatic concern for resource utilization. However, contrary to my hypothesis, MAS was not significantly associated with Value complaints. More notably, unexpected and significant negative relationships emerged for both MAS (β = −0.00006, p < 0.001) and LTO (β = −0.00008, p < 0.001) in relation to Service complaints. This suggests that consumers from cultures high in these dimensions not only prioritize functional aspects but may also be significantly less inclined to articulate complaints about relational service failures.
While the estimated coefficients are small in absolute magnitude, they represent statistically significant shifts in expressive focus across a massive dataset, estimated with high precision.
Table 3 presents the results of the interaction model testing my moderation hypotheses (H2a and H2b). I find strong support for my proposed contextual re-prioritization mechanism. In support of H2a, the interaction term between IND and hotel star rating is positive and significant in predicting a focus on Service (β = 0.00003, p < 0.01). This indicates that the effect of Individualism on relational complaints is significantly amplified in the luxury context, consistent with CLT’s prediction that psychological distance heightens the salience of abstract, core values like personal respect.
In support of H2b, the interaction effect between IND and hotel star rating is negative and significant for Value complaints (β = −0.00006, p < 0.05). This demonstrates that the individualistic consumer’s focus on transactional value is significantly dampened in the luxury context. This aligns with CLT’s prediction that abstract construals shift focus away from concrete feasibility concerns (like price) towards desirability, and is also consistent with the IM goal of signaling high status in luxury settings.
Consistent with my theory, the effects of LTO and MAS on complaint focus were not significantly moderated by the hotel star rating. These findings collectively support the contextual value amplification framework, suggesting that the service environment dynamically shapes how cultural values are expressed, particularly for abstract, self-relevant cultural schemas like Individualism.

4.2. Machine Learning Validation Results

Having established the systematic relationship between culture, context, and complaint content, I next sought to validate the practical predictive utility of my framework using machine learning. I tested whether the textual features derived from my theory (i.e., the topic proportions for Service, Value, and Room Features) could significantly improve the prediction of a reviewer’s cultural orientation. The purpose of this analysis was not to build a production-level classifier, but rather to perform a computational validation of the theory itself—testing if the identified ‘cultural signatures’ contain a meaningful predictive signal beyond mere stylistic variation. Table 4 and Figure 2 present the performance comparison of the three XGBoost models.
The Baseline Model, utilizing only stylistic and structural features (e.g., Word Count, LIWC Tone), achieved a high accuracy of 0.7547. However, its very low Cohen’s Kappa (0.0600) and Specificity (0.0425) reveal this performance was largely illusory. The model predominantly predicted the majority class, failing to identify the minority class—a common pitfall in models trained on imbalanced data [17].
The Full Model, which incorporated my theoretically derived topic proportions alongside the baseline stylistic features, demonstrated a marked improvement in true classification skill. While accuracy remained similar (0.7566), Kappa increased substantially to 0.1014 (a 69% improvement), and Specificity more than doubled to 0.0858. This indicates that the substantive content of the review, as conceptualized by my framework, contains significant predictive power regarding the reviewer’s cultural background, beyond simple stylistic cues.
Finally, the Optimized Model, which applied a class weight to the Full Model’s features to counteract the data imbalance, reveals the framework’s true classification potential. As expected, accuracy decreased to 0.6399; this reflects a necessary trade-off, moving away from simply guessing the majority class towards genuinely distinguishing between the classes. In exchange, the model’s ability to correctly identify the target minority class (high-IND reviewers) improved dramatically. Specificity surged to 0.5578, and Kappa, the measure of classification skill beyond chance agreement, increased significantly to 0.1925—more than tripling (a 220% increase) the Kappa score of the Baseline Model.
This final result provides strong computational validation for the practical relevance of my theoretical framework. While the absolute predictive performance of the Optimized Model (Kappa = 0.1925) indicates only ‘slight agreement’ and remains modest for reliable individual-level classification [86], the key insight lies in the significant relative improvement achieved by incorporating the theory-derived topic features. The tripling of the Kappa score demonstrates that the ‘cultural complaint signatures’ identified in my econometric analysis possess tangible predictive value. This confirms that my framework captures a robust cultural signal within the review text, sufficient to enhance classification skill well beyond chance and stylistic markers, thereby validating the utility of integrating cultural theory into predictive analytics.

5. Discussion

This research confronted the critical ‘cultural blind spot’ in modern marketing analytics, a domain where the scale of Big Data has often obscured the nuance of human culture [8,35]. My two-stage analysis provided a robust empirical answer to how culture manifests in the digital sphere and how this expression is dynamically shaped by context. My econometric models did not simply confirm that culture matters; they identified specific “cultural complaint signatures,” such as the link between Individualism and relational Service failures [17,18], and uncovered a novel mechanism of contextual re-prioritization, demonstrating that the influence of culture is not static but elastic, systematically reconfigured by the service environment [22]. The subsequent machine learning models provided strong computational validation, confirming that these theoretical signatures possess significant and actionable predictive power [17]. This chapter now turns to a deeper discussion of the theoretical contributions, strategic implications, and future directions stemming from these findings.

5.1. Theoretical Contributions

This study makes three primary and interlocking theoretical contributions that collectively advance the fields of international marketing, cross-cultural consumer behavior, and information systems.
First, I advance a dynamic, integrated framework for cross-cultural digital expression, moving beyond the static, main-effects explanations that have characterized prior research. The existing literature, while valuable, has largely documented the main effects of culture on UGC, often presenting culture as a stable, monolithic force [14,17,25]. My research provides a critical evolution by proposing and validating a micro-level psychological mechanism that explains the situational contingency of cultural expression. By synthesizing cultural schemas [10], Impression Management Theory [40], and Construal Level Theory [30], I answer the call for a more unifying, motivation-based framework [2]. It reveals an “elasticity” in cultural expression, whereby the luxury context systematically amplifies abstract, identity-relevant grievances while dampening concrete, functional ones for individualistic consumers. This finding contributes a more sophisticated, interactionist perspective to the literature [20] and directly responds to calls for alternative theoretical lenses to explain complex digital behaviors [8,26].
Second, I offer a normative framework for mitigating algorithmic bias by fusing theory-driven econometric modeling with predictive machine learning. The uncritical application of AI to global UGC risks creating a new generation of culturally biased technologies [12,13]. My two-stage methodology provides a direct corrective. By first using a high-dimensional fixed-effects model to isolate the systematic effect of culture from a vast array of confounders [31], I generate a cleaner, theory-consistent signal. This signal then serves as a high-quality input for the XGBoost model, enabling more accurate and less biased predictions. This approach demonstrates a pathway to escape the “garbage-in, garbage-out” problem that plagues atheoretical machine learning and responds to urgent calls for more responsible and transparent AI in marketing [3]. It serves as a real-world example of the powerful synergy between management theory and data science that scholars have advocated for [37], moving beyond using machine learning simply for prediction to using it for theory validation.
Finally, my work contributes to the discussion of ‘Cultural Intelligence’ as a data-driven organizational capability. Drawing on the resource-based view (RBV) of the firm, I propose that the ability to systematically decode cultural nuance from unstructured data can be framed as a strategic asset that is valuable, rare, and difficult to imitate [87]. This capability, which involves integrating complex theories with advanced analytics, possesses qualities of causal ambiguity and social complexity, making it a potential source of sustainable competitive advantage [35]. My research thus complements the common focus on Big Data as a resource, emphasizing instead that advantage is derived from the firm’s distinctive capability to transform that data into actionable intelligence [36]. A particularly noteworthy finding is the unexpected negative relationship between MAS and a focus on Service complaints. This suggests that consumers from high-MAS cultures are not merely indifferent to relational aspects but may be systematically less likely to articulate them. From an IM perspective, articulating complaints about interpersonal service might conflict with the goal of projecting a competent, task-oriented self-image [28], a stark contrast to the response in cultures where relational harmony or individual respect is paramount [21].

5.2. Managerial Implications

The strategic imperative for global firms is to transition from reactive damage control to proactive, culturally resonant customer experience design [71]. My findings provide a clear roadmap for this transition.
First, firms must build a ‘Cultural Analytics Dashboard’ as a core Decision Support System. Managers should move beyond simplistic sentiment analysis to track the prevalence of specific cultural complaint signatures across markets and service tiers [76]. For instance, a hotel chain could monitor whether complaints about Room Features from high-MAS markets are increasing, signaling a potential gap in functional service delivery that machine learning models can predict and flag [17]. This data-driven approach allows for the precise allocation of resources to the service attributes that matter most to specific cultural segments, turning analytics into a direct input for strategic decision-making [25].
Second, managers must design culturally intelligent and context-specific service recovery protocols. My moderation findings provide a direct playbook. A high-IND guest complaining about poor Service at a luxury property is expressing a threat to their self-concept; the optimal recovery is an abstract, status-affirming response, such as a sincere personal apology from a senior manager that restores their sense of power and respect [20]. Conversely, a high-LTO guest complaining about Value at a budget property is expressing a pragmatic concern about a ‘failed investment’; the optimal recovery is a concrete, transactional solution, such as a swift partial refund [21]. A one-size-fits-all recovery strategy is a missed opportunity to mitigate the powerful impact of negative eWOM.
Third, firms should architect a ‘Cultural Personalization Engine’ to power next-generation AI-driven CRM. The insights from my model can be embedded into automated systems to create culturally aware experiences [35]. This engine could power ‘culturally aware chatbots’ that adjust their communication style based on inferred cultural orientation [88], inform the content of pre-arrival emails to highlight culturally relevant benefits, or even dynamically alter website imagery to align with cultural preferences [89]. The successful deployment of such technologies requires a data-driven, culturally empathetic corporate culture, supported by continuous employee training and clear strategic commitment from leadership. This can transform OCRs from a reactive management challenge into a proactive tool for enhancing customer purchase intention and loyalty [90].

5.3. Limitations and Future Research

While this study offers a novel framework, its findings must be considered in light of several limitations, which collectively provide a clear agenda for future research.
A primary conceptual limitation is the study’s reliance on a macro-level cultural framework. The application of national-level Hofstede scores to individual-level UGC introduces the risk of the ecological fallacy [83], assuming a level of intra-country cultural homogeneity that may not exist [25]. Furthermore, the operationalization of service context via hotel star ratings is a useful but imperfect proxy for the theoretical mechanism of psychological distance, and the core mediating construct of Impression Management was inferred rather than directly measured.
Methodologically, the study’s scope is bounded by its data sample. The exclusive focus on English-language reviews, while necessary to isolate cultural from linguistic effects, results in a significant selection bias towards high-individualism cultures [24]. This constrains the generalizability of the findings. The analysis is also confined to negative reviews within the hotel industry. Cultural scripts may differ for praise, and the salience of different service attributes likely varies across industries (e.g., search, experience, and credence goods) [22].
In interpreting the results, two caveats are warranted. First, while the HDFE model provides a powerful strategy for robust identification, claims of causality from observational data must be made with caution. Second, the predictive performance of the machine learning model, while demonstrating the theoretical validity of my framework, is modest in its current form. Its primary contribution is as a computational validation of the theory’s relevance rather than as a market-ready tool for reliable individual-level classification [86].
These limitations inform a forward-looking research agenda. First, future research must move toward greater conceptual and causal granularity by incorporating individual-level measures of cultural values (e.g., CVSCALE) to mitigate the ecological fallacy. Experimental designs that directly manipulate psychological distance and IM goals are also needed to provide a more definitive causal test of the proposed mechanisms. Second, the data ecosystem should be expanded. Applying multilingual Natural Language Processing (NLP) to analyze UGC in native languages would capture a more authentic global consumer voice [22]. Future work should also examine different complaint behaviors across hotel classes and product categories. Finally, the application of this technology opens an important ethical frontier. As firms gain the ability to personalize based on culture, critical questions arise regarding the potential for reinforcing stereotypes and the governance structures needed to prevent algorithmic discrimination [12,13]. Navigating this ethical landscape will be as important as refining the technology itself.

Funding

This work was supported by the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2022S1A5B5A16056219).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

Gemini 2.5 Pro was used to improve the overall readability and clarity of the manuscript by proofreading for grammatical errors, rephrasing sentences, and enhancing stylistic consistency.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A

Table A1. Seed Word List for Seeded LDA.
Table A1. Seed Word List for Seeded LDA.
ConstructSeed Words
Servicestaff, service, manager, frontdesk, attitude, rude, friendly, helpful, professional, customer, reception, concierge, management, welcome, checkin, checkout
Valueprice, pay, cost, money, expensive, cheap, charge, rate, paid, worth, value, fee, deposit, booking, reservation
Room Featuresroom, bed, bathroom, shower, clean, dirty, small, old, ac, wifi, internet, comfortable, noise, amenities, condition, view, smell, maintenance

Appendix B

Table A2. Comparison of Sample vs. Population Cultural Scores.
Table A2. Comparison of Sample vs. Population Cultural Scores.
ProfileDimensionScore
Sample Weighted AveragePDI46.27
IND72.64
MAS57.81
UAI47.81
LTO44.58
IVR59.73
Population AveragePDI66.0
IND37.9
MAS46.7
UAI67.1
LTO38.2
IVR36.3

Appendix C

This appendix details the systematic process used to assign national culture scores to each of the 284,746 reviews in the final dataset.
1.
Data Source and Matching Protocol
The assignment of cultural scores was performed by linking the reviewer_home_country field, as self-reported by users on their TripAdvisor profiles, to the corresponding national scores. The data was sourced directly from the official dimension data matrix available for research use on Geert Hofstede’s academic website (https://geerthofstede.com/research-and-vsm/dimension-data-matrix/, accessed on 20 October 2025).
The matching process required a preliminary data-cleaning step to standardize country names. A custom script was developed to reconcile common variations and abbreviations (e.g., “USA,” “United States,” “America” were all mapped to “United States”; “UK” and “England” were mapped to “United Kingdom”). This ensured a clean and accurate one-to-one mapping between the review data and the Hofstede data.
2.
Handling of Intra-Country Variation
A critical methodological decision involved handling countries for which the Hofstede dataset provides distinct regional scores. To ensure analytical consistency and avoid making unsubstantiated assumptions about a reviewer’s specific regional origin, I adopted a “primary national score” policy. For instance, for reviews originating from the “United Kingdom,” a single national score was applied, rather than attempting to differentiate between regions like England and Scotland. Similarly, for reviewers from “China,” the score for mainland China was used. This conservative approach enhances the replicability and clarity of my design.
3.
Justification for Using Raw Scores
Finally, I deliberately chose to use the raw Hofstede scores (on their original 1–100 scale) without applying any normalization or transformation. This decision was made to preserve the direct interpretability of the econometric model’s coefficients. Using the raw scores means that the estimated coefficient for a cultural dimension reflects the change in the dependent variable for a one-point increase on Hofstede’s original, widely understood scale, an interpretation that is both straightforward for academic audiences and managerially intuitive.

References

  1. Ledro, C.; Nosella, A.; Vinelli, A. Artificial intelligence in customer relationship management: Literature review and future research directions. J. Bus. Ind. Mark. 2022, 37, 48–63. [Google Scholar] [CrossRef]
  2. Babić Rosario, A.; De Valck, K.; Sotgiu, F. Conceptualizing the electronic word-of-mouth process: What we know and need to know about eWOM creation, exposure, and evaluation. J. Acad. Mark. Sci. 2020, 48, 422–448. [Google Scholar] [CrossRef]
  3. Chatterjee, S.; Rana, N.P.; Tamilmani, K.; Sharma, A. The effect of AI-based CRM on organization performance and competitive advantage: An empirical analysis in the B2B context. Ind. Mark. Manag. 2021, 97, 205–219. [Google Scholar] [CrossRef]
  4. Schuckert, M.; Liu, X.; Law, R. Hospitality and tourism online reviews: Recent trends and future directions. J. Travel Tour. Mark. 2015, 32, 608–621. [Google Scholar] [CrossRef]
  5. Haenlein, M. How to date your clients in the 21st century: Challenges in managing customer relationships in today’s world. Bus. Horiz. 2017, 60, 577–586. [Google Scholar] [CrossRef]
  6. Luo, Q.; Zhong, D. Using social network analysis to explain communication characteristics of travel-related electronic word-of-mouth on social networking sites. Tour. Manag. 2015, 46, 274–282. [Google Scholar] [CrossRef]
  7. Verma, S.; Yadav, N. Past, present, and future of electronic word of mouth (EWOM). J. Interact. Mark. 2021, 53, 111–128. [Google Scholar] [CrossRef]
  8. Steenkamp, J.B.E. Global brand building and management in the digital age. J. Int. Mark. 2020, 28, 13–27. [Google Scholar] [CrossRef]
  9. Del Vecchio, P.; Mele, G.; Siachou, E.; Schito, G. A structured literature review on Big Data for customer relationship management (CRM): Toward a future agenda in international marketing. Int. Mark. Rev. 2022, 39, 1069–1092. [Google Scholar] [CrossRef]
  10. Markus, H.R.; Kitayama, S. Cultural variation in the self-concept. In The Self: Interdisciplinary Approaches; Springer: New York, NY, USA, 1991; pp. 18–48. [Google Scholar]
  11. Shavitt, S.; Barnes, A.J. Culture and the consumer journey. J. Retail. 2020, 96, 40–54. [Google Scholar] [CrossRef]
  12. Filimowicz, M. (Ed.) Digital Totalitarianism: Algorithms and Society; Routledge: Oxfordshire, UK, 2022. [Google Scholar]
  13. Noble, S.U. Algorithms of oppression: How search engines reinforce racism. In Algorithms of Oppression; New York University Press: New York, NY, USA, 2018. [Google Scholar]
  14. Filieri, R.; Mariani, M. The role of cultural values in consumers’ evaluation of online review helpfulness: A big data approach. Int. Mark. Rev. 2021, 38, 1267–1288. [Google Scholar] [CrossRef]
  15. Lin, H.C.; Kalwani, M.U. Culturally contingent electronic word-of-mouth signaling and screening: A comparative study of product reviews in the United States and Japan. J. Int. Mark. 2018, 26, 80–102. [Google Scholar] [CrossRef]
  16. Dang, A.; Raska, D. National cultures and their impact on electronic word of mouth: A systematic review. Int. Mark. Rev. 2022, 39, 1182–1225. [Google Scholar] [CrossRef]
  17. Sann, R.; Lai, P.C.; Chang, H.C. Does culture of origin have an impact on online complaining behaviors? The perceptions of Asians and non-Asians. Sustainability 2020, 12, 1838. [Google Scholar] [CrossRef]
  18. Jia, S.S. Motivation and satisfaction of Chinese and US tourists in restaurants: A cross-cultural text mining of online reviews. Tour. Manag. 2020, 78, 104071. [Google Scholar] [CrossRef]
  19. Tang, L. Mine your customers or mine your business: The moderating role of culture in online word-of-mouth reviews. J. Int. Mark. 2017, 25, 88–110. [Google Scholar] [CrossRef]
  20. Popelnukha, A.; Weng, Q.; Ali, A.; Atamba, C. When do low-power customers complain? The joint effects of chronic sense of personal power and complaint success on complaining intentions. J. Consum. Behav. 2021, 20, 101–118. [Google Scholar] [CrossRef]
  21. Rohden, S.F.; Matos, C.A.D. Online service failure: How consumers from emerging countries react and complain. J. Consum. Mark. 2022, 39, 44–54. [Google Scholar] [CrossRef]
  22. Kim, J.M.; Lee, E.; Yoon, Y. Bringing culture into the picture: Cross-cultural differences in online customer reviews. Int. Mark. Rev. 2023, 40, 528–546. [Google Scholar] [CrossRef]
  23. Hofstede, G. Culture and organizations. Int. Stud. Manag. Organ. 1980, 10, 15–41. [Google Scholar] [CrossRef]
  24. Brand, B.M.; Kopplin, C.S.; Rausch, T.M. Cultural differences in processing online customer reviews: Holistic versus analytic thinkers. Electron. Mark. 2022, 32, 1039–1060. [Google Scholar] [CrossRef]
  25. Mariani, M.; Predvoditeleva, M. How do online reviewers’ cultural traits and perceived experience influence hotel online ratings? An empirical analysis of the Muscovite hotel sector. Int. J. Contemp. Hosp. Manag. 2019, 31, 4543–4573. [Google Scholar] [CrossRef]
  26. Lee, J.; Park, C. The effects of corporate, review and reviewer characteristics on the helpfulness of online reviews: The moderating role of culture. Internet Res. 2022, 32, 1562–1594. [Google Scholar] [CrossRef]
  27. Hofstede, G. Dimensionalizing cultures: The Hofstede model in context. Online Read. Psychol. Cult. 2011, 2, 8. [Google Scholar] [CrossRef]
  28. Leary, M.R.; Kowalski, R.M. Impression management: A literature review and two-component model. Psychol. Bull. 1990, 107, 34. [Google Scholar] [CrossRef]
  29. Oh, H.J.; LaRose, R. Impression management concerns and support-seeking behavior on social network sites. Comput. Hum. Behav. 2016, 57, 38–47. [Google Scholar] [CrossRef]
  30. Trope, Y.; Liberman, N. Construal-level theory of psychological distance. Psychol. Rev. 2010, 117, 440. [Google Scholar] [CrossRef] [PubMed]
  31. Wooldridge, J.M. Econometric Analysis of Cross Section and Panel Data; MIT Press: Cambridge, MA, USA, 2010. [Google Scholar]
  32. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  33. Yuna, D.; Xiaokun, L.; Jianing, L.; Lu, H. Cross-cultural communication on social media: Review from the perspective of cultural psychology and neuroscience. Front. Psychol. 2022, 13, 858900. [Google Scholar] [CrossRef]
  34. LaBrie, R.C.; Steinke, G.H.; Li, X.; Cazier, J.A. Big data analytics sentiment: US-China reaction to data collection by business and government. Technol. Forecast. Soc. Change 2018, 130, 45–55. [Google Scholar] [CrossRef]
  35. Waqas, M.; Hamzah, Z.L.B.; Salleh, N.A.M. Customer experience: A systematic literature review and consumer culture theory-based conceptualisation. Manag. Rev. Q. 2021, 71, 135–176. [Google Scholar] [CrossRef]
  36. Wamba, S.F.; Gunasekaran, A.; Akter, S.; Ren, S.J.F.; Dubey, R.; Childe, S.J. Big data analytics and firm performance: Effects of dynamic capabilities. J. Bus. Res. 2017, 70, 356–365. [Google Scholar] [CrossRef]
  37. Davenport, T.; Harris, J. Competing on Analytics: Updated, with a New Introduction: The New Science of Winning; Harvard Business Press: Brighton, MA, USA, 2017. [Google Scholar]
  38. Schoefer, K.; Wäppling, A.; Heirati, N.; Blut, M. The moderating effect of cultural value orientations on behavioral responses to dissatisfactory service experiences. J. Retail. Consum. Serv. 2019, 48, 247–256. [Google Scholar] [CrossRef]
  39. Goffman, E. The Presentation of Self in Everyday Life; Doubleday: New York, NY, USA, 1959. [Google Scholar]
  40. Berger, J. Word of mouth and interpersonal communication: A review and directions for future research. J. Consum. Psychol. 2014, 24, 586–607. [Google Scholar] [CrossRef]
  41. Blunden, H.; Brodsky, A. A review of virtual impression management behaviors and outcomes. J. Manag. 2024, 50, 2197–2236. [Google Scholar] [CrossRef]
  42. Hennig-Thurau, T.; Gwinner, K.P.; Walsh, G.; Gremler, D.D. Electronic word-of-mouth via consumer-opinion platforms: What motivates consumers to articulate themselves on the internet? J. Interact. Mark. 2004, 18, 38–52. [Google Scholar] [CrossRef]
  43. Arkin, R.M. Self-presentation styles. In Impression Management Theory and Social Psychological Research; Academic Press: New York, NY, USA, 1981; pp. 311–333. [Google Scholar]
  44. Bolino, M.C.; Kacmar, K.M.; Turnley, W.H.; Gilstrap, J.B. A multi-level review of impression management motives and behaviors. J. Manag. 2008, 34, 1080–1109. [Google Scholar] [CrossRef]
  45. Kim, H.S.; Sherman, D.K. “Express yourself”: Culture and the effect of self-expression on choice. J. Personal. Soc. Psychol. 2007, 92, 1. [Google Scholar] [CrossRef]
  46. Vignoles, V.L.; Owe, E.; Becker, M.; Smith, P.B.; Easterbrook, M.J.; Brown, R.; González, R.; Didier, N.; Carrasco, D.; Cadena, M.P.; et al. Beyond the ‘east–west’ dichotomy: Global variation in cultural models of selfhood. J. Exp. Psychol. Gen. 2016, 145, 966. [Google Scholar] [CrossRef]
  47. Aaker, J.L.; Lee, A.Y. “I” seek pleasures and “we” avoid pains: The role of self-regulatory goals in information processing and persuasion. J. Consum. Res. 2001, 28, 33–49. [Google Scholar] [CrossRef]
  48. Lalwani, A.K.; Wang, J.J. How do consumers’ cultural backgrounds and values influence their coupon proneness? A multimethod investigation. J. Consum. Res. 2019, 45, 1037–1050. [Google Scholar] [CrossRef]
  49. Ngai, E.W.; Heung, V.C.; Wong, Y.H.; Chan, F.K. Consumer complaint behaviour of Asians and non-Asians about hotel services: An empirical analysis. Eur. J. Mark. 2007, 41, 1375–1391. [Google Scholar] [CrossRef]
  50. Fang, H.; Zhang, J.; Bao, Y.; Zhu, Q. Towards effective online review systems in the Chinese context: A cross-cultural empirical study. Electron. Commer. Res. Appl. 2013, 12, 208–220. [Google Scholar] [CrossRef]
  51. Hong, Y.; Huang, N.; Burtch, G.; Li, C. Culture, conformity, and emotional suppression in online reviews. J. Assoc. Inf. Syst. 2016, 17, 2. [Google Scholar] [CrossRef]
  52. Koh, N.S.; Hu, N.; Clemons, E.K. Do online reviews reflect a product’s true perceived quality? An investigation of online movie reviews across cultures. Electron. Commer. Res. Appl. 2010, 9, 374–385. [Google Scholar] [CrossRef]
  53. Moriuchi, E.; Moriyoshi, N. A cross-cultural study on online reviews and decision making: An eye-tracking approach. J. Consum. Behav. 2024, 23, 156–170. [Google Scholar] [CrossRef]
  54. Lalwani, A.K.; Shavitt, S. The “me” I claim to be: Cultural self-construal elicits self-presentational goal pursuit. J. Personal. Soc. Psychol. 2009, 97, 88. [Google Scholar] [CrossRef]
  55. Wong, J.; Newton, J.D.; Newton, F.J. Powerlessness following service failure and its implications for service recovery. Mark. Lett. 2016, 27, 63–75. [Google Scholar] [CrossRef]
  56. Donthu, N.; Yoo, B. Cultural influences on service quality expectations. J. Serv. Res. 1998, 1, 178–186. [Google Scholar] [CrossRef]
  57. Yuksel, A.; Kilinc, U.; Yuksel, F. Cross-national analysis of hotel customers’ attitudes toward complaining and their complaining behaviours. Tour. Manag. 2006, 27, 11–24. [Google Scholar] [CrossRef]
  58. Hofstede, G. Culture’s recent consequences: Using dimension scores in theory and research. Int. J. Cross Cult. Manag. 2001, 1, 11–17. [Google Scholar] [CrossRef]
  59. Hur, W.M.; Kang, S.; Kim, M. The moderating role of Hofstede’s cultural dimensions in the customer-brand relationship in China and India. Cross Cult. Manag. 2015, 22, 487–508. [Google Scholar] [CrossRef]
  60. Bearden, W.O.; Money, R.B.; Nevins, J.L. A measure of long-term orientation: Development and validation. J. Acad. Mark. Sci. 2006, 34, 456–467. [Google Scholar] [CrossRef]
  61. Wei, Z.; Zhang, M.; Ming, Y. Understanding the effect of tourists’ attribute-level experiences on satisfaction–a cross-cultural study leveraging deep learning. Curr. Issues Tour. 2023, 26, 105–121. [Google Scholar] [CrossRef]
  62. Gao, B.; Li, X.; Liu, S.; Fang, D. How power distance affects online hotel ratings: The positive moderating roles of hotel chain and reviewers’ travel experience. Tour. Manag. 2018, 65, 176–186. [Google Scholar] [CrossRef]
  63. Gelfand, M.J.; Erez, M.; Aycan, Z. Cross-cultural organizational behavior. Annu. Rev. Psychol. 2007, 58, 479–514. [Google Scholar] [CrossRef]
  64. Samaha, S.A.; Beck, J.T.; Palmatier, R.W. The role of culture in international relationship marketing. J. Mark. 2014, 78, 78–98. [Google Scholar] [CrossRef]
  65. Rosendo-Rios, V.; Shukla, P. When luxury democratizes: Exploring the effects of luxury democratization, hedonic value and instrumental self-presentation on traditional luxury consumers’ behavioral intentions. J. Bus. Res. 2023, 155, 113448. [Google Scholar] [CrossRef]
  66. Liberman, N.; Trope, Y. The role of feasibility and desirability considerations in near and distant future decisions: A test of temporal construal theory. J. Personal. Soc. Psychol. 1998, 75, 5. [Google Scholar] [CrossRef]
  67. Peng, J.; Zhao, X.; Mattila, A.S. Improving service management in budget hotels. Int. J. Hosp. Manag. 2015, 49, 139–148. [Google Scholar] [CrossRef]
  68. Hu, X.; Winter, S. The effects of abstract vs. concrete mindsets on the persuasiveness of online reviews: A construal level perspective. In Advances in Advertising Research X: Multiple Touchpoints in Brand Communication; Springer Fachmedien Wiesbaden: Wiesbaden, Germany, 2019; pp. 43–52. [Google Scholar]
  69. Han, Y.J.; Nunes, J.C.; Drèze, X. Signaling status with luxury goods: The role of brand prominence. J. Mark. 2010, 74, 15–30. [Google Scholar] [CrossRef]
  70. Adler, S.; Sarstedt, M. Mapping the jungle: A bibliometric analysis of research into construal level theory. Psychol. Mark. 2021, 38, 1367–1383. [Google Scholar] [CrossRef]
  71. Lemon, K.N.; Verhoef, P.C. Understanding customer experience throughout the customer journey. J. Mark. 2016, 80, 69–96. [Google Scholar] [CrossRef]
  72. Eyal, T.; Liberman, N.; Trope, Y. Judging near and distant virtue and vice. J. Exp. Soc. Psychol. 2008, 44, 1204–1209. [Google Scholar] [CrossRef]
  73. Torelli, C.J.; Shavitt, S. Culture and concepts of power. J. Personal. Soc. Psychol. 2010, 99, 703. [Google Scholar]
  74. Steele, C.M. The psychology of self-affirmation: Sustaining the integrity of the self. In Advances in Experimental Social Psychology; Academic Press: Cambridge, MA, USA, 1988; Volume 21, pp. 261–302. [Google Scholar]
  75. Oliver, R.L.; Swan, J.E. Equity and disconfirmation perceptions as influences on merchant and product satisfaction. J. Consum. Res. 1989, 16, 372–383. [Google Scholar] [CrossRef]
  76. Fang, J.; Partovi, F.Y. Criteria determination of analytic hierarchy process using a topic model. Expert Syst. Appl. 2021, 169, 114306. [Google Scholar] [CrossRef]
  77. Min, H.K.; Kim, H.J. When service failure is interpreted as discrimination: Emotion, power, and voice. Int. J. Hosp. Manag. 2019, 82, 59–67. [Google Scholar] [CrossRef]
  78. Hu, N.; Zhang, T.; Gao, B.; Bose, I. What do hotel customers complain about? Text analysis using structural topic model. Tour. Manag. 2019, 72, 417–426. [Google Scholar] [CrossRef]
  79. Blazquez, D.; Domenech, J. Big Data sources and methods for social and economic analyses. Technol. Forecast. Soc. Change 2018, 130, 99–113. [Google Scholar] [CrossRef]
  80. Kapil, G.; Agrawal, A.; Khan, R.A. A study of big data characteristics. In Proceedings of the 2016 International Conference on Communication and Electronics Systems (ICCES), Coimbatore, India, 21–22 October 2016; pp. 1–4. [Google Scholar]
  81. Jagarlamudi, J.; Daumé III, H.; Udupa, R. Incorporating lexical priors into topic models. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, 23–27 April 2012; pp. 204–213. [Google Scholar]
  82. Chang, J.; Gerrish, S.; Wang, C.; Boyd-Graber, J.; Blei, D. Reading tea leaves: How humans interpret topic models. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2009; Volume 22. [Google Scholar]
  83. Sivakumar, K.; Nakata, C. The stampede toward Hofstede’s framework: Avoiding the sample design pit in cross-cultural research. J. Int. Bus. Stud. 2001, 32, 555–574. [Google Scholar] [CrossRef]
  84. Kirkman, B.L.; Lowe, K.B.; Gibson, C.B. A quarter century of culture’s consequences: A review of empirical research incorporating Hofstede’s cultural values framework. J. Int. Bus. Stud. 2006, 37, 285–320. [Google Scholar] [CrossRef]
  85. Pennebaker, J.W.; Boyd, R.L.; Jordan, K.; Blackburn, K. The Development and Psychometric Properties of LIWC2015. 2015. Available online: https://repositories.lib.utexas.edu/items/705e81ca-940d-4c46-94ec-a52ffdc3b51f?utm_source=TodayInDigital.com&utm_medium=referral&utm_campaign=what-your-copy-should-be-doing (accessed on 20 October 2025).
  86. Lee, J.; Jung, O.; Lee, Y.; Kim, O.; Park, C. A comparison and interpretation of machine learning algorithm for the prediction of online purchase conversion. J. Theor. Appl. Electron. Commer. Res. 2021, 16, 1472–1491. [Google Scholar] [CrossRef]
  87. Erevelles, S.; Fukawa, N.; Swayne, L. Big Data consumer analytics and the transformation of marketing. J. Bus. Res. 2016, 69, 897–904. [Google Scholar] [CrossRef]
  88. Trivedi, J. Examining the customer experience of using banking chatbots and its impact on brand love: The moderating role of perceived risk. J. Internet Commer. 2019, 18, 91–111. [Google Scholar] [CrossRef]
  89. Sia, C.L.; Lim, K.H.; Leung, K.; Lee, M.K.; Huang, W.W.; Benbasat, I. Web strategies to promote internet shopping: Is cultural-customization needed? Mis Q. 2009, 33, 491–512. [Google Scholar] [CrossRef]
  90. Chakraborty, U. The impact of source credible online reviews on purchase intention: The mediating roles of brand equity dimensions. J. Res. Interact. Mark. 2019, 13, 142–161. [Google Scholar] [CrossRef]
Figure 1. Conceptual Model of Contextual Value Amplification.
Figure 1. Conceptual Model of Contextual Value Amplification.
Jtaer 20 00288 g001
Figure 2. Model Performance Comparison Chart.
Figure 2. Model Performance Comparison Chart.
Jtaer 20 00288 g002
Table 1. Descriptive Statistics.
Table 1. Descriptive Statistics.
VariableNMeanStd. Dev.MinMedianMax
Cultural Variables
Power Distance (PDI)284,74646.2717.7511.0039.00100.00
Individualism (IND)284,74672.6424.4511.0089.0091.00
Masculinity (MAS)284,74657.8113.675.0062.00100.00
Uncertainty Avoidance (UAI)284,74647.8118.568.0046.00100.00
Long-Term Orientation (LTO)284,74644.5818.154.0051.00100.00
Indulgence (IVR)284,74659.7315.724.0068.00100.00
Control Variables
Word Count (WC)284,746179.92142.025.00138.001005.00
Words Per Sentence (WPS)284,74619.6816.921.8016.50822.00
Hotel Star Rating284,7463.511.300.004.005.00
Country GDP p.c.284,74633,652.2718,589.401095.0436,085.8494,277.96
Dependent Variables
Value284,7460.300.230.0010.270.96
Service284,7460.060.080.0010.030.87
Room Features284,7460.060.090.0010.030.88
Table 2. Main Effects of National Culture on Complaint Focus (HDFE Model).
Table 2. Main Effects of National Culture on Complaint Focus (HDFE Model).
(1) Value(2) Service(3) Room_Features
Cultural Variables
IND0.00017 *0.00006 **0.00002
(0.00008)(0.00002)(0.00003)
MAS−0.00006−0.00006 ***0.00009 ***
(0.00007)(0.00002)(0.00003)
LTO0.00018 ***−0.00008 ***0.00014 ***
(0.00004)(0.00001)(0.00002)
ControlsYESYESYES
Fixed EffectsYESYESYES
Observations284,746284,746284,746
R2 (within)0.03700.02060.0155
Notes: Standard errors in parentheses. Significance levels: *** p < 0.001, ** p < 0.01, * p < 0.05.
Table 3. Moderating Effect of Hotel Market Tier on Complaint Focus (HDFE Model).
Table 3. Moderating Effect of Hotel Market Tier on Complaint Focus (HDFE Model).
(1) Value(2) Service(3) Room_Features
Cultural Main Effects
IND0.00019 **0.00006 **0.00002
(0.00007)(0.00002)(0.00003)
MAS−0.00007−0.00006 ***0.00009 ***
(0.00007)(0.00002)(0.00003)
LTO0.00017 ***−0.00008 ***0.00014 ***
(0.00004)(0.00001)(0.00002)
Interaction Terms
IND × hotel_star_rating−0.00006 *0.00003 **0.00002
(0.00003)(0.00001)(0.00001)
MAS × hotel_star_rating−0.000050.00000−0.00002
(0.00004)(0.00001)(0.00002)
LTO × hotel_star_rating0.00003−0.000010.00002
(0.00003)(0.00001)(0.00001)
ControlsYESYESYES
Fixed EffectsYESYESYES
Observations284,746284,746284,746
R2 (within)0.03710.02080.0155
Notes: Standard errors in parentheses. Significance levels: *** p < 0.001, ** p < 0.01, * p < 0.05.
Table 4. Predictive Model Performance Comparison.
Table 4. Predictive Model Performance Comparison.
ModelAccuracyKappaSpecificity
1. Baseline (Style Only)0.75470.06000.0425
2. Full Model (+Topics)0.75660.10140.0858
3. Optimized (Weighted)0.63990.19250.5578
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lee, J. From Big Data to Cultural Intelligence: An AI-Powered Framework and Machine Learning Validation for Global Marketing. J. Theor. Appl. Electron. Commer. Res. 2025, 20, 288. https://doi.org/10.3390/jtaer20040288

AMA Style

Lee J. From Big Data to Cultural Intelligence: An AI-Powered Framework and Machine Learning Validation for Global Marketing. Journal of Theoretical and Applied Electronic Commerce Research. 2025; 20(4):288. https://doi.org/10.3390/jtaer20040288

Chicago/Turabian Style

Lee, Jungwon. 2025. "From Big Data to Cultural Intelligence: An AI-Powered Framework and Machine Learning Validation for Global Marketing" Journal of Theoretical and Applied Electronic Commerce Research 20, no. 4: 288. https://doi.org/10.3390/jtaer20040288

APA Style

Lee, J. (2025). From Big Data to Cultural Intelligence: An AI-Powered Framework and Machine Learning Validation for Global Marketing. Journal of Theoretical and Applied Electronic Commerce Research, 20(4), 288. https://doi.org/10.3390/jtaer20040288

Article Metrics

Back to TopTop