1. Introduction
In digital media environments, tourism advertisements rely on visual cues to capture attention, construct destination image, and influence travel intention. While ink aesthetics are increasingly used to convey cultural meanings, the mechanisms through which different ink styles (traditional vs. modern) shape visual processing and persuasion remain underspecified.
We advance an integrated theoretical framework built on four pillars:
- (1)
Aesthetic processing and processing fluency: Aesthetic experience unfolds from early perception and attention to meaning formation and appraisal; when visual features align with internal schemas, fluency increases, enhancing liking and evaluation.
- (2)
Style–content congruity (match-up): Messages are more persuasive when form fits semantics/context, improving comprehension and memory.
- (3)
Environmental psychology (natural vs. urban): Attention Restoration Theory and Psychoevolutionary Theory suggest natural scenes elicit harmony/low arousal and restoration, whereas urban scenes afford stimulation and novelty seeking.
- (4)
Cultural schemas and cross-cultural perception: Cultural background shapes gaze allocation and style interpretation, moderating the effectiveness of stylistic cues across audiences.
From this, we propose a style–context congruity account. Traditional ink (restraint, graded tonality, negative space) should better fit natural destinations, yielding greater fluency and positive evaluations, while modern ink (contrast, geometric structure, dynamism) should better fit urban destinations, heightening attention and interest.
Congruity should manifest in process (eye-tracking) metrics and translate into outcomes (aesthetic evaluation, travel intention). Incongruity may reduce fluency and persuasion, unless reframed as creative novelty for certain segments.
Despite valuable insights into visual communication in tourism advertising, two gaps persist:
- (1)
Insufficient style coverage and cultural evidence: Systematic, mechanism-focused comparisons of culturally artistic styles—especially traditional ink—are scarce; differences between traditional and modern ink along the attention–evaluation–intention pathway remain unclear.
- (2)
Methodological siloing: Most studies rely on a single method (e.g., surveys), lacking an integrated view that combines objective process data (eye movements) with subjective judgments, obscuring the pathway from visual stimuli to psychological and behavioral outcomes.
We therefore adopt a mixed-method design (eye-tracking + subjective evaluations) to compare traditional and modern ink styles across natural/urban destinations, providing theoretical and empirical support for the mechanism of style, context congruity, attention, evaluation/intention.
Based on the above research gaps, this study selected young people in China to explore the following specific research questions:
RQ 1: How do tourism ads using different ink painting styles differ in the allocation of visual attention?
RQ 2: Does the difference in ink painting style in ads lead to changes in audiences’ aesthetic evaluations and tourism intentions?
RQ 3: How do viewers subjectively perceive and interpret the impressions brought by ads with different ink styles?
3. Materials and Methods
3.1. Participants
Young people are a very important group in modern tourism and are also an important target of tourism advertising exposure. This study targets this group and adopts a convenient abstract method to recruit participants from colleges and universities. A total of 80 undergraduates from universities in mainland China (42 males, 38 females; ages 18–26, M = 21.3, SD = 1.9) were recruited as participants. All participants had no professional training in art or advertising design, had normal or corrected-to-normal vision, and reported no history of neurological or visual-cognitive disorders. Prior to the experiment, each participant signed a written informed consent form.
To control for the influence of prior travel experience on ad evaluations, we asked participants during recruitment to fill out a basic information questionnaire indicating whether they had lived or spent an extended period during the past year in any of the four destinations involved in this study (Huangshan, Jiuzhaigou, Shanghai, Shenzhen). If a participant was very familiar with any target destination in the past year, they were excluded to avoid such prior familiarity influencing their responses to the ads.
Participants were then randomly assigned by a computer program to one of two experimental conditions: Group A (n = 40) viewed four tourism ad images in a traditional ink painting style, while Group B (n = 40) viewed four images in a modern ink painting style. Each participant completed the browsing and evaluation tasks for 4 ad images. After the experiment, we randomly selected 5 participants from each group for follow-up interviews. All participants received a remuneration of RMB 50 upon completing the experiment.
3.2. Experimental Design
We employed a 2 (ad ink painting style: traditional vs. modern) × 2 (destination type: natural landscape vs. urban scenery) mixed-factor experimental design. Ink painting style was a between-subjects factor, with each participant exposed to ads of only one style (traditional or modern). Destination type was a within-subjects factor, with each participant viewing ads of both content types (2 images of natural landscapes and 2 images of urban scenes). The combination of the two factors produced 4 experimental conditions, each corresponding to one specific ad image.
The dependent variables fell into two categories: (1) Objective visual attention metrics obtained via eye-tracking: total fixation duration (TFD). TFD refers to the cumulative fixation time on a given area of interest (AOI) by a participant, reflecting that area’s attentional attraction. (2) Subjective evaluation scores, comprising two dimensions: aesthetic preference and tourism intention. Both were measured using 7-point Likert scales (1 = strongly disagree, 7 = strongly agree). Each dimension consisted of 4 items, and the mean of the items’ scores was taken as the overall score for that dimension.
3.3. Stimuli Materials
The stimulus materials were custom-designed by the research team and consisted of 8 static tourism advertisement images covering 4 well-known tourist destinations in China: Huangshan, Jiuzhaigou, Shanghai, and Shenzhen. These four locations were chosen because they represent archetypal natural scenic sites (Huangshan, Jiuzhaigou) and modern urban landscapes (Shanghai, Shenzhen), providing a contrast in content. For each destination, two advertisement images were created: one using a traditional ink painting style and one using a modern ink painting style, forming four pairs of images. In each pair, the promotional content was kept consistent (same location) while the visual presentation style differed.
Figure 1 shows an example of an ad image.
The image creation process was as follows: first, we collected representative photographs of each destination, ensuring they depicted the site’s iconic landscapes or cityscapes. We provided a large set of photographs for each destination to a generative AI image tool (a plugin built on the DALL-E 3 model in ChatGPT 4) and prompted it to output images in traditional ink painting and modern ink painting styles. After several iterative rounds, we selected two ink-style images of different styles for each destination. We then used digital image editing software (Adobe Photoshop CC) to make fine adjustments and add textual descriptions; the text was sourced from the destinations’ official websites. The traditional ink-style images emphasize expressive effect and ink texture, employing techniques from Chinese painting such as reserved blank space, splashed ink, and ink outlines; the overall picture appears as if painted with ink on rice paper, presenting an elegant ink-wash atmosphere. The modern ink-style images, in contrast, incorporate contemporary design elements such as flat graphic composition, geometric lines, colorful ink gradients, and digital brush effects, so that the image retains the freehand spirit of ink art while also exhibiting a stylish modern visual impression. Three experts in visual design and three experts in tourism independently evaluated the stimuli. The evaluation focused on two aspects: (1) whether the images effectively represented the regional characteristics of the tourist destinations, and (2) whether the visual style of the images was appropriately maintained. Based on the experts’ feedback, the stimuli were confirmed to reflect the unique features of the destinations, and the styles of traditional and modern Chinese ink paintings were successfully preserved.
To ensure comparability of image content across conditions, we standardized the format and layout of the 8 images. All images were adjusted to 1536 × 2372 pixels in size and presented in a vertical poster orientation. In each layout, we defined 4 predetermined AOIs (
Figure 1): the title area (containing the destination name or slogan), the main image area (showcasing the primary landscape subject), the description text area (a short promotional blurb), and the information area (containing travel information such as suggestions). These four AOIs were kept uniform in relative size and position across all images to avoid layout differences influencing gaze behavior.
3.4. Instruments
Eye-tracking data were collected using Tobii Pro Fusion. The experiment was conducted in a quiet, softly lit laboratory. Participants sat approximately 60 cm from the display, and head movements were not restrained during the experiment to preserve a natural viewing state. Stimuli were presented on a 23-inch LCD monitor (1920 × 1080 resolution, 60 Hz refresh rate), with images displayed full-screen at the center against a neutral gray background to minimize visual distractions. The experimental procedure was controlled, and eye-tracking data were recorded using Tobii Pro Lab 1.102.
For subjective evaluations, we developed a questionnaire comprising two parts: aesthetic preference and tourism intention. After viewing each ad image, participants immediately completed this questionnaire, providing a subjective evaluation of the ad they had just seen. All items used 7-point Likert scales (1 = strongly disagree, 7 = strongly agree). The aesthetic preference scale contained 4 items, adapted from foundational research on visual design and aesthetic judgment [
3,
4]. Example items include: “I find this image visually attractive,” “The visual style of the image is aesthetically pleasing,” “The design elements in the image appear harmonious and well-coordinated,” and “This image presents a novel and interesting style.” The tourism intention scale also contained 4 items to assess the ad’s impact on participants’ interest in traveling and behavioral intention, adapted from classic measures in destination marketing research [
14,
34]. Example items include: “I would consider adding this destination to my future travel plans,” “This ad increased my interest in the destination,” “I want to learn more about this place,” and “If given the opportunity, I would prioritize visiting this destination.” In a pilot test, both scales showed good internal consistency (Cronbach’s α > 0.80).
Additionally, we designed a semi-structured interview outline to supplement the interpretation of the eye-tracking and survey results. The interview questions were centered on three core topics: (1) participants’ visual impressions and aesthetic evaluations of the different ink painting styles (traditional vs. modern); (2) the cultural connotations, style perceptions, and emotional connections conveyed by the images; and (3) the ads’ potential influence on participants’ desire to travel. All questions were open-ended, and the interviewer followed up with probing questions based on each interviewee’s responses to gain deeper cultural explanations and insights into individual cognition.
3.5. Procedures
Before the formal experiment began, the researcher briefly explained the study’s purpose and procedure to participants, emphasizing that this was an academic study with no right or wrong answers and that they should respond according to their true feelings. Participants then signed the informed consent form and filled out a basic information questionnaire to confirm they met the inclusion criteria. Group A corresponds to the traditional ink painting style, and Group B corresponds to the modern ink painting style. The experimental process is shown in
Figure 2.
The experiment consisted of four trials, each with the following procedure: first, at the start of each trial, a black “+” fixation point was presented in the center of the screen for 1000 ms to calibrate the participant’s initial gaze position. Next, a tourism ad image was presented (in a traditional or modern ink style according to the participant’s group), and remained on screen for 20 s. During this time, participants were instructed to view the ad naturally, without any button presses or verbal responses. After 20 s, the screen automatically switched to the subjective evaluation questionnaire interface. Participants rated the ad they had just seen on four items for each of the aesthetic preference and tourism intention dimensions, then clicked submit to proceed to the next stimulus.
To control for any potential effects of presentation order, we randomized the order of the 4 images for each participant using a Latin square design. Each participant saw 2 natural landscape ads and 2 urban scenery ads, but the sequence (which type of image was seen first or last) varied by person. This design effectively balanced out any confounding effects that viewing order might have on gaze behavior or rating results.
After completing four rounds of ad viewing and questionnaires, the system indicated the end of the experiment. The researcher then invited some participants to take part in one-on-one semi-structured interviews, each lasting about 15–20 min. The interviewer asked questions based on the predefined outline and probed or clarified according to participants’ answers. The entire experimental session lasted about 30 min per participant. All experimental procedures strictly followed the research protocol and were supervised by dedicated personnel to ensure consistency and data validity.
3.6. Data Analysis
The data analysis of this study comprised quantitative and qualitative parts, corresponding, respectively, to the experimental data from eye-tracking and questionnaires, and the textual data from interviews. A total of 96 sets of eye-tracking data were collected. Among them, 16 were excluded due to a sampling rate below 90%. As a result, data from 80 participants were included in the final analysis.
For the quantitative analysis, we used SPSS 26.0 for data processing and statistical testing. We conducted 2 × 2 mixed-design analyses of variance (ANOVAs) for each dependent variable, with ink painting style as a between-subjects factor and destination type as a within-subjects factor. For the eye-tracking data, we calculated each participant’s average TFD on the image region (AOI 2) and the text region (combined AOIs 1 + 3 + 4) of each ad, as indicators of visual attention, and then performed two-factor ANOVAs comparing between groups (traditional vs. modern) and within groups (nature vs. city). For the subjective evaluation data, we computed each participant’s mean aesthetic score and intention score across images, and similarly entered these into the mixed-design ANOVA model to test for main effects and interactions. All statistical tests were two-tailed with a significance level of α = 0.05.
For the qualitative analysis, we adopted a thematic analysis approach to code and synthesize the interview data. Specifically, two trained research assistants independently read the interview transcripts and used open coding to mark important information in the text, initially identifying a number of conceptual labels. The research team then held a discussion to compare the results of the two coders and reconcile any coding discrepancies, thereby improving consistency and reliability [
35]. After integrating perspectives, we merged similar codes and distilled them into several themes (categories), each theme representing a common viewpoint or experience among participants on a particular aspect. These themes were then used to explain the trends and differences observed in the quantitative data, helping us to understand the cognitive and emotional mechanisms behind the effects of different ad styles.
5. Discussion
By integrating objective eye-tracking data with subjective questionnaire and interview feedback, this study comprehensively revealed the similarities and differences between traditional ink-style and modern ink-style tourism advertisements in terms of visual attention, aesthetic preference, and tourism intention, and we have sought to explain and discuss these findings from theoretical perspectives.
First, our results showed that the ink painting style of the advertisement had a significant effect on viewers’ visual attention distribution. The eye-tracking experiment indicated that the two styles differ in how they capture attention: overall, the traditional ink ads demonstrated stronger sustained attraction. We recorded that in a natural landscape context, viewers’ total fixation time on the traditional ink ad images was significantly longer than on the modern style ads, whereas in an urban context, the opposite was true (the modern style held a slight advantage). In other words, when the ad’s style is highly congruent with the type of scenery (e.g., traditional ink with mountains-and-water nature, modern ink with urban scenes), viewers linger longer on the image and invest more attention. This phenomenon of style–content congruence affecting attention can be explained from the perspective of visual attention theory and perceptual fluency [
2,
36]. When a visual style matches viewers’ existing aesthetic schemas and expectations, information processing is smoother, and viewers are more willing to repeatedly gaze and savor its details [
3]. Conversely, if the style and content are incongruent, it may lead to attention being dispersed or processing being disrupted, thereby shortening viewing duration.
A potential mediator is narrative coherence [
37]. Beyond attentional capture, we propose that narrative coherence—the perceived logical fit between an ad’s visual form and the destination story it conveys—acts as a mediator linking style–content congruence to evaluation and intention. When visual style aligns with the semantic and affective affordances of the scene (e.g., traditional ink with natural landscapes; modern ink with urban settings), viewers experience a clearer storyline and higher processing fluency, which together support aesthetic appraisal [
3,
38]. Qualitative reports that traditional layouts felt “not cluttered” and “the focus is clear” suggest greater perceived coherence, while comments about modern layouts being “a bit cluttered” indicate potential coherence loss. At the process level, such coherence should manifest as more ordered scanpaths and longer coherent dwell episodes, beyond mere first-glance saliency [
2,
39,
40]. Future work could model coherence as a measured mediator (e.g., a 3–4 item scale on storyline clarity and visual–semantic fit) and test indirect effects while controlling for stimulus saliency.
Secondly, regarding aesthetic evaluation and preference, the quantitative results showed that overall, the traditional ink-style ads received higher aesthetic ratings, a superiority especially pronounced in the natural landscape scenario. Audiences’ preference for the traditional style largely stemmed from the cultural resonance and aesthetic pleasure it evoked. According to the five-stage aesthetic processing model, cultural familiarity and schema activation influence the cognitive evaluation stage of aesthetic experience [
3]. For Chinese viewers, the traditional ink painting style activates deep cultural schemas [
24], causing them to feel a sense of familiarity and a positive aesthetic bias early in perceptual processing. This perceptual fluency and familiarity-induced pleasantness explains why the traditional style ads were more readily accepted and liked—as reflected in many participants’ comments in interviews that the ads felt “comfortable” and “culturally rich” [
36,
41]. At the same time, it must be noted that the novel modern ink style is not without appeal. Although its average aesthetic rating was slightly lower than the traditional style, the modern style’s creative presentation won some viewers’ favor, and notably, the difference in aesthetic ratings between the two styles narrowed for city-themed ads. This can be understood via the dual-dimension theory of classical vs. expressive aesthetics [
4]: the traditional ink style emphasizes order, harmony, and other classical aesthetic dimensions, and thus is better suited to conveying the tranquil beauty of natural landscapes. Meanwhile, the modern ink style emphasizes novelty, creativity, and other expressive aesthetic dimensions, better showcasing the vibrancy and fashionable feel of urban scenes. When aesthetics are matched with content, viewers’ subjective evaluations of the ad also improve accordingly [
42,
43]. This was evident in our results: the modern ink style performed relatively better in urban landscape ads than it did in nature landscape ads, indicating that expressive aesthetics are more appreciated in an urban context.
A potential mediator is perceived authenticity. The interviews repeatedly referenced “cultural richness,” “familiar culture,” and doubts about “less cultural flavor,” pointing to perceived authenticity as another mediator between style–content congruence and persuasive outcomes. Congruent pairings likely signal authenticity—that the ad’s look genuinely belongs to the place—thereby increasing credibility and warmth and strengthening the affective and symbolic components of destination image that, in turn, drive intention [
23,
44,
45]. In our data, traditional ink in natural contexts appeared to heighten authenticity (e.g., “deep cultural heritage”), whereas modern ink sometimes reduced it for heritage-oriented scenes, even if it boosted novelty. Measuring authenticity (e.g., “the ad feels genuine/appropriate for this destination”) would allow a formal mediation test and clarify why equally attention-grabbing designs diverge in downstream motivation.
Lastly, the findings on tourism intention support the positive effect of aesthetic experience on attitudes and behavioral tendencies. Overall, the traditional ink ads, by enhancing viewers’ aesthetic affinity for and cultural identification with the destination, led to a stronger tendency toward travel intention. This is consistent with the theory of consumer aesthetic value: the emotional, symbolic, and aesthetic value viewers derive from an ad enhances their overall evaluation of the destination and thereby strengthens their behavioral intentions [
23]. Our survey data showed that the aesthetic pleasure and cultural closeness evoked by the traditional style ads indeed translated into higher destination desirability scores. The qualitative interviews further corroborated this trend: several participants plainly stated that the traditional ink ads sparked the idea of visiting the destination, feeling that “the scenery has more cultural story, making it worth a trip” (P1, P5). Evidently, when an ad successfully conveys a destination’s aesthetic value and cultural meaning, audiences are more likely to feel longing and curiosity and be willing to embark on a journey [
46]. By contrast, the effect of the modern ink ads on boosting tourism intention was somewhat limited. On one hand, their novel design piqued the interest of younger viewers and led to discussion, possibly aiding the destination’s word-of-mouth spread; but on the other hand, because some viewers doubted the ads’ authenticity and cultural substance, the modern style might not significantly strengthen the motivation to travel. As some interview comments pointed out, the curiosity sparked by visual creativity needs to be converted into trust in and yearning for the destination, which will influence travel decisions [
44,
45]. This suggests that the impact of ad style on tourism intention is indirect and conditional: only when the aesthetic and emotional responses elicited by the style successfully enhance the destination’s image will a positive effect on intention occur [
47,
48]. This conclusion aligns with the fundamental principle of the destination image model that cognitive, affective, and conative (intentional) components of image jointly predict travel intentions [
20].
Building on the above analysis, our findings can be further interpreted from multiple theoretical perspectives. First, Cultural Schema Theory [
24] was strongly validated in the context of this study. Traditional ink painting, as a classic symbol of Chinese culture, can rapidly activate domestic audiences’ cultural schemas, bringing cognitive familiarity and emotional identification [
49]. This process reduced viewers’ difficulty in understanding the ad information and increased the content’s credibility and intimacy. This is the deep reason we observed that the traditional style ads held an advantage in both subjective evaluations and attention investment. In the field of cross-cultural communication, research has shown that tourists of different cultural backgrounds exhibit systematic differences in eye movement patterns when viewing the same ad [
29], due to perceptual preference differences stemming from different cultural schemas. Our results are consistent with this, underscoring the role of cultural background in visual information processing: for viewers with Chinese cultural schemas, the traditional ink style fits their cognitive framework better, thus eliciting more positive responses [
50,
51].
Secondly, from the perspective of visual attention theory [
2], our study enriches the understanding of attention allocation mechanisms in advertising. Visual attention theory proposes that consumers’ visual attention is jointly determined by stimulus-driven factors and goal-directed factors. Our eye-tracking results indicate that the traditional ink ads perform better in goal-directed sustained attention, with viewers repeatedly fixating because they have a greater interest in and preference for the content. This suggests that marketing communications practice should not single-mindedly pursue “instant eye-catch,” but also consider how to sustain viewers’ attention [
52]. For tourism ads, using highly salient elements (e.g., bright colors, clever creativity) can increase immediate attention, but if the goal is for viewers to gain a deeper understanding and a lasting impression, the design must also incorporate content that resonates with viewers and prompts reflection [
39,
40]. For example, in our study, although the traditional ink style was not as bright as the modern style, its cultural imagery caused viewers to stay longer, gazing and savoring. This finding aligns with the advertising concepts of “stopping power” and “holding power”: an ideal ad should not only attract viewers to stop, but also make them willing to linger and appreciate [
53]. Therefore, advertisers should consider both the eye-catching salience and the meaningfulness of visual elements to achieve better communication results.
Thirdly, in terms of aesthetic psychology and value perception, our study supports and extends existing theoretical frameworks. The aesthetic processing model emphasizes the role of perceptual familiarity and cognitive interpretation in forming aesthetic judgments [
3]. Our results show that the traditional cultural style, by aligning with viewers’ aesthetic schemas, led to a smoother cognitive interpretation process and enhanced aesthetic pleasure, whereas the modern style required viewers to make more cognitive adjustments and adaptations, slightly affecting initial evaluations. This is consistent with the model’s predicted “prototype effect”: when an artistic stimulus is close to the viewer’s internal prototype or paradigm, it is more easily liked; conversely, innovation that deviates too far from the prototype may cause discomfort or a reserved attitude at the initial stage, even though novelty also brings some positive stimulation. It is noteworthy that the discussion of consumer value offers a broader perspective for understanding this phenomenon [
23]. It points out that consumers’ evaluation of an experience is derived from a synthesis of multi-dimensional value perceptions, including functional, emotional, symbolic, and aesthetic aspects. In our context, the traditional ink ads simultaneously satisfied viewers’ aesthetic value (pleasurable artistic enjoyment), emotional value (evoking cultural pride and resonance), and symbolic value (embodying the destination’s cultural symbolism) [
54,
55], thus receiving a higher overall evaluation. In contrast, the modern ink ads mainly stimulated novel and interesting sensory excitement (a part of emotional value) but were somewhat lacking in symbolic meaning and emotional depth; hence, their overall evaluation was slightly lower. This corroborates the view of the Consumer Aesthetic Value model: only when an ad creates value on the aesthetic, emotional, and symbolic levels will audiences give the most positive feedback [
3]. For tourism ads, the ability to balance creative expression with cultural substance is critical to their success.
Finally, our findings have direct implications for tourism destination marketing practice. On the one hand, the results indicate that incorporating indigenous cultural-artistic elements into tourism ads is an effective strategy. The traditional ink style significantly enhanced local audiences’ liking and interest in the ads, suggesting that a visual style with cultural recognizability helps boost a destination’s appeal and differentiated image. This aligns with prior research emphasizing cultural proximity: when marketing messages fit the target audience’s cultural background and aesthetic preferences, they more easily gain acceptance and trust [
36,
41]. Therefore, tourism destinations—especially those with rich historical and cultural heritage—should skillfully utilize traditional aesthetic elements in advertising to tell stories and convey emotion, thereby standing out in a homogenized marketing environment. On the other hand, we also observed that a modern creative style has advantages in attracting younger demographics [
56]. For market segments seeking fashion and novelty, appropriately using modern ink or other innovative styles can increase ad attention and shareability. In practice, marketers might consider a context-matching strategy: select an appropriate visual style based on the type of destination and the target clientele [
57]. For example, ads themed around cultural relics or natural landscapes could focus on a traditional aesthetic style to highlight their profound meaning, whereas ads for modern urban or tech-themed parks could incorporate modern artistic styles to showcase creativity and vitality. Meanwhile, caution should be taken to avoid relying excessively on visual spectacle while neglecting the conveyance of the destination’s core information. Whatever style is used, the ad content must clearly present the destination’s distinctive highlights and resonate with the audience’s travel motivations. Only by organically combining form and content can an advertisement be both “attractive” and “effective,” truly converting audience interest into action.
Despite the meaningful findings, several limitations merit caution and point to avenues for future work. First, our participants were young university students, a group typically more open to novelty; preferences for traditional versus modern styles may differ among older or international audiences. Future studies should broaden demographics and cultural backgrounds to test generalizability. Second, our stimuli focused exclusively on Chinese ink aesthetics, narrowing style diversity. Extending future studies to additional cultural art styles (e.g., oil painting, manga/comic, watercolor) would clarify whether the observed mechanisms generalize across visual traditions. Third, we began from four base photographs—two natural (Huangshan, Jiuzhaigou) and two urban (Shanghai, Shenzhen)—and rendered traditional/modern ink variants. Although this ensured a clean style × context manipulation, the small item sample raises external validity concerns and potential item-specific effects. Future work should (a) adopt broader item sampling (multiple destinations per category with intra-category diversity); (b) use mixed-effects models treating subjects and stimuli as crossed random factors; and (c) pretest and, where possible, equate low-level image properties (e.g., luminance, colorfulness, edge/texture density, visual complexity) while also collecting perceived congruence/coherence/authenticity ratings to partition alternative explanations. Fourth, the qualitative analysis relied on a random subsample of 10 interviewees out of 80, which may limit transferability and under-represent minority viewpoints. Increasing interview size, adopting stratified purposive sampling (e.g., by age, gender, art literacy, novelty seeking), documenting saturation, conducting member checking, reporting inter-coder reliability for a preregistered codebook, and complementing manual coding with computer-assisted text analysis would bolster credibility. Fifth, we used a single, laboratory-based exposure and did not track longer-term outcomes. Follow-ups (delayed recall, word-of-mouth, revisit intention) and field A/B tests would speak to durability and real-world impact. Sixth, although our discussion interprets several effects through cultural schemas, we assessed these processes qualitatively (interviews) rather than quantitatively via structured scales. As a result, we cannot formally test mediation by schema-related constructs. Future studies should include validated multi-item measures (e.g., cultural identification/familiarity with ink aesthetics, art literacy, need for cognition) and perceived style–content congruence, narrative coherence, and authenticity, enabling process-level mediation models that link style–context fit to evaluation and intention through these intermediaries.