Next Article in Journal
Synergistic Microstructures, Mechanical Properties, and Piezoresistive Performance Improvement of Nanoengineered Cementitious Composites by CNT and GNP
Previous Article in Journal
On-Site Welding Research for High-Strength S690 Steel
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Factors Influencing Natural and Cultural Soundscape Interactions on Perceptual Experiences in Forested–Historical Interface Areas

1
School of Architecture and Urban-Rural Planning, Fuzhou University, Fuzhou 350108, China
2
Key Laboratory of Southeast Coast Marine Information Intelligent Perception and Application, Ministry of Natural Resources, Zhangzhou 363000, China
3
Anhui Jianzhu University Design and Research Institute Co., Ltd., Hefei 230022, China
4
Department of Forest Resources Management, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
5
School of Architecture, Southeast University, Nanjing 210096, China
*
Authors to whom correspondence should be addressed.
Buildings 2025, 15(22), 4103; https://doi.org/10.3390/buildings15224103
Submission received: 30 September 2025 / Revised: 4 November 2025 / Accepted: 10 November 2025 / Published: 14 November 2025

Abstract

The quality of the soundscape in historical districts is receiving increasing attention from urban governments due to its significant potential to highlight historical characteristics and enhance the acoustic environment of urban areas. However, there is still a lack of research on the relationship between natural and cultural soundscapes as they interact in historic areas. Using the historical area of Wuhou Shrine Museum in Chengdu as a case study, this study analyzed the differences in sound levels, sound source recognition, and subjective perception between two distinct spatial types: the historical street and adjacent urban forest. Additionally, structural equation modeling (SEM) was employed to explore the impact of sound source recognition and sound levels on subjective perception. The results reveal the following: (1) The soundscape interaction between the historical street and the adjacent urban forest exhibits a conflicting relationship, with cultural and natural soundscapes struggling to coexist harmoniously. (2) Within the historical region, L10 has the strongest effect on subjective evaluation, while L90 has the weakest. (3) Quietness is not always positively correlated with comfort and pleasure, indicating that a tranquil environment does not necessarily enhance pleasantness. These findings provide differentiated soundscape optimization strategies tailored to historical areas.

1. Introduction

As cities transition to urban governance that prioritize quality-of-life improvements, historical cultural districts and natural green spaces—key public areas—fulfill vital historical, cultural, and social roles [1]. These areas reflect urban cultural memory and local identity [2], while offering access to and integration with natural environments. Their spatial environments combine architectural heritage, natural landscapes, and human activity, creating rich and multifaceted atmospheres. Within this context, acoustic environments have emerged as critical determinants of spatial quality, integrating cultural soundscapes (e.g., human activities, markets) and natural soundscapes (e.g., wind, birdcalls, flowing water) [2]. The interaction between these two soundscape types directly shapes visitors’ perceptual experiences [3], yet their interplay remains underexplored in historic areas.
Soundscape was defined as the “acoustic environment as perceived or experienced and/or understood by a person or people, in context” [4], highlighting the importance of sound perception. Urban soundscape research focuses on the variations of urban sounds across different temporal and spatial contexts and their effects on the public [5,6]. Soundscapes are viewed as resources [7], which can be optimized to enhance the quality of urban acoustic environments [8,9]. Current research topics include: studies on sound sources, such as the identification of urban sound sources [10] and the characteristics of specific sounds (like bird songs) [11]; studies on the temporal and spatial variations of soundscapes, such as daily changes in noise complaints [12,13,14]; studies on sound evaluation, including how personal factors influence sound preferences [15,16]; studies on indicator construction, focusing on determining soundscape descriptors [17] and physiological survey indicators; studies on multisensory influences, such as the effects of audiovisual interactions on soundscapes [18], and studies on soundscape design, including the application of soundscape resources in landscape planning [19]. The above findings collectively demonstrate the effects of sound intensity and source on human perception, as well as differential impacts across distinct population groups. Additionally, such studies are integrated with each other. For instance, Francesco utilized a large cross-cultural sample to examine how sound source type (e.g., natural vs. artificial sounds) and intensity influence perceived pleasantness [20]. Separately, Chan investigated the role of individual characteristics and found that female participants reported lower acoustic comfort when exposed to nighttime sounds, sounds from transportation/built-up land uses, and mechanical/vehicular/aircraft noises [21]. Together, these studies provide a foundational framework for future research.
Urban forest, as vital landscape elements in urban areas, provide open spaces conducive to public visitation and positively influence citizens’ physical and mental health [22,23,24,25]. Soundscapes in historical landscapes are vital for promoting the authenticity of historical culture [26], enhancing urban historical perception [27], and contributing to urban historical preservation [28,29]. Existing soundscape research has made progress in two parallel directions relevant to this study: urban forest soundscapes and historical district soundscapes. However, there is still a lack of research on the interaction between these two directions.
On the one hand, research on the soundscapes of urban forests has gradually shifted from a relatively single focus on the restorative effects of natural sounds to exploring the interaction between artificial sounds and natural sounds [30,31]. Early studies, such as those by Ratcliffe et al. (2013), have shown that bird songs can enhance people’s attention recovery and reduce noise pollution caused by traffic [32,33]. And water sound, as an active sound source, also makes a significant contribution to enhancing tourists’ restorative effects [34,35,36]. In addition, some scholars have explored the potential connection between urban forest soundscapes and physiological benefits (such as reduced heart rate) [37]. Recent studies on human sounds in forests have shown that artificial sounds play a dual role in urban forests: low-intensity human activities (such as casual conversations) may enhance the perceived social vitality without significantly disrupting the sense of tranquility; while high-intensity artificial sounds may damage the overall environmental experience [38,39]. However, most of these studies focus on general urban parks or large woodlands, rather than forests adjacent to cultural heritage areas. This means that current research may not fully explore how forest soundscapes interact with heritage-specific human-made sounds (such as folk performances related to local history). Future research can further investigate this dynamic relationship [40].
On the other hand, research on the soundscapes of historical areas has also evolved from a single focus on the value of historical sounds to a comprehensive study of all elements of the soundscapes in historical areas [41]. For example, Zhang et al. (2016) analyzed the soundscape of Chinese Han Buddhist temples and found that religious sounds (such as chanting) enhanced tourists’ perception of cultural authenticity [42]; Kim et al. (2022) explored the bell sounds in the historical center of Daejeon, revealing how these sounds shape people’s collective “acoustic memory” of the site’s past [43]. Subsequently, researchers began to explore the interrelationship between cultural soundscapes and natural soundscapes within historical areas. For example, Germán et al. (2018) demonstrated that in the historically significant monumental sites within semi-natural environments, natural sounds enhance the overall sense of pleasure [44]. Bartalucci et al. (2020) systematically explored the application of soundscape research in the field of cultural heritage, emphasizing the symbiotic relationship between natural soundscapes and cultural soundscapes [45]. Đorđević et al. (2023) explored sacred soundscapes in Serbia, highlighting the need to extend archaeological acoustic studies of sacred sites to include surrounding outdoor soundscapes [46]. Jo et al. (2021) surveyed Cheong-ju, finding that both anthropogenic and natural sounds dominate the soundscape of historical areas [47]. These studies indicate that people have begun to pay attention to the interaction between nature and culture, where natural sounds serve as temporal anchors connecting the past and the present.
Despite these advancements, the integration of natural sounds and cultural sounds in historical districts remains insufficient. Moreover, most current studies focus on the relationship between cultural and natural sounds in historical areas, with little discussion of their spatial interactions. Historical areas typically encompass historic districts and abundant green spaces. Within urban soundscapes, the soundscapes of historic districts and green space are considered worthy of preservation [29]. Historic districts represent cultural soundscapes dominated by human activities (visitor conversations, tour guides, vendor calls), while green space emphasize natural soundscapes with biophony and geophysical elements (wind, water, bird songs). The interaction between these soundscapes significantly shapes visitor perception [48], yet limited research explores this interplay—especially in settings where historical culture and natural environments coexist. Previous studies often analyze one soundscape type in isolation, neglecting their coexistence and interaction in the same area. This gap hinders the development of integrated soundscape optimization strategies for historic–nature interface areas.
This study addresses this gap by investigating how natural (urban forest) and cultural (historic street) soundscapes interact to influence visitor perception in the Wuhou Shrine Museum area, Chengdu. Existing studies mostly analyze the soundscape of a single space based on general soundscape theories, but pay relatively little attention to the composite characteristics of the ‘historical–natural interface area’. This study extends the ‘expectation–perception matching theory’ [49,50] from general urban scenarios to heritage contexts by introducing the expectation mechanism. It confirms that subjective perception in historical scenarios not only depends on sound intensity/sound sources, but is also regulated by cultural expectations and audiovisual consistency, which will supplement the soundscape theory in historical–natural composite scenarios. Specifically, we examine: (1) the spatial differences in sound intensity, sound source recognition, and subjective experience between historic streets and adjacent urban forest; (2) the impact of sound source recognition (biophony, anthropophony) and sound intensity metrics (L10, L90, etc.) on subjective evaluations. By integrating objective acoustic measurements and subjective surveys, this research aims to provide a theoretical basis for developing tailored soundscape optimization strategies in historical areas.

2. Methodology

This study adopts a three-level framework of data collection–analysis–modeling to carry out the research (Figure 1): In the first stage, objective data of sound pressure levels at 20 research points were collected using a sound level meter (AWA 6628+, Hangzhou Aihua Instruments Co., Ltd., Hangzhou, China), and 113 pieces of subjective perception data were collected simultaneously through structured questionnaires. In the second stage, kernel density analysis (ArcGIS for Desktop 10.6) was used to analyze the spatial distribution characteristics of sound intensity and sound sources, and Spearman correlation analysis was used to explore variable correlations. In the third stage, a SEM was constructed based on the soundscape perception theory to verify the influence paths of sound intensity, sound source identification, and individual information on subjective perception. All analyses passed reliability and validity tests to ensure reliability.

2.1. Study Area

The Wuhou Shrine Museum, located in Wuhou District, Chengdu, Sichuan Province, China, is a national scenic area. The museum comprises three main sections: the Three Kingdoms Historical Relic Area (Cultural Relics Zone), the Western Area (Three Kingdoms Cultural Experience Zone), and the Jinli Folk Area (Jinli Ancient Street), covering a total area of 150,000 m2. Jinli Ancient Street (550 m length) features vibrant cultural soundscapes including folk performances, vendor calls, and historical ambient sounds, while the Historical Relic and Western Areas offer contrasting natural soundscapes characterized by water features, forested areas, and bamboo groves acoustics [51]. In this study, Jinli Ancient Street is a historical street, while Historical Relic and Western Areas are an urban forest. At the same time, this forest has been developed into an urban park according to local conditions.
This study selected 20 survey locations within the Wuhou Shrine Museum for field research (Figure 2). These points were strategically placed in areas of high visitor activity and featured distinct soundscape characteristics. The survey points included intersections of pathways and green spaces, areas around water features, shaded resting zones, and temporary gathering areas. Specifically, Points 2, 5, and 6 were located within Jinli Ancient Street, Points 1, 3, 4, 7, and 8 were situated at the interface of Jinli Ancient Street and green spaces, and Points 9 to 20 were distributed across the museum’s green spaces. The spatial conditions of each measuring point are shown in Figure 3.

2.2. Theoretical Basis

This study was grounded in the evolutionary process of soundscape research, focusing on the interaction between natural and cultural soundscapes and their influence on perceptual experiences.
The concept of “soundscape” (defined by ISO 12913-1:2014 [4] as a perceived acoustic environment) emphasizes human perception as its core, evolving from early focus on natural soundscapes (biophony, geophony) [52] to integrating cultural soundscapes (anthropophony) [6]. As for humanistic soundscapes, especially those in historical districts, relevant studies usually focus on their core sound source composition (such as folk sounds) and explore how they shape perceptual experiences [3]. Recent studies usually involve the impact of different sound source types [10,11] and sound intensities [12,18,20] on human perception, and also explore the differences in perception among different groups [15,21]. As for humanistic soundscapes, especially those in historical blocks, relevant studies usually focus on their core sound source composition (such as folk sounds) and explore how they shape perceptual experiences [3].
Our study follows this process: we first identify natural (forest) and cultural (historic street) sound sources, measure sound intensity in different environments, then analyze their interaction and impact on subjective perceptions (e.g., comfort, harmony), aligning with the theoretical logic of soundscape research.

2.3. Content of Subjective and Objective Data Collection

Objective data were collected on-site using an AWA 6628+ Class I sound level meter, covering four indicators: L10, L50, L90, and LAeq. Specifically, L10 (peak sound pressure), L50 (median sound pressure), and L90 (background sound pressure) represent sound levels exceeded during 10%, 50%, and 90% of the survey time, respectively; the difference (L10-90) reflects peak-to-background sound variability. LAeq denotes the equivalent continuous sound pressure level. Data collection was conducted between 8:00 and 10:00 AM (local time), as this period captures the peak of visitor activity in the study area in the morning. This includes folk performances, early sightseeing, and vendor operations, thereby reflecting typical soundscape interactions between the historical street and forest. Acoustic surveys were performed at 20 locations, with each survey lasting 1 min and repeated three times per location, which can avoid capturing non-representative events (e.g., a sudden loud noise). The Coefficient of Variation (CV) of the sound pressure level (SP) after collection is shown in Table 1, which demonstrates good reliability. SPL metrics were prioritized to capture dynamic sound variations in historical areas, where human activities and cultural sounds dominate the acoustic environment.
Subjective perception data were gathered through a structured questionnaire consisting of three sections:
(1) Demographic information: includes variables such as gender (male, female), age group (≤18, 18–35, 36–45, 46–65, 65–80, >80), educational background (high school or below, undergraduate/diploma, graduate or above), place of residence (resident, non-local visitor), occupation (student, employed, retired, other), visit frequency (first-time visitor, 2–3 visits, weekly, multiple times per week, at least once per month), and monthly income (≤1500 CNY, 1500–3000 CNY, 3000–5000 CNY, 5000–10,000 CNY, >10,000 CNY).
(2) Sound source perception, where participants identified geophysical, biological, and anthropogenic sounds. To ensure consistency, participants received pre-survey training. Geophysical sounds refer to non-biological natural sounds such as wind, rain, and thunder. Biological sounds include those produced by living organisms, such as birdsong and human conversation, while anthropogenic sounds arise from human-made activities.
(3) Sound environment evaluation: including pleasure, comfort, quietness, and harmony. Pleasure refers to an individual’s perceived level of enjoyment and satisfaction within the soundscape. Comfort indicates a sense of relaxation and ease, while quietness reflects the perception of low-intensity sound levels in the environment. Lastly, harmony denotes the perceived balance and cohesion among various auditory elements. Soundscape evaluation used a 5-point Likert scale assessing pleasure, comfort, quietness, and harmony. Scoring criteria: very unpleasant/uncomfortable/quiet/harmonious (+1), somewhat unpleasant/uncomfortable/quiet/harmonious (+2), neutral (+3), somewhat pleasant/comfortable/quiet/harmonious (+4), very pleasant/comfortable/quiet/harmonious (+5).

2.4. Experimental Design

To ensure the feasibility and consistency of data collection, the formal survey was conducted on sunny weekdays in April 2021. The target population included two groups: (1) tourists aged 18–80 years visiting the Wuhou Shrine Museum (excluding those in a hurry or with hearing impairments, unique to each survey site); (2) 10 graduate student volunteers with prior soundscape research experience (participated in surveys at all 20 sites to control individual bias and ensure cross-site data consistency).
The overall experimental process includes: Questionnaire Validation and Pre-Training; Sampling Design; and Formal Survey Procedure.
(1)
Questionnaire Validation and Pre-Training:
Questionnaire Design Evaluation: We verified the validity of the questionnaire items through the following methods: expert review (3 soundscape scholars evaluated content validity); pre-testing (n = 30) to optimize wording and improve the accuracy of responses.
Pre-training: All participants received pre-training before the formal experiment to enhance the accuracy of the experiment. The training content was standardized as follows: (1) explaining the research purpose; (2) clearly defining biological sounds (such as bird songs, human conversations), geophysical sounds (such as wind, water flow), and anthropogenic sounds (such as vendors’ cries, broadcasts) with audio examples; (3) demonstrating the sound recording method (recording identified sounds every 20 s). Volunteers received additional advanced training, including a 5-min pilot test, to practice distinguishing subtle sound differences (such as the sounds of wind and water) and ensure consistency in recording standards. Tourists received a 5-min brief audio-visual guidance, focusing on sound category identification and subjective evaluation skills (using a 5-point Likert scale).
(2)
Sampling Design:
We have standardized the number of samples for the questionnaire survey and the three key sampling settings for tourists/volunteers as follows:
Quantity setting: The pre-test results (n = 30) showed that 3–5 respondents per site are sufficient to capture the spatial variation in sound perception. With 20 sites, the target sample size was 100–120, and we ultimately collected 113 valid responses (5–7 per site), which met the requirements for statistical and spatial representativeness.
Tourist sampling setting: At each of the 20 pre-determined survey points, tourists were selected using simple random intercept sampling. Investigators approached every 3rd tourist passing through the point (to avoid bias) and invited them to participate, ensuring a diverse range of demographics (age, origin). This method was chosen to reflect the natural variation of visitors in the area.
Volunteer sampling setting: The 10 volunteers were pre-selected based on their experience in soundscape research (to ensure consistency in sound recognition). They were assigned to all 20 survey points using a rotational schedule (each volunteer visited 4–5 points, which included parks and streets, randomized to avoid temporal bias). This design controlled for individual differences in perception, allowing valid cross-site comparisons.
(3)
Formal Survey Procedure:
Acoustic measurement: At each point, a sound level meter (AWA 6628+) recorded L10, L50, L90, and LAeq for 1 min, repeated 3 times (at 10-min intervals) to capture short-term variations.
Sound perception test: Tourists: Closed eyes and listened for 3 min, recording identified sounds every 20 s (9 intervals) on a pre-designed form; Volunteers: Completed a 5-min test with the same 20-s interval (15 recordings) to capture more detailed sound dynamics, leveraging their training.
Questionnaire administration: Immediately after the perception test, participants rated pleasure, comfort, quietness, and harmony using a 5-point Likert scale. Responses were collected on-site to avoid memory bias.
Data synchronization: All records (acoustic data, perception logs, questionnaires) were labeled with the survey point ID, time, and participant ID for cross-referencing.

2.5. Reliability and Validity of Experimental Data

A total of 120 questionnaires were distributed in the experiment, and 113 valid responses were collected (N = [5, 7]). We conducted reliability and validity tests to assess the quality of the collected questionnaire data. The reliability test evaluates the internal consistency and accuracy of the quantitative data. Results show that the Cronbach’s alpha coefficient for subjective perception is 0.811, exceeding the acceptable threshold of 0.7, indicating high reliability. The Corrected Item-Total Correlation (CITC) values for pleasantness (0.672), comfort (0.771), quietness (0.703), and harmony (0.491) are all above 0.4, suggesting good item correlation and supporting the internal consistency of the scale. These results indicate that the scale is well-designed and suitable for further analysis.
The validity was assessed through the Kaiser–Meyer–Olkin (KMO) value and Bartlett’s sphericity test. The KMO value is 0.776 (>0.60), confirming sampling adequacy for factor analysis. Bartlett’s test of sphericity was significant (p < 0.001), indicating that the data are suitable for structure detection. Exploratory factor analysis extracted three common factors, consistent with the structure of the questionnaire. The cumulative variance explained after rotation is 74.832%, exceeding the 50% threshold, which demonstrates that the items effectively capture the underlying constructs of interest.

2.6. Experimental Data Analysis

The data analysis included the following three major components:
(1) Spatial Distribution Analysis: Kernel density analysis was used to visualize the spatial distribution of sound intensity and auditory perception, thereby identifying areas where soundscape interactions occur. Kernel density analysis uses the default bandwidth optimization method of ArcGIS 10.2 (Silverman’s rule of thumb) to calculate the bandwidth h = 0.9 × min (standard deviation, interquartile range/1.34) × n^(−1/5), where n = 20 (the number of research points), and the final bandwidth value is 50 m. This bandwidth not only avoids fragmented hotspots caused by excessively small bandwidth but also prevents excessive spatial smoothing due to excessively large bandwidth, which is in line with the spatial scale of the study area (150,000 m2).
(2) Correlation Analysis: Spearman’s rank correlation was used to explore associations between acoustic indicators and subjective responses; Kernel density analysis was used to explore the spatial distribution of variables.
(3) Causal Analysis: A SEM was developed to identify key factors influencing subjective perception.
Finally, the study examined synergy and conflict between street and forest soundscapes—that is, the interaction between cultural (street) and natural (forest) auditory environments within the historical district. For instance, commercial sounds from the street may complement biological sounds from adjacent green spaces (e.g., vibrant street chatter alongside birdsong), or they may conflict (e.g., vendor noise disrupting forest tranquility). The analysis also considered potential influencing factors of this interaction, including acoustic parameters (e.g., sound intensity, type) and visitor demographics (e.g., age, site familiarity).

2.7. Structural Equation Modeling (SEM)

SEM is a multivariate data analysis method used to explore the interrelationships among multiple latent variables. In this study, SEM will be employed to further analyze how physical sound intensity and subjective sound source recognition influence subjective sound evaluations. The SEM approach adopted in this study includes three components: data processing, Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA).

2.7.1. Data Processing

For conducting SEM analysis, nested data (individual data within 20 sites) were aggregated to the site level (1 row of data per site). This study employed a data aggregation method for data preprocessing, whose steps include:
(1) Group mean: Grouping by site number, the mean value of each indicator within the group was calculated. Among them, the objective acoustic indicators (L10, L50, L90, LAeq) were the average of all measured values at each site; the subjective experience indicators (pleasure, comfort, quietness) were calculated using a weighted mean, where the weight was the proportion of the number of volunteers/visitors at the site to the total number of people at that site; the sound source identification and individual information indicators were calculated using the above-mentioned weighted mean method.
(2) Standard error (SE) calculation: For the aggregated mean of each site, the standard error was calculated to reflect the degree of data variation.
(3) Data aggregation: After aggregation, 20 rows of data were formed (1 row for each site), including the site mean and standard error of all variables, which were used for subsequent SEM analysis.

2.7.2. EFA and CFA

According to Spearman’s rho correlation analysis (Table 2), the variables of geophysical sound and harmony have a relatively low correlation with other variables. In addition, during the preliminary principal component analysis, geophysical sound was classified into the component related to subjective feelings, which is inconsistent with the general expectation. Therefore, when conducting the EFA, the two variables of geophysical sound and harmony were removed.
Through EFA (Table 3), four common factors were extracted, with a total variance explained of 73.527% (greater than 50%, indicating good explanatory power), and the KMO test is 0.643 (greater than 0.6, meeting the analysis requirements). Common Factor 1 includes the variables “LAeq”, “L10”, “L50”, and “L90”, which can be categorized as sound intensity, accounting for 22.672% of the variance. Common Factor 2 includes the variables “pleasure”, “comfort”, and “quietness”, which can be categorized as subjective perception, accounting for 15.951% of the variance. Common Factor 3 includes the variables “biological sound” and “anthropogenic sound”, which can be categorized as sound source recognition, accounting for 11.756% of the variance. Common Factor 4 includes variables such as “age”, “educational background”, “place of origin”, “occupation”, “visit frequency”, and “monthly income”, which can be categorized as individual information, accounting for 23.148% of the variance. The extracted factors align well with the predefined categories.
The common factors derived from the EFA were used as latent variables in the SEM, with their corresponding indicators treated as observed variables. To further verify the robustness of the measurement model, CFA was conducted, including both reliability and validity assessments. To adapt to the aggregated data structure of the final SEM, aggregated data at the measurement point level (n = 20) was used for CFA validation.
Reliability was evaluated using Cronbach’s alpha (CA) for each latent construct. As shown in Table 4, the CA values for “sound intensity”, “sound source recognition”, “subjective perception”, and “individual information” were all above 0.7, indicating strong internal consistency and reliability of the measurement model.
To assess convergent validity, Average Variance Extracted (AVE) and Composite Reliability (CR) were calculated. Convergent validity is considered acceptable when AVE > 0.5 and CR > 0.7. Based on the CFA results computed in AMOS Graphics 24 (Table 4), all three latent constructs showed AVE values exceeding 0.5 and CR values greater than 0.7, confirming that the measurement model demonstrated good convergent validity.

2.7.3. Structural Equation Model Construction and Hypotheses

Based on the theoretical framework, research objectives and previous studies (Table 5), four hypotheses were proposed to guide the SEM construction:
Hypothesis 1 (Ha).
Sound intensity has a positive effect on subjective experience.
Hypothesis 2 (Hb).
Sound source recognition has a negative effect on subjective experience.
Hypothesis 3 (Hc).
Individual information has a significant effect on subjective experience.
Hypothesis 4 (Hab).
Sound intensity and sound source recognition significantly influence each other.
Based on these hypotheses, the SEM was constructed as shown in Figure 4.
The SEM was developed in AMOS Graphics 24, following the relationship assumptions mentioned above. SEM models typically use fit indices to evaluate the overall goodness-of-fit. Common indices include the Chi-square to degrees of freedom ratio (X2/DF), Goodness of Fit Index (GFI), Root Mean Square Error of Approximation (RMSEA), and Comparative Fit Index (CFI). Table 4 presents the fit indices for this SEM. The initial results indicated that X2/DF, GFI, RMSEA, and CFI did not meet the recommended thresholds, requiring modifications to the model. To optimize the model’s goodness of fit and ensure the scientific nature of the corrections, this study, based on acoustic theory, soundscape perception theory, and tourism behavior theory, selected and added 7 groups of residual correlations through the Modification Index (MI). The theoretical basis and literature support are summarized as follows (Table 5). After these modifications, all fit indices met the acceptable criteria (Table 6).

3. Results

Through objective acoustic measurements, subjective questionnaire surveys, and multi-dimensional analysis, this chapter presents the spatial differences in soundscape sound source perception, sound intensity distribution, and subjective perception, and further reveals the interaction mechanism of soundscape perception in historical areas through SEM.

3.1. Analysis of the Spatial Distribution Characteristics of Sound Source Perception

This study utilized kernel density analysis and perceived occurrence (PO) statistics to uncover the spatial distribution characteristics of natural and cultural soundscapes in the street and forest areas. Figure 5 shows the kernel density estimation based on 20 research points, displaying the potential high-value areas of sound intensity/sound source perception (the results are spatial interpolations of discrete points).
Overall, biological sounds had the highest perceived occurrence across the study area (PO = 8.1), primarily consisting of bird songs, conversations, and other natural environmental sounds. Anthropogenic sounds ranked second (PO = 5.7), including broadcasting and market-related sounds, while geophysical sounds had the lowest perceived occurrence (PO = 3.2), mainly comprising wind and water sounds. Significant differences in sound source perception were observed: biological sounds were more frequently perceived within the street area than in the forest, whereas geophysical sounds were more prevalent in the forest than in the street.
Within the street area (Points 1–8), the dominant soundscapes were biological and anthropogenic sounds. Biological sounds had the highest perceived occurrence (PO = 10.9), concentrated near the street entrance and main commercial areas. Anthropogenic sounds (PO = 8.5) were predominantly observed in the vending zones and plazas, characterized by visitor conversations, vendor calls, and broadcasting sounds. Geophysical sounds were sporadically distributed in the street area, appearing near wooded or water features, but their perceived occurrence was relatively low (PO = 2.3).
In the forest area (Points 9–20), geophysical and biological sounds prevailed. Biological sounds (PO = 6.2) were primarily perceived in areas with high vegetation coverage. Geophysical sounds were concentrated around water bodies and adjacent areas, with a perceived occurrence of PO = 3.81. Anthropogenic sounds (PO = 3.79) were more scattered and less frequent than in the street, but still perceptible at high-traffic path intersections. In addition, the high concentration of geophysical sounds near water bodies, this spatial distribution may also be the reason for the poor correlation between geophysical sounds and other indicators.
At the interface between the street and forest areas (Points 1, 3, 4, 8), the perceived occurrence of biological and anthropogenic sounds were intermediate—slightly higher than in the forest but lower than in the street. Geophysical sounds were significantly less frequent compared to the forest area. The transition zone exhibited a notable complexity in soundscape composition and a relatively high soundscape diversity index (SDI), reflecting the coexistence of anthropophony (e.g., broadcast audio) and geophony (e.g., water sounds). However, this diversity was accompanied by reduced sound clarity, as overlapping and conflicting sources (e.g., loud announcements vs. soft wind sounds) diminished visitors’ ability to distinguish dominant sounds (Figure 5).
These findings indicate that cultural soundscapes (biological and anthropogenic) dominate the street area, while natural soundscapes (geophysical and biological) prevail in the forest. The interface between these zones is characterized by rich soundscape diversity but more complex and less distinguishable acoustic experiences due to mixed sound sources and higher visitor density.

3.2. Spatial Distribution Characteristics of Sound Intensity

Based on the kernel density analysis (Figure 5), overall sound intensity is higher within the forest than that within the historical district, while the boundary area generally has lower sound intensity compared to both the forest and the district. Specifically, the LAeq values are highest within the forest, especially concentrated around the water surface and near measurement point 8, followed by areas inside Jinli Ancient Street, such as the entrance. The LAeq values at the boundary between the district and the forest are relatively lower.
The L10 values, which reflect peak sound pressure, are also higher in the forest than in Jinli Ancient Street, with lower average peak sound pressure shown at the boundary. Similarly, the L50 values near the water surface and at point 8 in the forest are higher than at the entrance of Jinli Ancient Street, followed by the interior of the street, with the boundary having lower L50 values. The same trend was observed in L90 values, which are higher near the water surface and at point 8 in the forest moderate at the street entrance and interior, and lowest at the boundary. Finally, the L10-90 value, typically indicating foreground sound or signal noise, is highest at point 8, followed by the performance area inside Jinli Ancient Street and the area near point 8 in the forest.

3.3. Spatial Variations in Subjective Perception

According to the 5-point Likert scale results, visitors rated the overall soundscape of the district as follows: pleasure at 3.6, comfort at 3.7, quietness at 3.0, and harmony at 3.6. The results of Spearman’s rho correlation analysis (Table 2) show a low correlation (r = 0.384) between harmony and pleasantness, indicating no significant association. Similarly, harmony showed weak correlations with both comfort and quietness, suggesting that its evaluation may be relatively independent and possibly influenced by individual interpretations of the soundscape. The strongest correlation was found between quietness and comfort (r = 0.909), which is consistent with the common understanding of the mass.
In terms of spatial distribution, the average sound evaluation inside the forest (3.67) is higher than that inside the street (3.21). The evaluations of pleasantness, comfort, quietness, and harmony of the forest are all higher than those of the street (Table 7). Among them, the evaluation difference in quietness between the street and the forest is the largest. People generally think that the street is too noisy. However, despite the lower quietness score, the street still received relatively high ratings for comfort and pleasantness, likely due to the positive psychological effects of lively, socially engaging sounds such as human interaction and street activities. In addition, from the Spearman correlation analysis, it was found that the correlation between harmony and other indicators is poor. This may be due to individual differences in people’s understanding of harmony, as well as the fact that harmony shows relatively high scores in both forests and streets (which has certain differences from other subjective ratings).
The LAeq in the forest area is higher than that on the street, but tourists’ rating of the quietness of the forest (3.43) is significantly higher than that of the street (2.41). This contradiction may stem from the matching mechanism of “soundscape type—psychological expectation”. According to the theory of perceptual contrast in environmental psychology, the natural sounds in the forest (flowing water, bird songs) conform to tourists’ preset of tranquility and nature. Even if the sound intensity is high, they are still perceived as comfortable background sounds [51]. However, the artificial sounds on the street (vendors’ broadcasts, crowd noise) deviate from the expectation of “cultural immersion in historical blocks”. Even if the sound intensity is low, they are easily judged as “disturbing noise”. This is consistent with the conclusion of Guo et al. (2022) [55] that “tourists pay more attention to sound types rather than sound intensity in natural sound scenes”.

3.4. Structural Equation Modeling (SEM) Analysis

The modified SEM is shown in Figure 6, with the summary of regression coefficients and covariance coefficients listed in Table 6. As shown in Table 8, the hypothesized paths Ha, Hb, and Hc were all statistically significant. Specifically, Ha indicates that sound intensity positively influences subjective experience, suggesting that higher sound intensity levels are associated with more favorable acoustic perceptions. Hb reveals that sound source recognition negatively affects subjective experience, implying that increased awareness or identification of sound sources may reduce perceived soundscape quality. Moreover, Hb2 demonstrates that sound source recognition significantly influences subjective experience through the mediating role of individual information, highlighting the complex interaction between perceptual awareness and personal background characteristics. Hc shows that individual information has a significant negative effect on subjective experience, suggesting that certain demographic or behavioral traits are associated with lower soundscape satisfaction.
Further analysis of regression coefficients between individual information and its component variables indicates positive and significant relationships with age, occupation, visit frequency, and income, while negative and significant relationships were found with educational background and place of origin. This suggests that individuals who were older, had longer work experience, visited more frequently, and earned higher incomes tended to report lower levels of subjective experience. In contrast, those with higher levels of education and visitors from outside the area reported more positive evaluations of the soundscape. In addition, the results of the SEM model (Table 6) show that the standardized path coefficient of sound intensity on subjective perception is 0.437, and the factor loading of L10 (peak sound) (1.017) is significantly higher than that of L90 (background sound) (0.695), indicating that in historical areas, foreground sounds are more capable of dominating people’s perceptions.
According to Table 8, the relationship hypothesized in Hab did not show statistical significance, indicating that this path is not supported by the model. This phenomenon may be related to the division of geospace. Parks and neighborhoods have different dominant sound sources. High-intensity bird songs can enhance tourists’ satisfaction, while high-intensity artificial sounds and talking sounds only cause people to feel annoyed.

4. Discussion

4.1. Spatial Perception Differences and Coexistence Mechanisms of Natural–Cultural Soundscapes: The Role of Spatial Expectations

This study collected data on visitors’ sound source recognition, subjective experiences, and sound intensity using questionnaires and sound level meters. We propose that the influence of sound source type and sound intensity on subjective perception is significantly shaped by spatial context and cultural expectations—a dimension underemphasized in existing soundscape research on historic areas [49,55]. Tourists’ expectations regarding different spatial functions can amplify or buffer the impact of the acoustic environment on their subjective evaluations. For example, in industrial heritage sites, tourists’ expectations of mechanical sounds can enhance their sense of immersion [57]. This is consistent with the “Expectation–Perception Matching Theory” [49], but we have extended it to the context of cultural heritage. Based on the empirical evidence from the historical–natural interface zone, this study reveals two unique soundscape perception mechanisms, which supplement and refine the existing theories.
First, visitors’ context-specific expectations (quietness for forests, liveliness for historic streets) influence their subjective evaluations, potentially overriding objective sound intensity levels. For example, in our research and analysis, the LAeq were higher in the forest than in the street area, visitors rated the street as noisier than the forest. This contradiction can be explained by the expectation mechanism in environmental psychology, which refers to the positive impact of people’s expectations on their perceptions through “cognitive–emotional consistency” [58]. In the study area, tourists form a dual reference framework based on spatial functions: when entering a forest, they take “natural tranquility” as a reference (for example, expecting the sound of wind blowing through bamboo forests and distant water flows), so even high-intensity natural sounds are perceived as “consistent with the natural environment” and will not be labeled as “noise”. On the contrary, in historical streets, the reference shifts to “orderly cultural vitality” (for example, expecting folk performances and the cries of vendors), so suddenly appearing peak sounds (such as the sharp broadcast sounds of vendors) are regarded as “disrupting the expected order” and amplified as “noise”. This finding is similar to the results of the traffic noise reduction experiment conducted by Levenhagen et al. (2021) in U.S. national parks: reducing traffic noise levels did not significantly improve visitors’ soundscape pleasantness, and visitors’ evaluation of the soundscape depends on their pre-existing psychological expectations of natural sounds rather than the physical sound pressure level [59]. Liu et al. (2017) found in their study on the soundscape of natural parks that tourists’ tolerance for aircraft noise increased by 15% when they were informed in advance that “aircraft often pass by here [50]”. However, our study further specifies that this effect is strengthened in historic–natural interface areas—where natural sounds (e.g., wind through bamboo in Wuhou Shrine’s forest) are culturally framed as “complementary to historical ambiance”, whereas anthropogenic sounds (e.g., vendor calls in Jinli Street) are perceived as “disruptive to historical immersion” despite lower LAeq. In other words, while the street was objectively quieter, it was perceived as noisier—likely due to a mismatch between acoustic features and spatial expectations
Second, the difference in tourists’ perceptual evaluations between forests and urban blocks may be attributed to the composition of the dominant sound sources. According to the survey results, forest soundscapes are dominated by geophysical and biological sounds, while street soundscapes are dominated by biological and anthropogenic sounds. Based on this difference, this survey report shows that people can perceive a higher degree of pleasure, comfort, and harmony in forests. This indicates that in historical areas, people’s tolerance for high-intensity natural sounds may be higher than that for high-intensity cultural sounds. This finding is similar to the results of recent studies, mainly reflected in two aspects. On the one hand, researchers generally believe that natural soundscapes lead to higher satisfaction: a cross-cultural study by Purves and Wartmann (2023) shows that natural sounds are more often regarded as environmental backgrounds, while cultural sounds are to some extent considered invasive activities [53]; a study by Ye et al. (2024) also found that even if natural sound sources (such as the sound of flowing water) have a high sound pressure level (>60 dBA), they are still acceptable because they meet people’s expectations for the natural characteristics of historical environments [3]; at the same time, natural environments can reduce the annoyance caused by high sound pressure levels, thereby enhancing the pleasure of soundscapes [60,61]. Vegetation in urban green spaces can also soften sound perception, reduce negative emotions caused by loudness, and the reduction of human-related sounds in natural spaces can also improve people’s evaluation of soundscapes [3,62,63,64]. On the other hand, researchers believe that crowd noise is more likely to cause disgust: such sounds will destroy the spirit of the place [3]. In historical environments, human sounds, which are classified as biological sounds in this study but are essentially anthropogenic, are generally considered to have a negative impact on pleasure [44] because they destroy the authenticity of the historical soundscape that tourists pursue [49]; in addition, the density of street crowds may increase subjective loudness, which is consistent with the study by Meng et al. (2015) [65], and the dominant sound will affect tourists’ overall perception of the loudness of the space, and its impact is usually greater than that of background sounds or insignificant sounds [44]—for example, tourists in street areas regard conversation sounds and broadcast sounds as the dominant sound sources, while in forests, geophysical sounds (such as wind and water sounds) are often regarded as the dominant sound sources. Therefore, even if the sound pressure level in forests is higher than that in urban blocks, tourists still think that forests are quieter than urban blocks.
In conclusion, people’s expectations of soundscapes are shaped by their previous experiences in similar environments. When the actual soundscape aligns with these expectations, people are less likely to form negative impressions and may even be less aware of the existence of the sound environment [49,57]. For example, in studies, tourists rated the harmony, comfort, and pleasantness of street areas higher than their quietness. This indicates that despite the relatively high noise levels, tourists have certain expectations about the liveliness of the area, which helps to mitigate negative perceptions. Our research extends this insight to historical areas, showing that the “expected sense of liveliness” stems from cultural associations related to heritage (for instance, Jinli Street’s reputation as a “traditional commercial block”), making it a more important predictor of subjective experience than objective quietness.

4.2. Soundscape Conflict at the Historic–Forest Interface: Characteristics, Mechanisms, and Theoretical Implications

The Wuhou Shrine Museum’s forest (Cultural Relics and Western Areas) and historic district (Jinli Ancient Street) are directly adjacent, with multiple pathways on the north side facilitating sound interaction between the two spaces. Kernel density analysis of sound intensity, sound source recognition, and subjective experience revealed a critical phenomenon: the interface zone (Points 1, 3, 4, 8) exhibited higher soundscape diversity but lower subjective evaluations (comfort score = 3.1) compared to both the forest and the historic street. This high diversity-low satisfaction paradox defines the core of soundscape conflict at the historic–forest interface—a pattern not fully addressed in existing research on urban interface soundscapes [66].
In terms of sound intensity, the interface zone had lower LAeq than both the forest and the street, yet visitors reported more perceived noise (quietness score = 2.7, vs. forest 3.43 and street 2.41; Table 7). This discrepancy stems from sound masking by dominant anthropogenic sounds: peak sounds from the street (e.g., sudden vendor broadcasts, tourist chatter) masked subtle natural sounds in the forest (e.g., wind through bamboo, distant water flow), reducing visitors’ ability to perceive coherent soundscapes. This is consistent with recent studies, which show that anthropogenic noise can impair humans’ ability to recognize biological sounds and low-frequency sounds through spectral masking and cognitive interference [67,68]. This aligns with the study of Liu et al. (2014), who demonstrated that dominant sound sources in interface zones disrupt the detectability of complementary sounds, leading to perceived acoustic disorder [66]. Notably, the interface lacked shared activity nodes (e.g., plazas, green pavilions) to buffer sound transmission, unlike the forest’s water features or the street’s performance areas, which further exacerbated sound incoherence.
Regarding sound source recognition, the interface’s high SDI (driven by mixed geophony, biophony, and anthropophony) did not enhance perceptual richness but instead increased confusion. This finding is also consistent with that of Liu et al. (2014) [66], who found in their study that a higher SDI impairs the perception of bird songs (biological sounds). This is because anthropogenic sounds (e.g., street broadcasts, PO = 4.2) frequently overlapped with natural sounds (e.g., birdcalls, PO = 3.9) at the interface, creating “acoustic competition” that violated visitors’ context-specific expectations [66,67]. This finding extends the research of Schreckenberg et al. (2010) on landscape-soundscape interactions, which noted that “visual cues shape soundscape expectations, and mismatches intensify dissatisfaction” in heritage sites [69].
Subjective evaluations at the boundary are also lower than those on both sides. This phenomenon may be related to the influence of tourists’ visual perception on soundscape evaluation, and its underlying reasons can be further explained through the expectation mechanism [49,58]. Specifically, when the soundscape matches tourists’ cultural memories or experiential emotional memories, it triggers positive emotional responses; conversely, mismatched sounds break this connection, leading to negative evaluations. In the context of Wuhou Temple, tourists form two core emotional connections based on cultural cognition: (1) For the forest area (cultural relics area), the emotional connection is ‘historical tranquility’—tourists associate the forest with the cultural background of the Three Kingdoms, so natural sounds (such as the rustling of bamboo leaves in the wind) strengthen this connection and enhance comfort. (2) For the street area (Jinli Ancient Street), the emotional connection is ‘traditional liveliness’—tourists associate the street with ‘ancient commercial scenes’ (such as folk performances, traditional vendors’ cries), so moderate cultural sounds strengthen this connection and offset the negative impact of low quietness. However, in the interface area, mixed sounds (such as overlapping street broadcasts and bird songs) break these two emotional connections: artificial sounds disrupt the ‘historical tranquility’ of the forest, while faint natural sounds cannot support the ‘traditional liveliness’ of the street, ultimately resulting in a low harmony score. In addition, the high crowd density at path intersections also reduces tourists’ subjective evaluations [34]. It should be particularly noted that unlike the street area—where crowd noise is often regarded as “vibrant cultural vitality” [70]—the crowd sounds at the junction conflict with tourists’ expectation of a “peaceful transition between history and nature”, further exacerbating the evaluation contradiction.
Existing studies on historic or forest soundscapes often treat the two as isolated systems [31,45], but this study highlights that their interface is not a “neutral transition” but a dynamic zone of conflict shaped by three interrelated mechanisms: sound masking, audiovisual mismatch, and transient crowding. This found specifying that in historic–natural contexts, conflict arises not from high sound intensity alone, but from the “misalignment between sound composition, visual cues, and cultural expectations.” For example, the interface’s mixed sounds failed to align with either the street’s “cultural liveliness” or the forest’s “natural tranquility”, leaving visitors without a coherent perceptual frame. The revelation of the laws governing soundscape integration by this special type of space not only compensates for the limitation of existing studies that mostly focus on single spaces and ignore the complexity of boundary zones, but also provides a supplementary perspective of “cultural–natural composite scenes” for mainstream soundscape theories. For example, existing international studies are mostly based on the single logic of “natural soundscapes promoting restoration” or “cultural soundscapes enhancing identity”. However, this study, through case studies of boundary zones, confirms that the integration of these two types of soundscapes is not a simple choice between “natural sounds dominating” or “cultural sounds leading”, but rather requires building an appropriate integration model based on the “dual historical–ecological attributes” of the space itself. For heritage sites like Wuhou Temple, which are centered on the Three Kingdoms culture, the optimal state of soundscape integration in the boundary zone is “taking natural sounds (flowing water, bamboo forest sounds) as the base, and low-intensity cultural sounds (traditional musical instrument performances, soft explanations) as embellishments”. This not only retains the restorative value of the natural environment but also does not weaken the cultural immersion of historical scenes. This law not only provides a reusable theoretical framework for soundscape planning of similar “forest–historical boundary zones” (such as Suzhou gardens and surrounding urban green spaces, Kyoto’s historical districts and suburban mountains and forests) but also contributes empirical experience from the Chinese context to international soundscape research on heritage sites. It helps promote the improvement and deepening of the theory of “cultural–natural soundscape integration” in cross-cultural contexts, and further enhances the international academic attention and practical reference value of this study.

4.3. Factors Influencing Soundscape Perception: Sound Intensity, Sound Source Recognition, and Individual Characteristics

To explore how sound intensity, sound source recognition, and subjective experience interact, this study constructed a SEM to examine the effects of these variables on visitors’ perceptual outcomes. The results of the SEM (after modification: χ2/DF = 1.62, GFI = 0.9, CFI = 0.921, RMSEA = 0.075; Table 6) confirm that the three factors of sound intensity, sound source identification, and individual characteristics jointly explain subjective perception, and each has a different influence mechanism. Hypotheses Ha (sound intensity has a positive impact on subjective experience), Hb (sound source identification has a negative impact on subjective experience), and Hc (individual characteristics have a negative impact on subjective experience) all passed the significance test (Table 8). This result provides key empirical support for the multi-factor interaction research on soundscape perception and echoes and expands upon previous studies.
Hypothesis Ha (sound intensity has a positive effect on subjective experience) was supported. Among the sound intensity metrics (L10, LAeq, L50, L90), the SEM regression coefficients showed a clear hierarchy: L10 > LAeq > L50 > L90 (Table 6). This indicates that peak sounds have the strongest impact on subjective experience, while background sounds have minimal influence. This finding contrasts with Axelsson et al. (2010), who identified LAeq as the primary predictor of comfort in general urban spaces [71], yet aligns with the observation that “peak sounds dominate perceptual evaluations in heritage sites” because they disrupt the “historical immersion” visitors seek [3,44]. In addition, previous studies have shown that loud mechanical noises (frequently appearing on the street) can affect human auditory capacity, impede stress recovery, and lead to negative emotions [72,73,74]. This also explains why visitors perceive the street as noisier, as commercial streets often feature harsh, disruptive sounds such as vendor calls. For instance, even though the forest had higher LAeq than the street, the street’s more frequent peak sounds led visitors to rate it as “noisier”, highlighting that peak sound control, not just average intensity reduction, is critical for historic soundscape optimization.
Assume that Hab (there is a significant interaction between sound intensity and sound source identification) is not supported: the covariance coefficient between sound intensity and sound source identification is not significant (Table 8), which reflects the spatial segmentation of soundscapes in the study area. Historical streets are dominated by medium-to-low intensity anthropogenic sounds, while forests are characterized by high-intensity natural sounds. As found by Liu et al. (2014), when soundscapes are spatially separated (for example, separated by the bamboo forest buffer zone of Wuhou Temple), intensity and sound source type become “decoupled”—this explains why no direct interaction was observed [66]. This finding adds nuance to the model by Aletta et al. (2016), which assumes a universal interaction between intensity and sound source identification [17], whereas this study shows that this relationship depends on the spatial context. Therefore, the value of the non-significant result of Hab lies in that it reveals the spatial boundary conditions of the “sound intensity–sound source identification association”—when there is an obvious spatial separation of soundscapes, the interaction between the two becomes ineffective, which has certain significance for revising the universal model of soundscape perception.
In addition, Hypothesis Hb, which examines the effect of sound source recognition (including anthropogenic and biological sounds) on subjective experience, shows that sound source recognition (PO) has a negative impact on subjective experience, with biological sounds having the most significant effect. This result differs from our initial assumption that biological sounds, particularly non-human biophony like bird songs, would exert a neutral or positive influence on subjective experience, as supported by prior studies linking natural biophony to restorative effects [32,33,34,35,36]. The key explanation lies in the composition of “biological sounds” in our dataset: while bird songs were present, they were outnumbered by human conversations (classified as biological sounds in our framework but inherently anthropogenic in nature). Biological sounds, such as conversations, have a notable negative impact on visitors’ subjective experience when they frequent occurrence [3,70]. Studies have found that human-dominated biological sounds have destroyed “historical authenticity” [70]. The kernel density analysis (Figure 5) also reveals that areas with higher perception of biological sounds tend to have lower subjective ratings. In the forest, although there are positive sound sources such as the chirping of birds, the human conversation has a strong masking effect, which may be the reason why the high occurrence of biological sounds leads to a decrease in people’s subjective experience. Therefore, in forests located in historical areas, it is necessary to control the flow density of people to a certain extent and set up quiet visiting areas [38], so that biological sounds such as bird calls can be more distinct.
Hypothesis Hc (individual information has a significant effect on subjective experience) was supported: Individual characteristics exerted a significant negative overall effect on subjective perception (Table 8), but subgroup differences revealed critical nuances tied to the study’s questionnaire data (Section 2.4). Age correlated positively with negative perceptions: older visitors were more sensitive to peak sounds, as Schreckenberg et al. (2010) noted, “noise sensitivity increases with age due to auditory system changes” [69]. Visit frequency also mattered: frequent visitors reported lower comfort, likely because they had higher expectations for “acoustic consistency”. In contrast, education and place of origin had positive associations: visitors with graduate degrees or above showed greater tolerance for “controlled cultural noise”, while non-local tourists perceived street liveliness as a “cultural attraction” rather than a disturbance. Income also had a weak positive effect, with higher-income visitors more likely to overlook minor noise disruptions.
Furthermore, this study’s unique regional, site, and cultural backgrounds enable its findings to supplement international soundscape theory and offer differentiated contributions in three aspects: (1) Regional dimension: Unlike studies on low-density European heritage sites [44,45,46], Wuhou Temple (in the core of Chengdu, a western Chinese megacity) features “high overlap between historical space and urban life”. High visitor density in Chinese scenic spots creates frequent peak sounds (e.g., folk performances, tourist conversations), making L10 (not the commonly used LAeq) the core indicator of subjective experience, which informs research on high-density Asian urban heritage sites. (2) Site dimension: Different from Aletta et al.’s (2016) open European urban parks [70], Wuhou Temple adopts a “closed-separated” layout via “bamboo buffers + low walls”. Soundscape overlap concentrates at the junction of historical streets and urban forests, leading to an insignificant “sound intensity–sound source identification” interaction—contradicting the “universal positive interaction” assumption in international models and providing a classification reference for site-specific soundscape studies. (3) Cultural dimension: Contrasted with natural sound protection in nature reserves [72], Wuhou Temple (a core Three Kingdoms cultural heritage site) endows forest sounds (e.g., bamboo wind, water) with cultural meanings, giving them dual “ecological restoration-cultural carrying” functions. This expands soundscape theory and offers a non-Western perspective for natural soundscape protection in cultural heritage sites. In summary, this study verifies the applicability of international soundscape theory in specific scenarios, reveals soundscape laws of high-density, culturally overlapping East Asian heritage sites, and provides key empirical support for the regional and cultural diversity of international soundscape research.
To contextualize the study’s methodological and theoretical contributions, this research develops a tripartite soundscape framework tailored explicitly for historic districts—a tool designed to address the unique interplay of culture, nature, and human perception in heritage settings. This framework consists of three core components: (1) sound source typology, (2) perceptual dimensions (pleasantness, quietness, harmony, comfort), and (3) socio-demographic profiling. This framework differs from standardized soundscape scales by prioritizing context-specificity over universality. For instance, Axelsson’s model focuses on a universal Pleasure-Eventfulness-Familiarity triad [71], and ISO/TS 12913-2 emphasizes general spatial metrics [75], yet both of these tools overlook the “cultural compatibility” of sounds in heritage sites. In contrast, our framework explicitly includes “harmony” as a key perceptual dimension, defined as the alignment between acoustic elements and a site’s historical identity. In practice, “harmony” is operationally defined through survey questions such as “Does the sound of this area match its historical characteristics?” and measured using a 5-point Likert scale. This dimension was identified via exploratory factor analysis (Section 2.4), which showed that harmony loaded separately from pleasure or comfort, capturing unique variance related to perceived congruence between acoustic elements and the site’s cultural identity. This allows us to distinctively quantify how sound combinations align with heritage expectations. In addition, compared to objective bioacoustic indices (e.g., NDSI for biodiversity quantification) [76], our subjective classification captures visitor-perceived sound dominance. Socio-economic variables (income, origin) extend beyond the SSID protocol’s basic demographics [77], revealing subgroup variations (e.g., income-correlated quietness preferences). Future studies could integrate ISO-compliant acoustic indices (e.g., NDSI/ACI) with our perceptual framework to enhance ecological validity while retaining contextual strengths, or adopt machine learning for cross-cultural predictive modeling.

5. Conclusions

This study examined Jinli Ancient Street in Chengdu, China, using a combination of field measurements and visitor surveys to investigate the interactions between natural and cultural soundscapes in a historical urban setting. The research employed kernel density analysis, correlation analysis, and SEM to explore how sound intensity and sound source recognition influence subjective soundscape experiences. The key findings are as follows:
(1) Impact of Sound Intensity: High sound intensity (L10) had the greatest influence on subjective experience, while low sound intensity (L90) had a weaker effect. This highlights the importance of managing high-intensity or peak noise sources to prevent disruptions to visitors’ overall soundscape perception.
(2) Conflict Between Natural and Cultural Soundscapes: The soundscape diversity at the boundary zone—combined with mismatched audiovisual cues and elevated crowd density—contributed to a more fragmented and less satisfying acoustic experience. Future interventions should aim to alleviate these conflicts through integrated design strategies and visitor flow management, facilitating more harmonious coexistence of natural and cultural sound elements.

5.1. Theoretical Implications

To address gaps in historic soundscape theory [13,57,63], this study makes three novel, previously absent contributions:
(1) Extending the expectation–perception framework to heritage contexts: It identifies an “expectation–audiovisual matching” mechanism, showing that soundscape perception in historical interfaces depends not only on functional expectations (e.g., “forests = quiet”) but also on cross-sensory consistency with cultural identities (e.g., Three Kingdoms heritage in Wuhou Shrine).
(2) Developing a culture-adapted perception dimension: Exploratory Factor Analysis (EFA) verifies “harmony” (alignment between sounds and historical identity) as an independent perceptual dimension (explaining 15.951% of variance), filling the gap in standard tools (e.g., Axelsson’s triad, ISO/TS 12913-2) that lack cultural compatibility assessment.
(3) Revealing spatial boundary conditions for soundscape interactions: It confirms that the “sound intensity–sound source correlation” assumed in universal models disappears when soundscapes are spatially segmented (e.g., street vs. forest), highlighting “spatial segmentation” as a critical moderator for interface soundscape research.
In sum, this study takes the interface of “historical streets–urban forests” in Wuhou Temple as a case, and for the first time quantitatively reveals the contradictory mechanism of “high soundscape diversity but low subjective satisfaction” in such spaces. It confirms that the triple interaction of “sound masking + audio-visual mismatch + transient crowding” is the core inducement of soundscape conflicts in the interface area, providing a reusable empirical analysis framework for soundscape research in similar historical–natural interleaved zones.

5.2. Limitations

This study has several limitations that should be acknowledged:
Site and temporal constraints: Findings are specific to Wuhou Shrine (unique Three Kingdoms context, compact layout) and limited to weekday mornings (April 2021), excluding seasonal/diurnal soundscape variations.
Sample and classification gaps: The 113 valid responses underrepresent groups like international tourists; human biophony (e.g., conversations) was not distinguished from non-human biophony (e.g., birdcalls), and expert-naive participant perception differences were unanalyzed.

5.3. Policy Recommendations

These research findings provide actionable insights for soundscape management and sustainable development in historical–forest transition zones. They are applicable to urban environments where historical areas coexist with natural spaces, such as urban heritage parks and urban cultural parks. Based on the findings of this study, we propose specific and implementable strategies from three levels:
First, adopt a “space-time-sound source” 3D zoning system. Spatially, divide areas into “cultural sound-dominated zones” (e.g., Jinli Ancient Street core, protect folk performances, ban electronic loudspeakers), “natural sound-dominated zones” (e.g., Wuhou Shrine relics forest, limit gatherings to ≤20 people, conversation volume <50 dBA), and “transitional zones” (e.g., interface points 1, 3, 4, 8, set ≥15 m bamboo-water buffers to attenuate sound). Temporally, schedule cultural activities (e.g., folk performances) in cultural zones 9:00–11:00 to avoid forest morning exercise peaks (7:00–8:00). Classify sound sources by prioritizing bird songs/water flows, restricting human conversations, encouraging traditional instruments, and prohibiting modern construction—integrated into local Heritage Site Soundscape Protection Regulations.
Second, optimize quiet zones with “natural sound enhancement + artificial sound shielding.” Amplify natural sounds via water/wind guiding devices near forest water bodies and mixed vegetation (ginkgoes + bamboo) to extend bird songs. Block artificial sounds with composite barriers (1.2 m soil slopes + 2 m evergreen hedges) at quiet zone boundaries.
Third, apply digital simulation for “pre-evaluation—dynamic monitoring.” Pre-plan: build a soundscape model, input pedestrian density/vegetation coverage, and simulate L10/LAeq distribution to identify conflict hotspots. Monitor in real time: deploy sensors at 20 points, and warn when L10 > 65 dBA; combine with a tourist APP’s 5-point rating for “objective + subjective” dual monitoring.
These strategies enhance visitor experiences, preserve dual values, and promote sustainable coexistence of historical and natural soundscapes.

Author Contributions

Conceptualization, X.-C.H. and J.L. (Jiang Liu); methodology, X.-C.H. and J.L. (Jingsong Lin); software, J.L. (Jingsong Lin); validation, X.-C.H.; formal analysis, J.L. (Jingsong Lin); investigation, M.Z.; resources, X.-C.H. and J.L. (Jiang Liu); data curation, X.-C.H. and J.L. (Jingsong Lin); writing—original draft preparation, J.L. (Jingsong Lin); writing—review and editing, X.-C.H., J.L. (Jiang Liu) and Y.W.; visualization, J.L. (Jingsong Lin); supervision, X.-C.H.; project administration, X.-C.H.; funding acquisition, X.-C.H. and J.L. (Jiang Liu). All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Opening Project of Key Laboratory of Southeast Coast Marine Information Intelligent Perception and Application, Ministry of Natural Resources (MNR), China (NO. 23203), the National Natural Science Foundation of China (No. 52378049 and No. 52208052) and Fujian Natural Science Foundation, China (No. 2023J05108).

Data Availability Statement

The data presented in this study are available upon request from the corresponding authors due to privacy restrictions.

Conflicts of Interest

Author Mengqiao Zhang was employed by the company Anhui Jianzhu University Design and Research Institute Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Lin, H.; Wang, J.-B.; Zhang, X.; Hu, F.; Liu, J.; Hong, X.-C. Historical sensing: The spatial pattern of soundscape occurrences recorded in poems between the Tang and the Qing Dynasties amid urbanization. Humanit. Soc. Sci. Commun. 2024, 11, 730. [Google Scholar] [CrossRef]
  2. Chen, Z.; Jin, M.; Zuo, Y.; Ding, P.; Shi, X. The Effect of Soundscape on Sense of Place for Residential Historical and Cultural Areas: A Case Study of Taiyuan, China. Buildings 2024, 14, 1394. [Google Scholar] [CrossRef]
  3. Ye, J.; Li, S.; Chen, Y.; Ma, Y.; Chen, L.; He, T.; Zheng, Y. A Study of the Effects of Historical Block Context on Soundscape Perception. Buildings 2024, 14, 621. [Google Scholar] [CrossRef]
  4. ISO 12913-1: 2014; Acoustics—Soundscape—Part 1: Definition and Conceptual Framework. ISO: Geneva, Switzerland, 2014.
  5. Zhao, Y.; Xu, S.; Huang, Z.; Fang, W.; Huang, S.; Huang, P.; Zheng, D.; Dong, J.; Chen, Z.; Yan, C.; et al. Temporal and Spatial Characteristics of Soundscape Ecology in Urban Forest Areas and Its Landscape Spatial Influencing Factors. Forests 2022, 13, 1751. [Google Scholar] [CrossRef]
  6. Yang, J.; Lu, H. Visualizing the Knowledge Domain in Urban Soundscape: A Scientometric Analysis Based on CiteSpace. Int. J. Environ. Res. Public Health 2022, 19, 13912. [Google Scholar] [CrossRef]
  7. Brown, A.L. A review of progress in soundscapes and an approach to soundscape planning. Int. J. Acoust. Vib. 2012, 17, 73–81. [Google Scholar] [CrossRef]
  8. Truax, B.; Barrett, G.W. Soundscape in a context of acoustic and landscape ecology. Landsc. Ecol. 2011, 26, 1201–1207. [Google Scholar] [CrossRef]
  9. Chu, S.; Xu, W.; Zhang, D.; Lin, J.; Liu, J.; Liu, S.; Hong, X.C. Urban Blue-Green Spaces and tranquility: A comprehensive review of noise reduction and sensory perception integration. J. Asian Archit. Build. Eng. 2025, 1–22. [Google Scholar] [CrossRef]
  10. Jo, H.I.; Jeon, J.Y. Urban soundscape categorization based on individual recognition, perception, and assessment of sound environments. Landsc. Urban Plan. 2021, 216, 104241. [Google Scholar] [CrossRef]
  11. Hong, X.-C.; Wang, G.-Y.; Liu, J.; Song, L.; Wu, E.T. Modeling the impact of soundscape drivers on perceived birdsongs in urban forests. J. Clean. Prod. 2021, 292, 125315. [Google Scholar] [CrossRef]
  12. Guo, L.-H.; Cheng, S.; Liu, J.; Wang, Y.; Cai, Y.; Hong, X.-C. Does social perception data express the spatio-temporal pattern of perceived urban noise? A case study based on 3,137 noise complaints in Fuzhou, China. Appl. Acoust. 2022, 201, 109129. [Google Scholar] [CrossRef]
  13. Hong, X.C.; Zhang, D.Y.; Hu, F.B.; Guo, L.H.; Liu, J.; Guo, H. Does urban green space form influence the spatial pattern of noise complaints? Sustain. Cities Soc. 2025, 130, 106506. [Google Scholar] [CrossRef]
  14. Chu, S.; Wang, Y.; Liu, J.; Hong, X.C. Pandemic-induced shifts in urban noise perception: A spatial-temporal analysis in a Chinese megacity. J. Environ. Manag. 2025, 391, 126584. [Google Scholar] [CrossRef] [PubMed]
  15. Yu, L.; Kang, J. Factors influencing the sound preference in urban open spaces. Appl. Acoust. 2010, 71, 622–633. [Google Scholar] [CrossRef]
  16. Chen, Y.; Liu, C.; Chen, X.; Huang, L. Study on noise reduction effect of roadside trees along main roads in downtown of Fuzhou City. J. Fujian Norm. Univ. Nat. Sci. Ed. 2014, 30, 90–98. [Google Scholar]
  17. Aletta, F.; Kang, J.; Axelsson, Ö. Soundscape descriptors and a conceptual framework for developing predictive soundscape models. Landsc. Urban Plan. 2016, 149, 65–74. [Google Scholar] [CrossRef]
  18. Guo, X.; Liu, J.; Albert, C.; Hong, X.-C. Audio-visual interaction and visitor characteristics affect perceived soundscape restorativeness: Case study in five parks in China. Urban For. Urban Green. 2022, 77, 127738. [Google Scholar] [CrossRef]
  19. Chen, Z.; Hermes, J.; Liu, J.; von Haaren, C. How to integrate the soundscape resource into landscape planning? A perspective from ecosystem services. Ecol. Indic. 2022, 141, 109156. [Google Scholar] [CrossRef]
  20. Aletta, F.; Oberman, T.; Mitchell, A.; Erfanian, M.; Kang, J. Soundscape experience of public spaces in different world regions: A comparison between the European and Chinese contexts via a large-scale on-site survey. J. Acoust. Soc. Am. 2023, 154, 1710–1734. [Google Scholar] [CrossRef]
  21. Chan, T.-C.; Wu, B.-S.; Lee, Y.-T.; Lee, P.-H.; Deng, R.-H. Exploring relationships among soundscape perception, spatiotemporal sound characteristics, and personal traits through social media. Noise Mapp. 2023, 10, 20220174. [Google Scholar] [CrossRef]
  22. Ulmer, J.M.; Wolf, K.L.; Backman, D.R.; Tretheway, R.L.; Blain, C.J.; O’neil-Dunne, J.P.; Frank, L.D. Multiple health benefits of urban tree canopy: The mounting evidence for a green prescription. Health Place 2016, 42, 54–62. [Google Scholar] [CrossRef]
  23. Liu, H.; Li, F.; Li, J.; Zhang, Y. The relationships between urban parks, residents’ physical activity, and mental health benefits: A case study from Beijing, China. J. Environ. Manag. 2017, 190, 223–230. [Google Scholar] [CrossRef]
  24. Bielinis, E.; Takayama, N.; Boiko, S.; Omelan, A.; Bielinis, L. The effect of winter forest bathing on psychological relaxation of young Polish adults. Urban For. Urban Green. 2018, 29, 276–283. [Google Scholar] [CrossRef]
  25. Cobreros, C.; Medoza-Ruvalcaba, N.; Flores-García, M.; Roggema, R. Improving Psychological Well-Being in Urban University Districts through Biophilic Design: Two Cases in Mexico. Sustainability 2023, 15, 5703. [Google Scholar] [CrossRef]
  26. Kaymaz, I.; Cüre, C.T.; Baki, E. Perceived Soundscape of Urban Historical Places: A Case Study of Hamamönü, Ankara. Procedia Eng. 2016, 161, 1920–1925. [Google Scholar] [CrossRef]
  27. Leus, M.; Herssens, J. The Soundscapes of Antwerp: A Study on the Acoustic Genius Loci. Energy Procedia 2015, 78, 25–30. [Google Scholar] [CrossRef]
  28. Kang, J.; Aletta, F.; Gjestland, T.T.; Brown, L.A.; Botteldooren, D.; Schulte-Fortkamp, B.; Lercher, P.; van Kamp, I.; Genuit, K.; Fiebig, A.; et al. Ten questions on the soundscapes of the built environment. Build. Environ. 2016, 108, 284–294. [Google Scholar] [CrossRef]
  29. Jia, Y.; Ma, H.; Kang, J. Characteristics and evaluation of urban soundscapes worthy of preservation. J. Environ. Manag. 2020, 253, 109722. [Google Scholar] [CrossRef]
  30. Hong, X.-C.; Liu, J.; Wang, G.-Y. Soundscape in Urban Forests. Forests 2022, 13, 2056. [Google Scholar] [CrossRef]
  31. Ratcliffe, E. Sound and Soundscape in Restorative Natural Environments: A Narrative Literature Review. Front. Psychol. 2021, 12, 570563. [Google Scholar] [CrossRef]
  32. Ratcliffe, E.; Gatersleben, B.; Sowden, P.T. Bird sounds and their contributions to perceived attention restoration and stress recovery. J. Environ. Psychol. 2013, 36, 221–228. [Google Scholar] [CrossRef]
  33. Alvarsson, J.J.; Wiens, S.; Nilsson, M.E. Stress Recovery during Exposure to Nature Sound and Environmental Noise. Int. J. Environ. Res. Public Health 2010, 7, 1036–1046. [Google Scholar] [CrossRef]
  34. Van Renterghem, T.; Vanhecke, K.; Filipan, K.; Sun, K.; De Pessemier, T.; De Coensel, B.; Joseph, W.; Botteldooren, D. Interactive soundscape augmentation by natural sounds in a noise polluted urban park. Landsc. Urban Plan. 2020, 194, 103705. [Google Scholar] [CrossRef]
  35. Liu, F.; Liu, P.; Kang, J.; Meng, Q.; Wu, Y.; Yang, D. Relationships between landscape characteristics and the restorative quality of soundscapes in urban blue spaces. Appl. Acoust. 2022, 189, 108600. [Google Scholar] [CrossRef]
  36. Hong, J.Y.; Lam, B.; Ong, Z.-T.; Ooi, K.; Gan, W.-S.; Kang, J.; Yeong, S.; Lee, I.; Tan, S.-T. A mixed-reality approach to soundscape assessment of outdoor urban environments augmented with natural sounds. Build. Environ. 2021, 194, 107688. [Google Scholar] [CrossRef]
  37. Jo, H.; Song, C.; Ikei, H.; Enomoto, S.; Kobayashi, H.; Miyazaki, Y. Physiological and Psychological Effects of Forest and Urban Sounds Using High-Resolution Sound Sources. Int. J. Environ. Res. Public Health 2019, 16, 2649. [Google Scholar] [CrossRef] [PubMed]
  38. Jo, H.I.; Jeon, J.Y. The influence of human behavioral characteristics on soundscape perception in urban parks: Subjective and observational approaches. Landsc. Urban Plan. 2020, 203, 103890. [Google Scholar] [CrossRef]
  39. Steele, D.; Fraisse, V.; Bild, E.; Guastavino, C. Bringing music to the park: The effect of Musikiosk on the quality of public experience. Appl. Acoust. 2021, 177, 107910. [Google Scholar] [CrossRef]
  40. Parker, M.; Spennemann, D.H. Conceptualising sound making and sound loss in the urban heritage environment. Int. J. Urban Sustain. Dev. 2022, 14, 264–286. [Google Scholar] [CrossRef]
  41. Parker, M.; Spennemann, D.H.R. Classifying Sound: A Tool to Enrich Intangible Heritage Management. Acoust. Aust. 2022, 50, 23–39. [Google Scholar] [CrossRef]
  42. Zhang, D.; Zhang, M.; Liu, D.; Kang, J. Soundscape evaluation in Han Chinese Buddhist temples. Appl. Acoust. 2016, 111, 188–197. [Google Scholar] [CrossRef]
  43. Kim, M.-S. A Semantic Study on the Soundscape of the Historic Downtown of Daejeon—Focusing on the Bells of Daeheung-dong Cathedral and Enhang-dong Sungsimdang. J. Korean Inst. Landsc. Arch. 2022, 50, 64–75. [Google Scholar] [CrossRef]
  44. Pérez-Martínez, G.; Torija, A.J.; Ruiz, D.P. Soundscape assessment of a monumental place: A methodology based on the perception of dominant sounds. Landsc. Urban Plan. 2018, 169, 12–21. [Google Scholar] [CrossRef]
  45. Bartalucci, C.; Luzzi, S. The soundscape in cultural heritage. IOP Conf. Ser. Mater. Sci. Eng. 2020, 949, 012050. [Google Scholar] [CrossRef]
  46. Đorđević, Z.; Novković, D.; Dragišić, M. Intangible Mosaic of Sacred Soundscapes in Medieval Serbia. Acoustics 2023, 5, 28–45. [Google Scholar] [CrossRef]
  47. Jo, A.H.; Park, C.-J.; Haan, C.-H. Survey on the soundscape of urban districts in Cheong-ju using soundwalking. J. Acoust. Soc. Korea 2021, 40, 593–606. [Google Scholar] [CrossRef]
  48. Liu, J.; Yang, L.; Xiong, Y.; Yang, Y. Effects of soundscape perception on visiting experience in a renovated historical block. Build. Environ. 2019, 165, 106375. [Google Scholar] [CrossRef]
  49. Bruce, N.S.; Davies, W.J. The effects of expectation on the perception of soundscapes. Appl. Acoust. 2014, 85, 1–11. [Google Scholar] [CrossRef]
  50. Liu, A.; Wang, X.L.; Liu, F.; Yao, C.; Deng, Z. Soundscape and its influence on tourist satisfaction. Serv. Ind. J. 2017, 38, 164–181. [Google Scholar] [CrossRef]
  51. Chen, W.; Liu, J. Research on key acoustic characteristics of soundscapes of the classical Chinese gardens. Sci. Rep. 2023, 13, 12642. [Google Scholar] [CrossRef] [PubMed]
  52. Lorenzi, C.; Apoux, F.; Grinfeder, E.; Krause, B.; Miller-Viacava, N.; Sueur, J. Human Auditory Ecology: Extending Hearing Research to the Perception of Natural Soundscapes by Humans in Rapidly Changing Environments. Trends Hear. 2023, 27, 23312165231212032. [Google Scholar] [CrossRef] [PubMed]
  53. Purves, R.S.; Wartmann, F.M. Characterising and mapping potential and experienced tranquillity: From a state of mind to a cultural ecosystem service. Geogr. Compass 2023, 17, e12726. [Google Scholar] [CrossRef]
  54. Kaczynski, A.T.; Potwarka, L.R.; Saelens, B.E. Association of Park Size, Distance, and Features With Physical Activity in Neighborhood Parks. Am. J. Public Health 2007, 98, 1451–1456. [Google Scholar] [CrossRef]
  55. Guo, Y.; Wang, K.; Zhang, H.; Jiang, Z. Soundscape Perception Preference in an Urban Forest Park: Evidence from Moon Island Forest Park in Lu’an City. Sustainability 2022, 14, 16132. [Google Scholar] [CrossRef]
  56. Liu, H.; Li, F.; Xu, L.; Han, B. The impact of socio-demographic, environmental, and individual factors on urban park visitation in Beijing, China. J. Clean. Prod. 2017, 163, S181–S188. [Google Scholar] [CrossRef]
  57. Sun, L. Soundscape as a Tool for Place-Making in Industrial Heritage Sites. In Proceedings of the 2023 Immersive and 3D Audio: From Architecture to Automotive (I3DA), Bologna, Italy, 5–7 September 2023; pp. 1–8. [Google Scholar]
  58. Schreuder, E.; van Erp, J.; Toet, A.; Kallen, V.L. Emotional Responses to Multisensory Environmental Stimuli: A Conceptual Framework and Literature Review: A Conceptual Framework and Literature Review. SAGE Open 2016, 6, 2158244016630591. [Google Scholar] [CrossRef]
  59. Levenhagen, M.J.; Miller, Z.D.; Petrelli, A.R.; Ferguson, L.A.; Shr, Y.-H.; Taff, B.D.; Fristrup, K.M.; McClure, C.J.; Burson, S.; Giamellaro, M.; et al. Does experimentally quieting traffic noise benefit people and birds? Ecol. Soc. 2021, 26, 32. [Google Scholar] [CrossRef]
  60. Van Renterghem, T.; Botteldooren, D. View on outdoor vegetation reduces noise annoyance for dwellers near busy roads. Landsc. Urban Plan. 2016, 148, 203–215. [Google Scholar] [CrossRef]
  61. Nilsson, M.E.; Berglund, B. Soundscape quality in suburban green areas and city parks. Acta Acust. United Acust. 2006, 92, 903–911. [Google Scholar]
  62. Ferrari, R.; Rupf, R.; Reutz, B. Alpine soundscapes: Sounds and their consequences for perceived recreational quality—A case study of two Regional Nature Parks—Beverin Nature Park and Parc Ela in Switzerland. J. Prot. Mt. Areas Res. Manag. 2023, 15, 4–12. [Google Scholar]
  63. Ulrich, R.S.; Simons, R.F.; Losito, B.D.; Fiorito, E.; Miles, M.A.; Zelson, M. Stress recovery during exposure to natural and urban environments. J. Environ. Psychol. 1991, 11, 201–230. [Google Scholar] [CrossRef]
  64. Kaplan, S. The restorative benefits of nature: Toward an integrative framework. J. Environ. Psychol. 1995, 15, 169–182. [Google Scholar] [CrossRef]
  65. Meng, Q.; Kang, J. The influence of crowd density on the sound environment of commercial pedestrian streets. Sci. Total. Environ. 2015, 511, 249–258. [Google Scholar] [CrossRef]
  66. Liu, J.; Kang, J.; Behm, H.; Luo, T. Effects of landscape on soundscape perception: Soundwalks in city parks. Landsc. Urban Plan. 2014, 123, 30–40. [Google Scholar] [CrossRef]
  67. Francis, C.D.; Phillips, J.N.; Barber, J.R. Background Acoustics in Terrestrial Ecology. Annu. Rev. Ecol. Evol. Syst. 2023, 54, 351–373. [Google Scholar] [CrossRef]
  68. Kohrman, D.C.; Wan, G.; Cassinotti, L.; Corfas, G. Hidden Hearing Loss: A Disorder with Multiple Etiologies and Mechanisms. Cold Spring Harb. Perspect. Med. 2020, 10, a035493. [Google Scholar] [CrossRef] [PubMed]
  69. Schreckenberg, D.; Griefahn, B.; Meis, M. The associations between noise sensitivity, reported physical and mental health, perceived environmental quality, and noise annoyance. Noise Health 2010, 12, 7–16. [Google Scholar] [CrossRef]
  70. Aletta, F.; Kang, J.; Fuda, S.; Astolfi, A. The effect of walking sounds from different walked-on materials on the soundscape of urban parks. J. Environ. Eng. Landsc. Manag. 2016, 24, 165–175. [Google Scholar] [CrossRef]
  71. Axelsson, Ö.; Nilsson, M.E.; Berglund, B. A principal components model of soundscape perception. J. Acoust. Soc. Am. 2010, 128, 2836–2846. [Google Scholar] [CrossRef]
  72. Oberman, T.; Latini, A.; Aletta, F.; Gozzi, G.; Kang, J.; Torresin, S. Human sounds and associated tonality disrupting perceived soundscapes in protected natural areas. Sci. Rep. 2025, 15, 28759. [Google Scholar] [CrossRef]
  73. Andringa, T.C.; Lanser, J.J.L. How Pleasant Sounds Promote and Annoying Sounds Impede Health: A Cognitive Approach. Int. J. Environ. Res. Public Health 2013, 10, 1439–1461. [Google Scholar] [CrossRef] [PubMed]
  74. Van Kamp, I.; Van Kempen, E.; Ameling, C.; Swart, W.; Kruize, H. Perceived soundscapes, human restoration and related health in green urban areas. In Proceedings of the 22nd International Congress on Acoustics, Buenos Aires, Argentina, 5–9 September 2016. [Google Scholar]
  75. ISO/TS 12913–2:2018; Acoustics—Soundscape—Part 2: Data Collection and Reporting Requirements. ISO: Geneva, Switzerland, 2018.
  76. Durbridge, S.; Murphy, D.T. Assessment of soundscapes using self-report and physiological measures. Acta Acust. 2023, 7, 6. [Google Scholar] [CrossRef]
  77. Mitchell, A.; Oberman, T.; Aletta, F.; Erfanian, M.; Kachlicka, M.; Lionello, M.; Kang, J. The Soundscape Indices (SSID) Protocol: A Method for Urban Soundscape Surveys—Questionnaires with Acoustical and Contextual Information. Appl. Sci. 2020, 10, 2397. [Google Scholar] [CrossRef]
Figure 1. Research Technology Roadmap.
Figure 1. Research Technology Roadmap.
Buildings 15 04103 g001
Figure 2. Location of Case Study Area in Chengdu, Sichuan Province, and Detailed Plan of Jinli Ancient Street with Survey Point Distribution.
Figure 2. Location of Case Study Area in Chengdu, Sichuan Province, and Detailed Plan of Jinli Ancient Street with Survey Point Distribution.
Buildings 15 04103 g002
Figure 3. Photos of each survey point (taken by the authors).
Figure 3. Photos of each survey point (taken by the authors).
Buildings 15 04103 g003
Figure 4. Initial SEM: Relationships Among Sound Intensity, Sound Source Recognition, and Subjective Perception (Historic Street–Urban Forest Interface).
Figure 4. Initial SEM: Relationships Among Sound Intensity, Sound Source Recognition, and Subjective Perception (Historic Street–Urban Forest Interface).
Buildings 15 04103 g004
Figure 5. Kernel Density Maps of Sound Intensity (L10/L50/L90/LAeq), Sound Source Recognition (Biophony/Anthropophony/Geophony), and Subjective Perception (Pleasure/Comfort/Quietness/Harmony) in the Wuhou Shrine Museum.
Figure 5. Kernel Density Maps of Sound Intensity (L10/L50/L90/LAeq), Sound Source Recognition (Biophony/Anthropophony/Geophony), and Subjective Perception (Pleasure/Comfort/Quietness/Harmony) in the Wuhou Shrine Museum.
Buildings 15 04103 g005
Figure 6. Revised SEM: Path Relationships and Explained Variance Among Sound Intensity, Sound Source Recognition, and Subjective Perception.
Figure 6. Revised SEM: Path Relationships and Explained Variance Among Sound Intensity, Sound Source Recognition, and Subjective Perception.
Buildings 15 04103 g006
Table 1. Coefficient of Variation (CV) of sound pressure level data.
Table 1. Coefficient of Variation (CV) of sound pressure level data.
Survey PointCV (LAEQ)CV (L10)CV (L50)CV (L90)
172.44%150.44%98.91%129.69%
272.19%15.72%51.66%17.56%
3625.88%615.62%369.98%116.00%
4295.89%416.44%199.39%31.07%
5313.68%383.47%145.47%78.93%
6296.90%347.84%140.22%49.77%
71091.87%361.54%203.22%227.53%
8312.80%579.90%209.60%91.55%
9381.37%59.03%410.19%232.47%
10220.60%470.37%115.80%108.19%
11112.20%466.86%103.72%10.81%
12288.90%395.40%156.10%144.52%
13115.65%57.23%45.07%18.80%
14131.93%71.03%261.01%85.10%
15172.89%172.55%142.38%120.89%
16352.60%28.99%526.79%41.49%
17137.22%151.50%600.47%190.92%
18131.51%317.87%236.77%735.20%
19463.38%184.71%118.29%152.34%
20228.52%340.23%336.50%213.59%
Table 2. Spearman’s rho correlation analysis of Sound Intensity, Sound source recognition, and Subjective Perceptions.
Table 2. Spearman’s rho correlation analysis of Sound Intensity, Sound source recognition, and Subjective Perceptions.
LAeq (1)L10 (2)L50 (3)L90 (4)L10−90 (5)Geophony (6)Biophony (7)Anthropophony (8)pleasant Level (9)Comfort Level (10)Quietness (11)Harmony (12)
11.000
20.908 **1.000
30.678 **0.747 **1.000
40.592 **0.532 *0.887 **1.000
50.672 **0.795 **0.3460.0511.000
6−0.226−0.214−0.055−0.108−0.0771.000
70.2980.1500.0720.101−0.074−0.1351.000
8−0.086−0.149−0.123−0.060−0.280−0.3280.3911.000
90.1030.3690.195−0.0940.3650.351−0.236−0.3311.000
100.2800.509 *0.495 *0.1960.505 *0.351−0.309−0.3710.803 **1.000
110.1350.3880.3480.0850.446 *0.299−0.496 *−0.4120.777 **0.909 **1.000
12−0.143−0.1070.048−0.039−0.1850.473 *−0.218−0.1460.3840.469 *0.507 *1.000
Yellow background, * p < 0.05, Pink background, ** p < 0.01.
Table 3. EFA of SEM.
Table 3. EFA of SEM.
Variable NameComponentExplained Variance/%
1234
1individual information-age 0.76123.148
2individual information-educational information −0.808
3individual information-place of origin −0.79
4individual information-occupation 0.698
5individual information-visiting frequency 0.824
6individual information-monthly income 0.603
7Sound intensity-LAeq0.884 22.672
8Sound intensity-L100.913
9Sound intensity-L500.937
10Sound intensity-L900.886
11subjective perception-pleasant level 0.886 15.951
12subjective perception-comfort Level 0.878
13subjective perception-quietness 0.799
14sound source recognition-biophony 0.763 11.756
15sound source recognition-anthropophony 0.803
Total Explained variance/%73.527
Rotation method: Varimax.
Table 4. Reliability and Validity Analysis of the SEM Measurement Model.
Table 4. Reliability and Validity Analysis of the SEM Measurement Model.
Latent VariableObserved VariableSTD. ESTIMATEC.RAVECronbach’s α
Sound intensityLAeq0.7210.9210.7470.921
L100.799
L501
L900.912
Sound source recognitionBiophony10.8580.7570.835
Anthropophony0.716
Subjective perceptionPleasant level0.8820.9450.8520.886
Comfort level0.995
Quietness0.889
Individual informationAge0.5310.7510.5020.721
Educational information−0.691
Place of origin−0.62
Occupation0.928
Visiting frequency0.531
Monthly income0.476
Table 5. Theoretical Basis for SEM Path Settings.
Table 5. Theoretical Basis for SEM Path Settings.
PathCore Theoretical BasisLiterature Support
Sound intensity --> Subjective experience. (Ha)Sound intensity, as a physical stimulus, directly affects subjective arousal and emotional experience.[6,18]
Sound source --> Subjective experience. (Hb)Sound source types influence subjective experience through “emotional association”.[11,15]
Individual information --> Subjective experience. (Hc)Individual characteristics regulate subjective perception through “soundscape familiarity” and “cultural background”.[10,18,20,24]
Sound intensity <--> Sound source (Hab)There is a “dynamic synergistic effect” between sound intensity and sound sources; changes in the intensity of different types of sound sources directly affect subjective perception.[11,20]
L10 <--> L90Both belong to the sound intensity index system.[4,17]
L50 <--> L90Both belong to the sound intensity index system.[4,17,53]
L50 <--> Subjective experienceMedian sound intensity can enhance perceived controllability and directly affect comfort.[3,54]
L90 <--> Subjective experienceBackground sounds can reduce auditory fatigue and directly affect the evaluation of “sense of quietness”.[29,55]
Visiting frequency <--> Place of originLocal residents visit more frequently due to geographical accessibility.[50]
Educational information <--> Place of originChengdu has better educational resources, and the proportion of highly educated local residents may be higher.[56]
Table 6. SEM Model Fit Indices (Before and After Modification).
Table 6. SEM Model Fit Indices (Before and After Modification).
ParametersX2/DFGFICFIRMSEA
Criteria<3>0.9>0.9<0.08
Data values (before)3.290.6310.8080.181
Data values (after)1.620.90.9210.075
Table 7. Subjective evaluation differences between street and forest.
Table 7. Subjective evaluation differences between street and forest.
Pleasant LevelComfort LevelQuietnessHarmonyTotal (Average)
Street3.493.432.413.533.21
Forest3.723.863.433.663.67
Table 8. SEM Path Regression and Covariance Coefficients.
Table 8. SEM Path Regression and Covariance Coefficients.
PathEstimateS.E.C.R.pStandardized Estimate
Sound perception--->Individual information−0.0130.015−2.826***−0.195
Sound intensity--->Subjective perception0.0330.0162.121***0.437
Sound perception--->Subjective perception−0.0760.036−2.113***−0.574
Individual information--->Subjective perception−0.7270.378−2.222**−0.36
Sound intensity--->LAeq1 0.922
Sound intensity--->L101.3820.1678.274***1.017
Sound intensity--->L500.7450.1514.939***0.783
Sound intensity--->L900.570.153.8***0.695
Subjective perception--->Pleasant level1 0.899
Subjective perception--->Comfort level1.2330.1657.474***0.99
Subjective perception--->quietness2.0230.345.95***0.908
Sound perception--->biophony1 1.047
Sound perception--->anthropophony0.6430.272.38***0.685
Individual information--->age1 0.635
Individual information--->occupation1.2760.5882.17***0.842
Individual information--->Visiting frequency0.9290.5091.826***0.534
Individual information--->Educational information−0.2470.2−1.235***−0.32
Individual information--->Place of origin0.1020.240.425***0.121
Individual information--->Monthly income0.7390.3981.856***0.482
Sound perception<-->Sound intensity2.1476.9351.1650.269
** p < 0.01, *** p < 0.001.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lin, J.; Zhang, M.; Wang, Y.; Hong, X.-C.; Liu, J. Factors Influencing Natural and Cultural Soundscape Interactions on Perceptual Experiences in Forested–Historical Interface Areas. Buildings 2025, 15, 4103. https://doi.org/10.3390/buildings15224103

AMA Style

Lin J, Zhang M, Wang Y, Hong X-C, Liu J. Factors Influencing Natural and Cultural Soundscape Interactions on Perceptual Experiences in Forested–Historical Interface Areas. Buildings. 2025; 15(22):4103. https://doi.org/10.3390/buildings15224103

Chicago/Turabian Style

Lin, Jingsong, Mengqiao Zhang, Yiyang Wang, Xin-Chen Hong, and Jiang Liu. 2025. "Factors Influencing Natural and Cultural Soundscape Interactions on Perceptual Experiences in Forested–Historical Interface Areas" Buildings 15, no. 22: 4103. https://doi.org/10.3390/buildings15224103

APA Style

Lin, J., Zhang, M., Wang, Y., Hong, X.-C., & Liu, J. (2025). Factors Influencing Natural and Cultural Soundscape Interactions on Perceptual Experiences in Forested–Historical Interface Areas. Buildings, 15(22), 4103. https://doi.org/10.3390/buildings15224103

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop