Predicting Bicycle-Lane Traffic Noise from Urban Street Morphology Using Interpretable Machine Learning Models

Wu, Hupeng; Wen, Qiang; Li, Xinxin; Kang, Jian

doi:10.3390/buildings16102023

Open AccessArticle

Predicting Bicycle-Lane Traffic Noise from Urban Street Morphology Using Interpretable Machine Learning Models

¹

School of Civil Engineering and Architecture, Zhejiang Sci-Tech University, Hangzhou 310018, China

²

Hubei Engineering and Technology Research Center of Urbanization, School of Architecture & Urban Planning, Huazhong University of Science and Technology, Wuhan 430074, China

³

Institute for Environmental Design and Engineering, The Bartlett, University College London, London WC1H ONN, UK

^*

Author to whom correspondence should be addressed.

Buildings 2026, 16(10), 2023; https://doi.org/10.3390/buildings16102023

Submission received: 27 March 2026 / Revised: 13 May 2026 / Accepted: 18 May 2026 / Published: 20 May 2026

(This article belongs to the Section Building Energy, Physics, Environment, and Systems)

Download

Browse Figures

Versions Notes

Abstract

Road traffic noise in urban streets is shaped not only by traffic sources but also by sound propagation through the surrounding street geometry. Existing prediction methods are still largely source-oriented, and receptor-specific models that rely on street morphology alone remain uncommon. We developed and compared interpretable machine-learning models to predict a cyclist-side sound pressure level (SPL) under fixed source conditions, using 12 spatial parameters extracted from 5060 street sections on 195 streets in Harbin, China. Acoustic simulations were performed in ODEON under fixed source-power conditions, and four models—Linear Regression, support vector regression (SVR), extreme gradient boosting (XGBoost), and Random Forest (RF)—were evaluated through an illustrative 80/20 split, 20 repeated random 80/20 splits, and 20 road-name-based grouped holdout repetitions. The nonlinear models consistently outperformed the linear baseline. Under grouped holdout validation, XGBoost achieved the highest predictive accuracy (R² = 0.953 ± 0.018, RMSE = 0.583 ± 0.119 dB, MAE = 0.418 ± 0.082 dB). RF reached comparable accuracy (R² = 0.938 ± 0.041, RMSE = 0.662 ± 0.210 dB, MAE = 0.453 ± 0.128 dB) and was retained for the interpretation of feature importance and marginal response patterns. A computation-time comparison based on 93 representative ODEON simulations showed that ODEON required a median of 2 min 33 s per street section, whereas the trained models predicted all 5060 sections in 0.013 s with XGBoost and 0.143 s with RF. The RF-based interpretation identified vehicle-lane width, sidewalk width, and near-zone cross-sectional enclosure degree as the most influential variables. Width-related parameters dominated cyclist-side SPL prediction, while enclosure-related parameters became more relevant mainly under narrower width conditions. The framework is therefore intended as a comparative morphology-screening tool under fixed source conditions, not as a predictor of real-world traffic noise under varying traffic states.

Keywords:

urban street; traffic noise; bicycle lane; street morphology; machine learning; XGBoost

Graphical Abstract

1. Introduction

Road traffic noise is a persistent environmental burden in urban streets, affecting both adjacent buildings and outdoor activity spaces [1,2]. Beyond traffic flow and vehicle composition, the receptor-side acoustic environment is also shaped by sound propagation through the surrounding urban geometry. Reviews and recent studies show that street width, façade configuration, enclosure characteristics, and related spatial-form variables can substantially influence reflection, attenuation, and local exposure patterns [3,4,5]. Receptor locations close to the traffic lanes, including those in bicycle lanes, are therefore important near-road exposure positions for street design [6,7,8,9]. Although the effects of individual morphology factors have been discussed repeatedly, an integrated and interpretable framework for predicting noise from street spatial parameters alone is still rare, particularly for receptors close to the carriageway [1,4,5]. We address this gap by developing and comparing interpretable machine-learning models for a cyclist-side simulated SPL under fixed source conditions and by using the RF model to examine the relative importance, marginal effects, and interactions of key spatial variables.

Existing studies on road-traffic-noise prediction can be grouped into two strands [10,11]. The first comprises empirical and engineering-based frameworks, including nationally or regionally standardized models such as FHWA and CNOSSOS-EU. These frameworks remain essential for regulatory assessment but are mainly driven by source-related descriptors such as traffic volume, vehicle speed, and fleet composition [10]. The second comprises data-driven approaches, where artificial-intelligence and machine-learning applications have grown rapidly [11]. ANN and ensemble-learning models, in particular, can improve predictive performance when sufficient traffic and contextual data are available [12,13,14,15,16,17]. Most of these models, however, are still predominantly source-oriented, while urban-form studies tend to examine a limited set of morphology indicators or focus on block-scale noise distribution rather than receptor-specific exposure in street canyons [4,5,18,19]. Three questions therefore remain open: how far urban street spatial parameters alone can explain the near-road SPL under fixed source conditions, which parameters contribute most, and whether their effects are nonlinear or interactive.

Morphology-oriented research has shown that urban form can reshape traffic-noise distribution across multiple spatial scales. Street-canyon studies have demonstrated that façade configuration and canyon geometry substantially alter roadside exposure patterns [20], and Lee and Kang highlighted the role of the height-to-width ratio in sound propagation along urban streets [18]. Subsequent work extended this perspective from individual canyons to broader urban morphologies [21,22,23,24,25]. Forssén et al. showed that different urban morphologies can modify façade, sidewalk, and shielded-yard exposure through both direct and indirect propagation paths [26]. At the city and block scales, Montenegro et al. reported that urban features support street classification in road-traffic-noise estimation [1], Zhou et al. identified building layout, road organization, and land-use factors as important determinants of traffic-noise distribution in high-density cities [4], and Yang et al. proposed a spatial-form-based prediction model for residential blocks [5]. Explainable machine-learning research further suggests that built-environment and road-related predictors often act on traffic noise in non-linear ways [15]. Even so, most existing evidence concerns area-wide noise mapping, block-scale exposure, or a relatively small set of geometric descriptors; interpretable, receptor-specific prediction frameworks for near-road street sections remain comparatively rare.

Against this background, the present study addresses three questions: (1) Can urban street spatial parameters alone achieve high predictive performance for sound pressure level at bicycle-lane receptor locations under controlled source conditions? (2) Which spatial parameters contribute most to predictive performance? (3) Do these parameters show threshold-like marginal effects or interaction effects?

To answer these questions, 5060 simulation cases derived from real street morphologies were used to train and evaluate four models: Linear Regression, SVR, XGBoost, and RF. Linear Regression provides a transparent linear baseline; SVR and XGBoost serve as nonlinear benchmarks, included to test whether RF performed favorably only relative to a weak linear comparator. The predictive comparison thus rests on all four models. XGBoost achieved the highest predictive accuracy among the tested models. RF reached comparable accuracy and offers a more direct post hoc interpretation framework through permutation importance and partial dependence analysis, and was therefore used for the interpretation of morphology–SPL relationships. Deep-learning models have shown strong capability in adjacent engineering prediction tasks, such as structural damage identification under varying temperatures, bridge-response prediction, missing measurement-data recovery, and structural-response reconstruction [27,28,29,30]. The present study, however, uses structured tabular morphology variables rather than image sequences or temporal sensor data, so SVR, XGBoost, and RF were considered more suitable for balancing nonlinear predictive capacity, computational efficiency, and post hoc interpretability in this morphology-based screening task.

Compared with our earlier work on the morphological determinants of street sound propagation [19], this study advances the analysis on four points: (i) the prediction target shifts from general sound-propagation characteristics to the SPL at bicycle-lane receptor locations, focusing on a specific near-road exposure setting; (ii) the analysis is reformulated as a design-oriented surrogate model for rapid comparative assessment, with post-training prediction efficiency explicitly benchmarked against ODEON simulation time; (iii) linear and nonlinear machine-learning models—Linear Regression, SVR, XGBoost, and RF—are compared under both repeated random-split evaluation and road-name-based grouped holdout validation, while the RF model is reserved for interpretable analysis of variable importance, marginal response patterns, and interaction tendencies; and (iv) the outputs are extended from a list of influential parameters to importance rankings, threshold-like marginal responses, and interaction patterns among key street-form variables. The work is thus a receptor-specific and design-support-oriented extension of the earlier analysis rather than a repetition of it.

2. Materials and Methods

2.1. Overall Workflow

This study builds on our earlier morphology-oriented street-acoustics work [19] and reformulates it as a receptor-specific prediction task. The previously established street-parameter framework was retained, while the prediction target, receptor setting, and modeling approach were redefined for cyclist-side SPL estimation under controlled simulation conditions.

The workflow comprised eight steps. (1) A total of 5060 street sections from 195 streets in Harbin were sampled and their spatial parameters were extracted. (2) A standardized street-acoustic simulation model was set up in ODEON 14.00 Combined for each section under fixed source conditions. (3) The simulated SPL at the bicycle-lane receptor location was used as the prediction target. (4) Four models—Linear Regression, SVR, XGBoost, and RF—were developed and compared with the same input variables and the same train–test partitions. (5) Inter-variable dependence among the 12 morphological predictors was diagnosed using Pearson correlation coefficients and variance inflation factors. (6) A reduced-variable RF sensitivity analysis examined whether comparable performance could be reached with the dominant width-related variables alone or with a smaller subset of morphology descriptors. (7) RF was used for the subsequent interpretation of feature importance and partial dependence patterns, for the reasons given in Section 2.5. (8) A computation-time comparison evaluated the post-training prediction efficiency of the machine-learning models relative to ODEON 14.00 Combined simulation.

2.2. Street Samples

The dataset contained 5060 street sections from 195 streets in Harbin, China—2652 arterial-street sections, 1058 secondary-trunk-street sections, and 1350 branch-street sections. Each section was 200 m long. Urban expressways and elevated roads were excluded because their cross-sectional configurations differ substantially from the receptor-oriented street setting adopted here. Harbin was selected for the heterogeneity of its urban fabric, which contains a wide range of traditional and modern street forms suitable for model development. It is treated here as a morphologically diverse case city rather than as a representative proxy for all Chinese cities. The street database follows the same city context and sampling logic as our earlier morphology-oriented street-acoustics study, where the representativeness of Harbin’s street system and the rationale for stratified sampling are described in greater detail [19].

2.3. Spatial Parameters

The 12 input variables were adapted from a previously developed street-morphology parameter system for urban sound-propagation analysis [19]. They were retained because they capture three aspects of street form directly relevant to near-road SPL: width configuration, façade proximity, and enclosure characteristics. Specifically, W_vehicle and W_side describe the effective source-to-receptor and façade-reflection distances; B_p indicates whether a façade lies directly in front of the receptor; and H, C_s, and C_p describe zonal façade height, cross-sectional enclosure, and plan enclosure, respectively. The detailed derivation and geometric interpretation of these parameters are reported in the previous study [19]; only the definitions needed for the present model are summarized here.

Street width—the distance between the façades on both sides of a street—has been widely used in earlier studies of urban-street sound propagation. In practice, however, streets with the same total width may differ substantially in vehicle-lane and sidewalk widths, leading to different source-to-receptor and façade-reflection distances. Total street width was therefore not used directly. Instead, two width-related variables, vehicle-lane width (W_vehicle) and sidewalk width (W_side), were used to characterize the width configuration. Figure 1 illustrates the street configuration, receptor location, zonal division, and spatial parameters used in the model.

The immediate presence of a façade in front of the receptor may also influence the local SPL. Hall et al. reported that a building façade can raise the sound pressure level at a point 2 m in front of it by a mean of 3.2 ± 0.2 dB (95% CI), except at low frequencies [31]. A binary variable, B_p, was therefore introduced to indicate whether a building façade exists directly in front of the receptor point: B_p = 1 indicates presence and B_p = 0 indicates absence. This variable captures a local reflective condition that may differ even between street sections with otherwise similar geometry.

Because façade-related influences are distance-dependent, each street section was divided into three longitudinal zones according to distance from the bicycle-lane receptor. Figure 1 also defines these zones along both directions of the street axis. The near zone covers the street portion within 0–30 m of the receptor on both sides, giving a total longitudinal length of 60 m. The mid zone covers the next 30 m band on both sides (30–60 m from the receptor), again 60 m in total length. The far zone covers the following 30 m band on both sides (60–90 m from the receptor), with the same total length of 60 m. This symmetric distance-based division was adopted to represent the expected decay of façade-related reflection effects with increasing distance from the receptor: the near zone captures receptor-proximal façade geometry most likely to influence local SPL and early reflections, the mid zone represents intermediate street-boundary conditions, and the far zone reflects more distant façade-continuity effects. The enclosure-related parameters H, C_s, and C_p were therefore calculated separately for these three zones so that the spatial gradient of morphological influence could be retained.

To facilitate parameter calculation, each street section was centered at the receptor point and divided along the street axis at 3 m intervals. For a 200 m street section, the 61 middle cross-sections were considered. On each cross-section, the façade heights on both sides (H_oi and H_ni) were obtained, together with the angles formed by the line from the top of each façade to the center point of the street and the ground. When H_oi or H_ni was non-zero, a façade was considered present and the corresponding façade length (P_oi or P_ni) was recorded as 3 m; otherwise, it was recorded as 0. These geometric quantities formed the basis for calculating H, C_s, and C_p.

The mean façade height (H) represents the average vertical dimension of street façades within each zone. Because façade height often varies substantially along an urban street, a single height descriptor is insufficient. H was therefore calculated separately for the near, mid, and far zones, as shown in Equations (1)–(3).

H_{(n)} = \frac{1}{21} \sum_{i = - 10}^{10} (\frac{H_{o i} + H_{n i}}{2})

(1)

H_{(m)} = \frac{1}{20} (\sum_{i = - 20}^{- 11} (\frac{H_{o i} + H_{n i}}{2}) + \sum_{i = 11}^{20} (\frac{H_{o i} + H_{n i}}{2}))

(2)

H_{(f)} = \frac{1}{20} (\sum_{i = - 30}^{- 21} (\frac{H_{o i} + H_{n i}}{2}) + \sum_{i = 21}^{30} (\frac{H_{o i} + H_{n i}}{2}))

(3)

The cross-sectional enclosure degree (C_s) and plan enclosure degree (C_p) describe street enclosure from sectional and plan perspectives, respectively, following the framework established in the previous study [19]. C_s reflects how strongly the receptor is enclosed by the surrounding street cross-section, while C_p describes façade continuity in plain view and so indirectly indicates the presence of building gaps. Both parameters were also calculated separately for the near, mid, and far zones. Their calculation procedures are given in Equations (4)–(9).

C_{p (n)} = \frac{1}{21} \sum_{i = - 10}^{10} (\frac{P_{o i} + P_{n i}}{6})

(4)

C_{p (m)} = \frac{1}{20} (\sum_{i = - 20}^{- 11} (\frac{P_{o i} + P_{n i}}{6}) + \sum_{i = 11}^{20} (\frac{P_{o i} + P_{n i}}{6}))

(5)

C_{p (f)} = \frac{1}{20} (\sum_{i = - 30}^{- 21} (\frac{P_{o i} + P_{n i}}{6}) + \sum_{i = 21}^{30} (\frac{P_{o i} + P_{n i}}{6}))

(6)

C_{s (n)} = \frac{1}{21} \sum_{i = - 10}^{10} (\frac{θ_{o i} + θ_{n i}}{180})

(7)

C_{s (m)} = \frac{1}{20} (\sum_{i = - 20}^{- 11} (\frac{θ_{o i} + θ_{n i}}{180}) + \sum_{i = 11}^{20} \frac{θ_{o i} + θ_{n i}}{180})

(8)

C_{s (f)} = \frac{1}{20} (\sum_{i = - 30}^{- 21} (\frac{θ_{o i} + θ_{n i}}{180}) + \sum_{i = 21}^{30} (\frac{θ_{o i} + θ_{n i}}{180}))

(9)

Together, the adopted variables represent complementary dimensions of urban street morphology: W_vehicle and W_side describe width configuration; B_p captures immediate front-façade presence at the receptor; H describes average façade height; C_s describes cross-sectional enclosure; and C_p describes plan enclosure and façade continuity. Calculating H, C_s, and C_p separately for the near, mid, and far zones additionally accounts for the distance-decaying influence of surrounding street geometry on the cyclist-side SPL. Table 1 summarizes all input variables.

2.4. Acoustic Simulation Settings

ODEON v14.00 Combined (ODEON A/S, Kgs. Lyngby, Denmark) was used to simulate sound propagation in urban street environments. ODEON has been widely applied and previously validated in street-acoustics research [32,33], and earlier work has reported acceptable agreement between ODEON-based simulation and field measurements in urban-street contexts, supporting its use for morphology-related sound-propagation analysis [18,34,35].

For each sampled section, a 200 m street model was built. A fixed source-power condition was adopted to isolate the contribution of street morphology to the receptor-side SPL and to avoid confounding from the dynamic traffic-source variability. A receptor point representing a cyclist was placed in the bicycle lane at a height of 1.5 m and 1 m from the edge of the vehicle lane. The height approximates the ear height of a cyclist, and the lateral position represents a near-traffic bicycle-lane condition. Ground and façade absorption/scattering coefficients and other ODEON parameters were set according to previous studies. Under these conditions, the simulations capture variation in the cyclist-side SPL attributable to morphology under a fixed source input.

The absorption and scattering coefficients of the ground were both set to 0.1 [36]. The absorption coefficient of façades was set to 0.1 [36], and the scattering coefficient of façades was set to 0.3 [18]. For ODEON simulation parameters, the number of rays was set to 1,000,000, the transition order to 2, the number of reflection rays to 2000, and the impulse response length to 5000 ms [18].

2.5. Model Development and Evaluation

The simulated SPL at the bicycle-lane receptor location was used as the prediction target, with the 12 spatial parameters described above as input variables. To address both linear and nonlinear morphology–SPL relationships, four models were evaluated: Linear Regression, support vector regression (SVR), extreme gradient boosting (XGBoost), and Random Forest (RF). Linear Regression provides a transparent linear baseline; SVR and XGBoost serve as nonlinear benchmarks; RF is the model retained for post hoc interpretation, because it combines nonlinear predictive flexibility with a relatively direct interpretation framework based on permutation importance and partial dependence analysis.

The SPL was retained as a continuous regression target rather than discretized into categorical noise-level classes. Continuous prediction preserves the gradual morphology–SPL variations expected under fixed source-power conditions and allows the subsequent partial dependence analysis to recover threshold-like response ranges from the trained model itself, rather than imposing predefined class boundaries before modeling. A categorical formulation is noted in Section 6 as a planning-oriented extension once external validation against field measurements becomes possible.

For an initial benchmark, the full dataset was randomly divided into a training set (80%) and a test set (20%), with the same split used for all four models. Hyperparameter optimization for SVR, XGBoost, and RF was carried out on the training subset only, using RandomizedSearchCV with five-fold cross-validation. SVR was implemented with a radial-basis-function kernel inside a standardization pipeline because of its sensitivity to predictor scale. XGBoost and RF were fitted with tree-based ensemble regressors. Linear Regression was fitted directly without hyperparameter tuning. The hyperparameter search spaces for SVR, XGBoost, and RF are reported in Supplementary Table S2.

To test whether model performance was robust to sample-level data partitioning, repeated random-split evaluation was performed across 20 independent 80/20 train–test partitions. In each repetition, the same training and test subsets were used for all four models, and hyperparameter optimization was conducted on the training subset only; the best estimator was then refitted on the full training subset and evaluated on the corresponding test subset.

As a stricter robustness check, road-name-based grouped holdout validation was additionally performed using road name as the grouping variable [37,38]. In each of 20 repetitions, approximately 20% of road-name groups were withheld as the test set, ensuring that all samples from a held-out road were excluded from training. For SVR, XGBoost, and RF, hyperparameter tuning was carried out on the training road groups only, using RandomizedSearchCV with GroupKFold cross-validation. The same grouped partitions were used across all four models.

Model performance was assessed with the coefficient of determination (R²), root mean squared error (RMSE), and mean absolute error (MAE). These metrics were calculated for all four models under the illustrative random split, repeated random-split evaluation, and road-name-based grouped holdout validation. XGBoost was used to indicate the highest achievable predictive performance among the tested models, while RF was retained for the subsequent interpretation of feature importance and marginal response patterns, since it reached comparable predictive accuracy and supports a more direct post hoc interpretation framework.

2.6. Inter-Variable Dependence and Multicollinearity Diagnosis

Because the same 12 morphological predictors were used in all four models, the dependence structure among predictors was examined before model interpretation. Pearson correlation coefficients were calculated for all predictor pairs and between each predictor and simulated SPL. Variance inflation factors (VIFs) were also computed to diagnose multicollinearity in the shared predictor set.

The VIF analysis primarily targets the Linear Regression baseline, whose coefficients are sensitive to multicollinearity. For nonlinear models—SVR, XGBoost, and RF—correlated predictors do not invalidate predictive comparison, but they may introduce redundant information and affect the interpretation of feature-importance rankings. Feature importance in tree-based models is therefore interpreted as model reliance under the observed predictor-dependence structure rather than as an independent causal effect.

2.7. Reduced-Variable Sensitivity Analysis

To examine whether the full 12-variable morphology set was needed, a reduced-variable sensitivity analysis was performed using the RF model. This addresses the possibility that cyclist-side SPL prediction is mainly driven by a small number of dominant width-related predictors. Five nested predictor sets were compared: W_vehicle only; W_vehicle and W_side; W_vehicle, W_side, and C_s(n); W_vehicle, W_side, C_s(n), C_p(n), and H_(n); and the full 12-variable morphology set.

The same road-name-based grouped holdout framework used in the main model evaluation was adopted. In each repetition, approximately 20% of road-name groups were withheld as the test set, so that all samples from the same road group were kept within either the training or test subset. For each reduced-variable RF model, hyperparameter optimization was performed on the training road groups only, using RandomizedSearchCV with GroupKFold cross-validation. The same grouped partitions were used across all predictor sets, so that performance differences reflected differences in input-variable composition rather than in data splitting.

This analysis was used to determine whether the full morphology set provided additional predictive or interpretive value beyond the dominant width-related variables. The results are reported in Supplementary Table S4.

2.8. Model Interpretation

For the reasons given in Section 2.5, RF was used for the subsequent interpretation analysis. Permutation feature importance and partial dependence analysis were applied to examine how the trained RF model used the morphological predictors. Permutation importance quantifies the decrease in predictive accuracy after random reshuffling of a given feature [39,40]. Partial dependence plots (PDPs) describe average marginal response patterns of predicted SPLs across the observed predictor ranges [41,42]. Two-dimensional PDPs were also used to explore whether the influence of selected spatial parameters varied under different vehicle-lane-width conditions [41].

Because several predictors were intercorrelated, the interpretation of these tools was deliberately constrained. Permutation importance is read as RF model reliance under the observed predictor-dependence structure, not as the independent causal contribution of each variable. PDPs likewise summarize average model-based response tendencies, rather than isolated physical effects across all possible variable combinations.

2.9. Computational Efficiency Comparison

To test whether the proposed framework provides a practical computational advantage as a morphology-screening tool, a computation-time comparison was performed between ODEON simulation and machine-learning prediction. The ODEON simulation time was obtained from 93 representative street-section models drawn from the original simulation records. These models covered different street configurations and were simulated using the same ODEON version and acoustic settings described in Section 2.4.

All timing tests were run on the same workstation, an Intel Core i9-9900K CPU with 64 GB RAM running Windows 10 64-bit. For the machine-learning models, training time and post-training prediction time were measured in Python 3.12.7 on the same dataset and hardware. Training time and prediction time are reported separately because the practical efficiency of a surrogate model mainly concerns rapid prediction after training. Prediction time was measured for all 5060 street sections and converted to an average per-section value. The complete computation-time comparison is reported in Supplementary Table S5.

3. Results

3.1. Descriptive Statistics, Preliminary Relationships, and Inter-Variable Dependence

Figure 2 summarizes the distributions of the 12 spatial parameters and the simulated SPL values. Among the width-related variables, W_side was concentrated mainly within 0–21 m, whereas W_vehicle was more evenly distributed across a wider range of 7–42 m. For the zonal variables H, C_s, and C_p, distribution patterns were broadly similar across the near, mid, and far zones. H and C_s both showed skewed distributions, with H concentrated mainly between 6 and 24 m and C_s between 0.1 and 0.6. C_p was distributed more heavily towards higher values, suggesting that street sections with very low plan-enclosure values were uncommon in the sampled dataset. The simulated SPL values were mainly distributed between 62 and 70 dB.

The correlation matrix in Figure 3 shows that simulated SPL was most strongly correlated with the width-related variables. Both W_vehicle and W_side were negatively correlated with SPL, indicating that larger source–receptor and receptor–façade distances were generally associated with lower predicted noise levels. The correlation was stronger for W_vehicle (r = −0.84, p < 0.01) than for W_side (r = −0.60, p < 0.01), suggesting that vehicle-lane width was the more influential width descriptor at the bivariate level. Among the enclosure-related variables, C_s showed the strongest positive correlations with SPL (r = 0.68–0.72, p < 0.01), while C_p showed weaker positive correlations (r = 0.22–0.35, p < 0.01). B_p had a weak positive correlation with SPL (r = 0.20, p < 0.01), and H showed only very low positive correlations (r = 0.04–0.09, p < 0.01).

Predictor–predictor correlations further showed that the 12 morphological variables were not fully independent. Within the same zone, C_s and C_p were strongly correlated, with correlation coefficients of approximately 0.70–0.71. Moderate correlations were also observed between H and the enclosure-related variables. For the same morphological descriptor, inter-zone correlations were evident, especially for C_s. The zonal variables thus share some common geometric information, even though they still represent distance-specific descriptions of street morphology.

The correlation between the two most influential width-related variables, W_vehicle and W_side, was moderate rather than high (r = 0.33). They should therefore not be treated as redundant measures of a single width factor: W_vehicle mainly represents the source–receptor distance, while W_side mainly represents the receptor–façade reflection distance. The two variables are related components of street-width configuration but describe different acoustic mechanisms.

VIF analysis confirmed that multicollinearity was concentrated among the zonal enclosure and height variables rather than between W_vehicle and W_side (Supplementary Table S3). C_s(n), C_s(m), and C_s(f) showed high VIF values, followed by the H-related variables. W_side and B_p had low VIF values, and W_vehicle showed only moderate multicollinearity. The Linear Regression coefficients should therefore not be read as independent morphological effects, and Linear Regression was retained only as a transparent predictive baseline. For the nonlinear models, the correlated predictor structure does not invalidate model comparison but requires cautious interpretation of feature-importance results.

3.2. Model Performance

To address the concern that comparison with only a linear baseline was insufficient, two additional nonlinear models, SVR and XGBoost, were included. Table 2 summarizes the predictive performance of Linear Regression, SVR, XGBoost, and RF under the illustrative random split, repeated random-split evaluation, and road-name-based grouped holdout validation.

In the illustrative random 80/20 split (Figure 4), all nonlinear models clearly outperformed Linear Regression. The linear baseline reached an R² of 0.900, an RMSE of 0.895 dB, and an MAE of 0.722 dB. SVR substantially improved the prediction (R² = 0.990, RMSE = 0.289 dB, MAE = 0.173 dB), and XGBoost achieved the highest accuracy in this split (R² = 0.997, RMSE = 0.155 dB, MAE = 0.103 dB). RF showed similarly strong performance (R² = 0.996, RMSE = 0.183 dB, MAE = 0.112 dB). Predicted values from XGBoost and RF aligned closely with the simulated SPL values, while Linear Regression showed greater dispersion around the 1:1 line. The morphology–SPL relationship in this dataset was thus better captured by nonlinear models than by a purely linear one.

Across 20 repeated random 80/20 splits, the same overall ranking held. Linear Regression yielded R² = 0.897 ± 0.006, RMSE = 0.891 ± 0.019 dB, MAE = 0.718 ± 0.016 dB. SVR improved performance to R² = 0.989 ± 0.002, RMSE = 0.290 ± 0.031 dB, MAE = 0.181 ± 0.031 dB. XGBoost achieved the highest repeated-split performance (R² = 0.996 ± 0.001, RMSE = 0.167 ± 0.011 dB, MAE = 0.107 ± 0.004 dB), and RF was comparable but with slightly higher errors (R² = 0.996 ± 0.001, RMSE = 0.185 ± 0.012 dB, MAE = 0.113 ± 0.004 dB).

Under the stricter road-name-based grouped holdout validation, predictive performance decreased for all models, indicating that ordinary random splitting benefited partly from within-road similarity. The nonlinear models nevertheless retained clear advantages over Linear Regression. Linear Regression yielded R² = 0.876 ± 0.027, RMSE = 0.951 ± 0.079 dB, MAE = 0.778 ± 0.073 dB. SVR improved grouped-holdout performance to R² = 0.922 ± 0.018, RMSE = 0.755 ± 0.091 dB, MAE = 0.549 ± 0.084 dB. XGBoost achieved the highest grouped-holdout accuracy (R² = 0.953 ± 0.018, RMSE = 0.583 ± 0.119 dB, MAE = 0.418 ± 0.082 dB), and RF retained strong performance (R² = 0.938 ± 0.041, RMSE = 0.662 ± 0.210 dB, MAE = 0.453 ± 0.128 dB).

Across all evaluation settings, the nonlinear models outperformed the linear baseline. XGBoost was the most accurate predictor; RF, with comparable accuracy, is used in the following sections for interpretation.

Because all four models used the same 12 morphological predictors, the performance comparison was conducted under the same predictor-dependence structure identified in Section 3.1. The lower performance of Linear Regression should therefore be read with care: it may reflect both the limited ability of a purely linear model to capture nonlinear morphology–SPL relationships and the sensitivity of linear coefficient estimates to multicollinearity. Including SVR and XGBoost reduces the risk that RF was compared only with a weak linear baseline, while keeping the subsequent interpretation focused on RF for its balance between predictive performance and post hoc interpretability.

3.3. Reduced-Variable Model Comparison

A reduced-variable RF analysis examined whether comparable performance could be reached with a smaller subset of predictors. The results showed a clear hierarchy in predictor contribution (Supplementary Table S4). The model using W_vehicle alone achieved R² = 0.724 ± 0.063, RMSE = 1.415 ± 0.113 dB, MAE = 1.107 ± 0.087 dB. The dominant vehicle-lane-width descriptor thus captured an important share of the morphology-related SPL variation but was not sufficient on its own.

Adding W_side substantially improved performance, raising R² to 0.887 ± 0.043 and reducing RMSE and MAE to 0.898 ± 0.160 dB and 0.631 ± 0.091 dB, respectively. The two width-related variables together represented the primary width-configuration signal. Adding C_s(n) further improved the model (R² = 0.908 ± 0.031, RMSE = 0.813 ± 0.141 dB), suggesting that local cross-sectional enclosure adds information beyond width configuration alone.

Adding C_p(n) and H_(n) produced only marginal further gain, with R² rising slightly from 0.908 ± 0.031 to 0.910 ± 0.032. The full 12-variable model achieved the best overall RF performance (R² = 0.938 ± 0.041, RMSE = 0.662 ± 0.210 dB, MAE = 0.453 ± 0.128 dB). Cyclist-side SPL prediction was therefore width-dominated but not purely width-determined. The full variable set is retained as a complete morphology representation, rather than as evidence that all 12 variables contribute equally or independently.

3.4. Relative Importance of Spatial Parameters

The reduced-variable analysis showed that width-related variables accounted for the main predictive signal, while the full 12-variable set still gave the best RF performance. To examine how the full RF model used the individual predictors, permutation importance was calculated for the 12-variable RF model (Figure 5).

Permutation importance was unevenly distributed across the 12 predictors. Because the correlation and VIF analyses indicated intercorrelations among several predictors, the importance values are read as the extent to which the trained RF model relied on each variable under the observed predictor-dependence structure, rather than as independent causal effects. W_vehicle ranked first (importance = 1.008 ± 0.0145), followed by W_side and C_s(n). The importance of W_vehicle was approximately four times that of W_side and approximately 30 times that of C_s(n). W_vehicle and W_side together accounted for 93.9% of the total permutation importance, indicating that width-related variables dominated RF prediction of cyclist-side SPL.

Although the importance value of C_s(n) did not exceed 0.05, it remained higher than that of the other non-width variables, suggesting that local cross-sectional enclosure provided secondary information to the RF model. Given the correlations among the zonal variables, this should be read as evidence of additional model reliance on receptor-proximal enclosure information rather than a fully independent effect of C_s(n) alone. For the zonal variables C_s, C_p, and H, permutation importance generally decreased from the near zone to the mid zone and then to the far zone. This distance-decay pattern indicates that morphological features closer to the receptor contributed more to SPL prediction than those farther away. Overall, basic width configuration dominated model performance, while enclosure-related variables played secondary but still interpretable roles, especially in zones closest to the receptor. Given the dominant role of width-related variables and the non-negligible contribution of selected enclosure-related parameters, the marginal effects of individual predictors were further examined using partial dependence analysis.

3.5. Model-Based Average Marginal Response Patterns of Spatial Parameters

Partial dependence analysis showed that several spatial parameters exhibited threshold-like average response patterns in predicted SPLs (Figure 6). Among all predictors, the width-related variables showed the strongest and most continuous patterns within the trained model. The PDP for W_vehicle has four stages. When W_vehicle was below approximately 13 m, increasing it had little effect on predicted SPL. As W_vehicle increased from 13 to 22 m, predicted SPL decreased markedly, by approximately 3.1 dB. From 22 to 50 m, predicted SPL continued to decrease more gradually, by approximately 2.5 dB. Beyond 50 m, additional change was negligible. A similar but weaker negative pattern appeared for W_side, whose influence was concentrated mainly in the lower-width range. The PDPs thus show that width-related parameters dominated the model and that their response patterns were most pronounced within specific value ranges rather than across the entire parameter domain.

Compared with the width-related variables, C_s(n) showed a smaller but still interpretable positive effect. SPL changed little when C_s(n) increased from 0 to approximately 0.22, then increased by about 0.65 dB as C_s(n) rose to approximately 0.46, after which the curve largely flattened. The binary front-façade indicator B_p also showed a distinct marginal effect: compared with B_p = 0, the presence of a façade directly in front of the receptor (B_p = 1) raised SPL by approximately 1.34 dB. Local enclosure and immediate front-façade presence can therefore elevate cyclist-side SPL, but their effects are clearly smaller than those of the width-related variables.

Additional one-dimensional PDPs for C_s(m), C_s(f), H_(n), H(m), H(f), C_p(n), C_p(m), and C_p(f) are provided in Supplementary Figure S1. These curves show weaker or more threshold-dependent tendencies than those in Figure 6: the C_s curves generally weakened from the near zone to the far zone, the H curves remained relatively flat across most of their ranges, and the C_p curves became more influential only when plan enclosure approached near-complete continuity.

3.6. Exploratory Interaction Patterns

Given its dominant importance, W_vehicle was used as the conditioning variable in the exploratory interaction analysis. Figure 7 shows the two-dimensional partial dependence plots for W_vehicle in combination with W_side, C_s(n), and C_s(m). Figure 7a confirms that both W_vehicle and W_side were negatively associated with predicted SPL, consistent with the one-dimensional PDP results. Their joint response pattern, however, was not uniform across the full parameter space. When W_vehicle was below approximately 15 m, the influence of W_side on predicted SPL was concentrated mainly within the range of 5–10 m. When W_vehicle exceeded 15 m, the effective range of W_side broadened, and its influence was concentrated mainly within 5–15 m. The marginal benefit of increasing the sidewalk width thus varied with the vehicle-lane width rather than remaining constant across all street sections.

Figure 7b,c show that the average response patterns of C_s(n) and C_s(m) were also conditioned by W_vehicle. When W_vehicle exceeded approximately 20 m, the marginal patterns of C_s(n) and C_s(m) became broadly similar: in both cases, predicted SPL increased gradually as enclosure rose from 0 to 0.5 and changed little as enclosure rose from 0.5 to 0.8. When W_vehicle was below 20 m, however, the response pattern of C_s(n) was clearly stronger than that of C_s(m). These exploratory interaction patterns suggest that enclosure-related variables matter more under relatively narrow vehicle-lane conditions and that their marginal influence weakens once the primary width condition has improved.

3.7. Computational Efficiency

The computation-time comparison further supports the use of the trained machine-learning models as rapid morphology-screening tools. Across the 93 representative ODEON simulation records, the median simulation time for one street section was 2 min 33 s, with an interquartile range from 2 min 16 s to 3 min 22 s. The mean simulation time was 3 min 26 s, and the maximum recorded time was 19 min 38 s. Using the median ODEON time as the reference, simulating all 5060 street sections would take approximately 215.05 h; using the mean time, this would take approximately 289.56 h.

The trained machine-learning models required substantially less post-training prediction time. XGBoost required 23.24 s for training and 0.013 s to predict SPL values for all 5060 sections. RF required 195.08 s for training and 0.143 s to predict all 5060 sections. The corresponding batch prediction time per section was approximately 0.003 ms for XGBoost and 0.028 ms for RF. Relative to the median ODEON simulation time, this corresponds to a post-training batch-prediction speed-up of approximately 5.96 × 10⁷ for XGBoost and 5.42 × 10⁶ for RF.

These results do not imply that the machine-learning models can replace ODEON simulation or field measurement. Rather, they show that, once an ODEON-generated training dataset is available, the trained models can rapidly approximate ODEON-simulated SPL for additional morphology-based screening cases.

4. Discussion

4.1. Principal Findings in Relation to Previous Studies

Three principal findings emerged from the revised analysis. First, nonlinear machine-learning models predicted the cyclist-side SPL more accurately than Linear Regression under both random-split and road-name-based grouped holdout validation. Second, the reduced-variable analysis indicated that the morphology–SPL relationship was width-dominated but not purely width-determined: W_vehicle and W_side captured the primary width-configuration signal, while adding C_s(n) and the full 12-variable morphology set further improved RF performance. Third, enclosure-related variables contributed secondary and condition-dependent information, especially for receptor-proximal zones and narrower vehicle-lane-width conditions. Together, these findings suggest that near-road SPL prediction from street morphology is governed primarily by street-width configuration, while façade-related descriptors add more limited and context-dependent information.

The reduced-variable analysis also clarifies how the 12-variable input set should be read. The data do not support a view in which all 12 variables contribute equally or independently. The RF model was primarily driven by width configuration, especially W_vehicle and W_side; yet the two-variable width model did not fully reproduce the performance of the full morphology model. C_s(n) added information on local cross-sectional enclosure, and the full 12-variable set produced a further small improvement under grouped holdout validation. The full variable set is therefore best understood as a complete morphology representation for comparative screening, not as evidence that every individual parameter is indispensable.

The dominant role of width-related variables is broadly consistent with earlier studies showing that street width and canyon configuration substantially influence reflection, attenuation, and local traffic-noise exposure in urban streets [18,19,20,43]. Many of those studies, however, focused either on block-scale distribution patterns or on source-oriented prediction [4,5,10,11]. The present analysis extends this line of work to receptor-specific prediction at bicycle-lane locations under fixed source conditions. The contribution is not to restate that street width matters, but to show that, for the cyclist-side SPL close to the carriageway, width-related descriptors outweigh other morphology variables in predictive performance.

A second result of note is the distance-decay pattern of morphology-related variables and the non-uniform role of enclosure-related variables such as C_s and C_p. The stronger contribution of near-zone variables agrees with previous work showing that receptor-proximal street geometry is particularly influential for local SPL prediction [19,31]. At the same time, the limited but non-negligible role of C_s, together with the threshold-like behavior of C_p in Supplementary Figure S1, shows that enclosure-related parameters are not universally dominant predictors. Their contribution depends on both receptor proximity and the primary width condition of the street, refining earlier morphology-oriented discussions by showing that enclosure effects are configuration-dependent and sensitive to receptor location [19,20].

4.2. Mechanistic Interpretation of Morphology–Noise Relationships

The dominant role of W_vehicle can be explained by its direct control over the propagation distance between the traffic source and the bicycle-lane receptor. Under fixed source-power conditions, increasing the effective source–receptor separation lengthens the propagation path of the direct sound component before it reaches the receptor, and so reduces local SPL more effectively than many other geometric adjustments. W_side, by contrast, mainly affects the distance associated with façade-related reflections rather than the direct source–receptor path. This helps explain why both width-related variables showed negative marginal effects but W_vehicle was markedly more influential than W_side in both the permutation-importance ranking and the partial dependence analysis [18,19,20,34,43]. The threshold-like behavior in the PDPs further indicates that the acoustic benefit of widening is strongest within relatively constrained ranges and then diminishes.

The stronger effects of near-zone parameters are also physically plausible. Street geometry close to the receptor is more likely to shape the local balance between direct sound and early façade reflections, whereas features farther away contribute less to SPL at a fixed receptor point. This reading is consistent both with the geometric-reflection principle adopted in the present parameter system and with earlier morphology-oriented studies showing that receptor-proximal street form is especially relevant to local sound propagation [19,20,34,36,43]. The progressive decline in correlation strength and permutation importance from the near zone to the far zone therefore supports a distance-decay interpretation.

The PDPs in Figure 6 and Supplementary Figure S1 also clarify the differing roles of C_s, H, and B_p. Compared with mean façade height, cross-sectional enclosure provides a more integrated description of how strongly the receptor is surrounded by reflective boundaries within the street section, which likely explains why C_s showed clearer and more consistent marginal effects than H, whose influence flattened above a moderate height range. The non-negligible effect of B_p suggests in turn that the immediate presence of a façade directly in front of the receptor alters the local reflective condition more than the average height of façades distributed along the street segment [31,32,36]. For SPL prediction close to the carriageway, in other words, local reflective geometry matters more than height alone.

The behavior of C_p in Supplementary Figure S1, taken together with the interaction results, indicates that enclosure effects are strongly configuration-dependent. Across much of its range, C_p exerted only limited marginal influence, yet its effect increased noticeably as façade continuity approached complete closure. Plan enclosure is therefore unlikely to act as a uniformly strong predictor unless the street boundary becomes nearly continuous and the reflective environment correspondingly more uniform. The two-dimensional PDPs further show that the effects of C_s(n) and C_s(m) were more pronounced under relatively narrow W_vehicle conditions and weakened once the primary width condition had improved. Street width and enclosure should therefore be read together rather than as independent controls [19,20,36,43].

4.3. Planning Implications and Design Relevance

From a planning and design standpoint, width-related variables should be prioritized when comparing the morphology-related cyclist-side SPL under fixed source conditions [43]. The implication that follows from the dominance of W_vehicle and W_side concerns geometric separation, not motor-vehicle lane width as such: widening motor-vehicle lanes in real streets may induce higher traffic volumes or vehicle speeds, which could offset or even reverse the acoustic benefit predicted under fixed source-power conditions. The morphology-based strategy that follows most directly from the present results is therefore to increase the effective source-to-receptor separation, or to increase the separation between the bicycle lane and adjacent reflective boundaries, particularly within the parameter ranges where the PDPs showed the greatest SPL sensitivity [18,19,20,43].

The results also suggest that enclosure-related interventions matter more in relatively narrow streets. Where vehicle-lane width is constrained, reducing local cross-sectional enclosure may provide additional SPL reduction, whereas the benefit of enclosure-related adjustment becomes smaller once the primary width condition has improved. The behavior of C_p in Supplementary Figure S1 likewise suggests that introducing a small interruption in an otherwise continuous street façade may help reduce SPL, while further increasing the size or number of building gaps is unlikely to produce proportionally larger benefits. In design terms, increasing effective separation is the primary morphology-screening criterion, while enclosure modification is best treated as a secondary and context-dependent refinement, particularly for narrow or strongly bounded street sections [19,20,43].

4.4. Interpretation Boundary and Methodological Implications

The framework treats a morphology-based simulation chain as a receptor-specific prediction task for cyclist-side SPLs. Its primary value lies in comparative screening: training on a single-city, fixed-source-power dataset constrains the scope to morphology comparison, not regulatory prediction. The computation-time comparison clarifies this role. ODEON simulation remains necessary for generating physically based acoustic outputs and for building the training dataset, but the trained machine-learning models can approximate the ODEON-simulated morphology–SPL mapping with negligible post-training prediction time. The framework is therefore most useful for early-stage comparison of alternative street-morphology configurations, where many design cases may need to be screened rapidly before detailed acoustic simulation or field validation is undertaken.

Including SVR and XGBoost reduces the concern that RF was compared only with a linear baseline. All four models were nevertheless trained on the same 12 morphological predictors, and the correlation and VIF analyses showed that this predictor set was not fully independent. The four-model comparison should accordingly be read as a predictive-performance comparison under the same correlated morphology-variable structure, not as a basis for inferring independent causal effects of individual predictors.

The same caution applies in different forms to the linear and the tree-based models. For the Linear Regression baseline, multicollinearity directly affects coefficient estimates: the linear model remains useful as a transparent predictive reference, but its coefficients should not be read as independent effects of individual morphological variables, and the relatively lower performance of Linear Regression may reflect both the nonlinear nature of the morphology–SPL relationship and the instability of coefficient estimates under correlated predictors. For tree-based models such as RF and XGBoost, correlated predictors do not generally prevent accurate prediction, but they can influence feature-importance rankings, since importance may be shared or redistributed among correlated variables. The RF permutation-importance results reported here are therefore read as model reliance under the observed predictor-dependence structure rather than as estimates of independent causal contribution.

The same logic applies to the two width-related predictors and to the zonal variables. W_vehicle and W_side were only moderately correlated, so their combined dominance reflects a width-configuration-dominated morphology–SPL relationship rather than an artefact of duplicated predictors: W_vehicle mainly captures source–receptor distance, while W_side captures receptor–façade reflection distance. The zonal H, C_s, and C_p variables are correlated because adjacent zones belong to the same street section; they were retained because they encode distance-specific geometric information and allow the model to test whether receptor-proximal morphology contributes more strongly than morphology farther from the receptor. Their importance pattern therefore reads as distance-sensitive model reliance rather than as fully isolated zone-specific causal effects.

5. Conclusions

Drawing on 5060 ODEON-simulated street sections from 195 streets in Harbin and 12 morphology-related input variables, we compared Linear Regression, SVR, XGBoost, and RF under an illustrative random split, repeated random-split evaluation, and road-name-based grouped holdout validation to predict the cyclist-side simulated SPL under fixed source-power conditions. The nonlinear models substantially outperformed the linear baseline. XGBoost achieved the highest predictive accuracy among the tested models, while RF reached comparable accuracy and was retained for the interpretation of feature importance and marginal response patterns.

Within this simulation framework, street morphology alone accounts for most of the variation in the cyclist-side SPL, with street-width configuration as the dominant predictor. W_vehicle and W_side were the two most influential variables, together accounting for 93.9% of the total permutation importance. Morphological parameters closer to the receptor exerted stronger effects than those in the mid and far zones, underscoring the role of receptor-proximal street geometry in near-road sound propagation. Most predictors showed threshold-like marginal effects, with their acoustic influence weakening once certain values were reached.

Among the enclosure-related variables, C_s remained relevant, especially under relatively narrow W_vehicle conditions, while the effect of C_p was limited across much of its range and became more pronounced only as façade continuity approached complete closure. Width-related parameters should therefore be prioritized in morphology-based screening and design comparison under fixed source conditions, with enclosure-related parameters treated as secondary and configuration-dependent.

Overall, the framework offers a fast, morphology-sensitive screening tool for early-stage comparison of a cyclist-side simulated SPL during street planning and design. Its practical value lies in screening morphology alternatives efficiently once an ODEON-generated training dataset is available; it does not replace detailed acoustic simulation, field measurement, or regulatory traffic-noise assessment.

6. Limitations and Future Work

The study has several limitations.

The modeling dataset was derived from simulated street samples in a single city, which may limit transferability to other urban contexts with different street morphologies and building configurations. The acoustic simulations were also performed under a fixed source-power condition, so the model captures the relative effect of street morphology on cyclist-side SPL under controlled conditions rather than the full variability of real traffic noise under changing traffic flow, vehicle composition, and speed. The computational efficiency reported here should accordingly be understood as the efficiency of approximating ODEON-simulated SPL after model training, not as evidence that detailed acoustic simulation or field measurement can be omitted in the final assessment.

Several potentially relevant factors—including meteorological conditions, façade material diversity, and temporal variation in traffic states—were not explicitly incorporated. Although robustness was examined using both repeated random-split evaluation and road-name-based grouped holdout validation, the model was evaluated only within the present single-city simulation framework and was not tested against external measurements, independent datasets, or street samples from other cities.

Several morphological predictors were also correlated, especially the zonal enclosure and height variables. This does not invalidate the predictive comparison among Linear Regression, SVR, XGBoost, and RF, but it limits the interpretation of individual coefficients and feature-importance rankings. The reported RF importance values and PDP-based response patterns should therefore be read as model-based summaries under the observed predictor-dependence structure rather than as independent causal effects.

Future work should validate the framework with field measurements across multiple cities and a wider range of street types. Incorporating dynamic traffic-source descriptors—such as traffic volume, vehicle speed, and heavy-vehicle proportion—would also improve applicability under real operating conditions. Further extensions could integrate façade acoustic properties, meteorological effects, a broader set of urban morphological indicators, and stronger external validation strategies to improve generalizability and practical relevance. Categorical noise-level prediction, for example, using low-, medium-, and high-noise classes based on established acoustic thresholds, is a further direction that may reduce sensitivity to absolute simulation errors and improve interpretability for planning use.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/buildings16102023/s1, Table S1: Estimated coefficients of the linear regression baseline; Table S2: Hyperparameter search spaces for SVR, XGBoost, and Random Forest; Table S3: VIF diagnosis of the 12 morphological predictors; Table S4: Reduced-variable RF model comparison under road-name-based grouped holdout validation; Table S5: Computation-time comparison between ODEON simulation and machine-learning prediction; Figure S1: One-dimensional partial dependence plots (PDPs) for the remaining spatial predictors of cyclist-side SPL; Figure S2: Boxplots of model performance for Linear Regression, SVR, XGBoost, and Random Forest across repeated random-split evaluation and road-name-based grouped holdout validation.

Author Contributions

Conceptualization, H.W. and J.K.; methodology, H.W.; software, H.W.; validation, H.W. and X.L.; formal analysis, H.W.; investigation, H.W., Q.W. and X.L.; resources, H.W., Q.W. and X.L.; data curation, H.W. and X.L.; writing—original draft preparation, H.W.; writing—review and editing, H.W. and J.K.; visualization, H.W.; supervision, J.K.; project administration, H.W.; funding acquisition, H.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 51908506 and the Undergraduate Natural Science Innovation Fund of Huazhong University of Science and Technology, grant number 82500029.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The authors thank the anonymous reviewers for their constructive comments, which helped improve the clarity and methodological transparency of the manuscript. Random Forest, XGBoost, SVR, and Linear Regression analyses were conducted in Python 3.12.7 using scikit-learn 1.5.1 and XGBoost 3.2.0. Microsoft Excel (Office LTSC Standard for Mac 2024) was used for data organization and for preparing part of the figures.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

SPL	Sound pressure level
RMSE	Root mean square error
MAE	Mean absolute error
FHWA	Federal Highway Administration
CNOSSOS-EU	the Common Noise Assessment Methods in Europe
ANN	Artificial Neural Network
SVR	Support vector regression
XGBoost	Extreme gradient boosting
RF	Random Forest
W_vehicle	Width of vehicle lane
W_side	Width of sidewalk
B_p	Presence of a façade directly in front of the receptor (0/1)
H_(n)	Mean façade height in near zone
H_(m)	Mean façade height in mid zone
H_(f)	Mean façade height in far zone
C_s(n)	Cross-sectional enclosure degree in near zone
C_s(m)	Cross-sectional enclosure degree in mid zone
C_s(f)	Cross-sectional enclosure degree in far zone
C_p(n)	Plan enclosure degree in near zone
C_p(m)	Plan enclosure degree in mid zone
C_p(f)	Plan enclosure degree in far zone
PDPs	Partial dependence plots

References

Montenegro, A.L.; Rey-Gozalo, G.; Arenas, J.P.; Suárez, E. Streets Classification Models by Urban Features for Road Traffic Noise Estimation. Sci. Total Environ. 2024, 932, 173005. [Google Scholar] [CrossRef] [PubMed]
Yu, H.; Li, A. Assessing Traffic Noise and Its Impact on High-Rise Apartment Buildings Adjacent to an Urban Expressway: A Case Study in Chengdu, China. Buildings 2024, 14, 1377. [Google Scholar] [CrossRef]
Hamouta, S.; Zemmouri, N.; Ahriz, A. Facade Design and the Outdoor Acoustic Environment: A Case Study at Batna 1 University. Buildings 2024, 14, 3339. [Google Scholar] [CrossRef]
Zhou, Z.; Zhang, M.; Gao, X.; Gao, J.; Kang, J. Analysis of Traffic Noise Spatial Distribution Characteristics and Influencing Factors in High-Density Cities. Appl. Acoust. 2024, 217, 109838. [Google Scholar] [CrossRef]
Yang, Q.; Xia, M.; Huang, J. Research on the Effects of Spatial Forms in Residential Blocks on Road Traffic Noise Distribution in Typical City of China. Buildings 2024, 14, 2556. [Google Scholar] [CrossRef]
Apparicio, P.; Carrier, M.; Gelb, J.; Séguin, A.-M.; Kingham, S. Cyclists’ Exposure to Air Pollution and Road Traffic Noise in Central City Neighbourhoods of Montreal. J. Transp. Geogr. 2016, 57, 63–69. [Google Scholar] [CrossRef]
Apparicio, P.; Gelb, J. Cyclists’ Exposure to Road Traffic Noise: A Comparison of Three North American and European Cities. Acoustics 2020, 2, 73–86. [Google Scholar] [CrossRef]
Gelb, J.; Apparicio, P. Cyclists’ Exposure to Atmospheric and Noise Pollution: A Systematic Literature Review. Transp. Rev. 2021, 41, 742–765. [Google Scholar] [CrossRef]
Gelb, J.; Apparicio, P. Cyclists’ Exposure to Air and Noise Pollution, Comparative Approach in Seven Cities. Transp. Res. Interdiscip. Perspect. 2022, 14, 100619. [Google Scholar] [CrossRef]
Guarnaccia, C.; Mascolo, A.; Aumond, P.; Can, A.; Rossi, D. From Early to Recent Models: A Review of the Evolution of Road Traffic and Single Vehicles Noise Emission Modelling. Curr. Pollut. Rep. 2024, 10, 662–683. [Google Scholar] [CrossRef]
Umar, I.K.; Adamu, M.; Mostafa, N.; Riaz, M.S.; Haruna, S.I.; Hamza, M.F.; Ahmed, O.S.; Azab, M. The State-of-the-Art in the Application of Artificial Intelligence-Based Models for Traffic Noise Prediction: A Bibliographic Overview. Cogent Eng. 2024, 11, 2297508. [Google Scholar] [CrossRef]
Staab, J.; Weigand, M.; Schady, A.; Droin, A.; Cea, D.; Dallavalle, M.; Nikolaou, N.; Valizadeh, M.; Wolf, K.; Wurm, M.; et al. National Road Traffic Noise Estimation with Ensemble Learning and Multimodal Geodata. Transp. Res. Part D Transp. Environ. 2025, 149, 105063. [Google Scholar] [CrossRef]
Huang, J.; Fei, T.; Kang, Y.; Li, J.; Liu, Z.; Wu, G. Estimating Urban Noise along Road Network from Street View Imagery. Int. J. Geogr. Inf. Sci. 2024, 38, 128–155. [Google Scholar] [CrossRef]
Pan, J.; He, Y.; Ma, W.; An, S.; Li, L.; Huang, D.; Jia, D. Machine Learning-Enhanced 3D GIS Urban Noise Mapping with Multi-Modal Factors. ISPRS Int. J. Geo-Inf. 2025, 14, 223. [Google Scholar] [CrossRef]
Helbich, M.; Hagenauer, J.; Burov, A.; Dzhambov, A.M. Traffic Noise Assessment in Urban Bulgaria Using Explainable Machine Learning. Sustain. Cities Soc. 2025, 120, 106169. [Google Scholar] [CrossRef]
Acosta, Ó.; Montenegro, C.; González Crespo, R. Road Traffic Noise Prediction Model Based on Artificial Neural Networks. Heliyon 2024, 10, e36484. [Google Scholar] [CrossRef] [PubMed]
Fallah-Shorshani, M.; Yin, X.; McConnell, R.; Fruin, S.; Franklin, M. Estimating Traffic Noise over a Large Urban Area: An Evaluation of Methods. Environ. Int. 2022, 170, 107583. [Google Scholar] [CrossRef] [PubMed]
Lee, P.J.; Kang, J. Effect of Height-to-Width Ratio on the Sound Propagation in Urban Streets. Acta Acust. United Acust. 2015, 101, 73–87. [Google Scholar] [CrossRef]
Wu, H.; Kang, J.; Jin, H. Effects of Urban Street Spatial Parameters on Sound Propagation. Environ. Plan. B Urban Anal. City Sci. 2019, 46, 341–358. [Google Scholar] [CrossRef]
Echevarria Sanchez, G.M.; Renterghem, T.; Thomas, P.; Botteldooren, D. The Effect of Street Canyon Design on Traffic Noise Exposure along Roads. Build. Environ. 2016, 97, 96–110. [Google Scholar] [CrossRef]
Tong, H.; Kang, J. Characteristics of Noise Complaints and the Associations with Urban Morphology: A Comparison across Densities. Environ. Res. 2021, 197, 111045. [Google Scholar] [CrossRef]
Lu, X.; Kang, J.; Zhu, P.; Cai, J.; Guo, F.; Zhang, Y. Influence of Urban Road Characteristics on Traffic Noise. Transp. Res. Part D Transp. Environ. 2019, 75, 136–155. [Google Scholar] [CrossRef]
Han, X.; Huang, X.; Liang, H.; Ma, S.; Gong, J. Analysis of the Relationships between Environmental Noise and Urban Morphology. Environ. Pollut. 2018, 233, 755–763. [Google Scholar] [CrossRef]
Ryu, H.; Park, I.K.; Chun, B.S.; Chang, S.I. Spatial Statistical Analysis of the Effects of Urban Form Indicators on Road-Traffic Noise Exposure of a City in South Korea. Appl. Acoust. 2017, 115, 93–100. [Google Scholar] [CrossRef]
Salomons, E.M.; Pont, M.B. Urban Traffic Noise and the Relation to Urban Density, Form, and Traffic Elasticity. Landsc. Urban Plan. 2012, 108, 2–16. [Google Scholar] [CrossRef]
Forssén, J.; Gustafson, A.; Berghauser Pont, M.; Haeger-Eugensson, M.; Achberger, C.; Rosholm, N. Effects of Urban Morphology on Traffic Noise: A Parameter Study Including Indirect Noise Exposure and Estimated Health Impact. Appl. Acoust. 2022, 186, 108436. [Google Scholar] [CrossRef]
Zhang, J.; Huang, M.; Wan, N.; Deng, Z.; He, Z.; Luo, J. Missing Measurement Data Recovery Methods in Structural Health Monitoring: The State, Challenges and Case Study. Measurement 2024, 231, 114528. [Google Scholar] [CrossRef]
Huang, M.; Zhao, W.; Gu, J.; Lei, Y. Damage Identification of a Steel Frame Based on Integration of Time Series and Neural Network under Varying Temperatures. Adv. Civ. Eng. 2020, 2020, 4284381. [Google Scholar] [CrossRef]
Huang, M.; Zhang, J.; Hu, J.; Ye, Z.; Deng, Z.; Wan, N. Nonlinear Modeling of Temperature-Induced Bearing Displacement of Long-Span Single-Pier Rigid Frame Bridge Based on DCNN-LSTM. Case Stud. Therm. Eng. 2024, 53, 103897. [Google Scholar] [CrossRef]
Huang, M.; Wan, N.; Zhu, H. Reconstruction of Structural Acceleration Response Based on CNN-BiGRU with Squeeze-and-Excitation under Environmental Temperature Effects. J. Civ. Struct. Health Monit. 2025, 15, 985–1003. [Google Scholar] [CrossRef]
Hall, F.; Papakyriakou, M.; Quirt, J. Comparison of Outdoor Microphone Locations for Measuring Sound Insulation of Building Facades. J. Sound Vib. 1984, 92, 559–567. [Google Scholar] [CrossRef]
Ismail, M.R.; Oldham, D.J. A Scale Model Investigation of Sound Reflection from Building Façades. Appl. Acoust. 2005, 66, 123–147. [Google Scholar] [CrossRef]
Kim, M.-J.; Yang, H.-S.; Kang, J. A Case Study on Controlling Sound Fields in a Courtyard by Landscape Designs. Landsc. Urban Plan. 2014, 123, 10–20. [Google Scholar] [CrossRef]
Picaut, J.; Le Pollès, T.; L’hermite, P.; Gary, V. Experimental Study of Sound Propagation in a Street. Appl. Acoust. 2005, 66, 149–173. [Google Scholar] [CrossRef]
Guo, M.; Ni, M.Y.; Shyu, R.-J.; Ji, J.S.; Huang, J. Automated Simulation for Household Road Traffic Noise Exposure: Application and Field Evaluation in a High-Density City. Comput. Environ. Urban Syst. 2023, 104, 102000. [Google Scholar] [CrossRef]
Kang, J. Sound Propagation in Street Canyons: Comparison between Diffusely and Geometrically Reflecting Boundaries. J. Acoust. Soc. Am. 2000, 107, 1394–1404. [Google Scholar] [CrossRef]
Roberts, D.R.; Bahn, V.; Ciuti, S.; Boyce, M.S.; Elith, J.; Guillera-Arroita, G.; Hauenstein, S.; Lahoz-Monfort, J.J.; Schröder, B.; Thuiller, W.; et al. Cross-Validation Strategies for Data with Temporal, Spatial, Hierarchical, or Phylogenetic Structure. Ecography 2017, 40, 913–929. [Google Scholar] [CrossRef]
Wang, Y.; Khodadadzadeh, M.; Zurita-Milla, R. Spatial+: A New Cross-Validation Method to Evaluate Geospatial Machine Learning Models. Int. J. Appl. Earth Obs. Geoinf. 2023, 121, 103364. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Altmann, A.; Toloşi, L.; Sander, O.; Lengauer, T. Permutation Importance: A Corrected Feature Importance Measure. Bioinformatics 2010, 26, 1340–1347. [Google Scholar] [CrossRef] [PubMed]
Friedman, J.H. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Molnar, C.; Freiesleben, T.; König, G.; Herbinger, J.; Reisinger, T.; Casalicchio, G.; Wright, M.N.; Bischl, B. Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process. In Proceedings of the World Conference on Explainable Artificial Intelligence; Springer: Berlin/Heidelberg, Germany, 2023; pp. 456–479. [Google Scholar]
Kang, J. Sound Propagation in Interconnected Urban Streets: A Parametric Study. Environ. Plan. B Plan. Des. 2001, 28, 281–294. [Google Scholar] [CrossRef]

Figure 1. Schematic illustration of street configuration, receptor location, zonal division, and spatial parameters used in the model. The near, mid, and far zones correspond to 0–30 m, 30–60 m, and 60–90 m from the bicycle-lane receptor point along both directions of the street axis, respectively.

Figure 2. Distributions of the morphological predictors and simulated SPL: (a) width-related variables, W_vehicle and W_side; (b) mean façade height variables, H_(f), H_(f), and H_(f); (c) cross-sectional enclosure variables, C_s(n), C_s(m), and C_s(f); (d) plan enclosure variables, C_p(n), C_p(m), and C_p(f); and (e) simulated SPL.

Figure 3. Pearson’s correlation matrix among morphological predictors and simulated SPL. All off-diagonal correlation coefficients are significant at p < 0.01.

Figure 4. Predicted versus simulated SPL values on the illustrative random 80/20 test split for (a) Linear Regression, (b) SVR, (c) XGBoost, and (d) Random Forest.

Figure 5. Permutation importance of the 12 spatial parameters in the RF model. Error bars indicate standard deviations across repeated permutation calculations.

Figure 6. One-dimensional partial dependence plots (PDPs) showing average marginal response patterns of the four key predictors of cyclist-side SPLs: (a) W_vehicle, (b) W_side, (c) C_s(n), and (d) B_p. In (a–c), dashed lines indicate the 95% confidence intervals. In (d), the dashed line connects the average predicted SPL values for B_p = 0 and B_p = 1. Additional PDPs for the remaining predictors are provided in Supplementary Figure S1.

Figure 7. Two-dimensional PDPs showing exploratory average joint response patterns of predicted SPL for selected spatial-parameter combinations: (a) W_vehicle and W_side; (b) W_vehicle and C_s(n); (c) W_vehicle and C_s(m).

Table 1. Summary of spatial parameters used in the present study.

Variable	Description	Morphological Dimension	Zone	Expected Relevance to SPL
W_vehicle	Width of vehicle lane	Width configuration	Whole section	Affects direct sound propagation distance
W_side	Width of sidewalk	Width configuration	Whole section	Affects façade-reflection distance
B_p	Presence of a façade directly in front of the receptor (0/1)	Façade proximity	Receptor point	Captures immediate front-façade effect
H_(n)	Mean façade height in near zone	Vertical enclosure	Near zone	Describes local height-related reflection potential
H_(m)	Mean façade height in mid zone	Vertical enclosure	Mid zone	Describes intermediate height-related reflection potential
H_(f)	Mean façade height in far zone	Vertical enclosure	Far zone	Describes distant height-related reflection potential
C_s(n)	Cross-sectional enclosure degree in near zone	Section enclosure	Near zone	Describes local canyon-type enclosure
C_s(m)	Cross-sectional enclosure degree in mid zone	Section enclosure	Mid zone	Describes intermediate canyon-type enclosure
C_s(f)	Cross-sectional enclosure degree in far zone	Section enclosure	Far zone	Describes distant canyon-type enclosure
C_p(n)	Plan enclosure degree in near zone	Plan enclosure	Near zone	Describes local façade continuity/gap condition
C_p(m)	Plan enclosure degree in mid zone	Plan enclosure	Mid zone	Describes intermediate façade continuity/gap condition
C_p(f)	Plan enclosure degree in far zone	Plan enclosure	Far zone	Describes distant façade continuity/gap condition

Table 2. Predictive performance of Linear Regression, SVR, XGBoost, and Random Forest under three evaluation settings.

Evaluation Setting	Model	R²	RMSE (dB)	MAE (dB)
Illustrative random 80/20 split	Linear Regression	0.900	0.895	0.722
Illustrative random 80/20 split	SVR	0.990	0.289	0.173
Illustrative random 80/20 split	XGBoost	0.997	0.155	0.103
Illustrative random 80/20 split	Random Forest	0.996	0.183	0.112
Repeated random 80/20 splits	Linear Regression	0.897 ± 0.006	0.891 ± 0.019	0.718 ± 0.016
Repeated random 80/20 splits	SVR	0.989 ± 0.002	0.290 ± 0.031	0.181 ± 0.031
Repeated random 80/20 splits	XGBoost	0.996 ± 0.001	0.167 ± 0.011	0.107 ± 0.004
Repeated random 80/20 splits	Random Forest	0.996 ± 0.001	0.185 ± 0.012	0.113 ± 0.004
Road-name-based grouped holdout	Linear Regression	0.876 ± 0.027	0.951 ± 0.079	0.778 ± 0.073
Road-name-based grouped holdout	SVR	0.922 ± 0.018	0.755 ± 0.091	0.549 ± 0.084
Road-name-based grouped holdout	XGBoost	0.953 ± 0.018	0.583 ± 0.119	0.418 ± 0.082
Road-name-based grouped holdout	Random Forest	0.938 ± 0.041	0.662 ± 0.210	0.453 ± 0.128

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, H.; Wen, Q.; Li, X.; Kang, J. Predicting Bicycle-Lane Traffic Noise from Urban Street Morphology Using Interpretable Machine Learning Models. Buildings 2026, 16, 2023. https://doi.org/10.3390/buildings16102023

AMA Style

Wu H, Wen Q, Li X, Kang J. Predicting Bicycle-Lane Traffic Noise from Urban Street Morphology Using Interpretable Machine Learning Models. Buildings. 2026; 16(10):2023. https://doi.org/10.3390/buildings16102023

Chicago/Turabian Style

Wu, Hupeng, Qiang Wen, Xinxin Li, and Jian Kang. 2026. "Predicting Bicycle-Lane Traffic Noise from Urban Street Morphology Using Interpretable Machine Learning Models" Buildings 16, no. 10: 2023. https://doi.org/10.3390/buildings16102023

APA Style

Wu, H., Wen, Q., Li, X., & Kang, J. (2026). Predicting Bicycle-Lane Traffic Noise from Urban Street Morphology Using Interpretable Machine Learning Models. Buildings, 16(10), 2023. https://doi.org/10.3390/buildings16102023

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Predicting Bicycle-Lane Traffic Noise from Urban Street Morphology Using Interpretable Machine Learning Models

Abstract

1. Introduction

2. Materials and Methods

2.1. Overall Workflow

2.2. Street Samples

2.3. Spatial Parameters

2.4. Acoustic Simulation Settings

2.5. Model Development and Evaluation

2.6. Inter-Variable Dependence and Multicollinearity Diagnosis

2.7. Reduced-Variable Sensitivity Analysis

2.8. Model Interpretation

2.9. Computational Efficiency Comparison

3. Results

3.1. Descriptive Statistics, Preliminary Relationships, and Inter-Variable Dependence

3.2. Model Performance

3.3. Reduced-Variable Model Comparison

3.4. Relative Importance of Spatial Parameters

3.5. Model-Based Average Marginal Response Patterns of Spatial Parameters

3.6. Exploratory Interaction Patterns

3.7. Computational Efficiency

4. Discussion

4.1. Principal Findings in Relation to Previous Studies

4.2. Mechanistic Interpretation of Morphology–Noise Relationships

4.3. Planning Implications and Design Relevance

4.4. Interpretation Boundary and Methodological Implications

5. Conclusions

6. Limitations and Future Work

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI