1. Introduction
Passenger comfort in modern electric multiple units (EMUs) is strongly influenced by the tonal gear whine radiated by the traction gearbox, especially above 1 kHz, where other sources have already been mitigated [
1]. Although the baseline radiated noise is set by macro-level choices such as the helix angle or bearing topology, day-to-day variations perceived by travellers often originate from micrometre-scale corrections applied to the tooth profile during manufacturing or service—commonly termed microgeometry [
2]. Tooth repair, inclination, and carefully tuned profile relief can each shift the static transmission error and hence the acoustic signature by several decibels. Because these effects are nonlinear and frequency-dependent, engineers have traditionally relied on finite-element and multi-body simulations that require hours of CPU time per design iteration [
3]. While such computer-aided engineering (CAE) chains offer detailed insight, their computational cost renders them impractical for concept-stage trade-offs, where hundreds of candidate geometries must be sifted through rapidly.
To accelerate early design loops, recent studies have explored data-driven surrogates ranging from random forests to radial-basis-function networks [
4,
5]. Most of these algorithms, however, treat the design space as homogeneous, ignoring the fact that noise sensitivity can change abruptly when, for instance, a mesh-order harmonic crosses a structural resonance. Inspired by spatial interpolation used in geostatistics, we couple unsupervised k-means clustering with inverse distance weighting (IDW) to create a surrogate that respects local smoothness while guarding against spurious extrapolation. The proposed workflow replaces the time-consuming CAE simulation loop with a machine learning model (
Figure 1). The clustering–IDW process is outlined in
Figure 2.
The present study draws on a public dataset of 200 Romax trajectories for a 1:4 single-stage EMU gearbox, published by Tang et al. [
6]. Within this library, microgeometry parameters vary across realistic production tolerances: tooth repair depth (0–15 µm), inclination (−15° to +8°), and tip relief amplitude (0–18 µm). By partitioning this space into similarity clusters and performing IDW inside each cluster, the surrogate delivers root-mean-square (RMS) sound power predictions in milliseconds while retaining a coefficient of determination of 0.75 and a mean absolute error of 2.3 dB on unseen samples. These figures are comparable to the repeatability of the underlying CAE model and outperform classical Kriging and radial-basis-function baselines evaluated under identical splits [
6,
7].
Beyond pure speeding up, the hybrid surrogate offers two practical advantages. First, its transparent weighting scheme can be interrogated to reveal which neighbour designs dominate a given prediction, providing engineers with intuitive design leverage maps. Second, the decoupling of clustering and interpolation means that additional variables—such as loading torque or bearing clearance—can be appended without retraining the entire model, an asset for multidisciplinary optimisation. Collectively, these features position the method as a viable front-end to automated gear-train optimisation workflows, embedding acoustic targets directly into gearbox concept designs.
The aim of this paper is to (i) develop a data-driven surrogate that couples k-means clustering with inverse distance weighting (CLS-IDW) for millisecond-scale prediction of EMU gearbox noise, (ii) benchmark the model’s accuracy and computational cost against established alternatives, and (iii) perform partial sensitivity analyses to pinpoint the microgeometrical parameters that most strongly influence sound power.
2. Materials and Methods
The surrogate was trained on the open EMU gearbox library compiled by Tang et al. [
6]. The archive contains 200 steady-state operating points generated with Romax 2023.1 [
8] for a 400 kW, single-stage, helical gearbox (gear ratio 1:4). Each record stores the following:
Tooth repair depth Δr (μm): 0–15 μm.
Tooth inclination α (deg): –15°–+8°.
Tip relief amplitude Δt (μm): 0–18 μm.
RMS sound power level: Lw (dB).
The influence of microgeometry corrections on RMS sound power under the two analyzed working conditions is summarized in
Figure 3.
Table 1 summarises the statistical descriptors (mean, standard deviation, min–max) of the input variables used in the model.
Figure 4 shows the smoothed surface plots of the original simulation data across selected parameter pairs.
2.1. Clustering–IDW Surrogate
A two-stage workflow (illustrated in
Figure 2) was implemented in the ODYSSEE A-Eye 24.1 environment [
9]:
Partitioning. The input cloud is first segmented with k-means clustering (k = 2) using the squared-Euclidean distance. The choice k = 2 minimises the Bayesian Information Criterion while preserving sufficient samples per cluster (n ≥ 80).
Local interpolation. Within each cluster, the sound power level of a new query x is approximated by inverse distance weighting (IDW), with m = 10 nearest neighbours and exponent p = 10. The high exponent prevents extrapolation by reducing weights outside the local area.
Hyperparameters (k, m, p) were tuned on a 90% training subset by grid search, aimed at maximising the coefficient of determination (R2).
To investigate hyperparameter influence, we conducted a grid search over k ∈ {3, 5, 10, 15, 20} and p ∈ {–1, –2, –5, –10, –20}, keeping the original 90/10 train–test split. The selected configuration of k = 10 and p = −10 was chosen for its robustness and extrapolation control. This setting consistently yielded the R2 = 0.75 reported in the Results section. These experiments helped guide parameter selection for final testing.
2.2. Benchmark Models and Validation Protocol
To benchmark the proposed surrogate, two classical regressors were re-implemented over the same dataset and splits:
Ordinary Kriging with a Gaussian kernel and noise term σ
2 = 0.01 dB
2 [
10].
Radial-Basis-Function (RBF) Network with thin-plate splines and shape parameter ε = 1 [
4].
Data were shuffled once and divided at 90/10 into a training (180 points) and hold-out test sets (20 points). Performance was assessed using R2 and mean absolute error (MAE). Computational cost was tracked as average wall-clock time per query on an Intel® Core™ i7-(3.5 GHz, 128 GB RAM manufactured by Intel Corporation, Santa Clara, CA, USA).
2.3. Software Environment
All pre-processing and statistical analyses were conducted in Python 3.11 with scikit-learn 1.5 for clustering, pyKriging 1.6 for Kriging, and NumPy 2.0 for linear algebra. The IDW routine was implemented natively inside ODYSSEE’s embedded Jupyter kernel for tight coupling with the platform’s graphical design space exploration tools.
3. Results and Discussion
Unlike the study by Tang et al. [
6], which pursued global multi-condition optimisation with a random forest + Sparrow-Search framework and its own simulation campaign, the present work targets a different gap: we show that a transparent k-means + inverse distance weighting surrogate, trained exclusively on the publicly available Tang dataset, can deliver millisecond-scale noise predictions and explicit sensitivity maps that are suitable for rapid, concept-phase design decisions.
3.1. Surrogate Accuracy
The hybrid clustering–IDW (CLS-IDW) surrogate reproduced the 20 unseen test points with a coefficient of determination of R
2 = 0.75 and a mean absolute error of 2.3 dB. The predictive accuracy of the CLS_IDW model is visualised in
Figure 5. The near-unity slope (0.97) and negligible intercept (−0.4 dB) confirm the absence of systematic bias. A detailed comparison with the two baseline regressors is given in
Table 2.
The surrogate therefore achieves parity with the best statistical baseline while being 6× faster than the next-quickest model and three orders of magnitude faster than a full CAE rerun (≈1 h on the same hardware).
3.2. Local Sensitivities
Partial-dependence sweeps reveal distinct trends inside each cluster. In the low-inclination cluster (α > −3°), a 10 µm tooth repair depth lowers the RMS sound power by ≈2 dB, whereas further deepening yields diminishing returns. In the high-inclination cluster (α ≤ −3°), an additional −4° inclination offers a 1–2 dB benefit that stacks with repair, delivering a combined 3–5 dB reduction. The tip relief amplitude exerts only a secondary influence (<0.5 dB across its full span), consistent with simulations by Hu et al. (2016) [
11].
3.3. Interpretation of Cluster Boundaries
The two clusters effectively separate designs whose dominant mesh-order excitation falls on opposite sides of a prominent housing resonance in the mid-frequency range. Within each cluster, the noise varies smoothly; outside, the slope of the response surface changes sign, justifying the choice of a piece-wise surrogate. The rapid weight decay (exponent
p = 10) guards against unwarranted extrapolation across this boundary, improving robustness relative to global kernels [
2].
3.4. Practical Implications
For concept design, the surrogate provides an interactive map of design leverage. Engineers can identify iso-noise contours and steer microgeometry targets without resorting to time-consuming CAE reruns. The method works across gearbox sizes and can be used in other rail or automotive units if simulation data is available [
12]. Importantly, the transparent mathematical form—simple Euclidean distances and weights—facilitates scrutiny under functional safety audits, an emerging requirement for data-driven tools in rail applications.
3.5. Uncertainty Quantification
To assess the prediction reliability of the CLS–INVD surrogate model, local 95% prediction intervals (PIs) were estimated using leave-one-out residuals within each cluster. On the held-out test set, the intervals successfully captured approximately 90% of the true noise values, which aligns well with typical NVH measurement repeatability ranges. As shown in
Figure 6, vertical grey bars represent the estimated prediction intervals on the parity plot. Notably, 80% of the test points exhibit PI widths narrower than ±4 dB, which is considered acceptable for preliminary gearbox design and microgeometry optimisation stages.
3.6. Extrapolation Behaviour
Table 3 summarises in-domain versus outer-10% performance. As expected, the steep inverse distance weighting limits extrapolation: errors grow by \~0.3 dB MAE, and R
2 drops by \~0.12. This controlled degradation is preferable to unrestricted global models, which can yield unphysical noise reductions outside the training hull.
3.7. Comparison with Additional ML Baselines
Besides Kriging and RBF, a random forest regressor with 100 trees was trained on the same split. It achieved R2 ≈ 0.86 and MAE ≈ 1.9 dB, confirming that tree-based models can offer high accuracy but lack the geometric interpretability of CLS-IDW that is required for regulatory audits. Support Vector Regression (SVR) and XGBoost have been reported to achieve similar accuracy in related NVH studies, yet both demand extensive hyperparameter tuning and are not as transparent for post hoc design leverage analysis.
3.8. Practical Integration into OEM NVH Workflow
The millisecond response time (<1 ms per query) allows the surrogate to be embedded directly into early CAD configurators or gear-cutting CAM software (e.g., Hexagon Manufacturing Intelligence ODYSSEE A-Eye, version 24.1, Cobham, UK), giving real-time feedback to designers. Because the weighting scheme is fully traceable, the model satisfies functional safety review requirements (ISO 26262-10 [
13]) and can be deployed in NVH gates for passenger comfort assurance.
3.9. Multi-Condition Outlook
While the headline results focus on the high-speed working condition, the cleaned dataset also includes a continuous-operation subset. Preliminary tests indicate that a joint surrogate across both conditions maintains strong predictive performance (R2 > 0.72, with <3 dB MAE). Extending the framework to acceleration and start-up conditions is therefore straightforward and will be reported in future work.
4. Conclusions
This study demonstrated that a two-stage surrogate—k-means clustering followed by inverse distance weighting (CLS-IDW)—can predict the broadband sound power level of an EMU traction gearbox from micrometre-scale tooth-geometry corrections in milliseconds. The surrogate was not trained on proprietary data: it draws exclusively on the open, 200-case Romax dataset published by Tang et al. [
6] for a 1:4 single-stage gearbox, so no additional simulations were generated in the present work.
Using 180 points for fitting and 20 for blind testing, the model reproduced unseen operating points, with R2 = 0.75 and a mean absolute error of 2.3 dB, matching the repeatability of the underlying CAE solver while delivering a three-order-of-magnitude speed-up over a full finite-element rerun (≈0.8 ms vs. ≈1 h per query).
Sensitivity sweeps revealed that combining a −7° tooth inclination correction with a 10 µm tooth repair depth can lower the root-mean-square sound power by 3–5 dB, whereas the tip relief amplitude has only a secondary influence (<0.5 dB). The piece-wise interpolation also exposes which neighbouring designs dominate each prediction, giving engineers an intuitive “leverage map” for early-stage trade-offs.
The CLS-IDW framework can later include load, speed, or bearing clearance and support balancing acoustic and efficiency goals. By replacing hours-long simulation chains with an interpretable, millisecond-scale predictor, the approach offers a practical path to embed noise targets directly into gearbox concept design, shortening development cycles while safeguarding passenger comfort.
Taken together, the findings confirm that the objectives set out in the Introduction—developing a rapid CLS-IDW surrogate, benchmarking its performance against conventional methods, and identifying the microgeometry factors that govern EMU gearbox noise—have been fully achieved.
Future work will integrate the uncertainty-aware CLS-IDW surrogate into a multi-condition optimisation loop, enabling simultaneous minimisation of noise and power-loss targets across the full EMU duty cycle.
Author Contributions
Conceptualisation, A.Z.; methodology, K.H.; software, K.H.; validation, K.H.; formal analysis, K.H.; investigation, K.H.; resources, K.H.; data curation, K.H.; writing—original draft preparation, K.H.; writing—review and editing, K.H.; visualisation, A.Z.; supervision, A.Z.; project administration, A.Z. All authors have read and agreed to the published version of the manuscript.
Funding
This research was supported by the EKÖP-25-3-I-SZE-82 University Research Scholarship Program of the Ministry for Culture and Innovation from the source of the National Research, Development and Innovation Fund.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data presented in this study are available on request from the corresponding author.
Acknowledgments
The authors would like to thank Hexagon Manufacturing Intelligence for providing academic access to the ODYSSEE A-Eye software (version 2024.2). All individuals acknowledged have provided their consent to be included in this section.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Kim, S.; Mobis, H.; Park, I.; Park, C. Gear Optimisation for Noise Reduction of EPB Actuator. In Proceedings of the Europe’s Braking Technology Conference & Exhibition, Online, 17–21 May 2020; EB2020-EBS-007. Available online: https://fisita-wix.s3-eu-west-1.amazonaws.com/promo/EuroBrake-2021-Preliminary-Programme.pdf?utm_source=chatgpt.com (accessed on 5 November 2025).
- Beinstingel, A.; Keller, M.; Heider, M.; Pinnekamp, B.; Marburg, S. A hybrid analytical-numerical method based on isogeometric analysis for determination of time-varying gear-mesh stiffness. Mech. Mach. Theory 2021, 160, 104291. [Google Scholar] [CrossRef]
- Mao, K. An approach for power-train gear transmission-error prediction using the nonlinear finite-element method. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2006, 220, 1455–1463. [Google Scholar] [CrossRef]
- Ghosh, S.; Chakraborty, G. On optimal tooth-profile modification for reduction of vibration and noise in spur-gear pairs. Mech. Mach. Theory 2016, 105, 145–163. [Google Scholar] [CrossRef]
- Huang, P.; Xu, L.; Luo, C.; Zhang, J.; Chi, F.; Zhang, Q.; Zhou, J. A study on noise reduction of gear pumps of wheel loaders based on the ICA model. Int. J. Environ. Res. Public Health 2019, 16, 999. [Google Scholar] [CrossRef] [PubMed]
- Tang, Z.; Lu, M.; Wang, M.; Sun, J. Research on modification and noise-reduction optimisation of EMU traction gear. PLoS ONE 2024, 19, e0298785. [Google Scholar] [CrossRef]
- Lu, M. Correlation of modification parameters with noise in each working condition.xlsx. figshare. Dataset. Port Digit. Sci. 2023. [Google Scholar] [CrossRef]
- Romax Technology. RomaxDESIGNER, v14; User Documentation; Romax Technology: Nottingham, UK, 2023; Available online: https://www.romaxtech.com/software/romax-designer/ (accessed on 5 November 2025).
- Hexagon. ODYSSEE A-Eye, v24.1; Product Brochure; Hexagon: Cobham, UK, 2024; Available online: https://hexagon.com/products/odyssee?tabId=tab-690C6655F1D74C089356830F00306D55-2-1 (accessed on 5 November 2025).
- Oh, S.; Kang, K.; Soh, K.; Kim, J. Whine noise development of engine timing gear system in heavy-duty vehicle. In Proceedings of the ASME 2011 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference (IDETC/CIE 2011), Washington, DC, USA, 28–31 August 2011; pp. 357–363. [Google Scholar] [CrossRef]
- Hu, Z.; Tang, J.; Zhong, J.; Chen, Y. Effects of tooth-profile modification on dynamic responses of a high-speed gear-rotor-bearing system. Mech. Syst. Signal Process. 2016, 76–77, 294–318. [Google Scholar] [CrossRef]
- Granados-Ortiz, F.; Ortega-Casanova, J. Machine-learning-aided design optimisation of a mechanical micromixer. Phys. Fluids 2021, 33, 063604. [Google Scholar] [CrossRef]
- ISO 26262-10:2018; Road Vehicles—Functional Safety Part 10: Guidelines on ISO 26262. ISO: Geneva, Switzerland, 2018.
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).