Next Article in Journal
Federated Learning-Enabled Building Stock Modeling for Privacy-Preserving Embodied Carbon Benchmarking in Residential Construction
Previous Article in Journal
Multi-Agent Systems and Digital Twins as a Basis for Smart Buildings with Integrated Sustainable Efficient Ventilation
Previous Article in Special Issue
Materials Selection in Biophilic Building Design: Multisensory Perception and Psycho-Physical Mapping of Wood Materials
 
 
Due to scheduled maintenance work on our servers, there may be short service disruptions on this website between 11:00 and 12:00 CEST on March 28th.
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation of Psychoacoustic Machine Learning Assessment Method for Predicting Occupant Well-Being

1
Department of Mechanical Engineering, National Taipei University of Technology, Taipei 10608, Taiwan
2
Department of Building Environment and Energy Engineering, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, China
3
Department of Computing, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, China
4
Faculty of Dentistry, The University of Hong Kong, Pok Fu Lam, Hong Kong Island, Hong Kong, China
*
Author to whom correspondence should be addressed.
Buildings 2026, 16(5), 1027; https://doi.org/10.3390/buildings16051027
Submission received: 7 February 2026 / Revised: 1 March 2026 / Accepted: 3 March 2026 / Published: 5 March 2026

Abstract

In modern buildings, the air-conditioned indoor environment is vital for occupant productivity and well-being, yet fan noise and airflow turbulence can significantly compromise these benefits. Human–environmental interactions are complex processes that traditional energy-based acoustic metrics are often insufficient to model. Therefore, this study aims to advance the multidimensional sound quality assessment framework for building acoustics. Three methods, the conventional regression approach (CRA), general prediction model (GPM), and psychoacoustic machine learning (PML) assessment methods, were evaluated for predicting three perceptual dimensions (Evaluation, Potency, Activity; EPA) and negative noise impacts on occupant well-being (O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable). Based on 432 multidimensional sound quality assessments across four general types of air-conditioned built environments, the PML achieved the best goodness-of-fit for the EPA-score perdition (adjusted R2 = 0.61) compared to CRA (0.32) and GPM (0.15) and effectively predicted all negative noise impacts (adjusted R2 = 0.53–0.61). The PML assessment method offers a smart and reliable solution for sound quality and well-being prediction through psychoacoustic heatmaps encoding time-varying psychoacoustic features in 227 × 227 pixels from 30 s soundtracks of the built environment for sustainable building design.

1. Introduction

The field of building acoustics [1,2] encompasses the assessment and control of noise within built environments, with particular emphasis on how environmental sound characteristics affect the well-being of building occupants [3]. Modern buildings increasingly rely on air-conditioning systems as integral components for maintaining satisfactory indoor environmental conditions [4]. These systems inevitably generate noise through multiple mechanisms, including mechanical fan operation and aerodynamic interactions where turbulent airflow encounters duct discontinuities [5,6]. Consequently, systematic acoustic assessment has emerged as a critical requirement for mitigating these issues [7]. Conventional acoustic assessment approaches [8,9] for air-conditioned built environments utilize established acoustic metrics including A-weighted equivalent continuous sound pressure level (LAeq), noise criteria (NC), noise rating (NR), and room criteria (RC). While these single-value metrics offer straightforward acceptability assessments of indoor noise conditions, they possess inherent limitations in characterizing the acoustic properties and complex psychological dimensions of occupant experience that extend beyond simple magnitude sensitivity [10]. Historically, noise assessment, prevention, prediction methodologies, and management frameworks have prioritized objective acoustic characterization, largely because objective metrics present fewer measurement and prediction challenges compared to subjective human responses [11]. Contemporary understanding emphasizes that human–environmental interactions constitute the foundation for advancing acoustic assessment in building acoustics. This recognition establishes multidimensional sound quality assessment [12] as fundamental for effectively analyzing human–environmental relationships and supporting evidence-based building design.
Public recognition of the health consequences [13] of daily noise exposure has increased following the update of the World Health Organization (WHO) noise guidelines in 2018 [14]. Daily noise exposure is linked to discomfort as well as stress biomarkers, disease progression, and mortality [15]. Undesired acoustic environments are recognized to cause stress [16], annoyance [17], and discomfort [18], all of which exert negative impacts on occupant well-being [19]. Human–environmental interactions with acoustic environments include sequential processes from sound generation and transmission to perception, ultimately concluding in diverse noise impact outcomes. Although ISO 15,666 [20] and ISO 12,913 standards [21,22,23] address subjective assessment of perceptual responses, these assessments exhibit substantial variability among building acoustics studies [24,25]. The semantic differential method [26], which employs questions constructed from bipolar semantic pairs representing opposing meanings [27,28], is a commonly adopted psychological scale for the quantitative evaluation of objective phenomena. A comprehensive systematic review [29] analyzing applications of the semantic differential method for subjective assessment discovered the fundamental structure of sound perception in terms of the Evaluation, Potency, and Activity (EPA) framework. The Evaluation (E-), Potency (P-), and Activity (A-) dimensions respectively represent general judgment, sensation of sound energy content, and sensation of the temporal and spectral content of sounds. Hence, acoustic metric selection for sound quality assessment should incorporate measurements addressing energy, spectral, and temporal characteristics of sound.
In objective acoustic characterization, psychoacoustic metrics comprise a collection of objective measures designed to quantify actual sound sensations, grounded in the psychoacoustic scale introduced in 1961 [30]. Total loudness (N) is the most widely recognized psychoacoustic metric, quantifying loudness perception with calculation procedures standardized in ISO 532-1 [31]. The N measurement provides valuable supplementation to conventional acoustic assessment methods for assessing indoor acoustic comfort [32] compared to NC and NR. In addition, sharpness (S), roughness (R), and fluctuation strength (FS) are usually accepted psychoacoustic metrics for evaluating the spectral properties of sounds. These metrics respectively quantify sharpness perception via energy skewness calculations, roughness perception through rapid amplitude modulation measurement, and fluctuation strength perception via slower amplitude modulation assessment [33]. Applying statistical sound level metrics such as LA90 and LA10, rather than time-equivalent levels, enables superior differentiation between ambient noise levels and event noise levels through consideration of temporal characteristics. These psychoacoustic metrics and statistical noise levels were also found to be significantly correlated with perceptual assessments across the E-, P-, and A-dimensions [34,35].
Contrasting with objective acoustic characterization, developing reliable, valid, and practical psychometric instruments for assessing multidimensional perceptual impacts remains an ongoing research priority. Various subjective assessment approaches, together with their theoretical factor models, have been proposed across different investigations [36]. However, analytical inconsistencies among these studies limit cross-study comparability. The Psychoacoustics Perception Scale (PPS) was therefore constructed based on the EPA framework to systematically evaluate all essential perceptions within the fundamental human perceptual dimensions of sound [12]. Identification of the EPA framework offers a promising approach for addressing challenges posed by extensive factor variability in multidimensional environmental sound quality assessment. Moreover, the EPA framework establishes a comparative structure across different studies and provides a basis for predicting negative noise impacts associated with occupant well-being. Recent research [37] has successfully applied multidimensional sound quality assessments to outdoor soundscapes by combining a comprehensive set of acoustic and psychoacoustic metrics with the PPS. This study evaluated the occupant well-being-related noise impacts (e.g., O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable), which the WHO links to the disease burden of environmental noise. The results confirmed the value of the EPA framework in multidimensional sound quality assessments and facilitated the development of general prediction models (GPMs) that predictively model perceptual influences on occupant well-being.
Other than the conventional regression approach (CRA), the development of artificial neural networks (ANNs) with feature learning capabilities represents a significant advancement in machine learning (ML) algorithms for predictive modeling of perceptual influences on occupant well-being. While ANNs inspired by biological neural networks [38] have revolutionized fields like image classification [39], particularly through non-saturating ReLU activation functions [40], their application in psychoacoustics remains limited. A systematic search of the ScienceDirect database revealed that most “psychoacoustic machine learning” (PML) studies focused on objective signal classification tasks such as heartbeat analysis [41], vehicle diagnostics [42], and sound categorization [43,44,45], rather than on subjective human impact. Only a few studies have applied PML to predict subjective noise impacts [46,47,48], highlighting a gap in intelligent, holistic methodologies for assessing air-conditioned built environments.
To address the limitations of CRA assessment methods in capturing temporal variations and spectral complexities, a prior study [49] developed the psychoacoustic heatmap machine learning model (PHMLM). This innovative method incorporated the EPA framework to transform dynamic psychoacoustic metrics into detailed visual formats for forecasting noise effects. Unlike classical Mel-spectrograms [50], which display energy in constant frequency bands without accounting for human auditory sensations such as loudness and sharpness, the PHMLM utilized a preprocessing technique that reduced input data size by approximately 5000-fold. By encoding multidimensional acoustic information into 227 × 227-pixel psychoacoustic heatmaps with 0.002 s temporal resolution, the PHMLM efficiently captured energy, spectral, and temporal content for deep neural network analysis of multidimensional sound quality. This prior work [49] demonstrated that PML assessment methods could achieve predictive consistency with established regression methodologies (including GPM and CRA), providing a foundation for the comparative evaluation presented in this study.
The primary objective of this research is to evaluate different assessment methods (CRA, GPM, and PML) for predicting sound quality and subsequent negative noise impacts on occupant well-being in air-conditioned built environments using multidimensional objective acoustic characteristics. Detailed model development procedures have not been repeated in Section 2, as the historical progression of different methods used to analyze the complex interaction between the acoustic environment and occupant perception has been comprehensively presented in prior published studies (CRA [12,23], GPM [37], and PML [49]). Compared to these prior studies, this work delivers a comprehensive performance evaluation, cross-method comparison, and hybrid integration analysis through balanced sampling of air-conditioned environments, independent subjective perceptual response measurements beyond the original development datasets, and cross-domain generalization testing (outdoor GPM with indoor PML). These advances identify the relative strengths of each method and optimal integration strategies for building acoustic design. This significance of this research lies in delivering a smart, reliable, efficient, and user-friendly assessment method that can inform sustainable building design and enhance indoor acoustic environments by predicting occupant well-being responses to air-conditioning noise within built environments.

2. Materials and Methods

2.1. Multidimensional Sound Quality Assessment Framework

A holistic approach to sound quality assessment [12,37,49] is essential for capturing the complex interaction between the acoustic environment and occupant perception. As illustrated in Figure 1a, this study adopts a comprehensive framework comprising four key stages I-IV for objective and subjective data collection:
  • Stage I. Objective acoustic characterization: This stage involves measuring the physical properties of sound using acoustic and psychoacoustic metrics (see Figure 1b) that capture energy content (e.g., LAeq, RC, N), spectral content (e.g., S, R, and FS), and temporal content (e.g., LA10LA90 and N5N95);
  • Stage II. Fundamental perceptual responses: This stage measures the fundamental subjective perceptual responses to sound using the PPS, which consists of nine questions (E1: Quiet–Noisy, E2: Relaxed–Tense, E3: Pleasant–Unpleasant, P1: Quiet–Loud, P2: Light–Heavy, P3: Weak–Strong, A1: Deep–Metallic, A2: Low–High, and A3: Dull–Sharp) on a 7-level semantic differential scale (see Figure 1c);
  • Stage III. EPA framework: This stage categorizes subjective responses into three fundamental dimensions (E, P, and A), serving as the core link in the human–environmental interaction process between physical sound stimuli and occupant perception (see Figure 1c);
  • Stage IV. Occupant well-being-related noise impacts: The final stage evaluates the consequently negative impacts on occupant well-being (see Figure 1c), specifically Discomfort (O1), Annoying (O2), Stressful (O3), and Unacceptable (O4).
This multidimensional structure provides a standardized ground-truth for comparative analysis, ensuring that the CRA [12,23], GPM [37], and PML [49] assessment methods are evaluated against a consistent and holistic set of inputs from objective acoustic characterization to predict sound quality and subsequent negative noise impacts on occupant well-being (arrows in Figure 1a).

2.1.1. Objective Data Collection in Air-Conditioned Built Environments

To capture a representative sample of indoor acoustic conditions, sound recordings were conducted in four distinct air-conditioned built environments: (1) Library, (2) Classroom, (3) Lecture Hall, and (4) Office (see Figure 1b). These four air-conditioned built environments (Library, Classroom, Lecture Hall, and Office) represent essential daily spaces where sustained HVAC noise exposure critically impacts occupant concentration, learning, communication, and productivity. Libraries demand quiet for individual study and reading, classrooms support interactive teaching and group discussions, lecture halls accommodate large-group instruction and presentations, and offices facilitate collaborative work and focused tasks, each presenting distinct psychoacoustic challenges from HVAC systems. Their selection reflects prevalent built environments in educational institutions and workplaces where air-conditioning noise management directly influences occupant well-being and performance. These space types encompass the majority of indoor scenarios encountered daily by students, faculty, and professionals. Detailed acoustic characteristics of these environments were analyzed and compared in Section 3 to better understand the differences between environments. Soundtracks with an equal number from different air-conditioned built environments were randomly selected in this study from the prior PML development study [49]. Although identical to prior study recordings, independent listening tests were conducted to collect subjective perceptual responses, eliminating data leakage between the training phase of the prior study and the multidimensional sound quality assessments in this study. All 30 s soundtracks were recorded using a high-precision two-channel handheld analyzer (Type 2270; Brüel & Kjær, Nærum, Denmark) equipped with binaural recording microphones. The recording position was standardized at a height of 1.2 m above the floor to simulate the typical ear height of a seated occupant. A total of 432 30 s soundtracks were extracted from the recordings for both objective and subjective analysis of noise in air-conditioned built environments. Conventional acoustic metrics were recorded directly from the sound analyzer, while psychoacoustic metrics were computed using MATLAB R2024a (MathWorks, Natick, USA). The detailed calculation equations and unit definitions based on ISO 532-1 [31] and Zwicker’s book [30] are provided in Appendix A.

2.1.2. Subjective Perceptual Response Measurement

Jury listening tests were conducted to collect subjective responses to the recorded soundtracks (see Figure 1b). The tests were conducted in an anechoic chamber at the Hong Kong Polytechnic University to eliminate the influence of background noise and room reflections. To minimize the influence of confounding environmental factors, the anechoic chamber maintained a controlled temperature of 25 ± 2 °C with adequate lighting conditions. The 30 s soundtracks were replayed using an omni-directional dodecahedron loudspeaker (Type LS02; Acoem, Limonest, France) positioned 1.2 m from the listener, matching the original recording height. To ensure accurate reproduction, the sound pressure level (SPL) was calibrated using a 1 kHz reference tone at 60 dB measured at the listener’s ear position. Before the listening tests, written informed consent was obtained from all participants, and demographic details (gender and age) were documented. Individuals with self-reported hearing impairments or chronic noise exposure were excluded to ensure data validity. This criterion minimized potential biases from undetected auditory threshold shifts that could confound perceptual judgments.
Participants evaluated a total of 12 soundtracks using a self-administered questionnaire structured in two parts to capture the subjective responses defined in the assessment framework (see Figure 1c). All 12 soundtracks per participant were randomly assigned. Individual participant sets were not environment-balanced (due to randomization), but collectively ensured the same number of assessments per environment type. A 1 min idling period separated consecutive assessments. Part A employed the PPS to quantify sound quality perceptions across the three fundamental E-, P-, and A- dimensions. Part B assessed occupant well-being-related noise impact, where participants rated the specific negative outcomes (O1O4) on a 7-point Likert scale. This dual-part instrument ensured that both the intermediate perceptual attributes and the consequential well-being impacts were systematically recorded for every acoustic stimulus.

2.2. Assessment Methods for Predictive Modeling of Subjective Responses

Three assessment methods (CRA, GPM, and PML) were applied and compared in this study to evaluate the feasibility of predicting occupant well-being.

2.2.1. Conventional Regression Approach (CRA)

The CRA represents the traditional methodology in environmental noise studies [23] involving subjective data. The objective acoustic and psychoacoustic metrics are directly used as independent variables in linear regression models (see Equation (1)) to predict subjective responses such as EPA-scores in the EPA framework.
y = B 1 x 1 + B 2 x 2 + B 3 x 3 + B n x n + C ,
where y is the dependent variable, B1Bn are the regression coefficients (unstandardized coefficients) for the independent variables (x1xn) as predictors, and C is the y-intercept.

2.2.2. General Prediction Model (GPM)

The fundamental distinction between the GPM and the CRA lies in their reliance on experimental data. While the CRA fits a regression model to the specific participant responses observed in this study, the GPM operates as a standalone predictive tool. In this context, the GPM is the “pure-objective” approach developed for occupant well-being prediction [37]. It derives predictions directly from acoustic and psychoacoustic metrics using established formulations, without further adjustment to the subjective data collected in this specific experiment.
By bypassing the need for dataset-specific correction, the GPM evaluates the generalizability and direct applicability of using established objective equations to predict subjective EPA-score (GPMEPA, Equation (2)) and occupant well-being-related noise impacts (GPAO1–4, Equations (3)–(6)). Although the GPM was originally formulated for broader applications in outdoor soundscape conditions, this study specifically applied the GPM to test its ability in indoor air-conditioned built environments.
G P M E P A = 1.03 × L A 10 L A 90 + 0.68 × L A 50 0.28 × N 30.4 × R 39.7 ,
GPM O 1 :   discomfortable   = 0.17 × G P M E P A 5.51 × F S + 3.7 × R 5 + 0.04 × L Z e q 0.03 × L A 90 + 2.06 ,
G P M O 2 :   A n n o y i n g = 0.18 × G P M E P A 5.86 × F S 0.04 × f M o d + 4.77 ,
G P M O 3 :   S t r e s s f u l = 0.18 × G P M E P A + 3.86 ,
G P M O 4 :   U n a c c e p t a b l e = 0.17 × G P M E P A 5.58 × S + 8.81 × R 5 + 3.77 .

2.2.3. Psychoacoustic Machine Learning (PML) Assessment Method

The third and most advanced approach evaluated in this study is the PML assessment method, which utilizes the PHMLM [49], originally developed for EPA-score prediction. Unlike the equation-based GPM, the PML is a data-driven framework designed to capture complex, non-linear relationships between acoustic features and subjective perception.
For each 30 s soundtrack containing air-conditioning sounds, a corresponding psychoacoustic heatmap was generated as a 227 × 227-pixel grayscale image (see Figure 1b). These dimensions align with the input requirements of the AlexNet architecture [39] (see Table 1). Detailed psychoacoustic heatmap generation procedures, including matrix selection and resolution rationale, can be found in the study of PML development [49]. The heatmap encodes temporal variations in the four most commonly applied psychoacoustic metrics in environmental noise studies (N, R, S, and FS), calculated with a time resolution of 0.002 s. This resolution utilizes 96 samples at a 48 kHz sample rate, following ISO 532-1 standards. The specific covering durations and maximum intensities mapped to the pixel value of 255 (white) were set as follows:
  • Rows 1–57: N covering 25.878 s, with a maximum intensity of 25.5 sone (0.1 sone/pixel);
  • Rows 58–114: S covering 25.878 s, with a maximum intensity of 2.55 acum (0.01 acum/pixel);
  • Rows 115–171: R covering 25.878 s, with a maximum intensity of 0.31875 asper (0.00125 asper/pixel);
  • Rows 172–227: FS covering 25.424 s, with a maximum intensity of 0.255 vacil (0.001 vacil/pixel).
For this comparative study, the PML model was deployed using the pre-trained PHMLM-EPA established in prior work [49]. As the training phase (including hyperparameter optimization and stochastic gradient descent with momentum (SGDM) solver configuration) was completed and validated in the prior study, this research utilizes the fully trained network directly to generate predictions for the current dataset. This ensures that the comparison between PML, GPM, and CRA focuses strictly on predictive performance.

2.3. Statistical Analysis

All statistical analysis were executed using IBM SPSS Statistics 26 (IBM Corp., Armonk, NY, USA) at a significance level of 0.05 [51]. As a preliminary step, the Shapiro–Wilk test [52] was applied to assess the normality of the dataset. Given that the acoustic metrics and subjective responses demonstrated non-normal distributions (p-values < 0.05), non-parametric statistical methods were adopted for the two-tailed statistical tests in this study. To characterize the acoustic variations within the study, the Kruskal–Wallis test was employed to evaluate statistically significant differences in acoustic characteristics and subjective responses across the different types of air-conditioned built environments. Upon the detection of significant variances (p-values < 0.05), Dunn’s post hoc test with Bonferroni adjustments was performed to identify specific pairwise differences between different built environments. The internal consistency of the semantic differential scales was validated using Cronbach’s α (> 0.60) [53], justifying the reliability of combining individual items into composite E-, P-, and A-scores.
The comparative evaluation of the predictive modeling approaches (CRA, GPM, and PML) focused on two key performance indicators: Spearman’s rank correlation coefficient (ρ) and the adjusted coefficient of determination (adjusted R2). Spearman’s ρ was applied to test the strength and direction of the monotonic relationships (from −1 to 1) between the model-predicted values and the ground-truth experimental data, both for EPA-score and occupant well-being-related noise impacts (O1–O4).
ρ = 1 6 d i 2 n ( n 2 1 ) ,
where n is the total sample size and di is the difference between the two ranks of each sample.
The adjusted R2 served as a metric for goodness-of-fit, quantifying the proportion of variance in the subjective responses that could be explained by each model while penalizing model complexity. A higher adjusted R2 value indicates superior explanatory power, allowing comparison between the simpler regression-based approaches (CRA and GPM) and the more complex PML assessment methods.
A d j u s t e d   R 2 = 1 1 i = 1 n y i ^ y ¯ 2   i = 1 n y i y ¯ 2   × ( n 1 ) ( n p 1 ) ,
where n is the total sample size, p is the number of predictors, and y i ^ and y ¯ are the predicted values and the mean of y, respectively.

3. Results

3.1. Descriptive Statistics of Multidimensional Sound Quality Assessment

A total of 432 jury listening tests were conducted for analysis, derived from 36 participants (69.4% male, 30.6% female; mean age 24.1 ± 2.2 years) evaluating 12 soundtracks each. The number of sound quality assessment in this study was balanced across four different types of air-conditioned built environments: (1) Library, (2) Classroom, (3) Lecture Hall, and (4) Office, with 108 assessments per type.

3.1.1. Multidimensional Objective Acoustic Characterization

The descriptive statistics for the objective acoustic and psychoacoustic metrics across the four environment types are summarized in Table 2 and Table 3. Shapiro–Wilk normality tests indicated that none of the acoustic or psychoacoustic metrics followed a normal distribution (p-values < 0.05), requiring the use of non-parametric statistical analysis. The results of the Kruskal–Wallis tests showed significant differences (p-values < 0.001) in acoustic energy, spectral, and temporal content among the environment types.
The built environment of l. Library consistently exhibited the lowest noise levels across most conventional metrics, including LAeq (Median = 45.1 dBA), LA10 (45.5 dBA), and RC ratings (42). Post hoc Dunn’s tests confirmed that the Library environment was significantly quieter than the other three types. The built environments of 2. Classroom and 3. Lecture Hall demonstrated significantly higher temporal variability (LA10LA90 = 1.1 dBA) compared to the other environment types. The results suggested that although the built environment of 4. Office is louder than the Library, the noise profile is more constant than that of the university Classroom and Lecture Hall.
Similar to the conventional acoustic metrics, the psychoacoustic characteristics exhibited significant variations across the four environment types (p-values < 0.001). The built environment of 4. Office was characterized by the highest intensities across most psychoacoustic metrics. It recorded the highest median N (19.9 sone), R (0.12 asper), and FS (0.06 vacil). Post hoc Dunn’s tests confirmed that the Office environment was significantly louder and rougher than all other types. In addition, the Office environment illustrated the highest temporal variability in loudness fluctuation (N5N95 = 2.0 sone). In contrast, the built environment of 2. Classroom, despite being relatively quiet (N = 6.7 sone), was distinguished by its spectral characteristics. It exhibited the lowest S (0.87 acum), suggesting a duller sound profile compared to other built environments.

3.1.2. Multidimensional Subjective Perceptual Responses

The subjective assessments quantified via the PPS in the EPA framework are presented in Figure 2. Significant differences were observed for all perceptual dimensions across the four environment types (p-values < 0.05). The built environment of 1. Library was consistently rated as the most favorable environment, achieving the lowest EPA-score (−7.0 ± 4.8). In the design of the PPS, lower values indicate better sound quality (e.g., −3 = “Extremely Pleasant” vs. +3 = “Extremely Unpleasant”). Therefore, the Library environment was perceived as the “Quietest” (E1/P1), most “Relaxed” (E2), and most “Pleasant” (E3). Moreover, the built environment of 2. Classroom showed the lowest A-score (−2.6 ± 4.0), a result that aligns with its lowest objective S value (0.87 acum) in Table 2.

3.1.3. Reliability of Sound Quality Assessment

Cronbach’s α analysis was conducted to verify the internal consistency of the subjective responses measured by the PPS in quantifying sound quality perceptions across the three fundamental E-, P-, and A-dimensions. As shown in Table 4, the Cronbach’s α for all perceptual E-, P-, A-, and EPA-dimensions exceeded 0.80 across all built environments, indicating good internal consistency in the collected data [53].

3.2. Performance Evaluation of Assessment Methods

3.2.1. Prediction of Individual E-, P-, and A-Scores Using the PML Assessment Method

As shown in Table 5, higher ρ values were found between the PML predictions and the individual E-, P-, and A-scores compared to conventional metrics. For the E-score, the PML prediction (ρ = 0.77) showed a 148% and 38% improvement over LAeq (ρ = 0.31) and RC (ρ = 0.56), respectively. Similarly, for the P-score, the PML prediction (ρ = 0.80) demonstrated a 166% and 38% improvement over LAeq (ρ = 0.30) and RC (ρ = 0.58). For the A-score, the PHMLM (ρ = 0.57) showed a 171% improvement over the psychoacoustic metric S (ρ = 0.21).

3.2.2. Prediction of EPA-Score

The CRAEPA model was established using stepwise linear regression to identify the most significant acoustic and psychoacoustic metrics as predictors for the EPA-score. The analysis identified RC, N5N95, fMod, LZeq and LA90 as the significant independent variables. The final regression model achieved an adjusted R2 = 0.315, F(5, 426) = 40.6, p-value < 0.001). Among the predictors, RC demonstrated the strongest positive influence (B = 3.96, p-value < 0.001), while N5N95 showed a strong negative contribution (B = −4.16, p-value < 0.001). The background noise level (LA90) also negatively impacted the EPA-score (B = −2.19, p-value = 0.006), with minor contributions from LZeq (B = −0.40, p-value = 0.001) and fMod (B = −0.03, p-value = 0.013). The resulting regression equation was found to be:
C R A E P A = 3.96 × R C 4.16 × N 5 N 95 0.03 × f M o d 0.40 × L Z e q 2.9 × L A 90 41.7
The predictive performance of the overall sound quality (EPA-score) was evaluated by comparing the PML assessment methods against the CRAEPA (Equation (9)), GPMEPA (Equation (2)), and the metrics LAeq, RC, and N. As shown in Figure 3, the PMLEPA demonstrated the highest goodness-of-fit among the models across all environment types. For all air-conditioned built environments, the PMLEPA achieved a strong correlation (ρ = 0.79) and explained the highest variance (adjusted R2 = 0.61). In contrast, the CRAEPA showed moderate performance (ρ = 0.58, adjusted R2 = 0.32), while the GPMEPA showed the weakest predictive power (ρ = 0.40, adjusted R2 = 0.15). The performance of the GPMEPA was comparable to that of single-value acoustic metrics such as N (ρ = 0.40), indicating that general models fail to capture the specific acoustic properties in different environments.
The superiority of the PML approach was consistent across individual environments. In the built environment of 3. Lecture Hall, the PMLEPA achieved an exceptional fit (adjusted R2 = 0.77), providing a 28% improvement over the CRAEPA (adjusted R2 = 0.60) and a 43% improvement over the GPMEPA (adjusted R2 = 0.54). Similarly, in the built environment of 1. Library, the PMLEPA (adjusted R2 = 0.69) outperformed the CRAEPA (adjusted R2 = 0.23) by 200%, highlighting the strength of the heatmap-based features in quiet environments where traditional metrics often struggle. Furthermore, traditional acoustic metrics (LAeq, RC, and N) showed limited predictive capability, with adjusted R2 values generally below 0.30 for all air-conditioned environment (see Figure 3d–f). The results confirmed that while conventional metrics and CRAEPA can provide a baseline assessment, they lack the sensitivity of the PMLEPA to accurately predict multidimensional sound quality in complex air-conditioned environments.

3.2.3. Prediction of Occupant Well-Being-Related Noise Impacts

The capability of the models to predict occupant well-being, specifically the negative noise impacts (O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable), was evaluated by integrating the predicted EPA-scores into the GPMO1–4, Equations (3)–(6). Two hybrid models were established: GPMO1–4-PML, which substitutes the GPMEPA in the equations with the PMLEPA, and GPMO1–4-PCRA, which substitutes it with the CRAEPA. These hybrid models were then compared against the standalone PMLEPA and CRAEPA, as well as traditional acoustic metrics (LAeq, RC, and N), to assess their predictive performance for occupant well-being-related noise impacts.
Table 6 and Table 7 demonstrated a consistent relationship where higher adjusted R2 corresponds to lower root mean squared error (RMSE) and narrower 95% confidence intervals (CIs), confirming the reliability of adjusted R2 as the primary statistics indicator across all methods for subsequent discussion in this paper. As shown in Table 6, the PMLEPA consistently achieved the highest goodness-of-fit across all well-being-related noise impacts (O1O4) for all air-conditioned built environments (n = 432). For O1: Discomfortable, the PMLEPA achieved a correlation of ρ = 0.65 (adjusted R2 = 0.42), significantly outperforming the CRAEPA (ρ = 0.50, adjusted R2 = 0.25). The hybrid model GPMO1-PML yielded nearly identical performance to the standalone PMLEPA (ρ = 0.64, adjusted R2 = 0.41), whereas the GPMO1-CRA and traditional acoustic metrics (LAeq and RC) showed weaker correlations (ρ = 0.48). The superiority of the PML assessment method was even more pronounced for O2: Annoying and O3: Stressful. The PMLEPA achieved correlations of 0.63 and 0.64 (adjusted R2 = 0.40–0.41), respectively, providing a substantial improvement over the CRAEPA (ρ = 0.46–0.50). In contrast, the GPMO2–O3 and single-value metrics (LAeq, RC, and N) showed weak predictive power, with adjusted R2 values dropping as low as 0.02 for O3: Stressful predictions using N. For O4: Unacceptable, the PMLEPA remained the best predictor (ρ = 0.60, adjusted R2 = 0.36), effectively identifying the acoustic conditions that occupants deemed unacceptable. Notably, in the library environment, the PMLEPA achieved exceptional accuracy across all dimensions, with correlations reaching ρ = 0.74–0.78 (adjusted R = 0.36 = 0.53–0.62), demonstrating its sensitivity to the subtle acoustic properties that drive dissatisfaction in quiet spaces. These results confirm that integrating the multidimensional EPA-scores predicted by the PML into well-being models GPMO1–4-PML offers a far more reliable tool for acoustic assessment than CRA or simple energy-based acoustic or psychoacoustic metrics.

4. Discussion

4.1. Multidimensional Sound Quality Assessment Framework Application

The fundamental advantage of the PML assessment method lies in its integration of “psychoacoustics” with “machine learning” to create a holistic, multidimensional sound quality assessment framework (see Figure 1). The results of the multidimensional objective acoustic characterization demonstrated that psychoacoustic metrics (see Table 3) provide critical spectral and temporal information that complements traditional energy-based metrics (see Table 2). For instance, while traditional metrics indicated varying noise levels, the psychoacoustic analysis revealed that the built environment of 2. Classroom exhibited the lowest S values. This spectral characteristic, which traditional metrics failed to capture, aligns perfectly with the lower A-scores observed in the subjective responses (see Figure 2), confirming the agreement between objective psychoacoustic features and human perception. This supports recent findings that single-number ratings often fail to capture the acoustic characteristics of noise perception in complex indoor environments [54].
Furthermore, the study demonstrated that subjective perceptual responses cannot be explained solely by energy content-related metrics. A striking example is the built environment of 4. Office, which recorded the highest energy values in Table 2. Counter-intuitively, the subjective results in Figure 2 showed that the office environment had a P-score of −1, which is lower than that of the 2. Classroom and 3. Lecture Hall. This discrepancy can be attributed to the specific psychoacoustic characteristics of the Office noise, specifically the lowest fMod observed in Table 3. The low loudness sensation associated with low-frequency noise leads to a lower perception of Potency despite high physical energy levels, a phenomenon consistent with studies on low-frequency HVAC noise impacts [55]. Consequently, this resulted in a lower overall EPA-score for the office environment compared to university Classroom and Lecture Hall environments. These findings substantiate the necessity of using the PPS for multidimensional sound quality assessment, as simple energy metrics fail to account for the complex non-linear processing of human hearing [56]. The high internal consistency (Cronbach’s α > 0.82) observed across all environment types further validates the PPS as a reliable psychometric tool for evaluating air-conditioned built environments [12,49].

4.2. Predictive Performance of PML for Sound Quality Prediction

The PML approach demonstrated superior performance in predicting the multidimensional sound quality scores (E, P, and A-scores) compared to traditional methods. Spearman’s rank correlation tests confirmed that PML predictions achieved significantly higher correlations with the individual E-, P-, and A-scores than individual acoustic and psychoacoustic metrics (see Table 5). When evaluating the overall EPA-score prediction, the PMLEPA was found to be better than both the CRAEPA and the GPMEPA. While the CRAEPA showed reasonable performance because it is a case-specific model developed using the subjective assessment results from this study, it relies on obtaining subjective data, which is often impractical for design stages. The GPMEPA, designed as a pure-objective method without the need for subjective inputs, showed lower performance in this study (adjusted R2 = 0.15). This is likely because the GPM assessment method was originally designed based on outdoor soundscapes [37], and its current formulation may not fully capture the specific noise source characteristics of indoor air-conditioning systems. However, the GPM remains valuable for building acoustics design as it offers a generalized baseline without requiring jury testing.
The significantly better performance of the PML model can be attributed to its ability to learn non-linear noise features from 227 × 227-pixel psychoacoustic heatmaps representing 0.002 s resolution evolution of loudness, sharpness, roughness, and fluctuation strength. Unlike using time-averaged stationary metrics in linear regression models (CRA and GPM), the deep learning architecture of the PML captures the complex interactions between spectral and temporal variations, offering a solution to the feature engineering problem inherent in developing pure-objective prediction models [57]. This capability makes PML the most feasible method for predicting the EPA-score, as it effectively bridges the gap between objective signal properties and subjective human perception [58], accounting the human–environmental interaction.

4.3. Implications for Occupant Well-Being Prediction

The predictive power of the PML approach extended to the assessment of occupant well-being. The standalone PMLEPA model demonstrated the highest accuracy in predicting the negative noise impacts (O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable), surpassing both the traditional metrics and the hybrid models GPMO1–4. Although the hybrid models (GPMO1–4-PML) utilized the EPA-scores predicted by the PML, they yielded slightly lower goodness-of-fit compared to the standalone PHMLMEPA model (see Table 6 and Table 7). The same domain mismatch between the GPM developed for outdoor soundscapes [37] and the PHMLMEPA developed for indoor air-conditioned environments likely explains this result. The physics-based GPM parameters optimized for outdoor psychoacoustic relationships may conflict with the non-linear mappings learned by PHMLMEPA from indoor soundscapes, constraining hybrid performance. Future investigation should examine GPM and PHMLMEPA performance across both indoor and outdoor soundscapes before attempting integration. Only after domain-specific validation can data-driven predictions be effectively incorporated into generalized frameworks applicable to both environments. This indicates that simply integrating high-quality EPA predictions into the existing GPM framework, which was originally designed for different acoustic contexts, may not fully capitalize on the precision of the PML features. Therefore, the combination mechanism between data-driven predictions (PML or CRA) and generalized frameworks (GPM) requires further investigation to optimize the predictive combined effect.
In addition, the high performance of the standalone PMLEPA confirms that the method serves as a reliable assessment method for human comfort. The significance of these findings contributes directly to the field of built environment physics in studying the impact of acoustics on occupant well-being. By providing a validated, high-accuracy tool for predicting dissatisfaction and stress, the PML method empowers engineers and designers to prioritize occupant health in the acoustic design of sustainable buildings in the future [59].

4.4. Limitations and Future Work

While the PML demonstrated superior performance as a standalone model, the hybrid approach combining PML predictions with the GPM framework GPMO1–4 yielded slightly lower goodness-of-fit for the models. This suggests that the current method of integrating data-driven inputs into the generalized model structure requires optimization. Future work should investigate more advanced fusion techniques to better leverage the collaboration between the feature extraction of PML and the generalized structure of GPM. Although some soundtracks were randomly selected from the PML training datasets, independent listening tests with new subjects generated distinct perceptual labels, validating cross-panel prediction capability. Future work should validate PML performance using completely different soundtracks from diverse HVAC systems and building types to confirm generalizability beyond the original acoustic training domain.
Moreover, the subjective perceptual responses were collected from 36 participants with a mean age of 24.1 ± 2.2 years, of whom 69.4% were male. This represents a narrow demographic range, predominantly comprising young university students. While this participant profile facilitated standardized experimental control, it limits the generalizability of well-being predictions, particularly for diverse “Office” environments where actual occupant age and gender distributions are much wider. Future research should expand participant recruitment to include a broader demographic spectrum across age (e.g., 20–65 years), gender balance, occupational diversity, and varying hearing abilities to enhance ecological validity. Furthermore, field studies in actual office settings with long-term occupant monitoring would validate model performance across real-world demographic variability. These extensions would strengthen the practical applicability of the PML method for sustainable building design across diverse occupant populations.
In addition, the PML model was trained on a specific dataset of air-conditioned sounds. Further validation is required to test its generalizability across a broader range of building types (e.g., residential, hospitals) and noise sources [60] (e.g., traffic intrusion, machinery). The AlexNet transfer learning approach may have optimized feature extraction for this particular building inventory. Future research should employ cross-dataset validation with more independent field recordings. In addition, the current study highlights the “black box” nature of ML [61]. Future research could explore Explainable-AI techniques [62] to visualize which specific regions of the psychoacoustic heatmaps contribute most to the predictions, offering deeper insights into the physical drivers of sound perception. Therefore, the impact of noise source changes on PML performance needs systematic investigation to establish stronger thresholds for practical implementation in diverse real-world scenarios. While PML demonstrated better predictive performance (adjusted R2 = 0.61 for EPA-score than CRA and GPM, this is not a comparison flaw but a valid finding of method strengths. The results reflected PML’s ability to capture non-linear temporal–spectral complexity (0.002 s resolution psychoacoustic heatmaps) inherent to HVAC noise that linear methods cannot address. However, simplicity and interpretability remain critical practical advantages of CRA and GPM. CRA offers straightforward implementation using time-averaged metrics with transparent regression coefficients that enable engineering design rules. GPM provides general applicability across environments without retraining through fixed equations that facilitate rapid assessment. PML’s computational complexity represents a trade-off against CRA/GPM’s practicality for real-time building applications. Future work should validate all three methods in more engineering-practical scenarios across additional perspectives, including real-time monitoring systems, diverse HVAC configurations, long-term field studies beyond controlled jury tests, and cross-cultural perceptual validation. Hybrid GPM-PML approaches combining non-linear feature extraction with interpretable linear outputs will balance predictive power with practical utility for sustainable building acoustic design.

5. Conclusions

This study comprehensively evaluated CRA, GPM, and PML for predicting sound quality (EPA-score) and negative noise impacts on occupant well-being (O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable) across air-conditioned built environments. PML demonstrated better performance with Spearman ρ = 0.79 (adjusted R2 = 0.61) for EPA-score prediction (n = 432) than CRA (ρ = 0.58, adjusted R2 = 0.32) and GPM (ρ = 0.40, adjusted R2 = 0.15). Furthermore, PML also demonstrated better performance in well-being predictions (O1: adjusted R2 = 0.42, O2: adjusted R2 = 0.40, and O3: adjusted R2 = 0.41) and showed exceptional accuracy in quiet library environments (adjusted R2 = 0.53–0.62). GPM’s poor performance reflects an outdoor-to-indoor domain mismatch, while CRA’s limitations stem from static time-averaged metrics. PML’s psychoacoustic heatmaps capture critical temporal variations missed by both alternatives. This cross-method comparison identifies PML as the optimal objective assessment tool for building acoustic design, delivering reliable occupant well-being predictions without subjective testing to support health-focused sustainable building practices.

Author Contributions

Conceptualization, K.W.M. and C.M.M.; methodology, K.W.M.; software, C.M.M. and F.-L.C.; validation, C.M.M.; formal analysis, K.W.M.; investigation, K.W.M.; resources, H.M.W.; data curation, K.W.M.; writing—original draft preparation, K.W.M.; writing—review and editing, K.W.M., C.M.M., F.-L.C. and H.M.W.; visualization, K.W.M.; supervision, C.M.M.; project administration, C.M.M.; funding acquisition, C.M.M. and F.-L.C. All authors have read and agreed to the published version of the manuscript.

Funding

The work described in this paper was fully supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (PolyU15219421).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

Students Hung Yeuk Lun, Au Wing Kin, Leung Wai Yuen, and Yip Lai Hei from The Hong Kong Polytechnic University are acknowledged for their assistance in sound measurements and data collection.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
PHMLMPsychoacoustic Heatmap Machine Learning Models
MLMachine Learning
PMLPsychoacoustic Machine Learning
GPMGeneral Prediction Model
CRAConventional Regression Approach
EEvaluation
PPotency
AActivity
EPAEvaluation, Potency, and Activity
PPSPsychoacoustics Perception Scale
WHOWorld Health Organization
HVACHeating, Ventilation, and Air-Conditioning
NCNoise Criteria
NRNoise Rating
RCRoom Criteria
ANNArtificial Neural Network
SDStandard Deviation
IQRInterquartile Range
SGDMStochastic Gradient Descent with Momentum
ReLURectified Linear Unit activation function
AIArtificial Intelligence
RMSERoot Mean Squared Error
CIsConfidence Intervals

Appendix A

Equations of the Psychoacoustic Metrics

N t = 0.08 E T Q E 0 0.23 0.5 + 0.5 E t E T Q 0.23 1 s o n e B a r k ,
where N’(t) is the time-varying specific loudness in sone per Bark, E(t) is the time-varying excitation of the sound, ETQ is the excitation of threshold in quiet, and E0 is the reference excitation. The loudness of a 1 kHz pure tone sound at 40 dB is defined as 1 sone and 40 phon.
N ( t ) = 0 24   B a r k N ( t ) d z ( s o n e )   ,
where N(t) is the time-varying total loudness.
S t = 0.11 0 24   B a r k N t g z z d z 0 24   B a r k N d z a c u m ,
g z : = 1 , z 14 0.00012 Z 4 0.0056 Z 3 + 0.1 Z 2 0.81 z + 3.51 , z > 14
where S(t) is the time-varying sharpness, and g(z) is critical-band-rate dependent. The sharpness of a 1 kHz pure tone sound at 60 dB is defined to be 1 acum.
R = 0.0003 f M o d 0 24   B a r k L z d z a s p e r ,
F S = 0.008 0 24   B a r k L z d z f M o d 4 + 4 f M o d v a c i l ,
where fMod is the dominant modulation frequency for the effects of frequency resolution (Hz), and ΔL is the temporal depth for masking the effect of temporal resolution (dB). A 1 kHz pure tone at 60 dB with 100% amplitude modulation at 70 Hz and 4 Hz is equal to 1 asper and 1 vacil, respectively.
For the calculation of N, the time-averaged excitation of the sound is utilized.
L N = 40 N s o n e + 0.005 0.35 , N < 1   sone       40 + 10 l o g 2 ( N s o n e ) , N 1   s o n e ( p h o n )
where LN is the loudness level converted from the time-averaged N in phon.
NP% and SP% are the values of instantaneous total loudness and sharpness that exceed P% of the elapsed time, respectively.

References

  1. Scrosati, C.; Scamoni, F. Managing measurement uncertainty in building acoustics. Buildings 2015, 5, 1389–1413. [Google Scholar] [CrossRef]
  2. Park, J.; Loftness, V.; Aziz, A. Post-occupancy evaluation and IEQ measurements from 64 office buildings: Critical factors and thresholds for user satisfaction on thermal quality. Buildings 2018, 8, 156. [Google Scholar] [CrossRef]
  3. Torresin, S.; Aletta, F.; Babich, F.; Bourdeau, E.; Harvie-Clark, J.; Kang, J.; Lavia, L.; Radicchi, A.; Albatici, R. Acoustics for supportive and healthy buildings: Emerging themes on indoor soundscape research. Sustainability 2020, 12, 6054. [Google Scholar] [CrossRef]
  4. Aflaki, A.; Esfandiari, M.; Jarrahi, A. Multi-criteria evaluation of a library’s indoor environmental quality in the tropics. Buildings 2023, 13, 1233. [Google Scholar] [CrossRef]
  5. Fry, A. Noise Control in Building Services; Elsevier: Amsterdam, The Netherlands, 1988. [Google Scholar]
  6. Munjal, M.J. Acoustics of Ducts and Mufflers; Wiley: New York, NY, USA, 2014. [Google Scholar]
  7. Mak, C.M.; Ma, K.W.; Wong, H.M. Prediction and Control of Noise and Vibration from Ventilation Systems, 1st ed.; Taylor & Francis: Oxfordshire, UK, 2023. [Google Scholar]
  8. ISO 1996-1:2016; Acoustics—Description, Measurement and Assessment of Environmental Noise—Part 1: Basic Quantities and Assessment Procedures. International Organization for Standardization: Geneva, Switzerland, 2016.
  9. ANSI/ASA S12.2-2019; Criteria for Evaluating Room Noise. Acoustical Society of America: Melville, NY, USA, 2019.
  10. Kim, A.; Wang, S.; McCunn, L.; Prozuments, A.; Swanson, T.; Lokan, K. Commissioning the acoustical performance of an open office space following the latest healthy building standard: A case study. Acoustics 2019, 1, 473–492. [Google Scholar] [CrossRef]
  11. Thampanichwat, C.; Wongvorachan, T.; Bunyarittikit, S.; Chunhajinda, P.; Phaibulputhipong, P.; Wongmahasiri, R. The Architectural Design Strategies That Promote Attention to Foster Mindfulness: A Systematic Review, Content Analysis and Meta-Analysis. Buildings 2024, 14, 2508. [Google Scholar] [CrossRef]
  12. Ma, K.W.; Mak, C.M.; Wong, H.M. Development of a subjective scale for sound quality assessments in building acoustics. J. Build. Eng. 2020, 29, 101177. [Google Scholar] [CrossRef]
  13. Ma, K.W.; Wong, H.M.; Mak, C.M. Dental Environmental Noise Evaluation and Health Risk Model Construction to Dental Professionals. Int. J. Environ. Res. Public Health 2017, 14, 1084. [Google Scholar] [CrossRef]
  14. World Health Organization. Environmental Noise Guidelines for the European Region; WHO: Geneva, Switzerland, 2018. [Google Scholar]
  15. World Health Organization. Burden of Disease from Environmental Noise: Quantification of Healthy Life Years Lost in Europe; WHO: Geneva, Switzerland, 2011. [Google Scholar]
  16. Ma, K.W.; Mak, C.M.; Wong, H.M. The perceptual and behavioral influence on dental professionals from the noise in their workplace. Appl. Acoust. 2020, 161, 107164. [Google Scholar] [CrossRef]
  17. Yonemura, M.; Lee, H.; Sakamoto, S. Subjective evaluation on the annoyance of environmental noise containing low-frequency tonal components. Int. J. Environ. Res. Public Health 2021, 18, 7127. [Google Scholar]
  18. Zhang, X.; Ba, M.; Kang, J.; Meng, Q. Effect of soundscape dimensions on acoustic comfort in urban open public spaces. Appl. Acoust. 2018, 133, 73–81. [Google Scholar] [CrossRef]
  19. Erfanian, M.; Mitchell, A.J.; Kang, J.; Aletta, F. The psychophysiological implications of soundscape: A systematic review of empirical literature and a research agenda. Int. J. Environ. Res. Public Health 2019, 16, 3533. [Google Scholar] [CrossRef]
  20. ISO/TS 15666:2003; Acoustics—Assessment of Noise Annoyance by Means of Social and Socio-Acoustic Surveys. International Organization for Standardization: Geneva, Switzerland, 2003.
  21. ISO 12913-1:2014; Acoustics—Soundscape—Part 1: Definition and Conceptual Framework. International Organization for Standardization: Geneva, Switzerland, 2014.
  22. ISO 12913-2:2018; Acoustics—Soundscape—Part 2: Data Collection and Reporting Requirements. International Organization for Standardization: Geneva, Switzerland, 2018.
  23. ISO 12913-3:2019; Acoustics—Soundscape—Part 3: Data Analysis. International Organization for Standardization: Geneva, Switzerland, 2019.
  24. Soeta, Y.; Kagawa, H. Three dimensional psychological evaluation of aircraft noise and prediction by physical parameters. Build. Environ. 2020, 167, 106445. [Google Scholar] [CrossRef]
  25. Lian, Y.; Ou, D.; Tan, R. The effects of sound source dominance and pressure level on cognitive performance and environmental perception in green space. Appl. Acoust. 2025, 240, 110897. [Google Scholar] [CrossRef]
  26. Osgood, C.E. The nature and measurement of meaning. Psychol. Bull. 1952, 49, 197–273. [Google Scholar] [CrossRef]
  27. Takada, M.; Tanaka, K.; Iwamiya, S.-I. Relationships between auditory impressions and onomatopoeic features for environmental sounds. Acoust. Sci. Technol. 2006, 27, 67–79. [Google Scholar] [CrossRef]
  28. Galiana, M.; Llinares, C.; Page, Á. Subjective evaluation of music hall acoustics: Response of expert and non-expert users. Build. Environ. 2012, 58, 1–13. [Google Scholar] [CrossRef]
  29. Ma, K.W.; Wong, H.M.; Mak, C.M. A systematic review of human perceptual dimensions of sound: Meta-analysis of semantic differential method applications to indoor and outdoor sounds. Build. Environ. 2018, 133, 123–150. [Google Scholar] [CrossRef]
  30. Zwicker, E.; Fastl, H. Psychoacoustics: Facts and Models, 3rd ed.; Springer Science & Business Media: Berlin, Germany, 2007; Volume 22. [Google Scholar]
  31. ISO 532-1:2017; Acoustics—Methods for Calculating Loudness—Part 1: Zwicker Method. International Organization for Standardization: Geneva, Switzerland, 2017.
  32. Tang, S.K. Performance of noise indices in air-conditioned landscaped office buildings. J. Acoust. Soc. Am. 1997, 102, 1657–1663. [Google Scholar] [CrossRef]
  33. Segura-Garcia, J.; Navarro-Ruiz, J.M.; Perez-Solano, J.J.; Montoya-Belmonte, J.; Felici-Castell, S.; Cobos, M.; Torres-Aranda, A.M. Spatio-temporal analysis of urban acoustic environments with binaural psycho-acoustical considerations for IoT-based applications. Sensors 2018, 18, 690. [Google Scholar] [CrossRef]
  34. Ma, K.W.; Mak, C.M.; Wong, H.M. Acoustical measurements and prediction of psychoacoustic metrics with spatial variation. Appl. Acoust. 2020, 168, 107450. [Google Scholar] [CrossRef]
  35. Ma, K.W.; Mak, C.M.; Wong, H.M. Effects of environmental sound quality on soundscape preference in a public urban space. Appl. Acoust. 2021, 171, 107570. [Google Scholar] [CrossRef]
  36. Axelsson, Ö.; Nilsson, M.E.; Berglund, B. A principal components model of soundscape perception. J. Acoust. Soc. Am. 2010, 128, 2836–2846. [Google Scholar] [CrossRef]
  37. Ma, K.W.; Mak, C.M.; Wong, H.M. Development of a sound quality model for noise impact prediction in building acoustics. J. Build. Eng. 2025, 111, 113183. [Google Scholar] [CrossRef]
  38. Jiang, J.; Trundle, P.; Ren, J. Medical image analysis with artificial neural networks. Comput. Med. Imaging Graph. 2010, 34, 617–631. [Google Scholar] [CrossRef]
  39. Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 2012, 1097–1105. [Google Scholar] [CrossRef]
  40. Gu, J.; Wang, Z.; Kuen, J.; Ma, L.; Shahroudy, A.; Shuai, B.; Liu, T.; Wang, X.; Wang, G.; Cai, J. Recent advances in convolutional neural networks. Pattern Recognit. 2018, 77, 354–377. [Google Scholar] [CrossRef]
  41. Vican, I.; Kreković, G.; Jambrošić, K. Improved fetal heartbeat detection using pitch shifting and psychoacoustics. Biomed. Signal Process. Control 2024, 90, 105850. [Google Scholar] [CrossRef]
  42. Barros, A.; Geluykens, M.; Pereira, F.; Freitas, E.; Faria, S.; Goubert, L.; Vuye, C. Beyond noise levels: Vehicle classification using psychoacoustic indicators from pass-by road traffic noise and their correlations with speed and temperature. Appl. Acoust. 2023, 214, 109716. [Google Scholar] [CrossRef]
  43. Souli, S.; Lachiri, Z. Audio sounds classification using scattering features and support vectors machines for medical surveillance. Appl. Acoust. 2018, 130, 270–282. [Google Scholar] [CrossRef]
  44. Hvastja, A.; Ćirić, D.; Milivojčević, M.; Prezelj, J. Assessing air and noise pollution through acoustic classification of vehicles fuel types using deep learning. Heliyon 2025, 11, e43426. [Google Scholar] [CrossRef]
  45. Potočnik, P.; Olmos, B.; Vodopivec, L.; Susič, E.; Govekar, E. Condition classification of heating systems valves based on acoustic features and machine learning. Appl. Acoust. 2021, 174, 107736. [Google Scholar] [CrossRef]
  46. Wu, C.; Redonnet, S. A simple yet efficient data-driven model for the prediction of aircraft noise impact. Aerosp. Sci. Technol. 2025, 163, 110286. [Google Scholar] [CrossRef]
  47. O’Reilly, D.; White, M.; Langenheim, N.; Alambeigi, P.; Huang, X.; Yang, T. Receiver-centric mapping of pedestrian noise annoyance: A cost-effective approach using random forest, psychoacoustic metrics and open-source data. Sustain. Cities Soc. 2025, 130, 106651. [Google Scholar] [CrossRef]
  48. Dai, R.; Zhao, J.; Zhao, W.; Ding, W.; Huang, H. Exploratory study on sound quality evaluation and prediction for engineering machinery cabins. Measurement 2025, 253, 117684. [Google Scholar] [CrossRef]
  49. Ma, K.W.; Mak, C.M.; Chung, F.-L.; Wong, H.M. Advanced Acoustic Monitoring Using Psychoacoustic Heatmap Machine Learning Models for Noise Impact Prediction in Air-Conditioned Building Environments. Sensors 2026, 26, 544. [Google Scholar] [CrossRef]
  50. Zhang, T.; Feng, G.; Liang, J.; An, T. Acoustic scene classification based on Mel spectrogram decomposition and model merging. Appl. Acoust. 2021, 182, 108258. [Google Scholar] [CrossRef]
  51. Dixon, W.J.; Massey, F.J., Jr. Introduction to Statistical Aanalysis, 4th ed.; McGraw-Hill Book Company: New York, NY, USA, 1983. [Google Scholar]
  52. Razali, N.M.; Wah, Y.B. Power comparisons of shapiro-wilk, kolmogorov-smirnov, lilliefors and anderson-darling tests. J. Stat. Model. Anal. 2011, 2, 21–33. [Google Scholar]
  53. Gliem, J.A.; Gliem, R.R. Calculating, interpreting, and reporting Cronbach’s alpha reliability coefficient for Likert-type scales. In Midwest Research-to-Practice Conference in Adult, Continuing, and Community Education; The Ohio State University: Columbus, OH, USA, 2003. [Google Scholar]
  54. Zannin, P.H.T.; Marcon, C.R. Objective and subjective evaluation of the acoustic comfort in classrooms. Appl. Ergon. 2007, 38, 675–680. [Google Scholar] [CrossRef]
  55. Soeta, Y.; Shimokura, R. Sound quality evaluation of air-conditioner noise based on factors of the autocorrelation function. Appl. Acoust. 2017, 124, 11–19. [Google Scholar] [CrossRef]
  56. Abbasi, A.M.; Motamedzade, M.; Aliabadi, M.; Golmohammadi, R.; Tapak, L. Study of the physiological and mental health effects caused by exposure to low-frequency noise in a simulated control room. Build. Acoust. 2018, 25, 233–248. [Google Scholar] [CrossRef]
  57. Lionello, M.; Aletta, F.; Kang, J. A systematic review of prediction models for the experience of urban soundscapes. Appl. Acoust. 2020, 170, 107479. [Google Scholar] [CrossRef]
  58. Jafari, M.J.; Khosrowabadi, R.; Khodakarim, S.; Mohammadian, F. The effect of noise exposure on cognitive performance and brain activity patterns. Open Access Maced. J. Med. Sci. 2019, 7, 2924. [Google Scholar] [CrossRef]
  59. Starowicz, A.; Zieliński, M. Sustainable acoustics: The impact of AI on acoustics design and noise management. Tech. Sci. 2024, 27, 193–209. [Google Scholar] [CrossRef]
  60. Piczak, K.J. Environmental sound classification with convolutional neural networks. In Proceedings of the 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), Boston, MA, USA, 17–20 September 2015; pp. 1–6. [Google Scholar]
  61. Petch, J.; Di, S.; Nelson, W. Opening the black box: The promise and limitations of explainable machine learning in cardiology. Can. J. Cardiol. 2022, 38, 204–213. [Google Scholar] [CrossRef] [PubMed]
  62. Holzinger, A.; Saranti, A.; Molnar, C.; Biecek, P.; Samek, W. Explainable AI methods-a brief overview. In Proceedings of the International Workshop on Extending Explainable AI Beyond Deep Models and Classifiers, Vienna, Austria, 18 July 2020; pp. 13–38. [Google Scholar]
Figure 1. The multidimensional sound quality assessment framework and experimental workflow for the comparative evaluation of conventional regression approach (CRA), general prediction model (GPM), and psychoacoustic machine learning (PML) assessment methods in predicting occupant well-being in air-conditioned built environments. (a) The theoretical framework comprising four key stages (I–IV) linking objective acoustic characteristics to occupant well-being; (b) Experimental setup of objective data collection in Stage I, illustrating objective data collection in four air-conditioned built environments; (c) Experimental setup of subjective data collection in Stages II-IV, detailing the subjective jury listening tests and questionnaire structure (Part A: Psychoacoustic Perception Scale and Part B: occupant well-being-related noise impacts).
Figure 1. The multidimensional sound quality assessment framework and experimental workflow for the comparative evaluation of conventional regression approach (CRA), general prediction model (GPM), and psychoacoustic machine learning (PML) assessment methods in predicting occupant well-being in air-conditioned built environments. (a) The theoretical framework comprising four key stages (I–IV) linking objective acoustic characteristics to occupant well-being; (b) Experimental setup of objective data collection in Stage I, illustrating objective data collection in four air-conditioned built environments; (c) Experimental setup of subjective data collection in Stages II-IV, detailing the subjective jury listening tests and questionnaire structure (Part A: Psychoacoustic Perception Scale and Part B: occupant well-being-related noise impacts).
Buildings 16 01027 g001
Figure 2. A horizontal stacked bar chart of the perceptual data (n = 432) captured using the Psychoacoustics Perception Scale (PPS) in Evaluation, Potency, and Activity (EPA) dimensions for four types of air-conditioned built environments: (a) 1. Library, (b) 2. Classroom, (c) 3. Lecture Hall, and (d) 4. Office.
Figure 2. A horizontal stacked bar chart of the perceptual data (n = 432) captured using the Psychoacoustics Perception Scale (PPS) in Evaluation, Potency, and Activity (EPA) dimensions for four types of air-conditioned built environments: (a) 1. Library, (b) 2. Classroom, (c) 3. Lecture Hall, and (d) 4. Office.
Buildings 16 01027 g002
Figure 3. Comparative predictive performance for EPA-score prediction across four types of air-conditioned built environments (n = 432): Scatter plots of observed vs. predicted EPA-scores from (a) psychoacoustic machine learning model (PMLEPA), (b) conventional regression approach (CRAEPA), (c) general prediction model (GPMEPA), and traditional acoustic metrics (d) A-weighted equivalent continues sound pressure level (LAeq), (e) room criteria (RC), and (f) total loudness (N). The regression line (red) and coefficient of determination (adjusted R2) indicate the goodness-of-fit for each model.
Figure 3. Comparative predictive performance for EPA-score prediction across four types of air-conditioned built environments (n = 432): Scatter plots of observed vs. predicted EPA-scores from (a) psychoacoustic machine learning model (PMLEPA), (b) conventional regression approach (CRAEPA), (c) general prediction model (GPMEPA), and traditional acoustic metrics (d) A-weighted equivalent continues sound pressure level (LAeq), (e) room criteria (RC), and (f) total loudness (N). The regression line (red) and coefficient of determination (adjusted R2) indicate the goodness-of-fit for each model.
Buildings 16 01027 g003
Table 1. Architecture of a psychoacoustic heatmap machine learning model.
Table 1. Architecture of a psychoacoustic heatmap machine learning model.
LayerConfiguration/DimensionsActivation/Function
Input227 × 227 × 1 (grayscale heatmap)Input layer for psychoacoustic heatmaps
Convolution 196 filters, 11 × 11 kernel, stride 4ReLU (Feature extraction—transferred)
Max Pooling 13 × 3 max pooling, stride 2Down-sampling
Convolution 2256 filters, 5 × 5 kernel, padding 2ReLU (Feature extraction—transferred)
Max Pooling 23 × 3 max pooling, stride 2Down-sampling
Convolution 3384 filters, 3 × 3 kernel, padding 1ReLU (Feature extraction—transferred)
Convolution 4384 filters, 3 × 3 kernel, padding 1ReLU (Feature extraction—transferred)
Convolution 5256 filters, 3 × 3 kernel, padding 1ReLU (Feature extraction—transferred)
Max Pooling 33 × 3 max pooling, stride 2Down-sampling
Fully Connected 64096 neuronsReLU, Dropout (0.5) (High-level feature processing)
Fully Connected 74096 neuronsReLU, Dropout (0.5) (High-level feature processing)
Fully Connected 819 neuronsSoftmax (classification for −9 to +9 scale)
Output19-level probability distributionE-/P-/A-score prediction
Notes. ReLU = Rectified Linear Unit activation function; Dropout = regularization technique to prevent overfitting; E = Evaluation; P = Potency; A = Activity. The final predicted EPA-score is derived by combining the separate outputs of three parallel models independently trained for E-, P, and A-scores.
Table 2. Descriptive statistics of the acoustic characteristics of the soundtracks (n = 432) recorded from air-conditioned built environments in terms of nine conventional acoustic metrics.
Table 2. Descriptive statistics of the acoustic characteristics of the soundtracks (n = 432) recorded from air-conditioned built environments in terms of nine conventional acoustic metrics.
MetricUnit1. Library
(n = 108)
2. Classroom
(n = 108)
3. Lecture Hall
(n = 108)
4. Office
(n = 108)
Total
(n = 432)
Kruskal–Wallis TestPost Hoc Tests
Median (IQR)Median (IQR)Median (IQR)Median (IQR)Median (IQR)p-ValueOrder
LZeqdB66.1 (8.6)72.4 (5.8)72.7 (8.2)64.4 (9.5)68.4 (8.3)<0.0014 < 1 < 2 = 3
LAeqdBA45.1 (7.5)48.6 (8.6)49.4 (8.7)50.0 (7.0)48.4 (8.6)<0.0011 < 2 = 3 = 4
LA10dBA45.5 (7.5)49.2 (9.2)50.0 (8.1)50.3 (7.3)48.9 (8.6)<0.0011 < 2 = 3 = 4
LA50dBA45.1 (7.5)48.5 (8.5)49.3 (9.0)50.0 (7.0)48.4 (8.6)<0.0011 < 2 = 3 = 4
LA90dBA44.7 (7.4)47.9 (8.1)48.9 (9.3)49.6 (7.5)48.0 (8.3)<0.0011 < 2 = 3 = 4
LA10LA90dBA0.75 (0.2)0.81 (0.4)1.1 (0.6)0.70 (0.2)0.79 (0.4)<0.0011 = 4 < 2 = 3
NCNC43 (9)46 (9)47 (8)46 (8)46 (8)<0.0011 < 2 < 3 = 4
NRNR43 (8)46 (9)48 (7)47 (7)46 (9)<0.0011 < 2 < 3 = 4
RCRC42 (8)45 (8)47 (8)47 (8)45 (8)<0.0011 < 2 = 3 = 4
Notes. n = number of soundtracks, LZ = unweighted sound pressure level, LA = A-weighted sound pressure level, NC = noise criterion, NR = noise rating, RC = room criteria, eq = time-equivalent, 5/10/50/90/95 = percentiles of 5/10/50/90/95%, IQR = interquartile range, post hoc tests = Dunn’s tests of an acoustic metric in multiple comparisons between different environmental types.
Table 3. Descriptive statistics of the acoustic characteristics of the soundtracks (n = 432) recorded from air-conditioned built environments in terms of fifteen psychoacoustic metrics.
Table 3. Descriptive statistics of the acoustic characteristics of the soundtracks (n = 432) recorded from air-conditioned built environments in terms of fifteen psychoacoustic metrics.
MetricUnit1. Library
(n = 108)
2. Classroom
(n = 108)
3. Lecture Hall
(n = 108)
4. Office
(n = 108)
Total
(n = 432)
Kruskal–Wallis TestPost Hoc Tests
Median (IQR)Median (IQR)Median (IQR)Median (IQR)Median (IQR)p-ValueOrder
LNphon64.5 (8.7)67.4 (8.7)69.1 (9.6)83.1 (8.1)70.2 (13.5)<0.0011 < 2 < 3 < 4
Nsone5.5 (3.5)6.7 (3.8)7.5 (5.0)19.9 (10.8)8.1 (7.6)<0.0011 < 2 < 3 < 4
N5sone5.7 (3.6)6.9 (4.0)7.9 (4.8)20.7 (11.0)8.4 (7.6)<0.0011 < 2 < 3 < 4
N95sone5.1 (3.3)6.2 (3.6)6.7 (4.9)18.1 (9.9)7.5 (7.0)<0.0011 < 2 = 3 < 4
Sacum1.2 (0.1)0.87 (0.2)1.1 (0.2)1.2 (0.1)1.1 (0.3)<0.0012 < 3 < 1 < 4
S5acum1.3 (0.1)0.95 (0.3)1.2 (0.3)1.3 (0.1)1.2 (0.3)<0.0012 < 3 < 1 < 4
S95acum1.1 (0.1)0.81 (0.2)0.90 (0.3)1.2 (0.1)1.0 (0.3)<0.0012 < 3 < 1 < 4
Rasper0.07 (0.02)0.07 (0.03)0.08 (0.02)0.12 (0.05)0.08 (0.03)<0.0011 = 2 = 3 < 4
R5asper0.11 (0.03)0.11 (0.04)0.12 (0.03)0.19 (0.10)0.12 (0.05)<0.0011 = 2 < 3 < 4
R95asper0.04 (0.01)0.04 (0.02)0.05 (0.01)0.07 (0.03)0.05 (0.02)<0.0011 = 2 = 3 < 4
FSvacil0.01 (0.04)0.03 (0.02)0.02 (0.03)0.06 (0.05)0.03 (0.04)<0.0011 = 3 < 2 < 4
FS5vacil0.02 (0.05)0.04 (0.03)0.03 (0.03)0.08 (0.06)0.04 (0.05)<0.0011 < 3 < 2 < 4
FS95vacil0.005 (0.03)0.02 (0.02)0.007 (0.02)0.05 (0.04)0.02 (0.03)<0.0012 < 1 = 3 < 4
fModHertz135 (48)133 (55)141 (35)89 (70)131 (48)<0.0014 < 1 = 2 = 3
N5N95sone0.64 (0.3)0.78 (0.4)1.1 (0.4)2.0 (1.2)0.96 (0.7)<0.0011 < 2 < 3 < 4
Notes. n = number of soundtracks, 5/10/50/90/95 = percentiles of 5/10/50/90/95%, IQR = interquartile range, N = total loudness, S = sharpness, R = roughness, FS = fluctuation strength, fMod = dominant modulation frequency determined in the calculation of R, post hoc tests = Dunn’s tests of an acoustic metric in multiple comparisons between different environmental types.
Table 4. Cronbach’s α coefficients for subjective responses across four types of air-conditioned built environments.
Table 4. Cronbach’s α coefficients for subjective responses across four types of air-conditioned built environments.
Environment TypeEvaluation (E)Potency (P)Activity (A)EPA
1. Library (n = 108)0.9690.9610.9430.977
2. Classroom (n = 108)0.9290.8770.9450.916
3. Lecture Hall (n = 108)0.9150.8750.8800.872
4. Office (n = 108)0.8880.8290.8630.827
Total (n = 432)0.9340.8970.9180.923
Table 5. Spearman’s rank correlation coefficients between the individual E-, P-, and A-scores and the psychoacoustic machine learning (PML) predictions and the individual acoustic and psychoacoustic metrics.
Table 5. Spearman’s rank correlation coefficients between the individual E-, P-, and A-scores and the psychoacoustic machine learning (PML) predictions and the individual acoustic and psychoacoustic metrics.
VariablePMLLAeqLA10LA50LA90NCNRRCLNNN5N95SS5S95RR5R95fModN5N95
E-score0.770.310.560.560.560.560.550.560.560.380.380.38NANANA0.310.300.30NA0.33
P-score0.800.300.580.580.580.580.580.580.580.370.370.37NANANA0.260.250.26NA0.32
A-score0.57NA0.160.160.160.160.120.140.160.210.210.210.210.170.140.170.180.180.170.17
Notes. n = number of soundtracks, LZ = unweighted sound pressure level, LA = A-weighted sound pressure level, NC = noise criterion, NR = noise rating, RC = room criteria, eq = time-equivalent, 5/10/50/90/95 = percentiles of 5/10/50/90/95%, N = total loudness, S = sharpness, R = roughness, FS = fluctuation strength, fMod = dominant modulation frequency determined in the calculation of R. NA = no significant correlation found.
Table 6. Comparison of Spearman’s rank correlation coefficients ρ and adjusted coefficients of determination (adjusted R2 in parentheses) for predicting occupant well-being related noise impacts (O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable) in the four types of air-conditioned built environment using different assessment methods.
Table 6. Comparison of Spearman’s rank correlation coefficients ρ and adjusted coefficients of determination (adjusted R2 in parentheses) for predicting occupant well-being related noise impacts (O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable) in the four types of air-conditioned built environment using different assessment methods.
Dependent VariablePMLEPACRAEPAGPMO1–4GPMO1–4-PMLGPMO1–4-CRALAeqRCN
O1: Discomfortable
1. Library (n = 108)0.74 (0.53)0.45 (0.18)0.32 (0.10)0.73 (0.53)0.46 (0.20)0.35 (0.11)0.44 (0.21)0.43 (0.16)
2. Classroom (n = 108)0.49 (0.23)0.38 (0.14)0.31 (0.09)0.48 (0.22)0.39 (0.13)0.42 (0.15)0.66 (0.45)0.65 (0.44)
3. Lecture Hall (n = 108)0.66 (0.49)0.71 (0.50)0.57 (0.26)0.65 (0.49)0.71 (0.49)0.64 (0.42)0.40 (0.14)0.41 (0.16)
4. Office (n = 108)0.66 (0.43)0.50 (0.26)0.36 (0.11)0.66 (0.43)0.46 (0.23)0.43 (0.20)0.37 (0.14)0.35 (0.10)
Total (n = 432)0.65 (0.42)0.50 (0.25)0.32 (0.09)0.64 (0.41)0.49 (0.24)0.48 (0.22)0.48 (0.23)0.33 (0.06)
O2: Annoying
1. Library (n = 108)0.78 (0.59)0.45 (0.20)0.42 (0.13)0.70 (0.49)0.44 (0.21)0.38 (0.12)0.39 (0.15)0.36 (0.11)
2. Classroom (n = 108)0.46 (0.19)0.35 (0.12)0.30 (0.08)0.27 (0.08)0.23 (0.05)0.38 (0.12)0.36 (0.11)0.37 (0.12)
3. Lecture Hall (n = 108)0.67 (0.47)0.71 (0.48)0.56 (0.17)0.58 (0.31)0.54 (0.22)0.67 (0.43)0.68 (0.44)0.67 (0.45)
4. Office (n = 108)0.61 (0.36)0.55 (0.30)0.22 (0.05)NANA0.48 (0.23)0.48 (0.24)0.48 (0.21)
Total (n = 432)0.63 (0.40)0.50 (0.25)0.42 (0.16)0.35 (0.13)0.20 (0.04)0.46 (0.20)0.46 (0.21)0.18 (0.01)
O3: Stressful
1. Library (n = 108)0.76 (0.62)0.45 (0.20)0.35 (0.15)0.76 (0.62)0.45 (0.20)0.37 (0.12)0.37 (0.15)0.35 (0.11)
2. Classroom (n = 108)0.60 (0.33)0.46 (0.20)0.25 (0.03)0.60 (0.33)0.46 (0.20)0.46 (0.19)0.44 (0.18)0.46 (0.19)
3. Lecture Hall (n = 108)0.50 (0.27)0.46 (0.24)0.15 (0.02)0.50 (0.27)0.46 (0.24)0.44 (0.21)0.45 (0.22)0.45 (0.24)
4. Office (n = 108)0.61 (0.38)0.50 (0.26)−0.27 (0.08)0.61 (0.38)0.50 (0.26)0.45 (0.21)0.46 (0.22)0.46 (0.20)
Total (n = 432)0.64 (0.41)0.46 (0.22)0.17 (0.03)0.64 (0.41)0.46 (0.22)0.42 (0.17)0.43 (0.18)0.23 (0.02)
O4: Unacceptable
1. Library (n = 108)0.77 (0.61)0.40 (0.15)0.29 (0.08)0.75 (0.57)0.46 (0.22)0.32 (0.09)0.33 (0.11)0.31 (0.07)
2. Classroom (n = 108)0.50 (0.24)0.43 (0.19)0.34 (0.12)0.39 (0.15)0.27 (0.06)0.45 (0.19)0.44 (0.18)0.44 (0.19)
3. Lecture Hall (n = 108)0.69 (0.48)0.65 (0.45)0.65 (0.23)0.56 (0.33)0.49 (0.25)0.66 (0.45)0.68 (0.46)0.66 (0.47)
4. Office (n = 108)0.39 (0.15)0.32 (0.10)0.22 (0.05)0.36 (0.13)0.32 (0.10)0.26 (0.07)0.27 (0.08)0.27 (0.07)
Total (n = 432)0.60 (0.36)0.45 (0.20)0.42 (0.15)0.50 (0.25)0.36 (0.13)0.44 (0.19)0.44 (0.19)0.37 (0.12)
Notes. n = number of soundtracks, CRAEPA = conventional regression approach for EPA-score prediction, PMLEPA = psychoacoustic machine learning for EPA-score prediction, GPMO1–4 = general prediction model for O1O4 prediction, GPMO1–4-PML = hybrid models combining GPMO1–4 and PMLEPA, GPMO1–4-CRA = hybrid models combining GPMO1–4 and CRAEPA, LAeq = A-weighted equivalent continuous sound pressure level, RC = room criteria, N = total loudness, NA = no significant correlation found.
Table 7. Comparison of root mean squared error (RMSE) and confidence intervals (CIs) of predicted occupant well-being-related noise impacts (O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable) in the four types of air-conditioned built environments using different assessment methods.
Table 7. Comparison of root mean squared error (RMSE) and confidence intervals (CIs) of predicted occupant well-being-related noise impacts (O1: Discomfortable, O2: Annoying, O3: Stressful, and O4: Unacceptable) in the four types of air-conditioned built environments using different assessment methods.
Dependent VariablePMLEPACRAEPAGPMO1–4GPMO1–4-PMLGPMO1–4-CRALAeqRCN
O1: Discomfortable (n = 432, Mean = 3.87, SD = 1.52)
RMSE 1.161.321.451.171.331.341.331.47
95% CIs[3.77, 3.99][3.75, 4.00][3.74, 4.01][3.76, 3.99][3.75, 4.00][3.75, 4.00][3.75, 4.00][3.74, 4.01]
O2: Annoying (n = 432, Mean = 3.91, SD = 1.49)
RMSE 1.151.291.361.391.461.331.321.48
95% CIs[3.8, 4.02][3.79, 4.03][3.78, 4.04][3.78, 4.04][3.77, 4.05][3.79, 4.04][3.79, 4.04][3.77, 4.05]
O3: Stressful (n = 432, Mean = 3.70, SD = 1.43)
RMSE 1.101.271.431.101.271.301.291.42
95% CIs[3.59, 3.8][3.58, 3.82][3.56, 3.83][3.59, 3.08][3.58, 3.82][3.57, 3.82][3.57, 3.82][3.56, 3.83]
O4: Unacceptable (n = 432, Mean = 3.56, SD = 1.53)
RMSE1.171.321.411.261.381.361.351.51
95% CIs[3.44, 3.67][3.43, 3.69][3.42, 3.70][3.44, 3.68][3.42, 3.69][3.43, 3.69][3.43, 3.69][3.41, 3.71]
Notes. n = number of soundtracks, CRAEPA = conventional regression approach for EPA-score prediction, PMLEPA = psychoacoustic machine learning for EPA-score prediction, GPMO1–4 = general prediction model for O1O4 prediction, GPMO1–4-PML = hybrid models combining GPMO1–4 and PMLEPA, GPMO1–4-CRA = hybrid models combining GPMO1–4 and CRAEPA, LAeq = A-weighted equivalent continuous sound pressure level, RC = room criteria, N = total loudness.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ma, K.W.; Mak, C.M.; Chung, F.-L.; Wong, H.M. Evaluation of Psychoacoustic Machine Learning Assessment Method for Predicting Occupant Well-Being. Buildings 2026, 16, 1027. https://doi.org/10.3390/buildings16051027

AMA Style

Ma KW, Mak CM, Chung F-L, Wong HM. Evaluation of Psychoacoustic Machine Learning Assessment Method for Predicting Occupant Well-Being. Buildings. 2026; 16(5):1027. https://doi.org/10.3390/buildings16051027

Chicago/Turabian Style

Ma, Kuen Wai, Cheuk Ming Mak, Fu-Lai Chung, and Hai Ming Wong. 2026. "Evaluation of Psychoacoustic Machine Learning Assessment Method for Predicting Occupant Well-Being" Buildings 16, no. 5: 1027. https://doi.org/10.3390/buildings16051027

APA Style

Ma, K. W., Mak, C. M., Chung, F.-L., & Wong, H. M. (2026). Evaluation of Psychoacoustic Machine Learning Assessment Method for Predicting Occupant Well-Being. Buildings, 16(5), 1027. https://doi.org/10.3390/buildings16051027

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop