Skip Content
You are currently on the new version of our website. Access the old version .
JCMJournal of Clinical Medicine
  • Article
  • Open Access

29 January 2026

Interpretable Acoustic Features from Wakefulness Tracheal Breathing for OSA Severity Assessment

,
,
and
1
Biomedical Engineering Program, University of Manitoba, Winnipeg, MB R3T 5V6, Canada
2
Multidisciplinary Alfred Psychiatry Research Centre, Monash University, Melbourne, VIC 3004, Australia
3
Riverview Health Center, University of Manitoba, Winnipeg, MB R3L 2P4, Canada
4
Department of Electrical and Computer Engineering, University of Manitoba, Winnipeg, MB R3T 5V6, Canada
This article belongs to the Special Issue Sleep in Neurology—Sleep Disorders and Dysfunction of the Nervous System

Abstract

Background: Obstructive Sleep Apnea (OSA) is one of the most prevalent sleep disorders associated with cardiovascular complications, cognitive impairments, and reduced quality of life. Early and accurate diagnosis is essential. The present gold standard, polysomnography, is expensive and resource-intensive. This work develops a non-invasive machine-learning-based framework to classify four OSA severity groups (non, mild, moderate, and severe) using tracheal breathing sounds (TBSs) and anthropometric variables. Methods: A total of 199 participants were recruited, and TBS were recorded whilst awake (wakefulness) using a suprasternal microphone. The workflow included the following steps: signal preprocessing (segmentation, filtering, and normalization), multi-domain feature extraction representing spectral, temporal, nonlinear, and morphological features, adaptive feature normalization, and a three-stage feature selection that combined univariate filtering, Shapley Additive Explanations (SHAP)-based ranking, and recursive feature elimination (RFE). The classification included training ensemble learning models via bootstrap aggregation and validating them using stratified k-fold cross-validation (CV), while preserving the OSA severity and anthropometric distributions. Results: The proposed framework performed well in discriminating among OSA severity groups. TBS features, combined with anthropometric ones, increased classification performance and reliability across all severity classes, providing proof for the efficacy of non-invasive audio biomarkers for OSA screening. Conclusions: TBS-based model’s features, coupled with anthropometric information, offer a promising alternative or supplement to PSG for OSA severity detection. The approach provides scalability and accessibility to extend screening and potentially enables earlier detection of OSA, compared to cases that might remain undiagnosed without screening.

1. Introduction

Obstructive sleep apnea (OSA) is a common yet underdiagnosed sleep-related breathing disorder affecting nearly 20% of adults in North America and linked to cardiovascular disease, hypertension, diabetes, and increased perioperative risk [1,2]. Despite its prevalence, up to 80% of cases remain undiagnosed [3], creating primary healthcare and economic burdens. OSA arises from recurrent upper airway obstruction during sleep, and its severity is classified by the apnea-hypopnea index (AHI) [4]. While polysomnography (PSG) remains the diagnostic gold standard [5], it is costly, time-intensive, and often inaccessible. Screening tools such as STOP-Bang and Berlin questionnaires provide high sensitivity but low specificity, leading to frequent misclassification of OSA status (i.e., false positives) [6,7].
Recent advances in biomedical signal analysis offer promising alternatives. For instance, tracheal breathing sounds (TBSs) recorded during wakefulness have been shown to contain distinctive acoustic markers related to upper airway physiology [8,9,10,11,12,13,14,15,16]. Studies using power spectral, bispectral, and fractal analyses, and more recently, machine learning (ML) models, have demonstrated strong potential for OSA detection [8,9,10,11,12,13,14,15,16]. However, a significant gap remains between the extraction of acoustic features and their clinical interpretation. While many studies report statistically significant differences in signal characteristics, the physiological meaning of these features and their relationships with airway mechanics, airflow resistance, and neuromuscular control remain poorly understood [17,18].
Bridging this gap is essential to translate signal-based metrics into clinically interpretable and actionable tools. This study focuses on interpreting acoustic features extracted from wakefulness TBS across different OSA severity groups. By analyzing spectral power, bispectral coupling, and fractal dimensions, we explore how acoustic signatures reflect physiological mechanisms underlying airway obstruction. This approach aims to link quantitative signal analysis with clinical interpretation, supporting the development of objective, accessible, and scalable OSA screening.

2. Materials and Methods

In this study, we applied our previously validated workflow [8] for acquiring, preprocessing, and analyzing wakefulness tracheal breathing sounds (TBSs). Participants were recruited from individuals referred for overnight PSG, representing a clinically enriched cohort with elevated pre-test probability of OSA. Suprasternal TBS recordings were collected under controlled conditions: subjects were positioned supine and instructed to perform five full deep breaths through the nose with the mouth closed, followed by five deep breaths through the mouth while wearing a nose clip using a Sony ECM-77B, Tokyo, Japan omnidirectional condenser microphone (sensitivity: −52 dB ± 3.5 dB, frequency response: 40 Hz–20 kHz). Snoring history was not collected via subject self-report as part of the anthropometric questionnaire, and no snoring events were present in the wakeful breathing recordings analyzed in this study. Table 1 presents the distribution of subjects in the dataset by anthropometric features. Preprocessing followed our established procedures, including artifact inspection, adaptive segmentation of inspiration and expiration, and bandpass filtering to isolate physiological components. We then extracted a comprehensive set of spectral, nonlinear, fractal, morphological, and time-frequency features using the same methods detailed in [8]. These features were optimized for 1-vs-1 subgroup analyses to improve the interpretability and personalization of acoustic biomarkers. The complete workflow is summarized in Figure 1.
Table 1. Participants’ Severity Groups and Anthropometric Information. AHI: apnea–hypopnea index, BMI: body mass index, NC: neck circumference, MPS: Mallampati score, M/F: male/female.
Figure 1. Overview of the machine learning pipeline for wakefulness-based tracheal breathing sound analysis and OSA severity classification.
Preprocessing included careful inspection for background noise and vocal artifacts to be excluded, segmenting breathing sound signals into inspiratory/expiratory phases using adaptive thresholding of the log-variance envelope and Signal-to-Noise Ratio (SNR) computation, and bandpass filtering (75–3000 Hz, 4th-order Butterworth) to discard extraneous physiological and ambient signals [8,11,12]. Filtered signals were subsequently normalized using automated methods (mean-range scaling, z-score, min-max, and robust scaling) with mutual information to maximize feature-label dependency [8]. Then, a feature extraction method was applied to each processed mid-flow signal; the methodology spans multiple analytical domains, including spectral, temporal, and nonlinear analyses, as well as cross-domain analyses, ensuring a holistic, multidimensional representation of linear and nonlinear signal dynamics. The extracted features are grouped and explicitly optimized for 1-vs-1 labels [8]. This group-specific feature selection process enables the creation of personalized feature sets that enhance model robustness and improve interpretation for diagnostic and predictive applications [8]. The following features have been extracted:
  • Spectral features: Power spectrum density via Welch’s method, spectral centroid, entropy, kurtosis, bandwidth, flux, and crest metrics [19].
  • Bispectral features: Bootstrap-based confidence interval detection of nonstationary gaps and coupling metrics [20].
  • Fractal and nonlinear features: Hurst exponent, Lyapunov exponent, Recurrence Quantification Analysis (RQA), Katz and Higuchi fractal dimensions [21,22,23].
  • Wavelet and time-frequency features: Wavelet coefficients, Mel-Frequency Cepstral Coefficients (MFCCs), Constant-Q Transform statistics [24,25,26].
  • Morphological features: Image-based representation of spectrogram and bispectrum (bounding box area, holes, connected components, Euler number, contrast/homogeneity/correlation/energy descriptors) [20,27,28,29].
  • Time-domain metrics: Zero-crossing rate, root mean square, shimmer, jitter, and noise-to-harmonics ratio [30,31,32].
To identify stable and physiologically meaningful predictors of OSA severity, we applied the same three-stage feature selection framework described in [8], consisting of univariate statistical filtering, SHAP-based feature ranking, and RFE. Within this framework, the final feature subset was defined as the minimum number of features that preserved consistent performance across cross-validation folds while maintaining feature stability and physiological interpretability, as established in [8]. A further reduction in the feature set beyond this subset was previously shown to increase performance variability and reduce robustness; therefore, no additional feature pruning was applied in the present study. Model evaluation was conducted using a custom stratified k-fold CV scheme specifically designed to preserve the joint distribution of OSA severity labels and key anthropometric risk factors, including age, body mass index (BMI), neck circumference, sex, and Mallampati score [12]. Appendix A presents a summary of the top 35 selected features for each model, based on the extracted features.
As illustrated in Figure 2, the whole dataset is first partitioned into k folds, ensuring each fold has approximately equal representation of OSA severity classes (Non, Mild, Moderate, and Severe) and comparable distributions of the selected anthropometric variables. Rather than stratifying solely by OSA severity, the proposed strategy employs multi-criteria stratification to ensure that clinically relevant subgroups are consistently represented in both training and validation sets. This approach reduces sampling bias caused by population heterogeneity and yields a more reliable estimate of generalization performance across physiologically diverse subjects. Also, severity stratification followed standard AHI-based clinical definitions to preserve physiological granularity and enable interpretation of progressive airway dysfunction beyond binary disease detection. Table 2 shows the distribution of subjects’ anthropometric data of the k-fold splits.
Figure 2. Schematic illustration of the proposed stratified k-fold cross-validation strategy. The full dataset is partitioned into k folds while preserving the proportional distribution of OSA severity classes (Non, Mild, Moderate, Severe) and key anthropometric risk factors (age, BMI, neck circumference, sex, and Mallampati score). Each fold maintains comparable joint distributions, ensuring balanced subgroup representation during training and validation and minimizing bias due to population heterogeneity.
Table 2. Participants’ Severity Groups and Anthropometric Information for K-folds. AHI: apnea–hypopnea index, BMI: body mass index, NC: neck circumference, MPS: Mallampati score, M/F: male/female.
To evaluate the discriminative power and physiological relevance of tracheal sound features, several complementary metrics were used. Area Under the Receiver Operating Characteristic (ROC) Curve (AUC) quantified each feature’s ability to distinguish between OSA severity groups, with higher values indicating stronger discrimination [33]. Pearson correlation coefficients measured the linear association between feature values and true labels, providing insight into how consistently a feature reflects the clinical outcome across cross-validation folds [34]. To assess robustness and generalizability, Absolute Delta AUC (AbsDeltaAUC) was calculated as the absolute difference between the training and test AUCs, with smaller values indicating more stable features that are less sensitive to data variability [35]. Finally, SHAP quantified each feature’s contribution to model predictions, while accounting for interactions with other features, thereby enhancing interpretability by highlighting physiologically meaningful patterns [36]. Together, these metrics enable ranking of features by both their discriminative ability and stability, supporting the identification of robust biomarkers that link acoustic and morphological descriptors to airway dynamics, airflow turbulence, and anatomical variations associated with OSA. As shown in Figure 3, the proposed feature evaluation framework integrates analyses of discriminative, correlational, stability, and explainability to identify physiologically relevant tracheal-sound features in OSA.
Figure 3. Framework for evaluating and interpreting tracheal breathing sound features in obstructive sleep apnea (OSA). Extracted acoustic and morphological features are analyzed for discriminative power, consistency with clinical variables, robustness across cross-validation folds, and model explainability. The results are integrated into a multidimensional feature ranking, producing tables, plots, and visual maps that link sound descriptors to airway dynamics, airflow turbulence, and anatomical variability across OSA severity levels.
The final analytic framework integrates feature-level metrics, stability measures, and explainability-driven insights to produce tables and visualizations that highlight the most physiologically relevant acoustic and morphological features across OSA severity groups. Rather than focusing solely on classification performance, the analysis emphasizes how each feature contributes to model predictions and relates to underlying airway physiology. Ranked lists based on AUC [33], fold-wise stability (AbsDeltaAUC) [35], and SHAP values [36] provide a multidimensional perspective on feature importance, while correlation analyses link these features to clinical and anthropometric variables, including AHI, neck circumference (NC), and Mallampati Score (MPS) [34]. Image-based and spectro-temporal feature maps illustrate changes in sound texture, frequency patterns, and event shapes, revealing airflow turbulence, intermittent obstruction, and variations in airway mechanics. By combining quantitative metrics with visual interpretations, this framework transforms raw signal descriptors into clinically meaningful biomarkers, enhancing understanding of upper-airway dynamics, airflow irregularities, and anatomical risk factors associated with OSA severity [37].

3. Results

This section presents the key findings from the feature extraction and selection pipeline, highlighting the most discriminative tracheal-breathing-sound features for OSA severity classification. Analyses were conducted across six 1-vs-1 base models (Non-OSA vs. Mild, Non-OSA vs. Moderate, Non-OSA vs. Severe, Mild vs. Moderate, Mild vs. Severe, Moderate vs. Severe) and three folds of a custom stratified cross-validation, designed to preserve the joint distribution of severity groups and key anthropometric factors. Feature importance was assessed using both correlation-based ranking and SHAP values to identify consistently essential features. The top selected features for each model are detailed in Appendix A.
The models and selected features are empirical; their exact frequency bands or characteristics may differ for other datasets depending on the sensor used (e.g., different microphones). To keep feature names readable, we have categorized them by main characteristics, such as spectral or bispectral features, breathing type (mouth or nose), and phase (inspiration or expiration). The frequency regions from which the features were extracted are based on the 95% confidence interval of the training set, as proposed in our previous work [8]. For example, Bispectral_Centroid_Mean represents the mean bispectral energy centroid across all breathing conditions. Similarly, Spectral_Skewness_Mouth_Inspiration captures the skewness of the spectral distribution during mouth inspiration, and Spectral_FrequencyRatio_Mouth_Expiration represents the frequency ratio feature during mouth expiration. The detailed definitions of these features, including what a bounding box (BBox) is, the specific coordinates or frequency/time ranges, and the corresponding breathing conditions, are provided elsewhere (e.g., in a footnote, appendix, or table legend). This approach ensures that the main text remains readable while maintaining reproducibility and technical clarity. This general naming approach avoids dataset-specific details, makes the features more interpretable for readers outside the team, and preserves the essential information on how each feature was derived (Figure 4).
Figure 4. Representative examples of gap regions identified between two classes based on non-overlapping 95% confidence intervals. (a) Power spectral density (PSD): the green shaded areas indicate frequency bands where the 95% confidence intervals of the average PSD for Moderate-OSA and Severe-OSA do not overlap, defining statistically significant spectral gaps between the two classes. (b) Bispectrum: red boxes indicate time–frequency regions in the bispectral domain where the 95% confidence intervals of the averaged bispectrum for Moderate-OSA and Severe-OSA do not overlap. The bispectrum, a third-order spectral measure, captures quadratic phase coupling and nonlinear interactions between frequency components; gap regions therefore indicate class-specific differences in nonlinear spectral coupling.
For detailed clinical interpretation of top features, a specific appendix (Appendix B) is dedicated to these details, while for detailed comparison-specific results, all supporting tables and figures are provided in the appendices: Appendix C (Non-OSA vs. Mild OSA), Appendix D (Non-OSA vs. Moderate OSA), Appendix E (Non-OSA vs. Severe OSA), Appendix F (Mild vs. Moderate OSA), Appendix G (Mild vs. Severe OSA), and Appendix H (Moderate vs. Severe OSA). Each appendix includes the top-ranked features by test AUC, the most stable features by AbsDeltaAUC, and the strongest anthropometric- and AHI-associated features for the corresponding comparison. Higher AUC values indicate greater discrimination between the two severity groups. In contrast, smaller AbsDeltaAUC values suggest more consistent feature performance across cross-validation folds, reflecting reduced variability in AUC estimates across data partitions. Correlation analyses with anthropometric variables and AHI provide additional insight into potential physiological relevance and relationships with clinical severity. The following subsections focus on the clinical interpretation of the most consistently supported features across these comparisons.

3.1. Clinical Interpretation of Top Features

To provide a physiologically grounded interpretation of the observed acoustic differences, we adopt a Structure–Function–Symptom framework. In this narrative, anatomical and structural characteristics of the upper airway (Structure), such as tissue compliance, airway narrowing, and fat deposition, influence airflow behavior during breathing (Function), including turbulence, nonlinear coupling, and ventilatory instability. These functional alterations manifest clinically as differences in apnea–hypopnea burden and disease severity (Symptom), quantified by the AHI. The following interpretations therefore explain how each significant acoustic feature reflects a structural-functional pathway underlying OSA progression.
This subsection provides a clinical interpretation of the most discriminative features, prioritized based on agreement across AUC ranking, AbsDeltaAUC, and SHAP importance. The goal is to link key acoustic and spectro-temporal descriptors to potential physiological and airflow changes associated with early manifestations of sleep-disordered breathing. Table 3 shows an overview of the clinical interpretation of features across different models. In contrast, more detailed feature-by-feature interpretations are provided in Appendix B. Collectively, this analysis facilitates a clearer understanding of how specific acoustic patterns may reflect underlying upper airway dynamics and disease progression. Furthermore, aligning model-derived features with known clinical mechanisms enhances the interpretability and translational relevance of the proposed framework.
Table 3. Structure–Function–Symptom Interpretation of Dominant Acoustic Features Across OSA Severity Comparisons.

3.2. Top-Ranked Features

The top 10 features, identified by their overall average rank across both correlation- and SHAP-based ranking methods, are presented in Table 4. These features consistently demonstrated high importance in distinguishing between different OSA severity groups. The top features include a mix of spectral, temporal, and morphological characteristics of the tracheal breathing sounds, as shown in Table 4. Notably, features related to spectral bandwidth (range of frequencies contributing to the signal), texture energy (quantifies uniformity and repetitiveness of bispectral patterns), spectral flux (measures frame-to-frame changes in the power spectrum), and statistical moments (mean, standard deviation, kurtosis, skewness) consistently appear among the most essential features. These features are robust across cross-validation folds and severity comparisons, indicating that they reliably capture physiologically relevant changes in airflow dynamics and turbulence. These features capture various aspects of the sound signal, including its frequency distribution, temporal dynamics, and overall intensity and complexity.
Table 4. Top 10 Ranked Tracheal Breathing Sound Features for OSA Severity Classification (Lower is better).

3.3. Feature Stability Across Folds and Models

The top 10 most stable tracheal breathing sound features were identified based on the lowest absolute differences between training and testing Area Under the Curve (AUC) values, as shown in Table 5. These features demonstrate minimal variability across different data splits, highlighting their robustness and consistency in discriminating OSA severity. Lower absolute Delta AUC values indicate that the predictive power of these features is reliably maintained across training and test datasets. In contrast, lower absolute Delta Corr values reflect more consistent correlations between the features and clinical measurements, indicating stable physiological relevance across folds. Collectively, these metrics suggest that the selected features are both robust and physiologically meaningful, making them strong candidates for inclusion in predictive models and clinical decision-support systems.
Table 5. Top 10 Most Stable Tracheal Breathing Sound Features. Lower Absolute Delta AUC indicates higher feature stability across cross-validation folds, while Absolute Delta Correlation reflects the consistency of feature correlation. Features are considered stable/significant if Abs Delta AUC ≤ 0.01 and Abs Delta Correlation ≤ 0.1, reflecting minimal variation and robust performance across folds. Features with Abs Delta Correlation between 0.1 and 0.15 are considered slightly unstable but still statistically acceptable, as this slight variation does not substantially affect their overall predictive reliability.

3.4. Correlation with Anthropometric Data

Strong associations were observed between several extracted tracheal-sound features and key clinical anthropometric measurements relevant to OSA, including BMI, NC, Sex, and MPS. These relationships, summarized in Table 6, highlight the clinical relevance of the acoustic and spectro-temporal descriptors identified in this study. The table emphasizes the features most strongly correlated with anthropometric parameters, providing insight into the physiological underpinnings of OSA severity and supporting their potential utility in predictive modeling.
Table 6. Top 10 Pearson correlations between breathing sound features and anthropometric measures (NC, BMI, Sex, MPS). Each row indicates which feature is correlated with which anthropometric variable, along with the fold and severity comparison.
Pearson correlation coefficients between acoustic features and anthropometric variables were computed independently within each cross-validation fold, rather than on pooled data. For each severity comparison, correlations were calculated using the fold-specific data subset, and the corresponding fold index is explicitly reported in Table 6.
These correlations were used exclusively for interpretability and physiological analysis and did not influence model training, feature selection, or classifier optimization. As such, they should not be interpreted as estimates of generalization performance, but rather as indicators of strong fold-specific associations between acoustic characteristics and anthropometric measures.

4. Discussion

This study aimed to identify interpretable features from wakefulness tracheal breathing sounds that are clinically relevant for assessing OSA severity. The consistent emergence of specific features across different models and folds, coupled with their stability and correlation with anthropometric data, underscores their potential as robust biomarkers for OSA [8,9,10,11,12,13,17,18,38].

4.1. Clinical Relevance of Key Features

Interpreting acoustic biomarkers through a structure–function–symptom lens enables a mechanistic understanding of how anatomical vulnerability of the upper airway translates into altered airflow dynamics and ultimately manifests as increasing OSA severity. OSA manifests through complex interactions between the upper airway anatomy, airflow turbulence, and respiratory control. Identifying features from tracheal breathing sounds recorded whilst awake that reliably reflect these physiological processes is crucial for non-invasive assessment. This section focuses on clinically meaningful indicators of OSA severity [39,40,41].

4.1.1. Non-OSA vs. Mild-OSA

From a structure–function–symptom perspective, early anatomical vulnerability of the upper airway (Structure), including mild tissue compliance and partial narrowing, leads to subtle functional airflow disturbances (Function), characterized by intermittent turbulence and disrupted nonlinear airflow–tissue coupling during inspiration. These functional alterations manifest clinically (Symptom) as mild elevations in AHI without sustained airway obstruction. The distinction between Non-OSA and Mild-OSA is characterized by the emergence of subtle yet consistent early signs of the upper airway instability during breathing whilst awake. These changes are captured by features such as MouthInspiration_Range_FreqSkewness, Average_BBox_TextureEnergy, and Average_BBox_FrequencyCentroidX, which quantify shifts in spectral energy, disruption of structured bispectral coupling, and changes in dominant frequency interactions, respectively (see Appendix B.1 for full feature definitions and physiological interpretation). The observed patterns indicate a transition from predominantly laminar airflow toward intermittently turbulent inspiratory flow, consistent with early upper-airway collapsibility and soft-tissue vibration [38,40,42]. These changes suggest the onset of periodic flow limitation without sustained obstruction, aligning with early physiological manifestations of mild OSA described in prior studies [16,41].
Overall, mild OSA is marked not by significant increases in breathing sound intensity but by early disruption of airflow regularity and spectro-temporal organization, as captured by these spectral and bispectral features. This finding supports the concept that the earliest stage of OSA manifests primarily as micro-instability and intermittent turbulence rather than overt obstruction, reinforcing the value of wakeful acoustic markers for early detection [16,43,44].

4.1.2. Non-OSA vs. Moderate-OSA

Within the structure–function–symptom framework, progressive anatomical narrowing and reduced airway stiffness (Structure) produce sustained airflow limitation and elevated inspiratory effort (Function), resulting in prolonged turbulent breathing events. Clinically (Symptom), these changes correspond to a clear increase in AHI and more frequent obstructive events, consistent with moderate OSA. In contrast to mild OSA, the transition from Non-OSA to Moderate-OSA reveals a clear escalation in airflow disturbance and respiratory effort. These changes are captured by features such as Average_Range_Maximum, Average_BBox_BoundingBoxDiagonal, Average_Range_MeanPower, and Average_BBox_ConnectedComponents, which quantify peak breathing sound energy, expansion of nonlinear bispectral interactions, overall sound intensity, and fragmentation of coupling patterns, respectively (see Appendix B.2 for full feature definitions and physiological interpretation). The results demonstrate stronger, more sustained turbulent breathing events, reflecting prolonged partial airway collapse and increased inspiratory drive [13,39,45]. Breathing sounds become more energetic and fragmented, consistent with repetitive cycles of obstruction and compensatory recovery.
These acoustic characteristics indicate that moderate OSA is physiologically defined by persistent airflow instability rather than isolated abnormalities. The increased duration, intensity, and fragmentation of breathing events align with established descriptions of heightened airway collapsibility and more frequent arousal-related breathing responses in moderate disease [46,47,48].

4.1.3. Non-OSA vs. Severe-OSA

Structurally, severe OSA is characterized by pronounced upper-airway collapsibility and reduced neuromuscular compensation. Functionally, this leads to chaotic airflow, repeated collapse–reopening cycles, and highly nonlinear breathing dynamics. These functional disturbances manifest clinically as high AHI values (Symptom), reflecting frequent apneic and hypopneic events. Severe OSA exhibits a markedly distinct acoustic phenotype, dominated by chaotic, high-energy, and highly irregular breathing patterns. These changes are captured by features such as Average_BBox_MeanValue, Average_BBox_TextureEnergy, Average_BBox_FractalDimension, Average_BBox_EnergyValue, and Average_BBox_KurtosisValue, which quantify average sound intensity, heterogeneity of bispectral coupling, complexity of local patterns, total sound energy, and prevalence of abrupt or impulsive events, respectively (see Appendix B.3 for complete feature definitions and physiological interpretation). The results indicate frequent and intense airflow collapse followed by forceful recovery breaths, producing complex and impulsive acoustic events across a broad frequency range [12,38,40]. The pronounced variability and structural disruption observed are consistent with unstable ventilatory control and recurrent airway obstruction.
Physiologically, these findings reflect deep upper-airway collapsibility, exaggerated negative pressure swings, and repeated collapse–reopening cycles characteristic of advanced OSA [41,46,49]. The elevated complexity and unpredictability of the acoustic patterns are in line with prior reports linking severe disease to chaotic airflow and disordered breathing mechanics [50,51,52].

4.1.4. Mild-OSA vs. Moderate-OSA

In structure–function–symptom terms, the transition from mild to moderate OSA reflects worsening anatomical compromise of the airway (Structure), which shifts airflow behavior from intermittent to persistent instability (Function). This progression manifests clinically (Symptom) as a sustained increase in AHI and reduced effectiveness of compensatory airway control. The progression from mild to moderate OSA represents a shift from intermittent airflow disturbance to more persistent and structurally disruptive obstruction. These changes are captured by features such as Average_BBox_MeanValue, Average_BBox_TextureEnergy, Average_BBox_FractalDimension, Average_BBox_EnergyValue, and Average_BBox_KurtosisValue, which quantify average sound intensity, heterogeneity of bispectral coupling, complexity of local patterns, total sound energy, and the prevalence of sharp or impulsive events, respectively (see Appendix B.4 for complete feature definitions and physiological interpretation). The results indicate increasing turbulence during both inspiration and expiration, accompanied by broader spectral involvement and greater fragmentation of breathing sounds [12,38,44,53]. This suggests that airflow irregularities are no longer isolated but sustained throughout the respiratory cycle.
Clinically, this transition reflects worsening airway collapsibility and reduced effectiveness of neuromuscular compensation during wakefulness. Moderate OSA therefore emerges as a state in which airflow instability becomes chronic rather than episodic, consistent with physiological models of disease progression [9,17,40,50].

4.1.5. Mild-OSA vs. Severe-OSA

Here, structural airway vulnerability becomes dominant (Structure), overwhelming compensatory mechanisms. Functionally, this produces highly variable, noisy, and energetically intense airflow patterns. Clinically (Symptom), these effects correspond to severe OSA, marked by large AHI values and pronounced breathing instability. Comparisons between mild and severe OSA highlight a pronounced escalation in airflow irregularity, respiratory effort, and acoustic unpredictability. These changes are captured by features such as MouthInspiration_BBox_FrequencyCentroidX, Average_Average_BBoxes_Entropy, Average_Range_RMS, Average_BBox_EnergyValue, and MouthExpiration_Range_SpectralEnergy, which quantify shifts in dominant frequencies, overall entropy of bispectral patterns, amplitude variability, total sound energy, and broadband spectral energy, respectively (see Appendix B.5 for complete feature definitions and physiological interpretation). The results reveal prolonged and noisy inspiratory phases, increased breath-to-breath variability, and intense turbulent bursts extending into expiration [38,44,45]. These patterns indicate a breakdown of compensatory airway control mechanisms that remain partially effective in mild disease.
From a physiological standpoint, severe OSA is characterized by loss of airflow stability, where collapsibility dominates over neuromuscular control. The marked increases in variability and turbulence observed align with descriptions of unstable ventilatory control and repeated collapse–recovery dynamics in severe disease [18,38,44,48].

4.1.6. Moderate-OSA vs. Severe-OSA

From a structure–function–symptom standpoint, severe OSA represents a qualitative shift rather than a linear extension of moderate disease: deeper structural collapse and airway instability (Structure) lead to near-chaotic airflow dynamics (Function), which clinically manifest (Symptom) as extreme AHI values and frequent obstructive episodes. The transition from moderate to severe OSA is marked by a qualitative shift from structured instability to near-chaotic airflow dynamics. These changes are captured by features such as Average_BBox_MedianValue, Average_BBox_IQRValue, Average_BBox_EnergyValue, Average_BBox_KurtosisValue, Average_BBox_Compactness, Average_BBox_StdValue, and Average_BBox_EntropyValue, which quantify overall breathing sound intensity, central and total variability, abrupt peaks, diffusion of high-intensity regions, dispersion, and randomness of airflow-related acoustic patterns, respectively (see Appendix B.6 for full feature definitions and physiological interpretation). The results indicate greater breath-to-breath variability, stronger and more erratic respiratory effort, and increasingly diffuse turbulent sound patterns [39,41,48]. Acoustic events become less compact and more topologically complex, reflecting deeper and more frequent airway collapse.
These findings suggest that severe OSA represents not merely an amplification of moderate disease but a distinct physiological regime characterized by unpredictable airflow, unstable arousal responses, and diminished airway resilience [18,38,44,50,54]. This distinction supports the clinical importance of separating moderate and severe OSA in severity stratification and management.

4.1.7. Physiological Themes Across Models

Across severity comparisons, certain recurring acoustic patterns reflect underlying physiological mechanisms of OSA. By examining features related to turbulence, airflow complexity, variability, and energy, we can identify consistent markers of airway instability, vibration, and compensatory respiratory effort. The following themes summarize how these features may collectively capture the progression of OSA.
  • Escalating turbulence, bandwidth, and centroid shifts correspond to rising Reynolds number and more pronounced vibration/snoring as the airway narrows [44].
  • Event complexity (diagonals, perimeters, shape metrics): track segmented, irregular airflow fragments as OSA severity increases [43].
  • Variability (Interquartile range (IQR), Standard Deviation (SD), entropy) reveals unstable ventilatory control, frequent arousals, and abrupt collapse–recovery dynamics [43].
  • Amplitude/energy (mean, RMS, total) reflect increasing respiratory effort, loud post-obstructive inspiration, and compensatory surges in disease progression.
Each feature thus provides a physiologic aspect into how OSA disrupts the upper airway patency, generates turbulence and vibration, and drives instability and variability across both models [43,44].

4.2. Rationale for Multi-Class OSA Severity Stratification

While binary OSA classification (OSA vs. non-OSA) is common in screening-oriented studies, it does not capture the progressive and heterogeneous nature of the upper-airway dysfunction. Clinically defined AHI severity categories (mild, moderate, and severe) reflect distinct physiological states, including differences in airway collapsibility, airflow turbulence, ventilatory compensation, and symptom burden.
In this study, several acoustic features exhibited nonlinear or stage-specific behavior across severity levels, particularly in comparisons involving mild-to-moderate and moderate-to-severe transitions. Collapsing these groups into a binary framework would obscure intermediate phenotypes and reduce sensitivity to early or transitional disease mechanisms. By adopting a four-class framework, the proposed model preserves physiologically meaningful distinctions and enables severity-aware interpretation aligned with clinical risk stratification, perioperative assessment, and treatment decision-making.

4.3. Physiological and Clinical Interpretation of Feature Linkage to Severity

The high-ranking and stable features are not merely statistical constructs; they are direct acoustic manifestations of the anatomical and functional changes associated with progressive OSA severity. Tracheal sounds are generated by turbulent airflow, and their characteristics are susceptible to subtle changes in airway geometry, collapsibility, and compensatory respiratory effort, even during wakefulness [8,11].
Although all acoustic recordings in this study were obtained whilst awake under quiet breathing conditions and therefore do not contain snoring events, snoring history remains an important symptom associated with OSA severity. Chronic snoring reflects repetitive vibration of upper-airway soft tissues during sleep, which has been hypothesized to contribute to long-term structural changes such as tissue remodeling, inflammation, or altered compliance. These chronic structural modifications may persist beyond sleep and subtly influence airflow behavior and tracheal breathing acoustics even during wakefulness. Consequently, while the extracted acoustic features do not represent snoring sounds per se, they may indirectly reflect cumulative airway alterations associated with both OSA severity and a history of habitual snoring. Future studies incorporating quantitative snoring indices alongside wakeful and sleep-state recordings may further clarify this relationship.

4.3.1. Acoustic Signatures of Airway Chaos and Ventilatory Effort (AHI Correlation)

The correlations with the AHI provide the most direct clinical linkage. The following summarizes the correlations of features with AHI:
  • Decreased Texture Energy (Acoustic Disorganization): The feature exhibits the strongest negative correlation with OSA severity. Texture Energy is a quantitative measure of the uniformity and repetitiveness of local patterns in a spectrogram, computed by summing the squared values of the co-occurrence or filtered spectrogram matrix, reflecting how consistent and regular the acoustic structure is. As severity increases, the pharyngeal airway becomes intrinsically more compliant and prone to intermittent vibration and collapse, leading to flow separation and highly random, broadband turbulence. This shift from structured, laminar-like noise to chaotic, broadband turbulence disrupts the consistency of the spectrogram, resulting in a significant decrease in texture energy. This feature, therefore, serves as a powerful acoustic marker of increasing pharyngeal instability and vulnerability [39,42].
  • Increased Skewness (Compensatory Drive): Conversely, the high positive correlation (r ≈ 0.99) between spectral skewness and OSA severity indicates systematic changes in the distribution of sound amplitude. Positive skewness signifies a heavier tail toward high-amplitude values. Physiologically, this represents the subject’s increased reliance on intermittent, high-force maneuvers (such as a forceful, highly turbulent inhalation or a loud snort/gasp) to maintain adequate flow against increasing pharyngeal resistance. Clinically, this feature is an acoustic signature of heightened respiratory drive and compensatory effort, which scales directly with disease burden [41,45].

4.3.2. Morphological and Spectral Markers of Flow Limitation and Airway Dynamics

The consistently top-ranked spectral and morphological features provide a detailed view of the fluid dynamics within the compromised airway.
  • Spectral Bandwidth and Flux (Venturi Effect): These features are crucial markers of dynamic flow behavior. Airflow acceleration through a narrow, compliant pharyngeal segment (the site of flow limitation, a manifestation of the Venturi effect) generates high-velocity jets. The high spectral flux reflects the rapid, transient changes in the power spectrum as these turbulent jets form and dissipate during the breathing cycle. In contrast, increased bandwidth reflects a broader spread of acoustic energy across frequencies. Together, these changes are consistent with the presence and severity of flow-limiting segments, where the degree of narrowing modulates the strength and spectral extent of turbulent eddies [40,55].
  • Fractal Dimension and Complexity (Non-linear System Behavior): The high-ranking fractal dimension quantifies the non-linear complexity of the signal. Increased airway resistance and turbulence are hallmarks of a system pushed toward instability. A higher fractal dimension suggests a highly complex, chaotic, and less predictable airflow pattern, aligning with established non-linear control theory, which views the respiratory system as operating close to a chaotic bifurcation point [50,51].

4.3.3. Validation Through Established Anatomical Risk Factors

The extremely high correlations between specific features and anthropometric measures provide vital proof of concept: the acoustic features are not just abstract discriminators but directly encode the physical risk factors [12,56,57]. Peak Intensity is the near-perfect correlation between Neck Circumference (NC), which is physiologically profound. It is a validated proxy for fat deposition and reduced pharyngeal tissue stiffness. This deposition not only narrows the airway but also influences sound wave propagation. The high peak acoustic value in expiration is a measurable outcome of a sound wave propagating through a physically constrained, often partially occluded, and highly compressible tissue structure. At the same time, Entropy is the strong correlation between (a proxy for systemic obesity and increased soft tissue mass) and links overall body habits to the acoustic randomness and disorganization of the expired airflow pattern.
The combined evidence from correlation, feature stability, and anthropometric validation strongly supports the use of wakefulness features for the objective and clinically meaningful assessment of severity [12,56,57].

4.4. Correlation with Anthropometric Data

The strong correlations observed between several tracheal sound features and anthropometric data further strengthen their clinical utility [12,17]. For example, features such as Mouth Expiration_BBox_650_15_1_0_peakValue, which correlate strongly with Neck Circumference (NC), and Mouth Expiration_BBox_155_2_0_0_entropyValue, which correlate strongly with BMI, are particularly noteworthy. NC and BMI are well-established risk factors and indicators of OSA severity. The direct relationship between these anthropometric measures and specific acoustic features suggests that the structural characteristics of the upper airway, influenced by body posture, are reflected in tracheal breathing sounds [15,58]. This provides a mechanistic link between anatomical predispositions to OSA and the acoustic manifestations captured by our features. Similarly, correlations with the Mallampati Score (MPS), an indicator of the oral cavity and pharyngeal space, further support the idea that the airway’s physical configuration influences sound production during breathing [47]. Although some fold-wise correlations approached unity, these values reflect strong associations observed within specific severity contrasts and folds and do not imply model overfitting, as correlation analysis was conducted independently of the predictive learning pipeline.
These findings suggest that wakefulness tracheal breathing sounds contain rich information reflecting the physiological state of the upper airway. The ability of these features to discriminate OSA severity during wakefulness, their stability across different models and folds, and their strong correlations with established anthropometric risk factors are particularly significant. This offers a non-invasive, convenient, and potentially cost-effective method for screening and monitoring [9,44,59]. The interpretability of these features allows for a deeper understanding of the underlying mechanisms of OSA, moving beyond black-box model predictions [9,44,59]. This interpretability is crucial for clinical acceptance and for guiding future research into targeted interventions.

4.5. Alignment with Prior Wakefulness-Based OSA Studies

The present findings align strongly with and extend our group’s previous works [12,17]. In the 2018 study [17], the authors demonstrated that spectral, bispectral, and fractal features extracted from tracheal sounds could effectively differentiate OSA from non-OSA subjects, achieving classification accuracies of approximately 70–75% and ROC values of 0.73–0.80. They further observed that mouth-inspiratory features provided the highest discrimination power and that several acoustic descriptors were only weakly influenced by anthropometric variability, such as body mass index (BMI) or neck circumference (NC). In the other 2019 study [12], they reinforced these results using a larger dataset of 199 subjects. It confirmed that combining tracheal sounds with anthropometric features increased diagnostic performance to 81.4% accuracy (sensitivity = 82.1%, specificity = 80.9%).
Consistent with prior findings, our results achieved diagnostic performance comparable to or superior to that of previous models, yielding AUC values ranging from 0.86 to 0.97 across multiple OSA severity levels. The strongest predictive power was again observed for mouth-inspiratory and low-frequency components (150–450 Hz), which exhibited elevated spectral energy and distinct morphological patterns in moderate and severe OSA subjects. Correlation analysis in our dataset similarly showed significant relationships between acoustic features and anthropometric markers, including BMI (r = 0.52–0.75) and NC (r = 0.47–0.72), corroborating earlier physiological interpretations of airway constriction and turbulence during inspiration.
Beyond confirming the earlier results, the present study extends the prior research in three significant ways:
  • Instead of a binary OSA vs. non-OSA classification, our framework performs multi-level severity stratification (non-OSA, mild, moderate, and severe), offering finer clinical granularity.
  • We introduce novel morphological and time–frequency gap descriptors, extracted from harmonic–percussive (HP) decompositions and spectrogram bounding boxes, which capture airway-specific acoustic signatures not examined in previous work.
  • Our use of ensemble-based models with SHAP explainability provides transparent quantification of feature contributions and robustness validation (Abs ΔAUC < 0.04 across folds), establishing reproducibility across subjects and folds.
Together, these advances confirm and expand on the foundational evidence presented previously by our group [12,17], demonstrating that wakefulness-based tracheal breathing sounds, combined with simple anthropometric measures, constitute a physiologically meaningful, non-invasive, and reproducible tool for OSA detection and severity classification.

4.6. Comparison with Other Awake Screening Modalities

To contextualize the proposed methodology, it is necessary to benchmark it against the full spectrum of screening modalities whilst awake. While questionnaires (e.g., STOP-Bang) are ubiquitous due to their zero-cost administration [60], they are hindered by low specificity (often <40%), leading to high false-positive rates [60]. Facial image analysis offers a non-contact alternative, but current methods often plateau at approximately 70% accuracy or require large, balanced datasets to avoid demographic bias [61]. Functional methods such as Negative Expiratory Pressure (NEP) [62] and Acoustic Pharyngometry [63] offer high accuracy by directly measuring airway collapsibility and geometry. However, these techniques often require specialized equipment and strictly controlled protocols, reducing their utility for rapid, large-scale screening compared to microphone-based approaches [12]. Speech analysis, while similar in modality to tracheal breathing sounds analysis [64], often relies on complex phonetic tasks and has shown lower accuracy and variable specificity depending on the features used.
Table 7 presents a comprehensive quantitative comparison between the proposed framework and representative studies across five distinct wakefulness-based modalities. To ensure a fair comparison, we focus on dataset size, task formulation, and performance metrics (Sensitivity/Specificity). This comparison highlights that while functional tests (NEP/Pharyngometry) offer high precision, they lack the portability of acoustic methods. Conversely, while questionnaires are portable, they lack the diagnostic accuracy of the proposed method.
Table 7. Comparison of the proposed framework with representative wakefulness-based OSA screening modalities. Acc: accuracy, Sens: sensitivity, Spec: specificity.

4.7. Limitations and Future Work

Although the proposed framework achieved high interpretability, several technical limitations merit consideration. The analysis was performed on data collected from a single clinical site [8,11,12], which may limit the generalizability of the results to broader populations with differing acoustic environments, recording hardware, and demographic characteristics. Secondly, the sound recordings were conducted by a high-end Sony microphone. The selected features may change (although not significantly) depending on the sensor used. Future studies should incorporate multi-center datasets and cross-device validation to ensure robustness under real-world variability. Additionally, while the stratified k-fold design effectively balanced anthropometric covariates, the current results were based on a finite number of wakefulness recordings per subject, restricting the temporal representation of respiratory dynamics. Expanding the framework to include multi-cycle, sleep-stage-specific, or longitudinal tracheal sound data would enable modeling of disease progression and treatment response. Despite these constraints, the consistent feature stability and explainable ensemble structure provide a strong foundation for advancing automated, non-invasive OSA assessment.
In addition to these technical considerations, the study cohort consisted of individuals referred to overnight PSG and therefore represents a clinically enriched population with a higher pre-test probability of OSA than general or primary-care populations. This enrichment may lead to optimistic estimates of discrimination metrics, such as AUC, compared with deployment in lower-risk settings, and may particularly affect positive predictive value when disease prevalence is lower. However, the primary objective of this work was not to estimate population-level screening accuracy but to identify robust, physiologically interpretable acoustic biomarkers of OSA severity under controlled clinical conditions. These biomarkers reflect underlying airway dynamics and anatomical vulnerability, which are expected to generalize beyond referral-based cohorts.
In practical deployment, the proposed framework is best positioned as a first-line risk stratification or prioritization tool rather than a standalone diagnostic test. In primary-care or community settings, it could be used to identify individuals who would benefit most from expedited PSG, thereby improving resource allocation and reducing diagnostic delays. Future studies will focus on validating the framework in lower-prevalence populations, including primary-care and community-based cohorts, and on recalibrating decision thresholds to account for differences in disease prevalence and pre-test probability. Despite these constraints, the consistent feature stability and explainable ensemble structure provide a strong foundation for advancing automated, non-invasive OSA assessment.

5. Conclusions

In conclusion, this research successfully established an interpretable, machine-learning-driven framework that uses wakefulness tracheal breathing sounds as objective, severity-stratifying biomarkers for Obstructive Sleep Apnea. By identifying and validating stable acoustic features, texture energy, spectral bandwidth, and fractal dimension that exhibit strong mechanistic correlations with established anatomical risk factors, we have acoustically encoded the underlying physiological vulnerability of the upper airway. This interpretability moves beyond black-box diagnostics, offering clinicians clear, physiological correlates for disease progression. These findings represent a decisive step toward developing a non-invasive, cost-effective, and highly accessible screening tool, which is critically needed for timely diagnosis, perioperative risk stratification, and scalable long-term management in clinical settings globally.

Author Contributions

Conceptualization, A.M.A., W.A., B.L. and Z.M.; methodology, A.M.A., W.A., B.L. and Z.M.; software, A.M.A.; validation, A.M.A., W.A., B.L. and Z.M.; formal analysis, A.M.A., W.A., B.L. and Z.M.; investigation, A.M.A., W.A., B.L. and Z.M.; data curation, A.M.A., W.A., B.L. and Z.M.; writing original draft preparation, A.M.A., W.A., B.L. and Z.M.; writing review and editing, A.M.A., W.A., B.L. and Z.M.; visualization, A.M.A., W.A., B.L. and Z.M.; supervision, Z.M.; project administration, Z.M.; funding acquisition, Z.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Sciences and Engineering Research Council of Canada (NSERC).

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board (or Ethics Committee) of University of Manitoba’s Biomedical Research Ethics Board (protocol code HS15165(B2006:185) and date of approval 21 December 2015).

Data Availability Statement

The data presented in this study are available on request from the PI of the study (last author) due to ethical and consent-related restrictions, as the datasets contain human audio recordings and can only be shared under controlled access following approval of a signed data-use consent agreement.

Acknowledgments

We acknowledge the support of the NSERC (Natural Sciences and Engineering Research Council of Canada). During the preparation of this manuscript/study, the author(s) used Grammarly (https://www.grammarly.com/) for the purposes of improving grammar, spelling, punctuation, and overall clarity of the text.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AHIApnea–Hypopnea Index
AUCArea Under the Receiver Operating Characteristic Curve
AbsDeltaAUCAbsolute Change in AUC
BBoxBounding Box
BMIBody Mass Index
CVCross-Validation
CQTConstant-Q Transform
FMFrequency Mean
FM2MMean Frequency Ratio
HPHarmonic–Percussive
IQRInterquartile Range
k-Fold CVK-Fold Cross-Validation
MFCCMel-Frequency Cepstral Coefficients
MLMachine Learning
NCNeck Circumference
NEPNegative Expiratory Pressure
OSAObstructive Sleep Apnea
RFERecursive Feature Elimination
ROCReceiver Operating Characteristic
RQARecurrence Quantification Analysis
RMSRoot Mean Square
SDStandard Deviation
SHAPShapley Additive exPlanations
SNRSignal-to-Noise Ratio
TBSTracheal Breathing Sounds
ΔAUCChange in AUC

Appendix A. Top Selected Features for Each Model

Table A1. Summary of Top Selected Features for Each Model.
Table A1. Summary of Top Selected Features for Each Model.
Feature NumberNon-OSA vs. Mild-OSANon-OSA vs. Moderate-OSANon-OSA vs. Severe-OSAMild-OSA vs. Moderate-OSAMild-OSA vs. Severe-OSAModerate-OSA vs. Severe-OSA
1MouthExpiration_Range_SpectralEntropyAverage_Range_SpectralSkewnessNoseInspiration_Range_SpectralKurtosisNoseExpiration_Range_MeanPowerMouthExpiration_Range_SCBW_BandwidthNoseExpiration_Range_SpectralCrest
2MouthExpiration_Range_SpectralCrestNoseInspiration_Range_SpectralSkewnessNoseInspiration_Range_FreqSkewnessMouthExpiration_IQRAverage_Average_BBoxes_EnFBiDAverage_BBox_NumHoles
3MouthExpiration_Range_SpectralCrestAverage_BBox_EntropyNoseExpiration_Range_SCBW_BandwidthMouthInspiration_MFCCMeanNoseExpiration_BBox_RangeMouthInspiration_WaveletApproxEntropy
4NoseInspiration_Range_SCBW_BandwidthMouthInspiration_RangeAverage_BBox_TextureEnergyNoseExpiration_WaveletApproxSkewnessAverage_Range_SpectralCrestNoseExpiration_PeakCount
5MouthInspiration_MFCCMeanMouthExpiration_WaveletApproxSpectralBandwidthAverage_BBox_CentroidYMouthInspiration_MFCCMeanNoseInspiration_BBox_MedianMouthInspiration_MFCCMean
6MouthInspiration_MFCCMedianMouthExpiration_WaveletDetailEntropyNoseInspiration_BBox_AspectRatioMouthInspiration_MFCCStdNoseInspiration_BBox_PeakValueMouthExpiration_Range_SCBW_Bandwidth
7NoseExpiration_WaveletApproxMeanMouthInspiration_IQRMouthInspiration_BBox_EulerNumberMouthInspiration_SpectralCentroidNoseInspiration_BBox_FreqCentroidMouthInspiration_Range_RMS
8NoseExpiration_Range_BandPowerNoseInspiration_CoefVariationMouthInspiration_MeanValueMouthInspiration_SpectralBandwidthNoseInspiration_BBox_PeakValueNoseInspiration_Range_MeanPower
9NoseExpiration_Range_SpectralSkewnessNoseExpiration_BBox_EntropyMouthInspiration_StdValueMouthInspiration_MFCCKurtosisNoseInspiration_BBox_FreqCentroidNoseInspiration_Range_SCBW_Bandwidth
10MouthInspiration_BBox_NumHolesAverage_DIR_HistogramMouthExpiration_TotalEnergyMouthExpiration_WaveletSpectralCentroidNoseInspiration_BBox_SpectralFluxNoseInspiration_Range_SpectralEnergy
11MouthExpiration_BBox_FreqCentroidAverage_AVP_HistogramMouthExpiration_NormalizedEnergyMouthExpiration_CQTBandwidthDynamicRangeNoseInspiration_BBox_ConnectedComponentsNoseInspiration_Range_BandPower
12MouthExpiration_BBox_KurtosisMouthInspiration_ZeroCrossingMouthExpiration_StdAbsBiMouthExpiration_MFCCStdNoseInspiration_BBox_EulerNumberMouthInspiration_BBox_RegionArea
13MouthExpiration_BBox_AreaMouthInspiration_WaveletDetailSpectralBandwidthMouthExpiration_SymStdMouthExpiration_SpectralCentroidNoseInspiration_BBox_IQRNoseInspiration_BBox_IQR
14MouthExpiration_BBox_DiagonalMouthInspiration_WaveletDetailSpectralBandwidthMouthExpiration_PerimeterMouthExpiration_MFCCSkewnessNoseInspiration_BBox_FreqCentroidNoseInspiration_BBox_TextureHomogeneity
15MouthExpiration_BBox_NumHolesMouthInspiration_MFCCMeanMouthExpiration_CentroidYMouthExpiration_MFCCKurtosisNoseInspiration_BBox_BBoxAreaNoseInspiration_BBox_NumHoles
16MouthExpiration_BBox_AspectRatioMouthInspiration_MFCCKurtosisMouthExpiration_PerimeterMouthExpiration_MFCCKurtosisNoseInspiration_BBox_FreqCentroidNoseExpiration_BBox_TextureContrast
17MouthExpiration_BBox_TextureContrastMouthInspiration_PBP_SkewnessMouthExpiration_CentroidXNoseInspiration_ZeroCrossingNoseInspiration_BBox_TextureEnergyNoseExpiration_BBox_AspectRatio
18NoseInspiration_BBox_DiagonalMouthInspiration_TP_HistogramNoseInspiration_Average_BBox_EnFBiDNoseInspiration_WaveletApproxSkewnessNoseInspiration_BBox_EulerNumberNoseExpiration_BBox_Perimeter
19NoseInspiration_BBox_TextureEnergyMouthInspiration_TP_MaxProbNoseInspiration_Average_BBox_MeanBiDFNoseInspiration_WaveletApproxSpectralCentroidNoseInspiration_BBox_StdValueAverage_WaveletDetail_MaxToMinRatio
20Average_WaveletApproxMaxToMinRatioMouthInspiration_EP_MaxEnergyNoseInspiration_Average_BBox_WCOBDFxNoseInspiration_WaveletDetailKurtosisNoseInspiration_BBox_EntropyAverage_WaveletDetail_Kurtosis
21NoseInspiration_HurstExponentMouthExpiration_WaveletApproxKurtosisNoseInspiration_Average_BBox_WCOBDFyNoseInspiration_CQTStdPowerNoseInspiration_BBox_EntropyMouthInspiration_LyapunovExponentMean
22NoseExpiration_KatzFDMouthExpiration_WaveletApproxSkewnessNoseInspiration_Average_BBox_Hf1NoseInspiration_CQTSkewnessPowerNoseInspiration_BBox_TextureContrastMouthInspiration_BandPowerHigh
23MouthInspiration_Range_PeakFrequencyMouthExpiration_PBP_KurtosisNoseInspiration_Average_BBox_Hf2NoseInspiration_CQTTemporalCentroidNoseInspiration_BBox_TextureEnergyNoseInspiration_WaveletApproxEntropy
24MouthExpiration_BBox_CentroidYMouthExpiration_PBP_EntropyNoseInspiration_Average_BBox_ReservedNoseInspiration_CQTSpectralCentroidNoseInspiration_BBox_StdValueNoseInspiration_WaveletApproxEntropy
25MouthExpiration_BBox_PeakValueMouthInspiration_Range_FM2MFreqNoseInspiration_Average_BBox_EnFBiDNoseInspiration_CQTBandwidthDynamicRangeNoseExpiration_BBox_ConnectedComponentsNoseInspiration_WaveletDetailSpectralCentroid
26MouthExpiration_BBox_TextureCorrelationMouthInspiration_Range_MeanPowerNoseInspiration_Average_BBox_MeanBiDFNoseInspiration_MFCCSkewnessNoseExpiration_BBox_MedianNoseExpiration_WaveletApproxSkewness
27NoseInspiration_BBox_RangeMouthInspiration_Range_RMSNoseInspiration_Average_BBox_WCOBDFxNoseInspiration_MFCCKurtosisNoseExpiration_BBox_IQRAverage_AVP_Mean
28NoseInspiration_BBox_StdMouthInspiration_Range_FM2MFreqNoseInspiration_Average_BBox_WCOBDFyNoseInspiration_SpectralBandwidthNoseExpiration_BBox_TextureCorrelationMouthInspiration_KatzFD
29NoseExpiration_BBox_FractalDimensionMouthInspiration_Range_BandPowerNoseInspiration_Average_BBox_Hf1NoseInspiration_MFCCKurtosisNoseExpiration_BBox_TextureEnergyMouthInspiration_LyapunovExponentMax
30NoseExpiration_BBox_StdMouthInspiration_Range_StdNoseInspiration_Average_BBox_Hf2NoseInspiration_TP_SkewnessNoseExpiration_BBox_FractalDimensionMouthExpiration_MFCCMedian
31MouthExpiration_MFCCMedianMouthInspiration_Range_FM2MFreqNoseInspiration_Average_BBox_ReservedNoseExpiration_WaveletDetailSkewnessNoseExpiration_BBox_ConnectedComponentsNoseInspiration_WaveletDetailEntropy
32MouthExpiration_PBP_KurtosisMouthInspiration_Range_FM2MFreqNoseInspiration_Average_BBox_BisEntropyNoseExpiration_CQTMeanPowerNoseExpiration_BBox_CentroidYNoseInspiration_CQTGaborEnergyMean
33MouthExpiration_TP_HistogramMouthInspiration_Range_FM2MFreqNoseInspiration_BBox_SpectralFluxNoseExpiration_CQTSkewnessPowerNoseExpiration_BBox_CompactnessNoseInspiration_MFCCMedian
34MouthExpiration_TP_MaxProbMouthExpiration_Range_StdNoseInspiration_BBox_ConnectedComponentsNoseExpiration_CQTSpectralDynamicsStdNoseExpiration_BBox_EnergyAverage_Range_Maximum
35MouthExpiration_TP_RatioMouthExpiration_Range_StdNoseInspiration_BBox_SkewnessMouthInspiration_ZeroCrossingNoseExpiration_BBox_FreqCentroidAverage_Range_SCBW_Bandwidth

Appendix B. Analysis of Clinical Interpretation of Top Features for All Models

Appendix B.1. Non-OSA vs. Mild-OSA

  • MouthInspiration_Range_FreqSkewness: Measures asymmetry of the power spectrum during mouth inspiration within the relevant frequency range. Positive values indicate dominance of lower frequencies, negative values indicate dominance of higher frequencies. Reflects shifts in frequency energy distribution due to airflow changes, turbulence, or airway geometry.
  • Average_BBox_TextureEnergy: Indicates uniformity of bispectral texture. Higher energy suggests consistent patterns, while lower values indicate heterogeneous local coupling. In OSA, irregular airflow can produce less uniform bispectral coupling patterns, consistent with more complex or intermittent nonlinear flow-tissue interactions.
  • Average_BBox_FrequencyCentroidX: Centroid of bispectral energy along the f1-axis. Shifts in this centroid indicate changes in dominant coupling frequency, reflecting airflow-tissue interactions or turbulence associated with airway narrowing.
  • Average_BBox_TextureCorrelation: Measures gray-level co-occurrence texture correlation. Higher values indicate structured bispectral patterns, suggesting stable nonlinear coupling. Reduced correlation reflects heterogeneous coupling, consistent with variable airflow instability.
  • Average_BBox_Perimeter: Perimeter of high-intensity regions within the bispectral bounding box. Larger perimeter suggests fragmented coupling, possibly indicating variable airflow instability.
  • Average_BBox_TextureHomogeneity: Measures uniformity of gray-level values. Higher values indicate smoother patterns; lower values indicate irregular textures, reflecting turbulent airflow.
  • Average_BBox_IQRValue: Range between the first and third quartiles of bispectral intensity. Larger IQR indicates greater variability in the central portion of the sound signal, potentially associated with airflow turbulence.
  • Average_BBox_Compactness: How compact high-intensity regions are. Less compact patterns suggest more diffuse sound energy, consistent with irregular airflow.
  • MouthInspiration_Range_SpectralSkewness: Asymmetry of the power spectrum during mouth inspiration within a relevant frequency range. Changes may indicate airflow limitation or obstruction.

Appendix B.2. Non-OSA vs. Moderate-OSA

  • Average_Range_Maximum: Highest amplitude within the relevant frequency range, representing peak breathing sound energy. Higher values may indicate more turbulent airflow due to airway narrowing.
  • Average_BBox_BoundingBoxDiagonal: Length of the bounding box diagonal for a bispectral patch. Larger diagonal implies broader distribution of nonlinear interactions, consistent with more complex flow disturbances in moderate OSA.
  • Average_Range_MeanPower: Average spectral energy across the relevant frequency range. Reflects overall intensity of breathing sounds in that band.
  • Average_BBox_ConnectedComponents: Counts distinct high-intensity regions in the bispectral patch. More regions suggest fragmented coupling, potentially due to multiple flow structures or vibration sites.
  • Average_BBox_TextureContrast: Local contrast differences in bispectral patches. Higher contrast indicates stronger differences between regions of coupling, compatible with intermittent obstructions.
  • Average_BBox_FrequencyCentroidY: Centroid of bispectral energy along the f2-axis. Shifts indicate changes in dominant coupling frequencies, reflecting altered airflow-tissue interactions.
  • Average_Range_BandPower: Total spectral power within a relevant mid-frequency range. Higher power may indicate stronger turbulent airflow.
  • Average_Range_MeanPower: Average spectral energy, indicative of airflow consistency.
  • Average_Range_RMS: Standard amplitude measure within the frequency range. Higher RMS indicates more variable or intense breathing sounds, associated with stronger flow disturbances.

Appendix B.3. Non-OSA vs. Severe-OSA

  • Average_BBox_MeanValue: Average amplitude or intensity of the sound signal. In OSA, increased respiratory effort produces louder, higher-amplitude sounds.
  • Average_BBox_TextureEnergy: Uniformity of bispectral texture. Lower energy indicates more heterogeneous local coupling, reflecting complex airflow-tissue interactions.
  • MouthInspiration_Range_FreqSkewness: Asymmetry of the power spectrum during mouth inspiration within the relevant frequency range. Positive values indicate low-frequency dominance, negative values high-frequency dominance.
  • Average_BBox_FractalDimension: Complexity or irregularity of local patterns. Higher values indicate more intricate structures, compatible with chaotic airflow in severe OSA.
  • Average_BBox_MedianValue: Median local intensity; less sensitive to outliers than the mean. Changes reflect shifts in breathing sound intensity.
  • Average_BBox_EnergyValue: Total energy within the bounding box. Higher energy indicates stronger, more intense breathing sounds, often associated with airway narrowing.
  • Average_BBox_EulerNumber: Topological feature reflecting the number of connected components minus holes. Indicates complexity of sound events, altered by intermittent airflow or airway vibrations.
  • Average_BBox_AspectRatio: Width-to-height ratio of detected sound events; changes may reflect altered spectral characteristics due to airway dynamics.
  • Average_BBox_ConnectedComponents: Counts distinct high-intensity regions; more regions suggest fragmented coupling due to multiple airflow structures.
  • Average_BBox_KurtosisValue: Measures tailedness of the sound signal distribution. Higher kurtosis indicates sudden, sharp sound events associated with airway collapse or reopening.

Appendix B.4. Mild-OSA vs. Moderate-OSA

  • MouthExpiration_BBox_AspectRatio: Width-to-height ratio during mouth expiration; indicates alterations in spectral characteristics due to airway dynamics.
  • Average_BBox_MedianValue: Median local intensity. Changes reflect general shifts in breathing sound intensity.
  • Average_BBox_TextureEnergy: Uniformity of sound texture; lower values suggest heterogeneous patterns due to turbulent airflow.
  • MouthExpiration_Range_FM2MFreq: Ratio of frequency-modulated to mean spectral energy in a relevant range; reflects redistribution of low-frequency energy, indicating airflow variations.
  • Average_BBox_FrequencyBandwidthY: Effective bandwidth along the frequency axis; broader bandwidth suggests wider participation of frequencies, consistent with turbulent airflow.
  • Average_BBox_Perimeter: Perimeter of high-intensity regions; larger perimeter indicates more fragmented structure, consistent with variable airflow instability.
  • Average_BBox_FrequencyCentroidY: Centroid of bispectral energy along f2-axis; shifts indicate changes in dominant frequency regions.
  • MouthExpiration_BBox_TextureEnergy: Uniformity of sound pattern; lower values indicate less consistent sound patterns, reflecting turbulence or intermittent obstruction.

Appendix B.5. Mild-OSA vs. Severe-OSA

  • MouthInspiration_BBox_FrequencyCentroidX: Centroid along f1-axis; shifts indicate changes in dominant frequency region, reflecting airflow dynamics.
  • Average_Average_BBoxes_Entropy: Overall entropy across bounding boxes; higher values indicate irregular airflow or turbulent sounds in severe OSA.
  • Average_Range_RMS: Standard amplitude within the relevant frequency range; higher RMS indicates variable or stronger breathing sound intensity.
  • Average_BBox_EnergyValue: Total energy in bounding box; higher energy reflects increased respiratory effort or airway narrowing.
  • MouthInspiration_Range_StandardDeviation: Variability of sound amplitude; higher SD indicates greater fluctuation in airflow or turbulence.
  • Average_Range_FM2MFreq: Low-frequency energy ratio; reflects subtle airflow variations or airway patency changes.
  • MouthInspiration_BBox_TextureContrast: Local contrast; higher values indicate strong differences between adjacent sound regions.
  • MouthExpiration_BBox_FrequencyBandwidthY: Bandwidth along frequency axis; broader bandwidth indicates more turbulent airflow.
  • Average_Range_StandardDeviation: Variability of amplitude; higher SD reflects irregular airflow.
  • MouthExpiration_Range_SpectralEnergy: Total spectral energy within relevant range; higher energy corresponds to stronger, more turbulent airflow.

Appendix B.6. Moderate-OSA vs. Severe-OSA

  • Average_BBox_MedianValue: Median intensity; indicates overall breathing sound level.
  • Average_BBox_IQRValue: Range between first and third quartiles; larger IQR suggests greater variability, linked to turbulence.
  • Average_BBox_EnergyValue: Total energy; higher values reflect stronger, more intense breathing sounds due to airflow restriction.
  • Average_BBox_RangeValue: Difference between maximum and minimum values; larger range indicates pronounced variations in sound intensity.
  • Average_BBox_KurtosisValue: Measures tailedness; higher kurtosis indicates sudden peaks in sound, linked to airway collapse or reopening.
  • Average_BBox_Compactness: How compact high-intensity regions are; less compact indicates more diffuse sound energy due to turbulent airflow.
  • Average_BBox_StdValue: Dispersion of sound values; higher SD indicates greater variability, reflecting irregular airflow.
  • Average_BBox_IQRValue: Central variability of sound; larger values indicate more turbulence.
  • Average_BBox_EulerNumber: Complexity of sound events; reflects structure or continuity of airflow events.
  • Average_BBox_EntropyValue: Randomness or unpredictability of sound; higher entropy indicates more chaotic or turbulent airflow.

Appendix C. Analysis for Non-OSA vs. Mild-OSA

Table A2. Top 10 ranked features for Non-OSA vs. Mild-OSA classification based on AUC Test and correlation performance.
Table A2. Top 10 ranked features for Non-OSA vs. Mild-OSA classification based on AUC Test and correlation performance.
Feature NameAUC TestCorr Test
MouthInspiration_Range_FreqSkewness0.7870370.498762
Average_BBox_TextureEnergy0.6666670.301309
Average_BBox_FrequencyCentroidX0.6574070.145422
Average_BBox_TextureCorrelation0.6574070.145286
Average_BBox_Perimeter0.6574070.0772519
Average_BBox_TextureEnergy0.6444440.0914112
Average_BBox_TextureHomogeneity0.6388890.0837961
Average_BBox_IQRValue0.629630.211449
Average_BBox_Compactness0.629630.211449
MouthInspiration_Range_SpectralSkewness0.629630.237871
Table A3. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Non-OSA vs. Mild-OSA classification.
Table A3. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Non-OSA vs. Mild-OSA classification.
Feature NameAbs Delta AUC
MouthInspiration_Range_SpectralEntropy0.0102916
Average_BBox_TextureEnergy0.0241135
Average_BBox_FrequencyCentroidY0.0441738
Average_BBox_SpectralFlux0.0498553
Average_BBox_EntropyValue0.0613248
Average_BBox_BoundingBoxDiagonal0.0613248
Average_BBox_Compactness0.0627066
Average_BBox_SpectralFlux0.0728632
Average_BBox_FrequencyCentroidX0.0728632
Average_BBox_FractalDimension0.0860684
Table A4. Top 10 correlations between selected features and anthropometric characteristics in Non-OSA vs. Mild-OSA participants.
Table A4. Top 10 correlations between selected features and anthropometric characteristics in Non-OSA vs. Mild-OSA participants.
Feature NameAnthropometric FeaturePearson Correlation
Average_BBox_MeanValueSex0.998692
Average_BBox_NumHolesMPS0.997553
Average_BBox_FrequencyCentroidXMPS0.997288
Average_BBox_StdValueSex0.997255
Average_BBox_CoefVariationSex0.997255
Average_BBox_PerimeterSex0.99691
Average_BBox_MeanValueMPS0.994605
Average_BBox_CompactnessNC0.994605
Average_BBox_TextureContrastNC0.993768
Average_BBox_NumHolesNC0.993507
Table A5. Top 10 features showing the highest Pearson correlations with AHI in Non-OSA vs. Mild-OSA comparison.
Table A5. Top 10 features showing the highest Pearson correlations with AHI in Non-OSA vs. Mild-OSA comparison.
Feature NamePearson Correlation
Average_BBox_MeanValue0.994605
Average_BBox_Compactness0.994605
Average_BBox_AspectRatio0.984433
Average_BBox_Compactness0.984433
Average_BBox_MedianValue0.978554
Average_BBox_FrequencyCentroidY0.973494
Average_BBox_FrequencyBandwidthY0.969688
MouthExpiration_Range_SCBW_Bandwidth0.969226
Average_BBox_TextureContrast0.960109
Average_BBox_TextureEnergy0.951338

Appendix D. Analysis for Non-OSA vs. Moderate-OSA

Table A6. Top 10 ranked features for Non-OSA vs. Moderate-OSA classification based on AUC Test and correlation performance.
Table A6. Top 10 ranked features for Non-OSA vs. Moderate-OSA classification based on AUC Test and correlation performance.
Feature NameAUC TestCorr Test
MouthInspiration_Range_FreqSkewness0.7870370.498762
Average_Range_Maximum0.8497940.458931
Average_BBox_BoundingBoxDiagonal0.818930.537497
Average_Range_MeanPower0.8168720.4697
Average_BBox_ConnectedComponents0.7921810.400149
Average_BBox_TextureContrast0.7880660.421496
Average_BBox_BoundingBoxDiagonal0.7818930.479976
Average_BBox_FrequencyCentroidY0.767490.336108
Average_Range_BandPower0.7386830.437269
MouthInspiration_Range_FreqSkewness0.7386830.436397
Table A7. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Non-OSA vs. Moderate-OSA classification.
Table A7. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Non-OSA vs. Moderate-OSA classification.
Feature NameAbsDeltaAUC
Average_BBox_FrequencyBandwidthX0.00244068
Average_BBox_RangeValue0.00810185
Average_BBox_AspectRatio0.015558
Average_BBox_EnergyValue0.0266176
Average_BBox_MeanValue0.0277022
Average_BBox_EulerNumber0.0319368
Average_BBox_KurtosisValue0.0366769
Average_BBox_RangeValue0.0470624
Average_BBox_BoundingBoxArea0.0470624
Average_BBox_FrequencyBandwidthX0.047544
Table A8. Top 10 correlations between selected features and anthropometric characteristics in Non-OSA vs. Moderate-OSA participants.
Table A8. Top 10 correlations between selected features and anthropometric characteristics in Non-OSA vs. Moderate-OSA participants.
Feature NameAnthropometric FeaturePearson Correlation
Average_BBox_TextureHomogeneitySex0.999783
Average_BBox_RangeValueBMI0.997506
NoseExpiration_Range_MeanPowerSmoke History0.996784
Average_BBox_FrequencyCentroidYNC0.995961
Average_BBox_BoundingBoxDiagonalSmoke History0.990975
Average_BBox_TextureHomogeneityMPS0.989529
Average_BBox_IQRValueNC0.987245
Average_BBox_TextureHomogeneityAge0.982519
Average_BBox_MeanValueSex0.982193
Average_BBox_ConnectedComponentsSmoke History0.980487
Table A9. Top 10 features showing the highest Pearson correlations with AHI in Non-OSA vs. Moderate-OSA comparison.
Table A9. Top 10 features showing the highest Pearson correlations with AHI in Non-OSA vs. Moderate-OSA comparison.
Feature NamePearson Correlation
Average_BBox_RangeValue0.962649
Average_BBox_TextureCorrelation0.96016
Average_BBox_Perimeter0.930377
Average_BBox_MeanValue0.917263
Average_BBox_TextureEnergy0.911276
Average_BBox_FrequencyCentroidX0.89864
Average_BBox_SkewnessValue0.882517
Average_BBox_IQRValue0.877315
Average_BBox_StdValue0.844572
Average_BBox_FractalDimension0.735205

Appendix E. Analysis for Non-OSA vs. Severe-OSA

Table A10. Top 10 ranked features for Non-OSA vs. Severe-OSA classification based on AUC Test and correlation performance.
Table A10. Top 10 ranked features for Non-OSA vs. Severe-OSA classification based on AUC Test and correlation performance.
Feature NameAUC TestCorr Test
Average_BBox_MeanValue0.8602480.505319
Average_BBox_TextureEnergy0.7754630.444318
MouthInspiration_Range_FreqSkewness0.7750.413793
Average_BBox_FractalDimension0.739130.429386
Average_BBox_MedianValue0.7333330.242264
Average_BBox_EnergyValue0.7291670.216856
Average_BBox_EulerNumber0.7166670.246949
Average_BBox_AspectRatio0.7083330.200236
Average_BBox_ConnectedComponents0.7083330.215239
Average_BBox_KurtosisValue0.7083330.267374
Table A11. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Non-OSA vs. Severe-OSA classification.
Table A11. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Non-OSA vs. Severe-OSA classification.
Feature NameAbsDeltaAUC
Average_BBox_SkewnessValue0.00123128
Average_BBox_FrequencyBandwidthX0.0025107
Average_BBox_FrequencyBandwidthX0.00366667
MouthInspiration_Range_FreqSkewness0.00366667
Average_BBox_Compactness0.00418763
Average_BBox_FrequencyBandwidthX0.004408
Average_BBox_KurtosisValue0.00454831
Average_BBox_SkewnessValue0.00578492
Average_BBox_EulerNumber0.00608255
Average_BBox_ConnectedComponents0.00933082
Table A12. Top 10 correlations between selected features and anthropometric characteristics in Non-OSA vs. Severe-OSA participants.
Table A12. Top 10 correlations between selected features and anthropometric characteristics in Non-OSA vs. Severe-OSA participants.
Feature NameAnthropometric FeaturePearson Correlation
Average_BBox_EnergyValueNC0.998475
Average_BBox_AspectRatioMPS0.99834
Average_BBox_StdValueAge0.995421
Average_BBox_EulerNumberSex0.995285
Average_BBox_PerimeterMPS0.99373
Average_BBox_IQRValueSex0.991446
Average_BBox_NumHolesSmoke History0.989047
Average_BBox_ConnectedComponentsAge0.988745
Average_BBox_EnergyValueAge0.988332
Average_BBox_MeanValueAge0.987492
Table A13. Top 10 features showing the highest Pearson correlations with AHI in Non-OSA vs. Severe-OSA comparison.
Table A13. Top 10 features showing the highest Pearson correlations with AHI in Non-OSA vs. Severe-OSA comparison.
Feature NamePearson Correlation
Average_BBox_PeakValue0.895716
Average_BBox_BoundingBoxDiagonal0.881183
Average_BBox_BoundingBoxDiagonal0.85574
Average_BBox_Perimeter0.846486
Average_BBox_KurtosisValue0.835585
Average_BBox_Perimeter0.828251
Average_BBox_MeanValue0.769224
Average_BBox_FrequencyCentroidX0.759227
Average_BBox_EnergyValue0.752154
Average_BBox_SkewnessValue0.734339

Appendix F. Analysis for Mild-OSA vs. Moderate-OSA

Table A14. Top 10 ranked features for Mild-OSA vs. Moderate-OSA classification based on AUC Test and correlation performance.
Table A14. Top 10 ranked features for Mild-OSA vs. Moderate-OSA classification based on AUC Test and correlation performance.
Feature NameAUC TestCorr Test
MouthExpiration_BBox_AspectRatio0.8833330.496958
Average_BBox_MedianValue0.7888890.451368
Average_BBox_TextureEnergy0.7833330.258151
MouthExpiration_Range_FM2MFreq0.68750.187878
Average_BBox_FrequencyBandwidthY0.6718750.251735
Average_BBox_FrequencyBandwidthY0.6555560.0760239
Average_BBox_Perimeter0.6555560.0760239
Average_BBox_FrequencyCentroidY0.6445310.221861
MouthExpiration_BBox_TextureEnergy0.6444440.220908
Average_BBox_TextureEnergy0.6416670.294824
Table A15. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Mild-OSA vs. Moderate-OSA classification.
Table A15. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Mild-OSA vs. Moderate-OSA classification.
Feature NameAbsDeltaAUC
MouthExpiration_BBox_CoefVariation0.00180556
Average_BBox_TextureEnergy0.00291667
Average_BBox_FrequencyCentroidY0.00443837
MouthExpiration_Range_FM2MFreq0.00593891
Average_BBox_FrequencyBandwidthX0.00599845
Average_BBox_CentroidY0.00813298
Average_BBox_FrequencyBandwidthY0.0124323
Average_BBox_CentroidY0.0142974
Average_BBox_SkewnessValue0.0192591
Average_BBox_SkewnessValue0.0218056
Table A16. Top 10 correlations between selected features and anthropometric characteristics in Mild-OSA vs. Moderate-OSA participants.
Table A16. Top 10 correlations between selected features and anthropometric characteristics in Mild-OSA vs. Moderate-OSA participants.
Feature NameAnthropometric FeaturePearson Correlation
Average_BBox_IQRValueSex0.998547
MouthExpiration_FrequencyCentroidXMPS0.998212
Average_BBox_CentroidYMPS0.99816
MouthExpiration_StdValueNC0.997504
Average_BBox_MeanValueMPS0.997397
MouthExpiration_FrequencyCentroidXNC0.996663
Average_BBox_FrequencyCentroidYSex0.996409
Average_BBox_PerimeterAge0.996071
Average_BBox_FrequencyCentroidXSex0.995729
Average_BBox_PerimeterMPS0.994929
Table A17. Top 10 features showing the highest Pearson correlations with AHI in Mild-OSA vs. Moderate-OSA comparison.
Table A17. Top 10 features showing the highest Pearson correlations with AHI in Mild-OSA vs. Moderate-OSA comparison.
Feature NamePearson Correlation
Average_BBox_TextureHomogeneity0.990363
Average_BBox_BoundingBoxDiagonal0.984096
Average_BBox_EntropyValue0.9703
Average_BBox_EntropyValue0.964663
Average_BBox_FrequencyBandwidthX0.962357
Average_BBox_CentroidY0.962357
Average_BBox_FrequencyCentroidX0.956978
Average_BBox_MedianValue0.953038
Average_BBox_NumHoles0.952526
Average_BBox_TextureEnergy0.951101

Appendix G. Analysis for Mild-OSA vs. Severe-OSA

Table A18. Top 10 ranked features for Mild-OSA vs. Severe-OSA classification based on AUC Test and correlation performance.
Table A18. Top 10 ranked features for Mild-OSA vs. Severe-OSA classification based on AUC Test and correlation performance.
Feature NameAUC TestCorr Test
MouthInspiration_FrequencyCentroidX0.8444440.157994
Average_BBox_EnTBiD0.7777780.483544
Average_Range_RMS0.7444440.67878
Average_BBox_EnergyValue0.7333330.513402
MouthInspiration_Range_StdDev0.7222220.776042
Average_Range_FM2MFreq0.7142860.572164
MouthInspiration_TextureContrast0.7111110.380311
MouthExpiration_FrequencyBandwidthY0.693750.214055
Average_Range_StdDev0.6888890.621574
MouthExpiration_Range_SpectralEnergy0.6888890.516899
Table A19. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Mild-OSA vs. Severe-OSA classification.
Table A19. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Mild-OSA vs. Severe-OSA classification.
Feature NameAbsDeltaAUC
MouthExpiration_Perimeter0.00291667
MouthExpiration_FrequencyBandwidthX0.00291667
MouthInspiration_PeakValue0.0041751
Average_BBox_NumHoles0.0042735
MouthExpiration_Perimeter0.00583333
MouthInspiration_ConnectedComponents0.00600058
MouthExpiration_CentroidX0.00666667
MouthExpiration_NumHoles0.0075
MouthExpiration_PeakValue0.0075
MouthInspiration_StdValue0.00816946
Table A20. Top 10 correlations between selected features and anthropometric characteristics in Mild-OSA vs. Severe-OSA participants.
Table A20. Top 10 correlations between selected features and anthropometric characteristics in Mild-OSA vs. Severe-OSA participants.
Feature NameAnthropometric FeaturePearson Correlation
MouthExpiration_PeakValueNC0.99998
MouthExpiration_EntropyValueBMI0.99901
MouthInspiration_FrequencyCentroidXBMI0.998535
MouthInspiration_SpectralFluxBMI0.997205
MouthExpiration_PeakValueNC0.996188
MouthInspiration_CentroidYMPS0.996019
MouthExpiration_Range_RMSNC0.99593
MouthExpiration_Range_MeanPowerSex0.994836
MouthInspiration_EulerNumberSex0.99472
MouthExpiration_Range_BandPowerSex0.994715
Table A21. Top 10 features showing the highest Pearson correlations with AHI in Mild-OSA vs. Severe-OSA comparison.
Table A21. Top 10 features showing the highest Pearson correlations with AHI in Mild-OSA vs. Severe-OSA comparison.
Feature NamePearson Correlation
MouthInspiration_IQRValue0.988287
MouthInspiration_EulerNumber0.986967
MouthInspiration_AspectRatio0.980666
MouthInspiration_CentroidX0.965072
MouthInspiration_FrequencyBandwidthX0.958011
MouthInspiration_RegionArea0.942624
MouthExpiration_StdValue0.902562
MouthInspiration_KurtosisValue0.879297
MouthInspiration_IQRValue0.875097
MouthInspiration_MedianValue0.872257

Appendix H. Analysis for Moderate-OSA vs. Severe-OSA

Table A22. Top 10 ranked features for Moderate-OSA vs. Severe-OSA classification based on AUC Test and correlation performance.
Table A22. Top 10 ranked features for Moderate-OSA vs. Severe-OSA classification based on AUC Test and correlation performance.
Feature NameAUC TestCorr Test
Average_BBox_MedianValue0.718750.239624
Average_BBox_IQRValue0.6909720.354004
Average_BBox_EnergyValue0.6805560.361267
Average_BBox_RangeValue0.6423610.211027
Average_BBox_KurtosisValue0.6371530.397161
Average_BBox_Compactness0.6156250.423041
Average_BBox_StdValue0.6071430.121142
Average_BBox_IQRValue0.6071430.121142
Average_BBox_EulerNumber0.59375−0.0542083
Average_BBox_EntropyValue0.59375−0.0327421
Table A23. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Moderate-OSA vs. Severe-OSA classification.
Table A23. Top 10 most stable features based on absolute change in AUC (AbsDeltaAUC) for Moderate-OSA vs. Severe-OSA classification.
Feature NameAbsDeltaAUC
Average_BBox_MedianValue0.00355392
Average_BBox_RegionArea0.0053935
Average_BBox_FrequencyCentroidX0.0101997
Average_BBox_IQRValue0.0132378
MouthInspiration_Range_SpectralEnergy0.0278361
Average_BBox_KurtosisValue0.0325428
Average_BBox_PeakValue0.0373264
Average_BBox_EnergyValue0.0503472
Average_BBox_CoefVariation0.0576721
Average_BBox_SpectralFlux0.0649846
Table A24. Top 10 correlations between selected features and anthropometric characteristics in Moderate-OSA vs. Severe-OSA participants.
Table A24. Top 10 correlations between selected features and anthropometric characteristics in Moderate-OSA vs. Severe-OSA participants.
Feature NameAnthropometric FeaturePearson Correlation
Average_BBox_BoundingBoxAreaNC0.997757
Average_BBox_RangeValueSex0.997111
Average_BBox_NumHolesSmoke History0.996614
Average_BBox_PerimeterSmoke History0.996614
Average_BBox_FrequencyBandwidthXSmoke History0.995328
Average_BBox_RegionAreaNC0.992481
Average_BBox_EnergyValueMPS0.991225
NoseExpiration_Range_SpectralCrestAge0.990493
Average_BBox_PeakValueSex0.988759
Average_BBox_PerimeterSex0.987818
Table A25. Top 10 features showing the highest Pearson correlations with AHI in Moderate-OSA vs. Severe-OSA comparison.
Table A25. Top 10 features showing the highest Pearson correlations with AHI in Moderate-OSA vs. Severe-OSA comparison.
Feature NamePearson Correlation
Average_BBox_RegionArea0.977981
Average_BBox_ConnectedComponents0.966506
Average_BBox_TextureHomogeneity0.937707
Average_BBox_MeanValue0.937068
Average_BBox_RegionArea0.925231
Average_BBox_BoundingBoxArea0.915942
Average_BBox_EnergyValue0.909118
Average_BBox_CentroidY0.906971
Average_BBox_FrequencyCentroidX0.903769
Average_Range_StdDev0.890411

References

  1. Rizzo, D.; Baltzan, M.; Sirpal, S.; Dosman, J.; Kaminska, M.; Chung, F. Prevalence and regional distribution of obstructive sleep apnea in Canada: Analysis from the Canadian Longitudinal Study on Aging. Can. J. Public Health 2024, 115, 970–979. [Google Scholar] [CrossRef]
  2. Lechat, B.; Naik, G.; Reynolds, A.; Aishah, A.; Scott, H.; Loffler, K.A.; Vakulin, A.; Escourrou, P.; McEvoy, R.D.; Adams, R.J.; et al. Multinight Prevalence, Variability, and Diagnostic Misclassification of Obstructive Sleep Apnea. Am. J. Respir. Crit. Care Med. 2022, 205, 563–569. [Google Scholar] [CrossRef]
  3. Faria, A.; Allen, A.H.; Fox, N.; Ayas, N.; Laher, I. The public health burden of obstructive sleep apnea. Sleep Sci. 2021, 14, 257–265. [Google Scholar]
  4. Singh, M.; Liao, P.; Kobah, S.; Wijeysundera, D.N.; Shapiro, C.; Chung, F. Proportion of surgical patients with undiagnosed obstructive sleep apnoea. Br. J. Anaesth. 2013, 110, 629–636. [Google Scholar] [CrossRef]
  5. Kushida, C.A.; Littner, M.R.; Morgenthaler, T.; Alessi, C.A.; Bailey, D.; Coleman, J., Jr.; Friedman, L.; Hirshkowitz, M.; Kapen, S.; Kramer, M.; et al. Practice parameters for the indications for polysomnography and related procedures: An update for 2005. Sleep 2005, 28, 499–521. [Google Scholar] [CrossRef]
  6. Chen, L.; Pivetta, B.; Nagappa, M.; Saripella, A.; Islam, S.; Englesakis, M.; Chung, F. Validation of the STOP-Bang questionnaire for screening of obstructive sleep apnea in the general population and commercial drivers: A systematic review and meta-analysis. Sleep Breath. 2021, 25, 1741–1751. [Google Scholar] [CrossRef] [PubMed]
  7. Mazzotti, D.R.; Keenan, B.T.; Thorarinsdottir, E.H.; Gislason, T.; Pack, A.I. Sleep Apnea Global Interdisciplinary, C. Is the Epworth Sleepiness Scale Sufficient to Identify the Excessively Sleepy Subtype of OSA? Chest 2022, 161, 557–561. [Google Scholar] [CrossRef] [PubMed]
  8. Alqudah, A.M.; Moussavi, Z. Assessing Obstructive Sleep Apnea Severity During Wakefulness via Tracheal Breathing Sound Analysis. Sensors 2025, 25, 6280. [Google Scholar] [CrossRef]
  9. Alqudah, A.M.; Elwali, A.; Kupiak, B.; Hajipour, F.; Jacobson, N.; Moussavi, Z. Obstructive sleep apnea detection during wakefulness: A comprehensive methodological review. Med. Biol. Eng. Comput. 2024, 62, 1277–1311. [Google Scholar] [CrossRef] [PubMed]
  10. Elwali, A.; Meza-Vargas, S.; Moussavi, Z. Using tracheal breathing sounds and anthropometric information for screening obstructive sleep apnoea during wakefulness. J. Med. Eng. Technol. 2019, 43, 111–123. [Google Scholar] [CrossRef]
  11. Elwali, A.; Moussavi, Z. Obstructive Sleep Apnea Screening and Airway Structure Characterization During Wakefulness Using Tracheal Breathing Sounds. Ann. Biomed. Eng. 2017, 45, 839–850. [Google Scholar] [CrossRef] [PubMed]
  12. Elwali, A.; Moussavi, Z. A Novel Decision Making Procedure during Wakefulness for Screening Obstructive Sleep Apnea using Anthropometric Information and Tracheal Breathing Sounds. Sci. Rep. 2019, 9, 11467. [Google Scholar] [CrossRef]
  13. Elwali, A.; Moussavi, Z. Predicting Polysomnography Parameters from Anthropometric Features and Breathing Sounds Recorded during Wakefulness. Diagnostics 2021, 11, 905. [Google Scholar] [CrossRef] [PubMed]
  14. Hajipour, F.; Jozani, M.J.; Elwali, A.; Moussavi, Z. Regularized logistic regression for obstructive sleep apnea screening during wakefulness using daytime tracheal breathing sounds and anthropometric information. Med. Biol. Eng. Comput. 2019, 57, 2641–2655. [Google Scholar] [CrossRef] [PubMed]
  15. Hajipour, F.; Jozani, M.J.; Moussavi, Z. A comparison of regularized logistic regression and random forest machine learning models for daytime diagnosis of obstructive sleep apnea. Med. Biol. Eng. Comput. 2020, 58, 2517–2529. [Google Scholar] [CrossRef]
  16. Montazeri, A.; Giannouli, E.; Moussavi, Z. Assessment of obstructive sleep apnea and its severity during wakefulness. Ann. Biomed. Eng. 2012, 40, 916–924. [Google Scholar] [CrossRef]
  17. Elwali, A.; Moussavi, Z. Determining Breathing Sound Features Representative of Obstructive Sleep Apnea During Wakefulness with Least Sensitivity to Other Risk Factors. J. Med. Biol. Eng. 2018, 39, 230–237. [Google Scholar] [CrossRef]
  18. Elwali, A.; Moussavi, Z. A feature reduction and selection algorithm for improved obstructive sleep apnea classification process. Med. Biol. Eng. Comput. 2021, 59, 2063–2072. [Google Scholar] [CrossRef]
  19. Astfalck, L.C.; Sykulski, A.M.; Cripps, E.J. Debiasing Welch’s Method for Spectral Density Estimation. Biometrika 2023, 111, 1313–1329. [Google Scholar] [CrossRef]
  20. Mendel, J.M. Tutorial on Higher-Order Statistics (Spectra) in Signal Processing and System Theory: Theoretical Results and Some Applications. Proc. IEEE 1991, 79, 278–305. [Google Scholar] [CrossRef]
  21. Dlask, M.; Kukal, J. Hurst Exponent Estimation from Short Time Series. Signal Image Video Process. 2018, 12, 745–752. [Google Scholar] [CrossRef]
  22. Rosenstein, M.T.; Collins, J.J.; De Luca, C.J. A Practical Method for Calculating Largest Lyapunov Exponents from Small Data Sets. Phys. D Nonlinear Phenom. 1993, 65, 117–134. [Google Scholar] [CrossRef]
  23. Zhao, K.; Wen, H.; Guo, Y.; Scano, A.; Zhang, Z. Feasibility of Recurrence Quantification Analysis (RQA) in Quantifying Dynamical Coordination among Muscles. Biomed. Signal Process. Control 2023, 79, 104042. [Google Scholar] [CrossRef]
  24. Gosala, B.; Kapgate, P.D.; Jain, P.; Chaurasia, R.N.; Gupta, M. Wavelet Transforms for Feature Engineering in EEG Data Processing: An Overview. Biomed. Signal Process. Control 2023, 85, 104811. [Google Scholar] [CrossRef]
  25. Abdul, Z.K.; Al-Talabani, A.K. Mel Frequency Cepstral Coefficient and Its Applications: A Review. IEEE Access 2022, 10, 122136–122158. [Google Scholar] [CrossRef]
  26. Kohlrausch, A. Binaural masking experiments using noise maskers with frequency-dependent interaural phase differences. II: Influence of frequency and interaural-phase uncertainty. J. Acoust. Soc. Am. 1990, 88, 1749–1756. [Google Scholar] [CrossRef]
  27. Rangayyan, R.M.; Reddy, N.P. Biomedical Signal Analysis: A Case-Study Approach; Pergamon Press: New York, NY, USA, 2002; Volume 30. [Google Scholar]
  28. Ojala, T.; Pietikäinen, M.; Mäenpää, T. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 971–987. [Google Scholar] [CrossRef]
  29. Divya, S.; Suresh, L.P.; John, A. Image Feature Generation Using Binary Patterns—LBP, SLBP and GBP. In Advanced Computing and Intelligent Technologies; Springer Nature: Singapore, 2022; pp. 233–239. [Google Scholar]
  30. Jotz, G.P.; Cervantes, O.; Abrahão, M.; Settanni, F.A.P.; de Angelis, E.C. Noise-to-Harmonics Ratio as an Acoustic Measure of Voice Disorders in Boys. J. Voice 2002, 16, 28–31. [Google Scholar] [CrossRef]
  31. Farrús, M.; Hernando, J.; Ejarque, P. Jitter and Shimmer Measurements for Speaker Recognition. Proc. Interspeech 2007, 2007, 778–781. [Google Scholar]
  32. Borowska, M. Entropy-Based Algorithms in the Analysis of Biomedical Signals. Stud. Log. Gramm. Rhetor. 2015, 43, 21–23. [Google Scholar] [CrossRef]
  33. Hanley, J.A.; McNeil, B.J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982, 143, 29–36. [Google Scholar] [CrossRef]
  34. Benesty, J.; Chen, J.; Huang, Y.; Cohen, I. Pearson Correlation Coefficient. In Noise Reduction in Speech Processing; Springer: Berlin/Heidelberg, Germany, 2009; pp. 1–4. [Google Scholar]
  35. Guyon, I.; Weston, J.; Barnhill, S.; Vapnik, V. Gene Selection for Cancer Classification using Support Vector Machines. Mach. Learn. 2002, 46, 389–422. [Google Scholar] [CrossRef]
  36. Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
  37. Tibshirani, R. Regression Shrinkage and Selection via the Lasso. J. R. Stat. Soc. Ser. B Stat. Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
  38. Abeyratne, U.R.; de Silva, S.; Hukins, C.; Duce, B. Obstructive sleep apnea screening by integrating snore feature classes. Physiol. Meas. 2013, 34, 99–121. [Google Scholar] [CrossRef] [PubMed]
  39. Akhter, S.; Abeyratne, U.R.; Swarnkar, V. Variations of snoring properties with macro sleep stages in a population of Obstructive Sleep Apnea patients. In Proceedings of the 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Osaka, Japan, 3–7 July 2013; pp. 1318–1321. [Google Scholar] [CrossRef]
  40. Herer, B.; Sarig-Bahat, H.; Tarasiuk, A.; Lavie, L. Relationships between snore sound complexity and upper airway collapse level. Sleep Breath. 2018, 22, 437–445. [Google Scholar]
  41. Kim, J.; Kim, T.; Lee, D.; Kim, J.W.; Lee, K. Exploiting temporal and nonstationary features in breathing sound analysis for multiple obstructive sleep apnea severity classification. Biomed. Eng. Online 2017, 16, 6. [Google Scholar] [CrossRef] [PubMed]
  42. Ng, A.K.; Koh, T.S.; Abeyratne, U.R.; Puvanendran, K. Investigation of obstructive sleep apnea using nonlinear mode interactions in non-stationary snore signals. Ann. Biomed. Eng. 2009, 37, 1796–1806. [Google Scholar] [CrossRef] [PubMed]
  43. Janott, C.; Schuller, B.; Heiser, C. Snoring—An acoustic definition. Physiol. Meas. 2019, 40, 05T01. [Google Scholar]
  44. Ashraf, W.; Fredberg, J.J.; Moussavi, Z. Aeroacoustics of breath sounds in trachea and upper airway. Appl. Acoust. 2026, 241, 111021. [Google Scholar] [CrossRef]
  45. Dafna, E.; Herer, B.; Eberhardt, R.; Tarasiuk, A. Automatic detection of whole night snoring events using audio recordings. Physiol. Meas. 2013, 34, 1619–1634. [Google Scholar]
  46. Janott, C.; Schmitt, M.; Zhang, Y.; Qian, K.; Pandit, V.; Zhang, Z.; Heiser, C.; Hohenhorst, W.; Herzog, M.; Hemmert, W.; et al. Snoring classified: The Munich-Passau Snore Sound Corpus. Comput. Biol. Med. 2018, 94, 106–118. [Google Scholar] [CrossRef]
  47. Ng, A.K.; Koh, T.S.; Baey, E.; Lee, T.H.; Abeyratne, U.R.; Puvanendran, K. Could formant frequencies of snore signals be an alternative means for the diagnosis of obstructive sleep apnea? Sleep Med. 2008, 9, 894–898. [Google Scholar] [CrossRef]
  48. Huynh, P.K.; Setty, A.R.; Le, T.Q. Koopman spectral analysis of intermittent dynamics in complex systems: A case study in pathophysiological processes of obstructive sleep apnea. arXiv 2022, arXiv:2202.12430. [Google Scholar] [CrossRef]
  49. Kim, T.; Kim, J.W.; Lee, K. Detection of sleep disordered breathing severity using acoustic biomarker and machine learning techniques. Biomed. Eng. Online 2018, 17, 16. [Google Scholar] [CrossRef]
  50. Mikami, T.; Ueki, S.; Takahashi, H.; Yonezawa, K. Detecting nonlinear acoustic properties of snoring sounds using Hilbert–Huang Transform. In Proceedings of the International Conference on Biomedical Electronics and Devices; SCITEPRESS: Setúbal, Portugal, 2015. [Google Scholar]
  51. Majumder, S. A Gaussian mixture model method for eigenvalue-based spectrum sensing with uncalibrated multiple antennas. Signal Process. 2022, 192, 108404. [Google Scholar] [CrossRef]
  52. Sebastian, A.; Cistulli, P.A.; Cohen, G.; de Chazal, P. Automatic classification of OSA-related snoring signals from nocturnal audio recordings. arXiv 2021, arXiv:2102.12829. [Google Scholar]
  53. Akhter, S.; Bradley, T.D.; Morrell, M.J. Investigation of macro sleep-stage influences on snore characteristics in OSA. Am. J. Respir. Crit. Care Med. 2014, 189, A567. [Google Scholar]
  54. Janott, C.; Schuller, B.; Heiser, C. Acoustic information in snoring noises. HNO 2017, 65, 107–116. [Google Scholar] [CrossRef] [PubMed]
  55. Ye, Z.; Peng, J.; Zhang, X.; Song, L. Snoring Sound Recognition Using Multi-Channel Spectrograms. Arch. Acoust. 2024, 49, 169–178. [Google Scholar] [CrossRef]
  56. Hajipour, F.; Moussavi, Z. Spectral and Higher Order Statistical Characteristics of Expiratory Tracheal Breathing Sounds During Wakefulness and Sleep in People with Different Levels of Obstructive Sleep Apnea. J. Med. Biol. Eng. 2019, 39, 244–250. [Google Scholar] [CrossRef]
  57. Kevat, A.; Bernard, A.; Harris, M.A.; Heussler, H.; Black, R.; Cheng, A.; Waters, K.; Chawla, J. Impact of adenotonsillectomy on growth trajectories in preschool children with mild-moderate obstructive sleep apnea. J. Clin. Sleep Med. 2023, 19, 55–62. [Google Scholar] [CrossRef]
  58. Cao, S.; Rosenzweig, I.; Bilotta, F.; Jiang, H.; Xia, M. Automatic detection of obstructive sleep apnea based on speech and snoring sounds with AI. J. Thorac. Dis. 2024, 16, 2654–2667. [Google Scholar] [CrossRef] [PubMed]
  59. Gonzalez-Martinez, F.D.; Carabias-Orti, J.J.; Canadas-Quesada, F.J.; Ruiz-Reyes, N.; Martinez-Munoz, D.; Garcia-Galan, S. Improving snore detection under limited dataset through harmonic/percussive source separation and convolutional neural networks. Appl. Acoust. 2024, 216, 109811. [Google Scholar] [CrossRef]
  60. Nagappa, M.; Liao, P.; Wong, J.; Auckley, D.; Ramachandran, S.K.; Memtsoudis, S.; Mokhlesi, B.; Chung, F. Validation of the STOP-Bang Questionnaire as a Screening Tool for Obstructive Sleep Apnea among Different Populations: A Systematic Review and Meta-Analysis. PLoS ONE 2015, 10, e0143697. [Google Scholar] [CrossRef] [PubMed]
  61. Balaei, A.T.; Sutherland, K.; Cistulli, P.A.; de Chazal, P. Automatic Detection of Obstructive Sleep Apnea Using Facial Images. In Proceedings of the 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), Melbourne, VIC, Australia, 18–21 April 2017; pp. 215–218. [Google Scholar]
  62. Romano, S.; Salvaggio, A.; Hirata, R.P.; Lo Bue, A.; Picciolo, S.; Oliveira, L.V.; Insalaco, G. Upper airway collapsibility evaluated by a negative expiratory pressure test in severe obstructive sleep apnea. Clinics 2011, 66, 567–572. [Google Scholar] [CrossRef]
  63. Kushida, C.A.; Efron, B.; Guilleminault, C. A predictive morphometric model for the obstructive sleep apnea syndrome. Ann. Intern. Med. 1997, 127, 581–587. [Google Scholar] [CrossRef]
  64. Sola-Soler, J.; Fiz, J.A.; Torres, A.; Jane, R. Identification of Obstructive Sleep Apnea Patients from Tracheal Breath Sound Analysis during Wakefulness in Polysomnographic Studies. In Proceedings of the 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Chicago, IL, USA, 26–30 August 2014; pp. 4232–4235. [Google Scholar]
  65. El-Sayed, I.H. Comparison of four sleep questionnaires for screening obstructive sleep apnea. Egypt. J. Chest Dis. Tuberc. 2012, 61, 433–441. [Google Scholar] [CrossRef]
  66. Lee, R.W.; Petocz, P.; Prvan, T.; Chan, A.S.; Grunstein, R.R.; Cistulli, P.A. Prediction of obstructive sleep apnea with craniofacial photographic analysis. Sleep 2009, 32, 46–52. [Google Scholar] [CrossRef]
  67. Montero Benavides, A.; Fernández Pozo, R.; Toledano, D.T.; Blanco Murillo, J.L.; López Gonzalo, E.; Hernández Gómez, L. Analysis of Voice Features Related to Obstructive Sleep Apnoea and Their Application in Diagnosis Support. Comput. Speech Lang. 2014, 28, 434–452. [Google Scholar] [CrossRef]
  68. Simply, R.M.; Dafna, E.; Zigel, Y. Diagnosis of Obstructive Sleep Apnea Using Speech Signals From Awake Subjects. IEEE J. Sel. Top. Signal Process. 2020, 14, 251–260. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.