1. Introduction
The current landscape of general population otologic screening highlights a critical need for early detection and intervention strategies for hearing loss and related otologic conditions [
1]. Early assessments of children’s hearing and periodic screenings throughout life are essential for identifying otologic diseases, thereby mitigating the health and social consequences associated with hearing impairment [
2,
3,
4]. The World Health Organization (WHO) estimates that over 360 million individuals globally experience disabling hearing loss, emphasizing the urgency for effective screening strategies, particularly in low- and middle-income countries where healthcare access is often limited [
5]. In pediatric populations, otologic screening is increasingly recognized as vital due to the high incidence of conditions such as acute otitis media, which affects over 80% of children by age three [
6].
The American Academy of Audiology’s Clinical Practice Guidelines for Childhood Hearing Screening recommend initial assessments using pure tone audiometry, followed by tympanometry [
7]. If pure tone assessment is not feasible, otoacoustic emission measurements are employed [
8]. This structured approach underscores the necessity for comprehensive screening protocols that address both hearing and developmental concerns, particularly since untreated hearing loss can lead to significant cognitive and educational deficits [
9,
10,
11,
12].
In contrast, otologic screening for adults, especially those aged 50 and older, remains inconsistent. The U.S. Preventive Services Task Force (USPSTF) has historically stated that there is insufficient evidence to recommend routine screening for hearing loss in asymptomatic adults, which has resulted in a lack of standardized practices in primary care settings [
13,
14,
15]. However, recent studies advocate for the integration of hearing screening into routine health assessments, emphasizing that early detection can significantly improve quality of life and cognitive outcomes [
16,
17]. Hearing loss has been shown to severely affect social function and mental health, contributing to cognitive decline and isolation in both pediatric and adult populations [
8,
18,
19].
While screening programs are well established among children and industrial workers exposed to occupational hazards, broader population-level implementation remains limited. Disparities in screening rates, particularly among older adults and socioeconomically disadvantaged groups, underline the need for scalable, low-cost, and effective diagnostic technologies.
A key limitation in current screening practice lies in the diagnostic methods themselves. Tympanometry remains a gold standard due to its low cost and ease of use, and it is supported by robust clinical literature. It measures ear function via changes in acoustic impedance at a single frequency under variable air pressure [
20]. However, the method is not without drawbacks: it can induce discomfort or even risk damage in sensitive patients [
21], and may produce similar readings across diverse pathologies [
22]. Efforts to improve diagnostic resolution through wideband acoustic immittance (WAI), which extends the frequency range of analysis, have shown promise, but the complexity of data interpretation limits its widespread use [
23,
24,
25,
26,
27].
To overcome these limitations, the Pressure-Less Acoustic Immittance (PLAI™) system has been introduced as a truly non-invasive alternative. Based on microelectromechanical systems (MEMS), PLAI™ eliminates the need for ear canal pressurization while enabling multi-parameter measurements of middle ear function [
28]. Although promising, PLAI™ relies on a novel paradigm and must therefore be validated against gold standard methods through rigorous clinical evaluation [
29].
Initial investigations into PLAI™ suggest that its derived parameters exhibit strong physiological correlation and are sensitive to both pathology and patient age [
30,
31]. Therefore, the aim of this study is to determine whether age-dependent stratification improves diagnostic performance and whether non-invasive machine-learning based identification of common otologic pathologies can be achieved using PLAI™-derived features without reliance on conventional tympanometry.
2. Materials and Methods
2.1. Study Population
Outpatients aged from 0 to 80 years, both healthy and affected by various ear pathologies, were enrolled in the study from six Italian hospitals: Gemelli University Hospital IRCCS (Rome), “Federico II” University Hospital (Naples), Fondazione IRCCS Policlinico “San Matteo” Hospital (Pavia), “Giovanni XXIII” Children’s Hospital (Bari), “Guglielmo da Saliceto” Hospital (Piacenza), and IRCCS Maternal and Child Health Institute “Burlo Garofolo” (Trieste). Each patient underwent a clinical otologic examination, which included otoscopy and, where feasible, tympanometry to confirm the diagnosis. Following diagnosis, PLAI measurements were acquired to assess the middle ear’s acoustic properties under pressure-less conditions. The study was approved by the Ethics Committees of all participating institutions (Protocol No. 5389, 19 January 2023, IRCCS “A. Gemelli” Hospital, Rome), and by the Italian Ministry of Health (MEDWAVE-2, IT-23-03-042692, 9 May 2023). Written informed consent was obtained from adult participants; for minors, consent was provided by a parent or legal guardian.
Only patients with ears evaluated as healthy or as being affected by a single pathology were enrolled in the study, also excluding patients with a history of otologic surgery, significant comorbidities, and/or cognitive impairments that could affect measurements. All subjects underwent otoscopy and tympanometry to confirm the status of their auditory organs.
Table 1 reports the distribution of individual ears by age group and diagnosis.
2.2. PLAI™ Device Description
The Pressure-Less Acoustic Immittance (PLAI™; Neuranix Srl, Naples, Italy) system is a MEMS-based diagnostic tool designed for middle ear assessment without the need for external pressurization. Unlike conventional tympanometry, PLAI operates under ambient pressure, using a dual-sensor setup capable of measuring both acoustic pressure and particle velocity at the same location (
Figure 1). The device emits a wideband acoustic signal spanning 100 to 2000 Hz, which interacts with the ear canal and tympanic membrane. The reflected signal, containing key information about the mechanical and acoustic properties of the middle and outer ear, is captured by two microphones. These microphones measure the pressure and axial velocity of the air particles in the ear canal.
From the acquired signals, PLAI™ computes an acoustic admittance curve, from which it extracts numerous parameters, including the following:
Resonance frequency (Fres);
Peak admittance (Peak);
Minimum and maximum frequency bounds (minBnd, maxBnd);
Bandwidth amplitude (AltBnd);
Equivalent ear canal volume (Vol).
The full feature set also includes age and labels used for classification.
2.3. Statistical Analysis
To assess whether PLAI™-derived parameters differ significantly between healthy and pathological ears, each acoustic feature was compared across diagnosis groups within each age category using Wilcoxon rank-sum test (Bonferroni correction was applied for multiple comparisons). A significance threshold of p < 0.05 was set.
2.4. Classification Pipeline
Following statistical analysis, a classification pipeline was implemented to evaluate the ability of PLAI™ features to distinguish between healthy and pathological ears.
The dataset was split into training (80%) and testing (20%) subsets within each age group. Feature selection was performed using a Random Forest classifier’s intrinsic feature importance score, retaining only the most relevant features per group. SMOTE (Synthetic Minority Over-sampling Technique) was applied to the training set to mitigate class imbalance by generating synthetic samples for underrepresented diagnostic categories.
Random Forest was selected as the classifier due to its robustness to overfitting, ability to handle nonlinear relationships, and compatibility with feature importance analysis. Hyperparameters were tuned based on performance on the validation split. Model evaluation included accuracy, precision, recall, F1-score, and specificity, computed both per class and as micro- and macro-averages. ROC curves were plotted, and confusion matrices were generated in grayscale for each age category.
To support model interpretability, SHAP (SHapley Additive exPlanations) values were computed, revealing the impact of individual features on classification decisions [
32]. Feature importance visualizations are included in the results section for each age group.
All analyses and visualizations were performed in Python 3.8 using Scikit-learn, Imbalanced-learn, and SHAP libraries.
4. Discussion
This study investigated whether stratifying pediatric subjects by age groups improves the diagnostic modeling of middle ear conditions using quantitative PLAI™ features. Motivated by the known developmental changes in the auditory system during infancy and childhood [
34,
35], we hypothesized that age-related physiological differences significantly influence tympanometric measurements, thus impacting classification accuracy when predicting pathologies such as otitis media with effusion (OME), tympanic membrane perforation (Perf), otosclerosis (Otoscl), and retraction (Retract).
The primary aim of this study was to determine whether stratifying by age reveals statistically significant differences between diagnostic groups and to evaluate the performance of machine learning classifiers trained within these age-defined strata for detecting ear pathologies without relying on conventional, and potentially invasive, tympanometry.
Statistical comparisons of wideband tympanometry parameters showed that age stratification enhances the discrimination between healthy and pathological ears, particularly in the youngest (0–3 years) and oldest (12+ years) age groups. These results justified the creation of age-specific models, which subsequently demonstrated robust classification performance. Specifically, random forest classifiers, trained with SMOTE-balanced datasets, achieved macro F1-scores above 80% in both the 0–3 and 12+ groups. The SHAP analysis further revealed consistent feature relevance patterns, with Peak, Fres, and Vol emerging as the most discriminative parameters across all age groups.
Importantly, stratifying the analysis by age groups led to a marked reduction in false negative rates for conditions like OME and tympanic membrane retractions. This improvement strengthens diagnostic reliability and highlights the model’s potential utility as a non-invasive screening tool in clinical settings. In particular, the PLAI™ system could be integrated into routine examinations by general practitioners and pediatricians, enabling them to assess ear health rapidly and with minimal training. By flagging abnormal patterns suggestive of middle ear pathology, the system could support early identification and prompt referral of at-risk children for further evaluation by an otolaryngologist. This triage function may be especially valuable in primary care and community health settings where access to specialist equipment and expertise is limited.
Physiologically, these observations align with known developmental changes in the ear canal and middle ear structures. In infants and toddlers (0–3 years), the ear canal is shorter and more compliant, while the ossicular chain and tympanic membrane undergo maturation that can dramatically affect acoustic transfer functions [
36]. This likely contributes to the distinct parameter distributions observed in this group and underscores the need for specialized diagnostic thresholds in early life. Conversely, adolescents (12+ years) display more adult-like anatomy and auditory responses, which could explain the enhanced classification performance and clearer parameter separation in this group.
Among the tympanometric features, Peak admittance, representing energy flow at the resonant frequency, was repeatedly selected as a key predictor. Fres, the resonant frequency of the middle ear system, also showed high discriminative power, likely reflecting changes in ossicular ant tympanic stiffness. Volume (Vol), while highly informative in distinguishing perforations, showed some limitations in overlapping pathologies such as OME and retractions, particularly in the middle group (3–12 years), where developmental heterogeneity may blur diagnostic distinctions.
Despite promising results, this study has limitations. The dataset, while substantial, exhibited class imbalance, particularly in less frequent pathologies such as otosclerosis and retraction, which may affect classifier generalizability. While SMOTE was employed to address class imbalance, particularly for underrepresented diagnoses such as otosclerosis and tympanic membrane retraction, several limitations should be considered. SMOTE synthesizes new examples by interpolating between existing minority class samples, which may not fully capture the variability inherent in complex or rare pathologies. Moreover, age bins were defined in three broad categories; finer stratification might yield further insights but would require a larger cohort to maintain statistical power. In this study, the small samples available for these rare conditions were assumed to be representative of their respective classes. However, this assumption carries inherent risks: if the original samples do not adequately reflect the broader clinical variability of the condition, the synthetic data may reinforce sampling biases rather than mitigate them. Future work should prioritize collecting more real-world data for the minority classes to improve generalizability and reduce reliance on synthetic augmentation.
Lastly, although the models demonstrated strong diagnostic potential, especially in early and late childhood, further validation on external datasets is necessary to confirm generalizability. Expanding the dataset, especially with more balanced class representation and longitudinal follow-ups, would be beneficial.
Nonetheless, the study underscores the importance of age-aware diagnostic modeling in pediatric audiology and highlights PLAI™ as a promising non-invasive tool for early and reliable identification of middle ear conditions.
5. Conclusions
This study demonstrated that age-stratified analysis enables the classification of common otologic pathologies using a non-invasive screening methodology. By dividing subjects into physiologically meaningful age groups (0–3, 3–12, and 12+ years), we observed enhanced parameter discrimination and superior classification performance, particularly in the youngest and oldest cohorts. The application of random forest classifiers, combined with SHAP-based feature relevance analysis, identified Peak admittance, Fres, and Volume as key contributors to diagnostic accuracy.
These findings highlight the critical role of age-dependent anatomical and physiological changes in shaping acoustic immittance responses, reinforcing the necessity of age-specific diagnostic frameworks. Despite limitations related to class imbalance and sample size, our results indicate strong potential for integrating machine learning and tympanometric features into future clinical tools for early detection of middle ear pathologies.
Further research is warranted to validate these models in larger, more diverse populations and to refine age-specific diagnostic thresholds. Future work will also aim to establish diagnostic criteria for specific ear pathologies based on PLAI™ measurements and to validate these criteria through comparison with established clinical gold standards, such as otoscopic evaluation and audiological testing. Ultimately, this approach holds promise for improving diagnostic precision and guiding early interventions in pediatric audiology.