Development of an AI-Empowered Novel Digital Monitoring System for Inhalation Flow Profiles

Ziyi Fan; Yuqing Ye; Jiale Chen; Ying Ma; Jesse Zhu

doi:10.3390/s25144402

,

and

¹

Department of Biomedical Engineering, University of Western Ontario, London, ON N6A 3K7, Canada

²

Department of Chemical Engineering, Nottingham Ningbo China Beacons of Excellence Research and Innovation Institute, The University of Nottingham Ningbo China, Ningbo 315100, China

³

Suzhou Inhal Pharma Co., Ltd., Suzhou 215125, China

⁴

School of Chemical and Biomolecular Engineering, Eastern Institute of Technology, Ningbo 315200, China

Sensors2025, 25(14), 4402;https://doi.org/10.3390/s25144402

This article belongs to the Special Issue Integrated Sensor Systems for Medical Applications

Version Notes

Order Reprints

Review Reports

Abstract

The use of dry powder inhalers (DPIs) represents a cornerstone in the treatment of chronic pulmonary diseases. However, suboptimal inhalation techniques, including inadequate airflow rates, have been a persistent concern for achieving effective therapeutic outcomes, as many patients remain unaware of their insufficient inhalation performance. As an effective strategy, a digital monitoring system, coupled with dry powder inhalers (DPIs), has emerged to estimate flow profiles and provide inhalation information. The estimation could be further facilitated by the application of artificial intelligence (AI). In this work, a novel digital system to primarily monitor pressure during DPI usage was successfully designed, and advanced machine learning (ML) techniques were then employed to estimate inhalation flow profiles based on the captured data. Four optimal machine learning models were selected for subsequent inhalation parameter prediction, given their superior generalization ability. By using these models, inhalation flow profiles could be successfully estimated, with an excellent accuracy of 97.7% for Peak Inspiratory Flow Rate (PIFR) and 95.2% for inspiratory capacity (IC). In summary, the pressure-based digital monitoring system empowered by AI techniques could be successfully applied to assess inhalation flow profiles with excellent accuracy.

Keywords:

dry powder inhalers; digital monitoring system; machine learning; flowrate estimation; inhalation flow profile

1. Introduction

Dry powder inhalation has been one of the most commonly employed methods in clinical practice, especially for the treatment of pulmonary diseases, such as asthma and chronic obstructive pulmonary disease (COPD), which affect over half a billion patients worldwide [1]. Most dry powder inhalers (DPIs) available on the market are breath-activated, highly relying on inspiratory airflow to fluidize and disperse powdered formulation [2]. To achieve optimal therapeutic outcomes, the dry powder inhalation system requires patient adherence to prescribed dosing regimens, correct inhaler usage, and adequate inspiratory airflow rate [3]. Despite decades of efforts aimed at improving patient adherence and inhaler techniques, a substantial proportion of patients fail to follow prescribed dosing regimens, with misuse reported in up to 80% of cases [4,5].

To enhance disease management and optimize treatment outcomes, digital monitoring systems that integrate sensor technologies and mobile connectivity [6] have been developed. These systems could continuously record inhalation process and extract critical inhalation parameters, providing real-time feedback to patients regarding inhaler usage [3,7,8]. Such day-to-day feedback could improve patient adherence and inhaler techniques by making individuals more aware of suboptimal inhalation patterns, such as insufficient inspiratory flow that could impair drug delivery efficiency [9,10,11]. Additionally, the digital systems allow for remote sharing of inhalation data with healthcare providers, facilitating timely adjustments to treatment strategies and, therefore, achieving better control of diseases. Overall, the digital monitoring system with active feedback presents an opportunity to improve the therapeutic efficacy of inhaled therapy, reduce the exacerbations of diseases, and lower the frequency of hospital emergency admittance [12,13].

Among the issues associated with inhaler misuse, the inability of patients to achieve a sufficient airflow rate is a critical challenge that impairs effective pulmonary drug delivery. To indicate a patient’s capability to achieve adequate inhalation, peak inspiratory flow rate (PIFR), defined as the maximal flow rate during an inspiratory maneuver, is frequently used as a crucial parameter of inhalation [14,15]. A PIFR above 30 L/min, ideally above 60 L/min, is recommended to ensure effective powder dispersion and dose delivery [3,16]. On the contrary, an insufficient PIFR may lead to suboptimal drug deposition and compromised therapeutic effects. In addition to PIFR, inspiratory capacity (IC) has also been used as a key parameter to indicate the inhalation of patients. A reduced IC is often associated with respiratory exacerbations, as it reflects limitations in lung volume and capacity, which may worsen during acute episodes of airway obstruction or disease progression [17].

Given the critical role of sufficient airflow rate in effective pulmonary drug delivery, studies have investigated methods for accurately estimating inhalation airflow rates. Among these methods, two of the most prominent approaches involve audio-based and pressure-based measurements. Audio-based flow rate estimation relies on microphones to capture sound signals related to inhalation. This method has been applied to remotely monitor the inhalation techniques of inhaler usage [3,8,18,19]. For instance, in the work by Holmes et al., audio signals were recorded using the Inhaler Compliance Assessment (INCA^TM) device attached to the Ellipta™ DPI [3]. The acoustic envelope (how a sound’s amplitude changes over time) of each inhalation was extracted, and regression models were applied to determine the best relationship between the inhalation audio signal and the corresponding flow rate. However, the audio-based method is highly susceptible to ambient noise, which will interfere with signal quality and lead to inaccurate estimation of the flow profile [20]. Particularly, at low inhalation airflow rates, flow signals can be too weak to distinguish from ambient background noise, making accurate prediction even more difficult [10,21]. The method also requires advanced yet complex and intricate signal processing techniques to effectively filter noise and extract features.

The pressure-based method represents another feasible strategy, typically employing pressure sensors to measure differential pressure signals across the inhaler to estimate flow rate. One example of commercially available digital systems is Digihaler^® (Teva Pharmaceutical, Parsippany, NJ, USA), which incorporates a pressure sensor inside the inhaler to estimate inhalation flow rate. Another representative example is the Sensirion^® (Sensirion AG, Stäfa, Switzerland) clip-on prototype, which utilizes a differential pressure sensor to record signals associated with inhalation and airflow rate. Although the functional capabilities of these systems have been described, limited research has been published on the methodologies for estimating inhalation flow rates using such pressure-based measurements.

In recent years, artificial intelligence (AI), as a powerful cutting-edge technology, has been applied in inhaled therapy due to its prominent advantages of recognition, prediction, and data analysis [22]. Machine learning (ML), as a pivotal AI technique, has been utilized to establish correlations between multidimensional sensor data (such as sound signal, pressure signal) and inhalation parameters (such as PIFR and IC) [23,24,25]. For example, FAKOTAKIS et al. revisited sound pattern recognition with machine learning techniques (decision tree model, hidden Markov model, and random forest), and the results showed excellent classification efficiency on inhalation-related activities, although it did not exhibit that one model clearly outperformed the others [20]. Alam et al. also applied machine learning models to predict forced expiratory volume in 1 s (FEV1, a key indicator of lung function) with an accuracy of 85%, by recording the voice signal of asthma patients, although not indicated for DPI use monitoring [26].

Based on our review of current digital monitoring systems for dry powder inhalers (DPIs), it was found that there are proposed AI-empowered systems that employ an acoustic method and use AI techniques to categorize inhalation activities in order to address misuse issues [27,28]. However, to date, there is no published reporting on the integration of AI techniques with a pressure-based system for inhalation flow estimation. Therefore, a novel digital system that utilizes pressure signals was developed, taking advantage of its superior ambient noise resistance. Meanwhile, advanced ML techniques were also integrated to refine the signal processing, enhancing the accuracy and reliability of the system. This proposed system features a compact MEMS sensor that simultaneously measures pressure, temperature, and humidity signals, thereby also enabling multidimensional compensation. The approach not only overcomes the limitations of traditional analysis methods such as linear regression but also highlights the potential of machine learning techniques in improving the estimation of PIFR and IC. This study underscores the significance of integrating AI to enhance the performance of pressure-based monitoring systems in DPIs.

2. Materials and Methods

To monitor the inhalation activities, a novel digital monitoring system was developed. Its main part is a custom-designed digital module comprising mechanical frames and electronic components, attached to a capsule-based inhaler, Breezhaler^®. The digital module, together with the DPI, constitutes a complete digital monitoring system designed to collect inhalation data of patients.

2.1. Digital Module Design

Figure 1 presents the trimetric view (left) and the exploded view (right) of 3D models of the DPI digital monitoring system. The 3D models of both the inhaler and the digital module were created by SolidWorks 2023. The digital module includes a base, a case, and a self-designed printed circuit board (PCB). The mechanical frames of the digital module were prototyped using a Stereolithography 3D printer (iSLA660, ZRapid Tech, Suzhou, China) and feature a specially designed pressure detection configuration. The frame design of the digital module ensures its seamless integration with the inhaler while preserving functional integrity of the DPI.

Figure 1. Trimetric view (left) and exploded view (right) of the custom-designed DPI digital monitoring system.

Figure 2 shows a cross-sectional view of the customized digital module with the DPI, Breezhaler^® (Novartis, Basel, Switzerland). In the digital module, the case holds a MEMS (Micro-Electro-Mechanical Systems) sensor encased inside a “sensor cell”. During inhalation, ambient air is entrained into the system through the gas inlets of the inhaler. The sensor cell is connected to the airflow pathway, enabling precise pressure measurements for accurate detection of the inhalation flow profile without direct airflow passing through it. At the same time, the airflow stream travels into the circular chamber of the DPI, rotating the capsule and aerosolizing the formulation powders inside. This configuration for pressure detection is specially designed to minimize any interference with the original flow field of the inhaler, preserving the functional performance of the DPI.

Figure 2. The cross-sectional view of the customized digital module with the dry powder inhaler.

The mechanical frames of the digital module house a PCB that integrates a MEMS sensor, Bluetooth module, USB port, battery storage, and an LED indicator. The MEMS sensor captures real-time pressure, humidity, and temperature data, enabling comprehensive monitoring of both user activity and environmental conditions during inhalation. The Bluetooth module, which also functions as a microcontroller unit, processes the collected data and wirelessly transmits it to external devices for further analysis. The battery powers the module, while the USB port supports recharging. The LED indicator serves as a user interface, signaling system readiness and operation status. This compact, integrated design enables efficient and real-time inhalation monitoring.

2.2. Signal Collection

The experimental setup for the collection of inhalation data is shown in Figure 3. Pressure signals were directly obtained from the custom-designed digital monitoring system, while the real-time flow rate signals were simultaneously recorded by using a digital bidirectional flowmeter (model SFM3000, Sensirion AG, Stäfa, Switzerland) with a 14-bit resolution and a flow range of ±200 L/min. The flowmeter was interfaced with the same data acquisition laptop, enabling real-time visualization of flow rates (served as referenced flowrate values) through the SENSIRION^® ControlCenter 1.40.2 DataViewer software. Both the pressure sensor and the flow sensor were configured at a sampling rate of 20 Hz, based on the previous work by Tayler et al., which indicates that a cut-off frequency of 4 Hz is sufficient to capture the rapid change in signals during inhalation [10].

Figure 3. Experimental setup for the collection of inhalation data.

Eight healthy volunteers participated in this study with informed consent. To capture natural inhalation behaviors, no specific instructions on inhaler usage techniques were provided. Each participant performed 20 inhalations, resulting in a total of 160 inhalation recordings. Each recording consisted of multivariate time-series data, yielding approximately 328 data points (82 timestamps × 4 channels) per recording. The final dataset comprised 13,168 rows in tabular format, including synchronized measurements of flow rate, pressure, temperature, and humidity, along with corresponding timestamps.

2.3. Data Preprocessing and Model Selection

Typically, signal preprocessing is performed to standardize the input for machine learning models. In this study, the raw signals acquired from the digital monitoring system, including pressure, temperature, humidity, and timestamp, were intentionally subjected to minimal preprocessing. This approach was adopted to allow the models to learn underlying patterns and perform inherent denoising autonomously during training. Specifically, the pressure and flow signals were only processed using a second-order low-pass Butterworth filter with a 4 Hz cut-off frequency to enhance estimation accuracy, as recommended by Taylor et al. [10].

Moving on to the model selection, existing ML algorithms, including individual base learners and advanced heterogeneous ensembles, were selected, applied, and optimized for the subsequent prediction of inhalation airflow. The base learners are individual machine learning models that function as the building blocks for ensemble learning models, while heterogeneous ensembles are a type of ensemble learning that combines predictions from multiple different types of base learners [29,30]. A comparative analysis of eight base learner algorithms was conducted as a foundational approach to understand and predict inhalation flow profiles. Following the evaluation of these individual algorithms, advanced ensemble learning techniques were subsequently assessed to explore their potential improvements in predictive accuracy and robustness.

ML models were trained and evaluated using a 5-fold cross-validation approach, with the exception of the blending ensembles, which employed a holdout set strategy for evaluation. To implement the cross-validation, the collected dataset was divided into a training set and a testing set, with 75% of the data allocated for training and the remaining 25% for testing. Moreover, ML model hyperparameters were tuned using a grid search approach, enabling systematic exploration and optimization of model parameters to achieve robust performance [31].

All model training and evaluation were performed on a personal computing platform equipped with an AMD Ryzen 7 4700U CPU @2.00 GHz processor without GPU acceleration. The machine learning algorithms were implemented primarily using the scikit-learn library (Python 3.9). Due to variations in model complexity and scope of grid search parameter tuning, the total training time differed across algorithms. A detailed summary of the training durations for each model is presented in Supplementary Table S1.

2.3.1. Base Learner Algorithms

In this section, eight base learners were examined, each tested with diverse parameters. The Decision Tree algorithm, due to its simplicity and interpretability, was used for subsequent comparisons [32]. Support Vector Machine (SVM), which is renowned for its ability to maximize the margin between classified data points, was employed to model nonlinear relationships within the dataset and enhance model performance in complex scenarios [33]. Gaussian Process Regressor (GPR) was used to generate probabilistic predictions along with uncertainty estimates. This capability is particularly valuable when dealing with small-sized samples, as it aids in understanding the variability that is inherent in inhalation flow profiles.

To improve the accuracy and robustness of predictive models, various homogeneous ensemble learning techniques were applied. Homogeneous ensembles aggregate multiple instances of the same type of base learner. They are expected to reduce overall error and enhance predictive accuracy compared to individual base learner models. For example, bagging-based algorithms, such as Random Forest (RF) and Extra Trees Regressor (ETR), were implemented to reduce variance and stabilize predictions. Boosting-based algorithms, including AdaBoost, XGBoost, and GradientBoosting algorithms, were utilized to sequentially correct errors and strengthen overall model performance. In this study, the author grouped the individual Decision Tree, SVM, GPR, with the homogeneous ensemble models (RF, ETR, AdaBoost, XGBoost, and GradientBoosting) together as base learner models to preliminarily evaluate their performance.

2.3.2. Heterogeneous Ensemble Algorithms

Despite the trials in Section 2.3.1, homogeneous ensembles may not fully capture the diversity of insights offered by distinct modeling approaches. To overcome this limitation, heterogeneous ensemble algorithms were then introduced, combining different types of base learners to harness their complementary strengths, thus further improving predictive performance and robustness. Based on the combination strategies, heterogeneous ensemble models can be further categorized into voting ensembles, stacking ensembles, and blending ensembles.

Voting is one of the fundamental heterogeneous methods in machine learning. In the training stage, as shown in Supplementary Figure S1, multiple base learners are trained independently on the same dataset. Following this, the final prediction in the output stage is commonly obtained by calculating the average predictions generated by each base model in the training stage. This averaging approach helps to mitigate individual model errors as well as enhance the robustness and generalizability of the ensemble’s prediction.

The framework for the stacking ensemble strategy is illustrated in Supplementary Figure S2. Stacking is an advanced ensemble technique by which multiple base models are trained, and their predictions are then integrated through a meta-model to enhance predictive accuracy and generalizability. In this study, a 5-fold cross-validation strategy was employed to train the meta-model, therefore improving its robustness and mitigating overfitting. More details of this structural approach are available in the literature [34].

Blending ensemble shares similarities with stacking, as both methods use base learners to generate predictions, which are then treated as new features for a meta-model that makes the final prediction. However, blending differs by incorporating a holdout set (validation set) to train the meta-model on these new features. In this study, the holdout set was created by splitting the original training data into an 80:20 ratio. The blending framework is illustrated in Supplementary Figure S3.

2.4. Model Evaluation

In this study, model performance across various machine learning models was evaluated using four key metrics: coefficient of determination (

R^{2}

), mean absolute error (

M A E

), mean squared error (

M S E

), and root mean squared error (

R M S E

). The metrics are defined by the following Equations (1)–(4):

R^{2} = 1 - \frac{\sum {(\hat{y_{i}} - y_{i})}^{2}}{\sum {(\bar{y} - y_{i})}^{2}}

(1)

M A E = \frac{1}{N} \sum_{i = 1}^{N} | \hat{y_{i}} - y_{i} |

(2)

M S E = \frac{1}{N} \sum_{i = 1}^{N} {(\hat{y_{i}} - y_{i})}^{2}

(3)

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(\hat{y_{i}} - y_{i})}^{2}}

(4)

where

\hat{y_{i}}

denotes the predicted value,

y_{i}

is the actual (ground truth) value,

\bar{y}

is the mean of all actual values, and

N

is the total number of samples.

2.5. Estimation and Evaluation of Inhalation Parameters

Among the evaluated ML models, the best models were selected to plot the airflow rate profiles of inhalation using the collected data. To assess the estimation accuracy on the flow profiles, two key parameters, PIFR and IC, were used. The two flow parameters were obtained from the flow profiles, where the PIFR is the peak flow point of the inhalation flow curve, and the IC is the area under the inhalation flow profile curve that corresponds to the total inhaled air volume. This IC was obtained through trapezoidal numerical integration of flow profiles. The accuracy of the parameter estimation was determined by comparing the estimated PIFR and IC with their corresponding reference values for each inhalation. The calculation of estimation accuracy for PIFR and IC is detailed in Equations (5) and (6):

{P I F R}_{A c c u r a c y} (%) = 100 - \frac{| {P I F R}_{e s t} - {P I F R}_{a c t} |}{{P I F R}_{a c t}} \times 100

(5)

{I C}_{A c c u r a c y} (%) = 100 - \frac{| {I C}_{e s t} - {I C}_{a c t} |}{{I C}_{a c t}} \times 100

(6)

The results were shown as the mean with standard deviation (mean ± sd). For model comparisons, the Wilcoxon signed-rank test was employed to assess differences in prediction accuracy, with p < 0.05 considered statistically significant.

3. Results

This section presents the results of a series of machine learning models developed to predict inhalation flow profiles from sensor data. A comprehensive set of base learner algorithms and advanced ensemble algorithms was implemented to assess their effectiveness in capturing the underlying inhalation dynamics. The models were evaluated using multiple performance metrics, and the best-performing models were selected for further predictions.

Firstly, data recorded by the digital monitoring system were collected and are presented in Figure 4. Figure 4a illustrates the scatter plots depicting the relationships between reference flowrate and four types of signals (pressure, temperature, humidity, and time) collected. The negative values of the reference flowrates only suggest direction defined by the SENSIRION^® flowmeter. It is evident that the correlations between referenced flowrate and collected signals do not follow a simple linear pattern. This nonlinearity may arise from complex fluid mechanics, individual variations in inhalation behaviors, sensor response characteristics, as well as the influence of ambient environmental conditions. Consequently, AI-based approaches were adopted as they are better suited to modelling such nonlinear and multifactorial relationships, enabling accurate prediction of real-world inhalation flow rates.

Figure 4. Illustration of collected raw data. (a) scatter plot of collected signals vs. reference flowrate; (b) case sample of collected pressure vs. time and referenced flowrate vs. time profiles; (c) flow chart of AI techniques for real-time prediction of inhalation airflow rate.

Figure 4b presents a case sample of collected pressure and reference airflow rate profiles. The reference airflow rates were used not only for model training but also for evaluating the accuracy of the prediction. Figure 4c shows the overall flow of AI techniques for prediction. With the collected data, pressure, temperature, humidity, and time, as input variables, basic and advanced machine learning algorithms were developed and used for prediction. The timestamp was also incorporated as one of the inputs due to the intrinsic time-series nature of inhalation airflows. The feature is important for identifying the onset and termination of inhalation events.

3.1. Evaluation of Machine Learning Models

3.1.1. Using Base Learners

Following data collection, the machine learning models were developed and evaluated. In this section, eight base machine learning (ML) algorithms were employed to model the relationship between a dependent target variable (flow signal) and a set of independent features (pressure signal, temperature, humidity, and timestamp). Prior to model evaluation and prediction, ML algorithms need to be optimized by fine-tuning the corresponding hyperparameter set, defined as a configurable setting that governs the learning process and how models learn [35]. The optimal hyperparameters, as shown in Supplementary Table S2, were obtained based on the highest R² score that was achieved using grid search.

Moving on to the model evaluation, it is to assess how well a trained model performs. To ensure the accuracy and generalizability of models, the evaluation was performed on three distinct parts: cross-validation, testing, and training sets, and the evaluation results are summarized in Table 1. The model evaluation primarily focused on performance metrics obtained from cross-validation and testing datasets. With respect to the evaluation metrics of model performance, an R² value approaching 1 suggests a strong model fit, while lower values in error metrics (MAE, RMSE, MSE) indicate superior accuracy in the model’s predictability.

Table 1. Machine learning model performance for different models on training, cross-validation, and testing subsets.

In the evaluation of the cross-validation sets, both RF and AdaBoost stood out as top performers and reached the same highest R² of 0.944, indicative of excellent predictive accuracy and consistency across different data subsets. In contrast, GradientBoosting exhibited the lowest R² performance at 0.898, suggesting less effective generalization compared to other models. In terms of error metrics, the RF demonstrated superior generalization capabilities, evidenced by its lowest error values (MAE 2.788, RMSE 6.383, MSE 40.781). Conversely, the GradientBoosting algorithms recorded relatively large errors and exhibited significantly larger deviations, reflecting greater variability in their performance across different folds.

Upon analyzing the evaluation results on the testing sets, all the algorithms achieved R² values above 0.91, with the highest R² of 0.955 for the AdaBoost model, followed by 0.952 for RF and 0.951 for ETR. In terms of MAE, the results varied across models, with the RF model displaying the lowest MAE of 2.448, indicative of minimal average deviations, whereas GradientBoosting exhibited the highest MAE of 3.803, reflecting the largest average absolute errors on testing sets. The RMSE metrics also showed variability among the models, ranging from 5.635 for AdaBoost, which denoted highly accurate predictions with minimal error spread, to 7.629 for the SVM, which suggests a broader spread of errors on the test set. The MSE extended from 31.759 for AdaBoost to 58.207 for SVM, highlighting the squared average deviations of the predictions.

To clearly indicate the overall performance of these models, Figure 5 presents the bar plots and radar plots of the evaluation results. Figure 5a demonstrates the results of error metrics of the models with standard deviations, and one can see that the Random Forest (RF) model exhibited the highest R² values as well as the lowest error metrics values with minimal variation in errors. The radar plot shown in Figure 5b visualizes the performance metrics (RMSE Mean, MSE Mean, MAE Mean, and R²) for each model, normalized using the Min-Max scaler for direct comparability. The results indicated that the RF model, with points furthest from the origin, achieved the highest performance across these metrics. In contrast, the Gradient Boosting model is closer to the origin, demonstrating the lowest performance among the models evaluated. The RF model also stood out among the results of the testing sets, with the lowest MAE, and showed top performances regarding other scores. Therefore, Random Forest was selected as the best model for predicting flowrate and for further analysis regarding inhalation parameters.

Figure 5. (a) Bar plots of errors of different algorithms for cross-validation results; (b) Radar plot for model selection.

3.1.2. Using Heterogeneous Ensemble Models

In this section, advanced heterogeneous ensemble models were constructed using the four best-performing base learner models. Based on the results presented in Section 3.1.1., the top four models demonstrate higher R² values. Random Forest, AdaBoost, Extra Trees Regressor, and XGBoost were identified as optimal candidates for inclusion as base models in the construction of heterogeneous ensemble frameworks, instead of only selecting the best performer (RF model). The structures of the heterogeneous ensemble frameworks are detailed in Supplementary Table S2.

Figure S4 provides additional insights into the prediction performance of these four models. The analysis of the prediction plots on the testing sets indicates that all four models demonstrated high prediction accuracy in the high flowrate range (≥70 L/min), moderate accuracy in the medium range (30–70 L/min), and noticeably reduced performance in the low flowrate range (≤30 L/min). The performance in the low flowrate range also aligns with trends reported in a previous study [36].

Similar to the evaluation on base learner models in Section 3.1.1., the evaluation on the advanced heterogeneous ensemble algorithms also focused on the cross-validation and testing datasets. Table 2 presents the evaluation metrics of cross-validation sets for these heterogeneous ensemble models. Collectively, no significant increase or decrease in these error metrics for the heterogeneous models was observed when compared with the top four base models. Voting Ensembles 1 and 2, as well as Stacking Ensembles 1 and 3, are relatively top performers among the developed ensembles, showing slightly higher R². It is important to note that, due to the unique structure of the blending ensemble method described previously, its performance was evaluated only on the out-of-sample testing set, rather than through cross-validation.

Table 2. Evaluation results of heterogeneous ensemble models on cross-validation sets.

Table 3 shows the performance of heterogeneous ensemble models, including voting ensemble, stacking ensemble, and blending ensemble, evaluated on the testing sets. Among the 14 heterogeneous ensemble models, Stacking Ensemble 3 (RF + AdaBoost + XGBoost; Meta model = ETR) demonstrated the highest R² and the lowest values of error metrics among the testing cohorts, indicating its outstanding performance. This model also maintained consistently strong performance across cross-validation sets, substantiating its predictive reliability and generalizability. Of the four voting ensemble models, Voting Ensembles 1 and 2 performed slightly better, with Ensemble 1 marginally outperforming Ensemble 2, showing comparable prediction performance. As for blending ensemble models, model 3 illustrated the best performance based on R² and other error metrics.

Table 3. Evaluation performance of machine learning models on testing sets.

Using those heterogeneous models, the scatter plots depicting the estimated and referenced airflow rates were also generated, as shown in Supplementary Figures S5–S7. The scatter plots also emphasized the prediction accuracy of Voting Ensemble 1, Stacking Ensemble 3, as well as Blending Ensemble 3, although it is visibly difficult to differentiate the scatterability of these spots.

Overall, the comprehensive evaluation of the results for both cross-validation and testing sets indicates that Stacking Ensemble 3 emerged as the highest-performing model among all of the heterogeneous ensemble models. Voting Ensemble 1 and Blending Ensemble 3 demonstrated strong performance within their categories. These superior models will be employed for further prediction of airflow profiles and key parameters.

3.2. Prediction and Evaluation of Inhalation Parameters

In this section, the top-performing base learner (RF) and the top-performing heterogeneous ensembles (Voting Ensemble 1, Stacking Ensemble 3, and Blending Ensemble 3) were used to predict the inhalation flow profiles, respectively.

Table 4 shows the estimation accuracy of PIFR and IC by using the top four models. The Random Forest model achieved the highest accuracy for both PIFR (97.7 ± 2.9%) and IC (95.2 ± 9.0%), indicating strong agreement between predicted and actual values. The other ensemble models yielded PIFR accuracies between 95% and 97% and IC accuracies between 94% and 95%. Statistical analysis revealed significant differences (p < 0.05) between Random Forest and the three ensembles, while no significant difference was found between Random Forest and Stacking (p = 0.68) for IC prediction. These results suggest that ensemble methods provided no advantage over the best-performing single learner, with Random Forest consistently outperforming all models.

Table 4. The estimation accuracy of PIFR and IC across the whole datasets using the best base learner, voting ensemble, stacking ensemble, and blending ensemble models.

Additionally, the error margin for PIFR ranged from 2.9% to 5.1%. Considering that device-specific variability and intra-patient variability in PIFR measurements are typically within 10% [37,38,39], the prediction error of current models is deemed clinically acceptable. The prediction error for IC was slightly higher, with a standard deviation ranging from 9.0% to 9.5%. Although this value remains below the 10% margin, there is currently no established clinical reference to confirm whether this level of error is acceptable.

To further explore the clinical relevance of model predictions, a simple binary classification analysis was performed using a clinically meaningful threshold of PIFR ≥ 50 L/min for Breezhaler^® [40]. This analysis illustrates how small prediction errors can affect the determination of sufficient inhalation effort and supports the potential of the model to inform clinical decision-making (see Table S5). Since PIFR is widely recognized as the most clinically relevant parameter, this analysis was limited to PIFR and not extended to IC.

Figure 6 shows a case example of an estimated airflow rate profile using these optimal models and corresponding PIFR and IC (predicted), with relative error results shown in Table S4. The case inhalation sample was selected among the recordings at random. Its collected pressure data profile is shown in red, and its simultaneously collected airflow profile (as a reference) is shown in green (Figure 6). Based on the collected pressure data, the selected best models were applied to estimate the inhalation flow profiles, which are shown in different line colors in Figure 6. One can see that the predicted flowrate profiles using these optimal models are close to the reference airflow rate profile collected.

Figure 6. Illustration of collected pressure data profile, referenced flowrate profile, and predicted flowrate profiles using the best base learner, voting ensemble, stacking ensemble, and blending ensemble models.

Additionally, the ground truth PIFR (reference) was 81.342 L/min at 1.50 s (represented by a red star), and the IC (reference) was 2.110 L shown in Figure 6. In comparison to the referenced PIFR and IC, the predicted PIFR and IC by the four models were close to the references, with relative errors within 3.0%, confirming the capability of the optimized models to generate estimates of inhalation parameters that are closely aligned with the actual values.

4. Discussion

This study presents a novel digital monitoring system and also provides a feasible approach to predict inhalation flow profiles by analyzing pressure-based signals. By leveraging various basic and advanced machine learning algorithms, the optimal models could achieve high prediction accuracy by addressing some inherent challenges from (MEMS) sensor.

In this research, the development of an accurate inhalation flow monitoring system requires balancing hardware functions (sensor performance, mechanical design) and software capabilities (algorithms). On the hardware side, the inherent issues of MEMS pressure sensors, such as hysteresis, noise, signal drift, and nonlinearity, pose significant challenges in capturing the dynamics of inhalation. In addition, mechanical design, another aspect of hardware design, also needs to be engineered to minimize interference with the original flow field of the dry powder inhaler (DPI). In this work, the specially designed pressure detection configuration ensures that the inhalation process remains unaltered, allowing for authentic acquisition of pressure signals without affecting inhaler resistance to inspiratory airflow. Despite careful mechanical and sensor design, limitations in hardware performance are inevitable. Therefore, software algorithms play a crucial role in compensating for these hardware imperfections. Signal processing techniques and machine learning algorithms are employed to extract meaningful patterns from noisy, imperfect raw data. By learning complex relationships between pressure signals and flow profiles, these algorithms enable accurate estimation even in the presence of sensor noise and mechanical inconsistencies.

Despite the advantages of algorithms, software-based solutions come with their own challenges, such as computational complexity, real-time processing requirements, and the dependency on high-quality training data to achieve robust performance. Thus, an optimal trade-off involves selecting sensor hardware that provides acceptable baseline accuracy, complemented by advanced software algorithms tailored to mitigate hardware limitations. This hybrid approach, combining thoughtful mechanical design and well-tuned algorithms, ensures the excellent precision required for the inhaler monitoring application. By carefully balancing the strengths and shortcomings of hardware and software, the system can achieve high accuracy without introducing excessive complexity or cost, making it suitable for large-scale deployment.

To make up for the shortness of the MEMS sensor, both basic and advanced machine learning models have been tested to optimize the model and improve the prediction accuracy of the inhalation flow profile. Among the base learner algorithms, bagging-based ensemble models, including Random Forest (RF) and Extra Trees Regressor, were found to outperform other algorithms. These bagging-based models excel in tasks involving complex and nonlinear relationships due to their ability to reduce variance while maintaining robustness [41]. Conversely, boosting-based models exhibited suboptimal performance, with the exception of AdaBoost. This could be attributed to the limited size of the dataset, as boosting-based algorithms generally require larger datasets to achieve optimal training and minimize overfitting [42].

With regard to the advanced heterogeneous ensemble models, they demonstrated potential for improving prediction performance, which is consistent with previous findings on using advanced ensembles [43]. However, their superiority over base learners was not guaranteed, as their performance heavily depended on the structure and design of the ensembles. In this study, stacking ensembles outperformed other heterogeneous methods and the majority of individual base learners. This improvement may stem from the capability of stacking ensembles to effectively combine multiple base learners through meta-models in this task, which not only captures the strengths of each model but also corrects individual model weaknesses. The results emphasize that selecting and positioning base learners is crucial for constructing advanced predictive ensemble models, aligning with findings from previous research [32,44].

When considering the real-world application, the deployment of models should take into account both computational efficiency and hardware constraints. In this study, the RF model not only achieved excellent predictive accuracy but also offers high deployability for real-time applications due to its low inference latency and resource demands. For scenarios requiring more complex models (e.g., voting, stacking, or blending ensembles), a hybrid strategy can be adopted where data acquisition and preprocessing occur on the resource-constrained embedded devices, while inference can be handled by external platforms such as smartphones or cloud servers. This flexible architecture supports low-latency operation, ensures compatibility with low-power hardware, and facilitates scalable integration in both clinical and home-care environments.

One limitation of the present study is the relatively small and homogeneous dataset derived from a limited cohort of healthy participants. Future validation in larger and more diverse populations is necessary to further assess the proposed system and ML models. Ideally, assessment in clinical settings is expected to support the robustness and real-world applicability of the system.

The authors also would like to point out that accurately predicting flowrates in the low range (e.g., <20 L/min) remains a challenge, as evidenced by the scatter plots. The reasons behind the observation may arise from the limited sensitivity of the pressure sensor and the simplistic design of the mechanical frames. These issues can introduce noise and inaccuracies in inhalation flowrate detection. Since the inhalation parameters like PIFR are derived from the predicted flow profile, any inaccuracies in flowrate predictions can propagate to these critical measurements. Despite this limitation, the developed AI-powered digital system should be satisfactory in accurately predicting the PIFR. As noted, a universal threshold applicable across all DPIs is suggested to be 30 L/min, ideally 60 L/min, for effective drug delivery [3,16]. The top-performing models excelled in the prediction of PIFR higher than 30 L/min, which is theoretically sufficient to ensure effective aerosolization and pulmonary drug delivery.

This study highlights the promising application of AI in estimating flow profiles during DPI usage. Despite the promise, advancements are still necessary to further enhance model performance and applicability. A critical next step is to expand the dataset size, as larger and more diverse datasets are essential to improve model accuracy, generalizability, and robustness. The current dataset is limited by the small number of participants with restricted diversity of inhalation signals. Future efforts will focus on collecting data from a broader participant pool. Additionally, applying multi-modal sensor fusion or improving digital module design could further enhance prediction accuracy and reliability.

Another exploration will be advanced deep learning models, such as recurrent neural networks (RNNs) or transformer-based architectures. These advanced models may offer improved performance in capturing complex temporal patterns of inhalation signals. However, it remains a trade-off issue between model complexity and accuracy, as highly complex models may demand greater computational resources and risk overfitting when working with limited datasets. Balancing these factors will be key to developing robust, efficient models that can be deployed in both clinical and real-world environments.

5. Conclusions

This study successfully developed a novel digital monitoring system capable of recording pressure, temperature, and humidity signals during inhalation with minimal interference to the functionality of the inhaler. By using advanced machine learning techniques, the digital monitoring system could achieve high-accuracy estimation (>95%) of inhalation flow parameters by integrating and analyzing the captured multidimensional signals.

This work also compares the performance of individual base learner models and advanced heterogeneous ensemble models. Although it cannot be definitely stated that any of these models significantly outperformed others, the results highlight the feasibility and strong predictive capabilities of the approaches employed. Future research will focus on utilizing larger datasets and exploring more advanced AI models to further enhance prediction accuracy and robustness.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/s25144402/s1, Figure S1: Illustration of the voting ensemble framework, Figure S2: Illustration of stacking ensemble framework, Figure S3: Illustration of blending ensemble framework, Figure S4: (a) Scatter plots of “predicted flowrate values” versus “referenced flowrate values” for top four base learner models on both the training and testing subsets; (b) Representative inhalation flow profiles for the top four base learner models in low, medium, and high flowrate ranges, Figure S5: Scatter plots of voting ensemble models ‘predicted Flowrate values’ vs. ‘reference flowrate values’ in the training and testing subsets, Figure S6: Scatter plots of stacking ensemble models ‘predicted Flowrate values’ vs. ‘reference flowrate values’ in the training and testing subsets, Figure S7: Scatter plots of blending ensemble models ‘predicted Flowrate values’ vs. ‘reference flowrate values’ in the training and testing subsets, Table S1: The training time for each algorithm, Table S2: Values of fine-tuned optimal hyperparameters for the base learner models, Table S3: The structures of the heterogeneous ensemble frameworks, Table S4: Case examples of estimated PIFR and IC using the optimal base learner, voting ensemble, stacking ensemble, and blending ensemble models, Table S5: Performance of the models in classifying sufficient inhalation effort based on a clinical threshold of PIFR ≥ 50 L/min.

Author Contributions

Conceptualization, Z.F. and Y.Y.; Data curation, Z.F.; Formal analysis, Z.F.; Funding acquisition, J.Z.; Investigation, Z.F. and J.C.; Project administration, Y.Y., Y.M. and J.Z.; Resources, Y.M.; Software, Z.F. and J.C.; Supervision, Y.M. and J.Z.; Validation, Z.F., Y.Y. and J.C.; Writing—original draft, Z.F.; Writing—review & editing, Y.Y., J.C., Y.M. and J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Natural Sciences and Engineering Research Council of Canada (NSERC).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study. Written informed consent has been obtained from the participants to publish this paper. All experimental procedures were approved by the Human Research Ethics Board at Western University, London, ON, Canada.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The authors would like to thank Suzhou Inhal Pharma Co., Ltd., for supporting materials. During the preparation of this manuscript, the author(s) used ChatGPT 4o for the purposes of grammar check and language improvement. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

Ziyi Fan has a patent pending with Suzhou Inhal Pharma Co., Ltd. The author, Ying Ma, is employed by Suzhou Inhal Pharma Co., Ltd., but claims no conflicts of interest related to this work. The other authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Abbreviations

The following abbreviations are used in this manuscript:

DPIs	Dry powder inhalers
AI	Artificial intelligence
ML	Machine learning
PIFR	Peak Inspiratory Flow Rate
IC	Inspiratory capacity
COPD	Chronic obstructive pulmonary disease
INCA	Inhaler Compliance Assessment
MEMS	Micro-Electro-Mechanical Systems
PCB	Printed circuit board
RF	Random Forest
SVM	Support Vector Machine
GPR	Gaussian Process Regressor
ETR	Extra Trees Regressor
MAE	Mean absolute error
MSE	Mean squared error
RMSE	Root mean squared error

References

Carpenter, D.M.; Roberts, C.A.; Sage, A.J.; George, J.; Horne, R. A Review of Electronic Devices to Assess Inhaler Technique. Curr. Allergy Asthma Rep. 2017, 17, 17. [Google Scholar] [CrossRef]
Ye, Y.; Ma, Y.; Zhu, J. The Future of Dry Powder Inhaled Therapy: Promising or Discouraging for Systemic Disorders? Int. J. Pharm. 2022, 614, 121457. [Google Scholar] [CrossRef]
Holmes, M.S.; Seheult, J.N.; Geraghty, C.; D’Arcy, S.; O’Brien, U.; Crispino O’Connell, G.; Costello, R.W.; Reilly, R.B. A Method of Estimating Inspiratory Flow Rate and Volume from an Inhaler Using Acoustic Measurements. Physiol. Meas. 2013, 34, 903–914. [Google Scholar] [CrossRef]
D’Arcy, S.; MacHale, E.; Seheult, J.; Holmes, M.S.; Hughes, C.; Sulaiman, I.; Hyland, D.; O’Reilly, C.; Glynn, S.; Al-Zaabi, T.; et al. A Method to Assess Adherence in Inhaler Use through Analysis of Acoustic Recordings of Inhaler Events. PLoS ONE 2014, 9, e98701. [Google Scholar] [CrossRef]
Van Boven, J.F.M.; Lavorini, F.; Dekhuijzen, P.N.R.; Blasi, F.; Price, D.B.; Viegi, G. Urging Europe to Put Non-Adherence to Inhaled Respiratory Medication Higher on the Policy Agenda: A Report from the First European Congress on Adherence to Therapy. Eur. Respir. J. 2017, 49, 1700076. [Google Scholar] [CrossRef]
Chrystyn, H.; Audibert, R.; Keller, M.; Quaglia, B.; Vecellio, L.; Roche, N. Real-Life Inhaler Adherence and Technique: Time to Get Smarter! Respir. Med. 2019, 158, 24–32. [Google Scholar] [CrossRef]
Azzahra, N.F.; Nugraha, P.C.; Hamzah, T.; Lawal, K.O.; Larasati, A.D. Implementation of a Microcontroller Arduino for Portable Peak Expiratory Flow Rate to Examine the Lung Health. Int. J. Adv. Health Sci. Technol. 2022, 2, 54–59. [Google Scholar] [CrossRef]
Taylor, T.E.; Holmes, M.S.; Sulaiman, I.; Costello, R.W.; Reilly, R.B. Monitoring Inhaler Inhalations Using an Acoustic Sensor Proximal to Inhaler Devices. J. Aerosol Med. Pulm. Drug Deliv. 2016, 29, 439–446. [Google Scholar] [CrossRef]
Chetta, A.; Yorgancioglu, A.; Scuri, M.; Barile, S.; Guastalla, D.; Dekhuijzen, P.N.R. Inspiratory Flow Profile and Usability of the NEXThaler, a Multidose Dry Powder Inhaler, in Asthma and COPD. BMC Pulm. Med. 2021, 21, 65. [Google Scholar] [CrossRef]
Taylor, T.E.; Lacalle Muls, H.; Costello, R.W.; Reilly, R.B. Estimation of Inhalation Flow Profile Using Audio-Based Methods to Assess Inhaler Medication Adherence. PLoS ONE 2018, 13, e0191330. [Google Scholar] [CrossRef]
Weers, J.; Clark, A. The Impact of Inspiratory Flow Rate on Drug Delivery to the Lungs with Dry Powder Inhalers. Pharm. Res. 2017, 34, 507–528. [Google Scholar] [CrossRef] [PubMed]
Van Boven, J.F.M.; Chavannes, N.H.; Van Der Molen, T.; Rutten-van Mölken, M.P.M.H.; Postma, M.J.; Vegter, S. Clinical and Economic Impact of Non-Adherence in COPD: A Systematic Review. Respir. Med. 2014, 108, 103–113. [Google Scholar] [CrossRef]
Melani, A.S.; Bonavia, M.; Cilenti, V.; Cinti, C.; Lodi, M.; Martucci, P.; Serra, M.; Scichilone, N.; Sestini, P.; Aliani, M.; et al. Inhaler Mishandling Remains Common in Real Life and Is Associated with Reduced Disease Control. Respir. Med. 2011, 105, 930–938. [Google Scholar] [CrossRef]
Ghosh, S.; Ohar, J.A.; Drummond, M.B. Peak Inspiratory Flow Rate in Chronic Obstructive Pulmonary Disease: Implications for Dry Powder Inhalers. J. Aerosol Med. Pulm. Drug Deliv. 2017, 30, 381–387. [Google Scholar] [CrossRef]
Ohar, J.A.; Ferguson, G.T.; Mahler, D.A.; Drummond, M.B.; Dhand, R.; Pleasants, R.A.; Anzueto, A.; Halpin, D.M.; Price, D.B.; Drescher, G.S.; et al. Measuring Peak Inspiratory Flow in Patients with Chronic Obstructive Pulmonary Disease. Int. J. Chron. Obstruct. Pulmon. Dis. 2022, 17, 79–92. [Google Scholar] [CrossRef]
Mahler, D.A. Peak Inspiratory Flow Rate as a Criterion for Dry Powder Inhaler Use in Chronic Obstructive Pulmonary Disease. Ann. Am. Thorac. Soc. 2017, 14, 1103–1107. [Google Scholar] [CrossRef]
O’Donnell, D.E. COPD Exacerbations · 3: Pathophysiology. Thorax 2006, 61, 354–361. [Google Scholar] [CrossRef]
Taylor, T.E.; De Looze, C.; MacHale, P.; Holmes, M.S.; Sulaiman, I.; Costello, R.W.; Reilly, R.B. Characterization of Patient Inhaler Inhalation Sounds Using Non-Contact and Tracheal Microphones. Biomed. Phys. Eng. Express 2016, 2, 055021. [Google Scholar] [CrossRef]
Chang, M.-W.; Hsiao, F.-H.; Lin, Y.-J.; Chiu, C.-W.; Chen, W.-C.; Chen, H.-Y.; Wu, W.-J.; Bai, M.R.; Liao, Y.-T. Development of a Sound-Based Smart Inhaler With a Mechanical Acoustic Filter for Medication Identification. IEEE Sens. J. 2024, 24, 35456–35464. [Google Scholar] [CrossRef]
Nikos Fakotakis, D.; Nousias, S.; Arvanitis, G.; Zacharaki, E.I.; Moustakas, K. AI Sound Recognition on Asthma Medication Adherence: Evaluation With the RDA Benchmark Suite. IEEE Access 2023, 11, 13810–13829. [Google Scholar] [CrossRef]
Holmes, M.S.; D’arcy, S.; Costello, R.W.; Reilly, R.B. Acoustic Analysis of Inhaler Sounds From Community-Dwelling Asthmatic Patients for Automatic Assessment of Adherence. IEEE J. Transl. Eng. Health Med. 2014, 2, 1–10. [Google Scholar] [CrossRef]
Pan, Y.; Zhang, L. Roles of Artificial Intelligence in Construction Engineering and Management: A Critical Review and Future Trends. Autom. Constr. 2021, 122, 103517. [Google Scholar] [CrossRef]
Jeddi, Z.; Ghogho, M.; Bohr, A.; Botker, J.P.; Kassou, I. Estimation of Inhalation Flow Parameters for Asthma Monitoring Using Acoustic Signal Processing and Machine Learning. In Ambient Intelligence and Smart Environments; IOS Press: Amsterdam, The Netherlands, 2019. [Google Scholar]
Chamaon, D.; Sportel, E.; Elferink, E.; Van Der Palen, J. Validation of an AI-Powered Smart Dry Powder Inhaler (RS01X) for Asthma and COPD in a Clinical Setting. Int. J. Chron. Obstruct. Pulmon. Dis. 2025, 20, 811–819. [Google Scholar] [CrossRef]
Sha, Y.; Faber, J.; Gou, S.; Liu, B.; Li, W.; Schramm, S.; Stoecker, H.; Steckenreiter, T.; Vnucec, D.; Wetzstein, N.; et al. An Acoustic Signal Cavitation Detection Framework Based on XGBoost with Adaptive Selection Feature Engineering. Measurement 2022, 192, 110897. [Google Scholar] [CrossRef]
Alam, M.Z.; Simonetti, A.; Brillantino, R.; Tayler, N.; Grainge, C.; Siribaddana, P.; Nouraei, S.A.R.; Batchelor, J.; Rahman, M.S.; Mancuzo, E.V.; et al. Predicting Pulmonary Function From the Analysis of Voice: A Machine Learning Approach. Front. Digit. Health 2022, 4, 750226. [Google Scholar] [CrossRef]
Qiao, Y.; Arabi, M.; Xu, W.; Zhang, H.; Abdel-Rahman, E.M. The Impact of Thermal-Noise on Bifurcation MEMS Sensors. Mech. Syst. Signal Process. 2021, 161, 107941. [Google Scholar] [CrossRef]
Rahaman, A.; Park, C.H.; Kim, B. Design and Characterization of a MEMS Piezoelectric Acoustic Sensor with the Enhanced Signal-to-Noise Ratio. Sens. Actuators Phys. 2020, 311, 112087. [Google Scholar] [CrossRef]
Zhang, Y.; Liu, J.; Shen, W. A Review of Ensemble Learning Algorithms Used in Remote Sensing Applications. Appl. Sci. 2022, 12, 8654. [Google Scholar] [CrossRef]
Saleh, H.; Mostafa, S.; Alharbi, A.; El-Sappagh, S.; Alkhalifah, T. Heterogeneous Ensemble Deep Learning Model for Enhanced Arabic Sentiment Analysis. Sensors 2022, 22, 3707. [Google Scholar] [CrossRef]
Bannigan, P.; Bao, Z.; Hickman, R.J.; Aldeghi, M.; Häse, F.; Aspuru-Guzik, A.; Allen, C. Machine Learning Models to Accelerate the Design of Polymeric Long-Acting Injectables. Nat. Commun. 2023, 14, 35. [Google Scholar] [CrossRef]
Wu, T.; Zhang, W.; Jiao, X.; Guo, W.; Alhaj Hamoud, Y. Evaluation of Stacking and Blending Ensemble Learning Methods for Estimating Daily Reference Evapotranspiration. Comput. Electron. Agric. 2021, 184, 106039. [Google Scholar] [CrossRef]
Eleftheriadou, A.-C.; Vafeiadis, A.; Lalas, A.; Votis, K.; Tzovaras, D. An Audio-Based Method for Assessing Proper Usage of Dry Powder Inhalers. Appl. Sci. 2020, 10, 6677. [Google Scholar] [CrossRef]
Hajihosseinlou, M.; Maghsoudi, A.; Ghezelbash, R. Stacking: A Novel Data-Driven Ensemble Machine Learning Strategy for Prediction and Mapping of Pb-Zn Prospectivity in Varcheh District, West Iran. Expert Syst. Appl. 2024, 237, 121668. [Google Scholar] [CrossRef]
Ali, Y.; Awwad, E.; Al-Razgan, M.; Maarouf, A. Hyperparameter Search for Machine Learning Algorithms for Optimizing the Computational Complexity. Processes 2023, 11, 349. [Google Scholar] [CrossRef]
Chrystyn, H.; Saralaya, D.; Shenoy, A.; Toor, S.; Kastango, K.; Calderon, E.; Li, T.; Safioti, G. Investigating the Accuracy of the Digihaler, a New Electronic Multidose Dry-Powder Inhaler, in Measuring Inhalation Parameters. J. Aerosol Med. Pulm. Drug Deliv. 2022, 35, 166–177. [Google Scholar] [CrossRef]
Hanon, S.; Vanderhelst, E.; Buls, N.; Verbanck, S. Getting the Most out of Spirometry: A Tool to Guide Dry Powder Inhaler Use. Respiration 2022, 101, 893–900. [Google Scholar] [CrossRef]
Broeders, M.E.A.C.; Molema, J.; Hop, W.C.J.; Folgering, H.T.M. Inhalation Profiles in Asthmatics and COPD Patients: Reproducibility and Effect of Instruction. J. Aerosol Med. 2003, 16, 131–141. [Google Scholar] [CrossRef]
Anderson, M.; Collison, K.; Drummond, M.B.; Hamilton, M.; Jain, R.; Martin, N.; Mularski, R.A.; Thomas, M.; Zhu, C.-Q.; Ferguson, G.T. Peak Inspiratory Flow Rate in COPD: An Analysis of Clinical Trial and Real-World Data. Int. J. Chron. Obstruct. Pulmon. Dis. 2021, 16, 933–943. [Google Scholar] [CrossRef]
Hassan, M.I.; Laz, N.I.; Madney, Y.M.; Harb, H.S.; Abdelrahim, M.E.A. Enhancing Chronic Obstructive Pulmonary Disease Management through Optimized Peak Inspiratory Flow Rate and Inhaler Strategies: Literature Review. Bull. Pharm. Sci. Assiut Univ. 2024, 48, 387–399. [Google Scholar] [CrossRef]
Kotsiantis, S.B. Bagging and Boosting Variants for Handling Classifications Problems: A Survey. Knowl. Eng. Rev. 2014, 29, 78–100. [Google Scholar] [CrossRef]
Malashin, I.; Tynchenko, V.; Gantimurov, A.; Nelyub, V.; Borodulin, A. Boosting-Based Machine Learning Applications in Polymer Science: A Review. Polymers 2025, 17, 499. [Google Scholar] [CrossRef] [PubMed]
Hasan, M.; Abedin, M.Z.; Hajek, P.; Sultan, N.; Lucey, B.M. A Blending Ensemble Learning Model for Crude Oil Price Prediction. SSRN Electron. J. 2022. [Google Scholar] [CrossRef]
Nti, I.K.; Adekoya, A.F.; Weyori, B.A. A Comprehensive Evaluation of Ensemble Learning for Stock-Market Prediction. J. Big Data 2020, 7, 20. [Google Scholar] [CrossRef]

Figure 1. Trimetric view (left) and exploded view (right) of the custom-designed DPI digital monitoring system.

Figure 2. The cross-sectional view of the customized digital module with the dry powder inhaler.

Figure 3. Experimental setup for the collection of inhalation data.

Figure 4. Illustration of collected raw data. (a) scatter plot of collected signals vs. reference flowrate; (b) case sample of collected pressure vs. time and referenced flowrate vs. time profiles; (c) flow chart of AI techniques for real-time prediction of inhalation airflow rate.

Figure 5. (a) Bar plots of errors of different algorithms for cross-validation results; (b) Radar plot for model selection.

Figure 6. Illustration of collected pressure data profile, referenced flowrate profile, and predicted flowrate profiles using the best base learner, voting ensemble, stacking ensemble, and blending ensemble models.

Table 1. Machine learning model performance for different models on training, cross-validation, and testing subsets.

	Training				Cross-Validation				Testing
ML Algorithms	R²	RMSE	MSE	MAE	R²	RMSE	MSE	MAE	R²	RMSE	MSE	MAE
Decision Tree	0.926	7.345	53.945	3.459	0.911	8.053	64.911	3.720	0.919	7.541	56.871	3.383
Random Forest (RF)	0.983	3.547	12.583	1.507	0.944	6.383	40.781	2.788	0.952	5.833	34.026	2.448
Extra Trees Regressor (ETR)	0.975	4.237	17.949	1.886	0.943	6.428	41.359	2.889	0.951	5.846	34.174	2.556
Support Vector Machine (SVM)	0.914	7.914	62.639	3.049	0.910	8.089	65.502	3.201	0.917	7.629	58.207	2.888
Gaussian Progress Regressor	0.986	3.175	10.083	1.253	0.908	8.221	67.630	3.415	0.929	7.081	50.142	2.893
AdaBoost	0.979	3.941	15.529	2.403	0.944	6.386	40.890	3.238	0.955	5.635	31.759	2.880
XGBoost	0.973	4.430	19.625	2.587	0.939	6.651	44.335	3.730	0.948	6.029	36.343	3.404
GradientBoosting	0.917	7.811	61.012	4.092	0.898	8.273	72.401	4.456	0.924	7.297	53.242	3.803

Note: the best performance is marked in bold.

Table 2. Evaluation results of heterogeneous ensemble models on cross-validation sets.

	R²		RMSE		MSE		MAE
ML Algorithms	Mean	SD	Mean	SD	Mean	SD	Mean	SD
Random Forest (RF)	0.944	0.003	6.383	0.210	40.781	2.661	2.788	0.110
AdaBoost	0.944	0.004	6.386	0.322	40.890	4.212	3.238	0.117
Extra Trees Regressor	0.943	0.003	6.428	0.192	41.359	2.460	2.889	0.113
XGBoost	0.939	0.005	6.651	0.320	44.335	4.200	3.730	0.130
Voting Ensemble_1	0.947	0.004	6.194	0.252	38.433	3.082	3.100	0.130
Voting Ensemble_2	0.947	0.003	6.233	0.230	38.899	2.835	3.093	0.124
Voting Ensemble_3	0.945	0.003	6.352	0.213	40.385	2.680	2.813	0.111
Voting Ensemble_4	0.946	0.003	6.271	0.237	39.385	2.959	3.233	0.123
Stacking Ensemble_1	0.948	0.003	6.184	0.233	38.295	2.864	2.806	0.118
Stacking Ensemble_2	0.933	0.002	7.017	0.170	49.268	2.395	3.605	0.151
Stacking Ensemble_3	0.948	0.004	6.174	0.264	38.193	3.246	2.797	0.130
Stacking Ensemble_4	0.941	0.003	6.588	0.253	43.467	3.350	2.950	0.108
Stacking Ensemble_5	0.930	0.002	7.150	0.154	51.144	2.196	3.334	0.114
Stacking Ensemble_6	0.938	0.004	6.699	0.236	44.939	3.144	3.095	0.138

Note: the best performance is marked in bold.

Table 3. Evaluation performance of machine learning models on testing sets.

ML Algorithms	R²	RMSE	MSE	MAE
Decision Tree	0.919	7.541	56.871	3.383
Random Forest (RF)	0.952	5.833	34.026	2.448
Extra Tree Regressor (ETR)	0.951	5.846	34.174	2.556
Support Vector Machine (SVM)	0.917	7.629	58.207	2.888
Gaussian Progress Regressor	0.929	7.081	50.142	2.893
AdaBoost	0.955	5.635	31.759	2.880
XGBoost	0.948	6.029	36.343	3.404
GradientBoosting	0.924	7.297	53.242	3.803
Voting Ensemble_1	0.956	5.570	31.029	2.750
Voting Ensemble_2	0.956	5.590	31.252	2.746
Voting Ensemble_3	0.952	5.784	33.456	2.471
Voting Ensemble_4	0.955	5.603	31.392	2.887
Stacking Ensemble_1	0.956	5.565	30.972	2.449
Stacking Ensemble_2	0.942	6.406	41.033	3.450
Stacking Ensemble_3	0.957	5.456	29.769	2.403
Stacking Ensemble_4	0.953	5.771	33.309	2.500
Stacking Ensemble_5	0.937	6.679	44.615	3.013
Stacking Ensemble_6	0.944	6.249	39.051	2.952
Blending Ensemble_1	0.952	5.828	33.967	2.593
Blending Ensemble_2	0.936	6.722	45.181	3.131
Blending Ensemble_3	0.953	5.754	33.103	2.553
Blending Ensemble_4	0.938	6.590	43.431	2.920

Table 4. The estimation accuracy of PIFR and IC across the whole datasets using the best base learner, voting ensemble, stacking ensemble, and blending ensemble models.

ML Models	PIFR (%)	IC (%)
Best Single Base Learner (Random Forest)	97.7 ± 2.9	95.2 ± 9.0
Best Voting Ensemble (Voting Ensemble 1)	96.5 ± 4.5 *	94.3 ± 9.5 *
Best Stacking Ensemble (Stacking Ensemble 3)	96.7 ± 3.9 *	95.1 ± 9.2
Best Blending Ensemble (Blending Ensemble 3)	95.7 ± 5.1 *	94.3 ± 9.5 *

* represents statistically significant with p < 0.05, compared with the Random Forest model.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Development of an AI-Empowered Novel Digital Monitoring System for Inhalation Flow Profiles^†

Abstract

1. Introduction