A Novel Machine Learning Prediction Model for Aerosol Transport in Upper 17-Generations of the Human Respiratory Tract

: The main challenge of the health risk assessment of the aerosol transport and deposition to the lower airways is the high computational cost. A standard large-scale airway model needs a week to a month of computational time in a high-performance computing system. Therefore, developing an innovative tool that accurately predicts transport behaviour and reduces computational time is essential. This study aims to develop a novel and innovative machine learning (ML) model to predict particle deposition to the lower airways. The ﬁrst-ever study uses ML techniques to explore the pulmonary aerosol TD in a digital 17-generation airway model. The ML model uses the computational data for a 17-generation airway model and four standard ML regression models are used to save the computational cost. Random forest (RF), k -nearest neighbour ( k -NN), multi-layer perceptron (MLP) and Gaussian process regression (GPR) techniques are used to develop the ML models. The MLP regression model displays more accurate estimates than other ML models. Finally, a prediction model is developed, and the results are signiﬁcantly closer to the measured values. The prediction model predicts the deposition efﬁciency (DE) for different particle sizes and ﬂow rates. A comprehensive lobe-speciﬁc DE is also predicted for various ﬂow rates. This ﬁrst-ever aerosol transport prediction model can accurately predict the DE in different regions of the airways in a couple of minutes. This innovative approach and accurate prediction will improve the literature and knowledge of the ﬁeld.


Introduction
Particulate matter emission into the atmospheric air is a global health challenge, and emissions are increasing every day [1]. The increasing rate of respiratory health patients is evidence of the health hazards of atmospheric pollution [1]. Precise knowledge of atmospheric aerosol transport and deposition to the airways of the human lung is essential for health assessment and drug delivery purposes [2]. To date, researchers have employed wide-ranging approaches (in silico, in vivo and in vitro) to understand aerosol transport to the airways. The computational fluid dynamics (CFD) approach is the most popular method for airflow and particle transport in airways [3,4] and most of the studies to date used CFD for aerosol transport in airways. Almost all of the published literature considers the upper airways and analyses aerosol transport in airways [5][6][7]. Cheng et al. [8] performed an in vivo study for ultrafine aerosol transport and deposition in human oral and nasal airways. The study employed 10 adult healthy males cast for the analysis. The in vivo study reports the smaller cross-section area of the extrathoracic airways and that the complex airway shape influences the overall ultrafine particle deposition pattern. Later, Zhang et al. [6] studied ultrafine and microparticles transport in an idealised mouththroat and upper airways. The symmetric three-generation (G0-G3) model investigates the aerosol deposition pattern for different flow rates. The study reports a higher deposition concentration of the aerosol at the carinal angle region of the airways than in the bifurcating airways. In 2008, Zhang et al. [9] considered a 16-generation airway model and analysed the nanoparticle's transport and deposition. The study did not consider the asymmetric whole lung model for the 16 generations, and several 3-generation models were used to analyse the particle deposition.
To date, the best available and large-scale asymmetric airway model with maximum bifurcating branches has been developed by Schmidt et al. [10]. The authors developed a large-scale and highly complex airway model for the first 17 generations of human airways. Gemci et al. [5] used the model of Schmidt et al. [10] and analysed the airflow for the largescale model for the first time. The digital reference model had 1453 bronchi up to the 17th Horsfield order. This study analysed the airflow for the first time for a large-scale model and improved the knowledge of the field. Later, in 2017, Islam et al. [7] used the same 17-generation airway model and comprehensively analysed the microparticle transport and deposition in a 17-generation model. This first-ever study considered different physical activity conditions (resting, light activity and heavy physical activity) and analysed the deposition pattern for the first time. However, the CFD simulation of a 3-dimensional anatomically realistic 17-generation lung model based on high-resolution computer tomography (HRCT) data requires huge time and computational resources. The 17-generation model used by Islam et al. [7] took approximately 55 days for a single simulation in a high-performance computing system. This study did not consider all branches of the 17generation model and would significantly increase the computational cost. Islam et al. [11] also studied ultrafine particle transport and deposition to the lower airways of the 17generation model, and the first-ever study comprehensively analysed the lobe-specific deposition pattern. The study calculated the deposition hot spot for the ultrafine particles, and the computational cost for ultrafine particle transport for the 17-generation model was significantly high. The computational cost for the CFD model is extremely high due to the huge number of computational cells, and it is nearly impossible to simulate a whole lung model (23 generations) with all bifurcating branches. Therefore, coalescing machine learning algorithms (MLAs) with CFD simulation has attracted wide attention because it can accelerate the prediction and save time and computational cost [12].
Recently, many researchers have used MLAs in combination with CFD to save time and cost required in the CFD simulation. Ref. [13] employed the random forest (RF) technique to predict the viscosity of nanofluids. Kwon et al. [14] discovered that the RF model agreed with the CFD results in estimating convective heat transfer in a channel. Jamei et al. [15] used different MLAs to estimate the specific heat capacity of nanofluids. Although researchers have used MLAs for different engineering applications using CFD data, no ML model has been developed for aerosol transport and deposition to the lower airways.
Therefore, the objective of this study is to develop an innovative ML prediction model for pharmaceutical aerosol transport to the lower airways of a 17-generation model. The main goal of this study is to develop ML regression models based on the CFD dataset generated in our previous work. Next, the different ML models are compared using statistical parameters, and the best ML model is selected. For this purpose, four different MLAs are adopted to significantly decrease the computational cost and find the impact of each parameter on estimated results. Then using the best MLA model, particle transport and deposition (TD) in the transitional and respiratory zones (deeper airways) of the human lung for different particle diameter and flow rates are predicted and analysed.

Problem Definition and Numerical Methods
The present study extends our previous work in which we considered a highlyasymmetric 17-generation airway model ( Figure 1) for aerosol transport prediction. The airway model is constructed from the airway dimensions of Schmidt et al. [10] and the final model consists of 1453 bronchioles. The large-scale 17-generation model is one of the best available airway models in the literature. The airflow and particle transport in a whole lung model is highly computationally expensive as it needs to solve millions of computational cells. The 17-generation model consists of 7 million cells, and the multiphase flow for a single case takes approximately 55 days in high-performance computing systems.
To get a precise understanding of the various shape and size-specific parameters of aerosol transport to the lower airways of the large-scale model, it needs to run a large number of simulations, and it needs a significant amount of time. Our previous study analysed particle transport numerically and developed a database for aerosol transport for various flow rates and diameter particles [11]. In the present work, the database is used to develop an ML model which can accurately predict aerosol transport to the lower airways for various aerosol sizes and flow rates.   The numerical study solved mass and momentum equations for the primary and dispersed phase [7]. A proper grid independence and validation with the benchmark experimental, numerical and theoretical data was conducted by a numerical model. The detailed numerical methods, grid refinement and validation of the numerical data were available in the author's previous studies [7,11]. That study developed the database for the deposition efficiency at the right lung and the left lung of the airway model. Table 1 shows the deposition data at the right lung and the left lung for different flow rates. Table 2 shows the deposition data at the right (Upper, Middle and Lower) and left (Upper and Left) lungs for different flow rates.

Methodology
The procedure adopted in the present work is presented in the flowchart, as shown in Figure 2. Firstly, we collected the CFD data from our previous work for the ML step. The goal of using MLAs with CFD simulations was to decrease the computational time and, thus, the computational cost. The four MLAs used in the present analysis are knearest neighbours (k-NN) [16], Gaussian process regression (GPR) [17], random forest (RF) [18] and multi-layer perceptron (MLP). The best MLA among all was then used for optimisation. MLAs application requires hyper-parameters tuning to obtain the optimum result. The GridsearchCV tool was used for obtaining the optimised values of the important parameters. It is available in the Scikit-learn library [19].

Machine Learning Regression Models
The CFD problems involving multi-variables can be easily solved by using MLAs. Further, the number of variables can be easily increased in MLAs (Jalalifar et al., 2020) as they are proficient in handling many input and output quantities. The present study has three independent variables (particle size, flow rate and lung position) and one dependent variable (deposition efficiency). The functional form of the model is presented below: Two categories of models are developed in the present study. In category-I models, we selected left and right lungs only, while in Category-II, we selected left lower, left upper, right lower, right middle and right upper lungs to achieve a more detailed analysis.
The four ML techniques selected in this work have been extensively used for regression problems [20,21] k-nearest neighbour (k-NN) [22]: k-NN MLA is robust, computationally inexpensive and easy to implement. The outcome of a test point is computed based on the interpolated value of the test point's adjacent neighbours. The optimum number of neighbours, n, is indicated by the user and it is defined by the n training points with minimum Euclidean distance in the input feature space. k-NN MLA can estimate an outcome by performing data points weighted average with analogous input features.
Random forest (RF) [23]: RF is an ensemble model that increases estimating ability by combining models. The method it works on is called bagging. In this method several trees are created from input data and then data are reinstated and extracted from all data. The final result is the average of the results of all trees. The RF method is widely used because of its prediction capability.

Machine Learning Regression Models
The CFD problems involving multi-variables can be easily solved by using MLAs. Further, the number of variables can be easily increased in MLAs (Jalalifar et al., 2020) as they are proficient in handling many input and output quantities. The present study has three independent variables (particle size, flow rate and lung position) and one dependent variable (deposition efficiency). The functional form of the model is presented below: Two categories of models are developed in the present study. In category-I models, we selected left and right lungs only, while in Category-II, we selected left lower, left upper, right lower, right middle and right upper lungs to achieve a more detailed analysis.
The four ML techniques selected in this work have been extensively used for regression problems [20,21].
k-nearest neighbour (k-NN) [22]: k-NN MLA is robust, computationally inexpensive and easy to implement. The outcome of a test point is computed based on the interpolated value of the test point's adjacent neighbours. The optimum number of neighbours, n, is indicated by the user and it is defined by the n training points with minimum Euclidean distance in the input feature space. k-NN MLA can estimate an outcome by performing data points weighted average with analogous input features.
Random forest (RF) [23]: RF is an ensemble model that increases estimating ability by combining models. The method it works on is called bagging. In this method several trees are created from input data and then data are reinstated and extracted from all data. The final result is the average of the results of all trees. The RF method is widely used because of its prediction capability. Gaussian process regression (GPR) [24]: Gaussian processes can represent random variables in ML. It may be an unidentified error in a linear regression model or anything else. The data can be represented similar to a normal distribution curve in a GPR distribution. Gaussian distribution, similar to a normal distribution, has two key parameters, i.e., mean (µ) and variance (σ 2 ) ("Gaussian Distribution for Machine Learning and Data Science (Normal Distribution) | by Hemanth Nhs | Medium"). The following equation gives the Gaussian distribution for a dataset: where µ is the average value and σ is the skewness of any dataset. Multi-layer perceptron (MLP) [25]: MLP can offer improved output by learning the non-linear data pattern through network modelling. MLP has a minimum three layers: an input layer, a hidden layer and an output layer. Each node contains a neuron along with non-linear activation function, excluding the input nodes. MLP uses a Levenberg-Marquardt algorithm (back propagation) for training. MLP comprises of hyperbolic tangent sigmoid transfer function for hidden layers and a linear function at outer layer.

•
Performance evaluation In the current study, 80% of the data was selected for training, while 20% of the data was used to validate the ML models as recommended by Nguyen et al. [26]. The training and testing data were randomly selected to confirm that the datasets were typical samples of the original dataset.
The MLAs performance was evaluated with the help of some of the most commonly used statistical indicators. The particulars of the statistical parameters are revealed in Table 3 [27]. Table 3. Equations of statistical parameters.

S. No.
Statistical Parameter Equation Uncertainty at 95% (U 95 ) Table 4 depicts the configuration parameters for all the considered MLAs. The Grid-searchCV tool in Scikit-learn was used to obtain the significant hyperparameters for each MLA. The hyper-parameters of ML models were calculated based on the fit accuracy.  Figure 3 shows the performance of developed MLAs. There is a good agreement between the estimated and the measured (from CFD simulations) values for most MLAs. The deposition efficiency was estimated extremely well by all the MLAs. The MLP and GPR models performed exceptionally well, while the RF and k-NN models performed reasonably. Table 4 depicts the configuration parameters for all the considered MLAs. The GridsearchCV tool in Scikit-learn was used to obtain the significant hyperparameters for each MLA. The hyper-parameters of ML models were calculated based on the fit accuracy.       GPI was employed to improve our outcomes and eliminate anomalies in the statistical analysis and model ranking. Despotovic et al. [28] were the first to introduce GPI as a novel aspect; it is a fantastic way to integrate the effects of many statistical pointers. The particulars of the GPI technique can be found in Ref [29].
Scaled values of statistical parameters (0 to 1) and GPI and ML model ranking are depicted in Table 6. Figure 4 represents the overall GPI estimation. The MLP model ranks 1st with GPI = 1.603 then the GPR model with GPI = 1.308, followed by the RF model with GPI = −0.946. The MLP model shows the best performance among all the models in category 1. In category 2, the MLP model again shows top performance, followed by the GPR and RF models. Thus, we selected the MLP model for further analysis. Scaled values of statistical parameters (0 to 1) and GPI and ML model ranking are depicted in Table 6. Figure 4 represents the overall GPI estimation. The MLP model ranks 1st with GPI = 1.603 then the GPR model with GPI = 1.308, followed by the RF model with GPI = −0.946. The MLP model shows the best performance among all the models in category 1. In category 2, the MLP model again shows top performance, followed by the GPR and RF models. Thus, we selected the MLP model for further analysis.

Prediction Using ML Regression
The ML prediction model calculated the DE of the right lung and left lung for the 17generation model. The DE was calculated for a wide range of flow rates and particle sizes, which was unavailable from the CFD measurement. The overall DE was calculated for the left and right lung and the five different lobes for the large-scale model. The schematic in Figure 5 shows the areas for the different parts of the 17-generation model.

Prediction Using ML Regression
The ML prediction model calculated the DE of the right lung and left lung for the 17generation model. The DE was calculated for a wide range of flow rates and particle sizes, which was unavailable from the CFD measurement. The overall DE was calculated for the left and right lung and the five different lobes for the large-scale model. The schematic in Figure 5 shows the areas for the different parts of the 17-generation model.  Figure 6a shows the DE for 1 µm diameter particles at different flow rates. The overall DE of 1 µm particle in the right lung is higher than in the left, irrespective of the flow rates. For micron-sized particles, inertia plays an important role in overall transport and deposition. At high flow rates, larger particles cannot follow the air path line and deviates from the air streamline. At high flow conditions, once the particles are displaced from the original pathline, they touch the airway wall and become trapped on the highly viscous surface of the airway wall. However, aerosols can follow the air streamline at low flow inlet conditions, minimising the overall DE at the upper airways. The right lung anatomical  Figure 6a shows the DE for 1 µm diameter particles at different flow rates. The overall DE of 1 µm particle in the right lung is higher than in the left, irrespective of the flow rates. For micron-sized particles, inertia plays an important role in overall transport and deposition. At high flow rates, larger particles cannot follow the air path line and deviates from the air streamline. At high flow conditions, once the particles are displaced from the original pathline, they touch the airway wall and become trapped on the highly viscous surface of the airway wall. However, aerosols can follow the air streamline at low flow inlet conditions, minimising the overall DE at the upper airways. The right lung anatomical diameter is higher than the left lung, and the right lung consists of three airway lobes while the left contains two lobes. The complex anatomical shape and branching structure, higher flow distribution to the right lung, flow rates and particle inertia influence the overall DE at the right lung. The DE for the right and the left lung increased with the flow rates, which closely aligns with the CFD data, and it also satisfies the general aerosol deposition hypothesis in airways. Figure 6b-f show a similar trend of DE for different diameter particles. Figure 6d-f show the DE for 7 µm, 8.5 µm and 10 µm diameter particles, which is significantly higher than the smaller particles. Larger diameter particles mean the particle inertia will be higher, and the impaction mechanism is dominant for larger particles. At high flow condition, the inertial impaction became more dominant for the larger diameter particles and influenced the overall DE. while the left contains two lobes. The complex anatomical shape and branching structure, higher flow distribution to the right lung, flow rates and particle inertia influence the overall DE at the right lung. The DE for the right and the left lung increased with the flow rates, which closely aligns with the CFD data, and it also satisfies the general aerosol deposition hypothesis in airways. Figure 6b-f show a similar trend of DE for different diameter particles. Figure 6d-f show the DE for 7 µm, 8.5 µm and 10 µm diameter particles, which is significantly higher than the smaller particles. Larger diameter particles mean the particle inertia will be higher, and the impaction mechanism is dominant for larger particles. At high flow condition, the inertial impaction became more dominant for the larger diameter particles and influenced the overall DE.  . Predicted DE for different flow rates: (a) 1 µm particle, (b) 2.5 µm particle, (c) 5 µm particle, (d) 7 µm particle, (e) 8.5 µm particle and (f) 10 µm particle.
The comparison of DE in different lobes with respect to flow rate and particle diameter for particle sizes of 2.5 µm and 7.5 µm is shown in Figure 7. For a 2.5 µm diameter particle, the DE is higher at the right lower lobe and lower at the left upper lobe irrespective of the deposition parameter. It is also noticed that the DE of 2.5 µm particles at the right lower lobe increased with the flow rate. The overall DE at the right and left upper lobes is lower than the right and left lower lobes. The upper lobes' branches are in the upward direction and opposite to the gravitational force, while the lower lobes are in the downward direction. Micron-sized particles mostly travel downward due to the high inertia, which transports fewer aerosols to the upper lobes of the airways, and less deposition occurs at the upper lobes. Figure 7b shows the 7.5 µm aerosol DE at different lobes, and a similar trend is observed at various lobes. At the flow rate (15 L/min), DE at the left upper lobe is different from other lobes. The overall DE at the left upper lobe is higher for low flow rate and lower for high flow rate. At a high flow rate, larger diameter particles deviate from the streamline and transport to the lower airways, while at a low flow rate, more particles transport to the upper airways. The overall DE at the right middle lobe shows a similar trend for 2.5 µm and 7.5 µm particles.  Predicted DE for different flow rates: (a) 1 µm particle, (b) 2.5 µm particle, (c) 5 µm particle, (d) 7 µm particle, (e) 8.5 µm particle and (f) 10 µm particle.
The comparison of DE in different lobes with respect to flow rate and particle diameter for particle sizes of 2.5 µm and 7.5 µm is shown in Figure 7. For a 2.5 µm diameter particle, the DE is higher at the right lower lobe and lower at the left upper lobe irrespective of the deposition parameter. It is also noticed that the DE of 2.5 µm particles at the right lower lobe increased with the flow rate. The overall DE at the right and left upper lobes is lower than the right and left lower lobes. The upper lobes' branches are in the upward direction and opposite to the gravitational force, while the lower lobes are in the downward direction. Micron-sized particles mostly travel downward due to the high inertia, which transports fewer aerosols to the upper lobes of the airways, and less deposition occurs at the upper lobes. Figure 7b shows the 7.5 µm aerosol DE at different lobes, and a similar trend is observed at various lobes. At the flow rate (15 L/min), DE at the left upper lobe is different from other lobes. The overall DE at the left upper lobe is higher for low flow rate and lower for high flow rate. At a high flow rate, larger diameter particles deviate from the streamline and transport to the lower airways, while at a low flow rate, more particles transport to the upper airways. The overall DE at the right middle lobe shows a similar trend for 2.5 µm and 7.5 µm particles.
The comparison of DE in different lobes with respect to flow rate and particle diameter for particle sizes of 2.5 µm and 7.5 µm is shown in Figure 7. For a 2.5 µm diameter particle, the DE is higher at the right lower lobe and lower at the left upper lobe irrespective of the deposition parameter. It is also noticed that the DE of 2.5 µm particles at the right lower lobe increased with the flow rate. The overall DE at the right and left upper lobes is lower than the right and left lower lobes. The upper lobes' branches are in the upward direction and opposite to the gravitational force, while the lower lobes are in the downward direction. Micron-sized particles mostly travel downward due to the high inertia, which transports fewer aerosols to the upper lobes of the airways, and less deposition occurs at the upper lobes. Figure 7b shows the 7.5 µm aerosol DE at different lobes, and a similar trend is observed at various lobes. At the flow rate (15 L/min), DE at the left upper lobe is different from other lobes. The overall DE at the left upper lobe is higher for low flow rate and lower for high flow rate. At a high flow rate, larger diameter particles deviate from the streamline and transport to the lower airways, while at a low flow rate, more particles transport to the upper airways. The overall DE at the right middle lobe shows a similar trend for 2.5 µm and 7.5 µm particles.  A comprehensive lobe-specific DE was calculated for various diameter particles. Figure 8 shows the DE at different lobes of the 17-generation model for 15 L/min, 30 L/min and 60 L/min flow rates. The general trend shows higher DE at the lower lobes of the left and the right lung for all cases. The DE at the left lower lobe is lower than the remaining lobes in all cases. At the right upper lobe, the overall DE shows a fluctuating trend for different diameter particles. These specific findings are crucial for the health risk assessment of lung diseases and targeted drug delivery. The prediction model clearly indicates the deposition hot spot of the different diameter particles at various flow rates, which could potentially improve the efficiency of the targeted drug delivery. In conventional drug delivery tools, drug particles are mostly deposited in the upper airways. A minimum amount of drugs can reach the targeted areas of the lower part of the airways. The ML prediction model will improve the knowledge of the various diameter aerosol deposition hot spots and DE in the lower airways. An innovative drug delivery device can be developed for different lobes, and after diagnosis, a patient-specific drug delivery tool can be used, which will improve overall drug delivery efficiency. A comprehensive lobe-specific DE was calculated for various diameter particles. Figure 8 shows the DE at different lobes of the 17-generation model for 15 L/min, 30 L/min and 60 L/min flow rates. The general trend shows higher DE at the lower lobes of the left and the right lung for all cases. The DE at the left lower lobe is lower than the remaining lobes in all cases. At the right upper lobe, the overall DE shows a fluctuating trend for different diameter particles. These specific findings are crucial for the health risk assessment of lung diseases and targeted drug delivery. The prediction model clearly indicates the deposition hot spot of the different diameter particles at various flow rates, which could potentially improve the efficiency of the targeted drug delivery. In conventional drug delivery tools, drug particles are mostly deposited in the upper airways. A minimum amount of drugs can reach the targeted areas of the lower part of the airways. The ML prediction model will improve the knowledge of the various diameter aerosol deposition hot spots and DE in the lower airways. An innovative drug delivery device can be developed for different lobes, and after diagnosis, a patient-specific drug delivery tool can be used, which will improve overall drug delivery efficiency.

Conclusions
The present work advances the deposition analysis in the upper 17 generations of the human respiratory tract. Previously, researchers used CFD analysis, while a novel MLbased model is developed in the present work for the first-time. The present study developed ML prediction models from the CFD data of a 17-generation large-scale model. Different ML regression models are developed, and statistical analysis is performed to determine the best ML model for predicting pulmonary aerosol transport and deposition analysis in the upper 17 generations of the human respiratory tract. The ML models are trained and tested with the CFD data. It is witnessed that the MLP model performed well, with an overall GPI of 1.603 compared to other regression models. Furthermore, out of eight statistical indicator values, the MLP model has significant values for six. Therefore, the MLP regression model is used to predict the DE for a wide range of flow rates and particle sizes.
The MLP model predicts the DE in both lungs. The prediction shows higher DE in the right lung than in the left lung. The MLP model also reports higher DE at the left and the right lung irrespective of the inlet conditions and particle diameter. It is also observed that the MLP model gives excellent predictions with a similar trend of deposition efficiency as observed in our CFD work.
A comprehensive lobe-specific DE is calculated by using the ML prediction model. The overall DE at the right lower lobe is higher than the remaining lobes. The DE of various diameter particles is different at the lobes for different flow rates. The MLP prediction model analysed the deposition hot spot at various lobes for the first time, which would

Conclusions
The present work advances the deposition analysis in the upper 17 generations of the human respiratory tract. Previously, researchers used CFD analysis, while a novel ML-based model is developed in the present work for the first-time. The present study developed ML prediction models from the CFD data of a 17-generation large-scale model. Different ML regression models are developed, and statistical analysis is performed to determine the best ML model for predicting pulmonary aerosol transport and deposition analysis in the upper 17 generations of the human respiratory tract. The ML models are trained and tested with the CFD data. It is witnessed that the MLP model performed well, with an overall GPI of 1.603 compared to other regression models. Furthermore, out of eight statistical indicator values, the MLP model has significant values for six. Therefore, the MLP regression model is used to predict the DE for a wide range of flow rates and particle sizes.
The MLP model predicts the DE in both lungs. The prediction shows higher DE in the right lung than in the left lung. The MLP model also reports higher DE at the left and the right lung irrespective of the inlet conditions and particle diameter. It is also observed that the MLP model gives excellent predictions with a similar trend of deposition efficiency as observed in our CFD work.
A comprehensive lobe-specific DE is calculated by using the ML prediction model. The overall DE at the right lower lobe is higher than the remaining lobes. The DE of various diameter particles is different at the lobes for different flow rates. The MLP prediction model analysed the deposition hot spot at various lobes for the first time, which would improve the knowledge of the aerosol deposition in the lower airways. These specific findings would improve the knowledge of the field and could potentially improve the efficiency of the targeted drug delivery to the lower airways. Thus, the MLP model can be used to predict deposition efficiency for flow rates and particle sizes not considered in CFD analysis, resulting in considerable savings in time and cost. The developed prediction model will save a significant amount of computational time. The present study, along with a physics-informed ML modelling for the airflow and particle transport in airways, would improve the knowledge of the field.