Open Access
This article is

- freely available
- re-usable

*Materials*
**2019**,
*12*(9),
1544;
https://doi.org/10.3390/ma12091544

Article

Prediction and Sensitivity Analysis of Bubble Dissolution Time in 3D Selective Laser Sintering Using Ensemble Decision Trees

^{1}

University of Transport Technology, Hanoi 100000, Vietnam

^{2}

PIMM, ENSAM, CNRS, CNAM, HESAM Université, 151 Boulevard de l’Hôpital, 75013 Paris, France

^{3}

Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam

^{4}

NTT Hi-Tech Institute, Nguyen Tat Thanh University, Ho Chi Minh City 700000, Vietnam

^{*}

Authors to whom correspondence should be addressed.

Received: 31 March 2019 / Accepted: 6 May 2019 / Published: 10 May 2019

## Abstract

**:**

The presence of defects like gas bubble in fabricated parts is inherent in the selective laser sintering process and the prediction of bubble shrinkage dynamics is crucial. In this paper, two artificial intelligence (AI) models based on Decision Trees algorithm were constructed in order to predict bubble dissolution time, namely the Ensemble Bagged Trees (EDT Bagged) and Ensemble Boosted Trees (EDT Boosted). A metadata including 68644 data were generated with the help of our previously developed numerical tool. The AI models used the initial bubble size, external domain size, diffusion coefficient, surface tension, viscosity, initial concentration, and chamber pressure as input parameters, whereas bubble dissolution time was considered as output variable. Evaluation of the models’ performance was achieved by criteria such as Mean Absolute Error (MAE), Root Mean Squared Error (RMSE) and coefficient of determination (R

^{2}). The results showed that EDT Bagged outperformed EDT Boosted. Sensitivity analysis was then conducted thanks to the Monte Carlo approach and it was found that three most important inputs for the problem were the diffusion coefficient, initial concentration, and bubble initial size. This study might help in quick prediction of bubble dissolution time to improve the production quality from industry.Keywords:

3D selective laser sintering; artificial intelligence; decision trees; bubble dissolution time; sensitivity analysis## 1. Introduction

Selective laser sintering (SLS) is one of the most important 3D printing technique widely used in industry [1]. The main idea of this method is that powders are sintered by laser in order to produce solid objects predefined by 3D sketch models [2]. The addictive manufacturing technique exhibits many interesting behaviors not found in traditional methods. For instance, using 3D sketches combined with laser sources, SLS technology could easily handle complex geometries of objects [3,4,5,6,7]. Secondly, laser can melt a lot of materials such as metal, ceramic, polymer or glass powders, especially in case of polymeric particles [8,9,10,11]. In industry of polymer processing, the SLS method exposure also has great advantages, particularly in terms of saving time in fabrication [12]. However, combining particles and laser technique requires broadly information involving many multiphysical phenomena. For instance, depend on the nature of polymeric material, relationship between many parameters such as particle granulometry and morphology, laser power, and scan speed should be established in order to improve the production quality. Unfortunately, solid structures created using SLS technique are under the presence of many imperfections such as porosities, cracks, or lacks of fusion [12,13]. The existence of these defects could be explained by the complexity and the coupling of the different physics occurring in the process, for instance melting, coalescence, gas diffusion, and crystallization [14,15,16].

One of the most important defects using SLS technology is the formation of gas bubbles in fabricated parts [8]. As an illustration, for a considered layer during the process, laser beam irradiates polymer particles spread over the previous layer. Gas bubbles are therefore trapped in the melted layer by the difference between resulting temperatures [1]. The larger the initial size of the bubbles, the more dissolution time requires. That means the understanding of dissolution time of gas bubbles has a crucial impact on the processing of solids using SLS technique. However, it is a complex problem which depends on many factors, such as bubble internal radius, external radius, diffusion coefficient, surface tension, viscosity, saturation, and pressure [17]. Until now, no known analytical work has been proposed to predict such bubble dissolution time. Only semi-analytical or numerical methods have been established for investigating the influence of parameters on dissolution time. For instance Kontopoulou et al. [8] studied the impact of viscosity, initial bubble size, and applied pressure in order to determine the kinetic of bubble shrinkage. In another work, surface tension, diffusion coefficient, and saturation of polymer melt have been integrated to establish an equation of bubble dissolution time [18]. Many other attempts have also been introduced in order to relate the dissolution of bubbles and physical constant of the problem [19,20,21].

In general, there are three techniques commonly used in the literature to predict the mechanical properties of materials: computational modeling and simulation, experimental studies, or artificial intelligence (AI) approach. The modeling and simulation of physical–mechanical phenomena is usually complicated, requires extensive computation time, and is thus not widely used in practice. Experimental studies are costly, time consuming, and required a lot of specific equipment. Out of these, the artificial intelligence techniques have been recently used in many fields of science [22,23,24]. This technique exhibits a lot of advantages not found in traditional computational simulation [25], especially for predicting material behaviors, for instance, rheological properties of materials [26], permeability of polymers [27], composite materials [28], structure under compression [29], or concrete [30]. Nevertheless, to the best of our knowledge, no artificial intelligence approaches have been applied to predict bubble dissolution time in the SLS process as well as quantify the relationship between physical parameters of the problem. Moreover, works related to the modeling of bubble dissolution process only consider an infinite melted medium [8,18], which is not the case in SLS technique due to the entrapped gas bubble are close to the surface of powder bed. A proper approach is needed to take into account the finite medium in SLS process. In addition, laboratory experiments could only be used to observe the growth and collapse phenomena [31] but not the factors that affect the mechanism. The approach using AI algorithms, once developed, could provide efficient information and help to improve the quality of SLS production.

This contribution applied two AI algorithms namely the Ensemble Bagged Trees (EDT Bagged) and Ensemble Boosted Trees (EDT Boosted) to predict bubble dissolution time. The outline of the present study is organized as follows. The Introduction (Section 1) starts with the state-of-the-art of the problem. Section 2 (Materials and Methods) is devoted to describing the mechanism of the bubble shrinkage problem and the construction of data used for AI modeling. Bubble initial radius, external domain radius, diffusion coefficient, surface tension, viscosity, saturation, and chamber pressure were considered as inputs; and bubble dissolution time is output for the prediction tools. A brief introduction of two AI techniques based on the Ensemble Decision Trees algorithms is presented next, together with the Monte Carlo approach for the input sensitivity analysis. Principal results and discussions are presented in Section 3 (Results and Discussions), including the prediction capability of each model along with the sensitivity analysis of input parameters. Finally, the paper is completed with several conclusions and perspective in Section 4 (Conclusions).

## 2. Materials and Methods

#### 2.1. Physical Problem and Data Preparation

#### 2.1.1. Mechanical Description of the Bubble Dissolution Process

When fabricating parts in the SLS process, grains coalesce faster at the surface than deeper in the powder bed because of higher laser radiation absorption at its surface. Air porosities are then embedded in the melted layers, whose thickness can reach a few hundred of microns. Air porosity, or air bubbles, is an essential factor accounting for the final mechanical properties of the fabricated parts. Thus, understanding bubble dissolution process in the melted polymer is important, especially in SLS technique.

Bubble dissolution mechanism is a highly coupled and nonlinear physical problem. Noting that proper development of equations has been presented in our previous work, in this paper, only principal equations that have impact on the AI modeling part will be briefly recalled. The nature of the resorption process lied on the diffusion of gas inside the bubbles (the gas phase) into the melted materials, or the liquid phase, such as metal, glass, or polymer. During the resorption process, the movement of the interface between gas/liquid phases obeyed the fluid dynamics equations, represented by the equation of conservation of momentum [17]:
where P
where D stands for the diffusion coefficient, c is the concentration of gas in the fluid phase. The degree of saturation is introduced in order to take into account the boundary conditions derived from Henry’s law such as [17]:
with K
with R

$$[{P}_{b}(t)-{P}_{c}(t)]{R}_{i}(t)-2\gamma \left(1+\frac{{R}_{i}(t)}{{R}_{e}(t)}\right)+4\eta \left(\frac{{R}_{i}^{3}(t)}{{R}_{e}^{3}(t)}-1\right){\dot{R}}_{i}(t)=0$$

_{b}(t) and P_{c}(t) represent the pressure of the perfect gas inside the bubble and the chamber of the SLS machine, respectively; R_{i}(t), R_{e}(t) denotes the bubble initial radius and the radius of the outer liquid phase; ${\dot{R}}_{i}(t)$ is the rate of bubble dissolution; η refers to the fluid viscosity and γ stands for the surface tension. The liquid phase is characterized by fluid viscosity, whereas the surface tension depends on both media as it is resulted at the air-liquid interface. Besides, the rate of gas diffusion is related to the movement of the interface through the diffusion equation, or the Fick law [17]:
$$\frac{\partial c}{\partial t}=\frac{D}{{r}^{2}}\frac{\partial}{\partial r}({r}^{2}\frac{\partial c}{\partial r})$$

$$c(r,0)={x}_{sat}{K}_{H}{P}_{C}(0)$$

_{H}the Henry’s law constant. Finally, the mass balance equation at the bubble interface, in considering the decrease of perfect gas within the bubble is proportional to the flux across the interface, reads [17]:
$$\frac{\partial}{\partial t}\left(\frac{{P}_{b}(t){R}_{i}^{3}(t)}{{R}_{g}T}\right)=3{R}_{i}^{2}(t)D{\frac{\partial c(r,t)}{\partial r}|}_{r={R}_{i}}$$

_{g}the universal gas constant and T the temperature. For summary, the dissolution mechanism depends on the nature of the fluids (the molten materials), the mechanism of gas diffusion (air) and the mass balance at the gas/fluid interface. In the aim of predicting bubble dissolution time in the selective laser sintering (SLS) process, the need of experimental values of physical constants (inputs) as well as results (targets) are important. However, to the best of our knowledge, very few existing investigations clearly determining such parameters have been reported so far. These values are extremely difficult to determine. Therefore, simulation data constructed from previously developed numerical tool [17] have been used in this investigation. In this work, a finite element-based numerical tool has been developed in order to model and explain the influence of diverse parameters on the dissolution time, such as initial bubble size (R_{i}), the external radius of the fluid phase (R_{e}), the surface tension (γ), viscosity (η), diffusion coefficient (D), initial melt concentration (x_{sat}), and the chamber pressure (P). The fluid domain was assumed Newtonian. It is noteworthy that results extracted from such a numerical tool have already been compared and validated with experimental studies on the bubble removal in rotational molding process [8,18], a technique similar to the SLS.#### 2.1.2. Factors that Affecting Bubble Dissolution Time

In this paper, the prediction capability and sensitivity analysis of physical parameters that may have influence on the dissolution process were performed. It is important noticed that a change of any parameter affects all others, which is governed by the dynamic, thermodynamics, or physics of the bubble shrinkage problem [17]. For example, the change of gas inside bubble or the polymer melt causes changes in viscosity, surface tension, and diffusion coefficient applied to both liquid and gas phases. Therefore, all the parameters were varied in a broad range in order to fully collect results and analyze their effect to the bubble dissolution time.

Input parameters considered for the AI simulation were bubble radius (I

_{1}), bubble external radius (I_{2}), diffusion coefficient (I_{3}), surface tension (I_{4}), viscosity of the melted materials (I_{5}), initial saturation (I_{6}) and chamber pressure (I_{7}). The prediction output was bubble dissolution time (O). The values of physical parameters, i.e., I_{1}, I_{2}, I_{5}, (used in Equation (1)) were taken based on experimental data in the work of Kontopoulou et al. [8]. The diffusion coefficient (I_{3}) was varied around that in the case of air diffuse into polyethylene melt (used in Equations (2) and (4)), which was estimated previously in the work of Griskey [9]. Surface tension values (I_{4}) had been taken based on the case of polyethylene melts (used in Equation (1)), which was gathered from earlier study of Wu [32]. The viscosity of the melted polymer (I_{5}—used in Equation (1)) was collected following the works of Bird et al. [33] and Durrill et al. [34]. Being used in Equation (3), the initial saturation (I_{6}) and the chamber pressure (I_{7}) were varied from that of the analytical study of Gogos [18]. Parameters such as the Henry’s law constant, air parameters or the temperature dependence needed for applying an Arrhenius equation were not considered as inputs, as they are often directly related to one or several inputs I_{1}to I_{7}. Simulated data were taken to cover all the possibilities in the SLS process. A total number of 68,644 data were generated, lied on the fact that the corresponding dissolution time of these data varied from 20 s to 1200 s, which is of interest in the SLS process. If it is lower than 20 s, the dissolution process is quite quick and improvement of the SLS process appears less important. If it is higher than 1200 s, the dissolution process is too long so that it appears not interesting to be studied. A summary of the input and outputs values are presented in Table 1.As shown in Table 1, the dissolution time of bubble considered in this study varied from 20 to 1200 s, with a mean value of 194.2 s and a standard deviation of 234.9 s. That means the distribution of bubble dissolution time was highly asymmetric and exhibited a big amount of quantity distributed between 20 s and 200 s. This interval of dissolution time was very important in the SLS processing industry. The lower dissolution time of bubbles, the higher production quality.

#### 2.1.3. Datasets

In general, the training and testing datasets are required for constructing and validation of AI models. Data were divided into 2 groups at a ratio of 70/30. A total number of 48,051 training data (70%) were used to construct the prediction models, whereas 20,593 testing data (30%) were used for evaluation and validation of the models. The 70/30 ratio for generating the datasets in this study has been done as suggested by Khorsheed and Mohammad [35], and Leema et al. [36]. Description of two AI algorithms in this study will be briefly presented in the next section.

#### 2.2. Background of Models Used

#### 2.2.1. Decision Trees Methods

Decision trees (DT), proposed by Quinlan [37], is a popular machine learning method that can be applied in solving many real world problems such as identifying risk factors for drug use [38], flash flood prediction [39], landslide spatial prediction [40], prediction of a short-term photovoltaic power [41]. Main principle of the DT is to use a series of rules to identify the regions with the most homogeneous output variables to input variables on which a constant is fitted to each region. Main advantages of the DT are (i) it represents information in an intuitive and easy way for visualization, (ii) it is powerful for mining nonlinear and interactions effects between dependent and independent variables, (iii) it requires no mathematical assumptions between output and input variables, and (iv) it is capable to handle missing values and outliers [42]. On the other hands, the DT also faces several problems such as (i) it is difficult to model smooth functions, (ii) the structure of tree is sensitive with the sample data as only small change in training data can give very different results, and (iii) it has high variance and low bias [43]. Therefore, many techniques have proposed to enhance the predictive capability of the DT such as Ensemble Bagged Trees and Ensemble Boosted Trees.

#### Ensemble Bagged Trees (EDT Bagged)

EDT Bagged is a hybrid model which is a combination of bagging algorithm and decision trees [44]. Out of these methods, bagging method uses the bootstrap sampling to optimize the input training data for learning the decision trees [45]. This process can be carried out in several steps such as (i) it takes a bootstrap sample from the original training dataset, (ii) it fits the decision trees with the extracted dataset, (iii) it then finds the best fit models with the optimized dataset, (iv) it thereafter uses each of the fitted models with the optimize dataset to predict the results, and (v) it finally averages the prediction results of all the models [46]. EDT Bagged can significantly improve the prediction accuracy as bootstrap aggregation used in bagging method can reduced the variance in individual prediction methods like decision trees. EDT Bagged has been applied to solve many real world problems namely travel time prediction [44], disease prediction [45], and bankruptcy prediction [47]. In this study, we have used the EDT Bagged for prediction of bubble dissolution time when manufacturing solid structures by SLS technique.

Suppose that $x=\left({x}_{1},{x}_{2},\dots ,{x}_{n}\right)$ is a vector of input variables (n is the number of input variables), the bootstrap aggregation used in EDT Bagged to average the predictions is defined as follows [46]:
where N is the number of various training datasets generated by bootstrapping, and ${f}_{i}^{\ast}\left(x\right)$ are defined as predictors.

$${f}_{bag}\left(x\right)=\frac{1}{N}{\displaystyle \sum _{i=1}^{N}{f}_{i}^{\ast}}\left(x\right)$$

#### Ensemble Boosted Trees (EDT Boosted)

EDT Boosted is an additive regression model in which individual terms are simple trees, fitted in a forward, stagewise estimation [48]. It is well-known as an ensemble method which combines regression trees and boosting techniques in which the regression trees are used to connect output variables to input variables by recursive binary splits whereas the boosting technique is used to combine many single models to improve the predictive capability of the ensemble models [49]. EDT Boosted takes advantages of tree-based methods, and even overcomes the drawback of single tree models as (i) it is able to select relevant variables to fit accurate functions, (ii) it can be fitted with different amounts of data using stochastic boosting, and (ii) it can reduce both bias by forward stagewise fitting and variance by model averaging [48]. EDT Boosted has been applied to many fields such as banking [49], ecology [50], and medical [51]. In this study, EDT Boosted is a first time use for predicting the dissolution time of bubble removal when using laser to melt polymer powders in a SLS procedure.

Suppose that $x=\left({x}_{1},{x}_{2},\dots ,{x}_{n}\right)$ is a vector of predictors (n is the number of predictors) and y is a response, the EDT Boosted algorithm is trained by following function [52]:
where ${\alpha}_{k}$ values represent weights given to the nodes of each tree, ${y}_{k}$ is defined as the split variables, $t\left(x,{y}_{k}\right)$ represents single decision trees.

$$f\left(x\right)={\displaystyle \sum _{k}{f}_{k}}\left(x\right)={\displaystyle \sum _{k}{\alpha}_{k}}t\left(x,{y}_{k}\right)$$

#### 2.2.2. Monte Carlo Method

In many fields of science, average values of parameters are not sufficient enough for investigating the problem and also the complex relationship between parameters. For instance, material variability and structural variability could largely impact the effective mechanical behaviors of composite structure [53,54,55,56]. Along with the development of computational algorithms and computer performance, Monte Carlo method appeared then as a highly efficient technique for accounting variability of parameters when they become relevant [57]. This method could help researchers to determine which parameters are important on the overall responses following variability of each input. Indeed, Monte Carlo method exhibits high capability to quantify numerically linear or non-linear relationship in data. The main idea of this method is to repeat random sampling of inputs using probability density distribution of each. By doing so, variability and of inputs could be fully propagated to the overall properties though constructed models. Statistical analysis of outputs is next carried out in order to quantify the impact of input variability on the responses. A schematization of using Monte Carlo method for propagating input variability is presented in Figure 1.

Monte Carlo method could also be used for analyzing the sensitivity of input parameters on the responses though constructed models [58]. For instance, the considered input has no influence on the final responses if it is removed from the input space and the final responses has statistically no differences compared to the case of simulation with all inputs. This way, the sensitivity of all inputs to the overall properties could be entirely quantified. This analysis has an important role for designer, for instance, dimension of the problem could be reduced and time consuming could be saved by eliminating insignificant parameters. To the contrary, sensitive parameters exposure important role and receive so huge attention.

In this study, the “normdiff” function of random variable P was introduced in order to: (i) determine the corresponding optimal number of Monte Carlo realizations and (ii) evaluate the degree of fluctuation of P around its average value (in % unit). The normdiff function was defined as [55]:
where N

$$\mathrm{normdiff}\left({\mathrm{N}}_{\mathrm{r}}\right)=\frac{1}{\overline{\mathrm{P}}{\mathrm{N}}_{\mathrm{r}}}{\displaystyle \sum _{\mathrm{i}=1}^{{\mathrm{N}}_{\mathrm{r}}}{\mathrm{P}}_{\mathrm{i}}}$$

_{r}is the actual number of Monte Carlo runs and $\overline{\mathrm{P}}$ is the average value of P. In this work, the normdiff was applied for estimating the statistical convergence of several random variables such as Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and coefficient of determination (R^{2}) respectively (see Section 2.2.3.).Uniform distribution was applied to perform random sampling of input data for Monte Carlo simulations. Random subsets of input data were drawn uniformly between 1 and last sample index. The rectangular distribution ensures in this case no constraints for generating input realizations (i.e., all sub-spaces of the input space were accounted in the simulation).

#### 2.2.3. Validation Criteria

In this study, popular validation criteria namely Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and coefficient of determination (R
where m is considered as the number of the samples, ${v}_{i},{\overline{v}}_{i},and\overline{v}$ are the actual output, the predicted output, and the mean of the ${\overline{v}}_{i}$, respectively.

^{2}) were selected for validating the predictive capability of ensemble decision trees [59,60]. RMSE measures the average squared difference between actual and predicted values, MAE measures the average magnitude of the errors, and R^{2}measures the correlation between actual and predicted values [59,60]. Whereas R^{2}varies from 0 (lowest models) and 1 (best models), higher RMSE and MAE shows lower predictive capability of the models. We can calculate these RMSE and MAE and R^{2}by following equations [61,62]:
$$RMSE=\sqrt{\frac{1}{m}{\displaystyle \sum _{i=1}^{m}{\left({v}_{i}-{\overline{v}}_{i}\right)}^{2}}}$$

$$MAE=\frac{1}{m}{\displaystyle \sum _{i=1}^{m}\left({v}_{i}-{\overline{v}}_{i}\right)}$$

$${R}^{2}=1-\frac{{\displaystyle \sum _{i=1}^{m}{\left({v}_{i}-{\overline{v}}_{i}\right)}^{2}}}{{\displaystyle \sum _{i=1}^{m}{\left({v}_{i}-{\overline{v}}_{}\right)}^{2}}}$$

#### 2.2.4. Methodology Chart

We present here different steps of the present work’s methodology, which contains five main steps as followings (as shown in Figure 2):

**Step 1**. Numerical tools for collecting data. In this step, all input parameters such as bubble initial radius (I

_{1}), bubble external radius or the domain size (I

_{2}), the diffusion coefficient (I

_{3}), surface tension (I

_{4}), viscosity of the melted materials (I

_{5}), the initial saturation (I

_{6}), and the chamber pressure (I

_{7}) were used to compute the prediction output-bubble dissolution time (O), using the developed finite element model [17]. Fluid dynamics, mass balance and gas diffusion were taken into account in this numerical model.

**Step 2**. Dataset preparation for training AI models. In this step, all input and output parameters were used to create a complete set of data. A number of 70% data (48,051 training data) were taken from the initial dataset for training the two proposed AI models. The remaining 30% data (20,593 testing data) were used for validating step, after the AI models were trained.

**Step 3**. Training models. In this step, the AI models were trained using the training data set. Two AI algorithms were used: EDT Bagged and EDT Boosted. The algorithm of these two models were introduced in the previous section. This step was repeated until the models were successfully trained (a tolerance error criteria was reached).

**Step 4**. Validating models. In this step, after the two AI models were trained, the validation process was performed using the testing data set (30% of the initial data set). The models were validated using different statistical measures such as: Root Mean Squared Error (RMSE), Mean Absolute Error (MAE) and the coefficient of determination (R

^{2}).

**Step 5**. Sensitivity analysis. After validation of the two proposed AI models using the testing data set, the sensitivity of input parameters was performed. This process was carried out thanks to the Monte Carlo approach.

## 3. Results and Discussion

#### 3.1. Comparison of Ensemble Decision Trees (EDT) Algorithms

Validation of the two proposed AI models are illustrated in Figure 3 for the training part including 48,051 data (left) and testing part with 20,593 data (right). Error frequency are plotted in function of relative errors between target and output values for EDT Bagged (blue line) and EDT Boosted (orange line) algorithms. In both training and testing parts, the EDT Boosted exhibit more error between output and target, as the values ranging from −1 to 6. The predicted outputs are ranging from 1 time lower to 6 times higher than the target ones. As regard to the training part, the over predicted values outside the range −1 to 1 were 1880 samples (or 3.9%) for EDT Boosted and 275 samples (or 0.57%) for EDT Bagged. For the testing part, the over predicted values outside the range −1 to 1 were 893 samples (or 4.3%) for EDT Boosted and 163 samples (or 0.79%) for EDT Bagged, showing reasonable performance of the AI models. Considering the EDT Bagged algorithm, the error range between output and target is rather narrower than the EDT Boosted model, i.e., ranging from −0.5 to 2. The peaks (blue line) of both training and testing parts are centered at 0, showing excellence prediction capability of EDT Bagged. For both AI algorithms, the number of over predicted values are more than that of under predicted values, confirmed by the error curves are rather on the right side of 0. It is worth noticed that for this simulation, the error measurements are: RMSE = 24.38, MAE = 9.70, R

^{2}= 0.989 for training EDT Bagged, RMSE = 28.28, MAE = 9.91, R^{2}= 0.986 for testing EDT Bagged, whereas these values are RMSE = 141.48, MAE = 79.64, R^{2}= 0.852 for training EDT Boosted, RMSE = 144.80, MAE = 81.80, R^{2}= 0.843 for testing EDT Boosted. In general, both of methods show good predictive capability based on the validation criteria R^{2}, RMSE and MAE. Comparing the prediction performance using testing parts, EDT Bagged appears a better candidate for this problem, as the RMSE, MAE, and the number of wrong predictions are smaller than those of EDT Boosted, whereas the R^{2}values are higher than that of the EDT Boosted model.In validation stage, it is difficult to conclude the performance of given AI algorithms unless a fully analysis is performed. As 70% of data are randomly taken to train the AI prediction tools, the corresponding performance measurements (RMSE, MAE, or R

^{2}) are different for each simulation [63]. Such values depend of the choice of combination of 70% input data and can be varied from one to another simulation. In order to check the robustness of EDT Bagged and EDT Boosted models, thus, 1000 different arrangements of data were generated using uniform distribution and the error measurements in each case were collected. RMSE, MAE, or R^{2}errors for 1000 different simulations are plotted under scatter points in Figure 4 for training and testing parts. The variation of error measurements in the training part is rather smaller than that of the testing part in all cases, which clearly demonstrate the dependence of the predicted results on input combination. In the case of EDT Boosted model (Figure 4a–c), the training and testing part vary around the same values, i.e., RMSE = 141.21, MAE = 79.65, and R^{2}= 0.855. Regarding EDT Bagged algorithm (Figure 4d–f), the training results are RMSE = 25.82, MAE = 9.83 and R^{2}= 0.988, whereas the testing one are RMSE = 23.78, MAE = 9.14, and R^{2}= 0.990. This indicates that EDT Bagged outperforms EDT Boosted. However, both EDT Boosted and EDT Bagged algorithms possess ability to well-predict bubble dissolution time in the SLS process.The normalized statistical convergence analysis of the obtained results is calculated using Equation (3) and plotted in Figure 5. It is observed that over two Monte Carlo runs for EDT Boosted model are required to achieve the convergence within the 1% range around the converged values. On the contrary, RMSE and MAE criteria require at least 30 simulations to achieve converged values. For R

^{2}criterion, both algorithms are converged in a 1% error range without any simulations. These results demonstrate that the two proposed AI algorithms are promising methods for the prediction of bubble dissolution time with an interesting convergence rate. Out of these, the EDT Bagged is better in term of precision than the EDT Boosted but it requires more simulations to achieve converged statistical values.#### 3.2. Sensitivity Analysis of Input Parameters

Prediction of bubble dissolution time is a complex problem in which physical and mechanical equations are highly coupled and nonlinear. Thus, the sensitivity analysis is then carried out in order to evaluate the impact of each single input parameter to the predicted output. This could be an attempt to reduce the input space if one input variable is found not affect the final prediction results. All the input parameters were successively excluded from the input space by setting the column to zero values. Monte Carlo simulations were then performed in order to quantify the influence of each parameter, thanks to the prediction performance, i.e., RMSE, MAE, and R

^{2}. In excluding successively I_{1}, …, I_{7}, a total number of 7 groups of analysis were generated and simulated with a number of 1000 runs for each group. The statistical performances (RMSE, MAE, and R^{2}) of two AI models are highlighted in Table 2. Based on mean values of RMSE, MAE and R^{2}over 14,000 simulations, the EDT Bagged outperforms the EDT Boosted. Moreover, standard deviation values of EDT Bagged model are smaller than that of the EDT Boosted, indicating that the EDT Bagged algorithm is a more stable prediction technique.Let us consider the case where input parameters I

_{1}to I_{7}are successively excluded from the input space. It is observed that when I_{3}is excluded from the prediction process, highest mean values of RMSE and MAE are obtained, i.e., RMSE = 220.44, MAE = 165.46 for EDT Bagged algorithm and RMSE = 228.18, MAE = 149.14 for EDT Boosted model. Regarding R^{2}, lowest mean values are obtained: R^{2}= 0.122 for EDT Bagged algorithm and R^{2}= 0.168 for EDT Boosted model. This means that without the diffusion coefficient (I_{3}) in the input space, it is difficult to achieve acceptable performance of the AI prediction tools. Therefore, the diffusion coefficient (I_{3}) is the most important input variable in the prediction process of bubble dissolution time. Although there is no known analytical solution to predict the bubble dissolution process in Selective Laser Sintering until now, several observations in good agreement with the results can be found in the literature. Kontopoulou et al. [8], Gogos [18] reported the kinetic of bubble shrinkage and concluded that such mechanism is mostly controlled by the diffusion of air from the bubble to the polymer melt. The distribution of the resulted dissolution time was highly distributed in the range between 20 s and 200 s. This in good agreement with this finding, as the mean value of I_{3}was rather high, i.e., 4.1 × 10^{−9}m^{2}·s^{−1}.Based on the RMSE, MAE, and R

^{2}values, the second and the third important input factors in the prediction process are the initial concentration (saturation rate-I_{6}) and the initial bubble size (I_{1}). Indeed, without I_{6}, the as-obtained errors are: RMSE = 124.63, MAE = 71.64, R^{2}= 0.719 for EDT Bagged algorithm, whereas RMSE = 184.26, MAE = 101.63, R^{2}= 0.554 in case of EDT Boosted model. Besides, in excluding I_{1}out of the input space, the corresponding errors are: RMSE = 100.80, MAE = 48.86, R^{2}= 0.816 for EDT Bagged algorithm, whereas RMSE = 157.11, MAE = 87.85, R^{2}= 0.742 for EDT Boosted model. The higher values of RMSE, MAE and the lower values of R^{2}compared to that of the simulation using 7 inputs indicates the necessity of these variables in the prediction of bubble dissolution time which is in good agreement with the work of Kontopoulou et al. [8] where the authors mentioned that the air concentration and bubble initial size were found to be of great important in the bubble shrinkage process. Again, the air initial concentration distribution was taken to achieve mean value of I_{6}= 0.54, SD = 0.37, and the initial bubble size mean value I_{1}= 112.4 µm, SD = 19.37 µm. The skewness in the input space affected the bubble dissolution time to be in a range between 20 s–200 s, which was in good agreement with observations in the literature.In our previous work [17], the contribution lies on the simulation tool for the prediction of bubble dissolution process. The polymer domain is generally considered an infinite domain in the literature, whereas we have introduced the parameter that accounting for the size of the polymer domain (I

_{2}). In the sensitivity analysis of this study, such parameter is found to be more important than the others (I_{4}, I_{5}and I_{7}). The error when excluding I_{2}are RMSE = 58.84, MAE = 26.57, R^{2}= 0.938 for EDT Bagged algorithm and RMSE = 142.84, MAE = 80.54, R^{2}= 0.843 for EDT Boosted model. The surface tension (I_{4}), applied pressure (I_{7}), and viscosity (I_{5}) are found to be the most insensitive input parameters for the prediction problem (Table 2). The role of these parameters has also been discussed in several works relating the SLS process, such as the surface tension was neglected to derive an analytical solution [64], or its effect is negligible under certain condition, i.e., small degree of saturation [18]. The bubble shrinkage process is not influenced considerably by melt viscosity, as reported in the work of Kontopoulou et al. [8]. Last but not least, the effect of pressure is important only in case when it is applied after the formation of the bubble [8,18]. In our developed numerical tool, the pressure is considered constant during the whole SLS process [17]. Therefore, no conclusion on the effect of pressure can be deduced at this moment.The histogram of RMSE, MAE, and R

^{2}values of 14,000 simulations are plotted in Figure 6. All the peaks are rather narrow, demonstrating small values of standard deviation. The three most influenced parameters are easily observed, as they are separated to the rest. However, depending on the AI models, other input parameters can be detected or not. In case of EDT Boosted algorithm, the curves relating I_{2}, I_{4}, I_{5}, and I_{7}are superimposed for RMSE, MAE and R^{2}criteria. Regarding the EDT Bagged algorithm, these input parameters can be slightly identified by RMSE and R^{2}criteria, whereas it is difficult for MAE criterion. As a conclusion, sensitivity analysis requires not only adapted error criteria but also a good prediction model that can capture the difference influence between variables.The normalized statistical convergence analysis of the results are plotted in Figure 7 (see also Equation (3)), for the cases of three most influenced input parameters. Again, it is observed that over two Monte Carlo simulations for both EDT Boosted and EDT Bagged models are required to achieve the convergence within the 1% range around the converged values. On the contrary, for R

^{2}criterion, both algorithms are converged in a 1% error range within 10 Monte Carlo simulations. These results demonstrate that the two proposed AI algorithms are potential candidates for the prediction of bubble dissolution time with an interesting convergence rate. Out of these, the EDT Bagged is better in term of precision than the EDT Boosted but it requires more simulations to achieve converged statistical values.## 4. Conclusions

In this study, two ensemble AI models namely EDT Bagged and EDT Boosted were proposed and compared for predicting bubble dissolution time of the SLS process. A metadata of 68,644 simulated ones were created with the help of our previously developed numerical tool. They were used for generating datasets, which included input parameters (initial bubble size, external domain size, diffusion coefficient, surface tension, viscosity, initial concentration, the SLS chamber pressure) and output parameter (bubble dissolution time). Validation of the models was achieved using several criteria such as MAE, RMSE, and R

^{2}. Monte Carlo approach was finally used to carry out the sensitivity analysis for determination of the most important input parameter for the prediction of bubble dissolution time.The results showed that both AI models performed well for prediction of the bubble dissolution time of the SLS process but EDT Bagged (MAE = 9.14, RMSE = 23.78, and R

^{2}= 0.990) outperforms EDT Boosted (MAE = 79.62, RMSE = 141.16, and R^{2}= 0.856). The sensitivity analysis results showed that three most important input of the prediction problem were diffusion coefficient, initial concentration and bubble initial size. The present study simplified the mechanical problem of the SLS process in considering the dissolution of a single spherical gas. Therefore, a perspective of the study is to consider 2D and 3D problem to mimic the real bubble shrinkage problem. Other hybrid AI models should also be studied to improve the predictive capability. In the case of simplified approach, the computation time was estimated about 2 min for each dataset. However, the AI approaches would be more significant and useful when considering the prediction of 2D and 3D problem, where computation time would be extensively high for the generation of one dataset. Investigation of other sensitivity analysis method such as Global Sensitivity Analysis (GSA) [65] will also be a perspective of this study.## Author Contributions

Conceptualization, H.-B.L., T.-T.L. and E.M.; Methodology, H.-B.L. and B.T.P.; Software, H.-B.L. and T.-T.L.; Validation, H.-B.L., V.M.L. and T.-T.L.; Formal Analysis, H.-B.L. and E.M.; Investigation, H.-B.L. and T.-T.L.; Resources, H.-B.L., E.M. and G.R.; Data Curation, H.-B.L., E.M., M.D. and G.R.; Writing–Original Draft Preparation, H.-B.L., T.-T.L., V.M.L. and B.T.P.; Writing–Review & Editing, H.-B.L. and E.M.; Visualization, H.-B.L., T.-T.L. and V.M.L.; Supervision, H.-B.L., E.M. and G.R.; Project Administration, H.-B.L. and E.M.

## Funding

This research received no external funding.

## Conflicts of Interest

The authors declare no conflict of interest.

## References

- Schmid, M. Laser Sintering with Plastics: Technology, Processes, and Materials; Carl Hanser Verlag GmbH & Co. KG: München, Germany, 2018; ISBN 978-1-56990-683-5. [Google Scholar]
- Nelson, J.C. Selective Laser Sintering: A Definition of the Process and an Empirical Sintering Model; University of Texas at Austin: Austin, TX, USA, 1993. [Google Scholar]
- Oter, Z.C.; Coskun, M.; Akca, Y.; Surmen, O.; Yilmaz, M.S.; Ozer, G.; Tarakci, G.; Khan, H.M.; Koc, E. Support optimization for overhanging parts in direct metal laser sintering. Optik
**2019**, 181, 575–581. [Google Scholar] [CrossRef] - Cheng, B.; Chou, K. Geometric consideration of support structures in part overhang fabrications by electron beam additive manufacturing. Comput. Aided Des.
**2015**, 69, 102–111. [Google Scholar] [CrossRef] - Calignano, F. Design optimization of supports for overhanging structures in aluminum and titanium alloys by selective laser melting. Mater. Des.
**2014**, 64, 203–213. [Google Scholar] [CrossRef] - Yuan, S.; Zheng, Y.; Chua, C.K.; Yan, Q.; Zhou, K. Electrical and thermal conductivities of MWCNT/polymer composites fabricated by selective laser sintering. Compos. Part A Appl. Sci. Manuf.
**2018**, 105, 203–213. [Google Scholar] [CrossRef] - Josupeit, S.; Ordia, L.; Schmid, H.-J. Modelling of temperatures and heat flow within laser sintered part cakes. Addit. Manuf.
**2016**, 12, 189–196. [Google Scholar] [CrossRef] - Kontopoulou, M.; Vlachopoulos, J. Bubble dissolution in molten polymers and its role in rotational molding. Polym. Eng. Sci.
**1999**, 39, 1189–1198. [Google Scholar] [CrossRef] - Griskey, R. Polymer Process Engineering; Springer Science & Business Media: Dordrecht, The Netherlands, 2012. [Google Scholar]
- Doremus, R.H. Diffusion of Oxygen from Contracting Bubbles in Molten Glass. J. Am. Ceram. Soc.
**1960**, 43, 655–661. [Google Scholar] [CrossRef] - Wei, Q.; Zhong, J.; Xu, Z.; Xu, Q.; Liu, B. Microstructure evolution and mechanical properties of ceramic shell moulds for investment casting of turbine blades by selective laser sintering. Ceram. Int.
**2018**, 44, 12088–12097. [Google Scholar] [CrossRef] - Mokrane, A.; Boutaous, M.; Xin, S. Process of selective laser sintering of polymer powders: Modeling, simulation, and validation. C. R. Méc.
**2018**, 346, 1087–1103. [Google Scholar] [CrossRef] - Venuvinod, P.K.; Ma, W. Selective Laser Sintering (SLS). In Rapid Prototyping: Laser-based and Other Technologies; Venuvinod, P.K., Ma, W., Eds.; Springer: Boston, MA, USA, 2004; pp. 245–277. ISBN 978-1-4757-6361-4. [Google Scholar]
- Bourell, D.L.; Watt, T.J.; Leigh, D.K.; Fulcher, B. Performance Limitations in Polymer Laser Sintering. Phys. Procedia
**2014**, 56, 147–156. [Google Scholar] [CrossRef] - Schmid, M.; Wegener, K. Additive Manufacturing: Polymers Applicable for Laser Sintering (LS). Procedia Eng.
**2016**, 149, 457–464. [Google Scholar] [CrossRef] - Wudy, K.; Lanzl, L.; Drummer, D. Selective Laser Sintering of Filled Polymer Systems: Bulk Properties and Laser Beam Material Interaction. Phys. Procedia
**2016**, 83, 991–1002. [Google Scholar] [CrossRef] - Ly, H.-B.; Monteiro, E.; Dal, M.; Regnier, G. On the factors affecting porosity dissolution in selective laser sintering process. AIP Conf. Proc.
**2018**, 1960, 120014. [Google Scholar] - Gogos, G. Bubble removal in rotational molding. Polym. Eng. Sci.
**2004**, 44, 388–394. [Google Scholar] [CrossRef] - Weinberg, M.C. Surface tension effects in gas bubble dissolution and growth. Chem. Eng. Sci.
**1981**, 36, 137–141. [Google Scholar] [CrossRef] - Duda, J.; Vrentas, J. Mathematical analysis of bubble dissolution. AIChE J.
**1969**, 15, 351–356. [Google Scholar] [CrossRef] - Naji Meidani, A.R.; Hasan, M. Mathematical and physical modelling of bubble growth due to ultrasound. Appl. Math. Model.
**2004**, 28, 333–351. [Google Scholar] [CrossRef] - Haykin, S. Neural Networks: A Comprehensive Foundation., 2nd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 1998; ISBN 978-0-13-273350-2. [Google Scholar]
- Jang, J.-S.R. Neuro-Fuzzy and Soft Computing: A Computational Approach to Learning and Machine Intelligence; Prentice Hall: Upper Saddle River, NJ, USA, 1997; ISBN 978-0-13-261066-7. [Google Scholar]
- Witten, I.H.; Frank, E.; Hall, M.A.; Pal, C.J. Data Mining: Practical Machine Learning Tools and Techniques, 4th ed.; Morgan Kaufmann: San Francisco, CA, USA, 2016; ISBN 978-0-12-804357-8. [Google Scholar]
- Cosmin, A.; Elena, A.; Naif, A.; Timon, R. Artificial Neural Network Methods for the Solution of Second Order Boundary Value Problems. Comput. Mater. Contin.
**2019**, 59, 345–359. [Google Scholar] - Yusoff, N.I.M.; Ibrahim Alhamali, D.; Ibrahim, A.N.H.; Rosyidi, S.A.P.; Abdul Hassan, N. Engineering characteristics of nanosilica/polymer-modified bitumen and predicting their rheological properties using multilayer perceptron neural network model. Constr. Build. Mater.
**2019**, 204, 781–799. [Google Scholar] [CrossRef] - Hasnaoui, H.; Krea, M.; Roizard, D. Neural networks for the prediction of polymer permeability to gases. J. Membr. Sci.
**2017**, 541, 541–549. [Google Scholar] [CrossRef] - Diaconescu, R.-M.; Barbuta, M.; Harja, M. Prediction of properties of polymer concrete composite with tire rubber using neural networks. Mater. Sci. Eng. B
**2013**, 178, 1259–1267. [Google Scholar] [CrossRef] - Khader, M.H.; Hamid, G.; Xiaoying, Z.; Naif, A.; Timon, R. Computational Machine Learning Representation for the Flexoelectricity Effect in Truncated Pyramid Structures. Comput. Mater. Contin.
**2019**, 59, 79–87. [Google Scholar] - Dao, D.V.; Trinh, S.H.; Ly, H.-B.; Pham, B.T. Prediction of Compressive Strength of Geopolymer Concrete Using Entirely Steel Slag Aggregates: Novel Hybrid Artificial Intelligence Approaches. Appl. Sci.
**2019**, 9, 1113. [Google Scholar] [CrossRef] - Xu, D.; Pop-Iliev, R.; Park, C.B.; Fenton, R. Fundamental study of CBA-blown bubble growth and collapse under atmospheric pressure. J. Cell. Plast.
**2005**, 41, 519–538. [Google Scholar] [CrossRef] - Wu, S. Polymer Interface and Adhesion; CRC Press: Boca Raton, FL, USA, 1982. [Google Scholar]
- Bird, R.B.; Stewart, W.E.; Lightfoot, E.N. Transport Phenomena; John Wiley & Sons: Hoboken, NJ, USA, 2007. [Google Scholar]
- Durrill, P.L.; Griskey, R.G. Diffusion and solution of gases into thermally softened or molten polymers: Part II. Relation of diffusivities and solubilities with temperature pressure and structural characteristics. AIChE J.
**1969**, 15, 106–110. [Google Scholar] [CrossRef] - Khorsheed, M.S.; Al-Thubaity, A.O. Comparative evaluation of text classification techniques using a large diverse Arabic dataset. Lang Resour. Eval.
**2013**, 47, 513–538. [Google Scholar] [CrossRef] - Leema, N.; Nehemiah, H.K.; Kannan, A. Neural network classifier optimization using Differential Evolution with Global Information and Back Propagation algorithm for clinical datasets. Appl. Soft Comput.
**2016**, 49, 834–844. [Google Scholar] [CrossRef] - Quinlan, J.R. Induction of decision trees. Mach. Learn.
**1986**, 1, 81–106. [Google Scholar] [CrossRef] - Amirabadizadeh, A.; Nezami, H.; Vaughn, M.G.; Nakhaee, S.; Mehrpour, O. Identifying risk factors for drug use in an Iranian treatment sample: A prediction approach using decision trees. Subst. Use Misuse
**2018**, 53, 1030–1040. [Google Scholar] [CrossRef] - Khosravi, K.; Pham, B.T.; Chapi, K.; Shirzadi, A.; Shahabi, H.; Revhaug, I.; Prakash, I.; Tien Bui, D. A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Sci. Total Environ.
**2018**, 627, 744–755. [Google Scholar] [CrossRef] - Thai Pham, B.; Bui, D.T.; Prakash, I. Landslide susceptibility modelling using different advanced decision trees methods. Civ. Eng. Environ. Syst.
**2019**, 1–19. [Google Scholar] [CrossRef] - Wang, J.; Li, P.; Ran, R.; Che, Y.; Zhou, Y. A Short-Term Photovoltaic Power Prediction Model Based on the Gradient Boost Decision Tree. Appl. Sci.
**2018**, 8, 689. [Google Scholar] [CrossRef] - Akin, M.; Hand, C.; Eyduran, E.; Reed, B.M. Predicting minor nutrient requirements of hazelnut shoot cultures using regression trees. Plant Cell Tissue Organ Cult. (PCTOC)
**2018**, 132, 545–559. [Google Scholar] [CrossRef] - Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed.; Springer Series in Statistics; Springer: New York, NY, USA, 2009; ISBN 978-0-387-84857-0. [Google Scholar]
- Zhang, Y.; Haghani, A. A gradient boosting method to improve travel time prediction. Transp. Res. Part C Emerg. Technol.
**2015**, 58, 308–324. [Google Scholar] [CrossRef] - Austin, P.C.; Tu, J.V.; Ho, J.E.; Levy, D.; Lee, D.S. Using methods from the data-mining and machine-learning literature for disease classification and prediction: A case study examining classification of heart failure subtypes. J. Clin. Epidemiol.
**2013**, 66, 398–407. [Google Scholar] [CrossRef] [PubMed] - Winkler, D.; Haltmeier, M.; Kleidorfer, M.; Rauch, W.; Tscheikner-Gratl, F. Pipe failure modelling for water distribution networks using boosted decision trees. Struct. Infrastruct. Eng.
**2018**, 14, 1402–1411. [Google Scholar] [CrossRef] - Serrano-Cinca, C.; Gutiérrez-Nieto, B. Partial Least Square Discriminant Analysis for bankruptcy prediction. Decis. Support Syst.
**2013**, 54, 1245–1255. [Google Scholar] [CrossRef] - Elith, J.; Leathwick, J.R.; Hastie, T. A working guide to boosted regression trees. J. Anim. Ecol.
**2008**, 77, 802–813. [Google Scholar] [CrossRef] - Zięba, M.; Tomczak, S.K.; Tomczak, J.M. Ensemble boosted trees with synthetic features generation in application to bankruptcy prediction. Expert Syst. Appl.
**2016**, 58, 93–101. [Google Scholar] [CrossRef] - De’ath, G. Boosted Trees for Ecological Modeling and Prediction. Ecology
**2007**, 88, 243–251. [Google Scholar] [CrossRef] - Abdar, M.; Yen, N.Y.; Hung, J.C.-S. Improving the diagnosis of liver disease using multilayer perceptron neural network and boosted decision trees. J. Med. Boil. Eng.
**2017**, 38, 953–965. [Google Scholar] [CrossRef] - Chung, Y.-S. Factor complexity of crash occurrence: An empirical demonstration using boosted regression trees. Accid. Anal. Prev.
**2013**, 61, 107–118. [Google Scholar] [CrossRef] - Ghasemi, H.; Rafiee, R.; Zhuang, X.; Muthu, J.; Rabczuk, T. Uncertainties propagation in metamodel-based probabilistic optimization of CNT/polymer composite structure using stochastic multi-scale modeling. Comput. Mater. Sci.
**2014**, 85, 295–305. [Google Scholar] [CrossRef] - Vu-Bac, N.; Rafiee, R.; Zhuang, X.; Lahmer, T.; Rabczuk, T. Uncertainty quantification for multiscale modeling of polymer nanocomposites with correlated parameters. Compos. Part B Eng.
**2015**, 68, 446–464. [Google Scholar] [CrossRef] - Le, T.T.; Guilleminot, J.; Soize, C. Stochastic continuum modeling of random interphases from atomistic simulations. Application to a polymer nanocomposite. Comput. Methods Appl. Mech. Eng.
**2016**, 303, 430–449. [Google Scholar] [CrossRef] - Capillon, R.; Desceliers, C.; Soize, C. Uncertainty quantification in computational linear structural dynamics for viscoelastic composite structures. Comput. Methods Appl. Mech. Eng.
**2016**, 305, 154–172. [Google Scholar] [CrossRef] - Robert, C.; Casella, G. Monte Carlo Statistical Methods, 2nd ed.; Springer Texts in Statistics; Springer: New York, NY, USA, 2004; ISBN 978-0-387-21239-5. [Google Scholar]
- Mordechai, S. Applications of Monte Carlo Method in Science and Engineering; InTech: Rijeka, Croatia, 2011; ISBN 978-953-307-691-1. [Google Scholar]
- Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE)?—Arguments against avoiding RMSE in the literature. Geosci. Model Dev.
**2014**, 7, 1247–1250. [Google Scholar] [CrossRef] - Menard, S. Coefficients of Determination for Multiple Logistic Regression Analysis. Am. Stat.
**2000**, 54, 17–24. [Google Scholar] - Nagelkerke, N. A note on a general definition of the coefficient of determination. Biometrika
**1991**, 78, 691–692. [Google Scholar] [CrossRef] - Willmott, C.J.; Matsuura, K. Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim. Res.
**2005**, 30, 79–82. [Google Scholar] [CrossRef] - Dao, D.V.; Ly, H.-B.; Trinh, S.H.; Le, T.-T.; Pham, B.T. Artificial Intelligence Approaches for Prediction of Compressive Strength of Geopolymer Concrete. Materials
**2019**, 12, 983. [Google Scholar] [CrossRef] [PubMed] - Epstein, P.S.; Plesset, M.S. On the stability of gas bubbles in liquid-gas solutions. J. Chem. Phys.
**1950**, 18, 1505–1509. [Google Scholar] [CrossRef] - Hamdia, K.M.; Msekh, M.A.; Silani, M.; Thai, T.Q.; Budarapu, P.R.; Rabczuk, T. Assessment of computational fracture models using Bayesian method. Eng. Fract. Mech.
**2019**, 205, 387–398. [Google Scholar] [CrossRef]

**Figure 3.**Histogram of relative error between predicted and output values of training (

**left**) and testing (

**right**) part of Ensemble Bagged Trees (EDT Bagged) and Ensemble Boosted Trees (EDT Boosted) algorithms.

**Figure 4.**Error distribution (Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and coefficient of determination (R

^{2})) of different models for 1000 simulations: (

**a**–

**c**) EDT Boosted and (

**d**–

**f**) EDT Bagged algorithms.

**Figure 5.**Statistical convergence analysis of (

**a**) RMSE, (

**b**) MAE, and (

**c**) R

^{2}over 1000 Monte Carlo simulations for EDT Bagged and EDT Boosted algorithms. Blue lines representing ±1% deviation around the average value.

**Figure 6.**Histogram of RMSE, MAE, and R

^{2}for 14,000 simulations in case of: (

**a**–

**c**) EDT Boosted model, and (

**d**–

**f**) EDT Bagged algorithm.

**Figure 7.**Statistical convergence analysis of RMSE, MAE, and R

^{2}in case of: (

**a**–

**c**) diffusion coefficient excluded, (

**d**–

**f**) initial saturation excluded, and (

**g**–

**i**) initial bubble size excluded. Blue lines representing ±1% deviation around the average value.

Input Parameter | Minimum | Maximum | Average | Standard Deviation |
---|---|---|---|---|

I_{1} (bubble radius, µm) | 30 | 150 | 112.4 | 19.37 |

I_{2} (external radius, µm) | 33 | 150,000 | 333.1 | 422.15 |

I_{3} (diffusion coefficient, m^{2}·s^{−1}) | 0.1 × 10^{−9} | 50 × 10^{−9} | 4.1 × 10^{−9} | 6.79 × 10^{−9} |

I_{4} (surface tension, N·m^{−1}) | 0.0100 | 0.0500 | 0.0288 | 0.0144 |

I_{5} (viscosity, Pa·s) | 100 | 10,000 | 3670.5 | 3701.3 |

I_{6} (saturation) | 0.1 | 1 | 0.54 | 0.37 |

I_{7} (chamber pressure, atm) | 0.25 | 1.5 | 0.88 | 0.41 |

O (bubble dissolution time, s) | 20 | 1200 | 194.2 | 234.9 |

**Table 2.**Sensitivity analysis of input parameters with respect to RMSE, MAE, and R

^{2}obtained after 1000 Monte Carlo simulations for each case: full simulation with 7 inputs (No excl.) and excluding successively I

_{1}to I

_{7}(I

_{1,}…, I

_{7}excl.).

Criteria | Input Excl. | EDT Bagged | EDT Boosted | ||||||
---|---|---|---|---|---|---|---|---|---|

Min | Max | Mean | SD | Min | Max | Mean | SD | ||

RMSE | No excl. | 19.20 | 28.27 | 23.78 | 1.26 | 135.86 | 146.70 | 141.16 | 1.79 |

I_{1} excl. | 97.48 | 104.97 | 100.80 | 1.15 | 153.04 | 161.99 | 157.11 | 1.49 | |

I_{2} excl. | 56.13 | 63.04 | 58.84 | 1.02 | 138.32 | 150.13 | 142.84 | 1.68 | |

I_{3} excl. | 215.67 | 224.28 | 220.44 | 1.18 | 222.84 | 223.37 | 228.18 | 1.61 | |

I_{4} excl. | 40.06 | 48.21 | 43.71 | 1.12 | 136.29 | 147.91 | 141.28 | 1.69 | |

I_{5} excl. | 23.50 | 36.27 | 27.83 | 1.70 | 136.29 | 147.91 | 141.28 | 1.69 | |

I_{6} excl. | 120.65 | 128.45 | 124.63 | 1.12 | 178.98 | 189.90 | 184.26 | 1.67 | |

I_{7} excl. | 33.37 | 41.85 | 37.24 | 1.28 | 136.29 | 147.91 | 141.28 | 1.69 | |

MAE | No excl. | 7.93 | 10.82 | 9.14 | 0.47 | 77.30 | 82.33 | 79.62 | 0.91 |

I_{1} excl. | 46.87 | 51.16 | 48.86 | 0.69 | 85.76 | 90.81 | 87.85 | 0.82 | |

I_{2} excl. | 24.95 | 31.25 | 26.57 | 0.63 | 78.39 | 83.60 | 80.54 | 0.85 | |

I_{3} excl. | 163.07 | 167.72 | 165.46 | 0.73 | 145.98 | 152.37 | 149.14 | 0.88 | |

I_{4} excl. | 15.13 | 20.76 | 17.15 | 0.75 | 77.41 | 82.77 | 79.65 | 0.86 | |

I_{5} excl. | 10.15 | 15.99 | 12.54 | 0.92 | 77.41 | 82.77 | 79.65 | 0.86 | |

I_{6} excl. | 69.15 | 73.83 | 71.64 | 0.71 | 98.73 | 104.90 | 101.63 | 0.90 | |

I_{7} excl. | 13.24 | 19.13 | 15.18 | 0.83 | 77.41 | 82.77 | 79.65 | 0.86 | |

R^{2} | No excl. | 0.986 | 0.993 | 0.990 | 0.001 | 0.832 | 0.870 | 0.856 | 0.006 |

I_{1} excl. | 0.802 | 0.827 | 0.816 | 0.004 | 0.721 | 0.758 | 0.742 | 0.005 | |

I_{2} excl. | 0.929 | 0.943 | 0.938 | 0.002 | 0.808 | 0.858 | 0.843 | 0.007 | |

I_{3} excl. | 0.100 | 0.144 | 0.122 | 0.006 | 0.153 | 0.183 | 0.168 | 0.005 | |

I_{4} excl. | 0.960 | 0.971 | 0.966 | 0.002 | 0.831 | 0.872 | 0.855 | 0.006 | |

I_{5} excl. | 0.978 | 0.990 | 0.987 | 0.002 | 0.831 | 0.872 | 0.855 | 0.006 | |

I_{6} excl. | 0.704 | 0.731 | 0.719 | 0.004 | 0.533 | 0.573 | 0.554 | 0.007 | |

I_{7} excl. | 0.970 | 0.980 | 0.975 | 0.002 | 0.831 | 0.872 | 0.855 | 0.006 |

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).