Post Weld Heat Treatment Optimization of Dissimilar Friction Stir Welded AA2024-T3 and AA7075-T651 Using Machine Learning and Metaheuristics

Post weld heat treatment, or PWHT, is often used to improve the mechanical properties of materials that have been welded. Several publications have investigated the effects of the PWHT process using experimental designs. However, the modeling and optimization using the integration of machine learning (ML) and metaheuristics have yet to be reported, which are fundamental steps toward intelligent manufacturing applications. This research proposes a novel approach using ML techniques and metaheuristics to optimize PWHT process parameters. The goal is to determine the optimal PWHT parameters for both single and multiple objective perspectives. In this research, support vector regression (SVR), K-nearest neighbors (KNN), decision tree (DT), and random forest (RF) were ML techniques employed to obtain a relationship model between PWHT parameters and mechanical properties: ultimate tensile strength (UTS) and elongation percentage (EL). The results show that the SVR demonstrated superior performance among ML techniques for both UTS and EL models. Then, SVR is used with metaheuristics such as differential evolution (DE), particle swarm optimization (PSO), and genetic algorithms (GA). SVR-PSO shows the fastest convergence among other combinations. The final solutions of single-objective and Pareto solutions were also suggested in this research.


Introduction
Friction stir welding (FSW) was invented in 1991 at The Welding Institute (TWI) of the UK as a solid-state joining process [1]. FSW is extensively applied to high-strength aluminum alloys, especially the alloyed AA2XXX and AA7XXX series, which are widely used in important manufacturing industries such as automobiles, aircraft, aerospace, and military industries due to their high strength, low density, high fracture toughness, good electrical conductivity, and corrosion resistance [2]. FSW results in intense plastic deformation around the rotating tool and friction between the tool and workpieces. Both of these factors increase the temperature within and around the stirred zone and generate a softened region within the weld zone because of the dissolution of strengthening precipitates [1,3]. Therefore, as a consequence, precipitation-hardening aluminum alloy joints generally have a lower weld strength than base metal [4,5]. Many techniques are being considered to improve the properties of weld joints, such as welding process parameter optimization [6][7][8] and post weld heat treatment (PWHT) [9][10][11][12][13][14][15]. Güven et al. [15] compared the mechanical properties of friction stir welded AA6061 plates before and after applying PWHT. The result showed that the joint performance values of PWHT specimens in terms of proof stress, tensile strength, and elongation were slightly better at 84, 88, and 44 pct, respectively. Hakan et al. [9] also observed that the PWHT process increased the tensile strength efficiency and yield strength efficiency of the as-welded joint to 87.4% and 98%, respectively. Chaitanya et al. [10] examined the influence of PWHT on microstructure and mechanical properties. PWHT increased the size of aluminum grains in all zones of friction stir weld joints, and naturally aged joints offered the best mechanical properties. Sivaraj et al. [11] also reported that after solution treatment and artificial aging, the yield strength and tensile strength of friction stir-welded AA7075 aluminum alloy joints were increased to 346 MPa and 445 MPa, respectively, increasing joint efficiency by 9% over as-welded joints. According to the literature review, the PWHT process parameters had a direct impact on the properties of welded joints, such as tensile strength, hardness, residual stress, and toughness. For example, Vinay Kumar et al. [14] demonstrated that PWHT significantly improved the ductility and the hardness of the welded joint at 200 • C for 10 h compared to the as-welded joint and the PWHT joint at 190 • C for 10 h. In addition, we found that several researchers [16][17][18][19] studied the influence of the PWHT process on the same metal, but they used different parameters. Hence, many researchers intend to optimize the parameters of the PWHT process. Chin Chun et al. [20] used the Taguchi experimental design method to analyze optimal heat treatment parameters for the 7050 aluminum alloy. Mehmet et al. [21] also investigated the effect of PWHT parameters with the Taguchi method and used the genetic algorithm for optimization. It clearly shows that the function of PWHT is still implemented by using traditional methods such as trial-and-error, design of experiments (DOE), response surface methodology (RSM), or the Taguchi method. In contrast, the other manufacturing process uses more intelligent approaches such as AI and machine learning (ML) techniques to obtain the advanced process.
The ML technique is the science of teaching machines to learn on their own to solve real-time problems based on input data. Recently, ML has been applied in many research fields and has continuously grown with more authors to increase computer systems' intelligence [22]. In manufacturing, the applications of ML refer to pattern recognition in existing sets of data for forecasting the future behavior of the system [23] and provide the promising potential to improve quality control optimization in the manufacturing process [24]. For example, Kaneko and Funatsu [25] made an application of the online SVR model for soft sensor development. He and Wang [26] developed a statistical pattern analysis and process monitoring method based on the K-nearest neighbors (KNN) method. Tauqir et al. [27] studied the applications of ML to FSW process optimization. The results show that artificial neural networks (ANN) outperformed RSM in terms of robustness and accuracy. In addition, they found that the DOE provided a lack of a systematic approach to process parameters. In the phase of cloud manufacturing [28] and smart manufacturing, known as industry 4.0, currently, ML has the ability to become the main driver in uncovering fine-grained, intricate production patterns in the smart manufacturing paradigm [29]. On the other hand, we found no application of ML models to implement for the PWHT experiment in the literature review, because the PWHT process is difficult to test on a large scale, time-consuming, and expensive. These problems are consistent with the contribution of ML techniques to solving problems and increasing intelligence. SVR, KNN, decision tree (DT), and random forest (RF) are examples of supervised machine learning algorithms used in manufacturing [30]. The goal of these techniques is to produce a predicted model based on the training data. In this study, four ML techniques are employed to create a model of the PWHT process to determine the relationship between parameters and two mechanical properties, such as ultimate tensile strength and elongation percentage. Then, the process parameter optimization is carried out using metaheuristic algorithms. Differential evolution (DE), particle swarm optimization (PSO), and genetic algorithms (GA) are population-based metaheuristic approaches that are popularly used in the field of manufacturing process optimization [21,[31][32][33][34]. Ezra and Weihang [35] reported that GA and DE are more common for solving both process and system optimization problems among the 116 papers in their survey of the food manufacturing industry. Nizar et al. [36] demonstrated that PSO, real-coded GA, and DE obtained the same optimum for friction stir welding parameter optimization, but PSO had a faster convergence than the other algorithms. Ricardo et al. [37] discovered that PSO provided a better solution than GAs for a given micro-milling operation. For solving both single-and multi-objective problems in this research, DE, PSO and GA were executed. The specific contributions of this research are as follows: • A novel approach using ML techniques and a metaheuristic algorithm is proposed for solving single-and multi-objective problems of PWHT process parameter optimization.

•
The relationship between parameters and mechanical properties in the PWHT process of dissimilar friction stir welded 2024-T3 and 7075-T651 aluminum alloys are investigated using ML techniques to model single and multi-objective.

•
In this study, the use of ML techniques combined with metaheuristics encourages the intelligent manufacturing process, particularly the PWHT process, to be more adaptive, real-time, faster, and cost-effective.

Workpiece Material
In this study, AA7075-T651 and AA2024-T3 with dimensions 200 mm × 100 mm × 3 mm were used for the experiment. The chemical compositions of two materials were measured by the energy-dispersive X-ray fluorescence (EDXRF) method using the JEOL model JSX3400R, as shown in Table 1.

Friction Stir Welding
The FSW was used to fabricate dissimilar aluminum alloy plate joints between AA7075-T651 and AA2024-T3. The tool was made of high-carbon steel SKD61 and heat-treated to 55 HRC. The dimensions of the welding tool are in millimeters, as shown in Figure 1. The welding parameters were set at a rotational speed of 1200 rpm, a welding speed of 100 mm/min [38], and a plunge depth of 0.2 mm. The support system and welding setup are shown in Figure 2. popularly used in the field of manufacturing process optimization [21,[31][32][33][34]. Ezra and Weihang [35] reported that GA and DE are more common for solving both process and system optimization problems among the 116 papers in their survey of the food manufacturing industry. Nizar et al. [36] demonstrated that PSO, real-coded GA, and DE obtained the same optimum for friction stir welding parameter optimization, but PSO had a faster convergence than the other algorithms. Ricardo et al. [37] discovered that PSO provided a better solution than GAs for a given micro-milling operation. For solving both singleand multi-objective problems in this research, DE, PSO and GA were executed. The specific contributions of this research are as follows: • A novel approach using ML techniques and a metaheuristic algorithm is proposed for solving single-and multi-objective problems of PWHT process parameter optimization.

•
The relationship between parameters and mechanical properties in the PWHT process of dissimilar friction stir welded 2024-T3 and 7075-T651 aluminum alloys are investigated using ML techniques to model single and multi-objective.
• In this study, the use of ML techniques combined with metaheuristics encourages the intelligent manufacturing process, particularly the PWHT process, to be more adaptive, real-time, faster, and cost-effective.

Workpiece Material
In this study, AA7075-T651 and AA2024-T3 with dimensions 200 mm × 100 mm × 3 mm were used for the experiment. The chemical compositions of two materials were measured by the energy-dispersive X-ray fluorescence (EDXRF) method using the JEOL model JSX3400R, as shown in Table 1.

Friction Stir Welding
The FSW was used to fabricate dissimilar aluminum alloy plate joints between AA7075-T651 and AA2024-T3. The tool was made of high-carbon steel SKD61 and heattreated to 55 HRC. The dimensions of the welding tool are in millimeters, as shown in Figure 1. The welding parameters were set at a rotational speed of 1200 rpm, a welding speed of 100 mm/min [38], and a plunge depth of 0.2 mm. The support system and welding setup are shown in Figure 2.

Post Weld Heat Treatment (PWHT)
After the welding process, a joined plate was cut into three parts in the same transverse direction as the welding specimens, and their dimensions were 200 mm × 100 mm × 3 mm. A full factorial design of 2 4 with 4 center points and 2 replications, resulting in 36 experiments, was implemented. Each part was tested in different conditions of the PWHT process parameters, as shown in Table 2. The workpieces were heated to the solid solution temperature and soaked in a hardening furnace (Manufacturer: Nabertherm, Germany) in the first stage. Model: N41/H). Subsequently, the heated workpieces were transferred within 10 s and quenched in water to obtain a supersaturated solid solution state at different temperatures. After the cooling operation, the quenched parts were artificially aged to reach full strength.

Tensile Test
The workpieces were machined to the dimensions of a tensile test specimen following the ASTM B 557 M standard (25 mm gauge length), as shown in Figure 3. The tensile test was conducted using a universal testing machine from Cometech, model QC-506. The test rate was 5 mm/min. The ultimate tensile strength (UTS) was evaluated, and the elongation percentage (%EL) was calculated. Figure 4a,b show photographs of tensile test specimens before and after the test.

Post Weld Heat Treatment (PWHT)
After the welding process, a joined plate was cut into three parts in the same transverse direction as the welding specimens, and their dimensions were 200 mm × 100 mm × 3 mm. A full factorial design of 2 4 with 4 center points and 2 replications, resulting in 36 experiments, was implemented. Each part was tested in different conditions of the PWHT process parameters, as shown in Table 2. The workpieces were heated to the solid solution temperature and soaked in a hardening furnace (Manufacturer: Nabertherm, Germany) in the first stage. Model: N41/H). Subsequently, the heated workpieces were transferred within 10 s and quenched in water to obtain a supersaturated solid solution state at different temperatures. After the cooling operation, the quenched parts were artificially aged to reach full strength.

Tensile Test
The workpieces were machined to the dimensions of a tensile test specimen following the ASTM B 557 M standard (25 mm gauge length), as shown in Figure 3. The tensile test was conducted using a universal testing machine from Cometech, model QC-506. The test rate was 5 mm/min. The ultimate tensile strength (UTS) was evaluated, and the elongation percentage (%EL) was calculated. Figure 4a,b show photographs of tensile test specimens before and after the test.

Post Weld Heat Treatment (PWHT)
After the welding process, a joined plate was cut into three parts in the same transverse direction as the welding specimens, and their dimensions were 200 mm × 100 mm × 3 mm. A full factorial design of 2 4 with 4 center points and 2 replications, resulting in 36 experiments, was implemented. Each part was tested in different conditions of the PWHT process parameters, as shown in Table 2. The workpieces were heated to the solid solution temperature and soaked in a hardening furnace (Manufacturer: Nabertherm, Germany) in the first stage. Model: N41/H). Subsequently, the heated workpieces were transferred within 10 s and quenched in water to obtain a supersaturated solid solution state at different temperatures. After the cooling operation, the quenched parts were artificially aged to reach full strength.

Tensile Test
The workpieces were machined to the dimensions of a tensile test specimen following the ASTM B 557 M standard (25 mm gauge length), as shown in Figure 3. The tensile test was conducted using a universal testing machine from Cometech, model QC-506. The test rate was 5 mm/min. The ultimate tensile strength (UTS) was evaluated, and the elongation percentage (%EL) was calculated. Figure 4a,b show photographs of tensile test specimens before and after the test.

Modeling by Machine Learning Techniques
In this section, the Python programming language was executed on Spyder (Anaconda 3) software to subject the supervised machine learning algorithms to testing. For preprocessing, the MinMaxScaler was used to normalize the input features into a range [0, 1]. In this study, 28 out of 36 datasets (approximately 80%) were used as training datasets in machine learning models. The results suggested the most suitable hyperparameters using Grid Search Cross Validation (CV = 3). Then, 8 out of 36 datasets (approximately 20% of the dataset) were used to evaluate the model. Five variations in the random state were applied for all runs and repeated three times. The model's prediction performance was measured by the statistical MSE. The conceptual workflow in Figure 5 shows the steps of the ML technique and the connection point between ML and metaheuristic algorithms. First, the database collected from the PWHT experiment was split into training and testing datasets and normalized. Then, using the ML technique learned from the training dataset, a model was constructed using grid search CV to tune the hyperparameters. The model was validated by testing a dataset and saved. In the optimization process, the model from ML was used as a fitness function. The process started with setting the initial algorithm parameters to Iter (iteration) = 1 before calculating the fitness value. If Iter < Itermax, the fitness value was updated and reproduced. If Iter = Itermax, then the next step proceeded. The final solution of a random state was stored. It was checked whether all random states were used or not. If not, the first step was returned to, and the process was restarted.

Modeling by Machine Learning Techniques
In this section, the Python programming language was executed on Spyder (Anaconda 3) software to subject the supervised machine learning algorithms to testing. For preprocessing, the MinMaxScaler was used to normalize the input features into a range [0, 1]. In this study, 28 out of 36 datasets (approximately 80%) were used as training datasets in machine learning models. The results suggested the most suitable hyperparameters using Grid Search Cross Validation (CV = 3). Then, 8 out of 36 datasets (approximately 20% of the dataset) were used to evaluate the model. Five variations in the random state were applied for all runs and repeated three times. The model's prediction performance was measured by the statistical MSE. The conceptual workflow in Figure 5 shows the steps of the ML technique and the connection point between ML and metaheuristic algorithms. First, the database collected from the PWHT experiment was split into training and testing datasets and normalized. Then, using the ML technique learned from the training dataset, a model was constructed using grid search CV to tune the hyperparameters. The model was validated by testing a dataset and saved. In the optimization process, the model from ML was used as a fitness function. The process started with setting the initial algorithm parameters to Iter (iteration) = 1 before calculating the fitness value. If Iter < Iter max , the fitness value was updated and reproduced. If Iter = Iter max , then the next step proceeded. The final solution of a random state was stored. It was checked whether all random states were used or not. If not, the first step was returned to, and the process was restarted.

Process Parameters Optimization
In this research, differential evolution (DE), particle swarm optimization (PSO), and  In this research, differential evolution (DE), particle swarm optimization (PSO), and genetic algorithms (GA) were applied to parameter optimization. The fitness function was conducted using the ML technique as Equations (1) and (2). For all algorithms, population and maximum iteration were set to 50 and 300, respectively. Due to four input factors, the search space was limited to four dimensions. The upper and lower bounds of solid solution time (A), solid solution temperature (B), aging time (C), and aging temperature (D) are presented as follows: (1) Subject to:

Multi-Objective Optimization
In several multi-objective optimizations, it is difficult to simultaneously provide a satisfying solution for different objectives. Therefore, a possible answer for multi-objective optimization is a set of solutions called Pareto-optimal solutions or non-dominated solutions. Hajela and Lin [39] introduced a popular approach that is based on simple aggregation. They use the weighted aggregation method for fitness assignments. This approach is easy to use for solving multi-objective problems. Weight factors were varied to obtain all solutions of a combination and show the different relations of two objective functions.

1.
Weighted aggregation method The most common method to convert a multi-objective problem into a single objective is using weight factors. Based on this research, the two objective functions were summed up with varying weights, giving Equation (3).
where f 1 and f 2 represent two objective functions; X is the decision variables; w 1 and w 2 are the weight factors. Define w 1 = k/N. The combined objective function in Equation (3) can be rewritten as subject to w 1 + w 2 = 1, w 1 ≥ 0, and w 2 ≥ 0 where N is the population size, and k starts from 1 to N (use N = 20). The framework of the weight sum method is described as Algorithms 1 follows.

Hypervolume indicator
The hypervolume indicator [40] is a measure of the quality of a set p = {p (1) , p (2) , . . . , p (n) } of n nondominated objective vectors produced by the multi-objective optimizer. This indicator consists of the measure of the region dominated by P and bounded above by a reference point r ∈ R d , such that r ≥ max p p 1 , . . . , max p p d , where p = (p 1 , . . . , p d ) ∈ P ⊂ R d , and the relation ≥ applies component wise [41], as shown in Figure 6. In this work, the performance of each metaheuristic algorithm was measured by computing the hypervolume metric and comparing it with the region's size. The hypervolume indicator [40] is a measure of the quality of a set = ( ) , ( ) , … , ( ) of n nondominated objective vectors produced by the multi-objective optimizer. This indicator consists of the measure of the region dominated by P and bounded above by a reference point ∈ , such that ≥ ( , … , ), where = ( , … , ) ∈ ⊂ , and the relation ≥ applies component wise [41], as shown in Figure 6. In this work, the performance of each metaheuristic algorithm was measured by computing the hypervolume metric and comparing it with the region's size.

Results and Discussion
As mentioned, the full factorial design (2 4 with four center points and two replications) was used in the experiment. The two responses, UTS and EL, are reported in Table 3.
The analysis of variance (ANOVA) technique used statistical data analysis to provide a significant process parameter for the experimental results of the ultimate tensile strength (UTS) and elongation percentage (EL). In this study, an ANOVA was executed on Minitab 18 with a significant level of 5%, i.e., a confidence level of 95%. When the p-value is less than 0.05, the significance of the process parameters to the response value is acceptable. The ANOVA result of UTS in Table 4 shows that three parameters, solid solution temperature (B), aging time (C), and aging temperature (D), had significant effects, but solid solution time (A) was insignificant due to a p-value of >0.05 [42]. The two-way, three-way, and four-way interactions are also demonstrated to be substantial. As shown in Figure 7, the main effect plot reveals that higher UTS can be obtained with high solution temperatures. Increasing the solution time, aging time, and temperature reduces UTS, as observed by Polmear and Couper [43]. Table 3. Design matrix and experimental results of PWHT process.

StdOrder RunOrder CenterPt Blocks
Factor Response  Adj SS = Adjusted sums of squares; Adj MS = Adjusted mean squares; Fvalue = mean sum-of-squares for regression/mean sum-of-squares for residual; p-value = the smallest level of significance at which the data are significant. An ANOVA result for the EL response is shown in Table 5. Three parameters, solution time (A), solution temperature (B), and aging time (C), have significant effects with p-values smaller than 0.05. According to P. Sivaraj et al. [44], decreasing the aging time reduced the elongation percentage. The reduction in ductility is attributed to the abnormal grain growth and coarsening of the hardening precipitate. Some two-way and three-way interactions also demonstrate significance, but none of the four-way interactions. The main effect plot, as shown in Figure 8, determines the influence of each parameter on the elongation percentage. From Tables 4 and 5, the p-values of the UTS and EL models are smaller than 0.05. It means the parameters are significant to construct models for UTS and EL prediction. An ANOVA result for the EL response is shown in Table 5. Three parameters, solution time (A), solution temperature (B), and aging time (C), have significant effects with p-values smaller than 0.05. According to P. Sivaraj et al. [44], decreasing the aging time reduced the elongation percentage. The reduction in ductility is attributed to the abnormal grain growth and coarsening of the hardening precipitate. Some two-way and three-way interactions also demonstrate significance, but none of the four-way interactions. The main effect plot, as shown in Figure 8, determines the influence of each parameter on the elongation percentage. From Tables 4 and 5, the p-values of the UTS and EL models are smaller than 0.05. It means the parameters are significant to construct models for UTS and EL prediction.   In the model development section, the experimental PWHT process dataset is smaller than the general problem. Therefore, the consistency of the ML performance in a different random state and the effect of randomization on the dataset were considered by using five variations in random states. Grid search cross-validation was applied to tune hyperpa- In the model development section, the experimental PWHT process dataset is smaller than the general problem. Therefore, the consistency of the ML performance in a different random state and the effect of randomization on the dataset were considered by using five variations in random states. Grid search cross-validation was applied to tune hyperparameters for the ML technique, and the final values are shown in Tables 6-9. The ML technique used different hyperparameters in different random states. Since ML strongly depends on the input dataset, tuning is always applied before prediction. The box plot of the MSE and MAE for different ML techniques is shown in Figure 9, and the average values are shown in Table 10. For the UTS model, the average MSE of SVR, DT, KNN, and RF are 0.03748, 0.06455, 0.03, and 0.05591, respectively. For the EL model, the average MSE of SVR, DT, KNN, and RF are about 0.02072, 0.03971, 0.01659, and 0.04228, respectively. It demonstrates that SVR and KNN build the UTS model and EL model with the lowest average MSE and MAE, whereas DT and RF produce more MSE with a larger range of box plots.  1  distance  distance  4  2  3  5  3  1  3  uniform  distance  1  2  1  3  1  2  5  uniform  distance  1  4  9  5  2  3  7  distance  distance  2  2  10  9  1  1  9  distance  uniform  3  1  2  1 2 3  Figure 9. Box plot of training and testing MSE for UTS and EL model. Figure 9. Box plot of training and testing MSE for UTS and EL model. In this study, UTS and EL models were used as objective functions, with solid solution time (A), solid solution temperature (B), aging time (C), and aging temperature (D) as decision variables. The ML-based model is experimentally combined with a metaheuristic algorithm to optimize the parameters of the PWHT process and propose the appropriate combination. After a combination is run with various random states, the value of the final solution and responses are shown in a type of box plot. For the UTS model, it is observed that SVR is outstanding in giving a robust model due to providing the smallest range of box plots, as shown in Figure 10. Similar to the EL model, SVR also offers a model that does not become stuck in the local optimum during optimization [45,46] while changing the training dataset (random state) and usually gives almost the same final solution when optimization is complete, as shown in Figure 11. It also found in Figure 12a,b, SVR provide the smallest range of box plots of the final value. In contrast, DT, KNN, and RF produce an extensive range of solutions, especially DT-PSO and RF-PSO. Thus, SVR is the most suitable model based on the given data in this research, and similar results were observed by Wuest et al. [47]. Table 11 shows the value of the final solution (decision variables) and objective functions obtained by combinations of SVR and metaheuristics. When SVR works with DE and PSO algorithms, the final solutions of both UTS and EL models are the same. Additionally, GA is slightly different. The UTS can be maximized by solid solution annealing at 500 degrees Celsius for 30 min and artificial aging at 133.87 degrees Celsius for 6 h. Moreover, solid solution time, solid solution temperature, aging time, and aging temperature are suggested to be set at 30 min, 476.44 • C, 6 h, and 190 • C, respectively, for maximizing EL. To consider the convergent performance of the model, Figure 13a-f shows a graphical plot between the best fitness values and iterations. According to Figure 13b,e, PSO converges faster than other algorithms following DE and GA, respectively. that does not become stuck in the local optimum during optimization [45,46] while changing the training dataset (random state) and usually gives almost the same final solution when optimization is complete, as shown in Figure 11. It also found in Figure 12a,b, SVR provide the smallest range of box plots of the final value. In contrast, DT, KNN, and RF produce an extensive range of solutions, especially DT-PSO and RF-PSO. Thus, SVR is the most suitable model based on the given data in this research, and similar results were observed by Wuest et al. [47].       Figure 13a-f shows a graphical plot between the best fitness values and iterations. According to Figure 13b,e, PSO converges faster than other algorithms following DE and GA, respectively.    SVR was chosen for multi-objective analysis combined with metaheuristic algorithms based on its best performance on given PWHT process data in this study. Two objective functions obtained from SVR were converted into a single one by using weighted aggregation. However, the important issue in the method is the weight assignment because the solution strongly depends on the chosen weighting coefficients. Therefore, the Pareto solution was proposed to fix the problem. The result of a multi-objective optimization yields sets of trade-off solutions between UTS and EL. As shown in Figure 14, the Pareto front shows that the elongation percentage decreases with the increase in the ultimate tensile strength, as reported by [48]. Furthermore, the main advantage of the proposed multi- SVR was chosen for multi-objective analysis combined with metaheuristic algorithms based on its best performance on given PWHT process data in this study. Two objective functions obtained from SVR were converted into a single one by using weighted aggregation. However, the important issue in the method is the weight assignment because the solution strongly depends on the chosen weighting coefficients. Therefore, the Pareto solution was proposed to fix the problem. The result of a multi-objective optimization yields sets of trade-off solutions between UTS and EL. As shown in Figure 14, the Pareto front shows that the elongation percentage decreases with the increase in the ultimate tensile strength, as reported by [48]. Furthermore, the main advantage of the proposed multi-objective is that it provides sets of solutions with a variety of weighting coefficients from which to select the appropriate one to apply to the specific user-defined process as alternative solutions. In order to measure the distinction between the Pareto solutions, the hypervolume indicator was applied to quantify the quality of a solution set [49] using a single reference point r = (330,7.5). This measure aims to minimize the volume between solution sets and the reference point. It is observed from Figure 14 that the regions of hypervolume are concave due to optimizing the hypervolume in maximization problems [40,50]. The hypervolumes of SVR-DE, SVR-PSO, and SVR-GA are 12.004, 12.043, and 12.607, respectively. Table 12 shows that the SVR-DE has the smallest volume when compared to the others. It means the solution sets of SVR-DE can better reach the maximum value of two objective functions. However, the SVR-PSO also has a small volume, almost the same as the SVR-DE, whereas the SVR-PSO converges to the final solutions faster.

Conclusions
The PWHT process was conducted for dissimilarly FSWed 2024-T3 and 7075-T651 aluminum alloys. According to the experimental results and analysis carried out, it was concluded that the parameters in the experimental design significantly affected mechanical properties such as UTS and EL. The following conclusions can be drawn based on the results and discussion: 1. The SVR model demonstrated superior performance among ML techniques for both UTS and EL models based on the given data in this study.  In order to measure the distinction between the Pareto solutions, the hypervolume indicator was applied to quantify the quality of a solution set [49] using a single reference point r = (330,7.5). This measure aims to minimize the volume between solution sets and the reference point. It is observed from Figure 14 that the regions of hypervolume are concave due to optimizing the hypervolume in maximization problems [40,50]. The hypervolumes of SVR-DE, SVR-PSO, and SVR-GA are 12.004, 12.043, and 12.607, respectively. Table 12 shows that the SVR-DE has the smallest volume when compared to the others. It means the solution sets of SVR-DE can better reach the maximum value of two objective functions. However, the SVR-PSO also has a small volume, almost the same as the SVR-DE, whereas the SVR-PSO converges to the final solutions faster.

Conclusions
The PWHT process was conducted for dissimilarly FSWed 2024-T3 and 7075-T651 aluminum alloys. According to the experimental results and analysis carried out, it was concluded that the parameters in the experimental design significantly affected mechanical properties such as UTS and EL. The following conclusions can be drawn based on the results and discussion: The SVR model demonstrated superior performance among ML techniques for both UTS and EL models based on the given data in this study.
SVR-DE and SVR-PSO reached the same final solution for the UTS and EL models, but SVR-PSO shows the fastest convergence among other combinations.
The final solutions showed that the UTS could be increased by treating the solid solution at 500 • C for 30 min and aging it artificially at 133.87 • C for 6 h. For maximized EL, solid solution time, solid solution temperature, aging time, and aging temperature were set at 30 min, 476.44 • C, 6 h, and 190 • C, respectively.
The Pareto front was proposed as an alternative solution with a variety of weighting coefficients.
SVR-DE gave the smallest volume of 12.004, followed by SVR-PSO and SVR-GA with 12.043 and 12.607, respectively. It was concluded that the solution sets from SVR-DE can reach the maximum value of two objective functions more than SVR-PSO and SVR-GA.
In conclusion, this research presented contributions to developing the PWHT process in terms of model prediction and parameter optimization by using the ML technique, which is one of the main characteristics of smart manufacturing design and is expected to progress quality control. In machine learning techniques, the data used for training are not limited to the designed level like in experimental techniques. This makes the process of collecting data more flexible. When compared to the traditional design of an experiment, this would be the primary advantage. In addition, the metaheuristic approach in this study provides trade-off solutions for multi-objective problems with a variety of suitable options for different decision-makers. Further work can be applied with the finite element method (FEM) to study real-time material properties and add the significant factors of the PWHT process to obtain the results with a high-dimensional and more intelligent program.