1. Introduction
The utilization of the preforming step is important in metal forging processes because it avoids the manifestation of forging defects [
1,
2], reduces the forging load [
3,
4], and extends the die life [
5,
6]. However, preform design is conventionally performed by trial-and-error approaches based on an engineer’s experience and know-how [
7]. Thus, systematic preform design methods have been studied in recent decades.
By synthesizing the design rules in the reported literature and the experts’ know-how, an empirical equation for H-shape forging [
8], a knowledge-based system for rib-web type forgings [
9], and an expert system for gear forgings [
10] were developed to suggest the preform shapes. Although these approaches are sufficiently good for certain simple forgings, they are not practical because the guidelines are insufficient for complex geometries.
To overcome this limitation, Park et al. [
11] introduced the backward tracing scheme (BTS) method to determine a proper preform design by using the H-shapes and their boundary conditions and inversely tracing the loading path. In two-dimensional (2D) axisymmetric forging cases, Kim and Kobayashi [
12] applied the BTS method to closed-die forging, and obtained good results in terms of the metal flow for the predicted preform. Gao et al. [
13] extended this methodology to the preform design of three-dimensional (3D) blade forging by considering a non-uniform deformation. However, the BTS methodology requires a user’s initial guess for the preform design, which means that the result of this method is user-biased. Moreover, a designed preform is also influenced by the number of iterations and the loading path, which are not included in the design requirements.
Badrinarayanan and Zabaras [
14] introduced a linear sensitivity analysis (LSA)-based preform design method for axisymmetric forging. LSA approaches consider the change in the node position and compute the finite-dimensional gradients of the objective functions. Zhao et al. [
15] designed a preform for H-shaped forging considering the sensitivities of the objective functions and nodal velocities. However, because the governing equations in LSA approaches are strongly related to the forging shape, they become overcomplicated in the case of complex geometries. More recently, Lu et al. [
16] applied evolutionary structural optimization (ESO) by considering progressive element elimination during the procedure and applied it to the preform design of H-shaped geometry. Shao et al. [
17] and Yang et al. [
18] considered a bidirectional ESO method in which the elements eliminated in a certain step could be recovered in subsequent steps. However, this ESO method also requires an initial guess for the preform geometry, which becomes user-biased and is affected by aleatoric uncertainty.
To suggest a preform design shape without any initial design guess, Roy et al. [
19] utilized an artificial neural network (ANN) algorithm for H-shape forging. The ANN-based model utilizes the weight vectors and links the values of the forging geometry image to those of the preform image in a one-to-one relationship. Pathak et al. [
20] also applied the ANN method to a cylindrical billet upsetting process to reduce the barreling at the free surface of the workpiece. However, as the ANN method assigns the weight considering a one-to-one relationship for the user-defined training data, this method yields a poor level of detectable details and becomes impracticable for untrained complex forging shapes.
In contrast, a convolutional neural network (CNN) algorithm can extract the geometrical features of input data arrays through convolutional operations with weight arrays (filters) [
21]. As this algorithm does not require a one-to-one link between data, it has been widely employed in several engineering problems [
22,
23]. After the convolution operation, an array is defined as a feature map, where the extracted features of the input data are stored. The extracted feature maps are connected to an output data array by multiplying them by weights. All the utilized filters can be considered as a cumulative experience for engineers to infer appropriate output data from the input data.
Therefore, a new preform design approach based on the CNN algorithm is introduced. As the implemented CNN algorithm imitates the preform design process of the forging experts, the trained filters are expected to act as a sort of cumulative experience, utilized for the prediction of the preform shapes. Through the convolution operation with the filters, the feature maps relevant to the inputted forging shapes are identified, and the appropriate preform design is automatically determined by applying them to transform the forging geometry. Because the design of a 3D preform shape is normally a complicated and trial-and-error task, it is intended to be able to obtain multiple 3D preform candidates based on the capability of trained CNN sub-models for estimating 3D geometries. During the training procedure, filters were constructed based on the characteristics of the specified design requirements. Accordingly, we intended to derive a preform satisfying the design criterion considered in the corresponding training database. Even if different design criteria are considered during the construction of the training database, the proposed procedure can still be directly applied following the logic presented in this paper.
2. Preform Design Methodology
A flowchart of the CNN-based preform design model is presented in
Figure 1. The training database consisted of forging products and relevant preform shapes. The training materials should be designed to satisfy design requirements without user bias to eliminate the limitations that appeared in the literature survey. The product geometry was converted into a 3D digital data array (voxel array), and the preform candidates were derived from the various trained CNN models. The derived 3D data arrays of the preform candidates were converted into 3D modeling geometries. To select the best preform among the candidates, each candidate was evaluated by a finite element method (FEM) simulation. Based on the FEM results, one shape was easily selected as the preform that satisfied the design requirements of the input forging product.
2.1. Convolution Neural Network Model Structure
The CNN model utilized in this paper is based on the U-shape architecture [
24]. The model is composed of nine convolution blocks with two convolutional layers and eight pooling layers, as shown in
Figure 2. For the application of 3D geometries, the convolution operation is conducted in 3D digital volume space (voxel array).
The voxel array input to each layer is (
h ·
w ·
d ·
c).
h,
w, and
d are the height, width and depth of the array, respectively, and
c is the number of channels that signifies the information of the array. The convolution operation is conducted as the filters move in the
h,
w, and
d directions of the input array. In this study, 128 voxels were considered for the
h,
w, and
d values in the voxel array of the 3D forging and preform geometries through the conversion processes, summarized in
Appendix A. For the filter array, six voxels for each of the three axes were considered. Those values are the trade-off for the training convergence and computational efficiency.
The feature recognition process for the
i-th convolution layer begins from the input data array
(
is the number of the filters of the previous layers). To avoid dimension loss during the convolution operation, zero values are padded to the outside of an input array, resulting in
. After the convolution with
filter arrays of
, the output array is
(feature maps). The procedure in the first convolution layer is illustrated in
Figure 3 where
n is 128,
is 32 and
is 1.
The
array is reduced in half the size of
by the red arrows in
Figure 2 which represent a max-pooling operation. After that, by doubling the number of filter arrays, the features recognized in the previous convolution block are “zoomed in”. After the 4th convolution block, the size of the considered image is increased to derive the preform shape in the original dimensional scale. In this upgoing sampling procedure following the green arrows, the feature maps are mapped to the considered preform design, starting from small features (convolution block 5) to large features (convolution block 9). By concatenating the number of features in the down-sampling and the up-sampling operation, any loss of information concerning the positions of the feature positions in both the forging and preform shapes is avoided.
The general formulation of the convolution operation is defined in Equation (1).
x is the inputted array,
w is the filter array, and the bias
b is added in computation.
p,
q, and
r are the coordinates in the feature maps. The predicted value is calculated as the output of the activation function
f. The weights in filters (
w) are determined to minimize the loss between the true value
y of the given preform shapes and of the CNN model. Since the inputted data have binary values, a binary cross-entropy loss function
E, defined in Equation (2), is utilized. For the minimization process, an ADAM (Adaptive Moment Estimation) optimizer [
25] as defined in Equation (3) is utilized assuring the robust and quick weight update process by a two-moment vector,
and
. For the activation function, a ReLU (Rectified Linear Unit) function in Equation (4) is utilized for all the convolution layers except for the last one. Since the output of the ReLU function is
in the
domain, the
in Equation (3) cannot become zero. (If it becomes zero, the relevant weight is no longer updatable.) Since the last layer is utilized to predict the binary values of the array for a predicted preform shape, the sigmoid function in Equation (5), whose range is
, has been applied.
The CNN algorithm is implemented in Python 3.7.4 (Python Software Foundation, Wilmington, USA) and Keras 2.3.1(Keras, Mountain View, USA) with the Tensorflow backend framework on an NVIDIA V100-SXM2 GPU and 180 GB RAM computer systems. The hyperparameters utilized in the training procedure were determined through the random search method consisting of 300 training steps, and are equal to
,
,
, and
.
Since the derived array of preform shapes had a range from 0 to 1, a threshold of 0.7 has been employed, meaning that if the output of the sigmoid activation function is greater or equal to 0.7 the relevant position is considered as part of the preform, whereas the position is neglected. The overall CNN-based preform design concept is summarized in
Figure 4. To enhance the understanding of the proposed methodology, the explanation reported in
Figure 4 is based on 2D images of the forging product and preform shapes.
2.2. Training Database Definition
A training database consisting of the forging shapes and the corresponding preform shapes should be constructed to avoid user bias. According to the preform shape, in the training database, satisfying the objective design criterion, the weight values of the filters can be set to meet the considered criterion. In this study, both the forging and preform shapes for the training database were parametrically designed to avoid any bias. The design criterion was defined as the reduction in the forging load and the avoidance of metal flow overlapping and die unfilling.
Regarding the forging shape, a combination of four levels (0.24, 0.36, 0.48, and 0.6 D) at four heights (H1, H2, H3, and H4) was utilized to design all 96 geometries with the same outer diameter D, as shown in
Figure 5a. All cases had similar volumes in the
range. An R = 5 mm filet radius was used to connect the four regions for all cases. An example shape is shown in
Figure 5b.
To define the preform shape geometries, three levels (0.3, 0.4, and 0.5 D) were considered for five heights from H1 to H5. A B-spline function was utilized to generate 240 preform geometries, as shown in
Figure 6a. H6 was changed to match the volume of the forging shapes. Two examples of preform shapes are shown in
Figure 6b. The parametric design procedure was implemented in SolidWorks 2016 (Dassault Systèmes SolidWorks Corporation, Waltham, MA, USA) CAD software. To reduce the computation time, the forging shapes were considered as axisymmetric with a flat surface, and the preform shapes were presented using B-spline in axisymmetric shapes.
Among the 240 preform shapes, one shape satisfying the defined design rules was selected as the preform shape of the corresponding forging shape based on the FEM simulation results. The FEM simulations were implemented in the DEFORM 2D Version 11.0 (Scientific Forming Technologies Corporation, Columbus, OH, USA) software, as summarized in
Appendix B. The preform billet is defined as a deformable plastic considering AA6061 aluminum alloy using the Ludwick-type flow stress in Equation (6), calibrated by tensile test experiments on ASTM-E8 specimens at room temperature.
Although the AA6061 aluminum alloy was utilized in the implemented FEM model, the proposed procedure is independent of the material, as will be shown in the results section. Because the objective of the FEM investigation is to compare the forging load reduction and the plastic flow of the preform, and not a detailed investigation of the stress distribution, all simulations were implemented under cold and isothermal forging conditions.
2.3. Sub-Models Definition
By defining the multiple CNN sub-models, the filters have different values for each sub-model. Thus, individual preform candidates can be derived according to each CNN model. This subdivision strategy allows the final shapes that share similar geometrical features to be grouped together, increasing the level of accuracy of the relevant CNN sub-models.
In this study, six data groups were sorted by considering the geometrical characteristics (H1, H2, H3, and H4) of the forging shape, as shown in
Figure 7.
According to the defined grouping criterion in
Figure 7, one forging shape may belong to one single group or two different groups, resulting in each group having either 28 or 32 levels. The subdivision of the database was utilized for six CNN sub-models, all with the same CNN architecture. Six preform candidates were derived for an inputted forging shape according to each sub-model and further evaluated through FEM simulations to select the best preform that reduces the forging loads and avoids forging defects.
3. Numerical Experiments
For each of the multiple CNN sub-models, a range of forging load reductions could be achieved according to the corresponding training dataset, in comparison with a cylindrical billet with the same volume of the forging shape. The minimum and maximum forging load reductions for the data groups utilized for training the six CNN sub-models are summarized in
Table 1.
The proposed preform design model was applied to two 2D axis-symmetric forging cases in
Section 3.1, ¼ plane-symmetric forging in
Section 3.2, and grinding tool-like forging in
Section 3.3.
3.1. Two Axisymmetric Forging Cases
The CNN sub-models were utilized to derive preform shapes for the two axisymmetric test parts, as shown in
Figure 8 where the height features are listed in
Table 2. The dimensions of the two shapes were randomly designed to yield completely different geometrical features.
The finite element forging simulation results for the preform candidates were acquired from the 2D FEM model, summarized in
Appendix B, by considering the 2D cross-sections of the preform candidates as derived by 3D modeling shapes.
Based on the FEM results, for the test shape, as shown in
Figure 8a, the preform derived by the CNN#3 sub-model resulted in a maximum forging load reduction of 15.9% (
Figure 9). For the shape shown in
Figure 8b, the CNN#4 sub-model derived a preform resulting in a 16.5% forging load reduction (
Figure 10).
3.2. The One-Quarter Plane-Symmetric Forging
The CNN models were trained by utilizing 3D data arrays; thus, the trained models could derive the 3D preform shapes for the 3D inputted forging geometries. However, the training dataset used in this study was defined by considering axis-symmetric forgings. Therefore, to investigate the capability of the CNN sub-models trained with axisymmetric 2D images, and to accurately derive the preform shape for the case of non-axisymmetric shapes, two strategies were implemented for a ¼ plane-symmetric simple forging, as shown in
Figure 11. The first consisted of directly inputting the 3D images into the CNN sub-models. The second consisted of inputting separately the axisymmetric shapes of three cross-sections along 0°, 45°, and 90°.
Considering the first approach, the six shapes of the preform candidates are shown in
Figure 12a and
Table 3 and the preform shape derived by CNN#2 ensured the highest forging load reduction of 11.1% compared with that of a cylindrical billet (
Figure 12b). The FEM results presented in this section were acquired through a 3D thermo-mechanical FEM model, as described in
Appendix B.
Regarding the second approach, the three preforms estimated for each of the three considered directions (0°, 45°, and 90°) were combined into a single image by a Boolean union and smoothed in the transition regions between the three angles.
Considering the results reported in
Figure 12b, the preform shape derived by CNN#2 resulted in a maximum forging load reduction. For this reason, the preform shapes by inputting the 3D arrays to CNN#2 and by inputting the cross-sections at 0°, 45°, and 90° to CNN#2 were utilized in two forging simulations, and the relevant forging force versus stroke results are shown in
Figure 13b.
The comparison between the 0° and 90° cross-section dimensions of the two preforms derived through these two methods shows that they are remarkably similar in both the overall shape and dimensions. This indicates that although the training dataset was defined by considering axis-symmetric forgings, the proposed method can be applied to forging geometries in which at least one symmetry plane can be identified.
3.3. Grinding Tool-Like Forging
To test the reliability of the proposed approach, a literature case relevant to grinding tool-like forging [
26] was considered, as shown in
Figure 14. In the considered geometry, two symmetry planes were identified, as shown in
Figure 14.
The one-quarter models of the derived preform shapes are shown in
Figure 15 and
Table 4. For the estimation of the forging load reduction, 3D plane-symmetric FEM simulations were implemented in DEFORM 3D, as summarized in
Appendix B, considering the plasticine material reported in [
27].
As shown in
Figure 16, all six preform shapes allowed a maximum forging load reduction in comparison with the cylindrical preform, and the maximum reduction was identified for the preform shape derived by the CNN#5 sub-model. In
Figure 16c, the comparison between the experimental results [
26] and the authors’ FEM model results for the cylindrical billet show a particularly good agreement, proving the reliability of the implemented FEM model. However, the preform design derived by the proposed methodology reduced the forging load and eliminated forging defects.
4. Experimental Verification
To verify the applicability of the proposed preform design approach to a real industrial case, piston head forging (
Figure 17) was considered. This forging component is a one-quarter plane-symmetric shape characterized by thin walls and is designed for weight reduction.
The piston head geometry was input to the trained CNN sub-models and the derived preform shapes are shown in
Figure 18 and
Table 5. The derived preform shapes were utilized in the 3D thermo-mechanical FEM simulations, as described in
Appendix B. As for the industrial manufacturing process, the preforming stage can prevent die failure by reducing the die wear depth, which is significantly related to the forging load. Therefore, for the piston head verification step, the die wear depth according to the preform shapes was also estimated by the FEM investigations. Regarding the preform shapes derived by CNNs #4, #5, and #6, some features were considered too complicated to be obtained during a preform-forging phase, resulting in folding defects during the piston-forging phase, as highlighted in
Figure 18. Therefore, these three candidates were excluded from further investigation.
As shown in
Figure 19, the remaining three CNN-based preforms enabled the complete filling of the die, and the preform based on the CNN#3 model yielded the largest forging force reduction, estimated to be 15.09%, and promoted a wear depth reduction, estimated to be 16.05%, in comparison with the cylindrical billet.
Based on the positive results obtained from the FEM investigation in
Figure 19a, two experiments were performed considering the CNN#3 preform design and the cylindrical billet. For the hot forging process, the HKLP-800 press machine shown in
Figure 20a was utilized, and the set-up conditions for the process summarized in
Appendix B were utilized in the 3D FEM simulations. The top and bottom dies, relevant for the piston head shape, are shown in
Figure 20b.
Because the preform die has not been manufactured yet, the preform shape derived by CNN#3 (
Figure 21a) was obtained by machining from a cylindrical billet. In addition, an experiment considering a cylindrical billet was performed for comparison. When comparing the experimental and numerical forging load–stroke curves for the cylindrical and CNN#3-based preforms, as shown in
Figure 21c, the deviation between the experimental and FEM results was limited to 1.05% and 0.55% for the area beneath the curves and 1.95% and 0.93% for the case of the maximum load, respectively. These limited errors indicate that the implemented 3D thermo-mechanical FEM model can properly replicate the considered forging process. In addition, the cross-section of the forged piston head (
Figure 21b), cut through computer numerical control machining, shows the high quality of the filling as well as the absence of any overlap of the metal flow. This result proves the reliability of the defined CNN-based preform design approach and the accuracy of the implemented 3D thermo-mechanical FEM model in replicating the considered forging process conditions.
5. Discussion
As the creation of the training database is a time-consuming process, certain simplifications must be implemented, as performed in this study. Two-dimensional axisymmetric geometries were considered in the definition of the training database for both forging and preform shapes. For comparison, the 2D axisymmetric simulations required an average of 10 min each, whereas the 3D thermo-mechanical simulation for the piston head forging required approximately 2 h. For this reason, the ability to create a simplified, but still reasonable, training database is important to achieve accurate results without consuming excessive computational time. Moreover, the consideration of a cold-forging FEM can also reduce the computational time. Although the material flow resistance is strongly influenced by the forging temperature, this simplification does not alter the accuracy of the preform design because the material flow itself is independent of the processing temperature.
As presented in
Section 3 and
Section 4, the CNN sub-models automatically derived the 3D surface of the preform candidates by utilizing the trained filters. Considering the first validation phases, relevant for the application of the trained CNN sub-models to two additional axisymmetric forgings and a ¼ plane-symmetric forging, the results of the FEM investigations demonstrated the capability of the proposed preform design approach in deriving a preform shape that allows a significant reduction in the forging load by at least 11.1%. Regarding the grinding tool-like forging, the comparison between the experimental and FEM results validated the 3D forging FEM model implementation strategy and showed an improvement in the preform shape compared to that proposed in the relevant reference study [
26]. Regarding the investigation of the piston head forging described in
Section 4, the utilization of the preform shape derived by CNN#3 resulted in a reduction in the forging load of 15.09% (FEM results) and 15.94% (experimental result), as well as in the wear depth reduction estimated at 16.05%. Hence, the possibility of designing individual preform shapes, regardless of the material, allows the user to choose among the derived shapes based on the maximum force criterion and considering possible constraints for the preform manufacturing process.
Although the present study was mainly focused on forging load reduction, additional target parameters can be included as target functions for the training database definition. For instance, damage or residual stress minimization can be added as a criterion for the preform shape. However, the complexity of the resulting FEM model hinders its application to complex geometries, owing to the unreasonable amount of computational time required for the definition of the training database. This issue is critical for complex problem solutions by means of machine learning algorithms and requires more work in the future.
Furthermore, the training phase in the current study was based solely on the results of numerical simulations; however, it can also be linked to engineers’ experience by feeding previously validated sets of preforms and final shapes along with the FEM results. This combined approach is particularly interesting because the robustness of the training data can be further extended without requiring any additional computation time.
6. Conclusions
In this study, a new preform design methodology based on a CNN was proposed and investigated for a certain group of forging processes. The user-bias limitations highlighted in previously published algorithms can be overcome by considering a parametric design approach in the training database definition. The filters were successfully determined during the convolutional operations or the so-called training procedures by extracting the geometrical features of the forging product and linking them to the corresponding preform shapes, such as the building of a human design experience. Based on the multiple 3D CNN model strategies, multiple 3D preform design candidates for one inputted forging product geometry were easily acquired. Therefore, it is possible to select the best one depending on the design requirements and on the results of the forging simulation for the individual preform candidates. Although the CNN sub-models were trained three-dimensionally with 2D axisymmetric cases for simplicity, the results showed that they can be successfully extended to 3D complex forging shapes with symmetry planes. The filters were determined according to the design rules defining the training database; thus, according to the considered training set, additional design rules can be included and considered in the design of the preform geometry. It should be mentioned that the present method is a starting point for the automatization of preform design. In conclusion, the proposed preform design methodology is useful for deriving preform designs for general forging cases in the metal forging industry. However, it is expected that further investigations should be performed by including the effect of the number of training databases, the construction of a different training database, and additional manufacturing or material-related parameters.