1. Introduction
Superplastic forming (SPF), as a special process method of thermoforming, is an advanced forming technology that utilizes the superplastic large deformation capability of materials under specific temperature and strain rate conditions to manufacture thin-walled complex components (Li et al. [
1]). At present, the superplastic forming process has become a key technology in aerospace, new energy vehicles, and other fields due to its unique advantages in high-precision thin-walled complex components (Williams et al. [
2]; Yu et al. [
3]). Its research hotspots are mainly focused on the following four aspects: first, the development of new superplastic materials to meet the diversified needs for material properties in different fields (Lakshmanan et al. [
4]; Demirel et al. [
5]; Akula et al. [
6]; Dehkordi et al. [
7]). Second, the optimization of the superplastic forming process through numerical simulation and precise control of process parameters to produce high-quality molded parts. Third, the rapid generation of an appropriate superplastic forming process scheme according to the design characteristics, such as part structure, through artificial intelligence, machine learning, and other technological means. Fourth, the study of the integration of superplastic forming, diffusion bonding, additive manufacturing, and other technologies (Chandrappa et al. [
8]). With the growing demand for complex lightweight components in fields such as aerospace and new energy vehicles, efficiently and reliably designing optimal superplastic forming process solutions has become the core bottleneck constraining the broader application of this technology. Therefore, this paper aims to overcome the efficiency limitations of traditional process design through artificial intelligence methods, enabling intelligent and rapid design of superplastic forming processes.
In the research field of superplastic forming process design, the traditional approach is to use the step-by-step trial-and-error strategy that combines empirical formulas and finite element simulations. Using the finite element method, (Jun et al. [
9] and Alavi et al. [
10]) investigated the effects of different process parameters on the final part thickness in a novel hybrid forming process for Ti-6Al-4V. (Giuliano et al. [
11]) employed finite element modeling to determine precise pressure–time loading curves. Using AZ31 magnesium alloy as the experimental material, the numerical analysis results for hemispherical shell thickness exhibited a maximum deviation of less than 10% from experimental findings. However, because superplastic forming has the characteristics of high dimensionality and mutual coupling of process parameters, traditional process design methods are not only inefficient, costly for physical experiments and simulations, but also have limited ability to process a large amount of process data. As a result, they cannot fully mine the hidden rules behind the data, and it is difficult to achieve efficient optimization and innovation of the process. In contrast, AI-based design methods for superplastic forming processes demonstrate tremendous potential. With the help of artificial intelligence methods such as machine learning and deep learning, a large amount of process data can be processed quickly, complex patterns hidden within these datasets can be uncovered, and an accurate model of the mapping relationship between product design indicators and process parameters can be established. This approach reduces reliance on individual experience and provides a possibility for the intelligent generation of superplastic forming processes. Therefore, applying artificial intelligence methods to the field of superplastic forming process design—by constructing high-precision process prediction models to replace traditional trial-and-error strategies based on empirical knowledge and finite element simulations—has become an inevitable trend. To address this need, this paper aims to develop an AI-based process generation framework for superplastic forming.
For the process generation method based on artificial intelligence, many domestic and foreign scholars have carried out in-depth research. Research scholars combine knowledge graphs with certain reasoning algorithms to generate new process solutions, such as matching algorithms, machine learning, and deep learning (Xiao et al. [
12]). In the literature (Peng et al. [
13]; Li [
14]; Liu et al. [
15]), the researchers constructed the process knowledge graph by combining top–down and bottom–up methods. On this basis, combined with the similarity algorithm, a process route was recommended. Through validation, the feasibility of the model was proved, and the efficiency of the process design was effectively improved. However, the difference lies in the different research objects. (Peng et al. [
13]) took the part as the research object, and realized the recommendation of process routes by calculating the similarity between part attributes and feature topological relations. (Li [
14]) took the product as the research object, and realized the recommendation of process routes by calculating the similarity between the labels of the target product and those of similar products. (Liu et al. [
15]) took the precision transmission part as the research object, and realized the recommendation of process routes by calculating the similarity between the product process units and process route structure. (Su et al. [
16]) developed an automatic framework for building the knowledge graph of machining products and proposed an RGAT-PRotatE method for updating the knowledge periodically.
(Wang et al. [
17]) proposed a deep-learning-based method for constructing a process knowledge graph and reusing the process. The method realizes the construction of a process knowledge graph by constructing the pattern layer and the data layer. Secondly, the process knowledge inference model based on a graph neural network deep-learning algorithm is constructed. Through experimental verification, the accuracy of process recommendation reaches over 70%, which proves the effectiveness and feasibility of the method in the automatic construction of a knowledge graph and process reuse (Wang et al. [
17]). In the literature (Shen et al. [
18]; Wu et al. [
19]; Huang [
20]; Kesri et al. [
21]), researchers have studied the recommendation methods based on knowledge graph technology and graph neural network algorithms, which were used to solve the problem of ignoring the overall information in the data mining process in the existing methods. (Li et al. [
22]) proposed a method for part machining process design based on a knowledge graph and deep learning. Firstly, the process knowledge graph based on features, parts, feature work-step schemes, and part processes was constructed. Secondly, the part-to-part process mapping model based on BiLSTM + Attention, the part process sequence generation model based on Seq2Seq+Attention, and the part process decision model based on the fusion probability of the feature step scheme and the part process scheme were constructed, respectively. Finally, the effectiveness and feasibility of the method were verified by taking the pin shaft parts as an example (Li et al. [
22]). (Devireddy et al. [
23]) proposed a neural network-based process recommendation method for providing manufacturing process solutions for new components. (Ming et al. [
24]; Hussong et al. [
25]) proposed a neural network-based process recommendation method optimized by genetic algorithms. However, their objectives differ: the former addresses the optimal solution for tolerance allocation and process selection, while the latter uses the part’s 3D model as its dataset. Aiming at the problem of insufficient samples, (Ying et al. [
26]) proposed a process generation method based on small-sample knowledge learning. The method realizes sample enhancement by studying the geometric model enhancement analysis technology and process knowledge incremental learning technology, and establishes a three-dimensional digital process design method based on artificial intelligence by studying the process analysis technology of three-dimensional CAD models. The research result significantly improves the efficiency of the complex part process design (Ying et al. [
26]). (Chengwei et al. [
27]; Huang [
28]) took the ship plate surface forming as the research object, constructed the prediction model based on the improved gray wolf algorithm to optimize the SVM. The user can automatically determine the processing trajectory and processing parameters of the original steel plate based on the input of the target ship plate information, which initially meet the requirements of automatic processing of the ship plate surface. The ship plate processing has achieved the expected effect. (Zhang et al. [
29]; Wang et al. [
30]) proposed and optimized a novel four-stage drilling process method. Based on machine-learning algorithms, they developed a predictive model for femtosecond LTD technology, which enabled the rapid and successful design of optimized solutions across a wide range of process parameters.
By analyzing the above research, it is found that for the research of process generation method based on artificial intelligence, researchers mainly carry out research on process route recommendation based on similarity and deep-learning algorithms, such as graph neural network based on constructing knowledge graph, but there are some limitations in the existing research. On the one hand, they primarily focus on process route content without incorporating key process parameter values for each process. On the other hand, relying solely on similarity metrics or deep learning for parameter recommendation may result in substantial deviations from actual requirements, as highly similar products do not always guarantee suitable parameter matches. This challenge is particularly relevant in superplastic forming, where standardized processes demand precise parameter design.
Through the investigation, it is found that, firstly, the core process of superplastic forming for different products follows the standardized process of heating, feeding, mold closing, reverse inflatable forming, forward inflatable forming, cooling, and unloading. Although the process is universal, the key process parameters of each process need to be precisely designed for different part structures and material properties. Secondly, because the SPF involves high temperature and precise air pressure control, the cost of a single experiment is high, the cycle time is long, and the sample size of the public database is seriously insufficient. Thirdly, compared with traditional knowledge storage and representation, the graph structure can structurally organize concepts, entities, and their relationships, and has efficient retrieval capability (Bruendl et al. [
31]). Fourthly, compared with the regression algorithms such as SVR, RF, XGBoost, etc., the SVR, RF, and XGBoost are single-output models, while the BP neural network is a multi-output model, which has stronger nonlinear mapping ability and can more accurately fit the complex relationship between the process parameters. At the same time, it has a greater advantage in processing high-dimensional data. Compared with CNN, CNN is more suitable for processing images and time series data (Gu et al. [
32]; Wang et al. [
33]; Wang et al. [
34]; Alzubaidi et al. [
35]). Compared with LSTM, LSTM is more suitable for processing time series data (Malashin et al. [
36]; Malhotra et al. [
37]). BP neural network has the defects of slow convergence speed and being prone to falling into a local extremum. In existing studies, researchers mostly use methods such as adding momentum, global optimization, and adaptive learning rate to improve these defects, while ignoring the fact that the fixed maximum number of iterations affects the training results (Yan [
38]; Kaur et al. [
39]). Specifically, if the maximum number of iterations is too small, it leads to stopping the training before reaching the best point; if the maximum number of iterations is too large, it leads to prolonging the training time.
Therefore, this paper proposes a SPF process parameter precision design method based on a knowledge graph and artificial intelligence. In this method, the knowledge graph of the SPF process is constructed through the combination of top–down and bottom–up approaches. On this basis, a process parameter prediction model based on small samples and an improved BP neural network is constructed to realize the intelligent design of key process parameters for the target part. Finally, taking the typical hemispherical part as an example, the proposed algorithm is compared with the standard BP neural network, SVR, RF, and XGBoost to verify the effectiveness, feasibility, and superiority of the method in SPF process parameter design.
3. Superplastic Forming Process Parameter Design Method Based on Small Samples and an Improved BP Neural Network
3.1. Basic Principles of an Adaptive Iteration BP Neural Network
The concept of a BP (back propagation) neural network was first proposed by scientists led by Rumelhart and McClelland in 1986. It is a multilayer forward feedback neural network formed by forward propagation of data signals and back propagation of error signals, which has strong adaptive learning ability (Wang et al. [
43]; Wen et al. [
44]; Wen [
45]). The structure of the BP neural network is shown in
Figure 3. Among them,
,
,
,
represent the
input values of the input layer;
,
,
,
represent the
input values of the hidden layer; and
,
,
,
represent the
output values of the hidden layer.
,
,
,
represent the
input values of the output layer;
,
,
,
represent the
output values of the output layer;
represents the weights between the input layer and the hidden layer; and
represents the weights between the hidden layer and the output layer.
The basic flow of the adaptive iteration BP neural network algorithm is as follows, and the flowchart of the algorithm is shown in
Figure 4.
- (1)
Parameter Initialization
We initialize the parameters such as the number of input layer nodes, the number of hidden layer nodes, the number of output layer nodes, the learning rate, and the iteration count. Among them, the number of input layer nodes is determined by the number of influencing factor indicators, the number of output layer nodes is determined by the number of target indicators, and the number of hidden layer nodes is usually determined by the combination of empirical formulas and trial-and-error methods. The empirical formula is as follows:
In Equation (1), represents the number of hidden layer nodes, represents the number of input layer nodes, represents the number of output layer nodes, and represents a constant with values ranging from 1 to 10.
Meanwhile, the activation function, loss function, and optimizer are defined. Among them, the commonly used activation functions include the Sigmoid function and the ReLU function, the commonly used loss functions include the MSELoss and the L1Loss function, and the commonly used optimizer is the Adam optimizer.
- (2)
Forward Propagation
The input values of the hidden layer are calculated using Equation (2).
The output values of the hidden layer are calculated using Equation (3).
where
represents the input values of the hidden layer,
is the matrix with the shape of
representing the weights between the input layer and hidden layer,
represents the input matrix,
represents the output values of the hidden layer, and
represents the activation function.
The input values of the output layer are calculated using Equation (4).
The output values of the output layer are calculated using Equation (5).
where
represents the input values of the output layer,
is the matrix with the shape of
representing the weights between the hidden layer and output layer, and
represents the output values of the output layer.
- (3)
Back Propagation
The error between the predicted and actual values is calculated using the loss function. , are updated by the Adam optimizer.
- (4)
Update the Maximum Iteration Count and Save the Optimal Model
For each iteration, we determine whether the maximum iteration count has been reached. If this is attained, we update the maximum iteration count. We use the validation set to calculate the model loss function and compare it with the optimal value to determine whether to stop the iteration. Specifically, if the loss function value is less than 0.9999 times the optimal value, we update and save the current model as the optimal model; conversely, if the condition is not met for 16 consecutive iterations, we stop the iteration and complete the model training.
3.2. Superplastic Forming Process Parameter Design Method Based on Small Samples and an Improved BP Neural Network
When a new product is added to the queue, it includes basic information such as part name, part number, part size, mold material, plate material, plate size, and gas type.
First, according to the part name information in the new product queue, the knowledge graph of the superplastic forming process is searched, and the corresponding data are obtained as the product structure subgraph, which provides data support for the construction of the process parameter generation model.
Second, through analysis, the part name, part size, plate material, plate size, mold material, and gas type are determined as the influencing factor indicators. The chamber temperature and holding time in the heating process, the lowering speed and blank holder force in the mold closing process, the gas loading pressure and holding time in the reverse inflatable molding process, the gas loading pressure and holding time in the forward inflatable molding process, and the chamber temperature in the cooling process are determined as the target indicators.
Third, the model’s performance is affected by the fact that too few samples cannot fully explore the correlation relationship between the data. Therefore, this paper uses the data augmentation approach to expand the training set. Specifically, a sample is randomly selected from the training set, the feature data with string type remains unchanged, the feature data with numerical type generates random noise with a Gaussian distribution, and the noise is added to the initial data to generate new samples.
Fourth, data preprocessing is performed on the training set, validation set, and test set separately. Specifically, we perform One-Hot Encoding on string-type data to convert it into a numeric vector, and Min–Max Normalization on numeric-type data.
Fifth, we initialize the BP neural network structure and parameters, including the number of the input layer nodes, the number of the hidden layer nodes, the number of the output layer nodes, the iteration count, the learning rate, the activation function, and the loss function, among others.
Sixth, the model is trained with the training set and validated with the validation set by calculating MAE, MSE, and RMSE evaluation metrics until the iteration termination condition is met, completing the model training and saving the optimal model.
Finally, the test sample data are input into the prediction model to output the normalized values of the process parameters, and the predicted values of the process parameters are obtained after the inverse normalization of the output values.
The flowchart of the superplastic forming process parameter design method based on small samples and an improved BP neural network is shown in
Figure 5.