A Convolutional Neural Network-Based Stress Prediction Method for Airfoil Structures

Jia, Wendi; Chen, Quanlong

doi:10.3390/aerospace11121057

Open AccessArticle

A Convolutional Neural Network-Based Stress Prediction Method for Airfoil Structures

by

Wendi Jia

and

Quanlong Chen

^*

School of Aeronautics, Chongqing Jiaotong University, Chongqing 400074, China

^*

Author to whom correspondence should be addressed.

Aerospace 2024, 11(12), 1057; https://doi.org/10.3390/aerospace11121057

Submission received: 23 October 2024 / Revised: 16 December 2024 / Accepted: 20 December 2024 / Published: 23 December 2024

(This article belongs to the Section Aeronautics)

Download

Browse Figures

Versions Notes

Abstract

As a vital component of an aircraft, the structural integrity of the wing is closely linked to both flight performance and safety, making it essential to accurately predict the stresses within its structure. However, conventional stress calculation methods often encounter significant computational costs and lengthy analysis times when addressing highly nonlinear and complex geometries. To address these challenges, this paper introduces a deep learning-based stress prediction approach called the Multi-scale Attention Enhanced Unet (MA-Unet) model. The model incorporates a multi-scale feature extraction and attention mechanism based on Unet to capture complex stress distribution features more efficiently, and is applied to the stress prediction of wing skin structures. A stress field dataset is generated through numerical simulation, which is then used to train and evaluate the MA-Unet model. The prediction results are compared with those obtained from traditional convolutional neural networks (CNNs) and the Unet model. Experimental results demonstrate that the MA-Unet model achieves higher accuracy in predicting wing skin stresses and shows strong robustness across various testing conditions. This model serves as an effective method and provides valuable data support for the rapid and accurate assessment of wing structures, highlighting its significant practical applications.

Keywords:

structural strength; skinning; stress field prediction; multiscale; attention mechanism

1. Introduction

The wing serves as the primary lift-generating component of an aircraft, playing a vital role in both its design and flight performance. The wing’s skin, which interacts directly with the airflow, experiences complex aerodynamic loads during flight, potentially resulting in fatigue deformation, structural damage, and other associated risks. Consequently, conducting stress analysis on the wing skin is essential for maintaining the overall structural integrity of the wing. In contemporary engineering design, particularly within the realm of structural engineering, finite element analysis (FEM) is extensively employed to address intricate stress and deformation challenges [1,2]. This approach involves applying various boundary conditions to derive skin stress data and analyze stress distribution under differing operational scenarios. While FEM-based computational methods are capable of accurately simulating airfoil loading, traditional FEM techniques often encounter substantial computational costs and prolonged analysis times, especially in scenarios involving highly nonlinear geometries or dynamic loading conditions [3,4]. As the demand for sustainable, high-performance structures escalates, it is imperative for designers to rapidly assess the static stress performance of multiple design alternatives during the preliminary stages. Moreover, the importance of structural health monitoring for airfoils has increased to ensure their safety and reliability over extended usage periods. This evolving landscape has driven researchers to explore efficient predictive methodologies to enhance the design and evaluation processes of airfoils.

With the rapid development of Big Data and Artificial Intelligence technologies, engineers are using data-driven methods to optimize the design process and improve prediction accuracy with the help of massive data and powerful computing power [5,6,7]. Advanced data-driven techniques, such as data mining and deep learning, make it possible to extract valuable information from complex inputs, enabling rapid assessment of structural performance and failure prediction, and driving the design and application of sustainable high-performance structures [8,9,10].

Deep learning, a significant branch of machine learning, has exhibited remarkable capabilities, particularly in the processing of complex tasks such as image recognition, speech analysis, and natural language processing. An increasing number of researchers, both domestically and internationally, are investigating the applications of deep learning within the aviation sector, yielding notable outcomes [11,12,13]. Among these, Convolutional Neural Networks (CNNs) have garnered substantial attention due to their robust abilities in spatial feature extraction and information processing. Researchers have devised various network architectures and models based on CNNs tailored for diverse prediction and classification tasks, thereby offering effective solutions for optimizing the performance of aerostructures and enhancing fault prediction. In conclusion, the integration of deep learning and machine learning technologies not only augments the efficiency of design and analytical processes within engineering but also establishes a foundation for future sustainable development, further advancing the intelligent evolution of structural design and its applications [14,15,16,17,18].

Zhang et al. investigated the adaptability of CNNs for aerodynamic metamodeling, designing models tailored to various flow conditions and geometrical configurations. Through the training of multiple CNN architectures, they successfully predicted the lift coefficients of airfoils under diverse conditions, comparing the outcomes with those obtained from a multilayer perceptron (MLP). The findings indicate that the prediction accuracy of the CNN is comparable to that of the MLP, particularly under minimal geometric constraints, thereby demonstrating the efficacy of CNNs in this domain [19].

Chen et al. emphasize the significance of both symmetric and asymmetric airfoils in aircraft design and manufacturing, highlighting the necessity of acquiring their aerodynamic coefficients. They propose a CNN-based approach for predicting multiple aerodynamic coefficients. This method initially generates a transformed airfoil image (TAI) through fluidic convolution, which is then combined with the original airfoil image to create a composite airfoil image (CAI) for CNN analysis. Using the symmetric airfoil NACA 0012 as a case study for training and testing, the results demonstrate that the CNN method effectively predicts pitching moments, drag, and lift coefficients with a high degree of accuracy [20].

Bhatnagar’s research presents a CNN-based approximate model designed for predicting flow fields. This model specifically targets the prediction of velocity and pressure fields corresponding to a given pixelated object shape, with an emphasis on Reynolds-averaged Navier–Stokes (RANS) flow solutions over an airfoil. The study demonstrates that the predictive capabilities of the CNN are notably improved through the incorporation of convolutional operations, parameter sharing, and its robustness to noise [21].

In the realm of stress prediction, CNNs have proven their robust capabilities in feature extraction and spatial information capture. Sepasdar et al. employed a series of CNN architectures to predict post-failure von Mises stress fields and identify failure modes in inelastic composites [22,23,24].

Indrashish’s research investigates the design and analysis of inelastic materials under loading, emphasizing the prediction of their physical responses. To address the high time and computational costs associated with finite element method simulations, a deep learning (DL) framework is proposed for the rapid prediction of micro-scale elastic-plastic strains and stresses in two-phase media. The study introduces a novel two-step training methodology to enhance stress prediction accuracy: initially, the model is trained to predict strain fields, which are then utilized as inputs for stress field predictions. This efficient data-driven approach effectively predicts physical fields in inelastic materials using only microstructural images and load information [25].

Li et al. proposed a multiscale deep convolutional neural network (MS-DCNN) aimed at enhancing the accuracy of remaining useful life (RUL) assessments, thereby assisting decision-makers in developing effective maintenance strategies to maximize equipment utilization and mitigate costly failures. The MS-DCNN efficiently extracts features through its multiscale architecture, directly establishing a relationship between monitoring data and actual RUL. The results indicate that this method surpasses other network structures in prediction performance while accurately forecasting RUL without increasing computational demands [26].

Bhaduri’s research investigates the application of deep learning tools for local stress field prediction in fiber-reinforced composite systems, serving as an alternative to traditional finite element methods (FEMs). This methodology effectively reduces computational expenses while preserving the accuracy of the predictions [27].

Lei et al. proposed a rolling bearing fault diagnosis model that integrates Markov transfer field (MTF) and graph attention networks (GAT). This model leverages MTF to transform one-dimensional signals into two-dimensional feature maps, thereby preserving the temporal correlations inherent in the signals. By training and validating the model with various types of fault signals in a simulated real-world engineering environment, the experimental results demonstrate that the MTF-GAT model can accurately classify faults across diverse environmental conditions. Moreover, it exhibits superior recognition accuracy and generalization performance compared to other deep learning models [28].

In the aforementioned studies, CNNs have been extensively utilized for the prediction of various domains, showcasing their formidable capacity to process high-dimensional data and capture intricate features. These investigations have established a foundational understanding for implementing data-driven methodologies in engineering contexts. Nevertheless, despite the notable successes of CNNs in specific applications, current models exhibit limitations in accurately predicting both global and local stresses, particularly when confronted with complex geometries and multiscale challenges. Furthermore, there is a relative scarcity of research focusing on the application of deep learning models specifically for structural stress prediction, and existing CNN architectures have not been optimized for this particular task, leading to inadequate prediction accuracy.

To address the aforementioned challenges, this paper introduces a novel multi-scale convolutional neural network model known as the Multi-scale Attention Enhanced Unet (MA-Unet) model. This model integrates multi-scale feature extraction with an attention mechanism, employing multi-scale convolutional kernels to effectively capture the nuances of stress distribution across various levels. Additionally, the attention mechanism allows MA-Unet to adaptively focus on the most critical feature regions, thereby effectively filtering out irrelevant information and enhancing both the accuracy and robustness of the predictions. By leveraging deep learning techniques on wing-specific input parameters, MA-Unet not only adeptly identifies subtle variations in both global and local stress distributions but also demonstrates superior prediction accuracy and speed. This performance aligns with the dual requirements of efficiency and precision in contemporary engineering design.

In this paper, a finite element model is first developed for the wing design of a ventilation aircraft, with the aerodynamic load spectrum on the finite element nodes derived from aerodynamic calculation data and interpolation procedures. The flight parameters of the wing and the aerodynamic loads on the structural nodes are then organized and segmented into training and testing datasets. Finally, MA-Unet is employed to train and predict the dataset, with the results compared against those obtained from commonly used CNN and Unet models, thereby validating the superiority and applicability of the proposed MA-Unet method.

2. Theoretical Principles and Overall Process

2.1. Convolutional Neural Networks and Attention Mechanisms

CNNs were originally designed primarily for image recognition and processing tasks. Their distinctive structure, featuring convolutional and pooling layers, enables them to efficiently capture spatial hierarchies within images. As a result, CNNs have achieved significant success in various domains, including image classification, object detection, and facial recognition.

In a CNN, the operational components of the convolutional layer primarily include the convolutional kernel, stride, and padding. The convolutional kernel slides over the input data at specified intervals, multiplying its elements with the corresponding elements of the input, followed by summation. This process can be viewed as a matrix transformation. For a pressure matrix

A_{w \times h}

obtained after interpolation, let the convolution kernel be

α

, the stride be

S

, and the padding be

P

. The convolution operation can be defined as

A_{w^{'} \times h^{'}}^{'} = c o n v (A_{w, h}, α, S, P)

(1)

w^{'} = \frac{w + 2 * P - f}{s} + 1

(2)

h^{'} = \frac{h + 2 * P - f}{s} + 1

(3)

where

A_{w^{'} \times h^{'}}^{'}

, denotes the pressure matrix after the convolution operation, and

A_{w^{'} \times h^{'}}^{'}

denotes the matrix size after the operation.

The self-attention mechanism, as improved by Vaswani et al., leverages the principles of attention to associate different positions within a single sequence, allowing for the calculation of representations of that sequence. This mechanism offers superior global modeling capabilities compared to CNNs, enhancing the model’s ability to capture sample features by integrating the strengths of both CNNs and self-attention. The self-attention mechanism is fundamentally comprised of three components: the query vector, the key vector, and the value vector. Its calculation principle is shown in Figure 1. Its calculation process can be delineated into the following three steps:

(1): Assign weights to the input matrix and calculate the value of q, k, v, which is calculated as follows:

Q = W_{Q} X, K = W_{K} X, V = W_{V} X

(4)

where

W_{Q}, W_{K}, W_{V}

are initialized learnable weight matrices.

(2): Calculate the dot product between q and k and scale the result

S = \frac{Q K^{T}}{\sqrt{d_{k}}}

(5)

where

d_{k}

is the dimension of Q and K. Scaling the calculation results prevents clicks from being too large and enhances model fitting.

(3): Attentional weights were calculated using the softmax function on the computed results and pointwise multiplied with V to obtain the attentional output.

a t t e n t i o n_w e i g h t s = s o f t m a x (S)

(6)

o u t p u t = a t t e n t i o n_w e i g h t s \cdot V

(7)

That is, the final expression is

Z (Q, K, V) = s o f t m a x (\frac{Q K^{T}}{\sqrt{d_{k}}}) V

(8)

2.2. Overall Process

Stress analysis is a critical component of structural analysis, where finite element analysis methods are commonly employed for the stress calculation and analysis of complex structures and material systems. However, traditional multi-scale finite element analysis is frequently burdened by significant computational demands and time expenditures, prompting researchers to explore more efficient, data-driven machine learning approaches as viable alternatives. In response to this challenge, this paper presents a deep learning-based model for stress prediction. The proposed model integrates multi-scale feature extraction alongside an attention mechanism, leveraging the Unet architecture to more effectively capture the complexities inherent in stress distribution features. The methodology is applied to wing skin structures, with the primary workflow for stress prediction illustrated in Figure 2. The key steps involved are as follows:

(1): Numerical Simulation of the Wing and Data Acquisition

Firstly, computational fluid dynamics (CFD) software was used to perform aerodynamic calculations on the wing to obtain the skin aerodynamic load data under different working conditions. These working conditions include different flight parameters and aim to comprehensively reflect the load changes of the wing in actual flight. Subsequently, an interpolation procedure is used to accurately interpolate the CFD calculation results to the finite element nodes of the wing in order to achieve the accurate addition of loads and obtain the interpolated nodal load spectrum. Finally, hydrostatic calculations are performed with the help of finite element calculation software, so as to obtain the corresponding skin stress field data, which provides a solid data base for subsequent model training and validation.

(2): Dataset Establishment

After the obtained load spectra and flight parameters have undergone the necessary pre-processing, a multi-channel merger is performed in order to collate samples that meet the model input requirements. This process ensures the high quality and consistency of the data, which can effectively reflect the loading condition of the wing under different operating conditions. According to the design requirements, the final dataset is divided into training, validation, and test sets to facilitate subsequent model training and evaluation. In addition, dividing the dataset into training, validation, and test sets can improve the generalization ability of the model and also provide an objective basis for subsequent performance evaluation.

(3): Model Design

A novel deep learning model that can be used for wing skin stress field prediction is proposed based on the skin stress distribution. The model is based on the traditional Unet structure, incorporating multi-scale feature extraction and an attention mechanism. The multi-scale feature extraction enables the model to effectively capture the subtle changes of the stress field at different resolutions, while the attention mechanism enhances the model’s ability to learn global features and improve the model prediction performance.

(4): Model Evaluation

To ensure the validity and reliability of the model, the loss function and evaluation indexes commonly used in the field of deep learning are used to systematically evaluate and debug the model. Parameter adjustments are made by setting up model-tuning methods and observing the changes in loss values and evaluation indicators during model training. The model with the relative best performance is finally selected for stress prediction to ensure the effectiveness of the model in practical applications.

(5): Analysis of Results

After the training of the model is completed, it is validated and tested using the established dataset, and the prediction results of the model are analyzed in depth. The superiority and applicability of the MA-Unet model in stress analysis is verified by comparing the prediction results with the currently used CNN and Unet models.

3. Numerical Simulation and Stress Field Data Acquisition for Airfoils

In this study, a composite fixed-wing model of a general aviation aircraft, currently in the design phase, is employed for numerical simulations. The wing’s key parameters include a chord length of 1.45 m, a wingspan of 10.46 m, and a total airfoil area of 15.167 square meters. To generate the required training data, the research examines the aircraft’s operation at zero altitude, with flight speeds ranging from 150 to 240 km/h and angles of attack varying from 0 to 9 degrees. For aerodynamic load calculations, the commercial software Fluent is employed for the numerical simulation of the wing.

Initially, aerodynamic meshing of the wing geometry model is performed using Fluent Meshing software, employing the Poly-Hexcore body mesh generation method to enhance the number of hexahedral meshes. The aerodynamic mesh schematic of the wing skin and boundary layer is shown in Figure 3. The overall mesh size of the far field and wing wall is controlled between 0.001 m and 3 m, and the BOI encrypted mesh is set up in the leading and trailing edge regions of the wing, and a 40-layer boundary layer is set up in the wall region, with the initial height of the boundary layer being 0.001 m, and the total number of meshes is 2.75 million. In the watertight workflow, boundary conditions are set with the far-field wing designated as the pressure far-field and the wing model defined as a wall. To enhance solution accuracy, the pressure solver employs a second-order upwind discrete format. The shear stress transport (SST) k-ω model is selected as the turbulence model, with convergence determined when the lift and drag stabilize and the residuals fall below

1 \times 10^{- 5}

.

For stress load calculations, finite element modeling and stress analysis of the wing are conducted using the finite element software Nastran [29]. Figure 4 illustrates the finite element model of the wing. Among them, the girder and ribs are made of glass fiber material, and the skin is made of glass fiber and foam sandwich material. To comply with the design requirements for air-to-air aircraft, certain ultimate load conditions must be met. Specifically, the vertical displacement at the tip of the wing should remain within 6% of the wingspan, and the stress surrounding the bolt holes at the root of the wing beam must not exceed 60% of the material’s compressive strength. Fixed constraints are applied to the bolt holes at the beam’s root. Moreover, the wing structure is represented using quadrilateral shell elements, which encompass the airfoil, wing ribs, and beam components.

To ensure calculation accuracy, grid independence is verified during the early design phase by conducting numerical simulations with 15,000, 20,000, and 23,000 mesh elements, using equivalent stress as the evaluation criterion. Under flight conditions of zero altitude, a speed of 240 km/h, and an angle of attack of 5°, the results of the grid correlation verification are summarized in Table 1. The results show that the relative error in equivalent stresses from 15,000 to 23,000 grids is below 2%, and a grid-independent solution is considered to have been reached. Considering that 15,000 grids may contain less node information on some critical structures and 23,000 grids may contain too much unnecessary information on some non-critical structures, 20,000 grids were finally selected for subsequent numerical simulations, and the specific number of grid cells is shown in Table 2. This includes a total of 11,200 grid cells for the skin, 3048 grid cells for the wing ribs, and 5776 grid cells for the beam, with a maximum cell size of 46 mm and a minimum cell size of 1.7 mm throughout the wing grid.

The interpolation procedure is employed to transfer the CFD aerodynamic load calculation data to the finite element nodes, facilitating the addition of loads. The resulting interpolated calculation file is then provided to Nastran for stress analysis. The solution results from typical working conditions are selected for further examination. Figure 5 illustrates the equivalent stress distribution under varying angles of attack and flow rates. Notably, changes in the angle of attack and flow velocity have a significant impact on the stress distribution of the skin. Particularly at high angles of attack, the influence of flow velocity on stress becomes pronounced. As the angle of attack and flow velocity vary, equivalent stresses tend to concentrate at structural contact points, such as the skin, beam, and wing rib, with the von Mises stress values in the skin being higher near the wing root.

4. Data Processing and Dataset Creation

CNNs were originally employed predominantly for image recognition and processing tasks. In this study, the data required transformation into a two-dimensional format to facilitate the input as images. Specifically, we replaced the pixel value of the image with the specific value of the parameter, and then merged the processed two-dimensional data into the model as a sample. To meet the model’s input requirements, the wing skin was partitioned into upper and lower sections, with the model trained and evaluated separately on each segment.

In this paper, the simulation data under 100 flight conditions are processed, and the finite element node load spectrum, node position information, and flight parameters (including angle of attack and inflow velocity) are used as input parameters of the prediction model, while the calculated skin stress field is used as the output parameters. Figure 6 illustrates the schematic of channel merging, and the specific parameters and their ranges are shown in Table 3.

A dataset of 100 processed samples was created and split into training, validation, and test sets in an 8:1:1 ratio. The training set and validation set primarily includes operational data with angles of attack between 0 and 7 degrees to capture the effects of angle of attack and inflow velocity on stress magnitude and distribution. To evaluate the model’s performance under different flight conditions, data for angles of attack of 8 and 9 degrees was assigned to the test set, respectively.

5. Stress Prediction Model

5.1. MA-Unet Model

The Unet architecture was originally developed for medical image segmentation, demonstrating exceptional segmentation performance even with a limited number of samples. In recent years, it has proven effective in extracting potential features from various types of images. In this paper, a multi-scale convolutional neural network (MA-Unet) based on the Unet structure incorporating an attention mechanism is proposed, aiming to be used for stress field prediction. The introduction of the attention mechanism significantly enhances the global modelling capability of the model, which allows the model to pay more attention to the important feature regions in the image, thus suppressing the interference of irrelevant information during the feature extraction process and improving the overall accuracy and robustness. Meanwhile, this study also introduces the design of a multi-scale convolution kernel, a design concept that aims to allow the model to extract features at different spatial scales. By using a multi-scale convolution kernel, MA-Unet is able to capture the stress field variations and structural features more comprehensively. Specifically, the multiscale convolution kernel can process the input image through filters of different sizes, capturing multilevel features ranging from local details to global patterns. The structure of the multiscale convolutional layer is shown in Figure 7. The network’s structure is detailed in Table 4, and it can be categorized into three components: the encoder, base layer, and decoder.

In the encoder component, the model employs three distinct convolutional kernel sizes (1 × 1, 2 × 2, and 3 × 3) to extract multi-scale features from the input image. Each convolutional kernel size undergoes two convolution operations, and the resultant feature extraction outputs from the different kernel sizes are subsequently merged. To enhance the model’s focus on critical regions, a self-attention layer is incorporated following the first and last hidden layers. Downsampling is then executed via a max pooling layer to reduce the feature map’s dimensions while preserving essential spatial information. The encoder comprises three layers, containing 32, 64, and 128 convolutional kernels, respectively.

The lower section serves as an extension of the encoder, comprising 256 convolutional kernels and utilizing three convolutional kernel sizes for feature fusion. The decoder component employs transposed convolutional layers for upsampling, thereby restoring the spatial resolution of the image. The upsampled feature maps are then concatenated with the corresponding feature maps from the encoder, enabling the retention of more detailed information. The decoder is also organized into three layers, containing 32, 64, and 128 convolutional kernels, respectively. Ultimately, the output layer utilizes 1 × 1 convolutional kernels to produce a single-channel output, representing the stress field predicted by the model.

To thoroughly evaluate the performance of the proposed model, this paper conducts a comparative analysis with the Unet model and a commonly used Convolutional Neural Network (CNN) model. The structures of the relevant models are detailed in Table 5 and Table 6. This comparison aims to provide an in-depth analysis of the performance differences among the various models in predicting stress and strain fields, thereby validating the effectiveness and advantages of the proposed model.

5.2. Activation Function

In this study, both the linear function and the Rectified Linear Unit (ReLU) function are employed as activation functions within the network to augment its nonlinear representational capacity. The graphical representation of these functions is illustrated in Figure 8.

The gradient of the linear function remains constant across the entire input range, which facilitates gradient updating during backpropagation. Conversely, the ReLU function promotes convergence in the training of deep neural networks and mitigates the issue of vanishing gradients.

5.3. Loss Function and Evaluation Index

In this research, the Mean Squared Error (MSE) is employed as the model’s loss function, while the performance and explanatory power of the model are evaluated using three metrics: Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and the Coefficient of Determination (R²), with their definitions provided below:

M S E = \frac{1}{n} \sum_{i = 1}^{n} {({\hat{y}}_{i} - y_{i})}^{2}

(9)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |{\hat{y}}_{i} - y_{i}|

(10)

M A P E = \frac{100 %}{n} \sum_{i = 1}^{n} |\frac{{\hat{y}}_{i} - y_{i}}{y_{i}}|

(11)

R^{2} (y, \hat{y}) = 1 - \frac{\sum_{i = 1}^{n} {({\hat{y}}_{i} - y_{i})}^{2}}{\sum_{i = 1}^{n} {({\bar{y}}_{i} - y_{i})}^{2}}

(12)

where

{\hat{y}}_{i}

represents the predicted value from the model,

y_{i}

denotes the computed value from the FEM,

n

is the total number of samples, and

{\bar{y}}_{i}

denotes the average of the FEM-calculated values.

5.4. Multi-Scale Volume-Based Feature Fusion Approach

Following the multi-scale volume and operations, the extracted feature information from each layer requires fusion. Commonly utilized feature fusion methods include addition, concatenation, and weighted fusion. Addition fusion directly combines information from each convolutional layer by summing the feature maps of different scales, which preserves information strength and suppresses noise, making it suitable for scenarios with high feature similarity and reducing the parameter count. Concatenation fusion connects feature maps of varying scales along the channel axis, thereby retaining rich information; this method is particularly beneficial when input data are diverse and complex, as it enhances model performance and expands the feature space to capture more detailed attributes. Weighted fusion dynamically adjusts the importance of features by assigning different weights to each feature map, thereby enhancing the model’s sensitivity to specific features and improving performance in complex scenarios.

To evaluate the impact of these three feature fusion methods on model performance, each method was trained separately with the number of training iterations fixed at 5000 steps. The ReLU activation function was employed in the multiscale and convolutional layers, while the output layer utilized a linear activation function, with a learning rate set at 0.001. By comparing the performance of these fusion methods in feature extraction and model training, the objective was to identify the most appropriate fusion method for this model.

Table 7 presents the evaluation metric scores for the models on the validation and test sets following training with the three feature fusion methods. The results indicate that both Add fusion and Concatenate fusion exhibit similar performance on the validation set, significantly surpassing Weighted fusion. Notably, Add fusion achieves the lowest MSE and MAE on the test set, demonstrating superior generalization ability on unseen data. In contrast, Concatenate fusion slightly underperforms compared to Add fusion on the test set, while Weighted fusion fails to match the performance of Add and Concatenate fusion across all metrics, particularly exhibiting poor results in MAE and MAPE.

Figure 9 depicts the loss degradation of the three models on the training set. It is evident that Weighted fusion experiences a higher loss in the initial stages, exhibiting greater fluctuations during the training process. In contrast, Additive fusion and Concatenate fusion demonstrate relatively smooth loss fluctuations. Overall, both Additive fusion and Concatenate fusion are more suitable for this task; however, Concatenate fusion generates larger feature maps during the fusion process, resulting in increased computation time. Given that Additive fusion performs slightly better than Concatenate fusion on the test set, it is ultimately selected as the method for fusing the multi-scale information within the model.

Through the comparison of the performance of these fusion methods in feature extraction and model training, along with an evaluation of their impact on overall model performance, this paper identifies the optimal fusion strategy suitable for the proposed model.

6. Analysis of Results

6.1. Analysis of Overall Regional Stress Prediction Results

In this study, the MA-Unet model was developed within a Conda environment using TensorFlow version 2.0. A ReLU activation function was integrated into the Volume Layer to improve the network’s ability to represent nonlinearity, enabling it to effectively capture complex feature relationships. In the output layer, a Linear activation function was used to ensure the model generates continuous stress values, leading to accurate predictions. To enhance the training process, the Adam optimizer was chosen for its excellent convergence properties. This choice significantly enhances the computational efficiency and convergence speed of the network, expediting the process of identifying the optimal solution. The specific parameters were configured as follows: a learning rate of 0.001, a dataset batch size of 6, and a total of 5000 training iterations. To prevent overfitting, a callback function is added during training to ensure that the best model is preserved for stress prediction.

Figure 10 shows the learning curves of the three models on the training and validation sets. The loss values drop sharply at the beginning of training and level off after the number of iterations is 1000 steps, which indicates that the models are able to effectively capture and adapt to the main features in the data. The model is considered to have converged after 4000 iterations when the loss value stays low and does not fluctuate between large orders of magnitude. During the overall training process, the CNN model has a higher number of loss values and a larger range of fluctuations, while the Unet and MA-Unet models have fewer fluctuations and a smaller range of fluctuations, which suggests that these two models are more stable and able to learn more efficiently when dealing with the training data. Meanwhile, the loss values of these two models are lower than those of the CNN model, which further validates their effectiveness in the current task.

Figure 11 presents the stress prediction cloud diagrams for the upper and lower skins generated by the three models under typical working conditions. A comparison of the stress prediction cloud diagrams from the three models with the results obtained from FEM calculations reveals a high level of consistency in the equivalent stress distribution predictions. This alignment indicates that all three models exhibit a high degree of accuracy in capturing the overall trends of stress distribution.

Table 8 and Table 9 present the scores of the models on the upper and lower skin validation and test sets across various evaluation metrics. All three models exhibit low loss values and good prediction accuracy for both MSE and MAE metrics. The MA-Unet model demonstrates the best performance on the upper-skinned dataset, achieving an MSE of 0.0044 and an MAE of 0.0429, with an R² value approaching 1. This result indicates that the MA-Unet model effectively captures the latent features of the data, showcasing strong generalization and prediction capabilities. The Unet model records an MSE of 0.0051 and an MAE of 0.0489 on the validation set, with both MSE and MAE values on the test set being higher than those of the MA-Unet model, suggesting slightly reduced generalization capability. In comparison, the CNN model exhibits weaker performance on these datasets, with MSE values of 0.0799 and 0.0508, and MAE values of 0.1425 and 0.1570 for the validation and test sets, respectively.

In the lower-skinned dataset, the MA-Unet model continues to demonstrate excellent performance, achieving an MSE of 0.0064 and an MAE of 0.0560 on the validation set, along with an MSE of 0.1023 on the test set. These results indicate that the MA-Unet model is well adapted to the features of the lower-skinned data, effectively reducing prediction errors. The Unet model records a commendable performance on the validation set, with an MSE of 0.0109 and an MAE of 0.0675; however, the MSE rises to 0.1373 on the test set, indicating some volatility. In contrast, the CNN model performs poorly on the lower-skinned dataset, exhibiting an MSE of 0.1452 and an MAE of 0.1761 on the validation set, and an MSE of 0.1412 and an MAE of 0.2865 on the test set, which reflects greater volatility.

In summary, all three models demonstrate improved overall prediction accuracy, with the MA-Unet model exhibiting the best performance in both upper and lower skin prediction tasks. The multi-scale feature extraction approach contributes to higher prediction accuracy and enhanced generalization capabilities across the models.

6.2. Projections for High-Stress Areas

In engineering applications, accurately predicting high-stress areas is crucial, as these regions often represent potential risk points for structural failure. Reliable stress prediction for these areas is essential for ensuring the safety and integrity of the structure, as well as for effectively reducing maintenance costs and extending the service life of the system.

Table 10 and Table 11 present the prediction metric scores for the three models in high-stress regions. A comparison of the prediction results for the upper and lower skin concentration areas reveals that the MA-Unet model outperforms the others across several assessment metrics. In the validation set for the upper skin concentration region, the MA-Unet achieves an MSE of 0.0138 and an MAE of 0.0841, values that are comparable to those of the Unet model. Although the MA-Unet shows an increased MSE of 0.0451 and an MAE of 0.1587 on the test set, it still demonstrates strong generalization capabilities when confronted with unseen data. Conversely, the CNN exhibits significantly higher MSE and MAE values of 0.1251 and 0.2576, respectively, indicating its limitations in accurately capturing complex stress distributions.

In the lower skin concentration region, the MA-Unet continues to outperform both the CNN and Unet models in the validation set, achieving an MSE of 0.0156 and an MAE of 0.0931. Conversely, the test results indicate that the CNN performs slightly worse than the Unet model, demonstrating a limited ability to generalize complex data. The MA-Unet model, equipped with its unique attention mechanism and multi-scale convolutional kernels, effectively captures key features in high-stress regions, thereby enhancing its adaptability and prediction accuracy for complex structures.

Figure 12 illustrates the relative error distributions of the three models on the upper and lower skin test sets. The MA-Unet model exhibits a more concentrated relative error distribution compared to both the Unet and CNN models across the overall dataset, with 90% of the samples demonstrating a relative error within 1%. In contrast, the relative error distributions of the Unet and CNN models in the lower skin region are primarily concentrated in the interval of [0, 0.02], showing a more uniform spread. The MA-Unet model’s ability to adaptively capture key features in high-stress regions is attributed to its unique attention mechanism and multi-scale convolution kernels, enhancing both adaptability and prediction accuracy for complex structures. This design not only improves the prediction accuracy for high-stress areas but also provides robust support for structural safety, maintenance cost reduction, and service life extension in engineering practice.

Figure 13 illustrates the predictions of the three models at the stress peak. To more clearly demonstrate the model’s prediction performance for the stress peak, the absolute error of each model’s prediction is magnified by 100 times for comparison with the FEM results. After scaling, the figure reveals that the CNN model exhibits the lowest alignment with the FEM calculation results on both the upper and lower skins of the validation and test sets, particularly on the lower skin data, which show significant fluctuations. This inconsistency may stem from the simpler structure of the CNN model, leading to a slightly weaker ability to capture data features compared to the other two models. The Unet model performs well in peak prediction on the validation set but displays some fluctuations on the test set, indicating limited generalization ability under extreme working conditions. The MA-Unet model seems to have a tendency to move away from the FEM results in the upper skinned validation set, but the overall results are still close to the FEM calculations. One reason for this may be because the validation set has fewer samples and some occasional nodes appeared, and another reason is because the loss function used for training is MSE, and the overall mean-square error is in a very small range, but occasional nodes may have some more significant errors. Overall, MA-Unet performs best on the overall dataset, with the addition of the multi-scale convolution approach and attention resulting in high prediction accuracy and strong generalization in high stress regions.

7. Conclusions

To meet the demand for the efficient and rapid assessment of the structural stress field in aircraft, this paper introduces a novel multi-scale convolutional neural network model, MA-Unet. This model integrates an attention mechanism with multi-scale convolutional kernels to accurately predict the structural stress field of wing structures, thereby offering a high-quality data source for the swift and precise evaluation of structural safety and reliability. The main conclusions of this paper are as follows:

(1): This paper utilizes input parameters instead of RGB values as the foundational elements of the model’s input tensor, merging the channels of each parameter to create input samples. The structure demonstrates that this data format effectively establishes the mapping relationship between parameters and the stress field, yielding high prediction accuracy, thus providing a reference for the application of deep learning models in stress prediction.
(2): Addressing the challenge of achieving good overall prediction accuracy while slightly underperforming in local predictions, this study employs Unet as the base structure and integrates a multi-scale convolution approach with an attention mechanism. This enhancement improves the model’s capability to capture feature information related to both global and local stresses. Additionally, the impact of three different feature fusion methods on prediction accuracy is explored, with results indicating that the Add fusion method significantly enhances model performance.
(3): In this paper, the prediction performance of the designed MA-Unet model is compared with the commonly used CNN model and Unet model. On the overall dataset, the prediction performance of the CNN model is slightly inferior to that of the Unet model and the MA-Unet model, which may be due to the insufficient ability of the CNN model to capture the features of complex nonlinear problems. The Unet model has similar predictive power to the MA-Unet model on the training and test sets, but lower predictive power than the MA-Unet model on the test set. The MA-Unet model has high prediction accuracy and strong generalisation ability for high stress regions with the addition of a multi-scale convolution approach and attention mechanism. The wing’s calculation time for one working condition in the hydrostatic simulation is about 10 s, while the MA-Unet model calculates six samples in only 132 ms, which is capable of generating a large amount of data under a large number of working conditions in a short period of time. The high accuracy and computational speed also provide strong support for the rapid assessment of structural safety and reliability, maintenance cost reduction, and service life extension, which has certain value for engineering applications. However, although the model proposed in this paper can be shaped by relying on a small number of samples, the improvement of the number of samples also helps to improve the prediction accuracy of the model for deep learning models. The model proposed in this paper has only been applied to the wing skin, and its application to the rest of the aircraft structure is not yet known. Therefore, future work will consider expanding the number of samples as well as applying the model to the rest of the wing structure to improve the prediction accuracy and applicability of the model.

Author Contributions

W.J.: Conceptualization, investigation, resources, supervision, project administration, writing original draft, writing—review and editing. Q.C.: Conceptualization, investigation, data curation, funding acquisition, writing—original draft, writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science Fund for Distinguished Young Scholars of Chongqing Municipality (CSTB2022NSCQ JQX0024).

Data Availability Statement

The data supporting the findings of this study are not publicly available due to privacy restrictions. Data access may be provided upon reasonable request, subject to appropriate privacy and ethical considerations.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Cunha, F.J.; Dahmer, M.T.; Chyu, M.K. Thermal-Mechanical Life Prediction System for Anisotropic Turbine Components. J. Turbomach. 2006, 128, 240–250. Available online: https://asmedigitalcollection.asme.org/turbomachinery/article-abstract/128/2/240/470626/Thermal-Mechanical-Life-Prediction-System-for?redirectedFrom=fulltext (accessed on 28 May 2024). [CrossRef]
Giannella, V.; Vivo, E.; Mazzeo, M.; Citarella, R. FEM-DBEM Approach to Simulate Crack Propagation in a Turbine Vane Segment Undergoing a Fatigue Load Spectrum. Procedia Struct. Integr. 2018, 12, 479–491. [Google Scholar] [CrossRef]
Santos, P.D.R.; Sousa, D.B.; Gamboa, P.V.; Zhao, Y. Effect of Design Parameters on the Mass of a Variable-Span Morphing Wing Based on Finite Element Structural Analysis and Optimization. Aerosp. Sci. Technol. 2018, 80, 587–603. [Google Scholar] [CrossRef]
Papa, U.; Russo, S.; Lamboglia, A.; Del Core, G.; Iannuzzo, G. Health Structure Monitoring for the Design of an Innovative UAS Fixed Wing through Inverse Finite Element Method (iFEM). Aerosp. Sci. Technol. 2017, 69, 439–448. [Google Scholar] [CrossRef]
Tao, F.; Qi, Q. New IT Driven Service-Oriented Smart Manufacturing: Framework and Characteristics. IEEE Trans. Syst. Man Cybern. Syst. 2019, 49, 81–91. [Google Scholar] [CrossRef]
Teferra, K.; Graham-Brady, L. A Random Field-Based Method to Estimate Convergence of Apparent Properties in Computational Homogenization. Comput. Methods Appl. Mech. Eng. 2018, 330, 253–270. [Google Scholar] [CrossRef]
Paulson, N.H.; Priddy, M.W.; McDowell, D.L.; Kalidindi, S.R. Reduced-Order Structure-Property Linkages for Polycrystalline Microstructures Based on 2-Point Statistics. Acta Mater. 2017, 129, 428–438. [Google Scholar] [CrossRef]
Jiang, W.; Chang, R.C.; Zhang, S.; Zang, S. Structural Health Monitoring and Flight Safety Warning for Aging Transport Aircraft. J. Aerosp. Eng. 2023, 36, 5. Available online: https://ascelibrary.org/doi/10.1061/JAEEEZ.ASENG-4740 (accessed on 28 May 2024). [CrossRef]
Liu, H.; Ma, T.; Lin, Y.; Peng, K.; Hu, X.; Xie, S.; Luo, K. Deep Learning in Rockburst Intensity Level Prediction: Performance Evaluation and Comparison of the NGO-CNN-BiGRU-Attention Model. Appl. Sci. 2024, 14, 5719. Available online: https://www.mdpi.com/2076-3417/14/13/5719 (accessed on 21 September 2024).
Ramezani, M.; Alandihallaj, M.; Hein, A.M. Fuel-Efficient and Fault-Tolerant CubeSat Orbit Correction via Machine Learning-Based Adaptive Control. Aerospace 2024, 11, 807. [Google Scholar] [CrossRef]
Xu, Y.; Pan, Q.; Wang, Z.; Hu, B. A Novel Trajectory Prediction Method Based on CNN, BiLSTM, and Multi-Head Attention Mechanism. Aerospace 2024, 11, 822. Available online: https://www.mdpi.com/2226-4310/11/10/822 (accessed on 23 October 2024).
Cao, Y.; Wang, X.; Wang, Y.; Xu, L.; Wang, Y. An Interval Neural Network Method for Identifying Static Concentrated Loads in a Population of Structures. Aerospace 2024, 11, 770. Available online: https://www.mdpi.com/2226-4310/11/9/770 (accessed on 23 October 2024).
Baldan, G.; Guardone, A. A Deep Neural Network Reduced Order Model for Unsteady Aerodynamics of Pitching Airfoils. Aerosp. Sci. Technol. 2024, 152, 109345. [Google Scholar] [CrossRef]
Broer, A.A.R.; Benedictus, R.; Zarouchas, D. The Need for Multi-Sensor Data Fusion in Structural Health Monitoring of Composite Aircraft Structures. Aerospace 2022, 9, 183. Available online: https://www.mdpi.com/2226-4310/9/4/183 (accessed on 28 May 2024).
Yu, T.; Wu, X.; Yu, Y.; Li, R.; Zhang, H. Establishment and Validation of a Relationship Model between Nozzle Experiments and CFD Results Based on Convolutional Neural Network. Aerosp. Sci. Technol. 2023, 142, 108694. [Google Scholar] [CrossRef]
Liu, Y.; Li, Y.; Li, L.; Xie, Y.; Zhang, D. A Fast Prediction Model of Blade Flutter in Turbomachinery Based on Graph Convolutional Neural Network. Aerosp. Sci. Technol. 2024, 148, 109119. [Google Scholar] [CrossRef]
Hu, J.; Zhang, W. Flow Field Modeling of Airfoil Based on Convolutional Neural Networks from Transform Domain Perspective. Aerosp. Sci. Technol. 2023, 136, 108198. [Google Scholar] [CrossRef]
Ren, L.; Sun, Y.; Wang, H.; Zhang, L. Prediction of Bearing Remaining Useful Life With Deep Convolution Neural Network. IEEE Access 2018, 6, 13041–13049. [Google Scholar] [CrossRef]
Zhang, Y.; Sung, W.J.; Mavris, D.N. Application of Convolutional Neural Network to Predict Airfoil Lift Coefficient. In Proceedings of the 2018 AIAA/ASCE/AHS/ASC Structures, Structural Dynamics, and Materials Conference, Kissimmee, FL, USA, 8–12 January 2018; AIAA SciTech Forum. American Institute of Aeronautics and Astronautics: Reston, VA, USA. [Google Scholar]
Chen, H.; He, L.; Qian, W.; Wang, S. Multiple Aerodynamic Coefficient Prediction of Airfoils Using a Convolutional Neural Network. Symmetry 2020, 12, 544. Available online: https://www.mdpi.com/2073-8994/12/4/544 (accessed on 23 October 2024).
Bhatnagar, S.; Afshar, Y.; Pan, S.; Duraisamy, K.; Kaushik, S. Prediction of Aerodynamic Flow Fields Using Convolutional Neural Networks. Comput Mech 2019, 64, 525–545. [Google Scholar] [CrossRef]
Yang, C.; Kim, Y.; Ryu, S.; Gu, G.X. Using Convolutional Neural Networks to Predict Composite Properties beyond the Elastic Limit. MRS Commun. 2019, 9, 609–617. [Google Scholar] [CrossRef]
Chen, W.; Iyer, A.; Bostanabad, R. Data Centric Design: A New Approach to Design of Microstructural Material Systems. Engineering 2022, 10, 89–98. [Google Scholar] [CrossRef]
Gao, W.; Lu, X.; Peng, Y.; Wu, L. A Deep Learning Approach Replacing the Finite Difference Method for In Situ Stress Prediction. IEEE Access 2020, 8, 44063–44074. Available online: https://ieeexplore.ieee.org/document/9020114 (accessed on 23 October 2024). [CrossRef]
Saha, I.; Gupta, A.; Graham-Brady, L. Prediction of Local Elasto-Plastic Stress and Strain Fields in a Two-Phase Composite Microstructure Using a Deep Convolutional Neural Network. Comput. Methods Appl. Mech. Eng. 2024, 421, 116816. [Google Scholar] [CrossRef]
Li, H.; Zhao, W.; Zhang, Y.; Zio, E. Remaining Useful Life Prediction Using Multi-Scale Deep Convolutional Neural Network. Appl. Soft Comput. 2020, 89, 106113. [Google Scholar] [CrossRef]
Bhaduri, A.; Gupta, A.; Graham-Brady, L. Stress Field Prediction in Fiber-Reinforced Composite Materials Using a Deep Learning Approach. Compos. Part B Eng. 2022, 238, 109879. [Google Scholar] [CrossRef]
Lei, C.; Xue, L.; Xia, B.; Jiao, M.; Shi, J. Rolling Bearing Fault Diagnosis Method Based on Markov Transition Field and Graph Attention Network. J. Vib. Eng. 2023, 1–10. Available online: https://kns.cnki.net/kcms/detail/32.1349.tb.20230327.0958.003.html (accessed on 23 October 2024). (In Chinese).
Chen, Q.; Han, J.; Yun, H. Effect of Engine Thrust on Nonlinear Flutter of Wings. J. Vibroeng. 2013, 15, 1731–1739. [Google Scholar]

Figure 1. Schematic of the computational process of the self-attention mechanism.

Figure 2. Stress prediction model modelling process.

Figure 3. Wing skin and boundary layer mesh.

Figure 4. Finite element model of the wing.

Figure 5. Simulated stress cloud for different angle of attack and inflow velocity.

Figure 6. Multi-Channel Data Merging.

Figure 7. Multi-scale convolutional layers.

Figure 8. Linear activation function and ReLU activation function.

Figure 9. Learning curves for different feature fusion methods.

Figure 10. Learning curves for different models in the upper and lower skinned datasets.

Figure 11. Stress prediction cloud for typical working conditions.

Figure 12. Relative error distribution on the test set in the high stress region.

Figure 13. Peak stress prediction.

Table 1. Grid correlation verification results.

Number of Mesh Elements	Maximum Stress Value (Mpa)	Relative Error
15,000	180.2	0
20,000	183	1.55%
23,000	177.3	1.16%

Table 2. Grid Element Information.

Structure	Number of Mesh Elements	Maximum Mesh Element Size (mm)	Minimum Mesh Element Size (mm)
Wing surface	5600	46	8.6
Wing rib	3048	28	5.2
Beam	5776	46	1.7

Table 3. Model input parameters.

Input Parameter	Upper Skin Parameter Range	Lower Skin Parameter Range	Unit
X-axis node coordinates (x_c)	[0–1.106]	[0–1.106]	m
Y-axis node coordinates (y_c)	[0.002–0.273]	[−0.079–0.157]	m
Z-axis node coordinates (z_c)	[0.746–5.187]	[0.746–5.187]	m
X-axis aerodynamic load component (x_a)	[−41.895–46.388]	[−5.135–4.213]	N
Y-axis aerodynamic load component (y_a)	[−241.032–268.914]	[−177.85–194.166]	N
Z-axis aerodynamic load component (z_a)	[−52.631–51.631]	[−8.507–7.796]	N
angle of attack (a)	[0, 9]	[0, 9]	Degree
Inflow velocity (v)	[150, 240]	[150, 240]	Km/h

Table 4. MA-Unet structural parameters.

Serial Number	Structure Composition	Activation Function	Data Shape
Serial Number	Structure Composition	Activation Function	Up	Down
1	Input layer		88 × 72 × 8	100 × 60 × 8
2	Multi-scale convolutional layer + self-attention layer	Relu	88 × 72 × 32	100 × 60 × 32
3	Max pooling layer		44 × 36 × 32	50 × 30 × 32
4	Convolutional layer + Convolutional layer	Relu	44 × 36 × 64	50 × 30 × 64
5	Max pooling layer		22 × 18 × 64	25 × 15 × 64
6	Convolutional layer + Convolutional layer	Relu	22 × 18 × 128	25 × 15 × 128
7	Max pooling layer		11 × 9 × 128	5 × 3 × 128
8	Multi-scale Convolutional Layer	Relu	11 × 9 × 256	5 × 3 × 256
9	Upsampling Layer + skip connection layer	Relu	22 × 18 × 256	25 × 15 × 256
10	Convolutional layer + Convolutional layer	Relu	22 × 18 × 128	25 × 15 × 128
11	Upsampling Layer + skip connection layer	Relu	44 × 36 × 128	50 × 30 × 128
12	Convolutional layer + Convolutional layer	Relu	44 × 36 × 64	50 × 30 × 64
13	Upsampling Layer + skip connection layer	Relu	88 × 72 × 64	100 × 60 × 64
14	Multi-scale convolutional layer + self-attention layer	Relu	88 × 72 × 32	100 × 60 × 32
15	Output Layer	Linear	88 × 72 × 1	100 × 60 × 1

Table 5. Unet structural parameters.

Serial Number	Structure Composition	Activation Function	Data Shape
Serial Number	Structure Composition	Activation Function	Up	Down
1	Input layer		88 × 72 × 8	100 × 60 × 8
2	Convolutional layer + Convolutional layer	Relu	88 × 72 × 32	100 × 60 × 32
3	Max pooling layer		44 × 36 × 32	50 × 30 × 32
4	Convolutional layer + Convolutional layer	Relu	44 × 36 × 64	50 × 30 × 64
5	Max pooling layer		22 × 18 × 64	25 × 15 × 64
6	Convolutional layer + Convolutional layer	Relu	22 × 18 × 128	25 × 15 × 128
7	Max pooling layer		11 × 9 × 128	5 × 3 × 128
8	Convolutional layer + Convolutional layer	Relu	11 × 9 × 256	5 × 3 × 256
9	Upsampling Layer + skip connection layer	Relu	22 × 18 × 256	25 × 15 × 256
10	Convolutional layer + Convolutional layer	Relu	22 × 18 × 128	25 × 15 × 128
11	Upsampling Layer + skip connection layer	Relu	44 × 36 × 128	50 × 30 × 128
12	Convolutional layer + Convolutional layer	Relu	44 × 36 × 64	50 × 30 × 64
13	Upsampling Layer + skip connection layer	Relu	88 × 72 × 64	100 × 60 × 64
14	Convolutional layer + Convolutional layer	Relu	88 × 72 × 32	100 × 60 × 32
15	Output Layer	Linear	88 × 72 × 1	100 × 60 × 1

Table 6. CNN model structural parameters.

Serial Number	Structure Composition	Activation Function	Data Shape
Serial Number	Structure Composition	Activation Function	Up	Down
1	Input layer		88 × 72 × 8	100 × 60 × 8
2	Convolutional layer + Convolutional layer	Relu	88 × 72 × 32	100 × 60 × 32
3	Max pooling layer		44 × 36 × 32	50 × 30 × 32
4	Convolutional layer + Convolutional layer	Relu	44 × 36 × 64	50 × 30 × 64
5	Max pooling layer		22 × 18 × 64	25 × 15 × 64
6	Convolutional layer + Convolutional layer	Relu	22 × 18 × 128	25 × 15 × 128
7	Max pooling layer		11 × 9 × 128	5 × 3 × 128
8	Convolutional layer + Convolutional layer	Relu	11 × 9 × 256	5 × 3 × 256
9	Upsampling Layer	Relu	22 × 18 × 128	25 × 15 × 128
10	Convolutional layer + Convolutional layer	Relu	22 × 18 × 128	25 × 15 × 128
11	Upsampling Layer	Relu	44 × 36 × 64	50 × 30 × 64
12	Convolutional layer + Convolutional layer	Relu	44 × 36 × 64	50 × 30 × 64
13	Upsampling Layer	Relu	88 × 72 × 32	100 × 60 × 32
14	Convolutional layer + Convolutional layer	Relu	88 × 72 × 32	100 × 60 × 32
15	Output Layer	Linear	88 × 72 × 1	100 × 60 × 1

Table 7. Assessment metrics scores for different feature fusion approaches.

Feature Fusion Method	Data Set	MSE	MAE	R²	MAPE
Add	Validation set	0.0044	0.0429	0.99999	0.6031%
Add	Test set	0.0321	0.1325	0.99997	0.8323%
Concatenate	Validation set	0.0042	0.0431	0.99999	0.7039%
Concatenate	Test set	0.0349	0.1314	0.99996	0.9478%
Weighted Fusion	Validation set	0.0338	0.1012	0.99993	1.3067%
Weighted Fusion	Test set	0.0590	0.1655	0.99994	0.9306%

Table 8. Scores for each of the evaluation metrics on the upper skin validation set and test set.

Model	Data Set	Evaluation of Indicators
Model	Data Set	MSE	MAE	R²	MAPE
CNN	Validation set	0.0799	0.1425	0.99985	2.5036%
CNN	Test set	0.0508	0.1570	0.99995	0.9639%
Unet	Validation set	0.0051	0.0489	0.99999	0.7248%
Unet	Test set	0.0528	0.1575	0.99995	0.9556%
MA-Unet	Validation set	0.0044	0.0429	0.99999	0.6031%
MA-Unet	Test set	0.0321	0.1325	0.99997	0.8323%

Table 9. Scores for each of the evaluation metrics on the lower skin validation and test sets.

Model	Data Set	Evaluation of Indicators
Model	Data Set	MSE	MAE	R²	MAPE
CNN	Validation set	0.1452	0.1761	0.99973	3.0910%
CNN	Test set	0.1412	0.2865	0.99986	1.7907%
Unet	Validation set	0.0109	0.0675	0.99997	1.2655%
Unet	Test set	0.1373	0.2560	0.99986	1.3487%
MA-Unet	Validation set	0.0064	0.0560	0.99998	0.9015%
MA-Unet	Test set	0.1023	0.1910	0.99990	1.2521%

Table 10. Predictor scores for high-stress regions of the upper skin.

Model	Data Set	Evaluation of Indicators
Model	Data Set	MSE	MAE	R²	MAPE
CNN	Validation set	0.2320	0.2513	0.99978	1.4207%
CNN	Test set	0.1251	0.2576	0.99988	0.3182%
Unet	Validation set	0.0130	0.0854	0.99998	0.3413%
Unet	Test set	0.1118	0.2298	0.99989	0.2840%
MA-Unet	Validation set	0.0138	0.0841	0.99998	0.3619%
MA-Unet	Test set	0.0451	0.1587	0.99995	0.1888%

Table 11. Predictor scores for areas of high stress in the lower skin.

Model	Data Set	Evaluation of Indicators
Model	Data Set	MSE	MAE	R²	MAPE
CNN	Validation set	0.5033	0.3585	0.99950	2.3055%
CNN	Test set	0.3911	0.5480	0.99960	0.7153%
Unet	Validation set	0.0261	0.1053	0.99997	0.4660%
Unet	Test set	0.3928	0.5070	0.99960	0.6295%
MA-Unet	Validation set	0.0156	0.0931	0.99998	0.3767%
MA-Unet	Test set	0.2905	0.3260	0.99970	0.4193%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jia, W.; Chen, Q. A Convolutional Neural Network-Based Stress Prediction Method for Airfoil Structures. Aerospace 2024, 11, 1057. https://doi.org/10.3390/aerospace11121057

AMA Style

Jia W, Chen Q. A Convolutional Neural Network-Based Stress Prediction Method for Airfoil Structures. Aerospace. 2024; 11(12):1057. https://doi.org/10.3390/aerospace11121057

Chicago/Turabian Style

Jia, Wendi, and Quanlong Chen. 2024. "A Convolutional Neural Network-Based Stress Prediction Method for Airfoil Structures" Aerospace 11, no. 12: 1057. https://doi.org/10.3390/aerospace11121057

APA Style

Jia, W., & Chen, Q. (2024). A Convolutional Neural Network-Based Stress Prediction Method for Airfoil Structures. Aerospace, 11(12), 1057. https://doi.org/10.3390/aerospace11121057

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Convolutional Neural Network-Based Stress Prediction Method for Airfoil Structures

Abstract

1. Introduction

2. Theoretical Principles and Overall Process

2.1. Convolutional Neural Networks and Attention Mechanisms

2.2. Overall Process

3. Numerical Simulation and Stress Field Data Acquisition for Airfoils

4. Data Processing and Dataset Creation

5. Stress Prediction Model

5.1. MA-Unet Model

5.2. Activation Function

5.3. Loss Function and Evaluation Index

5.4. Multi-Scale Volume-Based Feature Fusion Approach

6. Analysis of Results

6.1. Analysis of Overall Regional Stress Prediction Results

6.2. Projections for High-Stress Areas

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI