Research on a Rapid Three-Dimensional Compressor Flow Field Prediction Method Integrating U-Net and Physics-Informed Neural Networks

Wang, Chen; Ma, Hongbing

doi:10.3390/math13152396

Open AccessArticle

Research on a Rapid Three-Dimensional Compressor Flow Field Prediction Method Integrating U-Net and Physics-Informed Neural Networks

by

Chen Wang

¹ and

Hongbing Ma

^1,2,*

¹

School of Intelligence Science and Technology, Xinjiang University, Urumqi 830046, China

²

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(15), 2396; https://doi.org/10.3390/math13152396

Submission received: 11 May 2025 / Revised: 27 June 2025 / Accepted: 27 June 2025 / Published: 25 July 2025

Download

Browse Figures

Versions Notes

Abstract

This paper presents a neural network model, PINN-AeroFlow-U, for reconstructing full-field aerodynamic quantities around three-dimensional compressor blades, including regions near the wall. This model is based on structured CFD training data and physics-informed loss functions and is proposed for direct 3D compressor flow prediction. It maps flow data from the physical domain to a uniform computational domain and employs a U-Net-based neural network capable of capturing the sharp local transitions induced by fluid acceleration near the blade leading edge, as well as learning flow features associated with internal boundaries (e.g., the wall boundary). The inputs to PINN-AeroFlow-U are the flow-field coordinate data from high-fidelity multi-geometry blade solutions, the 3D blade geometry, and the first-order metric coefficients obtained via mesh transformation. Its outputs include the pressure field, temperature field, and velocity vector field within the blade passage. To enhance physical interpretability, the network’s loss function incorporates both the Euler equations and gradient constraints. PINN-AeroFlow-U achieves prediction errors of 1.063% for the pressure field and 2.02% for the velocity field, demonstrating high accuracy.

Keywords:

flow field prediction; U-Net; gradient constraints; coordinate transformation; Euler equations

MSC:

65N99

1. Introduction

The development of the aviation industry places higher demands on the rapid iterative design of aeroengines. To achieve higher compressor pressure ratios and efficiency, continuous optimization of compressor design is necessary. Accurate prediction of the internal flow characteristics of compressors, especially the fluid acceleration at the blade leading edge and the flow characteristics of the wall boundary layer, which are critical for understanding phenomena like laminar–turbulent transition in gas turbine engines [1], is crucial for the rapid iterative design of compressors. These phenomena often involve complex turbulence, which remains a significant challenge in CFD [2]. Although traditional Computational Fluid Dynamics (CFD) methods can obtain accurate solutions of the flow field through numerical simulation, their computational speed no longer meets the requirements of modern compressor design [3]. This is because thousands of evaluations are often required during optimization, and each high-fidelity 3D simulation may take hours or even days to complete.

Deep learning-based methods provide a new perspective for compressor blade flow field prediction. Deep learning models can infer potential physical laws from large amounts of data, especially showing potential in handling high-dimensional and nonlinear problems [4,5]. The latest advancements include Transformer-based architectures, which are increasingly being applied to complex fluid dynamics problems [6]. Lv Zhaoyang et al. [7] used a convolutional neural network (CNN) to build a prediction model for airfoil flow fields, with the input being the original image of the airfoil flow field represented by pixels and the output being the lift coefficient and moment coefficient of the airfoil. The experimental results showed that the CNN model’s predictions had good accuracy and the calculation speed was 40 times faster than CFD methods. N. Thuerey et al. [8] designed a U-Net-based network [9] that, trained on a large amount of CFD data, could predict the velocity and pressure distribution of airfoil flow fields from boundary conditions, with an average prediction error of less than 3%. S. Bhatnagar et al. [10] introduced Gradient Sharpening (GS) to improve the clarity of predictions and used a CNN structure with shared encoding and decoding layers to significantly improve computational efficiency.

However, in fluid dynamics applications, deep learning models face specific challenges. For example, most deep learning models commonly adopt a pixelated preprocessing method that projects data onto a uniform Cartesian grid, which leads to the loss of flow field information in small-scale regions near the wall and boundary, affecting prediction accuracy. While coordinate transformation [11,12,13] can mitigate this issue, emerging implicit neural representations [14] offer a mesh-independent continuous flow field prediction method. To capture the flow field information near the wall, Z. Yang et al. [15] proposed the AMGNET method, which relies on algebraic multigrid algorithms and graph neural networks (GNNs) to improve the accuracy of flow field prediction in small-scale regions. However, although graph neural networks can handle unstructured grids, they have problems in representational theory and limitations in hardware requirements, parallel processing, and distributed solving.

Compared to well-structured text and images, graphs are irregular. A fundamental assumption in machine learning on graphs is that the prediction target should be independent of the order of nodes on the graph. To satisfy this assumption, GNNs introduce an inductive bias called permutation invariance [16]. Specifically, the output of GNNs is independent of how the node indices of the graph are assigned and their processing order, i.e., the model parameters are independent of the node order and are shared across the entire graph. Due to the unique parameter sharing mechanism of GNNs, the way they fit data features is very different from traditional neural networks. First, most GNNs are usually used as black-box feature extractors for graphs, and it is unclear how well they can capture different graph features and topologies. Second, due to the introduction of permutation invariance, the results of the classical universal approximation theorem for neural networks (NNs) [17,18] cannot be directly generalized to GNNs [19]. Furthermore, the limitations of the expressive power of GNNs are related to some longstanding difficult problems in graph theory [20]. For example, in predicting the properties of chemical molecules, it is necessary to determine whether the molecular structure is the same as or similar to molecules with known properties, which involves problems such as graph/subgraph isomorphism judgment [21,22].

Since existing deep learning models cannot overcome the problem of information loss in small-scale regions caused by structured grids, researchers have proposed methods for grid transformation in the preprocessing stage. L.-W. Chen and N. Thuerey [11] improved the solution efficiency and maintained the ability to capture complex flow phenomena by using a U-Net network to transform unstructured grids into body-fitted Cartesian grids. W. Cao et al. [12] combined physics-informed neural networks [23] with grid transformation and achieved a reduction in error of nearly one order of magnitude compared to the second-order Finite Volume Method (FVM) on very sparse grids, solving the problem of local sharp transitions in airfoil flow fields. J. Hu and W. Zhang [13] also used grid transformation, retaining the geometric information of the fluid equations, and the prediction accuracy of the model with preprocessed input data was significantly higher than the prediction results of Principal Component Analysis (PCA). T. Zhao et al. [24] combined deep convolutional neural networks with a physics-constrained method based on the Euler equations, significantly reducing the dependence on the amount of training data and making the training process more stable. This approach is particularly relevant for complex flow regimes, including transonic flows [25]. Further advancements in physics-informed neural networks, such as cascade loss models, continue to enhance their capabilities [26].

Current research on deep learning-based flow field reconstruction mainly focuses on two-dimensional flow fields. However, in complex scenarios, two-dimensional flow fields lack crucial information in one dimension, and only three-dimensional flow fields can meet research needs, such as the airflow around vehicles, the aerodynamics of aircraft wings, and the rotating stall of compressor blade rows. To convert a three-dimensional flow field into a two-dimensional flow field for model input, it is usually necessary to perform slicing processing. This not only reduces the intuitiveness of flow field features but also requires a cumbersome preprocessing step [27]. Moreover, low-cost storage of large-scale three-dimensional simulation field data is more attractive compared to storing two-dimensional flow field data. Furthermore, the transition from studying two-dimensional flow fields to three-dimensional flow fields helps to further research and apply deep learning techniques in the field of fluid dynamics [28]. For example, Bruni et al. [29] have successfully applied deep learning to multi-stage axial compressors, demonstrating its potential for rapid design iteration and analysis in complex turbomachinery. Therefore, three-dimensional models developed for three-dimensional input are more conducive to engineering applications such as real-time fluid control, field analysis, and online prediction tasks.

In this study, we propose an end-to-end, data-driven framework for three-dimensional compressor flow field prediction that maps blade geometry and freestream conditions directly to three-dimensional steady-state flow field distributions. Unlike existing approaches that rely on intermediate surrogate modeling, dimensionality reduction, or scalar output regression (e.g., lift or loss coefficients), our proposed method performs full-field prediction without intermediate reconstruction steps, enabling a more direct and physically comprehensive mapping from design parameters to flow solutions. This framework integrates coordinate transformation, U-Net-based encoding–decoding, and physics-informed loss constraints. While each component exists in prior works, our approach introduces several key improvements that significantly enhance prediction performance and physical fidelity in compressor applications, including the integration of attention mechanisms [30] and advanced network components: (1) The model performs direct 3D flow field prediction without requiring 2D slicing or flow reconstruction, thereby preserving the spatial coherence and reducing preprocessing complexity. (2) We adopt a variable-specific decoder structure with separate optimization paths for pressure, temperature, and velocity components, improving the treatment of multi-scale physical interactions. (3) The coordinate transformation module embeds first-order metric coefficients, and the training is guided by Euler-based residual loss, enabling accurate resolution of shocks, boundary layers, and wake structures in highly curved blade passages. These innovations enhance both the efficiency and interpretability of flow predictions, offering practical advantages for iterative compressor design and real-time aerodynamic analysis.

2. Materials and Methods

To establish a physically grounded predictive model, we first define the configuration of the studied system. The object of study is a subsonic compressor blade operating under steady flow conditions.

Figure 1 illustrates the chordwise shape of the CDA (Controlled Diffusion Airfoil) blade. The curved shape represents the cross-section of a blade, with A and B marking the leading and trailing edges. The straight line connecting A and B is the chord line, and its length is the chord length (b), consistent with Figure 1. The angle

θ

again denotes the stagger angle or blade setting angle, specifying the airfoil’s orientation. The internal circles, often used conceptually, illustrate the concepts of camber and thickness, which define the airfoil’s shape.

f_{\max}

represents the maximum camber, which is the greatest perpendicular distance from the chord line to the mean camber line (the line equidistant from the upper and lower surfaces). Its longitudinal position from the leading edge along the chord is denoted by a. Similarly,

c_{\max}

is the maximum thickness of the airfoil, the greatest perpendicular distance between the upper and lower surfaces, and its axial location from the leading edge is indicated by e. In essence, this figure elucidates the fundamental parameters of an airfoil’s cross section, which directly influence its aerodynamic performance, such as lift generation and drag.

Figure 2 presents a two-dimensional representation of a linear blade cascade, typical in turbomachinery components such as compressors and turbines. It comprehensively displays the flow path through a single blade passage along with its associated velocity triangles and key geometric parameters. Lines 1-1 and 2-2 define the inlet and outlet flow sections, often parallel for linear cascades, indicating uniform flow conditions upstream and downstream. The curved lines represent the airfoil (blade) profiles, with points A and B denoting the leading and trailing edges, respectively. Geometrically, t signifies the pitch of the cascade, which is the perpendicular distance between corresponding points on adjacent blades, while b is the chord length, the straight-line distance between the leading and trailing edges of a single airfoil. From an aerodynamic perspective,

w_{1}

and

w_{2}

denote the relative flow velocities at the inlet and outlet of the cascade, respectively. The angles

β_{1}

and

β_{2}

are the inlet and outlet relative flow angles, measured between the relative velocity vectors and the axial direction (lines 1-1 and 2-2). The blade angles at the leading and trailing edges are represented by

β_{1 K}

and

β_{2 K}

, respectively. The incidence angle (i) is defined as the difference between the inlet blade angle and the inlet flow angle (

i = β_{1 K} - β_{1}

), indicating how directly the flow hits the leading edge. Conversely, the deviation angle (

δ

) is the difference between the actual outlet flow angle and the outlet blade angle (

δ = β_{2} - β_{2 K}

), reflecting the flow’s inability to perfectly align with the blade surface at the exit. The stagger angle (

θ

) defines the orientation of the entire blade, measured between the chord line (AB) and the axial direction. Additional angles,

β_{m}

and

β_{y}

, likely represent mean or geometric blade angles at the leading and trailing edges, respectively, often related to the mean camber line of the blade. The velocity triangle on the right, involving

w_{m}

(a mean velocity) and

w_{t}

(a tangential velocity component), provides a graphical depiction of the vector relationships between different velocity components and angles within the cascade, crucial for turbomachine performance analysis.

The overall workflow of the proposed prediction model is illustrated in Figure 3 and consists of four main stages: (1) CFD data preparation: We begin by generating a structured dataset of 3D steady-state CFD flow fields using a set of parametric compressor airfoils under varying inflow conditions. Each sample includes flow variables and geometric metadata. (2) Coordinate transformation (CoordTrans): The raw 3D spatial coordinates are passed through a geometry-aware transformation module, which normalizes the mesh and embeds relative geometric features into the input tensor. This module outputs transformed spatial information and blade-specific metrics (see “CoordTrans” in Figure 3). (3) Neural network inference: The transformed data, combined with global flow metadata, is fed into a U-Net-based 3D convolutional neural network (AeroFlow-U or PINN-AeroFlow-U), which predicts the full-field distribution of pressure, temperature, and velocity components. (4) Loss computation and training: During training, the model minimizes a composite loss function, including pointwise data loss, gradient consistency loss, and a physics-informed residual term based on the Euler equations. The training process is performed offline, while inference is nearly instantaneous. These four stages are visually mapped in Figure 3, which provides a schematic overview of the full learning pipeline.

2.1. Research Objectives

This study aims to construct a flow field prediction model, PINN-AeroFlow-U, based on the U-Net network architecture and incorporating physics-informed neural networks (PINNs). In the data preprocessing stage, the flow field data is mapped from the non-uniformly distributed physical space to a uniformly discrete computational space, enhancing the model’s ability to represent the complex features of the compressor flow field. In the model prediction output stage, a PINN is introduced to embed the Euler equations as constraints into the model’s loss function, further improving the model’s adherence to physical laws and prediction accuracy.

The object of this study is the subsonic compressor flow field. Under inviscid conditions, the Reynolds-Averaged Navier–Stokes (RANS) equations can be simplified to the Euler equations:

\begin{matrix} \frac{\partial ρ}{\partial t} + \nabla \cdot (ρ u) = 0, \end{matrix}

(1)

\begin{matrix} \frac{\partial (ρ u)}{\partial t} + \nabla \cdot (ρ u u) & = - \nabla p + ρ g, \end{matrix}

(2)

\begin{matrix} \frac{\partial E}{\partial t} + \nabla \cdot ((E + p) u) & = - u \cdot \nabla p + Q . \end{matrix}

(3)

In the above equations,

ρ

is the fluid density,

u

is the fluid velocity vector, p is the fluid pressure,

g

is the body force (such as gravity), E is the total energy of the fluid, and Q is the heat source term per unit volume.

In this paper, the initial geometry and inlet conditions I, grid coordinate information N, deep learning model f, and flow field variables F and their relationships are defined as

\begin{matrix} I & = [M a_{\infty}, A o A_{\infty}, C_{max}, θ, β], \end{matrix}

(4)

\begin{matrix} F & = [p, ρ, t, u, v, w], \end{matrix}

(5)

\begin{matrix} F & = f (I, N) . \end{matrix}

(6)

In Equation (6),

M a_{\infty}

represents the airflow Mach number at the inlet,

A o A_{\infty}

represents the angle of attack,

C_{max}

represents the maximum relative thickness,

θ

represents the blade camber angle, and

β

represents the airflow inlet angle.

The process of solving the model can be defined as optimizing an approximate mapping function

\hat{f}

. This function takes the initial conditions I and grid information N as input, with the true output being the flow field variables F and the model’s output being

\hat{F}

. The training objective of the model is to make the approximate mapping function

\hat{F} = \hat{f} (I, N, λ)

as close as possible to the true mapping function

F = f (I, N)

. This is a deep learning method for mining physical mapping rules from flow field data, where

λ

are the hyperparameters of the model training. The deep learning training process in this paper can be expressed as

\underset{λ}{arg min} {∥f (I, N) - \hat{f} (I, N, λ)∥}_{L^{2}}

(7)

2.2. Data Preprocessing

To reduce the errors faced by the PINN-AeroFlow-U model when predicting complex flow field regions such as the wall boundary layer and the blade leading edge, the flow field is subjected to coordinate transformation before training. This maps the CFD physical space grid coordinates to a Cartesian coordinate system. In this paper, the transformation matrix is defined as

T = [\begin{matrix} \partial_{x} ξ & \partial_{y} ξ & \partial_{z} ξ \\ \partial_{x} η & \partial_{y} η & \partial_{z} η \\ \partial_{x} ζ & \partial_{y} ζ & \partial_{z} ζ \end{matrix}] = {[\begin{matrix} \partial_{ξ} x & \partial_{η} x & \partial_{ζ} x \\ \partial_{ξ} y & \partial_{η} y & \partial_{ζ} y \\ \partial_{ξ} z & \partial_{η} z & \partial_{ζ} z \end{matrix}]}^{- 1}

(8)

and normalized according to the index

(i, j, k)

of the physical space coordinate system, yielding

ξ = \frac{i - 1}{(i_{\max} - 1)}

,

η = \frac{j - 1}{(j_{\max} - 1)}

, and

ζ = \frac{k - 1}{(k_{\max} - 1)}

.

The mapping from equidistant Cartesian space to the curvilinear coordinate frame is achieved by a geometry-conditioned projection, wherein each grid point is represented by a normalized distance vector to the nearest blade surface along the local streamwise direction. This transformation embeds the geometric context into the input tensor while maintaining spatial locality. Although no explicit Jacobian matrix or determinant is computed, the network retains sensitivity to geometric deformation through learned spatial filters applied over the transformed domain. Since both the input and output of the neural network are defined in the same transformed space, an inverse mapping is not explicitly required.

Figure 4 shows the mapping relationship between the physical space and computational space coordinates of the CDA (Controlled Diffusion Airfoil). Figure 4a shows the physical space coordinates (i.e., under the

(x, y, z)

coordinates), and Figure 4b shows the computational space (i.e., under the

(ξ, η, ζ)

coordinates, where

ξ, η, ζ \in

(0,1)).

Figure 4a shows a schematic diagram of generating a body-fitted grid in physical space, where

Γ_{1}

represents the blade surface boundary coordinates,

Γ_{3}

represents the coordinates of the flow field boundary, and

Γ_{2}

and

Γ_{4}

represent the internal interface of the computational domain. There is a one-to-one correspondence between the points in the physical plane and the points in the computational plane.

As shown in Figure 4b, the coordinate transformation method maps complex geometric regions (such as the flow region around a blade) to a Cartesian grid, allowing numerical calculations to be performed using equidistant grids without the need for dense unstructured grids around every complex geometric feature. In this process, local grid refinement is performed near the blade and in the boundary layer region through grid stretching and deformation. This improves the resolution of critical flow field regions while maintaining computational efficiency, thereby better capturing the flow details of complex flow features such as boundary layers, separation zones, and shock waves [28]. Considering that the coordinate transformation is bijective and can be accurately inversely mapped back to the original physical space, the solution obtained by the model in the computational space is equivalent to the solution in the original physical space.

To enhance numerical stability and geometric fidelity, additional implementation details were adopted in the coordinate transformation process. The shortest distance d from each grid point to the airfoil surface was computed using a nearest-neighbor projection algorithm. Specifically, the airfoil surface was discretized into a triangulated mesh, and for each point in the computational domain, the Euclidean distance to the closest triangle was computed using a k-d tree structure to accelerate nearest-point queries. This ensures efficient and accurate approximation of wall proximity, which is critical for capturing boundary layer effects. To address edge effects near the domain boundaries during the transformation, we introduced buffer zones in the computational grid. Nodes located within two grid layers of the outer boundary were excluded from gradient-based loss computations, preventing potential artifacts due to extrapolation or metric distortion. Furthermore, we applied a smoothing filter to the metric coefficients near the edges to suppress numerical noise and ensure a stable input field for network training.

Different compressor blade geometries exhibit the same boundary conditions in the computational plane

(ξ, η, ζ)

, while the difference in governing equations is reflected in different metric coefficients within the governing equations. The model uses first-order metric coefficients to represent the grid information in the computational space

(ξ_{x}, ξ_{y}, ξ_{z}, η_{x}, η_{y}, η_{z}, ζ_{x}, ζ_{y}, ζ_{z})

, the coordinates of the airfoil surface points to represent the airfoil geometric information

(x_{0}, y_{0}, z_{0})

, and the shortest distance d from each coordinate point in the flow field to the airfoil surface as a supplement to the coordinate information [26]. Finally, the grid information and initial conditions can be expressed as

Metrics = [ξ_{x}, ξ_{y}, ξ_{z}, η_{x}, η_{y}, η_{z}, ζ_{x}, ζ_{y}, ζ_{z}]

(9)

N_{i_{max} \times j_{max} \times k_{max} \times 13} = [Metrics, x_{0}, y_{0}, z_{0}, d]

(10)

I_{i_{max} \times j_{max} \times k_{max} \times 5} = [M a_{\infty}, A o A_{\infty}, C_{max}, θ, β]

(11)

2.3. PINN-AeroFlow-U Network Architecture

This study proposes a flow field prediction model named PINN-AeroFlow-U, the architecture of which is shown in Figure 5. The model is constructed in an encoder–decoder manner, with a total of 4 encoders and 4 decoders. Each encoder consists of a double convolutional layer containing two convolutional operations and a max-pooling layer for downsampling, while each decoder consists of a double convolutional layer that also performs two convolutional operations, paired with a transposed convolutional layer for upsampling. Skip connections are used to concatenate the encoder and decoder feature maps at the same level to preserve original data information [9]. To further enhance the model’s ability to extract spatial information, PINN-AeroFlow-U appends a Convolutional Block Attention Module (CBAM) to the last upsampling block of the decoder.

The input of the model consists of 18 variables related to airfoil geometry and inflow conditions from Equations (10) and (11), including the grid information in the computational space

(ξ_{x}, ξ_{y}, ξ_{z}, η_{x}, η_{y}, η_{z}, ζ_{x}, ζ_{y}, ζ_{z})

, the coordinates of the airfoil surface points

(x_{0}, y_{0}, z_{0})

, the shortest distance d from each coordinate point in the flow field to the airfoil surface, the inlet Mach number

M a_{\infty}

, the angle of attack

A o A_{\infty}

, the maximum relative thickness

C_{max}

, the airfoil camber angle

θ

, and the inlet flow angle

β

. The output consists of the 5 flow field variables listed in Equation (5), including the pressure field, the temperature field, and the velocity fields in the x, y, and z directions.

PINN-AeroFlow-U employs independent decoders for different flow field variables (such as pressure, temperature, and the three velocity components). This architecture allows the model to be optimized for each channel, thereby improving the prediction accuracy for flow field data containing complex features. The separated decoder structure also helps to reduce interference between variables during the normalization process.

The skip connection mechanism of PINN-AeroFlow-U directly concatenates the feature maps from the encoder to the feature maps in the decoder. This is equivalent to fusing high-level global shape and structural information with low-level edge and texture information, which can capture multi-scale details while preserving the original details of the input data.

By introducing the CBAM module, the PINN-AeroFlow-U model enhances its ability to capture spatial and channel dependencies in the flow field data. The spatial attention module can learn the weights of different positions in the spatial dimension of the image, thereby highlighting the key regions in the image [30]. This mechanism can help the model more accurately locate important features in the flow field, such as boundary layers and separation regions. The channel attention module enhances the model’s ability to capture flow field features by learning the importance of different channels, which helps the model distinguish the important information of different physical quantities (such as velocity, pressure, and temperature). With the help of the attention mechanism, the model can adaptively filter and focus on the features that play a decisive role in the prediction task. This improves the model’s adaptability to new data, giving it good generalization performance.

The input to the PINN-AeroFlow-U model is a tensor of dimensions

128 \times 64 \times 32 \times 18

. Here, 128 corresponds to the flow passage length, 64 to the flow passage width (or pitch), and 32 to the blade height. The last dimension, 18, represents the number of input variables or channels (e.g., coordinates, flow parameters).

After passing through the PINN-AeroFlow-U’s first input layer, this tensor is transformed into

128 \times 64 \times 32 \times 64

. This is because the U-Net typically standardizes the channel dimension to 64 for subsequent processing stages. Subsequently, during each downsampling step, the spatial dimensions (length, width, height) are halved, while the number of channels is doubled.

2.4. Dataset Preparation

To verify the generalization ability and final prediction accuracy of the PINN-AeroFlow-U model, a batch of airfoils was constructed, and the flow field variable data of the compressor airfoils was obtained by CFD calculations under certain operating conditions. Data was extracted from the CFD data using a body-fitted grid method, a curvilinear coordinate system was constructed, and the coordinates were normalized to establish the dataset required for model training. Under supersonic and high angle-of-attack conditions, the flow field around the compressor blades is extremely complex. Shock wave discontinuities frequently occur, and flow separation phenomena are also relatively strong. Affected by this, the flow field data obtained by CFD calculations under these complex conditions has the problem of poor accuracy. Considering the calculation accuracy of CFD and the fitting ability of the PINN-AeroFlow-U model, the operating conditions and design parameters of the training dataset are shown in Table 1.

In total, 2560 CFD simulations were generated by combining 40 unique airfoil geometries with 64 variations in freestream and geometric parameters. Of these, 2304 samples from 36 airfoils were used for training, while the remaining 256 samples from 4 entirely separate airfoils formed the test set. This split ensures that the evaluation reflects the model’s ability to generalize to previously unseen blade shapes and operating conditions.

The computational domain for each CFD case includes a single blade passage, with pitchwise periodic boundaries to reflect cascade behavior. The axial inlet boundary is positioned 1.0 chord length upstream of the leading edge, and the outlet is placed 1.5 chord lengths downstream of the trailing edge. These distances ensure that the inflow develops naturally before interacting with the blade and that downstream pressure recovery is captured without artificial reflection. The spanwise domain height equals one blade span and is discretized uniformly. This domain setup follows established practices in compressor CFD studies [31,32] to minimize boundary-induced distortions while maintaining computational efficiency.

A CDA (Controlled Diffusion Airfoil) is used for the compressor airfoil. This type of airfoil prevents boundary layer separation by controlling the diffusion of airflow on the suction surface and overcomes the airflow separation phenomenon at the blade tip and root [1]. Forty CDAs with different shapes were designed for CFD simulation, of which the flow field data corresponding to 36 airfoils was used for model training, and the remaining 4 airfoils were used to test the model’s generalization ability.

Ten operating conditions were set with uniform intervals within the ranges shown in Table 1, and CFD calculations were performed to generate flow field data. The CFD simulation results were extracted using a structured grid data structure, and a schematic diagram of the structured grid node distribution is shown in Figure 6. The distribution of grid nodes in the computational space after coordinate transformation is shown in Figure 7.

The dimensions of the computational space in the three directions are 128, 64, and 32, respectively. The parameter dimension at each grid node is 18, including physical variables such as (

M e t r i c s, x_{0}, y_{0}, z_{0}, d, M a_{\infty}, A o A_{\infty}, C_{max}, θ, β

). The dimensions of the entire dataset are 40 × 10 × 128 × 64 × 32 × 18, and the specific data allocation is shown in Table 2.

Since there is a significant difference in the numerical ranges of different variables in Equations (10) and (11), to avoid negative effects on model training due to feature imbalance, leading to some features dominating the training process while others are ignored, it is necessary to standardize the input variables. The specific calculation formula is as follows:

γ_{j} = \frac{γ_{i} - γ_{min}}{γ_{max} - γ_{min}}

(12)

where

γ_{max}

and

γ_{min}

represent the global maximum and minimum values of the input variable, respectively.

To ensure the numerical accuracy of the CFD data used for model training and evaluation, a mesh independence study was performed. A representative compressor airfoil case (

{Ma}_{\infty}

= 0.6), (

{AoA}_{\infty}

= 2.0°, Cmax = 0.08,

θ

= 20°) was simulated using three structured mesh resolutions: coarse, medium, and fine. The characteristics of each grid, along with comparisons of key flow quantities, are summarized below (Table 3). The differences in surface pressure coefficient (

C_{p}

) and total pressure loss between the medium and fine grids were found to be within 2.7% and 1.9%, respectively. These values indicate that the CFD results using the medium-resolution mesh are sufficiently accurate and mesh-independent for use in constructing the training and testing datasets. Therefore, the medium-resolution grid was adopted for all CFD simulations in this study.

The CFD simulations were performed using steady-state RANS equations with the Spalart–Allmaras turbulence model. Boundary conditions were defined as follows: a uniform total pressure and total temperature were imposed at the inlet, along with prescribed Mach number and angle of attack; a static pressure condition was applied at the outlet to control the operating point. Blade surfaces were modeled as no-slip, adiabatic walls with near-wall y⁺ values below 1.0 to ensure boundary layer resolution. Periodic boundaries were enforced on the pitchwise sides to replicate cascade flow behavior, while hub and shroud surfaces were treated as slip walls. These settings follow standard practices for subsonic axial compressor simulations and were applied consistently across all cases to ensure physical reliability and dataset uniformity.

3. Model Training and Results Discussion

This section focuses on a detailed exposition of the model training strategies and an in-depth analysis of the experimental results. Considering the performance differences of the AeroFlow-U model before and after the introduction of the PINN, we will meticulously discuss both the AeroFlow-U model (without the PINN loss function) and the model after incorporating PINN-AeroFlow-U. This includes the construction of the loss function, the selection of training hyperparameters, the configuration of the computing platform, the loss reduction trends, and a comparative analysis of the predicted results (pressure, temperature, and velocity components in different directions) with CFD.

3.1. Model Training Strategy

3.1.1. AeroFlow-U Loss Function

The training objective of the AeroFlow-U network is to minimize the loss function between the predicted output and the ground truth data. This loss function consists of three parts: Mean Square Error loss (MSE), Gradient loss (Grad), and Airfoil Surface loss (Layer). The specific formula is

Loss = MSE + α_{1} \cdot Grad + α_{2} \cdot Layer

(13)

The weighting coefficients

α_{1}

and

α_{2}

were selected through a grid search over candidate values ranging from

10^{- 3}

to 100. The optimal setting (

α_{1} = 1.0

,

α_{2} = 0.01

) was chosen based on the minimum mean absolute error (MAE) averaged across all five predicted flow variables (pressure, temperature, and velocity components) on a held-out validation set. This configuration was found to yield a stable balance between data fidelity and physical consistency during training.

The MSE loss function is one of the most commonly used loss functions in regression problems. It measures the model’s prediction performance by calculating the squared difference between the model’s predicted value

{\hat{Y}}_{n, i, j, k}

and the actual target value

Y_{n, i, j, k}

.

{L o s s}_{M S E} = \frac{1}{n_{y} \cdot H \cdot W \cdot D} \sum_{n = 1}^{n_{y}} \sum_{i = 1}^{H} \sum_{j = 1}^{W} \sum_{k = 1}^{D} {(Y_{n, i, j, k} - {\hat{Y}}_{n, i, j, k})}^{2}

(14)

Let H, W, and D denote the height, width, and depth of the 3D grid used to represent each flow field sample. In our setting, these correspond to the number of grid points along the Y, X, and Z directions, respectively. The indices i, j, k iterate over the grid points in these respective dimensions.

The Grad loss function is used to improve the continuity of the flow field predicted by the model. This loss function measures the smoothness of the model’s predictions by calculating the derivatives of the flow field variables along the i, j, and k directions, denoted as

\frac{\partial}{\partial h}

,

\frac{\partial}{\partial w}

, and

\frac{\partial}{\partial d}

.

When calculating the Grad loss, a second-order central difference scheme is used to approximate the first and second derivatives. The central difference scheme is a numerical differentiation method that uses the values of a function near a specific point to estimate the derivative at that point. The second-order central difference scheme is chosen for its high accuracy and low sensitivity to computational errors. The calculation formula for the first-order derivative using this method is

\frac{\partial Y_{n, i, j, k}}{\partial h} \approx \frac{Y_{n, i + 1, j, k} - Y_{n, i - 1, j, k}}{2 Δ h}

(15)

For the second-order derivative, the calculation method is extended to

\frac{\partial^{2} Y_{n, i, j, k}}{\partial h^{2}} \approx \frac{Y_{n, i + 1, j, k} - 2 Y_{n, i, j, k} + Y_{n, i - 1, j, k}}{{(Δ h)}^{2}}

(16)

Similarly, the derivatives in the w and d directions can also be calculated. The Grad loss function comprehensively considers both the first and second derivatives, and its expression is

\begin{matrix} {L o s s}_{G r a d} = & \frac{1}{n_{y} \cdot H \cdot W \cdot D} \sum_{n = 1}^{n_{y}} \sum_{i = 1}^{H} \sum_{j = 1}^{W} \sum_{k = 1}^{D} ({(\frac{\partial}{\partial h} Y_{n, i, j, k} - \frac{\partial}{\partial h} {\hat{Y}}_{n, i, j, k})}^{2} \\ + {(\frac{\partial}{\partial w} Y_{n, i, j, k} - \frac{\partial}{\partial w} {\hat{Y}}_{n, i, j, k})}^{2} + {(\frac{\partial}{\partial d} Y_{n, i, j, k} - \frac{\partial}{\partial d} {\hat{Y}}_{n, i, j, k})}^{2} \\ + {(\frac{\partial^{2}}{\partial h^{2}} Y_{n, i, j, k} - \frac{\partial^{2}}{\partial h^{2}} {\hat{Y}}_{n, i, j, k})}^{2} + {(\frac{\partial^{2}}{\partial w^{2}} Y_{n, i, j, k} - \frac{\partial^{2}}{\partial w^{2}} {\hat{Y}}_{n, i, j, k})}^{2} \\ + {(\frac{\partial^{2}}{\partial d^{2}} Y_{n, i, j, k} - \frac{\partial^{2}}{\partial d^{2}} {\hat{Y}}_{n, i, j, k})}^{2}) \end{matrix}

(17)

The Layer loss function focuses on the model’s prediction performance near the airfoil surface, which is crucial for calculating distributed and concentrated forces. The formula for the Layer loss is

{L o s s}_{L a y e r} = \frac{1}{n_{y} \cdot H \cdot D} \sum_{n = 1}^{n_{y}} \sum_{i = 1}^{H} \sum_{k = 1}^{D} {(Y_{n, i, 1, k} - {\hat{Y}}_{n, i, 1, k})}^{2}

(18)

3.1.2. PINN-AeroFlow-U Loss Function

To make the model predictions more accurate and stable, the Euler equation constraint (Euler equation error) is added to the output of the PINN-AeroFlow-U model. The Euler equations are a set of partial differential equations in fluid dynamics that describe the motion of inviscid fluids, representing the fundamental physical laws of fluid motion. Physics-informed neural networks (PINNs) require less training data compared to purely data-driven deep learning methods [23,26] and are more consistent with physical laws. The loss function formula for PINN-AeroFlow-U is

Loss = MSE + α_{1} \cdot Grad + α_{2} \cdot Layer + α_{3} \cdot Euler

(19)

In three-dimensional flow field problems, the fluid state vector U is defined as

U = {[ρ, ρ u, ρ v, ρ w, E]}^{T}

(20)

where

ρ

is the fluid density,

u, v, w

are the velocity components of the fluid in three-dimensional space, and E is the total energy of the fluid. The fluid flux functions

F (U)

,

G (U)

, and

H (U)

are defined as

\{\begin{matrix} F (U) = {[ρ u, ρ u^{2} + p, ρ u v, ρ u w, u (E + p)]}^{T} \\ G (U) = {[ρ v, ρ u v, ρ v^{2} + p, ρ v w, v (E + p)]}^{T} \\ H (U) = {[ρ w, ρ u w, ρ v w, ρ w^{2} + p, w (E + p)]}^{T} \end{matrix}

(21)

where p is the fluid pressure. The conservation form of the Euler equations is

\frac{\partial U}{\partial t} + \nabla \cdot F (U) + \nabla \cdot G (U) + \nabla \cdot H (U) = 0

(22)

In the model of this paper, since the research focuses on steady flow problems, the time derivative term

\frac{\partial U}{\partial t}

is omitted, and the Euler equations are simplified to

\nabla \cdot F (U) + \nabla \cdot G (U) + \nabla \cdot H (U) = 0

(23)

The Euler loss function is constructed by calculating the residuals of the flow field variables predicted by the model with respect to the Euler equations. Specifically, the Euler loss function is defined as

\begin{matrix} {L o s s}_{E u l e r} = & \frac{1}{T_{f}} \sum_{x \in T_{f}} |\frac{F (U_{N N (i + 1)}) - F (U_{N N (i - 1)})}{2 Δ x} + \frac{G (U_{N N (j + 1)}) - G (U_{N N (j - 1)})}{2 Δ y} \\ {+ \frac{H (U_{N N (k + 1)}) - H (U_{N N (k - 1)})}{2 Δ z}|}^{2} \end{matrix}

(24)

where

T_{f}

is the fluid domain, and

U_{N N (i + 1)}, U_{N N (i - 1)}, U_{N N (j + 1)}, U_{N N (j - 1)}, U_{N N (k + 1)}

, and

U_{N N (k - 1)}

are the predicted values of the model at adjacent grid points. By minimizing the Euler loss, the model is guided to predict flow fields that satisfy the laws of physical conservation.

3.1.3. Computational Platform Configuration

In this study, the model was trained with parallel acceleration on a Tesla V100 platform. During the training process, the batch size was set to 10, the initial learning rate was 0.001, and the Adam optimization algorithm was used to update the model’s weights and biases. The number of training epochs was set to 1000, and a dynamic learning rate adjustment strategy was adopted: when training reached 800 epochs, the learning rate was multiplied by 0.1 to promote the model to reach the optimal convergence state. In terms of total training time, the full PINN-AeroFlow-U model was trained for 1000 epochs using four NVIDIA Tesla V100 GPUs in parallel. The complete training process on the full dataset (36 airfoils × 10 operating conditions) required approximately 42 h. To enhance training stability, gradient clipping and dynamic learning rate scheduling were applied during the final 200 epochs. Regarding scalability, the model demonstrates efficient generalization. For new compressor blade geometries not present in the training set, the model can be fine-tuned using only two to four new geometries, with convergence typically achieved within 2–4 h. This makes it feasible for rapid iteration in real-world aerodynamic design tasks.

The detailed software and hardware configuration information involved in model training is listed in Table 4.

3.1.4. Model Convergence

Figure 8 presents the convergence curves of the loss function during training. As shown in Figure 8, the loss of the PINN-AeroFlow-U model on both the training and test sets decreases rapidly before reaching 200 training epochs. After 200 training epochs, the loss function curves show some fluctuations. However, after completing 800 training epochs, with the dynamic adjustment of the learning rate, the loss function curves tend to stabilize. It was calculated that the average mean square error of a single batch of the model on the training set is

6.35 \times 10^{- 6}

, and the average mean square error on the test set is

5.24 \times 10^{- 4}

. The above results indicate that the model has a good convergence effect.

4. Results

4.1. Analysis

The study compared the differences between the CFD calculation results and the PINN-AeroFlow-U prediction results to analyze the prediction capability and limitations of PINN-AeroFlow-U.

This paper tested the calculation and prediction results of the compressor airfoil flow field under the following inflow conditions: Ma = 0.60, AoA = −2.0°, and design parameters

θ_{\infty}

= 16.675°,

C_{max} = 0.062

. The experimental results analysis is shown in Figure 9. The leftmost column of contour maps shows the relevant variable results predicted by the AeroFlow-U network from top to bottom; the middle column represents the temperature, pressure, and three velocity component flow fields obtained by CFD calculations; and the rightmost column shows the relative error between the CFD calculation and the PINN-AeroFlow-U network prediction results.

The experimental results show that the average inference time of the PINN-AeroFlow-U model for the flow field under each operating condition of the test set is 1 s, while the average prediction time of the CFD method is 30 min. The inference speed of the PINN-AeroFlow-U model is significantly higher than the CFD simulation speed.

Observing the results in Figure 9, it can be found that the temperature, pressure, and x, y, z direction velocity contour maps obtained by the PINN-AeroFlow-U network prediction are highly consistent with the CFD simulation results. This indicates that the model can accurately capture the basic characteristics of the high-temperature, high-pressure, and low-speed regions formed at the blade leading edge due to airflow stagnation. This not only demonstrates the ability of the PINN-AeroFlow-U network to capture flow field details but also further confirms the accuracy of its three-dimensional predictions.

In-depth analysis of the relative error contour maps shows that the relative temperature error is controlled within a small range of 0 to 0.007, the relative pressure error is between 0 and 0.015, while the relative errors of the x, y, and z direction velocities are relatively larger, ranging from 0 to 0.02.

To evaluate the impact of the gradient constraint loss function on the computational results, we employed an ablation study method, comparing three scenarios: without gradient constraint, with gradient constraint, and based on original CFD data. By observing the spatial distribution and numerical variations of various physical quantities (such as pressure, temperature, and velocity) under different schemes, we can not only verify the role of gradient constraints in improving model performance and numerical stability but also clearly identify the importance and limitations of this constraint within specific computational regions.

For this purpose, we selected two representative airfoil designs (as shown in Figure 10), and we present the flow field distribution at three crucial spanwise sections (relative positions of 0.1, 0.5, and 0.9). Choosing these three spanwise locations allows us to reflect the subtle differences in flow field variations along the span. The results are visualized in Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16.

To evaluate the effectiveness of the gradient constraint, we compared the prediction results of the PINN-AeroFlow-U model with and without the gradient constraint. The initial conditions for the inlet Mach number varying along the blade height h according to the formula are as follows:

M a = - 62.371 \cdot h^{4} + 62.914 \cdot h^{3} - 22.813 \cdot h^{2} + 3.5373 \cdot h + 0.4891

.

To avoid excessive image space due to displaying too many variables, the prediction results only showcase the comparison of the three velocity components, as the prediction of velocity variables is often the most challenging. This comparison is presented in Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16. The analysis reveals that the model without the gradient constraint performs relatively poorly. The prediction of the x-direction velocity field exhibits not only very obvious sawtooth-like artifacts but also significant fluctuations, severely impacting the prediction accuracy.

In contrast, the velocity field prediction with the gradient constraint not only aligns with the CFD results in terms of flow direction but also demonstrates higher accuracy and better detail in regions with large velocity changes, such as the wake and boundary layer, while eliminating those severe sawtooth-like fluctuations. It is particularly important to note that under these inlet conditions, shock waves may appear in the flow field. While both models might predict the shock waves, the prediction from the unconstrained model tends to be somewhat blurry. The constrained model, however, typically yields results for complex flow features like shock waves and potential separations that are closer to CFD in terms of position, strength, and clarity, appearing more realistic.

In conclusion, the comparison results clearly demonstrate that adding a gradient constraint to the PINN-AeroFlow-U model significantly improves the prediction accuracy and realism under the given inlet conditions. Compared to the model without the constraint, the constrained model not only predicts the overall flow field more accurately but also shows greater precision in displaying details (especially in regions of rapid change) and predicting complex phenomena such as shock waves, separations, and wakes. Consequently, its prediction results are closer to the CFD simulations.

To facilitate interpretation of the comparative prediction results shown in Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16, we provide in Table 5 the full parameter matrix corresponding to each test case. Each row represents a specific combination of inflow and geometric parameters used for model evaluation and visualization. Each profile corresponds to a specific CDA blade shape selected from the test dataset, and the conditions cover typical subsonic compressor operating regimes. These cases were selected to assess the model’s performance across varying degrees of flow separation, wake formation, and shock presence.

4.2. AeroFlow-U vs. PINN-AeroFlow-U

Although the coordinate transformation-based AeroFlow-U model has demonstrated good prediction performance, there is still room for improvement in regions with drastic flow changes and large gradients. To further enhance the model’s ability to handle complex physical phenomena, we introduced PINN constraints based on the Euler equation loss, see function (24), to further improve the model’s prediction performance in specific regions.

To intuitively compare the prediction accuracy of the AeroFlow-U and PINN-AeroFlow-U models, Figure 17 shows a comparison of the prediction results of the AeroFlow-U and PINN-AeroFlow-U models and the CFD numerical simulation results under a certain operating condition. As can be seen from Figure 17, the jagged traces in the flow field prediction are reduced, the boundaries of numerical changes become smoother and closer to the results calculated by CFD data, and the performance in the more complex flow field region near the hub is better than that of the AeroFlow-U model using only the U-Net network architecture. This phenomenon indicates that the introduction of PINN constraints helps to enhance the model’s overall grasp of flow field continuity.

In addition, the PINN-AeroFlow-U model also shows higher accuracy in predicting the high-temperature, high-pressure, and low-speed regions formed at the blade leading edge due to airflow stagnation. The numerical boundaries of these regions are closer to the CFD simulation results, indicating that the introduction of PINN improves the model’s ability to capture flow field details. This improvement is attributed to the PINN guiding the model’s training process by embedding the physical laws of the flow field, such as mass and momentum conservation.

The experimental results show that the coordinate transformation-based AeroFlow-U model has already demonstrated considerable accuracy in overall flow field prediction and has significant advantages in computational efficiency. Compared with traditional methods, its prediction error is significantly reduced, fully demonstrating the improvement effect of coordinate transformation on the model’s representation ability. After further incorporating PINN constraints, the prediction accuracy in regions with drastic flow changes has been further improved, especially reflected in the accurate capture of local physical quantities.

4.3. Physical Interpretation of Flow Features

To assess the physical reliability of the predicted flow fields, we compared PINN-AeroFlow-U outputs with CFD results in terms of key aerodynamic phenomena commonly observed in axial compressors, namely, shock formation, boundary layer separation, and wake development. These features are crucial to compressor performance and flow stability, and their accurate reproduction reflects the engineering utility of the model.

In cases involving higher inlet Mach numbers (e.g.,

M a_{\infty}

= 0.7), the model correctly identifies shock waves forming near the blade leading edge. This is evidenced by the sudden increase in predicted pressure and temperature, closely matching CFD distributions. These shock structures appear at physically expected positions, indicating that the model has effectively learned compressibility effects and flow deceleration phenomena.

In addition, the model demonstrates good capability in capturing near-wall flow behavior. Velocity field predictions reveal boundary layer separation on the suction surface in regions with adverse pressure gradients, particularly near the mid-span. The inclusion of gradient constraints enhances the accuracy and smoothness of these predictions. Furthermore, wake regions downstream of the blade trailing edge are clearly resolved, exhibiting the characteristic low-speed streaks and velocity deficits typical of viscous wake development. These results underscore the model’s ability to maintain physical consistency in complex flow regimes, thereby supporting its use in design and analysis workflows where physical interpretability is essential.

4.4. Additional Generalization Assessment

To assess the model’s generalization performance beyond the initially tested four airfoils, we conducted additional evaluations focusing on both boundary geometries and out-of-distribution (OOD) conditions. The four test geometries used previously were selected to cover extreme cases in the design parameter space, including airfoils with the largest camber angles (

θ > 28^{\circ}

), thinnest and thickest profiles (

Cmax \in [0.03, 0.1]

), and operating points near stall margins. Furthermore, we synthesized five additional OOD cases by extrapolating beyond the training range, such as (

{AoA}_{\infty} = 7 . 5^{\circ}

) and

Cmax = 0.11

. While the prediction error increased moderately (pressure MAE rose from ∼1.1% to ∼2.4%), the model still produced physically plausible flow structures, capturing expected shock movements and boundary layer behavior. These results suggest that while the model performs best within the trained parameter envelope, it retains reasonable predictive capability in extrapolative settings, supporting its potential use in rapid design space exploration and early-stage concept evaluation.

4.5. Performance Comparison: With vs. Without PINN

To rigorously evaluate the contribution of the Euler equation residual in the loss function, we conducted a comparative analysis between two model variants: (1) AeroFlow-U, a baseline model utilizing the U-Net architecture with coordinate transformation and gradient/layer-based loss components; and (2) PINN-AeroFlow-U, which augments the baseline with a physics-informed constraint derived from the steady-state Euler equations. Both models were tested on an identical evaluation set, and their predictive accuracy was assessed across five fundamental flow field variables: pressure, temperature, and the three velocity components (u, v, w). The corresponding mean absolute error (MAE) and relative error (RE) metrics for each variable are reported in Table 6. As shown, the PINN-AeroFlow-U model consistently outperforms the baseline across all variables, with notable improvements in high-gradient quantities such as velocity. The relative error reduction is particularly significant in velocity predictions (averaging over 1.3% improvement), reflecting the Euler constraint’s effectiveness in preserving conservation laws and improving flow continuity.

4.6. Limitations and Future Work

While the proposed PINN-AeroFlow-U framework demonstrates promising performance in predicting three-dimensional compressor flow fields, several limitations should be acknowledged:

(1) Dependence on CFD training data: The model’s accuracy is inherently tied to the fidelity of the CFD simulations used for training. Although mesh independence was verified, uncertainties in turbulence modeling or boundary conditions may propagate into the network predictions.

(2) Limited operating condition range: The training dataset covers a predefined range of inlet Mach numbers, angles of attack, and geometric parameters. While the model showed reasonable extrapolation behavior (see Section 4.4), its robustness under extreme or unseen conditions (e.g., high-speed transonic or stall) has not yet been fully validated.

(3) Assumption of steady, inviscid flow: The current model is based on the Euler equations and targets steady-state predictions. As such, it does not account for viscous effects, unsteady dynamics, or turbulence phenomena, which may limit applicability in certain engineering contexts.

(4) High memory demand for 3D data: Although the model runs efficiently at inference, training with high-resolution 3D data requires substantial GPU memory and computational resources, potentially limiting scalability for real-time applications or large-scale optimization loops.

Future work will address these limitations by extending the dataset to include viscous and unsteady cases, incorporating RANS-based physics constraints, and exploring model compression or multi-fidelity training strategies to improve scalability and adaptability to broader aerodynamic problems.

5. Conclusions

In this study, we proposed a physics-informed, data-driven framework for direct prediction of three-dimensional steady-state flow fields over compressor blade passages. By integrating a coordinate-transformed U-Net architecture with Euler-residual-based loss constraints, the model is able to learn full-field flow variable distributions from blade geometry and inlet conditions. Extensive evaluations on a CFD-generated dataset demonstrate that the proposed method achieves accurate predictions with relative errors below 3% across pressure, temperature, and velocity components. The PINN-enhanced variant outperforms baseline architectures by 10–15.

The framework also shows robust generalization to previously unseen airfoil geometries, highlighting its potential applicability to rapid aerodynamic design and optimization workflows. Future work will focus on extending the model to transonic and unsteady flow regimes, incorporating more detailed turbulence features, and exploring transfer learning strategies across broader compressor configurations.

Author Contributions

Conceptualization, H.M.; Methodology, C.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding. The APC was funded by the authors.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Mayle, R.E. The 1991 IGTI Scholar Lecture: The Role of Laminar-Turbulent Transition in Gas Turbine Engines. J. Turbomach. 1991, 113, 509–536. [Google Scholar] [CrossRef]
Raje, P.; Parish, E.; Hickey, J.P.; Cinnella, P.; Duraisamy, K. Recent developments and research needs in turbulence modeling of hypersonic flows. Phys. Fluids 2025, 37, 031304. [Google Scholar] [CrossRef]
Chao, Y.; Feng, Q.; Yatian, Z.; Jian, Y.; Conghai, W.; Shuhai, Z. Review of development and challenges for physical modeling and numerical scheme of CFD in aeronautics and astronautics. Acta Aerodyn. Sin. 2020, 38, 829–857. [Google Scholar]
Zhang, W.; Zhu, L.; Liu, Y.; Kou, J. Application progress of machine learning in the con-struction of turbulence models. Acta Aerodyn. Sin. 2019, 37, 444–454. [Google Scholar] [CrossRef]
Zhang, W.; Kou, J.; Liu, Y. Prospect of artificial intelligence empowered fluid mechanics. Acta Aeronaut. Astronaut. Sin. 2021, 42, 524689. [Google Scholar]
Xu, Z.; Liu, J.; Chen, K.; Chen, Y.; Hu, Z.; Ni, B. AMR-Transformer: Enabling Efficient Long-range Interaction for Complex Neural Fluid Simulation. arXiv 2025, arXiv:2503.10257. [Google Scholar] [CrossRef]
Lv, Z.; Nie, X.; Zhao, A. Prediction of wing aerodynamic coefficient based on CNN. J. Beiging Univ. Aeronaut. Astronaut 2021, 49, 674–680. [Google Scholar]
Thuerey, N.; Weißenow, K.; Prantl, L.; Hu, X. Deep learning methods for Reynolds-averaged Navier–Stokes simulations of airfoil flows. AIAA J. 2020, 58, 25–36. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015: 18th International Conference, Munich, Germany, 5–9 October 2015; Proceedings, Part III 18. Springer: Berlin/Heidelberg, Germany, 2015; pp. 234–241. [Google Scholar]
Bhatnagar, S.; Afshar, Y.; Pan, S.; Duraisamy, K.; Kaushik, S. Prediction of aerodynamic flow fields using convolutional neural networks. Comput. Mech. 2019, 64, 525–545. [Google Scholar] [CrossRef]
Chen, L.W.; Thuerey, N. Towards high-accuracy deep learning inference of compressible flows over aerofoils. Comput. Fluids 2023, 250, 105707. [Google Scholar] [CrossRef]
Cao, W.; Song, J.; Zhang, W. A solver for subsonic flow around airfoils based on physics-informed neural networks and mesh transformation. Phys. Fluids 2024, 36, 027134. [Google Scholar] [CrossRef]
Hu, J.; Zhang, W. Flow field modeling of airfoil based on convolutional neural networks from transform domain perspective. Aerosp. Sci. Technol. 2023, 136, 108198. [Google Scholar] [CrossRef]
de Vito, L.; Pinnau, N.; Dey, S. Implicit Neural Representation For Accurate CFD Flow Field Prediction. arXiv 2024, arXiv:2408.06486. [Google Scholar]
Yang, Z.; Dong, Y.; Deng, X.; Zhang, L. AMGNET: Multi-scale graph neural networks for flow field prediction. Connect. Sci. 2022, 34, 2500–2519. [Google Scholar] [CrossRef]
Battaglia, P.W.; Hamrick, J.B.; Bapst, V.; Sanchez-Gonzalez, A.; Zambaldi, V.; Malinowski, M.; Tacchetti, A.; Raposo, D.; Santoro, A.; Faulkner, R.; et al. Relational inductive biases, deep learning, and graph networks. arXiv 2018, arXiv:1806.01261. [Google Scholar]
Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst. 1989, 2, 303–314. [Google Scholar] [CrossRef]
Hornik, K. Approximation capabilities of multilayer feedforward networks. Neural Netw. 1991, 4, 251–257. [Google Scholar] [CrossRef]
Azizian, W.; Lelarge, M. Expressive power of invariant and equivariant graph neural networks. arXiv 2020, arXiv:2006.15646. [Google Scholar]
Babai, L. Graph isomorphism in quasipolynomial time. In Proceedings of the Forty-Eighth Annual ACM Symposium on Theory of Computing, Cambridge, MA, USA, 19–21 June 2016; pp. 684–697. [Google Scholar]
Levin, E.; Pieraccini, R.; Eckert, W. A stochastic model of human-machine interaction for learning dialog strategies. IEEE Trans. Speech Audio Process. 2000, 8, 11–23. [Google Scholar] [CrossRef]
Zanfir, A.; Sminchisescu, C. Deep learning of graph matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 2684–2693. [Google Scholar]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Zhao, T.; An, J.; Xu, Y.; He, G.; Qin, F. A physics-constrained and data-driven method for modeling supersonic flow. Phys. Fluids 2024, 36, 066118. [Google Scholar] [CrossRef]
Wassing, S.; Langer, S.; Bekemeyer, P. Physics-Informed Neural Networks for Transonic Flows around an Airfoil. arXiv 2024, arXiv:2408.17364. [Google Scholar]
Feng, Y.; Song, X.; Yuan, W.; Lu, H. Physics-informed neural networks based cascade loss model. J. Aerosp. Power 2023, 38, 1615–1625. [Google Scholar]
Nemati Taher, F.; Subaşı, A. A fast three-dimensional flow field prediction around bluff bodies using deep learning. Phys. Fluids 2024, 36, 025101. [Google Scholar] [CrossRef]
Yi, J.; Deng, F.; Qin, N.; Liu, X. Fast prediction of transonic flow field using deep learning method. Acta Aerodyn. Sin. 2022, 43, 526747. [Google Scholar] [CrossRef]
Bruni, G.; Maleki, S.; Krishnababu, S.K. Deep learning modeling of manufacturing and build variations on multistage axial compressors aerodynamics. Data-Centric Eng. 2025, 6, e9. [Google Scholar] [CrossRef]
Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
Abu-Zidan, Y.; Mendis, P.; Gunawardena, T. Optimising the computational domain size in CFD simulations of tall buildings. Heliyon 2021, 7, e06723. [Google Scholar] [CrossRef]
Li, J.; Liu, T.; Wang, Y.; Xie, Y. Integrated graph deep learning framework for flow field reconstruction and performance prediction of turbomachinery. Energy 2022, 254, 124440. [Google Scholar] [CrossRef]

Figure 1. Airfoil profile geometric parameters.

Figure 2. Turbomachinery blade cascade and flow velocity diagram.

Figure 3. Process of flow field prediction model.

Figure 4. Schematic diagram of CDA space transformation: (a) Physical plane. Here,

Γ_{1}

,

Γ_{2}

,

Γ_{3}

, and

Γ_{4}

denote the wall boundary, interface boundaries, and far-field boundary, respectively. The coordinate system is (x, y), and

ζ

indicates the grid stretching direction. Points a-s are key grid points on the boundaries. (b) Computational plane. Here,

Γ_{1}

to

Γ_{4}

represent the transformed boundaries in the computational domain with coordinate system (

ξ

,

η

). Point o is the origin, and points a–s correspond to those in the physical plane.

Figure 4. Schematic diagram of CDA space transformation: (a) Physical plane. Here,

Γ_{1}

,

Γ_{2}

,

Γ_{3}

, and

Γ_{4}

denote the wall boundary, interface boundaries, and far-field boundary, respectively. The coordinate system is (x, y), and

ζ

indicates the grid stretching direction. Points a-s are key grid points on the boundaries. (b) Computational plane. Here,

Γ_{1}

to

Γ_{4}

represent the transformed boundaries in the computational domain with coordinate system (

ξ

,

η

). Point o is the origin, and points a–s correspond to those in the physical plane.

Figure 5. Overview of the network architecture and data flow.

Figure 6. Schematic diagram of the physical space distribution of grid nodes.

Figure 7. Schematic diagram of the computational space distribution of grid nodes.

Figure 8. Training and verifying loss of PINN-AeroFlow-U.

Figure 9. Comparison between results of CFD calculations and PINN-AeroFlow-U predictions, and error map of CDA.

Figure 10. Geometry and compressor blade passage for two selected blade profiles: (a) profile A and (b) profile B.

Figure 11. Case A, axial pos: 0.05—comparison results of different physical variables and constraint types.

Figure 12. Case A, axial pos: 0.25—comparison results of different physical variables and constraint types.

Figure 13. Case A, axial pos: 0.45—comparison results of different physical variables and constraint types.

Figure 14. Case B, axial pos: 0.05—comparison results of different physical variables and constraint types.

Figure 15. Case B, axial pos: 0.25—comparison results of different physical variables and constraint types.

Figure 16. Case B, axial pos: 0.45—comparison results of different physical variables and constraint types.

Figure 17. Comparison between AeroFlow-U predictions and PINN-AeroFlow-U predictions and CFD simulation results.

Table 1. Initial condition parameter table of airfoil dataset.

Number of Airfoils	Initial Conditions
36 + 4	$M a_{\infty} \in (0.5, 0.7)$
	$A o A_{\infty} \in (- 3 . 0^{\circ}, 6 . 0^{\circ})$
	$θ_{\infty} \in (4^{\circ}, 30^{\circ})$
	$C_{max} \in (0.03, 0.1)$

Table 2. Dataset dimension allocation.

Dataset	Dimension
Train	36 × 10 × 128 × 64 × 32 × 18
Test	4 × 10 × 128 × 64 × 32 × 18

Table 3. Summary of mesh resolution and flow quantity differences for mesh independence study.

Grid Level	Total Cells	Blade Surface Nodes	Minimum First-Layer Y⁺	C_p Max Difference (%)	Total Pressure Loss Diff (%)
Coarse	1.1 million	∼4000	∼1.0	—	—
Medium	2.3 million	∼8000	∼0.8	2.7% (vs. fine)	1.9% (vs. fine)
Fine	4.6 million	∼16,000	∼0.6	—	—

Table 4. Training hardware and software configuration of model.

Name	Device Parameters
Platform	WindowsServer
CPU	AMD EPYC 7K83 64-Core Processor 2.54 GHz
GPU	Tesla V100
Cuda	11.7
PyTorch	1.13.1 + cu117

Table 5. Case matrix for test scenarios presented in Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16.

Case ID	Blade Profile	Ma_∞	AoA_∞ (°)	Cmax	$θ$ (°)
A-1	Profile A	0.6	−2	0.062	16.675
A-2	Profile A	0.6	2	0.062	16.675
A-3	Profile A	0.6	4	0.062	16.675
B-1	Profile B	0.65	−1.5	0.07	25
B-2	Profile B	0.65	1.5	0.07	25
B-3	Profile B	0.65	4	0.07	25

Table 6. Quantitative comparison between models with and without Euler-based PINN constraint.

Variable	Model	MAE	Relative Error (%)
Pressure	AeroFlow-U	8.54 $\times 10^{- 3}$	1.86
Pressure	PINN-AeroFlow-U	4.87 $\times 10^{- 3}$	1.06
Temperature	AeroFlow-U	7.12 $\times 10^{- 3}$	1.34
Temperature	PINN-AeroFlow-U	4.15 $\times 10^{- 3}$	0.79
Velocity (u)	AeroFlow-U	1.62 $\times 10^{- 2}$	3.41
Velocity (u)	PINN-AeroFlow-U	9.43 $\times 10^{- 3}$	2.01
Velocity (v)	AeroFlow-U	1.47 $\times 10^{- 2}$	3.12
Velocity (v)	PINN-AeroFlow-U	8.76 $\times 10^{- 3}$	1.92
Velocity (w)	AeroFlow-U	1.58 $\times 10^{- 2}$	3.26
Velocity (w)	PINN-AeroFlow-U	9.12 $\times 10^{- 3}$	1.95

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, C.; Ma, H. Research on a Rapid Three-Dimensional Compressor Flow Field Prediction Method Integrating U-Net and Physics-Informed Neural Networks. Mathematics 2025, 13, 2396. https://doi.org/10.3390/math13152396

AMA Style

Wang C, Ma H. Research on a Rapid Three-Dimensional Compressor Flow Field Prediction Method Integrating U-Net and Physics-Informed Neural Networks. Mathematics. 2025; 13(15):2396. https://doi.org/10.3390/math13152396

Chicago/Turabian Style

Wang, Chen, and Hongbing Ma. 2025. "Research on a Rapid Three-Dimensional Compressor Flow Field Prediction Method Integrating U-Net and Physics-Informed Neural Networks" Mathematics 13, no. 15: 2396. https://doi.org/10.3390/math13152396

APA Style

Wang, C., & Ma, H. (2025). Research on a Rapid Three-Dimensional Compressor Flow Field Prediction Method Integrating U-Net and Physics-Informed Neural Networks. Mathematics, 13(15), 2396. https://doi.org/10.3390/math13152396

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Research on a Rapid Three-Dimensional Compressor Flow Field Prediction Method Integrating U-Net and Physics-Informed Neural Networks

Abstract

1. Introduction

2. Materials and Methods

2.1. Research Objectives

2.2. Data Preprocessing

2.3. PINN-AeroFlow-U Network Architecture

2.4. Dataset Preparation

3. Model Training and Results Discussion

3.1. Model Training Strategy

3.1.1. AeroFlow-U Loss Function

3.1.2. PINN-AeroFlow-U Loss Function

3.1.3. Computational Platform Configuration

3.1.4. Model Convergence

4. Results

4.1. Analysis

4.2. AeroFlow-U vs. PINN-AeroFlow-U

4.3. Physical Interpretation of Flow Features

4.4. Additional Generalization Assessment

4.5. Performance Comparison: With vs. Without PINN

4.6. Limitations and Future Work

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI