Parallel Direct Solution of Flexible Multibody Systems Based on Block Gaussian Elimination

Cheng Yang; Bin Xia; Yuexin Wan; Pin Yang; Yifan Xie; Zhifeng Xie

doi:10.3390/app15084541

,

and

¹

School of Mechanical Engineering, Chengdu University, Chengdu 610106, China

²

National Key Laboratory of Plasma Physics, Laser Fusion Research Center, Chinese Academy of Engineering Physics, Mianyang 621900, China

³

School of Aerospace Engineering, Tsinghua University, Beijing 100084, China

^*

Author to whom correspondence should be addressed.

Appl. Sci.2025, 15(8), 4541;https://doi.org/10.3390/app15084541

Version Notes

Order Reprints

Abstract

This paper proposes a parallel direct solution of flexible multibody systems based on block Gaussian elimination. The Craig–Bampton method is utilized to model flexible bodies within the multibody system, resulting in a reduction in the size of the system equations. To address the time integration problem, an implicit stiff scheme is adopted to obtain large time step sizes. When forming the linearized systemic equations, global sparsity in the Jacobian matrix and similar local sparsity in submatrices can be observed. Subsequently, block Gaussian elimination is introduced for the direct solution of these linearized equations. The algorithm is designed to be parallelizable at the algorithm level, with a specific processing order for the submatrices of the constraints. The stability of the method is guaranteed by the positive definite and symmetric properties in the diagonal matrices in the Craig–Bampton method. The parallel efficiency and numerical stability of the method are confirmed through numerical examples in homemade codes parallelized by OpenMP.

Keywords:

flexible multibody system; Craig–Bampton method; block Gaussian elimination; algorithm-level parallel; direct solution

1. Introduction

Analysis, design, and optimization of complex flexible multibody systems (FMBS) are often limited by computational efficiency [1]. A potential solution is to develop a modeling method that offers high precision with a reduced number of degrees of freedom (DOFs) and, subsequently, address it using parallel computation.

To reduce the DOFs of flexible parts in multibody systems, the component mode synthesis (CMS) [2,3] methods are widely utilized, where the floating frame of reference (FFR) [2,4] and modal coordinates are introduced to express a flexible body. Among the CMS methods, the fixed-interface Craig–Bampton (CB) method [5,6] is extensively popular. The CB method preserves boundary modes, typically resulting in higher accuracy than the free-interface Craig–Chang method [7,8,9]. Despite this, numerous extensions and improvements have been suggested to enhance the CB method’s precision. Notably, we consider residual flexibility as one of the most successful strategies, which is known as the enhanced Craig–Bampton (ECB) method proposed by Kim [10,11,12,13].

To enhance computational efficiency, large time integration step sizes and high single-step efficiency are typically desired. Implicit methods for solving stiff problems, such as the backward difference formulas (BDFs) [14], can achieve these large step sizes. The resulting nonlinear algebraic equations [2] can then be linearized and solved iteratively [1], for instance, using the Newton iteration as shown by Yang [15].

Parallelization techniques have been adopted in these processes by researchers, aiming for higher efficiency. The assembly process can commonly be significantly accelerated through parallelization, which includes the calculation of general forces, their Jacobian matrices, component modes, and stress recovery. For instance, Park [16] exploited the modular features of the FMBS to achieve active control and real-time applications. Cao [17] computed the residual vector and Jacobian matrix in parallel using OpenMP [18] in a large diesel engine analysis. Negrut [19,20] combined CPU and GPU parallelization to compute contact forces and the contact Jacobian matrix in scenarios with many contacts in multibody systems. Pařík, Kim et al. [21] exploited the substructure independence to obtain a parallel approach for the ECB method. Cros [22] and Murthy [23] applied parallelization in modes generating and component stress recovery for the CB method and some CMS method, respectively. Motivated by the ideal parallelization efficiency in this process, domain decomposition methods have been introduced to divide large FMBS into smaller substructures for better parallelization, as suggested by Li [24], Featherstone [25], Dewes [26], and Cano [27].

After assembly, the linearized systemic equations must be solved. As Wasfy and Noor pointed out in their review [28], parallelizing an implicit and direct solution process is significantly challenging. This is due to the need for decomposing the Jacobian matrix, which involves interdependent operations. Typically, parallel operations can only be performed within a single column, such as within the LU decomposition. Although commercial parallel direct solvers like PARDISO can be used [17], the numerical stability and efficiency for different FMBS models are somehow unclear.

Indeed, the Jacobian matrices generated by the CB method possess unique block sparse structures that have not been fully exploited in current calculation schemes. By leveraging these characteristics and employing the block Gaussian elimination (BGE) method [29,30], we propose an algorithm-level parallel scheme for the direct solution of linearized systemic equations. In the proposed scheme, the Jacobian matrix is eliminated into a small subspace of Lagrange multipliers in a block-wise and fully parallel manner. Once the Lagrange multipliers are resolved, the reverse substitution can also be executed in full parallel. Moreover, by considering the matrix bandwidth, the block matrix sparsity can be retained with specific block order arrangement. The numerical stability is guaranteed by the BGE stability criterion.

This paper is arranged as follows. Section 2 details the formulation of multibody systems, with flexible components modeled by the Craig–Bampton method. Procedure details of algebraic-differential governing equations, implicit integration scheme, equation linearization and assembly, DOFs reduction by the Craig–Bampton method, rigid modes removing and modes orthogonalization, and flexible body formulation under floating frame of reference are illustrated sequentially. In Section 3, we propose a parallel direct solution scheme based on block Gaussian elimination. Coordinate reordering is applied to generate a small bandwidth within the Jacobian matrix and its submatrices, and the feasibility of the sub-block Jacobian matrix is studied. The block and sparse features of the presented Jacobian matrix are deeply investigated, and subsequently, we introduce the block Gaussian elimination method to solve the linear equations at an algorithmic level of parallelism. Section 4 verifies the effectiveness and efficiency of the algorithm through numerical examples, which include the vibration of an aeroengine stator system and the rotation of a crank-slider mechanism.

2. Flexible Multibody Formulation with the Craig–Bampton Method

This section details the flexible multibody formulation with the Craig–Bampton method. For system governing equations, the Lagrange equations of the first kind are adopted for convenience of code implementation. The resulting algebraic-differential equations are time-discretized using the implicit integration method, specifically backward difference formulas, to accommodate large time step sizes. The flexible body formulation using the Craig–Bampton method is then detailed, encompassing the DOFs reduction process and the generation of mass and stiffness matrices. In a subsequent section, a feature study of these matrices is the prerequisite for the presented algorithm-level parallel.

2.1. Time Integration Procedures of Flexible Multibody Systems

2.1.1. System Governing Equations

The Lagrange equations of the first kind are utilized to express the governing equations of the flexible multibody system [2] as

\{\begin{cases} \frac{d}{d t} (\frac{\partial L (ξ, \dot{ξ})}{\partial ξ}) - \frac{\partial L (ξ, \dot{ξ})}{\partial ξ} + {(\frac{\partial c}{\partial ξ})}^{T} λ = τ, \\ c (ξ) = 0, \end{cases}

(1)

wherein

ξ

is the general displacement of the bodies,

L = T - V

denotes the Lagrange function with the system’s kinetic energy

T

and potential energy

V

,

τ

is the general force vector of the external loads,

c

is a set of complete constraint equations that are solely dependent on the general displacement

ξ

, and

λ

corresponds to the Lagrangian multipliers of the constrains.

The constraints between bodies are explicitly expressed by the algebraic equations

c (ξ) = 0

in Equation (1). This facilitates convenient computer coding. However, the variable

λ

has the same magnitude as the constraint forces and often introduces numerical problems due to its large difference in magnitude compared to the general displacement variable

ξ

. This issue can be effectively resolved by multiplying the constraint equations

c (ξ) = 0

by a scale factor

a

to balance the magnitudes between

ξ

and

λ

, as follows:

\{\begin{cases} \frac{d}{d t} (\frac{\partial L (ξ, \dot{ξ})}{\partial ξ}) - \frac{\partial L (ξ, \dot{ξ})}{\partial ξ} + {(a \cdot \frac{\partial c}{\partial ξ})}^{T} λ = τ, \\ a \cdot c (ξ) = 0 . \end{cases}

(2)

A method for calculating the scale factor

a

has been detailed in the authors’ previous work [31], and readers are encouraged to consult it for further information. Since Equation (2) does not differ fundamentally from Equation (1), we still use Equation (1) instead of Equation (2) by taking

c

as

a \cdot c

in Equation (1).

In Equation (1), the Lagrange function relating terms are actually the general inertial forces

F_{iner}

and general elastic forces

F_{elas}

, and they are the internal forces within the bodies. By separating them from the external forces

τ

and constraint forces

c_{ξ}^{T} λ

, the system governing equation can be rewritten as

\{\begin{cases} F_{iner} (ξ, \dot{ξ}, \ddot{ξ}) + F_{elas} (ξ) = τ (ξ, \dot{ξ}) - c_{ξ}^{T} (ξ) λ, \\ c (ξ) = 0, \end{cases}

(3)

wherein

c_{ξ}^{T}

is the abbreviation of

{(\partial c / \partial ξ)}^{T}

.

2.1.2. Time Integration Scheme

To solve the algebraic-differential Equation (3), we need to apply a time integration to convert the general velocity

\dot{ξ}

and acceleration

\ddot{ξ}

into displacement

ξ

at the time integration steps. Since multibody problems are typically stiff, meaning that the system responses contain both low and high frequencies simultaneously, the stiff integrator—backward difference formulas (BDFs) is utilized in this paper.

For simplicity, we use a constant step size

h

and the first-order BDFs to demonstrate the integration process. To avoid confusion, we also employ a right superscript within parentheses

□^{(n)}

to denote variable values at the discrete time

t = n h

, such as

ξ^{(n)} ≜ {ξ|}_{t = n h}

and

λ^{(n)} ≜ {λ|}_{t = n h}

. Assuming the previous

n - 1

time steps are known and the

n^{th}

time step needs to be computed, the general velocity and acceleration can be expressed in terms of displacement as

\{\begin{cases} {\dot{ξ}}^{(n)} = \frac{ξ^{(n)} - ξ^{(n - 1)}}{h} ≜ f_{v} (ξ^{(n)}), \\ {\ddot{ξ}}^{(n)} = \frac{{\dot{ξ}}^{(n)} - {\dot{ξ}}^{(n - 1)}}{h} = \frac{ξ^{(n)} - 2 ξ^{(n - 1)} + ξ^{(n - 2)}}{h^{2}} ≜ f_{a} (ξ^{(n)}), \end{cases}

(4)

wherein

f_{v}

represents the velocity’s algebraic function and

f_{a}

represents the acceleration’s algebraic function.

Readers may notice that Formula (4) is not applicable when

n = 1

, where

ξ^{(- 1)}

is requested in forming

{\ddot{ξ}}^{(1)}

. A possible solution is to estimate

ξ^{(- 1)}

using the Taylor series as

ξ^{(- 1)} = ξ^{(0)} - {\dot{ξ}}^{(0)} h + {\ddot{ξ}}^{(0)} \frac{h^{2}}{2} + o (h^{2}),

(5)

wherein

ξ^{(0)}

and

{\dot{ξ}}^{(0)}

are given by the initial conditions and

{\ddot{ξ}}^{(0)}

can be calculated by solving the system Equation (3) at time

t = 0

.

By substituting Equation (4) into Equation (3), nonlinear algebraic equations at the time step

t = n h

with unknown variables

ξ^{(n)}

and

λ^{(n)}

are obtained as

\{\begin{cases} F_{iner} (ξ^{(n)}, f_{v} (ξ^{(n)}), f_{a} (ξ^{(n)})) + F_{elas} (ξ^{(n)}) = τ (ξ^{(n)}, f_{v} (ξ_{n h})) - c_{ξ}^{T} (ξ^{(n)}) λ^{(n)}, \\ c (ξ^{(n)}) = 0, \end{cases}

(6)

wherein the nonlinearity may arise from the expression of

F_{iner}

,

F_{elas}

,

τ

, and

c

.

2.1.3. Equation Linearization and Assembly

The nonlinear algebraic Equation (6) can then be linearized and solved iteratively, for instance, using the Newton iteration, as shown in our previous work [15].

To start the Newton iteration, an initial guess close enough is needed for

ξ^{(n)}

. With the first-order constant step BDFs, the initial guess of the differential variable

ξ^{(n)}

is simply given by the same order extrapolation formula as

ξ^{(n, 0)} = 2 ξ^{(n - 1)} - ξ^{(n - 2)},

(7)

wherein the two numbers in the right superscript, enclosed by parenthesis

□^{(n, j)}

, represent the results of the

j^{th}

iteration for the

n^{th}

time step. Consequently,

ξ^{(n, 0)}

represents the initial value in the iteration for the

n^{th}

time step.

The Lagrange multipliers are not differentiable with respect to time, and the initial guess is set to zero as

λ^{(n, 0)} = 0 .

(8)

We rewrite the nonlinear Equation (6) on one side to obtain their residual form:

\{\begin{cases} b (ξ^{(n)}) ≜ F_{iner} (ξ^{(n)}, f_{v} (ξ^{(n)}), f_{a} (ξ^{(n)})) + F_{elas} (ξ^{(n)}) - τ (ξ_{n h}, f_{v} (ξ_{n h})) + c_{ξ}^{T} (ξ^{(n)}) λ^{(n)} = 0, \\ c (ξ^{(n)}) = 0, \end{cases}

(9)

and the Newton iteration is given by

[\begin{matrix} \frac{\partial b}{\partial ξ^{(n)}} & {(\frac{\partial c}{\partial ξ^{(n)}})}^{T} \\ \frac{\partial c}{\partial ξ^{(n)}} & 0 \end{matrix}] [\begin{matrix} Δ ξ^{(n, j)} \\ Δ λ^{(n, j)} \end{matrix}] = - [\begin{matrix} b (ξ^{(n, j)}) \\ c (ξ^{(n, j)}) \end{matrix}],

(10)

wherein

\frac{\partial b}{\partial ξ^{(n)}} = \frac{\partial F_{iner}}{\partial ξ} + \frac{\partial F_{iner}}{\partial \dot{ξ}} \frac{\partial f_{v}}{\partial ξ^{(n)}} + \frac{\partial F_{iner}}{\partial \ddot{ξ}} \frac{\partial f_{a}}{\partial ξ^{(n)}} + \frac{\partial F_{elas}}{\partial ξ} - \frac{\partial τ}{\partial ξ} - \frac{\partial τ}{\partial \dot{ξ}} \frac{\partial f_{v}}{\partial ξ^{(n)}} + \frac{\partial (c_{ξ}^{T} λ^{(n)})}{\partial ξ} .

(11)

Typically, the second-order differential term

\frac{\partial (c_{ξ}^{T} λ^{(n)})}{\partial ξ}

is omitted in Formula (11) to ensure the stability of the Newton iteration because of the quick variation in the variable

λ^{(n)}

. Utilizing the time discrete Formula (4), we have

\frac{\partial f_{v}}{\partial ξ^{(n)}} = \frac{1}{h}, \frac{\partial f_{a}}{\partial ξ^{(n)}} = \frac{1}{h^{2}},

(12)

and denote the linearized general mass, damping, and stiffness matrices of the bodies as

M ≜ \frac{\partial F_{iner}}{\partial \ddot{ξ}}, C ≜ \frac{\partial F_{iner}}{\partial \dot{ξ}}, K ≜ \frac{\partial F_{elas}}{\partial ξ} + \frac{\partial F_{iner}}{\partial ξ},

(13)

The Newton iteration (10) for the nonlinear Equation (9) is rewritten as

{[\begin{matrix} \frac{M}{h^{2}} + \frac{C}{h} + K - (\frac{\partial τ}{\partial ξ} + \frac{1}{h} \cdot \frac{\partial τ}{\partial \dot{ξ}}) & \frac{\partial c^{T}}{\partial ξ} \\ \frac{\partial c}{\partial ξ} & 0 \end{matrix}]}^{(n, j)} [\begin{matrix} Δ ξ^{(n, j)} \\ Δ λ^{(n, j)} \end{matrix}] = - {[\begin{matrix} b \\ c \end{matrix}]}^{(n, j)},

(14)

wherein the Jacobian matrix and the right-hand terms are calculated at variables

ξ^{(n, j)}

and

λ^{(n, j)}

. The terms related to the external forces

τ

are enclosed with parenthesis.

After

Δ ξ^{(n, j)}

and

Δ λ^{(n, j)}

are solved by a linear solver, the iteration proceeds from

j

to

j + 1

as

ξ^{(n, j + 1)} = ξ^{(n, j)} + Δ ξ^{(n, j)},

(15)

until the following convergence errors are satisfied:

| | Δ ξ^{(n, j)} | | \leq e_{ξ}, | | Δ λ^{(n, j)} | | \leq e_{λ}, | | b^{(n, j)} | | \leq e_{b}, | | c^{(n, j)} | | \leq e_{c},

(16)

wherein

| | □ | |

represents an arbitrary given norm and

e_{ξ}

,

e_{λ}

,

e_{b}

, and

e_{c}

are the corresponding error thresholds for displacement variables, constraint forces, body equation residuals, and constraint equation residuals, respectively.

Let the system consist of

m

bodies and

k

constraints. Then, the system vector of body DOFs is

ξ = [ξ_{1}; \dots; ξ_{m}]

, and the system vector of constraint multipliers is

λ = [λ_{1}; \dots; λ_{k}]

. To assemble Equation (14), we need to calculate

K = diag (K_{1}, \dots, K_{m}), C = diag (C_{1}, \dots, C_{m}), M = diag (M_{1}, \dots, M_{m}), τ = [τ_{1}; \dots; τ_{m}],

(17)

and

c = [c_{1}; \dots; c_{k}], \frac{\partial c}{\partial ξ} = [\frac{\partial c_{1}}{\partial ξ}; \dots; \frac{\partial c_{k}}{\partial ξ}]

(18)

wherein the subscript indicates which body or constraint the variable or matrix belongs to, while the semicolon “;” separates the components of a “column” vector. Additionally,

diag (\dots)

signifies that the matrix is block-diagonally composed of submatrices.

Since the flexible bodies are modeled using the Craig–Bampton method in this paper, we will elaborate on how the general mass matrix

M_{i}

and stiffness matrix

K_{i}

(i = 1, \dots, m)

are computed in the subsection. The damping matrix

C_{i}

may be omitted in an undamped case or be considered as a linear combination of

M_{i}

and

K_{i}

under a Rayleigh damping hypothesis.

2.2. Flexible Body Formulation with the Craig–Bampton Method

2.2.1. DOFs Reduction by the Craig–Bampton Method

To reduce the degrees of freedom (DOFs) of flexible parts in multibody systems, the component mode synthesis (CMS) [2,3] methods are widely utilized. In all CMS methods, the first step is to calculate the preserved modes of the flexible parts under specified boundary conditions. For the Craig–Bampton (CB) method [5,6], all interfaces for joint connections and force applications are fixed when calculating the inner flexible modes. Hence, the CB method is also referred to as a fixed-interface method. Only a few inner modes are preserved to reduce the DOFs of a flexible part. To express the connections and forces acting on the flexible body, static response modes for each boundary DOF are also calculated and preserved, which are known as the boundary modes. The boundary modes are indeed static equilibrium configurations of the flexible body when a single boundary DOF moves a unit displacement in its own direction.

Before performing the modes calculation of the flexible body

i

, we first separate the inner DOFs

u_{i}^{Λ}

and boundary DOFs

u_{i}^{Γ}

within the finite element (FE) formulation. Superscripts

□^{Λ}

and

□^{Γ}

denote the inner DOFs and boundary DOFs, respectively. Set

u_{i} = [u_{i}^{Γ}; u_{i}^{Λ}]

and rearrange the FE equations to obtain

\underset{{\bar{K}}_{i}}{\underset{︸}{[\begin{matrix} {\bar{K}}_{i}^{Γ Γ} & {\bar{K}}_{i}^{Γ Λ} \\ {\bar{K}}_{i}^{Λ Γ} & {\bar{K}}_{i}^{Λ Λ} \end{matrix}]}} [\begin{matrix} u_{i}^{Γ} \\ u_{i}^{Λ} \end{matrix}] + \underset{{\bar{M}}_{i}}{\underset{︸}{[\begin{matrix} {\bar{M}}_{i}^{Γ Γ} & {\bar{M}}_{i}^{Γ Λ} \\ {\bar{M}}_{i}^{Λ Γ} & {\bar{M}}_{i}^{Λ Λ} \end{matrix}]}} [\begin{matrix} {\ddot{u}}_{i}^{Γ} \\ {\ddot{u}}_{i}^{Λ} \end{matrix}] = [\begin{matrix} {\bar{f}}_{i} \\ 0 \end{matrix}],

(19)

wherein

{\bar{K}}_{i}

represents the stiffness matrix and

{\bar{M}}_{i}

represents the mass matrix.

{\bar{f}}_{i}

denotes the external forces applied to the boundary DOFs.

To obtain the inner modes, we fix the boundary DOFs and set

u_{i}^{Γ} = {\ddot{u}}_{i}^{Γ} = {\bar{f}}_{i} = 0

in Equation (19) to obtain

{\bar{K}}_{i}^{Λ Λ} u_{i}^{Λ} + {\bar{M}}_{i}^{Λ Λ} {\ddot{u}}_{i}^{Λ} = 0 .

(20)

Then, the vibration Equation (20) is solved, and the first

N_{i}

low-frequency modes

{\bar{φ}}_{i, j}^{Λ} (j = 1, \dots, N_{i})

are preserved for coordinates

u_{i}^{Λ}

. These modes are extended to the full DOFs

u_{i}

, and we have the vibration modes matrix as follows:

{\bar{Φ}}_{i}^{vibr} = [\begin{matrix} 0 & \dots & 0 \\ {\bar{φ}}_{i, 1}^{Λ} & \dots & {\bar{φ}}_{i, N_{i}}^{Λ} \end{matrix}] .

(21)

Assuming the number of boundary DOFs for flexible body

i

is

M_{i}

, a total of

M_{i}

boundary modes need to be calculated. To obtain the

j^{th}

boundary mode, we ignore the inertial force term and the external forces and set

u_{i}^{Γ} = [0; \dots; 0; 1; 0; \dots; 0]

with the only 1 at the

j^{th}

coordinate in Equation (19) to obtain

[\begin{matrix} {\bar{k}}_{i, j}^{Γ Γ} \\ {\bar{k}}_{i, j}^{Λ Γ} \end{matrix}] + [\begin{matrix} {\bar{K}}_{i}^{Γ Λ} \\ {\bar{K}}_{i}^{Λ Λ} \end{matrix}] u_{i}^{Λ} = [\begin{matrix} 0 \\ 0 \end{matrix}], j = 1, \dots, M_{i} .

(22)

wherein

{\bar{k}}_{i, j}^{Γ Γ}

and

{\bar{k}}_{i, j}^{Λ Γ}

are the

j^{th}

column of

{\bar{K}}_{i}^{Γ Γ}

and

{\bar{K}}_{i}^{Λ Γ}

, respectively. By solving the static equilibrium Equation (22), we obtain the static equilibrium

{\bar{ψ}}_{i, j}^{Λ} (j = 1, \dots, M_{i})

for the coordinates

u_{i}^{Λ}

. Extending these results to the full DOFs

u_{i}

in conjunction with the hypothesis

u_{i}^{Γ} = [0; \dots; 0; 1; 0; \dots; 0]

, we arrive at the boundary modes matrix as follows:

{\bar{Φ}}_{i}^{boun} = [\begin{matrix} I \\ \begin{matrix} {\bar{ψ}}_{i, 1}^{Λ} & \dots & {\bar{ψ}}_{i, M_{i}}^{Λ} \end{matrix} \end{matrix}] .

(23)

With the inner vibration modes matrix

{\bar{Φ}}_{i}^{vibr}

and the boundary modes matrix

{\bar{Φ}}_{i}^{boun}

, a total of

M_{i} + N_{i}

modes are preserved for reduction in DOFs in the flexible body

i

. The preserved modes matrix is

{\bar{Φ}}_{i} = [\begin{matrix} {\bar{Φ}}_{i}^{vibr} & {\bar{Φ}}_{i}^{boun} \end{matrix}] .

(24)

2.2.2. Rigid Modes Removing and Modes Orthogonalization

However, there are two defects in the modes set (24). Firstly, rigid modes with zero frequency exist in the subspace generated by the boundary modes

Φ_{i}^{boun}

; and secondly, the columns in

Φ_{i}^{boun}

are not orthogonal to those in

Φ_{i}^{vibr}

. Rigid modes are redundant because they have been accounted for by the floating frame of reference (FFR) coordinates, while modal orthogonality is essential for producing diagonal modal matrices of stiffness and mass.

To address the two issues, reorthogonalization can be applied. First, we project Equation (19) onto the subspace of

{\bar{Φ}}_{i}

by letting

{\bar{f}}_{i} = 0

, substituting

u_{i} = {\bar{Φ}}_{i} {\bar{q}}_{i}

, and multiplying

{\bar{Φ}}_{i}^{T}

on the left side, resulting in

({\bar{Φ}}_{i}^{T} {\bar{K}}_{i} {\bar{Φ}}_{i}) {\bar{q}}_{i} + ({\bar{Φ}}_{i}^{T} \bar{M} {\bar{Φ}}_{i}) {\ddot{\bar{q}}}_{i} = 0,

(25)

wherein

{\bar{q}}_{i}

represents the redundant modal coordinates corresponding to modes

{\bar{Φ}}_{i}

. Secondly, we conduct a mode analysis on the projected vibration Equation (25) to derive the orthogonal modes [32]. By discarding zero-frequency modes and arranging the frequencies in descending order, the reorthogonalized non-zero-frequency modal matrix

{\bar{\bar{Φ}}}_{i}

can fulfill the following unit orthogonal condition corresponding to the mass matrix:

{\bar{\bar{Φ}}}_{i}^{T} {\bar{Φ}}_{i}^{T} {\bar{M}}_{i} {\bar{Φ}}_{i} {\bar{\bar{Φ}}}_{i} = E_{n_{i}}, {\bar{\bar{Φ}}}_{i}^{T} {\bar{Φ}}_{i}^{T} {\bar{K}}_{i} {\bar{Φ}}_{i} {\bar{\bar{Φ}}}_{i} = Ω_{i}^{2},

(26)

wherein

E_{n_{i}}

represents the unit matrix with dimension

n_{i} = M_{i} + N_{i} - 6

and

Ω_{i}^{2} = diag (ω_{i, 1}^{2}, \dots, ω_{i, n_{i}}^{2})

is the diagonal matrix with non-zero modal circle frequencies

ω_{i, 1} \geq \dots \geq ω_{i, n_{i}} > 0

in descending order.

Then, the orthogonal non-zero-frequency elastic modes matrix of the CB method is

Φ_{i} = {\bar{Φ}}_{i} {\bar{\bar{Φ}}}_{i} ≜ [\begin{matrix} φ_{i, 1} & \dots & φ_{i, n_{i}} \end{matrix}] .

(27)

The FE displacement is then expressed by the modal coordinates

q_{i} = [q_{i, 1}; q_{i, 2}, \dots; q_{i, n_{i}}]

as

u_{i} = Φ_{i} q_{i},

(28)

and the elastic vibration equations of flexible body

i

are diagonal:

Ω_{i}^{2} q_{i} + {\ddot{q}}_{i} = f_{i},

(29)

wherein

f_{i} = Φ_{i}^{T} {\bar{f}}_{i}

is the modal force of the external forces acting on the flexible body

i

.

2.2.3. Flexible Body Formulation Under Floating Frame of Reference

The orthogonal modes

Φ_{i}

and the elastic coordinates

q_{i}

describe the elastic vibration of the flexible body

i

, while the floating frame of reference (FFR) [2] can describe its overall spatial displacement and finite rotation.

Figure 1 shows the coordinates of flexible body

i

in the FFR.

G

represents the global coordinate frame, and

B_{i}

presents the body coordinate frame (also the FFR) of flexible body

i

.

x_{i} = [x_{i}; y_{i}; z_{i}]

presents the position (column) vector from

G

to

B_{i}

expressed in

G

, and

α_{i} = [α_{i}; β_{i}; γ_{i}]

presents the rotation parameters from

G

to

B_{i}

.

s_{i, p}

represents the original position of a node

p

in

B_{i}

, and

u_{i, p}^{t}

is the translational displacement of node

p

due to flexible deformation.

Figure 1. Coordinates of flexible body

i

in the floating frame of reference.

Let

A_{i} = A_{i} (α_{i})

be the transformation matrix from frame

B_{i}

to

G

, and let

Φ_{i, p}

be the components of the model matrix

Φ_{i}

at node

p

. After rigid motion and flexible deformation, the global position of node

p

on flexible body

i

is then expressed as

r_{i, p} = x_{i} + A_{i} (s_{i, p} + u_{i, p}^{t}),

(30)

wherein translational displacement

u_{i, p}^{t}

can be expressed with the modal coordinates

q_{i}

and

Φ_{i, p}^{t}

(i.e., the translational component of the modal matrix

Φ_{i, p}

) as

u_{i, p}^{t} = Φ_{i, p}^{t} q_{i} .

(31)

By differentiating the displacement (30) with respect to time, the global velocity of node

p

is obtained as

v_{i, p} = {\dot{x}}_{i} + A_{i} ({\dot{s}}_{i, p} + {\dot{u}}_{i, p}) + {\dot{A}}_{i} (s_{i, p} + u_{i, p}) .

(32)

wherein

\dot{□}

is the time derivative of

□

, i.e.,

\dot{□} = (d □) / (d t)

. Since

s_{i, p}

and

Φ_{i, p}^{t}

are constant, we have

{\dot{s}}_{i, p} = 0, {\dot{u}}_{i, p} = d (Φ_{i, p}^{t} q_{i}) / d t = Φ_{i, p}^{t} {\dot{q}}_{i} .

(33)

Since the component

{\dot{A}}_{i} (s_{i, p} + u_{i, p})

is caused by rotation of FFR

B_{i}

with the vector

(s_{i, p} + u_{i, p})

on it, we utilize the finite rotation theory to obtain

{\dot{A}}_{i} (s_{i, p} + u_{i, p}) = A_{i} [ω_{i} \times (s_{i, p} + u_{i, p})] = A_{i} [- (s_{i, p} + u_{i, p}) \times ω_{i}] = - A_{i} ({\tilde{s}}_{i, p} + {\tilde{u}}_{i, p}) ω_{i},

(34)

wherein

ω_{i}

represents the local (expressed in

B_{i}

) angular velocity of the frame

B_{i}

, “

\times

” represents the vector cross-product operation, and

{\tilde{s}}_{i, p}

projects the vector cross product into a matrix formulation with

s_{i, p} \times ω_{i} = {\tilde{s}}_{i, p} ω_{i}

by defining

{\tilde{s}}_{i, p} = \tilde{[\begin{matrix} s_{i, p, 1} \\ s_{i, p, 2} \\ s_{i, p, 3} \end{matrix}]} ≜ [\begin{matrix} 0 & - s_{i, p, 3} & s_{i, p, 2} \\ s_{i, p, 3} & 0 & - s_{i, p, 1} \\ - s_{i, p, 2} & s_{i, p, 1} & 0 \end{matrix}], {\tilde{u}}_{i, p} = \tilde{[\begin{matrix} u_{i, p, 1} \\ u_{i, p, 2} \\ u_{i, p, 3} \end{matrix}]} ≜ [\begin{matrix} 0 & - u_{i, p, 3} & u_{i, p, 2} \\ u_{i, p, 3} & 0 & - u_{i, p, 1} \\ - u_{i, p, 2} & u_{i, p, 1} & 0 \end{matrix}] .

(35)

As shown in reference [2], we can always find a coefficient matrix

D_{i} = D_{i} (α_{i})

for given attitude parameters

α_{i}

to obtain

ω_{i} = D_{i} {\dot{α}}_{i},

(36)

Then, the global velocity of node

p

can be written as

v_{i, p} = {\dot{x}}_{i} + A_{i} Φ_{i, p}^{t} {\dot{q}}_{i} - A_{i} ({\tilde{s}}_{i, p} + {\tilde{u}}_{i, p}) D_{i} {\dot{α}}_{i} .

(37)

Different from other references, we place the elastic coordinates

q_{i} = [q_{i, 1}; \dots; q_{i, n_{i}}]

at the forefront to assemble the generalized coordinates of the flexible body

i

as

ξ_{i} = [q_{i}; x_{i}; α_{i}] = [q_{i, 1}; \dots; q_{i, n_{i}}; x_{i}; y_{i}; z_{i}; α_{i}; β_{i}; γ_{i}] .

(38)

The mass and stiffness matrices of a flexible body can be obtained by calculating the overall kinetic and potential energies. In the finite element method, the kinetic energy is attributed to the nodes, and with the node mass

m_{i, p}

and the moment of inertia

H_{i, p}

lumped at node

p

, we have

T_{i} \approx \frac{1}{2} \sum_{p} (m_{i, p} v_{i, p}^{T} v_{i, p} + ω_{i, p}^{T} H_{i, p} ω_{i, p}) .

(39)

By substituting Equation (37) into Formula (39) and ignoring high-order quantities of the modal correlation parts, we have

\begin{array}{l} m_{i, p} v_{i, p}^{T} v_{i, p} + ω_{i, p}^{T} H_{i, p} ω_{i, p} = \\ {\dot{ξ}}_{i}^{T} [\begin{matrix} m_{i, p} {(Φ_{i, p}^{t})}^{T} Φ_{i, p}^{t} + {(Φ_{i, p}^{r})}^{T} H_{i, p} Φ_{i, p}^{r} & m_{i, p} {(Φ_{i, p}^{t})}^{T} A_{i}^{T} & m_{i, p} {(Φ_{i, p}^{t})}^{T} ({\tilde{s}}_{i, p}^{T} + {\tilde{u}}_{i, p}^{T}) D_{i} + {(Φ_{i, p}^{r})}^{T} H_{i, p} D_{i} \\ sym & m_{i, p} E_{3} & - A_{i} m_{i, p} ({\tilde{s}}_{i, p} + {\tilde{u}}_{i, p}) D_{i} \\ sym & sym & D_{i}^{T} [m_{i, p} ({\tilde{s}}_{i, p}^{T} + {\tilde{u}}_{i, p}^{T}) ({\tilde{s}}_{i, p} + {\tilde{u}}_{i, p}) + H_{i, p}] D_{i} \end{matrix}] {\dot{ξ}}_{i} \end{array}

(40)

wherein, for flexible body

i

,

m_{i, p}

is the lumped mass at node

p

,

H_{i, p}

is the symmetric lumped inertia matrix at node

p

,

Φ_{i, p}^{t}

is the translational components of the modes

Φ_{i}

at node

p

, while

Φ_{i, p}^{r}

is the rotational components of the modes

Φ_{i}

at node

p

,

φ_{i, p, j}^{t}

is the

j^{th}

column of

Φ_{i, p}^{t}

, the wave upon represents the cross-production-to-matrix operator as the same as in Formula (35), and

s_{i, p}

is the original position of node

p

in the FFR as shown in Figure 1.

The unit orthogonal condition corresponding to the mass matrix as defined in Equation (26) gives

m_{i, p} {(Φ_{i, p}^{t})}^{T} Φ_{i, p}^{t} + {(Φ_{i, p}^{r})}^{T} H_{i, p} Φ_{i, p}^{r} = E_{n_{i}},

(41)

Then, we substitute Formula (40) into the kinetic energy (39) to obtain the generalized mass matrix of the flexible body under the coordinates

ξ_{i}

as

M_{i} = [\begin{matrix} M_{i}^{ff} & M_{i}^{ft} & M_{i}^{fr} \\ M_{i}^{tf} & M_{i}^{tt} & M_{i}^{tr} \\ M_{i}^{rf} & M_{i}^{rt} & M_{i}^{rr} \end{matrix}] = [\begin{matrix} E_{n_{i}} & I_{i, 3}^{T} A_{i}^{T} & I_{i, 4}^{T} D_{i} \\ sym & I_{i, 1} E_{3} & - A_{i} ({\tilde{I_{i, 2} + I_{i, 3} q}}_{i}) D_{i} \\ sym & sym & D_{i}^{T} [I_{i, 7} - \sum_{j} q_{i, j} (I_{i, 8 j} + I_{i, 8 j}^{T})] D_{i} \end{matrix}] .

(42)

wherein the superscripts ^f,t,r refer to flexible, translational, and rotational components, respectively. The expressions and meanings of the modal invariant submatrices

I_{i, 1}

,

I_{i, 2}

,

I_{i, 3}

,

I_{i, 4}

,

I_{i, 7}

, and

I_{i, 8 j}

in (42) are listed in Table 1. The invariant submatrices are defined identically to those in the modal neutral file (MNF) [33], where

I_{i, 5}

and

I_{i, 6}

are commonly omitted.

Table 1. Expressions, dimensions, and mechanical meanings of the modal invariant submatrices.

For one-dimensional and two-dimensional elements, such as cables, beams, plates, and shells, the moment of inertia of the nodes

H_{i, p}

can be neglected.

The potential energy of elastic deformation is only a function of the elastic coordinates

q_{i}

. Because

u_{i} = Φ_{i} q_{i}

, as shown in (28),

{\bar{K}}_{i}

is the stiffness matrix corresponding to FE variable

u_{i}

, as shown in Equation (19); with the orthogonal condition (26), the potential energy of elastic deformation is then

V_{i} = \frac{1}{2} u_{i}^{T} {\bar{K}}_{i} u_{i} = \frac{1}{2} q_{i}^{T} Φ_{i}^{T} {\bar{K}}_{i} Φ_{i} q_{i} = \frac{1}{2} q_{i}^{T} Ω_{i}^{2} q_{i} .

(43)

The body’s general coordinate is

ξ_{i} = [q_{i}; x_{i}; α_{i}]

, and the system stiffness matrix given by Equation (13) is

K = \frac{\partial F_{elas}}{\partial ξ} + \frac{\partial F_{iner}}{\partial ξ}

, which includes both the elastic and inertial parts. Then, the stiffness matrix for body

i

is

K_{i} = [\begin{matrix} Ω_{i}^{2} \\ 0 \\ 0 \end{matrix}] + \frac{\partial (M_{i} {\ddot{ξ}}_{i})}{\partial ξ_{i}} .

(44)

3. Parallel Direct Solution Scheme Based on Block Gaussian Elimination

The most time-consuming tasks in FMB simulation involve assembling and solving the linearized system Equation (14). Generally, the calculation and assembly of the Jacobian matrix and right-hand terms of Equation (14) can be efficiently parallelized. However, solving Equation (14) with a direct solver does not benefit significantly from parallelization. This is mainly due to the inherent serial features of direct solvers.

In this section, we will delve deeply into the block and sparse features of the presented Jacobian matrix and then introduce the block Gaussian elimination (BGE) method to solve the linear equations in algorithm-level parallel. Prior to the BGE process, the reordering of variables and sub-blocks will be executed to generate a smaller bandwidth.

3.1. Coordinate Reordering Before BGE

To explore potential parallel strategies for directly solving Equation (14), we examine the block sparse characteristics of the Jacobian matrix derived from the CB method and then reorder the variables.

We write the linearized system Equation (14) here again for clarity:

{[\begin{matrix} \frac{M}{h^{2}} + \frac{C}{h} + K - (\frac{\partial τ}{\partial ξ} + \frac{1}{h} \cdot \frac{\partial τ}{\partial \dot{ξ}}) & {(\frac{\partial c}{\partial ξ})}^{T} \\ \frac{\partial c}{\partial ξ} & 0 \end{matrix}]}^{(n, j)} [\begin{matrix} Δ ξ^{(n, j)} \\ Δ λ^{(n, j)} \end{matrix}] = - {[\begin{matrix} b \\ c \end{matrix}]}^{(n, j)},

(45)

wherein

M = diag (M_{1}, \dots, M_{m})

,

C = diag (C_{1}, \dots, C_{m})

,

K = diag (K_{1}, \dots, K_{m})

, and

c = [c_{1}; \dots; c_{k}]

in the system with

m

bodies and

k

constraints. The superscript

□^{(n, j)}

denotes that it is the

j^{th}

iteration in the

n^{th}

time integration step.

In Equation (45),

M

and

K

are clearly block sparse, the same as

C

with

C = γ K

by a proportional modal damping

γ

hypothesis. If the external forces

τ

are not related to

ξ

and

\dot{ξ}

, we also have

\frac{\partial τ}{\partial ξ} + \frac{1}{h} \cdot \frac{\partial τ}{\partial \dot{ξ}} = 0 .

(46)

However, the constraint Jacobian matrix

\partial c / \partial ξ

is typically considered dense. For bandwidth consideration in Gaussian elimination, we place the Lagrange multipliers

λ

after all the body DOFs

ξ

. When the bodies are connected solely by constraints, the system Jacobian matrix has the block sparse features as shown in Figure 2.

Figure 2. Block sparse features of the system Jacobian matrix

J

.

We will demonstrate that the matrix sparsity depicted in Figure 2 can be preserved throughout the BGE process, as will be detailed in the following subsection. Although the LU decomposition method is often executed for direct solution of multibody systems, it is not suitable in this context because the LU bandwidth of the current Jacobian matrix is large.

In fact, we can expect similar block sparsity in sub-blocks

J_{i}, i = 1, \dots, m

as that in

J

. Since the Jacobian matrix of external loads is ignored, the Jacobian matrix of the

i^{th}

component has the following form, ignoring the iteration superscript

j

:

\begin{array}{l} J_{i} ≜ \frac{M_{i}}{h^{2}} + K_{i} = \frac{M_{i}}{h^{2}} + ([\begin{matrix} Ω_{i}^{2} \\ 0 \\ 0 \end{matrix}] + \frac{\partial (M_{i} {\ddot{ξ}}_{i})}{\partial ξ_{i}}) \\ = \frac{1}{h^{2}} [\begin{matrix} E_{n_{i}} & M_{i}^{ft} & M_{i}^{fr} \\ M_{i}^{tf} & M_{i}^{tt} & M_{i}^{tr} \\ M_{i}^{rf} & M_{i}^{rt} & M_{i}^{rr} \end{matrix}] + [\begin{matrix} Ω_{i}^{2} \\ 0 \\ 0 \end{matrix}] + \frac{\partial}{\partial ξ_{i}} [\begin{matrix} E_{n_{i}} {\ddot{q}}_{i} + M_{i}^{ft} (α_{i}) {\ddot{x}}_{i} + M_{i}^{fr} (α_{i}) {\ddot{α}}_{i} \\ M_{i}^{tf} (α_{i}) {\ddot{q}}_{i} + M_{i}^{tt} {\ddot{x}}_{i} + M_{i}^{tr} (q_{i}, α_{i}) {\ddot{α}}_{i} \\ M_{i}^{rf} (α_{i}) {\ddot{q}}_{i} + M_{i}^{rt} (q_{i}, α_{i}) {\ddot{x}}_{i} + M_{i}^{rr} (q_{i}, α_{i}) {\ddot{α}}_{i} \end{matrix}] \\ = [\begin{matrix} \frac{E_{n_{i}}}{h^{2}} + Ω_{i}^{2} & \frac{M_{i}^{ft}}{h^{2}} & \frac{M_{i}^{fr}}{h^{2}} + \frac{\partial (M_{i}^{ft} {\ddot{x}}_{i})}{\partial α_{i}} + \frac{\partial (M_{i}^{fr} {\ddot{x}}_{i})}{\partial α_{i}} \\ \frac{M_{i}^{tf}}{h^{2}} + \frac{\partial (M_{i}^{tr} {\ddot{α}}_{i})}{\partial q_{i}} & \frac{M_{i}^{tt}}{h^{2}} & \frac{M_{i}^{tr}}{h^{2}} + \frac{\partial (M_{i}^{tf} {\ddot{q}}_{i})}{\partial α_{i}} + \frac{\partial (M_{i}^{tr} {\ddot{x}}_{i})}{\partial α_{i}} \\ \frac{M_{i}^{rf}}{h^{2}} + \frac{\partial (M_{i}^{rt} {\ddot{x}}_{i})}{\partial q_{i}} + \frac{\partial (M_{i}^{rr} {\ddot{α}}_{i})}{\partial q_{i}} & \frac{M_{i}^{rt}}{h^{2}} & \frac{M_{i}^{rr}}{h^{2}} + \frac{\partial (M_{i}^{rf} {\ddot{q}}_{i})}{\partial α_{i}} + \frac{\partial (M_{i}^{rt} {\ddot{x}}_{i})}{\partial α_{i}} + \frac{\partial (M_{i}^{rr} {\ddot{α}}_{i})}{\partial α_{i}} \end{matrix}], \end{array}

(47)

wherein

M_{i}^{tr} (q_{i}, α_{i})

means

M_{i}^{tr}

is a function of

q_{i}

and

α_{i}

and others are similar, which can be observed from the expressions detailed in Table 1.

It is not difficult to find out that the sub-block

J_{i}

(47) has block sparsity, as shown in Figure 3a, which is similar to the global block sparsity as shown in Figure 2. The block sparsity in Figure 3a is even more constant, with a single diagonal row in the top-left submatrix and with six rows in the lower-left dense submatrix. The block sparsity in Figure 3b is undesired for a direct solver, since it will produce a full matrix in the Gaussian elimination and the LU decomposition. This is why we adopt the variable order

ξ_{i} = [q_{i}; x_{i}; α_{i}]

in Section 2.2.3 instead of the order

[x_{i}; α_{i}; q_{i}]

.

Figure 3. Block sparsity of

J_{i}

(a) for

ξ_{i} = [q_{i}; x_{i}; α_{i}]

(presented) and (b) for

ξ_{i} = [x_{i}; α_{i}; q_{i}]

(undesired).

3.2. Feasibility Study of BGE on Sub-Block Jacobian Matrix

Firstly, we demonstrate that the submatrix

J_{i}

is expected to be symmetrically positive definite, and the block Gaussian elimination (BGE) method can be applied to it. We know that

M_{i}

is symmetrically positive definite; since the component

\frac{\partial (M_{i}^{(j)} {\ddot{ξ}}^{(j)})}{\partial ξ_{i}^{T}}

in

K_{i}

is much smaller than others,

K_{i}

can be considered symmetrically semi-definite. Consequently,

J_{i} = \frac{M_{i}}{h^{2}} + K_{i}

is symmetrically positive definite. With the symmetrically positive definite nature of

J_{i}

, the BGE method can be applied, and numerical stability can be maintained without selecting principal elements, according to the stability theory of BGE proved in references [29,34,35].

After the variable reordering in Section 3.1, the BGE process can preserve the sparsity within submatrices

J_{i}

and the entire matrix

J

, thereby achieving linear computational complexities. Consider

J_{i}

in Figure 3a, for example. We apply the BGE process on the following linear equations with Jacobian matrix

J_{i}

:

[\begin{matrix} J_{i}^{ff} & J_{i}^{fR} \\ J_{i}^{Rf} & J_{i}^{RR} \end{matrix}] [\begin{matrix} Δ ξ_{i}^{f} \\ Δ ξ_{i}^{R} \end{matrix}] = - [\begin{matrix} b_{i}^{f} \\ b_{i}^{R} \end{matrix}],

(48)

wherein the superscripts ^f and ^R denote the components of flexible modes and rigid body motion, respectively. Solving Equation (48) is equivalent to solving the following Schur’s complement [16] equations within the rigid body motion DOFs:

[J_{i}^{RR} - J_{i}^{Rf} {(J_{i}^{ff})}^{- 1} J_{i}^{fR}] Δ ξ_{i}^{R} = - [b_{i}^{R} - J_{i}^{Rf} {(J_{i}^{ff})}^{- 1} b_{i}^{f}],

(49)

Because

J_{i}^{ff}

is diagonal, its reverse

{(J_{i}^{ff})}^{- 1}

consumes only

n_{i}

times of divisions. Since

J_{i}^{Rf}

is a

6 \times n_{i}

submatrix and

J_{i}^{fR}

is a

n_{i} \times 6

submatrix,

J_{i}^{Rf} {(J_{i}^{ff})}^{- 1}

consumes

6 n_{i}

times of operations. Then,

J_{i}^{Rf} [{(J_{i}^{ff})}^{- 1} J_{i}^{fR}]

consumes another

6 \times 6 \times 2 n_{i} = 72 n_{i}

times of operations, and

J_{i}^{Rf} [{(J_{i}^{ff})}^{- 1} J_{i}^{fR}]

consumes another

6 \times 2 n_{i} = 12 n_{i}

times of operations. At last, the subtraction

[J_{i}^{RR} - J_{i}^{Rf} {(J_{i}^{ff})}^{- 1} J_{i}^{fR}]

and

[b_{i}^{R} - J_{i}^{Rf} {(J_{i}^{ff})}^{- 1} b_{i}^{f}]

consumes another

36 + 6 = 42

times of operations. So, to form Schur’s complement Equation (49), the total computational complexity is

6 n_{i} + 72 n_{i} + 12 n_{i} + 42 = 90 n_{i} + 42 = O (n_{i}) .

(50)

The computational complexity of solving the

6 \times 6

Equation (49) is merely

O (1)

. Consequently, the computational complexity of BGE on

J_{i}

is linear to

n_{i}

. Similar applications of the Schur’s complement have been reported in rigid multibody system simulations involving contact problems [20,36].

3.3. Algorithm-Level Parallel Direct Solution of System Linear Equations Based on BGE

The block sparsity and symmetry of system Jacobian matrix

J

resemble that of submatrix

J_{i}

, which exhibits a strong principal diagonal property as shown in Figure 2 and Figure 3a. Hypothesizing that high-speed rotating parts are not directly incorporated into the system, the Jacobian matrix can still remain positive. This is because damping typically does not dominate the behavior of structural components [29]. Consequently, for the linearized system Equation (45), block Gaussian elimination (BGE) can be employed directly to enable algorithm-level parallel computation.

For convenience, we note

c_{ξ_{1}} ≜ \frac{\partial c}{\partial ξ_{1}^{T}}

,

c_{ξ_{1}}^{T} ≜ \frac{\partial c^{T}}{\partial ξ_{1}}

and write Equation (45) in a partitioned form as [37] does; thus, we obtain

[\begin{matrix} J_{1} & c_{ξ_{1}}^{T} \\ J_{2} & c_{ξ_{2}}^{T} \\ ⋱ & ⋮ \\ J_{m} & c_{ξ_{m}}^{T} \\ c_{ξ_{1}} & c_{ξ_{2}} & \dots & c_{ξ_{m}} \end{matrix}] [\begin{matrix} Δ ξ_{1} \\ Δ ξ_{2} \\ ⋮ \\ Δ ξ_{m} \\ Δ λ \end{matrix}] = - [\begin{matrix} τ_{1} \\ τ_{2} \\ ⋮ \\ τ_{m} \\ c \end{matrix}] .

(51)

Then, the algorithm-level parallel scheme for the direct solution of the flexible multibody system based on BGE is as follows:

Step ①: Each block row corresponding to the submatrix $J_{i}$ is left-multiplied by the Gaussian eliminator $J_{i}^{- 1}$ in parallel to obtain

$[\begin{matrix} E_{n_{1}} & J_{1}^{- 1} c_{ξ_{1}}^{T} \\ E_{n_{2}} & J_{2}^{- 1} c_{ξ_{2}}^{T} \\ ⋱ & ⋮ \\ E_{n_{m}} & J_{m}^{- 1} c_{ξ_{m}}^{T} \\ c_{ξ_{1}} & c_{ξ_{2}} & \dots & c_{ξ_{m}} \end{matrix}] [\begin{matrix} Δ ξ_{1} \\ Δ ξ_{2} \\ ⋮ \\ Δ ξ_{m} \\ Δ λ \end{matrix}] = - [\begin{matrix} J_{1}^{- 1} τ_{1} \\ J_{2}^{- 1} τ_{2} \\ ⋮ \\ J_{2}^{- 1} τ_{m} \\ c \end{matrix}];$

(52)
Step ②: Each constraint Jacobian matrix $c_{ξ_{i}}, i = 1, \dots, n_{i}$ in the lower-left is eliminated in parallel by the unit matrix $E_{n_{i}}$ in the top-left, resulting in the Schur’s complement $- \sum_{i = 1}^{m} c_{ξ_{i}} J_{i}^{- 1} c_{ξ_{i}}^{T}$ in the subspace of Lagrange multiplier $λ$ , thus forming

$[\begin{matrix} E_{n_{1}} & J_{1}^{- 1} c_{ξ_{1}}^{T} \\ E_{n_{2}} & J_{2}^{- 1} c_{ξ_{2}}^{T} \\ ⋱ & ⋮ \\ E_{n_{m}} & J_{m}^{- 1} c_{ξ_{m}}^{T} \\ - \sum_{i = 1}^{m} c_{ξ_{i}} J_{i}^{- 1} c_{ξ_{i}}^{T} \end{matrix}] [\begin{matrix} Δ ξ_{1} \\ Δ ξ_{2} \\ ⋮ \\ Δ ξ_{m} \\ Δ λ \end{matrix}] = - [\begin{matrix} J_{1}^{- 1} τ_{1} \\ J_{2}^{- 1} τ_{2} \\ ⋮ \\ J_{2}^{- 1} τ_{m} \\ c - \sum_{i = 1}^{m} c_{ξ_{i}} J_{i}^{- 1} τ_{i} \end{matrix}];$

(53)
Step ③: The linear equations within the Lagrange multiplier subspace are solved to obtain the multiplier increment as

$Δ λ = {(\sum_{i = 1}^{m} c_{ξ_{i}} J_{i}^{- 1} c_{ξ_{i}}^{T})}^{- 1} (c - \sum_{i = 1}^{m} c_{ξ_{i}} J_{i}^{- 1} τ_{i});$

(54)
Step ④: Substitute $Δ λ$ back into Equation (53) to obtain the increments $Δ ξ_{i}, i = 1, \dots, m$ in parallel for all bodies. With the known $J_{i}^{- 1} τ_{i}$ and $J_{i}^{- 1} c_{ξ_{i}}^{T}$ in Step ②, general coordinate increments of bodies are

$Δ ξ_{i} = - J_{i}^{- 1} τ_{i} - J_{i}^{- 1} c_{ξ_{i}}^{T} Δ λ, i = 1, \dots, m;$

(55)
Step ⑤: Update the general coordinates by the residual increments in parallel for all bodies and constraints. Since the time step $n$ and the iteration superscript $j$ are omitted in Formulas (51)–(55), we restore it to update the Newton iteration as

$\{\begin{cases} ξ_{i}^{(n, j + 1)} = ξ_{i}^{(n, j)} + Δ ξ_{i}^{(n, j)} \\ λ^{(n, j + 1)} = λ^{(n, j)} + Δ λ^{(n, j)} \end{cases} .$

(56)

In our proposed scheme, steps ①, ②, ④, and ⑤ are algorithmic-level block parallel, whereas step ③ is serial within the

k

constraint DOFs. In reality, step ③ still allows for acceleration via parallelism in a single column of row [38], but the efficiency of this parallelism [39] is not as high as that of block parallelism. Figure 4 illustrates the flowchart of the presented computational scheme integrating block parallel.

Figure 4. Flowchart of the proposed BGE-based algorithm-level parallel scheme for direct solution of the flexible multibody systems.

Since the original matrix is sparse and there is no interaction between rows and columns required, the assembly of the system matrix

J

is actually unnecessary. This approach saves time on topology calculation and data copying of sparse matrices, which is advantageous for parallel implementations of both OpenMP and MPI.

The proposed block-parallel scheme remains stable for a system lacking damping and gyroscopic matrix

C

, and the stability analysis process is akin to that of the linearized submatrix Equation (48) for

J_{i}

in Section 3.2. We can also anticipate that the BGE-based scheme will be stable when the damping/gyroscopic forces and their Jacobian matrices are small. Typically, damping forces are small for most mechanical structures, and gyroscopic forces are also small for low-speed rotors. However, bodies with high-speed rotation should not be directly incorporated into the system to ensure the stability. Nevertheless, periodic vibrations transmitted from high-speed rotors are permissible within the system, as they do not affect the system’s Jacobian matrix. Consequently, the method presented is suitable for diagnostic platform systems with large translational displacement and for aeroengine stator systems experiencing high-speed excitation forces.

4. Numerical Examples

In this section, an aeroengine stator system and a low-speed crank-slider mechanism are used as numerical examples. The algorithm’s stability is verified by comparing its numerical accuracy with several other methods [40], and its efficiency is also compared.

4.1. Example of an Aeroengine Stator System

We consider the vibration response of an aeroengine stator system under the excitation of a rotor’s eccentric periodic force, as depicted in Figure 5. The system comprises five components, a connector, compressor, combustion chamber, turbine, and nozzle, in the order of their modeling (i.e., the sequence of parts in the linearized equations). Each component, treated as an individual entity, is analyzed using finite element software Abaqus 2020 to determine the fixed interface modes, with the lower-order orthogonal elastic modes retained. The part connection relationships, finite element mesh, and number of the CB modes are illustrated in Table 2. A sinusoidal force with a frequency of 20 Hz and an amplitude of 100,000 Newtons is applied at the same position of the fixed connection between part ③ and part ④ (i.e., at the red star point) to simulate the excited vibration of the stator system by the eccentric rotor through the bearing.

Figure 5. Example of an aeroengine stator system: its structural composition, finite element meshing, constraint connections, and position of acting forces.

Table 2. Elastic modal DOFs and constraint connections between components in the aeroengine stator system example.

The topology of the Jacobian matrix derived from the aeroengine stator system is depicted in Figure 6. We can observe very appealing block sparsity in the submatrices and the whole matrix, which exhibit self-similarity.

Figure 6. Block sparsity and self-similarity of the Jacobian matrix derived from the aeroengine stator system (nz = none zeros).

For the purpose of comparing accuracy and verifying stability, the numerical results of the aeroengine stator system were computed using the present method, commercial software ADAMS 2020, and the traditional serial Gaussian algorithm. The results are compared in Figure 7, which shows highly consistent outcomes, thereby verifying the numerical stability of the proposed BGE-based algorithm-level parallel method.

Figure 7. Numerical accuracy comparison of the elastic vibration displacement to verify numerical stability of the presented algorithm-level parallel method (using the aeroengine stator system as an example).

As shown in Table 3, when employing 4-core OpenMP parallelism to solve linear Equation (45), the computational efficiency of pivoting GE decreases by 34.7% compared with the serial case, primarily due to the consumption of parallel initialization. This phenomenon illustrates the inadaptability of the pivoting GE to parallelism. In contrast, the efficiency of the block GE parallelism increases by 35.9% compared to its serial counterpart.

Table 3. Computational efficiency comparison between the traditional pivoting GE and the presented parallel BGE method (using the aeroengine stator system as an example).

In the serial case comparison, the block GE efficiency increases by 5.1% compared to the pivoting GE, primarily due to saving of pivoting operations between rows and columns. In contrast, in the parallel case comparison, the block GE efficiency improves by a total of 54.8% over the pivoting GE parallelism. This demonstrates the high parallelism adaptability of the block GE. In summary, the computational efficiency of the presented parallel block GE method increases by 39.2% in total compared to the original serial pivoting GE method.

4.2. Sample of a Low-Speed Rotating Crank-Slider Mechanism

Consider the elastic vibration response of a crank-slider mechanism at low speed (0.1 Hz), as depicted in Figure 8. The system consists of a crank, a linker, and a slider, with the elastic DOFs and interconnections detailed in Table 4. The FEM details of the flexible crank and flexible linker for modal analysis are listed in Table 5. We ignore friction in all joints. The physical quantity measured is the elastic displacement of point A in the

y

direction with respect to the body-fixed coordinate system

O x y

.

Figure 8. Sample of a low-speed crank-slider mechanism: its components, constraint connections, and rotational driving.

Table 4. Elastic modal DOFs and constraint connections between components in the low-speed crank-slider mechanism.

Table 5. FEM details of the flexible crank and flexible linker for modal analysis.

The parallel block GE and the serial pivoting GE methods are executed for computational comparison. The elastic part of the vibration response

y_{A}

, which is the vibration displacement of point A in the body-fixed coordinate system

O x y

, within one period is shown in Figure 9. The results from both methods are highly coincident, which verifies the numerical stability of the presented parallel block GE method for the FMB system with low-speed rotating parts.

Figure 9. Numerical accuracy comparison of the elastic vibration displacement to verify numerical stability of the presented algorithm-level parallel method (using the low-speed crank-slider mechanism as a sample).

As indicated in Table 6, the overall computational efficiency increased by 1.9% in the low-speed crank-slider sample, where the total CPU time consumption decreased from 4089 ms to 4013 ms. The parallelization of the presented block GE method accelerates the FMB system with low-speed rotating parts, albeit not to the extent demonstrated in the example of the aeroengine stator system. This is due to the presence of five flexible components in the aeroengine example compared to only two in the current crank-slider sample. This phenomenon suggests that the parallelization efficiency of the presented method is related to the number of flexible components in the system. In other words, the parallelization is executed between sub-blocks of flexible components.

Table 6. Computational efficiency comparison between the serial pivoting GE method and the presented parallel block GE method (using the low-speed crank-slider mechanism as a sample).

It is also noteworthy that the number of solving the linearized systemic Equation (45) rose from 30,214 to 31,546, an increase of approximately 4.4%. This is due to the fact that the gyroscopic component of the Jacobian matrix

C

is neglected in Equation (45) using our method. This results in the Newton iteration degenerating into the quasi-Newton iteration, which typically requires more iterations in engineering applications. Consequently, despite a 4.4% increase in the total number of iterations, a computational efficiency improvement of 1.9% is realized. As previously analyzed, the parallel efficiency can be significantly enhanced in scenarios involving more elastic modes or more flexible components within the FMB systems in question.

5. Conclusions

In this paper, the CB method is utilized to model large, complex, flexible, multibody systems. System Jacobian matrices with favorable block symmetry and sparsity properties are formed by appropriately ordering the generalized coordinates of bodies and joints. A local–global two-layer parallel algorithm is constructed to directly solve the linearized systemic equations. The algorithm also circumvents the disruption of matrix topological sparsity in the traditional pivoting GE process of direct solution and achieves algorithm-level parallelism. The algorithm ensures numerical stability by employing the physical assumption of component modal synthesis methods and the specific format of the Craig–Bampton method. The numerical stability and high efficiency of the proposed block parallel algorithm are validated by comparing the results with those from commercial software Abaqus 2020 and with the pivoting GE algorithm, using the examples of aeroengine stator system vibration and crank-slider mechanism’s low-speed rotation.

The method detailed in this paper is unsuitable for multibody systems that contain high-speed rotating parts directly, as in such cases, the gyroscopic matrix may become the primary component of the Jacobi matrix, leading to instability in the presented scheme. A potential solution is to construct an appropriate preconditioning matrix [40] to ensure numerical stability and prevent the need for pivoting operations. This approach could be explored in future papers.

Author Contributions

Conceptualization, C.Y. and Z.X.; methodology, C.Y.; software, B.X., Y.W. and Y.X.; validation P.Y.; writing—original draft preparation, C.Y.; writing—review and editing, P.Y., B.X. and Y.W.; supervision, Z.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author because the research project involved in this article involves relevant confidentiality agreements, and the overall project has not been completed yet.

Conflicts of Interest

The authors declare no conflicts of interest.

Nomenclature

$0$	Zero vector or zero matrix
$a$	Constraints scale factor for numerical conditioning
$A_{i}$	Transformation matrix from body-fixed frame to global frame
$α_{i} = [α_{i}; β_{i}; γ_{i}]$	Rotation parameters from global frame to body-fixed frame
$b$	Residual of body general force equations
$B_{i}$	Body-fixed coordinate frame (also the FFR) of flexible body $i$
$c$	Complete constraints in system
$C$	System damping matrix
$C_{i}$	Damping matrix of body $i$
$c_{ξ}$	Partial derivative of $c$ corresponding to $ξ$ , i.e., $\partial c / \partial ξ$
$D_{i}$	$Coefficient matrix before {\dot{α}}_{i}$ $to obtain ω_{i}$
$Δ λ^{(n, j)}$	$Increment of λ^{(n, j)}$ $in the j^{th}$ $iteration at n^{th}$ time step
$Δ ξ^{(n, j)}$	$Increment of ξ^{(n, j)}$ $in the j^{th}$ $iteration at n^{th}$ time step
$Δ λ$	$Abbreviation of Δ λ^{(n, j)}$ when there is no ambiguity
$Δ ξ$	$Abbreviation of Δ ξ^{(n, j)}$ when there is no ambiguity
$diag (\dots)$	Block-diagonal matrix composed of submatrices
$e_{ξ}$ $, e_{λ}$ $, e_{b}$ $, e_{c}$	Error thresholds for displacement variables, constraint forces, body force equation residuals, and constraint equation residuals
$E_{n_{i}}$	$Unit matrix with dimension n_{i}$
$f_{v}$	Velocity’s algebraic function for time integration with BDFs
$f_{a}$	Acceleration’s algebraic function for time integration with BDFs
${\bar{f}}_{i}$	External force applied to the boundary DOFs in FE equations
$F_{iner}$	General inertial forces
$F_{elas}$	General elastic forces
${\bar{φ}}_{i, j}^{Λ}$	$The j^{th}$ low-frequency mode of flexible body $i$ $on coordinates u_{i}^{Λ}$
$Φ_{i}$	Elastic modes of the CB method (orthogonal and non-zero frequency)
${\bar{Φ}}_{i}$	Preserved modes matrix of body $i$
${\bar{\bar{Φ}}}_{i}$	Non-zero-frequency reorthogonalized modal matrix
${\bar{Φ}}_{i}^{boun}$	Boundary modes matrix of body $i$
${\bar{Φ}}_{i}^{vibr}$	Preserved vibration modes matrix of body $i$
$Φ_{i, p}$	Components of the modes matrix at node $p$
$Φ_{i, p}^{t}$	$Translational components of Φ_{i, p}$
$Φ_{i, p}^{r}$	$Rotational components of Φ_{i, p}$
$G$	Global coordinate frame
$γ$	Proportional modal damping coefficient
$h$	Time step size
$H_{i, p}$	Moment of inertia lumped at node $p$ of body $i$
$i$	Serial number for bodies in the system
$I_{i, 1} ~ I_{i, 8}$	Modal invariant submatrices in the modal neutral file
$J$	System Jacobian matrix
$J_{i}$	Jacobian matrix of body $i$
$J_{i}^{* #}$	Submatrix of $J_{i}$ corresponding to rows DOFs “*” and column DOFs “#”, “^f” for flexible modes, “^t” for rigid translation, “^r” for rigid rotation, “^R” for rigid translation and rotation
$k$	Total number of constraint equations in the system
$K$	Stiffness matrix of system
$K_{i}$	Stiffness matrix of body $i$
${\bar{K}}_{i}$	Stiffness matrix of finite element equations
$L$	Lagrange function of system
$λ$	Lagrangian multipliers of constraints
$λ^{(n)}$	Value of $λ$ $at discrete time t = n h$
$m$	Total number of bodies in the system
$m_{i, p}$	Lumped mass at node $p$ of body $i$
$M$	Mass matrix of system
$M_{i}$	Number of boundary modes of body $i$
$M_{i}$	Mass matrix of body $i$
${\bar{M}}_{i}$	Mass matrix of finite element equations of body $i$
$M_{i}^{* #}$	Submatrix of $M_{i}$ corresponding to rows DOFs “*” and column DOFs “#”, “^f” for flexible modes, “^t” for rigid translation, “^r” for rigid rotation, “^R” for rigid translation and rotation
$N_{i}$	Number of preserve low-frequency modes of body $i$
$n_{i}$	Total number of flexible modes after rigid modes removing and modes orthogonalization of body $i$
$Ω_{i}^{2}$	Diagonal matrix of modal circle frequencies of body $i$
$ω_{i, j}$	$The j^{th}$ modal circle frequency of flexible body $i$
$ω_{i}$	Local angular velocity of the body-fixed frame
$p$	An FE node on the flexible body
${\bar{ψ}}_{i, j}^{Λ}$	$The j^{th}$ boundary mode of flexible body $i$ $on coordinates u_{i}^{Λ}$
$q_{i}$	Modal coordinates of the flexible body $i$ (excluding zero-frequency modes and ensuring orthogonality)
${\bar{q}}_{i}$	Redundant modal coordinates of the flexible body $i$ (including zero-frequency modes)
$r_{i, p}$	Global position of node $p$ of flexible body $i$
$s_{i, p}$	Original position of node $p$ of flexible body $i$ $in B_{i}$
$t$	Simulation time
$τ$	General force vector of external loads
$T$	Kinetic energy of system
$u_{i}$	Finite element DOFs of flexible body $i$
$u_{i, p}^{t}$	Translational displacement of node $p$ due to flexible deformation of flexible body $i$
$u_{i}^{Λ}$	Inner DOFs of flexible body $i$ within the FE formulation
$u_{i}^{Γ}$	Boundary DOFs of flexible body $i$ within the FE formulation
$V$	Potential energy of system
$v_{i, p}$	Global velocity of node $p$ of flexible body $i$
$x_{i} = [x_{i}; y_{i}; z_{i}]$	Position vector from global frame to body-fixed frame in global frame
$ξ$	General displacement of bodies
$\dot{ξ}$ $\ddot{ξ}$	General velocity and acceleration of bodies
$ξ^{(n, 0)}$	Initial value of $ξ$ $in Newton iteration for the n^{th}$ time step
$ξ^{(- 1)}$	Value of $ξ$ $at discrete time t = - h$
$ξ_{i}$	General displacement of body $i$
$ξ^{(n)}$	Value of $ξ$ $at discrete time t = n h$
“;”	Separator of components in a “column” vector
$□^{(0)}$	$Variable values at initial time t = 0$ for $□$
$□^{(n)}$	$Variable values at discrete time t = n h$ for $□$
$□^{(n, 0)}$	$Initial values of the n^{th}$ time step for $□$ in the Newton iteration
$□^{(n, j)}$	$Results of the j^{th}$ $iteration in the n^{th}$ time step for $□$
$□^{T}$	Vector transpose or matrix transpose of $□$
$□^{Λ}$	Inner DOFs components in the Craig–Bampton method for $□$
$□^{Γ}$	Boundary DOFs components in the Craig–Bampton method for $□$

References

Simeon, B. Computational Flexible Multibody Dynamics. A Differential-Algebraic Approach; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Shabana, A.A. Dynamics of Multibody Systems; Wiley: New York, NY, USA, 1989. [Google Scholar]
Craig, R.R.J. Substructure methods in vibration. J. Mech. Des. 1995, 117, 207–213. [Google Scholar] [CrossRef]
Song, J.O.; Haug, E.J. Dynamic analysis of planar flexible mechanisms. Comput. Methods Appl. Mech. Eng. 1980, 24, 359–381. [Google Scholar] [CrossRef]
Hurty, W.C. Dynamic analysis of structural systems using component modes. AIAA J. 1965, 3, 678–685. [Google Scholar] [CrossRef]
Bampton, M.C.C.; Craig, J.R.R. Coupling of substructures for dynamic analyses. AIAA J. 1968, 6, 1313–1319. [Google Scholar]
Hou, S.N. Review of Modal Synthesis Techniques and a New Approach; NASA: Washington, DC, USA, 1969. [Google Scholar]
Macneal, R.H. A hybrid method of component mode synthesis. Comput. Struct. 1971, 1, 581–601. [Google Scholar] [CrossRef]
Rubin, S. Improved component-mode representation for structural dynamic analysis. AIAA J. 1975, 13, 995–1006. [Google Scholar] [CrossRef]
Kim, J.G.; Lee, P.S. An enhanced craig–bampton method. Int. J. Numer. Methods Eng. 2015, 103, 79–93. [Google Scholar] [CrossRef]
Kim, J.G.; Park, Y.J.; Lee, G.H.; Kim, D.N. A general model reduction with primal assembly in structural dynamics. Comput. Methods Appl. Mech. Eng. 2017, 324, 1–28. [Google Scholar] [CrossRef]
Kim, J.G.; Han, J.B.; Lee, H.; Kim, S.S. Flexible multibody dynamics using coordinate reduction improved by dynamic correction. Multibody Syst. Dyn. 2018, 42, 411–429. [Google Scholar] [CrossRef]
Han, J.B.; Kim, J.G.; Kim, S.S. An efficient formulation for flexible multibody dynamics using a condensation of deformation coordinates. Multibody Syst. Dyn. 2019, 47, 293–316. [Google Scholar] [CrossRef]
Wanner, G.; Hairer, E. Solving Ordinary Differential Equations II; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 1996; Volume 375. [Google Scholar]
Yang, C.; Du, J.; Cheng, Z.; Wu, Y.; Li, C. Flexibility investigation of a marine riser system based on an accurate and efficient modelling and flexible multibody dynamics. Ocean Eng. 2020, 207, 107407. [Google Scholar] [CrossRef]
Park, K.C.; Downer, J.D.; Chiou, J.C.; Farhat, C. A modular multibody analysis capability for high-precision, active control and real-time applications. Int. J. Numer. Methods Eng. 1991, 32, 1767–1798. [Google Scholar] [CrossRef]
Cao, D.Z.; Qiang, H.F.; Ren, G.X. Parallel computing studies of flexible multibody system dynamics using OpenMP and Pardiso. J. Tsinghua Univ. Sci. Technol. 2012, 52, 1643–1649. [Google Scholar]
Dagum, L.; Menon, R. OpenMP: An industry standard API for shared-memory programming. J. IEEE Comput. Sci. Eng. 1998, 5, 46–55. [Google Scholar] [CrossRef]
Negrut, D.; Tasora, A.; Mazhar, H.; Heyn, T.; Hahn, P. Leveraging parallel computing in multibody dynamics. Multibody Syst. Dyn. 2012, 27, 95–117. [Google Scholar] [CrossRef]
Negrut, D.; Serban, R.; Mazhar, H.; Heyn, T. Parallel computing in multibody system dynamics: Why, when, and how. J. Comput. Nonlinear Dyn. 2014, 9, 041007. [Google Scholar] [CrossRef]
Pařík, P.; Kim, J.G.; Isoz, M.; Ahn, C.U. A parallel approach of the enhanced Craig–Bampton method. Mathematics 2021, 9, 3278. [Google Scholar] [CrossRef]
Cros, J.M. Parallel modal synthesis methods in structural dynamics. Contemp. Math. 1998, 218, 492–499. [Google Scholar]
Murthy, P.; Poschmann, P.; Reymond, M.; Schartz, P.; Wilson, C.T. Automated Component Modal Synthesis with Parallel Processing. 2000. Available online: http://www.mscsoftware.com/support/library/conf/auto00/p03900.pdf (accessed on 22 October 2024).
Li, P.; Liu, C.; Tian, Q.; Hu, H.Y.; Song, Y.P. Dynamics of a deployable mesh reflector of satellite antenna: Parallel computation and deployment simulation. J. Comput. Nonlinear Dyn. 2016, 11, 061005. [Google Scholar] [CrossRef]
Featherstone, R. A divide-and-conquer articulated-body algorithm for parallel O(log(n)) calculation of rigid-body dynamics. Part 1: Basic algorithm. Int. J. Robot. Res. 1999, 18, 876–892. [Google Scholar] [CrossRef]
Dewes, E.M.; Rixen, D.J. Time integration of multibody systems using nonlinear domain decomposition techniques with mixed interface conditions. In Proceedings of the 5th Joint International Conference on Multibody System Dynamics, Lisboa, Portugal, 24–28 June 2018. [Google Scholar]
Cano, J.C.; Cuenca, J.; Giménez, D.; Saura-Sánchez, M.; Segado-Cabezos, P. A parallel simulator for multibody systems based on group equations. J. Supercomput. 2019, 75, 1368–1381. [Google Scholar] [CrossRef]
Wasfy, T.M.; Noor, A.K. Computational strategies for flexible multibody systems. Appl. Mech. Rev. 2003, 56, 553–613. [Google Scholar] [CrossRef]
Higham, N.J. Gaussian elimination. Wiley Interdiscip. Rev. Comput. Stat. 2011, 3, 230–238. [Google Scholar] [CrossRef]
Peiret, A.; Andrews, S.; Kövecses, J.; Kry, P.G.; Teichmann, M. Schur complement-based substructuring of stiff multibody systems with contact. ACM Trans. Graph. 2019, 38, 1–17. [Google Scholar] [CrossRef]
Yang, C.; Cao, D.Z.; Zhao, Z.H.; Zhang, Z.R.; Ren, G.X. A direct eigenanalysis of multibody system in equilibrium. J. Appl. Math. 2012, 2012, 1–7. [Google Scholar] [CrossRef]
Wang, G.X.; Niu, Z.P.; Feng, Y. Improved Craig–Bampton Method Implemented into Durability Analysis of Flexible Multibody Systems. Actuators 2023, 12, 65. [Google Scholar] [CrossRef]
Ryan, R.R. ADAMS—Multibody system analysis software. In Multibody Systems Handbook; Springer: Berlin/Heidelberg, Germany, 1990; pp. 361–402. [Google Scholar]
Pan, V.Y.; Zhao, L. Numerically safe Gaussian elimination with no pivoting. Linear Algebra Its Appl. 2017, 527, 349–383. [Google Scholar] [CrossRef]
Strawderman, R.L.; Higham, N.J. Accuracy and stability of numerical algorithms. J. Am. Stat. Assoc. 1999. [Google Scholar] [CrossRef]
Bender, J.; Erleben, K.; Trinkle, J. Interactive simulation of rigid body dynamics in computer graphics. Comput. Graph. Forum 2014, 33, 246–270. [Google Scholar] [CrossRef]
Golub, G.H.; Greif, C. On solving block-structured indefinite linear systems. SIAM J. Sci. Comput. 2003, 24, 2076–2092. [Google Scholar] [CrossRef]
Faugère, J.C.; Lachartre, S. Parallel Gaussian elimination for Gröbner bases computations in finite fields. In Proceedings of the International Workshop on Parallel Symbolic Computation, Grenoble, France, 21–23 July 2010. [Google Scholar]
Peng, R.; Vempala, S. Solving sparse linear systems faster than matrix multiplication. In Proceedings of the Symposium on Discrete Algorithms, Virtual, 10–13 January 2021. [Google Scholar]
Sewell, G. Computational Methods of Linear Algebra; World Scientific Publishing Company: Singapore, 2014. [Google Scholar]

Figure 1. Coordinates of flexible body

i

in the floating frame of reference.

Figure 2. Block sparse features of the system Jacobian matrix

J

.

Figure 3. Block sparsity of

J_{i}

(a) for

ξ_{i} = [q_{i}; x_{i}; α_{i}]

(presented) and (b) for

ξ_{i} = [x_{i}; α_{i}; q_{i}]

(undesired).

Figure 4. Flowchart of the proposed BGE-based algorithm-level parallel scheme for direct solution of the flexible multibody systems.

Figure 5. Example of an aeroengine stator system: its structural composition, finite element meshing, constraint connections, and position of acting forces.

Figure 6. Block sparsity and self-similarity of the Jacobian matrix derived from the aeroengine stator system (nz = none zeros).

Figure 7. Numerical accuracy comparison of the elastic vibration displacement to verify numerical stability of the presented algorithm-level parallel method (using the aeroengine stator system as an example).

Figure 8. Sample of a low-speed crank-slider mechanism: its components, constraint connections, and rotational driving.

Figure 9. Numerical accuracy comparison of the elastic vibration displacement to verify numerical stability of the presented algorithm-level parallel method (using the low-speed crank-slider mechanism as a sample).

Table 1. Expressions, dimensions, and mechanical meanings of the modal invariant submatrices.

Modal Invariant Submatrix	Dimension	Mechanical Meaning
$I_{i, 1} = \sum_{p} m_{i, p}$	$1 \times 1$	Total mass
$I_{i, 2} = \sum_{p} m_{i, p} s_{i, p}$	$3 \times 1$	Initial centroid position in the FFR
$I_{i, 3} = \sum_{p} m_{i, p} Φ_{i, p}^{t}$	$3 \times n_{i}$	Deformation introduced change in modal centroid position
$I_{i, 4} = \sum_{p} (m_{i, p} {\tilde{s}}_{i, p} Φ_{i, p}^{t} + H_{i, p} Φ_{i, p}^{r})$	$3 \times n_{i}$	Deformation introduced change in the moment of inertia
$I_{i, 7} = \sum_{p} (m_{i, p} {\tilde{s}}_{i, p}^{T} {\tilde{s}}_{i, p} + H_{i, p})$	$3 \times 3$	Initial moment of inertia
$I_{i, 8 j} = \sum_{p} m_{i, p} {\tilde{s}}_{i, p} {\tilde{φ}}_{i, p, j}^{t}, j = 1, \dots, n_{i}$	$3 \times 3$	Deformation introduced change in the modal moment of inertia (the first-order main term)

Table 2. Elastic modal DOFs and constraint connections between components in the aeroengine stator system example.

Part Number (in Order of Assembly)	Part Name	Total Number of the CB Modes	Connections with Other Components
①	Connector	50	(Ball hinge to ground) × 2
②	Compressor	32	Fixed to ①
③	Combustion chamber	26	Fixed to ②
④	Turbine	32	Fixed to ③
⑤	Nozzle	26	Ball hinge to ground

Table 3. Computational efficiency comparison between the traditional pivoting GE and the presented parallel BGE method (using the aeroengine stator system as an example).

Method for Solving Equation (45)	Total CPU Time Consumption		Computational Efficiency Improvement by Parallelization ² (%)	Total Number of Solving System Equation (45)
Method for Solving Equation (45)	Serial (ms)	Parallel with 4 Cores (ms)		Total Number of Solving System Equation (45)
Traditional Pivoting GE	666	897	−34.7%	2733
Block GE (presented)	632	405	35.9%	2733
Computational efficiency improvement by block GE ¹ (%)	5.1%	54.8%	39.2%	-

¹ Computational efficiency improvement by block GE (%) = (block − pivoting)/pivoting × 100%; ² Computational efficiency improvement by parallelization (%) = (parallel − serial)/serial × 100%.

Table 4. Elastic modal DOFs and constraint connections between components in the low-speed crank-slider mechanism.

Part Number (in Order of Assembly)	Part Name	Total Number of the CB Modes	Connections with Other Components
①	Crank	50	Rotation to ground
②	Linker	50	Spin to ①, Spin to ③
③	Slider	0	Translation to ground

Table 5. FEM details of the flexible crank and flexible linker for modal analysis.

FEM Model Item	Value	Note
Material	Aluminum
Density	2700 kg/m³	2.7 × 10⁻⁶ tonne/mm³.
Young’s modulus	70 GPa	7 × 10⁴ MPa
Poisson’s ratio	0.3
Finite element size	1 mm × 1 mm × 1 mm	For both the crank and linker
Finite element type	C3D8R	Three-dimensional 8 nodes hexahedral element with reduced integration

Table 6. Computational efficiency comparison between the serial pivoting GE method and the presented parallel block GE method (using the low-speed crank-slider mechanism as a sample).

Method for Solving Equation (45)	Total CPU Time Consumption (ms)	Total Number of Solving System Equation (45)
Serial pivoting GE	4089	30,214
Parallel block GE (presented, with 4 cores OpenMP)	4013	31,546
Computational efficiency improvement by block GE (%) ¹	1.9%	-

¹ Computational efficiency improvement by block GE (%) = (block − pivoting)/pivoting × 100%.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Parallel Direct Solution of Flexible Multibody Systems Based on Block Gaussian Elimination

Abstract

1. Introduction

2. Flexible Multibody Formulation with the Craig–Bampton Method

2.1. Time Integration Procedures of Flexible Multibody Systems

2.1.1. System Governing Equations

2.1.2. Time Integration Scheme

2.1.3. Equation Linearization and Assembly

2.2. Flexible Body Formulation with the Craig–Bampton Method

2.2.1. DOFs Reduction by the Craig–Bampton Method

2.2.2. Rigid Modes Removing and Modes Orthogonalization

2.2.3. Flexible Body Formulation Under Floating Frame of Reference

3. Parallel Direct Solution Scheme Based on Block Gaussian Elimination

3.1. Coordinate Reordering Before BGE

3.2. Feasibility Study of BGE on Sub-Block Jacobian Matrix

3.3. Algorithm-Level Parallel Direct Solution of System Linear Equations Based on BGE

4. Numerical Examples

4.1. Example of an Aeroengine Stator System

4.2. Sample of a Low-Speed Rotating Crank-Slider Mechanism

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Nomenclature

References

Article Metrics

Citations

Article Access Statistics