1. Introduction
The aggressive scaling of semiconductor technologies has introduced substantial performance gains but has also increased reliability concerns, particularly in power delivery networks. Among these reliability issues, electromigration (EM) has emerged as a critical failure mechanism, driven by high current densities that induce gradual material displacement. This process leads to the formation of voids and hillocks, which can cause open or short circuits and ultimately reduce chip longevity [
1]. As a result, EM is now a dominant reliability limiter and a key consideration in modern VLSI design flows [
2].
Traditional EM assessment methods, such as the Blech criterion [
3] and Black’s equation [
4], rely on empirical assumptions that fail to capture the spatial and temporal complexity of modern interconnect structures. The Blech criterion applies only to single-segment wires, overlooking stress interactions in multi-branch topologies, while Black’s equation assumes uniform geometry and constant stress, often leading to inaccurate lifetime estimates. These limitations result in overdesign, increased power, and inefficient area use, particularly in advanced nodes [
5]. To address these shortcomings, Korhonen et al. [
6] introduced a physics-based EM model formulated as a diffusion-like Partial Differential Equation (PDE). This approach describes EM dynamics more accurately by coupling atomic flux with hydrostatic stress gradients across wire segments. Building upon Korhonen’s physics-based formulation, several EM stress analysis techniques have emerged, which can be broadly categorized into analytical and numerical approaches. Analytical methods, such as [
7,
8], preserve the continuous nature of both space and time, and offer closed-form solutions suitable for large-scale interconnects. However, their practical applicability is often limited by the computational overhead of truncating infinite series expansions.
On the other hand, numerical methods, such as [
9,
10,
11], are well-established due to their simplicity but tend to be computationally prohibitive for EM analysis, particularly in large-scale interconnect structures. This is primarily because their physics-based EM models, after spatial discretization, lead to large-scale descriptor systems that are computationally costly for transient analysis. As a result, the application of efficient model order reduction (MOR) techniques has become essential for enabling scalable EM analysis [
12]. MOR approaches are commonly categorized into two main classes. The first class includes system-theoretic methods such as Balanced Truncation (BT) [
13], which provide ROMs with guaranteed approximation error bounds based on Hankel singular values. However, classical BT requires the solution of computationally demanding Lyapunov matrix equations that are infeasible for dense matrices, especially when preserving sparsity is critical. The second class includes moment-matching (MM) techniques, which have gained popularity due to their high computational efficiency [
14]. MM-based ROMs are constructed by matching a set of moments of the original system’s transfer function. However, ROM quality depends entirely on the employed Krylov subspace.
The majority of MM techniques utilize the standard or rational Krylov subspace to approximate the original model. Rational Krylov methods [
15,
16] offer improved convergence and preserve matrix sparsity, but they require careful selection of expansion points and do not provide guaranteed error bounds, which can compromise accuracy or stability. On the other hand, methods based on the standard Krylov subspace [
9,
14,
17] are easy to implement and effective in many applications. Nonetheless, they suffer from a high-frequency bias and insufficient resolution of low-frequency behavior, which is especially problematic in EM analysis. Recent studies, such as [
18], have shown that the extended Krylov subspace (EKS) can effectively capture both low- and high-frequency dynamics without the need for expansion point selection. However, its symmetric structure leads to computational inefficiencies, particularly due to the dense linear system solves and matrix–vector operations required at each iteration, thereby limiting the scalability of this subspace for large-scale EM analysis.
In this paper, we propose an MM numerical method based on the asymmetric extended Krylov subspace (AEKS) for efficient EM stress analysis of general interconnect structures. The main contributions of this paper are summarized as follows: Firstly, our method leverages the AEKS, a sparsity-aware projection strategy that adaptively enriches the subspace in the direction of the sparser matrix. This enables the proposed approach to retain the approximation quality of EKS while significantly reducing the computational overhead, making it an efficient and scalable approach for EM stress evaluation in large-scale models. Secondly, we introduce a parallelization strategy for computing the reduced-order response of large-scale EM systems, enabling scalable Krylov subspace construction and transfer function evaluation with the reuse of sparse matrix factorizations. Finally, the proposed method has been extensively validated across frequency-domain and transient EM simulations using large-scale industrial power grid benchmarks. In frequency-domain evaluations, our method consistently outperforms conventional MM approaches, closely approximating the original model response across the spectrum. In transient EM analysis, it achieves high-accuracy results, maintaining relative errors below 0.5% compared to COMSOL v5.5. Moreover, it delivers substantial runtime improvements of up to over original model simulations and over COMSOL, while demonstrating robustness under time-varying current density inputs.
The remainder of this article is organized as follows:
Section 2 presents the physical and mathematical modeling of EM-induced stress, while
Section 3 demonstrates the problem formulation of the EM analysis.
Section 4 provides the theoretical background of MOR for EM analysis, focusing on the use of MM techniques to reduce the size of descriptor systems derived from interconnect structures.
Section 5 discusses the limitations of conventional MM and EKS methods in the context of large-scale EM models.
Section 6 and
Section 7 introduce our main contributions, including the proposed AEKS method, and the parallel Krylov reduction for EM systems.
Section 8 presents a comprehensive experimental evaluation of the proposed method using IBM benchmarks, while conclusions are drawn in
Section 9.
2. Electromigration Mechanism
A representative cross-section of a Cu Dual Damascene (DD) interconnect is shown in
Figure 1, capturing the essential mechanisms involved in EM. When an electric field is applied, the resulting current creates a stream of electrons that transfers momentum to the metal atoms through collisions, gradually displacing them in the direction of the electron flow [
19]. Since the wire is physically constrained, atoms cannot exit the structure and instead begin to accumulate near the anode and deplete the cathode region. The resulting imbalance in mass transport leads to the development of a compressive stress at the anode and tensile stress at the cathode, which in turn induces a counteracting (back-stress) force. This back-stress drives atoms in the opposite direction, establishing a competing diffusion flow that partially offsets the EM-driven movement.
In advanced Cu DD interconnects that include capping and barrier layers, EM-induced metal transport is confined within individual layers. This property facilitates a layer-by-layer modeling approach, which is particularly useful for analyzing large-scale structures like power grids [
20]. Such networks can be partitioned into multi-segment interconnect trees, composed of wire segments and vias interconnected at junctions, each potentially carrying different current densities [
21]. A typical structure of a multi-segment interconnect tree is shown in
Figure 2.
The most established model to describe the temporal evolution of EM-induced stress on a specific wire segment is Korhonen’s PDE [
6]. This model relates the stress
of the segment to the corresponding distance from the cathode
x as follows:
where
=
and
=
. Here,
j is the current density through the wire segment,
is the effective charge number,
e is the electron charge,
is wire resistivity,
is the atomic volume for the metal,
is the bulk modulus of the material,
is Boltzmann’s constant,
T is temperature,
t is time, and
=
is the diffusion coefficient with
being the diffusivity constant and
the activation energy.
To extend Korhonen’s model for multi-segment interconnect trees, a set of appropriate boundary conditions (BCs) [
22] should be applied to Equation (
1). Assuming that the segments
=
are connected to a point
with degree
, the following BCs must hold at any time instance:
BC 1. Stress must be continuous at any intermediate point
:
BC 2. The total flux entering any point
must be zero:
where
is the diffusivity of segment
,
is the current density at segment
, and
is the segment width.
BC 3. The total mass across the entire interconnect must be conserved. If
denotes the set of all segments
, then the following holds:
3. Problem Formulation
In EM analysis, each segment of the interconnect tree is considered to carry a constant current density
j [
22]. As a result, the Korhonen’s Equation (
1) for each segment takes the following form:
Note that the assumption of constant current density
j is made here solely to simplify the derivation of the Korhonen equation. This assumption will be relaxed in subsequent formulations, where
is allowed to vary with time, enabling dynamic EM stress analysis.
This equation can be uniformly discretized by dividing each segment of the interconnect tree into points of equal length
, using the finite difference method (FDM). By applying a finite difference approximation of the spatial derivative in Equation (
5), for each discretized point
i of the interconnect tree, we obtain the following:
i.e.,
where
w is the width of the wire segment that point
i belongs to, and
is the total area of the segments connected to the point
i (see
Figure 3). The formula of
is described as follows:
To enforce the global mass conservation condition described in Equation (
4) within the discretized model, the local area term
is used to approximate the contribution of each interconnect segment around point
i during numerical integration. For simplicity of visualization and discretization, we assume that thickness is negligible compared to the other two dimensions, effectively treating the interconnect as a two-dimensional structure. However, the same methodology remains applicable in three-dimensional settings by incorporating thickness into the local volume term.
After applying FDM on the
m-segment interconnect tree of
Figure 2, the resulting
n discretized points may be located at five different locations, as shown in
Figure 3. Considering the boundary conditions of Equations (
2)–(
4), we can rewrite Equation (
7) for the
n discretized points into the Ordinary Differential Equation (ODE) system of Equation (
9). More specifically, Equation (
9) depicts the stamps of the two boundary points
and
, any point
at the middle of a segment, any intermediate junction point
, and any via junction point
, with
. As a result, we can write the ODE system for EM stress evolution as the following linear time-invariant (LTI) system:
4. Background of MOR by Moment-Matching
Building on the formulation in Equation (
10), the application of MOR techniques requires the system to be expressed in a state-space form suitable for reduction. For an interconnect tree comprising
m segments and discretized into
n points, the resulting system can be formulated as an LTI model:
where
is the stress diffusion matrix,
is a diagonal matrix,
denotes the vector of unknown stress values at all discretization points (constituting internal states of the system),
represents the vector of input excitations from the current densities,
is the input-to-state connectivity matrix,
is the state-to-output connectivity matrix, and
is the output vector representing the observed stress values. Moreover, by denoting the model
order as
, Equation (
11) can be interpreted as a generalized state-space representation, commonly referred to as the
descriptor form.
The objective of MOR is to produce a reduced-order model:
with
,
,
, where the order of the reduced model is
and the output error
is small. An equivalent metric of accuracy in the frequency domain (via Plancherel’s theorem [
23]) is the distance
, where
are the transfer functions of the original model and the ROM, and
is the induced
matrix norm (or the
norm of a rational transfer function).
The most important and successful MOR methods for linear systems are based on MM. They are very efficient in circuit simulation problems and are formulated in a way that has a direct application to the linear model of (
11).
By applying the Laplace transform to (
11), we obtain the
s domain equations as follows:
Assuming that
and that a unit impulse excitation is applied (i.e.,
), the system’s impulse response can be extracted. This assumption is standard in MM MOR and serves as a mathematical simplification to derive the moments of the transfer function, rather than a physical input used in simulation. Accordingly, the system can be written as follows:
By expanding the Taylor series of
around zero, we derive the following equation:
The transfer function of (
11) is a function of
s and can be expanded into a moment expansion around
as follows:
where
,
,
,
, … are the moments of the transfer function. In general,
corresponds to the steady-state (DC) solution of the underlying linear system. It represents the system response under constant input conditions, where all dynamic effects are neglected. Depending on the physical context, this may correspond to the zero-frequency response, the static stress distribution, or the equilibrium state of the system.
Moreover,
represents the first-order dynamic behavior of the system around steady-state conditions. In physical terms, it captures the main time constant controlling how the system responds dynamically to external inputs. In general,
is related to the system matrices as follows:
The goal of MM reduction techniques is the derivation of a ROM, where some moments
of the reduced-order transfer function
match some moments of the original transfer function
.
Let us now denote the two projection matrices onto a lower dimensional subspace as
. These matrices can be derived from the associated moments using one or more expansion points. As a result, if we assume that
, then matrices
and
are defined as follows:
The computed ROM matches the first
moments and is obtained by the following matrices:
This reduced model provides a good approximation around the DC point. Finally, in case we employ a one-sided Krylov method, which is usually the case, matrix
can be set equal to
, an equality that also holds for symmetric systems.
5. Specification of EKS and Its Application to MM
The essence of MM methods is to iteratively compute a projection subspace, and then project the original system into this subspace in order to obtain the ROM of (
12). The dimension of the projection subspace is increased in every iteration, until an “a priori” number of the moments is matched. More specifically, if
r is the desired order for the reduced system and
is the number of moments (where
p denotes the total number of input/output ports), then
is a projection matrix whose columns span the
r-dimensional Krylov subspace:
where
Then, the ROM is obtained through the following matrix transformations:
with
,
,
. Although efficient, this construction captures high-frequency dynamics, which is less effective in applications such as EM stress analysis, where long-term stress evolution is governed by low-frequency behavior.
The effectiveness of the projection process can be improved by expanding around more points than
, leading to rational Krylov subspaces with targeted expansion points [
15,
16]. However, the selection of expansion points is nontrivial and problem-dependent. To avoid this ambiguity, we consider the EKS, which combines both standard and inverse Krylov subspaces, thereby expanding the subspace around both
and
. Formally, the EKS is defined as follows:
This construction achieves spectral enrichment across a broad frequency range without requiring the manual selection of expansion points. It has been successfully applied in matrix function approximation [
24], as well as in Lyapunov and Sylvester equation solvers [
25,
26], demonstrating significant improvements in the performance of Krylov subspace methods. Unlike weighted approaches such as [
27], EKS avoids ambiguity in the choice of expansion points. An earlier version of this EKS formulation appeared in [
28], where the method was initially introduced.
In a later work [
18], we applied the EKS framework in the context of EM analysis to reduce the computational cost of matrix exponential evaluations during time-domain stress simulations. This earlier application demonstrated the capability of EKS to capture slow dynamics and long-term stress evolution with improved numerical stability. Nevertheless, the symmetric expansion of EKS requires both forward and inverse matrix operations at every iteration, which becomes computationally expensive for large-scale EM systems characterized by structurally unbalanced matrices (e.g., one sparse and one dense matrix).
6. Proposed Asymmetric Krylov-Based Method
To address the limitation of symmetric subspace construction in traditional EKS methods, we propose AEKS as an efficient and scalable MOR strategy tailored for EM stress analysis in large-scale interconnect structures. Unlike EKS, which symmetrically alternates between forward and inverse matrix operations, AEKS introduces a sparsity-aware expansion strategy that prioritizes computational efficiency without compromising spectral enrichment. In practical implementations, the inputs to construct the EKS are not actually the matrices
and
but the original system matrices
and
, which are very sparse in typical EM problems. This is because the generally dense inverse matrices
and
are only needed for matrix–vector products during basis construction. These products can be implemented as sparse linear solves (
and
) by employing any sparse direct [
29] or iterative [
30] algorithm. However, in typical EM problems, one of matrices
or
is considerably sparser than the other. In that case, the solves with
are much more efficient than the solves with
, and it can be very computationally beneficial to deviate from symmetry in the construction of the EKS (
24).
Following this principle, our method begins by computing the sparsity (i.e., number of non-zeros) of the matrices
and
, and expands predominantly in the direction (
or
) that will generate more sparse solves, by adding one moment block corresponding to the denser solve for every
m moment blocks of the sparser solve. For example, in the usual case where
is sparser than
, and for
, the proposed AEKS method will generate one moment block of
after three moment blocks of
and will create the following
r-dimensional subspace:
The AEKS procedure is visually represented in
Figure 4 (compared to the symmetric EKS), and the proposed AEKS construction method is presented in Algorithm 1.
In steps 1–7, the algorithm begins by checking whether the number of non-zero entries in matrix
is smaller than that in matrix
. Based on this, it constructs the initial orthonormal basis and sets the appropriate dense and sparse matrix operators accordingly. During each iteration (steps 9–19), the algorithm applies the appropriate operator (either
Dense_Op or
Sparse_Op) to compute a new candidate block vector
, orthogonalizes it against the existing basis vectors using Algorithm 2 (step 15) [
31], normalizes it through QR factorization, and then appends the result to the evolving subspace (step 17). Finally, in step 20, the basis is truncated to the desired reduced order
r.
Algorithm 1: AEKS construction method via MM. |
![Electronics 14 02749 i001]() |
Algorithm 2: Orthogonalization with respect to another matrix. |
![Electronics 14 02749 i002]() |
7. Efficient Computation of the Reduced-Order Response and Transfer Function
To efficiently compute the transfer function and the output response of a multi-input multi-output (MIMO) descriptor model like (
11), we can consider the following single-input multi-output (SIMO) subsystems:
where
is the
i-th column of matrix
, and
is the
i-th input (
). From these SIMO subsystems, the output of the MIMO descriptor system is
, where
This effectively represents the superposition property of LTI systems.
The above decomposition can be employed for the parallel computation of the reduced-order MIMO transfer function. In particular, for each SIMO subsystem of (
26), a projection matrix
can be computed, whose columns span the
r-dimensional EKS (or AEKS):
where
. The computation of the projection matrices (by Algorithm 1) is independent of one another and can be performed in parallel. The reduced-order SISO subsystems can then be computed in parallel as follows:
The
i-th column of the MIMO reduced-order transfer function would then be computed as follows:
and the whole MIMO ROM transfer function can be derived as the concatenation:
It is worth noting that this decomposition is better combined with direct solvers since the computation of the Krylov subspace requires sparse solves with and , as previously mentioned. Using direct solvers, one can precompute the proper decomposition of the above matrices and then reuse them in each parallel computation.
8. Experimental Results
In this section, we provide a detailed experimental evaluation of the proposed AEKS-MM methodology. The experiments are organized into three parts. First, we investigate the numerical performance and quality of AEKS-MM against conventional Krylov subspace approaches, including MM and EKS-MM methods. Our evaluation focuses on large-scale industrial IBM power grids, examining the accuracy, computational efficiency, and frequency-domain behavior of the generated ROMs. Second, our approach is compared to COMSOL v5.5, a FEM-based solver, for an artificial multi-segment interconnect tree. Finally, we perform EM stress analysis on representative large-scale power grids to further evaluate the applicability of our proposed method. For time-domain simulations, an implicit Backward Euler (BE) method was employed for solving the ROMs derived from AEKS-MM. All power grid structures analyzed in this work are assumed to be made of Cu DD interconnects. This assumption allows each layer and interconnect structure to be analyzed independently under the local stress models. The proposed AEKS-MM methodology was developed using MATLAB R2021a, utilizing its built-in numerical toolset for matrix operations, linear solves, and sparse matrix handling. All experiments were conducted on a high-performance Windows system equipped with a 4.7 GHz Intel Core i9-9900k CPU, 32 GB RAM, and 16 threads, ensuring robust computational capacity for large-scale transient simulations.
COMSOL v5.5 was used with the Coefficient Form PDE interface, which allows direct implementation of Korhonen’s PDE. The geometry was modeled as a 2D multi-segment geometry, and material parameters were assigned per segment. Boundary conditions were defined using the Flux/Source formulation, enabling accurate specification of both zero-flux (Neumann) and non-zero-flux conditions at segment boundaries. The mesh resolution was selected to match the spatial accuracy of the FDM discretization used in our method. In particular, quad meshing was used as the default strategy to preserve element alignment across wire segments, while triangular meshing was enabled only in the case of quad generation failure. Finally, COMSOL time-domain simulations were performed using an implicit BE solver with absolute and relative tolerances set to and a fixed time step of years matching the AEKS-MM simulation grid.
Prior to the full experimental evaluation, we conducted a sensitivity study to determine the optimal choice of the modulo parameter
m in Algorithm 1, balancing reduction time and accuracy. Four industrial benchmarks were selected, each featuring approximately 1.6 million discretization points and 900 interconnect segments on average, covering a wide range of sparsity ratios
, with
, representative of all IBM cases.
Figure 5 shows the impact of varying
on the relative error of the transfer function and the corresponding reduction time across the four benchmarks. For low sparsity (
), the best trade-off is achieved with
, while for moderate sparsity (
),
performs best. In cases of higher sparsity (
),
provides improved accuracy at a reasonable cost. These trends provide practical guidance for selecting the parameter
m according to the sparsity pattern of the problem.
8.1. Accuracy and Performance Evaluation of ROMs
For the experimental evaluation of the proposed methodology, we used the available IBM power grid benchmarks [
32]. Each power grid structure was processed using a BFS algorithm to extract representative multi-segment interconnect lines suitable for EM stress analysis. To compute the DC steady-state currents required for EM stress analysis, we performed DC simulations using the Ngspice [
33] circuit simulator. From each benchmark, we selected the interconnect line containing the highest number of discretization points, as these represent the most challenging cases for transient EM stress evolution. The corresponding ODE systems derived from the discretization were subsequently used to construct ROMs based on the different Krylov subspace methods evaluated. EKS-MM and AEKS-MM were implemented following the procedures described in
Section 5 and
Section 7, respectively, and were compared against a standard MM method using the superposition property. For a fair comparison, an appropriate number of matching moments were selected so that the resulting ROM orders were identical across EKS-MM, AEKS-MM, and MM.
Our results are reported in the remaining columns of
Table 1. The characteristics of the selected lines are shown in the first two columns of table, where
Order refers to the number of discretization nodes, and
#ports indicates the total number of junctions and end-points present in the interconnect line. Also,
ROM Order refers to the size of
and
ROM matrices,
Max Error refers to the error between the infinity norms of the transfer functions, i.e.,
, and
Time refers to the computational time (in seconds) needed to generate each submatrix
of (
30), while
Error Red. refers to the error reduction percentage achieved by EKS-MM and AEKS-MM over MM. It can be clearly verified that, compared to MM for a similar ROM order, EKS-MM and AEKS-MM produce ROMs with significantly smaller errors. As depicted in
Table 1, the
Error Red. ranges from 36.98% to 80.12% for EKS-MM, while for AEKS-MM, it ranges from 19.64% to 66.68%. The differences in error reduction percentages across benchmarks stem primarily from structural and topological variations among the selected interconnects, such as length, number of segments, and current densities. Additionally, the observed error reduction depends on the specific port locations used to define the transfer function, since the frequency content and dynamic behavior of the response can vary significantly between ports. These combined effects explain the variations in error reduction percentages across benchmarks. The execution time of EKS-MM is negligibly larger than standard MM for each moment computation, due to the expansion at two points; however, the efficient implementation can effectively mask this overhead to a substantial extent and make the procedure applicable to large-scale interconnect lines. On the other hand, AEKS-MM leverages the sparse solve operation to dramatically reduce the overall runtime, while maintaining an acceptable error margin compared to EKS-MM, with a maximum error difference of only 0.4%. More specifically, AEKS-MM achieves significant average runtime speedups of up to
over EKS-MM and
over MM.
To demonstrate the accuracy of our method, we compare the transfer functions of the original model with those of the ROMs generated by EKS-MM, AEKS-MM, and MM. The corresponding transfer functions for the selected interconnect lines from the bmpg1 and ibmpg4 benchmarks, over the frequency band
Hz, are shown in
Figure 6. Additionally,
Figure 7 presents the transfer functions of the ROMs produced by EKS-MM, AEKS-MM, and MM, along with the corresponding absolute errors with respect to the original model, for the selected line from the ibmpg7 benchmark within the same frequency band.
As can be observed, the responses of the EKS-MM and AEKS-MM ROMs closely follow the original model’s response across the entire frequency range, whereas the MM ROM exhibits noticeable deviations. In particular, MM ROMs fail to capture dips and overshoots that appear at certain frequency bands, leading to significant mismatches with the original response. This behavior highlights the advantage of using two-sided projection methods such as AEKS-MM to accurately capture broadband dynamics.
8.2. EM Stress Evaluation on a Seven-Segment Tree
To validate the accuracy of our approach on complex geometries, an artificial seven-segment interconnect tree with four T-junctions was constructed, where each segment was assigned a different width and current density. The segment widths are defined as follows:
μm,
μm, and
μm, while the segment lengths are depicted in
Figure 8. The corresponding constant current densities are set as follows:
,
,
, and
.
In addition, we performed transient simulation using the AEKS-MM ROM at
,
, and
years, as shown in
Figure 8a–c, with the colorbars depicting the spatial distribution of the stress buildup across the artificial trees. The tuples next to boundary or junction nodes are the values computed by our method and COMSOL. In
Figure 8d, the relative error along the tree is found to be well below 0.7% with a reduced order
after applying a discretization step
μm. This stress distribution reveals compressive stress near anodes and tensile stress near cathodes, as expected from EM-driven atomic flux. In particular, junction and via junction points exhibit localized stress peaks, indicating potential EM hotspots where material failure is likely to occur.
8.3. Transient EM Analysis on IBM Power Grid Benchmarks
For EM stress assessment on very large-scale power grids, we utilized the interconnect lines provided in
Table 1 to evaluate the scalability and reliability of our proposed methodology. These benchmarks include up to 3.2 million degrees of freedom (DOFs), over 2300 interconnect segments, and as many as 966 ports, representing realistic and highly complex industrial designs. In this setup, the ROMs generated by AEKS-MM were used to perform transient EM analysis over a simulation window of 20 years, with a discretization step of
years using the BE method. The results for the selected interconnect lines from the IBM benchmarks are presented in
Table 2, where we report the runtimes of the original model simulation, the runtimes of the AEKS-MM simulation (including the ROM construction time as listed in
Table 1), and the corresponding runtimes obtained from COMSOL simulation. Furthermore, the table includes the speedups achieved by our methodology with respect to both the original model simulation and COMSOL, along with accuracy metrics, where the COMSOL results are used as the reference solution.
As shown, our methodology achieves an average speedup of compared to simulating the original model and over COMSOL simulations. It is also observed that the ROM construction phase accounts for about 85% of the total simulation time, indicating that the main computational overhead stems from the reduction procedure itself. Although the reduction is performed only once for each interconnect structure during EM analysis, in large-scale power grids containing tens of thousands of interconnects, this overhead could become a bottleneck. This observation highlights the critical importance of selecting an efficient Krylov subspace generation method for MOR techniques applied to EM reliability analysis. Moreover, our methodology achieves a mean relative error of 0.4% compared to COMSOL, while maintaining an average deviation of only 0.12% from the original model. These gains are especially significant given the limitations of COMSOL in large-scale EM analysis. Although COMSOL provides a flexible environment for FEM-based PDE simulation, it solves the full-order system without inherent MOR capabilities. As a result, it suffers from poor scalability and high computational cost when applied to large-scale interconnect structures, making it impractical for fast or repeated EM stress evaluations. Furthermore, COMSOL is constrained by an upper bound of approximately one million DOFs and demands substantial memory resources, often leading to excessive runtimes or simulation failures.
Regarding the spatial accuracy of the EM stress distribution,
Figure 9 presents the hydrostatic stress along the first 27 segments (total length
μm) of the selected interconnect line from ibmpg1 benchmark after 20 years of simulation. The gradual stress buildup and eventual saturation over time indicate that the system approaches a steady-state back-stress equilibrium, consistent with Korhonen’s diffusion model. As can be observed, the stress profiles generated by AEKS-MM are in excellent agreement with the reference COMSOL solution along the entire length of the line. Finally, to assess the capability of our approach in handling time-dependent current density inputs, we applied a periodic piecewise current density input at the left boundary node of the previously selected interconnect line. The applied profile starts at zero and oscillates between
A/m
2 and
A/m
2.
Figure 10 shows both the applied input current density and the corresponding transient EM stress responses obtained by AEKS-MM and COMSOL at the left boundary node over 20 years. The results demonstrate that both methods perfectly match throughout the entire transient simulation, even with time-variant input.
9. Conclusions
In this paper, we proposed a fast numerical approach for transient analysis of EM-induced stress based on the MOR technique at discrete spatial points of general interconnect structures. By leveraging the flexibility of the AEKS-MM framework, the proposed methodology exploits sparse solve operations to dramatically reduce the overall reduction runtime, achieving up to a 7× speedup compared to conventional numerical approaches such as MM and EKS-MM. At the same time, AEKS-MM maintains an acceptable error margin in the transfer function approximation relative to the original model, while significantly improving the accuracy over standard MM techniques. A key aspect of our methodology lies in the careful selection of the Krylov subspace used for model reduction. Since numerical MOR approaches for EM stress analysis critically depend on the underlying subspace construction, optimizing this step is essential for ensuring both high accuracy and computational efficiency. For the EM stress evaluation, the experimental results demonstrated that AEKS-MM ROMs achieve high accuracy compared to full-order simulations, maintaining a relative error of less than 0.5% with respect to COMSOL solutions. Moreover, the proposed approach significantly reduces simulation runtimes, achieving average speedups of relative to COMSOL and compared to full-order transient simulations. Furthermore, AEKS-MM exhibits strong robustness under time-dependent current density inputs, accurately tracking stress evolution even in dynamically changing scenarios. Although this work focused on planar copper-based 2D power grids, the AEKS-MM framework is fully extensible to more advanced interconnect technologies. In particular, it can be applied to multi-layer metal stacks and 3D-ICs with through-silicon vias (TSVs), as EM behavior is governed by similar PDE formulations. In such complex geometries, the use of efficient ROM techniques such as AEKS-MM becomes even more crucial for scalable and reliable EM analysis. Finally, our method is applicable not only to the nucleation phase but also to the growth phase, as the generated ROMs can be utilized to model the stress evolution throughout the entire EM process.