An Improved Self-Adaptive Inertial Projection and Contraction Algorithm for Mixed-Cell-Height Circuit Legalization

Wang, Luxin; Zhou, Chencan; Shen, Qinqin

doi:10.3390/electronics15081720

Open AccessArticle

An Improved Self-Adaptive Inertial Projection and Contraction Algorithm for Mixed-Cell-Height Circuit Legalization

by

Luxin Wang

^1,2

,

Chencan Zhou

^1,3

and

Qinqin Shen

^1,3,*

¹

Key Laboratory of Computational Science and Application of Hainan Province, Haikou 570100, China

²

School of Information Science and Technology, Nantong University, Nantong 226010, China

³

School of Transportation and Civil Engineering, Nantong University, Nantong 226010, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(8), 1720; https://doi.org/10.3390/electronics15081720

Submission received: 25 February 2026 / Revised: 9 April 2026 / Accepted: 15 April 2026 / Published: 18 April 2026

(This article belongs to the Special Issue AI-Enhanced Mixed-Signal Simulation and EDA for Integrated Circuit Design Using CMOS Technologies)

Download

Browse Figures

Versions Notes

Abstract

In advanced technology nodes, mixed-cell-height circuit designs have become increasingly prevalent, posing significant challenges for legalization. We first formulate the legalization as a class of variational inequality (VI) problems defined over convex sets and then employ an existing self-adaptive inertial projection and contraction algorithm (SIPCA) to solve it. Building upon this framework, we further propose an improved self-adaptive inertial projection and contraction algorithm (SIPCA_IP) by incorporating the subgradient extragradient technique to enhance convergence efficiency and numerical stability. The proposed method preserves the advantages of projection and contraction schemes for handling VIs with nonsymmetric positive semidefinite system matrices while demonstrating faster convergence and improved robustness compared with the baseline SIPCA. Moreover, a rigorous convergence analysis is established to provide theoretical guarantees for the effectiveness of the proposed method. Numerical experiments demonstrate that the proposed method effectively addresses the mixed-cell-height legalization problem and provides a rigorous and extensible framework for solving related quadratic optimization problems.

Keywords:

mixed-cell-height circuit; legalization; variational inequality; projection and contraction algorithm

1. Introduction

In very-large-scale integrated (VLSI) circuit design, placement is one of the most critical stages in determining overall chip performance, area, and routability [1]. In earlier integrated circuit technologies, standard cells were typically designed with a single-row height due to the relatively low design complexity. With continuous device scaling and the increasing diversity of circuit design objectives, modern standard-cell libraries commonly include cells with heterogeneous heights rather than a uniform row structure. For example, basic logic cells such as inverters and buffers are typically implemented as single-row-height cells, whereas complex functional units including flip-flops, multiplexers, and clock gates are often designed as multi-row-height cells to accommodate larger transistor stacks and more routing resources.

In VLSI design, placement is typically carried out in three sequential phases, namely, global placement, legalization, and detailed placement. During global placement, approximate cell positions are obtained by optimizing wire length and routability while allowing overlaps. In the legalization stage, overlaps are removed and cells are aligned to discrete rows and sites with minimal displacement. The detailed placement stage further refines the layout by locally adjusting cell ordering and spacing. Overall, placement determines the optimal positions and orientations of all standard cells under given constraints. In this paper, we focus on the legalization stage.

Although mixed-cell-height standard cells offer significant benefits in terms of design flexibility and area efficiency, their introduction substantially increases the complexity of legalization. Unlike the single-row-height cases, cells with varying heights span multiple placement rows, introducing intricate geometric constraints and inter-row interactions. Furthermore, heterogeneous cell structures and power-rail compatibility constraints further increase the difficulty of the legalization problem. For more details, see [2,3,4,5] and the references therein.

Heuristic algorithms and analytic algorithms are the two main categories addressing the mixed-cell-height legalization problem. Among heuristic methods, Abacus [6] and Tetris [7] are two classical algorithms originally developed for single-row-height legalization tasks. Subsequently, improved heuristic algorithms based on theses two classical types such as Eh? Placer [8] and Jezz [9] were proposed. Although classical legalization algorithms perform well in uniform-height cases, they cannot be easily generalized to mixed-height configurations. This is because, in single-row-height cases, cell overlaps can be resolved independently. In contrast, in mixed-cell-height cases, adjusting a cell in one row may introduce new overlaps in other rows. To address these challenges, several enhanced heuristic algorithms have been developed for the mixed-cell-height cases [10,11,12,13]. Since the objective of the legalization problem is minimizing the total displacement, it can be formulated as network flow, integer linear programming, or quadratic programming (QP) models [1,14,15,16], enabling analytic methods to efficiently obtain feasible solutions.

With proper preprocessing and relaxation, the mixed-cell-height legalization problem can be transformed into a QP problem. Using the Karush–Kuhn–Tucker (KKT) optimality framework [17], the QP problem can be equivalently reformulated as a linear complementarity problem (LCP), denoted as LCP(q, A). Specifically, given

A \in R^{n \times n}

and

q \in R^{n}

, the objective is to find vectors

w, z \in R^{n}

such that

w = A z + q \geq 0, z \geq 0 and w^{T} z = 0 .

(1)

The mixed-cell-height legalization problem can be addressed using the modulus-based matrix splitting (MMS) iteration scheme [18], which has been shown to be effective under certain assumptions [1]. Based on this scheme, a robust MMS method and several accelerated variants are proposed [3,19,20]. In addition, the LCP can be reformulated as an absolute value equation (AVE), enabling the construction of efficient iterative schemes by exploiting the structure of the system matrix together with matrix splitting techniques [21,22]. Building upon the MMS method and the AVE framework, more legalization problems with technical, regional, and abutment constraints have been extensively studied [4,5,23,24]. However, the classical convergence theory of MMS-type algorithms relies on the system matrix A being symmetric positive definite (PD) or an

H_{+} -

matrix. For mixed-cell-height legalization problems, the resulting system matrix is generally nonsymmetric positive semidefinite (PSD), which does not satisfy the aforementioned assumptions. Consequently, directly applying existing LCP-based algorithms may lead to limitations in the theoretical convergence guarantees.

In fact, the LCP (1) is equivalent to a VI problem defined as follows: for the function

F (z) = A z + q

, find a vector

z^{*}

in the closed convex set

Ω = R_{+}^{n}

such that

z^{*} \geq 0, F (z^{*}) \geq 0, 〈 z^{*}, F (z^{*}) 〉 = 0,

i.e.,

〈 z - z^{*}, F (z^{*}) 〉 \geq 0

,

\forall z \in Ω

. Compared with the LCP formulation, the VI framework provides a more flexible theoretical setting for algorithm design. Notably, the existence of a solution to the proposed variational inequality is guaranteed under standard monotonicity and convexity assumptions while uniqueness holds under strong monotonicity conditions [25]. Furthermore, a variety of effective iterative algorithms have been developed for when F is Lipschitz continuous and strongly monotonic or monotonic (with A being positive definite or positive semidefinite). It has been widely observed that projection-based algorithms are particularly efficient when the closed convex set is fairly simple and the projection is relatively easy to compute. Representative projection-based algorithms include the extragradient method [26], the projection contraction method [27], and the prediction–correction method [28]. However, the convergence rate and practical performance of projection-based methods are highly sensitive to the choice of step size. Fixed step-size strategies often fail to balance convergence speed and stability, especially when dealing with ill-conditioned or nonsymmetric systems.

In this paper, the mixed-cell-height legalization problem is reformulated as a VI problem. Under this formulation, the feasible region can be characterized as a nonempty closed convex set, which enables the construction of projection-type algorithms under mild assumptions on the associated operator. To efficiently solve the resulting VI problem, an existing self-adaptive inertial projection and contraction algorithm (SIPCA) is first adopted as a baseline. Building upon this framework, an improved SIPCA (SIPCA_IP) is developed by incorporating inertial acceleration and a two-step strategy based on the subgradient extragradient technique. The convergence properties of the proposed method are theoretically analyzed, and the adaptive scheme enhances convergence stability and computational efficiency. Furthermore, a lightweight Tetris-like refinement step is employed to eliminate residual overlaps after legalization. The proposed method demonstrates strong performance in solving large-scale mixed-cell-height legalization problems. The main contributions of this paper can be summarized as follows:

First, the mixed-cell-height legalization problem is reformulated as a VI, enabling efficient treatment of the LCP with a nonsymmetric positive semidefinite system matrix. The VI framework provides a flexible theoretical foundation for subsequent algorithm design.
Second, an improved algorithm, termed SIPCA_IP, is developed by incorporating an adaptive step-size scheme and a two-step iteration strategy, thereby enhancing both convergence stability and computational efficiency. Moreover, a rigorous convergence analysis is provided to establish the theoretical guarantees of the proposed method.
Third, a lightweight Tetris-like refinement strategy, adopted from existing legalization techniques, is incorporated as a postprocessing step to eliminate residual overlaps while preserving displacement quality.
Finally, numerical experiments demonstrate that SIPCA_IP outperforms the baseline SIPCA in terms of convergence speed and iterations. Moreover, comparisons with three state-of-the-art methods in terms of overlap and total displacement further confirm its superior legalization accuracy and significant improvements in placement quality.

The remainder of this paper is organized as follows: In Section 2, the mathematical model is established and subsequently reformulated as a VI. Section 3 details the baseline SIPCA and SIPCA_IP, along with an overview of the proposed framework. The experimental settings and corresponding results on seven benchmark cases are detailed in Section 4. Section 5 concludes the paper and discusses future research.

2. Problem Formulation

In this section, we formulate the mixed-cell-height legalization problem. After introducing the basic notation and constraints, the problem is first formulated as convex QP. By exploiting the KKT conditions, the QP is further reformulated as an LCP, which is subsequently extended to a VI framework.

2.1. Modeling of Mixed-Cell-Height Legalization

Consider a mixed-cell-height legalization problem with n standard cells

C = {c_{1}, \dots, c_{n}}

.

h_{i}

and

d_{i}

denote the height and width of

c_{i}

, while

h_{r o w}

represents the row height.

(x_{i}^{(0)}, y_{i}^{(0)})

1 \leq i \leq n

means the bottom-left corner coordinate of

c_{i}

. Legalization is to assign each cell

c_{i}

to a coordinate

(x_{i}, y_{i})

while minimizing the total cell displacement. Figure 1 illustrates a schematic of a mixed-cell-height placement with two double-row-height cells

c_{1}

and

c_{3}

, along with one single-row-height cell

c_{2}

, where power (VDD) and ground (VSS) lines are arranged alternately between rows. Figure 1a gives the cells’ initial positions. To ensure consistent processing, each multi-row-height cell

c_{i}

is partitioned into

k = h_{i} / h_{r o w}

single-row-height subcells

c_{i 1}, c_{i 2}, \dots, c_{i k}

. By neglecting vertical displacement and requiring all cells to be aligned to rows consistent with their power rails, the mixed-cell-height legalization problem can be formulated as the following minimization problem:

\begin{matrix} m i n \frac{1}{2} \sum_{i = 1}^{n} {(x_{i} - x_{i}^{(0)})}^{2} \\ s.t. (1) x_{j} - x_{i} \geq d_{i}, if y_{i} = y_{j} and x_{j} \geq x_{i}, \\ (2) x_{i} \geq 0 . \end{matrix}

(2)

Then, Model (2) can be equivalently expressed in the following form [1]:

\begin{matrix} m i n \frac{1}{2} x^{T} Q x + c^{T} x \\ s.t. W x \geq d, \\ E x = 0, \\ x \geq 0, \end{matrix}

(3)

where

x \in R^{n}

and the superscript T denotes the transpose.

Q \in R^{n \times n}

is the identity matrix, and

c \in R^{n}

is a vector whose ith component is

c_{i} = - x_{i}^{(0)}

.

W \in R^{m \times n}

is the constraint matrix used to prevent overlaps between neighboring cells; each row contains only two nonzero entries,

- 1

and 1. m and n indicate the number of constraints and variables, respectively.

E \in R^{r \times n}

defines equality relations to ensure that the multi-row-height cells share the same

x -

coordinates. For each adjacent cell pair, the

i -

th component of d corresponds to the width of the left cell. Figure 1b illustrates the coordinates of all the cells after partitioning. By ordering the adjacency relationships among cells in a left-to-right and bottom-to-top manner, we obtain

\{\begin{matrix} x_{21} - x_{11} \geq d_{1}, \\ x_{31} - x_{21} \geq d_{2}, \\ x_{32} - x_{12} \geq d_{1}, \\ x_{41} - x_{32} \geq d_{3}, \end{matrix}

and

\{\begin{matrix} - x_{11} + x_{12} = 0, \\ - x_{31} + x_{32} = 0 . \end{matrix}

Then, the constraint matrices W and E along with the vector d become

W = [\begin{matrix} - 1 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & - 1 & 1 & 0 & 0 \\ 0 & - 1 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & - 1 & 1 \end{matrix}], E = [\begin{matrix} - 1 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & - 1 & 1 & 0 \end{matrix}], d = [\begin{matrix} d_{1} \\ d_{2} \\ d_{1} \\ d_{3} \end{matrix}],

c = {[- x_{1}^{(0)}, - x_{1}^{(0)}, - x_{2}^{(0)}, - x_{3}^{(0)}, - x_{3}^{(0)}, - x_{4}^{(0)}]}^{T}

and

x = {[x_{11}, x_{12}, x_{21}, x_{31}, x_{32}, x_{41}]}^{T}

. By introducing a Lagrange multiplier

λ

for the equality constraint, Equation (3) can be expressed as the following QP problem:

\begin{matrix} m i n \frac{1}{2} x^{T} (Q + λ E^{T} E) x + c^{T} x \\ s.t. W x \geq d, \\ x \geq 0 . \end{matrix}

(4)

Figure 1. A schematic diagram of a mixed-cell-height placement.

Remark 1.

Evidently, for mixed-height cells consisting of both single- and double-height cells, the constraint matrix E satisfies the following rules:

1.: Each row contains exactly two nonzero elements, namely, $1 and - 1$ , and all other elements are $0$ . Moreover, the column index of $- 1$ is exactly one greater than that of $1$ .
2.: Each column contains at most one nonzero element, either $1 or - 1$ .
3.: The number of rows of matrix E equals the number of double-height cells.

Then, by direct computation, the matrix

E^{⊤} E

is block-diagonal, where each diagonal block is either a zero matrix or a

2 \times 2

matrix of the form

[\begin{matrix} 1 & - 1 \\ - 1 & 1 \end{matrix}] .

Further, the matrix

Q + λ E^{⊤} E

is block-diagonal, where each diagonal block is either an identity matrix or a

2 \times 2

matrix of the form

[\begin{matrix} 1 + λ & - λ \\ λ & 1 + λ \end{matrix}] .

2.2. From QP to an LCP and VI

Denote

B = Q + λ E^{T} E

. Based on the KKT conditions,

x \in R^{n}

is a global optimal solution of (4) if there exists vectors

u \in R^{n}

and

v, s \in R^{m}

satisfying

\{\begin{matrix} u = c + B x - W^{T} s \geq 0, \\ v = - d + W x \geq 0, \\ u^{T} x = 0, v^{T} s = 0, \\ x, s \geq 0 . \end{matrix}

(5)

Consequently, Equation (4) can be reformulated as a linear complementarity problem involving a nonsymmetric positive semidefinite system matrix:

w = A z + q \geq 0, z \geq 0 and w^{T} z = 0,

(6)

where

w = [\begin{matrix} u \\ v \end{matrix}], A = [\begin{matrix} B & - W^{T} \\ W & 0 \end{matrix}], z = [\begin{matrix} x \\ s \end{matrix}], q = [\begin{matrix} c \\ - d \end{matrix}] .

Lemma 1.

[29] The LCP

z \geq 0, A z + q \geq 0, z^{T} (A z + q) = 0

is a special linear VI: find

z^{*} \in Ω = R_{+}^{n}

satisfying

{(z - z^{*})}^{T} (A z^{*} + q) \geq 0, \forall z \in Ω,

where

A \in R^{n \times n}

is positive semidefinite, which may be nonsymmetric, and

q \in R^{n}

.

Clearly, LCP (6) can equivalently be expressed as the following VI problem: find

z^{*} \in Ω = R_{+}^{n}

,

\forall z \in Ω

satisfying

〈 F (z^{*}), z - z^{*} 〉 \geq 0,

(7)

where

F (z) = A z + q

.

To further clarify the motivation for adopting the VI framework, a comparison among VI and LCP formulations is summarized in Table 1. As shown in Table 1, the VI formulation relaxes the requirement for A to be symmetric and positive definite, allowing the operator F to be monotone. This generalization provides a more theoretical foundation for developing iterative algorithms applicable to the LCP arising from the mixed-cell-height legalization.

3. Self-Adaptive Inertial Projection and Contraction Algorithm and Its Improvement

3.1. Baseline Self-Adaptive Inertial Projection and Contraction Algorithm (SIPCA)

A classical iterative algorithm for the VI problem [29] is given as follows:

z^{k + 1} = P_{Ω} (z^{k} - τ F (z^{k})),

where

P_{Ω} (\cdot)

stands for the orthogonal projection onto

Ω

, with

τ > 0

being a fixed step size. It has been proved that this method ensures convergence when F has strong monotonicity and is Lipschitz continuous. However, when the assumption is weakened to monotonicity, the algorithm may diverge [30]. To weaken the requirement of strong monotonicity, ref. [26] introduced the extragradient method, a two-step method with the following iteration:

\{\begin{matrix} {\bar{z}}^{k} = P_{Ω} (z^{k} - α_{k} F (z^{k})), \\ z^{k + 1} = P_{Ω} (z^{k} - α_{k} F ({\bar{z}}^{k})), \end{matrix}

(8)

where

α_{k} \in (0, 1 / L)

, with L denoting the Lipschitz constant of F.

α_{k}

is generated accordingly to satisfy

α_{k} ∥ F (z^{k}) - F ({\bar{z}}^{k}) ∥ \leq μ ∥ z^{k} - {\bar{z}}^{k} ∥, μ \in (0, 1) .

(9)

Based on the extragradient method, a new projection and contraction algorithm is proposed in [31], which can be described as follows:

\{\begin{matrix} {\bar{z}}^{k} = P_{Ω} (z^{k} - τ F (z^{k})), \\ d (z^{k}, {\bar{z}}^{k}) = (z^{k} - {\bar{z}}^{k}) - τ (F (z^{k}) - F ({\bar{z}}^{k})), \\ z^{k + 1} = z^{k} - γ ρ_{k} d (z^{k}, {\bar{z}}^{k}), \end{matrix}

where

ρ_{k} = \frac{φ (z^{k}, {\bar{z}}^{k})}{{∥d (z^{k}, {\bar{z}}^{k})∥}^{2}}, φ (z^{k}, {\bar{z}}^{k}) = 〈z^{k} - {\bar{z}}^{k}, d (z^{k}, {\bar{z}}^{k})〉

and

γ \in (0, 2)

is a relaxation parameter. Since first-order algorithms, particularly gradient-type methods, often suffer from slow convergence, various acceleration techniques have been developed. One typical method is the inertial technique, which updates each iterate by incorporating information from the two preceding iterates. Under the assumptions that F is monotone and Lipschitz continuous with a constant L, an inertial projection and contraction algorithm [32] is proposed:

\{\begin{matrix} ω^{k} = z^{k} + α_{k} (z^{k} - z^{k - 1}), \\ {\bar{z}}^{k} = P_{Ω} (ω^{k} - τ F (ω^{k})), \\ d (ω^{k}, {\bar{z}}^{k}) = (ω^{k} - {\bar{z}}^{k}) - τ (F (ω^{k}) - F ({\bar{z}}^{k})), \\ z^{k + 1} = ω^{k} - γ ρ_{k} d (ω^{k}, {\bar{z}}^{k}), \end{matrix}

(10)

with

ρ_{k} = \{\begin{matrix} \frac{φ (ω^{k}, {\bar{z}}^{k})}{{∥ d (ω^{k}, {\bar{z}}^{k}) ∥}^{2}}, & if d (ω^{k}, {\bar{z}}^{k}) \neq 0, \\ 0, & if d (ω^{k}, {\bar{z}}^{k}) = 0, \end{matrix}

and

φ (z^{k}, {\bar{z}}^{k}) = 〈 z^{k} - {\bar{z}}^{k}, d (z^{k}, {\bar{z}}^{k}) 〉,

where

τ > 0

,

γ \in (0, 2)

. The sequence

{α_{k}}

is nondecreasing, with

α_{1} = 0

and satisfying

0 \leq α_{k} \leq α < 1

, and

σ, δ > 0

such that

δ > \frac{α^{2} (1 + α) + α σ}{1 - α^{2}}, 0 < γ \leq \frac{2 [δ - α ((1 + α) + α δ + σ)]}{δ [1 + α (1 + α) + α δ + σ]} .

It has been proved that, for

τ \in (0, \frac{1}{L})

, the sequence

{z^{k}}

generated by (10) converges weakly to a solution of

VI (Ω, F)

. In practical applications, L is usually hard to estimate. To overcome this limitation, a self-adaptive scheme incorporating the inertial technique, termed SIPCA (Algorithm 1), was proposed in [30]. Instead of using a fixed value

τ

, the proposed method employs a backtracking procedure to adaptively compute an appropriate step size

τ_{k}

:

\{\begin{matrix} ω^{k} = z^{k} + α_{k} (z^{k} - z^{k - 1}), \\ {\bar{z}}^{k} = P_{Ω} (ω^{k} - \bar{τ_{k}} F (ω^{k})), \\ d (ω^{k}, {\bar{z}}^{k}) = (ω^{k} - {\bar{z}}^{k}) - \bar{τ_{k}} (F (ω^{k}) - F ({\bar{z}}^{k})), \\ z^{k + 1} = ω^{k} - γ ρ_{k} d (ω^{k}, {\bar{z}}^{k}), \end{matrix}

(11)

where

γ \in (0, 2)

,

\bar{τ_{k}} = μ^{l_{k}} τ_{k}

,

τ_{0} = 0

,

0 < μ < \frac{1}{2}

,

l_{k}

is chosen as the minimal nonnegative integer ensuring that

\bar{τ_{k}}

satisfying

{\bar{τ}}_{k} 〈 ω^{k} - {\bar{z}}^{k}, F (ω^{k}) - F ({\bar{z}}^{k}) 〉 \leq δ {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2}

,

φ (z^{k}, {\bar{z}}^{k}) = 〈 z^{k} - {\bar{z}}^{k}, d (z^{k}, {\bar{z}}^{k}) 〉,

and

ρ_{k} = \{\begin{matrix} \frac{φ (ω^{k}, {\bar{z}}^{k})}{{∥ d (ω^{k}, {\bar{z}}^{k}) ∥}^{2}}, & if d (ω^{k}, {\bar{z}}^{k}) \neq 0, \\ 0, & if d (ω^{k}, {\bar{z}}^{k}) = 0, \end{matrix}

where

0 < δ < \frac{1}{2}, 0 < η < \frac{1}{2}, \frac{2 (1 + α^{2})}{2 α^{2} + α + 1} < γ < \frac{2 (1 + α)}{1 + 2 α} .

Algorithm 1 SIPCA [30]

1:: Input: $z^{- 1}, z^{0} \in H$ ; $α \in (0, 1)$ , $τ_{0} > 0$ , $μ \in [\frac{1}{2}, 1)$ , $δ \in (0, \frac{1}{2})$ , $γ \in (\frac{2 (1 + α^{2})}{2 α^{2} + α + 1}, \frac{2 (1 + α)}{1 + 2 α})$ , $τ^{'} \in (\frac{1}{5}, \frac{1}{2}]$ , $η \in (0, \frac{1}{2}]$ , tolerance $ε > 0$ .
2:: $k \leftarrow 0$ , $τ_{k} \leftarrow τ_{0}$
3:: while true do
4:: $ω^{k} \leftarrow z^{k} + α (z^{k} - z^{k - 1})$
5:: if $∥ω^{k} - P_{Ω} (ω^{k} - τ_{k} F (ω^{k}))∥ < ε$ then
6:: return $z^{k}$
7:: end if
8:: ${\bar{τ}}_{k} \leftarrow τ_{k}$
9:: repeat
10:: ${\bar{z}}^{k} \leftarrow P_{Ω} (ω^{k} - {\bar{τ}}_{k} F (ω^{k}))$
11:: if ${\bar{τ}}_{k} 〈 ω^{k} - {\bar{z}}^{k}, F (ω^{k}) - F ({\bar{z}}^{k}) 〉 \leq δ {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2}$ then break
12:: ${\bar{τ}}_{k} \leftarrow μ {\bar{τ}}_{k}$
13:: until condition holds
14:: $d^{k} \leftarrow (ω^{k} - {\bar{z}}^{k}) - {\bar{τ}}_{k} (F (ω^{k}) - F ({\bar{z}}^{k}))$
15:: if $∥ d^{k} ∥ > 0$ then $ρ_{k} \leftarrow 〈 ω^{k} - {\bar{z}}^{k}, d^{k} 〉 / {∥ d^{k} ∥}^{2}$ else $ρ_{k} \leftarrow 0$
16:: $z^{k + 1} \leftarrow ω^{k} - γ ρ_{k} d^{k}$
17:: if ${\bar{τ}}_{k} 〈 ω^{k} - {\bar{z}}^{k}, F (ω^{k}) - F ({\bar{z}}^{k}) 〉 \leq τ^{'} {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2}$ then
18:: $τ_{k + 1} \leftarrow (1 + η) {\bar{τ}}_{k}$
19:: else
20:: $τ_{k + 1} \leftarrow {\bar{τ}}_{k}$
21:: end if
22:: $k \leftarrow k + 1$
23:: end while

In SIPCA, the Lipschitz continuity requirement is removed, and the only assumption is that F is continuous. Lines 17–21 prevent

τ_{k}

from being too small, thereby improving the computational efficiency. This adaptive rule enables the algorithm to automatically enlarge the step size when the residual decreases rapidly and reduces it when instability is detected, thus maintaining a favorable balance between convergence speed and robustness.

3.2. Improved Self-Adaptive Inertial Projection and Contraction Algorithm

In recent years, the extragradient method (8) has attracted considerable attention, and numerous variants have been developed to enhance its performance due to its simple iterative forms. The projection and contraction algorithm (10) is one of its important extensions, and its classical form [27] can be described as follows:

\{\begin{matrix} {\bar{z}}^{k} = P_{Ω} (z^{k} - α_{k} F (z^{k})), \\ z^{k + 1} = P_{Ω} (z^{k} - γ ρ_{k} α_{k} F ({\bar{z}}^{k})), \end{matrix}

(12)

where

γ \in (0, 2)

,

α_{k}

is either chosen from

(0, 1 / L)

or adaptively selected as a sequence

{α_{k}}_{k = 0}^{\infty}

, and

ρ_{k} : = \frac{{∥ z^{k} - {\bar{z}}^{k} ∥}^{2} - α_{k} 〈 z^{k} - {\bar{z}}^{k}, F (z^{k}) - F ({\bar{z}}^{k}) 〉}{∥ (z^{k} - {\bar{z}}^{k}) - α_{k} (F (z^{k}) - F ({\bar{z}}^{k})) ∥^{2}} .

(13)

Compared with the classical extragradient method (8), where the same step size

{α_{k}}

is used in both projections, Algorithm (12) employs two different step sizes. This difference contributes to the superior computational efficiency of the projection and contraction algorithm relative to the extragradient method. On the other hand, the extragradient method involves two orthogonal projections onto

Ω

per iteration. As a result, when the set

Ω

cannot be simply projected onto, the minimum distance problem must be solved twice to obtain the next iteration, potentially reducing efficiency and applicability. To address this issue, the subgradient extragradient method [33] replaces the second projection with an easily computable subgradient projection, leading to the following iterative scheme:

\{\begin{matrix} {\bar{z}}^{k} = P_{Ω} (z^{k} - α_{k} F (z^{k})), \\ z^{k + 1} = P_{T_{k}} (z^{k} - α_{k} F ({\bar{z}}^{k})), \end{matrix}

(14)

where

T_{k} : = {w \in H ∣ 〈 (z^{k} - α_{k} F (z^{k})) - {\bar{z}}^{k}, w - {\bar{z}}^{k} 〉 \leq 0},

(15)

and

α_{k} \in (0, 1 / L)

or the sequence

{α_{k}}_{k = 0}^{\infty}

is generated adaptively according to

α_{k} = σ ρ^{m_{k}}

,

σ > 0

,

ρ \in (0, 1)

. The integer

m_{k}

denotes the smallest nonnegative value for which

α_{k} ∥ F (z^{k}) - F ({\bar{z}}^{k}) ∥ \leq μ ∥ z^{k} - {\bar{z}}^{k} ∥, μ \in (0, 1) .

(16)

As discussed above, both step size-based extragradient methods and subgradient extragradient methods play important roles in influencing the convergence behavior of two-step algorithms. However, subgradient extragradient methods, as gradient-type methods, often exhibit relatively lower convergence efficiency. Therefore, it is natural to ask whether step-size adjustment, subgradient extragradient strategies, and inertial techniques can be integrated to further improve the convergence performance of projection and contraction algorithms.

Motivated by the above observations to tackle the VI problem arising from large-scale mixed-cell-height circuit legalization, we develop an improved self-adaptive projection and contraction algorithm, termed SIPCA_IP (Algorithm 2), which integrates the inertial technique with the subgradient extragradient method.

Algorithm 2 SIPCA_IP

1:: Input: $z^{- 1}, z^{0} \in H$ ; $α \in (0, 1)$ , $τ_{0} > 0$ , $μ \in [\frac{1}{2}, 1)$ , $δ \in (0, \frac{1}{2})$ , $γ \in (\frac{2 (1 + α^{2})}{2 α^{2} + α + 1}, \frac{2 (1 + α)}{1 + 2 α})$ , $τ^{'} \in (\frac{1}{5}, \frac{1}{2}]$ , $η \in (0, \frac{1}{2}]$ , tolerance $ε > 0$ .
2:: $k \leftarrow 0$ , $τ_{k} \leftarrow τ_{0}$
3:: while true do
4:: $ω^{k} \leftarrow z^{k} + α (z^{k} - z^{k - 1})$
5:: if $∥ω^{k} - P_{Ω} (ω^{k} - τ_{k} F (ω^{k}))∥ < ε$ then
6:: return $z^{k}$
7:: end if
8:: ${\bar{τ}}_{k} \leftarrow τ_{k}$
9:: repeat
10:: ${\bar{z}}^{k} \leftarrow P_{Ω} (ω^{k} - {\bar{τ}}_{k} F (ω^{k}))$
11:: if ${\bar{τ}}_{k} 〈 ω^{k} - {\bar{z}}^{k}, F (ω^{k}) - F ({\bar{z}}^{k}) 〉 \leq δ {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2}$ then break
12:: ${\bar{τ}}_{k} \leftarrow μ {\bar{τ}}_{k}$
13:: until condition holds
14:: $d^{k} \leftarrow (ω^{k} - {\bar{z}}^{k}) - {\bar{τ}}_{k} (F (ω^{k}) - F ({\bar{z}}^{k}))$
15:: if $∥ d^{k} ∥ > 0$ then $ρ_{k} \leftarrow 〈 ω^{k} - {\bar{z}}^{k}, d^{k} 〉 / {∥ d^{k} ∥}^{2}$ else $ρ_{k} \leftarrow 0$
16:: ${\tilde{z}}^{k} \leftarrow z^{k} - γ ρ_{k} {\bar{τ}}_{k} F ({\bar{z}}^{k})$
17:: $a^{k} \leftarrow (z^{k} - {\bar{τ}}_{k} F (z^{k})) - {\bar{z}}^{k}$
18:: $z^{k + 1} \leftarrow {\tilde{z}}^{k} - max {0, 〈 a^{k}, {\tilde{z}}^{k} - {\bar{z}}^{k} 〉} a^{k} / {∥ a^{k} ∥}^{2}$ ▹ $T_{k} = {w : 〈 a^{k}, w - {\bar{z}}^{k} 〉 \leq 0}$
19:: if ${\bar{τ}}_{k} 〈 ω^{k} - {\bar{z}}^{k}, F (ω^{k}) - F ({\bar{z}}^{k}) 〉 \leq τ^{'} {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2}$ then
20:: $τ_{k + 1} \leftarrow (1 + η) {\bar{τ}}_{k}$
21:: else
22:: $τ_{k + 1} \leftarrow {\bar{τ}}_{k}$
23:: end if
24:: $k \leftarrow k + 1$
25:: end while

Compared with SIPCA, the proposed SIPCA_IP introduces a key modification in the update step: Lines 16–18 replace Line 16 of the original algorithm. Instead of performing the original direct iterative update, SIPCA_IP employs a subgradient projection step, which provides a more stable search direction and enhances convergence efficiency.

3.3. Convergence Analysis

In this section, we investigate the convergence behavior of Algorithm 2. Let

Ω

be a nonempty closed convex set, and let

F : Ω \to R^{n}

be monotone and Lipschitz continuous. We denote the solution set of

V I (Ω, F)

by

SOL (Ω, F)

, which is assumed to be nonempty.

Theorem 1.

Let

Ω \subseteq R^{n}

be a nonempty closed convex set and

F : Ω \to R^{n}

be monotone and Lipschitz continuous. Moreover, the solution set

SOL (Ω, F)

is nonempty. Under the condition of Algorithm 2, let

{z^{k}}

be generated by

ω^{k} = z^{k} + α_{k} (z^{k} - z^{k - 1}),

{\bar{z}}^{k} = P_{Ω} (ω^{k} - τ_{k} F (ω^{k})),

d (ω^{k}, {\bar{z}}^{k}) : = (ω^{k} - {\bar{z}}^{k}) - τ_{k} (F (ω^{k}) - F ({\bar{z}}^{k})),

ρ_{k} : = \frac{〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉}{{∥ d (ω^{k}, {\bar{z}}^{k}) ∥}^{2}},

z^{k + 1} = P_{T_{k}} (ω^{k} - γ ρ_{k} τ_{k} F ({\bar{z}}^{k})), γ \in (\frac{2 (1 + α^{2})}{2 α^{2} + α + 1}, \frac{2 (1 + α)}{1 + 2 α}),

where

T_{k} : = \{w \in R^{n} : 〈(ω^{k} - τ_{k} F (ω^{k})) - {\bar{z}}^{k}, w - {\bar{z}}^{k}〉 \leq 0\} .

Suppose that the following line-search condition holds for every k:

τ_{k} ∥ F (ω^{k}) - F ({\bar{z}}^{k}) ∥ \leq μ ∥ ω^{k} - {\bar{z}}^{k} ∥, μ \in [\frac{1}{2}, 1),

and that the inertial parameters satisfy

0 \leq α_{k} \leq \bar{α} < 1, \sum_{k = 0}^{\infty} α_{k} ∥ z^{k} - z^{k - 1} ∥ < \infty .

Then,

{z^{k}}

is bounded, and

lim_{k \to \infty} ∥ ω^{k} - {\bar{z}}^{k} ∥ = 0 .

Moreover, every cluster point of

{z^{k}}

belongs to

SOL (Ω, F)

. In the case where

V I (Ω, F)

has a unique solution, the sequence

{z^{k}}

converges to the unique solution.

Proof.

Due to space considerations, the detailed proof is provided in Appendix A. □

Remark 2.

From the structures of W and

E^{T} E

in Remark 1, it follows that A is a constant matrix. Moreover, noting that

Q = I

and each row of W contains only two nonzero entries

- 1

and

1

while each block of

E^{T} E

is bounded, one can estimate that

∥ A ∥ \leq 3 + 2 λ

. Therefore, F is Lipschitz continuous with

L = 3 + 2 λ

. Further, the matrix A is positive semidefinite, which implies that F is monotone. Consequently, the proposed algorithm is applicable to the VI considered in this work.

3.4. Computational Complexity Analysis

In this subsection, we analyze the computational complexity of the proposed SIPCA_IP algorithm.

At the kth iteration, the extrapolation step

ω^{k} = z^{k} + α_{k} (z^{k} - z^{k - 1})

only involves vector addition and scalar multiplication, and thus requires

O (n)

operations. The main computational cost comes from the evaluation of the mapping F and the projection step. Specifically, one evaluation of

F (ω^{k})

is from

y^{k} = P_{Ω} (ω^{k} - {\bar{τ}}_{k} F (ω^{k})),

while an additional evaluation of

F (y^{k})

is required from

d (ω^{k}, y^{k}) = (ω^{k} - y^{k}) - {\bar{τ}}_{k} (F (ω^{k}) - F (y^{k})) .

Therefore, each iteration computes the mapping F twice. In the proposed algorithm, F is induced by a sparse matrix–vector multiplication. Hence, each evaluation of F requires

O (nnz (A))

operations, where

nnz (A)

denotes the number of nonzero entries of the system matrix A of F. If the algorithm terminates after K iterations, the total complexity becomes

O (K nnz (A))

. Since the dominant cost of SIPCA_IP is determined by sparse matrix–vector multiplications and simple projection operations, it is computationally efficient for large-scale sparse legalization problems.

Remark 3.

It is worth noting that establishing an explicit convergence rate for the proposed SIPCA_IP method such as

O (1 / k)

is technically challenging due to the incorporation of adaptive step-size strategies and inertial mechanisms. These components introduce additional nonlinearity into the iterative process, making standard convergence rate analysis difficult to apply directly. Therefore, the current work primarily focuses on establishing the convergence properties of the proposed method. The investigation of explicit convergence rates will be pursued in future work.

3.5. Legalization Framework

Figure 2 illustrates the overall framework for mixed-cell-height circuit legalization. The legalization stage begins with a global placement solution, where cell locations are estimated without considering overlaps. We first align each cell to the nearest feasible row while ignoring the right boundary constraints. Multi-row-height standard cells are partitioned into single-row-height subcells. Consequently, the legalization task is formulated as a QP model and then reformulated as a VI problem. The resulting VI is solved by the SIPCA and SIPCA_IP algorithms. Due to numerical precision, overlaps may still occur after restoring the multi-row-height cells. These remaining overlaps are then resolved using a Tetris-like allocation method [3].

4. Experimental Results and Discussion

This section presents numerical experiments to evaluate the convergence behavior and layout quality of the proposed SIPCA_IP in comparison with SIPCA and several representative methods, including the modulus-based method, the robust modulus-based method, and the Newton method, for mixed-cell-height circuit legalization problems. First, we compare SIPCA and SIPCA_IP in terms of convergence behavior and layout quality. Subsequently, under identical stopping criteria, both methods are further compared with the above representative algorithms. In addition, the robustness of the proposed method and its sensitivity to parameter settings are investigated.

Experiments are conducted on seven standard mixed-cell-height benchmarks from the ISPD 2015 Detailed Routing-Driven Placement Contest [34]. Since the original cell library does not contain multi-row-height cells, 10% of the cells are randomly selected to double the height and halve the width. These benchmarks are provided by the authors of [11] and have been widely used in studies on mixed-cell-height legalization. Table 2 presents the cell statistics for these benchmarks. “T.Cell”, “S.Cell”, “D.Cell”, and “Dens.” correspond the total cell count, single-row-height cell count, double-row-height cell count, and design density, respectively. “W.size” and “E.size” denote the dimensions of matrices W and E. The implementation is carried out in C++ using Microsoft Visual Studio Community 2022 (64 bit) version 17.11.4, and the experiments are executed on a machine featuring an Intel Core i5 processor with 32 GB RAM.

The efficiency of the proposed algorithm is evaluated from three perspectives: IT, CPU time, and

R E S

. Here, IT denotes the iteration number, CPU time records the running time in seconds, and

R E S

is defined by

R E S = ∥ ω^{k} - P_{Ω} (ω^{k} - τ_{k} F (ω^{k})) ∥

, which is defined in the two algorithms. The parameters used in the experiments are selected according to the empirical settings reported in the existing literature [1,30], which have been shown to provide stable and efficient performance. The stopping tolerance is set to

ε = 10^{- 6}

, and the maximum number of iterations is set to

I T_{max} = 3000

. For the experiments with increased proportions of multi-height cells,

I T_{max}

is increased to 5000 to ensure sufficient convergence. The algorithms terminate when

R E S < ε

or

I T_{max}

is reached, with

z^{0} = {(0, 0, \dots, 0)}^{T} \in R^{n + m}

. The detailed parameter configurations and implementation settings are provided in Appendix B. Note that the stopping tolerance is set to

ε = 10^{- 6}

in the first two subsections. In the parameter sensitivity analysis (Section 4.3), a stricter tolerance

ε = 10^{- 7}

is also considered to examine the influence of the stopping criterion.

4.1. Comparison Between the Proposed Algorithms

This subsection presents a comparison between the two proposed algorithms focusing on IT, CPU time, and

R E S

. The “N.Avg” row reports the average normalized ratios of total runtime with respect to SIPCA_IP.

As summarized in Table 3, the improved SIPCA_IP algorithm consistently achieves comparable or higher accuracy with markedly fewer iterations and shorter CPU time. On average, the IT and CPU time of SIPCA are approximately 2.069× and 1.467× larger than that of SIPCA_IP, confirming the superior adaptive convergence efficiency of SIPCA_IP.

To further evaluate the solution quality achieved by the two algorithms, we compare their overlaps and total displacement. Table 4 presents the quantitative contribution of the Tetris-like refinement stage for both SIPCA and SIPCA_IP. The solver outputs before refinement, including the number of overlaps and displacement values, are reported together with the final displacement values obtained after refinement. The runtime of the refinement stage (denoted as R.Time) is also recorded separately for each benchmark instance.

After applying the Tetris-like refinement, all remaining overlaps are completely eliminated for all benchmark instances. Therefore, the overlap counts after refinement are not listed in the table. Moreover, the runtime of the refinement stage remains extremely small across all benchmark instances, typically ranging from 0.001 to 0.005 s. Meanwhile, the displacement values after refinement show only minor changes compared with those before refinement, indicating that the refinement primarily resolves residual overlaps while preserving displacement quality.

Overall, the proposed algorithm achieves the major solution quality, while the Tetris-like refinement serves as an efficient postprocessing step. On average, the total displacement produced by SIPCA is 1.009× that of SIPCA_IP, while the number of overlaps generated by SIPCA is 1.434× larger. These results indicate that, under the same termination accuracy, SIPCA_IP consistently achieves better placement quality than SIPCA.

To further evaluate the robustness of the proposed method under more challenging benchmark settings, we increase the proportion of double-height cells from 10% to 20%. The double-height cells are generated using a fixed random seed (seed = 1234). The statistics of the benchmark instances, including the numbers of single-height and double-height cells, as well as the corresponding matrix dimensions, are summarized in Table 5.

Compared with the original 10% setting, increasing the proportion of double-height cells significantly enlarges the constraint matrix size and increases the complexity of the legalization problem. Due to the increased problem scale, the maximum iteration number is increased from 3000 to 5000 for both SIPCA and SIPCA_IP to ensure sufficient convergence, while the stopping tolerance (

R E S

) remains

10^{- 6}

. If the iteration number reaches 5000, it indicates that the method has reached the maximum iteration limit without satisfying the stopping criterion.

The convergence performance of SIPCA and SIPCA_IP under the 20% double-height-cell setting is presented in Table 6. As shown in the table, SIPCA reaches the maximum iteration limit on several benchmarks, such as des_perf_a, fft_a, and pci_bridge32_b, while SIPCA_IP successfully converges on all tested benchmarks within significantly fewer iterations. These results indicate that increasing the proportion of double-height cells to 20% significantly increases the difficulty of the legalization problem, as reflected by the enlarged matrix dimensions and slower convergence behavior. Despite this increased complexity, SIPCA_IP maintains stable convergence across all tested benchmarks, while SIPCA reaches the maximum iteration limit on several instances. These results confirm the robustness of the proposed SIPCA_IP under more challenging mixed-cell-height cases.

4.2. Comparison with Existing Methods

In this subsection, we compare the total cell displacement of our proposed methods with that of three representative state-of-the-art legalization methods, namely, the modulus-based method [1], the robust modulus-based method [20], and the robust Newton method [4]. To ensure fair and controlled comparisons, all algorithms are implemented within the same legalization framework used in this study. Specifically, in the legalization flow shown in Figure 2, the VI formulation converted from the QP model and its corresponding solver are replaced by the respective baseline formulations and solution methods while all other procedures remain unchanged.

To ensure a consistent comparison environment, the same benchmark instances, stopping criteria, evaluation procedures, and hardware/software settings are applied to all methods. In particular, the termination conditions are unified across all methods, i.e., the iterations terminate when

R E S = ∥ ω^{k} - P_{Ω} (ω^{k} - τ_{k} F (ω^{k})) ∥ < 10^{- 6}

or when the maximum number of iterations

I T_{max} = 3000

is reached. All experiments are conducted on the same computing platform described in Appendix B.

Table 7 presents the controlled comparison results obtained under unified experimental settings. From the results, it can be seen that the proposed SIPCA_IP method achieves the smallest or highly competitive displacement values after refinement on most benchmark instances, such as des_perf_b, fft_a, and fft_b. This demonstrates the effectiveness of the proposed method in improving legalization quality under identical experimental conditions. Furthermore, the computational time of SIPCA_IP remains comparable to or lower than that of several baseline methods on multiple benchmarks, indicating that the improved performance is achieved without introducing significant computational overhead.

To further evaluate the final legalization performance, the final displacement results of all methods are summarized in Table 8. From Table 8, SIPCA_IP achieves total displacement comparable to that of SIPCA while outperforming the modulus-based and robust Newton approaches by

2.1 %

and

1.1 %

, respectively. The total displacement reported in [20] is

0.991 \times

that of SIPCA_IP. These results show that SIPCA and SIPCA_IP achieve competitive performance compared with the existing approaches in terms of total displacement.

4.3. Sensitivity Analysis with Respect to $α$ and $ε$

Both SIPCA and SIPCA_IP involve several parameters whose values are chosen according to the empirical settings suggested in the existing literature [30]. In this subsection, we investigate the sensitivity of the algorithm to two important parameters, namely, the relaxation parameter

α

and the stopping tolerance

ε

.

Since the parameter

α

controls the relaxation step in the iterative process and may affect the convergence, we first analyze the influence of

α

. To provide a more comprehensive evaluation of the parameter sensitivity, the influence of the parameter

α

is investigated on all seven benchmark instances used in this study. Specifically, both SIPCA and SIPCA_IP are tested with

α

varying from

0.1

to

0.9

with a step size of

0.1

. For each value of

α

, the iteration numbers and CPU times obtained from all benchmarks are collected, and their average values are reported to reflect the overall performance trend. The corresponding results are illustrated in Figure 3, where Figure 3a shows the average iteration numbers versus

α

and Figure 3b presents the average CPU time versus

α

.

From Figure 3, it can be observed that both the average number of iterations and the CPU time of SIPCA and SIPCA_IP decrease significantly as

α

increases. Moreover, SIPCA_IP exhibits a faster reduction than SIPCA, indicating that

α

significantly affects convergence, with SIPCA_IP being more sensitive to its choice. Based on the above observations,

α = 0.9

is adopted in all subsequent experiments, since it provides faster convergence and lower computational cost while maintaining stable performance. It should be noted that a theoretical analysis of the sensitivity of the parameter

α

, as well as the influence of other parameters on the performance of the two algorithms, deserves further investigation.

Next, we investigate the influence of the stopping tolerance

ε

, which is employed as the stopping criterion to terminate the iteration process. In general, a smaller value of

ε

leads to higher solution accuracy but may increase the computational cost. Therefore, experiments with different values of

ε

are conducted to examine the trade-off between computational efficiency and solution accuracy.

As summarized in Table 3 and Table 9, tightening the termination tolerance from

10^{- 6}

and

10^{- 7}

leads to an increased number of IT across all test cases, indicating the additional computational effort required for higher precision. However, the extent of this increase varies between the two algorithms. The improved SIPCA_IP algorithm consistently attains comparable or higher accuracy with markedly fewer iterations and a shorter CPU time. When the tolerance is further tightened to

10^{- 7}

, both algorithms require more IT; nevertheless, SIPCA_IP maintains its advantage, exhibiting smaller increases in both IT and CPU time. Moreover, for the benchmarks des_perf_a, fft_a, and pci_bridge32_b, the baseline SIPCA fails to reach the required accuracy within the maximum iteration limit, whereas SIPCA_IP successfully satisfies the tolerance in all cases.

In addition, we compare the overlap counts and total displacement of the two algorithms under different stopping tolerances. The corresponding results are summarized in Table 10. In addition to displacement values, the corresponding overlap counts and refinement runtimes are also reported for each stopping condition. This enables a further evaluation of the robustness of the Tetris-like refinement stage under varying termination criteria. It can be observed that all remaining overlap counts are completely eliminated after refinement across different stopping tolerances while the refinement runtime remains consistently small. Furthermore, Table 4 and Table 10 show that, as the stopping tolerance becomes stricter, both the overlap counts and total displacement decrease. Specifically, for SIPCA, the overlap count decreases by 18.18% and the total displacement by 0.08%; for SIPCA_IP, the overlap count decreases by 8.69% and the total displacement by 0.02%.

From the comparison under different stopping tolerances (Figure 4 and Figure 5), it can be seen that decreasing the stopping tolerance significantly increases iteration counts and CPU time for both algorithms. However, the impact of the stopping tolerance on legalization quality, measured by total displacement and overlap counts, is relatively small. This suggests that, in practice, the stopping tolerance can be moderately relaxed to achieve a better balance between computational efficiency and solution quality.

To further illustrate the relationship between computational cost and solution precision, a time–precision trade-off analysis is conducted. Specifically, the stopping tolerance

ε

is varied from

10^{- 3}

to

10^{- 7}

. For each tolerance value, both SIPCA and SIPCA_IP are executed on all seven benchmarks. The average CPU time and iteration numbers over the seven benchmarks are computed and plotted as functions of

- {log}_{10} (ε)

, as shown in Figure 6.

From Figure 6a, it can be observed that the computational time increases steadily as higher precision is required. Meanwhile, Figure 6b shows that the iteration numbers also increase as the stopping tolerance decreases. In all cases, SIPCA_IP consistently requires fewer iterations and less computational time than SIPCA, demonstrating its superior efficiency under different precision requirements.

4.4. Discussion

The experimental results demonstrate that both SIPCA and SIPCA_IP achieve stable convergence, while SIPCA_IP exhibits clear advantages in convergence speed and overall performance. Compared with SIPCA, SIPCA_IP attains the prescribed accuracy with significantly fewer iterations and a shorter CPU time. In terms of displacement quality, SIPCA_IP produces smaller overlap counts and total displacement under identical termination conditions, leading to improved legalization results.

In addition, to investigate the impact of matrix size on algorithm performance, seven benchmark instances are considered, which are arranged in ascending order according to the matrix size, measured by the number of nonzero elements (nnz(A)). As shown in Figure 7a, the overall CPU time tends to increase as the matrix size grows, indicating that the computational cost generally increases with problem scale. Figure 7b presents the corresponding iteration numbers. It can be observed that the iteration numbers of SIPCA vary more significantly as the matrix size increases, suggesting that SIPCA is relatively sensitive to problem scale. In contrast, the iteration numbers of SIPCA_IP remain comparatively stable across different matrix sizes, indicating a weaker dependence of iteration counts on matrix size and thus demonstrating improved scalability over a range of problem scales.

Overall, the SIPCA_IP enhances convergence robustness, computational efficiency, and displacement quality simultaneously, providing a more reliable and scalable solution framework for large-scale problems. Although the proposed algorithm performs well on benchmark instances with matrix sizes up to

10^{5}

, future work will involve further evaluation on datasets of the order of millions.

5. Conclusions and Outlook

This study transforms the mixed-cell-height legalization problem into a VI framework and addresses it using SIPCA. Inspired by the subgradient extragradient method, we further propose SIPCA_IP, which integrates adaptive step size and a two-step strategy to enhance convergence stability and computational efficiency. Extensive experiments demonstrate that SIPCA_IP achieves faster convergence, fewer iterations, and improved legalization quality, producing smaller overlap counts and total displacement compared with the baseline SIPCA. In addition, comparative experiments with representative baseline methods conducted under unified experimental settings and identical stopping tolerances demonstrate that SIPCA_IP achieves competitive or superior performance across all benchmark instances, confirming its effectiveness and robustness for large-scale mixed-cell-height legalization problems.

In future work, the proposed VI-based framework may be extended to incorporate additional design constraints, such as half-row-height and fence-region constraints. Such extensions would require modifying the feasible set to accommodate the additional placement restrictions, while the projection-based iterative structure of the algorithm would remain applicable. Moreover, due to the multiple algorithmic parameters involved in SIPCA and SIPCA_IP, integrating the proposed framework with advanced layout engines and machine learning-based parameter optimization strategies is expected to further enhance the adaptability and efficiency in practical VLSI design.

Author Contributions

Conceptualization, L.W. and Q.S.; methodology, L.W.; software, C.Z.; validation, C.Z.; formal analysis, L.W.; investigation, L.W.; data curation, Q.S.; writing—original draft preparation, L.W.; writing—review and editing, C.Z.; supervision, Q.S.; funding acquisition, C.Z. and Q.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (grant nos. 12401700, 12471354, 92373207), the Open Fund Project of Hainan Provincial Key Laboratory of Computational Science and Applications (grant no. JSKX202402), the Jiangsu Province Postgraduate Research and Practice Innovation Program (grant no. KYCX24_3640), and the QingLan Project of Jiangsu Province, China.

Data Availability Statement

The data used to support the reported results are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A. Proof of Theorem 1

In this appendix, we provide the detailed proof of Theorem 1.

Theorem A1

(Theorem 1). Let

Ω \subseteq R^{n}

be a nonempty closed convex set and

F : Ω \to R^{n}

be monotone and Lipschitz continuous. Moreover, let the solution set

SOL (Ω, F)

be nonempty. Under the condition of Algorithm 2, let

{z^{k}}

be generated by

ω^{k} = z^{k} + α_{k} (z^{k} - z^{k - 1}),

{\bar{z}}^{k} = P_{Ω} (ω^{k} - τ_{k} F (ω^{k})),

d (ω^{k}, {\bar{z}}^{k}) : = (ω^{k} - {\bar{z}}^{k}) - τ_{k} (F (ω^{k}) - F ({\bar{z}}^{k})),

ρ_{k} : = \frac{〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉}{{∥ d (ω^{k}, {\bar{z}}^{k}) ∥}^{2}},

z^{k + 1} = P_{T_{k}} (ω^{k} - γ ρ_{k} τ_{k} F ({\bar{z}}^{k})), γ \in (\frac{2 (1 + α^{2})}{2 α^{2} + α + 1}, \frac{2 (1 + α)}{1 + 2 α}),

where

T_{k} : = \{w \in R^{n} : 〈(ω^{k} - τ_{k} F (ω^{k})) - {\bar{z}}^{k}, w - {\bar{z}}^{k}〉 \leq 0\} .

Suppose that the following line-search condition holds for every k:

τ_{k} ∥ F (ω^{k}) - F ({\bar{z}}^{k}) ∥ \leq μ ∥ ω^{k} - {\bar{z}}^{k} ∥, μ \in [\frac{1}{2}, 1),

and that the inertial parameters satisfy

0 \leq α_{k} \leq \bar{α} < 1, \sum_{k = 0}^{\infty} α_{k} ∥ z^{k} - z^{k - 1} ∥ < \infty .

Then,

{z^{k}}

is bounded, and

lim_{k \to \infty} ∥ ω^{k} - {\bar{z}}^{k} ∥ = 0 .

Moreover, every cluster point of

{z^{k}}

belongs to

SOL (Ω, F)

. In the case where

V I (Ω, F)

has a unique solution, the sequence

{z^{k}}

converges to the unique solution.

Proof.

Let

z^{*} \in SOL (Ω, F)

be arbitrary. For clarity, the proof proceeds in seven steps:

Step 1.: The solution set is contained in $T_{k}$ .

Since

{\bar{z}}^{k} = P_{Ω} (ω^{k} - τ_{k} F (ω^{k})),

the characterization of the metric projection yields

〈{\bar{z}}^{k} - (ω^{k} - τ_{k} F (ω^{k})), z - {\bar{z}}^{k}〉 \geq 0, \forall z \in Ω .

Equivalently,

〈(ω^{k} - τ_{k} F (ω^{k})) - {\bar{z}}^{k}, z - {\bar{z}}^{k}〉 \leq 0, \forall z \in Ω .

For any solution

z^{*} \in SOL (Ω, F) \subseteq Ω

, we have

〈(ω^{k} - τ_{k} F (ω^{k})) - {\bar{z}}^{k}, z^{*} - {\bar{z}}^{k}〉 \leq 0 .

Hence,

z^{*} \in T_{k}

for each k.

Step 2.: A positivity estimate for $d (ω^{k}, {\bar{z}}^{k})$ .

From the expression of

d (ω^{k}, y^{k})

, it follows that

〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉 = {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2} - τ_{k} 〈 ω^{k} - y^{k}, F (ω^{k}) - F ({\bar{z}}^{k}) 〉 .

By applying the Cauchy–Schwarz inequality together with the line-search condition, we obtain

τ_{k} 〈 ω^{k} - {\bar{z}}^{k}, F (ω^{k}) - F ({\bar{z}}^{k}) 〉 \leq τ_{k} ∥ ω^{k} - {\bar{z}}^{k} ∥ ∥ F (ω^{k}) - F ({\bar{z}}^{k}) ∥ \leq μ ∥ ω^{k} - {\bar{z}}^{k} ∥^{2} .

Therefore,

〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉 \geq (1 - μ) {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2} .

In particular, if

ω^{k} \neq {\bar{z}}^{k}

, then

〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉 > 0,

and hence

ρ_{k}

is well defined.

Step 3.: A descent inequality.

Since

z^{k + 1} = P_{T_{k}} (ω^{k} - γ ρ_{k} τ_{k} F ({\bar{z}}^{k}))

and

z^{*} \in T_{k}

, we obtain

∥ z^{k + 1} - z^{*} ∥^{2} \leq ∥ ω^{k} - γ ρ_{k} τ_{k} F ({\bar{z}}^{k}) - z^{*} ∥^{2} - {∥ z^{k + 1} - (ω^{k} - γ ρ_{k} τ_{k} F ({\bar{z}}^{k})) ∥}^{2} .

Expanding the first term gives

∥ z^{k + 1} - z^{*} ∥^{2} \leq ∥ ω^{k} - z^{*} ∥^{2} - 2 γ ρ_{k} τ_{k} 〈 F ({\bar{z}}^{k}), ω^{k} - z^{*} 〉 + γ^{2} ρ_{k}^{2} τ_{k}^{2} {∥ F ({\bar{z}}^{k}) ∥}^{2} .

Using the standard estimate in subgradient extragradient methods together with

z^{*} \in T_{k}

, it holds

∥ z^{k + 1} - z^{*} ∥^{2} \leq ∥ ω^{k} - z^{*} ∥^{2} - γ (2 - γ) ρ_{k}^{2} {∥ d (ω^{k}, {\bar{z}}^{k}) ∥}^{2} .

For the definition of

ρ_{k}

, it holds

ρ_{k}^{2} {∥ d (ω^{k}, {\bar{z}}^{k}) ∥}^{2} = \frac{{〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉}^{2}}{∥ d (ω^{k}, {\bar{z}}^{k}) ∥^{2}} .

Hence,

∥ z^{k + 1} - z^{*} ∥^{2} \leq {∥ ω^{k} - z^{*} ∥}^{2} - γ (2 - γ) \frac{{〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉}^{2}}{∥ d (ω^{k}, {\bar{z}}^{k}) ∥^{2}} .

Together with Step 2, it follows that

〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉 \geq (1 - μ) {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2},

so the right-hand side contains a nonnegative descent term.

Step 4.: Treatment of the inertial term $ρ_{k}$ .

Since

ω^{k} = z^{k} + α_{k} (z^{k} - z^{k - 1}),

it satisfies

ω^{k} - z^{*} = (z^{k} - z^{*}) + α_{k} (z^{k} - z^{k - 1}) .

Combining the inequality

{∥ a + b ∥}^{2} \leq {∥ a ∥}^{2} + 2 〈 a, b 〉 + {∥ b ∥}^{2}

with Young’s inequality yields the existence of a constant

C > 0

satisfying

∥ ω^{k} - z^{*} ∥^{2} \leq ∥ z^{k} - z^{*} ∥^{2} + C α_{k} ∥ z^{k} - z^{k - 1} ∥ .

Substituting this into the above descent estimate yields

∥ z^{k + 1} - z^{*} ∥^{2} \leq ∥ z^{k} - z^{*} ∥^{2} + C α_{k} ∥ z^{k} - z^{k - 1} ∥ - γ (2 - γ) \frac{{〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉}^{2}}{∥ d (ω^{k}, {\bar{z}}^{k}) ∥^{2}} .

Since

\sum_{k = 0}^{\infty} α_{k} ∥ z^{k} - z^{k - 1} ∥ < \infty,

the above inequality shows that

{∥ z^{k} - z^{*} ∥^{2}}

is quasi-Fejér monotone with respect to

SOL (Ω, F)

. Therefore, the sequence

{z^{k}}

is bounded, and

\sum_{k = 0}^{\infty} \frac{{〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉}^{2}}{∥ d (ω^{k}, {\bar{z}}^{k}) ∥^{2}} < \infty .

Hence,

lim_{k \to \infty} \frac{{〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉}^{2}}{∥ d (ω^{k}, {\bar{z}}^{k}) ∥^{2}} = 0 .

Step 5.: Residual convergence.

Since F is Lipschitz continuous and

{ω^{k}}

and

{{\bar{z}}^{k}}

are bounded, there exists

M > 0

such that

∥ d (ω^{k}, {\bar{z}}^{k}) ∥ \leq M ∥ ω^{k} - {\bar{z}}^{k} ∥ .

Combining this with Step 2, we obtain

\frac{{〈 ω^{k} - {\bar{z}}^{k}, d (ω^{k}, {\bar{z}}^{k}) 〉}^{2}}{∥ d (ω^{k}, {\bar{z}}^{k}) ∥^{2}} \geq \frac{{(1 - μ)}^{2} {∥ ω^{k} - {\bar{z}}^{k} ∥}^{4}}{M^{2} {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2}} = \frac{{(1 - μ)}^{2}}{M^{2}} {∥ ω^{k} - {\bar{z}}^{k} ∥}^{2} .

Therefore,

lim_{k \to \infty} ∥ ω^{k} - {\bar{z}}^{k} ∥ = 0 .

Step 6.: Every cluster point solves $V I (Ω, F)$ .

Assume that

\bar{z}

is a cluster point of

{z^{k}}

. Then, there exists a subsequence

{z^{k_{j}}}

converging to

\bar{z}

, i.e.,

z^{k_{j}} \to \bar{z} .

Since

α_{k} ∥ z^{k} - z^{k - 1} ∥ \to 0

, it follows that

ω^{k_{j}} - z^{k_{j}} \to 0,

and hence

ω^{k_{j}} \to \bar{z} .

Moreover, since

∥ ω^{k_{j}} - {\bar{z}}^{k_{j}} ∥ \to 0

, we also get

{\bar{z}}^{k_{j}} \to \bar{z} .

By the projection formula

{\bar{z}}^{k} = P_{Ω} (ω^{k} - τ_{k} F (ω^{k})),

it holds

〈{\bar{z}}^{k} - (ω^{k} - τ_{k} F (ω^{k})), z - {\bar{z}}^{k}〉 \geq 0, \forall z \in Ω .

Let

k_{j} \to \infty

. By the continuity of F, we obtain

〈 F (\bar{z}), z - \bar{z} 〉 \geq 0, \forall z \in Ω .

Thus,

\bar{z} \in SOL (Ω, F)

.

If

V I (Ω, F)

has a unique solution

z^{*}

, then every cluster point of

{z^{k}}

coincides with

z^{*}

. Since

{z^{k}}

is bounded and all its cluster points are equal, the whole sequence converges to

z^{*}

.

This completes the proof. □

Appendix B. Implementation Details and Parameter Settings

To provide a clear description of the implementation details and parameter settings used in the numerical experiments, all relevant configurations adopted in this study are summarized below.

The main parameters used in the experiments are listed as follows:

Relaxation parameter: $α = 0.9$ ;
Step-size parameter: $γ = 0.5 (\frac{2 (1 + α^{2})}{2 α^{2} + α + 1} + \frac{2 (1 + α)}{1 + 2 α})$ ;
Initial step size: $τ_{0} = 1$ ;
Regularization parameter: $μ = 0.7$ ;
Control parameter: $δ = 0.3$ ;
Secondary step size: $τ^{'} = 0.4$ ;
Scaling parameter: $η = 0.5$ ;
Penalty factor: $λ = 1000$ .

The maximum number of iterations is set to

I T_{max} = 3000

in the standard experiments. For the experiments with increased proportions of multi-height cells,

I T_{max}

is increased to 5000 to ensure sufficient convergence. The stopping tolerance is set to

ε = 10^{- 6}

in the first two subsections. In the parameter sensitivity analysis (Section 4.3), a stricter tolerance

ε = 10^{- 7}

is also considered.

The algorithms terminate when one of the following conditions is satisfied:

$R E S < ε$ ;
$I T_{max}$ is reached.

To generate the multi-height cell distributions, a fixed random seed (seed = 1234) is used when selecting cells to be converted into double-height cells. Unless otherwise specified, the same parameter settings are applied to all benchmark instances.

All experiments are implemented in C++ and executed on a workstation equipped with an Intel Core i5 processor and 32 GB RAM running Windows 11 (64 bit). The programs are compiled using Microsoft Visual Studio Community 2022 (64 bit) version 17.11.4 with the MSVC compiler (version 19.41). The executable scripts used to generate the reported experimental results are available from the authors upon reasonable request.

References

Chen, J.-L.; Zhu, Z.-R.; Zhu, W.-X.; Chang, Y.-W. Toward optimal legalization for mixed-cell-height circuit designs. In Proceedings of the Annual Design Automation Conference, Austin, TX, USA, 18–22 June 2017; pp. 1–6. [Google Scholar]
Li, H.-C.; Chow, W.-K.; Yu, G.-J.C.G.B.; Young, E.F. Pin-accessible legalization for mixed-cell-height circuits. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2021, 41, 143–154. [Google Scholar] [CrossRef]
Zhou, C.-C.; Qiu, J.; Cao, Y.; Yang, G.-C.; Shen, Q.-Q.; Shi, Q. An accelerated modulus-based matrix splitting iteration method for mixed-size cell circuits legalization. Integration 2023, 88, 20–31. [Google Scholar] [CrossRef]
Zhou, C.-C.; Cao, Y.; Shi, Q.; Wang, L.-X.; Wen, X.-Q. A robust Newton iteration method for mixed-cell-height circuit legalization under technology and region constraints. ACM Trans. Des. Autom. Electron. Syst. 2024, 29, 1–25. [Google Scholar] [CrossRef]
Zhou, C.-C.; Cao, Y.; Yang, F.; Wen, X.-Q.; Shi, Q.; Rong, R. An accelerated Newton-based matrix splitting iteration method for mixed-cell-height circuit legalization. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2026, 45, 1535–1548. [Google Scholar] [CrossRef]
Spindler, P.; Schlichtmann, U.; Johannes, F.M. Abacus: Fast legalization of standard cell circuits with minimal movement. In Proceedings of the 2008 International Symposium on Physical Design, Portland, OR, USA, 13–16 April 2008; pp. 47–53. [Google Scholar]
Hill, D. Method and System for High Speed Detailed Placement of Cells within an Integrated Circuit Design. U.S. Patent 6,370,673B1, 9 April 2002. [Google Scholar]
Darav, N.K.; Kennings, A.; Tabrizi, A.F.; Westwick, D.; Behjat, L. Eh?Placer: A high performance modern technology-driven placer. ACM Trans. Des. Autom. Electron. Syst. 2016, 21, 1–27. [Google Scholar] [CrossRef]
Puget, J.C.; Flach, G.; Johann, M.; Reis, R. Jezz: An effective legalization algorithm for minimum displacement. In Proceedings of the 28th Symposium on Integrated Circuits and Systems Design, Salvador, Brazil, 31 August–4 September 2015; pp. 1–5. [Google Scholar]
Chen, J.-L.; Lin, Z.-F.; Xie, Y.-Y.; Zhu, W.-X.; Chang, Y.-W. Mixed-cell-height placement with complex minimum-implant-area constraints. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2021, 41, 4639–4652. [Google Scholar] [CrossRef]
Chow, W.-K.; Pui, C.-W.; Young, E.F. Legalization algorithm for multiple row height standard cell design. In Proceedings of the 53rd Annual Design Automation Conference, Austin, TX, USA, 5–9 June 2016; pp. 1–6. [Google Scholar]
Lin, Z.-Y.; Chang, Y.-W. A row-based algorithm for non-Integer multiple-cell-height placement. In Proceedings of the 2021 IEEE/ACM International Conference On Computer Aided Design, Munich, Germany, 1–4 November 2021; pp. 1–6. [Google Scholar]
Wang, C.-H.; Wu, Y.-Y.; Chen, J.-L.; Chang, Y.-W.; Kuo, S.-Y.; Zhu, W.-X.; Fan, G.-H. An effective legalization algorithm for mixed-cell-height standard cells. In Proceedings of the 2017 22nd Asia and South Pacific Design Automation Conference, Chiba, Japan, 16–19 January 2017; pp. 450–455. [Google Scholar]
Brenner, U. BonnPlace legalization: Minimizing movement by iterative augmentation. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2013, 32, 1215–1227. [Google Scholar] [CrossRef]
Cho, M.; Ren, H.; Xiang, H.; Puri, R. History-based VLSI legalization using network flow. In Proceedings of the 47th Design Automation Conference, Anaheim, CA, USA, 13–18 June 2010; pp. 286–291. [Google Scholar]
Hung, C.-Y.; Chou, P.-Y.; Mak, W.-K. Mixed-cell-height standard cell placement legalization. In Proceedings of the Great Lakes Symposium on VLSI 2017, Banff, AB, Canada, 10–12 May 2017; pp. 149–154. [Google Scholar]
Cottle, R.W.; Pang, J.-S.; Stone, R.E. The Linear Complementarity Problem; SIAM: Philadelphia, PA, USA, 2009. [Google Scholar]
Bai, Z.-Z. Modulus-based matrix splitting iteration methods for linear complementarity problems. Numer. Linear Algebra Appl. 2010, 17, 917–933. [Google Scholar] [CrossRef]
Wang, L.-X.; Cao, Y.; Shen, Q.-Q. Two variants of robust two-step modulus-based matrix splitting iteration methods for mixed-cell-height circuit legalization problem. Commun. Appl. Math. Comput. 2025, 7, 1769–1790. [Google Scholar] [CrossRef]
Chen, J.-L.; Zhu, Z.-R.; Zhu, W.-X.; Chang, Y.-W. A robust modulus-based matrix splitting iteration method for mixed-cell-height circuit legalization. ACM Trans. Des. Autom. Electron. Syst. 2020, 26, 1–28. [Google Scholar] [CrossRef]
Cao, Y.; Shi, Q.; Zhu, S.-L. A relaxed generalized Newton iteration method for generalized absolute value equations. AIMS Math. 2021, 6, 1258–1275. [Google Scholar] [CrossRef]
Wang, A.; Cao, Y.; Chen, J.-X. Modified Newton-type iteration methods for generalized absolute value equations. J. Optim. Theory Appl. 2019, 181, 216–230. [Google Scholar] [CrossRef]
Zhu, Z.-R.; Chen, J.-L.; Zhu, W.-X.; Chang, Y.-W. Mixed-cell-height legalization considering technology and region constraints. In Proceedings of the International Conference on Computer-Aided Design, San Diego CA, USA, 5–8 November 2018; pp. 1–8. [Google Scholar]
Chen, J.-L.; Zhu, Z.-R.; Guo, L.; Tseng, Y.-W.; Chang, Y.-W. Mixed-cell-height placement with drain-to-drain abutment and region constraints. IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst. 2022, 41, 1103–1115. [Google Scholar] [CrossRef]
Facchinei, F.; Pang, J.-S. Finite-Dimensional Variational Inequalities and Complementarity Problems; Springer: New York, NY, USA, 2003. [Google Scholar]
Korpelevich, G.M. The extragradient method for finding saddle points and other problems. Ekon. Mat. Metod. 1976, 12, 747–756. [Google Scholar]
He, B.-S. A class of projection and contraction methods for monotone variational inequalities. Appl. Math. Optim. 1997, 35, 69–76. [Google Scholar] [CrossRef]
He, B.-S.; Liao, L.-Z. Improvements of some projection methods for monotone nonlinear variational inequalities. J. Optim. Theory Appl. 2002, 112, 111–128. [Google Scholar] [CrossRef]
He, B.-S. A new method for a class of linear variational inequalities. Math. Program. 1994, 66, 137–144. [Google Scholar] [CrossRef]
Gao, X.; Cai, X.-J.; Wang, X. Self-Adaptive inertial projection and contraction algorithm for monotone variational inequality. Asia-Pac. J. Oper. Res. 2022, 39, 2150021. [Google Scholar] [CrossRef]
Cai, X.-J.; Gu, G.-Y.; He, B.-S. On the O(1/t) convergence rate of the projection and contraction methods for variational inequalities with Lipschitz continuous monotone operatorss. Comput. Optim. Appl. 2014, 57, 339–363. [Google Scholar] [CrossRef]
Dong, Q.-L.; Cho, Y.-J.; Zhong, L.-L.; Rassias, T.M. Inertial projection and contraction algorithms for variational inequalities. J. Glob. Optim. 2018, 70, 687–704. [Google Scholar] [CrossRef]
Censor, Y.; Gibali, A.; Reich, S. The subgradient extragradient method for solving variational inequalities in Hilbert space. J. Optim. Theory Appl. 2011, 148, 318–335. [Google Scholar] [CrossRef] [PubMed]
Bustany, I.S.; Chinnery, D.; Shinnerl, J.R.; Yutsis, V. ISPD 2015 benchmarks with fence regions and routing blockages for detailed-routing-driven placement. In Proceedings of the Symposium on International Symposium on Physical Design, Monterey, CA, USA, 29 March–1 April 2015; pp. 157–164. [Google Scholar]

Figure 2. Our legalization flow.

Figure 3. Average CPU time and iterations versus

α

for SIPCA and SIPCA_IP on seven benchmarks.

Figure 3. Average CPU time and iterations versus

α

for SIPCA and SIPCA_IP on seven benchmarks.

Figure 4. Comparison of iterations and CPU time under different stopping tolerances,

ε = 10^{- 6}

and

ε = 10^{- 7}

.

Figure 4. Comparison of iterations and CPU time under different stopping tolerances,

ε = 10^{- 6}

and

ε = 10^{- 7}

.

Figure 5. Comparison of overlaps and total displacement under different stopping tolerances,

ε = 10^{- 6}

and

ε = 10^{- 7}

.

Figure 5. Comparison of overlaps and total displacement under different stopping tolerances,

ε = 10^{- 6}

and

ε = 10^{- 7}

.

Figure 6. Time–precision trade-off curves averaged over seven benchmarks.

Figure 7. Complexity trend with respect to matrix size measured by the number of nonzero elements (nnz(A)).

Table 1. Comparison between VI-based and LCP-based formulations.

Aspect	VI Formulation	LCP Formulation
Model type	Variational inequality	Linear complementarity
Matrix requirement	Monotone	Symmetric PD or $H_{+}$
Applicability to nonsymmetric PSD	Naturally applicable	May require special treatment
Typical solution methods	Projection-type methods	MMS and related modulus methods

Table 2. Statistics of the benchmarks.

Benchmark	T.Cell	S.Cell	D.Cell	Dens.	W.Size	E.Size
des_perf_ a	108,488	99,975	8513	43%	116,375 × 116,201	8513 × 116,801
des_perf_ b	112,644	103,842	8802	50%	121,146 × 121,446	8802 × 121,446
fft_2	32,281	30,297	1984	50%	34,094 × 34,265	1984 × 34,265
fft_a	30,625	28,718	1907	25%	32,132 × 32,532	1907 × 32,532
fft_b	30,625	28,718	1907	28%	32,172 × 32,532	1907 × 32,532
pci_bridge32_a	29,517	26,268	3249	38%	32,566 × 32,766	3249 × 32,766
pci_bridge32_b	28,914	25,734	3180	14%	31,694 × 32,094	3180 × 32,094

Table 3. Comparison of convergence performance for SIPCA and SIPCA_IP.

Benchmark	SIPCA			SIPCA_IP
Benchmark	IT	RES	CPU Time (s)	IT	RES	CPU Time (s)
des_perf_a	2916	8.3351 $\times 10^{- 7}$	4.806	716	9.4716 $\times 10^{- 7}$	2.135
des_perf_b	1859	9.9046 $\times 10^{- 7}$	3.408	833	8.8778 $\times 10^{- 7}$	2.978
fft_2	819	9.0416 $\times 10^{- 7}$	2.538	493	9.0658 $\times 10^{- 7}$	1.270
fft_a	2120	9.7426 $\times 10^{- 7}$	3.340	1329	9.6865 $\times 10^{- 7}$	2.962
fft_b	1165	9.4368 $\times 10^{- 7}$	2.103	451	9.8451 $\times 10^{- 7}$	1.228
pci_bridge32_a	1311	8.7244 $\times 10^{- 7}$	2.376	676	9.6041 $\times 10^{- 7}$	1.887
pci_bridge32_b	1616	8.3576 $\times 10^{- 7}$	2.870	1206	8.8199 $\times 10^{- 7}$	2.170
N. Avg.	2.069	-	1.467	1.000	-	1.000

Table 4. Comparison of overlaps and total displacement for SIPCA and SIPCA_IP.

Benchmark	SIPCA				SIPCA_IP
Benchmark	Overlaps Before	Disp. Before	Disp. After	R.Time (s)	Overlaps Before	Disp. Before	Disp. After	R.Time (s)
des_perf_a	10	72,277	72,436	0.004	9	71,227	71,432	0.005
des_perf_b	5	71,406	71,579	0.002	2	69,451	69,569	0.001
fft_2	17	20,008	20,050	0.005	15	20,018	20,047	0.005
fft_a	9	18,142	18,191	0.005	3	18,044	18,096	0.005
fft_b	13	21,024	21,135	0.005	11	20,697	20,970	0.005
pci_bridge32_a	3	26,121	26,195	0.003	5	26,124	26,192	0.004
pci_bridge32_b	9	26,152	26,357	0.005	1	26,217	26,323	0.002
N. Avg.	1.457	1.003	1.002	1.095	1.000	1.000	1.000	1.000

Table 5. Statistics of the benchmarks (20% double-height-cells).

Benchmark	T.Cell	S.Cell	D.Cell	W.Size	E.Size
des_perf_ a	108,488	86,830	21,658	129,520 × 129,946	21,658 × 129,946
des_perf_ b	112,644	90,115	22,529	134,873 × 135,173	22,529 × 135,173
fft_2	32,281	25,825	6456	38,566 × 38,737	6456 × 38,737
fft_a	30,625	24,500	6125	36,386 × 36,750	6125 × 36,750
fft_b	30,625	24,500	6125	36,386 × 36,750	6125 × 36,750
pci_bridge32_a	29,517	23,614	5903	35,220 × 35,420	5903 × 35,420
pci_bridge32_b	28,914	23,131	5783	34,297 × 34,697	5783 × 34,697

Table 6. Comparison of convergence performance for SIPCA and SIPCA_IP (20% double-height-cells).

Benchmark	SIPCA			SIPCA_IP
Benchmark	IT	RES	CPU Time (s)	IT	RES	CPU Time (s)
des_perf_a	5000	6.4373 $\times 10^{- 5}$	169.427	1825	8.8675 $\times 10^{- 7}$	40.535
des_perf_b	3994	9.9402 $\times 10^{- 7}$	145.228	861	7.5326 $\times 10^{- 7}$	37.259
fft_2	4903	3.4441 $\times 10^{- 7}$	17.942	754	8.7911 $\times 10^{- 7}$	2.066
fft_a	5000	6.0134 $\times 10^{- 5}$	11.008	1735	9.8850 $\times 10^{- 7}$	4.548
fft_b	4128	3.0127 $\times 10^{- 7}$	15.956	806	9.2940 $\times 10^{- 7}$	7.278
pci_bridge32_a	4626	1.2187 $\times 10^{- 7}$	16.348	1060	7.5668 $\times 10^{- 7}$	10.739
pci_bridge32_b	5000	2.0488 $\times 10^{- 5}$	26.044	1602	9.8334 $\times 10^{- 7}$	15.799

Table 7. Controlled comparison under unified experimental settings.

Benchmark	Method	Disp. Before	Disp. After	Overlaps Before	Iter.	CPU (s)
des_perf_a	MMS [1]	71,851	72,561	15	146	2.625
	RMMS [20]	70,118	70,390	12	135	2.568
	RN [4]	71,727	71,908	10	56	2.712
	SIPCA	72,277	72,436	10	2916	4.810
	SIPCA_IP	71,227	71,432	9	716	2.141
des_perf_b	MMS [1]	71,686	71,888	5	81	3.089
	RMMS [20]	69,467	69,839	4	51	2.132
	RN [4]	71,304	71,351	4	8	2.987
	SIPCA	71,406	71,579	5	1859	3.411
	SIPCA_IP	69,451	69,569	2	833	2.979
fft_2	MMS [1]	20,862	20,979	10	56	1.487
	RMMS [20]	20,154	20,337	16	63	1.548
	RN [4]	19,069	20,152	18	15	1.352
	SIPCA	20,008	20,050	17	819	2.543
	SIPCA_IP	20,018	20,047	15	493	1.275
fft_a	MMS [1]	18,146	18,304	10	106	2.897
	RMMS [20]	17,136	17,460	15	103	3.192
	RN [4]	18,192	18,215	7	20	3.187
	SIPCA	18,142	18,191	9	2120	3.345
	SIPCA_IP	18,044	18,096	3	1329	2.967
fft_b	MMS [1]	21,459	21,671	9	91	1.479
	RMMS [20]	20,160	20,216	10	124	1.205
	RN [4]	21,192	21,235	8	10	1.432
	SIPCA	21,024	21,135	13	1165	2.108
	SIPCA_IP	20,697	20,970	11	451	1.233
pci_bridge32_a	MMS [1]	26,192	26,289	7	92	2.215
	RMMS [20]	25,621	25,978	10	84	2.134
	RN [4]	26,012	26,199	5	12	2.101
	SIPCA	26,121	26,195	3	1311	2.379
	SIPCA_IP	26,124	26,192	5	676	1.891
pci_bridge32_b	MMS [1]	25,984	26,028	9	121	2.621
	RMMS [20]	25,983	26,028	8	85	2.574
	RN [4]	26,142	26,330	10	10	2.634
	SIPCA	26,152	26,357	9	1616	2.875
	SIPCA_IP	26,217	26,232	1	1206	2.172

Table 8. Comparison of total displacement of five legalization methods.

Benchmark	Disp. (After)
Benchmark	[1]	[20]	[4]	SIPCA	SIPCA_IP
des_perf_a	72,561	70,390	71,908	72,436	72,432
des_perf_b	71,888	69,839	71,351	71,579	69,569
fft_2	20,979	20,337	20,152	20,050	20,047
fft_a	18,304	17,460	18,215	18,191	18,096
fft_b	21,671	20,216	21,235	21,135	20,970
pci_bridge32_a	26,289	25,978	26,199	26,195	26,192
pci_bridge32_b	26,028	26,028	26,330	26,357	26,232
N. Avg.	1.021	0.991	1.011	1.009	1.000

Table 9. Comparison of convergence performance for SIPCA and SIPCA_IP (

ε = 10^{- 7}

).

Table 9. Comparison of convergence performance for SIPCA and SIPCA_IP (

ε = 10^{- 7}

).

Benchmark	SIPCA			SIPCA_IP
Benchmark	IT	RES	CPU Time (s)	IT	RES	CPU Time (s)
des_perf_a	3000	9.3251 $\times 10^{- 7}$	5.102	892	7.6247 $\times 10^{- 7}$	2.573
des_perf_b	2168	8.4624 $\times 10^{- 7}$	4.246	916	9.7805 $\times 10^{- 7}$	3.017
fft_2	1297	8.7469 $\times 10^{- 8}$	3.872	763	8.8913 $\times 10^{- 8}$	1.957
fft_a	3000	4.7154 $\times 10^{- 7}$	4.489	2829	5.3007 $\times 10^{- 8}$	3.962
fft_b	1687	9.9695 $\times 10^{- 8}$	2.231	1035	8.7756 $\times 10^{- 8}$	1.708
pci_bridge32_a	2673	9.5030 $\times 10^{- 8}$	3.697	1893	8.1944 $\times 10^{- 8}$	3.368
pci_bridge32_b	3000	8.1283 $\times 10^{- 7}$	4.779	1980	8.7061 $\times 10^{- 8}$	3.510
N. Avg.	1.632	-	1.414	1.000	-	1.000

Table 10. Comparison of overlaps and total displacement for SIPCA and SIPCA_IP (

ε = 10^{- 7}

).

Table 10. Comparison of overlaps and total displacement for SIPCA and SIPCA_IP (

ε = 10^{- 7}

).

Benchmark	SIPCA				SIPCA_IP
Benchmark	Overlaps Before	Disp. Before	Disp. After	R.Time (s)	Overlaps Before	Disp. Before	Disp. After	R.Time (s)
des_perf_a	10	72,277	72,436	0.004	7	71,227	71,429	0.005
des_perf_b	4	71,406	71,578	0.002	1	69,451	69,560	0.001
fft_2	16	20,008	20,044	0.005	15	20,018	20,039	0.005
fft_a	3	18,142	18,192	0.005	3	18,044	18,093	0.005
fft_b	14	21,024	20,962	0.005	14	20,697	20,962	0.005
pci_bridge32_a	2	26,121	26,192	0.003	2	26,124	26,192	0.004
pci_bridge32_b	5	26,152	26,324	0.005	0	26,304	26,304	0.001
N. Avg.	1.286	1.003	1.002	1.095	1.000	1.000	1.000	1.000

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, L.; Zhou, C.; Shen, Q. An Improved Self-Adaptive Inertial Projection and Contraction Algorithm for Mixed-Cell-Height Circuit Legalization. Electronics 2026, 15, 1720. https://doi.org/10.3390/electronics15081720

AMA Style

Wang L, Zhou C, Shen Q. An Improved Self-Adaptive Inertial Projection and Contraction Algorithm for Mixed-Cell-Height Circuit Legalization. Electronics. 2026; 15(8):1720. https://doi.org/10.3390/electronics15081720

Chicago/Turabian Style

Wang, Luxin, Chencan Zhou, and Qinqin Shen. 2026. "An Improved Self-Adaptive Inertial Projection and Contraction Algorithm for Mixed-Cell-Height Circuit Legalization" Electronics 15, no. 8: 1720. https://doi.org/10.3390/electronics15081720

APA Style

Wang, L., Zhou, C., & Shen, Q. (2026). An Improved Self-Adaptive Inertial Projection and Contraction Algorithm for Mixed-Cell-Height Circuit Legalization. Electronics, 15(8), 1720. https://doi.org/10.3390/electronics15081720

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Improved Self-Adaptive Inertial Projection and Contraction Algorithm for Mixed-Cell-Height Circuit Legalization

Abstract

1. Introduction

2. Problem Formulation

2.1. Modeling of Mixed-Cell-Height Legalization

2.2. From QP to an LCP and VI

3. Self-Adaptive Inertial Projection and Contraction Algorithm and Its Improvement

3.1. Baseline Self-Adaptive Inertial Projection and Contraction Algorithm (SIPCA)

3.2. Improved Self-Adaptive Inertial Projection and Contraction Algorithm

3.3. Convergence Analysis

3.4. Computational Complexity Analysis

3.5. Legalization Framework

4. Experimental Results and Discussion

4.1. Comparison Between the Proposed Algorithms

4.2. Comparison with Existing Methods

4.3. Sensitivity Analysis with Respect to $α$ and $ε$

4.4. Discussion

5. Conclusions and Outlook

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Theorem 1

Appendix B. Implementation Details and Parameter Settings

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

An Improved Self-Adaptive Inertial Projection and Contraction Algorithm for Mixed-Cell-Height Circuit Legalization

Abstract

1. Introduction

2. Problem Formulation

2.1. Modeling of Mixed-Cell-Height Legalization

2.2. From QP to an LCP and VI

3. Self-Adaptive Inertial Projection and Contraction Algorithm and Its Improvement

3.1. Baseline Self-Adaptive Inertial Projection and Contraction Algorithm (SIPCA)

3.2. Improved Self-Adaptive Inertial Projection and Contraction Algorithm

3.3. Convergence Analysis

3.4. Computational Complexity Analysis

3.5. Legalization Framework

4. Experimental Results and Discussion

4.1. Comparison Between the Proposed Algorithms

4.2. Comparison with Existing Methods

4.3. Sensitivity Analysis with Respect to α and ε

4.4. Discussion

5. Conclusions and Outlook

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Theorem 1

Appendix B. Implementation Details and Parameter Settings

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.3. Sensitivity Analysis with Respect to $α$ and $ε$