Optimal Boundary-Flux Control of a Sharp Moving Interface in the Classical Two-Phase Stefan Problem

Khalid Ali Alanezy; Jihad Souissi

doi:10.3390/axioms14110840

and

¹

Department of Mathematics, King Fahd University of Petroleum & Minerals (KFUPM), Dhahran 31261, Saudi Arabia

²

Department of Mathematics, Faculty of Sciences, University of Gabes, Gabes 6072, Tunisia

^*

Author to whom correspondence should be addressed.

Axioms2025, 14(11), 840;https://doi.org/10.3390/axioms14110840

This article belongs to the Special Issue Nonlinear Analysis and Boundary Value Problems

Version Notes

Order Reprints

Abstract

In this paper, we study the optimal boundary control of solidification governed by the classical two-phase Stefan problem with a sharp moving interface. The main objective is to formulate an optimal control problem for interface motion using boundary heat-flux control. The control acts as a Neumann heat flux on a designated boundary segment and steers the interface through the Stefan condition. Using an enthalpy formulation, we prove well-posedness under boundary control and establish Lipschitz continuity of the control-to-state map and continuous dependence on the initial data. We then derive first-order necessary optimality conditions using a Lagrangian approach and propose a practical algorithm that couples a semismooth Newton method with Sequential Quadratic Programming (SQP) to compute the optimal boundary flux.

Keywords:

optimal control; Stefan problem; free boundary problems; boundary control; necessary optimality conditions

MSC:

80A22; 35R35; 49J20; 49M25

1. Introduction

The classical two-phase Stefan problem is a foundational model for melting and solidification in materials such as aluminum [1] and steel [2]. It is a free boundary problem where a sharp solid–liquid interface

Γ_{t}

moves according to the Stefan condition, which balances heat fluxes across

Γ_{t}

with latent heat release or absorption.

Although the sharp-interface formulation directly reflects the physical mechanism, it presents substantial analytical and numerical challenges, especially in higher dimensions. The shifted temperature variable

θ : = T - T_{m}

and the enthalpy formulation provide an alternative on a fixed domain, where temperature is expressed as a piecewise linear function of enthalpy and

θ

is used throughout, with the melting temperature corresponding to

θ = 0

. The latent-heat jump appears as a small regularized slope, and boundary heat fluxes enter naturally in weak form. This reformulation implicitly captures the moving interface as the set where

θ = 0

and enthalpy lies in the latent-heat interval, which is advantageous for boundary-control analysis [3,4].

A primary challenge in controlling solidification processes is managing the moving interface. One avenue adopts a sharp-interface optimal-control formulation, treating the free boundary itself as an optimization variable and using boundary temperature to track a desired interface evolution, while explicitly enforcing the Stefan condition [5]. A second, classical approach formulates the Stefan problem as a variational inequality, which enables optimal control under mixed boundary conditions and accommodates elliptic degeneracy of the original formulation [6]. Complementing these open-loop and optimization-based methods, research has also focused on boundary feedback control through backstepping and energy-shaping designs, which guarantee exponential stabilization of the interface position at a desired setpoint in both one- and two-phase Stefan problems [7,8]. Taken together, these directions illustrate a clear evolution in methodology, where optimization-based formulations, sharp-interface methods, and variational-inequality approaches provide powerful tools for trajectory design and analysis, while feedback-based controllers supply real-time stabilization guarantees that are crucial under disturbances and modeling uncertainties. This complementarity shows that no single method dominates; rather, the strength of the field lies in the interplay between rigorous optimal-control formulations and stabilizing feedback strategies, offering a broad and adaptable set of methods for applications ranging from crystal growth to advanced thermal energy storage.

While optimal control of Stefan-type problems has been investigated in several works [5], most existing studies focus on distributed controls or adjustments of the initial state. In contrast, boundary control—directly relevant to practical applications—has not been examined with the same level of analytical detail. In particular, results establishing well-posedness under boundary heat-flux control, stability estimates such as Lipschitz continuity of the control-to-state mapping, and rigorous derivations of adjoint-based optimality conditions remain incomplete. Recent work on networked control systems has explored how asymmetric information and unreliable communication affect controller performance [9,10]. Studies on decentralized and remote–local control architectures under such constraints, though focused on finite-dimensional systems, provide insights on limited information exchange and robustness that are conceptually relevant to boundary control, where sensing and actuation may also be distributed or delayed.

This paper provides a rigorous analysis of the two-phase Stefan problem with boundary heat-flux control in the enthalpy formulation. We prove the well-posedness of the controlled state system and establish stability estimates, including Lipschitz continuity with respect to both control and initial conditions. Building on these results, we formulate a tracking-type optimal control problem for the solidification front and show the existence of optimal controls. We then derive first-order necessary conditions using a Lagrangian approach and analyze the associated adjoint system for this highly nonlinear and nonsmooth control problem. Finally, we introduce a numerical strategy based on semismooth Newton methods combined with Sequential Quadratic Programming (SQP), demonstrating how these theoretical conditions can be implemented computationally.

Our motivation is driven by both theoretical and applied considerations. Mathematically, the Stefan problem belongs to a class of nonlinear free boundary problems that raise fundamental questions about existence, stability, and solution sensitivity. From an applied perspective, solidification processes play a central role in metallurgy, crystal growth, and materials science, where microstructural properties such as grain size and porosity depend strongly on the evolution of the phase interface. The ability to guide these processes through boundary fluxes is of clear technological importance. The remainder of this paper is organized as follows: In Section 2, we establish well-posedness and continuity for the controlled enthalpy model. In Section 3, we formulate the optimal boundary control problem and derive adjoint-based first-order optimality conditions. In Section 4, we present the numerical discretization and algorithms, and we illustrate controlled solidification through computational experiments.

2. The Classical Two-Phase Stefan Problem

The heat equation is a parabolic partial differential equation that is central to thermodynamics and mathematical analysis. Originally formulated to describe heat’s diffusion through a medium, it has since become a cornerstone in both theoretical and applied mathematics.

The Stefan problem provides a moving-interface model for phase-change phenomena such as melting and solidification. The transition occurs within a mushy zone, called the interface, where liquid and solid phases coexist. This model has wide applications in natural sciences and industrial processes [11], as noted in the Introduction.

In the traditional temperature formulation of the two-phase Stefan problem, the location of the phase-change interface must be determined as part of the solution. This solid–liquid boundary is a free boundary whose position is unknown a priori, making the analysis particularly challenging in multiple dimensions. To avoid explicit interface tracking, the problem can be reformulated in terms of enthalpy (internal energy). In the enthalpy method, the heat equation is expressed in terms of internal energy, absorbing the Stefan condition (accounting for latent heat) into the material law rather than imposing it as a separate boundary condition. In what follows, we develop the enthalpy-based formulation for the two-phase Stefan problem.

2.1. Enthalpy Formulation of the Two-Phase Stefan Problem

Enthalpy (internal energy) represents the total heat content of a system, combining sensible heat (temperature variation) and latent heat (associated with phase transitions). For a single-phase medium with shifted temperature

θ (x, t)

and constant specific heats, the enthalpy is expressed as follows:

E (θ) = \{\begin{matrix} ρ c_{s} θ, & θ \leq 0, \\ ρ c_{ℓ} θ + ρ L, & θ > 0, \end{matrix}

where

ρ (x) > 0

is the mass density,

c_{s} (θ)

,

c_{ℓ} (θ)

are the specific heats of the solid and liquid phases, respectively, and L denotes the latent heat associated with the phase transition.

The specific heat c denotes the energy required to raise the temperature of one unit of mass by one degree. In the absence of phase change, E evolves in time according to heat diffusion. With a boundary heat-flux control g prescribed on a portion

Γ_{c}

of the boundary, the standard heat conduction model takes the following form:

E_{t} = κ Δ θ, κ \partial_{ν} θ = g,

(1)

where

κ > 0

is the thermal conductivity and

\partial_{ν}

denotes the outward normal derivative for the Neumann boundary condition. Equation (1) states that the time-rate of change in enthalpy E in a region equals the thermal diffusion of

θ

, and the boundary flux

κ \partial_{ν} θ

is controlled by g.

Now, consider a material that can exist in two phases (solid and liquid), with a phase change occurring at a constant melting (solidification)-shifted temperature

θ = 0

. In a two-phase (solid–liquid) scenario, the enthalpy as a function of

θ

exhibits a discontinuity at

θ = 0

due to the absorption or release of latent heat L. As the material passes through

θ = 0

, an extra energy

ρ L

(per unit volume) must be absorbed or released without a temperature change. This leads to a jump in the enthalpy–temperature relation, known as the Stefan condition. That is, enthalpy has a jump of magnitude

ρ L

at

θ = 0

.

The two-phase Stefan problem describes heat diffusion in a medium undergoing a solid–liquid phase change, with a moving interface

Γ_{t}

separating the two phases. We denote by

Ω_{t}^{-} = {x \in Ω : θ < 0}

the solid region and by

Ω_{t}^{+} = {x \in Ω : θ > 0}

the liquid region. Thus,

Γ_{t}

is a moving free boundary, and we assume temperature continuity

θ = 0

on

Γ_{t}

.

Within each phase, heat evolves according to Fourier’s law. The general heat equation is

ρ c \frac{\partial θ}{\partial t} = κ Δ θ,

(2)

Accordingly, the governing equations in the two phases are

\{\begin{matrix} ρ_{s} c_{s} \frac{\partial θ_{s}}{\partial t} = κ_{s} Δ θ_{s}, & x \in Ω_{t}^{-}, \\ ρ_{ℓ} c_{ℓ} \frac{\partial θ_{ℓ}}{\partial t} = κ_{ℓ} Δ θ_{ℓ}, & x \in Ω_{t}^{+}, \end{matrix}

together with the condition

θ_{s} = θ_{ℓ} = 0

on

Γ_{t}

.

On a controlled portion

Γ_{c} \subset \partial Ω

of the external boundary, the normal heat flux is prescribed as a boundary control

g (t, x)

. The boundary condition is given by

κ \partial_{ν} θ = g, x \in Γ_{c},

(3)

where we take

κ

to be the thermal conductivity of the material in contact with

\partial Ω

at

Γ_{c}

. On the remaining boundary

\partial Ω ∖ Γ_{c}

, we impose insulation:

κ \partial_{ν} θ = 0

.

The interface

Γ_{t} = {x \in Ω : θ (t, x) = 0}

has unit normal

ν

and normal velocity

V (t, x)

.

Energy conservation across

Γ_{t}

then implies that the latent heat absorbed or released by the moving boundary balances the jump in conductive heat flux between the two phases:

ρ L V (t, x) = κ_{ℓ} \partial_{ν} θ_{ℓ} - κ_{s} \partial_{ν} θ_{s}, x \in Γ_{t} .

(4)

Here,

ν

points from solid to liquid, and V is the normal velocity in the direction of

ν

.

Collecting the above, the strong form of the two-phase Stefan problem is

\{\begin{matrix} ρ_{s} c_{s} \frac{\partial θ_{s}}{\partial t} = κ_{s} Δ θ_{s}, & x \in Ω_{t}^{-}, θ_{s} = 0 on Γ_{t}, \\ ρ_{ℓ} c_{ℓ} \frac{\partial θ_{ℓ}}{\partial t} = κ_{ℓ} Δ θ_{ℓ}, & x \in Ω_{t}^{+}, θ_{ℓ} = 0 on Γ_{t}, \\ ρ L V (t, x) = κ_{ℓ} \partial_{ν} θ_{ℓ} - κ_{s} \partial_{ν} θ_{s}, & x \in Γ_{t}, \\ κ \partial_{ν} θ = g, & x \in Γ_{c} \subset \partial Ω, \\ κ \partial_{ν} θ = 0, & x \in \partial Ω ∖ Γ_{c} . \end{matrix}

(5)

In System (5), the unknowns are the shifted temperature field

θ (t, x)

and the evolving interface

Γ_{t}

. Given an initial shifted temperature distribution and initial interface

Γ_{0}

, this system determines the coupled dynamics of the shifted temperature in each phase and the interface motion. Physically, the Stefan condition enforces that excess heat at the interface is consumed as latent heat to melt the solid, whereas a deficit of heat leads to solidification, thereby ensuring conservation of energy during the phase transition.

We define a set-valued enthalpy function

γ (θ)

to capture this behavior:

γ (θ) = \{\begin{matrix} ρ c_{ℓ} θ + ρ L, & θ > 0, \\ [0, ρ L], & θ = 0, \\ ρ c_{s} θ, & θ < 0, \end{matrix}

(6)

This strong form is presented for physical clarity; our analysis and subsequent results are derived from the weak enthalpy formulation. Figure 1 illustrates the enthalpy graph [11,12,13], as defined in Equation (6).

Figure 1. The enthalpy set-valued graph

γ (T)

.

Since

γ

is multi-valued at

θ = 0

, its inverse

β

is single-valued and continuous. Let

β (E)

denote the inverse relation, giving the shifted temperature

θ

as a function of enthalpy E:

β (E) = \{\begin{matrix} \frac{1}{ρ c_{ℓ}} (E - ρ L), & E > ρ L, \\ 0, & 0 \leq E \leq ρ L, \\ \frac{1}{ρ c_{s}} E, & E < 0 . \end{matrix}

(7)

Figure 2 shows the graph of the function

β (E)

. By construction,

β (E)

is continuous and non-decreasing in E. In fact,

β

is Lipschitz-continuous globally with the Lipschitz constant

L_{β} = max \{\frac{1}{ρ c_{s}}, \frac{1}{ρ c_{ℓ}}\} .

Figure 2. The inverse enthalpy graph

β (E)

.

Thus,

β

is a continuous, monotone, Lipschitz function of enthalpy, and

γ

is its (multi-valued) inverse, which is a monotone graph. Throughout, we write

β^{'}

for the a.e. derivative, and identities involving

β^{'}

are understood to hold almost everywhere. Physically,

β (E)

shows that the shifted temperature

θ

remains constant at 0 while enthalpy changes within the latent heat interval

[0, ρ L]

.

Using the enthalpy–temperature relation above, we can reformulate the Stefan problem entirely in terms of enthalpy

E (x, t)

on a fixed domain. Substituting

θ = β (E)

into the heat conduction law (1) gives the governing equation in enthalpy form:

E_{t} = \nabla \cdot (κ \nabla [β (E)]), κ \partial_{ν} [β (E)] = g .

(8)

For simplicity, we take

κ

to be constant in the enthalpy formulation; phase-dependent

κ

can be handled with the same analysis (variable-coefficient elliptic operators).

If a solution

E (x, t)

of (8) is known, the shifted temperature is recovered as

θ (x, t) = β (E (x, t))

. The solid–liquid interface at time

t > 0

is then the set of points

Γ_{t} = {x \in Ω : θ (x, t) = 0} = {x \in Ω : E (x, t) \in [0, ρ L]} .

2.2. Weak Formulation and Time Discretization

We now establish the weak formulation of the enthalpy equation with boundary heat-flux control. Let

Ω \subset R^{d}

be a fixed domain and

Γ_{c} \subseteq \partial Ω

the portion of the boundary where a Neumann control is prescribed. Multiplying the enthalpy equation by a test function

ϕ \in H^{1} (Ω)

and applying Green’s identity, we obtain

\int_{Ω} E_{t} ϕ d x = \int_{Γ_{c}} g ϕ d s - \int_{Ω} κ \nabla β (E) \cdot \nabla ϕ d x .

(9)

Thus, in the weak form, the control g enters explicitly as a boundary Neumann term on

Γ_{c}

. Equivalently, we may write

{(κ Δ β (E), ϕ)}_{Ω} = - {(κ \nabla β (E), \nabla ϕ)}_{Ω} + {(g, ϕ)}_{Γ_{c}} .

(10)

The weak formulation implicitly encodes the global energy balance. The compatibility condition

\frac{d}{d t} \int_{Ω} E (t, x) d x = \int_{Γ_{c}} g (t, s) d s,

expresses conservation of total enthalpy in the domain. Specifically, the rate of change in total enthalpy in the domain must equal the net heat flux through the controlled boundary.

To construct solutions, we discretize in time with step size

Δ t > 0

. The backward Euler scheme for the enthalpy formulation is

\frac{E^{n} - E^{n - 1}}{Δ t} = κ Δ β (E^{n}), κ \partial_{ν} β (E^{n}) = g^{n} .

(11)

We introduce the convex functional

j : R \to R

, whose subdifferential

\partial j = γ

characterizes the enthalpy–temperature relation:

j (β) = \{\begin{matrix} \frac{1}{2} ρ c_{s} β^{2}, & β < 0, \\ \frac{1}{2} ρ c_{ℓ} β^{2} + ρ L β, & β \geq 0, \end{matrix}

(12)

Then, (11) is equivalent to the variational inclusion

κ Δ t Δ β (E^{n}) + E^{n - 1} \in \partial j (β (E^{n})), κ \partial_{ν} β (E^{n}) = g^{n} .

(13)

The weak solution

β (E^{n})

is obtained as the unique minimizer of the convex functional

J_{n} (β) = \frac{1}{2} \int_{Ω} ({κ Δ t | \nabla β |}^{2} + j (β) - E^{n - 1} (x) β) d x - Δ t {(g^{n}, β)}_{Γ_{c}} .

(14)

The functional

J_{n}

is coercive and convex. While the Dirichlet energy term ensures convexity, we note that

j (β)

is not strictly convex over the interval where

β (E) = 0

(the mushy region). Under the given Neumann boundary conditions, the constant nullspace of the Laplacian combined with the non-strict convexity of

j (β)

in the mushy region means that uniqueness of the minimizer requires additional regularity conditions. We address this through regularization in Section 3, where we introduce a strictly convex approximation that guarantees both existence and uniqueness.

2.3. A Priori Bounds for the Finite-Difference Solution

We derive uniform a priori estimates for the finite-difference scheme, which are key for compactness and for passing to the continuous-time limit as

Δ t \to 0

. Such estimates for enthalpy formulations of the Stefan problem go back to Crank [14], Damlamian [13], White [3], and Voller–Cross [4], with rigorous numerical error analysis by Elliott [15].

The main tool is the compactness result known as the Aubin–Lions compactness lemma. This result is significant in the theory of Sobolev spaces and the analysis of nonlinear evolution equations, as it provides a criterion for compact embedding into the space

L^{2} (0, T; X)

. In particular, it serves as a tool for establishing the existence of approximate solutions developed through the Galerkin method. The lemma is formally stated as follows:

Lemma 1

(Aubin–Lions–Simon compactness lemma [16,17]). Let

X_{0}

, X,

X_{1}

be Banach spaces with

X_{0} \subseteq X \subseteq X_{1}

, where

X_{0}

is compactly embedded in X and X is continuously embedded in

X_{1}

. For

1 \leq p, q \leq \infty

, set

W = {u \in L^{p} (0, T; X_{0}) : \partial_{t} u \in L^{q} (0, T; X_{1})} .

Then, if

p < \infty

, the embedding

W ↪ L^{p} (0, T; X)

is compact, and if

p = \infty

and

q > 1

, the embedding

W ↪ C ([0, T]; X)

is compact.

While the original proof by Aubin required reflexivity of

X_{0}

or

X_{1}

, Simon [18] later proved this relaxed version and extended the result without this assumption.

Theorem 1

(Uniform bounds). Let

Ω \subset R^{d}

be a bounded Lipschitz domain with

Γ_{c} \subset \partial Ω

measurable, and let

t_{n} = n Δ t

with

N Δ t = T

. Assume that

κ > 0

and

E^{0} \in L^{2} (Ω)

. Then, the discrete solution

(E^{n}, β (E^{n}))

of (11) satisfies bounds independent of

Δ t

:

(i): Assume that $g \in L^{2} (0, T; H^{- 1 / 2} (Γ_{c}))$ . Then, the following energy bound holds:

$\int_{Ω} Φ (E^{n}) d x + κ Δ t \sum_{k = 1}^{n} ∥ \nabla β (E^{k}) ∥_{L^{2} (Ω)}^{2} \leq \int_{Ω} Φ (E^{0}) d x + C \sum_{k = 1}^{n} Δ t {∥ g^{k} ∥}_{H^{- 1 / 2} (Γ_{c})}^{2} .$

(15)
(ii): Assume in addition that $g \in L^{1} (0, T; L^{1} (Γ_{c}))$ . Then, the solution satisfies the following $L^{1}$ bound:

$\int_{Ω} | E^{n} | d x \leq M, \int_{Ω} | β (E^{n}) | d x \leq L M,$

(16)

where M depends only on $∥ E^{0} ∥_{L^{1} (Ω)}$ and ${∥ g ∥}_{L^{1} (0, T; L^{1} (Γ_{c}))}$ .

Proof.

(i) Define the convex potential function

Φ (E) = \int_{0}^{E} β (z) d z = \{\begin{matrix} \frac{1}{2 ρ c_{s}} E^{2}, & E < 0, \\ 0, & 0 \leq E \leq ρ L, \\ \frac{1}{2 ρ c_{ℓ}} {(E - ρ L)}^{2}, & E > ρ L . \end{matrix}

(17)

For a convex differentiable function f, we have

f (\hat{x}) \geq f (x) + f^{'} (x) (\hat{x} - x), \forall \hat{x} .

(18)

Recall the discrete Equation (11); applying (18) with

f = Φ

,

x = E^{n}

,

\hat{x} = E^{n - 1}

yields

Φ (E^{n - 1}) - Φ (E^{n}) \geq (β (E^{n}), E^{n - 1} - E^{n}) .

(19)

Using Green’s identity and substituting into (19) gives

\int_{Ω} Φ (E^{n}) d x - \int_{Ω} Φ (E^{n - 1}) d x + κ Δ t {∥ \nabla β (E^{n}) ∥}_{L^{2} (Ω)}^{2} \leq Δ t {(g^{n}, β (E^{n}))}_{Γ_{c}} .

(20)

Summing over

k = 1, \dots, n

formally gives

\int_{Ω} Φ (E^{n}) d x + \sum_{k = 1}^{n} κ Δ t ∥ \nabla β (E^{k}) ∥_{L^{2} (Ω)}^{2} \leq \int_{Ω} Φ (E^{0}) d x + \sum_{k = 1}^{n} Δ t {∥ g^{k} ∥}_{H^{- 1 / 2} (Γ_{c})}^{2},

(21)

once the boundary term is estimated as follows. By the trace inequality and Young’s inequality,

Δ t {(g^{n}, β (E^{n}))}_{Γ_{c}} \leq Δ t ∥ g^{n} ∥_{H^{- 1 / 2} (Γ_{c})} ∥ β (E^{n}) ∥_{H^{1 / 2} (Γ_{c})} \leq C_{tr} Δ t ∥ g^{n} ∥_{H^{- 1 / 2} (Γ_{c})} {∥ β (E^{n}) ∥}_{H^{1} (Ω)}

\leq \frac{κ}{2} Δ t ∥ \nabla β (E^{n}) ∥_{L^{2} (Ω)}^{2} + C Δ t ∥ g^{n} ∥_{H^{- 1 / 2} (Γ_{c})}^{2} + C Δ t {∥ β (E^{n}) ∥}_{L^{2} (Ω)}^{2} .

Using the Lipschitz property of

β

,

∥ β (E^{n}) ∥_{L^{2} (Ω)}^{2} \leq C_{1} Φ (E^{n}) + C_{2},

(22)

we substitute into the inequality above. After moving the gradient term to the left-hand side of (20), we arrive at the key recursive inequality:

(1 - C Δ t) \int_{Ω} Φ (E^{n}) d x \leq \int_{Ω} Φ (E^{n - 1}) d x + C Δ t {∥ g^{n} ∥}_{H^{- 1 / 2} (Γ_{c})}^{2} + C Δ t .

(23)

Applying the discrete Gronwall lemma to (23) yields the desired uniform estimate: there exists a constant

C > 0

independent of

Δ t

such that

\int_{Ω} Φ (E^{n}) d x + \frac{1}{2} \sum_{k = 1}^{n} κ Δ t {∥ \nabla β (E^{k}) ∥}_{L^{2} (Ω)}^{2} \leq \int_{Ω} Φ (E^{0}) d x + C \sum_{k = 1}^{n} Δ t {∥ g^{k} ∥}_{H^{- 1 / 2} (Γ_{c})}^{2} .

(24)

This proves (15).

(ii) For the

L^{1}

bound, test (11) with

ϕ = ρ_{ε} (β (E^{n})),

where

ρ_{ε}

is a smooth regularization of sgn. Passing

ε \to 0

gives

\int_{Ω} | E^{n} | d x - \int_{Ω} | E^{n - 1} | d x \leq Δ t \int_{Γ_{c}} | g^{n} | d s .

(25)

Summing in n yields

\int_{Ω} | E^{n} | d x \leq M,

(26)

for some constant M. Finally,

\int_{Ω} β (E^{n}) d x \leq L \int_{Ω} | E^{n} | d x \leq L M,

(27)

where L is the Lipschitz constant of

β

. This proves (16). □

Theorem 1 shows that

E^{n}

is bounded in

L^{\infty} (0, T; L^{1} (Ω))

,

β (E^{n})

is bounded in

L^{2} (0, T; H^{1} (Ω))

, and

{\partial_{t} E^{n}}

is bounded in

L^{2} (0, T; H^{- 1} (Ω))

. By the Aubin–Lions compactness lemma, using the compact embedding

H^{1} (Ω) ↪ L^{2} (Ω)

and the continuous embedding

L^{2} (Ω) ↪ H^{- 1} (Ω)

, we can conclude that

β (E^{n}) is relatively compact in L^{2} (0, T; L^{2} (Ω)),

which is the essential ingredient for passing to the limit

Δ t \to 0

and proving the existence of a weak solution.

Moreover, the estimate (24) shows that

\nabla β (E^{n}) is uniformly bounded in L^{2} ((0, T) \times Ω) .

On any set where

β^{'} (s) \geq c_{0} > 0

, this also controls

\nabla E^{n}

in

L^{2}

; combined with the compactness of

β (E^{n})

and the local invertibility of

β

, this implies the strong convergence of

E^{n}

on that set.

2.4. Convergence of Weak Solutions

We establish the existence of a weak solution to the enthalpy formulation (9) by passing to the limit

Δ t \to 0

in the discrete approximation. The compactness needed for this passage is provided by the Aubin–Lions compactness lemma.

Theorem 2

(Existence of a weak solution). Let

Ω \subset R^{d}

be a bounded Lipschitz domain with

Γ_{c} \subset \partial Ω

. Assume that

E^{0} \in L^{2} (Ω)

and

g \in L^{2} (0, T; H^{- 1 / 2} (Γ_{c}))

. Then, there exists a pair

(E, β (E))

such that

E \in L^{\infty} (0, T; L^{1} (Ω)), β (E) \in L^{2} (0, T; H^{1} (Ω)), E_{t} \in L^{2} (0, T; H^{- 1} (Ω)),

and the weak formulation

\int_{0}^{T} ⟨ E_{t}, ϕ ⟩ d t + κ \int_{0}^{T} \int_{Ω} \nabla β (E) \cdot \nabla ϕ d x d t = \int_{0}^{T} {⟨ g, ϕ ⟩}_{Γ_{c}} d t

(28)

holds for all

ϕ \in L^{2} (0, T; H^{1} (Ω))

.

Proof.

The continuous enthalpy formulation is given by

E_{t} = κ Δ β (E), κ \partial_{ν} β (E) = g .

(29)

Define the time interpolants from the discrete sequence

{(E^{n}, β (E^{n}), g^{n})}

:

{\bar{E}}_{Δ t} (t) : = E^{n}, {\bar{β (E)}}_{Δ t} (t) : = β (E^{n}), {\bar{g}}_{Δ t} (t) : = g^{n}, t \in (t_{n - 1}, t_{n}],

and the piecewise linear interpolant

E_{Δ t} (t) = E^{n - 1} + \frac{E^{n} - E^{n - 1}}{Δ t} (t - t_{n - 1}), t \in [t_{n - 1}, t_{n}],

(30)

so that

\partial_{t} E_{Δ t} (t) = \frac{E^{n} - E^{n - 1}}{Δ t}, t \in (t_{n - 1}, t_{n}) .

(31)

From the a priori estimates in Theorem 1, the following uniform bound holds

sup_{1 \leq n \leq N} \int_{Ω} Φ (E^{n}) d x + Δ t \sum_{n = 1}^{N} {∥ \nabla β (E^{n}) ∥}_{L^{2} (Ω)}^{2} \leq C (\int_{Ω} Φ (E^{0}) d x + {∥ g ∥}_{L^{2} (0, T; H^{- 1 / 2} (Γ_{c}))}^{2}) .

(32)

Consequently,

{\bar{E}}_{Δ t} is bounded in L^{\infty} (0, T; L^{1} (Ω)), {\bar{β (E)}}_{Δ t} is bounded in L^{2} (0, T; H^{1} (Ω)) .

From the discrete weak form

{(κ \nabla β (E^{n}), \nabla ϕ)}_{Ω} - {(g^{n}, ϕ)}_{Γ_{c}} = (\frac{E^{n} - E^{n - 1}}{Δ t}, ϕ), \forall ϕ \in H^{1} (Ω),

(33)

we obtain

| ⟨\frac{E^{n} - E^{n - 1}}{Δ t}, ϕ⟩ | \leq κ ∥ \nabla β (E^{n}) ∥_{L^{2} (Ω)} {∥ \nabla ϕ ∥}_{L^{2} (Ω)} + ∥ g^{n} ∥_{H^{- 1 / 2} (Γ_{c})} {∥ ϕ ∥}_{H^{1} (Ω)} .

Hence,

\partial_{t} E_{Δ t} is bounded in L^{2} (0, T; H^{- 1} (Ω)) .

(34)

Using that

β^{'} \in L^{\infty}

, we obtain

\partial_{t} {\bar{β (E)}}_{Δ t} = β^{'} ({\bar{E}}_{Δ t}) \partial_{t} E_{Δ t} \in L^{2} (0, T; H^{- 1} (Ω)),

with a uniform bound. By the Aubin–Lions compactness lemma with

X_{0} = H^{1} (Ω)

,

X = L^{2} (Ω)

,

X_{1} = H^{- 1} (Ω)

, there exists a subsequence such that

\begin{matrix} {\bar{β (E)}}_{Δ t} \to β (E) strongly in L^{2} (0, T; L^{2} (Ω)), \end{matrix}

(35)

\begin{matrix} {\bar{β (E)}}_{Δ t} ⇀ β (E) weakly in L^{2} (0, T; H^{1} (Ω)) . \end{matrix}

(36)

Moreover, since

{{\bar{E}}_{Δ t}}

is bounded in

L^{\infty} (0, T; L^{1} (Ω))

, it is weak-* relatively compact. Thus, up to a subsequence,

{\bar{E}}_{Δ t} ⇀ E

weak-* in

L^{\infty} (0, T; L^{1} (Ω))

for some

E \in L^{\infty} (0, T; L^{1} (Ω))

.

The discrete energy inequalities, derived in Theorem 1, are recalled here for the subsequent analysis. Testing the discrete weak form

(\frac{E^{n} - E^{n - 1}}{Δ t}, ϕ) = κ {(\nabla β (E^{n}), \nabla ϕ)}_{Ω} - {(g^{n}, ϕ)}_{Γ_{c}},

(37)

with

ϕ = β (E^{n})

and using convexity of

Φ

yields

\int_{Ω} (Φ (E^{n}) - Φ (E^{n - 1})) d x + κ Δ t {∥ \nabla β (E^{n}) ∥}_{L^{2} (Ω)}^{2} \leq Δ t {(g^{n}, β (E^{n}))}_{Γ_{c}} .

(38)

Summing over

n = 1, \dots, m

gives

\int_{Ω} Φ (E^{m}) d x + κ Δ t \sum_{n = 1}^{m} {∥ \nabla β (E^{n}) ∥}_{L^{2} (Ω)}^{2} \leq \int_{Ω} Φ (E^{0}) d x + Δ t \sum_{n = 1}^{m} {(g^{n}, β (E^{n}))}_{Γ_{c}} .

(39)

Since

β

is monotone, for any

ψ \in L^{2} (0, T; L^{2} (Ω))

,

\int_{0}^{T} \int_{Ω} ({\bar{β (E)}}_{Δ t} - β (ψ)) ({\bar{E}}_{Δ t} - ψ) d x d t \geq 0 .

Passing to the limit using (35) and the weak convergence of

{\bar{E}}_{Δ t}

yields

\int_{0}^{T} \int_{Ω} (β (E) - β (ψ)) (E - ψ) d x d t \geq 0 \forall ψ,

which implies that

β (E)

is the correct weak limit.

Finally, the interpolants satisfy the time-integrated discrete weak form

\int_{0}^{T} ⟨ \partial_{t} E_{Δ t}, ϕ ⟩ d t + κ \int_{0}^{T} \int_{Ω} \nabla {\bar{β (E)}}_{Δ t} \cdot \nabla ϕ d x d t = \int_{0}^{T} {⟨ {\bar{g}}_{Δ t}, ϕ ⟩}_{Γ_{c}} d t .

Using the convergences

\partial_{t} E_{Δ t} ⇀ E_{t}

in

L^{2} (0, T; H^{- 1} (Ω))

(by (34)),

\nabla {\bar{β (E)}}_{Δ t} ⇀ \nabla β (E)

in

L^{2} (0, T; L^{2} (Ω))

(by (35)), and

{\bar{g}}_{Δ t} \to g

in

L^{2} (0, T; H^{- 1 / 2} (Γ_{c}))

, we pass to the limit in each term and obtain (28). □

2.5. Lipschitz Continuity of the Enthalpy Solution with Respect to Boundary Control

Let

Ω \subset R^{d}

be a bounded Lipschitz domain,

Γ_{c} \subset \partial Ω

be a measurable subset, and define

V : = H^{1} (Ω)

,

V^{*} : = H^{- 1} (Ω)

. We consider the enthalpy formulation

E_{t} - κ Δ β (E) = 0 in Q : = Ω \times (0, T),

with boundary and initial conditions

κ \partial_{ν} β (E) = g on Σ_{c} : = Γ_{c} \times (0, T), \partial_{ν} β (E) = 0 on (\partial Ω ∖ Γ_{c}) \times (0, T), E (0) = E^{0} .

Throughout, we assume that

β : R \to R

is strictly increasing and Lipschitz-continuous with inverse

γ = β^{- 1}

. The natural control space is

G : = L^{2} (0, T; H^{- 1 / 2} (Γ_{c})) .

This space is the most natural for the analysis in weak formulation, as it ensures that the boundary term is well defined. We note that the more regular space

L^{2} (Γ_{c})

is continuously embedded in

H^{- 1 / 2} (Γ_{c})

, a fact that we will use when formulating the optimal control problem.

Theorem 3

(Dual-norm Lipschitz continuity). Let

g, \hat{g} \in G

and let

E (g), E (\hat{g})

be the corresponding enthalpy solutions with initial data

E^{0}, {\hat{E}}^{0} \in V^{*}

. Then, the following stability estimate holds:

∥ E (g) - E (\hat{g}) ∥_{L^{\infty} (0, T; V^{*})} \leq ∥ E^{0} - {\hat{E}}^{0} ∥_{V^{*}} + C_{Ω} {∥ g - \hat{g} ∥}_{L^{2} (0, T; H^{- 1 / 2} (Γ_{c}))},

(40)

where the constant

C_{Ω} > 0

depends only on the domain Ω, the conductivity κ, and the constants associated with the trace theorem and Neumann map (in particular, it is independent of the final time T).

Proof.

Let

δ E : = E (g) - E (\hat{g})

,

δ β : = β (E (g)) - β (E (\hat{g}))

, and

δ g : = g - \hat{g}

. The weak formulation of the enthalpy problem, tested with an arbitrary

ϕ \in V

, yields

⟨ \partial_{t} δ E, ϕ ⟩ + κ {(\nabla δ β, \nabla ϕ)}_{Ω} = {⟨ δ g, ϕ |_{Γ_{c}} ⟩}_{H^{- 1 / 2}, H^{1 / 2}} .

Let

N : V^{*} \to V

be the solution operator for the Neumann problem with zero mean. Specifically, for

f \in V^{*}

,

w = N f \in V

is the unique solution to

{(\nabla w, \nabla ψ)}_{Ω} = {⟨ f, ψ ⟩}_{V^{*}, V} \forall ψ \in V, with \partial_{ν} w = 0 on \partial Ω .

Choosing

ϕ = N δ E

gives

⟨ \partial_{t} δ E, N δ E ⟩ + κ {(\nabla δ β, \nabla N δ E)}_{Ω} = ⟨ δ g, (N δ E) |_{Γ_{c}} ⟩ .

To establish the coercivity structure, we note that the dual norm satisfies

{∥ δ E ∥}_{V^{*}}^{2} = {(\nabla N δ E, \nabla N δ E)}_{Ω} .

Differentiating this identity with respect to time yields

\frac{d}{d t} {∥ δ E ∥}_{V^{*}}^{2} = 2 {(\nabla N \partial_{t} δ E, \nabla N δ E)}_{Ω} .

However, by the definition of

N

, we have

{(\nabla N \partial_{t} δ E, \nabla N δ E)}_{Ω} = ⟨ \partial_{t} δ E, N δ E ⟩

, which establishes the key identity

⟨ \partial_{t} δ E, N δ E ⟩ = \frac{1}{2} \frac{d}{d t} {∥ δ E ∥}_{V^{*}}^{2} .

Furthermore, by the trace theorem and the boundedness of

N

, there exist constants

C_{tr}, C_{N} > 0

such that

{∥ (N δ E) |}_{Γ_{c}} ∥_{H^{1 / 2} (Γ_{c})} \leq C_{tr} {∥ N δ E ∥}_{H^{1} (Ω)} \leq C_{tr} C_{N} {∥ δ E ∥}_{V^{*}} .

Using Cauchy’s inequality

a b \leq \frac{1}{2} (ϵ a^{2} + ϵ^{- 1} b^{2})

and the previous bound, we obtain

\frac{1}{2} \frac{d}{d t} {∥ δ E ∥}_{V^{*}}^{2} + κ {(\nabla δ β, \nabla N δ E)}_{Ω} \leq C_{tr} C_{N} {∥ δ g ∥}_{H^{- 1 / 2} (Γ_{c})} {∥ δ E ∥}_{V^{*}} .

The coercivity now follows from the monotonicity of

β

. Since

β

is maximal monotone, we have

(δ β, δ E) \geq 0 .

By the definition of

N

, this implies that

{(\nabla δ β, \nabla N δ E)}_{Ω} = {⟨ δ E, δ β ⟩}_{V^{*}, V} \geq 0,

which provides the essential coercivity estimate. The positive term can be dropped, yielding

\frac{d}{d t} {∥ δ E ∥}_{V^{*}} \leq C_{Ω} {∥ δ g ∥}_{H^{- 1 / 2} (Γ_{c})}, C_{Ω} : = C_{tr} C_{N} .

Integrating this inequality from 0 to t and applying Cauchy–Schwarz in time gives (40) and completes the proof. □

If, in addition,

g, \hat{g} \in L^{1} (0, T; L^{1} (Γ_{c}))

and

E^{0}, {\hat{E}}^{0} \in L^{1} (Ω)

, the standard discrete sign test argument yields the sharper

L^{1}

-estimate:

∥ E (g) - E (\hat{g}) ∥_{L^{\infty} (0, T; L^{1} (Ω))} \leq ∥ E^{0} - {\hat{E}}^{0} ∥_{L^{1} (Ω)} + {∥ g - \hat{g} ∥}_{L^{1} (Σ_{c})} .

Define the set-valued nonlinear operator

A : L^{1} (Ω) ⇉ L^{1} (Ω)

with a homogeneous Neumann boundary condition by

D (A) = \{u \in L^{1} (Ω) : \exists v \in H^{1} (Ω) with v (x) \in γ (u (x)) a . e . in Ω, Δ v \in L^{1} (Ω), \partial_{ν} v = 0\},

and, for

u \in D (A)

,

A (u) = \{- Δ v \in L^{1} (Ω) : v \in H^{1} (Ω), v (x) \in γ (u (x)) a . e . in Ω, \partial_{ν} v = 0\} .

Then, A is m-accretive on

L^{1} (Ω)

and, thus, generates a nonlinear contraction semigroup. The boundary-controlled problem can be written abstractly as

E_{t} + A (E) ∋ B g in V^{*},

where

B : H^{- 1 / 2} (Γ_{c}) \to V^{*}

is the bounded linear operator mapping a boundary flux to the corresponding source term via the Neumann map. The accretivity of A combined with the boundedness of the control operator B directly implies the stability bound (40) and guarantees uniqueness in the natural

V^{*}

topology; see, e.g., [19] for the analysis of evolution equations with boundary inputs.

We adopt controls

g \in L^{2} (0, T; L^{2} (Γ_{c}))

, continuously embedded in

L^{2} (0, T; H^{- 1 / 2} (Γ_{c}))

for the state equation; all boundary pairings are interpreted via the trace map.

For the natural control space

G = L^{2} (0, T; H^{- 1 / 2} (Γ_{c}))

, the dual-norm estimate (40) is the sharp and natural stability result, directly implying continuous dependence and uniqueness. When boundary fluxes are more regular—that is, integrable on

Σ_{c}

—one recovers the classical

L^{1}

-contraction principle.

3. Optimal Control Problem for the Motion of the Interface of the Stefan Problem

In this section, we consider controlling the motion of the moving interface via a boundary heat-flux control

g (t, x)

applied on a portion of the boundary

Γ_{c}

. The goal is to control the evolution of the interface toward a desired trajectory by appropriate heating or cooling at

Γ_{c}

. This leads to an optimal control problem where the state equations are the Stefan PDEs and the objective functional measures the deviation from a desired outcome together with the cost of applying control. We use the enthalpy formulation to describe the phase-change dynamics. Let

E (t, x)

denote the enthalpy; the state system is given by

E_{t} = \nabla \cdot (κ \nabla β (E)), κ \partial_{ν} β (E) = g on Γ_{c},

together with zero-flux conditions on

\partial Ω ∖ Γ_{c}

and initial condition

E (0, x) = E_{0} (x)

.

We introduce an enthalpy-tracking cost functional that measures the discrepancy between the state and a target while penalizing the control effort.

min_{g} J (E, g) = \frac{1}{2} \int_{0}^{T} ∥ E - E_{d} ∥_{L^{2} (Ω)}^{2} d t + \frac{γ}{2} ∥ E (T) - E_{T} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} \int_{0}^{T} {∥ g ∥}_{L^{2} (Γ_{c})}^{2} d t,

(41)

where

E_{d} (t)

is the desired enthalpy trajectory and

E_{T}

is the desired terminal enthalpy state, subject to the control constraint

| g | \leq δ

, while

γ > 0

weights the terminal tracking, and

α > 0

penalizes control energy.

To define the moving-interface tracking criterion, let

{\bar{Γ}}_{t}

be a desired interface, and let

ϕ (t, x) = dist (x, {\bar{Γ}}_{t})

be the signed distance. Then,

min_{g} J (E, g) = \frac{1}{2} \int_{0}^{T} \int_{Ω} {| ϕ (t, x) |}^{2} δ_{η} (β (E (x, t))) | \nabla β (E (x, t)) | d x d t + \frac{α}{2} \int_{0}^{T} {∥ g ∥}_{L^{2} (Γ_{c})}^{2} d t .

(42)

In general, the cost functional may be summarized as

min_{g} J (E, g) = \int_{0}^{T} \int_{Ω} Ψ (E) d x + γ Ψ (E (T)) + \frac{α}{2} \int_{0}^{T} {∥ g ∥}_{L^{2} (Γ_{c})}^{2} d t,

(43)

for some functional

Ψ

that represents the tracking objective.

While the interface-tracking formulation provides the physical motivation for controlling the solidification front, the enthalpy-tracking formulation offers significant mathematical advantages for analysis and computation. The enthalpy tracking serves as a regularization of the interface-tracking objective, where guiding the full enthalpy field indirectly controls the interface position while avoiding the mathematical complexities of singular measures and geometric sensitivities. For the remainder of this work, we employ the enthalpy-tracking formulation to establish well-posedness and derive optimality conditions.

3.1. Existence of Optimal Control

We establish the existence of the optimal boundary control for the Stefan system via the direct method of the calculus of variations. Establishing existence is essential before one can pursue uniqueness, regularity, or numerical approximation results. The argument is based on the direct method in the calculus of variations, which provides a systematic methodology for proving existence by combining coercivity, compactness, and weak lower semicontinuity.

We show that the quadratic control cost enforces coercivity of the functional, while the control-to-state operator exhibits sufficient compactness properties to guarantee convergence of minimizing sequences. Together with the weak lower semicontinuity of the cost functional, these properties ensure the existence of an admissible control that minimizes the objective. Consequently, there exists an admissible boundary heat flux

g^{*}

that optimally drives the solidification interface motion toward a desired target, while balancing accuracy against control effort. For related developments in free boundary optimal control, we refer to [5,20,21].

Theorem 4

(Existence of an optimal control). Let

α > 0

. Define the admissible control set

U_{ad} : = \{g \in L^{2} (0, T; L^{2} (Γ_{c})) : | g (t, x) | \leq δ for a . e . (x, t) \in Γ_{c} \times (0, T)\} .

The optimal control problem (41), subject to the enthalpy formulation (8), admits at least one minimizer

g^{*} \in U_{ad},

with associated state

E^{*} = E (g^{*})

.

Proof.

Let

\tilde{J} (g) : = J (E (g), g)

be the reduced composite functional. By definition of J,

\tilde{J} (g) = \frac{1}{2} \int_{0}^{T} ∥ E (g) - E_{d} ∥_{L^{2} (Ω)}^{2} d t + \frac{γ}{2} ∥ E (g) (T) - E_{T} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} \int_{0}^{T} {∥ g ∥}_{L^{2} (Γ_{c})}^{2} d t

\geq \frac{α}{2} {∥ g ∥}_{L^{2} (0, T; L^{2} (Γ_{c}))}^{2} .

Thus,

\tilde{J}

is coercive on

L^{2} (0, T; L^{2} (Γ_{c}))

. Let

{g^{n}} \subset U_{ad}

be a minimizing sequence. Since

U_{ad}

is convex, closed, and bounded in the reflexive space

L^{2} (0, T; L^{2} (Γ_{c}))

, it is weakly sequentially compact, up to a subsequence

g^{n} ⇀ g^{*} in L^{2} (0, T; L^{2} (Γ_{c})) .

Let

E^{n} : = E (g^{n})

be the associated enthalpy solutions. From the well-posedness theory (Section 2) and

E^{0} \in L^{2} (Ω)

, we have uniform bounds

{∥ E^{n} ∥}_{L^{\infty} (0, T; L^{2} (Ω))} + ∥ β (E^{n}) ∥_{L^{2} (0, T; H^{1} (Ω))} + {∥ E_{t}^{n} ∥}_{L^{2} (0, T; H^{- 1} (Ω))} \leq C .

By the Aubin–Lions lemma, there exist

E^{*}

and

ξ

such that, up to a subsequence,

\begin{matrix} β (E^{n}) & \to ξ strongly in L^{2} (0, T; L^{2} (Ω)), \\ β (E^{n}) & ⇀ ξ in L^{2} (0, T; H^{1} (Ω)), \\ E^{n} & ⇀ E^{*} in L^{2} (0, T; L^{2} (Ω)), \\ E_{t}^{n} & ⇀ E_{t}^{*} in L^{2} (0, T; H^{- 1} (Ω)) . \end{matrix}

By the monotonicity of

β

and the strong convergence of

β (E^{n})

, we identify

ξ = β (E^{*})

a.e. For all

ϕ \in L^{2} (0, T; H^{1} (Ω))

, passing to the limit gives

\int_{0}^{T} ⟨ E_{t}^{n}, ϕ ⟩ d t + κ \int_{0}^{T} {(\nabla β (E^{n}), \nabla ϕ)}_{L^{2} (Ω)} d t = \int_{0}^{T} {(g^{n}, ϕ)}_{L^{2} (Γ_{c})} d t .

Using the convergences

E_{t}^{n} ⇀ E_{t}^{*}

,

\nabla β (E^{n}) ⇀ \nabla β (E^{*})

and

g^{n} ⇀ g^{*}

, we obtain

\int_{0}^{T} ⟨ E_{t}^{*}, ϕ ⟩ d t + κ \int_{0}^{T} {(\nabla β (E^{*}), \nabla ϕ)}_{L^{2} (Ω)} d t = \int_{0}^{T} {(g^{*}, ϕ)}_{L^{2} (Γ_{c})} d t,

so

(E^{*}, g^{*})

satisfies the state system.

By weak convergence

E^{n} ⇀ E^{*}

in

L^{2} (0, T; L^{2} (Ω))

and weak lower semicontinuity of the norm,

\int_{0}^{T} ∥ E^{*} - E_{d} ∥_{L^{2} (Ω)}^{2} d t \leq \underset{n \to \infty}{lim inf} \int_{0}^{T} {∥ E^{n} - E_{d} ∥}_{L^{2} (Ω)}^{2} d t .

Similarly, by weak convergence

g^{n} ⇀ g^{*}

in

L^{2} (0, T; L^{2} (Γ_{c}))

,

\int_{0}^{T} ∥ g^{*} ∥_{L^{2} (Γ_{c})}^{2} d t \leq \underset{n \to \infty}{lim inf} \int_{0}^{T} {∥ g^{n} ∥}_{L^{2} (Γ_{c})}^{2} d t .

Finally, from the boundedness

E^{n} \in L^{\infty} (0, T; L^{2} (Ω))

and

E_{t}^{n} \in L^{2} (0, T; H^{- 1} (Ω))

, we have

E^{n} (T) ⇀ E^{*} (T)

weakly in

L^{2} (Ω)

; hence,

{∥ E^{*} (T) - E_{T} ∥}_{L^{2} (Ω)}^{2} \leq \underset{n \to \infty}{lim inf} {∥ E^{n} (T) - E_{T} ∥}_{L^{2} (Ω)}^{2} .

Combining these inequalities,

\tilde{J} (g^{*}) \leq \underset{n \to \infty}{lim inf} \tilde{J} (g^{n}) = inf_{g \in U_{ad}} \tilde{J} (g),

so

g^{*}

is a minimizer. □

3.2. Lagrangian Functional and Adjoint Equation

We derive the adjoint equation for the optimal control problem using a Lagrangian approach. The adjoint variable measures the sensitivity of the cost functional with respect to small perturbations in the state. This technique is classical in PDE-constrained optimization [21] and has been applied to Stefan-type problems in [5,20].

Theorem 5

(Adjoint equation). For the regularized enthalpy function

β_{ε}

with

β_{ε}^{'} > 0

, the adjoint state p satisfies the backward parabolic equation

\begin{matrix} - p_{t} & = κ \nabla \cdot (β_{ε}^{'} (E) \nabla p) - (E - E_{d}) & in Ω \times (0, T), \\ p (T) & = - γ (E (T) - E_{T}) & in Ω, \\ \partial_{ν} p & = 0 & on \partial Ω \times (0, T) . \end{matrix}

(44)

Proof.

The state equation in weak form is

\int_{0}^{T} {(E_{t}, ϕ)}_{Ω} d t + κ \int_{0}^{T} {(\nabla β_{ε} (E), \nabla ϕ)}_{Ω} d t - \int_{0}^{T} {(g, ϕ)}_{Γ_{c}} d t = 0 .

for all

ϕ \in L^{2} (0, T; H^{1} (Ω))

. Define the Lagrangian functional

L (E, g, p) = J (E, g) + \int_{0}^{T} {(E_{t}, p)}_{Ω} d t + κ \int_{0}^{T} {(\nabla β_{ε} (E), \nabla p)}_{Ω} d t - \int_{0}^{T} {(g, p)}_{Γ_{c}} d t .

(45)

Integration by parts in time gives

\int_{0}^{T} {(E_{t}, p)}_{Ω} d t = (E (T), p (T)) - (E (0), p (0)) - \int_{0}^{T} {(E, p_{t})}_{Ω} d t .

For a variation h in E,

δ [{(\nabla β_{ε} (E), \nabla p)}_{Ω}] = {(β_{ε}^{'} (E) \nabla h, \nabla p)}_{Ω} .

The Lagrangian becomes

\begin{matrix} L (E, g, p) & = J (E, g) - \int_{0}^{T} {(E, p_{t})}_{Ω} d t + κ \int_{0}^{T} {(β_{ε}^{'} (E) \nabla E, \nabla p)}_{Ω} d t \\ - \int_{0}^{T} {(g, p)}_{Γ_{c}} d t + (E (T), p (T)) - (E (0), p (0)) . \end{matrix}

The Gâteaux derivative in the direction h is given by

\begin{matrix} L_{E} (E, g, p) [h] & = \int_{0}^{T} {(E - E_{d}, h)}_{Ω} d t + γ (E (T) - E_{T}, h (T)) \\ - \int_{0}^{T} {(p_{t}, h)}_{Ω} d t + κ \int_{0}^{T} {(β_{ε}^{'} (E) \nabla h, \nabla p)}_{Ω} d t \\ + (h (T), p (T)) - (h (0), p (0)) . \end{matrix}

Integrating the spatial term by parts (using

\partial_{ν} p = 0

),

κ \int_{0}^{T} {(β_{ε}^{'} (E) \nabla h, \nabla p)}_{Ω} d t = - κ \int_{0}^{T} {(\nabla \cdot (β_{ε}^{'} (E) \nabla p), h)}_{Ω} d t .

Requiring

L_{E} [h] = 0

for all admissible h with

h (0) = 0

yields the adjoint equation and terminal condition. The regularization parameter

ε > 0

ensures that

β_{ε}

is differentiable and

β_{ε}^{'} > 0

, which is essential for the well-posedness of the adjoint equation. □

We emphasize that the adjoint derivation is consistent with the weak formulation written in the enthalpy variable E, using the temperature–enthalpy relation. A similar scalar nonlinear diffusion structure is analyzed in [22], where the operator

- \nabla \cdot (β^{'} (E) \nabla \cdot)

arises naturally in the Gurtin–MacCamy-type equation. We follow this consistent formulation to ensure that the adjoint equation remains thermodynamically coherent.

Theorem 6

(Discrete existence). Let

{t_{n}}_{n = 0}^{N}

be a uniform partition of

[0, T]

with time step

Δ t = T / N

, where

t_{n} = n Δ t

and

t_{N} = T

. Consider the regularized enthalpy function:

β_{ε} (s) = \frac{1}{2 ε} \int_{s - ε}^{s + ε} β (z) d z, β_{ε}^{'} (s) \geq ε > 0 .

(46)

The backward Euler discretization of the adjoint equation (solved backward in time from

n = N

to

n = 0

) reads

- \frac{p^{n} - p^{n - 1}}{Δ t} = κ \nabla \cdot (β_{ε}^{'} (E^{n}) \nabla p^{n - 1}) + (E^{n} - E_{d}^{n}), \partial_{ν} p^{n - 1} = 0 .

(47)

Here,

p^{n - 1}

is computed from

p^{n}

, with

p^{N} = γ (E^{N} - E_{T})

. Then, for each n from N down to 1, given

p^{n}

, there exists a unique solution

p^{n - 1} \in H^{1} (Ω)

.

Proof.

Equation (47) is equivalent to the elliptic problem for

p^{n - 1}

:

- κ \nabla \cdot (β_{ε}^{'} (E^{n}) \nabla p^{n - 1}) + \frac{1}{Δ t} p^{n - 1} = \frac{1}{Δ t} p^{n} + (E^{n} - E_{d}^{n}) .

The weak form seeks

p^{n - 1} \in H^{1} (Ω)

such that, for all

ϕ \in H^{1} (Ω)

,

a (p^{n - 1}, ϕ) = ℓ (ϕ),

where

\begin{matrix} a (u, ϕ) & : = κ {(β_{ε}^{'} (E^{n}) \nabla u, \nabla ϕ)}_{Ω} + \frac{1}{Δ t} {(u, ϕ)}_{Ω}, \\ ℓ (ϕ) & : = \frac{1}{Δ t} {(p^{n}, ϕ)}_{Ω} + {((E^{n} - E_{d}^{n}), ϕ)}_{Ω} . \end{matrix}

Since

β_{ε}^{'} (E^{n}) \geq ε > 0

, the bilinear form

a (\cdot, \cdot)

is coercive and continuous on

H^{1} (Ω)

. The linear functional

ℓ (\cdot)

is continuous. By the Lax–Milgram theorem, there exists a unique solution

p^{n - 1} \in H^{1} (Ω)

. □

To obtain stability estimates, we test (47) with

- Δ p^{n - 1}

(formally, assuming sufficient regularity). This yields

(\frac{p^{n} - p^{n - 1}}{Δ t}, - Δ p^{n - 1}) = κ (β_{ε}^{'} (E^{n}) \nabla p^{n - 1}, \nabla p^{n - 1}) + (E^{n} - E_{d}^{n}, - Δ p^{n - 1}) .

Using the energy identity

(a - b, - Δ b) = \frac{1}{2} ({∥ \nabla a ∥}^{2} - {∥ \nabla b ∥}^{2} - {∥ \nabla (a - b) ∥}^{2}),

together with

\partial_{ν} p^{n - 1} = 0

, the left-hand side becomes

(\frac{p^{n} - p^{n - 1}}{Δ t}, - Δ p^{n - 1}) = \frac{1}{2 Δ t} (∥ \nabla p^{n} ∥_{L^{2} (Ω)}^{2} - ∥ \nabla p^{n - 1} ∥_{L^{2} (Ω)}^{2} - {∥ \nabla (p^{n} - p^{n - 1}) ∥}_{L^{2} (Ω)}^{2}) .

Applying Young’s inequality to the term

(E^{n} - E_{d}^{n}, - Δ p^{n - 1})

and using

β_{ε}^{'} \geq ε

, we obtain the following for each step:

\frac{1}{2 Δ t} (∥ \nabla p^{n} ∥_{L^{2} (Ω)}^{2} - {∥ \nabla p^{n - 1} ∥}_{L^{2} (Ω)}^{2}) + ε κ ∥ Δ p^{n - 1} ∥_{L^{2} (Ω)}^{2} \leq \frac{C}{ε} {∥ E^{n} - E_{d}^{n} ∥}_{L^{2} (Ω)}^{2} .

Multiplying by

Δ t

and summing from

n = N

down to 1 yields

∥ \nabla p^{0} ∥_{L^{2} (Ω)}^{2} + 2 ε κ \sum_{n = 1}^{N} Δ t ∥ Δ p^{n - 1} ∥_{L^{2} (Ω)}^{2} \leq ∥ \nabla p^{N} ∥_{L^{2} (Ω)}^{2} + \frac{C}{ε} \sum_{n = 1}^{N} Δ t {∥ E^{n} - E_{d}^{n} ∥}_{L^{2} (Ω)}^{2} .

This provides uniform bounds for the discrete solutions, initialized with the given value of

∥ \nabla p^{N} ∥

. By weak compactness and consistency, the limit p (as

Δ t \to 0

) satisfies

p \in L^{\infty} (0, T; H^{1} (Ω)) \cap L^{2} (0, T; H^{2} (Ω)), p_{t} \in L^{2} (0, T; L^{2} (Ω)) .

The adjoint state p propagates backward in time, reflecting the reversed causality of sensitivities: terminal perturbations influence the entire trajectory. In PDE-constrained optimization, the adjoint bridges state dynamics with the gradient of the cost functional with respect to the control. The regularization

β_{ε}

is essential for establishing well-posedness: without it,

β^{'} (E)

may vanish or blow up, leading to degeneracy. The condition

β_{ε}^{'} (E) \geq ε > 0

restores uniform ellipticity, and the energy bound above guarantees the stability and regularity of solutions.

Corollary 1

(Existence of the regularized adjoint). Let E be the enthalpy state associated with a given control g, and let

β_{ε}

be defined in (46) so that

β_{ε}^{'} \geq ε > 0

. Then, the regularized adjoint equation

- p_{t} = κ \nabla \cdot (β_{ε}^{'} (E) \nabla p) + (E - E_{d}), \partial_{ν} p = 0 on \partial Ω, p (T) = γ (E (T) - E_{T}),

admits a unique weak solution

p \in H^{1} (0, T; H^{- 1} (Ω)) \cap L^{2} (0, T; H^{1} (Ω)) .

Moreover, if Ω is

C^{2}

and

β_{ε}^{'} (E) \in W^{1, \infty} (Ω \times (0, T))

, then

p_{t} \in L^{2} (0, T; L^{2} (Ω)) and p \in L^{\infty} (0, T; H^{1} (Ω)) .

Proof.

The backward Euler scheme (47) produces a sequence

{p^{n}}

with the uniform estimate

∥ \nabla p^{0} ∥_{L^{2}}^{2} + 2 ε κ \sum_{n = 1}^{N} Δ t ∥ Δ p^{n - 1} ∥_{L^{2}}^{2} \leq ∥ \nabla p^{N} ∥_{L^{2}}^{2} + \frac{C}{ε} \sum_{n = 1}^{N} Δ t {∥ E^{n} - E_{d}^{n} ∥}_{L^{2}}^{2} .

Using standard time-interpolants, this yields boundedness of the interpolated sequence in

L^{\infty} (0, T; H^{1} (Ω))

and of the time differences in

L^{2} (0, T; L^{2} (Ω))

. By weak compactness (and Aubin–Lions), up to a subsequence,

p^{Δ t} ⇀ p

weak-* in

L^{\infty} (0, T; H^{1} (Ω))

, and

p^{Δ t} \to p

strongly in

L^{2} (0, T; H^{1} (Ω))

.

Since

β_{ε}^{'}

is bounded and continuous,

E^{n} \to E

a.e. implies

β_{ε}^{'} (E^{n}) \to β_{ε}^{'} (E)

strongly in

L^{q}

for any finite q (dominated convergence). Together with

\nabla p^{n - 1} ⇀ \nabla p

in

L^{2}

, this yields

β_{ε}^{'} (E^{n}) \nabla p^{n - 1} ⇀ β_{ε}^{'} (E) \nabla p

in

L^{2}

, so we can pass to the limit in (47) to obtain the weak form of the regularized adjoint. The terminal condition is inherited from the initialization

p^{N} = γ (E^{N} - E_{T})

. Uniqueness follows by testing the homogeneous equation with the difference of two solutions and using

β_{ε}^{'} \geq ε

. The temporal regularity

p_{t} \in L^{2} (0, T; L^{2} (Ω))

follows from the uniform bounds on the discrete time differences. □

Remark 1.

The regularization

β_{ε}

with

β_{ε}^{'} (s) \geq ε > 0

ensures that the corresponding functional

J_{n}^{ε} (β)

from Equation (14) becomes strictly convex. Since

β_{ε}^{'} (s) \geq ε > 0

everywhere, the convex potential associated with

β_{ε}

is strongly convex. Combined with the convex Dirichlet energy term and the linear terms, the functional

J_{n}^{ε} (β)

is strictly convex, guaranteeing both the existence and uniqueness of the minimizer.

3.3. Necessary Optimality Conditions

We derive the first-order necessary optimality conditions for the boundary control problem governed by the Stefan system. The method of Lagrange multipliers yields a coupled system of forward and backward PDEs characterizing local optima.

We define the control operator

B : L^{2} (Γ_{c}) \to H^{- 1} (Ω)

as the bounded linear operator given by

⟨ B g, ϕ ⟩ = {(g, ϕ)}_{Γ_{c}}

for all

ϕ \in H^{1} (Ω)

. Its adjoint

B^{⋆} : H^{1} (Ω) \to L^{2} (Γ_{c})

is given by

B^{⋆} {ϕ = ϕ |}_{Γ_{c}}

when interpreted appropriately through the trace theorem.

Theorem 7.

Let

(E^{*}, g^{*})

be a locally optimal pair for the control problem, and let p denote the associated adjoint state. Then,

(E^{*}, g^{*}, p)

satisfies the following optimality system:

\{\begin{matrix} \partial_{t} E^{*} = κ Δ β (E^{*}) & in Ω \times (0, T), \\ κ \partial_{ν} β (E^{*}) = g^{*} & on Γ_{c} \times (0, T), \\ - \partial_{t} p = κ \nabla \cdot (β^{'} (E^{*}) \nabla p) - (E^{*} - E_{d}) & in Ω \times (0, T), \\ \partial_{ν} p = 0 & on \partial Ω \times (0, T), \\ α g^{*} - B^{⋆} p = 0 & on Γ_{c} \times (0, T), \\ E^{*} (0) = E_{0}, p (T) = - γ (E^{*} (T) - E_{T}) & in Ω . \end{matrix}

(48)

Proof.

The proof follows by taking first variations of the Lagrangian functional

\begin{matrix} L (E, g, p) & = J (E, g) + \int_{0}^{T} [{(E_{t}, p)}_{Ω} + κ {(\nabla β (E), \nabla p)}_{Ω} - {(B g, p)}_{Ω}] d t, \\ J (E, g) & = \frac{1}{2} ∥ E - E_{d} ∥_{L^{2} (Q)}^{2} + \frac{α}{2} {∥ g ∥}_{L^{2} (Σ_{c})}^{2} + \frac{γ}{2} {∥ E (T) - E_{T} ∥}_{L^{2} (Ω)}^{2}, \end{matrix}

(49)

with

Q = Ω \times (0, T)

and

Σ_{c} = Γ_{c} \times (0, T)

.

For a variation

g^{*} + ε w

, the first variation gives (for all

w \in L^{2} (Σ_{c})

)

δ_{g} L (E, g, p) [w] = α {(g, w)}_{Σ_{c}} - {(B w, p)}_{Ω} = α {(g, w)}_{Σ_{c}} - {(w, B^{⋆} p)}_{Σ_{c}} = {(α g - B^{⋆} p, w)}_{Σ_{c}} = 0 .

Hence, the control condition

α g^{*} - B^{⋆} p = 0 on Γ_{c} \times (0, T),

understood in the

L^{2} (Σ_{c})

sense.

For a variation

E^{*} + ε v

with

v (0) = 0

, we compute

\begin{matrix} ⟨\frac{\partial L}{\partial E}, v⟩ & = {(E^{*} - E_{d}, v)}_{Q} + γ (E^{*} (T) - E_{T}, v (T)) \\ - \int_{0}^{T} {(p_{t}, v)}_{Ω} d t + κ \int_{0}^{T} {(\nabla β^{'} (E^{*}) v, \nabla p)}_{Ω} d t \\ = {(E^{*} - E_{d}, v)}_{Q} + γ (E^{*} (T) - E_{T}, v (T)) - \int_{0}^{T} {(p_{t}, v)}_{Ω} d t \\ - κ \int_{0}^{T} {(\nabla \cdot (β^{'} (E^{*}) \nabla p), v)}_{Ω} d t . \end{matrix}

Requiring

⟨L_{E} (E^{*}, g^{*}, p), v⟩ = 0

for all v yields the adjoint PDE

- p_{t} = κ \nabla \cdot (β^{'} (E^{*}) \nabla p) - (E^{*} - E_{d}) in Ω \times (0, T),

with terminal condition

p (T) = - γ (E^{*} (T) - E_{T})

and

\partial_{ν} p = 0

on

\partial Ω

.

Variation with respect to p recovers the weak form of the state equation with boundary flux

g^{*}

.

Let

S : g \mapsto E (g)

be the control-to-state map and define the reduced cost

\hat{J} (g) = J (S (g), g)

. At a local optimum

g^{*}

, the first variation satisfies

{\hat{J}}^{'} (g^{*}) h = 0

for all admissible

h \in L^{2} (Σ_{c})

. The adjoint construction above computes

{\hat{J}}^{'} (g^{*})

and yields the optimality relation

α g^{*} - B^{⋆} p = 0

on

Σ_{c}

. Collecting the three stationarity conditions gives exactly the optimality system (48). □

The optimality system (48) couples the forward enthalpy equation with the backward adjoint problem. The optimality condition

α g^{*} = B^{⋆} p

shows that the optimal flux

g^{*}

is determined by the boundary trace of the adjoint state through the adjoint operator

B^{⋆}

. The backward adjoint dynamics transmit information from the final-time cost to earlier times, linking the desired terminal state to the boundary control strategy.

4. Numerical Methods for Enthalpy-Based Formulation of the Controlled Two-Phase Stefan Problem

Many optimal-control treatments of Stefan-type problems yield nonsymmetric linearized/adjoint operators, motivating Krylov solvers whose performance can be sensitive to non-normality. Prior work includes free boundary control in a level-set setting [5,20] with connections to classical level-set solvers [23] and enthalpy/phase-field formulations [3,4]. In contrast, our contribution enforces an SPD structure throughout: a temperature-linearized semismooth Newton step for the forward problem and a diagonal congruence for the discrete adjoint produce strictly SPD systems, enabling preconditioned CG within a reduced-space SQP algorithm.

We develop a numerically stable scheme for optimal boundary control of the two-phase Stefan problem in enthalpy formulation. The scheme integrates a cell-centered spatial discretization with homogeneous Neumann boundary conditions and a Kronecker-sum Laplacian, a temperature-linearized semismooth Newton method whose linear systems are strictly symmetric positive definite (SPD), an SPD-transformed discrete adjoint obtained via a diagonal congruence that enables efficient solution by Conjugate Gradients (CGs) for backward sweeps, and a reduced-space Sequential Quadratic Programming (SQP) algorithm that incorporates preconditioning, an Armijo line search, and checkpointing. In addition, the scheme provides expanded numerical validation, including convergence behavior, sensitivity to the interface slope, and performance comparisons.

We consider a unit square

Ω = {[0, 1]}^{2}

discretized by a uniform Cartesian grid with

n \times n

cells and spacing

Δ x = 1 / n

. Shifted temperature is represented as

β (E)

and evaluated at cell centers (the stored primary unknown is Eij).

Let I denote the

n \times n

identity and h the standard one-dimensional second-difference operator with zero normal flux:

h = n^{2} \cdot (\begin{matrix} 1 & - 1 & 0 & \dots & 0 \\ - 1 & 2 & - 1 & \dots & 0 \\ 0 & ⋱ & ⋱ & ⋱ & ⋮ \\ ⋮ & - 1 & 2 & - 1 \\ 0 & \dots & 0 & - 1 & 1 \end{matrix}), n^{2} = Δ x^{- 2} .

This h is the negative discrete Laplacian (

h = - Δ_{h, 1 D}

) with homogeneous Neumann boundary conditions; hence, it is symmetric positive semidefinite. The two-dimensional negative discrete Laplacian with homogeneous Neumann boundary conditions is the Kronecker sum

H = h \otimes I + I \otimes h (H : = - Δ_{h}) .

(50)

By construction, H is symmetric positive semidefinite (SPSD) and has a one-dimensional nullspace of constants under Neumann boundary conditions.

Let

Γ_{c}

be the central segments of the top and bottom boundaries. The Neumann control g acts on

Γ_{c}

. We define the discrete boundary pairing

{(g^{n}, β (E))}_{Γ} = Δ s \sum_{ξ \in Γ_{c}} β (E (ξ)) g^{n} (ξ), Δ s = Δ x

and absorb

Δ s

into the sparse injection operator B so that

B g^{n}

consistently maps boundary-flux degrees of freedom into the state equation, and the control norm is grid-independent.

Figure 3 provides intuition regarding the geometry and the effect of boundary fluxes on the shifted temperature field. Related geometric/front-capturing ideas for Stefan problems date back to level-set solvers [23], but here we keep a fixed grid with enthalpy closure.

Figure 3. Snapshot at

t = 2.5

on

Ω

; control on

Γ_{c}

via g.

The grid is cell-centered so that face-normal fluxes from g pair naturally with shifted temperature unknowns; H is assembled as a Kronecker sum to preserve symmetry and sparsity patterns.

Using the backward Euler with time step

Δ t

and (for clarity in this study) a constant thermal conductivity

κ

(taken as

κ_{s}

), the discrete enthalpy state equation is

\frac{E^{n} - E^{n - 1}}{Δ t} = - κ H β (E^{n}) + B g^{n},

(51)

Here, and throughout, we adopt the convention

H = - Δ_{h}

, so that

- κ H β (E^{n}) = κ Δ_{h} β (E^{n})

matches the continuous diffusion term

κ Δ β (E)

, where the enthalpy–shifted temperature relation is

β (E) = \{\begin{matrix} \frac{1}{ρ c_{ℓ}} (E - ρ L), & E > ρ L, \\ 0, & 0 \leq E \leq ρ L, \\ \frac{1}{ρ c_{s}} E, & E < 0 . \end{matrix}

(52)

Equation (51) is the backward Euler discretization of the enthalpy balance: the change in enthalpy over

Δ t

equals diffusion (via

- H β (E^{n})

) plus the imposed boundary flux

B g^{n}

. The piecewise-linear

β

in (52) encodes liquid/solid/mushy behavior. The analysis here assumes exact knowledge of material parameters such as

ρ

,

c_{s}

,

c_{ℓ}

, and L. In practice, these may vary, but prior studies (e.g., [8]) show that boundary-controlled Stefan systems remain stable under bounded parameter perturbations. Extending the framework to explicitly handle uncertainty through robust or adaptive control is left for future work.

The framework extends to

κ (β (E))

provided the variable-coefficient operator

- \nabla \cdot (κ (β (E)) \nabla β (E))

is discretized symmetrically (e.g., face-centered harmonic/centered averaging). In particular, we discretize the SPD form

- \nabla \cdot (κ \nabla \cdot)

; when

κ

is constant, this reduces to

- κ Δ_{h}

and is represented by

- κ H

. This preserves the symmetry of the linearized operators; conditioning changes and motivates stronger preconditioners.

In the interface interval

0 \leq E \leq ρ L

, we regularize with a small slope

β^{'} (E) = ε_{β} \in [10^{- 6}, 10^{- 4}],

which clamps

θ

near 0 while ensuring semismooth differentiability. Unless stated otherwise,

ε_{β} = 10^{- 5}

; we provide a sensitivity study below.

We initialize with a central solid disk at 25 °C (radius 0.2) surrounded by liquid at 33 °C. To reduce grid-scale noise, we smooth by solving

(I + ε H) β (E) = β_{raw}

, and then we set

E^{0} = β^{- 1} (β (E)) .

Linearizing in shifted temperature

β (E)

improves symmetry and conditioning. Define

Q = diag (β^{'} (E))

. At Newton iteration k, we solve for the shifted temperature increment:

\begin{matrix} (\frac{Q^{- 1}}{Δ t} + κ H) δ β & = - (\frac{E_{k - 1}^{n} - E^{n - 1}}{Δ t} + κ H β (E_{k - 1}^{n}) - B g^{n}), \end{matrix}

(53)

\begin{matrix} δ E & = Q^{- 1} δ β, \end{matrix}

(54)

\begin{matrix} E_{k}^{n} & = E_{k - 1}^{n} + δ E . \end{matrix}

(55)

Since

Q^{- 1} ≻ 0

and

H ⪰ 0

, the coefficient matrix in (53) is strictly SPD. We use CG with Jacobi preconditioning

M_{fwd} = diag (Q^{- 1} / Δ t + κ \cdot diag (H)),

tolerance

10^{- 10}

, and max 100 iterations.

We use the residual merit

ϕ (E) = \frac{1}{2} {∥ F (E) ∥}_{2}^{2}

with Armijo backtracking: choose

η = β^{m}

,

m = 0, 1, \dots

with

β = 0.5

,

c_{1} = 10^{- 4}

until

ϕ (E + η Δ E) \leq ϕ (E) + c_{1} η \nabla ϕ {(E)}^{⊤} Δ E, \nabla ϕ (E) = J {(E)}^{⊤} F (E),

where

J (E)

is the Jacobian of F. In practice,

m \leq 10

.

Warm start with

E_{0}^{n} = E^{n - 1}

for

n > 1

. Stop when

\frac{∥ F (E_{k}^{n}) ∥_{2}}{∥ F (E_{0}^{n}) ∥_{2}} \leq 10^{- 8} and \frac{{∥ δ β ∥}_{2}}{∥ β (E_{k - 1}^{n}) ∥_{2}} \leq 10^{- 10} .

Now, we minimize the reduced cost

\begin{matrix} J (g) = \frac{γ}{2} {∥ E (T) - E_{T} ∥}_{W}^{2} + \frac{α}{2} \sum_{n = 1}^{N_{t}} Δ t {∥ g^{n} ∥}_{2}^{2} . \end{matrix}

(56)

All discrete

L^{2}

norms in the objective are implemented with standard quadrature weights. For the domain we use

{∥ E ∥}_{W}^{2} : = Δ x^{d} \sum_{i} E_{i}^{2}

, and for the boundary control,

∥ g^{n} ∥_{2}^{2}

denotes

Δ s \sum_{j} {(g_{j}^{n})}^{2}

, where

Δ s = Δ x

on

Γ_{c}

. These weighted norms converge to their continuous

L^{2}

counterparts under mesh refinement and are consistent with the theory established in Section 3.

The parameters

α

(control penalty) and

γ

(terminal tracking weight) do not admit a universal optimal value; they determine the trade-off between control effort and tracking accuracy. Increasing the ratio

γ / α

improves final-time matching but yields stronger boundary fluxes. The forward residual is

F^{n} (E^{n}, E^{n - 1}; g^{n}) = \frac{E^{n} - E^{n - 1}}{Δ t} + κ H β (E^{n}) - B g^{n} .

Figure 4 and Figure 5 provide visual anchors for the tracking term and transient behavior.

Figure 4. Target state

E_{T}

at

t = T

, obtained from a full-horizon simulation with constant boundary control

g = 5

.

Figure 5. Enthalpy field at the half-horizon

t = T / 2

using the computed optimal control

g^{*}

.

The standard adjoint in enthalpy variables is nonsymmetric. Therefore, we introduce the diagonal change of variables

{\tilde{p}}^{n} = {(Q^{n})}^{1 / 2} p^{n}

to obtain the symmetric system

(\frac{I}{Δ t} + κ {(Q^{n})}^{1 / 2} H {(Q^{n})}^{1 / 2}) {\tilde{p}}^{n} = \frac{1}{Δ t} {\tilde{p}}^{n + 1}, {\tilde{p}}^{N} = {(Q^{N})}^{1 / 2} γ W (E^{N} - E_{T}) .

(57)

Since H is symmetric SPSD (by our finite-difference discretization) and

{(Q^{n})}^{1 / 2}

is diagonal positive, the congruence

{(Q^{n})}^{1 / 2} H {(Q^{n})}^{1 / 2}

remains symmetric SPSD. The congruence transformation preserves the SPSD property because H is symmetric and

Q^{1 / 2}

is diagonal positive definite. Therefore adding

I / Δ t ≻ 0

produces an SPD system. We note that this argument relies on our chosen symmetric discretization of the Laplacian; a nonsymmetric scheme would not generally guarantee symmetry. We solve (57) by CG with Jacobi preconditioner

M_{adj} = diag (I / Δ t + κ \cdot (Q^{n}) diag (H)),

which is algebraically equivalent to

diag ({(Q^{n})}^{1 / 2} H {(Q^{n})}^{1 / 2})

, since

Q^{n}

is diagonal but cheaper to assemble. The computed optimal flux often begins with a short cooling phase to stabilize the solid core, followed by heating that accelerates melting near the controlled boundary layers—behavior consistent with previous optimal-control studies of Stefan systems [5] and illustrated in Figure 6.

Figure 6. Spatiotemporal distribution of

g^{*} (x, t)

on

Γ_{c} \times [0, T / 2]

(initial cooling, then heating).

From

\partial F^{n} / \partial g^{n} = - B

, the reduced gradient is

\nabla J (g) (t_{n}) = α Δ t g^{n} - B^{⊤} p^{n},

(58)

which matches the discrete optimality condition

α g - B^{⊤} p = 0

on

Σ_{c}

. Here, the operator B already incorporates the boundary quadrature weight

Δ s

, so that the discrete boundary pairing corresponds to the continuous

L^{2} (Γ_{c})

inner product.

We employ reduced-space line-search SQP. Each outer iteration comprises one forward semismooth Newton solve, one adjoint sweep, gradient evaluation (58), one CG solve for the reduced quadratic model, and Armijo line search. Hessian–vector products are applied with one tangent (forward-linearized, SPD) and one adjoint sensitivity (backward-linearized, SPD) solve. The method is detailed in Algorithm 1.

Algorithm 1 Reduced-space SQP with SPD forward/adjoint/tangent solves and checkpointing

Require:: Initial $g^{(0)} (t)$ ; tolerances ${tol}_{N} = 10^{- 8}$ , ${tol}_{G} = 10^{- 6}$ ; Armijo $c_{1} = 10^{- 4}$ , $β = 0.5$ ; checkpoint interval C
1:: for $j = 0, 1, 2, \dots$ do
2:: Forward (nonlinear): Solve (51) by semismooth Newton in $β (E)$ using (53) (CG+Jacobi); store checkpoints $(E_{n}^{(j)}, Q_{n})$ every C steps.
3:: Adjoint (SPD): Initialize ${\tilde{p}}_{N} = {(Q_{N})}^{1 / 2} γ W (E_{N}^{(j)} - E_{T})$ ; for $n = N - 1 ↓ 1$ solve (57) by CG (Jacobi). If a state at time n is not checkpointed, replay forward from the nearest prior checkpoint to reconstruct $(E_{n}, Q_{n})$ exactly.
4:: Gradient: $\nabla J (g^{(j)}) (t_{n}) = α Δ t g^{(j)} (t_{n}) - B^{⊤} p_{n}$ .
5:: Search direction: Approximately solve $H_{red} d = - \nabla J$ by PCG (tol $10^{- 8}$ ). Each $H_{red} v$ uses one SPD tangent and one SPD adjoint sensitivity solve; reuse preconditioners.
6:: Line search: $g^{(j + 1)} = g^{(j)} + η d$ , with $η = β^{m}$ until sufficient decrease in the reduced cost.
7:: if $∥ \nabla J (g^{(j + 1)}) ∥_{2} \leq {tol}_{G}$ then break
8:: end if
9:: end for

The overall complexity of our approach is dominated by the Krylov solves. For a 2D grid with

N = n^{2}

spatial degrees of freedom and

N_{t}

time steps, each CG iteration costs

O (N)

due to the sparsity of H. With optimal preconditioning, CG converges in

O (\sqrt{κ})

iterations, where

κ

is the condition number. For our Jacobi-preconditioned systems,

κ = O (n^{2})

leading to

O (n)

CG iterations per solve. Thus, each Newton step costs

O (n^{3}) = O (N^{3 / 2})

operations. The adjoint and sensitivity solves have similar complexity. The overall SQP method typically requires

O (10)

outer iterations, yielding total complexity

O (10 \cdot N_{t} \cdot N^{3 / 2})

for the complete optimal control solution. This favorable scaling compared to direct methods (

O (N^{3})

) makes our approach suitable for moderate-scale 2D problems and provides a foundation for 3D extensions with stronger preconditioners.

Figure 7 shows quadratic convergence in early Newton iterations (semismooth regime), transitioning to linear once globalization and tolerances dominate. In the SQP loop, gradient norms typically drop by several orders within ∼10–15 iterations.

Figure 7. Convergence of the SQP algorithm: norm of reduced gradient vs. iteration.

Figure 8 illustrates the trade-off: smaller

α

improves tracking but can induce oscillatory controls; larger

α

smooths controls at the cost of terminal error. We adopt

α = 10^{- 3}

.

Figure 8. Effect of regularization parameter

α

on control performance (effort vs. tracking error).

Refining to

n = 150

and

Δ t = 0.05

alters the objective by <1.5% and shifts the interface by <2 cells, indicating adequate resolution for

n = 100

,

Δ t = 0.1

.

Varying

ε_{β} \in {10^{- 6}, 10^{- 5}, 10^{- 4}}

: smaller values tighten the plateau near 0 and generally increase CG iterations (stiffer

Q^{- 1}

), while larger values shorten Newton but can bias the interface location. The smoothing parameter was set to

ε_{β} = 10^{- 5}

, as sensitivity tests showed that this value achieves the best trade-off between interface sharpness and numerical stability—smaller values cause ill-conditioning, while larger ones overly diffuse the phase interface.

The method converges for discontinuous initial data (sharp solid/liquid boundary), large time steps (

Δ t = 0.5

), and poor Newton initial guesses (50% perturbation).

Table 1 compares the optimal control to a constant-control baseline; the optimal control reduces final-state error substantially with nearly identical effort by leveraging temporal modulation of the boundary flux. Table 2 reports spectral condition numbers

κ (A)

for the two SPD linear systems used in our solver: the forward Newton step (53) and the adjoint system (57), evaluated on a

100 \times 100

grid (

n = 100

). Applying diagonal (Jacobi) preconditioning reduces

κ

by about three orders of magnitude (forward:

1.2 \times 10^{6} \to 4.3 \times 10^{2}

; adjoint:

8.7 \times 10^{5} \to 3.1 \times 10^{2}

). This explains the observed efficiency of our CG-based solves and justifies the choice of Jacobi as a cheap yet effective preconditioner for the SPD formulations.

Table 1. Performance comparison between optimal-control and constant-control strategies.

Table 2. Illustrative condition numbers for SPD systems (

n = 100

) with/without Jacobi.

5. Conclusions

In this work, we developed a comprehensive theory–algorithm scheme for optimal boundary control of the solidification process governed by the Stefan problem. On the analytical side, we formulated an enthalpy-based variational model and proved well-posedness: existence of weak solutions via a priori estimates and the Aubin–Lions compactness lemma, together with Lipschitz stability with respect to boundary data and initial conditions. Building on this, we posed a tracking-type optimal control problem for the interface and established the existence of minimizers by coercivity, compactness of the control-to-state map, and weak lower semicontinuity. First-order necessary conditions were derived through a Lagrangian formulation, yielding a coupled state–adjoint system.

On the computational side, we designed a structure-preserving discretization and an optimization method that make all major linear subproblems symmetric positive definite, enabling efficient preconditioned CG within a reduced-space SQP setting. Numerical experiments validated the analysis, demonstrating accurate tracking, robust convergence across a wide range of parameters, and competitive cost relative to standard reference strategies.

The foundation developed in this paper opens several directions for further research and applications. Our methodology can be extended beyond the classical Stefan model. From a mathematical perspective, several open problems remain. One direction is to extend our analysis to include convection effects or anisotropic thermal conductivities, both relevant in practical settings [11]. Another direction is the incorporation of state or control constraints beyond simple bounds, such as requiring the interface to remain within prescribed regions. Finally, the development of scalable algorithms for three-dimensional problems with realistic material parameters is an important challenge [19].

Author Contributions

Conceptualization, K.A.A.; Methodology, K.A.A.; Software, K.A.A.; Validation, K.A.A. and J.S.; Formal Analysis, K.A.A. and J.S.; Investigation, K.A.A. and J.S.; Resources, K.A.A.; Writing—Original Draft Preparation, K.A.A.; Writing—Review and Editing, K.A.A. and J.S.; Visualization, K.A.A. and J.S.; Supervision, K.A.A.; Project Administration, K.A.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors are very grateful to the editors and the anonymous referees for their constructive and valuable comments and recommendations, which have greatly contributed to improving the quality and clarity of this paper.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

References

Bermúdez, A.; Otero, M.V. An existence result for a two-phase Stefan problem arising in metal casting. Math. Methods Appl. Sci. 2006, 29, 325–350. [Google Scholar] [CrossRef]
Hill, J.M.; Wu, Y.-H. On a nonlinear Stefan problem arising in the continuous casting of steel. Acta Mech. 1994, 107, 183–198. [Google Scholar] [CrossRef]
White, R.E. An enthalpy formulation of the Stefan problem. SIAM J. Numer. Anal. 1982, 19, 1129–1157. [Google Scholar] [CrossRef]
Voller, V.; Cross, M. Accurate solutions of moving boundary problems using the enthalpy method. Int. J. Heat Mass Transf. 1981, 24, 545–556. [Google Scholar] [CrossRef]
Hinze, M.; Ziegenbalg, S. Optimal control of the free boundary in a two-phase Stefan problem. J. Comput. Phys. 2007, 223, 657–684. [Google Scholar] [CrossRef]
Pawłow, I. Optimal control of solutions of parabolic variational inequalities. Numer. Math. 1987, 51, 271–288. [Google Scholar]
Koga, S.; Diagne, M.; Krstić, M. Backstepping control of the one-phase Stefan problem. Automatica 2019, 100, 182–192. [Google Scholar]
Koga, S.; Krstić, M. Control of two-phase Stefan problems via backstepping and energy shaping. Syst. Control Lett. 2019, 123, 144–151. [Google Scholar]
Liang, X.; Qi, Q.; Zhang, H.; Xie, L. Decentralized Control for Networked Control Systems With Asymmetric Information. IEEE Trans. Autom. Control 2022, 67, 2076–2083. [Google Scholar] [CrossRef]
Liang, X.; Xu, J. Control for Networked Control Systems with Remote and Local Controllers over Unreliable Communication Channel. Automatica 2018, 98, 86–94. [Google Scholar] [CrossRef]
Tarwidi, D.; Pudjaprasetya, S.R. Godunov method for Stefan problems with enthalpy formulations. East Asian J. Appl. Math. 2013, 3, 107–119. [Google Scholar] [CrossRef]
Azaïez, M.; Jelassi, F.; Brahim, M.; Shen, J. Two-phase Stefan problem with smoothed enthalpy. Commun. Math. Sci. 2016, 14, 1625–1641. [Google Scholar] [CrossRef]
Damlamian, A. Some results on the multi-phase Stefan problem. Commun. Partial Differ. Equ. 1977, 2, 1017–1044. [Google Scholar] [CrossRef]
Crank, J. Free and Moving Boundary Problems; Clarendon Press: Oxford, UK, 1984. [Google Scholar]
Elliott, C.M. The Cahn–Hilliard model for the kinetics of phase separation. In Mathematical Models for Phase Change Problems; Birkhäuser: Basel, Switzerland, 1989; pp. 35–73. [Google Scholar]
Aubin, J.-P. Un théorème de compacité. C. R. Acad. Sci. 1963, 256, 5012–5014. [Google Scholar]
Zheng, S. Nonlinear Evolution Equations; CRC Press: Boca Raton, FL, USA, 2004. [Google Scholar]
Simon, J. Compact sets in the space L^p(0, T; B). Ann. Mat. Pura Appl. 1986, 146, 65–96. [Google Scholar] [CrossRef]
Ito, K.; Kunisch, K. Semi-smooth Newton methods for state-constrained optimal control problems. Syst. Control Lett. 2003, 50, 221–228. [Google Scholar] [CrossRef]
Bernauer, M.K.; Herzog, R. Optimal control of the classical two-phase Stefan problem in level set formulation. SIAM J. Sci. Comput. 2011, 33, 342–363. [Google Scholar] [CrossRef]
Ito, K.; Kunisch, K. Lagrange Multiplier Approach to Variational Problems and Applications; SIAM: Philadelphia, PA, USA, 2008. [Google Scholar]
Covei, D.P. A nonlinear diffusion equation of the Gurtin–MacCamy type: Existence, uniqueness, and numerical simulations. arXiv 2025, arXiv:2504.19823. [Google Scholar]
Chen, S.; Merriman, B.; Osher, S.; Smereka, P. A simple level set method for solving Stefan problems. J. Comput. Phys. 1997, 135, 8–29. [Google Scholar] [CrossRef]

Figure 1. The enthalpy set-valued graph

γ (T)

.

Figure 2. The inverse enthalpy graph

β (E)

.

Figure 3. Snapshot at

t = 2.5

on

Ω

; control on

Γ_{c}

via g.

Figure 4. Target state

E_{T}

at

t = T

, obtained from a full-horizon simulation with constant boundary control

g = 5

.

Figure 5. Enthalpy field at the half-horizon

t = T / 2

using the computed optimal control

g^{*}

.

Figure 6. Spatiotemporal distribution of

g^{*} (x, t)

on

Γ_{c} \times [0, T / 2]

(initial cooling, then heating).

Figure 7. Convergence of the SQP algorithm: norm of reduced gradient vs. iteration.

Figure 8. Effect of regularization parameter

α

on control performance (effort vs. tracking error).

Table 1. Performance comparison between optimal-control and constant-control strategies.

Metric	Optimal Control	Constant Control
Final-State Error ( $∥ E (T) - E_{T} ∥ / ∥ E_{T} ∥$ )	0.82%	2.15%
Total Control Effort ( $\int_{0}^{T} {∥ g (t) ∥}^{2} d t$ )	12.45	12.50
Maximum Control Magnitude ( ${max}_{t} \| g (t) \|$ )	7.21	5.00

Table 2. Illustrative condition numbers for SPD systems (

n = 100

) with/without Jacobi.

Table 2. Illustrative condition numbers for SPD systems (

n = 100

) with/without Jacobi.

System Type	Without Preconditioning	With Jacobi
Forward Newton	$1.2 \times 10^{6}$	$4.3 \times 10^{2}$
Adjoint (SPD)	$8.7 \times 10^{5}$	$3.1 \times 10^{2}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Optimal Boundary-Flux Control of a Sharp Moving Interface in the Classical Two-Phase Stefan Problem

Abstract

1. Introduction

2. The Classical Two-Phase Stefan Problem

2.1. Enthalpy Formulation of the Two-Phase Stefan Problem

2.2. Weak Formulation and Time Discretization

2.3. A Priori Bounds for the Finite-Difference Solution

2.4. Convergence of Weak Solutions

2.5. Lipschitz Continuity of the Enthalpy Solution with Respect to Boundary Control

3. Optimal Control Problem for the Motion of the Interface of the Stefan Problem

3.1. Existence of Optimal Control

3.2. Lagrangian Functional and Adjoint Equation

3.3. Necessary Optimality Conditions

4. Numerical Methods for Enthalpy-Based Formulation of the Controlled Two-Phase Stefan Problem

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics