A Unified Variational Principle for Reliable Machine Learning

Velasco, Jose Manuel; Gonzalez-Perez, Beatriz

doi:10.3390/math14111994

Open AccessFeature PaperArticle

A Unified Variational Principle for Reliable Machine Learning

by

Jose Manuel Velasco

^1,*

and

Beatriz Gonzalez-Perez

²

¹

Department of Computer Architecture and Automation, Faculty of Computer Science, Universidad Complutense de Madrid, 28040 Madrid, Spain

²

Department of Statistics and Operational Research, Faculty of Mathematics, Universidad Complutense de Madrid, 28040 Madrid, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(11), 1994; https://doi.org/10.3390/math14111994

Submission received: 30 April 2026 / Revised: 26 May 2026 / Accepted: 1 June 2026 / Published: 4 June 2026

(This article belongs to the Special Issue Advanced Machine Learning Analysis and Application in Data Science)

Download

Browse Figures

Versions Notes

Abstract

Modern machine learning systems can achieve remarkable predictive performance. Nevertheless, in several fields, this is not enough to produce acceptable solutions as we need formal guarantees of robustness, fairness, and interpretability. Most existing approaches treat these properties separately or introduce them through external constraints, which makes their interaction difficult to analyze. In this work, we develop a unified variational perspective that incorporates these requirements directly into the learning objective. Concretely, we model learning as the minimization of a composite functional that combines predictive risk, regularization, and additional terms that capture robustness, fairness, and interpretability. This viewpoint allows us to study these properties within a single mathematical framework. Under standard assumptions, we prove the existence of minimizers and show that the resulting solutions are Pareto-optimal for the associated multi-objective problem. We illustrate the framework using examples based on adversarial and distributional robustness, statistical fairness criteria, and a notion of interpretability. The analysis points out the trade-offs that inevitably arise. We also examine statistical aspects of the proposed objective and show that classical generalization guarantees can still be obtained under appropriate conditions. The resulting framework provides a flexible basis for designing reliable learning systems.

Keywords:

variational methods; machine learning theory; functional analysis; multi-objective optimization; trustworthy AI

MSC:

Primary 49J45; Secondary 46N10; 90C26; 68T07; 62M45

1. Introduction

Modern machine learning systems are typically formulated through empirical or expected risk minimization. In this framework, predictors are optimized primarily to minimize prediction error, often together with regularization terms controlling model complexity. This paradigm has produced substantial empirical success across a wide range of applications. However, predictive accuracy alone is frequently insufficient in high-stakes settings. Systems deployed in healthcare, finance, public policy, and scientific research must also satisfy requirements related to robustness, fairness, transparency, and interpretability.

The rapid growth of large-scale datasets has significantly expanded the capabilities of deep neural networks. Highly overparameterized models can now learn complex hierarchical representations and capture subtle statistical dependencies that remain inaccessible in small-data regimes. This development has enabled major advances in computer vision, natural language processing, healthcare, finance, and scientific discovery. However, increased predictive performance does not necessarily imply reliability. Models with excellent test accuracy may still behave unpredictably under distributional shifts, exhibit systematic bias, or produce decisions that domain experts cannot meaningfully interpret.

These limitations become particularly significant in critical applications [1,2]. In healthcare, machine learning systems are increasingly used for diagnosis, prognosis, and treatment planning. Deep learning methods in medical imaging can achieve extremely high predictive performance [3]; nevertheless, their deployment in clinical practice remains limited by several structural difficulties. First, model predictions are often difficult to interpret, making it challenging for clinicians to determine whether decisions are medically justified. Second, state-of-the-art supervised approaches typically rely on large-scale pixel-level annotations, which are expensive and difficult to obtain in medical settings. This situation motivates the development of learning frameworks capable of incorporating structural constraints and prior information when fully supervised data are unavailable.

Similar concerns arise in finance [4,5], public policy [6], and criminal justice [7,8]. In the financial sector, machine learning models are widely used for credit scoring, fraud detection, risk assessment, and algorithmic trading. However, models trained on historical datasets may inherit or amplify pre-existing demographic and societal biases, resulting in systematically unequal outcomes across groups. Regulatory frameworks therefore increasingly require automated decisions to be explainable and auditable, limiting the deployment of opaque black-box systems. Comparable issues appear in criminal justice and public policy, where algorithmic systems influence sentencing, risk assessment, and resource allocation. In these settings, limited interpretability complicates accountability, external auditing, and legal oversight.

Robustness presents an additional challenge. Modern machine learning systems are often highly sensitive to small perturbations in the input data. In computer vision, adversarial perturbations can produce highly confident yet incorrect predictions, while in autonomous systems such sensitivity may lead to unsafe behavior under minor environmental changes. At the same time, interpretability becomes particularly important in scientific applications, where the objective extends beyond prediction to the extraction of mechanistic understanding. Machine learning models are increasingly used to identify patterns in physics, biology, climate science, and related disciplines. However, when predictive systems operate purely as black boxes without providing interpretable mechanisms, their scientific value becomes limited because they fail to generate insight into the underlying phenomena.

These observations suggest a broader limitation of standard machine learning formulations. In many existing approaches, robustness, fairness, and interpretability are introduced only after training, either as external constraints, post hoc corrections, or independent optimization objectives [1]. As a consequence, their interaction with predictive risk remains difficult to analyze systematically. The existing literature often studies predictive risk minimization, robustness, fairness, and interpretability within separate theoretical frameworks, despite the fact that these properties interact strongly in practical applications.

In this work, we argue that many of these difficulties arise because standard learning objectives are variationally under-constrained. From this perspective, the black-box behavior of modern machine learning systems is not necessarily an intrinsic property of neural networks themselves, but rather a consequence of optimization objectives that fail to encode relevant structural and functional requirements. Predictive accuracy alone does not determine whether a model is robust, fair, stable, or scientifically interpretable. To address this limitation, we introduce a unified variational framework in which robustness, fairness, and interpretability are formulated directly as functionals over the hypothesis space and incorporated into a single learning objective. This formulation extends classical risk minimization by integrating predictive performance together with structural constraints within a common functional-analytic framework. The main idea is to treat trustworthy behavior not as a secondary correction, but as an intrinsic component of the optimization problem itself. This perspective allows tools from variational analysis, functional analysis, and multi-objective optimization to be applied systematically to the study of reliable machine learning systems.

A central motivation for this framework is the need to analyze trade-offs between competing objectives in a mathematically coherent manner. In particular, predictive accuracy and fairness are known to satisfy incompatibility relations in many settings, where multiple fairness criteria cannot generally be achieved simultaneously without affecting predictive performance. The proposed formulation provides a natural setting in which such interactions can be characterized variationally.

The goal of this work is therefore not to propose a new optimization algorithm, but rather to establish a unified variational formulation for reliable machine learning. In contrast to modular or post hoc approaches, the proposed framework incorporates robustness, fairness, and interpretability directly into the learning objective. This enables a systematic analysis of trustworthy machine learning within a single mathematical framework.

The main contributions of this paper are as follows:

(Section 3) We introduce a unified variational formulation of machine learning in which predictive risk, structural regularization, robustness, fairness, and interpretability are integrated into a single-objective functional.
(Section 4) We show how several classical paradigms, including regularized learning, kernel methods, robust optimization, fairness-aware learning, sparse coding, and physics-informed learning, arise as special cases of the proposed framework.
(Section 5) We formalize robustness and fairness as structural functionals over hypothesis spaces and analyze the trade-offs that arise between predictive accuracy and structural constraints.
(Section 6) We introduce a multi-criterion interpretability functional combining simplicity, information relevance, and stability of explanations, including a discussion of finite-dimensional and RKHS-compatible complexity measures.
(Section 7) We study theoretical consequences of the unified framework, including existence of minimizers, Pareto-optimality, stability, robustness, and generalization properties under suitable assumptions.
(Section 8) We discuss computational and practical instantiations of the framework, showing how modern methodologies such as adversarial training, fairness-aware optimization, sparse coding, kernel methods, and physics-informed neural networks can be interpreted within a common variational perspective.

Finally, Section 9 discusses computational limitations, optimization challenges, and open research directions associated with the proposed framework, including scalability in highly nonconvex settings and connections with emerging large-scale machine learning systems.

2. Related Work

Modern machine learning theory already contains many of the mathematical ingredients required for trustworthy learning, including variational optimization, regularization theory, robustness analysis, fairness constraints, interpretability objectives, and multi-objective optimization. However, these components are typically developed within distinct mathematical frameworks. As a consequence, robustness, fairness, interpretability, and predictive risk are often treated as separate optimization objectives whose interactions remain difficult to analyze systematically. Comparatively few works attempt to formulate these structural requirements within a single variational principle.

A central feature of classical statistical learning theory is the formulation of learning problems as optimization over function spaces. In particular, predictors are commonly studied in Hilbert or Banach spaces, including reproducing kernel Hilbert spaces (RKHSs), Sobolev spaces, and related functional spaces that provide suitable geometric and topological structure for optimization and generalization analysis [9,10]. Within this framework, learning objectives naturally appear as functionals over infinite-dimensional hypothesis spaces, making tools from variational analysis and functional analysis directly applicable.

Variational and functional-analytic approaches provide rigorous methods for studying optimization landscapes, regularization mechanisms, implicit bias, compactness properties, and stability of learning algorithms [11,12,13,14,15,16]. In particular, these methods are fundamental for analyzing existence and uniqueness of minimizers, lower semicontinuity of objective functionals, compactness of admissible sets, and stability or generalization guarantees under suitable assumptions. Classical regularization methods can also be interpreted variationally through structural penalty functionals such as norm-based regularization in Hilbert spaces [17]. Nevertheless, these formulations primarily address predictive risk and complexity control, without explicitly incorporating robustness, fairness, or interpretability as intrinsic structural components of the objective itself.

A similar structural limitation appears in robustness formulations. Distributionally robust optimization (DRO) and adversarial training both replace standard empirical risk minimization by worst-case optimization over structured perturbation sets. In adversarial robustness, predictors are optimized against local worst-case perturbations of the input data [18,19,20]. Distributionally robust optimization instead considers uncertainty sets of probability measures surrounding the empirical data distribution, typically defined through Wasserstein distances and optimal transport theory [21,22,23,24]. From a variational perspective, both approaches introduce robustness functionals that quantify stability under adversarial or distributional perturbations.

These methods provide formal guarantees related to worst-case risk control, stability, and robustness under suitable assumptions [25]. However, robustness is usually incorporated either as an external constraint or as a standalone optimization objective. Consequently, existing robustness formulations rarely analyze systematically how robustness interacts with fairness, interpretability, or other structural requirements within a unified variational framework.

The same fragmentation appears in fairness-aware learning. Existing approaches typically introduce fairness through statistical constraints or dependence penalties imposed on predictive distributions. Group-based criteria such as demographic parity and equalized odds constrain the distribution of predictions across sensitive groups [26,27], while information-theoretic formulations measure dependence between predictions and sensitive attributes through quantities such as mutual information [28]. From a variational viewpoint, these approaches can be interpreted as introducing fairness functionals over the hypothesis space that penalize discriminatory dependence.

Importantly, different fairness criteria encode distinct and often incompatible notions of equity. Impossibility results show that multiple fairness criteria cannot generally be satisfied simultaneously without affecting predictive performance [29,30]. These results suggest that fairness cannot usually be treated as an independent post hoc correction layered on top of predictive optimization. Instead, fairness introduces competing structural objectives that interact intrinsically with predictive risk. Existing approaches therefore commonly formulate fairness either as constrained optimization or as regularization within multi-objective settings [31]. Nevertheless, fairness formulations remain largely modular and are rarely integrated jointly with robustness and interpretability within a common functional-analytic framework [32,33,34].

Interpretability introduces a related class of structural objectives. Existing methods range from sparse and transparent models to post hoc explanation techniques [1]. Information-theoretic approaches such as the Information Bottleneck formalize interpretability through trade-offs between compression and predictive relevance [35,36,37], while recent work emphasizes the stability and robustness of explanations themselves [38]. These approaches again introduce additional structural functionals associated with simplicity, compression, relevance, or explanation stability. However, interpretability objectives are typically studied independently from robustness and fairness, often without a unified variational formulation capable of analyzing their interaction systematically [2,39].

More broadly, these difficulties reflect a common structural phenomenon. Existing approaches frequently introduce robustness, fairness, interpretability, or stability through auxiliary penalties, constraints, or independent optimization objectives added to standard empirical risk minimization. While these formulations successfully encode individual structural properties, they rarely provide a unified functional-analytic framework in which multiple structural objectives are incorporated simultaneously as components of a single variational principle.

This fragmentation becomes particularly visible in multi-objective optimization frameworks [40,41,42,43,44]. Such approaches provide a natural mathematical language for describing trade-offs between predictive accuracy and structural requirements. In particular, Pareto-efficient solutions characterize predictors for which no objective can be improved without simultaneously degrading at least one competing objective. From a variational perspective, scalarized objectives therefore provide a mechanism for selecting Pareto-optimal predictors within a multi-objective trade-off space.

However, most existing multi-objective approaches operate primarily at the optimization or algorithmic level, focusing on Pareto-front computation, scalarization strategies, or constrained optimization procedures. Comparatively less attention has been devoted to developing unified functional-analytic formulations that simultaneously integrate robustness, fairness, and interpretability as structural functionals within a single variational objective together with well-posedness guarantees related to existence, compactness, semicontinuity, and Pareto-optimality.

Table 1 summarizes these distinctions from the perspective of variational structure and functional integration.

In contrast to the prior literature, the framework proposed in this work formulates robustness, fairness, and interpretability directly as structural functionals over the hypothesis space and incorporates them into a single variational objective. The main contribution is therefore not merely the aggregation of multiple objectives, but the development of a unified functional-analytic formulation that:

Integrates multiple structural requirements within a single variational principle;
Is amenable to tools from variational analysis and the calculus of variations;
Provides well-posedness guarantees under suitable assumptions;
Establishes explicit connections between scalarized optimization and Pareto-optimality.

To the best of our knowledge, existing approaches rarely combine unified variational formulations, functional-analytic well-posedness guarantees, and explicit Pareto structure within a single framework.

3. A Unified Variational Framework

3.1. Unified Functional Formulation

Let

F

be a hypothesis space of measurable functions

f : X \to T

. We define the learning problem as the minimization of the functional

J_{D} (f) = R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f),

(1)

where:

$R_{D} (f) = E_{(X, Y) \sim D} [ℓ (f (X), Y)]$ is the expected risk;
$Ω (f)$ is a structural regularizer;
$Ψ_{i} (f)$ are functionals encoding robustness, fairness, or other constraints;
$I (f)$ is an interpretability score;
$λ > 0$ , $η_{i} \geq 0$ , and $τ \geq 0$ are trade-off parameters.

The interpretability term

I (f)

is incorporated directly into the objective as a reward functional. Larger values of

I (f)

correspond to predictors that are simpler, more stable, or more transparent according to the chosen interpretability criterion. In the unified objective, the coefficient

τ

controls the relative importance assigned to interpretability compared with predictive accuracy and other structural objectives.

Classical empirical risk minimization (ERM) seeks predictors that minimize only the expected prediction error

R_{D} (f)

. The proposed framework extends this paradigm by embedding additional structural objectives directly into the variational functional. The terms

Ω (f), Ψ_{i} (f), I (f)

allow the learning problem to simultaneously account for model complexity, robustness, fairness, physical consistency, or interpretability within a single optimization principle. In many modern machine learning applications, such structural properties cannot be treated as purely post hoc corrections. For example, robustness, fairness, and interpretability may fundamentally interact with predictive performance and with one another. Incorporating these requirements directly into the objective functional allows the resulting predictors to be analyzed through a unified variational and multi-objective framework, making the trade-offs between competing criteria mathematically explicit. The geometric interpretation of these competing objectives is illustrated in Figure 1, where the scalarized functional selects Pareto-optimal predictors within a multi-objective trade-off space. The dashed lines in Figure 1 schematically illustrate the effect of the trade-off parameters

(λ, η_{i}, τ)

on the optimization process. Varying these weights changes the relative importance assigned to predictive accuracy, robustness, fairness, complexity, and interpretability, thereby inducing the selection of different Pareto-optimal solutions along the Pareto frontier.

The hypothesis space

F

is assumed to consist of measurable predictors

f : X \to T

where

X

is the input space and

T

is the prediction space. Depending on the learning setting,

F

may be a finite-dimensional parameter space, a Banach space, a reproducing kernel Hilbert space (RKHS), or a class of neural network parametrizations. Throughout the paper, the functional-analytic structure imposed on

F

is chosen so that the relevant variational properties (e.g., lower semicontinuity, coercivity, compactness) are well-defined.

3.2. Well-Posedness and Basic Properties

We now collect basic properties of the unified objective. These results follow from standard arguments in the calculus of variations and multi-objective optimization.

We assume that

F

is endowed with a topology (e.g., a Banach or Hilbert space structure) under which the functionals are defined.

Proposition 1

(Well-posedness and basic properties). Let

(X, Y) \sim D

and let

F

be a hypothesis space of measurable functions

f : X \to T

. Consider the functional

J_{D} (f) = R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f) .

Assume:

(H1): Ω is coercive on $F$ ;
(H2): $R_{D}$ , Ω, and each $Ψ_{i}$ are lower semicontinuous;
(H3): the sublevel sets of $J_{D}$ are relatively compact;
(H4): I is upper semicontinuous.

Then:

(i): Existence. There exists $f^{†} \in F$ such that

$J_{D} (f^{†}) = \inf_{f \in F} J_{D} (f) .$
(ii): Trade-off inequality. Let

$f^{*} \in \arg \min_{f \in F} R_{D} (f) .$

Then,

$R_{D} (f^{†}) - R_{D} (f^{*}) \leq \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{*}) - τ I (f^{*}) .$
(iii): Pareto optimality. The minimizer $f^{†}$ is Pareto-optimal for the multi-objective problem

$\min_{f \in F} (R_{D} (f), Ψ_{1} (f), \dots, Ψ_{k} (f), - I (f)) .$

Proof.

See Appendix A. □

Remark 1.

The above properties follow from classical arguments in variational analysis. In particular, existence is a consequence of the direct method [11] of the calculus of variations, while Pareto optimality follows from standard scalarization principles in multi-objective optimization. Their role here is to show that the unified functional preserves well-posedness while incorporating multiple structural constraints.

Remark 2.

Typical examples of reflexive Banach spaces include

L^{p}

spaces for

1 < p < \infty

and Hilbert spaces such as reproducing kernel Hilbert spaces, which are used in Section 4.2.

Remark 3

(On the assumptions). The compactness assumption (H3) can be ensured in standard settings. For example, if

F

is a reflexive Banach space and Ω is coercive, then sublevel sets of

J_{D}

are relatively compact in the weak topology.

Proposition 1 provides a unified variational perspective on learning problems with multiple structural objectives. It shows that:

Predictors can be characterized as minimizers of a composite functional;
Structural constraints such as robustness, fairness, and interpretability can be incorporated without compromising well-posedness;
Trade-offs between predictive accuracy and additional constraints arise naturally from the objective;
The unified formulation induces Pareto-optimal solutions in the corresponding multi-objective space.

This perspective serves as a foundation for the analysis and examples developed in the subsequent sections.

3.3. Intuitive Interpretation: The Control Panel View

The unified objective (1) can be understood through a simple geometric and conceptual analogy. Rather than viewing learning as the optimization of a single quantity, the proposed formulation treats it as a multi-criteria control problemin which several competing objectives must be balanced simultaneously.

From single-objective to multi-objective learning.

Classical empirical risk minimization focuses primarily on predictive accuracy, as measured by the risk

R_{D} (f)

. In this setting, the learning problem can be interpreted as minimizing a single axis: the prediction error.

However, in many real-world applications, additional requirements are essential. Robustness, fairness, and interpretability impose structural constraints that cannot, in general, be satisfied simultaneously without affecting predictive performance. The unified objective makes these requirements explicit by introducing additional terms that quantify deviations from these desired properties.

A control panel of competing objectives. Each component of the functional plays a distinct role:

$R_{D} (f)$ measures predictive accuracy: how well the model fits the data.
$Ω (f)$ controls model complexity: how simple or regular the predictor is.
$Ψ_{i} (f)$ quantify violations of structural constraints, such as lack of robustness or fairness.
$I (f)$ measures interpretability: how understandable or stable the model is.

These terms can be viewed as defining a control panel with multiple dials. The parameters

(λ, η_{i}, τ)

determine how much weight is assigned to each objective, and therefore how much predictive accuracy one is willing to trade in order to enforce structural properties.

Trade-offs and the Pareto frontier. From a geometric perspective, each predictor

f \in F

corresponds to a point in a multi-dimensional space whose coordinates are given by

(R_{D} (f), Ψ_{1} (f), \dots, Ψ_{k} (f), - I (f)) .

In this space, it is generally impossible to simultaneously minimize all coordinates. Improving one objective (e.g., fairness) may worsen another (e.g., accuracy). As a result, optimal solutions lie on the Pareto frontier. A predictor is said to be Pareto-optimal if no other predictor can improve one objective without worsening at least one of the remaining objectives. The collection of all such predictors forms the Pareto frontier. The scalarized objective (1) selects a particular point on this frontier by assigning weights to each component. Different choices of

(λ, η_{i}, τ)

correspond to different trade-offs and lead to different Pareto-optimal solutions.

Why trade-offs are unavoidable. The proposed framework emphasizes that trade-offs are intrinsic to multi-objective learning problems rather than artifacts of particular algorithms. Structural objectives often compete directly with predictive accuracy. For example, enforcing fairness constraints may reduce the use of highly predictive but sensitive features, while robustness requirements may limit highly specialized decision boundaries. Similarly, interpretability and simplicity constraints can restrict the expressive complexity of admissible predictors. Consequently, no single predictor can generally optimize all criteria simultaneously, making Pareto trade-offs unavoidable within the learning process itself.

Summary. The unified variational principle can thus be interpreted as a mechanism for navigating a space of competing objectives. Instead of searching for a single notion of optimality, it provides a structured way to explore and control trade-offs between accuracy, robustness, fairness, and interpretability. This perspective complements the formal results of Proposition 1 by providing an intuitive understanding of why Pareto-optimal solutions arise naturally in the proposed framework.

4. Reinterpreting Existing Paradigms

The unified functional formulation introduced in Section 3 provides a common variational perspective that encompasses a wide range of machine learning methodologies. In this section, we demonstrate how several established paradigms arise as special cases of the proposed framework. We now illustrate how classical methods fit within Proposition 1.

Regularized Learning. Classical statistical learning is typically formulated as empirical or expected risk minimization with a structural penalty:

\min_{f \in F} R_{D} (f) + λ Ω (f) .

(2)

This corresponds to the proposed framework with

Ψ_{i} (f) = 0

for all i. Common choices of

Ω (f)

include:

$ℓ_{2}$ regularization (ridge regression): $Ω (f) = {∥ w ∥}_{2}^{2}$ ;
$ℓ_{1}$ regularization (lasso): $Ω (f) = {∥ w ∥}_{1}$ ;
RKHS norms in kernel methods: $Ω (f) = {∥ f ∥}_{H}^{2}$ .

These regularizers control model complexity and are closely tied to generalization guarantees via capacity measures. From a variational perspective, these methods differ primarily in the choice of hypothesis space, loss functional, and structural penalties. The proposed framework therefore provides a common functional language for regularization, probabilistic inference, and structural constraints.

The connection between Bayesian inference and regularization is particularly well known [45,46]. In Bayesian learning, the maximum a posteriori (MAP) estimator combines a likelihood term with a prior distribution over the model parameters. Taking the negative logarithm of the posterior transforms Bayesian inference into a variational optimization problem consisting of a data-fitting term together with a regularization functional induced by the prior distribution. In particular, Gaussian priors on the parameters lead to quadratic

ℓ_{2}

regularization penalties, while Laplace priors induce sparsity-promoting

ℓ_{1}

regularization terms.

Bayesian Inference. In Bayesian learning, the maximum a posteriori (MAP) estimator is defined as

f^{*} = \arg \max_{f} p (f ∣ D) = \arg \min_{f} (- \log p (D ∣ f) - \log p (f)) .

(3)

Identifying

- \log p (D ∣ f)

with the empirical risk and

- \log p (f)

with a regularizer, we obtain

J_{D} (f) = R_{D} (f) + Ω (f),

(4)

showing that Bayesian inference is equivalent to regularized risk minimization. For example:

A Gaussian prior on parameters induces $ℓ_{2}$ regularization,
A Laplace prior induces $ℓ_{1}$ regularization.

Physics-Informed Learning. Physics-informed machine learning incorporates prior knowledge in the form of physical laws, typically expressed as partial differential equations (PDEs). Let

N [f] (x) = 0

denote a differential operator encoding the governing equation. This constraint can be incorporated as a functional:

Ψ_{phys} (f) = E_{x \sim D} [{∥ N [f] (x) ∥}^{2}] .

(5)

The resulting objective,

J_{D} (f) = R_{D} (f) + η_{phys} Ψ_{phys} (f)

(6)

enforces consistency with known physical principles. This formulation is widely used in physics-informed neural networks (PINNs), where f is parameterized by a deep neural network. Recent developments [47,48] further illustrate the practical importance of incorporating physical constraints directly into learning objectives. In industrial and engineering applications, PINN-type formulations are increasingly used to prevent non-physical extrapolations when training data are sparse or only partially observed. For example, in aerodynamic and thermodynamic modeling [49,50,51], additional functional penalties can enforce physical consistency conditions such as similarity mappings, conservation laws, or surge boundary constraints. Incorporating these constraints directly into the loss function ensures that the learned predictor remains within physically admissible operating regimes, even in regions where observational data are limited. From the perspective of the present framework, such constraints naturally appear as structural functionals

Ψ_{phys} (f)

integrated into the variational objective.

Robust Optimization. Distributionally robust optimization (DRO) can be expressed as

\min_{f} \sup_{D^{'} \in U (D)} R_{D^{'}} (f),

(7)

where

U (D)

is an uncertainty set (e.g., a Wasserstein ball). This is equivalent to minimizing a robustness functional:

Ψ_{rob} (f) = \sup_{D^{'} \in U (D)} |R_{D^{'}} (f) - R_{D} (f)| .

(8)

Similarly, adversarial training corresponds to penalizing worst-case perturbations at the input level:

Ψ_{adv} (f) = E_{x \sim D} [\sup_{∥ δ ∥ \leq ϵ} ℓ (f (x + δ), y)] .

(9)

Fair Representation Learning. Fairness-aware learning can be incorporated by introducing dependence penalties between predictions and protected attributes. For example,

Ψ_{fair} (f) = I (f (X); A),

(10)

or, alternatively, using kernel-based independence measures such as HSIC. This yields

J_{D} (f) = R_{D} (f) + η_{fair} Ψ_{fair} (f),

(11)

which enforces statistical independence constraints during training.

Within the unified variational framework, fairness is incorporated by penalizing statistical dependence between model predictions and sensitive attributes. In the mutual-information formulation, the functional

Ψ_{fair} (f) = I (f (X); A)

measures the amount of information that the predictions retain about a protected attribute A. Minimizing this quantity encourages statistical independence between predictions and sensitive variables, thereby reducing discriminatory dependence within the learned representation. From the variational perspective, fairness constraints therefore appear naturally as structural penalties integrated directly into the objective functional rather than as external post hoc corrections.

Deep Learning Heuristics. Several widely used techniques in deep learning can be interpreted within this framework:

Weight decay corresponds to $Ω (f) = {∥ w ∥}_{2}^{2}$ ;
Dropout can be viewed as a stochastic regularization that approximates an ensemble of subnetworks;
Batch normalization implicitly controls the geometry of the optimization landscape;
Early stopping acts as an implicit regularizer by restricting effective model complexity.

Although often introduced heuristically, these techniques can be interpreted as modifying the effective regularization or functional constraints in

J_{D} (f)

.

From a variational perspective, these heuristics modify the effective geometry of the optimization problem through explicit or implicit regularization. Weight decay induces Tikhonov-type penalties, dropout introduces stochastic regularization, and early stopping restricts effective model complexity through optimization dynamics.

Summary. These examples show that many learning paradigms can be interpreted as variational problems differing primarily in their structural functionals. The proposed framework extends this perspective by incorporating robustness, fairness, and interpretability within a single objective functional. Table 2 summarizes several representative paradigms and illustrates how they arise as particular instances of the unified variational formulation through appropriate choices of hypothesis spaces, loss functions, and structural functionals.

4.1. Comparison of Paradigms Within the Unified Framework

Unified template. Let

(X, B_{X})

be a measurable space and

(Y, B_{Y})

a standard Borel space (e.g.,

Y

finite or

R^{k}

with the Borel

σ

-algebra). Let

D

be a probability measure on

(X \times Y, B_{X} \otimes B_{Y})

, and let

(X, Y) \sim D

.

Let

(T, B_{T})

be an output measurable space (typically

T = R

or

R^{k}

). Define the hypothesis space

F \subseteq \{f : X \to T : f is B_{X} / B_{T} - measurable\} .

Let

ℓ : T \times Y \to [0, \infty]

be measurable and assume

ℓ (f (X), Y)

is integrable for

f \in F

. The (population) risk is

R_{D} (f) : = E_{(X, Y) \sim D} [ℓ (f (X), Y)] .

Let

Ω : F \to [0, \infty]

be a regularizer and let

Ψ_{i} : F \to [0, \infty]

be structural functionals (robustness, fairness, physics constraints, etc.), all assumed measurable and finite on the admissible class.

The unified variational objective is

J_{D} (f) : = R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{m} η_{i} Ψ_{i} (f) - τ I (f), λ, η_{i} \geq 0,

(12)

and learning corresponds to minimizing

J_{D}

over

F

(or an empirical approximation thereof).

Proposition 2

(Dictionary learning / sparse coding as an instance of (12)). Let

X \subset R^{d}

and let

D

be a constraint set of dictionaries, e.g.,

D : = {D \in R^{d \times k} : ∥ d_{j} ∥_{2} \leq 1 \forall j}

where

d_{j}

denotes the j-th column of

D

. Consider the hypothesis space of pairs

F : = {(D, w) : D \in D, w \in R^{k}},

and define a reconstruction model

x \approx D w

. Let the loss be

ℓ ((D, w), x) = {∥ x - D w ∥}_{2}^{2}

and let the regularizer be

Ω (D, w) = {∥ w ∥}_{1}

. Then minimizing

J (D, w) = E_{X \sim D_{X}} [{∥ X - D w ∥}_{2}^{2}] + λ {∥ w ∥}_{1}

recovers the population sparse coding objective (and its empirical version is the standard dictionary learning/sparse coding problem). If one optimizes over w for each sample and alternates with updates of

D

, one obtains the classical alternating-minimization dictionary learning algorithms.

Proof.

See Appendix B. □

Corollary 1

(Weight decay as Tikhonov regularization). Let

F = {f (\cdot; θ) : θ \in R^{p}}

be a neural network class and let

R_{D} (f)

be the expected task loss. If

Ω (f)

is chosen as

Ω (f) : = {∥ θ ∥}_{2}^{2}

, then minimizing

J_{D} (f) = R_{D} (f) + λ {∥ θ ∥}_{2}^{2}

is exactly the population objective underlying

ℓ_{2}

weight decay (and its empirical analogue is the standard training objective with weight decay).

Then dropout training can be interpreted as minimizing

J_{D} (f) = R_{D}^{drop} (f),

i.e., an instance of (12) with an additional expectation over the stochasticity.

Corollary 2

(Early stopping as implicit regularization (template-level statement)). Consider an iterative optimization method producing parameters

θ_{t}

for minimizing the empirical analogue of

R_{D} (f)

. Stopping at a finite time

t = t^{★}

defines a constrained/regularized solution map

f_{t^{★}}

. In this sense, early stopping can be viewed as selecting an approximate minimizer of (12) with an implicit regularization determined by the optimization dynamics (e.g., algorithmic stability or norm control along the trajectory); hence, it fits the unified functional perspective at the level of the induced solution operator.

Functional Equivalence Principle. The above constructions show that diverse machine learning paradigms can be interpreted as instances of a single variational principle, differing primarily in the choice of hypothesis space and structural functionals rather than in their underlying optimization structure.

In this work, we provide a unified variational formulation that integrates simultaneously robustness, fairness, and interpretability as unified functionals within a single variational learning principle with theoretical guarantees.

4.2. A Fully Rigorous Instance in a Reproducing Kernel Hilbert Space

We now present a concrete instance of the unified variational framework in a reproducing kernel Hilbert space (RKHS) [10,17], showing that the abstract assumptions of Section 3 are satisfied in a standard functional-analytic setting.

Let

X

be a measurable space and let

K : X \times X \to R

be a measurable, positive definite kernel. Denote by

H_{K}

the associated RKHS, equipped with norm

{∥ \cdot ∥}_{H_{K}}

. We take

F = H_{K} .

Assume that

(A1): The kernel K is bounded, i.e., $\sup_{x \in X} K (x, x) \leq C_{K} < \infty$ ;
(A2): The loss $ℓ : T \times Y \to R_{+}$ is convex and Lipschitz in its first argument;
(A3): The output space $Y$ is finite.

We define the components of the unified objective as follows:

The risk:

$R_{D} (f) = E_{(X, Y) \sim D} [ℓ (f (X), Y)] .$
The regularizer:

$Ω (f) = {∥ f ∥}_{H_{K}}^{2} .$
A representative structural functional (e.g., fairness):

$Ψ_{fair} (f) = I (f (X); A),$

where A denotes a protected attribute.
In the RKHS example, the interpretability score is understood with the Hilbert-compatible simplicity term

$S_{H} (f) = - {∥ f ∥}_{H_{K}}^{2},$

rather than the finite-dimensional sparsity score $- {∥ w ∥}_{1}$ . Alternatively, if a finite kernel dictionary is fixed, one may use the dictionary-based score $S_{dict} (f) = - {∥ a ∥}_{1}$ for representations of the form $f = \sum_{j = 1}^{m} a_{j} K (\cdot, x_{j})$ .

We now verify that the assumptions of Proposition 1 hold.

Coercivity. Since

Ω (f) = {∥ f ∥}_{H_{K}}^{2}

, we have

Ω (f) \to \infty

as

{∥ f ∥}_{H_{K}} \to \infty

; hence,

Ω

is coercive on

H_{K}

.

Lower semicontinuity. The RKHS

H_{K}

is a Hilbert space. Under assumption (A1), point evaluations

f \mapsto f (x)

are continuous. Combined with the Lipschitz continuity of ℓ, this implies that

R_{D}

is continuous (hence lower semicontinuous) with respect to the norm topology.

Similarly,

Ω

is continuous, and standard choices of

Ψ_{i}

(under appropriate assumptions ensuring finiteness and continuity) are lower semicontinuous.

Compactness of sublevel sets. Since

H_{K}

is a Hilbert space, closed and bounded subsets are weakly compact. The coercivity of

Ω

implies that sublevel sets of

J_{D}

are bounded in

H_{K}

, hence relatively compact in the weak topology.

Upper semicontinuity of

I (f)

. Under the assumptions of Section 6, the interpretability score

I (f)

is finite and upper semicontinuous.

Therefore, all assumptions of Proposition 1 are satisfied, and the unified objective admits a minimizer in

H_{K}

.

Remark 4.

This example demonstrates that the abstract variational framework applies naturally within a classical and widely used functional-analytic setting in machine learning. In particular, it shows that additional structural objectives such as fairness and interpretability can be incorporated directly into the learning functional while preserving key variational properties including coercivity, lower semicontinuity, compactness of sublevel sets, and existence of minimizers. Consequently, the integration of structural constraints does not destroy the well-posedness of the underlying optimization problem under the assumptions considered here.

5. Robustness and Fairness as Structural Functionals

In this section, we define concrete instances of the structural functionals

Ψ_{i} (f)

appearing in Proposition 1. These functionals encode robustness to perturbations and fairness constraints, and play a central role in shaping the trade-offs of the unified variational formulation.

5.1. Robustness Functionals

Robustness characterizes the stability of predictions under perturbations of the input or the data distribution.

In the context of machine learning, robustness measures the extent to which a predictor remains stable under perturbations, uncertainty, or shifts in the data-generating process. A robust predictor should produce consistent outputs not only for nominal inputs, but also under small adversarial perturbations, measurement noise, or moderate distributional changes. From a variational perspective, robustness functionals quantify deviations from stability and therefore act as structural penalties controlling the sensitivity of the learned predictor.

Let

∥ \cdot ∥

be a norm on X. For

ε > 0

, define the local robustness functional

Ψ_{rob}^{loc} (f) = E_{X \sim D} [\sup_{∥ δ ∥ \leq ε} ∥ f (X + δ) - f (X) ∥] .

(13)

This functional measures the worst-case sensitivity of the predictor in a neighborhood of each input.

Let

W_{p}

denote the Wasserstein distance of order p. For

ρ > 0

, define

Ψ_{rob}^{dist} (f) = \sup_{D^{'} : W_{p} (D, D^{'}) \leq ρ} |R_{D^{'}} (f) - R_{D} (f)| .

(14)

This functional quantifies the sensitivity of the risk under distributional shifts.

These two notions capture complementary aspects of robustness: local stability at the input level and global stability at the distributional level.

Fairness constraints aim to control statistical dependence between predictions and protected attributes.

Let A denote a protected attribute. We define the fairness functional

Ψ_{fair}^{DP} (f) = I (f (X); A),

(15)

where

I (\cdot; \cdot)

denotes mutual information.

This functional penalizes statistical dependence between predictions and the protected attribute.

Let Y denote the target variable. We define

Ψ_{fair}^{EO} (f) = \sum_{y \in Y} |E [f (X) ∣ A = a, Y = y] - E [f (X) ∣ A = b, Y = y]| .

(16)

This functional enforces conditional independence given the target.

5.2. A Fundamental Trade-Off: Fairness vs. Accuracy

We now formalize a structural incompatibility between fairness and predictive accuracy that arises naturally within the unified framework.

Theorem 1

(Fairness–accuracy trade-off). Assume that

$Y ⊥ ⊥ A ∣ X$ ;
the hypothesis class $F$ is sufficiently rich to approximate the Bayes optimal predictor.

Then, there exists a constant

c > 0

such that

\inf_{f : f (X) ⊥ A} R_{D} (f) \geq \inf_{f \in F} R_{D} (f) + c .

(17)

This result is related to known impossibility theorems in fairness [29,30] and can be derived under similar informational assumptions.

The underlying mechanism behind this trade-off is informational. When the protected attribute contains predictive information correlated with the target variable, imposing independence constraints necessarily restricts the amount of predictive information available to the model. As a consequence, enforcing fairness constraints may prevent the predictor from attaining the Bayes-optimal risk. This phenomenon illustrates that fairness constraints do not simply act as external ethical corrections, but fundamentally modify the statistical structure of the learning problem itself.

Interpretation. When the protected attribute carries predictive information about the target, enforcing independence between predictions and the attribute induces an irreducible loss in accuracy. This phenomenon is not an artifact of specific algorithms, but a structural property of the learning problem.

5.3. Discussion

The functionals introduced in this section provide concrete instantiations of the abstract terms

Ψ_{i} (f)

in Proposition 1. Their inclusion in the unified objective leads to:

Explicit control of robustness under adversarial perturbations and distributional shifts;
Formal incorporation of fairness constraints through statistical dependence penalties;
Systematic characterization of trade-offs between predictive accuracy and structural requirements;
A unified variational interpretation of robustness and fairness as intrinsic components of the learning objective.

More broadly, the unified framework highlights that robustness and fairness are not independent add-on properties, but interacting structural objectives that directly influence the geometry of the optimization problem. Increasing robustness may restrict model flexibility, while enforcing fairness constraints may reduce access to predictive information correlated with protected attributes. The resulting trade-offs therefore arise intrinsically from the variational structure of the learning problem rather than from specific algorithmic choices.

6. Interpretability as a Variational Functional

6.1. Axiomatic Setup and Notation

Let

(Ω, G, P)

be a probability space. Let

(X, B_{X})

be a measurable space, and let

(Y, B_{Y})

be either a finite set with the power

σ

-algebra or a standard Borel space. Let

(X, Y) : Ω \to X \times Y

be a random pair with law D.

We consider predictors

f : X \to T

, where

(T, ∥ \cdot ∥_{T})

is a normed vector space (e.g.,

R

or

R^{k}

), and the hypothesis class

F

is a set of

B_{X} / B_{T}

-measurable maps.

We model interpretability via a functional

I : F \to R,

where larger values of

I (f)

correspond to more interpretable predictors.

We impose the following qualitative desiderata:

A1 (Simplicity). $I (f)$ should be larger for models of lower effective complexity.
A2 (Relevance). $I (f)$ should reward predictors that preserve information relevant to the target variable Y.
A3 (Stability of explanations). $I (f)$ should be larger for models whose explanations are stable under small perturbations of the input.

6.2. Definition of the Interpretability Score

To reflect A1–A3, we define an interpretability score,

I (f) = α S (f) + β M (f) + γ T_{\exp} (f),

(18)

where

α, β, γ \geq 0

.

(i) Simplicity score.

The form of the simplicity score depends on the structure of the hypothesis space.

In finite-dimensional parametric models, where

f (\cdot) = f (\cdot; w), w \in R^{d},

we may define

S (f) : = - {∥ w ∥}_{1} .

This choice promotes sparse parameter representations and is appropriate when the parametrization is fixed.

In contrast, in an abstract RKHS

H_{K}

, there is in general no canonical finite-dimensional coefficient vector w. Hence an

ℓ^{1}

penalty on parameters is not intrinsically defined unless a finite dictionary or basis has been specified. In the RKHS setting, a natural Hilbert-compatible simplicity score is instead

S_{H} (f) : = - {∥ f ∥}_{H_{K}}^{2} .

This measures functional complexity directly through the RKHS norm and is compatible with the variational assumptions used in Section 4.2.

Remark 5

(On sparsity in RKHS settings). The sparsity-based score

S (f) = - {∥ w ∥}_{1}

should be understood as a finite-dimensional or dictionary-based interpretability measure. In an RKHS, such a score is rigorous only after choosing a representation,

f = \sum_{j = 1}^{m} a_{j} K (\cdot, x_{j}),

for a fixed finite dictionary

{x_{1}, \dots, x_{m}}

, in which case one may define

S_{dict} (f) : = - {∥ a ∥}_{1} .

Without such a finite representation, the canonical complexity measure is the Hilbert norm

{∥ f ∥}_{H_{K}}

rather than an

ℓ^{1}

norm of parameters.

(ii) Relevance score. We make the representation structure explicit by writing

f = g \circ ϕ,

where

ϕ : X \to Z

is measurable with

(Z, B_{Z})

and

g : Z \to T

is measurable. Define

Z : = ϕ (X)

and

M (f) : = I (Z; Y),

(19)

the mutual information between Z and Y. We view

I (f)

as a functional of f under a fixed data distribution D.

We assume throughout that

I (Z; Y) < \infty

, which holds, for example, when

Y

is finite or under suitable regularity conditions on the joint distribution.

(iii) Stability score. Let

E_{f} : X \to R^{m}

be an explanation map, assumed

B_{X} / B (R^{m})

measurable. Typical examples include gradient-based attributions or local surrogate explanations.

We treat

E_{f}

as a given operator associated with the predictor f, without specifying its construction, as its precise form depends on the chosen explanation method.

Let

Δ

be a random perturbation defined on

(Ω, G, P)

, taking values in

X

, such that

(X, Δ)

is jointly measurable.

We define

T_{\exp} (f) : = - E [∥ E_{f} (X + Δ) - E_{f} {(X) ∥}_{2}],

(20)

whenever the expectation is finite. This term penalizes variability in explanations under input perturbations.

6.3. Well-Posedness Considerations

We briefly discuss conditions ensuring that the interpretability score is well-defined.

Lemma 1

(Basic well-posedness properties). Assume the setup above.

(a): If $Y$ is finite, then

$M (f) = I (Z; Y)$

is finite and satisfies

$0 \leq I (Z; Y) \leq H (Y) .$
(b): If $E_{f}$ is measurable and

$E [∥ E_{f} (X + Δ) - E_{f} {(X) ∥}_{2}] < \infty,$

then

$T_{\exp} (f)$

is well-defined, finite, and satisfies

$T_{\exp} (f) \leq 0 .$
(c): If

$S (f) \in R, M (f) = I (Z; Y) < \infty,$

and

$T_{\exp} (f) > - \infty,$

then the interpretability score

$I (f) = α S (f) + β M (f) + γ T_{\exp} (f)$

is finite.

Proof.

See Appendix C. □

Remark 6.

The above conditions are satisfied in many standard settings. For example, when

Y

is finite and

E_{f}

is constructed via continuous transformations of f, the interpretability score is well-defined.

6.4. Integration into the Unified Objective

Since

I (f)

is a score (larger is better), it is incorporated into the unified objective by subtraction:

\min_{f \in F} R_{D} (f) + λ Ω (f) + \sum_{i} η_{i} Ψ_{i} (f) - τ I (f),

(21)

with

τ \geq 0

.

The interpretability functional appears with a negative sign because the unified objective is formulated as a minimization problem, whereas larger values of

I (f)

correspond to more interpretable predictors. Subtracting

τ I (f)

therefore rewards predictors with higher interpretability while preserving the variational minimization structure of the framework. In this sense, interpretability acts as a utility-type structural objective competing with prediction error, robustness penalties, and fairness constraints.

6.5. Multi-Objective Interpretation

The scalarization (18) corresponds to selecting a direction in a multi-objective space. An equivalent Pareto formulation is

\min_{f \in F} (R_{D} (f), Ψ_{rob} (f), Ψ_{fair} (f), - I (f)),

(22)

which characterizes the competing objectives governing trustworthy machine learning systems. In the Pareto formulation, a predictor is considered Pareto-efficient if no objective can be improved without simultaneously degrading at least one other objective. Consequently, improving predictive accuracy may require sacrificing robustness or fairness, while increasing interpretability may constrain model complexity or expressive power.

This perspective makes explicit that robustness, fairness, and interpretability are not auxiliary post hoc properties, but intrinsic structural objectives interacting directly within the variational optimization problem. The Pareto formulation therefore provides a principled mathematical framework for analyzing the trade-offs and incompatibilities that arise between competing desiderata in modern machine learning systems.

7. Refinements and Consequences of the Unified Variational Principle

In this section, we discuss several consequences of Proposition 1. The results illustrate how the unified variational formulation interacts with classical notions such as stability, generalization, and structural trade-offs.

7.1. Uniform Stability of Empirical Minimizers

Let

S = (Z_{1}, \dots, Z_{n}) \sim D^{n}

, where

Z_{i} = (X_{i}, Y_{i})

, and define the empirical risk

R_{S} (f) = \frac{1}{n} \sum_{i = 1}^{n} ℓ (f (X_{i}), Y_{i}) .

(23)

The empirical counterpart of the unified objective is

J_{S} (f) = R_{S} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f) .

(24)

It follows from standard results that classical results in statistical learning theory relate uniform stability of empirical minimization to generalization performance.

Proposition 3

(Uniform stability in convex settings). Assume that

F

is equipped with a norm

∥ \cdot ∥

such that

The loss function satisfies

$| ℓ (f, z) - ℓ (g, z) | \leq L ∥ f - g ∥$

for all $f, g \in F$ and all sample points $z = (x, y)$ ;
Ω is μ-strongly convex and $λ > 0$ ;
Each $Ψ_{i}$ is convex;
The functional $- I$ is convex.

Let

{\hat{f}}_{S} \in \arg \min_{f \in F} J_{S} (f),

where

J_{S} (f) = \frac{1}{n} \sum_{j = 1}^{n} ℓ (f (x_{j}), y_{j}) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f) .

Then the learning algorithm

S \mapsto {\hat{f}}_{S}

is uniformly stable with stability parameter

β_{n} \leq \frac{2 L^{2}}{λ μ n} .

In particular,

β_{n} = O (\frac{1}{n}) .

Proof.

See Appendix D. □

Uniform stability quantifies the sensitivity of the learning algorithm to perturbations of the training dataset. In particular, a stability bound of order

O (1 / n)

implies that replacing a single training sample produces only a small change in the learned predictor and its associated risk. Consequently, the empirical risk becomes a reliable approximation of the population risk, leading to generalization guarantees for the learning procedure.

Remark 7.

This result applies to convex instantiations of the framework. In many practical settings (e.g., deep learning or mutual information-based functionals), the objective is nonconvex, and extending stability guarantees to such cases remains an open problem.

7.2. Implications for Generalization

Under the assumptions above, uniform stability implies generalization bounds. In particular, it follows from classical results [52] that

E [R_{D} ({\hat{f}}_{S}) - R_{S} ({\hat{f}}_{S})] = O (\frac{1}{n}) .

Remark 8.

This shows that, in convex settings, the addition of structural functionals

Ψ_{i}

and I does not change the qualitative generalization rate, but rather affects the location of the minimizer.

7.3. Refined Trade-Off Inequality

We restate the trade-off relation from Proposition 1.

Proposition 4

(Trade-off interpretation). Let

f^{†}

be a minimizer of the unified objective

J_{D} (f) = R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f),

and let

f^{*} \in \arg \min_{f \in F} R_{D} (f) .

Then,

\begin{matrix} R_{D} (f^{†}) - R_{D} (f^{*}) \leq & λ (Ω (f^{*}) - Ω (f^{†})) \\ + \sum_{i = 1}^{k} η_{i} (Ψ_{i} (f^{*}) - Ψ_{i} (f^{†})) \\ - τ (I (f^{*}) - I (f^{†})) . \end{matrix}

In particular, if the structural terms are normalized so that

Ω (f^{†}) \geq Ω (f^{*}), Ψ_{i} (f^{†}) \geq 0 for all i, I (f^{†}) \geq 0,

and then,

R_{D} (f^{†}) \leq R_{D} (f^{*}) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{*}) - τ I (f^{*}) .

Proof.

See Appendix E. □

Remark 9.

This inequality quantifies the trade-off between predictive accuracy and structural objectives within the unified variational framework. More precisely, it shows that the excess prediction risk incurred by the structurally constrained solution is controlled by the extent to which the unconstrained risk minimizer violates robustness, fairness, regularization, or interpretability requirements. Consequently, improving structural properties may require sacrificing predictive optimality, reflecting an intrinsic tension between competing objectives in trustworthy machine learning systems.

7.4. Bias–Variance Interpretation

The unified formulation induces a natural bias–variance perspective.

Proposition 5

(Bias induced by structural constraints). Let

{\hat{f}}_{S}

be an empirical minimizer of

J_{S} (f) = R_{S} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f) .

Assume the hypotheses of Proposition 3. Then there exists a constant

C > 0

, independent of n, such that

E [R_{D} ({\hat{f}}_{S})] \leq \inf_{f \in F} \{R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f)\} + \frac{C}{n} .

In particular, the structural terms

λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f)

act as bias-inducing terms: they restrict the effective class of admissible solutions, while uniform stability contributes a variance/generalization term of order

O (1 / n)

.

Proof.

See Appendix F. □

Remark 10.

This decomposition shows that structural penalties act as bias-inducing terms in the classical statistical sense: by favoring predictors satisfying robustness, fairness, smoothness, or interpretability requirements, the optimization problem is restricted to a smaller effective class of admissible solutions. As a consequence, the learned predictor may deviate from the unconstrained empirical risk minimizer, thereby introducing bias. At the same time, these structural constraints can improve stability and control variance, while preserving the qualitative generalization rate under the assumptions considered here.

7.5. Robustness and Regularity

We now relate robustness to classical smoothness properties.

Proposition 6

(Robustness and Lipschitz continuity). Let

(X, ∥ \cdot ∥)

and

(T, ∥ \cdot ∥_{T})

be normed spaces, and assume that

f : X \to T

is

L_{f}

-Lipschitz; that is,

{∥ f (x) - f (y) ∥}_{T} \leq L_{f} ∥ x - y ∥ for all x, y \in X .

Define the local robustness functional by

Ψ_{rob}^{loc} (f) = E_{X} [\sup_{∥ δ ∥ \leq ε} {∥ f (X + δ) - f (X) ∥}_{T}] .

Then,

Ψ_{rob}^{loc} (f) \leq L_{f} ε .

In particular,

Ψ_{rob}^{loc} (f) = O (L_{f} ε) .

Proof.

See Appendix G. □

Remark 11.

This result establishes a direct connection between robustness and classical smoothness properties of predictors. In particular, the proposition shows that if a predictor is Lipschitz continuous, then its local robustness functional is automatically controlled by the Lipschitz constant. Consequently, smoother predictors exhibit greater stability under adversarial perturbations. From the variational perspective, robustness can therefore be interpreted as a form of regularity control closely related to geometric smoothness of the learned function.

7.6. Summary

The discussion above highlights several theoretical consequences of the unified variational framework:

Recovery of classical stability and generalization guarantees under suitable convexity assumptions;
Explicit quantitative characterization of trade-offs between predictive accuracy and structural objectives such as robustness, fairness, and interpretability;
A bias–variance interpretation in which structural penalties act as bias-inducing regularization mechanisms while preserving qualitative statistical rates;
A direct connection between robustness and regularity properties through Lipschitz continuity and smoothness estimates;
A unified functional-analytic perspective linking optimization, stability, structural constraints, and generalization behavior within a common variational formulation.

Together, these results support the view that robustness, fairness, and interpretability can be analyzed systematically as intrinsic structural components of the learning objective rather than as isolated post hoc corrections.

8. Computational Perspective and Practical Instantiations

The unified variational framework developed in this work is not merely an abstract functional construction. Many modern machine learning methodologies already optimize objectives that can be interpreted as particular instances of the variational functional

J_{D} (f) = R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f) .

From this perspective, seemingly distinct learning paradigms differ primarily in the structural functionals imposed on the hypothesis space. Regularization, robustness, fairness constraints, physical admissibility, and interpretability can therefore be viewed as manifestations of a common variational architecture.

Figure 2 summarizes this viewpoint schematically. The learning objective is represented as a scalarized variational functional balancing predictive risk, robustness, fairness, complexity control, and interpretability. The associated optimization process induces a multi-objective geometry in which predictors correspond to points in a trade-off space, while the scalarized functional selects Pareto-optimal solutions according to the coefficients

(λ, η_{i}, τ)

. The figure also illustrates how several widely used methodologies arise naturally as particular instances of this general structure. Complementing this geometric perspective, Table 3 summarizes representative learning paradigms within the unified variational framework and identifies the corresponding risk terms, structural functionals, interpretability components, and typical application domains.

8.1. Worst-Case and Robustness Functionals

Adversarial training and distributionally robust optimization can both be interpreted as variational formulations incorporating worst-case stability directly into the learning objective. A standard adversarial objective takes the form

J (f) = R_{D} (f) + η_{rob} E [\sup_{∥ δ ∥ \leq ε} ℓ (f (X + δ), Y)],

where the inner maximization defines the robustness functional

Ψ_{rob} (f) = E [\sup_{∥ δ ∥ \leq ε} ℓ (f (X + δ), Y)] .

Distributionally robust optimization provides a related construction:

J (f) = \sup_{D^{'} \in U (D)} R_{D^{'}} (f),

where

U (D)

denotes an uncertainty set, typically defined through Wasserstein or divergence-based neighborhoods of the empirical distribution. In both cases, robustness is incorporated through worst-case functionals controlling sensitivity under perturbations of either the input or the underlying data distribution.

From the present perspective, adversarial training and DRO differ primarily in the geometry of the perturbation set defining the robustness functional. Optimization procedures such as projected gradient descent therefore act as computational approximations for minimizing a particular variational objective rather than as isolated algorithmic heuristics.

8.2. Constraint-Based Structural Objectives

Fairness-aware learning and physics-informed learning both introduce structural constraints directly into the optimization problem.

In fairness-aware learning, predictive risk is augmented by statistical dependence penalties or demographic constraints:

J (f) = R_{D} (f) + η_{fair} Ψ_{fair} (f) .

Typical choices of

Ψ_{fair}

include demographic parity penalties, equalized-odds constraints, mutual-information penalties, and kernel-based dependence measures such as HSIC. For example,

J (f) = R_{D} (f) + η_{fair} I (f (X); A),

penalizes statistical dependence between predictions and a protected attribute A.

Physics-informed neural networks introduce an analogous variational mechanism. Let

N [f] (x) = 0

denote a governing differential operator. PINN-type objectives take the form

J (f) = R_{D} (f) + η_{phys} E [{∥ N [f] (X) ∥}^{2}],

where the PDE residual defines a structural functional

Ψ_{phys} (f) = E [{∥ N [f] (X) ∥}^{2}] .

In both settings, structural admissibility is enforced directly at the level of the learning functional. Fairness constraints restrict discriminatory dependence, while physical penalties enforce consistency with governing equations. The resulting predictors are therefore shaped not only by predictive accuracy, but also by additional geometric or structural requirements imposed on the hypothesis space.

8.3. Regularization and Representation Functionals

Sparse coding, kernel methods, and several deep-learning heuristics can be interpreted as instances of structural regularization within the unified variational framework.

Sparse coding and dictionary learning optimize objectives of the form

J (D, w) = E [{∥ X - D w ∥}_{2}^{2}] + λ {∥ w ∥}_{1},

where the

ℓ^{1}

term acts as a structural functional promoting sparse representations.

Similarly, kernel methods impose RKHS penalties

Ω (f) = {∥ f ∥}_{H_{K}}^{2},

which control functional complexity through the geometry of the reproducing kernel Hilbert space.

Several widely used deep-learning heuristics admit analogous interpretations. Weight decay corresponds to Tikhonov-type regularization, dropout introduces stochastic regularization, and early stopping acts as an implicit regularizer restricting effective model complexity through optimization dynamics.

From a variational viewpoint, these methods differ primarily in the structural penalties used to control complexity, sparsity, or effective geometry of the optimization landscape.

8.4. Interpretability Functionals

Many interpretable learning systems incorporate explicit structural objectives promoting simplicity, explanation stability, or representation relevance. A representative objective takes the form

J (f) = R_{D} (f) + λ Ω (f) - τ I (f),

where the interpretability score

I (f)

may combine sparsity, information relevance, and stability of explanations.

Examples include sparse linear models, saliency regularization, explanation-consistency penalties, and concept-based representation constraints. In such formulations, interpretability acts as a utility-type structural functional competing directly with predictive accuracy and other structural objectives.

The unified variational framework developed in Section 6 provides a common mathematical setting in which interpretability, robustness, and fairness can be analyzed simultaneously rather than through disconnected post hoc procedures.

8.5. Optimization and Computational Considerations

Although the unified objective may involve multiple structural functionals, many of its components admit scalable approximations compatible with modern optimization pipelines. Adversarial robustness terms are commonly approximated through projected gradient methods, fairness penalties through minibatch estimators, and interpretability objectives through sparsity or smoothness regularization. Consequently, optimization procedures such as stochastic gradient descent, Adam-type methods, proximal algorithms, and alternating minimization can often be interpreted as computational strategies for approximating minimizers of composite variational objectives.

At the same time, highly nonconvex settings may require surrogate objectives, stochastic approximations, or specialized optimization schemes, particularly when information-theoretic penalties or explanation-based functionals are involved. The variational perspective nevertheless clarifies that these computational procedures operate on a common structural optimization problem rather than on unrelated collections of heuristics.

8.6. Limitations and Future Directions

The present work is primarily theoretical and variational in nature. The paper does not attempt to provide exhaustive empirical benchmarking across all instantiated objectives or application domains. Moreover, in highly nonconvex settings involving deep neural networks, robustness penalties, or information-theoretic objectives, practical optimization may require surrogate formulations whose theoretical properties remain only partially understood.

Several directions therefore remain open, including the analysis of optimization landscapes for composite structural objectives, scalable estimation of robustness and dependence functionals, and statistical consistency of Pareto-optimal solutions in high-dimensional hypothesis spaces. More broadly, the framework suggests that many deployed machine learning methodologies can be interpreted systematically through the language of variational optimization and structural functionals rather than as isolated algorithmic constructions.

9. Discussion, Computation Considerations and Open Problems

This paper adopts a variational perspective in which predictive risk, robustness, fairness, and interpretability are formulated as functionals on a hypothesis space and incorporated directly into a unified learning objective. The central idea is that many limitations of modern machine learning systems arise because standard optimization objectives encode predictive accuracy but omit additional structural properties required for reliable deployment. From this viewpoint, robustness, fairness, and interpretability are not external post hoc corrections, but intrinsic variational components of the learning problem itself.

Beyond bringing together several existing paradigms under a common formulation, this perspective also suggests new theoretical and computational challenges. In particular, combining modern nonconvex parameterizations with nonsmooth structural penalties leads to substantial difficulties in optimization, stability analysis, and characterization of minimizers.

9.1. Optimization of Nonconvex and Nonsmooth Objectives

The objectives arising from the unified formulation tend to combine several challenging features at once: nonconvex parameterizations (e.g., neural networks), nonsmooth penalties (such as

ℓ_{0} / ℓ_{1}

sparsity or max-type robustness terms), and dependence measures that may be difficult to estimate or differentiate. This combination makes even basic optimization questions nontrivial and leads to several open problems.

Algorithmic convergence under composite structure. Establish convergence guarantees for principled algorithms (proximal gradient, alternating minimization, primal–dual schemes, mirror descent) when the objective contains multiple competing functionals, some of which may be only lower semicontinuous or only available through stochastic estimators.
Provably correct surrogates. Many practically used substitutes (e.g., replacing ${∥ w ∥}_{0}$ by ${∥ w ∥}_{1}$ , mutual information by neural estimators, Wasserstein balls by tractable relaxations) change the geometry of the problem. A natural question is how closely minimizers (or Pareto frontiers) of surrogate objectives approximate those of the original formulation, and at what rate.
Stationarity notions and certificates. For nonsmooth/nonconvex formulations, classical first-order optimality conditions are insufficient. Developing appropriate notions (Clarke stationarity, variational inequalities, weak KKT-type conditions under constraints) and computable certificates is essential for both theory and reproducibility.

9.2. Choice of Trade-Off Parameters and Identifiability of the Pareto Frontier

The parameters

(λ, η_{i}, τ)

govern the relative strength of accuracy, robustness, fairness, and interpretability. Selecting them is not merely a tuning issue; it determines which points on the Pareto set are accessible and how sensitive the solution is to modeling assumptions.

Principled calibration of weights. Develop approaches that connect weights to interpretable quantities (e.g., a bound on worst-case distribution shift size, a target fairness gap, or an interpretability budget). This suggests studying Lagrange-multiplier interpretations and dual formulations whenever constraints are used.
Sensitivity and stability of solutions. Analyze how minimizers vary with $(λ, η_{i}, τ)$ , including continuity/discontinuity of minimizers, bifurcations in nonconvex regimes, and conditions ensuring a well-behaved Pareto frontier.
Recovering the Pareto set. Linear scalarization recovers only supported Pareto optima under convexity. For nonconvex objectives, a substantial part of the frontier may be missed. Designing algorithms that explore non-supported Pareto points (e.g., $ϵ$ -constraint methods, adaptive scalarizations, or multi-objective proximal methods) remains open.

9.3. Scalability and Computational Complexity

Even when functionals are conceptually well-defined, they may be computationally prohibitive in modern regimes (large models, high-dimensional data, and streaming settings).

Efficient estimation of dependence penalties. Fairness functionals based on mutual information or conditional constraints require estimating high-dimensional dependence, often under distribution shift. Establishing sample complexity bounds and scalable estimators compatible with stochastic optimization is an important direction.
Robustness at scale. Distributional robustness over Wasserstein balls can be costly, and adversarial robustness may require expensive inner maximizations. A key challenge is to identify computationally tractable approximations with explicit error bounds, and to understand when robustness objectives lead to manageable training dynamics.
Sparse/structured interpretability for large models. Interpretability functionals that promote sparsity, modularity, or explanation stability may be natural for linear or kernel methods but become subtle for deep networks. Determining which structural constraints scale (and which collapse into vacuous penalties) is largely unresolved.

9.4. Alignment Between Mathematical Definitions and Human-Centric Notions

A central motivation for this framework is to give formal meanings to robustness, fairness, and interpretability. However, these notions originate in human expectations, legal requirements, and domain-specific semantics.

Fairness: Incompatibilities and context dependence. Different fairness definitions (demographic parity, equalized odds, calibration, individual fairness) encode distinct statistical and normative requirements and can therefore be mutually incompatible depending on the underlying data-generating process. In particular, impossibility results show that multiple fairness criteria cannot generally be satisfied simultaneously except under highly restrictive assumptions, especially when base rates differ across groups.
Interpretability: What is the object being stabilized? Stability of predictions is not the same as stability of explanations. Formalizing the explanation object (saliency maps, concept vectors, local surrogate models, attribution mechanisms) and validating that its stability corresponds to meaningful human understanding remains an open problem. Recent applied research in advanced manufacturing illustrates ongoing efforts to bridge this gap. For example, interpretability techniques such as SHAP and Grad-CAM [53,54] have been used to analyze black-box process dynamics in electrochemical machining systems [55,56], helping identify physically meaningful regions and feature interactions that align with established domain knowledge. These developments highlight the practical relevance of interpretability-oriented functionals while simultaneously emphasizing that mathematically stable explanations do not automatically guarantee scientifically or cognitively meaningful interpretations.
Robustness: Choosing the right perturbation model. Wasserstein balls, adversarial $ℓ_{p}$ perturbations, and distribution shift sets are mathematical proxies for deployment uncertainty. Selecting perturbation classes that accurately reflect real-world shifts (while remaining analyzable) is a key bridge between theory and practice.

9.5. Further Theoretical Directions

We conclude with several concrete mathematical questions suggested by the unified formulation:

Existence and compactness. Provide general conditions (coercivity, lower semicontinuity, tightness) ensuring existence of minimizers for objectives combining $R_{D}$ , robustness and fairness penalties, and interpretability scores.
Generalization under structural constraints. Extend stability and complexity-based generalization analyses to objectives with Wasserstein robust risk, dependence-based fairness penalties, and explanation-based interpretability terms, including sharp rates and minimax optimality where possible.
Duality and certificates. Identify settings where robust and fair objectives admit strong dual representations. Duality can yield both computational algorithms and verifiable certificates (e.g., worst-case shift witnesses, fairness-violation witnesses).
Axiomatic completeness. Determine whether there exist “complete” axiom systems for interpretability functionals (analogous to characterizations in risk measures), and whether different axiom choices lead to equivalent or genuinely distinct notions of interpretability.

Overall, the variational framework offers a precise language for formulating learning objectives that explicitly target reliability properties. At the same time, the open problems outlined above indicate that turning this perspective into a fully developed theory (and a practical design tool) will require progress across optimization, statistical estimation, and the formalization of human-centered notions within a coherent functional-analytic setting.

10. Conclusions

In this work, we introduced a unified variational framework for trustworthy machine learning in which robustness, fairness, and interpretability are formulated directly as structural functionals over the hypothesis space and incorporated into a single learning objective. From this perspective, standard empirical risk minimization is variationally under-specified: it optimizes predictive accuracy while leaving structural reliability properties largely implicit. As a consequence, robustness, fairness, and interpretability are often introduced only through external constraints or post hoc corrections whose interactions remain difficult to analyze systematically.

The proposed formulation treats these properties as intrinsic components of the optimization problem itself. Robustness can be encoded through perturbation-based or distributional functionals, fairness through dependence penalties or statistical constraints on predictive distributions, and interpretability through structural objectives associated with simplicity, relevance, and stability of explanations.

Within this framework, many existing paradigms can be interpreted as particular instances of a common variational architecture differing primarily in the choice of loss functionals, hypothesis spaces, and structural constraints.A central consequence of this viewpoint is that reliability objectives interact intrinsically with predictive performance and with one another. The unified formulation therefore provides a natural setting for analyzing trade-offs, Pareto-optimality, stability, and well-posedness within a common functional-analytic framework. In particular, the framework supports the interpretation of trustworthy behavior not as a secondary heuristic adjustment, but as a structural consequence of the variational principles defining the learning problem.

Several theoretical questions remain open, including the analysis of highly nonconvex composite objectives, scalable estimation of robustness and dependence functionals, and the geometric structure of Pareto-optimal solution sets in high-dimensional hypothesis spaces. More broadly, the present framework suggests that many apparently distinct trustworthy-learning methodologies can be understood through a common language of variational optimization and structural functionals.

We hope that the perspective of this work contributes toward the development of learning principles in which reliability properties become intrinsic and mathematically analyzable components of machine learning systems.

Author Contributions

Conceptualization, J.M.V.; methodology, J.M.V.; formal analysis, J.M.V. and B.G.-P.; investigation, J.M.V.; writing—original draft preparation, J.M.V.; writing—review and editing, J.M.V. and B.G.-P. All authors have read and agreed to the published version of the manuscript.

Funding

This work is funded by national funds of the Spanish Minister of Science, Innovation, and Universities and the Agencia Española de Investigación, through grants PDC2025-165077-I00 and PID2024-158129OB-I00 and by FEDER, UE: MICIU/AEI/10.13039/501100011033.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Proposition 1

Proof.

We first show that

J_{D}

is lower semicontinuous with respect to the topology of

F

. By assumption, the functionals

R_{D}

,

Ω

, and each

Ψ_{i}

are lower semicontinuous. Since I is upper semicontinuous, the functional

- I

is lower semicontinuous. Therefore, since non-negative linear combinations of lower semicontinuous functionals remain lower semicontinuous, and since

λ > 0

,

η_{i} \geq 0

, and

τ \geq 0

, the functional

J_{D} (f) = R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f)

is lower semicontinuous.

We now prove (i). Let

{(f_{n})}_{n \in N} \subset F

be a minimizing sequence; that is,

J_{D} (f_{n}) \to \inf_{f \in F} J_{D} (f) .

Choose

c \in R

such that, for all sufficiently large n,

J_{D} (f_{n}) \leq c .

Passing to a tail of the sequence if necessary, we may assume that

f_{n} \in {f \in F : J_{D} (f) \leq c}

for every n. By assumption (H3), this sublevel set is relatively compact in the topology of

F

. Hence there exist a subsequence, not relabeled, and an element

f^{†} \in F

such that

f_{n} \to f^{†}

in the topology of

F

. Since

J_{D}

is lower semicontinuous,

J_{D} (f^{†}) \leq \underset{n \to \infty}{lim inf} J_{D} (f_{n}) = \inf_{f \in F} J_{D} (f) .

On the other hand, since

f^{†} \in F

, one trivially has

\inf_{f \in F} J_{D} (f) \leq J_{D} (f^{†}) .

Combining the two inequalities yields

J_{D} (f^{†}) = \inf_{f \in F} J_{D} (f),

which proves the existence of a minimizer.

We next prove (ii). Since

f^{†}

minimizes

J_{D}

, for every

f \in F

we have

J_{D} (f^{†}) \leq J_{D} (f) .

In particular, taking

f = f^{*}

gives

R_{D} (f^{†}) + λ Ω (f^{†}) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{†}) - τ I (f^{†}) \leq R_{D} (f^{*}) + λ Ω (f^{*}) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{*}) - τ I (f^{*}) .

Rearranging terms, we obtain

R_{D} (f^{†}) - R_{D} (f^{*}) \leq λ (Ω (f^{*}) - Ω (f^{†})) + \sum_{i = 1}^{k} η_{i} (Ψ_{i} (f^{*}) - Ψ_{i} (f^{†})) - τ (I (f^{*}) - I (f^{†})) .

This estimate quantifies the trade-off between predictive risk and the structural terms appearing in the unified objective.

Finally, we prove (iii). Suppose first that all scalarization weights associated with the structural objectives are strictly positive; that is,

η_{i} > 0 for all i, τ > 0 .

Assume, by contradiction, that

f^{†}

is not Pareto-optimal for the multi-objective problem

\min_{f \in F} (R_{D} (f), Ψ_{1} (f), \dots, Ψ_{k} (f), - I (f)) .

Then, there exists

\tilde{f} \in F

such that

R_{D} (\tilde{f}) \leq R_{D} (f^{†}), Ψ_{i} (\tilde{f}) \leq Ψ_{i} (f^{†}) for all i,

and

- I (\tilde{f}) \leq - I (f^{†}),

with at least one of these inequalities being strict. Equivalently,

I (\tilde{f}) \geq I (f^{†}),

with strict improvement in at least one objective.

Since all scalarization weights are strictly positive, summing the weighted inequalities yields

J_{D} (\tilde{f}) < J_{D} (f^{†}),

which contradicts the minimality of

f^{†}

. Therefore,

f^{†}

is Pareto-optimal for the stated multi-objective problem.

If some of the weights

η_{i}

or

τ

vanish, the same argument shows that

f^{†}

is Pareto-optimal with respect to the objectives appearing with strictly positive weights in the scalarized functional

J_{D}

. □

Appendix B. Proof of Proposition 2

Proof.

We show that the stated objective is an instance of the unified variational template.

Let

F = {(D, w) : D \in D, w \in R^{k}},

where

D \subset R^{d \times k}

is a set of admissible dictionaries. For a datum

x \in X \subset R^{d}

, the model approximates x by the linear reconstruction

D w

.

Define

ℓ ((D, w), x) = {∥ x - D w ∥}_{2}^{2}, Ω (D, w) = {∥ w ∥}_{1} .

Then the corresponding population risk is

R_{D_{X}} (D, w) = E_{X \sim D_{X}} [{∥ X - D w ∥}_{2}^{2}] .

Substituting these choices into the unified objective, with

Ψ_{i} \equiv 0, I \equiv 0,

gives

J (D, w) = E_{X \sim D_{X}} [{∥ X - D w ∥}_{2}^{2}] + λ {∥ w ∥}_{1} .

Thus the unified functional reduces to the population sparse coding objective: the first term penalizes reconstruction error, while the

ℓ^{1}

term promotes sparsity of the code.

For a finite sample

x_{1}, \dots, x_{n}

, the corresponding empirical formulation is

J_{n} (D, {w_{i}}_{i = 1}^{n}) = \frac{1}{n} \sum_{i = 1}^{n} ∥ x_{i} - D w_{i} ∥_{2}^{2} + λ \sum_{i = 1}^{n} {∥ w_{i} ∥}_{1} .

This is the standard empirical dictionary learning and sparse coding problem, where each datum

x_{i}

is assigned its own sparse code

w_{i}

.

Classical algorithms solve this problem by alternating minimization: one fixes

D

and optimizes the sparse codes

w_{i}

, and then fixes the codes and updates the dictionary

D

. Although the objective is generally nonconvex jointly in

(D, {w_{i}})

, it has the standard block structure used in dictionary learning algorithms. Hence, dictionary learning and sparse coding arise as special cases of the unified variational framework. □

Appendix C. Proof of Lemma 1

Proof.

We prove each statement separately.

(a): Assume that $Y$ is finite. Then the entropy $H (Y)$ is finite. By the definition of mutual information,

$I (Z; Y) = H (Y) - H (Y ∣ Z) .$

Since conditional entropy is non-negative,

$H (Y ∣ Z) \geq 0,$

it follows that

$0 \leq I (Z; Y) \leq H (Y) < \infty .$

Therefore,

$M (f) = I (Z; Y)$

is finite and bounded above by $H (Y)$ .
(b): By assumption, the explanation map

$E_{f} : X \to R^{m}$

is measurable, and the pair $(X, Δ)$ is jointly measurable. Hence, the maps

$ω \mapsto E_{f} (X (ω) + Δ (ω))$

and

$ω \mapsto E_{f} (X (ω))$

are measurable from $Ω$ into $R^{m}$ .
Since subtraction and the Euclidean norm are continuous operations, the map

$ω \mapsto ∥ E_{f} (X (ω) + Δ (ω)) - E_{f} {(X (ω)) ∥}_{2}$

is a measurable non-negative random variable.
By assumption,

$E [∥ E_{f} (X + Δ) - E_{f} {(X) ∥}_{2}] < \infty .$

Therefore,

$T_{\exp} (f) = - E [∥ E_{f} (X + Δ) - E_{f} {(X) ∥}_{2}]$

is well-defined and finite.
Moreover, since the expectation of a non-negative random variable is non-negative,

$E [∥ E_{f} (X + Δ) - E_{f} {(X) ∥}_{2}] \geq 0,$

and therefore,

$T_{\exp} (f) \leq 0 .$
(c): Assume that

$S (f) \in R, M (f) = I (Z; Y) < \infty,$

and

$T_{\exp} (f) > - \infty .$

Then the simplicity score $S (f)$ is finite by assumption, and

$M (f) = I (Z; Y)$

is finite as well.
Moreover,

$T_{\exp} (f) > - \infty$

implies that the stability contribution is finite.
Since $α, β, γ \geq 0$ , it follows that

$I (f) = α S (f) + β M (f) + γ T_{\exp} (f)$

is a finite real number.

□

Appendix D. Proof of Proposition 3

Proof.

Let

S = (z_{1}, \dots, z_{n}), z_{i} = (x_{i}, y_{i}),

and let

S^{(i)}

denote the sample obtained from S by replacing the i-th observation

z_{i}

with an independent sample point

z_{i}^{'}

.

For each sample S, define

{\hat{f}}_{S} \in \arg \min_{f \in F} J_{S} (f),

where the empirical objective is

J_{S} (f) = \frac{1}{n} \sum_{j = 1}^{n} ℓ (f (x_{j}), y_{j}) + λ Ω (f) + \sum_{r = 1}^{k} η_{r} Ψ_{r} (f) - τ I (f) .

Similarly, let

{\hat{f}}_{S^{(i)}} \in \arg \min_{f \in F} J_{S^{(i)}} (f) .

By assumption,

Ω

is

μ

-strongly convex and

λ > 0

. Since each

Ψ_{r}

is convex and

- I

is convex, it follows that the functional

f \mapsto λ Ω (f) + \sum_{r = 1}^{k} η_{r} Ψ_{r} (f) - τ I (f)

is

λ μ

-strongly convex. Therefore both empirical objectives

J_{S}

and

J_{S^{(i)}}

are

λ μ

-strongly convex.

Set

f = {\hat{f}}_{S}, g = {\hat{f}}_{S^{(i)}} .

By strong convexity of

J_{S}

and the optimality of f,

J_{S} (g) - J_{S} (f) \geq \frac{λ μ}{2} {∥ g - f ∥}^{2} .

Similarly, by strong convexity of

J_{S^{(i)}}

and the optimality of g,

J_{S^{(i)}} (f) - J_{S^{(i)}} (g) \geq \frac{λ μ}{2} {∥ g - f ∥}^{2} .

Adding the two inequalities gives

{λ μ ∥ g - f ∥}^{2} \leq J_{S} (g) - J_{S} (f) + J_{S^{(i)}} (f) - J_{S^{(i)}} (g) .

The two empirical objectives differ only in the contribution of the replaced observation. Hence all common terms cancel, yielding

{λ μ ∥ g - f ∥}^{2} \leq \frac{1}{n} (ℓ (g, z_{i}) - ℓ (f, z_{i}) + ℓ (f, z_{i}^{'}) - ℓ (g, z_{i}^{'})),

where we use the shorthand notation

ℓ (f, z) = ℓ (f (x), y) .

Since the loss is L-Lipschitz with respect to the parameter norm,

| ℓ (g, z) - ℓ (f, z) | \leq L ∥ g - f ∥

for every sample point z. Therefore,

{λ μ ∥ g - f ∥}^{2} \leq \frac{2 L}{n} ∥ g - f ∥ .

If

g \neq f

, dividing both sides by

∥ g - f ∥

yields

∥ g - f ∥ \leq \frac{2 L}{λ μ n} .

If

g = f

, the inequality holds trivially.

Now let

z = (x, y)

be an arbitrary test point. Using again the L-Lipschitz continuity of the loss,

| ℓ ({\hat{f}}_{S} (x), y) - ℓ ({\hat{f}}_{S^{(i)}} (x), y) | \leq L ∥ {\hat{f}}_{S} - {\hat{f}}_{S^{(i)}} ∥ .

Combining this estimate with the previous bound gives

| ℓ ({\hat{f}}_{S} (x), y) - ℓ ({\hat{f}}_{S^{(i)}} (x), y) | \leq \frac{2 L^{2}}{λ μ n} .

Therefore the learning algorithm satisfies uniform stability with stability parameter

β_{n} \leq \frac{2 L^{2}}{λ μ n} .

In particular,

β_{n} = O (\frac{1}{n}),

which proves the claim. □

Appendix E. Proof of Proposition 4

Proof.

Since

f^{†}

minimizes

J_{D}

, we have

J_{D} (f^{†}) \leq J_{D} (f^{*})

for every

f^{*} \in F

. Expanding the definition of the objective functional gives

R_{D} (f^{†}) + λ Ω (f^{†}) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{†}) - τ I (f^{†}) \leq R_{D} (f^{*}) + λ Ω (f^{*}) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{*}) - τ I (f^{*}) .

Rearranging terms yields

\begin{matrix} R_{D} (f^{†}) - R_{D} (f^{*}) \leq & λ (Ω (f^{*}) - Ω (f^{†})) \\ + \sum_{i = 1}^{k} η_{i} (Ψ_{i} (f^{*}) - Ψ_{i} (f^{†})) \\ - τ (I (f^{*}) - I (f^{†})) . \end{matrix}

This proves the general trade-off inequality.

Assume now that the structural terms satisfy

Ω (f^{†}) \geq Ω (f^{*}), Ψ_{i} (f^{†}) \geq 0 for all i, I (f^{†}) \geq 0 .

Since

λ > 0

,

η_{i} \geq 0

, and

τ \geq 0

, it follows that

λ (Ω (f^{*}) - Ω (f^{†})) \leq 0,

and

- \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{†}) \leq 0 .

Moreover,

τ I (f^{†}) \geq 0 .

Substituting these estimates into the general inequality gives

R_{D} (f^{†}) - R_{D} (f^{*}) \leq \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{*}) - τ I (f^{*}) .

Equivalently,

R_{D} (f^{†}) \leq R_{D} (f^{*}) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f^{*}) - τ I (f^{*}) .

This proves the normalized form. □

Appendix F. Proof of Proposition 5

Proof.

Define the population objective

J_{D} (f) = R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f),

and its empirical counterpart

J_{S} (f) = R_{S} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f),

where

R_{S} (f) = \frac{1}{n} \sum_{j = 1}^{n} ℓ (f (x_{j}), y_{j}) .

By Proposition 3, the learning algorithm

S \mapsto {\hat{f}}_{S}

is uniformly stable with parameter

β_{n} \leq \frac{C}{n}

for some constant

C > 0

independent of n.

Standard stability estimates imply that

E [R_{D} ({\hat{f}}_{S}) - R_{S} ({\hat{f}}_{S})] \leq β_{n} .

Therefore,

E [R_{D} ({\hat{f}}_{S})] \leq E [R_{S} ({\hat{f}}_{S})] + β_{n} .

Since

{\hat{f}}_{S}

minimizes

J_{S}

, for every fixed

f \in F

,

J_{S} ({\hat{f}}_{S}) \leq J_{S} (f) .

Expanding this inequality gives

\begin{matrix} R_{S} ({\hat{f}}_{S}) \leq & R_{S} (f) + λ Ω (f) - λ Ω ({\hat{f}}_{S}) \\ + \sum_{i = 1}^{k} η_{i} (Ψ_{i} (f) - Ψ_{i} ({\hat{f}}_{S})) \\ - τ (I (f) - I ({\hat{f}}_{S})) . \end{matrix}

Taking expectations and using

E [R_{S} (f)] = R_{D} (f)

for fixed f, we obtain

\begin{matrix} E [R_{S} ({\hat{f}}_{S})] \leq & R_{D} (f) + λ Ω (f) \\ + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f) \\ - E [λ Ω ({\hat{f}}_{S}) + \sum_{i = 1}^{k} η_{i} Ψ_{i} ({\hat{f}}_{S}) - τ I ({\hat{f}}_{S})] . \end{matrix}

Dropping the final expectation term yields the upper bound

E [R_{S} ({\hat{f}}_{S})] \leq R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f) .

Combining this estimate with the stability bound gives

E [R_{D} ({\hat{f}}_{S})] \leq R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f) + β_{n} .

Since this inequality holds for every

f \in F

, taking the infimum over f yields

E [R_{D} ({\hat{f}}_{S})] \leq \inf_{f \in F} \{R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f)\} + β_{n} .

Finally, using the estimate

β_{n} \leq \frac{C}{n}

gives

E [R_{D} ({\hat{f}}_{S})] \leq \inf_{f \in F} \{R_{D} (f) + λ Ω (f) + \sum_{i = 1}^{k} η_{i} Ψ_{i} (f) - τ I (f)\} + \frac{C}{n} .

Thus the population risk of the empirical minimizer is controlled by the best regularized structural objective together with a stability term of order

O (1 / n)

. The structural penalties therefore act as bias-inducing terms, whereas the stability contribution represents the variance/generalization component. □

Appendix G. Proof of Proposition 6

Proof.

Since f is

L_{f}

-Lipschitz, for every

x \in X

and every perturbation

δ

satisfying

∥ δ ∥ \leq ε,

we have

{∥ f (x + δ) - f (x) ∥}_{T} \leq L_{f} ∥ (x + δ) - x ∥ = L_{f} ∥ δ ∥ .

Therefore,

{∥ f (x + δ) - f (x) ∥}_{T} \leq L_{f} ε .

Taking the supremum over all admissible perturbations yields

\sup_{∥ δ ∥ \leq ε} {∥ f (x + δ) - f (x) ∥}_{T} \leq L_{f} ε .

Applying expectation with respect to X gives

Ψ_{rob}^{loc} (f) = E_{X} [\sup_{∥ δ ∥ \leq ε} {∥ f (X + δ) - f (X) ∥}_{T}] \leq E_{X} [L_{f} ε] .

Since

L_{f}

and

ε

are constants independent of X,

Ψ_{rob}^{loc} (f) \leq L_{f} ε .

Hence,

Ψ_{rob}^{loc} (f) = O (L_{f} ε),

which proves the claim. □

References

Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [Google Scholar] [CrossRef] [PubMed]
Linardatos, P.; Papastefanopoulos, V.; Kotsiantis, S. Explainable AI: A Review of Machine Learning Interpretability Methods. Entropy 2021, 23, 18. [Google Scholar] [CrossRef]
Liu, X.; Faes, L.; Kale, A.; Wagner, S.; Fu, D.J.; Bruynseels, A.; Mahendiran, T.; Moraes, G.; Shamdas, M.; Kern, C.; et al. A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: A systematic review and meta-analysis. Lancet Digit. Health 2019, 1, e271–e297. [Google Scholar] [CrossRef]
Hajj, M.E.; Hammoud, J. Unveiling the Influence of Artificial Intelligence and Machine Learning on Financial Markets: A Comprehensive Analysis of AI Applications in Trading, Risk Management, and Financial Operations. J. Risk Financ. Manag. 2023, 16, 434. [Google Scholar] [CrossRef]
Mhlanga, D. Financial Inclusion in Emerging Economies: The Application of Machine Learning and Artificial Intelligence in Credit Risk Assessment. Int. J. Financ. Stud. 2021, 9, 39. [Google Scholar] [CrossRef]
Amarasinghe, K.; Rodolfa, K.T.; Lamba, H.; Ghani, R. Explainable machine learning for public policy: Use cases, gaps, and research directions. Data Policy 2020, 5, e5. [Google Scholar] [CrossRef]
Canhoto, A. Leveraging machine learning in the global fight against money laundering and terrorism financing: An affordances perspective. J. Bus. Res. 2020, 131, 441–452. [Google Scholar] [CrossRef]
Berk, R. Machine Learning Risk Assessments in Criminal Justice Settings; Springer: Cham, Switzerland, 2018; pp. 1–178. [Google Scholar] [CrossRef]
Vapnik, V. Statistical Learning Theory; Wiley: Hoboken, NJ, USA, 1998. [Google Scholar]
Steinwart, I.; Christmann, A. Support Vector Machines; Information Science and Statistics; Springer: New York, NY, USA, 2008; Available online: https://link.springer.com/book/10.1007/978-0-387-77242-4 (accessed on 29 April 2026).
Evans, L.C. Partial Differential Equations; American Mathematical Society: Providence, RI, USA, 2010. [Google Scholar]
Ekeland, I.; Témam, R. Convex Analysis and Variational Problems; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1999. [Google Scholar] [CrossRef]
Bottou, L.; Curtis, F.E.; Nocedal, J. Optimization Methods for Large-Scale Machine Learning. SIAM Rev. 2018, 60, 223–311. [Google Scholar] [CrossRef]
Mei, S.; Montanari, A.; Nguyen, P.M. A mean field view of the landscape of two-layer neural networks. Proc. Natl. Acad. Sci. USA 2018, 115, E7665–E7671. [Google Scholar] [CrossRef] [PubMed]
Du, K.L.; Zhang, R.; Jiang, B.; Zeng, J.; Lu, J. Understanding Machine Learning Principles: Learning, Inference, Generalization, and Computational Learning Theory. Mathematics 2025, 13, 451. [Google Scholar] [CrossRef]
Deisenroth, M.P.; Faisal, A.; Ong, C.S. Mathematics for Machine Learning; Cambridge University Press: Cambridge, UK, 2020. [Google Scholar]
Schölkopf, B.; Smola, A. Learning with Kernels; MIT Press: Cambridge, MA, USA, 2002; Available online: https://dl.acm.org/doi/book/10.5555/559923 (accessed on 29 April 2026).
Madry, A.; Makelov, A.; Schmidt, L.; Tsipras, D.; Vladu, A. Towards Deep Learning Models Resistant to Adversarial Attacks. In Proceedings of the 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, 30 April–3 May 2018; Conference Track Proceedings; Curran Associates, Inc.: Red Hook, NY, USA, 2018; Available online: https://dspace.mit.edu/entities/publication/b26e76fe-b1e7-4333-bdd5-bb2f87c20af5 (accessed on 29 April 2026).
Goodfellow, I.J.; Shlens, J.; Szegedy, C. Explaining and Harnessing Adversarial Examples. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015 Conference Track Proceedings, San Diego, CA, USA, 7–9 May 2015; Bengio, Y., LeCun, Y., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2015. [Google Scholar] [CrossRef]
Croce, F.; Hein, M. Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks. In Proceedings of the 37th International Conference on Machine Learning, ICML’20, Virtual, 13–18 July 2020; JMLR.org. Available online: https://dl.acm.org/doi/10.5555/3524938.3525144 (accessed on 29 April 2026).
Abadeh, S.S.; Nguyen, V.; Kuhn, D.; Esfahani, P.M. Wasserstein Distributionally Robust Kalman Filtering. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Montreal, QC, Canada, 3–8 December 2018; pp. 8474–8483. Available online: https://dl.acm.org/doi/10.5555/3327757.3327939 (accessed on 29 April 2026).
Blanchet, J.; Kang, Y.; Murthy, K. Robust Wasserstein profile inference and applications to machine learning. J. Appl. Probab. 2019, 56, 830–857. [Google Scholar] [CrossRef]
Villani, C. Optimal Transport: Old and New; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar] [CrossRef]
Gao, R.; Kleywegt, A. Distributionally Robust Stochastic Optimization with Wasserstein Distance. Math. Oper. Res. 2023, 48, 603–655. [Google Scholar] [CrossRef]
Sinha, A.; Namkoong, H.; Duchi, J.C. Certifying Some Distributional Robustness with Principled Adversarial Training. In Proceedings of the 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, 30 April–3 May 2018; Conference Track Proceedings; Curran Associates, Inc.: Red Hook, NY, USA, 2018. [Google Scholar]
Hardt, M.; Price, E.; Srebro, N. Equality of opportunity in supervised learning. In Proceedings of the 30th International Conference on Neural Information Processing Systems, NIPS’16; Curran Associates, Inc.: Red Hook, NY, USA, 2016; pp. 3323–3331. [Google Scholar]
Dwork, C.; Hardt, M.; Pitassi, T.; Reingold, O.; Zemel, R. Fairness through awareness. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference, ITCS ’12; Association for Computing Machinery: New York, NY, USA, 2012; pp. 214–226. [Google Scholar] [CrossRef]
Kamishima, T.; Akaho, S.; Asoh, H.; Sakuma, J. Fairness-aware classifier with prejudice remover regularizer. In Proceedings of the 2012th European Conference on Machine Learning and Knowledge Discovery in Databases—Volume Part II, ECMLPKDD’12; Springer: Berlin/Heidelberg, Germany, 2012; pp. 35–50. [Google Scholar]
Kleinberg, J.; Mullainathan, S.; Raghavan, M. Inherent Trade-Offs in the Fair Determination of Risk Scores. In Proceedings of the 8th Innovations in Theoretical Computer Science Conference (ITCS 2017), Leibniz International Proceedings in Informatics (LIPIcs); Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik: Wadern, Germany, 2017; Volume 67, pp. 43:1–43:23. [Google Scholar] [CrossRef]
Chouldechova, A. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big Data 2017, 5, 153–163. [Google Scholar] [CrossRef]
Liu, S.; Vicente, L.N. Accuracy and fairness trade-offs in machine learning: A stochastic multi-objective approach. Comput. Manag. Sci. 2022, 19, 513–537. [Google Scholar] [CrossRef]
Mehrabi, N.; Morstatter, F.; Saxena, N.; Lerman, K.; Galstyan, A. A Survey on Bias and Fairness in Machine Learning. ACM Comput. Surv. 2021, 54, 115. [Google Scholar] [CrossRef]
Wan, M.; Zha, D.; Liu, N.; Zou, N. In-Processing Modeling Techniques for Machine Learning Fairness: A Survey. ACM Trans. Knowl. Discov. Data 2023, 17, 35. [Google Scholar] [CrossRef]
Liu, H.; Chaudhary, M.; Wang, H. Towards Trustworthy and Aligned Machine Learning: A Data-centric Survey with Causality Perspectives. arXiv 2023, arXiv:2307.16851. [Google Scholar] [CrossRef]
Tishby, N.; Pereira, F.C.N.; Bialek, W. The information bottleneck method. arXiv 2000, arXiv:physics. [Google Scholar] [CrossRef]
Shwartz-Ziv, R.; Tishby, N. Opening the Black Box of Deep Neural Networks via Information. arXiv 2017, arXiv:1703.0081. Available online: https://arxiv.org/abs/1703.00810. [CrossRef]
Poole, B.; Ozair, S.; Van Den Oord, A.; Alemi, A.; Tucker, G. On Variational Bounds of Mutual Information. In Proceedings of the 36th International Conference on Machine Learning, Long Beach, CA, USA, 9–15 June 2019; Proceedings of Machine Learning Research; Chaudhuri, K., Salakhutdinov, R., Eds.; PMLR: Cambridge, MA, USA, 2019; Volume 97, pp. 5171–5180. Available online: https://proceedings.mlr.press/v97/poole19a/poole19a.pdf (accessed on 29 April 2026).
Alvarez-Melis, D.; Jaakkola, T.S. Towards robust interpretability with self-explaining neural networks. In Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS’18; Curran Associates, Inc.: Red Hook, NY, USA, 2018; pp. 7786–7795. Available online: https://dl.acm.org/doi/10.5555/3327757.3327875 (accessed on 29 April 2026).
Fridkin, S.; Bendersky, M. Interpretable Machine Learning: A Comprehensive Review of Foundations, Methods, and the Path Forward. WIREs Data Min. Knowl. Discov. 2026, 16, e70075. [Google Scholar] [CrossRef]
Deb, K. Multi-Objective Optimization Using Evolutionary Algorithms; John Wiley & Sons: Hoboken, NJ, USA, 2001; Available online: https://www.egr.msu.edu/~kdeb/papers/k2011003.pdf (accessed on 29 April 2026).
Miettinen, K. Nonlinear Multiobjective Optimization; Kluwer: Boston, MA, USA, 1999; Available online: https://link.springer.com/book/10.1007/978-1-4615-5563-6 (accessed on 29 April 2026).
Boyd, S.; Vandenberghe, L. Convex Optimization; Cambridge University Press: Cambridge, UK, 2004; Available online: https://web.stanford.edu/~boyd/cvxbook/bv_cvxbook.pdf (accessed on 29 April 2026).
Jin, Y.; Sendhoff, B. Pareto-Based Multiobjective Machine Learning: An Overview and Case Studies. IEEE Trans. Syst. Man. Cybern. Part C 2008, 38, 397–415. [Google Scholar] [CrossRef]
Gardner, S.; Golovidov, O.; Griffin, J.; Koch, P.; Thompson, W.; Wujek, B.; Xu, Y. Constrained Multi-Objective Optimization for Automated Machine Learning. In 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA); IEEE: Piscataway, NJ, USA, 2019; pp. 364–373. [Google Scholar] [CrossRef]
Polson, N.G.; Sokolov, V.O. Bayesian regularization: From Tikhonov to horseshoe. Wiley Interdiscip. Rev. Comput. Stat. 2019, 11, e1463. [Google Scholar] [CrossRef]
Mohammad-Djafari, A. Regularization, Bayesian Inference, and Machine Learning Methods for Inverse Problems. Entropy 2021, 23, 1673. [Google Scholar] [CrossRef]
Cai, S.; Mao, Z.; Wang, Z.; Yin, M.; Karniadakis, G. Physics-informed neural networks (PINNs) for fluid mechanics: A review. Acta Mech. Sin. 2021, 37, 1727–1738. [Google Scholar] [CrossRef]
Cuomo, S.; Cola, V.S.D.; Giampaolo, F.; Rozza, G.; Raissi, M.; Piccialli, F. Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next. J. Sci. Comput. 2022, 92, 88. [Google Scholar] [CrossRef]
Hanrahan, S.; Kozul, M.; Sandberg, R. Studying turbulent flows with physics-informed neural networks and sparse data. Int. J. Heat Fluid Flow 2023, 104, 109232. [Google Scholar] [CrossRef]
Cai, S.; Wang, Z.; Wang, S.; Perdikaris, P.; Karniadakis, G. Physics-Informed Neural Networks for Heat Transfer Problems. J. Heat Transf. 2021, 143, 060801. [Google Scholar] [CrossRef]
Harandi, A.; Moeineddin, A.; Kaliske, M.; Reese, S.; Rezaei, S. Mixed formulation of physics-informed neural networks for thermo-mechanically coupled systems and heterogeneous domains. Int. J. Numer. Methods Eng. 2023, 125, e7388. [Google Scholar] [CrossRef]
Bousquet, O.; Elisseeff, A. Stability and generalization. J. Mach. Learn. Res. 2002, 2, 499–526. [Google Scholar] [CrossRef][Green Version]
van Zyl, C.; Ye, X.; Naidoo, R. Harnessing eXplainable artificial intelligence for feature selection in time series energy forecasting: A comparative analysis of Grad-CAM and SHAP. Appl. Energy 2024, 353, 122079. [Google Scholar] [CrossRef]
Narkhede, J. Comparative Evaluation of Post-Hoc Explainability Methods in AI: LIME, SHAP, and Grad-CAM. In 2024 4th International Conference on Sustainable Expert Systems (ICSES); IEEE: Piscataway, NJ, USA, 2024; pp. 826–830. [Google Scholar] [CrossRef]
Tchoupe, E.; Heidemanns, L.; Küpper, U.; Klink, A.; Herrig, T.; Bergs, T. Evaluation of process stability in precise electrochemical machining using machine learning models based on extracted features. Procedia CIRP 2024, 126, 498–503. [Google Scholar] [CrossRef]
Ranganayakulu, J.; Bagi, N.; Bala, P.; Satya, G.; Murthy, N.; Rao, V. Investigation and enhancement of the process of electrochemical discharge machining: A study. Mater. Manuf. Process. 2025, 41, 136–151. [Google Scholar] [CrossRef]

Figure 1. Geometric intuition of the unified variational objective. Each predictor corresponds to a point in a multi-dimensional trade-off space. The scalarized objective selects the specific point

f^{†}

on the Pareto frontier according to the weight vector defined by the control panel parameters.

Figure 1. Geometric intuition of the unified variational objective. Each predictor corresponds to a point in a multi-dimensional trade-off space. The scalarized objective selects the specific point

f^{†}

on the Pareto frontier according to the weight vector defined by the control panel parameters.

Figure 2. Geometric interpretation of the unified variational framework. Each predictor

f \in F

corresponds to a point in a multi-objective space determined by predictive risk, robustness, fairness, complexity, and interpretability. The scalarized objective

J_{D} (f) = R_{D} (f) + λ Ω (f) + \sum_{i} η_{i} Ψ_{i} (f) - τ I (f)

selects Pareto-optimal predictors through weight-controlled trade-offs between competing structural objectives. Many modern machine learning paradigms, including adversarially robust learning, fairness-aware optimization, sparse coding, kernel methods, and physics-informed learning, arise as instances of this variational principle. Here,

f^{†}

denotes the Pareto-optimal predictor selected by the scalarized variational objective, while

\partial F^{*}

denotes the Pareto frontier (the boundary of the set of Pareto-optimal solutions).

Figure 2. Geometric interpretation of the unified variational framework. Each predictor

f \in F

corresponds to a point in a multi-objective space determined by predictive risk, robustness, fairness, complexity, and interpretability. The scalarized objective

J_{D} (f) = R_{D} (f) + λ Ω (f) + \sum_{i} η_{i} Ψ_{i} (f) - τ I (f)

selects Pareto-optimal predictors through weight-controlled trade-offs between competing structural objectives. Many modern machine learning paradigms, including adversarially robust learning, fairness-aware optimization, sparse coding, kernel methods, and physics-informed learning, arise as instances of this variational principle. Here,

f^{†}

denotes the Pareto-optimal predictor selected by the scalarized variational objective, while

\partial F^{*}

denotes the Pareto frontier (the boundary of the set of Pareto-optimal solutions).

Table 1. Structural comparison of representative trustworthy learning frameworks.

Framework	Variational Formulation	Structural Integration	Functional-Analytic Guarantees	Pareto Structure
Regularized ERM	Yes	Complexity	Yes	No
[9,10,17]		control
DRO/adversarial	Partial	Robustness	Partial	No
robustness [18,21,23]		only
Fairness-aware learning [26,27,29]	Partial	Fairness only	Limited	Partial
Interpretability	Rarely	Interpretability	Rarely	No
methods [1,2,35]		only
Multi-objective	Yes	Multiple	Limited	Yes
optimization [40,41,42]		objectives
		Robustness
Proposed framework	Yes	fairness	Yes	Yes
		interpretability

Table 2. Representative paradigms recovered as instances of the unified objective (12) by suitable choices of

F

, ℓ, and

(Ω, Ψ_{i})

.

Table 2. Representative paradigms recovered as instances of the unified objective (12) by suitable choices of

F

, ℓ, and

(Ω, Ψ_{i})

.

Paradigm	Hypothesis Space $F$	Loss ℓ	Key Functional Term(s)
Support Vector Machines (SVM)	RKHS $H_{K}$ (measurable representatives)	Hinge loss $ℓ_{hinge}$	$Ω (f) = {∥ f ∥}_{H_{K}}^{2}$ (Tikhonov/RKHS norm)
Physics-Informed Neural Networks (PINNs)	Sobolev-type space $F \subset W^{k, 2} (Ω) \cap C (Ω)$ (realized by NN parametrizations)	Data MSE/likelihood loss	$Ψ_{phys} (f) = E_{x \sim μ} [{∥ L f (x) - s (x) ∥}^{2}]$ (PDE/operator residual)
Fairness-aware learning (Demographic Parity)	Measurable predictors $f : X \to T$	Cross-entropy/logistic loss	$Ψ_{fair} (f) = I (f (X); A)$ (or an independence surrogate)
Dictionary learning/ sparse coding	Pairs $(D, w)$ with $D \in D$ , $w \in R^{k}$ ; model $x \approx D w$	Reconstruction MSE ${∥ x - D w ∥}_{2}^{2}$	$Ω (w) = {∥ w ∥}_{1}$ (sparsity of code)
Deep learning heuristics	Neural nets $f (\cdot; θ)$ , $θ \in R^{p}$	Task loss (CE/MSE)	Weight decay: $Ω (θ) = {∥ θ ∥}_{2}^{2}$ ; Dropout: stochastic $Ψ_{drop} (f)$ ; Early stopping: implicit regularization

Table 3. Examples of machine learning paradigms interpreted within the unified variational framework.

Learning Paradigm	Risk Term	Structural Functional	Interpretability	Typical Applications
ERM	$R_{D} (f)$	—	—	Standard supervised learning
Adversarial Training	$R_{D} (f)$	$Ψ_{rob} (f)$	—	Robust AI, cybersecurity
Fair ML	$R_{D} (f)$	$Ψ_{fair} (f)$	—	Credit scoring, hiring
Sparse Coding	${∥ X - D w ∥}^{2}$	${∥ w ∥}_{1}$	Sparse representations	Signal processing, imaging
PINNs	$R_{D} (f)$	$Ψ_{phys} (f)$	—	Scientific ML, inverse problems
Sparse/Explainable Models	$R_{D} (f)$	Complexity penalties	$I (f)$	Clinical AI, regulatory systems
Kernel Methods	$R_{D} (f)$	${∥ f ∥}_{H_{K}}^{2}$	RKHS simplicity	SVMs, nonparametric learning

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Velasco, J.M.; Gonzalez-Perez, B. A Unified Variational Principle for Reliable Machine Learning. Mathematics 2026, 14, 1994. https://doi.org/10.3390/math14111994

AMA Style

Velasco JM, Gonzalez-Perez B. A Unified Variational Principle for Reliable Machine Learning. Mathematics. 2026; 14(11):1994. https://doi.org/10.3390/math14111994

Chicago/Turabian Style

Velasco, Jose Manuel, and Beatriz Gonzalez-Perez. 2026. "A Unified Variational Principle for Reliable Machine Learning" Mathematics 14, no. 11: 1994. https://doi.org/10.3390/math14111994

APA Style

Velasco, J. M., & Gonzalez-Perez, B. (2026). A Unified Variational Principle for Reliable Machine Learning. Mathematics, 14(11), 1994. https://doi.org/10.3390/math14111994

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Unified Variational Principle for Reliable Machine Learning

Abstract

1. Introduction

2. Related Work

3. A Unified Variational Framework

3.1. Unified Functional Formulation

3.2. Well-Posedness and Basic Properties

3.3. Intuitive Interpretation: The Control Panel View

4. Reinterpreting Existing Paradigms

4.1. Comparison of Paradigms Within the Unified Framework

4.2. A Fully Rigorous Instance in a Reproducing Kernel Hilbert Space

5. Robustness and Fairness as Structural Functionals

5.1. Robustness Functionals

5.2. A Fundamental Trade-Off: Fairness vs. Accuracy

5.3. Discussion

6. Interpretability as a Variational Functional

6.1. Axiomatic Setup and Notation

6.2. Definition of the Interpretability Score

6.3. Well-Posedness Considerations

6.4. Integration into the Unified Objective

6.5. Multi-Objective Interpretation

7. Refinements and Consequences of the Unified Variational Principle

7.1. Uniform Stability of Empirical Minimizers

7.2. Implications for Generalization

7.3. Refined Trade-Off Inequality

7.4. Bias–Variance Interpretation

7.5. Robustness and Regularity

7.6. Summary

8. Computational Perspective and Practical Instantiations

8.1. Worst-Case and Robustness Functionals

8.2. Constraint-Based Structural Objectives

8.3. Regularization and Representation Functionals

8.4. Interpretability Functionals

8.5. Optimization and Computational Considerations

8.6. Limitations and Future Directions

9. Discussion, Computation Considerations and Open Problems

9.1. Optimization of Nonconvex and Nonsmooth Objectives

9.2. Choice of Trade-Off Parameters and Identifiability of the Pareto Frontier

9.3. Scalability and Computational Complexity

9.4. Alignment Between Mathematical Definitions and Human-Centric Notions

9.5. Further Theoretical Directions

10. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Proposition 1

Appendix B. Proof of Proposition 2

Appendix C. Proof of Lemma 1

Appendix D. Proof of Proposition 3

Appendix E. Proof of Proposition 4

Appendix F. Proof of Proposition 5

Appendix G. Proof of Proposition 6

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI