Minimal Entropy and Entropic Risk Measures: A Unified Framework via Relative Entropy

Sohns, Moritz

doi:10.3390/risks13040070

Open AccessArticle

Minimal Entropy and Entropic Risk Measures: A Unified Framework via Relative Entropy

by

Moritz Sohns

^1,2,3

¹

Faculty of Economic Studies, University of Finance and Administration, 10100 Prague, Czech Republic

²

Mathematical Institute, University of Oxford, Oxford OX1 2JD, UK

³

Department of Mathematical Sciences, University of South Africa, Johannesburg 0003, Florida, South Africa

Risks 2025, 13(4), 70; https://doi.org/10.3390/risks13040070

Submission received: 1 March 2025 / Revised: 23 March 2025 / Accepted: 28 March 2025 / Published: 1 April 2025

(This article belongs to the Special Issue Stochastic Modelling in Financial Mathematics, 2nd Edition)

Download Versions Notes

Abstract

We introduce a new coherent risk measure, the minimal-entropy risk measure, which is built on the minimal-entropy

σ

-martingale measure—a concept inspired by the well-known minimal-entropy martingale measure used in option pricing. While the minimal-entropy martingale measure is commonly used for pricing and hedging, the minimal-entropy

σ

-martingale measure has not previously been studied, nor has it been analyzed as a traditional risk measure. We address this gap by clearly defining this new risk measure and examining its fundamental properties. In addition, we revisit the entropic risk measure, typically expressed through an exponential formula. We provide an alternative definition using a supremum over Kullback–Leibler divergences, making its connection to entropy clearer. We verify important properties of both risk measures, such as convexity and coherence, and extend these concepts to dynamic situations. We also illustrate their behavior in scenarios involving optimal risk transfer. Our results link entropic concepts with incomplete-market pricing and demonstrate how both risk measures share a unified entropy-based foundation.

Keywords:

entropic risk measure; minimal-entropy risk measure; relative entropy; risk-neutral measures; dynamic risk measures

MSC:

91B70; 60H30; 91G80

1. Introduction

Risk measures play an essential role in both academic research and financial practice, as they provide a systematic way to assess the potential losses of a financial position. Their importance has been growing in the industry and academic research since the work of Artzner et al. (1999), which introduced an axiomatic framework for coherent risk measures. Subsequent studies, such as (Föllmer and Schied 2011, chp. 4) or Delbaen (2000), have refined and extended these ideas.

In the monetary risk-measure framework, one models a financial position using a real-valued random variable X on a probability space

(Ω, F, P)

. The number

X (ω)

is the discounted net worth of the position if scenario

ω

occurs. A monetary risk measure,

ρ

, assigns a real number,

ρ (X)

, to the outcome of a random variable. This real number represents the minimal amount of capital needed to make X acceptable according to certain risk criteria. Desirable properties include monotonicity (increasing payoffs lowers the risk) and translation invariance (adding a sure amount of cash decreases the risk by the same amount), and when

ρ

is further assumed to be convex or coherent, it reflects the benefits of diversification. We refer to Artzner et al. (1999); Delbaen (2000); Föllmer and Schied (2011) for standard references on these terms.

A preferred way to price financial claims in financial mathematics (and hence risk measurement, when viewed from a market perspective) is to determine the price as the expectation of the discounted payoff under an equivalent (local) martingale measure. However, in general markets and for certain price processes, a (local) martingale measure need not exist. Instead, from the more general perspective of no-arbitrage (or no free lunch with vanishing risk, to be more precise), one can only ensure the existence of an equivalent

σ

-martingale measure (E

σ

MM). See, for instance, Delbaen and Schachermayer (1998); Kallsen (2004); Sohns (2025) for details on

σ

-martingale arguments. In addition, in incomplete markets, there may be multiple such measures, and a natural question is how to select the “best” or “preferred” measure among the many.

One popular selection criterion in incomplete markets is to pick the measure that is “closest” to the real-world probability measure

P

in terms of the relative entropy (also known as the Kullback–Leibler divergence); see Delbaen et al. (2002); Frittelli (2000); Fujiwara and Miyahara (2003); Miyahara (2004). Minimizing relative entropy leads to the well-known minimal-entropy martingale measure, a construction that has proven valuable in option pricing and hedging and which is closely connected to maximizing the investor’s utility (see Section 4).

Most of the literature focuses on the local martingale setting when studying minimal-entropy measures. The corresponding minimal-entropy

σ

-martingale measure has not been studied, even though, in some markets, one must work with E

σ

MMs. We close this gap by introducing and studying the minimal-entropy

σ

-martingale measure:

Q^{E} = \underset{Q \in M_{σ}}{argmin} H (Q ∣ P),

where

M_{σ}

denotes the set of E

σ

MMs, and

H (\cdot | P)

is the relative entropy with respect to

P

. The associated minimal-entropy risk measure is then

ρ^{E} (X) = E_{Q^{E}} [- X] .

This measure is an extension of the classical minimal-entropy martingale measure (since an ELMM is a special case of an E

σ

MM) but is strictly more general whenever no local martingale measure exists. Notably, while the minimal-entropy martingale measure has been studied for pricing and hedging, it has not been viewed or analyzed as a traditional risk measure in the classical sense. Nor has the

σ

-martingale version been examined at all. In this paper, we fill this gap by proving that the minimal-entropy

σ

-martingale measure leads to a coherent risk measure with desirable properties.

A different risk measure and one of the most popular ones is the entropic risk measure. Because of the similar name, one might suspect similar definitions of the entropic risk measure and the minimal-entropy methods. Nevertheless, the entropic risk measure is typically introduced via the exponential formula

e_{γ} (X) = \frac{1}{γ} \log (E [e^{- γ X}]),

which does not indicate any connection to entropy.

Its name stems from its robust representation,

e_{γ} (X) = sup_{Q ≪ P} \{E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P)\},

showing that it penalizes deviations from

P

in proportion to the relative entropy. Although this measure is well understood as a convex, time-consistent risk measure, the label “entropic” might not be fully transparent when beginning from an exponential definition. Here, we show that one can equivalently start with a relative-entropy-based formulation, making the “entropic” nature more obvious. We then show that, with this alternative definition, one easily arrives at the same results and conclusions.

This measure is strictly convex (rather than linear) in the payoff, X. In contrast, the minimal-entropy risk measure only picks out a single measure—the one that eliminates arbitrage and is of the least relative entropy. As a result, it turns out to be coherent. Despite these structural differences, both measures share a fundamental entropy-based underpinning.

The foundations of entropic risk measures and minimal-entropy martingale measures (MEMM) date back over two decades, but the topic remains actively researched. Recent studies address both theory and applications. Chong et al. (2019) introduced a forward entropic risk measure based on ergodic backward-stochastic differential equations and studied its behavior at large maturities. Pichler and Schlotter (2020) analyzed entropy-based risk measures and, in particular, their convexity and dual representations. Wang et al. (2020) investigated the entropic measure transform and its applications in risk-sensitive decisions. Dhaene et al. (2015) studied MEMMs for markets that contain financial and actuarial risks. Ishikawa and Robertson (2020) examined optimal investment and pricing problems that include default risks through entropy-based techniques. Doldi et al. (2024) connected entropy martingale optimal transport to dualities in nonlinear pricing and hedging. Y. Kabanov and Sonin (2025) derived explicit characterizations of MEMMs for exponential Ornstein–Uhlenbeck volatility models. McCloud (2025) studied pricing in incomplete markets through an entropic risk measure that accounts for default risks. Finally, Marthe et al. (2025) used entropic risk measures for risk-sensitive planning within Markov decision processes.

The main contributions of this paper are as follows:

We introduce the minimal-entropy $σ$ -martingale measure for general semimartingale models, which is new to the literature.
We prove that the induced minimal-entropy risk measure is coherent and extends the classical minimal-entropy martingale measure results.
We define the entropic risk measure via its robust representation and, therefore, provide an alternative approach that highlights its connection to entropy.
We demonstrate key properties including convexity, coherence, dynamic consistency, and optimal risk transfer for both measures, thereby revealing that minimal-entropy techniques are not only pricing tools but also valid risk measures in their own right.
We provide some estimates, comparing the two risk measures to their real-world expectations.

The paper is organized as follows. Section 2 contains the precise definitions of both the minimal-entropy

σ

-martingale measure (by minimizing relative entropy under the no-arbitrage condition) and the entropic risk measure (via a relative-entropy supremum), along with existence criteria. We establish their existence and compare their properties. Section 3 focuses on the definition of monetary risk measures, convexity and coherence, proving that the minimal-entropy risk measure is coherent, while the entropic measure is convex. In Section 4, we explore duality and highlight the deeper relationship between entropy-based valuations and risk measures and, in particular, show that our definition of the entropic risk measure is equivalent to the more common definition in the literature and we elaborate that the setup works for general processes and probability measures. Section 5 provides dynamic versions, establishing time consistency. Finally, Section 6 discusses optimal risk transfer and how each of these risk measures behaves in that context. In the Appendix A, we repeat some well-known results that are used in this publication.

2. Definition and Existence

Let

(Ω, F, P)

be a probability space,

L^{\infty}

the space of bounded random variables, and

P_{L}

the set of probability measures on

L^{\infty} (F)

. Furthermore, let

F = {(F_{t})}_{0 \leq t \leq T}

be a filtration with

F_{T} = F

, and let S be a (potentially multi-dimensional) stochastic process adapted to

F

. We assume

\hat{S} = {({\hat{S}}_{t})}_{0 \leq t \leq T}

is the discounted price process (for details on discounting, see Sohns (2023)).

In most models, at least one probability measure exists such that the discounted process,

\hat{S}

, is a local martingale under this probability measure. However, Delbaen and Schachermayer (1998) showed that, in an arbitrage-free market (more precisely a market that satisfies no risk with vanishing risk), you can only assume that a probability measure exists, such that

\hat{S}

is a

σ

-martingale under this probability measure. Therefore, it makes sense to study

σ

-martingales, equivalent

σ

-martingale measures, and derived risk measures.

First, let us recall the following:

Definition 1.

A one-dimensional semimartingale, S, is called a σ-martingale if there exists a sequence of predictable sets,

D_{n}

, such that the following applies:

(i): $D_{n} \subset D_{n + 1}$ for all n;
(ii): $⋃_{n = 1}^{\infty} D_{n} = Ω \times R_{+}$ ;
(iii): For each $n \geq 1$ , the process $1_{D_{n}} • S$ is a uniformly integrable martingale.

Such a sequence,

{(D_{n})}_{n \in N}

, is called a σ-localizing sequence. A d-dimensional semimartingale is called a σ-martingale if each of its components is a one-dimensional σ-martingale.

These ideas, first introduced by Chou (1979) and later developed by Émery (1980); Jacod and Shiryaev (2003); Kallsen (2004), make it possible to study processes that are not martingales, even though they have no drift.

Definition 2.

An equivalent σ-martingale measure (EσMM) is a probability measure

Q

with

Q \sim P

, such that

\hat{S}

is a σ-martingale under

Q

. The set of all EσMM is denoted as

M_{σ}

.

We also define a broader sets of absolutely continuous measures,

(Q ≪ P)

, under which

\hat{S}

is a

σ

-martingale:

M_{σ}^{a c} : = \{Q; Q ≪ P, \hat{S} is a σ - martingale under Q\} .

For more details on

σ

-martingales, see, for example, Goll and Kallsen (2003); Kallsen (2004); Sohns (2025).

Definition 3.

The relative entropy

H (Q ∣ P)

of a probability measure,

Q

, with respect to

P

is defined as

H (Q ∣ P) = \{\begin{matrix} \int_{Ω} \log (\frac{d Q}{d P}) d Q = E_{Q} (\log (\frac{d Q}{d P})) & for Q ≪ P, \\ \infty & otherwise . \end{matrix}

The relative entropy provides a notion of distance between a probability measure,

Q

, and a reference probability measure,

P

. Even though it can be interpreted as a distance, it is not a metric since neither the symmetry property nor the triangle inequality holds.

Beyond financial mathematics, relative entropy is a crucial concept in statistical physics and information theory, where it is referred to as the Kullback–Leibler divergence (Shunsuke 1993). It also plays a fundamental role in large deviations theory, where it underpins results such as Sanov’s theorem (Cover and Thomas 2012). A comprehensive summary of its applications can be found in Cherny and Maslov (2004).

Further illustrations of this definition are available in Dacunha-Castelle and Duflo (1986), including a demonstration of how relative entropy can be understood as a distance measure. In statistics, this interpretation is reinforced by results such as Stein’s Lemma, which can be found in (Hesse 2003, Satz 10.4).

One of the central questions in mathematical finance is how to determine the fair price of a claim, X. If the payoff at time T can be replicated with trading in the underlying asset, S, then the initial cost of the replicating strategy equals the fair price; otherwise, an arbitrage opportunity would exist.

More concretely, suppose there is an admissible self-financing strategy,

ϕ

, such that the stochastic integral

ϕ • S

matches the terminal payoff

X_{T}

. Then the fair price is

ϕ • S_{0}

. If there exists an equivalent probability measure,

Q

, under which S is a martingale, and if suitable integrability conditions hold, then

ϕ • S

is also a martingale under

Q

. In this case, the initial value equals the expected terminal value:

ϕ • S_{0} = E_{Q} [ϕ • S_{T}] = E_{Q} [X_{T}],

so the fair price is given via the expectation under

Q

.

An equivalent true martingale measure may not always exist. In such cases, one often uses an equivalent local martingale measure. There are also models without any local martingale measure. However, if the market is free of arbitrage, the first fundamental theorem of asset pricing ensures the existence of at least one equivalent

σ

-martingale measure (see Delbaen and Schachermayer (1998)), making the discounted price process a

σ

-martingale.

In a complete market, this measure is unique, leading to a single fair price. In incomplete markets, there are multiple equivalent

σ

-martingale measures. The range of fair values then lies between

min_{Q} E_{Q} [X_{T}] and max_{Q} E_{Q} [X_{T}]

Thus, the fair price must lie within this interval (Sohns 2023). It is, therefore, reasonable to define the minimal-entropy risk measure as an optimization problem over all equivalent

σ

-martingale measures.

Definition 4.

(a): An equivalent σ-martingale measure $Q^{E} \in M_{σ}$ is called the minimal-entropy σ-martingale measure if it minimizes the relative entropy (or Kullback–Leibler divergence) $H (\cdot ∣ P)$ among all $Q \in M_{σ}$ , i.e.,

$H (Q^{E} ∣ P) = inf Q \in M_{σ} H (Q ∣ P) .$

The corresponding minimal-entropy risk measure $ρ^{E}$ is defined by

$ρ^{E} (X) : = E_{Q^{E}} [- X] .$
(b): For $X \in L^{\infty}$ and $γ > 0$ , the entropic risk $e_{γ}$ is defined as

$e_{γ} (X) : = sup_{Q \in P_{L}} (E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P)) .$

(1)

Remark 1.

Equivalence of

Q

to

P

ensures that the relative entropy

H (Q ∣ P)

remains finite since any singular part would make

\log (\frac{d Q}{d P})

infinite on a positive-

P

set. Also, if

Q

ignored events that

P

deems possible, the entropic penalty term would lose its economic meaning by excluding potentially significant outcomes.

Remark 2.

In the above definition of

e_{γ}

, the parameter

γ > 0

can be viewed as a risk-aversion level or scaling factor, much like in exponential-utility frameworks where larger γ corresponds to greater risk aversion. Thus,

e_{γ}

can be seen as an “entropic” or “exponential-penalized” valuation: the higher the γ, the stronger the penalty on deviating from the reference measure

P

.

Note that the existence of these risk measures is not immediately clear. In particular, the existence of a minimal-entropy

σ

-martingale measure can still depend on the properties of S. For instance, if the market is not arbitrage-free (e.g., the No Free Lunch With Vanishing Risk condition fails) and S represents the price process of a tradable asset, then no equivalent

σ

-martingale measure can exist. Even if the market does satisfy NFLVR, one may still require additional conditions on the price processes (e.g., local boundedness or boundedness from below) to ensure that there is some equivalent

σ

-martingale measure carrying finite relative entropy. For further details, see Delbaen and Schachermayer (1994, 1998, 2006).

Example 1.

One classic example of a complete market is given via the Black–Scholes setup Black and Scholes (1973). In this model, the asset price $S_{t}$ typically follows

$d S_{t} = S_{t} μ d t + S_{t} σ d W_{t}$

for constants μ and $σ > 0$ , where W is a Brownian motion under the real-world probability. After discounting by the risk-free rate r, one finds a unique equivalent martingale measure, $Q$ , that replaces μ with r in the drift term. The market is, therefore complete, and any bounded contingent claim can be perfectly replicated. Since there is exactly one equivalent martingale measure, pricing is straightforward: every claim has a unique fair value given by its discounted expectation under $Q$ . In this case, the σ-martingale condition is also trivially satisfied by the unique risk-neutral measure; hence, minimizing any functional over the set of equivalent σ-martingale measures offers no ambiguity (there is only one element in that set). Consequently, the minimal-entropy measure (cf. Definition 4) coincides with the unique martingale measure, and the resulting minimal-entropy risk measure effectively matches the usual Black–Scholes pricing rule.
By contrast, many models for asset prices are incomplete. As a specific example, consider the Heston stochastic volatility model (Heston 1993). There, one typically assumes

$d S_{t} = S_{t} μ d t + S_{t} \sqrt{v_{t}} d W_{t}^{(1)}, d v_{t} = κ (θ - v_{t}) d t + σ_{vol} \sqrt{v_{t}} d W_{t}^{(2)},$

where $v_{t}$ represents the instantaneous variance process, and $W^{(1)}, W^{(2)}$ are correlated Brownian motions under the real-world probability. Unlike Black–Scholes, this model does not admit a perfect hedge for arbitrary claims, so there can be multiple local (or σ-)martingale measures equivalent to the real-world probability. In other words, the market is incomplete, so one cannot replicate every payoff. Different choices of the market price of volatility risk and jump risk (if extended) can lead to families of risk-neutral measures, all of which ensure no arbitrage but yield different pricing implications for claims.
In such an incomplete setting, the minimal-entropy σ-martingale measure (from Definition 4) provides a systematic way to single out a preferred measure among the many. By penalizing deviations from the real-world probability through the relative entropy functional, this approach selects the measure that is “closest” (in Kullback–Leibler divergence) to the original distribution. Hence, the resulting risk measure assigns a unique fair price (or risk assessment) to each claim. For more details, see Biagini et al. (2000); Boguslavskaya and Muravey (2016); Hull and White (1987); Pham (2001); Sircar and Zariphopoulou (2004); Sohns (2022); Wiggins (1987).

Unlike the minimal-entropy

σ

-martingale measure, the entropic risk always exists.

Theorem 1.

The entropic risk always exists.

Proof.

It suffices to show that

e_{γ}

is indeed a finite number.

Let

X \in L^{\infty} (F)

. It is straightforward to verify the following:

lim_{γ ↘ 0} | e_{γ} (X) | = | E_{P} [- X] | < \infty

and

lim_{γ \to \infty} | e_{γ} (X) | = | ess \sup (- X) | < \infty

because X is almost surely bounded.

Since

e_{γ} (X)

is increasing in

γ

, we conclude for a fixed

\tilde{γ} > 0

:

- \infty < lim_{γ ↘ 0} e_{γ} (X) \leq e_{\tilde{γ}} (X) \leq lim_{γ \to \infty} e_{γ} (X) < \infty .

□

While the minimal-entropy

σ

-martingale measure is new in the literature, the notion of the minimal-entropy martingale measure has been extensively studied in the literature. The definition of the latter is analog to ours, but instead of equivalent

σ

-martingale measures, one minimizes the entropy among all equivalent martingale measures (see, for example, Grandits and Rheinländer (2002); Lee and Rheinländer (2013); Miyahara (1999); Rheinländer and Steiger (2006); Schweizer (2010)). There are numerous criteria in the literature for the existence of the minimal-entropy martingale measure. Most of these criteria impose strong conditions on the stochastic process

\hat{S}

. A prominent example is when

\hat{S}

is a Lévy process (Fujiwara 2004, Theorem 3.1). Another simple result is the existence of the minimal-entropy martingale measure if

\hat{S}

is bounded as shown in (Frittelli 2000, Theorem 2.1). Other results depend on the specific form of the Radon–Nikodym derivative (Grandits and Rheinländer 2002) or are applicable only in discrete time settings (Föllmer and Schied 2011, Corollary 3.27).

Things are a bit easier for the minimal-entropy

σ

-martingale measure. We now state and prove a theorem showing that, under a mild condition—namely, the existence of at least one

σ

-martingale measure

Q

with finite

H (Q ∣ P)

—one confirms the existence (and uniqueness) of a minimal-entropy

σ

-martingale measure. This holds even if

\hat{S}

is not locally bounded and even if no equivalent local martingale measures exist.

For the proof, we need some lemmas.

Lemma 1.

Let

P

be the set of probability measures on

(Ω, A)

. We have the following:

(a): $H (Q ∣ P) \geq 0$ . Furthermore, $H (Q ∣ P) = 0$ if and only if $Q = P$ .
(b): The mapping

$: \{\begin{matrix} P & \to & R \\ Q & \mapsto & H (Q ∣ P) \end{matrix}$

is convex and strictly convex on the set of probability measures that are absolutely continuous with respect to $P$ .

Proof.

We have the following:

H (Q ∣ P) = \{\begin{matrix} E_{P} [\frac{d Q}{d P} \log (\frac{d Q}{d P})] & for Q ≪ P, \\ \infty & else . \end{matrix}

Hence, it is reasonable to look at the function

ϕ

, which we define as a continuous extension of the function

x \mapsto x \log x

to

[0, \infty)

. Because of

ϕ {(x)}^{″} = \frac{1}{x}

and Theorem A1, we have

ϕ^{″} (x) > 0

for

x > 0

, which proves the strict convexity of

ϕ

.

It is easy to see that the function

x \mapsto x \log x - x

takes its minimum at

x = 1

. Hence, we have

x \log x - x \geq - 1

, and thus,

ϕ (x) \geq x - 1 for all x \geq 0 .

(2)

(a): According to Equation (2), we have, for $Q ≪ P$ , the following:

$\begin{matrix} H (Q ∣ P) & = E_{P} [\frac{d Q}{d P} \log (\frac{d Q}{d P})] \\ \geq E_{P} [\frac{d Q}{d P} - 1] = E_{P} [\frac{d Q}{d P}] - 1 \\ = E_{Q} [1] - 1 = 0 . \end{matrix}$

Now, let $H (Q ∣ P) = 0$ . We must show that the equality $Q = P$ follows. We have the following:

$\begin{matrix} 0 & = H (Q ∣ P) = E_{P} [\frac{d Q}{d P} \log (\frac{d Q}{d P})] \\ = E_{P} [\frac{d Q}{d P} \log (\frac{d Q}{d P})] - \underset{= 0}{\underset{︸}{E_{P} [\frac{d Q}{d P} - 1]}} \\ = E_{P} [\underset{\overset{E q u a t i o n ()}{\geq} 0}{\underset{︸}{\frac{d Q}{d P} \log (\frac{d Q}{d P}) - \frac{d Q}{d P} + 1}}] . \end{matrix}$

Hence, $P$ —almost everywhere, we have $\frac{d Q}{d P} \log (\frac{d Q}{d P}) = \frac{d Q}{d P} - 1$ . Now, it follows that $\frac{d Q}{d P} = 1 P$ —almost surely. If $Q$ is not absolutely continuous with respect to $P$ , the statement is obvious.
(b): The statement is obvious for probability measures that are not absolutely continuous with respect to $P$ . So, we focus on the case where $Q_{1}, Q_{2} ≪ P$ . For $λ_{1}, λ_{2} \geq 0$ with $λ_{1} + λ_{2} = 1$ , we have the following:

$\begin{matrix} H (λ_{1} Q_{1} + λ_{2} Q_{2} ∣ P) & = E_{P} [ϕ (λ_{1} \frac{d Q_{1}}{d P} + λ_{2} \frac{d Q_{2}}{d P})] \\ \leq E_{P} [λ_{1} ϕ (\frac{d Q_{1}}{d P}) + λ_{2} ϕ (\frac{d Q_{2}}{d P})] \\ = λ_{1} E_{P} [ϕ (\frac{d Q_{1}}{d P})] + λ_{2} E_{P} [ϕ (\frac{d Q_{2}}{d P})] \\ = λ_{1} H (Q_{1} ∣ P) + λ_{2} H (Q_{2} ∣ P) . \end{matrix}$

Thus, strict convexity follows. □

Lemma 2.

Suppose there exists a measure,

Q_{0} \in M_{σ}^{a c}

, with

H (Q_{0} ∣ P) < \infty and H (Q_{0} ∣ P) \leq H (Q ∣ P) for all Q \in M_{σ}^{a c} .

Then,

Q_{0}

is the unique minimal-entropy σ-martingale measure.

Proof.

Define the set

Γ : = {Q \in M_{σ}^{a c}; H (Q ∣ P) < \infty} .

By assumption,

Γ

is nonempty because

Q_{0} \in Γ

. Thus, we have

I (Γ, P) : = inf_{Q \in Γ} H (Q ∣ P) < \infty .

Next, we show that

Γ

is convex and closed in total variation:

For any

Q_{1}, Q_{2} \in Γ

, their convex combination

α Q_{1} + (1 - α) Q_{2}

,

α \in [0, 1]

, is also in

Γ

. Indeed, convex combinations preserve both the finite entropy condition (due to convexity of relative entropy) and the

σ

-martingale property as

σ

-martingales form a vector space.

The set

M_{σ}^{a c}

can be expressed through linear constraints as

M_{σ}^{a c} = {Q ≪ P : E_{Q} [f_{i}] = 0 for a suitable family of bounded measurable functions f_{i}} .

Sets defined via linear constraints of the form

{Q ≪ P : E_{Q} [f_{i}] = 0}

are closed in total variation topology. Intersecting with the closed set of measures having finite entropy preserves this closedness.

By applying Csiszár’s Theorem A6, we conclude that the set

Γ

has a unique I-projection

\tilde{Q}

of

P

, which satisfies

H (\tilde{Q} ∣ P) = inf_{Q \in Γ} H (Q ∣ P) .

By hypothesis,

Q_{0}

already satisfies this minimality, so it must coincide with

\tilde{Q}

. Thus, uniqueness follows directly from the strict convexity of relative entropy.

Finally, we verify that

Q_{0}

is equivalent to

P

. Suppose this is not the case. Then, there exists a

\tilde{Q} \in Γ

, which is equivalent to

P

, as given by hypothesis. If

Q_{0}

is not equivalent to

P

, its support is strictly smaller than that of

\tilde{Q}

, which contradicts minimality via Csiszár–Gibbs inequality (Theorem A7). Hence,

Q_{0} \sim P

.

Thus,

Q_{0} \in M_{σ}^{a c}

, is equivalent to

P

, and minimizes entropy. Therefore, it is the unique minimal-entropy

σ

-martingale measure. □

In the upcoming proof of Theorem 2 (the existence of the minimal-entropy

σ

-martingale measure), we construct a sequence

{Q_{n}}

in the class of absolutely continuous

σ

-martingale measures (each having finite entropy). We then take appropriate convex combinations (or a subsequence) whose Radon–Nikodým derivatives converge in

L^{1} (P)

. A priori, it is not obvious that the limit measure

Q_{0}

remains a

σ

-martingale measure. The following Lemma 3 guarantees exactly this: the limiting measure

Q_{0}

is still a

σ

-martingale measure.

Lemma 3.

Let

(Ω, F, P)

be a probability space, and let

\tilde{Q}, Q^{1}, Q^{2}, \dots

be probability measures on

(Ω, F)

with

\tilde{Q}, Q^{n} \sim P

for each n and

\frac{d Q^{n}}{d P} \to_{n \to \infty}^{L^{1} (P)} \frac{d \tilde{Q}}{d P} .

Suppose X is a semimartingale that is a σ-martingale under each measure

Q^{n}

. Then, X is also a σ-martingale under

\tilde{Q}

.

Proof.

Since X is a

σ

-martingale under each measure

Q^{n}

, there exists, for every n, a suitable family of predictable sets making each localized piece of X a uniformly integrable martingale under

Q^{n}

. By reindexing or combining these families appropriately, we can select a single sequence

{(D_{k})}_{k \in N}

of predictable sets for which

1_{D_{k}} • X

is a uniformly integrable martingale simultaneously under every measure

Q^{n}

.

To show that

1_{D_{k}} • X

remains a uniformly integrable martingale under

\tilde{Q}

, fix arbitrary times

s < t

and an arbitrary set,

A \in F_{s}

. Define the bounded random variable

Z = 1_{A} ({(1_{D_{k}} • X)}_{t} - {(1_{D_{k}} • X)}_{s}) .

Since

1_{D_{k}} • X

is a martingale under each

Q^{n}

, we immediately have

E_{Q^{n}} [Z] = 0

for every n. Using the convergence of the Radon–Nikodým derivatives in

L^{1} (P)

and boundedness of Z, we get

E_{\tilde{Q}} [Z] = E_{P} [Z \frac{d \tilde{Q}}{d P}] = E_{P} [lim_{n \to \infty} Z \frac{d Q^{n}}{d P}] = lim_{n \to \infty} E_{P} [Z \frac{d Q^{n}}{d P}] = lim_{n \to \infty} E_{Q^{n}} [Z] = 0 .

This equality implies that the conditional expectation of

{(1_{D_{k}} • X)}_{t}

, given

F_{s}

under

\tilde{Q}

equals

{(1_{D_{k}} • X)}_{s}

, establishing the martingale property for each localized piece. Since this argument applies to each predictable set,

D_{k}

, it follows that X is a

σ

-martingale under

\tilde{Q}

. □

In a discrete-time market setting, the

σ

-martingale condition reduces to the usual martingale requirement on each time step. Then, the statement of the lemma is more transparent: if each

Q^{n}

makes X a martingale (so

E_{Q^{n}} [X_{t + 1} | F_{t}] = X_{t}

), and if

Q^{n} \to \tilde{Q}

in

L^{1}

, then X remains a martingale under

\tilde{Q}

. The “unified localizing sets” in discrete time simply become the entire time index set. The continuous-time version is conceptually the same but requires working with a single predictable sequence,

(D_{k})

, across all

Q^{n}

for the localization.

Theorem 2.

Suppose there exists at least one measure,

Q_{1} \in M_{σ}

, with finite entropy,

H (Q_{1} ∣ P) < \infty

. Then, the unique minimal-entropy σ-martingale measure

Q^{E} \in M_{σ}

exists.

Proof.

We define

M^{0} = {Q \in M_{σ}^{a c} : H (Q ∣ P) < \infty} .

By assumption, there exists a sequence,

{Q_{n}}_{n \geq 1} \in M^{0}

, such that

{(H (Q_{n} ∣ P))}_{n \in N}

is decreasing and satisfies

H (Q_{n} ∣ P) ↘ inf_{Q \in M_{σ}^{a c}} H (Q ∣ P) .

Because the sequence of the relative entropies is monotone, we obtain for the convex function

ϕ (x) = x \log x

sup_{n \geq 1} E_{P} (ϕ (\frac{d Q_{n}}{d P})) \leq H (Q_{1} ∣ P),

and, therefore, the sequence

(\frac{d Q_{n}}{d P})

is uniformly integrable according to Theorem A3. Per Theorem A4, there exists a subsequence,

{(\frac{d Q_{n_{k}}}{d P})}_{k \geq 1}

, which converges weakly in

L^{1}

. However, according to Mazur’s lemma (Theorem A5), there exists a sequence,

{(\tilde{\frac{d Q_{n}}{d P}})}_{n \geq 1}

, consisting of convex combinations

\tilde{\frac{d Q_{n}}{d P}} = \sum_{k = n}^{N_{n}} λ_{n_{k}} \frac{d Q_{n_{k}}}{d P} (λ_{n_{k}} \geq 0, \sum_{k} λ_{n_{k}} = 1),

(3)

which converges in

L^{1} (P)

to some

\frac{d Q_{0}}{d P}

.

The measure

{\tilde{Q}}_{n}

is uniquely defined if we interpret

\tilde{\frac{d Q_{n}}{d P}}

as a Radon–Nikodym derivative. One can see directly that

{\tilde{Q}}_{n} ≪ P

. By assumption,

\hat{S}

is a

σ

-martingale with respect to all

Q_{n}

(and hence also with reference to all

Q_{n_{k}}

). Now, we want to show that it is also a

σ

-martingale under

{\tilde{Q}}_{n}

. To prove this, let

{(D_{i})}_{i}

be a sequence of predictable sets, such that

1_{D_{i}} • \hat{S}

is a martingale under a fixed

Q_{n_{k}}

. For each

A \in F_{s}

,

\begin{matrix} \int_{A} (1_{D_{i}} • {\hat{S}}_{t} - 1_{D_{i}} • {\hat{S}}_{s}) d {\tilde{Q}}_{n} & = \int_{A} (1_{D_{i}} • {\hat{S}}_{t} - 1_{D_{i}} • {\hat{S}}_{s}) \tilde{\frac{d Q_{n}}{d P}} d P \\ = \int_{A} (1_{D_{i}} • {\hat{S}}_{t} - 1_{D_{i}} • {\hat{S}}_{s}) (\sum_{k = n}^{N_{n}} λ_{n_{k}} \frac{d Q_{n_{k}}}{d P}) d P \\ = \sum_{k = n}^{N_{n}} λ_{n_{k}} \int_{A} (1_{D_{i}} • {\hat{S}}_{t} - 1_{D_{i}} • {\hat{S}}_{s}) \frac{d Q_{n_{k}}}{d P} d P \\ = \sum_{k = n}^{N_{n}} λ_{n_{k}} \int_{A} (1_{D_{i}} • {\hat{S}}_{t} - 1_{D_{i}} • {\hat{S}}_{s}) d Q_{n_{k}} = 0, \end{matrix}

because

Q_{n_{k}} \in M_{σ}^{a c}

. Thus,

1_{D_{i}} • \hat{S}

is also a martingale under

{\tilde{Q}}_{n}

; hence,

\hat{S}

is a

σ

martingale under

{\tilde{Q}}_{n}

.

Moreover,

\tilde{\frac{d Q_{n}}{d P}}

converges in

L^{1}

to

\frac{d Q_{0}}{d P}

, and so, per Lemma 3 we have

Q_{0} \in M_{σ}^{a c}

.

Because of the convexity of the relative entropy (Lemma 1), Equation (3), and since

{(H (Q_{n} ∣ P))}_{n \in N}

is a decreasing sequence, we have

\begin{matrix} H ({\tilde{Q}}_{n} ∣ P) & = H (\sum_{k = n}^{N_{n}} λ_{k} Q_{n_{k}} | P) \\ \leq \sum_{k = n}^{N_{n}} λ_{k} H (Q_{n} ∣ P) \leq max_{k \in {n, \dots, N_{n}}} H (Q_{n_{k}} ∣ P) = H (Q_{n} ∣ P) . \end{matrix}

Thus, by applying Fatou’s Lemma, we obtain

\begin{matrix} H (Q_{0} ∣ P) & = E_{P} [\underset{n \to \infty}{lim inf} \tilde{\frac{d Q_{n}}{d P}} \log (\tilde{\frac{d Q_{n}}{d P}})] \leq \underset{n \to \infty}{lim inf} E_{P} [\tilde{\frac{d Q_{n}}{d P}} \log (\tilde{\frac{d Q_{n}}{d P}})] \\ = \underset{n \to \infty}{lim inf} H (\tilde{Q_{n}} ∣ P) \leq \underset{n \to \infty}{lim inf} H (Q_{n} ∣ P) . \end{matrix}

Since

H (Q_{n} ∣ P)

is decreasing, it follows that

H (Q_{0} ∣ P)

is already minimal. Now, all conditions of Lemma 2 are satisfied, so we conclude that

Q_{0}

is equivalent to

P

, proving the existence of the minimal-entropy martingale measure.

Uniqueness follows from Lemma 1 and the strict convexity of

H (\cdot ∣ P)

. □

Remark 3.

Theorem 2 shows that the “entropy-minimizing” approach extends smoothly to σ-martingale models: as soon as there is one finite-entropy σ-martingale measure, there must be a unique one of minimal entropy, even if

\hat{S}

is unbounded or not locally bounded. This measure can be used for entropy-based valuation, risk measurement, or other applications. The local martingale approach, in contrast, might be impossible if

M_{loc} = ⌀

.

3. Convex and Coherent Risk Measures

In this section, we define the notions of monetary, convex, and coherent risk measures. We also highlight the relationships among translation invariance, monotonicity, convexity, positive homogeneity, and subadditivity.

Definition 5.

Let

X

be the space of linear bounded, real-valued random variables.

(a)

A mapping,

ρ : X \to R \cup {+ \infty}

, is called a monetary risk measure if it satisfies for all

X, Y \in X

Monotonicity: if $X \leq Y$ almost surely, then $ρ (X) \geq ρ (Y)$ .
Translation invariance: for all $m \in R$ , $ρ (X + m) = ρ (X) - m$ .

(b)

The monetary risk measure ρ is called convex if it also satisfies

Convexity: for all $X, Y \in X$ and $λ \in [0, 1]$ ,

$ρ (λ X + (1 - λ) Y) \leq λ ρ (X) + (1 - λ) ρ (Y) .$

(c)

A convex risk measure, ρ, is called coherent if, in addition to monotonicity and translation invariance, it satisfies

Positive homogeneity: for all $α > 0$ ,

$ρ (α X) = α ρ (X) .$
Subadditivity: for all $X, Y \in X$ ,

$ρ (X + Y) \leq ρ (X) + ρ (Y) .$

Theorem 3.

Let ρ be a monetary risk measure on a linear space of random variables

X

.

(a)

We say that ρ is normalized if

ρ (0) = 0

. In particular, if ρ is positively homogeneous, then it follows that

ρ (0) = 0

automatically.

(b)

Suppose ρ satisfies translation invariance and monotonicity (as in Definition 5). Then, any two of the following three properties imply the remaining third:

Convexity;
Positive homogeneity;
Subadditivity.

For a detailed discussion and proof, see (Föllmer and Schied 2011, chp. 4).

Theorem 4.

The minimal-entropy martingale measure and the entropic risk are both convex risk measures. The minimal-entropy martingale measure is also a coherent risk measure.

Proof.

We start with the minimal-entropy martingale measure. It suffices to prove the coherence. The convexity follows from the fact that positive homogeneity and subadditivity imply convexity.

Monotonicity. Let $X, Y \in L^{\infty} (F)$ with $P [X \geq Y] = 1$ . We have

$ρ^{E} (X) = E_{Q^{E}} [- X] \leq E_{Q^{E}} [- Y] = ρ^{E} (Y) .$
Cash translability. This simply follows from

$ρ^{E} (X + c) = ρ^{E} (- X - c) = ρ^{E} (- X) - c = ρ^{E} (X) - c .$
Positive homogeneity and subadditivity. These follow directly from the linearity of the expectation.

We now address the entropic risk.

Monotonicity. From $X \geq Y$ , almost surely, $E_{Q} [- Y] \geq E_{Q} [- X]$ follows for all $Q \in P_{L}$ , and hence

$E [- Y] - \frac{1}{γ} H (Q ∣ P) \geq E [- X] - \frac{1}{γ} H (Q ∣ P)$

for all $Q$ . Thus, $e_{γ} (X) \leq e_{γ} (Y)$ .
Cash translability. With

$E [- (X + c)] - \frac{1}{γ} H (Q ∣ P) = E [- X] - \frac{1}{γ} H (Q ∣ P) - c,$

we obtain $e_{γ} (X + c) = e_{γ} (X) - c$ .
Convexity. We have

$\begin{matrix} E [- (λ X + (1 - λ) Y)] - \frac{1}{γ} H (Q ∣ P) \\ = λ E [- X] + (1 - λ) E [- Y] - λ \frac{1}{γ} H (Q ∣ P) - (1 - λ) \frac{1}{γ} H (Q ∣ P) \\ = λ (E [- X] - \frac{1}{γ} H (Q ∣ P)) + (1 - λ) (E [- Y] - \frac{1}{γ} H (Q ∣ P)) . \end{matrix}$

And since, for general functions $f, g$ and $c_{1}, c_{2} \in R$ , we have

$sup_{x} (c_{1} f (x) + c_{2} g (x)) \leq c_{1} sup_{x} f (x) + c_{2} sup_{x} g (x),$

the result follows.

□

In general, the entropic risk is not a coherent risk measure, even though it is additive for independent positions. However, there is also a coherent version of the entropic risk measure described in Föllmer and Knispel (2011), which is quite similar to the two risk measures we are examining here.

The coherence of a risk measure is particularly significant in the context of financial regulation. Regulators, such as the Basel Committee on Banking Supervision, emphasize capital requirements that are consistent with diversification benefits and penalize concentration risk (Basel Committee on Banking Supervision 2010). In particular, the subadditivity property (cf. Definition 5 and the subsequent theorem establishing that

ρ^{E}

is coherent) ensures that merging two portfolios

X

and

Y

never results in a higher combined capital charge than the sum of their individual charges. This requirement aligns with Basel III’s core principle that holistic risk management should not be discouraged by artificially additive capital rules.

When we specialize to the minimal-entropy risk measure

ρ^{E}

introduced in Section 2 and shown in Section 3 to be coherent, we see that it satisfies precisely these subadditivity and positive-homogeneity conditions. Concretely, if a bank adopts

ρ^{E}

to quantify its market or counterparty exposures, then merging business units or pooling risk factors would not inflate the aggregate capital requirement beyond the sum of stand-alone capital allocations. This feature could be advantageous in an internal models approach to Basel III capital calculation, provided that the bank can justify the assumptions on market completeness or the existence of

σ

-martingale measures with finite entropy (see Theorem 2). Theoretically, replacing non-coherent measures (such as certain forms of value at risk) with a coherent measure like

ρ^{E}

better captures the true diversification effect.

Nonetheless, adopting a coherent measure in practice also raises considerations:

Model complexity. Calculating $ρ^{E}$ may require advanced numerical methods to identify or approximate the minimal-entropy $σ$ -martingale measure.
Data intensity. Banks must maintain high-quality market data and stress scenarios to ensure robust parameter estimation under $ρ^{E}$ .
Regulatory validation. Any internal model, including entropy-based approaches, must pass supervisory review, which entails transparency on modeling assumptions and backtesting (Basel Committee on Banking Supervision 2010).

Finally, to better understand these risk measures, we explore their relationship with the real-world probability measure. The following lemma will be helpful.

Lemma 4.

Let

Q, R \in M_{σ}^{a c}

be absolutely continuous σ–martingale measures, i.e., each of

\frac{d Q}{d P}, \frac{d R}{d P}

is in

L^{1} (P)

, and assume that

H (Q ∣ P)

and

H (R ∣ P)

are both finite. Then, for any bounded random variable,

X \in L^{\infty} (F)

, we have

|E_{Q} [X] - E_{R} [X]| \leq {∥ X ∥}_{\infty} (\sqrt{2 H (Q ∣ P)} + \sqrt{2 H (R ∣ P)}) .

(4)

Proof.

First, note that, for any probability measures

Q, R

(absolutely continuous w.r.t. P),

\begin{matrix} |E_{Q} [X] - E_{R} [X]| & = |E_{P} [X \frac{d Q}{d P}] - E_{P} [X \frac{d R}{d P}]| = |E_{P} [X (\frac{d Q}{d P} - \frac{d R}{d P})]| \\ \leq {∥ X ∥}_{\infty} E_{P} [|\frac{d Q}{d P} - \frac{d R}{d P}|] = {∥ X ∥}_{\infty} {∥ Q - R ∥}_{TV}, \end{matrix}

where

{∥ Q - R ∥}_{TV}

denotes the total variation distance:

{∥ Q - R ∥}_{TV} = E_{P} [|\frac{d Q}{d P} - \frac{d R}{d P}|] .

Next, according to the triangle inequality in total variation,

{∥ Q - R ∥}_{TV} \leq {∥ Q - P ∥}_{TV} + {∥ P - R ∥}_{TV} .

Finally, we invoke the Pinsker inequality Theorem A8 and obtain

\begin{matrix} {∥ Q - P ∥}_{TV}^{2} & \leq 2 H (Q ∣ P), \\ {∥ R - P ∥}_{TV}^{2} & \leq 2 H (R ∣ P), \end{matrix}

whenever

H (Q ∣ P)

(resp.

H (R ∣ P)

) is finite. Hence,

{∥ Q - P ∥}_{TV} \leq \sqrt{2 H (Q ∣ P)}, {∥ R - P ∥}_{TV} \leq \sqrt{2 H (R ∣ P)} .

Putting these pieces together yields

\begin{matrix} {∥ Q - R ∥}_{TV} & \leq {∥ Q - P ∥}_{TV} + {∥ P - R ∥}_{TV} \leq \sqrt{2 H (Q ∣ P)} + \sqrt{2 H (R ∣ P)} . \end{matrix}

Therefore,

\begin{matrix} | E_{Q} [X] - E_{R} [X] | & \leq {∥ X ∥}_{\infty} {∥ Q - R ∥}_{TV} \leq {∥ X ∥}_{\infty} (\sqrt{2 H (Q ∣ P)} + \sqrt{2 H (R ∣ P)}) . \end{matrix}

That completes the proof. □

Remark 4.

Inequality (4) shows that two absolutely continuous σ-martingale measures

Q

and

R

with small relative entropy

H (\cdot ∣ P)

yield similar expectations for any bounded payoff, X. In particular, a measure with small entropy is close to

P

in total variation distance.

Theorem 5.

Let

X \in L^{\infty} (F)

be bounded and

γ > 0

. Then, the entropic risk

e_{γ} (X) = sup_{Q \in P_{L}} (E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P))

satisfies the following two-sided bound:

E_{P} [- X] \leq e_{γ} (X) \leq E_{P} [- X] + \frac{{γ ∥ X ∥}_{\infty}^{2}}{2} .

(5)

Proof.

By taking the specific choice

Q = P

in the definition of

e_{γ} (X)

, we see

e_{γ} (X) \geq E_{P} [- X] - \frac{1}{γ} H (P ∣ P) = E_{P} [- X],

since

H (P ∣ P) = 0

. This proves the left inequality in (5).

For any probability measure,

Q ≪ P

, with finite entropy

H (Q ∣ P)

, we use the triangle bound from total variation Lemma 4 to obtain

|E_{Q} [- X] - E_{P} [- X]| \leq {∥ X ∥}_{\infty} \sqrt{2 H (Q ∣ P)} .

(6)

Hence,

E_{Q} [- X] \leq E_{P} [- X] + {∥ X ∥}_{\infty} \sqrt{2 H (Q ∣ P)} .

Therefore, for any such

Q

,

\begin{matrix} E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P) & \leq E_{P} [- X] + {∥ X ∥}_{\infty} \sqrt{2 H (Q ∣ P)} - \frac{1}{γ} H (Q ∣ P) . \end{matrix}

Define

h : = H (Q ∣ P) \geq 0

. Then, we want to maximize

{∥ X ∥}_{\infty} \sqrt{2 h} - (1 / γ) h

over

h \geq 0

. A short calculus argument shows that the maximum occurs at

h^{*} = {(\frac{{∥ X ∥}_{\infty} \sqrt{2}}{2 / γ})}^{2} = \frac{γ^{2} {∥ X ∥}_{\infty}^{2}}{2},

and that the maximum value is

\frac{{γ ∥ X ∥}_{\infty}^{2}}{2}

. Consequently,

\begin{matrix} sup_{h \geq 0} [{∥ X ∥}_{\infty} \sqrt{2 h} - \frac{h}{γ}] = \frac{{γ ∥ X ∥}_{\infty}^{2}}{2} . \end{matrix}

Hence, for all

Q \in P_{L}

with

H (Q ∣ P) < \infty

,

\begin{matrix} E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P) \leq E_{P} [- X] + \frac{{γ ∥ X ∥}_{\infty}^{2}}{2} . \end{matrix}

Taking the supremum over all

Q \in P_{L}

leads to

e_{γ} (X) = sup_{Q} (E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P)) \leq E_{P} [- X] + \frac{{γ ∥ X ∥}_{\infty}^{2}}{2} .

This completes the proof of (5). □

Corollary 1.

Let

Q^{E}

be the minimal-entropy σ-martingale measure for a (bounded) price process

\hat{S}

. Then,

e_{γ} (X) \geq E_{Q^{E}} [- X] - \frac{1}{γ} H (Q^{E} ∣ P) for all bounded X .

(7)

Furthermore, combining this estimate with the two-sided bound of Theorem 5, we have the following:

max \{E_{P} [- X], ρ^{E} (X) - \frac{1}{γ} H (Q^{E} ∣ P)\} \leq e_{γ} (X) \leq E_{P} [- X] + \frac{{γ ∥ X ∥}_{\infty}^{2}}{2} .

(8)

Proof.

Since

e_{γ} (X)

is defined as the supremum

{sup}_{Q \in P_{L}} \{E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P)\},

it clearly holds that, for any

Q

in that admissible set,

e_{γ} (X) \geq E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P) .

If

Q^{E}

(the minimal-entropy

σ

-martingale measure) is in

P_{L}

(i.e.,

Q^{E} ≪ P

and

H (Q^{E} ∣ P) < \infty

), then simply take

Q = Q^{E}

in the supremum. This yields the lower bound (7).

Finally, to deduce (8), observe that

e_{γ} (X) \geq E_{P} [- X]

also holds by choosing

Q = P

. Then, Theorem 5 provides the upper bound, so taking the maximum of these lower estimates proves (8). □

Remark 5.

The assumption that

Q^{E} \in P_{L}

requires that the minimal-entropy measure

Q^{E}

be absolutely continuous with reference to

P

with finite relative entropy. If, for instance, the market admits no arbitrage of the first kind (NA1), and there exists a finite-entropy EσMM, one can show

Q^{E} \in P_{L}

. In that case, (7) provides a non-trivial lower estimate on the entropic risk regarding the minimal-entropy measure.

Remark 6.

When the risk-aversion parameter γ tends to zero, the penalty factor

\frac{1}{γ}

in the entropic risk measure

e_{γ} (X) = sup_{Q ≪ P} \{E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P)\}

becomes very large. As a result, any measure

Q \neq P

with

H (Q ∣ P) > 0

gets heavily penalized in the objective, so that the supremum is forced increasingly toward

Q = P

.

Formally, it can be shown that

lim_{γ \to 0} e_{γ} (X) = E_{P} [- X] .

Economically, this reflects a situation of ultra-strong aversion to deviating from the reference measure

P

. In the limit

γ \to 0

, the only measure not incurring an infinite penalty is

Q = P

. Thus, the entropic risk measure collapses to

E_{P} [- X]

, the plain (negative) expectation under the real-world probability

P

.

4. Duality

Any convex or coherent risk measure admits a so-called robust representation. This has been discussed in several works, including Artzner et al. (1999); Föllmer and Schied (2011); Frittelli and Gianin (2002).

The usual approach to defining the entropic risk measure is as follows:

\frac{1}{γ} \log E [e^{- γ X}]

(9)

and its robust representation, Equation (1), is typically derived using dual representation theorems.

In this paper, we defined the entropic risk using its robust representation and now demonstrate that this definition is equivalent to the traditional one found in the literature. Since the convexity of Equation (9) has not been explicitly shown here, we present an alternative proof.

Theorem 6.

For

γ > 0

and

X \in L^{\infty} (F)

, we have the following:

sup_{Q \in P_{L}} (E_{Q} (- X) - \frac{1}{γ} H (Q ∣ P)) = \frac{1}{γ} \log E_{P} [e^{- γ X}] .

Proof.

Define a new probability measure,

P_{X}

, as

\frac{d P_{X}}{d P} = \frac{e^{- γ X}}{E_{P} e^{- γ X}} .

Then,

\begin{matrix} H (Q ∣ P) & = E_{Q} [\log \frac{d Q}{d P}] \\ = E_{Q} \log [\frac{d Q}{d P_{X}} \frac{d P_{X}}{d P}] \\ = E_{Q} [\log (\frac{d Q}{d P_{X}}) + \log (e^{- γ X}) - \log E_{P} e^{- γ X}] \\ = E_{Q} [\log \frac{d Q}{d P_{X}}] + E_{Q} [- γ X] - \log E_{P} [e^{- γ X}] . \end{matrix}

Substituting this into the optimization problem yields

\begin{matrix} \exp sup_{Q \in P_{L}} (E_{Q} [- γ X] - H (Q ∣ P)) \\ = \exp (sup_{Q \in P_{L}} (E_{Q} [- γ X] - E_{Q} [\log \frac{d Q}{d P_{X}}] - E_{Q} [- γ X] + \log (E_{P} [e^{- γ X}]))) \\ = \exp (sup_{Q \in P_{L}} (- E_{Q} [\log \frac{d Q}{d P_{X}}] + \log (E_{P} [e^{- γ X}]))) \\ = E_{P} [e^{- γ X}] \exp (- inf_{Q \in P_{L}} E_{Q} [\log \frac{d Q}{d P_{X}}]) \\ = E_{P} (e^{- γ X}), \end{matrix}

where the last equality follows from

P_{X} \in P_{L}

and Lemma 1.

Taking the logarithm and dividing by

γ

yields the result. □

A detailed analysis of the dual problem for the minimal-entropy martingale measure is more complex but provides valuable insights into its economic interpretation as an equivalent local martingale measure. The dual problem, in essence, connects minimizing relative entropy to maximizing a specific utility function. Below, we elaborate briefly on this connection.

Let U be a utility function of the form

U (x) = - \exp (- x) .

We aim to maximize the expected utility of the terminal value of a wealth process by selecting an optimal strategy,

ϕ

, from a set of admissible strategies,

Φ

. If the terminal value is represented as

ϕ • {\hat{S}}_{T} : = ϕ_{0} {\hat{S}}_{0} + \int_{0}^{T} ϕ_{t} d {\hat{S}}_{t}

, then the objective becomes

sup_{ϕ \in Φ} E [U (ϕ • {\hat{S}}_{T})] = sup_{ϕ \in Φ} E [- \exp (- ϕ • {\hat{S}}_{T})] .

Under weak conditions, Delbaen et al. (2002) show

sup_{ϕ \in Φ} E [- \exp (- ϕ • {\hat{S}}_{T})] = - \exp (inf_{Q \in M_{loc}} H (Q ∣ P))

(10)

as well as

\frac{d Q^{E}}{d P} = \exp (- H (Q^{E} ∣ P) + ϕ^{*} • {\hat{S}}_{T}),

where

ϕ^{*}

is the strategy that maximizes the utility in Equation (10).

Further details on this duality can be found in Bellini and Frittelli (2002); Delbaen et al. (2002); Y. M. Kabanov and Stricker (2002).

One key insight from the discussion above and, in particular, Theorem 6 is that the entropic risk measure can be derived by optimizing an exponential-based functional, namely

e_{γ} (X) = \frac{1}{γ} \log (E [e^{- γ X}]) .

In principle, this representation is quite general, as it only requires that the exponential moment

E [e^{- γ X}]

be finite. Concretely, this exponential expectation is automatically well defined for

X \in L^{\infty} (Ω, F, P)

(bounded random variables). For unbounded positions, one needs

E [e^{- γ X}] < \infty,

(11)

which is a standard integrability condition often seen in exponential utility frameworks (Delbaen et al. 2002; Föllmer and Schied 2011).

For many distributions commonly used in finance (e.g., lognormal, normal, or other exponential-family models), the condition (11) is satisfied for all

γ > 0

. Thus, the entropic risk measure is well defined across a broad range of “light-tailed” or exponentially decaying distributions.

If X can exhibit extremely heavy tails (e.g., some Pareto-type or stable distributions), there may be values of

γ > 0

for which

E [e^{- γ X}] = \infty

. In that case, the entropic risk measure

e_{γ} (X)

may not be well defined or may be only valid for smaller

γ

. In practical models, one imposes truncated tails or ensures that

γ

remains in a regime where

e^{- γ X}

is integrable.

From the dual perspective (cf. Equation (1)), the supremum

sup_{Q ≪ P} \{E_{Q} [- X] - \frac{1}{γ} H (Q ∣ P)\}

remains finite under fairly general conditions. Essentially, if X belongs to an appropriate Orlicz space or

L^{p}

space for which the exponential integrals exist, then the entropic risk measure is valid, regardless of the specific distribution family (Detlefsen and Scandolo 2005; Föllmer and Schied 2011).

Hence, the underlying probability measure

P

is not required to come from a specific family (such as Gaussian or Lévy processes) for the exponential-based or entropy-based risk measure to work. The key assumption is that one can handle the necessary moment conditions (or integrability constraints) so that the exponentials are finite and the relative entropy functionals are well defined. Otherwise, if X exhibits tails so heavy that

E [e^{- γ X}] = \infty

for every positive

γ

, the approach can fail to produce a finite value.

5. Dynamic Consistency

In this section, we present dynamic versions of the entropic risk measure and the risk measure associated with the minimal-entropy martingale measure and show that they are time-consistent. Dynamic consistency is particularly important when dealing with practical financial applications. For instance, dynamic risk measures based on entropy have been successfully applied to energy markets and stochastic volatility models, as discussed in Swishchuk (2007).

Definition 6.

(a): A map, $ρ_{t} : L^{\infty} \to L_{t}^{\infty}$ , is called a dynamic risk measure.
(b): A dynamic risk measure is called time-consistent if, for $s \leq t$ ,

$P [ρ_{t} (X) \geq ρ_{t} (Y)] = 1 \Rightarrow P [ρ_{s} (X) \geq ρ_{s} (Y)] = 1 .$
(c): The dynamic entropic risk measure $e_{γ, t}$ is defined as

$e_{γ, t} (X) : = \frac{1}{γ} \log (E [\exp (- γ X) ∣ F_{t}]) .$
(d): The dynamic minimal-entropy risk measure $ρ_{t}^{E}$ is defined as

$ρ_{t}^{E} (X) : = E_{Q^{E}} [- X ∣ F_{t}],$

where $Q^{E}$ is the minimal-entropy martingale measure.

It is possible to allow

γ

to be an adapted process instead of a constant, resulting in a slightly more general definition of the dynamic entropic risk measure. However, in such cases, time consistency may not be guaranteed.

If, for

Q \in Q_{t}

, we define the conditional relative entropy

H_{t} (Q ∣ P)

as

H_{t} (Q ∣ P) = E_{Q} [\log \frac{d Q}{d P} ∣ F_{t}] .

The dynamic entropic risk measure can also be expressed in a robust representation involving the relative entropy

e_{γ, t} (X) = sup_{Q \in Q_{t}} (E_{Q} [- X ∣ F_{t}] - \frac{1}{γ} H_{t} (Q ∣ P)) .

Further details on this duality and dynamic risk measures in general (in both continuous and discrete time) can be found in Acciaio and Penner (2011); Detlefsen and Scandolo (2005); Föllmer and Penner (2006); Penner (2007).

It is straightforward to verify that the two risk measures defined above satisfy the properties of dynamic risk measures.

Theorem 7.

The dynamic entropic risk measure and the dynamic minimal-entropy risk measure are time-consistent.

Proof.

We start with the entropic risk measure. We aim to show that

e_{γ, s} (X) = e_{γ, s} (- e_{γ, t} (X))

for

0 \leq s \leq t

. Time consistency then follows from the intertemporal monotonicity theorem (Föllmer and Penner 2006, Proposition 4.2).

Using the tower property of conditional expectation, we have the following:

\begin{matrix} e_{γ, s} (X) & = \frac{1}{γ} \log (E [e^{- γ X} ∣ F_{s}]) \\ = \frac{1}{γ} \log (E [E [e^{- γ X} ∣ F_{t}] ∣ F_{s}]) \\ = \frac{1}{γ} \log (E [\exp (- γ \underset{= - e_{γ, t} (X)}{\underset{︸}{(- \frac{1}{γ} \log (E [e^{- γ X} ∣ F_{t}]))}}) ∣ F_{s}]) \\ = e_{γ, s} (- e_{γ, t} (X)) . \end{matrix}

For the minimal-entropy martingale measure, we proceed similarly:

\begin{matrix} ρ_{s}^{E} (X) & = E_{Q^{E}} [- X ∣ F_{s}] \\ = E_{Q^{E}} [E_{Q^{E}} [- X ∣ F_{t}] ∣ F_{s}] \\ = ρ_{s}^{E} (- ρ_{t}^{E} (X)) . \end{matrix}

This proves time consistency for both measures. □

In practice, many other risk measures are used. Therefore, it makes sense to compare the minimal-entropy risk measure with more commonly used approaches in the financial industry, such as value at risk (VaR) and conditional value at risk (CVaR, also called the expected shortfall). Below, we incorporate the standard definitions of VaR and CVaR, discuss their strengths and limitations, and highlight how they compare with the minimal-entropy framework, especially under incomplete markets and high volatility conditions.

The most used and well-known risk measure is value at risk (VaR). At a confidence level

α \in (0, 1)

, VaR typically answers the following:

“What is the smallest loss L so that X does not exceed −L with probability more than 1 − α?”

Formally, one writes

{VaR}_{α} (X) = inf \{ℓ; P [X \leq - ℓ] \leq 1 - α\} .

Due to its relative simplicity, VaR is still widely used by banks and regulatory bodies (Basel Committee on Banking Supervision 2016). However, VaR is not in general subadditive, so it fails to be coherent (Artzner et al. 1999) and does not quantify the magnitude of losses in the tail region beyond the threshold.

By comparison, the minimal-entropy risk measure

ρ^{E}

introduced in Section 2 is coherent, thus reflecting diversification benefits more accurately. Moreover,

ρ^{E}

depends on underlying price dynamics and the selection of a minimal-entropy

σ

-martingale measure, thereby embedding consistency with no-arbitrage principles in incomplete markets, while VaR typically lacks a direct link to hedging or replication arguments.

Alongside value at risk (VaR), conditional value at risk (CVaR) is likely the second most commonly used risk measure. Often referred to as the expected shortfall, CVaR at level

α

considers average tail losses above the VaR threshold. One standard formula is

{CVaR}_{α} (X) = \frac{1}{1 - α} \int_{1 - α}^{1} {VaR}_{u} (X) d u .

Basel III and the Fundamental Review of the Trading Book increasingly advocate the use of CVaR for market risk calculations, recognizing its coherent properties and superior sensitivity to extreme events (Basel Committee on Banking Supervision 2016). However, CVaR remains distribution-based, often sidelining explicit dynamic hedging or replication considerations. By contrast,

ρ^{E}

(the minimal-entropy measure) directly incorporates no-arbitrage dynamics and penalizes large deviations from the real-world measure via the Kullback–Leibler divergence. In highly volatile or incomplete markets,

ρ^{E}

thus captures both model risk (through entropy) and market risk (through

σ

-martingale constraints) more naturally than CVaR can, though at the cost of greater computational and data requirements.

Beyond VaR and CVaR, the entropic VaR (Ahmadi-Javid 2011) also introduces an entropy-based correction to the usual VaR framework, aiming for improved coherence-like properties. Meanwhile, expectile-type risk measures (Bellini and Di Bernardino 2017; Zaevski and Nedeltchev 2023) focus on generalized quantiles that can exhibit coherence under certain assumptions and capture the asymmetric tail risks. Nonetheless, these constructs typically remain tied to direct distributional assumptions, without necessarily unifying real-world dynamics and no-arbitrage arguments.

In Section 3, we see that

ρ^{E}

fulfills the coherence axioms and, thanks to Theorem 2, is firmly rooted in a no-arbitrage framework for incomplete markets. By minimizing relative entropy with respect to

P

,

ρ^{E}

selects a risk-neutral (or

σ

-martingale) measure that is closest to the real-world measure, thereby unifying pricing and risk. This unification becomes particularly relevant when local martingale measures may not exist.

In contrast to the above-mentioned advantages, there are also some drawbacks. Approximating or computing the minimal-entropy

σ

-martingale measure can be significantly more involved than applying VaR or other distribution-based measures. In volatile environments, calibrating Radon–Nikodým derivatives so that the entropy remains finite also demands extensive data and robust modeling. Moreover, although regulators increasingly tolerate advanced internal models (see Page 13), institutions must still validate the market incompleteness assumptions and explain the chosen measure to supervisors. Hence, some market participants prefer simpler, purely distribution-based schemes like CVaR or entropic VaR because they are more transparent and easier to backtest or justify to stakeholders.

Overall, while more sophisticated than VaR and even CVaR, the minimal-entropy risk measure furnishes a coherent and market-consistent framework that directly incorporates incomplete-market structures and penalizes divergence from real-world probabilistic views. Its principal obstacles involve higher computational costs and data intensity, potentially limiting immediate adoption in practice unless sufficiently robust calibration and infrastructure are available. Nonetheless, for investors, regulators, or institutions confronting model uncertainty and market incompleteness, the minimal-entropy methodology offers a powerful alternative that more fully respects no-arbitrage and coherent-risk requirements.

6. Optimal Risk Transfer

Optimal risk transfer is a classical topic in the study of risk measures and has been studied extensively in, e.g., Barrieu and El Karoui (2005). In this section, we present a somewhat more general and extended view, highlighting in particular the property of

γ

-dilated families of risk measures, the associated inf-convolution approach, and how these results can be restricted to the risk-neutral setting. We then provide a brief discussion of why no non-trivial

γ

-dilation family can be constructed from the minimal-entropy coherent risk measure, whereas the standard entropic risk measure naturally fits into this framework. Finally, we show how restricting to (local) risk-neutral measures force the minimal-entropy measure to emerge as the unique solution to the risk-transfer problem.

Let

ρ

be any (convex or coherent) risk measure on

L^{\infty} (F)

. The fundamental question in optimal risk sharing between two entities with risk measures

ρ_{1}

and

ρ_{2}

is to split a position, X, into two parts

X - F

and F. The goal is to minimize the combined risk

ρ_{1} (X - F) + ρ_{2} (F)

. This optimization problem is well expressed via the inf-convolution operator:

(ρ_{1} □ ρ_{2}) (X) : = inf_{F} \{ρ_{1} (X - F) + ρ_{2} (F)\} .

(12)

The optimal risk transfer is solved when the infimum in (12) is achieved via some

F^{*}

. Well-known results (see Barrieu and El Karoui (2005)) show that certain risk measures, particularly those arising from

γ

-dilated families, enjoy a simple solution: the optimal allocation is often a linear split of X.

Definition 7

(

γ

-dilated family). A family

{ρ_{γ}}_{γ \in I}

of risk measures on

L^{\infty} (F)

is called γ-dilated if there exists a base risk measure ρ, such that

ρ_{γ} (X) = \frac{1}{γ} ρ (γ X) for each γ \in I,

where

I \subseteq (0, \infty)

is some index set.

An immediate (but important) example is the entropic risk measure, which is

γ

-dilated by construction:

e_{γ} (X) = \frac{1}{γ} \log (E [e^{- γ X}]) = \frac{1}{γ} ρ (γ X)

if one sets

ρ (Z) : = \log (E [e^{- Z}])

. By contrast, a coherent risk measure with positive homogeneity cannot usually form a non-trivial

γ

-dilated family unless it is purely linear in

γ

and thus collapses to the

E

-type functional (more details below).

We restate here a known result, cf. Barrieu and El Karoui (2005), slightly paraphrased to fit our notation.

Theorem 8.

Let

{ρ_{γ}}_{γ > 0}

be a γ-dilated family of convex risk measures on

L^{\infty} (F)

. Then, the following statements hold:

(a): For any $γ_{1}, γ_{2} > 0$ , we have the inf-convolution identity:

$ρ_{γ_{1} + γ_{2}} (X) = (ρ_{γ_{1}} □ ρ_{γ_{2}}) (X) .$

(13)
(b): The optimal allocation $F^{*}$ solving ${inf}_{F} {ρ_{γ_{1}} (X - F) + ρ_{γ_{2}} (F)}$ is linear in X, specifically

$F^{*} = \frac{γ_{2}}{γ_{1} + γ_{2}} X .$

(14)
(c): Consequently, we also have

$ρ_{γ_{1} + γ_{2}} (X) = ρ_{γ_{1}} (X - F^{*}) + ρ_{γ_{2}} (F^{*}) .$
(d): Moreover, if $ρ^{1}$ and $ρ^{2}$ are two convex risk measures that each can be embedded in a single γ-dilated family (i.e., they are both instances of $ρ_{γ}$ for some base ρ), then, for any $γ_{1}, γ_{2} > 0$ ,

$ρ_{γ_{1}} □ ρ_{γ_{2}} = {(ρ □ ρ)}_{γ_{1} + γ_{2}}$

under a suitable identification. Hence, the γ-dilation structure is preserved under inf-convolution.

Sketch of Proof

The key insight is that, for a

γ

-dilated family, scaling X by

γ

and adjusting the risk measure by

\frac{1}{γ}

yields a consistent re-parametrization of the same base functional

ρ

. Once this is established, standard inf-convolution arguments for the exponentially (or more generally,

ρ

-) tilted function imply the linear sharing rule (14). For full details, see Barrieu and El Karoui (2005) or references therein. □

We now remark why a coherent risk measure—in particular, the minimal entropy risk measure

ρ^{E}

—cannot generally be part of a non-trivial

γ

-dilated family. Recall that a coherent risk measure

ρ

is positively homogeneous, i.e.,

ρ (λ X) = λ ρ (X) for all λ \geq 0 .

Combining this with the

γ

-dilation requirement

ρ_{γ} (X) = \frac{1}{γ} ρ (γ X)

quickly forces

ρ_{γ} (X) \equiv ρ (X)

for all

γ

, unless

ρ \equiv 0

or

ρ \equiv some fixed linear functional

. Indeed, for coherent

ρ

, we would have

ρ (γ X) = γ ρ (X), so \frac{1}{γ} ρ (γ X) = ρ (X) .

Hence,

ρ_{γ}

cannot genuinely “depend” on

γ

. In short, non-trivial

γ

-dilated families (like the entropic class) are not coherent. This explains why the minimal-entropy risk measure,

ρ^{E} (X) = E_{Q^{E}} [- X]

, cannot appear in a standard

γ

-dilated framework.

Funding

This research received no external funding.

Data Availability Statement

Data sharing not applicable.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Some Well-Known Theorems

The following theorem is from (Königsberger 2003, p. 146).

Theorem A1.

Let f be a continuous function on

[a, b]

and two times differentiable

(a, b)

. Then, we have the following:

(a): f is convex in $[a, b]$ if and only if $f^{″} \geq 0$ ;
(b): f is strictly convex if $f^{″} > 0$ .

A thorough presentation of this and closely related inequalities can be found, for instance, in (Rockafellar 1970, sct. 4) or in standard real analysis texts.

Theorem A2

(Classical Young–Gibbs Inequality). Let

x > 0

and

y \in R

. Then,

x y - ln (x) \leq y, with equality if and only if x = 1 .

The following theorem is a special case of the de La Vallée Poussin criterion and is proven, for example, in (Dellacherie and Meyer 1982, T22).

Theorem A3.

Let

{(X_{λ})}_{λ \in Λ}

be a family of integrable random variables and

ϕ : R \to R_{+}

a measurable function with

{lim}_{t \to \infty} \frac{ϕ (t)}{t} = \infty

. If we have

K : = sup_{λ \in Λ} E [ϕ (| X_{λ} |)] < \infty,

then

{(X_{λ})}_{λ_{\in} Λ}

is uniformly integrable.

A proof of the following theorem can be found in (Dunford and Schwartz 1958, p. 294).

Theorem A4

(Dunford–Pettis criterion). Let

K \subseteq L^{1}

. Then, the following elements are equivalent:

(i): $K$ is uniformly integrable.
(ii): $K$ is relatively compact in $L^{1}$ if we consider the weak convergence in $L^{1}$ as a corresponding convergence.
(iii): For each sequence in $K$ , there exists a subsequence that converges weakly in $L^{1}$ .

A proof of the next statement can be found in (Ekeland and Temam 1976, p. 6) or (Werner 2008, Theorem III.2.5).

Theorem A5

(Mazur’s lemma). Let

(X, ∥\cdot∥)

be a Banach space, and let

{(u_{n})}_{n \in N}

be a sequence in X that converges weakly to some

u_{0}

in X:

u_{n} \to u_{0} as n \to \infty .

That is, for every continuous linear functional

f \in X^{*}

, the continuous dual space of X,

f (u_{n}) \to f (u_{0}) a s n \to \infty .

Then, there exists a function,

N : N \to N

, and a sequence of sets of real numbers,

{α {(n)}_{k} | k = n, \dots, N (n)}

such that

α {(n)}_{k} \geq 0

, and

\sum_{k = n}^{N (n)} α {(n)}_{k} = 1,

such that the sequence

{(v_{n})}_{n \in N}

defined by the convex combination

v_{n} = \sum_{k = n}^{N (n)} α {(n)}_{k} u_{k}

converges strongly in X to

u_{0}

, i.e.

∥v_{n} - u_{0}∥ \to 0 a s n \to \infty .

We denote with

L

the set of probability measures defined on

(Ω, F)

.

For

Γ \subseteq L

, we set

I (Γ, P) = inf {I (Q, P) : Q \in Γ} .

A probability,

Q \in Γ

, satisfying

I (Q, P) = I (Γ, P)

is called the I-projection of

P

on

Γ

.

Theorem A6

(Th. 2.1 Csiszár (1975)). Let

Γ \subseteq L

be a convex set, such that

I (Γ, P) < \infty

. If Γ is closed in variation, then

P

has a (unique) I-projection on Γ.

Theorem A7

(Th. 2.2 Csiszár (1975)). Let

Γ \subseteq L

be a convex set.

Q \in Γ

, such that

I (Q, P) < \infty

is the I–projection of

P

on Γ, if and only if

I (R, P) \geq I (R, Q) + I (Q, P) \forall R \in Γ .

The following theorem is known as the Pinsker inequality (also sometimes called the Kullback–Csiszár–Kemperman inequality), linking the total variation norm and relative entropy (Kullback–Leibler divergence) between two probability measures.

Theorem A8

(Pinsker inequality). Let

P

and

Q

be two probability measures on the same measurable space. Then,

{∥Q - P∥}_{TV}^{2} \leq 2 H (Q ∣ P),

(A1)

where

H (Q ∣ P)

is the relative entropy or Kullback–Leibler divergence defined by

H (Q ∣ P) = \int \log (\frac{d Q}{d P}) d Q,

provided

Q ≪ P

, and otherwise,

H (Q ∣ P) = + \infty

.

For example, this inequality can be found explicitly in (Barron 1986, Sct. 2). An earlier result with weaker constants appears in Pinsker (1964). The final and more general form was established independently by Kullback (1967); Csiszár (1967); Kemperman (1969).

References

Acciaio, Beatrice, and Irina Penner. 2011. Dynamic risk measures. In Advanced Mathematical Methods for Finance. Berlin/Heidelberg: Springer, pp. 1–34. [Google Scholar] [CrossRef]
Ahmadi-Javid, Amir. 2011. Entropic value-at-risk: A new coherent risk measure. Journal of Optimization Theory and Applications 151: 504–21. [Google Scholar] [CrossRef]
Artzner, Philippe, Freddy Delbaen, Jean-Marc Eber, and David Heath. 1999. Coherent measures of risk. Mathematical Finance 9: 203–28. [Google Scholar] [CrossRef]
Barrieu, Pauline, and Nicole El Karoui. 2005. Inf-convolution of risk measures and optimal risk transfer. Finance and Stochastics 9: 269–98. [Google Scholar] [CrossRef]
Barron, Andrew R. 1986. Entropy and the central limit theorem. The Annals of Probability 14: 336–42. [Google Scholar]
Basel Committee on Banking Supervision. 2010. Basel III: A Global Regulatory Framework for More Resilient Banks and Banking Systems. Technical Report. Basel: Bank for International Settlements (BIS). [Google Scholar]
Basel Committee on Banking Supervision. 2016. Minimum Capital Requirements for Market Risk (Fundamental Review of the Trading Book); Basel: Bank for International Settlements. Available online: https://www.bis.org/bcbs/publ/d352.htm (accessed on 10 August 2023).
Bellini, Fabio, and Elena Di Bernardino. 2017. Risk management with expectiles. The European Journal of Finance 23: 487–506. [Google Scholar] [CrossRef]
Bellini, Fabio, and Marco Frittelli. 2002. On the existence of minimax martingale measures. Mathematical Finance 12: 1–21. [Google Scholar] [CrossRef]
Biagini, Francesca, Paolo Guasoni, and Maurizio Pratelli. 2000. Mean-variance hedging for stochastic volatility models. Mathematical Finance 10: 109–23. [Google Scholar] [CrossRef]
Black, Fischer, and Myron Scholes. 1973. The pricing of options and corporate liabilities. Journal of Political Economy 81: 637–54. [Google Scholar] [CrossRef]
Boguslavskaya, Elena, and Dmitry Muravey. 2016. An explicit solution for optimal investment in Heston model. Theory of Probability & Its Applications 60: 679–88. [Google Scholar]
Cherny, Alexander S., and Victor P. Maslov. 2004. On minimization and maximization of entropy in various disciplines. Theory of Probability & Its Applications 48: 447–64. [Google Scholar] [CrossRef]
Chong, Wing Fung, Ying Hu, Gechun Liang, and Thaleia Zariphopoulou. 2019. An ergodic bsde approach to forward entropic risk measures: Representation and large-maturity behavior. Finance and Stochastics 23: 239–73. [Google Scholar] [CrossRef]
Chou, Ching-Sung. 1979. Caractérisation d’une classe de semimartingales. In Séminaire de Probabilités XIII. Berlin/Heidelberg: Springer, pp. 250–52. [Google Scholar]
Cover, Thomas M., and Joy A. Thomas. 2012. Elements of Information Theory. Hoboken: John Wiley & Sons. [Google Scholar]
Csiszár, Imre. 1975. i-divergence geometry of probability distributions and minimization problems. The Annals of Probability 3: 146–58. [Google Scholar] [CrossRef]
Csiszár, Imre. 1967. Information-type measures of difference of probability distributions and indirect observations. Studia Scientiarum Mathematicarum Hungarica 2: 299–318. [Google Scholar]
Dacunha-Castelle, Didier, and Marie Duflo. 1986. Probability and Statistics, 1st ed.Berlin/Heidelberg: Springer. [Google Scholar]
Delbaen, Freddy. 2000. Coherent risk measures. Blätter der DGVFM 24: 733–39. [Google Scholar] [CrossRef]
Delbaen, Freddy, and Walter Schachermayer. 1994. A general version of the fundamental theorem of asset pricing. Mathematische Annalen 300: 463–520. [Google Scholar] [CrossRef]
Delbaen, Freddy, and Walter Schachermayer. 1998. The fundamental theorem of asset pricing for unbounded stochastic processes. Mathematische Annalen 312: 215–50. [Google Scholar] [CrossRef]
Delbaen, Freddy, and Walter Schachermayer. 2006. The Mathematics of Arbitrage. Springer Finance. Berlin/Heidelberg: Springer. [Google Scholar]
Delbaen, Freddy, Peter Grandits, Thorsten Rheinländer, Davide Samperi, Martin Schweizer, and Christophe Stricker. 2002. Exponential hedging and entropic penalties. Mathematical Finance 12: 99–123. [Google Scholar] [CrossRef]
Dellacherie, Claude, and Paul-André Meyer. 1982. Probabilities and Potential, B: Theory of Martingales. North-Holland Mathematics Studies. Amsterdam: Elsevier Science. [Google Scholar]
Detlefsen, Kai, and Giacomo Scandolo. 2005. Conditional and dynamic convex risk measures. Finance and Stochastics 9: 539–61. [Google Scholar] [CrossRef]
Dhaene, Jan, Ben Stassen, Pierre Devolder, and Michel Vellekoop. 2015. The minimal entropy martingale measure in a market of traded financial and actuarial risks. Journal of Computational and Applied Mathematics 282: 111–33. [Google Scholar] [CrossRef]
Doldi, Alessandro, Marco Frittelli, and Emanuela Rosazza Gianin. 2024. On entropy martingale optimal transport theory. Decisions in Economics and Finance 47: 1–42. [Google Scholar] [CrossRef]
Dunford, Nelson, and Jacob Theodore Schwartz. 1958. Linear Operators. Part 1. General Theory. Pure and Applied Mathematics. Geneva: Interscience Publishers. [Google Scholar]
Ekeland, Ivar, and Roger Temam. 1976. Convex Analysis and Variational Problems. Studies in Mathematics and Its Applications. Amsterdam: Elsevier Science. [Google Scholar]
Émery, Michel. 1980. Compensation de processus vf non localement intégrables. In Séminaire de Probabilités XIV 1978/79. Berlin/Heidelberg: Springer, pp. 152–60. [Google Scholar]
Föllmer, Hans, and Alexander Schied. 2011. Stochastic Finance: An Introduction in Discrete Time, 2nd ed. Volume 27 of De Gruyter Studies in Mathematics. Berlin: Walter de Gruyter. [Google Scholar] [CrossRef]
Föllmer, Hans, and Irina Penner. 2006. Convex risk measures and the dynamics of their penalty functions. Statistics & Decisions 24: 61–96. [Google Scholar] [CrossRef]
Föllmer, Hans, and Thomas Knispel. 2011. Entropic risk measures: Coherence vs. convexity, model ambiguity and robust large deviations. Stochastics and Dynamics 11: 333–51. [Google Scholar] [CrossRef]
Frittelli, Marco. 2000. The minimal entropy martingale measure and the valuation problem in incomplete markets. Mathematical Finance 10: 39–52. [Google Scholar] [CrossRef]
Frittelli, Marco, and Emanuela Rosazza Gianin. 2002. Putting order in risk measures. Journal of Banking & Finance 26: 1473–86. [Google Scholar] [CrossRef]
Fujiwara, Tsukasa. 2004. From the minimal entropy martingale measures to the optimal strategies for the exponential utility maximization: The case of geometric lévy processes. Asia-Pacific Financial Markets 11: 367–91. [Google Scholar] [CrossRef]
Fujiwara, Tsukasa, and Yoshio Miyahara. 2003. The minimal entropy martingale measures for geometric lévy processes. Finance and Stochastics 7: 509–31. [Google Scholar]
Goll, Thomas, and Jan Kallsen. 2003. A complete explicit solution to the log-optimal portfolio problem. The Annals of Applied Probability 13: 774–99. [Google Scholar] [CrossRef]
Grandits, Peter, and Thorsten Rheinländer. 2002. On the minimal entropy martingale measure. The Annals of Probability 30: 1003–38. [Google Scholar] [CrossRef]
Hesse, Christian. 2003. Angewandte Wahrscheinlichkeitstheorie. Kranzberg: Vieweg. [Google Scholar]
Heston, Steven L. 1993. A closed-form solution for options with stochastic volatility with applications to bond and currency options. The Review of Financial Studies 6: 327–43. [Google Scholar] [CrossRef]
Hull, John, and Alan White. 1987. The pricing of options on assets with stochastic volatilities. The Journal of Finance 42: 281–300. [Google Scholar] [CrossRef]
Ishikawa, Tetsuya, and Scott Robertson. 2020. Optimal investment and pricing in the presence of defaults. Mathematical Finance 30: 577–620. [Google Scholar] [CrossRef]
Jacod, Jean, and Albert N. Shiryaev. 2003. Limit Theorems for Stochastic Processes, 2nd ed. Volume 288 of Grundlehren der mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Berlin: Springer. [Google Scholar] [CrossRef]
Kabanov, Yuri, and Mikhail A. Sonin. 2025. On the entropy minimal martingale measure in the exponential ornstein-uhlenbeck stochastic volatility model. arXiv arXiv:2501.02396. [Google Scholar]
Kabanov, Yuri M., and Christophe Stricker. 2002. On the optimal portfolio for the exponential utility maximization: Remarks to the six-author paper. Mathematical Finance 12: 125–34. [Google Scholar] [CrossRef]
Kallsen, Jan. 2004. σ-localization and σ-martingales. Theory of Probability & Its Applications 48: 152–63. [Google Scholar]
Kemperman, Johannes H. B. 1969. On the optimum rate of transmitting information. Annals of Mathematical Statistics 40: 2156–77. [Google Scholar]
Königsberger, Konrad. 2003. Analysis 1. Springer-Lehrbuch. Berlin/Heidelberg: Springer. [Google Scholar]
Kullback, Solomon. 1967. A lower bound for discrimination information in terms of variation. IEEE Transactions on Information Theory 13: 126–27. [Google Scholar]
Lee, Young, and Thorsten Rheinländer. 2013. The minimal entropy martingale measure for exponential markov chains. Journal of Applied Probability 50: 344–58. [Google Scholar] [CrossRef]
Marthe, Alexandre, Samuel Bounan, Aurélien Garivier, and Claire Vernade. 2025. Efficient risk-sensitive planning via entropic risk measures. arXiv arXiv:2502.20423. [Google Scholar]
McCloud, Paul. 2025. The relative entropy of expectation and price. arXiv arXiv:2502.08613. [Google Scholar]
Miyahara, Yoshio. 1999. Minimal entropy martingale measures of jump type price processes in incomplete assets markets. Asia-Pacific Financial Markets 6: 97–113. [Google Scholar] [CrossRef]
Miyahara, Yoshio. 2004. A note on esscher transformed martingale measures for geometric lévy processes. Discussion Papers in Economics, Nagoya City University 379: 1–14. [Google Scholar]
Penner, Frau Dipl-Math Irina. 2007. Dynamic Convex Risk Measures: Time Consistency, Prudence, and Sustainability. Ph. D. thesis, Humboldt-Universität zu Berlin, Berlin, Germany. [Google Scholar]
Pham, Huyên. 2001. Mean-variance hedging for partially observed drift processes. International Journal of Theoretical and Applied Finance 4: 263–84. [Google Scholar] [CrossRef]
Pichler, Alois, and Ruben Schlotter. 2020. Entropy based risk measures. European Journal of Operational Research 285: 223–36. [Google Scholar] [CrossRef]
Pinsker, Mark S. 1964. Information and Information Stability of Random Variables and Processes. San Francisco: Holden-Day. [Google Scholar]
Rheinländer, Thorsten, and Gallus Steiger. 2006. The minimal entropy martingale measure for general barndorff-nielsen/shephard models. The Annals of Applied Probability 16: 1319–51. [Google Scholar] [CrossRef]
Rockafellar, R. Tyrrell. 1970. Convex Analysis. Princeton: Princeton University Press. [Google Scholar]
Schweizer, Martin. 2010. Minimal entropy martingale measure. In Encyclopedia of Quantitative Finance. Hoboken: Wiley. [Google Scholar] [CrossRef]
Shunsuke, Ihara. 1993. Information Theory for Continuous Systems. Singapore: World Scientific, vol. 2. [Google Scholar]
Sircar, Ronnie, and Thaleia Zariphopoulou. 2004. Bounds and asymptotic approximations for utility prices when volatility is random. SIAM Journal on Control and Optimization 43: 1328–53. [Google Scholar] [CrossRef]
Sohns, Moritz. 2022. Utility indifference pricing in the heston model: Pricing, hedging and shortcomings. In Proceedings of the 9th Rocník Konference Doktorandů Na Vysoké škole Finanční A Správní. Edited by Ondřej Roubai. Prague: Vysoká Škola Finanční a Správní, pp. 132–53. [Google Scholar]
Sohns, Moritz. 2023. The general semimartingale model with dividends. In Proceedings of the 10th Rocník Konference Doktorandů Na Vysoké škole Finanční A Správní. Edited by Ondřej Roubai. Prague: Vysoká Škola Finanční a Správní. [Google Scholar] [CrossRef]
Sohns, Moritz. 2025. σ-martingales: Foundations, properties, and a new proof of the Ansel–Stricker lemma. Mathematics 13: 682. [Google Scholar] [CrossRef]
Swishchuk, Anatoliy. 2007. Change of time method in mathematical finance. Canadian Applied Mathematics Quarterly 15: 299–336. [Google Scholar]
Wang, Renjie, Cody Hyndman, and Anastasis Kratsios. 2020. The entropic measure transform. Canadian Journal of Statistics 48: 97–129. [Google Scholar] [CrossRef]
Werner, Dirk. 2008. Funktionalanalysis (Springer-Lehrbuch) (German Edition), 6., korr. ed. Berlin/Heidelberg: Springer. [Google Scholar]
Wiggins, James B. 1987. Option values under stochastic volatility: Theory and empirical estimates. Journal of Financial Economics 19: 351–72. [Google Scholar] [CrossRef]
Zaevski, Tsvetelin S., and Dragomir C. Nedeltchev. 2023. From basel iii to basel iv and beyond: Expected shortfall and expectile risk measures. International Review of Financial Analysis 87: 102645. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sohns, M. Minimal Entropy and Entropic Risk Measures: A Unified Framework via Relative Entropy. Risks 2025, 13, 70. https://doi.org/10.3390/risks13040070

AMA Style

Sohns M. Minimal Entropy and Entropic Risk Measures: A Unified Framework via Relative Entropy. Risks. 2025; 13(4):70. https://doi.org/10.3390/risks13040070

Chicago/Turabian Style

Sohns, Moritz. 2025. "Minimal Entropy and Entropic Risk Measures: A Unified Framework via Relative Entropy" Risks 13, no. 4: 70. https://doi.org/10.3390/risks13040070

APA Style

Sohns, M. (2025). Minimal Entropy and Entropic Risk Measures: A Unified Framework via Relative Entropy. Risks, 13(4), 70. https://doi.org/10.3390/risks13040070

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Minimal Entropy and Entropic Risk Measures: A Unified Framework via Relative Entropy

Abstract

1. Introduction

2. Definition and Existence

3. Convex and Coherent Risk Measures

4. Duality

5. Dynamic Consistency

6. Optimal Risk Transfer

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Some Well-Known Theorems

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI