Next Article in Journal
Prediction of MoRFs in Protein Sequences with MLPs Based on Sequence Properties and Evolution Information
Previous Article in Journal
Entropy Generation and Thermoelastic Damping in the In-plane Vibration of Microring Resonators
Article Menu
Issue 7 (July) cover image

Export Article

Entropy 2019, 21(7), 634;

Divergence-Based Risk Measures: A Discussion on Sensitivities and Extensions
School of Economics, Sichuan University, Chengdu 610065, China
Department of Statistics and Operations Research, University of Granada, 18071 Granada, Spain
Author to whom correspondence should be addressed.
Received: 13 June 2019 / Accepted: 24 June 2019 / Published: 27 June 2019


This paper introduces a new family of the convex divergence-based risk measure by specifying ( h , ϕ ) -divergence, corresponding with the dual representation. First, the sensitivity characteristics of the modified divergence risk measure with respect to profit and loss (P&L) and the reference probability in the penalty term are discussed, in view of the certainty equivalent and robust statistics. Secondly, a similar sensitivity property of ( h , ϕ ) -divergence risk measure with respect to P&L is shown, and boundedness by the analytic risk measure is proved. Numerical studies designed for Rényi- and Tsallis-divergence risk measure are provided. This new family integrates a wide spectrum of divergence risk measures and relates to divergence preferences.
convex risk measure; preference; sensitivity analysis; ambiguity; ϕ-divergence

1. Introduction

In the last two decades, there has been a substantial development of a well-founded risk measure theory, particularly propelled since the axiomatic approach introduced by [1] in relation to the concept of coherency. While, to a large extent, the theory has been fundamentally inspired and motivated with financial risk assessment objectives in perspective, many other areas of application are currently or potentially benefited by the formal mathematical construction of the discipline.
The coherency axioms of monotonicity, translation invariance, positive homogeneity and sub-additivity lead to a representation for coherent risk measures of the form
ρ ( X ) = sup Q Q E Q [ X ] ,
where X is a real-valued measurable function on the measurable space ( Ω , F ) representing benefit, and Q is a certain set of probability measures on ( Ω , F ) . Since its introduction, such a formulation has been the matter of an important debate in the following years, mainly due to the restrictive conditions implied by the axiom of sub-additivity, which make coherent measures of risk unsuitable in certain applications. Relaxation of this axiom in terms of a weaker convexity condition, as was introduced by [2,3], provides a more flexible representation in the form
ρ ( X ) = sup Q Q E Q [ X ] α ( Q ) ,
where Q is a suitable set of probability measures on ( Ω , F ) , and α is a penalty function defined on Q . The minimal penalty function for ρ is convex, being defined by
α m i n ( Q ) : = sup X X { E Q [ X ] ρ ( X ) } ,
where X is the space of all Borel-measurable functions defined on ( Ω , F ) . With additional assumptions (see, for example, [4]), there exists a one-to-one pairing between ρ and α m i n via Legendre-Fenchel (LF) duality. The theories of convex and coherent risk measures have been increasingly and deeply developed as a central axis of the general theory of measures of risk, owing to the contributions by many researchers (e.g., [4,5,6,7,8,9,10,11,12], etc; see also early work of [13], among others). In the law-invariant case [14], the value ρ ( X ) only depends on the distribution of X X under the assumption of a prefixed probability measure P on ( Ω , F ) . Typical examples are value-at-risk (VaR), average value-at-risk (AVaR), tail value-at-risk (TVaR), the entropic risk measure, the ϕ -divergence risk measure related to the optimized certainty equivalent (OCE), etc; the authors of [15] comparatively study various generalized entropy-based forms of risk measures. The details on the VaR related family of risk measures can be found in the large range of the literature connected with duality. Here we just review some basic definitions focused on the entropic risk measure and ϕ -divergence risk measure used in the examples of Section 2.1.
In the decision theory under uncertainty, especially in the context of economics, the numerical representation of preferences shares a similar structure as the dual form of risk measures. In this context, it is postulated that an agent can evaluate the consequences of an alternative decision according to
U ( X ) = inf Q Q { E Q [ u ( X ) ] + α ( Q ) } ,
where u denotes the utility function and the penalty function α here is explained as the ambiguity attitude in this related strand of the literature. For more details on the ambiguity of variational preferences, we refer to the work by [16], which also proves that the preference relation satisfies the axioms of preferences when there exists a non-constant affine utility function and demonstrates a new class of preferences, built on the ϕ -divergence, for handling Ellsberg-type puzzles [17]. This numerical form leads to the definition of a loss functional, L ( X ) = U ( X ) , satisfying
L ( X ) = sup Q Q { E Q [ u ( X ) ] α ( Q ) } ,
but we are interested in the risk measure, the payoff of the financial position, which is equivalent to the negative certainty equivalent
sup Q Q { E Q [ φ ( X ) ] α ( Q ) } ,
where φ ( x ) = u ( x ) , named as the de-utility function. The (generalized) Donsker-Varadhan variational formula [18,19] plays a vital role in these connections, which recently also spread to the classical Gini index of income inequality [20] through the linkage tool derived from [21]. Some other efforts have been made to connect both of these research fields, such as in [2,7], and more recently [22].
From the perspective of convex risk measures, intuitively, the dual representation of Equation (2) can be regarded as a risk-measure generator obeying the axiomatic framework, i.e., we are able to produce a certain risk measure through selecting among different penalty functions. For instance, [8] constructed a bridge between the expected utility framework and the ϕ -divergence risk-measure framework (Lemma A2). On the one hand, any utility function satisfying certain suitable conditions can generate its related convex risk measure. Conversely, any ϕ -divergence can provide a possibly analytic form of a risk measure useful for calculation in practice, such as the Tsallis divergence.
Thus, the penalty function plays a substantive role in the evaluation of risk, with its properties determining the behavior of the linked risk measure. Moreover, from our understanding, this viewpoint on the sensitivity is ignored in the dual theory of convex risk measures. Besides early work, as in [23,24], discussing the sensitivity concentrated on cases of VaR-type with respect to portfolio allocation, recently, the authors of [25] have stated a framework for sensitivity analysis both on the measurement processes and the dataset through robust statistics and the estimation procedures, respectively. These sensitivity studies focus on trend identification (direction of change), which reveals how a shift of position affects the output risk measurements. In an extensive review, [26] identify and comparatively discuss different sensitivity analysis strategies in the literature, with a clear distinction between local (deterministic) and global (probabilistic) approaches.
The contribution of this paper is twofold. First, after reviewing the literature on the ambiguity aversion (or loving) of preferences in Section 2.2, we study the sensitivity of convex risk measures and examine the modified version of the ϕ -divergence risk measure to declare this issue clearly and operationally presented in Section 2.2 and Section 3 in terms of the reference probability measure and the input financial position.
Moreover, there are some important measures of divergence that cannot be obtained as particular cases of the ϕ -divergence; for instance, Rényi divergence. For this reason, the authors of [27] worked with an extended formulation, the so-called ( h , ϕ )-divergence, denoted by D ϕ h ( Q , P ) , from which various well-known divergences can be obtained through the extra distortion h. Our second contribution is, in a first step, straightforward: By replacing the penalty function to the ( h , ϕ ) -divergence in the dual form of convex risk measure, we derive a new family of convex risk measures, the ( h , ϕ ) -divergence risk measure
ρ h ( X ) : = sup Q Q { E Q [ X ] D ϕ h ( Q , P ) } .
Recent study on risk measures has focused on the extension of the risk-neutral valuation. Under the probability measure Q, the part E Q [ X ] is equivalent to the neutral risk. One more general approach to evaluating X consists in considering c φ ( X , Q ) = φ 1 E Q [ φ ( X ) ] , where a non-linear and convex φ leads to a risk-averse evaluation by the so-called φ -convex risk measures. Based on this extension, the authors of [7] develop a subclass of the entropic risk measure and contribute the connections with variational preferences by assuring the de-utility function φ as linear or exponential. The authors of [9] extend to a more general form based on the utility theory, while the authors of [28] develop the optimal expected utility (OEU) risk measure by modifying the OCE, which benefits from the easy application defined on optimizing in the real field.
Our article is organized as follows. In Section 2, we briefly review the content of the dual representation for convex risk measures and the concept of ambiguity in the field of the preference in decision making theory. In Section 3, we strengthen the analysis on the modified version of ϕ -divergence risk measure and its sensitivity on the financial positions, and analyze the sensitivity on the probability measure. In Section 4, we extend the penalty term to the ( h , ϕ ) -divergence for defining the new family of convex risk measures, and derive relative OCE bounds. The case of Rényi divergence is addressed in particular. Some numerical studies by cases are shown in Section 5. Conclusions and directions of continuing work are given in Section 6.

2. Risk Measures and Ambiguity

There is a deep connection between the concept and representation of risk measure and ambiguity. In the next subsection, we review several definitions and special cases of a measure of risk in view of axioms and duality, and illustrate its interpretation associated with the structure of the penalty term in relation to the ambiguity based on the background of the decision making theory in Section 2.2.

2.1. Dual Representation of Risk Measures

Let Ω be a fixed set of scenarios. A financial position is typically uncertain, and modeled as a real-valued measurable function X on the measurable space ( Ω , F ) , for a given σ algebra F .
Definition 1.
ρ : X R is called a ’convex measure of risk’ if it satisfies the following conditions for all X , Y X :
Convexity: ρ ( λ X + ( 1 λ ) Y ) λ ρ ( X ) + ( 1 λ ) ρ ( Y ) , for λ [ 0 , 1 ] .
Monotonicity: If X Y , then ρ ( X ) ρ ( Y ) .
Translation Invariance: If m R , then ρ ( X + m ) = ρ ( X ) m .
A convex measure of risk ρ is called a ’coherent measure of risk’ if it meets the property of
Positive Homogeneity: If λ 0 , then ρ ( λ X ) = λ ρ ( X ) .
Following the robust representation of convex measures of risk in [2], we recall some notations. M 1 : = M 1 ( Ω , F ) denotes the class of all probability measures on ( Ω , F ) , and M 1 , f : = M 1 , f ( Ω , F ) denotes the class of all finitely additive and non-negative set functions Q on F which are normalized to Q [ Ω ] = 1 . Let α : M 1 , f R { + } be any functional which is bounded from below and not identically equal to + . For each Q M 1 , f the functional X E Q [ X ] α ( Q ) is convex, monotone, and translation invariant, and these three properties are preserved when taking the supremum over Q Q . Hence,
ρ ( X ) : = sup Q M 1 , f { E Q [ X ] α ( Q ) }
defines a dual form of a convex measure of risk on X .
We recall the definitions of entropic risk measure and the ϕ -divergence risk measure (named g-divergence in [8]), which motivate our contributions.
Entropic risk measure. The standard ’entropic risk measure’ is defined by
e γ ( X ) : = 1 γ ln E P [ e γ X ] ,
for parameter γ [ 0 , ) . Its dual representation is given by
e γ ( X ) = sup Q Q E Q [ X ] 1 γ D ( Q , P ) ,
where D ( Q , P ) denotes the relative entropy or Kullback-Leibler (KL) divergence of Q with respect to P.
When D ( Q , P ) d , for a constant d, the entropic risk measure is coherent [6]. The entropic risk measure is related to Varadhan’s Lemma (Lemma A3) in the large deviation theory, when the rate function is the relative entropy.
ϕ -divergence risk measure. A natural extension can be formulated as the ’ ϕ -divergence risk measure’ in terms of a ϕ -divergence D ϕ ( Q , P ) ,
ρ ϕ ( X ) = sup Q Q { E Q [ X ] D ϕ ( Q , P ) } ,
with ϕ being a convex function satisfying certain suitable conditions.
Remark 1.
The ‘optimized certainty equivalent’ (OCE) representation by [8,19] can be derived from the generalized Donsker-Varadhan variational formula, as follows:
ρ O C E ( X ) = sup η R { η E P [ ϕ * ( η X ) ] }
= sup Q Q { E Q [ X ] D ϕ ( Q , P ) } ,
where u ( · ) = ϕ * ( · ) denotes the utility function, with ϕ * being the conjugate of ϕ.
Remark 2.
The OCE representation of Equation (7), referring to Kusuoka representation (see also [29,30]) in case of law invariance, is interpreted as the optimal decision of the allocation of X between the present η and the future consumption.
For various choices of ϕ corresponding to different divergences, we refer to [31] and applications in [15]. Since Tsallis divergence is an element in the family of ϕ -divergence, we can plug it into Equation (6) to achieve the Tsallis-divergence risk measure ρ T ( X ) , with ϕ ( x ) = ( x 1 α 1 ) / ( α 1 ) for α ( 0 , ) .
Average value-at-risk (AVaR). The average value-at-risk (AVaR) is defined, for α ( 0 , 1 ) , as
AVaR α ( X ) = 1 α 0 α VaR p ( X ) d p ,
where VaR denotes value-at-risk, VaR α ( X ) = inf { m R : P [ m + X < 0 ] α } , and it can be rewritten into the LF-dual and the OCE-type forms as
AVaR α ( X ) = sup Q Q α E Q [ X ] = sup η R η 1 α E P [ ( η X ) + ] ,
where Q α is the set of all probability measures Q P whose density d Q / d P is P-a.s. bounded by 1 / α . Here, for any z R , we denote ( z ) + = max { 0 , z } .

2.2. Ambiguity

In the language of preference, the penalty term in the dual form of a convex measure of risk can be interpreted as the ambiguity aversion. Here we give some information on the ambiguity attitudes characterized by the variational preferences and the original framework followed by [16,32], respectively. The preference denoted by ≿ is called ’variational’ if and only if there exists a non-constant affine function u : X R and a grounded, convex and lower semicontinuous function α such that, for all acts X 1 , X 2 X ,
X 1 X 2 inf Q Q { E Q [ u ( X 1 ) ] + α ( Q ) } inf Q Q { E Q [ u ( X 2 ) ] + α ( Q ) }
ρ ( u ( X 1 ) ) ρ ( u ( X 2 ) ) .
For each u there is a (unique) minimal α m i n that satisfies Equation (10), given by
α m i n ( Q ) = sup X X { E Q [ u ( X ) ] ρ ( u ( X ) ) } = sup X X { E Q [ u ( X ) ] + u ( x X ) } ,
where x X is a ‘certainty equivalent’ for X (see [16]).
According to [32], the benchmark preference, subjective expected utility (SEU) preference, is introduced to comparatively judge the ambiguity neutrality (see also [33]), and is quantified by [34] via the Arrow-Pratt quadratic approximation. All the variational preferences are ambiguity-averse. In order to distinguish the ambiguity attitudes among the variational preferences, the penalty function α gets a contextually substantive interpretation in the following result. Note that u 1 u 2 means that there exist a positive constant d 1 and a constant d 2 such that u 1 = d 1 u 2 + d 2 .
Proposition 1
(Proposition 8 in [16]). Given two variational preferences 1 and 2 , the relation 1 is more ambiguity-averse than 2 if and only if u 1 u 2 and α m i n , 1 α m i n , 2 , provided that u 1 = u 2 .
By the assumption of common normalization, u 1 = u 2 , thus, the more ambiguity-averse results from the smaller penalty α m i n , and α m i n is interpreted as the ‘index of ambiguity’ in agreement with the penalty term in the convex risk measure.

3. Properties of Divergence Risk Measure

3.1. Modified ϕ -Divergence Risk Measure

The modified ϕ -divergence risk measure, involving a linear rescaling of the penalty term, is defined below.
Definition 2.
The modified ϕ-divergence risk measure with parameter θ R is formulated as
ρ M ( X ) : = sup Q Q { E Q [ X ] θ D ϕ ( Q , P ) } ,
where D ϕ ( Q , P ) is the ϕ-divergence.
The formulation above comes from the divergence preference approach in [16] (see also [19]), where a slightly more general case of weighted ϕ -divergence is considered. Compared to the ϕ -divergence risk measure of Equation (8), it benefits from the extra parameter θ , which can be seen as an indicator to control the weight of the penalty, i.e., the relative importance of ambiguity aversion in the construction of the risk measure. It represents the same role as 1 / γ in the standard entropic risk measure of Equation (4). The Donsker-Varadhan variational formula can be rewritten for these cases as follows.
Proposition 2.
For parameter θ R + ,
inf Q Q { E Q [ X ] + θ D ϕ ( Q , P ) } = sup η R η E P ϕ θ * ( η X ) ,
with ϕ θ * ( · ) being the Legendre-Fenchel transform of θ · ϕ ( · )
ϕ θ * ( x ) = sup t dom ϕ { t · x θ · ϕ ( t ) } .
Correspondingly, the modified ϕ -divergence risk measure can be also rewritten by the following OCE-type form:
ρ M ( X ) = sup η R { η E P ϕ θ * ( η X ) } .
The parameter θ calibrates the relative effect of penalization in terms of the discrepancy of the measure Q with respect to the reference measure P. The limiting cases where θ tends to 0 or are shown in the next result.
Proposition 3
(Proposition 22 in [16]). The following limiting cases for θ hold:
lim θ 0 sup Q M 1 , f ( Ω ) { E Q [ X ] θ D ϕ ( Q , P ) } = ess inf ω X ( ω ) .
lim θ sup Q M 1 , f ( Ω ) { E Q [ X ] θ D ϕ ( Q , P ) } = E P [ X ] .
It is understood in the context of preferences that agents that behave by means of the criterion are assuming that the reference P may not be the right probability measure to reflect their interests and a potential probability measure Q may be taken into consideration, weighted by the parameter θ . The larger the value of θ , the higher the credibility attributed to P as the correct model.

3.2. Sensitivity Analysis with Respect to X

We first analyze the static sensitivity of the modified risk measure in terms of the financial position X at the direction towards X in the space X . For convenience regarding the description below, we introduce some notation. Let f Q ( X ) : = E Q [ X ] θ D ϕ ( Q , P ) . The modified ϕ -divergence risk measure can be rewritten as ρ M ( X ) = sup Q f Q ( X ) . It is natural to define Gâteaux differentiability ([35], Chapter 2) to describe the derivative in the direction X X , that is,
r ρ M ( X r ) r = 0 = lim r 0 ρ M ( X r ) ρ M ( X ) r ,
where X r denotes the intermediate position at the direction towards X given by
X r = ( 1 r ) X + r X .
According to [7,16], we can derive the following proposition.
Proposition 4.
The modified ϕ-divergence risk measure is everywhere differentiable for all θ > 0 , and, assuming that Q 0 = arg sup Q f Q ( X ) exists,
r ρ M ( X r ) r = 0 = lim r 0 ρ M ( X r ) ρ M ( X ) r = E Q 0 [ X X ] .
Remark 3.
See, for example, [4,36] for conditions ensuring the existence of Q 0 .
At any certain point X for the financial position, in the direction towards X , given Q 0 = arg sup Q f Q ( X ) , we have
r ρ M ( X r ) r = 0 = lim r 0 ρ M ( X r ) ρ M ( X ) r = lim r 0 ρ M ( X r ) f Q 0 ( X r ) r + lim r 0 f Q 0 ( X r ) ρ M ( X ) r = lim r 0 ρ M ( X r ) f Q 0 ( X r ) r + E Q 0 [ X X ] .
lim r 0 ρ M ( X r ) f Q 0 ( X r ) r 0 , lim r 0 ρ M ( X r ) f Q 0 ( X r ) r 0 ,
and by the convexity assumption it follows that
lim r 0 ρ M ( X r ) f Q 0 ( X r ) r = 0 ,
which completes the proof. □
The above property of sensitivity is only suitable for Gâteaux-differentiable risk measures, whereas more recently the athours of [37] consider the convex risk measures in non-Gâteaux-differentiable cases by means of the Aumann-Shapley allocation principle [38] in a view of capital allocation.

3.3. Sensitivity Analysis with Respect to P

Apart from the qualitative robustness analysis addressed in [25], which gives an entire systematic framework on examining Hampel’s robustness of risk estimators, the following content shows one aspect of sensitivity by theoretically deriving the error of the risk measure based on Gâteaux differentiability, considering that the reference probability measure P has a slight change in the direction towards a certain probability measure P in the space M 1 . The directional derivative can be seen as a measure for the sensitivity of ρ ( X ) with respect to considering the mixture measure P γ = ( 1 γ ) P + γ P of P and P , as γ tends to 0.
For convenience, we adopt the following notation in this section. Let g ( η , P ) : = η E P [ ϕ θ * ( η X ) ] . Hence, we rewrite
ρ M ( X ) : = ρ ( X , P ) = sup η g ( η , P ) = g ( η ( P ) , P ) ,
with η ( P ) = arg sup η g ( η , P ) . In what follows, q denotes the argument of ϕ θ * ( · ) .
As before, it is plausible to define the derivative of ρ ( X , P γ ) at γ = 0 for describing the degree of robustness of the modified ϕ -divergence risk measure, which reflects the effect on ρ M ( X ) of a small change of the probability measure P in the direction towards the probability measure P .
The analysis is then addressed to assess the derivative
γ ρ ( X , P γ ) γ = 0 = lim γ 0 ρ ( X , P γ ) ρ ( X , P ) γ = lim γ 0 g ( η ( P γ ) , P γ ) g ( η ( P ) , P ) γ ,
where, as stated above, P γ = ( 1 γ ) P + γ P . Observing the OCE-type form of the divergence risk measure (Equation (13)), it is natural to connect with the theories of robust statistics, that is, the influence function referring to [35]. The following lemma discusses the marginal property of η with respect to P, where we write η ˙ for short,
η ˙ = lim γ 0 η ( P γ ) η ( P ) γ .
Lemma 1.
Suppose that ϕ θ * ( · ) is a second-order differentiable function and Ω 2 q 2 ϕ θ * ( η ( P ) X ) d P 0 . Then,
η ˙ = 1 Ω q ϕ θ * ( η ( P ) X ) d P Ω 2 q 2 ϕ θ * ( η ( P ) X ) d P .
By optimization in the OCE-type form, we trivially have the relationship between η and the probability measure P. For the supremum η with respect to P, η ( P ) , we get
η g η ( P ) = 0
Ω q ϕ θ * ( η ( P ) X ) d P = 1
Inserting P γ as P in the above equation, and taking the derivative on both sides with respect to γ at γ = 0 , it follows that
η ˙ 2 q 2 ϕ θ * ( X η ) d P q ϕ θ * ( X η ) d P + q ϕ θ * ( X η ) d P = 0
leading to the result. □
Thus, according to Lemma 1, the influence function of the modified ϕ -divergence risk measure can be constructed by substituting Equation (16) into Equation (15).
Theorem 1.
Under the same assumptions of Lemma 1, we have
γ ρ M ( X ) γ = 0 = Ω ϕ θ * ( η ( P ) X ) ( d P d P ) = E P [ ϕ θ * ( η ( P ) X ) ] E P [ ϕ θ * ( η ( P ) X ) ] .
In [25], the influence function is described as the degree of sensitivity function in order to evaluate the level of Hampel’s robustness, which is equivalent to the continuity of the risk measure, and is evolved to the index of qualitative robustness between Hampel’s robustness and full tail sensitivity according to [39,40].

4. ( h , ϕ ) -Divergence Risk Measure

Our new extended family of risk measures derives from ( h , ϕ ) -divergence, the definition of which in [27] is recalled below.
Definition 3.
An extension of the ϕ-divergence, called ( h , ϕ ) -divergence, is defined by
D ϕ h ( Q , P ) = a = 1 A w a h a ϕ a ( d Q d P ) d P ,
for Q P , where h = ( h a ) a = 1 , , A and, for a = 1 , , A , h a are nondecreasing and continuous functions with h ( 0 ) = 0 , w a are positive weights, and the functions ϕ a satisfy the conditions for a ϕ-divergence risk measure.
We denote by Q the set of probability measures Q on ( Ω , F ) which are absolutely continuous with respect to P. For simplicity, here we consider the reduced form of the ( h , ϕ ) -divergence for the case A = 1 , given by
D ϕ h ( Q , P ) = h Ω ϕ ( d Q d P ) d P .
The parameter θ in the modified ϕ -divergence risk measure can be interpreted as the simple case of a linear scale distortion on the ϕ -divergence, whereas the operator h is extendedly interpreted as a general case of a non-linear distortion on it.
Definition 4.
Suppose that D ϕ h ( · , P ) is convex. The ( h , ϕ ) -divergence risk measure is defined by
ρ h ( X ) = sup Q Q { E Q [ X ] D ϕ h ( Q , P ) } .
By the definition of a convex measure of risk and the convexity of the penalty term in the dual form, it is clear to state that the ( h , ϕ ) -divergence risk measure is a convex measure of risk. It retrieves to the modified ϕ -divergence risk measure in Equation (12) when h is linear.
Observing the similar structure to the modified ϕ -divergence risk measure, the following property on the static sensitivity with respect to X still holds.
Proposition 5.
Under the same assumptions and setting as in Proposition 4, and assuming that Q 0 = arg sup Q Q { E Q [ X ] D ϕ h ( Q , P ) } exists,
r ρ h ( X r ) r = 0 = lim r 0 ρ h ( X r ) ρ h ( X ) r = E Q 0 [ X X ] .
The proof can also be derived from [7,16].
The following theorem shows that, under certain conditions, the ( h , ϕ ) -divergence risk measure can be bounded in terms of the OCE-type form provided by the corresponding ( h ϕ ) -divergence risk measure.
Theorem 2.
Under the convexity of D ϕ h ( · , P ) , and assuming that ϕ h : = h ϕ satisfies the conditions for a ϕ-divergence, if h is concave,
ρ h ( X ) ρ O C E h ( X ) ;
conversely, if h is convex,
ρ h ( X ) ρ O C E h ( X ) ,
ρ O C E h ( X ) = sup Q Q { E Q [ X ] D ϕ h ( Q , P ) } = sup η R η E P ϕ h * ( η X ) ,
where ϕ h * is the conjugate function of ϕ h .
By definition
ρ h ( X ) = sup Q Q { E Q [ X ] D ϕ h ( Q , P ) } ,   or ρ h ( X ) = inf Q Q { E Q [ X ] + D ϕ h ( Q , P ) } .
We denote by v the optimal value of the right hand side minimization problem of the above equation (the notations are presented in Appendix A.1):
v = inf z L p h Ω ϕ ( z ( ω ) ) d P ( ω ) + Ω X ( ω ) z ( ω ) d P ( ω ) s . t . Ω z ( ω ) d P ( ω ) = 1 α z ( ω ) β
The Lagrangian dual is given by
w : = sup η R η + inf α z ( · ) β h Ω ϕ ( z ( ω ) ) d P ( ω ) Ω ( η X ( ω ) ) z ( ω ) d P ( ω ) .
If h ( · ) is concave, denoted by h c (e.g., log function), by using Jensen inequality and Lemma A1, it follows that
w sup η R η + inf α z ( · ) β Ω h c ϕ ( z ( ω ) ) d P ( ω ) Ω ( η X ( ω ) ) z ( ω ) d P ( ω ) = sup η R η E P ϕ h * ( η X ) .
By applying Lemma A2, the equation is equal to the convex risk measure of the OCE-type form
ρ O C E h ( X ) = inf Q Q { E Q [ X ] + D ϕ h ( Q , P ) } = sup η R η E P ϕ h * ( η X ) ,
where ϕ h * is the conjugate of ϕ h : = h ϕ . It completes the proof by verifying that the ‘constraint qualification’ stated in Theorem 4.2 of [8] holds, which results in w = v . For convex h, the proof is similar. □
Remark 4.
If h is convex, the lower bound of any ( h , ϕ ) -divergence risk measure is its related ϕ-divergence risk measure. However, when h is concave, there may be a potential range of cases for meeting the demands from the industries since they prefer both convex and smaller measures.
Remark 5.
As the ( h , ϕ ) -divergence risk measure extends from a general divergence, the degree of complexity of its sensitivity analysis increases, adapting to multi-parameter forms, such as, for instance, Sharma-Mittal divergence [41,42], rather than the one-parameter cases of Tsallis- or Rényi-divergence risk measures. To handle the sensitivity analysis in relation to the multiple parameters involved, with or without different units, we refer to the framework of the differential importance measure (see [43,44] and references therein for details).
Rényi-divergence risk measure. Intuitively, the Rényi-divergence risk measure is derived as
ρ R ( X ) = sup Q Q { E Q [ X ] D R ( Q , P ) } ,
D R ( Q , P ) = 1 α 1 ln Ω d Q d P α 1 d Q = h Ω ϕ d Q d P d P ,
with h ( x ) = 1 α 1 ln ( 1 + · x ) and ϕ ( x ) = · ( x α 1 ) , with = sgn ( α 1 ) and α ( 0 , 1 ) . When α = 1 , Rényi divergence (as well as Tsallis divergence) is equal to KL divergence.
Remark 6.
The parameter α of ρ R ( X ) or ρ T ( X ) , as well as θ in ρ M ( X ) , can be interpreted as calibrating parameters of the level of ambiguity aversion in the preferences. However, θ essentially states the balance between the expected risk and the ambiguity aversion, yet α determines the structural profile of ambiguity, which corresponds to the nature of the preferences in the decision-making agents. Their behaviors are shown in the design of the simulation in the next section. The Rényi-divergence risk measure, among others, is deeply discussed in the recent paper [45].
It is easy to check that Rényi divergence is a convex measure of risk when α ( 0 , 1 ) . Then, the lower bound of the Rényi-divergence risk measure via the convexity of h is
ρ R ( X ) ρ O C E R ( X ) = sup η R η E P ϕ h * ( η X ) ,
ϕ h * ( x ) = sup t R { t · x h ϕ ( t ) } = sup t R { t · x ln ( t α ) / ( 1 α ) } .

5. Simulated Examples

The aim of the simulations described in this section is to compare the performances for the different divergence-based risk measures. One comparison is given between Rényi-divergence risk measure and Tsallis-divergence risk measure when α ( 0 , 1 ] . In both divergences, which constitute an important reference in information theory and its multiple applications [46], the parameter α calibrates the structural deviations between two probability measures and hence, as mentioned before, the profile of ambiguity aversion by the agent in the context of preferences. The second comparison is addressed to assess the effect derived in Tsallis-divergence risk measure by the extra distortion of a non-linear or linear function h. In this case, the parameter α is in ( 0 , ) .
In the setting of the dual representation of convex measures of risk, it is difficult to perform numerical studies depending on the nature of the measure P and the supremum of Q without the quantified OCE-type expression. Therefore, it is natural to consider the simulation via the idea of perturbation in the compositional data analysis. We assume an initial discrete distribution p i of the financial position x i , and the optimized distribution q i for i = 1 , , n , where n denotes the size of the sample space, such that i = 1 n p i = 1 , i = 1 n q i = 1 . In this case, our proposed risk measure can be rewritten by the form
ρ h ( X ) = max q i i = 1 n x i p i h i = 1 n p i ϕ ( q i / p i ) s . t . i = 1 n q i = 1 .
The complexity of this form increases rapidly with the size of the sample space, so we simplify the setting that n is 10 and x i is from 0.1 to 1 in the numerical studies. Since the influence of the reference distribution p i is obviously important to the value of risk measure, we took five scenarios on p i with respect to x i for consideration, see Figure 1. Scenario 1 is chosen to be the equiprobability case, p i = 1 / n ; Scenario 2 consists in assigning the larger probability to the value of x i in the middle range and the smaller probability to the small and large values of x i ; Scenario 3 is putting large probabilities on the small-value financial positions; Scenario 4 is the reverse of Scenario 3; Scenario 5 is the reverse of Scenario 2.
Figure 2 depicts the performance of Rényi-divergence risk measure (the solid line in blue) and Tsallis-divergence risk measure (the solid line in red) under the five scenarios considered, with α ( 0 , 1 ] . All the five subfigures show the same declining shape of the values for these two risk measures for increasing α , hence the structures of the two penalties do not change that much in the measure of risk according to the same α ; the value of Rényi-divergence risk measure is slightly smaller than that of Tsallis-divergence in the last subfigure of Figure 2. In addition, the deviation in Scenario 4, which puts the larger probabilities on the larger values, is smaller than in other scenarios when α is small, whereas Scenario 3, which is the reverse of Scenario 4, shows almost no deviation of both measures of risk.
For the second comparison, we consider these three non-linear and linear deformations on Tsallis divergence: (1) Second-order polynomial h ( x ) = x 2 ; (2) exponential h ( x ) = e x 1 ; and (3) linear case θ = 0.3 . The results are presented in Figure 3. It can be observed that the Tsallis-family risk measures exhibit the same behavior on the declining shapes, the values of which are decreasing for increasing α . In particular, the value of the second-order polynomial (the dash line in blue) is apparently larger but declines more gently than that of the original Tsallis-divergence risk measure (the solid line in yellow), and it also leads to a much stronger distortion on the scale than that of the exponential deformation (the dash–dot line in red), the value of which is slightly smaller than the Tsallis-divergence risk measure. Moreover, even for the linear function (the solid line in purple) operating on Tsallis divergence, the risk measure reveals non-linear variations. Shortly, all the figures verify the fact that the divergence-based risk measure remains sensitive on the reference distribution of the financial positions.

6. Conclusions

A basic and straightforward approach to the analysis of the sensitivity of the modified ϕ -divergence convex risk measure based on Gâteaux differentiability is proposed, and further, by non-linearly distorting the ϕ divergence, a wider range of the divergence family of convex risk measures is explored. In particular, interest is focused on the deviations of the risk measure derived from slight modifications of both the input financial position in X and the probability measure in M , benefiting from the preference and robust-statistics area shared with the relative formation and frame, respectively. The extension introduced, called (h, ϕ )-divergence risk measure, which is inspired by the dual representation of convex risk measures and covers a larger variety of divergences as penalty functionals, also nurtures the divergence preference on expanding the class of the ambiguity attitudes. A lower bound for the (h, ϕ )-divergence risk measure is established in the case where h is a convex function, in terms of the related ( h ϕ ) -divergence risk measure.
Several directions for study on the (h, ϕ )-divergence risk measure are open and are beyond the sensitivity and boundedness properties discussed in this paper. First, the relationship with the certainty equivalent, in a similar way to the general Donsker-Varadhan formula or the relation between ϕ -divergence risk measure and OCE, would be useful in order to efficiently implement the quantification of risk in practical applications. In this case, the sensitivity property of the (h, ϕ )-divergence risk measure, with respect to the reference probability, can also be studied through the relation. Furthermore, as shown in [39,40], the framework of risk measures on Orlicz space introduced by [47], applying merely to the law-invariant convex risk measures, is useful for studying their qualitative robustness through Kusuoka representations. Exploring the specific type and comparative degree of robustness for general convex risk measures defined through the dual representation, such as the ϕ -divergence or ( h , ϕ ) -divergence risk measures, also constitutes an important challenge of interest for continuing research.

Author Contributions

Conceptualization, M.X. and J.M.A.; Formal analysis, M.X. and J.M.A.; Funding acquisition, J.M.A.; Investigation, M.X. and J.M.A.; Methodology, M.X. and J.M.A.; Visualization, M.X. and J.M.A.; Writing—original draft, M.X. and J.M.A.


This research was funded by MINECO/FEDER, EU grant MTM2015-70840-P and MCIU/AEI/FEDER, UE grant PGC2018-098860-B-I00.


M.X. thanks the support from Erasmus+:I.D (Partner Countries) Programme, University of Granada and Centre for European Studies, Sichuan University. The authors thank the reviewers for constructive and insightful comments that led to a significant improvement of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A.

Appendix A.1. Preliminary and Setup

This section follows the settings of [8] in the space L p ( Ω , F , P ) and its dual space. Let ( Ω , F ) be a measurable space equipped with σ -algebra F . Consider two probability distributions P and Q, and let μ be an arbitrary dominating positive measure of P and Q, such that both P and Q are absolutely continuous with respect to μ on F , which we denote by P μ , Q μ . Following the Radon-Nikodym theorem, there exist F -measurable functions x , y 0 representing the densities of P and Q with respect to μ , respectively, written as
x = d P d μ , y = d Q d μ .
For 1 p + , let L p : = L p ( Ω , F , P ) be the linear space of measurable real-valued functions f : Ω R with f p < . For p [ 1 , + ) , we denote by L q its dual space, with q ( 1 , + ] and p + q = q p . Furthermore, for y L p and X L q ,
< y , X > = Ω y ( ω ) X ( ω ) d μ ( ω ) = Ω X ( ω ) d Q ( ω ) = E Q [ X ] ,
and this partially leads to the Legendre-Fenchel form for the risk measure.

Appendix A.2. Related Results

Let ϕ : R ( , ] be a proper closed convex function such that dom ϕ is an interval with endpoints a < b , thus int dom ϕ = ( a , b ) . Since ϕ is closed,
lim t a + ϕ ( t ) = ϕ ( a ) , lim t b ϕ ( t ) = ϕ ( b )
if a and b are finite. It is assumed that 1 int dom ϕ and that the minimum of ϕ is 0. The class of such functions is denoted by Φ .
Definition A1 (ϕ-divergence).
Given ϕ Φ , the ϕ-divergence of the probability measure Q with respect to P is
D ϕ ( Q , P ) = Ω ϕ d Q d P d P , i f Q P ;
otherwise, D ϕ ( Q , P ) = + .
To avoid some meaningless cases, it is assumed that
ϕ ( 0 ) < ; 0 ϕ 0 0 = 0 ; 0 ϕ s 0 = lim ε 0 ε ϕ s ε = s lim t + ϕ ( t ) t , s > 0 .
Definition A2 (Legendre-Fenchel transform).
Let ϕ Φ . The conjugate of a real-valued function ϕ on R d is another real-valued function on R d , defined as
ϕ * ( x ) : = sup t R d { t · x ϕ ( t ) } .
When d = 1 , the conjugate ϕ * is a closed proper convex function, with int dom ϕ * = ( a * , b * ) , where
a * = lim t t 1 ϕ ( t ) [ , + ) , b * = lim t + t 1 ϕ ( t ) ( , + ] .
Lemma A1 (The interchange of minimization and integration).
Let Ω be a σ-finite measure space, and let X : = L p ( Ω , F , P ) , p [ 1 , + ] . Let g : R × Ω ( , + ] be a normal integrand. Then,
inf x X Ω g ( x ( ω ) , ω ) d P ( ω ) = Ω inf s R g ( s , ω ) d P ( ω ) .
Lemma A2 (The generalized Donsker-Varadhan variational formula).
Let ϕ : R ( , + ] be a closed convex function with g ( 0 ) = 0 . Then, for X L ,
inf Q Q { E Q [ X ] + D ϕ ( Q , P ) } = sup η R η E P ϕ * ( η X ) ,
where ϕ * denotes the conjugate of ϕ via the Legendre-Fenchel transform. In particular, for ϕ ( x ) = x ln x x + 1 if x 0 and ϕ ( x ) = otherwise, it retrieves the classical Donsker-Varadhan variational formula presented in the Large Deviation Theory.
Lemma A3 (Varadhan’s lemma).
Suppose that { X ε } satisfies a large deviation principle on X with good rate function I. Then, for any bounded continuous function φ : X R , we have
lim ε 0 ε ln E [ e φ ( X ε ) / ε ] = sup x X [ φ ( x ) I ( x ) ] .


  1. Artzner, P.; Delbaen, F.; Eber, J.M.; Heath, D. Coherent measures of risk. Math. Financ. 1999, 9, 203–228. [Google Scholar] [CrossRef]
  2. Föllmer, H.; Schied, A. Robust preferences and convex measures of risk. In Advances in Finance and Stochastics: Essays in Honour of Dieter Sondermann; Springer Berlin Heidelberg: Berlin, Germany, 2002; pp. 39–56. [Google Scholar]
  3. Frittelli, M.; Gianin, E.R. Putting order in risk measures. J. Bank Financ. 2002, 26, 1473–1486. [Google Scholar] [CrossRef]
  4. Föllmer, H.; Weber, S. The axiomatic approach to risk measures for capital determination. Annu. Rev. Financ. Econ. 2015, 7, 301–337. [Google Scholar] [CrossRef]
  5. Föllmer, H.; Schied, A. Convex and Coherent Risk Measures. 2008. Available online: (accessed on 24 June 2019).
  6. Föllmer, H.; Knispel, T. Entropic risk measures: coherence vs. convexity, model ambiguity and robust large deviations. Stoch. Dynam. 2011, 11, 333–351. [Google Scholar] [CrossRef]
  7. Laeven, R.J.A.; Stadje, M. Entropy coherent and entropy convex measures of risk. Math. Oper. Res. 2013, 38, 265–293. [Google Scholar] [CrossRef]
  8. Ben-Tal, A.; Teboulle, M. An old-new concept of convex risk measures: The optimized certainty equivalent. Math. Financ. 2007, 17, 449–476. [Google Scholar] [CrossRef]
  9. Vinel, A.; Krokhmal, P.A. Certainty equivalent measures of risk. Ann. Oper. Res. 2017, 249, 75–95. [Google Scholar] [CrossRef]
  10. Kaina, M.; Rüschendorf, L. On convex risk measures on Lp-spaces. Math. Oper. Res. 2009, 69, 475–495. [Google Scholar] [CrossRef]
  11. Ahmadi-Javid, A. Entropic Value-at-Risk: a new coherent risk measure. J. Optimiz. Theory App. 2012, 155, 1105–1123. [Google Scholar] [CrossRef]
  12. Pele, D.; Lazar, E.; Dufour, A. Information entropy and measures of market risk. Entropy 2017, 19, 226. [Google Scholar] [CrossRef]
  13. Wang, S.S.; Young, V.R.; Panjer, H.H. Axiomatic characterization of insurance prices. Insur. Math. Econ. 1997, 21, 173–183. [Google Scholar] [CrossRef]
  14. Frittelli, M.; Gianin, E.R. Law invariant convex risk measures. In Advanced Mathematical Economics; Springer Tokyo: Tokyo, Japan, 2005; pp. 33–46. [Google Scholar]
  15. Zhou, R.; Liu, X.; Yu, M.; Huang, K. Properties of risk measures of generalized entropy in portfolio selection. Entropy 2017, 19, 657. [Google Scholar] [CrossRef]
  16. Maccheroni, F.; Marinacci, M.; Rustichini, A. Ambiguity aversion, robustness, and the variational representation of Preferences. Econometrica 2006, 74, 1447–1498. [Google Scholar] [CrossRef]
  17. Ellsberg, D. Risk, ambiguity, and the Savage axioms. Q. J. Econ. 1961, 75, 643–669. [Google Scholar] [CrossRef]
  18. Dupuis, P.; Ellis, R.S. A Weak Convergence Approach to the Theory of Large Deviations; John Wiley & Sons: New York, NY, USA, 1997; Volume 313. [Google Scholar]
  19. Ben-Tal, A.; Teboulle, M. Penalty functions and duality in stochastic programming via φ-divergence functionals. Math. Oper. Res. 1987, 12, 224–240. [Google Scholar] [CrossRef]
  20. Greselin, F.; Zitikis, R. From the classical Gini index of income inequality to a new Zenga-type relative measure of risk: A modeller’s perspective. Econometrics 2018, 6, 4. [Google Scholar] [CrossRef]
  21. Maccheroni, F.; Marinacci, M.; Rustichini, A. A Variational Formula for the Relative Gini Concentration Index. 2004. Available online: (accessed on 24 June 2019).
  22. Borgonovo, E.; Cappelli, V.; Maccheroni, F.; Marinacci, M. Risk analysis and decision theory: A bridge. Eur. J. Oper. Res. 2018, 264, 280–293. [Google Scholar] [CrossRef]
  23. Gourieroux, C.; Laurent, J.; Scaillet, O. Sensitivity analysis of Values at Risk. J. Empir. Finance 2000, 7, 225–245. [Google Scholar] [CrossRef]
  24. Scaillet, O. Nonparametric estimation and sensitivity analysis of Expected Shortfall. Math. Financ. 2004, 14, 115–129. [Google Scholar] [CrossRef]
  25. Cont, R.; Deguest, R.; Scandolo, G. Robustness and sensitivity analysis of risk measurement procedures. Quant. Financ. 2010, 10, 593–606. [Google Scholar] [CrossRef]
  26. Borgonovo, E.; Plischke, E. Sensitivity analysis: a review of recent advances. Eur. J. Oper. Res. 2016, 248, 869–887. [Google Scholar] [CrossRef]
  27. Menéndez, M.L.; Morales, D.; Pardo, L.; Salicrú, M. Asymptotic behaviour and statistical applications of divergence measures in multinomial populations: a unified study. Stat. Papers 1995, 36, 1–29. [Google Scholar] [CrossRef]
  28. Geissel, S.; Sass, J.; Seifried, F.T. Optimal expected utility risk measures. Stat. Risk Model. 2017, 35, 73–87. [Google Scholar] [CrossRef]
  29. Kusuoka, S. On law invariant coherent risk measures. In Advances in Mathematical Economics; Springer: Tokyo, Japan, 2001; pp. 83–95. [Google Scholar]
  30. Dentcheva, D.; Penev, S.; Ruszczyński, A. Kusuoka representation of higher order dual risk measures. Ann. Oper. Res. 2010, 181, 325–335. [Google Scholar] [CrossRef]
  31. Ciszár, I. Information-type measures of difference of probability distributions and indirect observations. Stud. Sci. Math. Hung. 1967, 2, 299–318. [Google Scholar]
  32. Ghirardato, P.; Marinacci, M. Ambiguity made precise: A comparative eoundation. J. Econ. Theory 2002, 102, 251–289. [Google Scholar] [CrossRef]
  33. Cerreia-Vioglio, S.; Maccheroni, F.; Marinacci, M.; Montrucchio, L. Classical subjective expected utility. Proc. Natl. Acad. Sci. USA 2013, 110, 6754–6759. [Google Scholar] [CrossRef]
  34. Borgonovo, E.; Marinacci, M. Decision analysis under ambiguity. Eur. J. Oper. Res. 2015, 244, 823–836. [Google Scholar] [CrossRef]
  35. Huber, P.J. Robust Statistics; Wiley Series in Probability and Statistics; John Wiley & Sons: New York, NY, USA, 1981. [Google Scholar]
  36. Föllmer, H.; Schied, A. Stochastic Finance: An Introduction in Discrete Time, 4th ed.; De Gruyter: Berlin, Germany, 2016. [Google Scholar]
  37. Centrone, F.; Gianin, E.R. Capital allocation à la Aumann-Shapley for non-differentiable risk measures. Eur. J. Oper. Res. 2018, 267, 667–675. [Google Scholar] [CrossRef]
  38. Aumann, R.J. Values of markets with a continuum of traders. Econometrica 1975, 43, 611–646. [Google Scholar] [CrossRef]
  39. Krätschmer, V.; Schied, A.; Zähle, H. Qualitative and infinitesimal robustness of tail-dependent statistical functionals. J. Multivariate. Anal. 2012, 103, 35–47. [Google Scholar] [CrossRef]
  40. Krätschmer, V.; Schied, A.; Zähle, H. Comparative and qualitative robustness for law-invariant risk measures. Financ. Stoch. 2014, 18, 271–295. [Google Scholar] [CrossRef]
  41. Sharma, B.D.; Mittal, D.P. New nonadditive measures of entropy for discrete probability distributions. Casp. J. Math. Sci. 1975, 10, 28–40. [Google Scholar]
  42. Nielsen, F.; Nock, R. A closed-form expression for the Sharma–Mittal entropy of exponential families. J. Phys. A Math. Theor. 2011, 45, 032003. [Google Scholar] [CrossRef]
  43. Antoniano-Villalobos, I.; Borgonovo, E.; Siriwardena, S. Which parameters are important? Differential importance under uncertainty. Risk Anal. 2018, 38, 2459–2477. [Google Scholar] [CrossRef] [PubMed]
  44. Tsanakas, A.; Millossovich, P. Sensitivity analysis using risk measures. Risk Anal. 2016, 36, 30–48. [Google Scholar] [CrossRef] [PubMed]
  45. Pichler, A.; Schlotter, R. Entropy based risk measures. Eur. J. Oper. Res. 2019. [Google Scholar] [CrossRef]
  46. Naudts, J. Generalised Thermostatistics; Springer: London, UK, 2011. [Google Scholar]
  47. Cheridito, P.; Li, T. Risk measures on Orlicz hearts. Math. Financ. 2009, 19, 189–214. [Google Scholar] [CrossRef]
Figure 1. The setting of five scenarios on the reference probability measure P.
Figure 1. The setting of five scenarios on the reference probability measure P.
Entropy 21 00634 g001
Figure 2. Values of Rényi-divergence and Tsallis-divergence risk measures for the different scenarios and their deviations.
Figure 2. Values of Rényi-divergence and Tsallis-divergence risk measures for the different scenarios and their deviations.
Entropy 21 00634 g002
Figure 3. Values of Tsallis-divergence risk measure and corresponding nonlinearly and linearly distorted risk measures for the different scenarios.
Figure 3. Values of Tsallis-divergence risk measure and corresponding nonlinearly and linearly distorted risk measures for the different scenarios.
Entropy 21 00634 g003

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (
Entropy EISSN 1099-4300 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top