Next Article in Journal
Symmetrical Traditional Patterns and User Perception: A Study on Innovation in Home Textile Design
Next Article in Special Issue
Hurwicz-Type Optimal Control Problem for Uncertain Singular Non-Causal Systems
Previous Article in Journal
Research on Coordinated Control of Multi-PMSM for Shaftless Overprinting System
Previous Article in Special Issue
Robust Load Frequency Control in Hybrid Microgrids Using Type-3 Fuzzy Logic Under Stochastic Variations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Symmetric Adjustable Tail-Risk Measure for Distributionally Robust Optimization in Portfolio Allocation

School of Mathematics and Statistics, Henan University of Science and Technology, Luoyang 471003, China
*
Author to whom correspondence should be addressed.
Symmetry 2025, 17(6), 959; https://doi.org/10.3390/sym17060959
Submission received: 14 April 2025 / Revised: 29 May 2025 / Accepted: 10 June 2025 / Published: 17 June 2025
(This article belongs to the Special Issue Symmetry in Optimal Control and Applications)

Abstract

In this study, we begin by extending the mathematical formulation of the expectile risk measure through a key modification: replacing the expectation in its defining equation with expected shortfall. This substitution leads to a revised risk measure that more precisely captures downside risk. To handle the uncertainty of the underlying distribution, we then adopt a distributionally robust optimization framework. Notably, this robust optimization problem can be reformulated as a linear programming problem, and by employing suitable approximation techniques, we derive an analytical solution. In numerical experiments, our portfolio problem exhibits superior performance when compared to several traditional and distributionally robust optimized portfolio problems.

1. Introduction

In the context of portfolio optimization, while Markowitz’s [1] mean–variance question provides a theoretical foundation for asset allocation, its limitations have become increasingly evident in practical applications. Variance, as a risk measure, fails to distinguish between different types of risks and overlooks the distribution of risk, which may expose investors to unacceptable levels of loss. Additionally, parameter estimation errors, particularly in the covariance matrix and mean vector, can lead to significant variations in the optimization outcomes, thereby impacting the robustness of the portfolio [2].
To overcome these limitations, new risk measures have been proposed, with the expectile emerging as a prominent tool in both statistics and finance in recent years [3]. Compared to traditional risk measures such as the Value at Risk (VaR) and expected shortfall (ES), the expectile provides a more flexible approach to capturing different degrees of risk aversion by minimizing the weighted mean squared error. By adjusting parameter α , the expectile allows for greater emphasis on either the lower or upper tail of the distribution, making it suitable for a variety of investment objectives. This method not only effectively complements traditional risk measures but also offers a more refined and personalized risk control strategy for portfolio optimization.
Formally, the α -expectile [3] is defined by
q α ( X ) = arg min d R α E ( X d ) + 2 + ( 1 α ) E ( X d ) 2 ,
where α is the confidence level, ( x ) + denotes the positive part, and ( x ) denotes the negative part. According to [4], q α ( X ) is the unique solution to this optimization problem if and only if
α E ( X q α ( X ) ) + = ( 1 α ) E ( X q α ( X ) ) .
When α = 0.5 , the expectile coincides with the mean of X. As α moves above 0.5 , q α ( X ) progressively exceeds the mean, reflecting an increasing focus on higher losses (or lower returns). Conversely, when α < 0.5 , the expectile shifts below the mean, emphasizing outcomes below the central tendency. These features allow practitioners to adjust α according to the severity of risks they wish to emphasize, providing a flexible measure that can better capture tail risks and adapt to changing market conditions. This makes the expectile a valuable tool for tailoring risk management strategies in investment portfolios.
In the application domain, ref. [3] demonstrates that expectiles serve as rigorous and efficient risk assessment tools, characterized by properties such as coherence, elicitability, non-negative risk margins, and adaptability to structured contracts. Building on this foundation, ref. [5] provides closed-form expectile(ERM) formulas via generalized Fourier inversion, validates them across five market models, and proposes a 1.25 ERM-to-VaR scaling for Basel compliance. As coherent measures (with α 0.5 ), expectiles enhance regulatory compliance and facilitate comprehensive risk verification, while optimal reinsurance strategies effectively streamline the trade-off between risk and cost.
Additionally, Taylor’s Conditional Autoregressive Expectiles (CARE) model [6] employs asymmetric least squares to capture autoregressive structures, enabling the dynamic estimation of VaR and ES without imposing restrictive distributional assumptions. Empirical evidence indicates that the CARE model attains accuracy comparable to that of historical simulation and GARCH-EVT approaches—especially in estimating extreme quantiles—thereby overcoming the limitation of Conditional Autoregressive Value-at-Risk methods, which focus solely on VaR, and ultimately offering a more comprehensive framework for financial risk management.
To further enhance the applicability of these tools, this paper introduces a generalization of the expectile concept to encompass expected shortfall at a confidence level β , which assesses the anticipated value of losses across varying confidence levels. Moreover, this paper also attempts to use expectile for portfolio decision making. This generalization not only preserves the advantageous properties of expected shortfall but also integrates the flexibility inherent in the expectile, thereby facilitating a more comprehensive risk assessment that accommodates diverse risk preferences and confidence levels.
In this paper, we generalize the expectile, and its generalized form can be expressed as
α ES β 1 ( ( X d ) + ) = ( 1 α ) ES β 2 ( ( X d ) ) , d R , β 1 , β 2 ( 0 , 1 ) .
In this equation, ES β 1 ( ( X d ) + ) represents the expected shortfall at confidence level β 2 for the positive part of the loss X d , while ES β ( ( X d ) ) corresponds to the expected shortfall for the negative part. The solution d to this equation is referred to as the ES-expectile, denoted as q α , β 1 , β 2 ( X ) . When β 1 = β 2 = 0 , q α , β 1 , β 2 ( X ) = q α ( X ) .
The expected shortfall (ES) [7] is defined as
ES β ( X ) = 1 1 β β 1 VaR θ ( X ) d θ ,
where the Value at Risk (VaR) at confidence level θ is mathematically defined as
VaR θ ( X ) = inf { x R : P ( X x ) θ } .
Additionally, for ES, there is the Rockafellar–Uryasev formula [7], which simplifies the computation of the conditional risk measure ES. The value of ES τ is obtained by solving the following optimization problem:
ES τ ( Z ) = inf ζ R ζ + 1 1 τ E [ ( Z ζ ) + ] .
This formulation emphasizes the use of different weights for losses above and below a specified threshold, under varying confidence levels, thereby capturing the severity of extreme risk scenarios more effectively than traditional measures such as VaR.
To address this uncertainty, distributionally robust optimization (DRO) has been proposed as an effective modeling technique. The main objective of DRO is to find decisions that minimize the worst-case expected cost over an ambiguity set, which is constructed using the distributional information of uncertain data. Compared to traditional robust optimization, DRO utilizes available distributional information more effectively, making it a more conservative approach in handling uncertainty [8].
In recent years, distributionally robust optimization methods based on Wasserstein ambiguity sets have gained significant attention. To the best of our knowledge, Mohajerin Esfahani and Kuhn [9] first explored the convex reformulation of the Wasserstein distributionally robust optimization question, demonstrating several desirable properties of this approach. Gao subsequently extended these properties and explored the connection between distributionally robust optimization and variation regularization [10,11]. Today, DRO is widely applied in various fields, including machine learning [12] and portfolio optimization [13,14,15,16,17,18].
We summarize our main contributions in this paper as follows:
  • Expected Shortfall Expectile (ES-expectile). We introduce a new class of risk measures by substituting the expectation in the classical expectile definition with expected shortfall (ES). This modification preserves the coherent risk measure spirit of ES while maintaining the sensitivity of expectiles to the full distribution, yielding a refined metric that captures the entire left-tail risk profile more faithfully than either the ES or expectile alone.
  • Distributionally Robust ES-expectile via Wasserstein Ambiguity. To counter the instability arising from finite-sample empirical distributions, we formulate the ES-expectile portfolio problem within a distributionally robust optimization framework whose ambiguity set is a Wasserstein ball centered at the empirical distribution. We prove that the resulting min–max problem can be conservatively reformulated as a computationally tractable program that can be solved reliably with standard optimization software.
  • Empirical Superiority and Extreme-Weight Adjustment. Out-of-sample experiments show that our distributionally robust ES-expectile question (DRES-expectile) delivers higher Sharpe ratios and Sortino ratios compared with the classical ES-expectile portfolio question, traditional portfolio approaches, and Wasserstein DRO portfolios. Although the DRES-expectile occasionally yields extreme allocations, we mitigate this behaviour with a lightweight extreme-weight adjustment heuristic that retains most of the performance gains while reducing variance.
The remainder of this paper is organized as follows. In Section 2, the definition of the ES-expectile is presented, and its favorable properties are demonstrated through a series of lemmas and theorems, culminating in the introduction of the ES-expectile question. Section 3 begins with an introduction to the theory of distributionally robust optimization along with related theorems. Based on this foundation, the DRES-expectile is derived, and its degenerate form is also presented. Section 4 further simplifies the DRES-expectile question, and Section 5 presents numerical experiments that highlight the unique features of our question and provide guidance for portfolio adjustment decisions. The results indicate that our question outperforms both traditional portfolio questions and those based on distributionally robust optimization. Section 6 provides a dedicated discussion of the managerial and investor implications of the proposed DRES-expectile framework and translates the empirical evidence into actionable guidance for future portfolio optimization practice.
For clarity and ease of reference, Table 1 lists the key symbols and abbreviations employed in this paper.
Notations: We denote by δ ξ ^ i the Dirac measure concentrated at ξ ^ i R n , which assigns a unit mass to point ξ ^ i . The empirical distribution P N is then defined as the probability measure given by the average P N = 1 N i = 1 N δ ξ ^ i , where ξ ^ 1 , ξ ^ 2 , , ξ ^ N are the observed samples. We denote by e R n the vector where each component is 1. We denote by I A ( x ) the indicator function, which assigns a value of 1 if x A , and 0 otherwise. We denote by M ( Ξ ) the set of all Borel probability measures supported on Ξ , where Ξ R n . Let ( Ω , B , P ) represent an atomless probability space, where Ω is the set of possible states, B is a σ -algebra on Ω , and P is the probability measure. The space L 1 = L 1 ( Ω , B , P ) refers to the set of measurable functions f for which Ω | f | d P < .

2. ES-Expectile

In this section, we will define the ES-expectile question and derive some necessary theorems and formulas.
Definition 1.
For X L 1 , α ( 0 , 1 ) , and β 1 , β 2 ( 0 , 1 ) , the ES-expectile of X, denoted by q α , β 1 , β 2 ( X ) , is defined as the solution to the equation
α ES β 1 ( ( X d ) + ) = ( 1 α ) ES β 2 ( ( X d ) ) , d R .
In the special case where β 1 = β 2 = 0 , q α , β 1 , β 2 ( X ) reduces to the standard expectile q α ( X ) .
Lemma 1.
For X L 1 , d R , and β ( 0 , 1 ) , the expected shortfall ES β ( X d ) + and ES β ( X d ) can be expanded as follows
ES β ( ( X d ) + ) = ES β ( X ) d , if d VaR β ( X ) , 1 1 β E [ ( X d ) + ] , if d > VaR β ( X ) ,
ES β ( ( X d ) ) = d + β 1 β ES 1 β ( X ) 1 1 β E ( X ) , if d VaR 1 β ( X ) , 1 1 β E [ ( X d ) ] , if d < VaR 1 β ( X ) .
Proof. 
Note that VaR θ ( ( X d ) + ) = ( VaR θ ( X ) d ) + (We begin by considering the right-hand side of the equation. When d VaR θ ( X ) , ( VaR θ ( X ) d ) + equals 0. In this case, the left-hand side of the equation is also 0 because P ( max ( X d , 0 ) 0 ) α and x 0 . When d < VaR θ ( X ) , the condition P ( max ( X d , 0 ) x ) α is equivalent to P ( X x + d ) α , leading to the conclusion that VaR θ ( ( X d ) + ) = VaR θ ( X ) d .).
If d VaR β ( X ) , then VaR θ ( ( X d ) + ) = 0 for 0 θ β . Therefore,
ES β ( ( X d ) + ) = 1 1 β 0 1 VaR θ ( ( X d ) + ) d θ = 1 1 β E [ ( X d ) + ] .
Conversely, for d VaR β ( X ) , we have
ES β ( ( X d ) + ) = 1 1 β β 1 ( VaR θ ( X ) d ) d θ = ES β ( X ) d .
Note that VaR θ ( ( X d ) ) = VaR θ ( ( X d ) + ) = ( VaR θ ( X ) d ) + = ( VaR θ ( X ) d ) and VaR θ ( X ) = VaR 1 θ ( X ) . Then,
ES β ( ( X d ) ) = 1 1 β β 1 VaR 1 θ ( ( X d ) ) d θ = 1 1 β 0 1 β ( VaR θ ( X ) d ) d θ .
If d VaR 1 β ( X ) , then ( VaR θ ( X ) d ) = 0 for 1 β θ 1 , which implies
ES β ( ( X d ) ) = 1 1 β E [ ( X d ) ] .
On the other hand, for d VaR 1 β ( X ) , we have
ES β ( ( X d ) ) = 1 1 β 0 1 β ( d VaR θ ( X ) ) d θ = d + β 1 β ES 1 β ( X ) 1 1 β E ( X ) .
   □
In the subsequent sections, we will refer to Lemma 1 to establish that our definition is well defined by demonstrating that the equation in Definition 1 admits at least one solution.
Theorem 1.
For X L 1 , α ( 0 , 1 ) , and β 1 , β 2 ( 0 , 1 ) , the equation with respect to d,
α ES β 1 ( ( X d ) + ) = ( 1 α ) ES β 2 ( ( X d ) ) , d R ,
is guaranteed to have at least one solution.
Proof. 
We first define the function
f ( d ) = α ES β 1 ( ( X d ) + ) ( 1 α ) ES β 2 ( ( X d ) ) .
It is evident that f ( d ) is strictly decreasing with respect to d. Also, as d approaches negative infinity, f ( d ) tends to positive infinity, while as d approaches positive infinity, f ( d ) tends to negative infinity. Therefore, it suffices to prove that f ( d ) is continuous in order to apply the Intermediate Value Theorem, which guarantees the existence and uniqueness of a zero point for f ( d ) .
To prove the continuity of f ( d ) , it suffices to show that ES β 1 ( ( X d ) + ) is continuous. From Lemma 1, we know that ES β 1 ( ( X d ) + ) is continuous at VaR β 1 ( X ) . Therefore, we only need to further demonstrate the continuity of E [ ( X d ) + ] .
We define
g ( d ) = E [ max ( X d , 0 ) ] = ω Ω max ( X ( ω ) d , 0 ) d P ( ω ) .
For all ε > 0 , δ = ε > 0 such that if | d d 0 | < δ , then
g ( d ) g ( d 0 ) Ω max ( X ( ω ) d , 0 ) max ( X ( ω ) d 0 , 0 ) d P ( ω ) Ω d d 0 d P ( ω ) < ε .
Thus, g ( d ) is continuous, which further implies that f ( d ) has at least one zero point.    □
Therefore, our classical portfolio question is
inf d R , y R n d s . t . α ES β 1 ( ( y T ξ d ) + ) = ( 1 α ) ES β 2 ( ( y T ξ d ) ) , y i 0 , i = 1 , , n , i = 1 n y i = 1 .
Here, d represents the risk measure, y denotes the portfolio weights, and ξ is the vector of investment returns, which follows an unknown distribution.
The formulas for ES β ( ( X d ) + ) and ES β ( ( X d ) ) in Lemma 1 were derived to obtain simpler forms. However, this derivation introduces piecewise cases depending on the relationship between d and VaR β ( X ) or VaR 1 β ( X ) . To address the complexity caused by these piecewise cases, we make the following assumption to simplify the analysis and streamline the portfolio problem.
Assumption 1.
Due to the variability of y, which is allowed to vary within a specific domain, we assume that solution d lies within the range from V a R β ( y T ξ ) to positive infinity. This assumption ensures that there exists a risk measure greater than or equal to V a R β ( y T ξ ) , accommodating the potential changes in y.
Specifically, when β 1 = β 2 = β 0.5 and considering the expectation of the returns ρ , portfolio question (2) is equivalent to the following question:
inf d R , y R n d s . t . d VaR β ( y T ξ ) , α 1 α E [ ( y T ξ d ) + ] = ( 1 β ) d + β ES 1 β ( y T ξ ) E ( y T ξ ) , y i 0 , i = 1 , , n , i = 1 n y i = 1 , E [ y T ξ ] = ρ .
We know that the optimal ζ in the Rockafellar–Uryasev Formula (1) is actually the τ -quantile of the random variable Z. Therefore, if we use the Rockafellar–Uryasev formula to replace ES 1 β ( y T ξ ) in question (3), we can obtain the following:
d ζ .
This problem framework is the focus of our study. Since we have discussed the two challenging terms E [ ( y T ξ d ) + ] and ES 1 β ( y T ξ ) , the other cases can be easily generalized. Moreover, the derivation process does not rely on the requirement that β 1 = β 2 . This is also the reason why we approximate ES β ( ( X d ) + ) and ES β ( ( X d ) ) using only these two terms.
However, the problem framework has certain challenges, such as the unknown probability measure P of ξ , which corresponds to the distribution Law ( ξ ) , and the computational difficulties associated with modeling VaR and ES. These issues arise from the uncertainty in Law ( ξ ) and the complexity of handling these risk measures in practice.

3. DRO Theory and DRES-Expectile

In this section, we briefly introduce the theory of distributionally robust optimization and apply it using some derived related theorems to handle question (3).
In decision making under uncertainty, it is common to encounter situations where the probability distribution governing the uncertainty is not known precisely. Classical stochastic programming assumes that this distribution is fully known, but this assumption often does not hold in real-world scenarios. This leads to the concept of distributionally robust optimization (DRO), which seeks to hedge against the worst-case scenario by considering a family of probability distributions that are close to a nominal distribution.
One powerful tool to define this “closeness” between probability distributions is the Wasserstein distance. The Wasserstein distance, rooted in the theory of optimal transport, provides a metric with which to quantify the distance between two probability distributions on a given space. Specifically, if we consider an uncertainty set Ξ equipped with a norm · , the set of all Borel probability distributions on Ξ with finite p-th moments is denoted as P p ( Ξ ) . For any two distributions μ , ν P p ( Ξ ) , the Wasserstein distance of order p is defined as
W p ( μ , ν ) : = min γ Γ ( μ , ν ) Ξ × Ξ ξ η p γ ( d ξ , d η ) 1 / p ,
where Γ ( μ , ν ) represents the set of all couplings between μ and ν , that is, the set of all joint distributions on Ξ × Ξ that have μ and ν as marginals.
In the context of distributionally robust optimization, we use the Wasserstein distance to construct an ambiguity set M that contains all distributions within a certain Wasserstein distance from a nominal distribution ν . The goal is then to find a decision that performs optimally in the worst-case scenario over all distributions in this ambiguity set. Mathematically, this can be formulated as
min y Y sup μ M E μ [ Ψ ( y , ξ ) ] ,
where M = { μ P p ( Ξ ) : W p ( μ , ν ) δ } , with δ > 0 denoting the radius of the Wasserstein distance. Here, P p ( Ξ ) represents the set of probability distributions with finite p-th moments, and Ψ is an upper semi-continuous function, typically representing a loss or a negative payoff function. The random variable ξ represents uncertainty, and W p ( μ , ν ) denotes the Wasserstein distance between μ and ν . Since our main focus in the following discussion is on the problem of taking the supremum over distributions, specifically sup μ M E μ [ Ψ ( y , ξ ) ] , we treat y as a parameter. For convenience, we often write Ψ ( y , ξ ) as Ψ y ( ξ ) , which indicates that it is a function of the random variable ξ .
We refer to the ambiguity set M as the Wasserstein ball and use it with p = 1 in this study due to the favorable mathematical properties of the Wasserstein metric with p = 1 , which will be utilized in the subsequent derivations.
This formulation provides a robust approach to decision making under uncertainty, ensuring that the chosen decision is resilient against the worst possible distributional deviations within the specified Wasserstein distance [10].
In recent studies on distributionally robust optimization, Gao et al. ([11] Corollary 2), investigated the relationship between distributionally robust optimization and variation regularization, obtaining the following conclusion.
Theorem 2.
Assume that Ψ y is Lipschitz continuous. Suppose diam ( Ξ ) = and there exists ξ 0 Ξ such that
lim sup ξ ˜ ξ 0 Ψ y ( ξ ˜ ) Ψ y ( ξ 0 ) ξ ˜ ξ 0 = Ψ y Lip .
Then, for any δ 0 and Ψ y , we have
sup μ M E ξ μ [ Ψ y ( ξ ) ] = E ξ P N [ Ψ y ( ξ ) ] + δ Ψ y Lip ,
where P N is the empirical distribution, δ is the radius of the Wasserstein ball, and Ψ y Lip is called Lipschitz regularization.
Specifically, when condition (5) is satisfied for a particular point ξ 0 Ξ , it is also satisfied for all ξ Ξ . Condition (5) implies that the Lipschitz norm is approximately achieved between any ξ supp P N and some distant point ξ ˜ . More precisely, for any ε > 0 and r > 0 , there exists a point ξ ˜ = T Ψ ( ξ ) ( T Ψ ( ξ ) is a mapping that transforms point ξ to a distant point, ensuring that the Lipschitz continuity condition is satisfied. Specifically, it ensures that for Ψ y , as point ξ moves farther away, the increment in Ψ y approximately reaches the upper bound of the Lipschitz constant Ψ y Lip .) such that T Ψ ( ξ ) ξ > r and Ψ y ( T Ψ ( ξ ) ) Ψ y ( ξ ) ( Ψ Lip ε ) T Ψ ( ξ ) ξ . The approximately worst-case distribution then perturbs some point ξ i 0 supp P N to T Ψ ( ξ i 0 ) with a small probability of δ / N , resulting in the limiting distribution
1 N i i 0 δ ξ i + 1 δ N δ ξ i 0 + δ N δ T Ψ ( ξ i 0 ) .
It can be observed that we can simplify a problem involving the supremum over distributions, but determining the optimal solution, i.e., identifying the worst-case distribution, remains challenging. Moreover, as shown in ([9] Example 2), the worst-case distribution does not always exist. Although ([10] Theorem 1) and ([9] Corollary 4.6) mention that a worst-case distribution exists under certain conditions, sufficient and necessary conditions are still lacking.
We now reproduce Corollary 4.6 from Kun et al. ([9] Corollary 4.6).
Theorem 3.
An uncertainty set Ξ R n is convex and closed, with some lower semi-continuous, convex, and appropriate functions (Given a real-valued function f and a non-empty set X, if there exists an x X such that f ( x ) < + and for all x X , f ( x ) > , then function f is said to be appropriate on set X.) l k , where k K . Moreover, for any k K , the function l k does not take the value negative infinity on Ξ. If Ξ is compact or K = 1 , then the worst-case distribution of max k K l k exists.
Next, we apply distributionally robust optimization to q α , β 1 , β 2 ( X ) and the expected return, aiming to derive q α , β 1 , β 2 ( X ) under the worst-case distribution while considering the expected return. Furthermore, we handle ES 1 β ( y T ξ ) using the Rockafellar–Uryasev Formula (1). This leads to the following problem:
sup μ W inf ζ R ( 1 β ) d + β ζ + E μ [ ( y T ξ ζ ) + ] E μ ( y T ξ ) α 1 α E μ [ ( y T ξ d ) + ] s . t . E μ [ y T ξ ] = ρ ,
where W = { μ P p ( Ξ ) : W p ( μ , P N ) δ } .
Define f ( μ , ζ ) = β ζ + E μ [ ( y T ξ ζ ) + ] E μ ( y T ξ ) α 1 α E μ [ ( y T ξ d ) + ] = E μ [ h ( ζ , z ) ] , where z = y T ξ , and h ( ζ , z ) can be regarded as a loss function. Here, we take the upper bound of the expectation of h ( ζ , z ) over the distribution rather than E μ ( h ( ζ , z ) ) because, as discussed in the relevant distributionally robust optimization literature [9], when the objective function is written in the form of Theorem 3, i.e., max k K l k , where l k is a concave function, the problem exhibits some desirable mathematical properties. As for the specific equivalent representation of h ( ζ , z ) , we will provide it in the following sections.
Let us first consider inf ζ R sup μ W f ( μ , ζ ) . Under Assumption 1, we can easily deduce that for any ζ , the function h ( ζ , ξ ) has the same Lipschitz regularization (If there exist ζ 1 and ζ 2 such that d < ζ 1 and d > ζ 2 , the Lipschitz regularizations of the two loss functions h ( ζ 1 , ξ ) and h ( ζ 2 , ξ ) may differ. Our assumption excludes this case.). Thus, according to Theorem 2, the worst-case distribution for the problem sup μ W E μ [ h ( ζ , ξ ) ] asymptotically converges to
1 N i i 0 δ ξ i + 1 δ N δ ξ i 0 + δ N δ T h ( ζ , ξ i 0 ) .
Moreover, we know that for different ζ , the variation in the loss function, as described by h ( T h ( ζ , ξ i 0 ) , ξ i 0 ) h ( ζ , ξ i 0 ) in Theorem 2, is the same. Therefore, it can be concluded that for different loss functions h ( ζ , ξ ) , the worst-case distributions to which they asymptotically converge can be the same.
Assumption 2.
For a sequence of loss functions with respect to ζ, defined as h ( ζ , z ) = ( z ζ ) + z α 1 α ( z d ) + , it can be written as a function that conforms to Theorem 3 in the form of max ( q 1 , q 2 ) , where q 1 ( z ) = min ( z , 1 α + α 2 α 1 z + α 2 1 α ζ ) , q 2 ( z ) = min ( ζ , α α 1 z + α 1 α d ζ ) , and both q 1 and q 2 are concave functions. If z belongs to a compact uncertainty set Ξ, then on the Wasserstein ball W , centered at the empirical distribution with radius δ > 0 , the worst-case distributions exist and are the same.
The sources of q 1 and q 2 in the above assumptions are as follows. To ensure that h ( ζ , z ) = max ( q 1 , q 2 ) , in the part of q 1 ( z ) , q 2 ( z ) is not the only choice. The key is to find a linear function passing through ( ζ , ζ ) with a slope less than −1 and less than α α 1 . We choose the slope as α α 1 + α 1 , and it is easy to verify that it satisfies the conditions. By adding some further conditions, we can use Theorem 3 to ensure that the worst-case distribution exists.
Theorem 4.
For the given Wasserstein ball W with δ > 0 , β ( 0 , 1 ) , and α ( 0 , 1 ) , under the condition that inequality (4) and Assumption 2 hold, define f ( μ , ζ ) = β ζ + E μ [ ( y T ξ ζ ) + ] E μ ( y T ξ ) α 1 α E μ [ ( y T ξ d ) + ] , where y T ξ comes from a compact uncertainty set Ξ. Then, the following equality holds:
sup μ W inf ζ R f ( μ , ζ ) = inf ζ R sup μ W f ( μ , ζ ) .
Proof. 
From inequality (4) and Assumption 2, we know that there exists a probability measure μ , independent of ζ, such that
inf ζ R sup μ W f ( μ , ζ ) = inf ζ R f ( μ , ζ ) = inf ζ R f ( P N , ζ ) + f Lip .
We know that the max–min inequality holds independently of the properties of f or whether W is a convex set (For any fixed μ 0 , we have sup μ W f ( μ , ζ ) f ( μ 0 , ζ ) . Taking the infimum over ζ on both sides and then the supremum over μ 0 yields the result.), so we always have
sup μ W inf ζ R f ( μ , ζ ) inf ζ R sup μ W f ( μ , ζ ) .
For any fixed μ 0 , we also have
sup μ W inf ζ R f ( μ , ζ ) inf ζ R f ( μ 0 , ζ ) .
Let μ 0 = μ . Then,
sup μ W inf ζ R f ( μ , ζ ) inf ζ R f ( μ , ζ ) = inf ζ R sup μ W f ( μ , ζ ) .
Thus, we have completed the proof.    □
Assume Ξ is a compact set. By applying Theorem 4, we can transform problem (6), and for now, we do not need to consider minimizing over ζ .
sup μ W Ξ h ( ζ , y T ξ ) μ ( d ξ ) s . t . Ξ y T ξ μ ( d ξ ) = ρ = sup γ M ( Ξ ) Ξ h ( ζ , y T ξ ) γ ( d ξ , Ξ ) s . t . Ξ y T ξ γ ( d ξ , Ξ ) = ρ , Ξ 2 ξ η γ ( d ξ , d η ) δ , γ ( Ξ , d η ) = P N ( d η ) .
For γ ( Ξ , d η ) , we have
γ ( d ξ , Ξ ) = η Ξ γ ( d ξ , d η ) = i = 1 N γ ( d ξ | η = ξ ^ i ) P N ( ξ ^ i ) = 1 N i = 1 N P i ( d ξ ) ,
where P i ( d ξ ) represents the conditional distribution of ξ conditioned on η = ξ ^ i . Likewise, we have
γ ( d ξ , d η ) = γ ( d ξ | η ) P N ( η ) = 1 N i = 1 N δ ξ ^ i ( η ) P i ( d ξ ) .
Thus, we can transform problem (7) into
sup P i M ( Ξ ) 1 N i = 1 N Ξ h ( ζ , y T ξ ) P i ( d ξ ) s . t . 1 N i = 1 N Ξ y T ξ P i ( d ξ ) = ρ , 1 N i = 1 N Ξ ξ ξ ^ i P i ( d ξ ) δ , Ξ P i ( d ξ ) = 1 , i = 1 , , N .
If we derive the Lagrangian dual problem from the original problem, then according to an extended version of the well-known strong duality result for moment problems ([19] Proposition 3.4), we know that the maximum and minimum of the Lagrangian dual function remain unchanged even after exchanging the order of optimization.
Theorem 5.
We stipulate that the norm · in the Wasserstein distance is the infinity norm · . Problem (8) is equivalent to
min λ 1 , λ 2 , θ 1 , θ 2 λ 1 ρ + λ 2 δ + 1 N i = 1 N max ( ζ α 2 ( 1 θ 1 ) 1 α θ 1 + 1 α + α 2 1 α ( 1 θ 1 ) y ξ ^ i , d α ( 1 θ 2 ) 1 α ζ α 1 α ( 1 θ 2 ) y ξ ^ i ) s . t . | ( 1 α ) λ 1 + θ 1 α 2 + α 2 + α + 1 | λ 2 ( 1 α ) , | λ 1 + α ( 1 λ 1 θ 2 ) | λ 2 ( 1 α ) λ 2 0 , θ 1 , θ 2 ( 0 , 1 ) .
Proof. 
We derive the Lagrangian dual problem from the original problem (8)
L ( P i ; λ 1 , λ 2 , s ) = 1 N i = 1 N Ξ h ( ζ , y T ξ ) P i ( d ξ )
L ( P i ; λ 1 , λ 2 , s ) = 1 N i = 1 N Ξ h ( ζ , y T ξ ) P i ( d ξ ) + λ 1 ( α 1 N i = 1 N Ξ y T ξ P i ( d ξ ) )
+ λ 2 ( ε 1 N i = 1 N Ξ ξ ξ ^ i P i ( d ξ ) ) + i = 1 N s i ( 1 Ξ P i ( d ξ ) )
= 1 N i = 1 N Ξ h ( ζ , y T ξ ) λ 1 y T ξ λ 2 ξ ξ ^ i N s i P i ( d ξ )
+ λ 1 α + λ 2 δ + i = 1 N s i ,
where λ 1 R , λ 2 R + , and s R N . Thus, the Lagrangian dual of problem (8) is (This is because M ( Ξ ) contains all Dirac distributions, allowing the integral function to attain its supremum at P i ).
min P i ; λ 1 , λ 2 , s sup ξ Ξ L ( ξ ; λ 1 , λ 2 , s ) ,
which is given by
min λ 1 , λ 2 , s , t λ 1 ρ + λ 2 δ + i = 1 N s i + 1 N i = 1 N t i , s . t . h ( ζ , y T ξ ) λ 1 y ξ λ 2 ξ ξ ^ i N s i t i , ξ Ξ , i = 1 , , N , λ 2 0 .
We use the form h ( ζ , y ξ ) as given in Assumption 2, specifically h ( ζ , y ξ ) = max ( q 1 ( y ξ ) , q 2 ( y ξ ) ) , where q 1 ( y ξ ) = min y ξ , 1 α + α 2 α 1 y ξ + α 2 1 α ζ and q 2 ( y ξ ) = min ζ , α α 1 y ξ + α 1 α d ζ . So, problem (9) is equivalent to
min λ 1 , λ 2 , s , t λ 1 ρ + λ 2 δ + i = 1 N s i + 1 N i = 1 N t i s . t . N s i max ξ Ξ λ 2 ξ ξ ^ i λ 1 y ξ + q 1 ( y ξ ) t i , i = 1 , , N , N s i max ξ Ξ λ 2 ξ ξ ^ i λ 1 y ξ + q 2 ( y ξ ) t i , i = 1 , , N , λ 2 0 .
= min λ 1 , λ 2 , s λ 1 ρ + λ 2 δ + i = 1 N s i + 1 N i = 1 N t i s . t . N s i max ξ Ξ min z i 1 1 λ 2 z i 1 ( ξ ξ ^ i ) λ 1 y ξ + q 2 ( y ξ ) t i , i = 1 , , N , N s i max ξ Ξ min z i 1 1 λ 2 z i 2 ( ξ ξ ^ i ) λ 1 y ξ + q 2 ( y ξ ) t i , i = 1 , , N , λ 2 0 .
= min λ 1 , λ 2 , s λ 1 ρ + λ 2 δ + i = 1 N s i + 1 N i = 1 N t i s . t . N s i max ξ Ξ z i 1 ( ξ ξ ^ i ) λ 1 y ξ + q 1 ( y ξ ) t i , i = 1 , , N , N s i max ξ Ξ z i 2 ( ξ ξ ^ i ) λ 1 y ξ + q 2 ( y ξ ) t i , i = 1 , , N , z i 1 1 λ 2 , z i 2 1 λ 2 .
= min λ 1 , λ 2 , s λ 1 ρ + λ 2 δ + i = 1 N s i + 1 N i = 1 N t i s . t . N s i max ξ Ξ min z i 1 + ( 1 + λ 1 ) y ξ + z i 1 ξ ^ i , z i 1 + 1 α + α 2 1 α + λ 1 y ξ + α 2 1 α ζ + z i 1 ξ ^ i t i , i = 1 , , N , N s i max ξ Ξ min z i 2 + λ 1 y ξ ζ + z i 2 ξ ^ i , z i 2 + α 1 α + λ 1 y ξ + α 1 α d ζ + z i 2 ξ ^ i t i , i = 1 , , N , z i 1 1 λ 2 , z i 2 1 λ 2 λ 2 0 .
Let q 1 ( ξ ) =
min z i 1 + ( 1 + λ 1 ) y ξ + z i 1 ξ ^ i , z i 1 + 1 α + α 2 1 α + λ 1 y ξ + α 2 1 α ζ + z i 1 ξ ^ i
and q 2 ( ξ ) =
min z i 2 + λ 1 y ξ ζ + z i 2 ξ ^ i , z i 2 + α 1 α + λ 1 y ξ + α 1 α d ζ + z i 2 ξ ^ i .
In order for q 1 ( ξ ) and q 2 ( ξ ) to have a maximum value, there must exist θ 1 , θ 2 ( 0 , 1 ) such that (We consider a function f ( x ) = min ( a x + c , b x + d ) , where a b . The two components of the function are hyperplanes. For f ( x ) to have a maximum value, it is necessary and sufficient that a and b are opposites and neither is zero. In this case, we can express z i 1 and z i 2 as convex combinations).
z i 1 = θ 1 ( 1 + λ 1 ) + ( 1 θ 1 ) 1 α + α 2 1 α + λ 1 y
and
z i 2 = θ 2 λ 1 + ( 1 θ 2 ) α 1 α + λ 1 y .
In addition, when
z i 1 + ( 1 + λ 1 ) y ξ + z i 1 ξ ^ i = z i 1 + 1 α + α 2 1 α + λ 1 y ξ + α 2 1 α ζ + z i 1 ξ ^ i ,
i.e., when ξ = d y y 2 2 , the maximum value of q 1 ( ξ ) is
ζ α 2 ( 1 θ 1 ) 1 α θ 1 ( 1 + λ 1 ) + ( 1 θ 1 ) 1 α + α 2 1 α + λ 1 y ξ ^ i .
Similarly, the maximum value of q 2 ( ξ ) is
d α ( 1 θ 2 ) 1 α ζ θ 2 λ 1 + ( 1 θ 2 ) α 1 α + λ 1 y ξ ^ i .
Moreover, since y 1 = 1 , problem (10) is equivalent to
min λ 1 , λ 2 , θ 1 , θ 2 , s λ 1 ρ + λ 2 δ + i = 1 N s i + 1 N i = 1 N t i s . t . N s i ζ α 2 ( 1 θ 1 ) 1 α θ 1 ( 1 + λ 1 ) + ( 1 θ 1 ) 1 α + α 2 1 α + λ 1 y ξ ^ i t i , i = 1 , , N , N s i d α ( 1 θ 2 ) 1 α ζ θ 2 λ 1 + ( 1 θ 2 ) α 1 α + λ 1 y ξ ^ i t i , i = 1 , , N , | ( 1 α ) λ 1 + θ 1 α 2 + α 2 + α + 1 | λ 2 ( 1 α ) , | λ 1 + α ( 1 λ 1 θ 2 ) | λ 2 ( 1 α ) λ 2 0 , θ 1 , θ 2 ( 0 , 1 ) .
= min λ 1 , λ 2 , θ 1 , θ 2 λ 1 ρ + λ 2 δ + 1 N i = 1 N max ( ζ α 2 ( 1 θ 1 ) 1 α θ 1 + 1 α + α 2 1 α ( 1 θ 1 ) y ξ ^ i , d α ( 1 θ 2 ) 1 α ζ α 1 α ( 1 θ 2 ) y ξ ^ i ) | λ 1 + α ( 1 λ 1 θ 2 ) | λ 2 ( 1 α ) λ 2 0 , θ 1 , θ 2 ( 0 , 1 ) .
   □
However, this distributionally robust optimization problem plays the role of a constraint in portfolio optimization, so we would prefer to obtain an analytical solution. Next, we consider approximating the Lagrangian dual formulation of problem (7) using the max–min inequality, i.e.,
sup μ W inf λ R f ( μ , ζ ) + λ ρ E μ ( y T ξ ) inf λ R sup μ W f ( μ , ζ ) + λ ρ E μ ( y T ξ ) .
Using inequality (4) and Theorem 2, we find that the problem
inf λ R sup μ W f ( μ , ζ ) + λ ρ E μ ( y T ξ )
has an analytical solution.
Theorem 6.
We stipulate that the norm · in the Wasserstein distance is the Euclidean norm · 2 . Assuming E P N ( y T ξ ) > 0 , we set ρ = k E P N ( y T ξ ) with k ( 0 , 1 ) . If ρ E P N ( y T ξ ) + δ y 2 < 0 , the solution to problem (11) is negative infinity. If ρ E P N ( y T ξ ) + δ y 2 0 , the solution to problem (11) is given by
f ( P N , ζ ) + 1 2 δ y 2 ρ + E P N ( y T ξ ) if α 1 2 , f ( P N , ζ ) + α 2 2 α δ y 2 ρ + E P N ( y T ξ ) if α > 1 2 .
Proof. 
Under the premise of Conclusion 4 and applying Theorem 2, we have
inf λ R sup μ W f ( μ , ζ ) + λ ρ E μ ( y T ξ ) = inf λ R sup μ W E μ [ ( y T ξ ζ ) + ( λ + 1 ) y T ξ α 1 α ( y T ξ d ) + ] + λ ρ = inf λ R E P N [ ( y T ξ ζ ) + ( λ + 1 ) y T ξ α 1 α ( y T ξ d ) + ] + L δ y + λ ρ = inf λ R f ( P N , ζ ) + L δ y 2 + λ [ ρ E P N ( y T ξ ) ] ,
where L is the Lipschitz constant of the function t ( x ) = ( x ζ ) + ( λ + 1 ) x α 1 α ( x d ) + , that is, max ( λ + 1 , λ , λ + α 1 α ) .
Let A = δ y 2 > 0 and B = ρ E P N ( y T ξ ) < 0 , and the next step is to solve the convex problem
inf λ R A max ( | λ + 1 | , | λ | , | λ + α 1 α | ) + B λ .
When α 1 2 , 0 < α 1 α 1 , through analysis or graphing, we know that max ( λ + 1 , λ , λ + α 1 α ) = 1 2 + λ + 1 2 . When λ 1 2 , the problem becomes
inf λ 1 2 ( A + B ) λ + A .
Obviously, under the condition A + B 0 , the minimum value is A B 2 ; otherwise, it tends to negative infinity.
When λ 1 2 , the problem becomes
inf λ 1 2 ( B A ) λ .
Since B A < 0 holds, the minimum value of this problem is also A B 2 .
Under the condition α 1 2 , we obtain the following result:
sup μ W inf λ R f ( μ , ζ ) + λ ρ E μ ( y T ξ ) = f ( P N , ζ ) + 1 2 δ y 2 ρ + E P N ( y T ξ ) , s . t . ρ E P N ( y T ξ ) + δ y 2 0 .
When α > 1 2 , α 1 α > 1 , max ( | λ + 1 | , | λ | , | λ + α 1 α | ) = α 2 2 α + | λ + α 2 2 α | . Similarly, we have
sup μ W inf λ R f ( μ , ζ ) + λ ρ E μ ( y T ξ ) = f ( P N , ζ ) + α 2 2 α δ y 2 ρ + E P N ( y T ξ ) , s . t . ρ E P N ( y T ξ ) + δ y 2 0 .
   □
Thus, in taking α 1 2 as an example, if the optimal solution to problem (7), i.e., the worst-case distribution, is P , then the ES-expectile question processed through distributionally robust optimization (DRES-expectile) is
inf d R , y R n d s . t . d VaR β P ( y T ξ ) , α 1 α E P N [ ( y T ξ d ) + ] = inf ζ R ( 1 β ) d + β ζ + E P N [ ( y T ξ ζ ) + ] 1 2 [ E P N ( y T ξ ) + ρ ] + 1 2 δ y 2 , y i 0 , i = 1 , , n , i = 1 n y i = 1 , ρ E P N ( y T ξ ) + δ y 2 0 .

4. Question Simplification

In this section, to make the problem easier to solve, we consider using some approximation techniques to simplify the problem.
For the infimum part in the equation of question (12), although we know that the optimal ζ is the 1 β quantile of the random variable y T ξ , due to the complexity of sorting, non-smoothness, and the involvement of the decision variable y, we apply a relaxation instead of directly choosing the optimal value. Additionally, we introduce new variables to smooth the function ( x ) + .
Specifically, we first degrade the second constraint and introduce the variable ζ , transforming the problem into
inf d , ζ R , y R n d s . t . d VaR β P ( y T ξ ) , α 1 α E P N [ ( y T ξ d ) + ] ( 1 β ) d + β ζ + E P N [ ( y T ξ ζ ) + ] 1 2 [ E P N ( y T ξ ) + ρ ] + 1 2 δ y 2 , y i 0 , i = 1 , , n , i = 1 n y i = 1 , ρ E P N ( y T ξ ) + δ y 2 0 .
Next, we introduce new variables a i , where a i = ( y T ξ d ) + , and generate two constraints: a i ξ i ^ d and a i 0 . The same treatment is applied to ( y T ξ ζ ) + , and the problem becomes
inf d , ζ R , y R n , a , b R N d s . t . d VaR β P ( y T ξ ) , α N ( 1 α ) e T a ( 1 β ) d + β ζ + 1 N e T b 1 2 [ E P N ( y T ξ ) + ρ ] + 1 2 δ y 2 , ρ E P N ( y T ξ ) + δ y 2 0 , y i 0 , i = 1 , , n , i = 1 n y i = 1 , a i , b i 0 , i = 1 , , N , a i ξ i ^ d , i = 1 , , N , b i ξ i ^ ζ , i = 1 , , N .
For the first constraint d VaR β P ( y T ξ ) , VaR β P ( y T ξ ) represents the β -quantile of the random variable y T ξ under P . We approximate P using the empirical distribution and estimate the β -quantile using the normal distribution
VaR β P ( y T ξ ) y T μ ^ + z β y T Σ ξ ^ y ,
where μ ^ is the sample mean vector, Σ ξ ^ is the sample covariance matrix, and z β is the β -quantile of the standard normal distribution.
In addition to the constraints and numerical boundaries on the portfolio weights y, we treat all other constraints as soft constraints, handled through a penalty function approach. On the one hand, this simplifies the problem, making it easier for the algorithm to find an optimal solution; on the other hand, it prevents overemphasis on risk d at the expense of ignoring the importance of investment returns. The question after applying the penalty function essentially becomes a weighted problem concerning the expectation, standard deviation, Value at Risk, and expected shortfall. To ensure smoothness and similarity in certain terms, the penalty function is set as log ( 1 + σ e r ( x ) ) , where r ( x ) represents constraints less than or equal to zero, and σ is the penalty factor.
inf d , ζ R , y C , a , b D d + j = 1 3 log 1 + σ j e r j ( d , ζ , y ) + i = 1 N log 1 + σ a e r a i ( d , ζ , y ) + log 1 + σ b e r b i ( d , ζ , y ) ,
where
r 1 ( d , ζ , y ) = y T μ ^ + z β y T Σ ξ ^ y d , r 2 ( d , ζ , y ) = α N ( 1 α ) e T a + 1 2 E P N ( y T ξ ) + ρ ( 1 β ) d β ζ 1 N e T b 1 2 δ y 2 , r 3 ( d , ζ , y ) = E P N ( y T ξ ) ρ δ y 2 , r a i ( d , ζ , y ) = ξ ^ i d a i , i = 1 , , N , r b i ( d , ζ , y ) = ξ ^ i ζ b i , i = 1 , , N ,
and the constraint sets are
D = { x R N x i 0 } , C = { y R n y i 0 , i = 1 n y i = 1 } .

5. Numerical Experiments

In this section, we first introduce the portfolio problems for comparison and the metrics that characterize portfolio performance. Next, we gather data from online sources, calculate returns, and utilize the solutions obtained from the corresponding questions to make investment decisions. Finally, we generate graphs illustrating asset changes and compute various financial statistics, such as the expected return, standard deviation, Sharpe ratio, expected shortfall, and Sortino ratio, for comparative analysis.
During the testing phase, we test the traditional ES-expectile question under the empirical distribution. Since the question sets the value of the expected return ρ as k E P N ( y T ξ ) , the constraint becomes ( 1 k ) E P N ( y T ξ ) , which turns into an ineffective constraint. Therefore, we modify this constraint to E P N ( y T ξ ) 0 . Similarly to the approximation process from problems (12), (13), and finally (14), we first approximate problem (3) as follows:
inf d , ζ R , y R n , a , b R N d s . t . d VaR β P ( y T ξ ) , α N ( 1 α ) e T a ( 1 β ) d + β ζ + 1 N e T b E P N ( y T ξ ) , y i 0 , i = 1 , , n , i = 1 n y i = 1 , a i , b i 0 , i = 1 , , N , a i ξ i ^ d , i = 1 , , N , b i ξ i ^ ζ , i = 1 , , N .
Then, we apply a penalty function approach to transform the problem into
inf d , ζ R , y C , a , b D d + j = 1 3 log 1 + σ j e r j ( d , ζ , y ) + i = 1 N log 1 + σ a e r a i ( d , ζ , y ) + log 1 + σ b e r b i ( d , ζ , y ) ,
where
r 1 ( d , ζ , y ) = y T μ ^ + z β y T Σ ξ ^ y d , r 2 ( d , ζ , y ) = α N ( 1 α ) e T a + E P N ( y T ξ ) ( 1 β ) d β ζ 1 N e T b , r 3 ( d , ζ , y ) = E P N ( y T ξ ) , r a i ( d , ζ , y ) = ξ ^ i d a i , i = 1 , , N , r b i ( d , ζ , y ) = ξ ^ i ζ b i , i = 1 , , N ,
and the constraint sets are
D = { x R N x i 0 } , C = { y R n y i 0 , i = 1 n y i = 1 } .
Compared to question (14), the main changes are in r 2 ( d , ζ , y ) and r 3 ( d , ζ , y ) .
Nowadays, many portfolio questions with good performance have emerged in academia. In this data experiment, we selected two traditional portfolio questions and their distributionally robust optimized versions. One of them is the mean–variance question proposed by Markowitz in 1952 [1]. However, since we use the empirical distribution in this experiment, the treatment of expected returns is consistent with that in the ES-expectile question. Therefore, we replace it with a weighted sum of the mean and standard deviation:
inf y R n E P N ( y T ξ ) + ω y T Σ ξ y s . t . y i 0 , i = 1 , 2 , , n , e T y = 1 .
Here, ω > 0 represents the degree of risk aversion.
Jose Blanchet studied its distributionally robust optimization version in 2018 [13]. When the Wasserstein distance is based on the Euclidean norm and the radius of the Wasserstein ball is δ ,
inf y R n δ y + y T Σ ξ y s . t . y i 0 , i = 1 , 2 , , n , e T y = 1 , E P N ( y T ξ ) α + δ y .
Here, we set α = k E P N ( y T ξ ) .
Secondly, we select the mean–expected shortfall question [9]:
inf y R n E P N ( y T ξ ) + ω ES β ( y T ξ ) s . t . y i 0 , i = 1 , 2 , , n , e T y = 1 .
As for its distributionally robust optimized version, it is mentioned in ([9] Proposition 7.2) that under certain conditions, the optimal solution is the equally weighted portfolio.
We select a sample of 130 well-performing Chinese equities from cn.investing.com covering the period 2022–2023. Using their daily closing prices P t (with t indexing trading days), we compute simple daily returns:
R t = P t P t 1 1 ,
where P t denotes the closing price on day t, and R t is the corresponding daily return. After discarding any trading day on which one or more equities have missing data, we retain a cleaned dataset with 460 valid trading days. These are days for which complete return data are available across all selected stocks.
Across these 460 days and 130 equities (roughly 60,000 return observations), the cross-sectional mean of individual daily returns is 0.061 % , while the mean cross-sectional standard deviation is 2.56 % . The pooled distribution of daily returns is mildly right-skewed (skewness, 0.39 ) and strongly leptokurtic (excess kurtosis, 12.42 ), signaling pronounced fat tails. Single-day extremes reach + 36.76 % and 43.08 % , and about 62% of the equities exhibit excess kurtosis above 3.
To evaluate the portfolio strategy’s viability, we conducted a rolling-window back-testing experiment. A 100-day estimation window was paired with four holding periods h { 3 , 7 , 14 , 21 } . After each h-day decision interval, the estimation window advances by h days; the portfolio question is re-estimated on the updated 100-day sample and a new portfolio is formed. This cycle is repeated until the entire 360-day out-of-sample testing horizon is covered.
The initial cumulative return was set to C R 0 = 1 . If the portfolio realizes an interval return R t , t + h over the holding period [ t , t + h ] , the cumulative return updates multiplicatively as
C R t + h = C R t 1 + R t , t + h .
Unfolding the recursion yields the cumulative return at any test horizon T ( 1 T 360 ):
C R T = t = 1 T 1 + R t ,
where R t is the one-day portfolio return on day t.
For the 360-day returns, we calculate the expectation, standard deviation, Sharpe ratio, expected shortfall, and Sortino ratio [20]. The expectation, standard deviation, and expected shortfall will not be discussed again. Here, the expected shortfall is calculated as
1 0.05 0 0.05 VaR θ ( X ) d θ .
The Sharpe ratio is defined as
R p R f σ p ,
where R p is the expected return of the portfolio, R f is the risk-free rate, and σ p is the standard deviation of the portfolio returns. The Sharpe ratio measures the excess return per unit of risk. For the risk-free rate, we assume the 4% annual interest rate on U.S. Treasury bonds as the risk-free rate. Since our returns are daily, we use R f = ( 1 + 4 % ) 1 252 1 , where 252 is the number of trading days in a year.
The Sortino ratio is defined as
R p R f σ d ,
where R p is the expected return of the portfolio, R f is the risk-free rate (which can be considered the target return), and σ d is the standard deviation of the downside returns, which are the returns below the risk-free rate. Unlike the Sharpe ratio, which considers both upside and downside volatility, the Sortino ratio focuses exclusively on downside risk. This means it only accounts for volatility below the target return and ignores the volatility above it.
The advantage of the Sortino ratio is that it provides a more precise risk-adjusted performance measure, particularly suited for investors who are more concerned with avoiding losses than simply achieving higher returns. A higher Sortino ratio indicates that the portfolio is generating better returns relative to the downside risk, which is a key measure of its effectiveness in mitigating losses.
We use the ‘minimize’ method from Python’s ‘scipy.optimize’ library to solve this problem. It is a general-purpose optimizer for minimizing scalar objective functions and supports various algorithms, including both unconstrained and constrained optimization. Users can solve nonlinear problems by passing different optimization algorithm options, such as SLSQP, BFGS, Nelder-Mead, etc. In this numerical experiment, all questions were solved using Sequential Least Squares Programming (SLSQP) [21]. All experiments were run on a laptop with an Intel Core i5-12500H CPU (12 threads at 2.5 GHz) and 16 GB of RAM, under Python 3.11 with NumPy 1.26 and SciPy 1.11.4, without any GPU acceleration. Solving a single DRES-expectile portfolio problem took 7.31 ± 0.32 s of wall-clock time and required 37 ± 2 iterations for the optimizer to reach its convergence criteria.
Here, we provide explanations for the abbreviations of the tested questions. DRES-e refers to DRES-expectile (14), ES-e refers to ES-expectile (15), 1/N refers to the equally weighted portfolio, Mean-Std refers to the mean–standard deviation question (16), DRMean-Std refers to the distributionally robust optimized mean–standard deviation question (17), and Mean-ES refers to the mean–expected shortfall question (18).
It can be observed from Table 2 that regardless of the sliding window employed, the average return of DRES-expectile is significantly higher than that of other questions. It also demonstrates excellent performance in terms of the Sharpe ratio and the Sortino ratio. However, it performs inadequately in terms of standard deviation and expected shortfall. When combined with the cumulative return curve in Figure 1, Figure 2, Figure 3 and Figure 4, it can be seen that DRES-expectile exhibits an explosive upward trend, and this is inevitably accompanied by a poor standard deviation and expected shortfall. Regarding the ES-expectile question under the empirical distribution, its performance is nearly equivalent to that of the mean–variance question. We also discover that questions present diverse performance under different sliding windows. The reason behind this lies in that the question mainly captures a short-term future trend. Hence, if an investment portfolio is utilized for an overly short period, its superiority in the average sense cannot be manifested. Similarly, if an investment portfolio is used for an excessively long period, the long-term stability of returns becomes difficult to guarantee.
Furthermore, to explore the reason behind the high average returns of our question, we fix the sliding window to 7, keeping other parameters unchanged. We record the solution vectors for each question at each stage (52 in total, with each solution vector having 130 dimensions, corresponding to 130 stocks), i.e., the portfolio weights. We then calculate the following statistics: the average number of components in the solution vector greater than 0.05, 0.1, and 0.2 and average values of the maximum, minimum, median, 25th percentile, and 5th percentile components of the solution vectors.
As shown in Table 3, G 0.05 , G 0.1 , and G 0.2 represent the average number of components exceeding 0.05, 0.1, and 0.2, respectively. G max and G mid represent the average of the maximum value and the median value of the components. As for the 25th percentile, 5th percentile, and minimum value of the components, all are smaller than 10 19 and are approximately equal to zero within the arithmetic precision; thus, they are not presented or analyzed here.
Based on the five concentration indicators, G 0.05 , G 0.1 , G 0.2 , G max and G mid , a clear gradient of capital allocation emerges: DRES-expectile attains G 0.05 = G 0.1 = G 0.2 = 1 , G max = 1 , and G mid 0 , meaning that in almost every rolling window the entire budget is staked on a single stock and the remaining 129 weights are truncated to machine precision. This “all-in one bet” design amplifies stock selection skill into explosive portfolio returns whenever the portfolio question correctly pinpoints the short-term winner.
ES-expectile remains highly focused but is less extreme: On average, roughly six stocks exceed the 5 % threshold, none exceed 20 % , and the largest position is capped at 11 % ; together with G mid 0 this delivers a milder risk–return trade-off than DRES-expectile.
Mean-ES lies in between: G 0.05 = 1.45 and G 0.2 = 1.26 show that one to two stocks carry more than 20 % each, the largest weight is limited to 86 % , and G mid 1.5 × 10 4 confirms that the long tail is still heavily pruned; the portfolio question therefore preserves high concentration while curbing the tail risk observed with DRES-expectile.
Mean-Std and ES-expectile (under the empirical distribution) display a “barbell” pattern with moderate concentration ( G 0.05 5.6 , G 0.1 0.8 ) and no exposure above 20 % , whereas DRMean-Std (G thresholds all zero, G max = 0.02 , G mid = 0.007 ) spreads the weight almost uniformly across the universe.
Since it is neither realistic nor prudent to invest all the capital in a single stock in typical investment scenarios, we slightly adjust the solution of DRES-expectile by lowering the maximum investment weights to 0.5, 0.3, 0.1, and 0.05, while evenly distributing the remaining weights among the other stocks for data experiments. Additionally, we anticipate that some may question whether we are truly maximizing the expected return, so we also conducted tests to address this. In the following figures and table, we provide a brief explanation of the abbreviations used: DRES-e(50), DRES-e(30), DRES-e(10), and DRES-e(5) represent the results where the maximum investment weights are set to 0.5, 0.3, 0.1, and 0.05, respectively, with the remaining weight evenly distributed. “Mean” represents the solution to the following optimization problem
sup y R n 1 N i = 1 N y ξ ^ i s . t . y i 0 , i = 1 , , n , e T y = 1 .
Table 4 shows that imposing progressively tighter caps on the maximum single-asset weight (from 50% down to 5%) brings a sharp reduction in portfolio variance and expected shortfall, while the corresponding fall in mean return is comparatively small. This characteristic can also be directly observed from Figure 5, Figure 6, Figure 7 and Figure 8. Because volatility declines more rapidly than the mean, the Sharpe and Sortino ratios actually rise, particularly in the 14-day window, which indicates that tail risk in the unconstrained solution is driven chiefly by extreme concentration.
The mean-only benchmark produces weaker risk-adjusted results across every horizon and even shows negative average returns in the 14-day window; therefore, our optimization problem cannot be regarded as a simple return maximization exercise. Over the longest window (21 days) the advantage of DRES-expectile narrows, suggesting that its edge is strongest when the portfolio question is recalibrated frequently. We therefore recommend employing the capped DRES-expectile portfolio as a tactical sleeve on short rolling windows, while pairing it with more conservative strategies to stabilize long-term risk.

6. Discussion

In this section, we first examine the practical benefits of measuring portfolio risk using an expectile-based framework augmented with DRES-expectile, from both managerial and investor viewpoints. We then translate our empirical findings into concrete managerial implications for the design of future portfolio optimization strategies, showing how the DRES-expectile portfolio question can be deployed tactically, integrated into multi-strategy architectures, and governed under real-world risk budgets.

6.1. Risk Measurement Implications for Managers and Investors

The numerical experiments presented earlier clearly demonstrate the practical characteristics of the DRES-expectile portfolio question: superior average return and Sharpe ratio across all sliding windows, but accompanied by higher volatility and concentration risk. These results lead to two distinct sets of implications, one for portfolio managers and risk controllers, and one for investors.

6.1.1. Managerial Perspective

Tail-aware and continuously adjustable risk control. In contrast to “threshold-triggered” jump reactions typical of Value-at-Risk portfolio questions, an expectile-based risk metric offers earlier and smoother tail risk warnings. Thanks to this continuity, managers can employ gradient-based optimization to fine-tune risk in real time, greatly improving the controllability and sensitivity of portfolio adjustments.
Practical governance levers. The primary issues—elevated expected shortfall and high volatility—arise from oversized single-stock exposure. A governance committee can remedy this simply by imposing position-weight caps, a standard control tool. Once these caps are in place, the portfolio’s Sharpe ratio still surpasses that of all classical portfolio questions over 3 to 14 day windows, demonstrating that the framework aligns seamlessly with existing risk management policies.

6.1.2. Investor Perspective

Transparent risk–return trade-off. The ES-expectile metric produces a continuous, tail-sensitive spectrum of risk values. This enables investors to see not just how often losses might occur but also how severely, and how this evolves as portfolio positions are adjusted. Unlike threshold-based measures such as VaR, this “risk spectrum” facilitates better understanding and more informed investment decisions.
Downside protection with controlled sacrifice. When upper bounds are applied (DRES-e(30/10/5)), the portfolio question significantly reduces ES while preserving high Sharpe ratios. Investors obtain a more balanced growth path by accepting marginally lower peaks in exchange for markedly lower drawdowns, which is especially desirable for institutions with capital preservation mandates.

6.2. Managerial Implications for Future Portfolio Optimization Strategies

6.2.1. Strategic Application

Across every window length, the DRES-expectile portfolio question shows a strong, steadily rising performance trajectory, whereas traditional portfolio questions produce much flatter curves. Investors thus observe a continuous risk-evolution path and can detect the build-up of potential tail risk sooner, avoiding the VaR-style “risk-silent interval.” This clearer view of the risk–return trade-off empowers investors to make more forward-looking allocation decisions.
For more risk-averse investors, such as pension funds or insurance portfolios, the DRES-expectile portfolio question should be positioned as a satellite strategy—allocated only within clearly defined risk budgets. This approach enables controlled exposure to potential upside while maintaining the portfolio’s overall risk discipline.

6.2.2. Integration with Other Strategies

To further improve long-term portfolio stability, we recommend integrating the DRES-expectile portfolio question into a broader multi-strategy architecture:
  • Core satellite structure: Use low-volatility portfolio questions such as mean–variance or minimum variance for the core portfolio, and deploy DRES-expectile as a growth-seeking satellite component.
  • Dynamic window adjustment: Vary the sliding window length adaptively based on market volatility or regime shifts, ensuring the portfolio question does not overfit to a fixed horizon.
  • Risk budgeting and cap enforcement: Assign explicit volatility or VaR limits to the DRES-expectile sleeve, and coordinate with other sub-strategies to manage overall portfolio risk. This allows total portfolio exposure to remain within pre-defined limits while leveraging the growth potential of the robust tail-aware portfolio question.

7. Conclusions

We first introduced the expectation gap to extend the traditional risk measure and expectile, and demonstrated its effectiveness by proving that it has a solution. To better control the instability caused by empirical distributions, we employed distributionally robust optimization, introduced necessary theorems, and made reasonable assumptions. We then updated the ES-expectile and simplified the problem using optimization techniques. In numerical experiments, our DRES-expectile method almost exactly captured the trend in stock growth, resulting in the selection of only one stock in each decision, which led to excellent returns, though with a higher standard deviation. We acknowledge that such results may seem unrealistic and lack persuasiveness in practical applications. Therefore, we further conducted data experiments to show that this portfolio problem is not simply an expectation maximization problem. Moreover, by adjusting the weight of extreme solutions, we proposed a more practical portfolio strategy. We believe that this portfolio problem has potential when combined with other conservative investment strategies and holds certain reference value and research significance.

Author Contributions

Conceptualization, H.W., Y.Z. and C.L.; methodology, H.W., Y.Z., C.L. and X.Z.; software, H.W. and Y.Z.; validation, H.W., Y.Z., Y.G. and C.L.; formal analysis, Y.G.; investigation, H.W. and Y.G.; resources, Y.Z.; data curation, Y.G.; writing—original draft preparation, H.W., Y.Z., Y.G. and X.Z.; writing—review and editing, H.W., Y.Z., C.L. and X.Z.; visualization, Y.Z.; supervision, H.W. and C.L.; project administration, H.W. and C.L.; funding acquisition, C.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Science and Technology Research Project of Henan Province of China OF FUNDER grant number 252102240125.

Data Availability Statement

The stock closing price data used in this study are publicly available at Investing.com (https://cn.investing.com), a widely used financial market platform that provides comprehensive financial data, real-time quotes, charts, and financial news on global stocks, indices, commodities, and currencies.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Markowitz, H. The utility of wealth. J. Political Econ. 1952, 60, 151–158. [Google Scholar] [CrossRef]
  2. Chopra, V.K.; Ziemba, W.T. The effect of errors in means, variances, and covariances on optimal portfolio choice. J. Portfolio Manag. 1993, 19, 6–11. [Google Scholar] [CrossRef]
  3. Cai, J.; Weng, C. Optimal reinsurance with expectile. Scand. Actuar. J. 2016, 2016, 624–645. [Google Scholar] [CrossRef]
  4. Bellini, F.; Klar, B.; Müller, A.; Rosazza Gianin, E. Generalized quantiles as risk measures. Insur. Math. Econ. 2014, 54, 41–48. [Google Scholar] [CrossRef]
  5. Zhang, F.; Xu, Y.; Fan, C. Non-parametric inference of expectile-based value-at-risk for financial time series with application to risk assessment. Int. Rev. Financ. Anal. 2023, 90, 102852. [Google Scholar] [CrossRef]
  6. Taylor, J.W. Estimating value at risk and expected shortfall using expectiles. J. Financ. Econom. 2008, 6, 231–252. [Google Scholar] [CrossRef]
  7. Rockafellar, R.T.; Uryasev, S. Conditional value-at-risk for general loss distributions. J. Bank. Financ. 2002, 26, 1443–1471. [Google Scholar] [CrossRef]
  8. Rahimian, H.; Mehrotra, S. Frameworks and results in distributionally robust optimization. Open J. Math. Optim. 2022, 3, 4. [Google Scholar] [CrossRef]
  9. Mohajerin Esfahani, P.; Kuhn, D. Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations. Math. Program. 2018, 171, 115–166. [Google Scholar] [CrossRef]
  10. Gao, R.; Kleywegt, A.J. Distributionally robust stochastic optimization with Wasserstein distance. Math. Oper. Res. 2022, 48, 603–655. [Google Scholar] [CrossRef]
  11. Gao, R.; Chen, X.; Kleywegt, A.J. Wasserstein distributionally robust optimization and variation regularization. Oper. Res. 2022, 72, 1177–1191. [Google Scholar] [CrossRef]
  12. Shafieezadeh-Abadeh, S.; Kuhn, D.; Mohajerin Esfahani, P. Regularization via mass transportation. J. Mach. Learn. Res. 2019, 20, 1–68. [Google Scholar]
  13. Blanchet, J.; Chen, L.; Zhou, X.Y. Distributionally robust mean-variance portfolio selection with Wasserstein distances. Manag. Sci. 2022, 68, 6382–6410. [Google Scholar] [CrossRef]
  14. Chen, D.; Wu, Y.; Li, J.; Zhou, X. Distributionally robust mean–absolute deviation portfolio optimization using the Wasserstein metric. J. Glob. Optim. 2023, 87, 783–805. [Google Scholar] [CrossRef]
  15. Liu, W.; Liu, Y. Worst-case higher-moment coherent risk based on optimal transport with application to distributionally robust portfolio optimization. Symmetry 2022, 14, 138. [Google Scholar] [CrossRef]
  16. Wu, Z.; Sun, K. Distributionally robust optimization with Wasserstein metric for multi-period portfolio selection under uncertainty. Appl. Math. Model. 2023, 117, 513–528. [Google Scholar] [CrossRef]
  17. Zhang, Z.; Jing, H.; Kao, C. High-dimensional distributionally robust mean-variance efficient portfolio selection. Mathematics 2023, 11, 1272. [Google Scholar] [CrossRef]
  18. Fan, Z.; Ji, R.; Lejeune, M.A. Distributionally robust portfolio optimization under marginal and copula ambiguity. J. Optim. Theory Appl. 2024, 203, 2870–2907. [Google Scholar] [CrossRef]
  19. Shapiro, A. On duality theory of conic linear problems. Nonconvex Optim. Appl. 2001, 57, 135–155. [Google Scholar] [CrossRef]
  20. Chaudhry, A.; Johnson, H.L. The efficacy of the Sortino ratio and other benchmarked performance measures under skewed return distributions. Aust. J. Manag. 2008, 32, 485–502. [Google Scholar] [CrossRef]
  21. Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]
Figure 1. Cumulative returns of all models with a sliding window of 3.
Figure 1. Cumulative returns of all models with a sliding window of 3.
Symmetry 17 00959 g001
Figure 2. Cumulative returns of all models with a sliding window of 7.
Figure 2. Cumulative returns of all models with a sliding window of 7.
Symmetry 17 00959 g002
Figure 3. Cumulative returns of all models with a sliding window of 14.
Figure 3. Cumulative returns of all models with a sliding window of 14.
Symmetry 17 00959 g003
Figure 4. Cumulative returns of all models with a sliding window of 21.
Figure 4. Cumulative returns of all models with a sliding window of 21.
Symmetry 17 00959 g004
Figure 5. Cumulative returns with a sliding window of 3 (improved DRES-expectile vs. mean).
Figure 5. Cumulative returns with a sliding window of 3 (improved DRES-expectile vs. mean).
Symmetry 17 00959 g005
Figure 6. Cumulative returns with a sliding window of 7 (improved DRES-expectile vs. mean).
Figure 6. Cumulative returns with a sliding window of 7 (improved DRES-expectile vs. mean).
Symmetry 17 00959 g006
Figure 7. Cumulative returns with a sliding window of 14 (improved DRES-expectile vs. mean).
Figure 7. Cumulative returns with a sliding window of 14 (improved DRES-expectile vs. mean).
Symmetry 17 00959 g007
Figure 8. Cumulative returns with a sliding window of 21 (improved DRES-expectile vs. mean).
Figure 8. Cumulative returns with a sliding window of 21 (improved DRES-expectile vs. mean).
Symmetry 17 00959 g008
Table 1. Comprehensive list of symbols and acronyms.
Table 1. Comprehensive list of symbols and acronyms.
Symbol/AbbreviationEnglish Full Name
VaR θ ( X ) Value at Risk at confidence level θ
ES β ( X ) Expected shortfall at confidence level β
q α ( X ) Expectile at level α
ES-expectileExpectile defined using asymmetric
expected shortfall on both tails
q α , β 1 , β 2 ( X ) ES-expectile with parameters α , β 1 , and β 2
DRODistributionally robust optimization
DRES-expectileDistributionally robust ES-expectile
RU FormulaRockafellar–Uryasev Formula (1)
Table 2. Performance metrics for different questions across sliding windows.
Table 2. Performance metrics for different questions across sliding windows.
WindowQuestionMeanStandard DeviationSharpe RatioExpected ShortfallSortino Ratio
3 daysDRES-e6.25 × 10−34.24 × 10−21.44 × 10−1−7.58 × 10−22.56 × 10−1
ES-e6.92 × 10−44.06 × 10−31.32 × 10−1−7.99 × 10−32.37 × 10−1
1/N1.04 × 10−37.21 × 10−31.23 × 10−1−1.33 × 10−21.27 × 10−1
Mean-Std5.21 × 10−44.32 × 10−38.46 × 10−2−9.57 × 10−31.33 × 10−1
DRMean-Std1.01 × 10−37.24 × 10−31.18 × 10−1−1.37 × 10−21.97 × 10−1
Mean-ES2.98 × 10−33.06 × 10−29.21 × 10−2−5.82 × 10−21.58 × 10−1
7 daysDRES-e7.95 × 10−34.49 × 10−21.73 × 10−1−7.98 × 10−22.87 × 10−1
ES-e6.90 × 10−44.18 × 10−31.28 × 10−1−8.37 × 10−32.09 × 10−1
1/N1.03 × 10−37.24 × 10−31.21 × 10−1−1.33 × 10−22.13 × 10−1
Mean-Std5.87 × 10−44.45 × 10−39.70 × 10−2−9.77 × 10−31.46 × 10−1
DRMean-Std1.01 × 10−37.25 × 10−31.18 × 10−1−1.35 × 10−22.03 × 10−1
Mean-ES2.42 × 10−33.10 × 10−27.28 × 10−2−6.21 × 10−21.19 × 10−1
14 daysDRES-e9.12 × 10−34.79 × 10−21.87 × 10−1−7.99 × 10−23.77 × 10−1
ES-e7.56 × 10−44.19 × 10−31.43 × 10−1−8.39 × 10−32.25 × 10−1
1/N1.04 × 10−37.29 × 10−31.21 × 10−1−1.33 × 10−22.14 × 10−1
Mean-Std6.95 × 10−44.52 × 10−31.19 × 10−1−9.63 × 10−31.75 × 10−1
DRMean-Std1.01 × 10−37.26 × 10−31.17 × 10−1−1.34 × 10−22.00 × 10−1
Mean-ES−2.99 × 10−43.40 × 10−2−1.34 × 10−2−7.68 × 10−22.57 × 10−2
21 daysDRES-e5.43 × 10−34.96 × 10−21.06 × 10−1−1.02 × 10−11.55 × 10−1
ES-e7.93 × 10−44.20 × 10−31.52 × 10−1−8.57 × 10−32.31 × 10−1
1/N1.02 × 10−37.33 × 10−31.18 × 10−1−1.34 × 10−22.13 × 10−1
Mean-Std7.15 × 10−44.53 × 10−31.23 × 10−1−9.79 × 10−31.80 × 10−1
DRMean-Std9.90 × 10−47.27 × 10−31.15 × 10−1−1.36 × 10−22.01 × 10−1
Mean-ES1.66 × 10−33.03 × 10−24.97 × 10−2−6.60 × 10−27.26 × 10−2
Table 3. Statistical summary of the different questions’ solutions.
Table 3. Statistical summary of the different questions’ solutions.
Question G 0.05 G 0.1 G 0.2 G max G mid
DRES-e1.001.001.001.00 6.69 × 10 18
ES-e5.720.810.000.11 5.44 × 10 18
Mean-Std5.600.700.000.11 4.27 × 10 18
DRMean-Std0.000.000.000.02 7.11 × 10 3
Mean-ES1.451.451.260.86 1.45 × 10 4
Table 4. Performance metrics for the improved DRES-expectile solution and mean questions.
Table 4. Performance metrics for the improved DRES-expectile solution and mean questions.
WindowModelMeanVarianceSharpe RatioExpected ShortfallSortino Ratio
3 daysDRES-e(50)3.24 × 10−32.23 × 10−21.38 × 10−1−3.88 × 10−22.68 × 10−1
DRES-e(30)2.34 × 10−31.52 × 10−21.44 × 10−1−2.68 × 10−22.79 × 10−1
DRES-e(10)1.44 × 10−39.07 × 10−31.41 × 10−1−1.59 × 10−22.68 × 10−1
DRES-e(5)1.49 × 10−39.40 × 10−31.42 × 10−1−1.65 × 10−22.68 × 10−1
mu2.81 × 10−33.13 × 10−28.50 × 10−2−5.89 × 10−21.46 × 10−1
7 daysDRES-e(50)4.29 × 10−32.46 × 10−21.68 × 10−1−4.45 × 10−23.09 × 10−1
DRES-e(30)2.96 × 10−31.66 × 10−21.70 × 10−1−2.98 × 10−23.19 × 10−1
DRES-e(10)1.64 × 10−39.46 × 10−31.56 × 10−1−1.65 × 10−23.00 × 10−1
DRES-e(5)1.68 × 10−39.79 × 10−31.56 × 10−1−1.70 × 10−22.99 × 10−1
mu1.92 × 10−33.20 × 10−25.51 × 10−2−6.47 × 10−28.75 × 10−2
14 daysDRES-e(50)4.91 × 10−32.46 × 10−21.94 × 10−1−4.07 × 10−23.80 × 10−1
DRES-e(30)3.34 × 10−31.66 × 10−21.92 × 10−1−2.79 × 10−23.77 × 10−1
DRES-e(10)1.76 × 10−39.53 × 10−31.68 × 10−1−1.64 × 10−23.25 × 10−1
DRES-e(5)1.81 × 10−39.86 × 10−31.67 × 10−1−1.70 × 10−23.24 × 10−1
mu−5.00 × 10−43.45 × 10−2−1.90 × 10−2−7.82 × 10−2−2.53 × 10−2
21 daysDRES-e(50)3.21 × 10−32.66 × 10−21.15 × 10−1−5.37 × 10−21.74 × 10−1
DRES-e(30)2.32 × 10−31.78 × 10−21.22 × 10−1−3.49 × 10−21.95 × 10−1
DRES-e(10)1.43 × 10−39.79 × 10−31.30 × 10−1−1.77 × 10−22.38 × 10−1
DRES-e(5)1.48 × 10−31.01 × 10−21.31 × 10−1−1.82 × 10−22.39 × 10−1
mu1.57 × 10−33.02 × 10−24.68 × 10−2−6.60 × 10−26.87 × 10−2
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, H.; Zhao, Y.; Guo, Y.; Liu, C.; Zhang, X. Symmetric Adjustable Tail-Risk Measure for Distributionally Robust Optimization in Portfolio Allocation. Symmetry 2025, 17, 959. https://doi.org/10.3390/sym17060959

AMA Style

Wang H, Zhao Y, Guo Y, Liu C, Zhang X. Symmetric Adjustable Tail-Risk Measure for Distributionally Robust Optimization in Portfolio Allocation. Symmetry. 2025; 17(6):959. https://doi.org/10.3390/sym17060959

Chicago/Turabian Style

Wang, Haonan, Yunxiao Zhao, Yixin Guo, Changhe Liu, and Xinlin Zhang. 2025. "Symmetric Adjustable Tail-Risk Measure for Distributionally Robust Optimization in Portfolio Allocation" Symmetry 17, no. 6: 959. https://doi.org/10.3390/sym17060959

APA Style

Wang, H., Zhao, Y., Guo, Y., Liu, C., & Zhang, X. (2025). Symmetric Adjustable Tail-Risk Measure for Distributionally Robust Optimization in Portfolio Allocation. Symmetry, 17(6), 959. https://doi.org/10.3390/sym17060959

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop