A General Framework for Portfolio Theory—Part I: Theory and Various Models

Stanislaus Maier-Paape; Qiji Jim Zhu

doi:10.3390/risks6020053

and

¹

Institut für Mathematik, RWTH Aachen University, Templergraben 55, 52062 Aachen, Germany

²

Department of Mathematics, Western Michigan University, 1903 West Michigan Avenue, Kalamazoo, MI 49008, USA

^*

Author to whom correspondence should be addressed.

Risks2018, 6(2), 53;https://doi.org/10.3390/risks6020053

This article belongs to the Special Issue Computational Methods for Risk Management in Economics and Finance

Version Notes

Order Reprints

Abstract

Utility and risk are two often competing measurements on the investment success. We show that efficient trade-off between these two measurements for investment portfolios happens, in general, on a convex curve in the two-dimensional space of utility and risk. This is a rather general pattern. The modern portfolio theory of Markowitz (1959) and the capital market pricing model Sharpe (1964), are special cases of our general framework when the risk measure is taken to be the standard deviation and the utility function is the identity mapping. Using our general framework, we also recover and extend the results in Rockafellar et al. (2006), which were already an extension of the capital market pricing model to allow for the use of more general deviation measures. This generalized capital asset pricing model also applies to e.g., when an approximation of the maximum drawdown is considered as a risk measure. Furthermore, the consideration of a general utility function allows for going beyond the “additive” performance measure to a “multiplicative” one of cumulative returns by using the log utility. As a result, the growth optimal portfolio theory Lintner (1965) and the leverage space portfolio theory Vince (2009) can also be understood and enhanced under our general framework. Thus, this general framework allows a unification of several important existing portfolio theories and goes far beyond. For simplicity of presentation, we phrase all for a finite underlying probability space and a one period market model, but generalizations to more complex structures are straightforward.

Keywords:

convex programming; financial mathematics; risk measure; utility functions; efficient frontier; Markowitz portfolio theory; capital market pricing model; growth optimal portfolio; fractional Kelly allocation

MSC:

52A41; 90C25; 91G99

1. Introduction

The modern portfolio theory of Markowitz (1959) pioneered the quantitative analysis of financial economics. The most important idea proposed in this theory is that one should focus on the trade-off between expected return and the risk measured by the standard deviation. Mathematically, the modern portfolio theory leads to a quadratic optimization problem with linear constraints. Using this simple mathematical structure, Markowitz gave a complete characterization of the efficient frontier for trade-off of the return and risk. Tobin showed that the efficient portfolios are an affine function of the expected return Tobin (1958). Markowitz portfolio theory was later generalized by Lintner (1965), Mossin (1966), Sharpe (1964) and Treynor (1999) in the capital asset pricing model (CAPM) by involving a riskless bond. In the CAPM model, both the efficient frontier and the related efficient portfolios are affine in terms of the expected return (Sharpe 1964; Tobin 1958).

The nice structures of the solutions in the modern portfolio theory and the CAPM model afford many applications. For example, the CAPM model is designed to provide reasonable equilibrium prices for risky assets in the market place. Sharpe used the ratio of excess return to risk (called the Sharpe ratio) to provide a measurement for investment performance (Sharpe 1966). In addition, the affine structure of the efficient portfolio in terms of the expected return leads to the concept of a market portfolio as well as the two fund theorem (Tobin 1958) and the one fund theorem (Sharpe 1964; Tobin 1958). These results provided a theoretical foundation for passive investment strategies.

In many practical portfolio problems, however, one needs to consider more general pairs of reward and risk. For example, the growth portfolio theory can be viewed as maximizing the log utility of a portfolio. In order to address the issue that an optimal growth portfolio is usually too risky in practice, practitioners often have to impose additional restrictions on the risk (MacLean et al. 2009; Vince 2009; Vince and Zhu 2015). In particular, current drawdown (Maier-Paape 2016), maximum drawdown and its approximations (De Prado et al. 2013; Maier-Paape 2015; Vince and Zhu 2015), deviation measure (Rockafellar et al. 2006), as well as conditional value at risk (Rockafellar and Uryasev 2000) and more abstract coherent risk measures (Artzner et al. 1999) are widely used as risk measures in practice. Risk, as measured by such criteria, is reduced by diversification. Mathematically, it is to say these risk measures are convex. For these reasons, considering the trade-off between general risk measures and expected utilities are crucial in portfolio problems. In particular, including risk measures beyond positive homogeneous risk measures allows for measuring risk by drawdown (see Maier-Paape and Zhu (2017)), a concept to which many practitioners are sensitive.

The goal and main results of this paper are to extend the modern portfolio theory into a general framework under which one can analyze efficient portfolios that trade-off between a convex risk measure and a reward captured by a concave expected utility (see Section 3). We phrase our primal problem as a convex portfolio optimization problem of minimizing a convex risk measure subject to the constraint that the expected utility of the portfolio is above a certain level. Thus, convex duality plays a crucial role and the structure of the solutions to both the primal and dual problems often have significant financial implications. We show that, in the space of risk measure and expected utility, efficient trade-off happens on an increasing concave curve (cf. Proposition 8 and Theorem 4). We also show that the efficient portfolios continuously depend on the level of the expected utility (see Theorem 5), and moreover, we can describe the curve of efficient portfolios quantitatively in a precise manner (cf. Proposition 9 and Corollary 2).

To avoid technical complications, we restrict our analysis to the practical case in which the status of an underlying economy is represented by a finite sample space. Under this restriction, the Markowitz modern portfolio theory and the capital asset pricing model are special cases of this general theory. Markowitz determines portfolios of purely risky assets which provide an efficient trade-off between expected return and risk measured by the standard deviation (or equivalently the variance). Mathematically, this is a class of convex programming problems of minimizing the standard deviation of the portfolio parameterized by the level of the expected returns. The capital asset pricing model, in essence, extends the Markowitz modern portfolio theory by including a riskless bond in the portfolio. We observe that the space of the risk-expected return is, in fact, the space corresponding to the dual of the Markowitz portfolio problem. The shape of the famous Markowitz bullet is a manifestation of the well known fact that the optimal value function of a convex programming problem is convex with respect to the level of constraint. As mentioned above, the Markowitz portfolio problem is a quadratic optimization problem with linear constraint. This special structure of the problem dictates the affine structure of the optimal portfolio as a function of the expected return (see Theorem 6). This affine structure leads to the important two fund theorem (cf. Theorem 7) that provides a theoretical foundation for the passive investment method. For the capital asset pricing model, such an affine structure appears in both the primal and dual representation of the solutions, which leads to the one fund theorem in the portfolio space and the capital market line in the dual space of risk-return trade-off (cf. Theorems 8 and 9).

The flexibility in choosing different risk measures allows us to extend the analysis of the essentially quadratic risk measure pioneered by Markowitz to a wider range. For example, when a deviation measure (Rockafellar et al. 2006) is used as risk measure, which happens e.g., when an approximation of the current drawdown is considered (see Maier-Paape and Zhu (2017)), and the expected return is used to gauge the performance, we show that the affine structure of the efficient solution in the classical capital market pricing model is preserved (cf. Theorem 10 and Corollary 3), recovering and extending especially the results in Rockafellar et al. (2006). In particular, we can show that the condition in CAPM that ensures the existence of a market portfolio has a full generalization to portfolio problems with positive homogeneous risk measures (see Theorem 11). This is significant in that it shows that the passive investment strategy is justifiable in a wide range of settings.

The consideration of a general utility function, however, allows us to go beyond the “additive” performance measure in modern portfolio theory to a “multiplicative” one including cumulative returns when, for example, using the log utility. As a result, the growth optimal portfolio theory (Lintner 1965) and the leverage space portfolio theory (Vince 2009) can also be understood under our general framework. The optimal growth portfolio pursues to maximize the expected log utility that is equivalent to maximize the expected cumulative compound return. It is known that the growth optimal portfolio is usually too risky. Thus, practitioners often scale back the risky exposure from a growth optimal portfolio. In our general framework, we consider the portfolio that minimizes a risk measure given a fixed level of expected log utility. Under reasonable conditions, we show that such portfolios form a path parameterized by the level of expected log utility in the portfolio space that connects the optimal growth portfolio and the portfolio of a riskless bond (see Theorem 13). In general, for different risk measures, we will derive different paths. These paths provide justifications for risk reducing curves proposed in the leverage space portfolio theory (Vince 2009). The dual problem projects the efficient trade-off path into a concave curve in the risk-expected log utility space parallel to the role of Markowitz bullet in the modern portfolio theory and the capital market line in the capital asset pricing model. Under reasonable assumptions, the efficient frontier for log utility is a bounded increasing concave curve. The lower left endpoint of the curve corresponds to the portfolio of pure riskless bond and the upper right endpoint corresponds to the growth optimal portfolio. The increasing nature of the curve tells us that the more risk we take, the more cumulative return we can expect. The concavity of the curve indicates, however, that, with the increase of the risk, the marginal increase of the expected cumulative return will decrease.

Markowitz portfolio theory essentially maximizes a linear expected utility while the growth optimal portfolio focuses on the log utility. Other utility functions were also considered in portfolio problems. Our general framework brings them together in a unified way. Besides unifying the several important results laid out above, the general framework, furthermore, has many new applications. In this first installment of the paper, we layout the framework, derive the theoretical results of crucial importance and illustrate them with a few examples. More specific results on drawdown risk measures will appear in Maier-Paape and Zhu (2017). We arrange the paper as follows: first, we discuss necessary preliminaries in the next section. Section 3 is devoted to our main result: a framework to efficient trade-off between risk and utility of portfolios and its properties. In Section 4, we give a unified treatment of Markowitz portfolio theory and capital asset pricing model using our framework. Section 5 is devoted to a discussion of positive homogeneous risk measures under which the optimal trade-off portfolio possesses an affine structure. This situation fully generalizes Markowitz and CAPM theories and thus many of the conditions in Section 4 find an analog in Section 5. Section 6 discusses growth optimal portfolio theory and leverage portfolio theory. We conclude in Section 7 pointing to applications worthy of further investigation.

2. Preliminaries

2.1. A Portfolio Model

We consider a simple one period financial market model S on an economy with finite states represented by a sample space

Ω = {ω_{1}, ω_{2}, \dots, ω_{N}}

. We use a probability space

(Ω, 2^{Ω}, P)

to represent the states of the economy and their corresponding probability of occurring, where

2^{Ω}

is the algebra of all subsets of

Ω

. The space of random variables on

(Ω, 2^{Ω}, P)

is denoted

R V (Ω, 2^{Ω}, P)

and it is used to represent the payoff of risky financial assets. Since the sample space

Ω

is finite,

R V (Ω, 2^{Ω}, P)

is a finite dimensional vector space. We use

R V_{+} (Ω, 2^{Ω}, P)

to represent of the cone of nonnegative random variables in

R V (Ω, 2^{Ω}, P)

. Introducing the inner product

{⟨ X, Y ⟩}_{R V} = E [X Y], X, Y \in R V (Ω, 2^{Ω}, P),

R V (Ω, 2^{Ω}, P)

becomes a (finite dimensional) Hilbert space.

Definition 1.

(Financial Market) We say that

S_{t} = {(S_{t}^{0}, S_{t}^{1}, \dots, S_{t}^{M})}^{⊤}, t = 0, 1

is a financial market in a one period economy provided that

S_{0} \in R_{+}^{M + 1}

and

S_{1} \in (0, \infty) \times R V_{+} {(Ω, 2^{Ω}, P)}^{M}

. Here,

S_{0}^{0} = 1, S_{1}^{0} = R > 0

represents a risk free bond with a positive return when

R > 1

. The rest of the components

S_{t}^{m}, m = 1, \dots, M

represent the price of the m-th risky financial asset at time t.

We will use the notation

{\hat{S}}_{t} = {(S_{t}^{1}, \dots, S_{t}^{M})}^{⊤}

when we need to focus on the risky assets. We assume that

S_{0}

is a constant vector representing the prices of the assets in this financial market at

t = 0

. The risk is modeled by assuming

{\hat{S}}_{1} = {(S_{1}^{1}, \dots, S_{1}^{M})}^{⊤}

to be a nonnegative random vector on the probability space

(Ω, 2^{Ω}, P)

, that is

S_{1}^{m} \in R V_{+} (Ω, 2^{Ω}, P), m = 1, 2, \dots, M

. A portfolio is a column vector

x \in R^{M + 1}

whose components

x_{m}

represent the share of the m-th asset in the portfolio and

S_{t}^{m} x_{m}

is the portion of capital invested in asset m at time t. Hence,

x_{0}

corresponds to the investment in the risk free bond and

\hat{x} = {(x_{1}, \dots, x_{M})}^{⊤}

is the risky part.

Remark 1.

Restricting to a finite sample space avoids the distraction of technical difficulties. This is also practical since, in the real world, one can only use a finite quantity of information. Furthermore, we restrict our presentation to the one period market model. However, more complex sample spaces and market models such as multi-period financial models should be treatable with a similar approach.

We often need to restrict the selection of portfolios. For example, in many applications, we consider only portfolios with unit initial cost, i.e.,

S_{0}^{⊤} x = 1

. The following definition makes this precise.

Definition 2.

(Admissible Portfolio) We say that

A \subset R^{M + 1}

is a set of admissible portfolios provided that A is a nonempty closed and convex set. We say that A is a set of admissible portfolios with unit initial price provided that A is a closed convex subset of

{x \in R^{M + 1} : S_{0}^{⊤} x = 1}

.

2.2. Convex Programming

The trade-off between convex risks and concave expected utilities yields essentially convex programming problems. For convenience of the reader, we collect notation and relevant results in convex analysis, which are important in the discussion below. We omit most of the proofs that can be found in Borwein and Zhu (2016); Carr and Zhu (forthcoming); Rockafellar (1970). Readers who know convex programming well can skip this section.

Let X be a finite dimensional Banach space. Recall that a set

C \subset X

is convex if, for any

x, y \in C

and

s \in [0, 1]

,

s x + (1 - s) y \in C

. For an extended valued function

f : X \to R \cup {+ \infty}

, we define its domain by

dom (f) : = {x \in X : f (x) < \infty}

and its epigraph by

epi (f) : = {(x, r) \in X \times R : r \geq f (x)} .

We say f is lower semi-continuous if

epi (f)

is a closed set. The following proposition characterizes an epigraph of a function.

Proposition 1.

(Characterization of Epigraph) Let F be a closed subset of

X \times R

such that

inf {r : (x, r) \in F} > - \infty

for all

x \in R

. Then, F is the epigraph for a lower semi-continuous function

f : X \to (- \infty, \infty]

, i.e.,

F = epi (f)

, if and only if

\begin{matrix} (x, r) \in F \Rightarrow (x, r + k) \in F, \forall k > 0 . \end{matrix}

(1)

Proof.

The key is to observe that, for a set F with the structure in (1), a function

\begin{matrix} f (x) = inf {r : (x, r) \in F} \end{matrix}

(2)

is well defined and then

F = epi (f)

holds. ☐

We say a function f is convex if

epi (f)

is a convex set. Alternatively, f is convex if and only if, for any

x, y \in dom (f)

and

s \in [0, 1]

,

f (s x + (1 - s) y) \leq s f (x) + (1 - s) f (y) .

Consider

f : X \to [- \infty, + \infty)

. We say f is concave when

- f

is convex and we say f is upper semi-continuous if

- f

is lower semi-continuous. Define the hypograph of a function f by

hypo (f) = {(x, r) \in X \times R : r \leq f (x)} .

Then, a symmetric version of Proposition 1 is

Proposition 2.

(Characterization of Hypograph) Let F be a closed subset of

X \times R

such that

sup {r : (x, r) \in F} < + \infty

for all

x \in R

. Then, F is the hypograph of an upper semi-continuous function

f : X \to [- \infty, \infty)

, i.e.,

F = hypo (f)

, if and only if

\begin{matrix} (x, r) \in F \Rightarrow (x, r - k) \in F, \forall k > 0 . \end{matrix}

(3)

Moreover, the function f can be defined by

\begin{matrix} f (x) = sup {r : (x, r) \in F} . \end{matrix}

(4)

Remark 2.

The value of the function f in Proposition 1 (Proposition 2) at a given point x cannot assume

- \infty

(

+ \infty

) and therefore

{x} \times R \neg \subset F

.

Since utility functions are concave and risk measures are usually convex, the analysis of a general trade-off between utility and risk naturally leads to a convex programming problem. The general form of such convex programming problems is

\begin{matrix} v (y, z) : = inf_{x \in X} [f (x) : g (x) \leq y, h (x) = z], for y \in R^{M}, z \in R^{N}, \end{matrix}

(5)

where f, g and h satisfy the following assumption.

Assumption 1.

Assume that

f : X \to R \cup {+ \infty}

is a lower semi-continuous extended valued convex function,

g : X \to R^{M}

is a vector valued function with convex components, ≤ signifies componentwise minorization and

h : X \to R^{N}

is an affine mapping, for natural numbers

M, N

. Moreover, at least one of the components of g has compact sublevel sets.

Convex programming problems have nice properties due to the convex structure. We briefly recall the pertinent results related to convex programming. First, the optimal value function v is convex. This is a well-known result that can be found in standard books on convex analysis, e.g., Borwein and Zhu (2005).

Proposition 3.

(Convexity of Optimal Value Function) Let f, g and h satisfy Assumption 1. Then, the optimal value function v in the convex programming problem (5) is convex and lower semi-continuous.

By and large, there are two (equivalent) general approaches to help solving a convex programming problem: by using the related dual problem and by using Lagrange multipliers. The two methods are equivalent in the sense that a solution to the dual problem is exactly a Lagrange multiplier (see Borwein and Zhu (2016)). Using Lagrange multipliers is more accessible to practitioners outside the special area of convex analysis. We will take this approach. The Lagrange multipliers method tells us that, under mild assumptions, we can expect there exists a Lagrange multiplier

λ = (λ_{y}, λ_{z}) \in R^{M} \times R^{N}

with

λ_{y} \geq 0

such that

\bar{x}

is a solution to the convex programming problem (5) if and only if it is a solution to the unconstrained problem of minimizing

\begin{matrix} L (x, λ) & : = & f (x) + {⟨ λ, (g (x) - y, h (x) - z) ⟩}_{R^{M} \times R^{N}} \\ = & f (x) + {⟨ λ_{y}, g (x) - y ⟩}_{R^{M}} + {⟨ λ_{z}, h (x) - z ⟩}_{R^{N}} . \end{matrix}

(6)

The function

L (x, λ)

is called the Lagrangian. To understand why and when a Lagrange multiplier exists, we need to recall the definition of the subdifferential.

Definition 3.

(Subdifferential) Let X be a finite dimensional Banach space and

X^{*}

its dual space. The subdifferential of a lower semi-continuous convex function

ϕ : X \to R \cup {+ \infty}

at

x \in dom (ϕ)

is defined by

\partial ϕ (x) = {x^{*} \in X^{*} : ϕ (y) - ϕ (x) \geq ⟨ x^{*}, y - x ⟩ \forall y \in X} .

Geometrically, an element of the subdifferential gives us the normal vector of a support hyperplane for the convex function at the relevant point. It turns out that Lagrange multipliers of problem (5) are simply the negative of elements of the subdifferential of v as summarized in the lemma below.

Theorem 1.

(Lagrange Multiplier) Let

v : R^{M} \times R^{N} \to R \cup {+ \infty}

be the optimal value function of the constrained optimization problem (5) with

f, g

and h satisfying Assumption 1. Suppose that, for fixed

(y, z) \in R^{M} \times R^{N}

,

- λ = - (λ_{y}, λ_{z}) \in \partial v (y, z)

and

\bar{x}

is a solution of (5). Then,

(i): $λ_{y} \geq 0$ ,
(ii): the Lagrangian $L (x, λ)$ defined in (6) attains a global minimum at $\bar{x}$ , and
(iii): λ satisfies the complementary slackness condition

$\begin{matrix} ⟨ λ, (g (\bar{x}) - y, h (\bar{x}) - z) ⟩ = ⟨ λ_{y}, g (\bar{x}) - y ⟩ = 0, \end{matrix}$

(7)

where $⟨ \cdot, \cdot ⟩$ signifies the inner product.

Proof.

See (Carr and Zhu forthcoming, Theorem 1.2.15). ☐

Remark 3.

By Theorem 1 Lagrange multipliers exist when (5) has a solution

\bar{x}

and

\partial v (y, z) \neq \emptyset

. Calculating

\partial v (y, z)

requires to know the value of v in a neighborhood of

(y, z)

and is not realistic. Fortunately, the well-known Fenchel–Rockafellar theorem (see e.g., Borwein and Zhu (2005)) tells us when

(y, z)

belongs to the relative interior of

dom (v)

, then

\partial v (y, z) \neq \emptyset

. This is a very useful sufficient condition. A particularly useful special case is the Slater condition (see also Borwein and Zhu (2005)): there exists

x \in dom (f)

such that

g (x) < y

. Under this condition,

\partial v (y) \neq \emptyset

holds.

3. Efficient Trade-Off between Risk and Utility

We consider the financial market described in Definition 1 and consider a set of admissible portfolios

A \subset R^{M + 1}

(see Definition 2). The payoff of each portfolio

x \in A

at time

t = 1

is

S_{1}^{⊤} x

. The merit of a portfolio x is often judged by its expected utility

E [u (S_{1}^{⊤} x)],

where u is an increasing concave utility function. The increasing property of u models the more payoff the better. The concavity reflects the fact that, with the increase of payoff, its marginal utility to an investor decreases. On the other hand, investors are often sensitive to the risk of a portfolio that can be gauged by a risk measure. Because diversification reduces risk, the risk measure should be a convex function.

3.1. Technical Assumptions

Some standard assumptions on the utility and risk functions are often needed in the more technical discussion below. We collect them here.

Assumption 2.

(Conditions on Risk Measure) Consider a continuous risk function

r : A \to [0, + \infty)

where A is a set of admissible portfolios according to Definition 2. We will often refer to some of the following assumptions:

(r1): (Riskless Asset Contributes No risk) The risk measure $r (x) = \hat{r} (\hat{x})$ is a function of only the risky part of the portfolio, where $x^{⊤} = (x_{0}, {\hat{x}}^{⊤})$ .
(r1n): (Normalization) There is at least one portfolio of purely bonds in A. Furthermore, $r (x) = 0$ if and only if x contains only riskless bonds, i.e., $x^{⊤} = (x_{0}, {\hat{0}}^{⊤})$ for some $x_{0} \in R$ .
(r2): (Diversification Reduces Risk) The risk function $r$ is convex.
(r2s): (Diversification Strictly Reduces Risk) The risk function $\hat{r}$ is strictly convex.
(r3): (Positive homogeneous) For $t > 0$ , $\hat{r} (t \hat{x}) = t \hat{r} (\hat{x})$ .
(r3s): (Diversification Strictly Reduces Risk on Level Sets) The risk function $\hat{r}$ satisfies (r3) and, for all $\hat{x} \neq \hat{y}$ with $\hat{r} (\hat{x}) = \hat{r} (\hat{y}) = 1$ and $α \in (0, 1)$ ,

$\hat{r} (α \hat{x} + (1 - α) \hat{y}) < α \hat{r} (\hat{x}) + (1 - α) \hat{r} (\hat{y}) = 1 .$

Condition (r3) precludes (r2s). Thus, condition (r3s) serves as a replacement for (r2s) when the risk measure satisfies (r3). Moreover, we have the following useful result.

Lemma 1.

Assuming a risk measure

r

satisfies (r1), (r1n) and (r3s). Then,

(a): $r$ satisfies (r2), and
(b): $f (x) = \hat{f} (\hat{x}) = {[\hat{r} (\hat{x})]}^{2}$ satisfies (r1), (r1n) and (r2s).

Proof.

Let

α \in (0, 1)

and

\hat{x} \neq \hat{y}

be given. If

\hat{x}

and

\hat{y}

lie on the same ray through

\hat{0}

, say

\hat{x} = c \hat{y}

for some

c \geq 0

, then convexity of

\hat{r}

there is clear due to (r3). For

\hat{x}

and

\hat{y}

not on the same ray and with

\hat{x} / \hat{r} (\hat{x}) \neq \hat{y} / \hat{r} (\hat{y})

, defining

λ : = \frac{α \hat{r} (\hat{x})}{α \hat{r} (\hat{x}) + (1 - α) \hat{r} (\hat{y})},

we have

1 - λ = \frac{(1 - α) \hat{r} (\hat{y})}{α \hat{r} (\hat{x}) + (1 - α) \hat{r} (\hat{y})},

and since

\hat{r} (\hat{x} / \hat{r} (\hat{x})) = \hat{r} (\hat{y} / \hat{r} (\hat{y})) = 1

, by (r3s), we have

\begin{matrix} 1 > \hat{r} (λ \hat{x} / \hat{r} (\hat{x}) + (1 - λ) \hat{y} / \hat{r} (\hat{y})) = \hat{r} (\frac{α \hat{x} + (1 - α) \hat{y}}{α \hat{r} (\hat{x}) + (1 - α) \hat{r} (\hat{y})}) = \frac{\hat{r} (α \hat{x} + (1 - α) \hat{y})}{α \hat{r} (\hat{x}) + (1 - α) \hat{r} (\hat{y})}, \end{matrix}

(8)

verifying (r2) for

r

since

r (x) = \hat{r} (\hat{x})

depends only on

\hat{x}

by (r1).

Clearly,

\hat{f} (\hat{x}) = {[\hat{r} (\hat{x})]}^{2}

has the properties (r1) and (r1n). Squaring (8), we derive

\begin{matrix} {[\hat{r} (α \hat{x} + (1 - α) \hat{y})]}^{2} < {[α \hat{r} (\hat{x}) + (1 - α) \hat{r} (\hat{y})]}^{2} \leq α {[\hat{r} (\hat{x})]}^{2} + (1 - α) {[\hat{r} (\hat{y})]}^{2} . \end{matrix}

(9)

Furthermore, on rays

{\hat{x} ∣ \hat{x} = c \hat{y}, c \geq 0}

due to (r3), we have

\hat{f} (t \hat{y}) = t^{2} \hat{f} (\hat{y})

and the strict convexity of

\hat{f}

there is clear as well. Hence, the square of the risk measure satisfies (r2s). ☐

Remark 4.

(Deviation measure) Our risk measure is described in terms of the portfolio. Assumptions (r1), (r1n), (r2) and (r3) are equivalent to the axioms of a deviation measure in Rockafellar et al. (2006), which is described in terms of the random payoff variable generated by the portfolio. Assumption (r1) excludes the widely used coherent risk measure introduced in Artzner et al. (1999), which requires cash reserve, reduces risk.

Assumption 3.

(Conditions on Utility Function) Utility functions

u : R \to R \cup {- \infty}

are upper semi-continuous functions on their domain

dom (u) = {t \in R : u (t) > - \infty}

and are usually assumed to satisfy some of the following properties:

(u1): (Profit Seeking) The utility function u is an increasing function.
(u2): (Diminishing Marginal Utility) The utility function u is concave.
(u2s): (Strict Diminishing Marginal Utility) The utility function u is strictly concave.
(u3): (Bankrupcy Forbidden) For $t < 0$ , $u (t) = - \infty$ .
(u4): (Unlimited Growth) For $t \to + \infty$ , we have $u (t) \to + \infty$ .

Another important condition that often appears in the financial literature is no arbitrage (see (Carr and Zhu forthcoming, Definition 3.5)). In the sequel, it is also useful to have two other related concepts.

Definition 4.

Consider a portfolio

x \in R^{M + 1}

on the financial market

S_{t}

.

(a): (No Nontrivial Riskless Portfolio) We say a portfolio x is riskless if

$⟨ S_{1} - R S_{0}, x ⟩ \geq 0 .$

We say the market has no nontrivial riskless portfolio if there does not exist a riskless portfolio x with $\hat{x} \neq \hat{0}$ .
(b): (No Arbitrage) We say x is an arbitrage if it is riskless and there exists some $ω \in Ω$ such that

$⟨ S_{1} (ω) - R S_{0}, x ⟩ \neq 0 .$

We say market $S_{t}$ has no arbitrage if there does not exist any arbitrage portfolio.
(c): (Nontrivial Bond Replicating Portfolio) We say that $x^{⊤} = (x_{0}, {\hat{x}}^{⊤})$ is a nontrivial bond replicating portfolio if $\hat{x} \neq \hat{0}$ and

$⟨ S_{1} - R S_{0}, x ⟩ = 0 .$

An arbitrage is a way to make return above the risk free rate without taking any risk of losing money. If such an opportunity exists, then investors will try to take advantage of it. In this process, they will bid up the price of the risky assets and cause the arbitrage opportunity to disappear. For this reason, usually people assume a financial market does not contain any arbitrage. A trivial riskless portfolio of investing everything in the riskless asset

S_{t}^{0}

always exists. A nontrivial riskless portfolio, however, is not to be expected and we will often use this assumption. It turns out that the difference between no nontrivial riskless portfolio and no arbitrage is exactly the existence of a nontrivial bond replicating portfolio. The three conditions in Definition 4 (a), (b) and (c) are related as follows:

Proposition 4.

Consider the financial market

S_{t}

of Definition 1. There is no nontrivial riskless portfolio in

S_{t}

if and only if

S_{t}

has no arbitrage portfolio and no nontrivial bond replicating portfolio. It follows that no nontrivial riskless portfolio implies no arbitrage portfolio.

Proof.

The conclusion follows directly from Definition 4. ☐

Assuming that the financial market has no arbitrage, then no nontrivial riskless portfolio is equivalent to no nontrivial bond replicating portfolio and has the following characterization.

Theorem 2.

(Characterization of no Nontrivial Bond Replicating Portfolio) Assuming the financial market

S_{t}

in Definition 1 has no arbitrage. Then, the following assertions are equivalent:

(i): There is no nontrivial bond replicating portfolio.
(ii): For every nontrivial portfolio x with $\hat{x} \neq \hat{0}$ , there exists some $ω \in Ω$ such that

$\begin{matrix} ⟨ S_{1} (ω) - R S_{0}, x ⟩ < 0 . \end{matrix}$

(10)
(ii*): For every risky portfolio $\hat{x} \neq \hat{0}$ , there exists some $ω \in Ω$ such that

$\begin{matrix} ⟨ {\hat{S}}_{1} (ω) - R {\hat{S}}_{0}, \hat{x} ⟩ < 0 . \end{matrix}$

(11)
(iii): The matrix

$\begin{matrix} G : = [\begin{matrix} S_{1}^{1} (ω_{1}) - R S_{0}^{1} & S_{1}^{2} (ω_{1}) - R S_{0}^{2} & \dots & S_{1}^{M} (ω_{1}) - R S_{0}^{M} \\ S_{1}^{1} (ω_{2}) - R S_{0}^{1} & S_{1}^{2} (ω_{2}) - R S_{0}^{2} & \dots & S_{1}^{M} (ω_{2}) - R S_{0}^{M} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ S_{1}^{1} (ω_{N}) - R S_{0}^{1} & S_{1}^{2} (ω_{N}) - R S_{0}^{2} & \dots & S_{1}^{M} (ω_{N}) - R S_{0}^{M} \end{matrix}] \in R^{N \times M} \end{matrix}$

(12)

has rank M, in particular $N \geq M$ .

Proof.

We use a cyclic proof. (i)→ (ii): If (ii) fails, then

⟨ S_{1} - R S_{0}, x ⟩ \geq 0

for some nontrivial x. By (i), x must be an arbitrage, which is a contradiction. (ii)→ (ii*): obvious. (ii*)→ (iii): If (iii) is not true, then

G \hat{x} = 0

has a nontrivial solution that is a contradiction to (11). (iii)→ (i): Assume that there exists a portfolio

x^{*}

with

{\hat{x}}^{*} \neq \hat{0}

, which replicates the bond. Then,

⟨ S_{1} - R S_{0}, x^{*} ⟩ = 0

. This implies that

⟨ {\hat{S}}_{1} - R {\hat{S}}_{0}, {\hat{x}}^{*} ⟩ = 0

so that

G {\hat{x}}^{*} = 0

, which contradicts (iii). ☐

A rather useful corollary of Theorem 2 is that any of the conditions (i)–(iii) of that theorem ensures the covariance matrix of the risky assets to be positive definite.

Corollary 1.

(Positive Definite Covariance Matrix) Assume the financial market

S_{t}

in Definition 1 has no nontrivial riskless portfolio. Then, the covariant matrix of the risky assets

\begin{matrix} Σ : = E [({\hat{S}}_{1} - E ({\hat{S}}_{1})) {({\hat{S}}_{1} - E ({\hat{S}}_{1}))}^{⊤}] = {(E [(S_{1}^{i} - E (S_{1}^{i})) (S_{1}^{j} - E (S_{1}^{j}))])}_{i, j = 1, \dots, M} \end{matrix}

(13)

is positive definite.

Proof.

We note that, under the assumption of the corollary, for any nontrivial risky portfolio

\hat{x}

,

{\hat{S}}_{1}^{⊤} \hat{x}

cannot be a constant. Otherwise,

⟨ {\hat{S}}_{1} - R {\hat{S}}_{0}, \hat{x} ⟩

would be a constant, which contradicts

S_{t}

has no nontrivial riskless portfolio. It follows that, for any nontrivial risky portfolio

\hat{x}

,

V a r ({\hat{S}}_{1}^{⊤} \hat{x}) = {\hat{x}}^{⊤} Σ \hat{x} > 0 .

Thus,

Σ

is positive definite. ☐

Corollary 1 shows that the standard deviation as a risk measure satisfies the properties (r1), (r1n), (r2) and (r3s) in Assumption 2.

3.2. Efficient Frontier for the Risk-Utility Trade-Off

We note that, to increase the utility, one often has to take on more risk and, as a result, the risk increases. The converse is also true. For example, if one allocates all the capital to the riskless bond, then there will be no risk, but the price to pay is that one has to forgo all the opportunities to get a high payoff on risky assets so as to reduce the expected utility. Thus, the investment decision of selecting an appropriate portfolio becomes one of trading-off between the portfolio’s expected return and risk. To understand such a trade-off, we define, for a set of admissible portfolios

A \subset R^{M + 1}

in Definition 2, the set

\begin{matrix} G (r, u; A) : = {(r, μ) : \exists x \in A s . t . r \geq r (x), μ \leq E [u (S_{1}^{⊤} x)]} \subset R^{2}, \end{matrix}

(14)

on the two-dimensional risk-expected utility space for a given risk measure

r

and utility u. Given a financial market

S_{t}

and a portfolio x, we often measure risk by observing

S_{1}^{⊤} x

. The following simple proposition is useful in linking such observations to the risk measure in Assumption 2.

Proposition 5.

(Induced Risk Measure) (a) Fixing a financial market

S_{t}

as in Definition 1. Suppose that

ρ : R V (Ω, 2^{Ω}, P) \to [0, + \infty)

is a lower semi-continuous, convex and positive homogeneous function. Moreover, assume that

ρ (S_{1}^{⊤} x) = ρ ({\hat{S}}_{1}^{⊤} \hat{x})

. Then,

r : A \to [0, + \infty)

,

r (x) : = ρ (S_{1}^{⊤} x)

is a lower semi-continuous risk measure satisfying properties (r1), (r2) and (r3) in Assumption 2.

The following are two sufficient conditions ensuring

ρ (S_{1}^{⊤} x) = ρ ({\hat{S}}_{1}^{⊤} \hat{x})

that are easy to verify:

(1): When ρ is invariant under adding constants, i.e., $ρ (X) = ρ (X + c)$ , for any $X \in R V (Ω, 2^{Ω}, P)$ and $c \in R$ . A useful example is when ρ is the standard deviation.
(2): When ρ is restricted to a set of admissible portfolios A with unit initial cost. In this case, we can see that

$\begin{matrix} \hat{r} (\hat{x}) : = ρ (R + {({\hat{S}}_{1} - R {\hat{S}}_{0})}^{⊤} \hat{x}) = ρ (S_{1}^{⊤} x) . \end{matrix}$

(15)

(b) If the financial market

S_{t}

has no nontrivial riskless portfolio and ρ is strictly convex, then, for a set A of admissible portfolios with unit initial cost,

\hat{r} : A \to [0, + \infty)

satisfies (r2s) in Assumption 2.

Similarly, we are interested in when the expected utility

x \mapsto E [u (S_{1}^{⊤} x)]

of

S_{1}^{⊤} x

is strictly concave in x. Below, we provide a set of sufficient conditions guaranteeing this. The easy proof is left to the reader.

Proposition 6.

(Strict Concavity of Expected Utility) Assume that

(a): the financial market $S_{t}$ has no nontrivial riskless portfolio,
(b): the utility function u satisfies condition (u2s) in Assumption 3, and
(c): A is a set of admissible portfolios with unit initial cost as in Definition 2.

Then, the expected utility

E [u (S_{1}^{⊤} x)]

as a function of the portfolio x is upper semi-continuous and strictly concave on A.

When

r (x) = ρ (S_{1}^{⊤} x)

is induced by

ρ

as in Proposition 5 we also use the notation

G (ρ, u, A)

. Clearly, if

A^{'} \subset A

then

G (r, u; A^{'}) \subset G (r, u; A)

. The following assumption will be needed in concrete applications.

Assumption A4.

(Compact Level Sets) Either (a) for each

μ \in R

,

{x \in R^{M + 1} : μ \leq E [u (S_{1}^{⊤} x)], x \in A}

is compact or (b) for each

r \in R

,

{x \in R^{M + 1} : r \geq r (x), x \in A}

is compact.

Proposition 7.

Assume that A is a set of admissible portfolios as in Definition 2. We claim: (a) Assume that the risk measure

r

satisfies (r2) in Assumption 2 and the utility function u satisfies (u2) in Assumption 3. Then, set

G (r, u; A)

is convex and

(r, μ) \in G (r, u; A)

implies that, for any

k > 0

,

(r + k, μ) \in G (r, u; A)

and

(r, μ - k) \in G (r, u; A)

. (b) Assume furthermore that Assumption 4 holds. Then,

G (r, u; A)

is closed.

Proof.

(a) The property

(r, μ) \in G (r, u; A)

implies that, for any

k > 0

,

(r + k, μ) \in G (r, u; A)

and

(r, μ - k) \in G (r, u; A)

follows directly from the definition of

G (r, u; A)

.

Suppose that

(r_{1}, μ_{1}), (r_{2}, μ_{2}) \in G (r, u; A)

and

s \in [0, 1]

. Then, there exists

x^{1}, x^{2} \in A

such that

r_{i} \geq r (x^{i}) and μ_{i} \leq E [u (S_{1}^{⊤} x^{i})], i = 1, 2 .

Then, convexity of

r

in x yields

s r_{1} + (1 - s) r_{2} \geq s r (x^{1}) + (1 - s) r (x^{2}) \geq r (s x^{1} + (1 - s) x^{2}),

and (u2) gives

s μ_{1} + (1 - s) μ_{2} \leq s E [u (S_{1}^{⊤} x^{1})] + (1 - s) E [u (S_{1}^{⊤} x^{2})] \leq E [u (S_{1}^{⊤} (s x^{1} + (1 - s) x^{2}))] .

Thus,

s (r_{1}, μ_{1}) + (1 - s) (r_{2}, μ_{2}) \in G (r, u; A)

so that

G (r, u; A)

is convex.

(b) Suppose that

(r_{n}, μ_{n}) \to (r, μ)

, for a sequence in

G (r, u; A)

. Then, there exists a sequence

x^{n} \in A

such that

\begin{matrix} r_{n} \geq r (x^{n}) and μ_{n} \leq E [u (S_{1}^{⊤} x^{n})] . \end{matrix}

(16)

By Assumption 4, a subsequence of

x^{n}

(denoted again by

x^{n}

) converges to, say,

\bar{x} \in A

. Taking limits in (16), by the upper semicontinuity of u, we arrive at

\begin{matrix} r \geq r (\bar{x}) and μ \leq E [u (S_{1}^{⊤} \bar{x})] . \end{matrix}

(17)

Thus,

(r, μ) \in G (r, u; A)

and hence

G (r, u; A)

is a closed set. ☐

Now, we can represent a portfolio

x \in A \subset R^{M + 1}

as a point

(r (x), E [u (S_{1}^{⊤} x)]) \in G (r, u; A)

in the two-dimensional risk-expected utility space. Investors prefer portfolios with lower risk if the expected utility is the same or with higher expected utility given the same level of risk.

Definition 5.

(Efficient Portfolio and Frontier) We say that a portfolio

x \in A

is efficient provided that there does not exist any portfolio

x^{'} \in A

such that either

r (x^{'}) \leq r (x) and E [u (S_{1}^{⊤} x^{'})] > E [u (S_{1}^{⊤} x)]

or

r (x^{'}) < r (x) and E [u (S_{1}^{⊤} x^{'})] \geq E [u (S_{1}^{⊤} x)] .

We call the set of images of all efficient portfolios in the two-dimensional risk-expected utility space the efficient frontier and denote it by

G_{e f f} (r, u; A)

.

The next theorem characterizes efficient portfolios in the risk-expected utility space.

Theorem 3.

(Efficient Frontier) Efficient portfolios represented in the two-dimensional risk-expected utility space are all located in the (non vertical or horizontal) boundary of the set

G (r, u; A)

. Moreover, consider admissible portfolios

A, B

. If

B \subset A,

then

\begin{matrix} G_{e f f} (r, u; A) \cap G (r, u; B) \subset G_{e f f} (r, u; B) . \end{matrix}

(18)

Proof.

If a portfolio x represented in the risk-expected utility space as

(r, μ)

is not on the (non vertical or horizontal) boundary of the

G (r, u; A)

, then, for

ε

small enough, we have either

(r - ε, μ) \in G (r, u; A)

or

(r, μ + ε) \in G (r, u; A)

. This means x can be improved. The inclusion (18) directly follows from

G (r, u; B) \subset G (r, u; A)

. ☐

Remark 5.

(Empty Efficient Frontier) If

(α, \hat{0}) \in A

for all

α \in R

and the increasing utility function u has no upper bound, then for any risk measure

r

satisfying (r1) and (r1n) in Assumption 2,

{0} \times R \subset G (r, u; A)

. By Proposition 7

[0, + \infty) \times R \subset G (r, u; A),

which implies that

G_{e f f} (r, u; A) = \emptyset

. Thus, practically meaningful

G (r, u; A)

always correspond to sets of admissible portfolios A such that the initial cost

S_{0} \cdot x

for all

x \in A

is limited. Moreover, if the initial cost has a range and riskless bonds are included in the portfolio, then we will see a vertical line segment on the μ axis and the efficient portfolio corresponds to the upper bound of this vertical line segments. Thus, it suffices to consider sets of portfolios A with unit initial cost.

3.3. Representation of Efficient Frontier

In view of Remark 5, in this section, we will consider a set of admissible portfolios A with unit initial cost as in Definition 2. By Proposition 7, we can view the set

G (r, u; A)

as an epigraph on the expected utility-risk space or a hypograph on the risk-expected utility space. By Propositions 1 and 2, the set

G (r, u; A)

naturally defines two functions

γ : R \to R \cup {+ \infty}

and

ν : R \to R \cup {- \infty}

:

\begin{matrix} μ \mapsto γ (μ) : = inf {r : (r, μ) \in G (r, u; A)} = inf {r (x) : E [u (S_{1}^{⊤} x)] \geq μ, x \in A} \geq 0, \end{matrix}

(19)

and

\begin{matrix} r \mapsto ν (r) : = sup {μ : (r, μ) \in G (r, u; A)} = sup {E [u (S_{1}^{⊤} x)] : r (x) \leq r, x \in A}, \end{matrix}

(20)

where we assume Assumption 4 to ensure

ν

is well defined, i.e.,

ν (r) < \infty

for all

r \in R

.

Proposition 8.

(Function Related to the Efficient Frontier) Assume that the risk measure

r

satisfies (r2) in Assumption 2 and the utility function u satisfies (u2) in Assumption 3. Furthermore, assume that Assumption 4 holds for a set of admissible portfolios A with unit initial cost. Then, the functions

μ \mapsto γ (μ)

and

r \mapsto ν (r)

are increasing lower semi-continuous convex and increasing upper semi-continuous concave, respectively. Moreover, for any

(r_{0}, μ_{0}) \in G_{e f f} (r, u; A)

,

(- \infty, μ_{0}] \subset dom (γ) : = {μ \in R : γ (μ) < \infty}

and

[r_{0}, \infty) \subset dom (ν) : = {r \in R : ν (r) > - \infty}

.

Proof.

The increasing property of

γ

and

ν

follows directly from the second representation in (19) and (20), respectively.

The properties for the domains of

γ

and

ν

follow directly from Proposition 7.

The other properties of

γ

and

ν

follow directly from Propositions 1 and 2 since

G (r, u; A)

is closed and convex according to Proposition 7.

Alternatively, we can also directly apply Proposition 3 to the second representation in (19) and (20) to derive the convexity and concavity of

γ

and

ν

, respectively. ☐

To describe a representation of the efficient frontier in the next theorem, we will use the exchange operator

\hat{P} : R^{2} \to R^{2}

defined by

\hat{P} (x_{1}, x_{2}) = (x_{2}, x_{1})

.

Theorem 4.

(Representation of the Efficient Frontier) Assume that the risk measure

r

satisfies (r2) in Assumption 2 and the utility function u satisfies (u2) in Assumption 3. Furthermore, assume that Assumption 4 holds for a set of admissible portfolios A with unit initial cost. Then, the efficient frontier has the following representation

\begin{matrix} G_{e f f} (r, u; A) = \hat{P} [graph (γ)] \cap graph (ν) \end{matrix}

(21)

or equivalently

\begin{matrix} G_{e f f} (r, u; A) = {(γ (μ), μ) : μ \in dom (γ) \subset R} \cap {(r, ν (r)) : r \in dom (ν) \subset R} . \end{matrix}

(22)

More specifically, setting

\begin{matrix} I : = dom (ν) \cap range (γ) = {r \in R : \exists μ with (r, μ) \in G_{e f f} (r, u; A)} \end{matrix}

(23)

and

\begin{matrix} J : = dom (γ) \cap range (ν) = {μ \in R : \exists r with (r, μ) \in G_{e f f} (r, u; A)}, \end{matrix}

(24)

we find that I and J are intervals and the representation

\begin{matrix} G_{e f f} (r, u; A) = \hat{P} [graph (γ ∣_{J})] = graph (ν ∣_{I}) \end{matrix}

(25)

holds, where

γ : J \to R

and

ν : I \to R

are continuous. Moreover,

γ : J \to I

and

ν : I \to J

are strictly increasing, bijective and inverse to each other, i.e.,

\begin{matrix} γ \circ ν (r) = r \forall r \in I and ν \circ γ (μ) = μ \forall μ \in J . \end{matrix}

(26)

Proof.

First, we show that the right-hand side of (21) is a subset of the left-hand side. Let

(r_{0}, μ_{0}) \in \hat{P} [graph (γ)] \cap graph (ν)

. Since

\hat{P} [graph (γ)] : = {(γ (μ), μ) : μ \in R}

and

graph (ν) = {(r, ν (r)) : r \in R}

necessarily

(r_{0}, μ_{0}) \in R^{2}

. Note that, in particular, (22) holds. Using

(r_{0}, μ_{0}) \in graph (ν)

, we get from (20)

\begin{matrix} μ_{0} = ν (r_{0}) = sup {E [u (S_{1}^{⊤} x)] : r (x) \leq r_{0}, x \in A} . \end{matrix}

(27)

Similarly, from (19)

\begin{matrix} r_{0} = γ (μ_{0}) = inf {r (x) : E [u (S_{1}^{⊤} x)] \geq μ_{0}, x \in A} . \end{matrix}

(28)

With (27), we can select a sequence

x_{n} \in A

such that

r (x_{n}) \leq r_{0}

and

E [u (S_{1}^{⊤} x_{n})] ↗ μ_{0}

. By Assumption 4, either

{x \in A : r (x) \leq r_{0}}

or

{x \in A : E [u (S_{1}^{⊤} x)] \geq μ_{0} - 1}

is compact. Hence, without loss of generality, we may assume that

x_{n} \to x^{*} \in A

with

r (x^{*}) \leq r_{0}

and

E [u (S_{1}^{⊤} x^{*})] \geq μ_{0}

by the upper semicontinuity of

x \mapsto E [u (S_{1}^{⊤} x)]

. Note that

r (x^{*}) < r_{0}

would contradict (28). Thus,

r (x^{*}) = r_{0}

, so that

(r_{0}, μ_{0}) \in G (r, u; A)

. Now, consider

(r_{1}, μ_{1}) \in G (r, u; A)

. If

μ_{1} > μ_{0}

and

r_{1} \leq r_{0}

, then

ν (r_{1}) : = sup {μ : (r_{1}, μ) \in G (r, u; A)} \geq μ_{1} > μ_{0} = ν (r_{0})

contradicting that

ν

is increasing. On the other hand, if

r_{1} < r_{0}

and

μ_{1} \geq μ_{0}

, then

γ (μ_{1}) : = inf {r : (r, μ_{1}) \in G (r, u; A)} \leq r_{1} < r_{0} = γ (μ_{0}),

contradicting the increasing property of

γ

. Thus,

(r_{0}, μ_{0}) \in G_{e f f} (r, u; A)

.

To conclude (21), it remains to show that the left-hand side of (21) is a subset of the right-hand side. Let

(r_{0}, μ_{0}) \in G_{e f f} (r, u; A) \subset G (r, u; A) \subset R^{2}

. Then, there exists some efficient

x^{*} \in A

with

r_{0} = r (x^{*})

and

μ_{0} = E [u (S_{1}^{⊤} x^{*})]

. This means both the supremum in (27) and the infimum in (28) are attained at

x^{*}

so that

r_{0} = γ (μ_{0})

and

μ_{0} = ν (r_{0})

. It follows that

(r_{0}, μ_{0}) \in \hat{P} [graph (γ)] \cap graph (ν) .

Since, by Proposition 8,

ν

and

γ

are convex and concave functions, respectively, they are continuous in the interior of its domain. When

G_{e f f} (r, u; A)

is not a single point, it is therefore a continuous curve except for the possible finite endpoints. By Proposition 8, if

G_{e f f} (r, u; A)

contains

(r, μ),

then

(- \infty, μ] \subset dom (γ)

and

[r, \infty) \subset dom (ν)

. Thus, if

G_{e f f} (r, u; A)

has a finite left endpoint, we can represent it in the form

(γ (μ_{e}), μ_{e})

where

μ_{e}

is in the interior of

dom (γ)

. Thus, for any

μ \to μ_{e} +

,

(γ (μ), μ) \to (γ (μ_{e}), μ_{e})

so that

G_{e f f} (r, u; A)

is right continuous. Similarly, if

G_{e f f} (r, u; A)

has a finite right endpoint, then it is left continuous at this endpoint. Finally, representation (22) implies that the projection of

G_{e f f} (r, u; A)

onto the r and

μ

axises are intervals I and J, respectively, giving (23) and (24). Moreover, the representations in (25) follow immediately. Furthermore, since

G_{e f f} (r, u; A)

contains no vertical or horizontal lines (see Theorem 3),

γ : J \to I

and

ν : I \to J

are strictly increasing. Thus, both are injective, and surjectivity follows from (23) and (24). Finally, (26) follows from (22). ☐

3.4. Efficient Portfolios

We now turn to analyze how the corresponding efficient portfolios behave. Ideally, we would want that each point on the efficient trade-off frontier corresponds to exactly one portfolio. For this purpose, we need additional assumptions on risk measures and utility functions.

Theorem 5.

(Efficient Portfolio Path) Consider a financial market

S_{t}

as defined in Definition 1 and assume that A is a set of admissible portfolios with unit initial cost as in Definition 2. We also assume Assumption 4 holds and

(c0): there exists some $\bar{x} \in A$ with $\bar{μ} : = E [u (S_{1}^{⊤} \bar{x})]$ and $\bar{r} : = r (\bar{x})$ finite.

In addition, suppose that one of the following conditions holds:

(c1): The risk measure $r$ satisfies conditions (r1) and (r2s) in Assumption 2 and the utility function satisfies conditions (u1) and (u2) in Assumption 3.
(c2): The risk measure $r$ satisfies conditions (r1) and (r2) in Assumption 2 and the utility function satisfies conditions (u1) and (u2s) in Assumption 3.
(c3): The risk measure $r$ satisfies conditions (r1), (r1n) and (r3s) in Assumption 2 and the utility function satisfies conditions (u1) and (u2) in Assumption 3.

Then, each point

(r, μ) \in G_{e f f} (r, u; A)

corresponds to a unique efficient portfolio

x (r, μ) \in A

and the mapping

(r, μ) \mapsto x (r, μ)

is continuous on

G_{e f f} (r, u; A)

(onesided continuous at the finite endpoint(s)). Moreover, efficient portfolios have the continuous representation

r \mapsto x (r, ν (r))

and

μ \mapsto x (γ (μ), μ)

on intervals I defined in (23) and J defined in (24), respectively.

Proof.

Note that Assumption 4 and condition (c0) ensures that

G_{e f f} (r, u; A)

is nonempty.

We first show the uniqueness of the efficient portfolio. Suppose that portfolios

x^{1} \neq x^{2}

both correspond to

(r, μ) \in G_{e f f} (r, u; A)

. We consider only the case when (c1) is satisfied (and the case when (c2) or (c3) is satisfied can be argued in a similar way). Then, by (r1) and (21), we must have

r = \hat{r} ({\hat{x}}^{1}) = \hat{r} ({\hat{x}}^{2}) = r (x^{1}) = r (x^{2}) = γ (μ)

and

E [u (S_{1}^{⊤} x^{i})] = μ, x^{i} \in A, i = 1, 2

. Note that because A has unit initial cost,

{\hat{x}}^{1} \neq {\hat{x}}^{2}

. Since A is convex,

x^{*} = (x^{1} + x^{2}) / 2 \in A

. Conditions (r2s) and (u2) imply that

E [u (S_{1}^{⊤} x^{*})] \geq μ

and due to the strict convexity of

\hat{r}

by (r1),

r (x^{*}) = \hat{r} ({\hat{x}}^{*}) < γ (μ)

, a contradiction. Thus, the efficient portfolio corresponding to

(r, μ) \in G_{e f f} (r, u; A)

is unique and we denote it by

x (r, μ)

. The mapping

(r, μ) \to x (r, μ)

is well defined.

Next, we show the continuity of the mapping

(r, μ) \to x (r, μ)

. If

G_{e f f} (r, u; A)

is a single point, there is nothing to prove. When

G_{e f f} (r, u; A)

is not a single point by Theorem 4, we can represent all the efficient portfolios either as the image of the mapping

r \mapsto x (r, ν (r))

on I or as the image of the mapping

μ \mapsto x (γ (μ), μ)

on J. Suppose that

x (r, μ)

is discontinuous at

(\bar{r}, \bar{μ}) \in G_{e f f} (r, u; A)

. We first focus on the case when Assumption 4 (a) holds. Then, for a fixed positive number

ε_{0} > 0

, there exist sequences

μ_{n} \to \bar{μ}

(

μ_{n} ↗ \bar{μ}

if

\bar{μ} = max (J)

or

μ_{n} ↘ \bar{μ}

if

\bar{μ} = min (J)

) and such that

∥ x (γ (μ_{n}), μ_{n}) - x (γ (\bar{μ}), \bar{μ}) ∥ \geq ε_{0}

where

\begin{matrix} E [u (S_{1}^{⊤} x (γ (μ_{n}), μ_{n}))] \geq μ_{n} and r (x (γ (μ_{n}), μ_{n})) = \hat{r} (\hat{x} (γ (μ_{n}), μ_{n})) = γ (μ_{n}) . \end{matrix}

(29)

By Assumption 4 (a), we may assume without loss of generality that

x (γ (μ_{n}), μ_{n})

converges to some portfolio

x^{*}

with

∥ x^{*} - x (γ (\bar{μ}), \bar{μ}) ∥ \geq ε_{0}

. Furthermore, by Proposition 8,

μ \mapsto γ (μ)

is concave, and by Theorem 4 continuous on J. Taking limits in (29) and using the upper semicontinuity of

x \mapsto E [u (S_{1}^{⊤} x)]

yields

\begin{matrix} E [u (S_{1}^{⊤} x^{*})] \geq \bar{μ} and \hat{r} ({\hat{x}}^{*}) = γ (\bar{μ}) = \bar{r} . \end{matrix}

(30)

However, the uniqueness of the efficient portfolio (30) implies that

x^{*} = x (γ (\bar{μ}), \bar{μ})

, which is a contradiction. If Assumption 4 (b) holds, we can use the mapping

r \mapsto x (r, ν (r))

on the interval I to obtain a similar contradiction. ☐

Remark 6.

Interval

I = dom (ν) \cap range (γ)

is always bounded from below by 0 because the risk measure is always none negative, other than that, both

I = dom (ν) \cap range (γ)

and

J = dom (γ) \cap range (ν)

can be open, closed, half open and half closed. They can be finite or infinite. Although various situations are possible, we do have a precise characterization of their endpoints in the next proposition.

Proposition 9.

Under the conditions of Theorem 5, define

r_{min} : = inf [dom (ν) \cap range (γ)] = inf I,

r_{max} : = sup [dom (ν) \cap range (γ)] = sup I,

μ_{min} : = inf [dom (γ) \cap range (ν)] = inf J,

and

μ_{max} : = sup [dom (γ) \cap range (ν)] = sup J .

Then,

\begin{matrix} r_{min} = inf {r (x) : E [u (S_{1}^{⊤} x)] > - \infty, x \in A} \geq 0, \end{matrix}

(31)

\begin{matrix} μ_{max} = sup {E [u (S_{1}^{⊤} x)], x \in A} > - \infty, \end{matrix}

(32)

\begin{matrix} μ_{min} = lim_{r ↘ r_{min}} sup {E [u (S_{1}^{⊤} x)] : r (x) \leq r, x \in A} \leq μ_{max}, \end{matrix}

(33)

and

\begin{matrix} r_{max} = lim_{μ ↗ μ_{max}} inf {r (x) : E [u (S_{1}^{⊤} x)] \geq μ, x \in A} \geq r_{min} . \end{matrix}

(34)

Proof.

We start with (31). Let

\bar{r} : = inf {r (x) : E [u (S_{1}^{⊤} x)] > - \infty, x \in A}

. It is clear that, for any

μ

,

\bar{r} \leq γ (μ)

so that

\bar{r}

is a lower bound for

I = dom (ν) \cap range (γ)

, i.e.,

\bar{r} \leq r_{min}

. For any

r > \bar{r}

, there exist some finite

μ

such that

\begin{matrix} S (μ, r) : = {x \in A : E [u (S_{1}^{⊤} x)] \geq μ > - \infty and r (x) \leq r} \neq \emptyset . \end{matrix}

(35)

By Assumption 4,

S (μ, r)

is compact. Thus,

γ (μ) \in [\bar{r}, r]

is attained by some

x^{*} \in A

with

E [u (S_{1}^{⊤} x^{*})] \geq μ

. It follows that

S (μ, γ (μ))

defined in (35) is nonempty and, therefore, compact by Assumption 4. Thus,

ν (γ (μ)) > - \infty

implying

γ (μ) \in dom (ν) \cap range (γ) = I

and hence

γ (μ) \geq r_{min}

. However, since

r > \bar{r}

was arbitrary,

γ (μ)

can be chosen close to

\bar{r}

implying

\bar{r} \geq r_{min}

and in conclusion

\bar{r} = r_{min}

.

Note that, since

r (x)

is always finite, we have

sup {E [u (S_{1}^{⊤} x)], x \in A} = sup {E [u (S_{1}^{⊤} x)], r (x) < \infty, x \in A} .

Thus, the proof of (32) is parallel to that of (31). Having determined

r_{min}

and

μ_{max}

, we have

r_{max} = {lim}_{μ ↗ μ_{max}} γ (μ)

and

μ_{min} = {lim}_{r ↘ r_{min}} ν (r)

. Hence, representations (33) and (34) directly follow from the definitions of

ν

and

γ

, respectively. ☐

Corollary 2.

Under the conditions of Theorem 5, we have

(a)

r_{min} \in I

if and only if

μ_{min} \in J

, and

r_{max} \in I

if and only if

μ_{max} \in J

.

(b)

If

r_{min} \in I

then

μ_{min} = ν (r_{min})

and

γ (μ_{min}) = r_{min}

.

(c)

If

μ_{max} \in J

then

r_{max} = γ (μ_{max})

and

ν (r_{max}) = μ_{max}

.

(d)

(i): If $r_{min} \in I$ and $μ_{max} \in J$ then $I = [r_{min}, r_{max}]$ and $J = [μ_{min}, μ_{max}]$ .
(ii): If $r_{min} \notin I$ and $μ_{max} \in J$ then $I = (r_{min}, r_{max}]$ and $J = (- \infty, μ_{max}]$ .
(iii): If $r_{min} \in I$ and $μ_{max} \notin J$ then $I = [r_{min}, \infty)$ and $J = [μ_{min}, μ_{max})$ .
(iv): If $r_{min} \notin I$ and $μ_{max} \notin J$ then $I = (r_{min}, \infty)$ and $J = (- \infty, μ_{max})$ .

Proof.

Let

r_{min} \in I \subset dom (ν)

. Then,

r_{min} = γ (\bar{μ})

for some

\bar{μ} \in J

by Theorem 4. Since

γ

is an increasing function, we have

\bar{μ} = min J

. Hence,

\bar{μ} = μ_{min}

and

r_{min} = γ (μ_{min})

. Then,

ν (r_{min}) = \bar{μ} = μ_{min}

follows since

γ \circ ν = i d

is the identity mapping on I. The converse and the case for max can be proved analogously. This proves (a), (b) and (c). Moreover, (d)(i) directly follows from (b) and (c).

If

r_{min} \notin I

, we show

μ_{min} = - \infty

. In fact, if

μ_{min} > - \infty

, then, for any natural number n, we can select

x^{n} \in A

such that

r (x^{n}) \leq r_{min} + 1 / n

and

E [u (S_{1}^{⊤} x^{n})] \geq μ_{min}

. By Assumption 4, we may assume without loss of generality that

x^{n} \to x^{*} \in A

. Taking limits as

n \to \infty

, we conclude that

r (x^{*}) \leq r_{min}

and

E [u (S_{1}^{⊤} x^{*})] \geq μ_{min}

and both have to be equality. Thus,

(r_{min}, μ_{min}) \in G_{e f f} (r, u; A)

, a contradiction. This shows (d)(ii).

Analogously, one gets that

μ_{max} \notin J

implies

r_{max} = \infty

, which shows (d)(iii) and (d)(iv). ☐

Remark 7.

Several interesting cases when

G_{e f f} (r, u; A)

has finite endpoints are discussed below:

(a) The quantity

r_{min}

is always finite and

μ_{min}

may be finite as well as illustrated in Figure 1. However,

μ_{min}

may also be

- \infty

, as Example 1 shows. A typical efficient frontier corresponding to this case is illustrated in Figure 2.

Figure 1. Efficient frontier with both

r_{min}

and

μ_{min}

are finite and attained.

Figure 2. Efficient frontier with

μ_{min} = - \infty

.

(b) Suppose

μ_{max}

is finite and attained at an efficient portfolio

x (γ (μ_{max}), μ_{max})

. Under the conditions of Theorem 5, the portfolio

κ : = x (γ (μ_{max}), μ_{max})

is unique and independent of the risk measure. A graphic illustration is given in Figure 3.

Figure 3. Efficient frontier when

r_{min} > 0

and

μ_{max}

is finite and attained as maximum.

(c) Trade-off between utility and risk is thus implemented by portfolios

x (γ (μ), μ)

that trace out a curve in the so-called leverage space introduced by Vince (2009). Note that the curve

x (γ (μ), μ)

depends on the risk measure

r

as well as the utility function u. This provides a method for systematically selecting portfolios in the leverage space to reduce risk exposure.

(d) If, in addition,

r

satisfies (r1n) in Assumption 2 and

u (R) > - \infty

then

r_{min} = 0

,

μ_{min} = u (R)

and

x (r_{min}, μ_{min}) = {(1, {\hat{0}}^{⊤})}^{⊤}

(see Figure 4).

Figure 4. Efficient frontier with

{(1, {\hat{0}}^{⊤})}^{⊤} \in A

.

(e) Unlike in (b),

μ_{max}

finite can also happen when the efficient frontier is unbounded (see Example 2).

Example 1.

(for

μ_{min} = - \infty

) Consider a portfolio problem with the log utility on a financial market that contains no bond and two risky assests (i.e.,

M = 2

)

\begin{matrix} ν (r) : = sup_{\hat{x} \in R^{2}} {E [ln ({\hat{S}}_{1}^{⊤} \hat{x})] : \hat{r} (\hat{x}) \leq r, {\hat{S}}_{0}^{⊤} \hat{x} = 1} . \end{matrix}

(36)

The financial market

{\hat{S}}_{t} = {(S_{t}^{1}, S_{t}^{2})}^{⊤}

(since the riskless asset is not involved in (36), it is irrelevant to the problem) is specified as follows:

{\hat{S}}_{0} = {[1, 1]}^{⊤}

,

{\hat{S}}_{1}

is a random vector on the sample space

Ω = {ω_{1}, ω_{2}, ω_{3}}

with

P (ω_{1}) = P (ω_{2}) = P (ω_{3}) = 1 / 3

and a payoff matrix

\begin{matrix} [{\hat{S}}_{1} (ω_{1}), {\hat{S}}_{1} (ω_{2}), {\hat{S}}_{1} (ω_{3})] = [\begin{matrix} 1 & 3 & 0.5 \\ 0.5 & 0.8 & 1.2 \end{matrix}] . \end{matrix}

(37)

Note that, for instance with

R = 1

, this market has no nontrivial riskless portfolio. We use the risk measure

\begin{matrix} \hat{r} (\hat{x}) : = \sqrt{{(x_{1} - 2 x_{2})}^{2} + 100 {(2 x_{1} + x_{2})}^{2}}, \end{matrix}

(38)

which satisfies (r1), (r1n) and (r3s) and, therefore, Assumption 4(b) holds. Clearly,

ν (\hat{r} ({[1, 0]}^{⊤})) > 0

and finite. Notice that on the feasible set

{\hat{S}}_{0}^{⊤} \hat{x} = 1

, i.e.,

x_{2} = 1 - x_{1}

. It follows that the risk measure

\hat{r} (\hat{x}) : = \sqrt{{(3 x_{1} - 2)}^{2} + 100 {(x_{1} + 1)}^{2}}

attains a minimum

r_{m} = \frac{50}{109} \sqrt{109}

at

{\hat{x}}_{m} = {(- 94 / 109, 203 / 109)}^{⊤}

. Observing

{\hat{S}}_{1}^{⊤} (ω_{2}) {\hat{x}}_{m} < 0

, we must have

r_{min} > r_{m}

and

μ_{min} = lim_{r ↘ r_{min}} ν (r) = - \infty .

Example 2.

(for

μ_{max} < \infty

and

r_{max} = \infty

) Consider the same risk measure as in the previous example, but use instead the utility function

u (t) = 1 - e^{- t}

. We analyze

\begin{matrix} ν (r) : = sup_{\hat{x} \in R^{2}} {E [u ({\hat{S}}_{1}^{⊤} \hat{x})] : \hat{r} (\hat{x}) \leq r, {\hat{S}}_{0}^{⊤} \hat{x} = 1}, \end{matrix}

(39)

where the financial market is defined by

\begin{matrix} [{\hat{S}}_{1} (ω_{1}), {\hat{S}}_{1} (ω_{2}), {\hat{S}}_{1} (ω_{3})] = [\begin{matrix} 1 & 3.6 & 0.5 \\ 0.5 & 1.2 & 0.3 \end{matrix}] \end{matrix}

(40)

on the sample space

Ω = {ω_{1}, ω_{2}, ω_{3}}

with

P (ω_{1}) = P (ω_{2}) = P (ω_{3}) = 1 / 3

. Again, on the feasible set

{\hat{S}}_{0}^{⊤} \hat{x} = 1

, i.e.,

x_{2} = 1 - x_{1}

. The portfolio as a function of

x_{1}

implies

{({\hat{S}}_{1}^{⊤} (ω_{i}) {[x_{1}, 1 - x_{1}]}^{⊤})}_{i = 1, 2, 3} = [0.5, 1.2, 0.3] + x_{1} [0.5, 2.4, 0.2] .

As

x_{1} \to \infty

, we can see that

\hat{r} (\hat{x}) \to \infty

and

E [u ({\hat{S}}_{1}^{⊤} \hat{x})] \to 1

. Hence,

r_{max} = \infty

and

μ_{max} = 1 < \infty

. Notice that (40) with, e.g.,

R = 1

, has an arbitrage portfolio

{\hat{x}}^{*} = {(1, - 1)}^{⊤}

, but the existence of an arbitrage seems to be necessary in constructing such an example.

4. Markowitz Portfolio Theory and CAPM Model

Let us now turn to applications of the general theory. We show that the results in the previous section provide a general unified framework for several familiar portfolio theories. They are Markowitz portfolio theory, the CAPM model, growth optimal portfolio theory and leverage space portfolio theory. Of course, when dealing with concrete risk measures and expected utilities related to these concrete theories, an additional helpful structure in the solutions often emerge. Although many different expositions of these theories do already exist in the literature, for the convenience of readers, we include brief arguments using Lagrange multiplier methods. In this entire section, we will assume that the market

S_{t}

from Definition 1 has no nontrivial riskless portfolio.

4.1. Markowitz Portfolio Theory

Markowitz portfolio theory that considers only risky assets (see Markowitz (1959)), can be understood as a special case of the framework discussed in Section 3. The risk measure is the standard deviation

σ

and the utility function is the identity function. Thus, we face the problem

\begin{matrix} min & σ ({\hat{S}}_{1}^{⊤} \hat{x}) \\ Subject to & E [{\hat{S}}_{1}^{⊤} \hat{x}] \geq μ, {\hat{S}}_{0}^{⊤} \hat{x} = 1 . \end{matrix}

(41)

We assume

E [{\hat{S}}_{1}]

is not proportional to

{\hat{S}}_{0}

, that is, for any

α \in R

,

\begin{matrix} E [{\hat{S}}_{1}] \neq α {\hat{S}}_{0} . \end{matrix}

(42)

Since the variance is a monotone increasing function of the standard deviation, we can minimize half of the variance for convenience:

\begin{matrix} min_{\hat{x} \in R^{M}} & \hat{r} (\hat{x}) : = \frac{1}{2} Var ({\hat{S}}_{1}^{⊤} \hat{x}) = \frac{1}{2} σ^{2} ({\hat{S}}_{1}^{⊤} \hat{x}) = \frac{1}{2} {\hat{x}}^{⊤} Σ \hat{x} \\ Subject to & E [{\hat{S}}_{1}^{⊤} \hat{x}] \geq μ, {\hat{S}}_{0}^{⊤} \hat{x} = 1 . \end{matrix}

(43)

Optimization problem (43) is already in the form (19) with

A = {x \in R^{M + 1} : S_{0}^{⊤} x = 1, x_{0} = 0}

. We can check if condition (c1) in Theorem 5 is satisfied. Moreover, Corollary 1 implies that

Σ

is positive definite since

S_{t}

has no nontrivial riskless portfolio. Hence, the risk function

\hat{r}

has compact level sets. Thus, Assumption 4 is satisfied and Theorem 5 is applicable. Let

\hat{x} (μ)

be the optimal portfolio corresponding to

μ

. Consider the Lagrangian

\begin{matrix} L (\hat{x}, λ) : = \frac{1}{2} {\hat{x}}^{⊤} Σ \hat{x} + λ_{1} (μ - {\hat{x}}^{⊤} E [{\hat{S}}_{1}]) + λ_{2} (1 - {\hat{x}}^{⊤} {\hat{S}}_{0}), \end{matrix}

(44)

where

λ_{1} \geq 0

. Thanks to Theorem 1, we have

\begin{matrix} 0 = \nabla_{\hat{x}} L = Σ \hat{x} (μ) - (λ_{1} E [{\hat{S}}_{1}] + λ_{2} {\hat{S}}_{0}) . \end{matrix}

(45)

In other words,

\begin{matrix} \hat{x} (μ) = Σ^{- 1} (λ_{1} E [{\hat{S}}_{1}] + λ_{2} {\hat{S}}_{0}) . \end{matrix}

(46)

We must have

λ_{1} > 0

because otherwise

\hat{x} (μ)

would be unrelated to the payoff

{\hat{S}}_{1}

. The complementary slackness condition implies that

E [{\hat{S}}_{1}^{⊤} \hat{x} (μ)] = μ

. Left multiplying (45) by

{\hat{x}}^{⊤} (μ),

we have

\begin{matrix} σ^{2} (μ) = λ_{1} μ + λ_{2} . \end{matrix}

(47)

To determine the Lagrange multipliers, we need the numbers

α = E {[{\hat{S}}_{1}]}^{⊤} Σ^{- 1} E [{\hat{S}}_{1}]

,

β = E {[{\hat{S}}_{1}]}^{⊤} Σ^{- 1} {\hat{S}}_{0}

and

γ = {\hat{S}}_{0}^{⊤} Σ^{- 1} {\hat{S}}_{0}

. Left multiplying (46) by

E {[{\hat{S}}_{1}]}^{⊤}

and

{\hat{S}}_{0}^{⊤}

, we have

\begin{matrix} μ = λ_{1} α + λ_{2} β \end{matrix}

(48)

and

\begin{matrix} 1 = λ_{1} β + λ_{2} γ . \end{matrix}

(49)

Solving (48) and (49), we derive

\begin{matrix} λ_{1} = \frac{γ μ - β}{α γ - β^{2}} and λ_{2} = \frac{α - β μ}{α γ - β^{2}}, \end{matrix}

(50)

where

\begin{matrix} α γ - β^{2} = \det ([E [{\hat{S}}_{1}^{⊤}], {\hat{S}}_{0}^{⊤}] Σ^{- 1} [\begin{matrix} E [{\hat{S}}_{1}] \\ {\hat{S}}_{0} \end{matrix}]) > 0, \end{matrix}

(51)

since

Σ^{- 1}

is positive definite and condition (42) holds. Substituting (50) into (47), we see that the efficient frontier is determined by the curve

\begin{matrix} σ (μ) = \sqrt{\frac{γ μ^{2} - 2 β μ + α}{α γ - β^{2}}} = \sqrt{\frac{γ}{α γ - β^{2}} {(μ - \frac{β}{γ})}^{2} + \frac{1}{γ}} \geq \frac{1}{\sqrt{γ}}, \end{matrix}

(52)

usually referred to as the Markowitz bullet due to its shape. A typical Markowitz bullet is shown in Figure 5 with an asymptote

\begin{matrix} μ = \frac{β}{γ} + σ (μ) \sqrt{\frac{α γ - β^{2}}{γ}} . \end{matrix}

(53)

Figure 5. Markowitz Bullet.

Note that

G (\frac{1}{2} Var, i d, {S_{0}^{⊤} x = 1, x_{0} = 0}) = G (σ, i d, {S_{0}^{⊤} x = 1, x_{0} = 0})

. Thus, relationships (52) and (53) describe the efficient frontier

G_{e f f} (σ, i d, {S_{0}^{⊤} x = 1, x_{0} = 0})

as in Definition 5. In addition, note that (52) implies that

μ_{min} = β / γ

and

r_{min} = 1 / \sqrt{γ}

. Thus, as a corollary of Theorem 5, we have

Theorem 6.

(Markowitz Portfolio Theorem) Assume that the financial market

S_{t}

has no nontrivial riskless portfolio and

E [{\hat{S}}_{1}]

is not proportional to

{\hat{S}}_{0}

(see (42)). The Markowitz efficient portfolios of (41) represented in the

(σ, μ)

-plane are given by

G_{e f f} (σ, i d; {S_{0}^{⊤} x = 1, x_{0} = 0}) .

They correspond to the upper boundary of the Markowitz bullet given by

σ (μ) = \sqrt{\frac{γ μ^{2} - 2 β μ + α}{α γ - β^{2}}}, μ \in [\frac{β}{γ}, + \infty) .

The optimal portfolio

\hat{x} (μ)

can be determined by (46) and (50) as

\begin{matrix} \hat{x} (μ) = μ \frac{Σ^{- 1} (γ E [{\hat{S}}_{1}] - β {\hat{S}}_{0})}{α γ - β^{2}} + \frac{Σ^{- 1} (α {\hat{S}}_{0} - β E [{\hat{S}}_{1}])}{α γ - β^{2}}, \end{matrix}

(54)

which is affine in μ.

The structure of the optimal portfolio in (54) implies the well known two fund theorem derived by Tobin (1958).

Theorem 7.

(Two Fund Theorem) Select two distinct portfolios on the Markowitz efficient frontier. Then, any portfolio on the Markowitz efficient frontier can be represented as the linear combination of these two portfolios.

4.2. Capital Asset Pricing Model

The capital asset pricing model (CAPM) is a theoretical equilibrium model independently proposed by Lintner (1965), Mossin (1966), Sharpe (1964) and Treynor (1999) for pricing a risky asset according to its expected payoff and market risk, often referred to as the beta. The core of the capital asset pricing model is including a riskless bond in the Markowitz mean-variance analysis. Thus, we can apply the general framework in Section 3 with the same setting as in Section 4.1. Similar to the previous section, we can consider the equivalent problem of

\begin{matrix} min_{x \in R^{M + 1}} & \frac{1}{2} σ^{2} (S_{1}^{⊤} x) = \frac{1}{2} {\hat{x}}^{⊤} Σ \hat{x} = : \hat{r} (\hat{x}) \\ Subject to & E [S_{1}^{⊤} x] \geq μ, S_{0}^{⊤} x = 1 . \end{matrix}

(55)

Similar to the last section problem (55) is in the form (19) with

A = {x \in R^{M + 1} : S_{0}^{⊤} x = 1}

. We can check that condition (c1) in Theorem 5 is satisfied. Again, the risk function

\hat{r}

has compact level sets since

Σ

is positive definite. Thus, Assumption 4 is satisfied and Theorem 5 is applicable. The Lagrangian of this convex programming problem is

\begin{matrix} L (x, λ) : = \frac{1}{2} {\hat{x}}^{⊤} Σ \hat{x} + λ_{1} (μ - x^{⊤} E [S_{1}]) + λ_{2} (1 - x^{⊤} S_{0}), \end{matrix}

(56)

where

λ_{1} \geq 0

. Again, we have

\begin{matrix} 0 = \nabla_{x} L = (0, Σ \hat{x} (μ)) - (λ_{1} E [S_{1}] + λ_{2} S_{0}) . \end{matrix}

(57)

Using

S_{1}^{0} = R

and

S_{0}^{0} = 1

, the first component of (57) implies

\begin{matrix} λ_{2} = - λ_{1} R, \end{matrix}

(58)

so that (57) becomes

\begin{matrix} 0 = \nabla_{x} L = (0, Σ \hat{x} (μ)) - λ_{1} (E [S_{1}] - R S_{0}) . \end{matrix}

(59)

Clearly,

λ_{1} > 0

for

\hat{x} (μ) \neq 0

. Using the complementary slackness condition

E [S_{1}^{⊤} x (μ)] = μ,

we derive

\begin{matrix} σ^{2} (μ) = {\hat{x}}^{⊤} (μ) Σ \hat{x} (μ) = λ_{1} (μ - R), \end{matrix}

(60)

by left multiplying

x^{⊤} (μ)

in (59). Solving

\hat{x} (μ)

from (59), we have

\begin{matrix} \hat{x} (μ) = λ_{1} Σ^{- 1} (E [{\hat{S}}_{1}] - R {\hat{S}}_{0}) . \end{matrix}

(61)

Left multiplying with

E [{\hat{S}}_{1}^{⊤}]

and

{\hat{S}}_{0}^{⊤}

and using the

α, β

and

γ

introduced in the previous section, we derive

\begin{matrix} μ - x_{0} (μ) R = λ_{1} (α - R β) \end{matrix}

(62)

and

\begin{matrix} 1 - x_{0} (μ) = λ_{1} (β - R γ), \end{matrix}

(63)

respectively. Multiplying (63) by R and subtracting it from (62), we get

\begin{matrix} μ - R = λ_{1} (α - 2 β R + γ R^{2}) . \end{matrix}

(64)

Combining (60) and (64), we arrive at

\begin{matrix} σ^{2} (μ) = \frac{{(μ - R)}^{2}}{α - 2 β R + γ R^{2}} . \end{matrix}

(65)

Clearly, efficient portfolios only occur for

μ \geq R

, since, for

μ = R

, the pure bond portfolio

{(1, {\hat{0}}^{⊤})}^{⊤}

is the only efficient (and risk free) portfolio. Relation (65) defines a straight line on the

(σ, μ)

-plane

\begin{matrix} σ (μ) = \frac{μ - R}{\sqrt{Δ}} or μ = R + σ (μ) \sqrt{Δ}, \end{matrix}

(66)

where

Δ : = α - 2 β R + γ R^{2} > 0

if

\begin{matrix} E [{\hat{S}}_{1}] - R {\hat{S}}_{0} \neq \hat{0}, \end{matrix}

(67)

since

Σ

is positive definite. The line given in (66) is called the capital market line.

In addition, combining (61), (63) and (64), we have

\begin{matrix} x^{⊤} (μ) = Δ^{- 1} [α - β R - μ (β - γ R), (μ - R) (E [{\hat{S}}_{1}^{⊤}] - R {\hat{S}}_{0}^{⊤}) Σ^{- 1}] . \end{matrix}

(68)

Again, we see the affine structure of the solution. Note that, although the computation is done in terms of the risk function

\hat{r} (\hat{x}) = \frac{1}{2} {\hat{x}}^{⊤} Σ \hat{x}

, relationships in (66) are in terms the risk function

σ (S_{1}^{⊤} x)

. Thus, they describe the efficient frontier

G_{e f f} (σ, i d; {S_{0}^{⊤} x = 1})

as in Definition 5. In summary, we have

Theorem 8.

(CAPM) Assume that the financial market

S_{t}

of Definition 1 has no nontrivial riskless portfolio. Moreover, assume that condition (67) holds. The efficient portfolios for the CAPM model

G_{e f f} (σ, i d; {S_{0}^{⊤} x = 1})

represented in the

(σ, μ)

-plane are a straight line passing through

(0, R)

corresponding to the portfolio of pure risk free bond. The optimal portfolio

x (μ)

can be determined by (68), which is affine in μ and can be represented as points in the

(σ, μ)

-plane as located on the capital market line

μ = R + σ \sqrt{Δ}, σ \geq 0 .

In particular, when

μ = R

and

μ = (α - β R) / (β - γ R),

we derive, respectively, the portfolio

{(1, {\hat{0}}^{⊤})}^{⊤}

that contains only the riskless bond and the portfolio

{(0, (E [{\hat{S}}_{1}^{⊤}] - R {\hat{S}}_{0}^{⊤}) Σ^{- 1} / (β - γ R))}^{⊤}

that contains only risky assets. We call this portfolio the market portfolio and denote it

x_{M}

. The market portfolio corresponds to the coordinates

\begin{matrix} (σ_{M}, μ_{M}) = (\frac{\sqrt{Δ}}{β - γ R}, R + \frac{Δ}{β - γ R}) . \end{matrix}

(69)

Since the risk

σ

is non negative, we see that the market portfolio exists only when

β - γ R > 0 .

This condition is

\begin{matrix} {\hat{S}}_{0}^{⊤} Σ^{- 1} (E [{\hat{S}}_{1}] - R {\hat{S}}_{0}) > 0 . \end{matrix}

(70)

By Theorem 3,

\begin{matrix} (σ_{M}, μ_{M}) & \in & G_{e f f} (σ, i d; {S_{0}^{⊤} x = 1}) \cap G (σ, i d; {S_{0}^{⊤} x = 1, x_{0} = 0}) \\ \subset & G_{e f f} (σ, i d; {S_{0}^{⊤} x = 1, x_{0} = 0}) . \end{matrix}

(71)

Thus, the market portfolio has to reside on the Markowitz efficient frontier. Moreover, by (68), we can see that the market portfolio

x_{M}

is the only portfolio on the CAPM efficient frontier that consists of purely risky assets. Thus,

\begin{matrix} G_{e f f} (σ, i d; {S_{0}^{⊤} x = 1}) \cap G (σ, i d; {S_{0}^{⊤} x = 1, x_{0} = 0}) = {(σ_{M}, μ_{M})}, \end{matrix}

(72)

so that the capital market line is tangent to the Markowitz bullet at

(σ_{M}, μ_{M})

as illustrated in Figure 6.

Figure 6. Capital Market Line and Markowitz Bullet.

Remark 8.

Observe that

Σ^{- 1} (E [{\hat{S}}_{1}] - R {\hat{S}}_{0})

is proportional to the optimal portfolio in (61). Thus, condition (70) means that any optimal portfolio should have an positive initial cost. Note that (70) also implies (67).

The affine structure of the solutions is summarized in the following one fund theorem Sharpe (1964); Tobin (1958).

Theorem 9.

(One Fund Theorem) Assume that the financial market

S_{t}

has no nontrivial riskless portfolio. Moreover, assume that condition (70) holds. All the optimal portfolios in the CAPM model (55) are generalized convex combinations of the riskless bond and the market portfolio

x_{M} = {(0, (E [{\hat{S}}_{1}^{⊤}] - R {\hat{S}}_{0}^{⊤}) Σ^{- 1} / (β - γ R))}^{⊤}

, which corresponds to

(σ_{M}, μ_{M})

. The capital market line is tangent to the boundary of the Markowitz bullet at the coordinates of the market portfolio

(σ_{M}, μ_{M})

and intercepts the μ-axis at

(0, R)

(see Figure 6).

Alternatively, we can write the slope of the capital market line as

\begin{matrix} \sqrt{Δ} = \frac{μ_{M} - R}{σ_{M}} . \end{matrix}

(73)

This quantity is called the price of risk and we can rewrite the equation for the capital market line (66) as

\begin{matrix} μ = R + \frac{μ_{M} - R}{σ_{M}} σ . \end{matrix}

(74)

5. Affine Efficient Frontier for Positive Homogeneous Risk Measure

The affine dependence of the efficient portfolio on the return

μ

observed in the CAPM still holds when the standard deviation is replaced by the more general deviation measure (see Rockafellar et al. (2006)). In this section, we derive this affine structure using the general framework discussed in Section 3 and provide a proof different from that of Rockafellar et al. (2006). Moreover, we provide a sufficient condition for the existence of the master fund in the one fund theorem generalizing condition

β - R γ > 0

(see (70)) for the existence of the market portfolio in the CAPM model. We also construct a counter-example showing that the two fund theorem (Theorem 7) fails in this setting. Let us consider a risk measure

r

that satisfies (r1), (r1n), (r2) and (r3) in Assumption 2 and the related problem of finding efficient portfolios becomes

\begin{matrix} min_{x \in R^{M + 1}} & r (x) = \hat{r} (\hat{x}) \\ Subject to & E [S_{1}^{⊤} x] \geq μ, S_{0}^{⊤} x = 1 . \end{matrix}

(75)

Since, for

μ = R

, there is an obvious solution

x (R) = {(1, {\hat{0}}^{⊤})}^{⊤}

corresponding to

r (x (R)) = \hat{r} (\hat{0}) = 0

, we have

r_{min} = 0

and

μ_{min} = R

. In what follows, we will only consider

μ > R

. Moreover, we note that for

\hat{r}

satisfying the positive homogeneous property (r3) in Assumption 2,

\hat{y} \in \partial \hat{r} (\hat{x})

implies that

\begin{matrix} \hat{r} (\hat{x}) = ⟨ \hat{y}, \hat{x} ⟩ . \end{matrix}

(76)

In fact, for any

t \in (- 1, 1)

,

\begin{matrix} t \hat{r} (\hat{x}) = \hat{r} ((1 + t) \hat{x}) - \hat{r} (\hat{x}) \geq t ⟨ \hat{y}, \hat{x} ⟩, \end{matrix}

(77)

and (76) follows. Now we can state and prove the theorem on affine dependence of the efficient portfolio on the return

μ

.

Theorem 10.

(Affine Efficient Frontier for Positive Homogeneous Risk Measures) Assume that the risk measure

r

satisfies assumptions (r1), (r1n), (r2) and (r3) in Assumption 2 with

A = {x \in R^{M + 1} : S_{0}^{⊤} x = 1}

and Assumption 4 (b) holds. Furthermore, assume

\begin{matrix} E [{\hat{S}}_{1}] - R {\hat{S}}_{0} \neq \hat{0} . \end{matrix}

(78)

Then, there exists an efficient portfolio

x^{1}

corresponding to

(r_{1}, μ_{1}) : = (r (x^{1}), R + 1)

on the efficient frontier for problem (75) such that the efficient frontier for problem (75) in the risk-expected return space is a straight line that passes through the points (0,R) corresponding to a portfolio of pure bond

{(1, {\hat{0}}^{⊤})}^{⊤}

and

(r_{1}, μ_{1})

corresponding to the portfolio

x^{1}

, respectively. Moreover, the straight line connecting

{(1, {\hat{0}}^{⊤})}^{⊤}

and

x^{1}

in the portfolio space, namely for

μ \geq R

,

\begin{matrix} x (μ) = (μ_{1} - μ) {(1, {\hat{0}}^{⊤})}^{⊤} + (μ - R) x^{1} \end{matrix}

(79)

represents a set of efficient portfolios for (75) that corresponds to

\begin{matrix} (γ (μ), μ) = ((μ - R) r_{1}, μ) \end{matrix}

(80)

in the risk-expected return space (see Definition 5 and (19)).

Proof.

The Lagrangian of this convex programming problem (75) is

\begin{matrix} L (x, λ) : = r (x) + λ_{1} (μ - x^{⊤} E [S_{1}]) + λ_{2} (1 - x^{⊤} S_{0}), \end{matrix}

(81)

where

λ_{1} \geq 0

and

λ_{2} \in R

.

Condition (78) implies that there exists some

\bar{m} \in {1, 2, \dots, M}

, such that

E [S_{1}^{\bar{m}}] \neq R S_{0}^{\bar{m}}

. Hence, for any

μ

, there exists a portfolio of the form

y = {(y_{0}, 0, \dots, 0, y_{\bar{m}}, 0, \dots, 0)}^{⊤}

satisfying

\begin{matrix} [\begin{matrix} E [S_{1}^{⊤} y] \\ S_{0}^{⊤} y \end{matrix}] = [\begin{matrix} R y_{0} + E [S_{1}^{\bar{m}}] y_{\bar{m}} \\ y_{0} + S_{0}^{\bar{m}} y_{\bar{m}} \end{matrix}] = [\begin{matrix} R & E [S_{1}^{\bar{m}}] \\ 1 & S_{0}^{\bar{m}} \end{matrix}] [\begin{matrix} y_{0} \\ y_{\bar{m}} \end{matrix}] = [\begin{matrix} μ \\ 1 \end{matrix}] \end{matrix}

(82)

because the matrix in (82) is invertible. Thus, for any

μ \geq R

, Assumption 4 (b) with

A = {x \in R^{M + 1} : S_{0}^{⊤} x = 1}

and condition (78) ensure the existence of an optimal solution to problem (75).

Denoting one of those solutions by

x (μ)

(may not be unique), we have

\begin{matrix} γ (μ) = r (x (μ)) = \hat{r} (\hat{x} (μ)) . \end{matrix}

(83)

Fixing

μ_{1} = R + 1 > R

, denote

x^{1} = x (μ_{1})

. Then,

\begin{matrix} λ_{1} E [S_{1}] + λ_{2} S_{0} \in \partial r (x^{1}) . \end{matrix}

(84)

Since

r

is independent of

x_{0}

, we have

\begin{matrix} λ_{1} E [S_{1}^{0}] + λ_{2} S_{0}^{0} = 0 or λ_{2} = - λ_{1} R . \end{matrix}

(85)

Substituting (85) into (84) we have

\begin{matrix} λ_{1} E [{\hat{S}}_{1} - R {\hat{S}}_{0}] \in \partial \hat{r} ({\hat{x}}^{1}) \end{matrix}

(86)

so that, for all

\hat{x} \in R^{M}

,

\begin{matrix} \hat{r} (\hat{x}) - \hat{r} ({\hat{x}}^{1}) \geq λ_{1} E [{({\hat{S}}_{1} - R {\hat{S}}_{0})}^{⊤} (\hat{x} - {\hat{x}}^{1})] = λ_{1} (E [{({\hat{S}}_{1} - R {\hat{S}}_{0})}^{⊤} \hat{x}] - (μ_{1} - R)) \end{matrix}

(87)

because, at the optimal solution

{\hat{x}}^{1},

the constraint is binding. Using (r3), it follows from (76) and (86) that

\begin{matrix} \hat{r} ({\hat{x}}^{1}) = λ_{1} E [{({\hat{S}}_{1} - R {\hat{S}}_{0})}^{⊤} {\hat{x}}^{1}] = λ_{1} (μ_{1} - R) = λ_{1} . \end{matrix}

(88)

Thus, we can write (87) as

\begin{matrix} \hat{r} (\hat{x}) \geq \hat{r} ({\hat{x}}^{1}) E [{({\hat{S}}_{1} - R {\hat{S}}_{0})}^{⊤} \hat{x}] . \end{matrix}

(89)

For

t \geq 0,

define the homotopy between

x^{0} : = {(1, {\hat{0}}^{⊤})}^{⊤}

and

x^{1}

\begin{matrix} x^{t} : = (t x_{0}^{1} + (1 - t), t {\hat{x}}^{1}) . \end{matrix}

(90)

We can verify that

S_{0}^{⊤} x^{t} = 1

and

E [S_{1}^{⊤} x^{t}] = R + t

so that

\begin{matrix} E [{(S_{1} - R S_{0})}^{⊤} x^{t}] = t . \end{matrix}

(91)

On the other hand, it follows from assumptions (r1) and (r3) that

\begin{matrix} r (x^{t}) = \hat{r} (t {\hat{x}}^{1}) = t \hat{r} ({\hat{x}}^{1}) . \end{matrix}

(92)

Thus, for any x satisfying

S_{0}^{⊤} x = 1

and

E [S_{1}^{⊤} x] \geq R + t,

it follows from (89) that

\begin{matrix} \hat{r} (\hat{x}) \geq \hat{r} ({\hat{x}}^{1}) t . \end{matrix}

(93)

For any

μ > R

, letting

t_{μ} : = μ - R

, we have

μ = R + t_{μ}

and hence

x^{t_{μ}} = x (μ)

. Thus, by inequality (93), we have

\hat{r} (\hat{x} (μ)) \geq t_{μ} \hat{r} ({\hat{x}}^{1})

. On the other hand,

x (μ)

is an efficient portfolio implies that

\hat{r} (\hat{x} (μ)) \leq \hat{r} ({\hat{x}}^{t_{μ}}) = t_{μ} \hat{r} ({\hat{x}}^{1})

yielding equality

\begin{matrix} γ (μ) = \hat{r} (\hat{x} (μ)) = \hat{r} ({\hat{x}}^{t_{μ}}) = t_{μ} \hat{r} ({\hat{x}}^{1}) = (μ - R) \hat{r} ({\hat{x}}^{1}), for μ \geq R . \end{matrix}

(94)

In other words,

γ (μ)

is an affine function in

μ

. In addition, we conclude that points

(γ (μ), μ)

on this efficient frontier correspond to efficient portfolios

\begin{matrix} x (μ) = x^{t_{μ}} = ((μ - R) x_{0}^{1} + μ_{1} - μ, (μ - R) {\hat{x}}^{1}) = (μ_{1} - μ) {(1, {\hat{0}}^{⊤})}^{⊤} + (μ - R) x^{1} \end{matrix}

(95)

as an affine mapping of the parameter

μ

into the portfolio space showing (79).

In addition, using

r_{1},

we can write (94) as

\begin{matrix} γ (μ) = r_{1} (μ - R) . \end{matrix}

(96)

That is to say, the efficient frontier of (75) in the risk-expected return space is given by the parameterized straight line (80). ☐

Corollary 3.

In Theorem 10, if instead of (r3) the stronger condition (r3s) holds, then the portfolio

x^{1}

constructed there is unique and, therefore, for each fixed

μ \geq R

, the efficient portfolio

x (μ)

in (79) is unique.

Proof.

Apply Theorem 5 with condition (c3). ☐

Theorem 10 and Corollary 3 manifest a full generalization of Theorem 8 on the capital market pricing model to positive homogeneous risk measures. Note that the necessary conditions on the financial market in (67) and (78) are the same.

Remark 9.

(a) Clearly,

x^{t_{R}}

corresponds to the portfolio

{(1, {\hat{0}}^{⊤})}^{⊤}

with

γ (R) = \hat{r} (\hat{0}) = 0

. If

x_{0}^{1} < 1

, setting

μ_{M} : = \frac{μ_{1} - R x_{0}^{1}}{1 - x_{0}^{1}}

and

r_{M} : = γ (μ_{M}) = \hat{r} ({\hat{x}}^{1}) / (1 - x_{0}^{1}),

we see that

(r_{M}, μ_{M})

on the efficient frontier corresponds to a purely risky efficient portfolio of (75)

\begin{matrix} x_{M} : = x^{t_{μ_{M}}} = {(0, \frac{1}{1 - x_{0}^{1}} {({\hat{x}}^{1})}^{⊤})}^{⊤} . \end{matrix}

(97)

Since

x_{M}

belongs to the image of the affine mapping in (95), the family of efficient portfolios as described by the affine mapping in (95) contains both the pure bond

{(1, {\hat{0}}^{⊤})}^{⊤}

and the portfolio

x_{M}

that consists only of purely risky assets. In fact, we can represent the affine mapping in (95) as a parametrized line passing through

{(1, {\hat{0}}^{⊤})}^{⊤}

and

x_{M}

as

\begin{matrix} x^{t_{μ}} : = (1 - \frac{μ - R}{μ_{M} - R}) {(1, {\hat{0}}^{⊤})}^{⊤} + \frac{μ - R}{μ_{M} - R} x_{M}, for μ \geq R, \end{matrix}

(98)

which is a similar representation of the efficient portfolios as (79). The portfolio

x_{M}

is called a master fund in Rockafellar et al. (2006). When

r = σ

, it is the market portfolio in the CAPM. For a general risk measure

r

satisfying conditions (r1), (r1n), (r2) and (r3) in Assumption 2, the master funds

x_{M}

are not necessarily unique. However, all master funds correspond to the same point

(r_{M}, μ_{M})

in the risk-expected return space.

(b) We can also consider problem (75) on the set of admissible portfolios of purely risky assets, namely

G_{e f f} (r, i d; {S_{0}^{⊤} x = 1, x_{0} = 0})

. Then, similar to the relationship between the Markowitz efficient frontier and the capital market line, it follows from Theorem 10 that

\begin{matrix} G (r, i d; {S_{0}^{⊤} x = 1, x_{0} = 0}) \cap G_{e f f} (r, i d; {S_{0}^{⊤} x = 1}) = {(r_{M}, μ_{M})}, \end{matrix}

(99)

as illustrated in Figure 7.

Figure 7. Capital Market Line for (75) when

1 - x_{0}^{1} > 0

.

(c) If

x_{0}^{1} = 1

, then the efficient portfolios in (79) are related to μ in a much simpler fashion

\begin{matrix} {(1, {\hat{0}}^{⊤})}^{⊤} + (μ - R) {(0, {({\hat{x}}^{1})}^{⊤})}^{⊤} . \end{matrix}

(100)

There is no master fund as observed in Rockafellar et al. (2006) in this case. In the language of Rockafellar et al. (2006), the portfolio

x^{1}

is called a basic fund. Thus, Theorem 10 recovers the results in Theorem 2 and Theorem 3 in Rockafellar et al. (2006) with a different proof and a weaker condition (condition (78) is weaker than (A2) on page 752 of Rockafellar et al.). However, Corollary 3 is a significant improvement yielding uniqueness in case (r3s) holds. This will help below when we derive a sufficient condition for the existence of a master fund, which is solely depending on the risk measure and the financial market.

We see in Remark 9 that the existence of a master fund depends on whether or not

x_{0}^{1} < 1

. Below, we characterize this condition in terms of

f (\hat{x}) : = {[\hat{r} (\hat{x})]}^{2} / 2

and its Fenchel conjugate

f^{*} : R^{M} \to R

, defined by

f^{*} (\hat{y}) : = {sup}_{\hat{x} \in R^{M}} {⟨ \hat{y}, \hat{x} ⟩ - f (\hat{x})}

.

Theorem 11.

Under the conditions of Corollary 3, assuming that

f^{*}

is differentiable at

E [{\hat{S}}_{1} - R {\hat{S}}_{0}]

, a master fund exists if and only if

{\hat{S}}_{0}^{⊤} \nabla f^{*} (E [{\hat{S}}_{1} - R {\hat{S}}_{0}]) > 0

.

Proof.

Combining (86) and (88) and using the chain rule, we can see that

\begin{matrix} {[\hat{r} ({\hat{x}}^{1})]}^{2} E [{\hat{S}}_{1} - R {\hat{S}}_{0}] \in \partial f ({\hat{x}}^{1}) . \end{matrix}

(101)

By virtue of the Fenchel–Young equality (see (Carr and Zhu forthcoming, Proposition 1.3.1)), we have

\begin{matrix} f ({\hat{x}}^{1}) + f^{*} ({[\hat{r} ({\hat{x}}^{1})]}^{2} E [{\hat{S}}_{1} - R {\hat{S}}_{0}]) = ⟨ {[\hat{r} ({\hat{x}}^{1})]}^{2} E [{\hat{S}}_{1} - R {\hat{S}}_{0}], {\hat{x}}^{1} ⟩, \end{matrix}

(102)

and

\begin{matrix} \nabla f^{*} ({[\hat{r} ({\hat{x}}^{1})]}^{2} E [{\hat{S}}_{1} - R {\hat{S}}_{0}]) = {\hat{x}}^{1} . \end{matrix}

(103)

It follows that

x_{0}^{1} < 1

is equivalent to

\begin{matrix} 0 < 1 - x_{0}^{1} = {\hat{S}}_{0}^{⊤} {\hat{x}}^{1} = {\hat{S}}_{0}^{⊤} \nabla f^{*} ({[\hat{r} ({\hat{x}}^{1})]}^{2} E [{\hat{S}}_{1} - R {\hat{S}}_{0}]) = {[\hat{r} ({\hat{x}}^{1})]}^{4} {\hat{S}}_{0}^{⊤} \nabla f^{*} (E [{\hat{S}}_{1} - R {\hat{S}}_{0}]) . \end{matrix}

(104)

The last equality is because

f (t \hat{x}) = t^{2} f (\hat{x})

implies

f^{*} (t \hat{y}) = t^{2} f^{*} (\hat{y})

. ☐

Remark 10.

We refer to Borwein and Vanderwerff (2009) for conditions ensuring the differentiability of

f^{*}

in Theorem 11. In the CAPM model

f (\hat{x}) = \frac{1}{2} {\hat{x}}^{⊤} Σ \hat{x}

and

f^{*} (\hat{y}) = \frac{1}{2} {\hat{y}}^{⊤} Σ^{- 1} \hat{y}

. Thus, the master fund exists if and only if

β - R γ = {\hat{S}}_{0}^{⊤} Σ^{- 1} E [{\hat{S}}_{1} - R {\hat{S}}_{0}] > 0,

which exactly recovers the condition in (70) for the existence of a market portfolio in the one fund theorem (cf. Theorem 9).

In general, for a risk measure with (r1), (r1n) and (r3s), if

f (\hat{x}) = {[r (\hat{x})]}^{2} / 2

is

C^{2}

, then

f (\hat{x}) = \frac{1}{2} {\hat{x}}^{⊤} \hat{Σ} \hat{x}

where

\hat{Σ}

is the Hessian of f at

\hat{0}

. Thus, a criterion for the existence of a master fund similar to (70) holds with Σ replaced by

\hat{Σ}

.

Another very useful case is

\hat{r} (\hat{x}) = {∥ \hat{x} ∥}_{max}

. It is not hard to show that the conjugate of

f (\hat{x}) = {∥ \hat{x} ∥}_{max}^{2} / 2

is

f^{*} (\hat{y}) = {∥ \hat{y} ∥}_{1}^{2} / 2

. In fact, it follows from the Cauchy inequality that

∥ \hat{x} ∥_{max}^{2} / 2 + {∥ \hat{y} ∥}_{1}^{2} / 2 \geq ⟨ \hat{x}, \hat{y} ⟩ .

Thus,

\begin{matrix} ∥ \hat{y} ∥_{1}^{2} / 2 \geq f^{*} (\hat{y}) . \end{matrix}

(105)

On the other hand, for any

\hat{y} = {(y_{1}, \dots, y_{M})}^{⊤}

, defining

{\hat{x}}_{t} : = t {(sgn (y_{1}), \dots, sgn (y_{M}))}^{⊤}

, we have

\begin{matrix} ⟨ {\hat{x}}_{t}, \hat{y} ⟩ - ∥ {\hat{x}}_{t} ∥_{max}^{2} / 2 = t {∥ \hat{y} ∥}_{1} - t^{2} / 2 . \end{matrix}

(106)

The maximum of the expression in (106) as a function of t is

∥ \hat{y} ∥_{1}^{2} / 2

. It follows that

\begin{matrix} ∥ \hat{y} ∥_{1}^{2} / 2 \leq f^{*} (\hat{y}) . \end{matrix}

(107)

Combining (105) and (107), we arrive at

∥ \hat{y} ∥_{1}^{2} / 2 = f^{*} (\hat{y})

. This example illustrates that using

{\hat{r}}^{2} / 2

and its conjugate often helps. In fact,

f^{*}

is differentiable everywhere except for the coordinate axises. However,

{∥ \cdot ∥}_{max}^{*}

is an indicator function on the closed set

{\hat{y} : ∥ \hat{y} ∥_{1} \leq 1}

(see (Carr and Zhu forthcoming), Proposition 2.4.2)), whose derivative is 0 at any differentiable point and, therefore, is not useful for our purpose.

Since the standard deviation satisfies Assumptions (r1), (r1n), (r2) and (r3s), the result above is a generalization of the relationship between the CAPM model and the Markowitz portfolio theory. We note that the standard deviation is not the only risk measure that satisfies these assumptions. For example, some forms of approximation to the expected drawdowns also satisfy these assumptions (cf. Maier-Paape and Zhu (2017)).

Theorem 10 and Corollary 3 are a full generalization of Theorem 8 on the CAPM and Theorem 11 is a generalization of the one fund theorem in Theorem 9. On the other hand, in Rockafellar et al. (2006), footnote 10, it has been noted that a similar generalization of the two fund theorem (Theorem 7) is not to be expected. We construct a concrete counter-example below.

Example 3.

(Counter-example to a Generalized Two Fund Theorem) Let us consider, for example,

\begin{matrix} min_{\hat{x} \in R^{3}} & \hat{r} (\hat{x}) \\ Subject to & E [{\hat{S}}_{1}^{⊤} \hat{x}] \geq μ, {\hat{S}}_{0}^{⊤} \hat{x} = 1, \end{matrix}

(108)

with

M = 3

.

Choose all

S_{0}^{m} = 1

, so that

{\hat{S}}_{0}^{⊤} \hat{x} = 1

is

x_{1} + x_{2} + x_{3} = 1

. Choose the payoff

S_{1}

such that

E [{\hat{S}}_{1}^{⊤} \hat{x}] = x_{1}

so that

x_{1} = μ

at the optimal solution. Finally, let us construct

\hat{r} (\hat{x})

so that the optimal solution

\hat{x} (μ)

is not affine in μ.

We do so by constructing a convex set G with

0 \in int G

(interior of G) and then set

\hat{r} (\hat{x}) = 1

for

\hat{x} \in \partial G

(boundary of G) and extend

\hat{r}

to be positive homogeneous. Then, (r1), (r1n), (r2) and (r3) are satisfied.

Now, let us specify G. Take the convex hull of the set

[- 5, 5] \times [- 1, 1] \times [- 1, 1]

and five other points. One point is

E = {(10, 0, 0)}^{⊤}

and the other four points

A, B, C

and D, are the corner points of a square that lies in the plane

x_{1} = 9

and has unit side length. To obtain that square, take the standard square with unit side length in

x_{1} = 9

, i.e., the square with corner points

{(9, \pm 1 / 2, \pm 1 / 2)}^{⊤}

and rotate this square by 30 degrees counter clockwise in the

x_{2} x_{3}

-plane. Doing some calculation, one gets:

\begin{matrix} A & = & {(9, (- 1 + \sqrt{3}) / 4, (1 + \sqrt{3}) / 4)}^{⊤}, B = {(9, (- 1 - \sqrt{3}) / 4, (- 1 + \sqrt{3}) / 4)}^{⊤}, \\ C & = & (9, (1 - \sqrt{3}) / 4, - (1 + \sqrt{3}) / 4))^{⊤}, D = (9, (1 + \sqrt{3}) / 4), (1 - \sqrt{3}) / 4 {))}^{⊤} . \end{matrix}

Obviously for

μ = 1,

the optimal solution is

\hat{x} (1) = {(1, 0, 0)}^{⊤}

with

\hat{r} (\hat{x} (1)) = 1 / 10

. For

μ = 1 + ϵ

with

ϵ > 0

small, we have

\hat{x} (1 + ϵ) = (1 + ϵ, ϵ \sqrt{3} (+ 1 - \sqrt{3}) / 6, ϵ \sqrt{3} (- 1 - \sqrt{3}) / 6))^{⊤}

(they lie on the ray through a point on the convex combination of C and

{(10, 0, 0)}^{⊤}

), and, for

μ = 1 + d

with

d > 0

large, we have

\hat{x} (1 + d) = {(1 + d, - d / 2, - d / 2)}^{⊤}

(they lie on the ray through a point on the set

{{(x_{1}, - 1, - 1)}^{⊤} : x_{1} \in (2, 5)}

. Therefore,

\hat{x} (μ)

cannot be affine in μ.

6. Growth Optimal and Leverage Space Portfolio

Growth portfolio theory is proposed by Lintner (1965) and is also related to the work of Kelly (1956). It is equivalent to maximizing the expected log utility:

\begin{matrix} max_{x \in R^{M + 1}} & E [ln (S_{1}^{⊤} x)] \\ Subject to & S_{0}^{⊤} x = 1 . \end{matrix}

(109)

Remark 11.

Problem (109) is equivalent to

\begin{matrix} max_{\hat{x} \in R^{M}} E [ln (R + {({\hat{S}}_{1} - R {\hat{S}}_{0})}^{⊤} \hat{x})] . \end{matrix}

(110)

The following theorem establishes the existence of the growth optimal portfolio as a corollary of our results in Section 3. This theorem reconfirms previous results in Hermes and Maier-Paape (2017) with somewhat different conditions and a shorter proof.

Theorem 12.

(Growth Optimal Portfolio) Assume that the financial market

S_{t}

of Definition 1 has no nontrivial riskless portfolio. Then, problem (109) has a unique optimal portfolio, which is often referred to as the growth optimal portfolio and is denoted

κ \in R^{M + 1}

.

To prove Theorem 12, we need the following lemma.

Lemma 2.

Assume that the financial market

S_{t}

of Definition 1 has no nontrivial riskless portfolio. Let u be a utility function satisfying (u3) in Assumption 3. Then, for any

μ \in R

,

\begin{matrix} {x \in R^{M + 1} : E [u (S_{1}^{⊤} x)] \geq μ, S_{0}^{⊤} x = 1} \end{matrix}

(111)

is compact (and possibly empty in some cases).

Proof.

Since, by Assumption 3, u is upper semi-continuous, the set in (111) is closed. Thus, we need only to show it is also bounded. Assume the contrary that there exists a sequence of portfolios

x^{n}

with

\begin{matrix} S_{0}^{⊤} x^{n} = 1 \end{matrix}

(112)

and

∥ x^{n} ∥ \to \infty

satisfying

\begin{matrix} E [u (S_{1}^{⊤} x^{n})] \geq μ . \end{matrix}

(113)

Equation (112) implies that

∥ {\hat{x}}^{n} ∥ \to \infty

. Then, without loss of generality, we may assume

x^{n} / ∥ {\hat{x}}^{n} ∥

converges to

x^{*} = {(x_{0}^{*}, {({\hat{x}}^{*})}^{⊤})}^{⊤}

where

∥ {\hat{x}}^{*} ∥ = 1

. Condition (u3) and (113) for arbitrary

μ \in R

imply that, for each natural number n,

\begin{matrix} S_{1}^{⊤} x^{n} \geq 0 . \end{matrix}

(114)

Dividing (112) and (114) by

∥ {\hat{x}}^{n} ∥

and taking limits as

n \to \infty,

we derive

S_{0}^{⊤} x^{*} = 0

and

S_{1}^{⊤} x^{*} \geq 0

. Thus, we have

\begin{matrix} {({\hat{S}}_{1} - R {\hat{S}}_{0})}^{⊤} {\hat{x}}^{*} \geq 0, \end{matrix}

(115)

and thus

x^{*}

is a nontrivial riskless portfolio, which is a contradiction. ☐

Proof of Theorem 12.

We can verify that the utility function

u = ln

satisfies conditions (u1), (u2s), (u3) and (u4). In addition,

{x : E [ln (S_{1}^{⊤} x)] \geq ln (R), S_{0}^{⊤} x = 1} \neq \emptyset

because it contains

{(1, {\hat{0}}^{⊤})}^{⊤}

. Thus, Lemma 2 implies that problem (109) has at least one solution and

μ_{max} = max_{x \in R^{M + 1}} {E [ln (S_{1}^{⊤} x)] : S_{0}^{⊤} x = 1}

is finite. By Proposition 6,

x \mapsto E [ln (S_{1}^{⊤} x)]

is strictly concave. Thus, problem (109) has a unique optimal portfolio. ☐

Assuming one repeatedly invests in the identical one period financial market, the growth optimal portfolio has the nice property that it provides the fastest compounded growth of the capital. By Remark 7(b), it is independent of any risk measures. In the special case that all the risky assets are representing a certain gaming outcome,

κ

is the Kelly allocation in Kelly (1956). However, the growth portfolio is seldomly used in investment practice for being too risky. The book (MacLean et al.2009) provides an excellent collection of papers with chronological research on this subject. These observations motivated Vince (2009) to introduce his leverage space portfolio to scale back from the growth optimal portfolio. Recently, De Prado et al. (2013); Vince and Zhu (2015) further introduce systematical methods to scale back from the growth optimal portfolio by, among other ideas, explicitly accounting for limiting a certain risk measure. The analysis in Vince and Zhu (2015) and De Prado et al. (2013) can be phrased as solving

\begin{matrix} γ (μ) : = inf {r (x) = \hat{r} (\hat{x}) : E [ln (S_{1}^{⊤} x)] \geq μ, S_{0}^{⊤} x = 1}, \end{matrix}

(116)

where

r

is a risk measure that satisfies conditions (r1) and (r2). Alternatively, to derive the efficient frontier, we can also consider

\begin{matrix} ν (r) : = sup {E [ln (S_{1}^{⊤} x)] : r (x) = \hat{r} (\hat{x}) \leq r, S_{0}^{⊤} x = 1} . \end{matrix}

(117)

Applying Proposition 8, Theorem 5 and Remark 7 to the set of admissible portfolios

A = {x \in R^{M + 1} : S_{0}^{⊤} x = 1}

, we derive:

Theorem 13.

(Leverage Space Portfolio and Risk Measure) We assume that the financial market

S_{t}

in Definition 1 has no nontrivial riskless portfolio and that the risk measure

r

satisfies conditions (r1), (r1n) and (r2). Then, the problem

\begin{matrix} sup_{x \in R^{M + 1}} & E [ln (S_{1}^{⊤} x)] \\ subject to & r (x) = \hat{r} (\hat{x}) \leq r, S_{0}^{⊤} x = 1 \end{matrix}

(118)

has a bounded efficient frontier that can be parameterized as follows:

(a) problem (116) defines

γ (μ) : [ln (R), μ_{κ}] \to R

as a continuous increasing convex function, where

μ_{κ} : = E [ln (S_{1}^{⊤} κ)]

and κ is the optimal growth portfolio. Moreover, problem (116) has a continuous path of unique solutions

z (μ) : = x (γ (μ), μ)

that maps the interval

[ln (R), μ_{κ}]

into a curve in the leverage portfolio space

R^{M + 1}

. Finally,

z (ln (R)) = {(1, {\hat{0}}^{⊤})}^{⊤}

,

z (μ_{κ})) = κ

,

γ (ln (R)) = \hat{r} (\hat{0}) = 0

and

γ (μ_{κ}) = r (κ)

.

(b) problem (117) defines

ν (r) : [0, r (κ)] \to R

as a continuous increasing concave function, where κ is the optimal growth portfolio. Moreover, problem (117) has a continuous path of unique solutions

y (r) : = x (r, ν (r))

that maps the interval

[0, r (κ)]

into a curve in the leverage portfolio space

R^{M + 1}

. Finally,

y (0) = {(1, {\hat{0}}^{⊤})}^{⊤}

,

y (r (κ)) = κ

,

ν (0) = ln (R)

and

ν (r (κ)) = μ_{κ}

.

Proof.

Note that Assumption 4 (a) holds due to Lemma 2 and (c2) in Theorem 5 is also satisfied. Then, (a) follows straightforwardly from Theorem 5, where

μ_{max} = μ_{κ}

and

μ_{min} = ln (R)

are finite and attained and (b) follows from Theorem 5 with

r_{min} = 0

and

r_{max} = r (κ)

. ☐

Remark 12.

Theorem 13 relates the leverage portfolio space theory to the framework setup in Section 3. It becomes clear that each risk measure satisfying conditions (r1), (r1n) and (r2) generates a path in the leverage portfolio space connecting the portfolio of a pure riskless bond to the growth optimal portfolio. Theorem 13 also tells us that different risk measures usually correspond to different paths in the portfolio space. Many commonly used risk measures satisfy conditions (r1) and (r2). The curve

z (μ)

provides a pathway to reduce risk exposure along the efficient frontier in the risk-expected log utility space. As observed in De Prado et al. (2013); Vince and Zhu (2015), when investments have only a finite time horizon, then there are additional interesting points along the path

z (μ)

such as the inflection point and the point that maximizes the return/risk ratio. Both of which provide further landmarks for investors.

Similar to the previous sections, we can also consider the related problem of using only portfolios involving risky assets, i.e.,

\begin{matrix} max_{\hat{x} \in R^{M}} & E [ln ({\hat{S}}_{1}^{⊤} \hat{x})] \\ subject to & {\hat{S}}_{0}^{⊤} \hat{x} = 1 . \end{matrix}

(119)

Theorem 14.

(Existence of Solutions) Suppose that

\begin{matrix} S_{1}^{i} (ω) > 0, \forall ω \in Ω, i = 1, \dots, M . \end{matrix}

(120)

Then, problem (119) has a solution.

Proof.

As in the proof of Theorem 13, we can see that Assumption 4 (a) holds due to Lemma 2. Observe that, for

{\hat{x}}^{*} = {(1 / M, 1 / M, \dots, 1 / M)}^{⊤}

, we get from (120) that

E [ln ({\hat{S}}_{1}^{⊤} {\hat{x}}^{*})]

is finite. Then, we can directly apply Theorem 5 with

A = {x \in R^{M + 1} : S_{0}^{⊤} x = 1, x_{0} = 0}

. ☐

With the help of Theorem 14, we can conclude that problem

\begin{matrix} sup_{\hat{x} \in R^{M}} & E [ln ({\hat{S}}_{1}^{⊤} \hat{x})] \\ subject to & \hat{r} (\hat{x}) \leq r, {\hat{S}}_{0}^{⊤} \hat{x} = 1 \end{matrix}

(121)

generates an efficient frontier as well (comparable to the Markowitz bullet for

u = i d

). However, due to the involvement of the log utility function, the relative location of efficient frontiers stemming from (118) and (121) may have several different configurations. The following is an example.

Example 4.

Let

M = 1

. Consider a sample space

Ω = {0, 1}

with probability

P (0) = 0.45

and

P (1) = 0.55

and a financial market involving a riskless bond with

R = 1

and one risky asset specified by

S_{0}^{1} = 1

,

S_{1}^{1} (0) = 0.5

and

S_{1}^{1} (1) = 1 + α

with

α > 9 / 22

so that

E [S_{1}^{1}] > S_{0}^{1}

. Use the risk measure

r_{1} (x_{0}, x_{1}) = | x_{1} |

(which is an approximation of the drawdown cf. Vince and Zhu (2015)). Then, it is easy to calculate that the efficient frontier corresponding to (118) is

\begin{matrix} ν (r) = 0.55 ln (1 + α r) + 0.45 ln (1 - 0.5 r), r \in [0, r_{max}^{α}], \end{matrix}

(122)

where

r_{max}^{α} = (22 α - 9) / 20 α

. On the other hand, the efficient frontier stemming from (121) is a single point

{(1, ν (1))},

where

ν (1) = 0.55 ln (1 + α) - 0.45 ln (2)

.

When

α \in (9 / 22, 9 / 2)

, the two efficient frontiers corresponding to (118) and (121) have no common points (see Figure 8). However, when

α \geq 9 / 2

,

G_{e f f} (r_{1}, ln; {S_{0}^{⊤} x = 1, x_{0} = 0}) \subset G_{e f f} (r_{1}, ln; {S_{0}^{⊤} x = 1})

(see Figure 9). In particular, when

α = 9 / 2

,

G_{e f f} (r_{1}, ln; {S_{0}^{⊤} x = 1, x_{0} = 0})

coincides with the point on

G_{e f f} (r_{1}, ln; {S_{0}^{⊤} x = 1})

corresponding to the growth optimal portfolio as illustrated in Figure 10.

Figure 8. Separated efficient frontiers.

Figure 9. Touching efficient frontiers.

Figure 10. Touching efficient frontiers at growth optimal.

In fact, a far more common restriction to the set of admissible portfolios are limits of risk. For this example, if, for instance, we restrict the risk by

r_{1} (x) \leq 0.5

, then we will create a shared efficient frontier from (118) when

r

is a priori restricted (see Figure 11).

Figure 11. Shared efficient frontiers.

Remark 13.

(Efficiency Index) Although the growth optimal portfolio is usually not implemented as an investment strategy, the maximum utility

μ_{max}

corresponding to the growth optimal portfolio κ, empirically estimated using historical performance data, can be used as a measure to compare different investment strategies. This is proposed in Zhu (2007) and called the efficiency index. When the only risky asset is the payoff of a game with two outcomes following a given playing strategy, the efficiency coefficient coincides with Shannon’s information rate (see Kelly (1956); Shannon and Weaver (1949); Zhu (2007)). In this sense, the efficiency index gauges the useful information contained in the investment strategy it measures.

7. Conclusions

Following the pioneering idea of Markowitz to trade-off the expected return and standard deviation of a portfolio, we consider a general framework to efficiently trade-off between a concave expected utility and a convex risk measure for portfolios. Under reasonable assumptions, we show that (i) the efficient frontier in such a trade-off is a convex curve in the expected utility-risk space, (ii) the optimal portfolio corresponding to each level of the expected utility is unique and (iii) the optimal portfolios continuously depend on the level of the expected utility. Moreover, we provide an alternative treatment and enhancement of the results in Rockafellar et al. (2006) showing that the one fund theorem (Theorem 9) holds in the trade-off between a deviation measure and the expected return (Theorem 11) and construct a counter-example illustrating that the two fund theorem (Theorem 7) fails in such a general setting. Furthermore, the efficiency curve in the leverage space is supposedly an economic way to scale back risk from the growth optimal portfolio (Theorem 13).

This general framework unifies a group of well known portfolio theories. They are Markowitz portfolio theory, capital asset pricing model, the growth optimal portfolio theory, and the leverage portfolio theory. It also extends these portfolio theories to more general settings.

The new framework also leads to many questions of practical significance worthy of further explorations. For example, quantities related to portfolio theories such as the Sharpe ratio and efficiency index can be used to measure investment performances. What other performance measurements can be derived using the general framework in Section 3? Portfolio theory can also inform us about pricing mechanisms such as those discussed in the capital asset pricing model and the fundamental theorem of asset pricing (see (Carr and Zhu forthcoming, Section 2.3). What additional pricing tools can be derived from our general framework?

Clearly, for the purpose of applications, we need to focus on certain special cases. Drawdown related risk measures coupled with the log utility attracts much attention in practice. In Part II of this series Maier-Paape and Zhu (2017), several drawdown related risk measures are constructed and analyzed.

Author Contributions

S.M.-P. and Q.J.Z. contributed equally to the work reported.

Acknowledgments

We thank Andreas Platen for his constructive suggestions after reading earlier versions of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Artzner, Philippe, Freddy Delbaen, Jean-Marc Eber, and Davia Heath. 1999. Coherent measures of risk. Mathematical Finance 9: 203–27. [Google Scholar] [CrossRef]
Borwein, Jonathan M., and Jon Vanderwerff. 2009. Differentiability of conjugate functions and perturbed minimization principles. Journal of Convex Analysis 16: 707–11. [Google Scholar]
Borwein, Jonathan M., and Qiji Jim Zhu. 2005. Techniques of Variational Analysis. New York: Springer. [Google Scholar]
Borwein, Jonathan M., and Qiji Jim Zhu. 2016. A variational approach to Lagrange multipliers. Journal of Optimization Theory and Applications 171: 727–56. [Google Scholar] [CrossRef]
Carr, Peter, and Qiji Jim Zhu. Forthcoming. Convex Duality and Financial Mathematics. Berlin: Springer.
De Prado, Marcos Lopez, Ralph Vince, and Qiji Jim Zhu. 2013. Optimal risk budgeting under a finite investment horizon. SSRN Electronic Journal, 2364092. Available online: https://ssrn.com/abstract=2364092 (accessed on 29 March 2018).
Hermes, Andreas, and Stanislaus Maier-Paape. 2017. Existence and Uniqueness for the Multivariate Discrete Terminal Wealth Relative. Risks 5: 44. [Google Scholar] [CrossRef]
Kelly, John L. 1956. A new interpretation of information rate. Bell System Technical Journal 35: 917–26. [Google Scholar] [CrossRef]
Lintner, John. 1965. The valuation of risk assets and the selection of risky investments in stock portfolios and capital budgets. Review of Economics and Statistics 47: 13–37. [Google Scholar] [CrossRef]
MacLean, Leonard C., Edward O. Thorp, and William T. Ziemba, eds. 2009. The Kelly Capital Growth Criterion: Theory and Practice. Singapore: World Scientific. [Google Scholar]
Maier-Paape, Stanislaus. 2015. Optimal f and diversification. International Federation of Technical Analysts Journal 15: 4–7. [Google Scholar]
Maier-Paape, Stanislaus. 2016. Risk Averse Fractional Trading Using the Current Drawdown. Report No. 88. Aachen: Institut für Mathematik, RWTH Aachen. [Google Scholar]
Maier-Paape, Stanislaus, and Qiji Jim Zhu. 2017. A General Framework for Portfolio Theory. Part II: Drawdown Risk Measures. Report No. 92. Aachen: Institut für Mathematik, RWTH Aachen. [Google Scholar]
Markowitz, Harry. 1959. Portfolio Selection. Cowles Monograph, 16. New York: Wiley. [Google Scholar]
Mossin, Jan. 1966. Equilibrium in a Capital Asset Market. Econometrica 34: 768–83. [Google Scholar] [CrossRef]
Rockafellar, Ralph Tyrell. 1970. Convex Analysis. Vol. 28 Princeton Math. Series; Princeton: Princeton University Press. [Google Scholar]
Rockafellar, R. Tyrrell, and Stanislav Uryasev. 2000. Optimization of conditional value-at-risk. Journal of Risk 2: 21–42. [Google Scholar] [CrossRef]
Rockafellar, R. Tyrrell, Stan Uryasev, and Michael Zabarankin. 2006. Master funds in portfolio analysis with general deviation measures. Journal of Banking and Finance 30: 743–78. [Google Scholar] [CrossRef]
Shannon, Claude E., and Warren Weaver. 1949. The Mathematical Theory of Communication. Urbana: University of Illinois Press. [Google Scholar]
Sharpe, William F. 1964. Capital asset prices: A theory of market equilibrium under conditions of risk. Journal of Finance 19: 425–42. [Google Scholar]
Sharpe, William F. 1966. Mutual fund performance. Journal of Business 1: 119–38. [Google Scholar] [CrossRef]
Tobin, James. 1958. Liquidity preference as behavior towards risk. The Review of Economic Studies 26: 65–86. [Google Scholar] [CrossRef]
Treynor, Jack L. 1999. Toward a theory of market value of risky assets. In Asset Pricing and Portfolio Performance: Models, Strategy and Performance Metrics. Edited by Robert A. Korajczyk. London: Risk Books, pp. 15–22. [Google Scholar]
Vince, Ralph. 2009. The Leverage Space Trading Model. Hoboken: John Wiley and Sons. [Google Scholar]
Vince, Ralph, and Qiji Jim Zhu. 2015. Optimal betting sizes for the game of blackjack. Risk Journals: Portfolio Management 4: 53–75. [Google Scholar] [CrossRef]
Zhu, Qiji Jim. 2007. Mathematical analysis of investment systems. Journal of Mathematical Analysis and Applications 326: 708–20. [Google Scholar] [CrossRef]

Figure 1. Efficient frontier with both

r_{min}

and

μ_{min}

are finite and attained.

Figure 2. Efficient frontier with

μ_{min} = - \infty

.

Figure 3. Efficient frontier when

r_{min} > 0

and

μ_{max}

is finite and attained as maximum.

Figure 4. Efficient frontier with

{(1, {\hat{0}}^{⊤})}^{⊤} \in A

.

Figure 5. Markowitz Bullet.

Figure 6. Capital Market Line and Markowitz Bullet.

Figure 7. Capital Market Line for (75) when

1 - x_{0}^{1} > 0

.

Figure 8. Separated efficient frontiers.

Figure 9. Touching efficient frontiers.

Figure 10. Touching efficient frontiers at growth optimal.

Figure 11. Shared efficient frontiers.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A General Framework for Portfolio Theory—Part I: Theory and Various Models

Abstract

1. Introduction

2. Preliminaries

2.1. A Portfolio Model

2.2. Convex Programming

3. Efficient Trade-Off between Risk and Utility

3.1. Technical Assumptions

3.2. Efficient Frontier for the Risk-Utility Trade-Off

3.3. Representation of Efficient Frontier

3.4. Efficient Portfolios

4. Markowitz Portfolio Theory and CAPM Model

4.1. Markowitz Portfolio Theory

4.2. Capital Asset Pricing Model

5. Affine Efficient Frontier for Positive Homogeneous Risk Measure

6. Growth Optimal and Leverage Space Portfolio

7. Conclusions

Author Contributions

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics