Comparing Distributions of Sums of Random Variables by Deficiency: Discrete Case

Vladimir E. Bening; Victor Y. Korolev

doi:10.3390/math10030454

and

¹

Faculty of Computational Mathematics and Cybernetics, Moscow State University, 119991 Moscow, Russia

²

Moscow Center for Fundamental and Applied Mathematics, 119991 Moscow, Russia

³

Federal Research Center “Computer Science and Control”, Russian Academy of Sciences, 119333 Moscow, Russia

^*

Author to whom correspondence should be addressed.

Mathematics2022, 10(3), 454;https://doi.org/10.3390/math10030454

This article belongs to the Special Issue Stability Problems for Stochastic Models: Theory and Applications II

Version Notes

Order Reprints

Abstract

In the paper, we consider a new approach to the comparison of the distributions of sums of random variables. Unlike preceding works, for this purpose we use the notion of deficiency that is well known in mathematical statistics. This approach is used, first, to determine the distribution of a separate random variable in the sum that provides the least possible number of summands guaranteeing the prescribed value of the

(1 - α)

-quantile of the normalized sum for a given

α \in (0, 1)

, and second, to determine the distribution of a separate random variable in the sum that provides the least possible number of summands guaranteeing the prescribed value of the probability for the normalized sum to fall into a given interval. Both problems are solved under the condition that possible distributions of random summands possess coinciding three first moments. In both settings the best distribution delivers the smallest number of summands. Along with distributions of a non-random number of summands, we consider the case of random summation and introduce an analog of deficiency which can be used to compare the distributions of sums with random and non-random number of summands. The main mathematical tools used in the paper are asymptotic expansions for the distributions of

R

-valued functions of random vectors, in particular, normalized sums of independent identically distributed r.v.s and their quantiles. Along with the general case, main attention is paid to the situation where the summarized random variables are independent and identically distributed. The approach under consideration is applied to determination of the distribution of insurance payments providing the least insurance portfolio size under prescribed Value-at-Risk or non-ruin probability.

Keywords:

limit theorem; sum of independent random variables; random sum; asymptotic expansion; asymptotic deficiency; kurtosis

1. Introduction

1.1. The Problem under Consideration and the Structure of the Paper

The problem considered in the paper is very close to the problem of stochastic ordering and even may be considered as a a version of this problem. In probability theory and statistics, a stochastic order quantifies the concept of one random variable being “bigger” or “smaller” than another. Many different orders exist, which have different applications, see, e.g., the book [1]. Here we propose an approach to establishing stochastic order for the distributions of sums of independent random variables (r.v.s) based on the notion of deficiency that is well known in asymptotic statistics, see, e.g., [2] and later publications [3,4,5]. Roughly speaking, in statistics the deficiency of a statistical procedure with respect to an ‘optimal’ procedure is the number of additional observations required to attain the same quality of inference as is guaranteed by the ‘optimal’ procedure.

In this paper we deal with the case where the deficiency is measured in natural-valued discrete units (number of ‘additional’ summands) and therefore here we deal with discrete case. The notion of deficiency can be extended to the case of the continuous parameter, say, time. This case will be considered in another work.

Along with the general case, in the paper main attention is paid to the situation where the r.v.s being summed are assumed to be independent and identically distributed.

The first problem to be considered below consists in determination of the distribution of a separate random variable in the sum that provides the least possible number of summands guaranteeing the prescribed value of the

(1 - α)

-quantile of the normalized sum for a given

α \in (0, 1)

. The second problem considered in the paper consists in determination of the distribution of a separate random variable in the sum that provides the least possible number of summands guaranteeing the prescribed value of the probability for the normalized sum to fall into a given interval. Actually, in both problems we deal with ‘fine tuning’ of the distribution of a separate summand since we assume that different possible distributions of random summands possess coinciding three first moments, so that they can differ only by their kurtosis. In both settings the best distribution delivers the smallest number of summands.

We also consider the problem where some additional randomization is introduced so that the number of summands in the sum can be random itself. This randomization may not be artificially induced, but also may occur when the exact number of summands is a priori unknown and only some its ‘expected’ value can be available as the parameter of the problem. For this case we introduce an analog of deficiency which can be used to compare the distributions of sums with random and non-random number of summands.

Both problems are closely related with the problem of quantification of the accuracy of approximations provided by limit theorems of probability theory. The main mathematical tools used in the paper are asymptotic expansions for the distributions of normalized sums of independent identically distributed r.v.s and their quantiles.

The formal settings mentioned above can be applied to solving practical problems where the models of the observed statistical regularities have the form of distributions of sums of r.v.s and the number of summands plays a substantial role. For example, consider an insurance company whose portfolio consists of a finite number of insurance contracts. Formally, the portfolio is assumed to be a finite set of r.v.s each of which characterizes the income of the company related to a separate contract. Instead of income we can speak of loss assuming that income is a negative loss or that loss is a negative income.

In these terms, the first setting concerns the problem of determination of the distribution of a possible loss within a separate insurance contract (say, the distribution of an insurance payment) providing the least possible portfolio size and guaranteeing the prescribed Value-at-Risk for the average losses. The approach considered in the paper can be used when the distributions of the summands (possible losses) are known only up to their three first moments and the exact Value-at-Risk is not known for sure. In the second setting the latter requirement is replaced by that of guaranteeing the prescribed ‘non-ruin’ probability. Within the framework of this example in both settings the problem consists in the description of the best strategy of the insurance company, if by a strategy we mean the choice of the terms of a contract (e.g., the amount of insurance payment related to each possible insurance event), that is, of the distribution of possible loss within a separate contract. Briefly, the problem is to choose an optimal distribution of a separate loss among the distributions that have the same first three moments so that the portfolio size is least possible.

The paper is organized as follows. Section 1.2 contains a short overview of the properties of statistical deficiency. In Section 2 we outline some results concerning the asymptotic expansions for the distributions of

R

-valued measurable functions of r.v.s and, in particular, for the distributions of normalized sums of r.v.s, as well as for their quantiles. In Section 3 the problem of comparison of the distributions of two sums of independent r.v.s by their deficiency is considered. The notion of asymptotic deficiency is introduced and some formulas for the calculation of asymptotic deficiency are presented. Section 3.1 contains the solution of this problem for these distributions providing a prescribed value of the

(1 - α)

-quantile for a given

α \in (0, 1)

. In Section 3.2 this problem is considered for the distributions of sums of independent r.v.s guaranteeing a prescribed probability for an

R

-valued measurable function of r.v.s, in particular, for a normalized sum of r.v.s, to fall into a given interval. Section 4 contains an example of extension of the results of Section 3 to the case of a random number of summands in the sum (random portfolio size, in terms of the example dealing with an insurance company). In Section 4.1 asymptotic expansions for the asymptotic

(1 - α)

-quantile (called

α

-reserve here) under a random portfolio size are presented and an analog of deficiency of the sum of a random number of summands (or the strategy with a random portfolio size) with respect to the distribution of the sum of a non-random number of summands (or a strategy with a non-random portfolio size) is considered. In Section 4.2 the problem of comparison of these distributions by an analog of deficiency is considered in a special case of three-point distribution of portfolio size.

Everywhere in what follows the set of real numbers is denoted by

R

, the set of natural numbers is denoted by

N

. The distribution function of the standard normal law will be denoted by

Φ (x)

,

Φ (x) = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{x} φ (y) d y, φ (x) = \frac{1}{\sqrt{2 π}} exp \{- \frac{x^{2}}{2}\}, x \in R .

The distribution of a random vector

(X_{1}, \dots, X_{n})

will be denoted

L (X_{1}, \dots, X_{n})

.

1.2. Asymptotic Deficiency

Following the classical terminology of [6], consider two decision rules (say, two statistical procedures)

D_{n}^{*}

and

D_{n}

whose quality is characterized by the quantities

π_{n}^{*}

and

π_{n}

, respectively. Here n is the number of observations

X_{1}, \dots, X_{n}

delivering the information underlying the decision rules. Assume that the rule

D_{n}^{*}

is in some sense optimal whereas the rule

D_{n}

is competing. For example, in the problem of estimation usually

π_{n}^{*}

and

π_{n}

are mean square deviations and

π_{n}^{*} \leq π_{n}

. In the problem of testing hypotheses usually

π_{n}^{*}

and

π_{n}

are powers of tests so that

π_{n}^{*} \geq π_{n}

.

By

m (n)

denote the number of observations required for the decision rule

D_{m (n)}

based on

m (n)

observations

X_{1}, \dots, X_{m (n)}

to attain the same quality as the ‘best’ rule

D_{n}^{*}

based on n observations

X_{1}, \dots, X_{n}

. In what follows we will keep to the asymptotic approach assuming that

n \to \infty

. Following [7], by the asymptotic relative efficiency (a.r.e.) of the rule

D_{n}

with respect to the rule

D_{n}^{*}

we will mean the limit

e \equiv lim_{n \to \infty} \frac{n}{m (n)}

(if it exists and does not depend on the sequence

m (n)

).

Instead of the ratio of the required number of observations, the difference

m (n) - n

can be considered as well, vividly showing the additional number of observations required by the decision rule

D_{n}

. However, many authors considered the ratio

n / m (n)

, possibly, because the asymptotic analysis of its properties is simpler.

The systematic analysis of the asymptotic behavior of the difference

m (n) - n

was first carried out by Hodges and Lehmann in 1970 [2]. They suggested to call the difference

m (n) - n

deficiency of the competing decision rule

D_{n}

with respect to the rule

D_{n}^{*}

and introduced the notation

d_{n} = m (n) - n .

(1)

If the limit

{lim}_{n \to \infty} d_{n}

exists, then it is called the asymptotic deficiency of the competing decision rule

D_{n}

with respect to the rule

D_{n}^{*}

and is denoted d. The number d is often called the deficiency of

D_{n}

with respect to

D_{n}^{*}

. Note that if a.r.e.

e \neq 1

, then

d = \infty

, so that this case is not so interesting. In [2] it was also noticed that for some decision rules (statistical procedures) there typically appear cases

e = 1

(see, e.g., the book [8]), that is, in these cases the a.r.e. cannot give an answer to the question, which rule is better, whereas the deficiency can clarify the case, because, generally speaking, in this case the asymptotic deficiency can be arbitrary.

So, the deficiency of

D_{n}

with respect to

D_{n}^{*}

shows, how many additional observations (that is, how much extra information) is required to attain the desired quality, if the decision rule

D_{n}

is used instead of the ‘optimal’ decision rule

D_{n}^{*}

. Therefore, the notion of deficiency provides natural grounds for the asymptotic comparison of

D_{n}

and

D_{n}^{*}

in the case

e = 1

. The study of the asymptotic behavior of the deficiency

d_{n}

requires more sophisticated techniques than is used to find the limit e. As a rule, this techniques employ the construction of asymptotic expansions (a.e.s) for the corresponding functions characterizing the quality of decision rules (see, e.g., the books [7,8,9]).

Since the rules

D_{n}^{*}

and

D_{n}

have the quality characteristics

π_{n}^{*}

and

π_{n}

, respectively, then, by the definition of the deficiency

d_{n} = m (n) - n

, for every n we have

π_{n}^{*} = π_{m (n)} .

(2)

So solve Equation (2), the integer-valued quantity

m (n)

should be treated as a variable taking arbitrary real values. For this purpose the function

π_{m (n)}

can be defined for non-integer

m (n)

by the formula

π_{m (n)} = (1 - m (n) + [m (n)]) π_{[m (n)]} + (m (n) - [m (n)]) π_{[m (n)] + 1}

(see [2]).

The functions

π_{n}^{*}

and

π_{n}

are usually unknown, so, in practice, their approximations are used. Assume that the a.e.s

π_{n}^{*} = \frac{a}{n^{r}} + \frac{b}{n^{r + s}} + o (n^{- r - s}),

(3)

and

π_{n} = \frac{a}{n^{r}} + \frac{c}{n^{r + s}} + o (n^{- r - s}),

(4)

hold, where a, b and c are some numbers that do not depend on n, and

r > 0

, and

s > 0

are constants determining the rate of decrease of these quality criteria in n. The first terms in these expansions coincide which means that the a.r.e. of the corresponding rules equals one. It can be easily obtained from relations (1)–(4) that

d_{n} = \frac{c - b}{r a} n^{1 - s} + o (n^{1 - s})

(5)

(see [2] or [7]). Thus, the asymptotic deficiency has the form

d = \{\begin{matrix} \pm \infty, & 0 < s < 1, \\ \frac{c - b}{r a}, & s = 1, \\ 0, & s > 1 . \end{matrix} .

(6)

The asymptotic deficiency possesses the following obvious property of transitivity: if there is some third decision rule

{\bar{D}}_{n}

with the quality characteristic

{\bar{π}}_{n}

admitting an a.e. of the form (4), then the deficiency

{\bar{d}}_{n}

of the rule

{\bar{D}}_{n}

with respect the the rule

D_{n}^{*}

satisfies the equality

{\bar{d}}_{n} = {\tilde{d}}_{n} + d_{n},

where

{\tilde{d}}_{n}

is the deficiency of the rule

{\bar{D}}_{n}

with respect to

D_{n}

and

d_{n}

is the deficiency of

D_{n}

with respect to

D_{n}^{*}

.

The case where

s = 1

is most interesting, because in this case the asymptotic deficiency is finite. In the paper [2] some simple examples are given illustrating that this case is quite natural in mathematical statistics (also see the book [8]).

2. Asymptotic Expansions for the Distributions of Normalized Sums of Random Variables

We begin with most general case. Let

n \in N

. Consider a finite set of r.v.s

X_{1}, \dots, X_{n}

. For the time being we do not assume that the r.v.s

X_{1}, \dots, X_{n}

are independent and identically distributed. Let

L_{n} = L_{n} (X_{1}, \dots, X_{n})

be an

R

-valued measurable function of

X_{1}, \dots, X_{n}

. (In what follows when dealing with the example of the portfolio of an insurance company we will call this function generalized loss). In particular,

L_{n}

may be of the form

L_{n} = \sqrt{n} T_{n}

where

T_{n}

is the arithmetic mean,

T_{n} \equiv \frac{1}{n} \sum_{i = 1}^{n} X_{i} .

(7)

As it has been already said, the problem consists in description of the distribution of r.v.s

X_{i}

providing the least possible number of summands n and guaranteeing the prescribed value of the

(1 - α)

-quantile of the function

L_{n}

for a given

α \in (0, 1)

.

Let

α \in (0, 1)

be a small number. Consider the quantity

c_{α} (n)

defined by the asymptotic relation

P (L_{n} \geq c_{α} (n)) = α + o (n^{- 1}), n \to \infty .

(8)

The quantity

c_{α} (n)

is the asymptotic

(1 - α)

-quantile of

L_{n}

. If

L_{n} = \sqrt{n} T_{n}

, then

c_{α} (n)

can be interpreted as the threshold, the exceedance of which by

L_{n}

is undesirable and is assumed to have the prescribed small probability

α

. In terms of an insurance company,

c_{α} (n)

is the asymptotic Value-at-Risk.

By applying the Taylor formula it is not difficult to obtain the following result.

Lemma 1.

Assume that there exist distribution function

G (x)

and functions

g_{1} (x)

and

g_{2} (x)

such that

sup_{x \in R} | P (L_{n} < x) - G (x) - \frac{1}{\sqrt{n}} g_{1} (x) - \frac{1}{n} g_{2} (x) | = o (n^{- 1}),

where the functions

G (x)

,

g_{1} (x)

and

g_{2} (x)

are smooth enough. Then the asymptotic

(1 - α)

-quantile

c_{α} (n)

of

L_{n}

admits the a.e.

c_{α} (n) = c_{α} - \frac{g_{1} (c_{α})}{\sqrt{n} G^{'} (c_{α})} - \frac{1}{n} [\frac{G^{''} (c_{α}) g_{1}^{2} (c_{α})}{2 {(G^{'} (c_{α}))}^{3}} + \frac{G^{'} (c_{α}) g_{2} (c_{α}) - g_{1} (c_{α}) g_{1}^{'} (c_{α})}{{(G^{'} (c_{α}))}^{2}}] + o (n^{- 1}),

where

c_{α}

satisfies the equation

G (c_{α}) = 1 - α

.

Consider the application of this lemma to the case where

X_{1}, X_{2}, \dots

are independent identically distributed r.v.s such that

E X_{1} = 0, E X_{1}^{2} = 1, E {| X_{1} |}^{k + δ} < \infty, k \in N, k \geq 3, δ > 0

(9)

and the function

L_{n}

has the form

L_{n} = \sqrt{n} T_{n}

with

T_{n}

defined by (7). Here the condition

E X_{1} = 0

means that the separate losses are centered by their expectations. Assume that the characteristic function

f (t)

of the r.v.

X_{1}

satisfies the Cramér condition (C)

\underset{| t | \to \infty}{lim sup} | f (t) | < 1 .

(10)

Under conditions (9) and (10), from Theorem 6.3.2 of [10] (also see [9]) it follows that there exist functions

Q_{1} (x), \dots, Q_{k - 2} (x)

and a

C_{k, δ} \in (0, \infty)

such that

sup_{x} |P (\sqrt{n} T_{n} < x) - Φ (x) - \sum_{i = 1}^{k - 2} n^{- i / 2} Q_{i} (x)| \leq \frac{C_{k, δ}}{n^{(k - 2 + δ) / 2}}, n \in N,

(11)

For the definition of the functions

Q_{1} (x), \dots, Q_{k - 2} (x)

see the book [10]. In particular,

Q_{1} (x) = - (x^{2} - 1) φ (x) \frac{E X_{1}^{3}}{6},

Q_{2} (x) = - (x^{3} - 3 x) φ (x) \frac{E X_{1}^{4} - 3}{24} - (x^{5} - 10 x^{3} + 15 x) φ (x) \frac{{(E X_{1}^{3})}^{2}}{72} .

(12)

Relations (11) and (12) and Lemma 1 directly imply the a.e. for the asymptotic

(1 - α)

-quantile

c_{n} (α)

of

L_{n}

presented in the following lemma.

Lemma 2.

Let conditions

(9)

and

(10)

hold with

k = 4

,

δ > 0

. Then the the asymptotic

(1 - α)

-quantile

c_{n} (α)

of

L_{n}

admits the a.e.

c_{α} (n) = u_{α} + \frac{E X_{1}^{3}}{6 \sqrt{n}} (u_{α}^{2} - 1) + \frac{1}{12 n} [\frac{E^{2} X_{1}^{3}}{3} (5 u_{α} - 2 u_{α}^{3}) + \frac{E X_{1}^{4} - 3}{2} (u_{α}^{3} - 3 u_{α})] + o (n^{- 1}),

where

u_{α}

is the

(1 - α)

-quantile of the standard normal distribution:

Φ (u_{α}) = 1 - α

.

3. The Comparison of the Distributions of Two Normalized Sums of Random Variables

3.1. The Asymptotic Deficiency of the Distributions of Summands Providing a Given $(1 - α)$ -Quantile of the Normalized Sums

In this section we will present an approach to the comparison of the distributions of two sums of r.v.s in terms of the number of summands. The distribution of the random vector

X_{1}, \dots, X_{n}

will be denoted

L (X_{1}, \dots, X_{n})

. Consider an

R

-valued measurable function of

X_{1}, \dots, X_{n}

.

From Lemma 1 we can easily obtain the following result.

Lemma 3.

Consider a sequence

{ϵ_{n}}_{n \geq 1}

such that

ϵ_{n} \to 0

as

n \to \infty

. Under the conditions of Lemma 1 we have

sup_{x \in R} | P (L_{n} (X_{1}, \dots, X_{n}) < x + ϵ_{n}) - P (L_{n} (X_{1}, \dots, X_{n}) < x) -

- ϵ_{n} G^{'} (x) - \frac{ϵ_{n}^{2}}{2} G^{''} (x) - \frac{ϵ_{n}}{\sqrt{n}} g_{1}^{'} (x) | = o (max \{ϵ_{n}^{2}, \frac{ϵ_{n}}{\sqrt{n}}, n^{- 1}\}) .

Along with the r.v.s

X_{1}, \dots, X_{n}

resulting in the value

L_{n} (X_{1}, \dots, X_{n})

of the function

L_{n}

, consider another set of r.v.s

Y_{1}, \dots, Y_{n}

, according to which the value of the function

L_{n}

is

L_{n} (Y_{1}, \dots, Y_{n})

. For example,

L_{n} (X_{1}, \dots, X_{n})

may have the form

L_{n} (X_{1}, \dots, X_{n}) = \sqrt{n} T_{n}

with

T_{n}

defined by (7) and

L_{n} (Y_{1}, \dots, Y_{n})

may have the form

L_{n} (Y_{1}, \dots, Y_{n}) = \sqrt{n} U_{n}

where

U_{n} = \frac{1}{n} \sum_{i = 1}^{n} Y_{i} .

(13)

Let to the distribution

L (Y_{1}, \dots, Y_{n})

there correspond the asymptotic

(1 - α)

-quantile

{\bar{c}}_{α} (n)

of

L_{n}

:

P (L_{n} (Y_{1}, \dots, Y_{n}) \geq {\bar{c}}_{α} (n)) = α + o (n^{- 1}), n \to \infty .

(14)

Assume that the a.e. for the distribution function of

L_{n} (Y_{1}, \dots, Y_{n})

has the form

P (L_{n} (Y_{1}, \dots, Y_{n}) < x) = G (x) + \frac{1}{\sqrt{n}} g_{1} (x) + \frac{1}{n} {\bar{g}}_{2} (x) + o (n^{- 1}),

(15)

where the functions

G (x)

,

g_{1} (x)

and

{\bar{g}}_{2} (x)

are smooth enough. The a.e. (15) differs from the a.e. for the distribution function of

L_{n} (X_{1}, \dots, X_{n})

established by Lemma 1 only by the term of order

n^{- 1}

, which means that the two distributions are rather close. Define the sequence of natural numbers

{m (n)}_{n \geq 1}

by the equality

P (L_{m (n)} (Y_{1}, \dots, Y_{m (n)}) \geq c_{α} (m (n))) = α + o (n^{- 1}), n \to \infty .

(16)

If

m (n) - n = d + o (1)

,

d \in R

,

n \to \infty

, then d is the asymptotic deficiency of the distribution

L (Y_{1}, \dots, Y_{1})

with respect to the distribution

L (X_{1}, \dots, X_{n})

. In other words, d is the asymptotic number of ‘additional’ r.v.s be included in the set

Y_{1}, \dots, Y_{1}

in order that the distribution

L (Y_{1}, \dots, Y_{m (n)})

provides the same quality as the distribution

L (X_{1}, \dots, X_{n})

.

Theorem 1.

Assume that the conditions of Lemma 1 and (15) hold and

G^{'} (c_{α}) c_{α} \neq 0

. Then the asymptotic deficiency d of the distribution

L (Y_{1}, \dots, Y_{1})

with respect to the distribution

L (X_{1}, \dots, X_{n})

has the form

d = \frac{2 [g_{2} (c_{α}) - {\bar{g}}_{2} (c_{α})]}{G^{'} (c_{α}) c_{α}} + o (1) .

Proof.

From Lemma 1 and condition (15) it directly follows that

{\bar{c}}_{α} (n) = c_{α} - \frac{g_{1} (c_{α})}{\sqrt{n} G^{'} (c_{α})} - \frac{1}{n} [\frac{G^{''} (c_{α}) g_{1}^{2} (c_{α})}{2 {(G^{'} (c_{α}))}^{3}} + \frac{G^{'} (c_{α}) {\bar{g}}_{2} (c_{α}) - g_{1} (c_{α}) g_{1}^{'} (c_{α})}{{(G^{'} (c_{α}))}^{2}}] + o (n^{- 1})

(17)

and therefore

ϵ_{n} \equiv \sqrt{\frac{m (n)}{n}} {\bar{c}}_{α} (m (n)) - c_{α} (m (n)) = \frac{d}{2 n} c_{α} - \frac{1}{n} \frac{(g_{2} (c_{α}) - {\bar{g}}_{2} (c_{α}))}{G^{'} (c_{α})} + o (n^{- 1}) .

(18)

Further, with the account of the definitions of

m (n)

(see (16)) and

ϵ_{n}

we have

α + o (n^{- 1}) = P (L_{m (n)} (Y_{1}, \dots, Y_{m (n)}) \geq {\bar{c}}_{α} (m (n))) =

= P (L_{m (n)} (Y_{1}, \dots, Y_{m (n)}) \geq \sqrt{\frac{n}{m (n)}} (c_{α} (m (n)) + ϵ_{n})))

(19)

Applying Lemma 3 to the right-hand side of (19) we obtain

α + o (n^{- 1}) = P (L_{m (n)} (Y_{1}, \dots, Y_{m (n)}) \geq c_{α} (m (n))) - ϵ_{n} G^{'} (c_{α}) + o (n^{- 1}) .

Now from (16) and (18) it follows that

d = \frac{2 [g_{2} (c_{α}) - {\bar{g}}_{2} (c_{α})]}{G^{'} (c_{α}) c_{α}} + o (1) .

The theorem is proved. □

Now consider an example of the application of Theorem 1 to the optimization of the portfolio size of an insurance company. Let the possible losses

X_{1}, X_{2}, \dots

related with each insurance contract in the portfolio be independent identically distributed r.v.s satisfying conditions (9) and (10). Consider another distribution, under which the possible losses

Y_{1}, Y_{2}, \dots

are assumed to be independent identically distributed r.v.s such that

E Y_{1} = 0, E Y_{1}^{2} = 1, E {| Y_{1} |}^{4 + δ} < \infty, δ > 0 .

(20)

Assume that the characteristic function

p (t)

of the r.v.

Y_{1}

satisfies the Cramér

(C)

condition

\underset{| t | \to \infty}{lim sup} | p (t) | < 1 .

(21)

For each n consider the average losses

U_{n}

defined by (13). Assume that

E X_{1}^{3} = E Y_{1}^{3},

(22)

(for example, the r.v.s

X_{i}

and

Y_{i}

are centered by their expectations and the distributions of these centered r.v.s are symmetric). From Lemma 2 and Theorem 1 we directly obtain the following statement.

Lemma 4.

Let conditions (9), (10) and (20)–(22) hold. Then the asymptotic (as

n \to \infty)

deficiency of the distribution

L (Y_{1}, \dots, Y_{n})

with respect to the distribution

L (X_{1}, \dots, X_{n})

(the ‘additional number of contracts’)d has the form

d = \frac{(E X_{1}^{4} - E Y_{1}^{4}) (3 - u_{α}^{2})}{12} + o (1) .

Lemma 4 illustrates that if the distributions are close, then the deficiency is determined by the kurtosis.

3.2. The Asymptotic Deficiency of the Distributions of Summands Providing a Given Probability for the Normalized Sum to Fall into a Given Interval

To begin with, in this section we again consider the values of a measurable

R

-valued function

L_{n} (X_{1}, \dots, X_{n})

and

L_{n} (Y_{1}, \dots, Y_{n})

on random vectors

(X_{1}, \dots, X_{n})

and

(Y_{1}, \dots, Y_{n})

with the the distributions

L (X_{1}, \dots, X_{n})

and

L (Y_{1}, \dots, Y_{n})

, respectively. The goal is to provide that the value of

L_{n}

falls into the interval

[S_{1}, S_{2})

for some given numbers

S_{1} < S_{2}

. As a quality characteristic consider the probabilities

π_{n} = P (S_{1} \leq L_{n} (X_{1}, \dots, X_{n}) < S_{2}), {\bar{π}}_{n} = P (S_{1} \leq L_{n} (Y_{1}, \dots, Y_{n}) < S_{2}) .

(23)

If

L_{n} (X_{1}, \dots, X_{n}) = \sqrt{n} T_{n}

(see (7)) and

L_{n} (Y_{1}, \dots, Y_{n}) = \sqrt{n} U_{n}

(see (22)), that is, normalized sums of r.v.s are considered, then relation (23) means that

π_{n}

and

{\bar{π}}_{n}

are probabilities of that the normalized sums of r.v.s are inside the interval

[S_{1}, S_{2})

.

From the definition of

π_{n}

we directly obtain the following result.

Lemma 5.

Assume that for some

r > 0

and

s > 0

there exist a distribution function

H (x)

and functions

h_{1} (x)

,

h_{2} (x)

and

{\bar{h}}_{2} (x)

such that

sup_{x \in R} | P (L_{n} (X_{1}, \dots, X_{n}) < x) - H (x) - \frac{1}{n^{r}} h_{1} (x) - \frac{1}{n^{r + s}} h_{2} (x) | = o (n^{- r - s}),

sup_{x \in R} | P (L_{n} (Y_{1}, \dots, Y_{n}) < x) - H (x) - \frac{1}{n^{r}} h_{1} (x) - \frac{1}{n^{r + s}} {\bar{h}}_{2} (x) | = o (n^{- r - s}),

and, moreover, the functions

h_{1} (x)

,

h_{2} (x)

and

{\bar{h}}_{2} (x)

are measurable. Then

π_{n}

and

{\bar{π}}_{n}

admit a.e.s

π_{n} = H (S_{2}) - H (S_{1}) + \frac{h_{1} (S_{2}) - h_{1} (S_{1})}{n^{r}} + \frac{h_{2} (S_{2}) - h_{2} (S_{1})}{n^{r + s}} + o (n^{- r - s}),

{\bar{π}}_{n} = H (S_{2}) - H (S_{1}) + \frac{h_{1} (S_{2}) - h_{1} (S_{1})}{n^{r}} + \frac{{\bar{h}}_{2} (S_{2}) - {\bar{h}}_{2} (S_{1})}{n^{r + s}} + o (n^{- r - s}) .

Corollary 1.

Let

ϵ_{n} ↓ 0

as

n \to \infty

and

S_{2} = S_{1} + ϵ_{n}

. Assume that the functions

H (x)

,

h_{1} (x)

,

h_{2} (x)

and

{\bar{h}}_{2} (x)

are smooth enough and

h_{1} (S_{2}) \neq h_{1} (S_{1})

. Then

ϵ_{n}^{- 1} π_{n} = H^{'} (S_{1}) + \frac{ϵ_{n}}{2} H^{''} (S_{1}) + \frac{ϵ_{n}^{2}}{6} H^{'''} (S_{1}) + o (ϵ_{n}^{2}) +

+ \frac{1}{n^{r}} h_{1}^{'} (S_{1}) + \frac{1}{2 n^{r}} h_{1}^{''} (S_{1}) ϵ_{n} + o (ϵ_{n} n^{- r}) + \frac{1}{n^{r + s}} h_{2}^{'} (S_{1}) + o (n^{- r - s} ϵ_{n}^{- 1}),

ϵ_{n}^{- 1} {\bar{π}}_{n} = H^{'} (S_{1}) + \frac{ϵ_{n}}{2} H^{''} (S_{1}) + \frac{ϵ_{n}^{2}}{6} H^{'''} (S_{1}) + o (ϵ_{n}^{2}) +

+ \frac{1}{n^{r}} h_{1}^{'} (S_{1}) + \frac{1}{2 n^{r}} h_{1}^{''} (S_{1}) ϵ_{n} + o (ϵ_{n} n^{- r}) + \frac{1}{n^{r + s}} {\bar{h}}_{2}^{'} (S_{1}) + o (n^{- r - s} ϵ_{n}^{- 1}) .

Lemma 5, Corollary 1 and formula (6) directly imply the expression for the asymptotic deficiency with quality characteristics (23).

Theorem 2.

Let conditions of Lemma 5 hold with

s = 1

. Then the deficiency

d_{n}

of the distribution

L (Y_{1}, \dots, Y_{n})

with the quality characteristic

{\bar{π}}_{n}

with respect to the distribution

L (X_{1}, \dots, X_{n})

with the quality characteristic

π_{n}

has the form

d_{n} = \frac{{\bar{h}}_{2} (S_{2}) - h_{2} (S_{2}) + {\bar{h}}_{2} (S_{1}) - {\bar{h}}_{2} (S_{1})}{r (h_{1} (S_{2}) - h_{1} (S_{1}))} + o (1) .

(24)

If

S_{2} = S_{1} + ϵ_{n}

with

ϵ_{n} ↓ 0

as

n \to \infty

and

h_{1}^{'} (S_{1}) \neq 0

, then the formal passage to the limit in (3.13) yields the formula

d_{n} = \frac{{\bar{h}}_{2}^{'} (S_{1}) - h_{2}^{'} (S_{1})}{r h_{1}^{'} (S_{1})} + o (1) .

Consider an example of the application of Theorem 2 to the optimization of the portfolio size of an insurance company. Let the possible losses

X_{1}, X_{2}, \dots

related with each insurance contract in the portfolio be independent identically distributed r.v.s satisfying conditions (9) and (10). Consider another distribution, under which the possible losses

Y_{1}, Y_{2}, \dots

are assumed to be independent identically distributed r.v.s satisfying conditions (20) and (21). Assume that in (9) and (20)

k = 3

. We are interested in the asymptotic behavior of the average losses

T_{n}

(see (7)) and

U_{n}

(see (13)). With the account of Lemma 5 we obtain the following statement.

Lemma 6.

Let conditions

(9)

,

(10)

,

(19)

and

(20)

hold with

k = 3

. Then

P (\sqrt{n} T_{n} < x) = Φ (x) + \frac{Q_{1} (x)}{\sqrt{n}} + \frac{Q_{2} (x)}{n} + o (n^{- 1}),

P (\sqrt{n} U_{n} < x) = Φ (x) + \frac{{\bar{Q}}_{1} (x)}{\sqrt{n}} + \frac{{\bar{Q}}_{2} (x)}{n} + o (n^{- 1}),

uniformly in

x \in R

,

π_{n} = Φ (S_{2}) - Φ (S_{1}) + \frac{Q_{1} (S_{2}) - Q_{1} (S_{1})}{\sqrt{n}} + \frac{Q_{2} (S_{2}) - Q_{2} (S_{1})}{n} + o (n^{- 1}),

{\bar{π}}_{n} = Φ (S_{2}) - Φ (S_{1}) + \frac{{\bar{Q}}_{1} (S_{2}) - {\bar{Q}}_{1} (S_{1})}{\sqrt{n}} + \frac{{\bar{Q}}_{2} (S_{2}) - {\bar{Q}}_{2} (S_{1})}{n} + o (n^{- 1}),

where the functions

Q_{1} (x)

and

Q_{2} (x)

are defined in

(12)

,

{\bar{Q}}_{1} (x) = - (x^{2} - 1) φ (x) \frac{E Y_{1}^{3}}{6},

{\bar{Q}}_{2} (x) = - (x^{3} - 3 x) φ (x) \frac{E Y_{1}^{4} - 3}{24} - (x^{5} - 10 x^{3} + 15 x) φ (x) \frac{{(E Y_{1}^{3})}^{2}}{72} .

Corollary 2.

Let

ϵ_{n} ↓ 0

as

n \to \infty

and

S_{2} = S_{1} + ϵ_{n}

. Assume that conditions of Lemma 6 hold. Then

ϵ_{n}^{- 1} π_{n} = φ (S_{1}) + \frac{ϵ_{n}}{2} φ^{'} (S_{1}) + \frac{ϵ_{n}^{2}}{6} φ^{''} (S_{1}) + o (ϵ_{n}^{2}) +

+ \frac{1}{\sqrt{n}} Q_{1}^{'} (S_{1}) + \frac{ϵ_{n}}{2 \sqrt{n}} Q_{1}^{''} (S_{1}) + o (ϵ_{n} n^{- 1 / 2}) \frac{1}{n} Q_{2}^{'} (S_{1}) + o (n^{- 1} ϵ_{n}^{- 1}),

ϵ_{n}^{- 1} {\bar{π}}_{n} = φ (S_{1}) + \frac{ϵ_{n}}{2} φ^{'} (S_{1}) + \frac{ϵ_{n}^{2}}{6} φ^{''} (S_{1}) + o (ϵ_{n}^{2}) +

+ \frac{1}{\sqrt{n}} {\bar{Q}}_{1}^{'} (S_{1}) + \frac{ϵ_{n}}{2 \sqrt{n}} {\bar{Q}}_{1}^{''} (S_{1}) + o (ϵ_{n} n^{- 1 / 2}) + \frac{1}{n} {\bar{Q}}_{2}^{'} (S_{1}) + o (n^{- 1} ϵ_{n}^{- 1}) .

Theorem 2, Lemma 5 and formula (5) directly imply the following statement.

Theorem 3.

Let, in addition to the conditions of Lemma 5.,

E X_{1}^{3} = E Y_{1}^{3}

. Then the deficiency

d_{n}

of the distribution

L (Y_{1}, \dots, Y_{n})

with the quality characteristic

{\bar{π}}_{n}

with respect to the strategy

L (X_{1}, \dots, X_{n})

with the quality characteristic

π_{n}

(the ‘additional number of contracts’) has the form

d_{n} = 2 \frac{{\bar{Q}}_{2} (S_{2}) - Q_{2} (S_{2}) + Q_{2} (S_{1}) - {\bar{Q}}_{2} (S_{1})}{Q_{1} (S_{2}) - Q_{1} (S_{1})} n^{1 / 2} + o (n^{1 / 2}) .

Consider an example where the asymptotic deficiency is finite.

Corollary 3.

Let

ϵ_{n} = \frac{1}{n}

and

S_{2} = S_{1} + \frac{1}{n}

,

E X_{1}^{3} = E Y_{1}^{3} = 0

. Then under the conditions of Lemma 5 we have

π_{n} = \frac{φ (S_{1})}{n} + \frac{φ^{'} (S_{1}) + 2 Q_{2}^{'} (S_{1})}{n^{2}} + o (n^{- 2}),

π_{n} = \frac{φ (S_{1})}{n} + \frac{φ^{'} (S_{1}) + 2 {\bar{Q}}_{2}^{'} (S_{1})}{n^{2}} + o (n^{- 2}) .

Moreover, the deficiency

d_{n}

has the form

d_{n} = \frac{2 ({\bar{Q}}_{2}^{'} (S_{1}) - Q_{2}^{'} (S_{1}))}{φ (S_{1})} + o (1) = \frac{S_{1}^{4} - 6 S_{1}^{2} + 3}{12} (E Y_{1}^{4} - E X_{1}^{4}) + o (1) .

4. Random Number of Summands

4.1. Asymptotic Expansions for the Asymptotic $(1 - α)$ -Quantile of $R$ -Valued Measurable Functions of a Random Number of Random Variables

In this section we consider the case where an additional randomization can be introduced into the problem. In this case the number of summands in the sum can be considered as random. This randomization may not be artificially induced, but also may occur when the exact portfolio size can be unknown beforehand and only some ‘expected’ number of summands can be available as the parameter of the problem.

Let natural-valued r.v.s

N_{1}, N_{2}, \dots

and r.v.s

X_{1}, X_{2}, \dots

be defined on one and the same probability space

(Ω, A, P)

. In what follows we will assume that n is the expected value of

N_{n}

,

E N_{n} = n .

(25)

Assume that for each

n \geq 1

the r.v.

N_{n}

is independent of the sequence

X_{1}, X_{2}, \dots

. As above, for each

n \geq 1

, consider the value of an

R

-valued measurable function

L_{n} = L_{n} (X_{1}, \dots, X_{n})

. For each

n \geq 1

consider the r.v.

L_{N_{n}}

defined as

L_{N_{n}} (ω) \equiv L_{N_{n} (ω)} (X_{1} (ω), \dots, X_{N_{n} (ω)} (ω)), ω \in Ω .

Below we will assume that the following condition holds.

Condition A.There exist

k \in N ∖ {1}

,

α_{i, n} \in R

,

i = 1, \dots, k

,

β_{n} > 0

,

C_{k} > 0

, a differentiable distribution function

G (x)

and measurable functions

g_{j} (x)

,

j = 1, \dots, k

such that

β_{n} \to 0, max_{1 \leq i \leq k} | α_{i, n} | \to 0

as

n \to \infty

and

sup_{x} |P (L_{n} < x) - G (x) - \sum_{i = 1}^{k} α_{i, n} g_{i} (x)| \leq C_{k} β_{n}, n \in N .

Lemma 7.

Let the function

L_{n} = L_{n} (X_{1}, \dots, X_{n})

satisfy Condition A. Then

sup_{x} |P (L_{N_{n}} < x) - G (x) - \sum_{i = 1}^{k} g_{i} (x) E α_{i, N_{n}}| \leq C_{k} E β_{N_{n}} .

The elementary proof of this lemma directly follows by the formula of total probability.

Consider an example of application of Lemma 7. Let

X_{1}, X_{2}, \dots

be independent identically distributed r.v.s satisfying conditions (9) and (10). Assume that the function

L_{n}

is the normalized arithmetic mean (or, which is the same, the normalized sum)

L_{n} = \sqrt{n} T_{n}

with

T_{n}

defined in (7). Then, in accordance with what has been said in Section 2, relation (11) holds implying the validity of Condition A. From (11) playing the role of Condition A and Lemma 7 we obtain the following statement.

Lemma 8.

Assume that

L_{n} = \sqrt{n} T_{n}

with

T_{n}

defined in (7) and conditions (9) and (10) hold. Then

sup_{x} |P (\sqrt{N_{n}} T_{N_{n}} < x) - Φ (x) - \sum_{i = 1}^{k - 2} Q_{i} (x) E N_{n}^{- i / 2}| \leq C_{k, δ} E N_{n}^{- (k - 2 + δ) / 2},

where the functions

Q_{i} (x)

are defined in Theorem 6.3.2 of [10].

Relation (11) and Lemma 8 imply the following statement.

Lemma 9.

Let conditions (9) and (10) hold with

k = 4

and

δ > 0

. Assume that condition (25) holds and

E N_{n}^{- 1 / 2} = \frac{1}{\sqrt{n}} + \frac{a}{n} + o (n^{- 1}), a \in R,

E N_{n}^{- 1} = \frac{b}{n} + o (n^{- 1}), E N_{n}^{- (2 + δ) / 2} = o (n^{- 1}), b \in R .

Then

sup_{x} |P (\sqrt{n} T_{n} < x) - Φ (x) - \frac{Q_{1} (x)}{\sqrt{n}} - \frac{Q_{2} (x)}{n}| = o (n^{- 1})

and

sup_{x} |P (\sqrt{N_{n}} T_{N_{n}} < x) - Φ (x) - \frac{Q_{1} (x)}{\sqrt{n}} - \frac{b Q_{2} (x) + a Q_{1} (x)}{n}| = o (n^{- 1}) .

We will use Lemma 9 in order to determine the asymptotic

(1 - α)

-quantile of

L_{n}

and calculate the asymptotic deficiency.

Recall that, for

α \in (0, 1)

, the asymptotic

(1 - α)

-quantile of

L_{n}

is the quantity

c_{α} (n)

satisfying the asymptotic equality

P (L_{n} \geq c_{α} (n)) = α + o (n^{- 1}), n \to \infty .

(26)

Correspondingly, we define the the asymptotic

(1 - α)

-quantile

{\tilde{c}}_{α} (n)

of

L_{N_{n}}

by the equation

P (L_{N_{n}} \geq {\tilde{c}}_{α} (n)) = α + o (n^{- 1}), n \to \infty .

(27)

From Lemmas 1 and 9 we directly obtain the a.e.s for these asymptotic

(1 - α)

-quantiles.

Lemma 10.

Under the conditions of Lemma 8, we have

c_{α} (n) = u_{α} + \frac{E X_{1}^{3}}{6 \sqrt{n}} (u_{α}^{2} - 1) + \frac{1}{12 n} [\frac{E^{2} X_{1}^{3}}{3} (5 u_{α} - 2 u_{α}^{3}) + \frac{E X_{1}^{4} - 3}{2} (u_{α}^{3} - 3 u_{α})] + o (n^{- 1}),

{\tilde{c}}_{α} (n) = u_{α} + \frac{E X_{1}^{3}}{6 \sqrt{n}} (u_{α}^{2} - 1) +

+ \frac{1}{12 n} [\frac{E^{2} X_{1}^{3}}{3} (5 u_{α} - 2 u_{α}^{3}) + \frac{b (E X_{1}^{4} - 3)}{2} (u_{α}^{3} - 3 u_{α}) + 2 a E X_{1}^{3} (u_{α}^{2} - 1)] + o (n^{- 1}),

where

u_{α}

satisfies the equation

Φ (u_{α}) = 1 - α

.

Now define the sequence

m (n)

of natural numbers by the relation

P (\sqrt{n} L_{N_{m (n)}} \geq \sqrt{m (n)} c_{α} (m (n))) = α + o (n^{- 1}), n \to \infty .

(28)

If

m (n) = n + d + o (1),

(29)

n = 1, 2, \dots

, then d can have the meaning of the expected additional number of summands to be included in the sum in order that the function

L_{N_{n}}

exceeds

c_{α} (n)

for the loss under a non-random number n of summands. The quantity d will be called the asymptotic deficiency.

In the same way that Theorem 1 was proved, we can establish the following statement.

Theorem 4.

Assume that

E N_{n} = n, E N_{n}^{- 1 / 2} = \frac{1}{\sqrt{n}} + \frac{a}{n} + o (n^{- 1}), a \in R,

E N_{n}^{- 1} = \frac{b}{n} + o (n^{- 1}), E N_{n}^{- (2 + δ) / 2} = o (n^{- 1}), b \in R,

and there exist

δ > 0

, a differentiable distribution function

G (x)

and measurable functions

g_{1} (x)

and

g_{2} (x)

such that

sup_{x} |P (L_{n} < x) - G (x) - \frac{g_{1} (x)}{\sqrt{n}} - \frac{g_{2} (x)}{n}| \leq \frac{C}{n^{(2 + δ) / 2}}

and

G^{'} (c_{α}) c_{α} \neq 0

. Then the expected number d of additional summands (see

(28)

and

(29)

) in the normalized random sum

L_{N_{n}}

with respect to the normalized sum

L_{n}

has the form

d = \frac{2 [g_{2} (c_{α}) (1 - b) - a g_{1} (c_{α})]}{G^{'} (c_{α}) c_{α}} + o (1),

where

c_{α}

satisfies the equation

G (c_{α}) = 1 - α

.

Theorem 4 implies the following statement.

Corollary 4.

Under the conditions of Lemma 8 the expected additional number of summands d (see (28) and (29)) corresponding to the normalized sum

\sqrt{N_{n}} T_{N_{n}}

with a random number of summands with respect to the normalized sum

\sqrt{n} T_{n}

has the form

d = \frac{2 ((1 - b) Q_{2} (u_{α}) - a Q_{1} (u_{α}))}{φ (u_{α}) u_{α}} + o (1) .

If additionally

E X_{1}^{3} = 0

, then

d = \frac{(1 - b) (3 - u_{α}^{2}) (E X_{1}^{4} - 3)}{12} + o (1) .

4.2. An Example of Three-Point Distribution of the Number of Summands

In this section, keeping to the terminology of the example related to optimization of the portfolio size of an insurance company, we will use Corollary 4 to obtain a.e.s for the asymptotic Value-at-Risk (asymptotic

(1 - α)

-quantile of the normalized average loss, or asymptotic normalized

α

-reserve) in the case where the portfolio size

N_{n}

has a special distribution concentrated in three points so that is symmetric around the central point.

Assume that the portfolio size

N_{n}

has the distribution of the form

P (N_{n} = n - h_{n}) = P (N_{n} = n) = P (N_{n} = n + h_{n}) = \frac{1}{3},

(30)

where

h_{n} \in N

,

h_{n} < n

,

n = 1, 2, \dots

, and

lim_{n \to \infty} \frac{h_{n}}{n} = 0 .

(31)

Lemma 11.

Let the random portfolio size

N_{n}

have distribution (30) and let condition (31) hold. Then

E N_{n} = n

and, as

n \to \infty

,

E N_{n}^{- 1 / 2} = \frac{1}{\sqrt{n}} - \frac{1}{4 \sqrt{n}} {(\frac{h_{n}}{n})}^{2} + O (\frac{1}{\sqrt{n}} {(\frac{h_{n}}{n})}^{3}),

E N_{n}^{- 1} = \frac{1}{n} + \frac{2}{3 n} {(\frac{h_{n}}{n})}^{2} + O (\frac{1}{n} {(\frac{h_{n}}{n})}^{4}), E N_{n}^{- 3 / 2} = \frac{1}{n^{3 / 2}} + O (\frac{1}{n^{3 / 2}} {(\frac{h_{n}}{n})}^{2}) .

Proof.

The desired statements follow from the relations

E N_{n}^{- 1} = \frac{3 n^{2} - h_{n}^{2}}{3 n (n^{2} - h_{n}^{2})} = \frac{1}{n} (1 - \frac{h_{n}^{2}}{3 n}) (1 + \frac{h_{n}^{2}}{n^{2}} + O (\frac{h_{n}^{4}}{n^{4}})) = \frac{1}{n} + \frac{2}{3 n} {(\frac{h_{n}}{n})}^{2} + O (\frac{1}{n} {(\frac{h_{n}}{n})}^{4}),

E N_{n}^{- 3 / 2} = \frac{1}{3 n^{3 / 2}} (\frac{1}{{(1 - h_{n} / n)}^{3 / 2}} + 1 + \frac{1}{{(1 + h_{n} / n)}^{3 / 2}}) = \frac{1}{n^{3 / 2}} + O (\frac{1}{n^{3 / 2}} {(\frac{h_{n}}{n})}^{2}) .

The formula for

E N_{n}^{- 1 / 2}

is established in a similar way. □

Lemmas 10 and 11 imply the following statement.

Theorem 5.

Assume that the normalized average loss has the form

L_{n} = \sqrt{n} T_{n}

with

T_{n}

defined in (7). Let the r.v.

N_{n}

be distributed according to (30) and condition (31) hold. Under the conditions of Lemma 9, for the asymptotic α-reserve

{\tilde{c}}_{α} (n)

corresponding to the normalized average loss

\sqrt{N_{n}} T_{N_{n}}

there holds the relation

{\tilde{c}}_{α} (n) = c_{α} (n) - \frac{E X_{1}^{3} (u_{α}^{2} - 1)}{24 \sqrt{n}} {(\frac{h_{n}}{n})}^{2} + o (n^{- 1}), n \to \infty .

Remark 1.

In addition to the conditions of Theorem 5, let

h_{n} = γ n^{β} + o (n^{β}), γ \geq 0, 0 \leq β < 1 .

Then, as

n \to \infty

,

n^{5 / 2 - 2 β} (c_{α} (n) - {\tilde{c}}_{α} (n)) \to \frac{γ^{2}}{24} E X_{1}^{3} (u_{α}^{2} - 1) .

Applying Lemma 9, by simple calculations we obtain the following statement.

Lemma 12.

Assume that conditions (9) and (10) hold with

k = 4

and

0 < δ \leq 1

. Let conditions (30) and (31) hold. Then

sup_{x} |P (\sqrt{N_{n}} T_{N_{n}} < x) - Φ (x) - (1 - \frac{h_{n}^{2}}{4 n^{2}}) \frac{Q_{1} (x)}{\sqrt{n}} - (1 + \frac{2 h_{n}^{2}}{3 n^{2}}) \frac{Q_{2} (x)}{n}| = O (\frac{h_{n}^{(4 + 2 δ) / 3}}{n^{7 (2 + δ) / 6}}) .

Corollary 5.

Let conditions of Lemma 12 hold and

h_{n} = n^{3 / 4}

. Then

sup_{x \in R} | P (\sqrt{N_{n}} T_{N_{n}} < x) - Φ (x) - \frac{1}{\sqrt{n}} Q_{1} (x) - \frac{1}{n} (Q_{2} (x) - \frac{1}{4} Q_{1} (x)) | = o (n^{- 1}) .

Relations (12), Lemmas 10 and 11 yield the following theorem.

Theorem 6.

Let the conditions of Corollary 5 hold. Then the asymptotic α-reserves

c_{α} (n)

and

{\tilde{c}}_{α} (n)

related to the normalized average losses

\sqrt{n} T_{n}

and

\sqrt{N_{n}} T_{N_{n}}

have the form

c_{α} (n) = u_{α} + \frac{E X_{1}^{3}}{6 \sqrt{n}} (u_{α}^{2} - 1) + \frac{1}{12 n} [\frac{E^{2} X_{1}^{3}}{3} (5 u_{α} - 2 u_{α}^{3}) + \frac{E X_{1}^{4} - 3}{2} (u_{α}^{3} - 3 u_{α})] + o (n^{- 1}),

{\tilde{c}}_{α} (n) = u_{α} + \frac{E X_{1}^{3}}{6 \sqrt{n}} (u_{α}^{2} - 1) +

+ \frac{1}{12 n} [\frac{E^{2} X_{1}^{3}}{3} (5 u_{α} - 2 u_{α}^{3}) + \frac{E X_{1}^{4} - 3}{2} (u_{α}^{3} - 3 u_{α}) - \frac{1}{2} E X_{1}^{3} (u_{α}^{2} - 1)] + o (n^{- 1}),

where

u_{α}

satisfies the equation

Φ (u_{α}) = 1 - α

. The corresponding expected additional number d of contracts has the form

d = \frac{Q_{1} (u_{α})}{2 φ (u_{α}) u_{α}} + o (1) = \frac{(1 - u_{α}^{2}) E X_{1}^{3}}{12 u_{α}} + o (1) .

5. Conclusions

The paper deals with an approach to the comparison of distributions of sums of a finite number of independent random variables by deficiency. The notion of asymptotic deficiency of the distribution of a measurable

R

-valued function of a random vector with respect to the distribution of the same function of another random vector was introduced. Some formulas for the calculation of asymptotic deficiency were presented in the cases where the function has the form of a normalized sum of independent identically distributed r.v.s. The formulas for the asymptotic deficiency were obtained as the solution of two problems, one of which deals with the description of the distribution of a separate summand minimizing the number of summands and providing a prescribed value of the

(1 - α)

-quantile of the normalized sum for a given

α \in (0, 1)

. The second problem deals with minimization of the number of summands and guaranteeing a prescribed probability for a normalized sum of r.v.s to fall into a given interval. These results were extended to the case of a random number of summands in the sum (or random portfolio size, in terms of the example dealing with an insurance company). For this case, an analog of deficiency of the sum of a random number of summands with respect to the distribution of the sum of a non-random number of summands was introduced. The problem of comparison of these distributions by an analog of deficiency was considered in a special case of three-point distribution of portfolio size. The main mathematical tools used in the paper were asymptotic expansions for the distributions of average losses and their quantiles.

Author Contributions

Conceptualization, V.E.B. and V.Y.K.; Formal analysis, V.Y.K.; Funding acquisition, V.Y.K.; Investigation, V.E.B. and V.Y.K.; Writing – original draft, V.E.B. and V.Y.K. All authors have read and agreed to the published version of the manuscript.

Funding

The research was supported by the Ministry of Science and Higher Education of the Russian Federation, project No. 075-15-2020-799.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors thank the anonymous referees for their comments and suggestions that improved the paper. We also thank A.K. Gorshenin for his help in formatting the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Müller, A.; Stoyan, D. Comparison Methods for Stochastic Models and Risks; J. Wiley & Sons: Chichester, UK, 2002; 352p, ISBN 978-0-471-49446-1. [Google Scholar]
Hodges, J.L.; Lehmann, E.L. Deficiency. Ann. Math. Stat. 1970, 41, 783–801. [Google Scholar] [CrossRef]
Torgersen, E. Comparison of Statistical Experiments; Printed online: May 2013; Cambridge University Press: Cambridge, UK, 1991. [Google Scholar] [CrossRef]
Xiang, X. Deficiency of the sample quantile estimator with respect to kernel quantile estimators for censored data. Ann. Stat. 1995, 23, 836–854. [Google Scholar] [CrossRef]
Bening, V.E.; Korolev, V.Y.; Zeifman, A.I. Calculation of the deficiency of some statistical estimators constructed from samples with random sizes. Colloq. Math. 2019, 157, 157–171. [Google Scholar] [CrossRef]
Blackwell, D.; Girshick, M.A. Theory of Games and Statistical Decisions. Wiley Publications in Statistics; J. Wiley & Sons: New York, NY, USA; Chapman & Hall: London, UK, 1954; p. XI, 355. [Google Scholar]
Lehmann, E.L.; Casella, G. Theory of Point Estimation; Springer: Berlin, Germany, 1998; 589p. [Google Scholar]
Bening, V.E. Asymptotic Theory of Testing Statistical Hypotheses: Efficient Statistics, Optimality, Power Loss, and Deficiency; Walter de Gruyter: Berlin, Germany, 2011; 277p, ISBN 978-3-11-093599-8. [Google Scholar]
Cramér, H. Mathematical Methods of Statistics; Princeton University Press: Princeton, NJ, USA, 1946; 647p. [Google Scholar]
Petrov, V.V. Limit Theorems of Probability Theory: Sequences of Independent Random Variables; Clarendon Press: Oxford, UK, 1985; 437p. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Comparing Distributions of Sums of Random Variables by Deficiency: Discrete Case

Abstract

1. Introduction

1.1. The Problem under Consideration and the Structure of the Paper

1.2. Asymptotic Deficiency

2. Asymptotic Expansions for the Distributions of Normalized Sums of Random Variables

3. The Comparison of the Distributions of Two Normalized Sums of Random Variables

3.1. The Asymptotic Deficiency of the Distributions of Summands Providing a Given $(1 - α)$ -Quantile of the Normalized Sums

3.2. The Asymptotic Deficiency of the Distributions of Summands Providing a Given Probability for the Normalized Sum to Fall into a Given Interval

4. Random Number of Summands

4.1. Asymptotic Expansions for the Asymptotic $(1 - α)$ -Quantile of $R$ -Valued Measurable Functions of a Random Number of Random Variables

4.2. An Example of Three-Point Distribution of the Number of Summands

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Comparing Distributions of Sums of Random Variables by Deficiency: Discrete Case

Abstract

1. Introduction

1.1. The Problem under Consideration and the Structure of the Paper

1.2. Asymptotic Deficiency

2. Asymptotic Expansions for the Distributions of Normalized Sums of Random Variables

3. The Comparison of the Distributions of Two Normalized Sums of Random Variables

3.1. The Asymptotic Deficiency of the Distributions of Summands Providing a Given ( 1 − α ) -Quantile of the Normalized Sums

3.2. The Asymptotic Deficiency of the Distributions of Summands Providing a Given Probability for the Normalized Sum to Fall into a Given Interval

4. Random Number of Summands

4.1. Asymptotic Expansions for the Asymptotic ( 1 − α ) -Quantile of R -Valued Measurable Functions of a Random Number of Random Variables

4.2. An Example of Three-Point Distribution of the Number of Summands

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

3.1. The Asymptotic Deficiency of the Distributions of Summands Providing a Given $(1 - α)$ -Quantile of the Normalized Sums

4.1. Asymptotic Expansions for the Asymptotic $(1 - α)$ -Quantile of $R$ -Valued Measurable Functions of a Random Number of Random Variables