1. Introduction
Random shifting and random scaling in insurance applications are natural phenomena for latent unknown risk factors, time-value of money, or the need of allowing financial risks to be dependent. In this contribution, we are concerned with three principal stochastic models related to credibility theory, ruin theory, and extreme value modeling of large losses.
In credibility theory (e.g., [
1]) often stochastic models are defined via a conditional argument. As an illustration, consider the classical Gaussian model assuming that the conditional random variable
has the normal distribution
. If further the random variable Θ has the normal distribution
, we obtain the credibility premium formula for the Bayesian premium (calculated under the
loss function)
for any
and
positive. The relation explained by Equation (1) can be directly derived by introducing a random shift. Indeed, let
Y be, independent of Θ, a random variable with
distribution. We have the equality in distribution
Consequently, Equation (1) follows immediately by the fact that the conditional random variable
is normally distributed for any
.
The random shifting in this approach is related to Θ , which shifts
Y. The random shift model given in Equation (2) has natural extensions. For instance,
Y can be a
d-dimensional normally distributed random vector with Θ being some
d-dimensional random vector; a more general case is recently discussed in [
2]. Another extension is to consider
Y having an elliptical distribution; see
Section 4.
In ruin (or risk) theory, realistic stochastic models for claim sizes (or risks)
should allow for the dependence among them. Furthermore, dependent claim sizes need to have a tractable and transparent dependence structure. In several contributions (see [
1,
3] and the references therein) dependent claim sizes (or risks) are introduced by resorting to a dependence structure implied by the Archimedean copula. Recall that an Archimedean copula in
d-dimension (denoted by
) is defined by
where
ψ, called the generator of
, is required to be positive, strictly decreasing, and continuous with
and
, and
; see, e.g., [
4] and the references therein.
A similar idea was used in the context of ruin theory in [
5] where conditional on the positive random variable Θ
holds for any positive constants
. Proposition 1 of the aforementioned paper shows the link of such dependence structure (determined by Equation (4)) with the Archimedean copula. In fact, instead of dealing with the conditional random model defined in Equation (4) we can consider the following equivalent random scale model
where
are independent random variables with unit exponential distribution being further independent of the positive random variable Θ. Clearly,
has joint survival function given by Equation (4). The random scale model Equation (5) is interesting since it leads to certain simplifications; see [
4].
At this point, we emphasize one extremely important issue, which seems to have been very often overlooked in the literature. Claim sizes form an infinite sequence of random variables, and thus specializing particular finite dimensional distributions is not enough for completely defining the infinite sequence of the claim sizes. Of course, if the claim sizes are independent, say , no caution is needed for specializing the dependence for any n. However, if the claim sizes are assumed to be dependent, then particular dependence structures for finite n (like the one in Equation (5)) lead to randomly scaled independence structures; this is further illustrated below in our Theorem 1.
In view of the above discussions, some possible approaches for modeling dependent claim sizes (or risks) include:
copula-based models (here one needs to be careful since dependence structures for infinite sequences are needed!);
conditional dependence models;
random scale models;
transformation of simple independence models.
The last point above means that if
are independent claim sizes, then
, with
f some given deterministic function and
some indices, form a dependent sequence of claim sizes. One important example in this direction is the multivariate Pareto distribution of the second kind dealt with in [
6,
7]. Many other dependence models, like
m dependence or common shock models, can be introduced by this simple transformation of independent risks.
Of course these are only a few possibilities that lead to tractable dependence structures with certain appeal to actuarial applications; see also [
1,
8,
9,
10,
11,
12,
13,
14,
15] and the references therein.
Finally, we mention that there are several other aspects of actuarial models where random shifting and scaling are intrinsically present. For instance, in [
14] a new interesting copula model was studied, which can be alternatively introduced by a random scale of independent risks; see discussions in
Section 4.
The principal goal of this contribution is to discuss various aspects of random shift and random scale paradigms in actuarial models. Our analysis leads to new derivations and insights concerning the calculation of the Bayesian premium. Furthermore, we show that modeling claim sizes by a class of Dirichlet random sequences can be done in the framework of a tractable random scale model. Further, we point out that random scaling approach is of interest for modeling large losses as in the setup of [
14]. As a byproduct, a new class of
Dirichlet random vectors is introduced.
The paper is organized as follows. In
Section 2 we consider the Bayesian premium through certain random shift model. Our main finding is presented in
Section 3, which generalizes Theorem 1 in [
4].
Section 4 is dedicated to discussions and extensions.
2. Credibility Premium in Random Shift Models
For a given
d-dimensional distribution function
F, we define a shift family of distribution functions
. Typically, the assumption on a loss random vector
X is that
follows a distribution function parametrized by
θ, say it follows
. A direct way to formulate this model is via the random shift representation
where
Y has distribution function
F and is independent of
Θ. If
Θ possesses a probability density function (pdf)
h, then clearly
X also possesses a pdf given by
E. Consequently, the Bayesian premium (under a
loss function), when it exists, is given by
where for the derivation of the last equality Equation (7) we assumed additionally that
Y also possesses a pdf. Clearly, if
we have further
The random shift model Equation (6) is transparent and offers a clear advantage in comparison with the conditional model, if the joint distribution of (or ) can be easily found as illustrated below.
Example 1. Suppose that
with
(here
stands for the
d-dimensional normal distribution with mean
ν and covariance matrix
A). Suppose further that
is positive definite. It follows that
with
independent of
Θ. Therefore, in the light of [
1] the fact that
Z is normally distributed in
implies that
is normally distributed with mean
Consequently
Particularly, if Σ is positive definite
where
denotes the
identity matrix.
Clearly, (1) is immediately established by the above for the special case that
and
. It is worth pointing out that Equation (10) was derived by [
2] when
is non-singular using an indirect (in that case complicated) approach, whereas Example 1 gives a short direct proof for the formula of the Bayesian premium in the random shift Gaussian model where we can further allow
to be singular.
3. Dirichlet Claim Sizes & Random Scaling
A fundamental question when constructing models for claim sizes
is how to introduce tractable dependence structures. As mentioned in the Introduction, one common approach in the actuarial literature is to assume that the survival copula of
is an
n-dimensional Archimedean copula; see, e.g., [
5,
16] and the references therein. In view of the link between Archimedean copula and Dirichlet distribution explained in [
17], we choose the direct approach for modeling claim sizes by a Dirichlet random sequence as in [
4].
With motivation from the definition of
Dirichlet random vectors, we introduce next
d-dimensional
Dirichlet random vectors. Let
denote the Gamma distribution with positive parameters
. It is known that its pdf is
, where
stands for the Euler Gamma function. Fix some positive constants
, and
p. In the rest of the paper, without special indication, let
denote a sequence of positive independent random variables defined on some probability space
such that, for any
has
distribution with parameters
and
p. It follows easily that the pdf of
is given by
We say that
is a
d-dimensional
Dirichlet random vector, if the stochastic representation
holds with some positive random variable
R defined on
, which is independent of the random vector
O. The reason for the name of
Dirichlet random vector (and distribution) is that the angular component
O lives on the unit
-sphere of
,
i.e.,
When
,
O has the Dirichlet distribution on the unit simplex; see [
17].
The main result of this section displayed in the next theorem shows that the model with Dirichlet claim sizes can be explained by a random scale model.
Theorem 1. Let be positive random variables. If, for any ,
the random vector has a d-dimensional Dirichlet distribution with representation ,
thenwith S a non-negative random variable defined on ,
independent of .
Proof: By definition, it is sufficient to show that, for any
for the non-negative random variable
S required. Since for any
the random vector
has an
Dirichlet distribution, then we have the stochastic representation
with
. Therefore, we have the convergence in distribution (denoted here as
)
as
. Clearly, by the strong law of large numbers, as
we have the almost sure convergence
, which entails
as
, meaning that
In light of Theorem 3.9.4 in [
18], by the independence of
and
we conclude that
with
S some non-negative random variable defined on
such that
implying Equation (13), and thus the claim follows. ☐
The following corollary is a generalization of Theorem 1 in [
4].
Corollary 2. If the claim sizes are identically distributed, then under the assumptions and notation of Theorem 1, Equation (12) holds with a sequence of independent random variables with common pdf , for some .
In view of the well-known Beta-Gamma algebra (see, e.g., [
19]) if
, then
in Corollary 2 can be rewritten as
with
a Beta distribution with parameters
and
being exponentially distributed with mean
p. Further,
are mutually independent. Consequently
Note that
is a
d-dimensional
Dirichlet random vector.
4. Discussions and Extensions
The conditional credibility model considered in
Section 2 is simple since we used a single distribution function
F to define a shift family of distributions,
i.e.,
. Of course, we can consider a more general case that
is a family of
d-dimensional distributions and assume that
has distribution function
. Hence the random shift model is
, where
has distribution function
. It is clear that the random shift model is again specified via a conditional distribution, so there is no essential simplification by rewriting the conditional model apart from the case that the joint distribution of
is known.
We consider briefly a tractable instance that
has an elliptical distribution in
,
i.e.,
with
independent
distributed random variables being further independent of
, and
C a square matrix in
. For more details and actuarial applications of elliptically symmetric multivariate distributions, see [
1].
Let and . For any matrix A, denote as the sub-matrix of A obtained by selecting the elements with row indices in I and column indices in J. Similarly, for any row vector , define and to be the sub-vectors of ν. Further, denote by the transpose of matrix A.
By the stochastic representation Equation (15) we obtain that
where
Set
and assume that
B is non-singular. As in the Gaussian case, for the more general class of elliptically symmetric distributions, the conditional random vector
is again elliptically symmetric with stochastic representation (suppose for simplicity
)
where
D is a square matrix such that
, and the random variable
is independent of
U; see, e.g., [
20]. Consequently, since
U has components being symmetric about 0, we obtain the Bayesian premium formula
provided that
E. In the special case that
and
have all entries equal to 0, and further
we conclude that Equation (9) holds.
The random vector
O defined in Equation (11) has components
such that
has beta distribution with parameters
; see, e.g., [
4]. In the special case that
for any
, properties of
O and
with
independent of
O are studied in [
21]. Our result in Corollary 2 agrees with the finding of Theorem 4.4 in the aforementioned paper. Note that, for the case
, the corresponding result of Theorem 1 for spherically symmetric random sequences is well-known, see, e.g., [
22,
23].
Weighted
Dirichlet random vectors are naturally introduced by using indicator random variables. Specifically, let
and
O be given as in Equation (11). Further, let
be independent Bernoulli random variables, defined on
with
, which are further independent of the random vector
. The random vector
X with stochastic representation
is referred to as a weighted
Dirichlet random vector with indicators
and parameters
.
If
’s are independent and identically distributed with
and
then
Therefore, if
is chi-square distributed with
d degrees of freedom, then
X has a centered Gaussian distribution with
independent components. The introduction of the weighted Dirichlet random vectors is important since it includes the normal distribution as a special case. In addition, weighted Dirichlet random vectors are suitable for modeling claim sizes in certain ruin models with double-sided jumps; see Example 4 in [
4].
As in the case of
Dirichlet random sequences, in the weighted case the dependence structure can be given through a random scale model as well. More precisely, if the random sequence
is such that, for any fixed
,
is a weighted
Dirichlet random vector with indicators
and parameters
, then
with
S some non-negative random variable defined on
which is independent of
.
With motivation from credibility theory, we propose to consider a new class of multivariate distributions called Dirichlet distributions, which is naturally introduced by letting the parameter p in our definition above to be random (a common feature of credibility models where parameters are random elements).
Specifically, let
P be a positive random variable, and let
be positive constants. Further, let
be independent random variables, which are further independent of
P such that
has
distribution. We say that
is a
d-dimensional
Dirichlet random vector if
holds for some positive random variable
R independent of
P and
. Here, forany
This multivariate distribution can be used in the context of credibility models, models for large losses, models for risk aggregation, or models for claim sizes. If we assume that
is a sequence of claim sizes such that, for any
,
has a
d-dimensional
Dirichlet distribution, then an extension of Theorem 1 for this case is possible. More precisely, choosing now
, we conclude that
with some non-negative random variable
S independent of
.
In what follows, we consider a new copula class introduced in [
14], which is referred to as MGB2 copula. Let Θ be a positive random variable having an inverse Gamma distribution with shape parameter
and a unit scale,
i.e.,
has
distribution. In view of the aforementioned paper
has an MGB2 distribution (or MGB2 copula) if
’s are positive random variables and
are independent with pdf
given by
Here the parameters
are all positive constants. Instead of using the conditional argument, we can directly define
via a random scale model as follows
with
being independent positive random variables such that, for any fixed
,
has
distribution. One advantage of the random scale model Equation (20) is that, for modeling purposes, it can be rewritten as a random shift model
Another advantage of the random scale model Equation (20) becomes clearer if of interest is the joint tail asymptotic behavior of
, as discussed in [
14]. As illustrated below, the regular variation of survival function of Θ is enough for the joint tail asymptotic behavior of
; distributional assumptions on Θ are not really necessary.
Example 2. Let
be defined as above with the parameters therein and further assume that
. Define
through Equation (20) with Θ a random variable independent of
having a regularly varying tail behavior at infinity with index
,
i.e.,
For modeling joint behavior of large losses, of interest is the calculation of the following limit
for
positive constants, see, e.g., [
24] and [
1]. In our case we have
as
where in the last step we applied Breiman’s lemma; see, e.g., [
25]. Since the asymptotic dependence function
is positive, an appropriate extreme value model for the joint survival function of
and
is the one that allows for Fréchet marginals and asymptotic dependence.