On the Evaluation of the Distribution of a General Multivariate Collective Model: Recursions versus Fast Fourier Transform

Raluca Vernic

doi:10.3390/risks6030087

¹

Faculty of Mathematics and Computer Science, Ovidius University of Constanta, 900527 Constanta, Romania

²

Institute for Mathematical Statistics and Applied Mathematics, 050711 Bucharest, Romania

Risks2018, 6(3), 87;https://doi.org/10.3390/risks6030087

Version Notes

Order Reprints

Abstract

With the purpose of introducing dependence between different types of claims, multivariate collective models have recently gained a lot of attention. However, when it comes to the evaluation of the corresponding compound distribution, the problems increase with the dimensionality of the model. In this paper, we consider a multivariate collective model that generalizes a model already studied from the point of view of recursive and FFT evaluation of its distribution, and we extend the same study to the general model. With the intention to see which method works better for this general model, we compare the recursive method with the FFT technique, and emphasize the advantages and drawbacks of each one, based on numerical examples.

Keywords:

multivariate collective model; multivariate compound distribution; recursions; Fast Fourier Transform

1. Introduction

Recently, Robe-Voinea and Vernic (2016a, 2016b, 2017, 2018) and Vernic (2018) studied the recursive and Fast Fourier Transform (FFT)-based evaluation of the distribution of the following multivariate collective model:

(S_{1}, \dots, S_{m}) = (\sum_{l = 0}^{N_{1}} U_{1 l} + \sum_{k = 0}^{N_{0}} L_{1 k}, \dots, \sum_{l = 0}^{N_{m}} U_{m l} + \sum_{k = 0}^{N_{0}} L_{m k}), m \geq 2,

(1)

which may arise in different contexts (see, e.g., the discussion in Section 14.1 of Reference Sundt and Vernic (2009)), from which we mention the case where a policyholder has m types of policies, such as auto, home, business, etc., that can be simultaneously affected by some claim events, such as floods, storms or earthquakes. More precisely, in this case,

\sum_{l = 0}^{N_{j}} U_{j l}

denotes the aggregate claims affecting solely the policy of type j, while

N_{0}

denotes the random variable (r.v.) number of claims simultaneously affecting all m types of policies, with

L_{j k}

denoting the size of the kth such claim corresponding to the policy of type j. The assumptions under which this model was considered are: Each set of claim sizes

{(U_{j l})}_{l \geq 1}

are non-negative, independent and identically distributed (i.i.d.) r.v.s,

1 \leq j \leq m;

they are also independent of the claim numbers and of the other claim sizes,

(L_{1 k}, \dots, L_{m k})

included; the random vectors

{(L_{1 k}, \dots, L_{m k})}_{k \geq 1}

are non-negative i.i.d. as the generic random vector

L = (L_{1}, \dots, L_{m}),

and independent of the claim numbers, while the components of

L

, however, are dependent; by convention,

U_{j 0} = L_{j 0} = 0, 1 \leq j \leq m .

Note that the above model assumes that a claim event affects either a single type of insurance line or all the insurance lines at once; there is no middle way, i.e., an event cannot affect only, say, lines 1 and 2, without causing claims in the other lines.

To overcome this drawback, in this paper we consider the more general multivariate collective model:

S = (S_{1}, \dots, S_{m}) = \sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} \sum_{i = 0}^{N_{i_{1} \dots i_{k}}} X_{i}^{(i_{1} \dots i_{k})},

(2)

where

The m-variate claim size random vectors ${(X_{i}^{(i_{1} \dots i_{k})})}_{i \geq 1}$ are i.i.d. as the generic m-variate random vector $X^{(i_{1} \dots i_{k})},$ whose jth univariate component $X_{j}^{(i_{1} \dots i_{k})} = 0$ if $j \notin \{i_{1}, \dots, i_{k}\},$ meaning that $X^{(i_{1} \dots i_{k})}$ results from those claim events simultaneously affecting solely the lines $\{i_{1}, \dots, i_{k}\}$ ; these events are counted by the r.v. $N_{i_{1} \dots i_{k}}$ . Moreover, the $X_{i}^{(i_{1} \dots i_{k})}$ s are also independent of the other claim size random vectors (i.e., of each ${(X_{i}^{(j_{1} \dots j_{t})})}_{i \geq 1}$ , where $\{i_{1}, \dots, i_{k}\} \neq \{j_{1}, \dots, j_{t}\}$ ) and of the claim numbers. We let $X_{i, j}^{(i_{1} \dots i_{k})}$ denote the jth univariate component of $X_{i}^{(i_{1} \dots i_{k})},$ $f_{i_{1} \dots i_{k}}$ the probability function (p.f.) of $X^{(i_{1} \dots i_{k})}$ (in the discrete case) and, by convention, $X_{0}^{(i_{1} \dots i_{k})} = 0$ .
The components of the random vector number of claims $N = {(N_{i_{1} \dots i_{k}})}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}$ are dependent r.v.s, in total (maximum) $ν = 2^{m} - 1 .$

We adopt the actuarial terminology in which the distribution of

S

is called “compound” and the distribution of

N

is called “counting”.

To evaluate the distribution of this model, we shall consider that all the claim distributions are of the discrete type (e.g., they have been previously discretized; this is a usual assumption for collective models). We start the next section by presenting the exact formula of the p.f. of

S

based on convolutions, which, unfortunately, is unpractical. Therefore, we also aim at developing recursions for the evaluation of this distribution, an approach that requires the introduction of supplementary assumptions under which it is possible to obtain recursive formulas; examples of such recursions are given in Section 2.1. Apart from the restrictive assumptions, another important drawback of recursions is that they become very time consuming when the dimensionality m of the model increases (see the numerical examples in Section 2.3). To overcome these drawbacks, in Section 2.2 we propose the use of the Fast Fourier Transform (FFT) technique, which can be applied whenever we know the form of the characteristic function of

S

and which is very efficient when we want to evaluate the distribution’s tail. However, this remarkably fast method is an approximate one, and we must pay a special attention to its specific errors; this aspect is illustrated by the numerical examples discussed in Section 2.3.

For simplicity, let us introduce more notation: We denote by

f_{S}

the p.f. of

S

, by g and

φ

the probability generating function (pgf) and the characteristic function (cf), respectively, of a r.v., which will be indexed with the r.v.’s name. Also,

n, t, x, y

are vectors whose corresponding dimension results from the context,

0

is the zero-vector, while the difference

x - y

is componentwise. By

x_{+} = \sum_{i = 1}^{m} x_{i}

we denote the sum of the components of the vector

x

and by

f^{* n}

the n-fold convolution of f. To shorten the formulas, we rewrite the sum

\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m}

as

\sum_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}

.

2. Evaluation of the Compound Distribution

We start by presenting the exact formula of the p.f. of

S

based on convolutions. This formula is so complex that, in general, it cannot be directly applied to find the distribution of

S

.

Proposition 1.

The p.f. of the multivariate collective model (2) is given by

f_{S} (x) = \sum_{n} Pr (N = n) \sum_{\begin{matrix} x = & \sum & x_{i_{1} \dots i_{k}} \\ 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}} (\prod_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}} f_{i_{1} \dots i_{k}}^{* n_{i_{1} \dots i_{k}}} (x_{i_{1} \dots i_{k}})),

where

n = {(n_{i_{1} \dots i_{k}})}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}, n_{i_{1} \dots i_{k}} \in N

.

Proof.

We have

\begin{matrix} f_{S} (x) & = & Pr (S = x) = \sum_{n} Pr (N = n) Pr (\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} \sum_{i = 0}^{n_{i_{1} \dots i_{k}}} X_{i}^{(i_{1} \dots i_{k})} = x) \\ = & \sum_{n} Pr (N = n) \sum_{\begin{matrix} x = & \sum & x_{i_{1} \dots i_{k}} \\ 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}} Pr (⋂_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} (\sum_{i = 0}^{n_{i_{1} \dots i_{k}}} X_{i}^{(i_{1} \dots i_{k})} = x_{i_{1} \dots i_{k}})), \end{matrix}

which immediately yields the result. ☐

We shall also need the pgf and the cf of

S

.

Proposition 2.

The pgf and cf of the general multivariate collective model (2) are, respectively, given by

\begin{matrix} g_{S} (t) & = & g_{N} ({(g_{X^{(i_{1} \dots i_{k})}} (t))}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}), \end{matrix}

(3)

\begin{matrix} φ_{S} (t) & = & g_{N} ({(φ_{X^{(i_{1} \dots i_{k})}} (t))}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}) . \end{matrix}

(4)

Proof.

We prove only the pgf formula (the one for the cf follows along the same lines). Considering the independence assumptions of the model, we have

\begin{matrix} g_{S} (t) & = & E [\prod_{j = 1}^{m} t_{j}^{S_{j}}] = E [\prod_{j = 1}^{m} t_{j}^{\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} \sum_{i = 0}^{N_{i_{1} \dots i_{k}}} X_{i, j}^{(i_{1} \dots i_{k})}}] \\ = & E [\prod_{k = 1}^{m} \prod_{1 \leq i_{1} < \dots < i_{k} \leq m} E [\prod_{j = 1}^{m} t_{j}^{\sum_{i = 0}^{N_{i_{1} \dots i_{k}}} X_{i, j}^{(i_{1} \dots i_{k})}}| N]] \\ = & E [\prod_{k = 1}^{m} \prod_{1 \leq i_{1} < \dots < i_{k} \leq m} g_{X^{(i_{1} \dots i_{k})}}^{N_{i_{1} \dots i_{k}}} (t)], \end{matrix}

hence the formula (3). ☐

2.1. Recursive Evaluation

Due to the difficulty of directly applying the exact formula from Proposition 1, we present in the following examples of alternative recursive formulas for obtaining the p.f. of

S

under some supplementary assumptions. These assumptions are chosen such that the multivariate compound distribution of

S

can be rewritten as a compound distribution with a univariate counting distribution, for which we can apply the already existing recursions.

2.1.1. Case 1 Assumptions

As in Reference Robe-Voinea and Vernic (2017), we assume that

N

follows the multivariate Poisson distribution

M P o (λ; \tilde{λ})

with parameters

λ > 0

and

\tilde{λ} = {(λ_{i_{1} \dots i_{k}})}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m} > 0,

having the pgf (see, e.g., Johnson et al. (1997))

g_{N} (t) = exp \{λ (\prod_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} t_{i_{1} \dots i_{k}} - 1) + \sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} λ_{i_{1} \dots i_{k}} (t_{i_{1} \dots i_{k}} - 1)\} .

As a consequence, Proposition 2 easily yields the following pgf and cf

\begin{matrix} g_{S} (t) & = & exp \{λ (\prod_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} g_{X^{(i_{1} \dots i_{k})}} (t) - 1) + \sum_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} λ_{i_{1} \dots i_{k}} (g_{X^{(i_{1} \dots i_{k})}} (t) - 1)\}, \end{matrix}

(5)

\begin{matrix} φ_{S} (t) & = & exp \{λ (\prod_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} φ_{X^{(i_{1} \dots i_{k})}} (t) - 1) + \sum_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} λ_{i_{1} \dots i_{k}} (φ_{X^{(i_{1} \dots i_{k})}} (t) - 1)\} . \end{matrix}

(6)

Also, two recursive formulas for evaluating the distribution of

S

are obtained in the following proposition, where we denote by

f_{X_{+}}

the p.f. of the sum r.v.

\sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} X^{(i_{1} \dots i_{k})}

.

Proposition 3.

Under the assumption that

N \sim M P o (λ; \tilde{λ})

it holds that

\begin{matrix} f_{S} (x) & = \sum_{k = 1}^{m} \sum_{\begin{matrix} 1 \leq i_{1} < \dots < i_{k} \leq m \\ l \in \{i_{1}, \dots, i_{k}\} \end{matrix}} λ_{i_{1} \dots i_{k}} \sum_{\begin{matrix} 0 < (y_{i_{1}}, \dots, y_{i_{k}}) \leq (x_{i_{1}}, \dots, x_{i_{k}}) \\ y_{i_{k + 1}} = \dots = y_{i_{m}} = 0 \end{matrix}} \frac{y_{l}}{x_{l}} f_{i_{1} \dots i_{k}} (y) f_{S} (x - y) \\ + λ \sum_{0 < y \leq x} \frac{y_{l}}{x_{l}} f_{X_{+}} (y) f_{S} (x - y), x_{l} \geq 1, x_{j} \geq 0, \forall j \neq l, \end{matrix}

and

\begin{matrix} f_{S} (x) & = \frac{1}{x_{+}} [\sum_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} λ_{i_{1} \dots i_{k}} \sum_{\begin{matrix} 0 < (y_{i_{1}}, \dots, y_{i_{k}}) \leq (x_{i_{1}}, \dots, x_{i_{k}}) \\ y_{i_{k + 1}} = \dots = y_{i_{m}} = 0 \end{matrix}} y_{+} f_{i_{1} \dots i_{k}} (y) f_{S} (x - y) \\ + λ \sum_{0 < y \leq x} y_{+} f_{X_{+}} (y) f_{S} (x - y)], x_{+} \geq 1, \end{matrix}

(7)

with starting value

f_{S} (0) = exp \{λ \prod_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} f_{i_{1} \dots i_{k}} (0) + \sum_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} λ_{i_{1} \dots i_{k}} f_{i_{1} \dots i_{k}} (0) - λ_{+}\},

where

λ_{+} = λ + \sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} λ_{i_{1} \dots i_{k}}

. In the above formulas,

y = (y_{1}, \dots, y_{m})

is such that

(y_{i_{1}}, \dots, y_{i_{m}})

is a permutation of its components.

Proof.

Due to the independence of the random vectors

X^{(i_{1} \dots i_{k})},

we have that

g_{X_{+}} = \prod_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} g_{X^{(i_{1} \dots i_{k})}};

therefore, we can rewrite the pgf (5) as

\begin{matrix} g_{S} (t) & = & exp \{λ_{+} (\frac{λ}{λ_{+}} (g_{X_{+}} (t) - 1) + \sum_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} \frac{λ_{i_{1} \dots i_{k}}}{λ_{+}} (g_{X^{(i_{1} \dots i_{k})}} (t) - 1))\} \\ = & exp \{λ_{+} (\frac{λ}{λ_{+}} g_{X_{+}} (t) + \sum_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} \frac{λ_{i_{1} \dots i_{k}}}{λ_{+}} g_{X^{(i_{1} \dots i_{k})}} (t) - 1)\}, \end{matrix}

meaning that in this case, the distribution of Model (2) is also a compound distribution, with a univariate Poisson counting distribution. More precisely,

S

can also be rewritten as

S = \sum_{k = 0}^{\tilde{N}} C_{k},

(8)

where

\tilde{N} \sim P o (λ_{+})

,

C_{0} = 0,

while the random vectors

C_{1}, C_{2}, \dots

are i.i.d. as the m-variate random vector

C

having the mixture p.f.

f_{C} (x) = \frac{λ}{λ_{+}} f_{X_{+}} (x) + \sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} \frac{λ_{i_{1} \dots i_{k}}}{λ_{+}} f_{i_{1} \dots i_{k}} (x) .

(9)

Regarding model (8), with

\tilde{N}

satisfying Panjer’s recursion (see Panjer (1981)) with parameters

a, b \in R

, i.e.,

Pr (\tilde{N} = n) = (a + \frac{b}{n}) Pr (\tilde{N} = n - 1), \forall n \geq 1,

from Reference Sundt (1999) (see, also, formulas (15.4) and, respectively, (15.5) in Sundt and Vernic (2009)) it holds that

\begin{matrix} f_{S} (x) & = & \frac{1}{1 - a f_{C} (0)} \sum_{0 < y \leq x} (a + b \frac{y_{l}}{x_{l}}) f_{C} (y) f_{S} (x - y), x_{l} \geq 1, \end{matrix}

(10)

\begin{matrix} f_{S} (x) & = & \frac{1}{1 - a f_{C} (0)} \sum_{0 < y \leq x} (a + b \frac{y_{+}}{x_{+}}) f_{C} (y) f_{S} (x - y), x > 0 . \end{matrix}

(11)

Since in our case

\tilde{N} \sim P o (λ_{+}),

we have

a = 0

and

b = λ_{+}

. Based on this, we insert Equation (9) into Equation(10) and obtain for

x_{l} \geq 1,

f_{S} (x) = λ_{+} \sum_{0 < y \leq x} \frac{y_{l}}{x_{l}} (\frac{λ}{λ_{+}} f_{X_{+}} (y) + \sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} \frac{λ_{i_{1} \dots i_{k}}}{λ_{+}} f_{i_{1} \dots i_{k}} (y)) f_{S} (x - y) .

We know that

X_{j}^{(i_{1} \dots i_{k})} = 0

if

j \notin \{i_{1}, \dots, i_{k}\},

hence, concerning the argument

y

of

f_{i_{1} \dots i_{k}} (y),

we can take the components

y_{i_{k + 1}} = \dots = y_{i_{m}} = 0

. Therefore, if

l \notin \{i_{1}, \dots, i_{k}\},

clearly

y_{l} = 0

in the argument

y

of

f_{i_{1} \dots i_{k}} (y),

which yields the first stated formula. The second formula results in a similar way by inserting Equation (9) into Equation (11), while the starting value is immediate from

f_{S} (0) = g_{S} (0)

and from the above form of

g_{S}

. This completes the proof. ☐

2.1.2. Case 2 Assumptions

Similarly to Robe-Voinea and Vernic (2016a, 2016b), the supplementary assumptions are now:

A1: The p.f. of the total number of claims $N_{t o t} = \sum_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m} N_{i_{1} \dots i_{k}}$ satisfies Panjer’s recursion for $a, b \in R$ .
A2: Given $N_{t o t} = n,$ the conditional distribution of the random vector number of claims $N$ is assumed to be multinomial $M n o m (n; p)$ with parameters $n \in N$ and $p = {(p_{i_{1} \dots i_{k}})}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m},$ where $p_{i_{1} \dots i_{k}} \in (0, 1)$ such that $\sum_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} = 1 .$ Therefore, with $n = {(n_{i_{1} \dots i_{k}})}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}$ and $n = \sum_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m} n_{i_{1} \dots i_{k}},$

$Pr (N = n | N_{t o t} = n) = \frac{n!}{\prod_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m} n_{i_{1} \dots i_{k}}!} \prod_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}}^{n_{i_{1} \dots i_{k}}} .$

Under these assumptions, the pgf, the cf of

S

and two alternative recursive formulas are presented in the following.

Proposition 4.

Under the assumptions (A1 and A2), the pgf and cf of the general multivariate collective model (2) become, respectively,

\begin{matrix} g_{S} (t) & = & g_{N_{t o t}} (\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} g_{X^{(i_{1} \dots i_{k})}} (t)), \end{matrix}

(12)

\begin{matrix} φ_{S} (t) & = & g_{N_{t o t}} (\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} φ_{X^{(i_{1} \dots i_{k})}} (t)) . \end{matrix}

(13)

Proof.

To obtain the pgf formula, we recall that the pgf of the multinomial distribution

M n o m (n; p)

is (see, e.g., Johnson et al. (1997))

g (t) = {(\sum_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} t_{i_{1} \dots i_{k}})}^{n}

, so that the pgf of

N

becomes for

t = {(t_{i_{1} \dots i_{k}})}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m},

\begin{matrix} g_{N} (t) & = & E [E [\prod_{k = 1}^{m} \prod_{1 \leq i_{1} < \dots < i_{k} \leq m} t_{i_{1} \dots i_{k}}^{N_{i_{1} \dots i_{k}}}| N_{t o t}]] = E [{(\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} t_{i_{1} \dots i_{k}})}^{N_{t o t}}] \\ = & g_{N_{t o t}} (\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} t_{i_{1} \dots i_{k}}) . \end{matrix}

Inserting this into Equation (3) easily yields Equation (12). Equation (13) follows in a similar way, which completes the proof. ☐

Proposition 5.

Under the assumptions (A1 and A2) of Model (2), with starting value

f_{S} (0) = g_{S} (0) = g_{N_{t o t}} (\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} f_{i_{1} \dots i_{k}} (0)),

the following recursive formula holds for

x_{l} \geq 1, x_{j} \geq 0, \forall j \neq l,

\begin{matrix} f_{S} (x) & = & K \sum_{k = 1}^{m} \{\sum_{\begin{matrix} 1 \leq i_{1} < \dots < i_{k} \leq m \\ l \in \{i_{1}, \dots, i_{k}\} \end{matrix}} p_{i_{1} \dots i_{k}} \sum_{\begin{matrix} 0 < (y_{i_{1}}, \dots, y_{i_{k}}) \leq (x_{i_{1}}, \dots, x_{i_{k}}) \\ y_{i_{k + 1}} = \dots = y_{i_{m}} = 0 \end{matrix}} (a + b \frac{y_{l}}{x_{l}}) f_{i_{1} \dots i_{k}} (y) f_{S} (x - y) \\ + a \sum_{\begin{matrix} 1 \leq i_{1} < \dots < i_{k} \leq m \\ l \notin \{i_{1}, \dots, i_{k}\} \end{matrix}} p_{i_{1} \dots i_{k}} \sum_{\begin{matrix} 0 < (y_{i_{1}}, \dots, y_{i_{k}}) \leq (x_{i_{1}}, \dots, x_{i_{k}}) \\ y_{i_{k + 1}} = \dots = y_{i_{m}} = 0 \end{matrix}} f_{i_{1} \dots i_{k}} (y) f_{S} (x - y)\}, \end{matrix}

(14)

while for

x_{+} > 0,

f_{S} (x) = K \sum_{_{\begin{matrix} 1 \leq k \leq m \\ 1 \leq i_{1} < \dots < i_{k} \leq m \end{matrix}}} p_{i_{1} \dots i_{k}} \sum_{\begin{matrix} 0 < (y_{i_{1}}, \dots, y_{i_{k}}) \leq (x_{i_{1}}, \dots, x_{i_{k}}) \\ y_{i_{k + 1}} = \dots = y_{i_{m}} = 0 \end{matrix}} (a + b \frac{y_{+}}{x_{+}}) f_{i_{1} \dots i_{k}} (y) f_{S} (x - y),

(15)

where

K = {[1 - a \sum_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} f_{i_{1} \dots i_{k}} (0)]}^{- 1}

and

y = (y_{1}, \dots, y_{m})

is such that

(y_{i_{1}}, \dots, y_{i_{m}})

is a permutation of its components.

Proof.

Considering the assumptions (A1 and A2), we rewrite Model (2) as

S = \sum_{k = 0}^{N_{t o t}} C_{k},

where

C_{0} = 0,

while the random vectors

C_{1}, C_{2}, \dots

are i.i.d. as the m-variate random vector

C

with the p.f.

f_{C} (y) = \sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} f_{i_{1} \dots i_{k}} (y) .

(16)

We use again Equations (10) and (11). By inserting Equation (16) into Equation (10), the stated formula of the constant K is easily obtained and, for

x_{l} \geq 1,

\begin{matrix} f_{S} (x) & = & K \sum_{0 < y \leq x} (a + b \frac{y_{l}}{x_{l}}) (\sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} f_{i_{1} \dots i_{k}} (y)) f_{S} (x - y) \\ = & K \sum_{k = 1}^{m} \sum_{1 \leq i_{1} < \dots < i_{k} \leq m} p_{i_{1} \dots i_{k}} \sum_{0 < y \leq x} (a + b \frac{y_{l}}{x_{l}}) f_{i_{1} \dots i_{k}} (y) f_{S} (x - y) . \end{matrix}

Using reasoning similar with the one used in the proof of Proposition 3, we obtain Equation (14). Similarly, Equations (11) and (16) lead to Equation (15). This completes the proof. ☐

Particular case:

m = 3

. Let us now have a look at a recursive formula in the trivariate case, where the general Model (2) is

S = (S_{1}, S_{2}, S_{3})

with

\begin{matrix} S_{1} & = & \sum_{i = 0}^{N_{1}} X_{i, 1}^{(1)} + \sum_{i = 0}^{N_{12}} X_{i, 1}^{(12)} + \sum_{i = 0}^{N_{13}} X_{i, 1}^{(13)} + \sum_{i = 0}^{N_{123}} X_{i, 1}^{(123)}, \\ S_{2} & = & \sum_{i = 0}^{N_{2}} X_{i, 2}^{(2)} + \sum_{i = 0}^{N_{12}} X_{i, 2}^{(12)} + \sum_{i = 0}^{N_{23}} X_{i, 2}^{(23)} + \sum_{i = 0}^{N_{123}} X_{i, 2}^{(123)}, \\ S_{3} & = & \sum_{i = 0}^{N_{3}} X_{i, 3}^{(3)} + \sum_{i = 0}^{N_{13}} X_{i, 3}^{(13)} + \sum_{i = 0}^{N_{23}} X_{i, 3}^{(23)} + \sum_{i = 0}^{N_{123}} X_{i, 3}^{(123)} . \end{matrix}

For example, Equation (15) becomes

\begin{matrix} f_{S} (x) & = & K \{\sum_{i = 1}^{3} p_{i} \sum_{\begin{matrix} 1 \leq y_{i} \leq x_{i} \\ y_{2} = y_{3} = 0 \end{matrix}} (a + b \frac{y_{i}}{x_{+}}) f_{i} (y) f_{S} (x - y) \\ + \sum_{1 \leq i_{1} < i_{2} \leq 3} p_{i_{1} i_{2}} \sum_{\begin{matrix} 0 < (y_{i_{1}}, y_{i_{2}}) \leq (x_{i_{1}}, x_{i_{2}}) \\ y_{i_{3}} = 0 \end{matrix}} (a + b \frac{y_{i_{1}} + y_{i_{2}}}{x_{+}}) f_{i_{1} i_{2}} (y) f_{S} (x - y) \\ + p_{123} \sum_{0 < y \leq x} (a + b \frac{y_{+}}{x_{+}}) f_{123} (y) f_{S} (x - y)\}, \end{matrix}

(17)

where

K = {[1 - a (\sum_{i = 1}^{3} p_{i} f_{i} (0) + \sum_{1 \leq i_{1} < i_{2} \leq 3} p_{i_{1} i_{2}} f_{i_{1} i_{2}} (0) + p_{123} f_{123} (0))]}^{- 1}

.

2.1.3. Case 3 Assumptions

Another assumption under which recursive formulas already exist is the univariate mixed Poisson counting distribution. To this purpose, we assume that, given that a positive univariate r.v.

Θ

takes the value

θ,

the r.v.s

N_{i_{1} \dots i_{k}}

are all i.i.d. Poisson distributed such that

N_{i_{1} \dots i_{k} |Θ = θ} \sim P o (θ λ_{i_{1} \dots i_{k}}), λ_{i_{1} \dots i_{k}} > 0, 1 \leq k \leq m, 1 \leq i_{1} < \dots < i_{k} \leq m .

Then, the pgf of

S

given

Θ = θ

becomes, from Equation (3):

\begin{matrix} g_{S |Θ = θ} (t) & = & g_{N |Θ = θ} ({(g_{X^{(i_{1} \dots i_{k})}} (t))}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}) \\ = & exp \{θ λ_{+} (\sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} \frac{λ_{i_{1} \dots i_{k}}}{λ_{+}} g_{X^{(i_{1} \dots i_{k})}} (t) - 1)\}, \end{matrix}

where

λ_{+} = \sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} λ_{i_{1} \dots i_{k}}

. This is the pgf of a compound distribution with univariate Poisson

P o (θ λ_{+})

counting distribution and multivariate claims distribution having p.f.

h = \sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} \frac{λ_{i_{1} \dots i_{k}}}{λ_{+}} f_{i_{1} \dots i_{k}};

hence, the conditional distribution of

S

, given

Θ = θ,

can be evaluated based on Equations (10) and (11), with

a = 0

and

b = θ λ_{+}

. To find the unconditional distribution of

S,

we use the technique described in Chapter 20 of Sundt and Vernic (2009). Therefore, with U denoting the distribution function of

Θ

, we introduce the auxiliary functions

v_{i} (x) = \int_{0}^{\infty} θ^{i} f_{S |Θ = θ} (x) d U (θ), i = 0, 1, 2, \dots,

and note that

f_{S} = v_{0} .

Multiplying Equations (10) and (11) by

θ^{i}

and integrating yields the following two recursions for

v_{i}

\begin{matrix} v_{i} (x) & = & \frac{λ_{+}}{x_{l}} \sum_{0 < y \leq x} y_{l} h (y) v_{i + 1} (x - y), x_{l} \geq 1, \\ v_{i} (x) & = & \frac{λ_{+}}{x_{+}} \sum_{0 < y \leq x} y_{+} h (y) v_{i + 1} (x - y), x_{+} > 0, \end{matrix}

with starting value

v_{i} (0) = \int_{0}^{\infty} θ^{i} e^{θ λ_{+} (h (0) - 1)} d U (θ)

. Therefore, the algorithm for evaluating

f_{S} (y)

for all

0 \leq y \leq x

is more complex and implies the backward evaluation of all

v_{i} (y), 0 \leq y \leq x, i = 1, \dots, x_{+}

(here backward means by decreasing i, see, e.g., the algorithm in Section 20.4.1 in Reference Sundt and Vernic (2009)). Being very time consuming, we don’t insist on this algorithm. However, we note that the recursions can be refined under the assumption that the mixing distribution U is of the continuous type, with the density denoted by u satisfying the condition

\frac{d}{d θ} ln u (θ) = \frac{\sum_{i = 0}^{k} η_{i} θ^{i}}{\sum_{i = 0}^{k} χ_{i} θ^{i}} .

This is also called Willmot’s mixing distribution, see Reference Willmot (1993).

Remark 1.

In view of the FFT, we also display the formula of the cf of

S

given

Θ = θ,

φ_{S |Θ = θ} (t) = exp \{θ (\sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} λ_{i_{1} \dots i_{k}} φ_{X^{(i_{1} \dots i_{k})}} (t) - λ_{+})\},

where

φ_{S} (t) = \int_{0}^{\infty} φ_{S |Θ = θ} (t) d U (θ) .

(18)

Particular case: Simpler recursions are obtained when

Θ

is gamma

G a (δ, β)

distributed, with

δ, β > 0

. In this case, the univariate mixed Poisson

P o (θ λ_{+})

distribution becomes a Negative Binomial distribution

N B (δ, \frac{β}{β + λ_{+}}),

which satisfies Panjer’s recursion with

a = \frac{λ_{+}}{β + λ_{+}}

and

b = (δ - 1) \frac{λ_{+}}{β + λ_{+}}

. Since

f_{S} (x) = \int_{0}^{\infty} f_{S |Θ = θ} (x) d U (θ) = \sum_{n} h^{* n} (x) \int_{0}^{\infty} Pr ({\tilde{N}}_{θ} = n) d U (θ) = \sum_{n} Pr (\tilde{N} = n) h^{* n} (x),

where

{\tilde{N}}_{θ} \sim P o (θ λ_{+}),

hence

\tilde{N} \sim N B (δ, \frac{β}{β + λ_{+}}),

and it follows that we can use Equations (10) and (11) to obtain direct recursions for

f_{S},

i.e.,

\begin{matrix} f_{S} (x) & = & \frac{λ_{+}}{β + λ_{+} - λ_{+} h (0)} \sum_{0 < y \leq x} (1 + (δ - 1) \frac{y_{l}}{x_{l}}) h (y) f_{S} (x - y), x_{l} \geq 1, \\ f_{S} (x) & = & \frac{λ_{+}}{β + λ_{+} - λ_{+} h (0)} \sum_{0 < y \leq x} (1 + (δ - 1) \frac{y_{+}}{x_{+}}) h (y) f_{S} (x - y), x > 0, \end{matrix}

(19)

with starting value

f_{S} (x) = {(\frac{β}{β + λ_{+} - λ_{+} h (0)})}^{δ} .

Moreover, regarding the cf, we easily obtain

\begin{matrix} φ_{S} (t) & = & \frac{β^{δ}}{Γ (δ)} \int_{0}^{\infty} φ_{S |Θ = θ} (t) θ^{δ - 1} e^{- β θ} d θ \\ = & {(\frac{β}{β - \sum_{_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}} λ_{i_{1} \dots i_{k}} φ_{X^{(i_{1} \dots i_{k})}} (t) + λ_{+}})}^{δ} . \end{matrix}

(20)

2.2. Fast Fourier Transform Evaluation

The recursive method is an exact one, but, as already mentioned in the introduction, it has some important drawbacks: It can be applied only on some particular models and it becomes quite slow with the increasing of the dimensionality of

S

. A much faster and less restrictive way to evaluate the p.f. of

S

is provided by the Fast Fourier Transform method, which is an approximate technique used to strongly reduce the computing time, especially when evaluating the distribution’s tail. As an advantage, this method can be applied to any model as long as its cf (4) (on which it is based) has a closed form, even if there is no recursive formula available. Therefore, the FFT technique received special consideration in the actuarial literature (see, e.g., References Bühlmann (1984), Embrechts et al. (1993), Jin and Ren (2014) or Robe-Voinea and Vernic (2018)). It consists of an algorithm that computes the discrete Fourier transform of a multivariate function, as well as its inverse, extremely fast. Let

f (x)

denote an m-variate function defined on the integer support

\underset{j = 1}{\overset{m}{\times}} \{0, 1, \dots, r_{j} - 1\}

; then its discrete Fourier transform,

\tilde{f}

, and, respectively, the inverse mapping, can defined by (definition consistent with the functions fftn and ifftn in Matlab)

\begin{matrix} \tilde{f} (c) & = & \sum_{x_{1} = 0}^{r_{1} - 1} \dots \sum_{x_{m} = 0}^{r_{m} - 1} f (x) exp \{- 2 π i \sum_{j = 1}^{m} \frac{x_{j} c_{j}}{r_{j}}\}, c_{j} = 0, \dots, r_{j} - 1, 1 \leq j \leq m, \\ f (x) & = & \frac{1}{\prod_{j = 1}^{m} r_{j}} \sum_{c_{1} = 0}^{r_{1} - 1} \dots \sum_{c_{m} = 0}^{r_{m} - 1} \tilde{f} (c) exp \{2 π i \sum_{j = 1}^{m} \frac{x_{j} c_{j}}{r_{j}}\}, x_{j} = 0, \dots, r_{j} - 1, 1 \leq j \leq m . \end{matrix}

In general, the FFT method requires that the values

r_{j}

are powers of two for all j. For the multivariate model (2), this algorithm becomes:

FFT Algorithm for model (2)

Step 1. After setting the truncation point for each claim size random vector

X^{(i_{1} \dots i_{k})}

at the same

r = (r_{1}, \dots, r_{m})

, the corresponding truncated claim size distribution is obtained as

f^{(i_{1} \dots i_{k})} = {[f_{i_{1} \dots i_{k}} (j)]}_{0 \leq j \leq r - 1}

; if necessary, the resulting

f^{(i_{1} \dots i_{k})}

will be filled with zeros (e.g., to constraint the

r_{j}

s to be powers of two).

Step 2. Apply the m-dimensional FFT to each

f^{(i_{1} \dots i_{k})}

, which results in the multidimensional table

{\tilde{f}}^{(i_{1} \dots i_{k})} .

Step 3. Use Equation (4) in the general case to obtain the discrete cf

{\tilde{φ}}_{S} (j) = g_{N} ({({\tilde{f}}^{(i_{1} \dots i_{k})} (j))}_{1 \leq k \leq m; 1 \leq i_{1} < \dots < i_{k} \leq m}), 0 \leq j \leq r - 1

.

Step 4. Apply the multidimensional IFFT to

{\tilde{φ}}_{S}

to obtain the p.f. of

S

.

Usually, to find the optimal

r_{j}

s, one gradually increases them until the differences between the actual solutions and the previous ones are under a certain threshold (e.g., we increase

r_{j}

as 32, 64, 128, 256 etc.). However, when dealing with heavy tailed claim size distributions, the results of this method can be strongly affected by a specific error caused by the discrete Fourier transform, which consists of placing under the truncation point the compound probability mass which is in fact above this point. This so-called “aliasing error” (AE) can be significantly reduced by applying to the claim size distributions an exponential change of measure, hence, forcing the tails of these distributions to decrease at an exponential rate; this transformation is known under the name of “exponential tilting” (for more details on this transformation see, e.g., Reference Grübel and Hermesmeier (1999)).

Particular cases: Under the particular assumptions considered in the previous section to allow for a recursive evaluation, one should use the following formulas at Step 3 of the above algorithm:

-: When $N \sim M P o (λ; \tilde{λ}), {\tilde{φ}}_{S}$ is given by Equation (6);
-: Under the Case 2 assumptions (A1 and A2), ${\tilde{φ}}_{S}$ is given by Equation (13);
-: Under the Case 3 mixed Poisson assumption, ${\tilde{φ}}_{S}$ is given by Equation (18).

2.3. Numerical Illustration

In this section, we consider a particular trivariate model (2) with

\begin{matrix} S_{1} & = & \sum_{i = 0}^{N_{1}} X_{i, 1}^{(1)} + \sum_{i = 0}^{N_{12}} X_{i, 1}^{(12)} + \sum_{i = 0}^{N_{13}} X_{i, 1}^{(13)} + \sum_{i = 0}^{N_{123}} X_{i, 1}^{(123)}, \\ S_{2} & = & \sum_{i = 0}^{N_{2}} X_{i, 2}^{(2)} + \sum_{i = 0}^{N_{12}} X_{i, 2}^{(12)} + \sum_{i = 0}^{N_{123}} X_{i, 2}^{(123)}, \\ S_{3} & = & \sum_{i = 0}^{N_{3}} X_{i, 3}^{(3)} + \sum_{i = 0}^{N_{13}} X_{i, 3}^{(13)} + \sum_{i = 0}^{N_{123}} X_{i, 3}^{(123)}, \end{matrix}

for which we implemented both the recursive formulas and the FFT algorithm, under different assumptions.

As claim size distributions, we considered only type II Pareto distributions with the purpose to emphasize the effect of the exponential tilting on the FFT technique. We recall that the decumulative distribution (or survival) function of the m-variate type II Pareto distribution

P a_{m} I I (α, {(σ_{i})}_{i = 1, \dots, m}), α, σ_{i} > 0, i = 1, \dots, m,

is given by

\bar{F} (x) = {(1 + \sum_{i = 1}^{m} \frac{x_{i}}{σ_{i}})}^{- α}, x_{i} > 0, i = 1, \dots, m .

The expected value of each marginal exists only if

α > 1

, while the variance exists only when

α > 2 .

We took (mainly from the numerical Example 4 in Reference Robe-Voinea and Vernic (2018))

\begin{matrix} X^{(1)} & \sim & P a_{1} I I (1, 1), X^{(2)} \sim P a_{1} I I (2, 2), X^{(3)} \sim P a_{1} I I (3, 1), \\ X^{(12)} & \sim & P a_{2} I I (1.5, 1, 2), X^{(13)} \sim P a_{2} I I (2, 1, 1), X^{(123)} \sim P a_{3} I I (1.5; 2, 2, 2) . \end{matrix}

The expected value of

X^{(1)}

and the variances of

X^{(2)}, X^{(12)}, X^{(13)}, X^{(123)}

do not exist, hence we can see the effect of the exponential tilting in the heavy-tailed case. To discretize these distributions, we used the method of rounding considering the span

h = 1

(good enough for illustration, but not optimal, see the discussion in Reference Robe-Voinea and Vernic (2018)).

Concerning the FFT method, as discussed in Section 2.2, we increased the truncation point

r = (r, r, r)

(we took

r_{1} = r_{2} = r_{3} = r

for simplicity) from 16 till 128 (unfortunately,

r = 256

generated an “out of memory” warning), and noticed that

r = 128

yielded enough accurate results (for our data) compared to the exact method (see Tables 1, 3 and 5). Moreover, we also varied the tilting parameter

θ = (θ_{1} = θ_{2} = θ_{3})

and noticed that an increasing of

θ

improves the results till

θ = 7 / r,

while a larger value like

θ = 9 / r

doesn’t significantly improve the results (see Table 4 in Example 2).

As expected, there is an important difference between the computing times requested by the two methods. This difference increases with the increasing of the truncation point

r

and becomes really huge for

r = 32

in Example 1 and for

r = 128

in Examples 2 and 3. Therefore, we decided to compare the resulting p.f.s only up to a certain right endpoint denoted by

x_{M} = (x_{M} - 1, x_{M} - 1, x_{M} - 1),

even if the support of the FFT was much larger. Note that the discretization time was not taken into account in the displayed computing times since discretization is needed by both methods (the total discretization time up to

r = 128

was about 160 s).

To emphasize the differences between the FFT and the recursive results, we used the cumulative distribution function (cdf), the AE and the maximum absolute error evaluated between the exact p.f. and the FFT one; these last two are defined, respectively, by

\begin{matrix} A E & = & \sum_{0 \leq x \leq x_{M}} |f_{S} (x) - f_{S}^{F F T} (x)|, \\ M a x . e r r & = & max_{0 \leq x \leq x_{M}} |f_{S} (x) - f_{S}^{F F T} (x)| . \end{matrix}

We shall now present three examples based on the three particular cases considered in Section 2.1. From these examples, we also note that in cdf terms,

F_{S}^{F F T} (x_{M} {) \geq F}_{S} (x_{M}),

an inequality caused by the AE that places compound mass below the truncation point.

Example 1.

We assume that

N \sim M P o (λ; \tilde{λ}),

where

λ = 1, λ_{1} = 3, λ_{2} = 3, λ_{3} = 2, λ_{12} = 2, λ_{13} = 1.7, λ_{123} = 1.5

; since for this particular model, the recursive method (we implemented Equation (7)) implies the evaluation of the p.f.

f_{X_{+}}

(i.e., multivariate convolutions), the corresponding computing time increases tremendously with

r .

Therefore, starting with

r = 32,

we took only

x_{M} = 20,

which needed about 30 minutes only for the convolution part. However, the FFT was ready in only a few s even for

r = 128

, see Table 1, where we also display a comparison of the accuracy of the two methods. This example clearly emphasizes the speed discrepancy between the two methods and the important advantage of the FFT speed.

Table 1. Example 1: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

Example 2.

We now assume that

N_{t o t}

follows a Poisson distribution

P o (λ = 5),

for which we recall that

a = 0, b = λ

and

g_{N_{t o t}} = e^{λ (t - 1)} .

Numerically, we took the multinomial parameters

p_{1} = 0.3, p_{2} = 0.2, p_{3} = 0.2, p_{12} = 0.15, p_{13} = 0.1, p_{123} = 0.05

. We implemented the recursive Equation (17) and performed it up to the maximum

x_{M} = 70

in about 35 min. The speed difference between the two methods can be seen in Table 2, where we displayed the relative computing times Rec/FFT (for

r = 128,

FFT took about 8 s).

Table 2. Example 2: Relative performances of the two methods when varying r (

θ = 7 / r

).

The accuracy comparison of the two methods is presented in Table 3 and the effect of changing the tilting parameters in Table 4, both supporting the above conclusions regarding the choices of r and

θ .

Table 3. Example 2: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

Table 4. Example 2: Comparing recursive and FFT methods for

h = 1, r = 64

and various

θ

.

Example 3.

This example is related to Case 3, i.e.,

N

follows a mixed Poisson distribution and, for simplicity, we let

Θ \sim G a (δ, β) .

Therefore, we implemented recursion (19) and the FFT based on Equation (20). The values of the parameters are:

δ = β = 2, λ_{1} = λ_{2} = 2.5, λ_{3} = λ_{12} = 2, λ_{13} = 1.7, λ_{123} = 1.5 .

The comparison between the two methods is presented in Table 5, from where we note once again that a value of

r = 128

is sufficient to obtain good enough results by FFT (at least for these data). Concerning the computing times, the values were similar with the ones obtained in Example 2, see Table 2.

Table 5. Example 3: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

3. Conclusions

In this paper, we proposed a general multivariate collective model that allows for dependence between the r.v.s number of claims, and, moreover, between the different r.v.s claim sizes. Since the evaluation of the resulting compound distribution is not straightforward, we discussed two types of techniques to deal with it: The recursive method that was presented in Section 2.1 and the FFT algorithm that was described in Section 2.2. Unfortunately, even if the recursive method has the advantage of being exact, it has two main drawbacks compared with the FFT method: First, recursions are available under some restrictive assumptions and second, they become very slow with the increasing of the dimensionality of the model. On the other hand, the main drawback of the FFT method consists in its specific errors, especially the aliasing error. However, the FFT technique is so fast compared with the exact recursions, that it is quite worthwhile to use it, especially when values from the tail of the compound distribution are needed (nevertheless, it is important to pay attention when choosing optimal values for the truncation points and for the tilting parameters). Another advantage of the FFT is that specific functions are already implemented in existing software, even for higher dimensions, with, eventually, the disadvantage of memory limitation.

To conclude, we would recommend the following approach: If recursive formulas are available for the considered model, they should be used to evaluate the compound distribution until some reasonable (in computing time terms) upper limit is reached, and then the FFT method should be applied for a more extended domain; to validate the accuracy of the FFT results, they should be compared with the ones obtained by the recursive method.

Funding

This research received no external funding.

Acknowledgments

The author gratefully acknowledges the two anonymous referees for their nice and valuable comments, and the prompt help of the associate editor.

Conflicts of Interest

The author declares no conflict of interest.

References

Bühlmann, Hans. 1984. Numerical evaluation of the compound Poisson distribution: Recursion or fast Fourier transform? Scandinavian Actuarial Journal 1984: 116–26. [Google Scholar] [CrossRef]
Embrechts, Paul, R. Grübel, and S. M. Pitts. 1993. Some applications of the fast Fourier transform algorithm in insurance mathematics. Statistica Neerlandica 47: 59–75. [Google Scholar] [CrossRef]
Grübel, Rudolf, and Renate Hermesmeier. 1999. Computation of compound distributions I: Aliasing errors and exponential tilting. ASTIN Bulletin: The Journal of the IAA 29: 197–214. [Google Scholar] [CrossRef]
Jin, Tao, and Jiandong Ren. 2014. Recursions and fast Fourier transforms for a new bivariate aggregate claims model. Scandinavian Actuarial Journal 2014: 729–52. [Google Scholar] [CrossRef]
Johnson, Norman Lloyd, Samuel Kotz, and Narayanaswamy Balakrishnan. 1997. Discrete Multivariate Distributions. New York: Wiley. [Google Scholar]
Panjer, Harry H. 1981. Recursive evaluation of a family of compound distributions. ASTIN Bulletin: The Journal of the IAA 12: 22–26. [Google Scholar] [CrossRef]
Robe-Voinea, Elena-Gratiela, and Raluca Vernic. 2016a. On the recursive evaluation of a certain multivariate compound distribution. Acta Mathematicae Applicatae Sinica, English Series 32: 913–20. [Google Scholar] [CrossRef]
Robe-Voinea, Elena-Gratiela, and Raluca Vernic. 2016b. Another approach to the evaluation of a certain multivariate compound distribution. Analele Universitatii “Ovidius” Constanta-Seria Matematica 24: 339–49. [Google Scholar] [CrossRef]
Robe-Voinea, Elena-Gratiela, and Raluca Vernic. 2017. On a multivariate aggregate claims model with multivariate Poisson counting distribution. Proceedings of the Romanian Academy Series A 18: 3–7. [Google Scholar]
Robe-Voinea, Elena-Gratiela, and Raluca Vernic. 2018. Fast Fourier Transform for multivariate aggregate claims. Computational and Applied Mathematics 37: 205–19. [Google Scholar] [CrossRef]
Sundt, Bjørn. 1999. On multivariate Panjer recursions. ASTIN Bulletin: The Journal of the IAA 29: 29–45. [Google Scholar] [CrossRef]
Sundt, Bjørn, and Raluca Vernic. 2009. Recursions for Convolutions and Compound Distributions with Insurance Applications. Berlin: Springer Science & Business Media. [Google Scholar]
Vernic, Raluca. 2018. On the evaluation of some multivariate compound distributions with Sarmanov’s counting distribution. Insurance Mathematics and Economics 79: 184–93. [Google Scholar] [CrossRef]
Willmot, Gordon E. 1993. On recursive evaluation of mixed Poisson probabilities and related quantities. Scandinavian Actuarial Journal 1993: 114–33. [Google Scholar] [CrossRef]

Table 1. Example 1: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

Table 1. Example 1: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

	$r = x_{M} = 16$	$r = 32, x_{M} = 20$	$r = 64, x_{M} = 20$	$r = 128, x_{M} = 20$
Rec. $F_{S} (x_{M})$	0.219737	0.312845	0.312845	0.312845
FFT $F_{S}^{F F T} (x_{M})$	0.219884	0.312909	0.312855	0.312847
FFT time up to $r$	0.016 s	0.124 s	0.952 s	9.484 s
$A E$	1.4743 × $10^{- 4}$	6.3771 × $10^{- 5}$	1.0251 × $10^{- 5}$	1.7488 × $10^{- 6}$
$M a x . e r r$	8.8571 × $10^{- 8}$	2.0810 × $10^{- 8}$	3.5244 × $10^{- 9}$	6.9685 × $10^{- 10}$

Table 2. Example 2: Relative performances of the two methods when varying r (

θ = 7 / r

).

Table 2. Example 2: Relative performances of the two methods when varying r (

θ = 7 / r

).

	$r = 16$	$r = 32$	$r = 64$
Rec/FFT in time	12	130	781

Table 3. Example 2: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

Table 3. Example 2: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

	$r = x_{M} = 16$	$r = x_{M} = 32$	$r = x_{M} = 64$	$r = 128, x_{M} = 70$
Rec. $F_{S} (x_{M})$	0.80035	0.91543	0.96436	0.96804
FFT $F_{S}^{F F T} (x_{M})$	0.80039	0.91544	0.96436	0.96804
$A E$	4.3580 × $10^{- 5}$	1.4294 × $10^{- 5}$	3.8798 × $10^{- 6}$	9.0937 × $10^{- 7}$
$M a x . e r r$	7.9393 × $10^{- 7}$	1.8276 × $10^{- 7}$	4.1642 × $10^{- 8}$	9.6893 × $10^{- 9}$

Table 4. Example 2: Comparing recursive and FFT methods for

h = 1, r = 64

and various

θ

.

Table 4. Example 2: Comparing recursive and FFT methods for

h = 1, r = 64

and various

θ

.

	Recursion	FFT no tilt.	$\begin{matrix} FFT tilt . \\ θ = 5 / r \end{matrix}$	$\begin{matrix} FFT tilt . \\ θ = 7 / r \end{matrix}$	$\begin{matrix} FFT tilt . \\ θ = 9 / r \end{matrix}$
$F_{S} (x_{M})$	0.96436	0.96863	0.96439	0.96436	0.96436
$A E$		4.2693 × $10^{- 3}$	2.8485 × $10^{- 5}$	3.8798 × $10^{- 6}$	5.4867 × $10^{- 6}$
$M a x . e r r$		4.5823 × $10^{- 5}$	3.0770 × $10^{- 7}$	4.1642 × $10^{- 8}$	2.7813 × $10^{- 8}$

Table 5. Example 3: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

Table 5. Example 3: Comparing recursive and FFT methods for

h = 1, θ = 7 / r

and various

r, x_{M}

.

	$r = x_{M} = 16$	$r = x_{M} = 32$	$r = x_{M} = 64$	$r = 128, x_{M} = 70$
Rec. $F_{S} (x_{M})$	0.49044	0.72191	0.88701	0.90087
FFT $F_{S}^{F F T} (x_{M})$	0.49055	0.72200	0.88705	0.90088
$A E$	1.0650 × $10^{- 4}$	8.3639 × $10^{- 5}$	3.5553 × $10^{- 5}$	6.4368 × $10^{- 6}$
$M a x . e r r$	1.6674 × $10^{- 7}$	3.2871 × $10^{- 8}$	6.8457 × $10^{- 9}$	1.5338 × $10^{- 9}$

© 2018 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

On the Evaluation of the Distribution of a General Multivariate Collective Model: Recursions versus Fast Fourier Transform

Abstract

1. Introduction

2. Evaluation of the Compound Distribution

2.1. Recursive Evaluation

2.1.1. Case 1 Assumptions

2.1.2. Case 2 Assumptions

2.1.3. Case 3 Assumptions

2.2. Fast Fourier Transform Evaluation

2.3. Numerical Illustration

3. Conclusions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics