Lambert W Random Variables and Their Applications in Loss Modelling

Käärik, Meelis; Selart, Anne; Puhkim, Tuuli; Tee, Liivika

doi:10.3390/sym15101877

Open AccessArticle

Lambert W Random Variables and Their Applications in Loss Modelling

by

Meelis Käärik

^*

,

Anne Selart

,

Tuuli Puhkim

and

Liivika Tee

Institute of Mathematics and Statistics, University of Tartu, Narva mnt 18, 51009 Tartu, Estonia

^*

Author to whom correspondence should be addressed.

Symmetry 2023, 15(10), 1877; https://doi.org/10.3390/sym15101877

Submission received: 7 September 2023 / Revised: 25 September 2023 / Accepted: 4 October 2023 / Published: 6 October 2023

(This article belongs to the Section Mathematics)

Download

Browse Figures

Versions Notes

Abstract

Several distributions and families of distributions are proposed to model skewed data, e.g., with skew-normal and related distributions. Lambert W random variables offer an alternative approach in which, instead of constructing a new distribution, a certain transformation is proposed. Such an approach allows the construction of a Lambert W skewed version from any distribution. Here, we choose the Lambert W normal distribution as a natural starting point and include the Lambert W exponential distribution due to the simplicity and shape of the exponential distribution, which, after skewing, may produce a reasonably heavy tail for loss models. In the theoretical part, we focus on the mathematical properties of obtained distributions, including the range of skewness. In the practical part, the suitability of the corresponding Lambert W transformed distributions is evaluated on real insurance data. Finally, the results are compared with those obtained using common loss distributions.

Keywords:

asymmetry; skewness; loss distributions; non-life insurance; probability distributions; Lambert W function

1. Introduction

Loss modelling is an essential part of actuarial and financial mathematics. Several distributional models have been applied over the years, and the increasing volumes of data and computational power have motivated the use of even more complex distributions to fit the data.

In the actuarial and financial fields, the data are usually skewed. Several classical distributions can be used to fit skewed data (see, e.g., [1,2]). A generic approach for skewing symmetric distributions was introduced in Azzalini [3], where the shape of the normal distribution is deformed by a certain skewness parameter. Similarly, other asymmetric distributions (e.g., skew t-distribution) have been developed [4]. Unified overviews of skewed distributions are provided in [5,6], while a review of different applications of skew-elliptical distributions in actuarial and financial mathematics is provided in [7].

In [8], another method of generating skewness was introduced through the Lambert W function that, when applied to symmetric distributions, can produce skewness and a heavy tail. In addition, Lambert W random variables can be seen as a generalization, as the input distribution can be arbitrary and not necessarily symmetric. When using the Lambert W function, instead of using the parametric manipulation of the original symmetric density function to introduce skewness, the random variable itself is transformed.

Another Lambert W transformation related to random variables was studied in [9], namely, a class of log-Lambert W random variables with applications to likelihood-based inference of normal random variables.

A different approach using the Lambert W function was introduced in [10,11], where the transformation is applied to the cumulative distribution function of the continuous positive valued random variable.

The Lambert W function has proven useful in mathematics, physics, chemistry, biology, engineering, risk theory, and other fields, though it has been less widely used in statistical modelling. Nonetheless, there are a number of noteworthy examples. In [12], the Lambert W approach was applied to normalize a vector regardless of its actual distribution. The use of the Lambert W distribution in matrix factorization with an implementation in probabilistic programming was presented in [13]. The Lambert W function has been used to derive the exact distribution of the likelihood ratio test statistic and to solve related problems in [14,15,16].

The approach of modelling the skewed random variables and symmetrizing the data using the Lambert W function as a variable transformation was used in [8,12,17,18]. We use [8] as the basis of our construction in this paper.

The rest of this paper is organized as follows. In the first section, we provide a short overview of the Lambert W function. In Section 3, general definitions and the expressions of the cumulative density functions and probability density functions of the Lambert W random variables are introduced, followed by more detailed results concerning the Lambert W normal and exponential distributions. In Section 4, we describe the results of fitting the Lambert W normal and exponential distributions to two insurance-related datasets, then compare the fit with several typical insurance models. Proofs of several properties, technical details of estimation, and additional figures showing the fitted distributions are presented in the Appendix A, Appendix B and Appendix C.

2. The Lambert W Function and Its Properties

In the following, we define the Lambert W function and provide a brief overview of its properties; refer to [19,20,21] for more details on the topic.

The Lambert W function is a set of inverse functions for the following function:

f (x^{'}) = x^{'} e^{x^{'}} (x^{'} \in R)

, in other words,

x^{'} = f^{- 1} (x^{'} e^{x^{'}}) = W (x^{'} e^{x^{'}}) .

Substituting

x = x^{'} e^{x^{'}}

leads to the definition of the Lambert W function.

Definition 1.

The Lambert W function

W (x)

is defined by the following equality:

W (x) e^{W (x)} = x, x \in [- \frac{1}{e}, \infty) .

(1)

Note that, in general, the function

W (x)

can be defined for real or complex arguments, and that Equation (1) has infinitely many solutions, most of which are complex. Following the notation of [21], we denote the different branches of the function by

W_{k} (x)

, where the branch index

k \in {0, \pm 1, \pm 2, \dots}

and

x \in C

. For real x, all branches other than

W_{0} (x)

and

W_{- 1} (x)

are complex. For

x \in (- \infty, - \frac{1}{e})

, the equation has only complex solutions. We denote the branch corresponding to

W (x) \geq - 1

by

W_{0} (x)

, which we call the principal branch, and the branch corresponding to

W (x) \leq - 1

by

W_{- 1} (x)

, which we call the non-principal branch.

Among the characteristic properties of the function (see Figure 1 as well) are:

$W (0) = 0$
$W_{0} (- \frac{1}{e}) = W_{- 1} (- \frac{1}{e}) = - 1$
$W (e) = 1$
$W (1) = e^{- W (1)} = ln (\frac{1}{W (1)}) = - ln W (1) \approx 0.5671433$
${lim}_{x \to 0 -} W_{- 1} (x) = - \infty$
${lim}_{x \to \infty} W_{0} (x) = \infty$

Based on its construction as an inverse of a certain exponential function, the asymptotes of W are similar to those of the natural logarithm. More precisely, the limits can be found as follows:

lim_{x \to \infty} \frac{W_{0} (x)}{ln x} = lim_{x \to \infty} \frac{x W_{0} (x)}{x (1 + W_{0} (x))} = lim_{x \to \infty} \frac{1}{\frac{1}{W_{0} (x)} + 1} = 1

and

lim_{x \to 0 -} \frac{W_{- 1} (x)}{ln (- x)} = lim_{x \to 0 -} \frac{x W_{- 1} (x)}{x (1 + W_{- 1} (x))} = lim_{x \to 0 -} \frac{1}{\frac{1}{W_{- 1} (x)} + 1} = 1 .

At the same time, the absolute difference between the Lambert’s W function and the natural logarithm

| W_{0} (x) - ln x |

goes to infinity for

x \to \infty

[20].

3. Lambert W Random Variables

3.1. Definitions

Next, we present the definitions of different types of Lambert random variables based on Goerg [8]. We provide the formulae of the cumulative distribution function (cdf) and probability density function (pdf) for scale and location–scale random variables.

Definition 2.

Let U be a continuous random variable with a cdf

F_{U} (u) = P (U \leq u)

,

u \in R

and pdf

f_{U} (u)

; then,

Y : = U exp (γ U), γ \in R

(2)

is a noncentral and nonscaled Lambert

W \times F_{U}

random variable with skewness parameter γ.

The skewness parameter

γ

can take any value on the real line; however, as the exponential function is always positive, the transformation (2) preserves the sign. Thus, if

γ = 0

, then

Y = U

. The effect of the transformation on the shape of the distribution depends on the original variable U. If U has both positive and negative values, then positive

γ

folds back the tail with negative values at a point

- \frac{1}{γ}

, relocating part of negative U values, while on the positive side the values move further away, making the right tail heavier. Negative

γ

acts the other way around. Note that for a skewed U, the Lambert W transform can produce a more symmetric random variable.

The transformation in (2) is not scale- or location-invariant. In order to keep these properties, which are needed, for example, to construct the Lambert W normal random variables, it is necessary to include the transformed variable’s location and scale parameters in the definition. For more details about the location-scale family of distributions, refer to [22] (pp. 116–121).

Definition 3.

Let X be a continuous random variable from a location-scale family with cdf

F_{X} (x | β)

, where β is the corresponding parameter vector. Let

U = \frac{X - μ}{σ}

be the zero-mean unit variance version of X. Then,

Y : = {U exp (γ U)} σ + μ, γ \in R, σ > 0

(3)

is a location-scale Lambert

W \times F_{X}

random variable with parameter vector

(β, γ)

.

If

γ > 0

, the location-scale Lambert

W \times F_{X}

random variable takes values in the interval

(μ - \frac{σ}{γ e}, \infty)

. For a negative

γ

, on the contrary, Y has an upper bound, and the values are in the interval

(- \infty, μ - \frac{σ}{γ e})

.

For

γ > 0

, the cdf and pdf of a location-scale Lambert

W \times F_{X}

random variable are respectively

F_{Y} (y | β, γ) = \{\begin{matrix} 0, & if y \leq μ - \frac{σ}{γ e}, \\ F_{X} (\frac{W_{0} (γ z)}{γ} σ + μ| β) \\ - F_{X} (\frac{W_{- 1} (γ z)}{γ} σ + μ| β), & if μ - \frac{σ}{γ e} < y < μ, \\ F_{X} (\frac{W_{0} (γ z)}{γ} σ + μ| β), & if y \geq μ, \end{matrix}

(4)

and

f_{Y} (y | β, γ) = \{\begin{matrix} 0, & if y \leq μ - \frac{σ}{γ e}, \\ f_{X} (\frac{W_{0} (γ z)}{γ} σ + μ| β) \frac{W_{0}^{'} (γ z)}{γ} \\ - f_{X} (\frac{W_{- 1} (γ z)}{γ} σ + μ| β) \frac{W_{- 1}^{'} (γ z)}{γ}, & if μ - \frac{σ}{γ e} < y < μ, \\ f_{X} (\frac{W_{0} (γ z)}{γ} σ + μ| β) \frac{W_{0}^{'} (γ z)}{γ}, & if y \geq μ, \end{matrix}

(5)

where

z = \frac{y - μ}{σ}

and we denote the derivative of

W (γ z)

by z as

W^{'} (γ z) = \frac{d W (γ z)}{d z} = \frac{exp (- W (γ z))}{1 + W (γ z)} γ = \frac{W (γ z)}{z (1 + W (γ z))} .

(6)

In (6), the principal and non-principal branches are not distinguished, as the same holds for both.

The derivation of these expressions can be found in Goerg [8]. The derivation and resulting expressions for

γ < 0

are similar, except that the three regions considered are pivoted: the first region is

y \leq μ

, where only the principal branch is used; the second region is

μ < y < μ - \frac{σ}{γ e}

, where both branches are used; and

y \geq μ - \frac{σ}{γ e}

for last region, where the cdf reaches 1 and the pdf is equal to 0.

For a non-negative X from the scale family, for example, an exponentially-distributed X, we can define the corresponding scale-family Lambert random variable as follows.

Definition 4.

Let X be a non-negative continuous random variable from a scale family with cdf

F_{X} (x | β)

, where β is the parameter vector. Let

U = \frac{X}{σ}

be the unit-variance version of X. Then,

Y : = {U exp (γ U)} σ = X exp (γ X / σ), γ \in R, σ > 0

(7)

is a scale Lambert

W \times F_{X}

random variable with parameter vector

(β, γ)

.

If

γ > 0

, then the cdf and pdf for a scale Lambert random variable can be found easily, as the transformation (7) takes values only on the positive side of the real line; as we apply the transformation W on positive arguments as well, only the principal branch plays a role. Hence, the cdf has the following form:

F_{Y} (y | β, γ) = \{\begin{matrix} 0, & if y < 0, \\ F_{X} (\frac{W_{0} (γ y / σ)}{γ} σ| β), & if y \geq 0 . \end{matrix}

(8)

Taking the derivative of (8), we obtain the following form for the pdf:

f_{Y} (y | β, γ) = \{\begin{matrix} 0, & if y < 0, \\ f_{X} (\frac{W_{0} (γ y / σ)}{γ} σ| β) \frac{exp (- W_{0} (γ y / σ))}{1 + W_{0} (γ y / σ)} & if y \geq 0 . \end{matrix}

(9)

Our primary focus is on positive

γ

that produces a heavier right tail to right-skewed distribution, possibly making the distribution more suitable for describing insurance losses. Yet, the results for

γ < 0

are not as straightforward as for the location-scale family case. Thus, to complete the theory, we analyze this situation as well and derive the cdf and pdf. First, the cdf:

\begin{matrix} F_{Y} (y) & = P (Y \leq y) = P (U exp (γ U) σ \leq y) = P (γ U exp (γ U) \geq γ y / σ) \\ = 1 - P (γ U exp (γ U) \leq γ y / σ) . \end{matrix}

Now, as the argument

γ y / σ

is negative for

y > 0

, both branches are needed when we apply the Lambert function. Hence,

\begin{matrix} F_{Y} (y) & = 1 - P (W_{- 1} (γ y / σ) \leq γ U \leq W_{0} (γ y / σ)) \\ = 1 - P (W_{- 1} (γ y / σ) / γ \geq U \geq W_{0} (γ y / σ) / γ) \\ = 1 - F_{X} (\frac{W_{- 1} (γ y / σ)}{γ} σ| β) + F_{X} (\frac{W_{0} (γ y / σ)}{γ} σ| β) . \end{matrix}

The principal and non-principal branches are equal at point

y = - \frac{σ}{γ e}

; thus, this is the point where the cdf

F_{Y}

reaches 1. In summary, if

γ < 0

, then

F_{Y} (y | β, γ) = \{\begin{matrix} 0, & if y \leq 0, \\ 1 - F_{X} (\frac{W_{- 1} (γ y / σ)}{γ} σ| β) + F_{X} (\frac{W_{0} (γ y / σ)}{γ} σ| β), & if 0 < y < - \frac{σ}{γ e}, \\ 1, & if y \geq - \frac{σ}{γ e} \end{matrix}

(10)

and the corresponding pdf is

f_{Y} (y | β, γ) = \{\begin{matrix} 0, & if y \leq 0 or y \geq - \frac{σ}{γ e}, \\ f_{X} (\frac{W_{0} (γ y / σ)}{γ} σ| β) \frac{exp (- W_{0} (γ y / σ))}{1 + W_{0} (γ y / σ)} \\ - f_{X} (\frac{W_{- 1} (γ y / σ)}{γ} σ| β) \frac{exp (- W_{- 1} (γ y / σ))}{1 + W_{0} (γ y / σ)}, & if 0 < y < - \frac{σ}{γ e} . \end{matrix}

(11)

3.2. Lambert W Normal Distribution

In this section, we apply the Lambert location-scale transformation (3) on a normal random variable

X \sim N (μ, σ)

. The resulting random variable

Y = \frac{X - μ}{σ} exp (γ \frac{X - μ}{σ}) σ + μ

is a Lambert

W \times N (μ, σ)

random variable with parameter vector

(μ, σ, γ)

. Without loss of generality, we assume that the skewness parameter

γ

is positive; the situation is mirrored for negative

γ

-s (i.e., left skew instead of right skew). Using (4), the cdf for a positive skewness parameter

γ

can be written as

F_{Y} (y | μ, σ, γ) = \{\begin{matrix} 0, & if y \leq μ - \frac{σ}{γ e}, \\ Φ (\frac{W_{0} (γ z)}{γ}) - Φ (\frac{W_{- 1} (γ z)}{γ}), & if μ - \frac{σ}{γ e} < y < μ, \\ Φ (\frac{W_{0} (γ z)}{γ}), & if y \geq μ, \end{matrix}

where

z = \frac{y - μ}{σ}

and

Φ

is the standard normal cdf. Likewise, using (5), we obtain the pdf for

γ > 0

as

f_{Y} (y | μ, σ, γ) = \{\begin{matrix} 0, & if y \leq μ - \frac{σ}{γ e}, \\ f_{0} (\frac{y - μ}{σ}) - f_{- 1} (\frac{y - μ}{σ}), & if μ - \frac{σ}{γ e} < y < μ, \\ f_{0} (\frac{y - μ}{σ}), & if y \geq μ, \end{matrix}

where

f_{0} (z)

and

f_{- 1} (z)

are the components of the pdf corresponding to the principal and non-principal branch, respectively:

\begin{matrix} f_{0} (z) & = \frac{1}{\sqrt{2 π}} exp (- \frac{{(W_{0} (γ z))}^{2}}{2 γ^{2}}) \frac{exp (- W_{0} (γ z))}{1 + W_{0} (γ z)}, \end{matrix}

(12)

\begin{matrix} f_{- 1} (z) & = \frac{1}{\sqrt{2 π}} exp (- \frac{{(W_{- 1} (γ z))}^{2}}{2 γ^{2}}) \frac{exp (- W_{- 1} (γ z))}{1 + W_{- 1} (γ z)} . \end{matrix}

(13)

Examples of the cdf and pdf for the Lambert

W \times N (0, 1)

distribution with

γ > 0

are shown in Figure 2.

In the following, we provide several results that describe the behaviour of the pdf of a Lambert W normal random variable. To keep our proofs technically cleaner, the analysis is applied to Lambert

W \times N (0, 1)

random variables, as generalization to Lambert

W \times N (μ, σ)

is straightforward. Proofs of these lemmas are presented in Appendix A.

Lemma 1.

The pdf of a Lambert

W \times N (0, 1)

random variable Z,

f_{Z}

has an asymptote at

- \frac{1}{γ e}

:

lim_{z \to - \frac{1}{γ e}} f_{Z} (z) = \infty .

The point

- \frac{1}{γ e}

where

f_{Z}

has an asymptote can be thought of as a point where the transformation folds the left tail of

N (0, 1)

and fits it into the interval

(- \frac{1}{γ e}, 0)

. At this turning point, the density accumulates; see Figure 2, Figure 3, Figure 4 and Figure 5 for examples. Although the transformation squeezes the negative values of

N (0, 1)

into a fixed interval and makes the right tail heavier, it continues to have zero as a point where the probability mass is divided into equal halves. Furthermore, at point

z = 0

, the pdf

f_{Z}

is equal to the pdf of

N (0, 1)

, i.e.,

f_{Z} (0) = \frac{1}{\sqrt{2 π}}

. This property is pointed out in the right-hand panels of Figure 5 and Figure 6.

Lemma 2.

The principal branch component of the pdf of a Lambert

W \times N (0, 1)

random variable

f_{0}

has the following properties. The function

f_{0} (z)

:

(a): has two local extrema (maximum and minimum) if $γ \in (0, \sqrt{2} - 1)$ ; and
(b): is monotone decreasing if $γ > \sqrt{2} - 1$ .

Lemma 3.

The non-principal branch component of the pdf of a Lambert

W \times N (0, 1)

random variable

f_{- 1}

has the following properties. The function

f_{- 1} (z)

:

(a): is monotone increasing (to 0) if $γ \in (0, \sqrt{2} + 1)$ ; and
(b): has two local extrema (maximum and minimum) if $γ > \sqrt{2} + 1$ .

Consequently, depending on the value of the skewness parameter

γ

, it is possible to distinguish three main shapes of the pdf of a Lambert W normal random variable. First, if

γ \in (0, \sqrt{2} - 1)

, then the pdf has two local extrema due to the principal branch component

f_{0}

; see Figure 3 or the left panel of Figure 4 for examples. Second, if

γ \in [\sqrt{2} - 1, \sqrt{2} + 1]

, then the pdf is a strictly decreasing function of z, as in the right panel of Figure 4. Third, if

γ > \sqrt{2} + 1

, then the pdf again has two local extrema, now due to the non-principal branch component

f_{- 1}

, and compared to the first case, the overall shape of pdf is different, as seen in Figure 5 and Figure 6. In these two figures, the right panel provides a more detailed view of the interval where the maximum is placed. As notably seen in Figure 6, the apparently sharp peak turns out to be quite smooth if examined more closely.

Lastly, we provide the expressions of the moments and skewness coefficient of a Lambert

W \times N (μ, σ)

random variable. The moments of a Lambert

W \times N (0, 1)

random variable can be found using the moment generating function (mgf) of the underlying standard normal distribution. Let Z be a Lambert

W \times N (0, 1)

random variable. The moments for Z are then as follows [8]:

E (Z^{k}) = \frac{1}{k^{k}} \frac{\partial^{k}}{\partial γ^{k}} M_{N (0, 1)} (γ k) = \frac{1}{k^{k}} \frac{\partial^{k}}{\partial γ^{k}} exp (\frac{γ^{2} k^{2}}{2}),

where

M_{N (0, 1)}

denotes the mgf of

N (0, 1)

. For the general case, i.e., for a Lambert

W \times N (μ, σ)

random variable Y, we can use the properties of the location-scale family, meaning that we have

E (Y^{k}) = E ({(Z σ + μ)}^{k}) .

As the moments are found using the derivatives of an exponential function, the moments of any order k exist and are finite. Using the above expressions, we can derive the formulae for the mean of Y

E Y = μ + σ γ e^{γ^{2} / 2},

(14)

the variance of Y

V a r Y = σ^{2} e^{γ^{2}} (e^{γ^{2}} (1 + 4 γ^{2}) - γ^{2}),

(15)

and the skewness coefficient

γ_{1} (Y)

:

γ_{1} (Y) = γ (\frac{e^{3 γ^{2}} (9 + 27 γ^{2}) - e^{γ^{2}} (3 + 12 γ^{2}) + 2 γ^{2}}{{(e^{γ^{2}} (1 + 4 γ^{2}) - γ^{2})}^{\frac{3}{2}}}) .

(16)

The skewness coefficient is a monotone function of

γ

, and has the same sign. As

γ \to \pm \infty

, we have

γ_{1} (Y) \to \pm \infty

, and the speed of growth is exponential. For example, if we look at the range of values

γ \in (\sqrt{2} - 1; \sqrt{2} + 1)

, where the pdf is monotone decreasing, the skewness coefficient grows from around 3 to 20,000 (see Figure 7).

3.3. Lambert W Exponential Distribution

Let X be an exponentially distributed random variable with parameter

λ > 0

(

λ

as rate). Then, the transformed random variable

Y = X e^{γ λ X}

has a Lambert

W \times E x p (λ)

distribution with parameter vector

(λ, γ)

. According to (7), for positive

γ

, the cdf of Y is

F_{Y} (y | λ, γ) = 1 - exp (- \frac{W_{0} (γ λ y)}{γ}), y \geq 0,

and, using (9), the pdf of Y is

f_{Y} (y | λ, γ) = λ exp (- \frac{W_{0} (γ λ y)}{γ}) \frac{exp (- W_{0} (γ λ y))}{1 + W_{0} (γ λ y)}, y \geq 0 .

For

γ < 0

, the expressions for cdf and pdf additionally involve the non-principal branch of the Lambert W function, as seen in (10) and (11):

F_{Y} (y | λ, γ) = 1 - exp (- \frac{W_{0} (γ λ y)}{γ}) + exp (- \frac{W_{- 1} (γ λ y)}{γ}), 0 \leq y < - \frac{1}{e γ λ},

and

\begin{matrix} f_{Y} (y | λ, γ) = λ exp (- \frac{W_{0} (γ λ y)}{γ}) \frac{exp (- W_{0} (γ λ y))}{1 + W_{0} (γ λ y)} \\ - λ exp (- \frac{W_{- 1} (γ λ y)}{γ}) \frac{exp (- W_{- 1} (γ λ y))}{1 + W_{- 1} (γ λ y)}, 0 \leq y < - \frac{1}{e γ λ} . \end{matrix}

For examples of the pdf and cdf for the Lambert

W \times E x p (1)

distribution, see Figure 8 and Figure 9. As apparent from Figure 8, the Lambert random variables have a heavier tail in the case of positive

γ

as compared to the exponential distribution.

For negative

γ

values (see Figure 9), the random variable Y takes values in the fixed interval

(0, - \frac{1}{e γ λ})

, as the transformation relocates the larger values of the underlying exponential random variable X. While it can be argued that this kind of transformation is not relevant for typically heavy-tailed insurance data, our example (see Section 4) shows an adequate fit when using the Lambert W exponential random variables with

γ < 0

for log claims of Danish fire loss data. In the case of

γ < 0

, if the absolute value of

γ

is small, this produces a distribution with a suitably large cut-off point to fit data with moderate tails, as is the case for the Danish log claims data. Similarly, only small values of

γ

are of practical use for positive

γ

, as the tail quickly becomes heavy very. For example, if

γ \geq 1

, then Lambert

W \times E x p (λ)

random variables do not have a finite first moment. For

γ < 1

, the first moment is

\frac{1}{λ {(1 - γ)}^{2}}

. In general, the following expression holds:

E Y^{k} = \frac{k!}{λ^{k} {(1 - k γ)}^{k + 1}}, i f γ < \frac{1}{k} .

The skewness coefficient for a Lambert

W \times E x p (λ)

random variable with

γ < \frac{1}{3}

can be calculated as follows:

\begin{matrix} γ_{1} (Y) = 2 \sqrt{\frac{{(1 - 2 γ)}^{9}}{{(2 γ^{4} - 2 γ + 1)}^{3}}} (\frac{3 {(1 - γ)}^{4} ({(1 - γ)}^{2} {(1 - 2 γ)}^{3} - {(1 - 3 γ)}^{4})}{{(1 - 3 γ)}^{4} {(1 - 2 γ)}^{3}} + 1) . \end{matrix}

(17)

If

γ \geq \frac{1}{3}

, then the third moment of Y is infinite, and the coefficient

γ_{1} (Y)

cannot be found. The skewness coefficient is a non-monotonic function of

γ

(see Figure 10). If

γ = 0

, the distribution simplifies to exponential, and the skewness coefficient

γ_{1} = 2

. For a Lambert

W \times E x p (λ)

distribution, the skewness coefficient

γ_{1}

can exceed a value of 2, and approaches infinity as

γ \to \frac{1}{3}

. For values

- \infty < γ < - 1

, the skewness coefficient is a decreasing function of

γ

with a minimum value of

- \frac{9 \sqrt{15}}{50}

, while for

γ \in (- 1, \frac{1}{3})

it is increasing (see Figure 10).

4. Fitting Lambert W Random Variables to Insurance Data

In this subsection, we fit the Lambert W normal and exponential random variables on two well-known datasets, the US indemnity data introduced in [23] and the Danish fire data introduced in [24], then compare the fit with previous results.

These datasets have been widely used in field-specific literature before; see, e.g., [25,26] for the US indemnity data and [27,28,29] for the Danish fire data, among others. A consolidated overview of previous results is provided in [30].

To recall the distributions of these example datasets, see Figure 11 for the US indemnity data and Figure 12 for the Danish fire loss data. In both figures, the left panel presents the data on the original scale (thousands of USD for US indemnity and millions of DKK for Danish fire data), and the right panel presents the same data after log transformation. In the case of the log-transformed data, we use a similar shift to the one in [30] in order to keep the results comparable. More precisely, the transformation

ln (y) - min (ln (y)) + 10^{- 10}

is applied on the original variable y.

It is evident that both datasets exhibit significant skewness when observed on the original scale. The skewness is more extreme for the Danish fire data, with a skewness coefficient of

γ_{1} = 18.74

, as compared to

γ_{1} = 9.15

for the US indemnity data. In the case of the US indemnity data, the log-transformed data produces an almost symmetric histogram that is very similar to a normal distribution. The log-transform reduces the skewness for Danish fire data as well, although the result remains skewed, with

γ_{1} = 1.76

.

In [30], nineteen distributions were fitted to the two aforementioned datasets, with the result that the skew-normal and skew t distributions are reasonably competitive compared to other models commonly used for insurance data.

In our research, we follow this construction and include all fitted continuous distributions while adding three more distributions to the list: the Lambert W normal and exponential distributions as our main contribution, and the Pareto distribution, which was previously missing due to technical problems. We use the maximum likelihood method for parameter estimation, as in [30]. For more details of the estimation process, see Appendix B.

To compare these models with competitors, we measure the goodness of fit between the data and distribution using the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC). The BIC is included because the number of parameters of the distributions ranges from 1 to 5, making the penalty of the AIC quite small compared to the flexibility that additional parameters can provide.

Before the comparison, we first examine the parameter estimates of the Lambert W distributions in Table 1.

In the case of the Lambert W exponential model for the US indemnity data, the

γ

estimate

0.496

provides an infinite skewness coefficient. At the same time, the fit according to the BIC is good relative to other models; see Table 2 and later discussion. As the US indemnity data are close to normal on the log scale, the Lambert W exponential is not really a suitable model here. However, the estimate

\hat{γ} = - 0.321

corresponds to a skewness coefficient value of

0.09

, i.e. this model is able to pick up the symmetry of the data.

What is interesting in the case of the Danish log data is the negative

γ

estimate, as it produces a distribution with an upper bound of

- \frac{1}{γ e λ}

, here resulting in

7.82

. As the maximum value in data is around

5.57

, this model allows even higher claim values than in the data. Furthermore, this model suits the data well, as it ranks high according to BIC value (see Table 3, discussed in detail later on). The fit is not good for the same data on the original scale, which are highly skewed, and the

γ

estimate

0.096

can be considered unexpectedly low.

From the estimates of the Lambert W normal parameter, we can point out that the

γ

estimates are in the interval that produces monotone decreasing pdf for both datasets on the original scale. For US indemnity data on the log scale, the

γ

estimate

- 0.021

produces a distribution very similar to normal, which is in agreement with the histogram. For the Danish log data, the estimate for

γ

is

0.373

, which is in the interval

(0, \sqrt{2} - 1)

, which corresponds to the pdf shape with some downward bend between the asymptote and maximum, as in the left panel of Figure 4. As shown in the following analysis, the fit provided by the Lambert W transformed random variables is promising.

The results of model fitting are presented in Table 2 and Table 3. The distributions are sorted in ascending order by the number of parameters, with the two newly added Lambert W distributions always shown at the top of the table. In every column, the first three results are marked: the best result is in bold, the second-best is underlined, and the third-best is underlined and in italics.

In the case of the US indemnity data (see Table 2), we have seen earlier that the log-transformed data closely resemble the normal distribution. Therefore, the log-normal distribution can be expected to provide the best fit for the data in the original scale. However, the Lambert W exponential model provides a good fit as well, with the second-best AIC and BIC values. For the log-transformed data, the two smallest AIC values are almost equal, with the following block having very close values. Thus, the skew-normal and Lambert W normal distributions share first place, and skew t follows at the top of the next block. Based on the BIC, the normal distribution provides the best fit, having fewer parameters than the skew-normal or Lambert W normal. The skew-normal and Lambert W normal distributions fall to second and third place, respectively. The pdfs for the best three models with data histograms are plotted in Figure A1 in Appendix C. It is apparent from the latter graph that the top three models exhibit a high degree of similarity, with the primary distinction residing in the region of small claims when viewed on the original scale. For the log-transformed data, the three curves practically coincide.

From Table 3, it can be seen that for the Danish fire data on the original scale, the two best-fitting models are the skew t and Lambert W normal distributions. For the Danish log data, the Lambert W normal distribution again has the best fit based on the AIC, followed by the skew t. Based on the BIC, the best model is the Lambert W normal distribution, while the Lambert W exponential has the second best result; for further illustration, see Figure A2 in Appendix C. The three best pdfs for the original data are very similar. On the log-transformed data, the discrepancies are not large either, though they are more clearly visible. In conclusion, the Lambert W models provide a good fit to both the original and log-transformed data.

5. Summary

In this paper, we have addressed the Lambert W transform-based approach and the properties of the resulting distributions, thoroughly investigating the Lambert W normal and Lambert W exponential distributions. We introduce the skewness via the Lambert W transform and the skewness parameter

γ

. Without loss of generality, we focus on positive values of

γ

, as these are more of interest in loss modelling applications. For the Lambert W standard normal distribution with a positive skewness parameter

γ

, the pdf

f (y)

has an asymptote at

y = - \frac{1}{γ e}

. We establish the following three regions based on the shape of the pdf:

(a): If $γ \in (0, \sqrt{2} - 1)$ , then the pdf has two local extrema;
(b): If $γ \in (\sqrt{2} - 1, \sqrt{2} + 1)$ , then the pdf is monotone decreasing;
(c): If $γ > \sqrt{2} + 1$ , then the pdf has two local extrema.

In the first range, where

γ \in (0, \sqrt{2} - 1)

, the shape of the distribution is at first glance not the most suitable for loss modelling, and needs additional explanations. Nevertheless, it can be argued that the asymptote effect is reasonably small, meaning that the distribution can provide a good fit, as in the Danish fire log data. Such a shape might be suitable in zero-altered models as well, where zero claims are included. The second and most appealing range, where the pdf is monotone decreasing, covers a wide range of the skewness coefficient values; see Figure 7. If

γ = \sqrt{2} + 1

, then the skewness coefficient is about 20,000; thus, the not-very-suitable shape in the third range is not a problem for most practical applications.

For the Lambert W exponential distribution, we establish that it allows a wider choice of the skewness coefficient than the exponential distribution. Moreover, one additional parameter relaxes the rigid relationship between the mean and variance of the exponential distribution. These properties make the Lambert exponential distribution a promising model for insurance loss data.

Our results in the practical part show that the Lambert W transformed distributions operating in a wide range of skewness represent a viable choice for insurance loss modelling. Both the normal and exponential distribution-based transforms show a reasonably good fit. An especially illustrative proof of this flexibility is visible in the Danish fire data, where the results of the Lambert W normal model are well at the top for both the original and log-transformed datasets.

Clearly, the choices available for the Lambert W approach are not limited to normal and exponential random variables. While the normal and exponential distributions seem to be a natural starting point for loss modelling, other distributions can offer valuable contributions as well.

Author Contributions

Conceptualization, M.K. and A.S.; methodology, M.K., A.S. and T.P.; software, A.S., T.P. and L.T.; validation, A.S.; formal analysis, all authors; data curation, A.S.; writing—original draft preparation, M.K., T.P. and L.T.; writing—review and editing, M.K. and A.S.; visualization, A.S. and T.P.; supervision, M.K.; project administration, A.S. and M.K.; funding acquisition, M.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Estonian Research Council, grant PRG1197.

Data Availability Statement

We used the R package fExtremes [31] to access the US indemnity data and the package copula [32] for the Danish fire loss data.

Acknowledgments

The authors are thankful to Roel Verbelen for constructive discussions and comments on an earlier draft of the paper. The authors also thank all the anonymous referees for their valuable and constructive feedback.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proofs of the Properties of Lambert W Standard Normal Random Variables

In this appendix, we provide the proofs of the properties of the Lambert W standard normal random variables formulated in Lemmas 1–3 in Section 3.2.

Proof of Lemma 1.

Recall that the density

f_{Z} (z)

can be expressed as

f_{0} (z) - f_{- 1} (z)

for

z \in (- \frac{1}{γ e}, 0]

, where

f_{0}

and

f_{- 1}

are the principal and non-principal branch components of the pdf, respectively.

Furthermore, recall the form of the principal branch component

f_{0} (z)

as specified in (12):

f_{0} (z) = \frac{1}{\sqrt{2 π}} exp (- \frac{{(W_{0} (γ z))}^{2}}{2 γ^{2}}) \frac{exp (- W_{0} (γ z))}{1 + W_{0} (γ z)}

for

z > - \frac{1}{γ e}

.

Looking separately at the components of this expression, it is easy to see that

W_{0} (γ z) \to - 1

,

{(W_{0} (γ z))}^{2} \to 1

, and

1 + W_{0} (γ z) \to 0 +

if

z \to - \frac{1}{γ e} +

. Thus, the ratio

\frac{exp (- W_{0} (γ z))}{1 + W_{0} (γ z)}

tends to infinity in the process, which implies that the principal branch component

f_{0} (z)

specified in (12) goes to infinity if

z \to - \frac{1}{γ e} +

.

A similar argument holds for the non-principal branch component

f_{- 1} (z)

. First, recall that, as stated in Formula (13), the non-principal branch component has the following form:

f_{- 1} (z) = \frac{1}{\sqrt{2 π}} exp (- \frac{{(W_{- 1} (γ z))}^{2}}{2 γ^{2}}) \frac{exp (- W_{- 1} (γ z))}{1 + W_{- 1} (γ z)}

with

z \in (- \frac{1}{γ e}, 0]

.

Analyzing the components of this expression separately, it can be seen that

W_{- 1} (γ z) \to - 1

,

{(W_{- 1} (γ z))}^{2} \to 1

, and

1 + W_{- 1} (γ z) \to 0 -

in the process where

z \to - \frac{1}{γ e} +

. This implies that

\frac{exp (- W_{- 1} (γ z))}{1 + W_{- 1} (γ z)} \to - \infty

, which, in summary, results in

{lim}_{z \to - \frac{1}{γ e} +} f_{- 1} (z) = - \infty

.

In conclusion, because

f_{Z} (z) = f_{0} (z) - f_{- 1} (z)

for

z \in (- \frac{1}{γ e}, 0]

, we have

{lim}_{z \to - \frac{1}{γ e} +} f_{Z} (z) = \infty

. The lemma is proved. □

Proof of Lemma 2.

We first note that Formulas (12) and (13) differ only in the specification of the branch (

W_{0}

or

W_{- 1}

). Because most of the following argumentation holds for both branches, we do not specify the branch unless explicitly needed. In other words, we start by searching for the extrema of the function

\frac{1}{\sqrt{2 π}} exp (- \frac{{(W (γ z))}^{2}}{2 γ^{2}}) \frac{exp (- W (γ z))}{1 + W (γ z)} .

(A1)

To investigate the existence of extrema for different values of

γ > 0

, we first have to take the derivative from the expression (A1) by z, ignoring the constant in front:

\begin{matrix} {(\frac{exp (- \frac{{(W (γ z))}^{2}}{2 γ^{2}} - W (γ z))}{1 + W (γ z)})}^{'} & = \frac{exp (- \frac{{(W (γ z))}^{2}}{2 γ^{2}} - W (γ z)) {(- \frac{{(W (γ z))}^{2}}{2 γ^{2}} - W (γ z))}^{'}}{(1 + W (γ z))} \\ - \frac{exp (- \frac{{(W (γ z))}^{2}}{2 γ^{2}} - W (γ z)) {(1 + W (γ z))}^{'}}{{(1 + W (γ z))}^{2}} . \end{matrix}

(A2)

Using Formula (6), we can write

\begin{matrix} {(- \frac{{(W (γ z))}^{2}}{2 γ^{2}} - W (γ z))}^{'} & = - \frac{2 W (γ z) W^{'} (γ z)}{2 γ^{2}} - γ W^{'} (γ z) \\ = - \frac{W (γ z) exp (- W (γ z))}{γ (1 + W (γ z))} - \frac{γ exp (- W (γ z))}{1 + W (γ z)} \\ = \frac{- exp (- W (γ z)) (W (γ z) + γ^{2})}{γ (1 + W (γ z))} \end{matrix}

and

{(1 + W (γ z))}^{'} = γ W^{'} (γ z) = \frac{γ exp (- W (γ z))}{1 + W (γ z)} .

Now, substituting the results into Formula (A2) leads to

\frac{exp (- \frac{{(W (γ z))}^{2}}{2 γ^{2}} - W (γ z)) (- exp (- W (γ z))) ({(W (γ z))}^{2} + (1 + γ^{2}) W (γ z) + 2 γ^{2})}{γ {(1 + W (γ z))}^{3}} = 0 .

(A3)

The equality (A3) holds if the numerator is zero and the denominator is not. As we assume that

γ > 0

, the denominator provides a restriction

z \neq - \frac{1}{γ e}

, which is already accounted for. In the numerator, because the value of the exponential function is positive for any fixed argument (except for

z = 0

for the non-principal branch, which is dealt with separately), we need to solve the following quadratic equation:

{(W (γ z))}^{2} + (1 + γ^{2}) W (γ z) + 2 γ^{2} = 0

(A4)

with respect to

W (γ z)

. The solution to this equation is of the form

W (γ z) = - \frac{1 + γ^{2}}{2} \pm \sqrt{{(\frac{1 + γ^{2}}{2})}^{2} - 2 γ^{2}} = \frac{- 1 - γ^{2} \pm \sqrt{γ^{4} - 6 γ^{2} + 1}}{2} .

(A5)

Let us now look more closely at the expression under the square root that determines the number of solutions for Formula (A5). Solving this equation

γ^{4} - 6 γ^{2} + 1 = 0

with respect to

γ

results in

γ^{2} = 3 \pm 2 \sqrt{2} \Leftrightarrow γ = \pm \sqrt{3 \pm 2 \sqrt{2}} = \pm (\sqrt{2} \pm 1) .

Because of the initial assumption

γ > 0

, we are interested in two of these four solutions:

γ^{(1)} = \sqrt{2} - 1 \approx 0.4142

and

γ^{(2)} = \sqrt{2} + 1 \approx 2.4142

.

We have established that the pdf of a Lambert

W \times N (0, 1)

random variable has two extrema in the following regions of the skewness parameter

γ

:

γ > \sqrt{2} + 1

0 < γ < \sqrt{2} - 1

. If

\sqrt{2} - 1 < γ < \sqrt{2} + 1

, then there are no real solutions for Formula (A5).

Now, let us restrict ourselves to the principal branch of the pdf and check which of the found values for

γ

are within the range of values of the principal branch, i.e.,

W (γ z) > - 1

. Formula (A5) then leads to the equality

\frac{- 1 - γ^{2} \pm \sqrt{γ^{4} - 6 γ^{2} + 1}}{2} > - 1 \Leftrightarrow γ^{2} - 1 < \pm \sqrt{γ^{4} - 6 γ^{2} + 1},

where the solution for positive values for

γ

is

0 < γ \leq \sqrt{2} - 1

. Thus, the function has two extrema in the interval

0 < γ \leq \sqrt{2} - 1

and is monotone decreasing for

γ > \sqrt{2} - 1

. The lemma is proved. □

Proof of Lemma 3.

To prove this result, we can use the reasoning in the previous proof up to Formula (A5), as this holds for both the principal and non-principal branches. Then, it sufficient to check which solutions comply with the restriction

W (γ z) < - 1

. Solving the inequality

\frac{- 1 - γ^{2} \pm \sqrt{γ^{4} - 6 γ^{2} + 1}}{2} < - 1 \Leftrightarrow γ^{2} - 1 > \pm \sqrt{γ^{4} - 6 γ^{2} + 1}

for positive values of

γ

results in

γ \geq \sqrt{2} + 1

.

Thus, the function has two extrema if

γ \in (\sqrt{2} + 1, \infty)

, and we have proved part (b) of the lemma. Similarly, the Equation (A4) has no real solutions for

γ < \sqrt{2} + 1

. Now, the assertion that

{lim}_{z \to 0} f_{- 1} (z) = 0

follows from the construction (see Formula (13)), which proves part (a) of the lemma. The lemma is proved. □

Appendix B. Details of Estimation

We use several R [33] packages for parameter estimation. For the hyperbolic, generalized hyperbolic, variance gamma, and normal inverse Gaussian distributions, we use a routine from the ghyp package [34]; for the skew-normal and skew t distributions, we use the sn package [35]; and for all other cases, we use the fitdistrplus package [36]. In addition, we apply several functions from the LambertW package [18] to produce the pdf and cdf for the Lambert W normal distribution.

To access the US indemnity data, we use the R package fExtremes [31], while for the Danish fire loss data we use the copula package [32].

We use the default starting values in each package for the relevant MLE routines, except for the case of the Lambert W distributions. As the Lambert W approach is relatively new, the consistency and stability properties of the MLE estimator have not been thoroughly studied, though the simulations provided in [8] are promising. In the following, we apply the method of moments to find the starting point for MLE. Next, we provide a more detailed overview of our selection of these starting values.

Lambert W normal distribution. To derive the starting values for the Lambert

W \times N (μ, σ)

distribution, we use the mean, variance, and skewness coefficient of the Lambert

W \times N (μ, σ)

random variable Y provided in Formulas (14)–(16). We first equate (16) with the sample skewness coefficient and solve it numerically to produce

γ_{0}

. Next, the expressions for the mean and variance are used, first substituting

γ_{0}

and a sample variance

s_{y}^{2}

of Y into (15) and then solving it for

σ

to obtain the starting value of

σ_{0} = \sqrt{\frac{s_{y}^{2}}{(e^{γ_{0}^{2}} (e^{γ_{0}^{2}} (1 + 4 γ_{0}^{2}) - γ_{0}^{2}))}} .

Lastly, we substitute

γ_{0}

,

σ_{0}

and the sample mean

\bar{y}

into (14) and solve for

μ

to obtain

μ_{0} = \bar{y} - σ_{0} γ_{0} e^{γ_{0}^{2} / 2} .

Lambert W exponential distribution. In the case of the Lambert

W \times E x p (λ)

distribution, we use the formula for the skewness coefficient (17) with the sample-based estimate

{\hat{γ}}_{1}

to find a starting value

γ_{0}

for the skewness parameter. As the skewness coefficient

γ_{1}

is a non-monotone function of

γ

(see Figure 10), only solutions in the interval

(- 1, \frac{1}{3})

are used, as values

γ < - 1

produce too drastic of truncation. For the rate parameter

λ

, we use the expression of the first moment and solve

\bar{y} = \frac{1}{λ {(1 - γ_{0})}^{2}}

to derive the formula

λ_{0} = \frac{1}{\bar{y} {(1 - γ_{0})}^{2}}

for the starting value of

λ

.

Appendix C. Data Histograms with the Three Best Fitting Models

Figure A1. Left panel: US indemnity data (in thousands of USD). For a better overview, values above 100 are not shown on the histogram. Right panel: the same data after log-transformation. The added lines represent the best three estimates based on BIC.

Figure A2. Left panel: Danish fire data claims (in millions of DKK). For a better overview, values above 20 are not shown on the histogram. Right panel: the same data after log-transformation. The added lines represent the best three estimates based on BIC.

References

Hogg, R.V.; Klugman, S.A. Loss Distributions; John Wiley & Sons: Hoboken, NJ, USA, 1984. [Google Scholar]
Klugman, S.A.; Panjer, H.H.; Willmot, G.E. Loss Models: From Data to Decisions; John Wiley & Sons: Hoboken, NJ, USA, 2012; Volume 715. [Google Scholar]
Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Azzalini, A.; Capitanio, A. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t-distribution. J. R. Stat. Soc. Ser. B Stat. Methodol. 2003, 65, 367–389. [Google Scholar] [CrossRef]
Genton, M.G. Skew-Elliptical Distributions and Their Applications: A Journey Beyond Normality; CRC Press: Boca Raton, FL, USA, 2004. [Google Scholar]
Nadarajah, S.; Kotz, S. Skewed distributions generated by the normal kernel. Stat. Probab. Lett. 2003, 65, 269–277. [Google Scholar] [CrossRef]
Adcock, C.; Eling, M.; Loperfido, N. Skewed distributions in finance and actuarial science: A review. Eur. J. Financ. 2015, 21, 1253–1281. [Google Scholar] [CrossRef]
Goerg, G.M. Lambert W random variables—A new family of generalized skewed distributions with applications to risk estimation. Ann. Appl. Stat. 2011, 5, 2197–2230. [Google Scholar] [CrossRef]
Witkovsk’y, V.; Wimmer, G.; Duby, T. Logarithmic Lambert W × F random variables for the family of chi-squared distributions and their applications. Stat. Probab. Lett. 2014, 96, 223–231. [Google Scholar] [CrossRef]
Iriarte, Y.A.; de Castro, M.; Gómez, H.W. The Lambert-F Distributions Class: An Alternative Family for Positive Data Analysis. Mathematics 2020, 8, 1398. [Google Scholar] [CrossRef]
Iriarte, Y.A.; de Castro, M.; Gómez, H.W. An Alternative One-Parameter Distribution for Bounded Data Modeling Generated from the Lambert Transformation. Symmetry 2021, 13, 1190. [Google Scholar] [CrossRef]
Peterson, R.A. The R Journal: Finding Optimal Normalizing Transformations via bestNormalize. R J. 2021, 13, 294–313. [Google Scholar] [CrossRef]
Klami, A.; Lagus, J.; Sakaya, J. Lambert Matrix Factorization. In Proceedings of the Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2018, Dublin, Ireland, 10–14 September 2018; Proceedings, Part II 18. Springer: Berlin/Heidelberg, Germany, 2019; pp. 311–326. [Google Scholar]
Stehlík, M. Distributions of exact tests in the exponential family. Metrika 2003, 57, 145–164. [Google Scholar] [CrossRef]
Stehlík, M. Exact likelihood ratio scale and homogeneity testing of some loss processes. Stat. Probab. Lett. 2006, 76, 19–26. [Google Scholar] [CrossRef]
Stehlík, M.; Economou, P.; Kisel’ák, J.; Richter, W.D. Kullback–Leibler life time testing. Appl. Math. Comput. 2014, 240, 122–139. [Google Scholar] [CrossRef]
Goerg, G.M. The Lambert way to Gaussianize heavy-tailed data with the inverse of Tukey’s h transformation as a special case. Sci. World J. 2015, 2015, 909231. [Google Scholar] [CrossRef] [PubMed]
Goerg, G.M. LambertW: Probabilistic Models to Analyze and Gaussianize Heavy-Tailed, Skewed Data; R Package Version 0.6.7-1; R Foundation for Statistical Computing: Vienna, Austria, 2022. [Google Scholar]
Brito, P.; Fabião, F.; Staubyn, A. Euler, Lambert, and the Lambert W-function today. Math. Sci. 2008, 33, 127–133. [Google Scholar]
Dence, T. A Brief Look into the Lambert W Function. Appl. Math. 2013, 4, 887–892. [Google Scholar] [CrossRef]
Corless, R.; Gonnet, G.; Hare, D.; Jeffrey, D.; Knuth, D. On the Lambert W Function. Adv. Comput. Math. 1996, 5, 329–359. [Google Scholar] [CrossRef]
Casella, G.; Berger, R.L. Statistical Inference; Duxbury Press: Pacific Grove, CA, USA, 2002. [Google Scholar]
Frees, E.W.; Valdez, E.A. Understanding relationships using copulas. N. Am. Actuar. J. 1998, 2, 1–25. [Google Scholar] [CrossRef]
McNeil, A.J. Estimating the tails of loss severity distributions using extreme value theory. ASTIN Bull. J. IAA 1997, 27, 117–137. [Google Scholar] [CrossRef]
Klugman, S.A.; Parsa, R. Fitting bivariate loss distributions with copulas. Insur. Math. Econ. 1999, 24, 139–148. [Google Scholar] [CrossRef]
Dupuis, D.J.; Jones, B.L. Multivariate Extreme Value Theory And Its Usefulness In Understanding Risk. N. Am. Actuar. J. 2006, 10, 1–27. [Google Scholar] [CrossRef][Green Version]
Resnick, S.I. Discussion of the Danish Data on Large Fire Insurance Losses. Astin Bull. 1997, 27, 139–151. [Google Scholar] [CrossRef]
Cooray, K.; Ananda, M.M.A. Modeling actuarial data with a composite lognormal-Pareto model. Scand. Actuar. J. 2005, 2005, 321–334. [Google Scholar] [CrossRef]
Dell’Aquila, R.; Embrechts, P. Extremes and Robustness: A Contradiction? Financ. Mark. Portf. Manag. 2006, 20, 103–118. [Google Scholar] [CrossRef][Green Version]
Eling, M. Fitting insurance claims to skewed distributions: Are the skew-normal and skew-student good models? Insur. Math. Econ. 2012, 51, 239–248. [Google Scholar] [CrossRef]
Wuertz, D.; Setz, T.; Chalabi, Y. fExtremes: Rmetrics—Modelling Extreme Events in Finance; R Package Version 4021.83.; R Foundation for Statistical Computing: Vienna, Austria, 2022. [Google Scholar]
Hofert, M.; Kojadinovic, I.; Maechler, M.; Yan, J. Copula: Multivariate Dependence with Copulas; R Package Version 1.1-2; R Foundation for Statistical Computing: Vienna, Austria, 2023. [Google Scholar]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022. [Google Scholar]
Weibel, M.; Luethi, D.; Breymann, W. ghyp: Generalized Hyperbolic Distribution and Its Special Cases; R Package Version 1.6.3; R Foundation for Statistical Computing: Vienna, Austria, 2022. [Google Scholar]
Azzalini, A. The R Package sn: The Skew-Normal and Related Distributions Such as the Skew-t and the SUN (Version 2.1.0); Università degli Studi di Padova: Padua, Italia, 2022. [Google Scholar]
Delignette-Muller, M.L.; Dutang, C. fitdistrplus: An R Package for Fitting Distributions. J. Stat. Softw. 2015, 64, 1–34. [Google Scholar] [CrossRef]

Figure 1. Lambert W function.

Figure 2. Plots of the pdf (left panel) and cdf (right panel) of

W \times N (0, 1)

distributions with different

γ

values.

Figure 2. Plots of the pdf (left panel) and cdf (right panel) of

W \times N (0, 1)

distributions with different

γ

values.

Figure 3. Examples of Lambert

W \times N (0, 1)

pdf with

γ = 0.2

and

γ = 0.3

.

Figure 3. Examples of Lambert

W \times N (0, 1)

pdf with

γ = 0.2

and

γ = 0.3

.

Figure 4. Examples of Lambert

W \times N (0, 1)

pdf with

γ = 0.4

and

γ = 0.5

.

Figure 4. Examples of Lambert

W \times N (0, 1)

pdf with

γ = 0.4

and

γ = 0.5

.

Figure 5. Example of Lambert

W \times N (0, 1)

pdf when

γ = 2.5

. The right panel shows a closer view of the interval marked with grey in the left panel.

Figure 5. Example of Lambert

W \times N (0, 1)

pdf when

γ = 2.5

. The right panel shows a closer view of the interval marked with grey in the left panel.

Figure 6. Example of Lambert

W \times N (0, 1)

pdf with

γ = 3

. The right panel shows a closer view of the interval marked with grey in the left panel.

Figure 6. Example of Lambert

W \times N (0, 1)

pdf with

γ = 3

. The right panel shows a closer view of the interval marked with grey in the left panel.

Figure 7. Skewness coefficient

γ_{1}

for Lambert

W \times N (μ, σ)

random variables for different ranges of the skewness parameter

γ

.

Figure 7. Skewness coefficient

γ_{1}

for Lambert

W \times N (μ, σ)

random variables for different ranges of the skewness parameter

γ

.

Figure 8. Plots of the pdf (left panel) and cdf (right panel) of

W \times E x p (1)

distributions with different positive

γ

values.

Figure 8. Plots of the pdf (left panel) and cdf (right panel) of

W \times E x p (1)

distributions with different positive

γ

values.

Figure 9. Plots of the pdf (left panel) and cdf (right panel) of

W \times E x p (1)

distributions with different negative

γ

values.

Figure 9. Plots of the pdf (left panel) and cdf (right panel) of

W \times E x p (1)

distributions with different negative

γ

values.

Figure 10. Skewness coefficient

γ_{1}

for Lambert

W \times E x p (λ)

with parameter

γ \in (- 5, \frac{1}{3})

.

Figure 10. Skewness coefficient

γ_{1}

for Lambert

W \times E x p (λ)

with parameter

γ \in (- 5, \frac{1}{3})

.

Figure 11. Left panel: US indemnity data (in thousands of USD). Right panel: same data after log-transformation.

Figure 12. Left panel: Danish fire data claims (in millions of DKK). Right panel: same data after log-transformation.

Table 1. Parameter estimates for Lambert distributions.

Data	$W \times Exp (λ)$		$W \times N (μ, σ)$
Data	$λ$	$γ$	$μ$	$σ$	$γ$
US indemnity	0.080	−0.496	13.444	28.829	−0.789
US indemnity, log	0.093	−0.321	17.106	21.635	−0.021
Danish fire	0.386	−0.096	11.923	21.417	−0.564
Danish fire, log	1.176	−0.040	10.542	20.549	−0.373

Table 2. US indemnity data: AIC and BIC values for fitted distributions.

Distribution		AIC		BIC
Distribution	Npar	Original	Log	Original	Log
Lambert W exponential	2	13,141.92	7845.81	13,152.55	7856.44
Lambert W normal	3	13,397.48	5737.79	13,413.42	5753.73
exponential	1	14,157.93	8869.95	14,163.24	8875.26
gamma	2	13,537.17	6442.22	13,547.80	6452.85
log-normal	2	13,137.53	8895.12	13,148.16	8905.74
logistic	2	16,544.91	5753.92	16,555.54	5764.55
normal	2	18,156.65	5740.44	18,167.27	5751.06
Weibull	2	13,321.70	5923.95	13,332.33	5934.58
Cauchy	2	14,518.07	6264.44	14,528.69	6275.07
Pareto	2	13,148.51	8871.95	13,159.13	8882.58
symm hyperbolic	3	15,884.38	5738.41	15,900.32	5754.35
symm NIG ¹	3	14,515.76	5738.38	14,531.70	5754.32
symm VG ²	3	14,261.53	5738.65	14,277.47	5754.59
student t	3	14,492.64	5738.12	14,508.58	5754.06
skew-normal	3	16,315.13	5737.79	16,331.07	5753.73
asymm hyperbolic	4	14,163.24	5738.16	14,184.49	5759.41
asymm NIG	4	13,148.66	5738.12	13,169.91	5759.37
asymm VG	4	14,177.46	5738.61	14,198.71	5759.86
symm ghyp ³	4	14,494.64	5740.43	14,515.89	5761.68
skew t	4	13,197.79	5738.06	13,219.05	5759.32
asymm ghyp	5	13,145.91	5740.61	13,172.48	5767.17

¹ normal inverse Gaussian; ² variance gamma; ³ generalized hyperbolic.

Table 3. Danish fire data: AIC and BIC values for fitted distributions.

Distribution		AIC		BIC
Distribution	Npar	Original	Log	Original	Log
Lambert W exponential	2	9264.10	3282.22	9275.46	3293.58
Lambert W normal	3	6699.82	2978.46	6716.86	2995.50
exponential	1	9620.79	3297.61	9626.47	3303.30
gamma	2	9538.19	3299.61	9549.55	3310.98
log-normal	2	8119.79	5504.62	8131.16	5515.98
logistic	2	11,479.71	4421.17	11,491.08	4432.53
normal	2	15,431.52	4709.15	15,442.89	4720.52
Weibull	2	9611.24	3294.27	9622.61	3305.63
Cauchy	2	8240.17	4589.38	8251.53	4600.74
Pareto	2	9249.67	3818.07	9261.03	3829.43
symm hyperbolic	3	10,433.17	4363.90	10,450.21	4380.95
symm NIG ¹	3	8237.61	4303.93	8254.66	4320.97
symm VG ²	3	9089.69	4375.17	9106.73	4392.21
student t	3	8237.85	4299.90	8254.90	4316.94
skew-normal	3	12,608.36	3441.49	12,625.40	3458.54
asymm hyperbolic	4	8109.27	3307.83	8132.00	3330.56
asymm NIG	4	6806.79	3378.14	6829.52	3400.86
asymm VG	4	7404.07	3281.06	7426.80	3303.78
symm ghyp ³	4	8224.65	4298.21	8247.38	4320.93
skew t	4	6683.02	3274.24	6705.75	3296.96
asymm ghyp	5	6775.85	3283.06	6804.26	3311.46

¹ normal inverse Gaussian; ² variance gamma; ³ generalized hyperbolic.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Käärik, M.; Selart, A.; Puhkim, T.; Tee, L. Lambert W Random Variables and Their Applications in Loss Modelling. Symmetry 2023, 15, 1877. https://doi.org/10.3390/sym15101877

AMA Style

Käärik M, Selart A, Puhkim T, Tee L. Lambert W Random Variables and Their Applications in Loss Modelling. Symmetry. 2023; 15(10):1877. https://doi.org/10.3390/sym15101877

Chicago/Turabian Style

Käärik, Meelis, Anne Selart, Tuuli Puhkim, and Liivika Tee. 2023. "Lambert W Random Variables and Their Applications in Loss Modelling" Symmetry 15, no. 10: 1877. https://doi.org/10.3390/sym15101877

APA Style

Käärik, M., Selart, A., Puhkim, T., & Tee, L. (2023). Lambert W Random Variables and Their Applications in Loss Modelling. Symmetry, 15(10), 1877. https://doi.org/10.3390/sym15101877

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Lambert W Random Variables and Their Applications in Loss Modelling

Abstract

1. Introduction

2. The Lambert W Function and Its Properties

3. Lambert W Random Variables

3.1. Definitions

3.2. Lambert W Normal Distribution

3.3. Lambert W Exponential Distribution

4. Fitting Lambert W Random Variables to Insurance Data

5. Summary

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proofs of the Properties of Lambert W Standard Normal Random Variables

Appendix B. Details of Estimation

Appendix C. Data Histograms with the Three Best Fitting Models

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI