Next Article in Journal
Development of Risk Management Mechanism and the System of Risk Metrics to Evaluate and Enhance the Long-Term Orientation of the Strategies of Non-Financial Companies
Previous Article in Journal
Multi-Variate Risk Measures under Wasserstein Barycenter
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bonus-Malus Premiums Based on Claim Frequency and the Size of Claims

by
Adisak Moumeesri
1 and
Tippatai Pongsart
2,*
1
Department of Mathematics, Khon Kaen University, Khon Kaen 40002, Thailand
2
Department of Statistics, Khon Kaen University, Khon Kaen 40002, Thailand
*
Author to whom correspondence should be addressed.
Risks 2022, 10(9), 181; https://doi.org/10.3390/risks10090181
Submission received: 10 July 2022 / Revised: 20 August 2022 / Accepted: 5 September 2022 / Published: 9 September 2022

Abstract

:
The bonus-malus system (BMS) is one of the most widely used tools in merit-rating automobile insurance, with the primary goal of ensuring that fair premiums are paid by all policyholders. The traditional BMS is dependent only on the claim frequency. Thus, an insured person who makes a claim with a small severity is penalized unfairly compared to an individual who makes a large severity claim. This study proposes a model for estimating the bonus-malus premium by employing a limit value (monetary unit) which distinguishes claim size into small and large based on claim frequency and claim severity distributions. This assists in determining the penalties for policyholders with claim sizes falling above and below the limit value. The number of claims is assumed to follow a Poisson distribution, and the total number of claims with a size greater than the limit value is considered a binomial distribution. The underlying risk of each policyholder is assumed to follow a beta Lindley distribution and is referred to as the prior distribution. Each policyholder’s claim size is also assumed to follow a gamma distribution, with the Lindley distribution considered as the prior distribution. Bonus-malus premiums are calculated following the Bayesian method. Practical examples using an actual data set are provided, and the results generated are compared to those produced using the traditional Poisson binomial-exponential beta model. This methodology provides a more equitable mechanism for penalizing policyholders in the portfolio.

1. Introduction

One crucial actuarial task is designing a charge structure that fairly distributes the responsibility of claims among policyholders. This is computed using the best model available to determine insurance premiums. The bonus-malus system (BMS) adjusts the premiums according to individual claim history and is popularly used, especially in automobile insurance, to determine that fair premiums are paid by all policyholders. As a reward for accident-free driving, a bonus is offered as a discount, whereas a malus results in an increase in the premium. The introduction of a BMS that links the premium to the number of reported claims will cause a tendency for policyholders to shoulder small claims on their own and not report them to their company to avoid a premium increase. “Hunger for bonus” is the term used to describe this phenomenon (Lemaire 1985). Lemaire (1976, 1977) studied the hunger for bonus and proposed a dynamic programming algorithm to determine the optimal claiming behavior. The BMS is used by insurance companies for two main reasons. The first is to encourage insureds to drive more cautiously throughout the year to avoid claims, and the latter is to ensure that insureds pay premiums that are proportional to their risks based on their claim experiences (Bühlmann 1967). The basic principle of this system is that greater claim frequencies result in higher premiums. The traditional BMS was solely dependent on the random variable of number of claims (Déniz 2016). However, not all events result in the same claim amount for an insured person. Because of different claim sizes, it makes sense to design a BMS based on both claim frequency and claim severity.
Several methodologies under the BMS framework have been used to determine fair premiums paid by policyholders. One of the most popular ways to determine premiums is the Bayesian method. Each policyholder has a constant as an unequal underlying risk. This unknown constant is referred to as a risk parameter and is treated as a random variable, with a specific probability distribution known as a prior distribution or structure-function distribution. This method is useful for calculating premiums in the BMS based on specific transition rules that distinguish policyholders as a bonus or malus. By dividing a posterior mean of the parameter by a prior mean, the basic Bayesian tool is easily applied to Bayes’ theorem. This tool employs the net premium principle and provides an estimate of risk parameters to distinguish between low- and high-risk policyholders.
The Poisson distribution is normally used to explain an independent random event of claims in vehicle insurance and expresses the behavior probability of individual policyholders (Déniz 2016). However, it cannot adequately describe the number of claims in an insurance portfolio. The negative binomial distribution, a type of mixing distribution with a Poisson distribution and a gamma distribution, was studied by Greenwood and Yule (1920). This marked the start of the process of constructing mixed Poisson distributions. A mixed Poisson was proposed for claim frequency distribution by Tremblay (1992). He designed an optimal BMS by mixing the Poisson distribution with the inverse Gaussian distribution, which has a wide variety of applications in medicine, finance, business, survival analysis and even in the maximum durability problem of investment in the gold market (Moumeesri and Klongdee 2019). In the automobile insurance sector, the random number of claims was also assumed to be Poisson distributed, while the randomly expected inherent risks of each insured person followed an inverse Gaussian distribution. Here, the Bayesian method was applied to estimate the posterior portfolio distribution function for a scenario covering the past t years. The expected premium value was calculated based on the BMS principle. Several papers have discussed mixing other distributions to obtain an optimal bonus-malus premium for the claim frequency (Dionne and Vanasse 1992; Lemaire 1995; Walhin and Paris 1999; Tzougas and Frangos 2014; Tzougas et al. 2019; Tzougas 2020). However, premium payments based on the BMS show no difference between a claim made by a policyholder for a small loss and another with a big loss. This phenomenon is called “hunger for bonus” (Lemaire 1985). For instance, a claim of USD 50 by a policyholder should not be penalized by the same increase in premium as a claim of USD 500. Déniz (2016) and Hernawati et al. (2017) proposed a model to determine premiums based on a BMS that distinguished two types of claims: those less than the limit value and those that exceeded this value. The result was a fairer method of penalizing all policyholders in the portfolio. Déniz and Calderín-Ojeda (2018) further developed the proposal of Déniz (2016). They studied a trivariate model where claims were distinguished into three types, while Pongsart et al. (2022) proposed a model for computing Bayesian bonus-malus premiums by distinguishing different multiple types of claims.
Our analysis identified situations where this concept should not be regarded as the fairest approach for penalizing policyholders. These included cases where only a few claims were made but the claim amount was large, or where a policyholder made a higher number of claims but the total claim amount was small. Therefore, the premium calculation based on the BMS should also assess the total claim amount. An optimal method to determine the premium charged to an insured must take both frequency and severity components into account (Frangos and Vrontos 2001). Many studies have attempted to determine premiums based on the BMS by including more factors than using only claim frequency in a model (Frangos and Vrontos 2001; Mert and Saykan 2005; Ibiwoye et al. 2011; Ni et al. 2014; Tzougas et al. 2015, 2017, 2018; 2020; Santi et al. 2016; Oh et al. 2019; Moumeesri et al. 2020; Jacob and Wu 2020). They proposed a model in which claim frequency and claim severity were included jointly to compute the bonus-malus premiums. This model was constructed by multiplying the bonus-malus premium based only on the number of claims with the bonus-malus premium based on the individual claim size. In addition, there was the work done to integrate the BMS into the farm insurance product. Boucher (2022) suggested developing unique BMSs using recursive partitioning techniques to each type of insured.
Here, we propose a model in which claim frequency and claim severity are included jointly to determine bonus-malus premiums by distinguishing two types of claims: those below a limit value classified as small, and the rest classified as large. The number of claims was assumed to be Poisson distributed, while the total number of claims with claim size larger than the limit value was assumed to follow a binomial distribution. The underlying risk of each policyholder was taken to be Lindley and beta distributed for the prior distributions. For claim severity distribution, we assume that the claim size of each policyholder follows a gamma distribution. The prior distribution was introduced as a Lindley distribution. To calculate the posterior structure functions for claim frequency and claim severity distributions, we followed the Bayesian approach. The mean of these functions was used to calculate the premiums paid by a policyholder.
The remainder of this paper is structured as follows. Section 2 discusses the methodology, divided into two parts as claim frequency distribution and claim severity distribution. Mixing distributions, the Bayesian method and the premium calculation are also explained. Section 3 illustrates the numerical applications and discusses the results using real claim data separated as claim frequency components and claim severity components. The results obtained are compared with those derived from the traditional Poisson-Lindley model that does not distinguish the type of claims (see Moumeesri et al. 2020 for details), and those achieved under the Poisson binomial-exponential beta model (see Hernawati et al. 2017 for details). The conclusions drawn are presented in Section 4.

2. Methodology

We assumed that the claim frequency and severity of each policyholder were independent. Claim frequency and claim severity distributions were divided into subsections as described below.

2.1. Claim Frequency Distribution

2.1.1. Mixing Distribution

In vehicle insurance, the Poisson distribution is used to describe the random occurrence of claims and represents the probability of individual policyholder behavior. Assume that the number of claims K of each policyholder, given the parameter θ > 0 , follows a Poisson distribution. Then, its probability mass function (pmf) can be represented as:
f ( k | θ ) = e θ θ k k ! ,               k = 0 , 1 , 2 , .
The expected value of the Poisson random variable is E [ K | θ ] = θ .
When the j th policyholder causes a total claim amount w j , this has a certain size, x i , that can be considered as a small or large claim. To distinguish the two types of claims, the limit value denoted by Φ separates the two sub-events as follows.
Let Z i be the random variable corresponding to the claim size over the limit values.
Z i = { 1 ,       x i > Φ 0 ,       x i < Φ
where Φ is a positive real number and x i > 0 .
Z i is modeled as a mutually independent and identically distributed random variable (i.i.d.) with a parameter of 0 < p < 1 and Bernoulli distribution. Then, its pmf can be given as follows:
f ( z i | p ) = p z i ( 1 p ) 1 z i ,             z i = 0 , 1 .
We also assume that Z = i = 1 k Z i is the total number of claims with claim size larger than Φ . Then, the remainder, K Z , is the total number of claims with claim size below Φ . Assume that Z has a binomial distribution. The conditional pmf of each Z i can be provided in the following form:
f ( z | k , p ) = ( k z ) p z ( 1 p ) k z ,       z = 0 , 1 , 2 , , k .
The expected value of the binomial random variable is E [ Z | k , p ] = k p .
Thus, the joint distribution of the number of claims ( K ) and the number of claims corresponding to the limit value ( Z ) can be represented as:
f ( k , z | θ , p ) = f ( k | θ ) f ( z | k , p ) = e θ θ k k ! ( k z ) p z ( 1 p ) k z ,
where k = 0 ,   1 ,   2 , and z = 0 ,   1 ,   2 , , k . This is the joint distribution of Poisson ( θ ) and binomial ( k , p ).
All policyholders have a constant representing their expected inherent risk. This is the mean of the number of claims of each insured, denoted by θ . The parameter θ in Equation (1) denotes the different underlying risks of each insured having an accident and varies from one individual to another. This parameter is assumed to be a random variable Θ according to the Lindley distribution, with parameter δ . Then, the probability density function (pdf) of θ can be represented as:
π 1 ( θ ) = δ 2 δ + 1 ( θ + 1 ) e δ θ ,       θ > 0 ,   δ > 0 .
The expected value of the Lindley random variable is E [ Θ ] = δ + 2 δ ( δ + 1 ) .
The parameter p in Equation (2) varies from one individual to another and is assumed to be a random variable P that follows a beta distribution, with parameters α > 0 and β > 0 . Thus, its pdf can be represented as:
π 2 ( p ) = 1 B ( α , β ) p α 1 ( 1 p ) β 1 ,         0 < p < 1 ,
where B ( α , β ) is the beta function defined by B ( α , β ) = Γ ( α ) Γ ( β ) Γ ( α + β ) , and Γ ( ) is the gamma function defined as Γ ( α ) = 0 t α 1 e t d t . The expected value of the beta random variable is E [ P ] = α α + β .
When the random variables Θ and P are assumed to be independent, then the joint prior distribution for two random variables can be given by:
π ( θ , p ) = π 1 ( θ ) π 2 ( p ) = δ 2 δ + 1 ( θ + 1 ) e δ θ 1 B ( α , β ) p α 1 ( 1 p ) β 1 .
The mixed Poisson binomial distribution in Equation (3) with Lindley beta distribution in Equation (4) leads to in the following equations:
f ( k , z ) = 0 0 1 f ( k , z | θ , p ) π ( θ , p )   d p d θ = 0 0 1 e θ θ k k ! ( k z ) p z ( 1 p ) k z δ 2 δ + 1 ( θ + 1 ) e δ θ 1 B ( α , β ) p α 1 ( 1 p ) β 1   d p d θ = δ 2 δ + 1 1 k ! ( k z ) B ( α + z , β + k z ) B ( α , β ) [ Γ ( k + 2 ) ( 1 + δ ) k + 2 + Γ ( k + 1 ) ( 1 + δ ) k + 1 ] = B ( α + z , β + k z ) B ( α , β ) ( k z ) ! z ! δ 2 Γ ( k + 1 ) ( k + δ + 2 ) ( 1 + δ ) k + 3 = ( k z ) B ( α + z , β + k z ) B ( α , β ) δ 2 ( k + δ + 2 ) ( 1 + δ ) k + 3 ,
where α > 0 ,   β > 0 ,   δ > 0 ,   k = 0 , 1 , 2 , and z = 0 , 1 , 2 , , k .

2.1.2. Bayesian Method

The bonus-malus premium calculation has been rigorously studied, with the Bayesian method being one of the most popular computing options (Déniz et al. 2014). The main purpose of this approach is to obtain the posterior distribution function. Bayesian methodology can be applied when data for all policyholders are available, either based on claims history or insured profiles.
Let ( k , z ) = { ( k 1 , z 1 ) , ( k 2 , z 2 ) , , ( k t , z t ) } be a sample, where t is the sample size. Then, N = i = 1 t k i is the total number of claims that a policyholder made in t years, k i is the number of claims that the policyholder made in the year i , i = 1 , 2 , , t , and M = i = 1 t z i represents the total number of claims where the claim size was larger than Φ in t years.
The posterior distribution function of random variables Θ and P can be considered according to Bayes’ theorem as proportional to the product of the prior distribution and the likelihood function.
The likelihood function considered by Equation (3) is proportional to
L ( θ , p ; ( k , z ) ) = f ( ( k , z ) | θ , p ) = i = 1 t e θ θ k i k i ! ( k i z i ) p z i ( 1 p ) k i z i = p i = 1 t z i ( 1 p ) i = 1 t k i i = 1 t z i e t θ θ i = 1 t k i i = 1 t 1 k i ! ( k i z i )   p M ( 1 p ) N M e t θ θ N .
The prior distribution considered by Equation (4) is proportional to
π ( θ , p ) p α 1 ( 1 p ) β 1 ( θ + 1 ) e δ θ .
We then obtain the posterior distribution function of Θ and P for a policyholder or a group of policyholders with claim history ( k 1 , z 1 ) , ( k 2 , z 2 ) , , ( k t , z t ) . Then, the posterior distribution function which is proportional to the product of the likelihood function in Equation (6) and the prior distribution in Equation (7) can be represented as
π * ( θ , p | ( k , z ) )   p M + α 1 ( 1 p ) N M + β 1 e ( t + δ ) θ ( θ + 1 ) θ N .
Consider
0 0 1 π * ( θ , p | ( k , z ) )   d p d θ 0 0 1 p M + α 1 ( 1 p ) N M + β 1 e ( t + δ ) θ ( θ + 1 ) θ N   d p d θ
then
0 0 1 π * ( θ , p | ( k , z ) )   d p d θ = 0 0 1 A   p M + α 1 ( 1 p ) N M + β 1 e ( t + δ ) θ ( θ + 1 ) θ N   d p d θ = 1 ,
where A is constant.
Let u = ( t + δ ) θ , θ = u t + δ , d u = ( t + δ ) d θ , d θ = d u t + δ , thus
A 0 e ( t + δ ) θ ( θ + 1 ) θ N 0 1 p M + α 1 ( 1 p ) N M + β 1   d p d θ = 1 A   B ( M + α , N M + β ) [ 0 e u u N + 1 ( t + δ ) N + 1 d u t + δ + 0 e u u N ( t + δ ) N d u t + δ ] = 1 A   B ( M + α , N M + β ) [ Γ ( N + 2 ) ( t + δ ) N + 2 + Γ ( N + 1 ) ( t + δ ) N + 1 ] = 1 A   B ( M + α , N M + β ) Γ ( N + 1 ) ( t + δ ) N + 1 [ N + 1 + t + δ t + δ ] = 1 .
then
A = ( t + δ ) N + 2 Γ ( N + 1 ) B ( M + α , N M + β ) ( N + 1 + t + δ ) .
Therefore, the posterior distribution function can be represented as the following equation:
π * ( θ , p | ( k , z ) ) = ( t + δ ) N + 2 p M + α 1 ( 1 p ) N M + β 1 e ( t + δ ) θ ( θ + 1 ) θ N Γ ( N + 1 ) B ( M + α , N M + β ) ( N + 1 + t + δ ) .

2.1.3. Premium Calculation

Many principles are involved in pricing insurance premiums. In this study, the net premium is denoted by H [ X ] . This is the basic principle in the sense that premiums should cover the expected value of losses. The net premium is calculated by expected values or means of claims or the number of claims to be paid by the insurers.
A premium calculation principle is a rule for assigning a premium to an insurance risk. This principle also assigns any risk X with probability density function f X ( x | θ ) , where x takes values in the sample space X and θ is considered a realization of a parameter space Θ . A premium is a real number. Heilmann (1989), Landsman and Makov (1998) and Young (2000) introduced the net premium principle, which can be represented as
P ( θ ) = E f [ X | θ ] = X x f X ( x | θ ) d x ,           θ Θ .
For the loss function L :   Θ × , the risk function R :   Θ × is given by
R ( θ , P ) = E f [ L ( θ , P ) ] = X L ( θ , P ) f X ( x | θ ) d x .
The number of claims made by a policyholder in a given period is assigned by two random variables ( K , Z ) , with the pdf f ( k , z | θ , p ) depending on an unknown risk parameter ( θ , p ) that has a prior distribution π ( θ , p ) . In the actuarial field, an unknown premium P ( θ , p ) , called the risk premium, is calculated by minimizing the expected loss Ef[L(g(k,z), P )], where g ( k , z ) is a suitable function of the number of claims with claim size relating to the limit values and P is the action space. If this experience is not available, then the collective premium is computed by minimizing the risk function Eπ(θ,p)[L(P(θ,p), P )], where π ( θ , p ) is the prior distribution of the unknown parameter ( θ , p ) . Conversely, if the experience is available, then the sample ( k , z ) will be used by the actuary to estimate the unknown risk premium P * , called the Bayesian premium. Déniz et al. (2000) and Heilmann (1989) further developed the premium calculation principle as follows:
  • The risk premium:
    P ( θ , p ) = E f [ g ( k , z ) ] = K Z g ( k , z ) f ( k , z | θ , p ) .
  • The collective premium:
    P = E π [ P ( θ , p ) ] = θ p P ( θ , p ) π ( θ , p ) d p   d θ .
  • The Bayesian premium:
    P * = θ p P ( θ , p ) π * ( θ , p | ( k , z ) ) d p   d θ .
    g ( k , z ) can be represented as
    g ( k , z ) = p l z + p s ( k z ) ,
    where p l and p s are suitable weights related to the number of claims, with sizes above and below the limit value, respectively. We also assume that 0 p s p l 1 .
The risk premium can be obtained using the net premium principle as shown below:
P ( θ , p ) = E [ g ( k , z ) ] = k = 0 z = 0 k g ( k , z ) f ( k , z | θ , p ) = k = 0 z = 0 k ( p l z + p s ( k z ) ) e θ θ k k ! ( k z ) p z ( 1 p ) k z = k = 0 e θ θ k k ! [ p l z = 0 k z ( k z ) p z ( 1 p ) k z + p s z = 0 k ( k z ) ( k z ) p z ( 1 p ) k z ] = k = 0 e θ θ k k ! ( p l k p + p s k p s k p ) = [ ( p l p s ) p + p s ] θ .
The risk premium can be reduced to P ( θ , p ) = θ if p l = p s = 1 , and considered under the net premium principle or the traditional model obtained by the mean of the Poisson distribution.
If the value of ( θ , p ) is known, then the premium is a fair amount to penalize a policyholder. However, these values cannot be observed in practice since the risk premium is a theoretical amount that cannot be determined exactly but must be estimated from the data. The average premium for all possible risk premiums is called a prior or collective premium. This can be represented by the equations below.
P = E [ P ( θ , p ) ] = 0 0 1 P ( θ , p ) π ( θ , p ) d p   d θ = 0 0 1 [ ( p l p s ) p + p s ] θ δ 2 δ + 1 ( θ + 1 ) e δ θ 1 B ( α , β ) p α 1 ( 1 p ) β 1   d p   d θ = p l α + p s β α + β δ + 2 δ ( δ + 1 ) .
If all weights are set to be p l = p s = 1 , then the collective premium calculated under the traditional distribution or net premium principle is P = δ + 2 δ ( δ + 1 ) , where P is the mean of the Lindley distribution.
The posterior premium or Bayesian premium can be represented as the following equations:
P t * ( k , z ) = 0 0 1 P ( θ , p ) π * ( θ , p | ( k , z ) ) d p   d θ = 0 0 1 [ ( p l p s ) p + p s ] θ ( t + δ ) N + 2 p M + α 1 ( 1 p ) N M + β 1 e ( t + δ ) θ ( θ + 1 ) θ N Γ ( N + 1 ) B ( M + α , N M + β ) ( N + 1 + t + δ ) d p   d θ = p l ( M + α ) + p s ( N M + β ) N + α + β N + 1 t + δ N + 2 + t + δ N + 1 + t + δ ,
where N = i = 1 t k i represents the total number of claims and M = i = 1 t z i represents claims with size larger than Φ . Note that P t * ( k , z ) = P 0 * ( 0 , 0 ) = P . Thus, the Bayesian premium in Equation (11) is similar to the prior premium in Equation (10) when no information is available.
If we assume that the initial premium or base premium at time t = 0 is 100, then, at time t + 1 , the Bayesian bonus-malus premium can be determined from the number of claims and expressed as:
Premium t + 1 = P t * ( k , z ) P 0 * ( 0 , 0 ) 100 = P t * ( k , z ) P 100 ,
i.e.,
Premium t + 1 = α + β p l α + p s β p l ( M + α ) + p s ( N M + β ) N + α + β N + 1 t + δ N + 2 + t + δ N + 1 + t + δ   δ ( δ + 1 ) δ + 2 100 .

2.2. Claim Severity Distribution

2.2.1. Mixing Distribution

Suppose that the amount x is the claim size of each policyholder. We assume that the conditional distribution of the claim size, given by the parameter β 1 , follows a gamma distribution. Its pdf can then be given by
f ( x | β 1 ) = β 1 τ Γ ( τ ) x τ 1 e β 1 x ,       x > 0 ,   τ > 0 ,   β 1 > 0 .
The expected value of the gamma random variable is E [ X | β 1 ] = τ β 1 .
The parameter β 1 is not the same for all insureds. Therefore, our prior belief for λ in Equation (13) can be expressed in the form of a distribution. If the parameter β 1 is distributed according to the Lindley distribution with the parameter δ 1 , then its pdf is provided by
π ( β 1 ) = δ 1 2 δ 1 + 1 ( β 1 + 1 ) e δ 1 β 1 ,       β 1 > 0 ,   δ 1 > 0 .
The expected value of β 1 will be E [ β 1 ] = δ 1 + 2 δ 1 ( δ 1 + 1 ) .
Then, the unconditional distribution of claim size x can be obtained as follows
f ( x ) = 0 f ( x | β 1 ) π ( β 1 ) d β = 0 β 1 τ Γ ( τ ) x τ 1 e β 1 x δ 1 2 δ 1 + 1 ( β 1 + 1 ) e δ 1 β 1 d β 1 = δ 1 2 δ 1 + 1 x τ 1 Γ ( τ ) 0 ( e ( x + δ 1 ) β 1 β 1 τ + 1 + e ( x + δ 1 ) β 1 β 1 τ ) d β 1 = τ δ 1 2 x τ 1 δ 1 + 1 x + τ + δ 1 + 1 ( x + δ 1 ) τ + 2 ,
where x > 0 ,   τ > 0 and δ 1 > 0 . Figure 1 illustrates the pdf plots of gamma-Lindley distribution for the unconditional distribution of claim size x .

2.2.2. Bayesian Method

Consider that an insured is in the portfolio for t years and that the number of claims made in the year i is denoted by k i . Let N = i = 1 t k i be the total number of claims that a policyholder made in t years. Then, claim size history can be represented as vector x = ( x 1 , x 2 , , x N ) . The total claim size over t years made by a policyholder in the portfolio can be denoted by S = k = 1 N x k .
Bayes’ theorem can be applied to consider the likelihood function and the prior distribution to obtain the posterior distribution function of the parameter β 1 given the claim size history of the insured x 1 , x 2 , , x N as follows:
The likelihood function is
L ( β 1 ; x 1 , x 2 , , x N ) = f ( x 1 , x 2 , , x N | β 1 ) = k = 1 N β 1 τ Γ ( τ ) x k τ 1 e β 1 x k β 1 τ N e β 1 S .
The prior distribution is
π ( β 1 ) = δ 1 2 δ 1 + 1 ( β 1 + 1 ) e δ 1 β 1 ( β 1 + 1 ) e δ 1 β 1 .
The posterior distribution for a policyholder or a group of policyholders with claim history x 1 , x 2 , , x N is proportional to the product of the likelihood function in Equation (15) and the prior distribution in Equation (16) and can be represented as:
π * ( β 1 | x 1 , x 2 , , x N )   f ( x 1 , x 2 , , x N | β 1 ) π ( β 1 ) = β 1 τ N e β 1 S ( β 1 + 1 ) e δ 1 β 1 = e ( S + δ 1 ) β 1 ( β 1 + 1 ) β 1 τ N
Consider
0 π * ( β 1 | x 1 , x 2 , , x N ) d β 1   0 e ( S + δ 1 ) β 1 ( β 1 + 1 ) β 1 τ N d β 1 ,
then
0 π * ( β 1 | x 1 , x 2 , , x N ) d β 1 = 0 B   e ( S + δ 1 ) β 1 ( β 1 + 1 ) β 1 τ N d β 1 = 1 .
where B is a constant.
Let u = ( S + δ 1 ) β 1 , β 1 = u S + δ 1 , d u = ( S + δ 1 ) d β 1 , d β 1 = d u S + δ 1 , then
B [ 0   e ( S + δ 1 ) β 1 β 1 τ N + 1 d β 1 + 0   e ( S + δ 1 ) β 1 β 1 τ N d β 1 ] = 1 B [ 0   e u u τ N + 1 ( S + δ 1 ) τ N + 1 d u S + δ 1 + 0 e u u τ N ( S + δ 1 ) τ N   d u S + δ 1 ] = 1 B [ Γ ( τ N + 2 ) ( S + δ 1 ) τ N + 2 + Γ ( τ N + 1 ) ( S + δ 1 ) τ N + 1 ] = 1 B Γ ( τ N + 1 ) ( S + δ 1 ) τ N + 1 [ τ N + 1 + S + δ 1 S + δ 1 ] = 1 .
Hence,
B = ( S + δ 1 ) τ N + 2 Γ ( τ N + 1 ) ( τ N + 1 + S + δ 1 ) .
Therefore, the posterior distribution function results in the following Equation (17).
π * ( β 1 | x 1 , x 2 , , x N ) = ( S + δ 1 ) τ N + 2 Γ ( τ N + 1 ) ( τ N + 1 + S + δ 1 ) e ( S + δ 1 ) β 1 ( β 1 + 1 ) β 1 τ N .

2.2.3. Premium Calculation

Many principles are involved in pricing insurance premiums. In this article, we determined the net premium principle for both claim severity distribution and claim frequency distribution. The basic principle is that premiums should be the expected value of losses. The expected value of the posterior distribution function in Equation (17) for the gamma-Lindley distribution was
β ^ 1 = E [ β 1 | x 1 , x 2 , , x N ] = 0 β 1 ( S + δ 1 ) τ N + 2 Γ ( τ N + 1 ) ( τ N + 1 + S + δ 1 ) e ( S + δ 1 ) β 1 ( β 1 + 1 ) β 1 τ N d β 1 = ( τ N + 1 ) ( τ N + 2 + S + δ 1 ) ( S + δ 1 ) ( τ N + 1 + S + δ 1 ) .
From E [ β 1 | x 1 , x 2 , , x N ] = β ^ 1 then E [ x 1 , x 2 , , x N | β 1 ] = τ β ^ 1 .
Therefore,
E [ x 1 , x 2 , , x N | β 1 ] = τ ( S + δ 1 ) ( τ N + 1 + S + δ 1 ) ( τ N + 1 ) ( τ N + 2 + S + δ 1 ) .
The Bayesian bonus-malus premium that must be paid by a specific group of insureds will be equal to the product of the Bayesian premium based on the frequency component in Equation (11) and the severity component in Equation (18). This can be expressed by
Premium = [ p l ( M + α ) + p s ( N M + β ) N + α + β N + 1 t + δ N + 2 + t + δ N + 1 + t + δ ] × [ τ ( S + δ 1 ) ( τ N + 1 + S + δ 1 ) ( τ N + 1 ) ( τ N + 2 + S + δ 1 ) ] .
The model was constructed considering only the frequency component to set the premium for each policyholder and considering the premium in the severity component as a constant. Therefore, this model was not fair when penalizing policyholders. For instance, a policyholder making a claim of AUD 100 should not be penalized by the same amount as other policyholders making claims of AUD 1000. The total amount of claims should be considered as the main factor for setting fair premiums for all policyholders. The model in Equation (19) offers fairer penalties to all policyholders than a model considering only the frequency component.
To determine the bonus-malus premiums that must be paid according to the proposed model, we need to know the number of claims, age of the policy and total claim amounts. All these values are generally shown in the portfolio.
The Bayesian bonus-malus premium at time t = 0 is called the base premium. This is the initial premium paid by a new policyholder who joins the insurance scheme. The base premium is given by
Premium 0 = [ p l α + p s β α + β δ + 2 δ ( 1 + δ ) ] × [ τ δ 1 ( 1 + δ 1 ) 2 + δ 1 ] .

3. Numerical Applications

3.1. Data

The data set used to calculate model premiums introduced in this paper was based on 1-year automobile insurance policies taken out in 2004 or 2005. This data set can be found on the website of the Faculty of Business and Economics, Macquarie University (Sydney, Australia), see also De Jong and Heller (2008). Out of 67,856 policies in the total portfolio, 4624 claims were made. Histograms of the data set and histograms in log scale are shown in Figure 2 and Figure 3, respectively. There were 4333 policyholders who made claims once, 271 twice, 18 three times and 2 four times. The limit value was taken as Φ = 500 monetary units, and data were derived from Déniz (2016) to distinguish between the two types of claims. The number of policyholders that made claims once with claim size above the limit value was 2615, 123 twice, 7 three times and 1 four times, as shown in Table 1.

3.2. Parameter Estimation and Data Fitting

3.2.1. Claim Frequency Distribution

Model parameters can be estimated after the prior distributions. The parameters were estimated from the data using the unconditional distribution of ( K , Z ) provided in Equation (5). Let ω f = ( α , β , δ ) be the vector of the parameters in Equation (5).
Let ( k , z ) = { ( k 1 , z 1 ) , ( k 2 , z 2 ) , , ( k n , z n ) } be a sample with n observations taken from the pdf in Equation (5). A parameter estimation method for the frequency component, called the minimum chi-square estimator (MCSE), was used to minimize the statistical value of the chi-square goodness of fit test ( χ 2 ) for the frequency distribution. The advantage of this method is that it gives a better fit to the data set. The χ 2 formula is defined by
χ 2 = i = 1 n ( O i E i ) 2 E i ,
where χ 2 is the value of the chi-square goodness of fit test, O i is the observed frequency count for the i t h level of the categorical variable and E i is the expected frequency count for the i t h level of the categorical variable.
A randomized neighborhood search (RNS) technique was used to minimize the statistical value of χ 2 . Let ω i be the vector of the parameters for some iteration i = 0, 1, 2, …. Let ϑ i be the representative of the statistical value ( χ 2 ) corresponding to the parameter ω i . The RNS process (see Pongsart et al. (2022) for detail) is explained by the following steps.
Step 1: Set the initial parameter ω and compute the statistical value ϑ with the parameter ω .
Step 2: Randomly change the parameter ω i to ω i * :
ω i * = ω i + r i ω i ,
where r i [ 0.5 , 0.5 ] .
Step 3: Compute the statistical value ϑ * with the parameter ω i * .
Step 4: Compare ϑ * and ϑ . If ϑ * < ϑ , then we set ω i = ω i * and go to Step 5. Else go to Step 2.
Step 5: If | ϑ * ϑ | 10 5 then the process is completed. Else go to Step 2.
MATLAB programming was used for parameter estimation.
Claim frequency distribution was applied using the Poisson binomial-Lindley beta distribution introduced in this article. The estimated parameter values of the Poisson binomial-Lindley beta distribution were α ^ = 4.1061, β ^ = 2.9352 and δ ^ = 14.5654 with chi-square goodness of fit test χ 2 = 17.4035 and AIC = 42,582.3175. Values of observed claim frequency and expected frequency were compared in Table 2 between the traditional Poisson binomial-exponential beta distribution (Hernawati et al. 2017) and Poisson binomial-Lindley beta distribution. Results showed that our proposed Poisson binomial-Lindley beta distribution gave a better fit to the data than the traditional Poisson binomial-exponential beta distribution.

3.2.2. Claim Severity Distribution

The maximum likelihood estimation (MLE) is widely used for estimating the model parameters for claim severity distribution. The unconditional distribution of X is provided in Equation (14).
Let X = ( x 1 , x 2 , , x n ) T be an identically independent observation for the gamma-Lindley distribution with pdf in Equation (14). To find the most likely value of the parameters δ 1 and τ , we require maximizing the likelihood function L where
L ( δ 1 , τ ; x i ) = i = 1 n f ( x i ; δ 1 , τ ) = i = 1 n [ τ δ 1 2 x i τ 1 δ 1 + 1 x i + τ + δ 1 + 1 ( x i + δ 1 ) τ + 2 ] .
Then, the log-likelihood function is
ln L ( δ 1 , τ ; x i ) = n ln τ + 2 n ln δ 1 + ( τ 1 ) i = 1 n ln x i + i = 1 n ln ( x i + τ + δ 1 + 1 ) n ln ( δ 1 + 1 )   ( τ + 2 ) i = 1 n ln ( x i + δ 1 ) .
The estimators δ ^ 1 and τ ^ of the parameters δ 1 and τ respectively can be obtained by solving the equation:
d d δ 1 ln L ( δ 1 , τ ; x i ) = 0   and   d d τ ln L ( δ 1 , τ ; x i ) = 0 .
where
2 n δ 1 + i = 1 n ( 1 x i + τ + δ 1 + 1 ) n δ 1 + 1 ( τ + 2 ) i = 1 n 1 x i + δ 1 = 0
and
n τ + i = 1 n ln x i + i = 1 n ( 1 x i + τ + δ 1 + 1 ) i = 1 n ln ( x i + δ 1 ) = 0 .
The parameters δ 1 and τ cannot be estimated in closed form and the numerical iteration technique is used to solve the equations above.
To evaluate whether a given distribution is suited to a data set, the Kolmogorov-Smirnov test (K-S test) represents the goodness of fit test for the claim severity distribution. The K-S test statistic is defined by
D = max | F n ( x ) F ( x ) | ,
where F ( x ) is the theoretical cumulative distribution of claim severity distributions and F n ( x ) is denoted by
F n ( x ) = 1 n [ number   of   observations   x ] ,
where n is the sample size.
The results gave the maximum likelihood estimators δ ^ 1 = 1501.5620 and τ ^ = 0.8012, with D-value = 0.0339 and AIC = 82,083.9448. Figure 4 shows the P-P plot of the Gamma-Lindley distribution.

3.3. Bonus-Malus Premiums

3.3.1. Claim Frequency Component

The Bayesian bonus-malus premiums based only on the frequency component were determined and calculated from Equation (12), with results shown in Table 3. For comparison, a limit value Φ was assumed to be 500 monetary units, with weights p l = 0.8 and p s = 1.
From the results in Table 3, a bonus with no claims in the first year represented 6.77% of the base premium. Policyholders who made one claim with a claim size below and above the limit value during the first year must pay a malus at 82.92% and 87.97%, respectively of the base premium. Premiums decrease if policyholders have a claim-free year and increase if claims occur. The Bayesian bonus-malus premiums achieved under the traditional Poisson-Lindley model were compared (see Moumeesri et al. 2020 for details). The bonus-malus premiums were the same as the bonus ( N = 0) and different for the remainder of the malus ( N 1 ). For these malus classes, policyholders with a claim size below the limit value are rewarded compared to those with premiums determined under the traditional Poisson-Lindley model. The remaining premiums may be larger or smaller than those obtained in this model. A comparison of premiums between the traditional Poisson-Lindley model and the proposed Poisson binomial-Lindley beta model in the first year for various numbers of claims is shown in Figure 5.
For comparative purposes, the Bayesian bonus-malus premiums achieved under the Poisson binomial-exponential beta model were computed (see Hernawati et al. 2017 for details), with the results shown in Table 4.
In Table 4, a policyholder with no claims from the first year to the seventh year is awarded by bonus classes that equal the premiums determined under the Poisson binomial-Lindley beta model. An insured who made one claim with a claim size below and above the limit value of during the first year must pay a malus at 83.15% and 88.79%, respectively of the base premium. The results in Table 4 show that malus values based on the Poisson binomial-Lindley beta model both below and above the limit value are less than those based on the Poisson binomial-exponential beta model.
The Bayesian bonus-malus premiums computed from the traditional Poisson binomial-exponential beta model were stricter for high-risk policyholders than the proposed Poisson binomial-Lindley beta model in the case of claim sizes both below and above the limit value. Premiums obtained from the traditional model were similar to the proposed model for low-risk policyholders.

3.3.2. Claim Frequency and Claim Severity Components

Bonus-malus premiums using the proposed model are shown in Equation (19). To find the bonus-malus premiums that must be paid, we need to know the number of claims made in these years, the number of claims corresponding to the limit value, the age of the policy and the total claim amount from a portfolio.
In some cases, the total claim amount corresponding to the limit value was equal to 400, 1500 and 2500. To better understand how the BMS operates, Table 5 shows the premiums that must be paid by the policyholder for various numbers of claims when the age of the policy is up to 7 years. The base premium, an initial premium paid by all new policyholders, is 80.52, and this decreases with a claim-free year. Conversely, after an accident with a claim size of 400 in the first year, the policyholder will pay 103.57.
If an accident occurs during the second year with a claim size of 1100, then a surcharge will be enforced and the insured will have to pay 160.54 as the premium for two accidents, with a claim amount that exceeds the limit value one time ( M = 1) for a total claim size of 1500 in 2 years, as shown in Table 6.
If no accident occurs in the third year, then the premium will reduce with a claim-free year and the insured must pay 151.04, which is the premium for two accidents of total claim size 1500 in 3 years (see Table 6).
If two accidents occur during the fourth year with claim sizes 900 and 100, then a surcharge will be enforced of 194.88, which is the premium for four accidents with a claim amount that exceeds the limit value two times ( M = 2) for total claim size 2500 in 4 years, as shown in Table 7.
The Bayesian bonus-malus premiums were compared under the traditional Poisson binomial-exponential beta distribution for the frequency component and the gamma-Lindley distribution for the severity component. The results are shown in Table 8, Table 9 and Table 10.
The base premiums and premiums with a claim-free year for low-risk policyholders are slightly different between the bonus-malus premiums proposed in Table 5, Table 6 and Table 7 and the traditional bonus-malus premiums in Table 8, Table 9 and Table 10. Moreover, the traditional bonus-malus premiums in Table 8, Table 9 and Table 10 are stricter for high-risk policyholders than premiums presented in Table 5, Table 6 and Table 7 when the total number of claims with claim size over the limit value is high. However, the traditional bonus-malus premiums in Table 8, Table 9 and Table 10 are more generous for high-risk policyholders than premiums presented in Table 5, Table 6 and Table 7 when the total number of claims with the claim size over the limit value is small.
Table 11 shows the premiums (malus) with Poisson binomial-Lindley beta for claim frequency distribution and gamma-Lindley for claim severity distribution that must be paid by policyholders during the first year of the policy, with the number of accidents ranging from 1 to 4, number of claims corresponding to the limit value and the aggregate claim amount of accidents ranging from AUD 100 to AUD 30,000. The results shown in Table 11 are depicted in Figure 6. Premiums were dependent on aggregate claim amount, number of claims and number of claims with claim size corresponding to the limit value.

4. Conclusions

A model was proposed to determine optimal bonus-malus premiums that considered both claim frequency and claim severity components by distinguishing two types of claims based on claim amounts beneath and above a threshold value. Two mixed distributions, the Poisson binomial with the Lindley beta distribution and a mixed gamma with Lindley distribution, were investigated and employed in the model as frequency distribution and severity distribution, respectively. The Bayesian method was applied in the model as a frequency component and a severity component. Both models were then integrated. The new model rewarded and penalized all policyholders in a fairer manner.
An example of real automobile insurance data was used to illustrate our model. Fitted results of claim frequency were better than the traditional Poisson binomial-exponential beta model. When claims occurred, high-risk policyholders were liable to more severe penalties under the traditional model. Our proposed model presents a reasonable alternative method for rewarding and penalizing both low- and high-risk policyholders. The obtained premiums could be used by insurers to gain market share in the highly competitive insurance industry.
To increase impartiality of assigning premium to all policyholders, further different types of claims could be distinguished as a topic for further research.

Author Contributions

Conceptualization, A.M. and T.P.; methodology, A.M. and T.P.; software, A.M.; validation, A.M. and T.P.; formal analysis, A.M. and T.P.; investigation, A.M. and T.P.; data curation, A.M.; writing—original draft preparation, A.M.; writing—review and editing, A.M. and T.P.; visualization, A.M.; supervision, T.P.; project administration, T.P.; funding acquisition, T.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Office of the Permanent Secretary, Ministry of Higher Education, Science, Research and Innovation, grant number RGNS 63-054.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to express our heartfelt appreciation to Watcharin Klongdee for the initial idea and helpful advice in completing this paper successfully. The authors would like to thank the referees for their careful reading and numerous helpful suggestions that helped to improve this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Boucher, Jean-Philippe. 2022. Multiple Bonus–Malus Scale Models for Insureds of Different Sizes. Risks 10: 152. [Google Scholar] [CrossRef]
  2. Bühlmann, Hans. 1967. Experience Rating and Credibility. Astin Bulletin 4: 199–207. [Google Scholar] [CrossRef]
  3. De Jong, Piet, and Gillian Z. Heller. 2008. Generalized Linear Models for Insurance Data. Cambridge: Cambridge University Press. [Google Scholar]
  4. Déniz, E. Gómez, Francisco José Vázquez Polo, and A. Hernández Bastida. 2000. Robust Bayesian premium principles in actuarial science. Journal of the Royal Statistical Society Series D (The Statistician) 49: 241–52. [Google Scholar] [CrossRef]
  5. Déniz, Emilio. 2016. Bivariate credibility bonus malus premiums distinguishing between two types of claims. Insurance: Mathematics and Economics 70: 117–24. [Google Scholar]
  6. Déniz, Emilio, Agustín Hernández-Bastida, and M. Pilar Fernández-Sánchez. 2014. Computing credibility Bonus-Malus premiums using the total claim amount distribution. Hacetepe Journal of Mathematics and Statistics 43: 1047–61. [Google Scholar]
  7. Déniz, Emilio, and Enrique Calderín-Ojeda. 2018. Multivariate Credibility in Bonus-Malus Systems Distinguishing between Different Types of Claims. Risks 6: 34. [Google Scholar] [CrossRef]
  8. Dionne, Georges, and Charles Vanasse. 1992. Automobile Insurance Ratemaking in the Presence of Asymmetrical Information. Journal of Applied Econometrics 7: 149–65. [Google Scholar] [CrossRef]
  9. Frangos, Nicholas E., and Spyridon D. Vrontos. 2001. Design of optimal bonus-malus systems with a frequency and a severity component on an individual basis in automobile insurance. Astin Bulletin 31: 1–22. [Google Scholar] [CrossRef]
  10. Greenwood, Major, and G. Udny Yule. 1920. An Inquiry into the Nature of Frequency Distributions Representative of Multiple Happenings with Particular Reference to the Occurrence of Multiple Attacks of Disease or of Repeated Accidents. Journal of the Royal Statistical Society A 83: 255–79. [Google Scholar] [CrossRef]
  11. Heilmann, Wolf-Rüdiger. 1989. Decision theoretic foundations of credibility theory. Insurance: Mathematics and Economics 8: 77–95. [Google Scholar] [CrossRef]
  12. Hernawati, Lili, I. Gusti Putu Purnaba, and I. Wayan Mangku. 2017. Comparing bonus malus premiums of two types claim which is assessed using Bayesian method. Applied Mathematical Sciences 11: 1687–94. [Google Scholar] [CrossRef]
  13. Ibiwoye, Ade, I. A. Adeleke, and Sunday A. Aduloju. 2011. Quest for optimal bonus-malus in automobile insurance in developing economies. An Actuarial Perspective. International Business Research 4: 74–83. [Google Scholar] [CrossRef]
  14. Jacob, Azaare, and Zhao Wu. 2020. An Alternative Pricing System through Bayesian Estimates and Method of Moments in a Bonus-Malus Framework for the Ghanaian Auto Insurance Market. Journal of Risk and Financial Management 13: 143. [Google Scholar] [CrossRef]
  15. Landsman, Zinoviy M., and Udi E. Makov. 1998. Exponential dispersion models and credibility. Scandinavian Actuarial Journal 1: 89–96. [Google Scholar] [CrossRef]
  16. Lemaire, Jean. 1976. Driver versus company: Optimal behaviour of the policyholder. Scandinavian Actuarial Journal 4: 209–19. [Google Scholar] [CrossRef]
  17. Lemaire, Jean. 1977. La soif du bonus. Astin Bulletin 9: 181–90. [Google Scholar] [CrossRef]
  18. Lemaire, Jean. 1985. Analysis of the Hunger for Bonus. In Automobile Insurance. Huebner International Series on Risk, Insurance and Economic Security; New York and Dordrecht: Springer, pp. 183–74. [Google Scholar]
  19. Lemaire, Jean. 1995. Bonus-malus systems in automobile insurance. Insurance: Mathematics and Economics 3: 277. [Google Scholar]
  20. Mert, Mehmet, and Yasemin Saykan. 2005. On a bonus-malus system where the claim frequency distribution is geometric and the claim severity distribution is Pareto. Hacettepe Journal of Mathematics and Statistics 34: 75–81. [Google Scholar]
  21. Moumeesri, Adisak, and Watcharin Klongdee. 2019. The Maximum Durability Problem for Investing in Gold Market. WSEAS Transactions on Business and Economics 16: 68–77. [Google Scholar]
  22. Moumeesri, Adisak, Watcharin Klongdee, and Tippatai Pongsart. 2020. Bayesian Bonus-Malus Premium with Poisson-Lindley Distributed Claim Frequency and Lognormal-Gamma Distributed Claim Severity in Automobile Insurance. WSEAS Transactions on Mathematics 19: 443–51. [Google Scholar] [CrossRef]
  23. Ni, Weihong, Corina Constantinescu, and Athanasios Pantelous. 2014. Bonus–Malus systems with Weibull distributed claim severities. Annals of Actuarial Science 8: 217–33. [Google Scholar] [CrossRef]
  24. Oh, Rosy, Peng Shi, and Jae Youn Ahn. 2019. Bonus-Malus premiums under the dependent frequency-severity modeling. Scandinavian Actuarial Journal 2020: 172–95. [Google Scholar] [CrossRef]
  25. Pongsart, Tippatai, Adisak Moumeesri, Tidadeaw Mayureesawan, and Wikanda Phaphan. 2022. Computing Bayesian Bonus-Malus Premium Distinguishing Among Different Multiple Types of Claims. Lobachevskii Journal of Mathematics 43: 148–57. [Google Scholar] [CrossRef]
  26. Santi, D. N., I. G. P. Purnaba, and I. W. Mangku. 2016. Bonus-Malus System with the Claim Frequency Distribution is Geometric and the Severity Distribution is Truncated Weibull. IOP Conference Series: Earth and Environmental Science 31: 012006. [Google Scholar] [CrossRef]
  27. Tremblay, Luc. 1992. Using the Poisson inverse Gaussian in bonus-malus systems. Astin Bulletin 22: 97–106. [Google Scholar] [CrossRef] [Green Version]
  28. Tzougas, George. 2020. EM estimation for the Poisson-Inverse Gamma regression model with varying dispersion: An application to insurance ratemaking. Risks 8: 97. [Google Scholar] [CrossRef]
  29. Tzougas, George, and Nicholas Frangos. 2014. The Design of an Optimal Bonus-Malus System Based on the Sichel Distribution. Cham: Springer International Publishing. [Google Scholar]
  30. Tzougas, George, Dimitris Karlis, and Nicholas Frangos. 2017. Confidence intervals of the premiums of optimal Bonus Malus Systems. Scandinavian Actuarial Journal 2: 129–44. [Google Scholar]
  31. Tzougas, George, Spyridon Vrontos, and Nicholas Frangos. 2015. Optimal Bonus-Malus Systems using finite mixture models. Astin Bulletin 44: 417–44. [Google Scholar] [CrossRef]
  32. Tzougas, George, Spyridon Vrontos, and Nicholas Frangos. 2018. Bonus-Malus systems with two component mixture models arising from different parametric families. North American Actuarial Journal 22: 59–91. [Google Scholar] [CrossRef]
  33. Tzougas, George, Wei Li Hoon, and Jun Ming Lim. 2019. The negative binomial-inverse Gaussian regression model with an application to insurance ratemaking. European Actuarial Journal 9: 323–44. [Google Scholar] [CrossRef]
  34. Tzougas, George, Woo Hee Yik, and Muhammad Waqar Mustaqeem. 2020. Insurance ratemaking using the Exponential-Lognormal regression model. Annals of Actuarial Science 14: 42–71. [Google Scholar] [CrossRef]
  35. Walhin, Jean-François, and Jose Paris. 1999. Using mixed Poisson processes in connection with bonus-malus systems. Astin Bulletin 29: 81–99. [Google Scholar] [CrossRef]
  36. Young, Virginia R. 2000. Credibility using semiparametric models and a loss function with a constancy penalty. Insurance: Mathematics and Economics 26: 151–56. [Google Scholar] [CrossRef]
Figure 1. Gamma-Lindley distribution density plots.
Figure 1. Gamma-Lindley distribution density plots.
Risks 10 00181 g001
Figure 2. Histogram of the data set.
Figure 2. Histogram of the data set.
Risks 10 00181 g002
Figure 3. Histogram (log scale) of the data set.
Figure 3. Histogram (log scale) of the data set.
Risks 10 00181 g003
Figure 4. P-P plot of gamma-Lindley distribution.
Figure 4. P-P plot of gamma-Lindley distribution.
Risks 10 00181 g004
Figure 5. Comparison of the premiums between the traditional Poisson-Lindley model and the proposed Poisson binomial-Lindley beta model in the first year.
Figure 5. Comparison of the premiums between the traditional Poisson-Lindley model and the proposed Poisson binomial-Lindley beta model in the first year.
Risks 10 00181 g005
Figure 6. The premiums based on Poisson binomial-Lindley beta for frequency distribution and gamma-Lindley for severity distribution with various number of claims and claim sizes in the first year.
Figure 6. The premiums based on Poisson binomial-Lindley beta for frequency distribution and gamma-Lindley for severity distribution with various number of claims and claim sizes in the first year.
Risks 10 00181 g006
Table 1. Observed claim frequency for the number of claims ( k ) and the total number of claims with claim size larger than a limit value ( z ) with Φ = 500.
Table 1. Observed claim frequency for the number of claims ( k ) and the total number of claims with claim size larger than a limit value ( z ) with Φ = 500.
k z Total
01234
063,232 63,232
118402493 4333
237117117 271
31557 18
4001012
Total65,11026151237167,856
Table 2. Observed frequency and expected frequency for estimated parameter values for the number of claims ( k ) and the total number of claims with claim size larger than a limit value ( z ) with Φ = 500.
Table 2. Observed frequency and expected frequency for estimated parameter values for the number of claims ( k ) and the total number of claims with claim size larger than a limit value ( z ) with Φ = 500.
k z Observed FrequencyExpected Frequency
Poisson Binomial-Exponential BetaPoisson Binomial-Lindley Beta
0063,23263,262.476063,234.5099
1018401768.26681795.7044
1124932514.29782512.0406
203759.165459.6707
21117121.0755124.5243
22117109.6681108.3118
3012.22622.2053
3155.33705.5043
3256.95737.1421
3375.10494.9526
4000.09100.0880
4100.23870.2436
4210.36450.3780
4300.38490.3910
4410.24930.2367
Total67,85667,85667,856
Estimated parameter values α ^ = 3.6490
β ^ = 2.5663
λ ^ = 13.7721
α ^ = 4.1061
β ^ = 2.9352
δ ^ = 14.5654
χ 2 18.069617.4035
AIC42,586.241442,582.3175
Table 3. Bonus-malus premiums based on the Poisson binomial-Lindley beta distribution for the frequency component where N represents the total number of claims and M represents those that the claim size is larger than Φ = 500.
Table 3. Bonus-malus premiums based on the Poisson binomial-Lindley beta distribution for the frequency component where N represents the total number of claims and M represents those that the claim size is larger than Φ = 500.
Number of Claims t
01234567
N = 0 M = 0100.0093.2387.3282.1177.4973.3569.6466.28
N = 1 M = 0 182.92171.38161.20152.16144.07136.80130.22
M = 1 187.97176.11165.65156.35148.04140.57133.81
N = 2 M = 0 270.16253.19238.21224.90212.98202.26192.56
M = 1 276.87259.48244.13230.48218.27207.28197.35
M = 2 283.58265.77250.04236.07223.56212.31202.13
N = 3 M = 0 355.63333.37313.72296.24280.59266.50253.76
M = 1 363.66340.90320.80302.93286.93272.52259.49
M = 2 371.70348.43327.89309.62293.26278.54265.22
M = 3 379.73355.96334.97316.31299.60284.56270.95
N = 4 M = 0 439.76412.32388.09366.52347.21329.82314.08
M = 1 448.87420.87396.13374.12354.41336.66320.59
M = 2 457.98429.41404.17381.71361.60343.49327.10
M = 3 467.09437.95412.21389.30368.79350.32333.60
M = 4 476.20446.49420.25396.89375.98357.16340.11
Table 4. Bonus-malus premiums based on the Poisson binomial-exponential beta distribution for the frequency component where N represents the total number of claims and M represents those that the claim size is larger than Φ = 500.
Table 4. Bonus-malus premiums based on the Poisson binomial-exponential beta distribution for the frequency component where N represents the total number of claims and M represents those that the claim size is larger than Φ = 500.
Number of Claims t
01234567
N = 0 M = 0100.0093.2387.3282.1177.4973.3669.6566.30
N = 1 M = 0 183.15171.54161.31152.24144.13136.84130.25
M = 1 188.79176.82166.28156.92148.56141.05134.26
N = 2 M = 0 270.98253.80238.66225.23213.24202.45192.71
M = 1 278.40260.75245.20231.40219.08208.00197.98
M = 2 285.82267.70251.74237.57224.92213.54203.26
N = 3 M = 0 357.38334.72314.77297.06281.23267.01254.15
M = 1 366.21342.99322.54304.39288.17273.60260.43
M = 2 375.03351.25330.31311.72295.12280.19266.70
M = 3 383.85359.51338.08319.05302.06286.78272.98
N = 4 M = 0 442.79414.72389.99368.05348.44330.82314.89
M = 1 452.74424.03398.75376.31356.27338.25321.97
M = 2 462.69433.35407.51384.58364.10345.68329.04
M = 3 472.63442.67416.27392.85371.92353.11336.11
M = 4 482.58451.99425.04401.12379.75360.55343.19
Table 5. Bonus-malus premiums based on Poisson binomial-Lindley beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 400.
Table 5. Bonus-malus premiums based on Poisson binomial-Lindley beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 400.
Number of Claims t
01234567
N = 0 M = 080.5275.0770.3166.1262.3959.0656.0753.37
N = 1 M = 0 103.5797.0491.2786.1581.5777.4673.73
M = 1 106.4399.7193.7988.5383.8279.5975.76
N = 2 M = 0 105.8799.2293.3588.1383.4779.2675.46
M = 1 108.50101.6995.6790.3285.5481.2377.34
M = 2 111.13104.1597.9992.5187.6183.2079.21
N = 3 M = 0 106.5699.8994.0088.7684.0879.8576.04
M = 1 108.97102.1596.1290.7785.9781.6677.75
M = 2 111.37104.4098.2592.7787.8783.4679.47
M = 3 113.78106.66100.3794.7889.7785.2781.19
N = 4 M = 0 106.66100.0194.1388.9084.2180.0076.18
M = 1 108.87102.0896.0890.7485.9681.6577.76
M = 2 111.08104.1598.0392.5887.7083.3179.34
M = 3 113.29106.2299.9894.4289.4584.9780.91
M = 4 115.50108.29101.9396.2691.1986.6382.49
Table 6. Bonus-malus premiums based on Poisson binomial-Lindley beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 1500.
Table 6. Bonus-malus premiums based on Poisson binomial-Lindley beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 1500.
Number of Claims t
01234567
N = 0 M = 080.5275.0770.3166.1262.3959.0656.0753.37
N = 1 M = 0 163.52153.20144.10136.02128.79122.29116.40
M = 1 168.03157.42148.07139.77132.34125.66119.61
N = 2 M = 0 167.15156.65147.38139.14131.77125.14119.14
M = 1 171.30160.54151.04142.60135.05128.25122.10
M = 2 175.45164.43154.70146.06138.32131.36125.06
N = 3 M = 0 168.23157.71148.41140.14132.74126.07120.04
M = 1 172.03161.27151.76143.30135.73128.92122.75
M = 2 175.83164.83155.11146.47138.73131.77125.47
M = 3 179.64168.39158.46149.63141.73134.62128.18
N = 4 M = 0 168.39157.89148.61140.35132.96126.30120.27
M = 1 171.88161.16151.69143.26135.71128.91122.76
M = 2 175.37164.43154.76146.16138.46131.53125.25
M = 3 178.86167.70157.84149.07141.22134.15127.74
M = 4 182.35170.97160.92151.98143.97136.76130.24
Table 7. Bonus-malus premiums based on Poisson binomial-Lindley beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 2500.
Table 7. Bonus-malus premiums based on Poisson binomial-Lindley beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 2500.
Number of Claims t
01234567
N = 0 M = 080.5275.0770.3166.1262.3959.0656.0753.37
N = 1 M = 0 218.01204.26192.13181.35171.71163.04155.20
M = 1 224.02209.89197.42186.35176.44167.53159.48
N = 2 M = 0 222.86208.86196.50185.52175.69166.85158.85
M = 1 228.39214.04201.38190.12180.05170.99162.79
M = 2 233.93219.23206.26194.73184.42175.13166.73
N = 3 M = 0 224.30210.26197.87186.84176.97168.09160.05
M = 1 229.37215.01202.34191.06180.97171.89163.66
M = 2 234.44219.76206.81195.28184.97175.68167.28
M = 3 239.50224.51211.28199.50188.97179.48170.89
N = 4 M = 0 224.52210.51198.13187.12177.27168.39160.35
M = 1 229.17214.87202.24191.00180.94171.88163.67
M = 2 233.82219.23206.34194.88184.61175.37167.00
M = 3 238.47223.59210.45198.75188.28178.85170.32
M = 4 243.12227.95214.55202.63191.96182.34173.64
Table 8. Bonus-malus premiums based on Poisson binomial-exponential beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 400.
Table 8. Bonus-malus premiums based on Poisson binomial-exponential beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 400.
Number of Claims t
01234567
N = 0 M = 080.0974.6769.9365.7662.0658.7655.7853.10
N = 1 M = 0 103.1496.6090.8485.7381.1777.0673.35
M = 1 106.3299.5893.6488.3783.6679.4375.61
N = 2 M = 0 105.6298.9293.0387.7983.1178.9175.11
M = 1 108.51101.6395.5790.2085.3981.0777.17
M = 2 111.41104.3498.1292.6087.6783.2379.23
N = 3 M = 0 106.5199.7693.8188.5383.8179.5875.74
M = 1 109.14102.2296.1290.7285.8881.5477.61
M = 2 111.77104.6898.4492.9087.9583.5079.48
M = 3 114.40107.14100.7695.0990.0285.4781.35
N = 4 M = 0 106.82100.0594.0888.7984.0679.8175.96
M = 1 109.22102.2996.1990.7885.9581.6077.67
M = 2 111.62104.5498.3192.7887.8383.3979.38
M = 3 114.02106.79100.4294.7789.7285.1881.08
M = 4 116.42109.04102.5496.7791.6186.9882.79
Table 9. Bonus-malus premiums based on Poisson binomial-exponential beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 1500.
Table 9. Bonus-malus premiums based on Poisson binomial-exponential beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 1500.
Number of Claims t
01234567
N = 0 M = 080.0974.6769.9365.7662.0658.7655.7853.10
N = 1 M = 0 162.84152.52143.42135.35128.14121.66115.80
M = 1 167.85157.21147.83139.52132.08125.40119.37
N = 2 M = 0 166.75156.18146.87138.60131.22124.58118.59
M = 1 171.32160.46150.89142.40134.81128.00121.83
M = 2 175.89164.73154.91146.20138.41131.41125.08
N = 3 M = 0 168.15157.49148.10139.77132.32125.63119.58
M = 1 172.31161.38151.76143.22135.59128.73122.54
M = 2 176.46165.27155.41146.67138.86131.83125.49
M = 3 180.61169.16159.07150.12142.12134.93128.44
N = 4 M = 0 168.64157.95148.53140.17132.71126.00119.93
M = 1 172.43161.50151.87143.32135.69128.83122.62
M = 2 176.22165.05155.21146.47138.67131.66125.32
M = 3 180.01168.59158.54149.62141.65134.49128.01
M = 4 183.80172.14161.88152.77144.63137.32130.71
Table 10. Bonus-malus premiums based on Poisson binomial-exponential beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 2500.
Table 10. Bonus-malus premiums based on Poisson binomial-exponential beta distribution for the frequency component and gamma-Lindley distribution for the severity component with a total claim size of S = 2500.
Number of Claims t
01234567
N = 0 M = 080.0974.6769.9365.7662.0658.7655.7853.10
N = 1 M = 0 217.11203.35191.22180.46170.85162.21154.40
M = 1 223.79209.60197.10186.01176.10167.20159.15
N = 2 M = 0 222.32208.23195.81184.80174.95166.10158.11
M = 1 228.41213.93201.18189.86179.74170.65162.44
M = 2 234.50219.64206.54194.92184.53175.20166.77
N = 3 M = 0 224.20209.98197.46186.35176.42167.50159.44
M = 1 229.73215.16202.34190.95180.78171.64163.37
M = 2 235.26220.35207.21195.55185.13175.77167.31
M = 3 240.80225.53212.08200.15189.49179.90171.24
N = 4 M = 0 224.84210.59198.03186.89176.93167.99159.90
M = 1 229.90215.32202.48191.09180.91171.76163.49
M = 2 234.95220.05206.93195.29184.88175.53167.08
M = 3 240.00224.78211.38199.49188.86179.31170.68
M = 4 245.05229.51215.83203.69192.83183.08174.27
Table 11. Comparison of premiums for various number of claims and claim sizes in the first year.
Table 11. Comparison of premiums for various number of claims and claim sizes in the first year.
Claim SizeNumber of Claims
N = 1N = 2N = 3N = 4
M = 0M = 1M = 0M = 1M = 2M = 0M = 1M = 2M = 3M = 0M = 1M = 2M = 3M = 4
1008790899194909294969092949597
1000136140139143146140143147150140143146149152
5000354364362371380364373381389365372380387395
10,000627644641657672645659674689645659672686699
20,00011721204119812271257120512331260128712071232125712821307
30,00017171764175517981842176618061846188617681804184118781914
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Moumeesri, A.; Pongsart, T. Bonus-Malus Premiums Based on Claim Frequency and the Size of Claims. Risks 2022, 10, 181. https://doi.org/10.3390/risks10090181

AMA Style

Moumeesri A, Pongsart T. Bonus-Malus Premiums Based on Claim Frequency and the Size of Claims. Risks. 2022; 10(9):181. https://doi.org/10.3390/risks10090181

Chicago/Turabian Style

Moumeesri, Adisak, and Tippatai Pongsart. 2022. "Bonus-Malus Premiums Based on Claim Frequency and the Size of Claims" Risks 10, no. 9: 181. https://doi.org/10.3390/risks10090181

APA Style

Moumeesri, A., & Pongsart, T. (2022). Bonus-Malus Premiums Based on Claim Frequency and the Size of Claims. Risks, 10(9), 181. https://doi.org/10.3390/risks10090181

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop