Abstract
Often, the claims reserves exceed the available equity of non-life insurance companies and a change in the claims reserves by a small percentage has a large impact on the annual accounts. Therefore, it is of vital importance for any non-life insurer to handle claims reserving appropriately. Although claims data are time series data, the majority of the proposed (stochastic) claims reserving methods is not based on time series models. Among the time series models, state space models combined with Kalman filter learning algorithms have proven to be very advantageous as they provide high flexibility in modeling and an accurate detection of the temporal dynamics of a system. Against this backdrop, this paper aims to provide a comprehensive review of stochastic claims reserving methods that have been developed and analyzed in the context of state space representations. For this purpose, relevant articles are collected and categorized, and the contents are explained in detail and subjected to a conceptual comparison.
  1. Introduction
1.1. The Importance of Claims Reserving in Non-Life Insurance
The insurance industry offers a multi-faceted range of numerous products that enable policyholders to insure themselves against almost any form of loss. Insurance companies therefore differentiate their products according to various criteria. In this paper, we focus on the problem of claims reserving for a branch of insurance products known as Non-Life Insurance (Continental Europe), General Insurance (United Kingdom) and Property and Casualty Insurance (USA). While this branch encompasses all insurance products that are different from life insurance, life insurance includes only life-related products and disability insurance (see ). This is due to the following reasons. On the one hand, life and non-life products differ reasonably, which is mainly reflected in the contract terms, types of claims and risk drivers. This also explains why different stochastic models and methods are used in both these branches. On the other hand, in many countries (such as Germany or Switzerland), there is a strict legal separation between life and non-life. A non-life insurer is therefore prohibited from offering life products, and vice versa. For this reason, it is not uncommon for insurance corporations to establish different companies and thus sell products from both branches. The following lines of business belong to the non-life insurance branch: motor/car insurance, property insurance, liability insurance, accident insurance, health insurance, marine insurance, and other insurance products such as aviation, credit insurance, epidemic insurance, legal protection, travel insurance, and so on (see ).
The amount of money that a policyholder has to pay to the insurer for insurance coverage is called the premium. By paying a premium, the policyholder under an insurance policy transfers the risk to the insurer (risk transfer), who has to compensate/settle the potential loss occurring under the contract via corresponding claims payments (in whole or in part). This practice represents the insurance principle of non-life insurance. Thus, in contrast to life insurance, non-life insurance is loss insurance, i.e., payments are made by the insurer to the policyholder only in the event of a specific loss.
At the end of each fiscal year, the insurer is confronted with the situation in which the premiums are known, but the claim amount is unknown. This uncertainty of the total loss liabilities is mainly due to (1) a reporting delay, (2) a long-lasting claim settlement, and (3) the unexpected re-opening of a closed claim (see ). Therefore, appropriate claims reserves for the outstanding loss liabilities have to be calculated by the responsible actuary. Since these loss reserves are often the largest share on the liability side of the balance sheet, adequate claims reserving is required, that is, forecasting these liabilities and quantifying their uncertainty is a key actuarial issue (see ).
Although claims data are time series data, the majority of the proposed (stochastic) claims reserving methods is not based on time series models. Among the time series models, state space models combined with Kalman filter learning algorithms have proven to be very advantageous as they provide high flexibility in modeling and an accurate detection of the temporal dynamics of a system (see ). Against this backdrop, this paper aims to provide a comprehensive review of stochastic claims reserving methods that have been developed and analyzed in the context of state space representations. For this purpose, relevant articles are collected and categorized, the contents are explained in detail and subjected to a conceptual comparison.
1.2. State Space Models in the Claims Reserving Literature
The actuarial literature contains various articles in which state space models and the Kalman filter learning algorithms are applied to improve stochastic claims reserving (see ). As a pioneer,  () constructed a state space model for the payment stream of incremental payments, took business volume and inflation indices into account, and presented a method to estimate the states underlying the observations of the upper triangle and to predict the outstanding loss liabilities of the lower triangle. Afterwards,  () used the relationship between the two-way ANOVA and the Chain Ladder (CL) method to establish a state space model for the so-called linear CL model.  () constructed a model for incremental payments and employed the state space approach to model variations in parameters across different accident years.  () extended the state space model of  () to weaken the homogeneity property of the CL method, which allows for development factors that do not necessarily have to be identical across all accident years.  () considered different recursive representations, including state space models based on the general form introduced by  () and discussed calendar year effects in claims development triangles.
 () presented four models for Reported But Not Settled (RBNS) claims, including state space models following  (, and ).  () proposed a univariate distribution-free state space model, where incremental payments are modeled as a function of payments of the first development year, i.e., the accident year itself.  () discussed a generalized Kalman filter that accounts for non-linearities in the observation equation.  () considered the so-called development correlation model, which is a (state space) model that accounts for correlations between individual development factors in the first two development years. In addition,  () not only discussed the development correlation model, but two further approaches taking correlations related to accident and calendar years into account.
 () compared various claims reserving methods including the state space model of  (). A completely different approach from the previous articles is taken by  (), who did not model the Incurred But Not Reported (IBNR) run-off data in chronological form, but as a univariate time series with missing observations.  () combined the approach of  () and  () and included an additional lag of the state vector into the state equation.  () presented a scalar state space model for cumulative payments. Most recently,  () and  () extended the row-wise stacking approach from  () through the inclusion of tail effects and multivariate considerations that allow for dependency modeling between correlated lines of business, respectively.
1.3. Categorization of Articles and Organization of the Paper
Figure 1 shows the history of the considered articles in stochastic claims reserving. Thereby, all articles are ordered chronologically and are classified into five categories considering their similarities in terms of contents: “Parametric evolution”, “Log-normal model”, “Correlation models”, “Univariate models”, and “Row-wise stacking”. These categories need not be taken as mutually exclusive, but the choice of the appropriate category is made considering the main approach used in the respective paper. The first category includes the articles by  (),  (),  (),  (), and  (), as they are based on the assumption of a parametric evolution of the run-off data across the development years. The second category includes the articles by  (, ),  (),  () because of the considered log-normal model for incremental payments. The third category consists of the articles by  (, and ) who discusses three types of models that incorporate correlations within claims development triangles. In the fourth category, there are the articles by  () and  (), where models are presented that avoid complex matrix-based structures. Finally, the fifth category include the articles by  (),  (), and  (), who propose a row-wise stacking of the claims data and associated state space representations. The solid arrows in Figure 1 represent the contentual similarities among the papers in their modeling approaches. The dashed arrows indicate, however, that the respective state space models are included in papers where different stochastic claims reserving methods are compared (see ; ). In addition, state space models and the Kalman learning algorithms are discussed in the context of stochastic claims reserving in standard text books such as  ().
      
    
    Figure 1.
      Chronology and categorization of the papers.
  
In the following, a category-guided presentation of the articles is performed. Within each of five categories, a chronological order is followed to present the individual articles. For the sake of consistency, a unified notation is used throughout the paper. Since this paper is devoted to state space representations, all essential contents concerning state space models are presented in the following, whereas less relevant contents are omitted or referred to. In particular, the state space representations given in the articles are developed in full detail, often much more detailed than in the original papers.
The paper is organized as follows. In Section 2, articles are discussed that are based on the assumption of a parametric evolution of the claims data across development years (Category 1). Section 3 presents articles in which incremental payments are assumed to be log-normally distributed and are modeled using a log-normal model (Category 2). Section 4 includes articles where correlation models are considered (Category 3). In Section 5, state space models are presented that have a scalar structure (Category 4). Section 6 contains articles where the row-wise stacking approach is considered to re-organize the claims data (Category 5). Subsequently, Section 7 provides a conceptual comparison of the presented approaches and state space representations. In Section 8, concluding remarks are given.
2. Parametric Evolution of Claims Data (Category 1)
In this section, we present papers that are based on the assumption of a parametric evolution of the claims data across development years:
- ▸
 - (): Claims Reserving, State-Space Models and the Kalman Filter;
 - ⊳
 - (): A Stochastic Method for Claims Reserving in General Insurance;
 - ⊳
 - (): Kalman Filters with Applications to Loss Reserving;
 - ▸
 - (): Loss Reserving: Past, Present and Future;
 - ▸
 - (): Application of State Space Model in Outstanding Claims Reserve.
 
Three articles marked with ▸ are mainly based on the use of state space models and the Kalman filter learning theory, and thus are presented in detail, while the models of the other two articles marked with ⊳ are treated in a more brief form, as state space models are not the focus of their methodologies.
2.1. Claims Reserving, State Space Models and the Kalman Filter
 () laid the foundation for the use of state space models and the Kalman filter in stochastic claims reserving with their article “Claims Reserving, State-Space Models and the Kalman Filter”. The proposed state space model is constructed for the payment stream of the incremental payments and presumes known, time-varying system matrices.
 Modeling the payment stream of incremental payments
The modeling is based on claims development triangles in which incremental payments  are given for accident years  and development years . The payment stream of incremental payments is modeled with increasing development year  and decreasing accident year  for a fixed calendar year  via
        
      
        
      
      
      
      
    
        see also Figure 2. Here, the quantity  is generally the expected claim payment to be made in accident year i and development year j of the t-th calendar year, and  is a noise term with .
      
    
    Figure 2.
      Modeling the payment stream of incremental payments.
  
 () propose an optional modification of (1) by including additional information such as the volume of business transacted in each accident year and the inflation factor for each calendar year. To this end, let  denote an appropriate index for the volume of business transacted in accident year i and  denote an appropriate price index for payments in the t-th calendar year. Using both these quantities, (1) can be extended to
        
      
        
      
      
      
      
    
        where  is the expected value of the inflation-adjusted and volume-weighted incremental payments in accident year i and development year j of calendar year t.
 Development of an appropriate state space representation
The modeling of the payment stream via (1) and (2) is promising with respect to the construction of an appropriate observation and state equation of a state space model, respectively. The following discussion in this regard is based on (1), but can be applied to (2) with minor modifications. In the first step of modeling the observation equation, (1) is transferred into a vector representation in such a way that  represents the vector of observations  of the t-th calendar year,  forms the vector of expected claims payments , and  is the vector of noise terms  with . Thus, the incremental payments made in calendar year t can be specified via
        
      
        
      
      
      
      
    
        or briefly as . In the second step, the vector  is to be modeled in such a way that it is obtained by the product of a system matrix  and a state vector . For this purpose,  () take  for a given accident year i as a function depending on the development year j and thus construct for each accident year a distributed lag model of the form
        
      
        
      
      
      
      
    
        where  are known functions in j and  are unknown parameters depending on the respective accident year i.  () justified the approach (4) by an overall smooth evolution of  characterized by a firstly increasing and then decreasing behavior in j for a given accident year i. A variation of (4) for  is the so-called Hoerl curve
      
        
      
      
      
      
    
        which  () use in their empirical application example. In addition, (4) can be easily transferred into vector notation by using
        
      
        
      
      
      
      
    
        as follows:
      
        
      
      
      
      
    
Substituting (7) into (3) then gives
        
      
        
      
      
      
      
    
        or in a more compact form
        
      
        
      
      
      
      
    
        with  and
        
      
        
      
      
      
      
    
        for all . Thus, given , , the system matrix  is a known time-varying diagonal matrix, and the state vector  contains unknown parameter vectors  for . Assuming a Hoerl curve according to (5), the observation Equation (9) of the t-th calendar year results in (due to ):
      
        
      
      
      
      
    
Subsequently,  () specify an appropriate state equation, in which they establish a connection between the state vector  of the t-th calendar year and the state vector  of the -th calendar year. The basic idea is again to model a smooth evolution, but in a slightly different form than in (4). The starting point is the sequence , but with the difference that for a fixed development year j the accident years i are varied, whereas before for a fixed calendar year t the development years j varied (see Figure 3).
      
    
    Figure 3.
      Sequences  for a given development year j.
  
For a given development year j,  () propose modeling  via
        
      
        
      
      
      
      
    
        with , where  is a noise term with . Thus, in contrast to (4),  is not modeled in a deterministic way but as a random variable. Further, they assume that the conditional expected value on the right-hand side of (10) is a polynomial in i of degree  that passes through . This leads to
        
      
        
      
      
      
      
    
        with known
        
      
        
      
      
      
      
    
        for . Substituting (7) on both sides into (11) for  yields
        
      
        
      
      
      
      
    
        where the ()-dimensional matrix  and the p-dimensional vector  are given by
        
      
        
      
      
      
      
    
        respectively. If both sides of Equation (12) are multiplied from the left by the inverse  of the matrix  (the existence of the inverse is ensured, see ), one obtains
        
      
        
      
      
      
      
    
Transferring (13) into matrix notation, we obtain
        
      
        
      
      
      
      
    
        or in a more compact fom
        
      
        
      
      
      
      
    
        with  and
        
      
        
      
      
      
      
    
        as well as  for all . The identity matrices , zero matrices  and scalar matrices  with  in (14) are each of dimension . Note also that the system matrices  and  are known in the state Equation (15).
A variation of the state Equation (15) is given for  (i.e., assuming a Hoerl curve as in (5)) and the parameters  of different accident years  are connected by a random walk
        
      
        
      
      
      
      
    
        that is, , , . Since we have , the relation  holds. For this reason,  () aim to obtain  and thus a state equation in the form of the random walk (16), i.e., they choose without loss of generality the fixed development year .
With respect to (10) and (13), the use of (16) implies
        
      
        
      
      
      
      
    
        for all . Accordingly, it follows for the system matrix  that it has the value one at positions  and zeros otherwise, while  corresponds to a t-dimensional unit vector with the value one at position . The state Equation (15) thus simplifies to:
      
        
      
      
      
      
    
Table 1 gives an overview of the dimensions of vectors and matrices in the state space model of  ().
       
    
    Table 1.
    Dimensions in the state space model of  ().
  
If one intends to model the observation and state equations by using (2) instead of (1), there are only changes in the observation Equation (9), while the state Equation (15) remains unchanged: each row  of the system matrix  has to be multiplied by a weighting factor consisting of volume and inflation indices, i.e., by .
 Forecasting the outstanding loss liabilities
As the system matrices  are assumed to be known for all , the outstanding loss liabilities for individual and aggregated accident years can be predicted by using  and  in a straightforward way. To this end, all future incremental payments are collected in the vector
        
      
        
      
      
      
      
    
All these future observations belong to one of the accident years , and therefore, they are based on the corresponding state . Accordingly, the state vector  corresponds to the vector  of the current calendar year I, which is why the state Equation (15) is given by  (i.e., , ). The system matrix  of the observation equation is obtained on the basis of (1) similar to that in (8), i.e., it consists mostly of zero vectors, and the entries  with  are ordered such that they are multiplied by the states  from  of the corresponding accident year  of  from . Thus, the future observations can be predicted via
        
      
        
      
      
      
      
    
		(given by (9)) and
        
      
        
      
      
      
      
    
        respectively. The variance–covariance matrix of the prediction error  is given by:
      
        
      
      
      
      
    
Since , ,  are known at time , a prediction of the outstanding loss liabilities for individual and aggregated accident years is straightforward. With respect to the aggregated accident years, all components from  are to be added to the total loss reserve, while for individual accident years only those components from  related to the respective accident year  are to be added. An extraction of these components can be carried out via a diagonal matrix , which has a value of one at the respective positions and otherwise zeros. The variance–covariance matrix belonging to  is thus
        
      
        
      
      
      
      
    
However, if the modified payment stream according to (2) is used, additional uncertainty is induced via the inflation index  of future calendar years , which is unknown at time . This is due to the unknown entries  for  instead of the known entries  in the system matrix .
2.2. A Stochastic Method for Claims Reserving in General Insurance
 () primarily establishes a model for incremental payments that includes a state space approach, where the variation of the parameters is modeled over different accident years. Thus, although the model of  () is not mainly based on state space models and the Kalman filter theory, it embeds them in a model framework as one component. In the following, therefore, the model for incremental payments and the state space model are presented (for further details, see ).
 Construction of the model for claims payments
The modeling is built on development triangles that include incremental payments  in accident years  and development years . The proposed model is based on the assumption that incremental payments  are composed of the sum of  independent and identically distributed (i.i.d.) payments  (which are stochastically independent of ), that is, . Thus,  () uses the collective risk model and  has a mixture distribution (see, e.g., ). The lags j of individual incremental payments  between the accident year of the claim and the actual payment are modeled as i.i.d. random variables, which is why  with  is defined as the probability of payments regarding claims of accident year i in a given development year j. Let the number  of payments for claims of accident year i in development year j be Poisson-distributed with parameter , i.e., ; then, the incremental payments  follow a mixture Poisson distribution. Following the convolution property of the Poisson distribution, the total number of claims payments  of an accident year i also follows a Poisson distribution with parameter
        
      
        
      
      
      
      
    
        where the  for different j are assumed to be stochastically independent random variables and the parameter  serves as a measure for the exposure of accident year i. As for modeling of the probability ,  () gives two alternatives, the stochastic CL and the Hoerl curve model. While in the first alternative it is assumed that the probabilities  are identical over all accident years i, the second alternative (preferred by ) provides a modeling via a Hoerl curve of the form
        
      
        
      
      
      
      
    
        with constants ,  and  to be estimated and  and  as functions depending on j. Using (17), the expected value and variance of  are as follows:
      
        
      
      
      
      
    
In addition to the number  of payments,  () also models the amount of individual payments  for claims of an accident year i in the j-th development year, which, like the , are also assumed to be stochastically independent for various j. The first two moments of  are modeled distribution-free with help of
        
      
        
      
      
      
      
    
        with proper (unknown) constants , ,  and inflation parameter . While such a modeling of the expected value with different  and K provides a variety of possibilities, the modeling of the variance results from the assumption that the coefficient of variation
        
      
        
      
      
      
      
    
        is time-invariant and corresponds to . The optional term  in (19) with
        
      
        
      
      
      
      
    
        and  as the average annual inflation rate between calendar years  and k, on the other hand, are used to account for inflation; i.e.,  reflects the inflation factor from the first calendar year to calendar year . However,  () proposes using
        
      
        
      
      
      
      
    
        and therefore assumes a constant inflation rate .
Considering (18)–(20), and using the moments of the mixture Poisson distribution, the expected value and variance of the incremental payments  in  are obtained via
        
      
        
      
      
      
      
    
        and
        
      
        
      
      
      
      
    
        where  are stochastically independent for different j due to the assumptions regarding  and . Moreover,  () normalizes the incremental payments  with the help of
        
      
        
      
      
      
      
    
        with exposure defined by
        
      
        
      
      
      
      
    
By using (17), (21), (23), (24), the expected value  of the normalized incremental payments  can be stated as follows:
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
    
Assuming that  and  are known, one obtains a generalized linear model of the form
        
      
        
      
      
      
      
    
        with the exponential response function , linear predictor  consisting of
        
      
        
      
      
      
      
    
        and noise term  with
        
      
        
      
      
      
      
    
        where the parameter estimators  and variance–covariance matrices  can be determined for all i using the Fisher scoring algorithm such that  is approximately satisfied. However, since  and  are usually unknown,  () proposes an iterative approach using parameter initializations to determine initial values for  and . Considering this approach, all accident years are run sequentially and the results of all accident years are subsequently used to obtain new estimates of the parameters for the next run.
 Modeling the parameter variation via a state space model
To increase the reliability of the estimators ,  () models the variation in the parameters  for different accident years i via
        
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
    
By defining  with the help of
        
      
        
      
      
      
      
    
        and by using (26), (27) can be written as
        
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
    
        where  and
        
      
        
      
      
      
      
    
        hold for all . Thus, Equation (28) forms the state equation of a state space model. Considering the estimators  as observations , the associated observation equation can be obtained via
        
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
    
        and ,
        
      
        
      
      
      
      
    
        and  for all . Accordingly, a complete state space model with  and  is specified via Equations (28) and (29).
2.3. Kalman Filters with Applications to Loss Reserving
 () states that this article arose from various lecture notes on statistics and actuarial science and should be viewed primarily as an introduction to Kalman filter theory and ordinary least squares (OLS) estimation and their close relationship to Bayes estimation. Thus,  () derives Kalman recursions for (multiple) linear regression models and the local level model, shows the connections of sample-based updates with Bayes updates in OLS estimators, and discusses state space models and the general Kalman filter algorithms.
The focus in the experimental and empirical applications is primarily not on an application of the Kalman filter, but on an investigation of the trend properties within claims development triangles. In the experimental application, a simulation of incremental payments  in accident years  and development years  is performed via
        
      
        
      
      
      
      
    
        i.e., a variation of the Hoerl curve. The factor  reflects the basic level of incremental payments, while the factor  describes their decreasing behavior over the development years. Based on this, calendar year effects (in the form of inflation factors) are illustrated and the problem of overparameterization is addressed, which arises, e.g., when there are too many parameters for the individual accident years, but can be remedied by recursively evolving parameters. However, no specific state space representation is developed.
2.4. Loss Reserving: Past, Present and Future
 () give a classification scheme for claims reserving methods whose higher-level criteria make a division between static and dynamic methods. In the framework of this taxonomic classification and especially with respect to the dynamic methods, they discuss a generalized Kalman filter, which allows for non-linearities in the observation equation and noise terms following a distribution of the Exponential Dispersion Family (EDF). They present two modeling approaches based on different types of claims data and state space representations constructed specifically for these data.
 Accident year-based state space modeling
In the first modeling approach, an accident year-based state space representation is constructed, which is based on Payments Per Claim Incurred (PPCI) of a workers’ compensation insurance policy as claims data. The PPCI of an accident year  in the development year  are denoted by  and belong to the ()-th calendar year with .
The state space model considered by  () is based on a linear state equation of the form
        
      
        
      
      
      
      
    
        with five-dimensional random vectors , transition matrix ,  and
        
      
        
      
      
      
      
    
        for , while the observation equation
        
      
        
      
      
      
      
    
        with ()-dimensional random vectors , system matrix ,  and
        
      
        
      
      
      
      
    
        is based on a generalized linear model with link function h (i.e., response function ) and linear predictor  for all . Moreover,  holds for all , the initial state  is uncorrelated with  and  for all  and  is assumed to be EDF-distributed for all . Thus, any strictly monotonic and differentiable link function h (such as a logarithm function) can be used to link the EDF-distributed observations  and the systematic component . The resulting recursive equations  () refer to as the EDF filter, which include the Kalman filter as a special case, namely for the identity function as link function and normally distributed noise terms . The observation vector  in (32) includes all PPCIs of an accident year  of the upper claims development triangle (see Figure 4).
      
    
    Figure 4.
      Accident year-based modeling of the observation vector.
  
 () propose a logarithm function as a link function, the noise terms  are assumed to be gamma-distributed and the -th row of the linear predictor  for an accident year  is given by
        
      
        
      
      
      
      
    
        with respect to the development year . Here,  denotes the Kronecker delta,
        
      
        
      
      
      
      
    
        which can be used to model the peak in development year . Thus, the observation Equation (32) of accident year  can be stated as follows:
      
        
      
      
      
      
    
On the other hand,  () do not provide any information on the concrete form of the state Equation (31).  () model the evolution of the PPCI over the development years according to (33) in a similar way to  (),  () and  (), who specify the evolution of incremental payments over the development years with the help of a Hoerl curve.  () apply this approach to the PPCI, as their evolution over the development years is similar to that of incremental payments: They reach their peak in development year  and then drop relatively quickly to zero. This evolution of the PPCI is also the justification of  () for the choice of the logarithm function as a link function and the assumption of a gamma distribution for the measurement noise.
 Calendar year-based state space modeling
For the second modeling approach,  () use a data set from  () that consists of motor vehicle bodily injury claim closure rates. Here, rather than collecting the observations from each accident year, they stack the observations from each calendar year into observation vectors. This is due to the fact that claim closure rates are relatively flat across development years, but are subject to calendar year effects.
The state space model proposed by  () provides a linear state equation and an observation equation in the form of a generalized linear model, but differs from the first approach by the time index (calendar years t instead of accident years i) and by the matrix dimensions. They consider the following state space model consisting of the state equation
        
      
        
      
      
      
      
    
        with ()-dimensional random vectors , , a ()-dimensional random vector  and transition matrix  for , and the observation equation of the t-th calendar year
        
      
        
      
      
      
      
    
        with ()-dimensional random vectors , , and -dimensional system matrix  for , where the assumptions concerning the noise terms correspond to those of the first approach (transferred to calendar years).
 () choose the identity function as a link function and the measurement noise is assumed to be normally distributed, which is why one obtains an ordinary linear observation equation and the usual linear Kalman filter can be used. This choice is motivated by the sufficiently high number of claims closures in the underlying claims data, and the assumption of an approximate normal distribution is justified by the central limit theorem, although the assumption of a discrete probability distribution such as the binomial distribution would be more appropriate. As for the development of the expected claim closure rate  with respect to the claims of an accident year  over the development years ,  () assume
        
      
        
      
      
      
      
    
        with  as effect of the t-th calendar year and Kronecker Delta . The observation vector
        
      
        
      
      
      
      
    
        of the t-th calendar year with  contains all  claim closure rates  of the respective calendar year  (see Figure 5), which is why the -dimensional state vector can be stated as
        
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        for .
      
    
    Figure 5.
      Calendar year-based modeling of the observation vector.
  
While the state vector  in the first modeling approach only contains the parameters of the i-th accident year, the state vector  contains all parameters up to the t-th accident year plus the corresponding calendar year effect. This is due to the fact that the observations of the t-th calendar year pass through all accident years . The observation Equation (35) is thus given by
        
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
     according to (37) and  according to (38) for all  as well as three-dimensional zero vectors . The state Equation (34) is then
        
      
        
      
      
      
      
    
        where  and  in  are identity and zero matrices of dimensions , respectively,  in  are three-dimensional zero vectors and ,  are given as follows:
      
        
      
      
      
      
    
Thus, the state equation involves a dynamic estimation of the parameters  and  via
        
      
        
      
      
      
      
    
        for . Finally, Table 2 gives an overview of the dimensions of vectors and matrices in the state space models of  ().
       
    
    Table 2.
    Dimensions in the state space models of  ().
  
2.5. The Application of State Space Model in Outstanding Claims Reserve
 () largely adopt the second modeling approach from  (), but without integrating calendar year effects. They extend the state equation by including a further lag of the state vector. Accordingly, the state space model they consider is given by
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        with , ,
        
      
        
      
      
      
      
    
        for all . Table 3 gives an overview of the dimensions of vectors and matrices in the state space model of  ().
       
    
    Table 3.
    Dimensions in the state space model of  ().
  
The observation vector  contains all observations  of the t-th calendar year, i.e., all  with . However, the nature of the claims data is not obvious and the authors refer to it only as “times of compensation”. Therefore, in view of the magnitude of the observations and their modeling, claims data are assumed to be incremental payments. The expected incremental payments of an accident year  are assumed to have a parametric evolution over the development years  similar to (33) via
        
      
        
      
      
      
      
    
        with Kronecker Delta . Thus, the observation Equation (39) of the t-th calendar year () results in a similar form as achieved within the second modeling approach of  (),
        
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
    
        for all .  () do not give the general representation of the state equation according to (40), but the reduced form
        
      
        
      
      
      
      
    
        which solely contains the last four rows of (40) that are of interest. For the remaining -dimensional parameter matrices, they assume scalar matrices  and  for all , which is why the state Equation (42) is given by:
      
        
      
      
      
      
    
If, on the other hand, one intends to express the state equation in the form (40), the upper -dimensional part of  corresponds to an identity matrix, while the last four rows in the last four columns of  form the scalar matrix  and otherwise contain zeros. The parameter matrix  has only zeros in the -dimensional upper part and also in the last four rows except for the last four columns, which correspond to the -dimensional scalar matrix . The noise vector  is equal to a zero vector in the first  rows and to the vector  in the remaining rows.
3. Log-Normal Models for Incremental Payments (Category 2)
This section presents articles in which incremental payments are assumed to be log-normally distributed and are modeled using a log-normal model:
- ▸
 - (): A State Space Representation of the Chain Ladder Linear Model;
 - ▸
 - (): A Method for Modelling varying Run-Off Evolutions in Claims Reserving;
 - ⊳
 - (): Bayesian Modelling of Outstanding Liabilities incorporating Claim Count Uncertainty;
 - ⊳
 - (): Comparison of Stochastic Reserving Methods.
 
The articles of  (, ) are presented in detail due to the fact that they are mainly based on the use of state space models and the Kalman filter learning theory (marked in the above listing with ▸), while the models in the papers of  () and  () are treated in a more concise form (marked in the above listing with ⊳).
3.1. A State Space Representation of the Chain Ladder Linear Model
 () discusses various state space representations based on the model of a two-way ANOVA, and thus follows  (), who shows a close connection between the CL method and the two-way ANOVA. In addition to a dynamic estimation of the parameters by means of the Kalman filter algorithms,  () also considers static models without and with prior information.
 The linear Chain Ladder model
The modeling is based on increments  with . The restriction to positive values is necessary against the backdrop of a logarithmic transformation of . In practice, the model of  () can be applied to paid data, but not to incurred data. For the increments , a multiplicative model
        
      
        
      
      
      
      
    
        with  as a parameter of the accident year i,  as a parameter of the development year j and  as noise term with  for all  is assumed. Further, the increments are presumed to follow a log-normal distribution, so a logarithmic transformation of the increments is performed, i.e., . Thus, the variables  are normally distributed. If both sides of (43) of the multiplicative model are logarithmized, this leads to the (additive) model of the two-way ANOVA with normally distributed residuals
        
      
        
      
      
      
      
    
        with population mean , row parameter , column parameter  and  for all . As for the model parameters,  () assumes  and
        
      
        
      
      
      
      
    
        with , and it holds  for all . Due to the fact that (44) is a model for logarithmized increments, it is referred to in the actuarial literature as log-normal model.  (), on the other hand, chooses to refer to it as linear CL model because it is very similar to the CL method (in an additive representation).  () shows this similarity of the classical CL method to the two-way ANOVA by estimating the parameters of the model (44) via OLS estimation for the two-way ANOVA and then reversing the logarithmic transformations. The predictor for the ultimate claim of an accident year ,
        
      
        
      
      
      
      
    
        is similar to the CL predictor except for a different parameterization. However,  () argues that (45) is neither an MLE nor an unbiased estimator of the expected ultimate claim, so he proposes using Bayes estimators instead. In addition,  () develops several state space representations of the linear CL model (44), which are in the focus in the following.
 Development of an appropriate state space representation
In order to specify a state space representation and to be able to use dynamic estimation methods, the linear CL model has to be specified in a recursive form. For this purpose,  () collected the incremental payments of a calendar year  in the t-dimensional vector . However, different from  (), he did not use the available observations , but the logarithmized observations :
      
        
      
      
      
      
    
		Hence, the entries , , of the t-th diagonal are arranged in the observation vector of the t-th calendar year from top right to bottom left (i.e., opposite to ); see Figure 6.
      
    
    Figure 6.
      Modeling the observation vector in  ().
  
Using a state vector containing the model parameters  up to the t-th accident and development year, an appropriate observation equation for the t-th calendar year based on (44) can be stated as
        
      
        
      
      
      
      
    
        or in a more compact form as
        
      
        
      
      
      
      
    
        with t-dimensional observation vector , system matrix , -dimensional state vector , and t-dimensional Gaussian white noise process  with  and
        
      
        
      
      
      
      
    
        for all . For the third calendar year, for instance, (46) results in:
      
        
      
      
      
      
    
For the state equation,  () gives several alternatives, where the most general variant is
        
      
        
      
      
      
      
    
        with system matrices , , the u-dimensional stochastic input vector  as well as the -dimensional Gaussian white noise process  with  and
        
      
        
      
      
      
      
    
        for . Here, , ,  are pairwise stochastically independent for all  and the input vector  is independent of the state vector . Table 4 gives an overview of the dimensions of the vectors and matrices in the state space model of  ().
       
    
    Table 4.
    Dimensions in the state space model of  ().
  
The dynamics of the system depend on the matrices ,  and the distribution of the input vector  in the state Equation (48). The simplest case is when  and  are zero vectors for all  and the parameters at time  are the same as those at time t. Then, (48) is given by:
      
        
      
      
      
      
    
If, on the other hand, one wants to realize different parameters at time  and t, the following variant of the state Equation (48) can be used:
      
        
      
      
      
      
    
The variation of the state Equation (50) means that already determined parameters remain unchanged and the new parameters are considered as stochastic inputs. While static parameter estimation is performed in the cases (49) and (50), dynamic parameter estimation can be achieved using the Kalman filter when a stochastic noise term  is added. For dynamic modeling,  () proposes state equations for two cases, for a dynamic estimation of the row parameters and for a dynamic estimation of both row and column parameters simultaneously. A dynamic estimation of the row parameters with help of the random walk  can be achieved via the following state equation:
      
        
      
      
      
      
    
If, on the other hand, a dynamic estimation of both the row and column parameters according to the random walks
        
      
        
      
      
      
      
    
        is intended, an input vector is obsolete and a reasonable state equation can be stated as follows:
      
        
      
      
      
      
    
Thus, dynamic parameter estimation is just between the identical and the different parameter cases, where the parameters in  are related to the parameters in t, but do not necessarily have to match. The state Equation (53), which allows for a dynamic estimation of both row and column parameters, is also exemplarily given for :
      
        
      
      
      
      
    
3.2. A Method for Modelling Varying Run-Off Evolutions in Claims Reserving
 () adopts the state space model presented in the work of  () with the aim to model a not necessarily homogeneous run-off evolution across the accident years within the CL method. With this approach, he addresses one of the main criticisms of the CL method, the homogeneity property. Since the state space model from  () is a linear CL model according to (44),  () shows how this model can be adjusted when there is a varying development pattern across accident years.
 Connection between CL factors and column parameters
A possible method to model a not necessarily homogeneous run-off evolution across the accident years is, for example, to use the individual CL factors  for all  instead of the CL development factors . Such modeling would allow for deviating development factors in different accident years, but comes with the disadvantage of overparameterization. It is therefore reasonable to strike a balance between both these extremes, i.e., between the CL development factors that are identical across the accident years and individual CL factors. For this purpose,  () uses the connection
        
      
        
      
      
      
      
    
        between the CL factors and the column parameters  in the linear CL model (44) (see ) to be able to indirectly relax the homogeneity property of the CL method via modifications to the linear CL model.
 Development of an appropriate state space representation
 () modifies the linear CL model of  () such that the column parameters  with  need not to be identical across all accident years. He differentiates the parameters  by accident years  via an extension of the notation to , where  corresponds to the column parameter  in the i-th accident year.  () does not give general definitions of the observation and state equations, but in the following we provide such representations. As for the observation equation in the t-th calendar year, it can be given in general form as follows:
      
        
      
      
      
      
    
As an example, the observation equation in  results in:
      
        
      
      
      
      
    
A connection between the parameters of successive accident years can be established by the state Equation (48). In this regard, a dynamic estimation of the row parameters can be achieved via
        
      
        
      
      
      
      
    
        with  and  for all  to avoid overparameterization of the model. The column parameters  of a development year j are supposed to be connected across accident years i in such a way that they follow a random walk
        
      
        
      
      
      
      
    
        with ,  and  for all  and . In this manner, it is found that the parameters related to a specific development year are similar for different accident years or can be identical, but do not necessarily have to be identical. If one assumes a variance of zero for the noise terms  for all , one obtains the state Equation (51) from  (), i.e., the column parameters  of development year j are identical across all considered accident years i and correspond to the column parameter  of the linear CL model (44). The larger the variance of the noise terms  chosen, the larger the variation in the parameters  can be across different accident years. Accordingly, the variances of the individual noise terms can be used to account for the indicators of changes in the development pattern.
Considering , the state equation is exemplarily given by:
      
        
      
      
      
      
    
Finally, when estimates of the column parameters  for all  are obtained (determined by means of the Kalman filter), the individual CL factors  can be determined separately for individual accident years via
        
      
        
      
      
      
      
    
        according to (54) for . In this manner, a not necessarily homogeneous run-off evolution across all accident years can be modeled within the CL method and the problem of overparameterization is avoided due to the recursive development of the column parameters. Furthermore, it should be emphasized that a dynamic estimation of the parameters has a considerable advantage over the static CL estimation: the observations of more recent accident years have a higher weight with respect to the prediction of the outstanding loss liabilities, whereas CL assigns the same weight to all the observations.
3.3. Bayesian Modelling of Outstanding Liabilities Incorporating Claim Count Uncertainty
 () consider four models based on claims development triangles that include incremental payments and claim counts for RBNS claims. They assume that claims are settled via one-off payments. They justify this assumption by means of their empirical application example, in which they use run-off data from a large Greek motor insurance company, where claims must be reported within three working days according to Greek legislation and are usually settled in the form of a one-off payment. The proportion of claims that are paid in more than one installment of claims payments is minimal, and therefore is neglected by  ().
Two models are based solely on incremental payments, while the other two models incorporate incremental payments and claim counts, thus using Payments Per Claim Finalized (PPCF).  () adjust the incremental payments  by the inflation index  of the corresponding calendar year  and log-transform the inflation-adjusted incremental payments that are assumed to be log-normally distributed via
        
      
        
      
      
      
      
    
        such that  for all . The definition of  is different for the four models under consideration:
- ⊳
 - Log-normal model for incremental payments (Model 1);
 - ⊳
 - Log-normal model for PPCF (Model 2);
 - ⊳
 - State space model for incremental payments (Model 3);
 - ⊳
 - State space model for PPCF (Model 4).
 
		but it is generally based on the two-way ANOVA model and thus also on the linear CL model from  (, ) according to (44). In the framework of models 3 and 4,  () consider state space models; however, they only specify the ANOVA model, recursive relationships of the parameters and model extensions without developing a specific state space representation. The reason for this is that they do not employ the Kalman filter to fit the model and to predict the outstanding loss liabilities, but instead they use a Bayesian approach in combination with Markov Chain Monte Carlo (MCMC). As the article by  () does not mainly rely on state space models and the Kalman filter theory, the models are presented briefly, and, in particular, details on the Bayesian approach are omitted.
 Log-normal model for incremental payments (Model 1)
The log-normal model for incremental payments, where the expected value  is given by
        
      
        
      
      
      
      
    
        for all  with , is already considered by various authors. That is, the expected incremental payments  for claims of the i-th accident year that are paid with a lag of  years are modeled via a linear predictor. This predictor consists of the sum of  (expected inflation- and log-adjusted claims payments of the first accident year that are settled in the same development year),  (row parameter reflecting expected changes in the ith accident year), and  (column parameter reflecting expected changes in the jth development year). According to  (), the ANOVA model has the disadvantage that it includes only one source of information (i.e., incremental payments) and omits claims counts. For example, this model would not be able to take into account a strong increase in incremental payments due to a surprising increase in the claim counts.
 Log-normal model for PPCF (Model 2)
The log-normal model for PPCF extends the first model by additionally considering claim counts in the modeling. For this purpose,  () give a two-stage model, where the first stage is related to incremental payments,
        
      
        
      
      
      
      
    
        with  and claim counts  for all . Compared with model 1, the ANOVA model (57) was additively extended by the term , which is why  in (58) can be interpreted as the logarithmized expected PPCF of the first accident year in the first development year, and the parameters  and  can be considered as expected deviations from  in the later accident and development years, respectively. The second stage of the model is related to the claim counts  with . It is given by the log-linear model
        
      
        
      
      
      
      
    
        with constraints ,  for all , hyper-parameters  and , and , where  holds,  is the probability that a claim will be settled with a lag of  years, and  denotes the total number of claims for a given accident year i. In this model, an increase in incremental payments induced by higher claim counts is accounted for.
 State space model for incremental payments (Model 3)
The state space model for incremental payments is based on the discussion of  () and the extension of the column parameters  to  as proposed by  ():
      
        
      
      
      
      
    
Here, the row and column parameters  and  follow the recursions
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        with  and  as well as  for all . Thus, for the variance of the individual log-transformed and inflation-adjusted incremental payments ,  holds for  or  and  holds for , as in each subsequent accident year after accident year , the weighted sum of the variance terms ,  (see recursions (59) and (60)) is added to the variance term . That is, this model differs from model 1 in two ways: the column parameters  are extended to , and both row and column parameters evolve recursively. The recursions (59) and (60) are thereby decisively affected by the variances  and  of their noise terms: If  is assumed to be close to zero, all row parameters tend to zero due to . If, on the other hand,  is assumed, models 1 and 3 are identical (except for the -recursion) because the column parameters are the same across all accident years, i.e.,  holds for all i.
 State space model for PPCF (Model 4)
The state space model for PPCF extends model 3 by incorporating claim counts. Like the second model, it is designed as a two-stage model, with stage 1 related to incremental payments and stage 2 related to claim counts. Thus, the first stage of model 4 is described via
        
      
        
      
      
      
      
    
        for all  with recursions (59) and (60), and the second stage is identical to the second stage of model 2. Hence, like models 1 and 3, models 2 and 4 differ in other column parameters and in the recursive relationships of row and column parameters.
3.4. Comparison of Stochastic Reserving Methods
 () compares some methods in stochastic claims reserving, including a state space model, in terms of forecasting the outstanding loss liabilities. The considered state space model
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        is based on the common assumptions regarding the noise terms (as, for example, in ), and it is constructed in analogy to  () via the log-normal model for incremental payments and the linear CL model (44), respectively: the observation vector  includes all logarithmized incremental payments  with  of the t-th calendar year ( with ), where the  have an expected value of  with . The measurement noise  that overlays the expected logarithmized incremental payments follows a Gaussian white noise process (). The state vector  includes , row parameters , and column parameters ; thus, unlike  (), column parameters beyond  for  are also included. Table 5 gives an overview of the dimensions of the vectors and matrices in the state space model of  ().
       
    
    Table 5.
    Dimensions in the state space model of  ().
  
The part on the left-hand side of the vertical line in the system matrix  is generally of dimensions , and the part on the right-hand side consists of  zero columns for all . Thus, if ,  only includes the -dimensional part on the left-hand side of the vertical line and no zero columns. As for the state Equation (62),  () proposes a dynamic estimation of the row parameters according to  with  for :
      
        
      
      
      
      
    
For , the -th column of  thus contains in the rows  and t the value one and otherwise only zeros. In the case , however,  deviates from (63) by having only zeros in the second row because of . The noise term  corresponds in each case to the t-th component of the vector .
4. Correlation Models (Category 3)
This section presents two articles:
- ⊳
 - (): State Space Models in Actuarial Science;
 - ▸
 - (): Forecasting Runoff Triangles.
 
Here, correlations regarding the different dimensions of claims development triangles are considered. As the conference paper by  () can be seen as a preprint of  () (with respect to the remarks on claims reserving), it is briefly presented, while  () is highlighted in the listing (as in the previous sections) with ▸ since it is significantly based on state space models and Kalman filter learning theory.
4.1. State Space Models in Actuarial Science
 () discusses two applications of state space models in actuarial sciences, in relation to mortality and in relation to cumulative payments in run-off triangles. As for the latter one, he extends the model of  () and proposes the so-called development correlation model. This model is already presented in a prior working paper by  (), where two additional models, the accident correlation model and the calendar correlation model, are proposed, but without discussing their state space representations. This extension, i.e., an embedding of the three models into state space representations and model fitting via Kalman filter, is carried out in the work of  (). Thus, with respect to applications of state space models in claims reserving,  () is a variant of  (), which only deals with one of the correlation models. For this reason, we refer to the following subsection, in which the article of  () is presented.
4.2. Forecasting Runoff Triangles
 () aims to predict the outstanding loss liabilities using three different models that can account for correlations within the claims data. In each case,  () gives state space representations for these models in order to be able to apply the Kalman filter to predict the claims reserves and to quantify their precision. Based on these results, he simulates the complete shape of the liability distribution. In the following, the focus is mainly on the state space representations of the considered models.
The proposed correlation models in the work of  () are generally based on a model of  (), which is extended in such a way that correlations between the individual accident, development or calendar years can be incorporated into the modeling. The models consider the logarithmized individual development factors
        
      
        
      
      
      
      
    
        with ,  and . Using the individual development factors (64), the future growth rate  of cumulative payments in each accident year  can be decomposed as follows:
      
        
      
      
      
      
    
Considering (65), the outstanding loss liabilities  of an accident year  are given by:
      
        
      
      
      
      
    
An aggregation of (66) across all accident years yields the total outstanding loss liabilities:
      
        
      
      
      
      
    
Thus, in order to predict the outstanding loss liabilities, it is necessary to estimate the growth rates  according to (65) and the future logarithmized individual development factors  for , respectively. For this purpose,  () considers three extended variants of the model proposed by  (). The model of  (),
        
      
        
      
      
      
      
    
        with ,  and , is a simple model for logarithmized individual development factors in which the  are assumed to be uncorrelated for all , . Here,  and , i.e., expected value and variance of the logarithmized individual development factors  only depend on the development year j.
With the goal to incorporate correlations of the logarithmized individual development factors into the model of  (),  () presents the development, accident, and calendar correlation models, each considering correlations between development years j, accident years i, and calendar years , respectively. In order to achieve appropriate state space representations of these models,  () generally suggests the state space model
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        with , where the t-dimensional observation vector  contains the logarithmized individual development factors  of the t-th calendar year (see Figure 7).
      
    
    Figure 7.
      Modeling of the observation vector in  ().
  
Due to the fact that  () aims to embed all three models into the same general state space model, the state space representations obtained in this way are excessive in their complexity. This is in contrast to the underlying compact models, in particular the development correlation model with only one model equation.
 Development correlation model
The development correlation model allows to model correlations of  across development years  for a given accident year  and is defined by
        
      
        
      
      
      
      
    
        with  for  and . Here, the correlation between development years j and  (i.e., between  and ) is modeled via . Based on empirical evidence,  () argues that only correlations between the first two development years are relevant, so only the correlation between  and  is considered. Thus, the correlation coefficient between  and  results in
        
      
        
      
      
      
      
    
        i.e., the correlation between  and  is based solely on . Thus, if , then  and  are uncorrelated as in the model of  (). Furthermore, setting  in (71) for all  results in the original model of  ().
The development correlation model (71) can be transferred into a state space representation with the observation equation
        
      
        
      
      
      
      
    
        and state equation
        
      
        
      
      
      
      
    
		by using (69) and (70). The matrix  consists of the last  rows of the row-permuted identity matrix ; that is,  corresponds to the row-permuted identity matrix on the left-hand side of the vertical line for , and it reduces by one row for each t before the -th calendar year. Considering, for example,  and , the state space representation of the development correlation model (71) is given by:
      
        
      
      
      
      
    
 Accident correlation model
The accident correlation model allows for correlations between accident years and implies that more recent accident years receive a higher weight for prediction. To achieve this goal, the expected value  in (68) is extended by a row index i to  and a random walk is assumed across the accident years (, ):
      
        
      
      
      
      
    
Here, ,  and  hold for all . Thus, the expected value  of a development year can change slowly across accident years. This change is influenced by the parameter : the larger , the higher the weight of  of more recent accident years. Setting  equal to zero for all j, the accident correlation model corresponds to the model of  (), since the expected value  of a development year is identical across all accident years. The accident correlation model (72) can be transferred into a state space representation with the observation equation
        
      
        
      
      
      
      
    
        and state equation
        
      
        
      
      
      
      
    
		by using (69) and (70). The matrix  consists exclusively of zeros, apart from the value of one at position . Thus, for  it corresponds to the entire (-dimensional) part on the left-hand side of the vertical line. Considering, for example,  and , the state space representation of the accident correlation model (72) is given by:
      
        
      
      
      
      
    
 Calendar correlation model
The calendar correlation model
        
      
        
      
      
      
      
    
        with ,  and  for all ,  is appropriate to consider correlations between calendar years . The calendar year effects  are modeled as a random walk across calendar years, which is why all logarithmized individual development factors  of a given calendar year change equally. The effect of  on individual development factors is measured by  and it is modeled proportionally to the standard deviation of . Setting , the calendar correlation model (73) corresponds to model (68), since the effects  are the same for all calendar years  and the term  is considered as part of . The calendar correlation model (73) can be transferred into a state space representation with the observation equation
        
      
        
      
      
      
      
    
        and the state equation
        
      
        
      
      
      
      
    
		by using (69) and (70). The matrix  contains the last  rows of the row-permuted identity matrix  and a row of zeros as the last row, i.e., for  it corresponds to the entire (-dimensional) part on the left-hand side of the vertical line, and for each t before the -th calendar year it reduces by one row. Considering, for example,  and , the state space representation of the calendar correlation model (73) is given by:
      
        
      
      
      
      
    
Finally, Table 6 gives an overview of the dimensions of vectors and matrices in the above three state space models of  ().
       
    
    Table 6.
    Dimensions in the state space models of  ().
  
5. Univariate State Space Models (Category 4)
In this section, we present articles where univariate state space models are proposed:
- ▸
 - (): A State Space Model for Run-Off Triangles;
 - ▸
 - (): State Space Models and the Kalman Filter in Stochastic Claims Reserving: Forecasting, Filtering and Smoothing.
 
Both articles are mainly devoted to state space models and the Kalman filter learning algorithms, so they are highlighted with ▸ in the above listing.
5.1. A State Space Model for Run-Off Triangles
 () present a univariate distribution-free state space model for incremental payments to predict claims reserves and to calculate their precision. They assume that the incremental payments of more recent development years are not related to the respective payments of the previous development year, but to the payments made in the accident year. This is in contrast to the common CL method, which is based on the assumption that cumulative payments in more recent development years are proportional to the cumulative payments of the previous development year, with the proportionality factor being assumed to be constant across all accident years under consideration (homogeneity property).  (), on the other hand, assume that the proportionality factor linking the incremental payments of more recent development years to the value of the 0th development year may also vary across accident years, so they do not require the common assumption of independent accident years often found in stochastic claims reserving methods.
The observation equation thus links the incremental payments  of the ith accident year () in the jth development year ( and ) via factor  to the payments  that already occurred in accident year i (see also Figure 8):
      
        
      
      
      
      
    
      
    
    Figure 8.
      Modeling of the incremental payments in the work of  ().
  
Here, the incremental payments  act as observations, while the  for all  correspond to the unknown states. The state equation is constructed as an AR(1) model with the expected value  and  as a function of :
      
        
      
      
      
      
    
As for the noise terms, they are assumed as white noise processes with
        
      
        
      
      
      
      
    
        as well as  for all  and . The strictest assumption of the model is that the incremental payments of more recent development years depend on the payments of the 0th development year, whereas the columns for  are independent of each other.
Setting the variances  and the coefficients  equal to zero for all , (75) simplifies to , i.e.,  is constant across all accident years and corresponds to the expected value  of the j-th development year. In this case, the observation Equation (74) results in . On the other hand, if the coefficients  are all set equal to one and  also holds for all , then the state equation is , which is why the coefficients are constant over all accident years, and the observation equation results in . The state equation would thus be obsolete in both cases and the state space modeling would simplify to a regression model. Thus, the general model (see (74) and (75)) can be seen as a simple regression model of each  on , where the time-varying parameters  follow an AR(1) process.
5.2. State Space Models and the Kalman Filter in Stochastic Claims Reserving: Forecasting, Filtering and Smoothing
 () propose a scalar state space model for cumulative payments to employ the Kalman filter for calculating the claims reserves and for measuring their precision. It is assumed that there are unobservable states  underlying the observed cumulative payments  with  for , i.e., the “real cumulative payments” are modeled as latent variables and there may be a potential observation error in the claims data. The introduced state space model then allows to determine the entire unobservable upper and lower run-off triangles, that is, forecasting, filtering and smoothing of all states  with  (see Figure 9).
      
    
    Figure 9.
      Unobservable states, observations and Kalman smoothings (), filterings () and predictions ().
  
The authors consider a linear state space model, which consists of the observation equation
        
      
        
      
      
      
      
    
        with ,  and  for ,  as well as the state equation
        
      
        
      
      
      
      
    
        with ,  and  for , . The white noise processes  and  are uncorrelated, i.e.,  holds for all ,  and . This assumption is due to the fact that there is no reason to assume a systematic relationship between the measurement noise  and the process noise .
In (78) and (79),  and  with  and  are appropriate linear functions. As a consequence of the model assumptions,
        
      
        
      
      
      
      
    
        hold for all ,  with , . Thus, the initial state  of an accident year  is uncorrelated with  and  for all j.
As for the prediction of the future cumulative payments  with  for ,  in the lower triangle, the Kalman learning algorithms for one- and h-step predictions () can be used. Considering the underlying states  of the observations  in the upper triangle, the Kalman learning algorithms for filtering (for ) and the Kalman learning algorithms for smoothing (for ) can be applied to identify outliers in the observations and to replace them by filtered or by smoothed observations as well as to quantify outlier effects. Another key application of smoothing and filtering algorithms is the interpolation of missing values in the upper run-off triangle (e.g., resulting from a merger).
6. Row-Wise Stacking Approaches (Category 5)
In this section, we discuss articles where the claims data is stacked row-wise:
- ▸
 - (): A row-wise Stacking of the Runoff Triangle: State Space Alternatives for IBNR Reserve Prediction;
 - ▸
 - (): State space models for predicting IBNR reserve in row-wise ordered runoff triangles: Calendar year IBNR reserves and tail effects;
 - ▸
 - (): Applying State Space Models to Stochastic Claims Reserving.
 
These articles are all marked with ▸ because the proposed methods are mainly based on state space models and the Kalman filter learning algorithms.
6.1. A Row-Wise Stacking of the Runoff Triangle: State Space Alternatives for IBNR Reserve Prediction
In contrast to most of the above approaches,  () do not stack the observations of individual accident, development or calendar years in a vector representation, but consider the claims data as a univariate time series with various missing observations. The time series is then modeled using a structural model in a state space representation. As for the prediction of the claims reserves and the estimation of the corresponding MSEP for individual and aggregated accident years,  () present two approaches, the blocks method and the cumulating method. Although both approaches differ in some aspects, they provide the same numerical results.
 Development of an appropriate state space representation
 () consider claims development triangles that include incremental payments  in accident years  and development years . They put the incremental payments into a representation as univariate time series by simply stacking the observations of more recent accident years to the observations of the first accident year. Thus, the common double indexing  is omitted and replaced by the simple index t, which, however, cannot be interpreted in chronological form as usual for time series. The time series  constructed in this way, with , has more and more missing observations for increasing t, which lead to the outstanding loss liabilities for aggregated accident years as follows:
      
        
      
      
      
      
    
Figure 10 shows the row-wise “stacked” incremental payments using the notation  instead of , where the observed time series values correspond to those of the upper triangle and the missing values to those of the lower triangle.
      
    
    Figure 10.
      Row-wise stacked incremental payments in the work of  ().
  
 () model the row-wise stacked incremental payments  via a structural model that includes a level component , a periodic component , and a regression term . Hence, they obtain
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        with ,  and . Here, the level component captures the mean level of incremental payments, while the periodic component reflects the column effect (i.e., the development pattern) and the regression term is incorporated to address intervention effects (related to outliers in the observations).
To represent the structural model consisting of Equations (80)–(82) as a state space model,  () consider the general state space model
        
      
        
      
      
      
      
    
        with normal assumptions
        
      
        
      
      
      
      
    
        for . As for the noise terms  and , it is assumed that ,  for  and  for all . Moreover, the initial state  is proposed to be independent of  and  for all t. Incorporating the structural model into a state space representation, the observation equation results in
        
      
        
      
      
      
      
    
        with , , ,  and  and the state equation is given by
        
      
        
      
      
      
      
    
        with
        
      
        
      
      
      
      
    
Table 7 gives an overview of the dimensions of vectors and matrices in the state space model of  ().
       
    
    Table 7.
    Dimensions in the state space model of  ().
  
In the following, the cumulating method, one of the two approaches proposed by  () to predict the loss reserves and to estimate their MSEP for individual and aggregated accident years, is presented.
 Cumulating method
The cumulating method adds additional components to the state vector that accumulates estimates of the missing observations in the lower triangle so that the MSEP of the claims reserves can directly be determined using Kalman filter. In the following,  denotes an index set containing all t-indices belonging to observations , and  stands for total, i.e., for aggregated accident years. If one is interested only in the claims reserves along with the MSEP for aggregated accident years, the state vector can be extended by the additional component  that accumulates all estimates of missing observations across all accident years. The state space model is then given by
        
      
        
      
      
      
      
    
        with , the J-dimensional zero vector  in the transition matrix, the two-dimensional zero vector  and the J-dimensional row vector
        
      
        
      
      
      
      
    
        where the changes in the dimensions within the system compared to (83) and (84) are given, while , , , , ,  remain unchanged. If one is also interested in individual accident years, further components corresponding to the respective accident years  have to be added to the state vector. This leads to the inclusion of the J-dimensional vector
        
      
        
      
      
      
      
    
        in which the component  related to aggregated accident years is also included. The modified state space model is then be given by
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        with , the -dimensional zero matrix  and identity matrix  in the transition matrix, the -dimensional zero matrix  and the -dimensional matrix  with  components
        
      
        
      
      
      
      
    
        as well as component  according to (85). Thus, the vector  includes the claims reserves for individual and aggregated accident years, but without taking into account the effects of the regression terms  with , which are excluded from the accumulation process and therefore have to be added separately.
6.2. State Space Models for Predicting IBNR Reserve in Row-Wise Ordered Runoff Triangles: Calendar Year IBNR Reserves and Tail Effects
 () extend the row-wise stacking approach of  () and the corresponding state space representation of the structural model by implementing (1) a calendar year IBNR reserve prediction and (2) tail effects for the row-wise ordered triangle. In this way they intend (1) to improve the possibilities of an insurance company to predict short-term IBNR reserves and (2) to make IBNR predictions more conservative and thus more effective to protect insurance companies from insolvency risks.
As for the first extension,  () consider the cumulating method proposed by  () and simply add a further cumulating entry to the state vector, in particular, to the vector (86). The additional cumulating entry  is related to the calendar year IBNR reserve and accumulates all estimates of missing observations associated with a specific calendar year.
As for the second extension,  () consider both a one-step ahead column and row tail effects in the claims development triangle. Thus, the triangle is extended by an additional row for the -th accident year and an additional column for the -th development year. Following  (), this short period for the tail effects does not lead to a reasonable loss of generality as it was empirically shown that the last column payments are expected to be lower than the first ones. In order to incorporate the tail effects into the structural model,  () assume that  have the same periodicity behavior (i.e., “saisonality”) as the respective previous observation of the time series. Against this backdrop, the following changes are made to the system matrices of the state space representation (see (87) and (88)):
      
        
      
      
      
      
    
That is, the modified state space representation for the cumulating method is the same as in the work of  () for the observations that are not affected by a column tail effect. As for the observations with the tail effect, the above modifications force the periodicity component to be exactly the same as those from the preceding observations.
6.3. Applying State Space Models to Stochastic Claims Reserving
 () discuss and compare various common approaches in stochastic claims reserving such as log-normal models or Hoerl curve approaches in the framework of state space models. In particular, the authors use the approach of a row-wise stacking of the claims development data ordered as a time series proposed by  () to handle common claims reserving methods via unified state space representations and the Kalman filter learning algorithms. This approach has the benefit that all the different models can be handled within the same framework and the results can be easily compared. As the row-wise stacking approach in a state space representation has practical advantages over other state space approaches,  () transfer its benefits for handling different approaches within the same state space framework.
In the following, the log-normal model for incremental payments according to (44) investigated by  () and other authors is considered (see Section 3). This model is converted into a state space representation following the row-wise stacking approach. In the first step,  for all  are row-wise stacked (as proposed in the work of  ()), and the common time series notation via  with  is used. In contrast to  (),  () take the observations of the first column ( for all i) for each accident year as initial values in the observation equation. This is conducted before the backdrop so that the initial level for the recursions is set in a more appropriate way, which has a positive impact on the calculations when there are few data and especially when there are missing values. Thus, the row-wise stacked log-normal model for incremental payments can be stated as
        
      
        
      
      
      
      
    
        with , . The corresponding state space representation with state vector
        
      
        
      
      
      
      
    
        can then be given as follows:
      
        
      
      
      
      
    
In addition,  () consider the multivariate case for all the discussed approaches. This leads to a further benefit of state space models in claims reserving as it becomes possible to incorporate claims activity dynamics and to model dependencies between correlated lines of business. This does not require any additional effort by the practitioner, since multivariate modelings can be implemented by state space models in a simple way and are largely analogous to the univariate case.
In the following, the multivariate log-normal model for incremental payments is considered in a state space representation. In addition to the unknown parameters in the above univariate case (), there are further parameters describing the correlations between the run-off triangles in the multivariate setting. Hence, considering N run-off triangles, the  for all  and  are modeled via the log-normal model for incremental payments in a row-wise stacked manner as follows
        
      
        
      
      
      
      
    
        with , . As for achieving a suitable state space representation, the vectors
        
      
        
      
      
      
      
    
        can be used, and the variance–covariance matrices  and  contain the correlation parameters that have to be estimated. Therefore, the following state space representation for the multivariate log-normal model for incremental payments is obtained:
      
        
      
      
      
      
    
Finally, Table 8 gives an overview of the dimensions of vectors and matrices in the above exemplary state space models of  ().
       
    
    Table 8.
    Dimensions in the state space models of  ().
  
7. Conceptual Comparison
In this section, a conceptual comparison of the proposed methods is conducted. In particular, we compare the objectives behind the methods, the modeling approaches for claims data, and the state space representations. Further, we give insights from practical applications discussed in the papers.
7.1. Objectives and Claims Data
The vast majority of articles (; ; ; ; ; ; ; ; ) aim to forecast the outstanding loss liabilities and to calculate the corresponding prediction error. In addition, there are deviant objectives such as an estimation of the underlying states of the observations in the upper triangle (; ), an extension of the CL method to not necessarily homogeneous development patterns across accident years ( ()), an illustration of calendar year effects ( ()), or a simulation of the shape of the liability distribution (; ).
While most models are based on incremental payments, e.g., the log-normal models (see  , ; ; ), the Hoerl curve approaches (see ; ; ) as well as the methods presented in the work of  (),  (),  (),  (),  (), there are also models constructed for other data situations, such as cumulative payments ( , ; ), incurred incremental data (), PPCF (), claim closure rates (), and PPCI (). Some models also incorporate additional information such as inflation indices (; ; ), business volume (), or exposure ().
Often, the claims data are directly embedded in the objective and thus are an essential component of the modeling. For example, log-normal models for incremental data require strictly positive claims data, which is why they are unsuitable for incurred incremental data. Additionally, modeling via a Hoerl curve needs incremental payments and cannot be easily applied to incurred incremental data. In some articles, such as  () and  (), the claims data even form the foundation of the modeling, i.e., the state space representations are motivated by and constructed specifically for the underlying claims data.
7.2. Modeling of Claims Data
The categories “Parametric evolution of claims data” and “Log-normal models for incremental payments” include the most common modeling approaches for claims data.
Within the first category,  (),  (), and  () assume that incremental payments are subject to a very fast increase in early development years and an exponentially decrease over the following development years, which is why they model incremental payments via a Hoerl curve (see (5), (17) and (30)). The general exponential-logarithmic Hoerl curve is given by
        
      
        
      
      
      
      
    
        with development year parameter  for all  and . An advantage of treating development time j as a continuous covariate is that extrapolation is possible beyond the range of development times observed (see, e.g., ). The Hoerl curve is the most popular parametric form used for modeling the evolution of incremental payments over development years j, since it behaves very similar to the typical run-off of incremental payments: it rises very quickly to its peak and then tends to zero at an exponential speed. Following the Hoerl curve approach,  (),  (), and  () propose modeling the expected incremental payments in
		 by means of variations of (89) as follows (see (5), (21) and (30)):
		
| () | |
| () | |
| () | 
In addition, by implementing state space models,  () and  () allow the accident year parameters to evolve recursively over the accident years, see (16) and (26), i.e., they implement dynamic estimation of the parameters that has the advantage of avoiding overparameterization of the model.
Since the evolution of incremental payments can be applied in a similar way to PPCI and claim closure rates,  () also use a parametric approach to model the evolution over the development years in a suitable way. For this purpose, however, they do not choose a variant of the Hoerl curve, but approaches similar to discounting. In particular,  () calculate the expected PPCI  and the expected claim closure rate  via
		
        for a given accident year  over the development years  (see (33), (36)).  () follow the modeling approach of the linear predictor for the PPCI according to (33) in the work of  () and adopt their approach for incremental payments (see (41)):
		
| () | |
| ( | 
| () | 
For the most part, the modeling approaches in these articles do not require any distributional assumptions. The only exceptions are  (), where the number of payments is assumed to be Poisson-distributed, and  (), where the noise terms and thus the observations are assumed to be EDF-distributed.
Considering the second category “Log-normal models for incremental payments”, all the models are based on explicit distributional assumptions, since the incremental payments are assumed to be log-normally distributed. The logarithmized incremental payments
		 in  are then specified via the log-normal model for incremental payments (also called the linear CL model, following ). In particular,  () and  () use the common basic model (see (44))
		
	 whereas  () and  () suggest a variant of this model that allows for variations in the column parameters across accident years,
			
        where the column parameters  may evolve according to (56). In addition to incremental payments,  () also incorporate claim counts, and therefore consider PPCF as claims data. In compliance with the approaches of the first category and also by utilizing state space models, the authors implement recursions for the model parameters to achieve dynamic estimation and to avoid the overparameterization of the model (see, e.g., (52)).
| (; ) | 
| (; ) | 
In contrast to the above approaches, there are other ways of modeling the claims data:  () (and to some extent also ) presents correlation models where correlations between accident, development or calendar years are considered (see (71)–(73)),  () and  () propose univariate state space models (see (74), (75) as well as (76), (77)), and  (),  (), and  () discuss row-wise stacking approaches for the claims data to get a univariate time series (see, e.g., the structural model (80)–(82)).
In particular,  () extends the model  (, ) for logarithmized individual development factors (64) from  () by including correlations of  across development years, accident years or calendar years (see (71)–(73)):
					
| () | |
In  (), it is proposed to model the incremental payments  in  as a function of the payments  of the respective accident year  by means of
					
        see (74). Thus, the total amount of claims incurred in accident year i that has been paid j years later is proportional to the claims incurred and paid in accident year i. This proportion varies randomly with i and j, which is why  () consider the AR(1) process , see (75). By applying this approach, the common assumption of independent accident years is not required.
| () | 
 () propose to model the observed cumulative payments  as a function of unobservable latent variables , . Against this backdrop, they presume the relationship
					
        according to (76), where  is additionally assumed to follow the recursion  (see (77)) that is implemented by using a state space model. The approach by  () therefore addresses potential observation errors in the claims data.
| () | 
The authors  () and  () discuss a structural model for incremental payments with a local level component , a stochastic periodic component  and a regression term ,
		
        see (80)–(82). This approach is inspired by the nature of the claims process: The level component shall respond for the mean value of claims in each accident year, while the periodic component is supposed to capture the development year effect. The regression term is mainly motivated by the need of intervention effects due to the presence of outliers. That is, the approach of  (), and hence also of  () and  (), differs from other proposals by using a modeling approach that is not directly based on claims data with the usual double indexing, but instead, the claims data is modeled in its whole as a univariate time series. This allows the use of tools that are available for time series, and thus considerably expands the modeling spectrum including diagnostic checking and model selection criteria.
| (; ) | 
7.3. Modeling Approaches of State Space Representations
Most of the state space representations are based on the approach of a calendar year-based modeling, in which the claims data of the individual calendar years are stacked into separate observation vectors. Similar approaches are an accident year-based modeling (see ) or a development year-based modeling (see ) of the observation vectors. Beyond these most common approaches, there are univariate state space representations and state space models based on the row-wise stacking approach.
The popularity of the approaches that are aligned to the dimensions of claims development triangles (see Figure 11) is to be seen in the fact that they enable for modeling effects related to accident, development or calendar years. Because of the relationship of calendar years  to accident years  and development years , it is clear that only two of these three directions (diagonal, vertical, horizontal) are “independent” of each other. While the vertical direction captures trends across accident years and the horizontal direction captures trends across development years, the diagonal direction reflects trends across calendar years (see Figure 12, left-hand side). The vertical and horizontal directions are orthogonal to each other, i.e., trends in one direction are not projected to the other. However, the diagonal direction is not orthogonal to either of the other two directions, i.e., trends in calendar years are projected onto both the horizontal and vertical directions. Accordingly, diagonal or calendar year effects at a level of  are equivalent in their effect to a combined vertical and horizontal effect each at a level of  (see Figure 12, right-hand side). Calendar year effects include trend and structural breaks (e.g., due to extraordinary events such as floods, hurricanes, terrorist attacks, etc.), changes in the inflation rate, in individual case reserving, in the underwriting policy, in legislation, and organizational changes such as the implementation of new claims processing systems or the emergence of new phenomena (see, e.g., ).
      
    
    Figure 11.
      Modeling approaches of the state space representations.
  
      
    
    Figure 12.
      Trend properties of claims development triangles.
  
Following the above explanations, an adequate embedding of calendar year effects into claims reserving models is essential. This also accounts for the fact that these approaches are the most widespread. Moreover, the calendar year-based approach can be justified as follows (see ):
- It corresponds to a natural modeling of the claims data, as annually added observations build up a new diagonal in the run-off triangle.
 - As for estimation and prediction, more recent observations should get a higher weight compared to past observations. The recursive and dynamic nature of the Kalman filter learning algorithms complies with this requirement, especially with respect to the calendar year-based approach.
 
In the following, an exemplary calendar year-based state space representation from the category “Log-normal models for incremental payments” is given. This state space representation is based on the linear CL model discussed by  () and can also be found in a similar form in the work of  () and  (). It consists of the observation equation
        
      
        
      
      
      
      
    
        corresponding to calendar year  that implies (44) for each  of calendar year t, and the state equation
        
      
        
      
      
      
      
    
        that allows dynamic estimation of the accident and development year parameters via (52).
However, the approaches shown in Figure 11 have the drawback that the dimensions of the vectors and matrices in the corresponding state space representations are time-variant. Considering the calendar year-based approach, this is due to the fact that with proceeding calendar years, complete diagonals are added to the run-off triangle, which have one more observation than the previous calendar year. Thus, the current calendar year has the most observations before the number of future observations in the lower triangle decreases with proceeding calendar years (when considering claims development triangles). Depending on the modeling (e.g., via a Hoerl curve or the log-normal model), these additional observations induce correspondingly increasing state vectors, system matrices, hyper-parameters and noise terms. This can complicate parameter estimation, practical handling, and simultaneous involvement of multiple run-off triangles considerably (see ).
The above drawbacks can be avoided by choosing state space models based on the row-wise stacking approach (; ; ), which enable a unified framework to handle different models. Further, as demonstrated by  (), the row-wise stacking approach allows to incorporate claims activity dynamics and to model dependencies between correlated lines of business. It should also be noted that although the row-wise stacking approach is not a calendar year-based modeling approach, calendar year effects can be modeled within the row-wise stacking approach by adding an additional component to the structural model.
There are a few articles where a Bayesian approach is employed for estimation, alternatively or in addition to the Kalman filter (see ; ; ). This is because both approaches are related to each other. As is well known, the Kalman filter is based on two basic ideas: First, the idea of using new information to update estimators based on previous observations. Second, the idea of filtering, i.e., separating signals from noise. On the other hand,  () was the first to show how new observations can be used to update previous estimators. In the usual Bayesian approach, a posterior density is first generated from the prior density and the current observation, and this posterior density is then updated to the prior density for the next step. This process is then repeated sequentially for all upcoming observations (see, e.g., ). The particular benefit of Bayesian estimation is that it allows the practitioner/researcher to incorporate prior information from other sources (see, e.g., ). Following  (), the Bayesian approach also increases the computational flexibility, and MCMC sampling strategies can be used to generate samples for each posterior distribution of interest.
Finally, it is worth mentioning that most of the state space representations considered in the articles of this review are linear state space models, i.e., they consist of a linear observation equation and a linear state equation. This directly implies linear system properties and the limitation to linear processes. An exception is given by  (), who consider a non-linear observation equation and EDF-distributed measurement noise, that is, a generalized linear model. This approach enables for any kind of strictly monotonic and differentiable link functions (e.g., logarithm functions). However, linear system properties are not a principal drawback, as every non-linear system can be converted into a linear system by linearizing the system equations. This directly leads to the extended Kalman filter (see, e.g., ).
7.4. Insights from Practical Applications
In the following, some selected implications of empirical applications discussed in the above papers are given in chronological order:
- () present a simple illustrative example based on a data set from a UK general insurance company (1970–1974), where volume and inflation indices are also available. They give estimated states for the observations of the upper triangle and predicted future incremental payments of the lower triangle. () conclude that the results confirm the regular nature of the data and therefore the appropriateness of the “constant” transition model for according to (16). Further, the projected future incremental payments decline smoothly to zero with increasing delay due to the Hoerl curve approach (5).
 - () performs comprehensive practical applications using the benchmark data set from () that includes data from the motor bodily injury class of business in one Australian state (1972–1981). In particular, he compares static models with recursive Bayesian estimation and dynamic models, where row and column parameters are estimated dynamically. The results show that the Kalman filter and empirical Bayes methods outperform the OLS (i.e., uninformative prior) approach: the estimates of row (and column) parameters are smoother and the standard errors are lower. This is due to the fact that more information is used for parameter estimation.
 - () considers the data set from () for an illustrative example and emphasizes that comprehensive examples covering all possibilities are not feasible. In particular, () focuses solely on the development parameters and shows that the proposed model allows them to evolve over time.
 - The modeling approaches in the work of () are motivated by their RBNS data set from a major Greek motor insurance company. The data are characterized by claims that are reported within three working days according to Greek legislation and are usually settled by a one-off payment. By comparing the predictive performance of the proposed models, () state that the predictive ability of models 1 and 2 seems to be better compared to models 3 and 4 for the considered data set.
 - As for the accident year-based approach, () discuss a practical application based on a workers’ compensation portfolio, in which benefits are dominated by payments of weekly compensation. The data show a strong upward movement of the PPCI at the beginning and a steady slow decrease in later years. Based on this evolution, () decide for a logarithm function as link function and a gamma distribution for the measurement noise. As for the calendar year-based approach, they use motor vehicle bodily injury data from (). The claim closure rates are relatively flat over the development years, but there are shocks that tend to affect whole calendar years. The filtered results follow the data closely at their general level, that is, there is minor smoothing of the calendar year effects but considerable smoothing across development years.
 - () discuss two application examples based on real data sets: paid claims from the motor branch of a Portuguese insurance company (1984–1996) and the data set from (). The authors compare various claims reserving methods and conclude that Hoerl curve approaches lead to the largest MSEP of the claims reserves. Further, they suppose that the log-normal transformation of the data results in larger values of the MSEP, and therefore, the original observations should be used unless there is strong evidence of log-normal distributed data. For both data sets, however, the state space model proposed by () leads to reserves with the smallest MSEP.
 - () performs a case study for the development correlation model using a data set from the Historical Loss Development Study that includes cumulative payments related to Automatic Facultative General (AFG) liability (1981–1990). In the first step, he applies the model of () to the AFG data and concludes that it is not suitable to adequately represent the data, mainly due to remaining (negative) correlations in the standardized residuals regarding the development years zero and one. For this reason, () uses the development correlation model (71) in the second step, which considers the correlation between the first both development years. Then, the residuals no longer contain any correlations and the correlation between the first both development years can be explained via the development correlation model.
 - () also use the AFG data set and especially discuss three results of their analysis regarding the row-wise stacking approach. First, it provides computational feasibility and efficiency. Second, the accuracy of the reserve prediction is increased. Third, the approach is flexible with respect to IBNR modeling possibilities. As a particularly interesting aspect, they highlight that blocks and cumulating methods yield the same numerical results.
 - () provide a comparison of various claims reserving methods with state space representations (; ; ; ) and popular methods such as CL, Bornhuetter–Ferguson (BF) and overdispersed Poisson using the data set from (). Considering the claims reserves, their MSEP and the coefficient of variation, no model can be identified that provides the best or the worst results for the given data set.
 - () perform a practical example based on the data set from () and compare their extended row-wise stacking approach with a modified CL approach and heteroskedastic regression models. For the given data set, their proposed method outperforms the three competitors with respect to IBNR reserve prediction. In particular, by applying the competitors, the insurance company might overestimate the claims reserves (thus leading to overpriced insurance contracts). On the other hand, by employing the original approach by (), this would lead to underestimated reserves.
 - The most comprehensive empirical comparison of various state space models is conducted by (), who consider five data sets, including data sets from (), from a Belgian insurance industry, and the data set from (). They compare their introduced models with the models proposed by (), (), and () as well as CL and BF methods. Following (), their presented state space models are adequate for routine actuarial situations. Further, they give information about the distribution of the predicted claims reserves.
 
It is obvious that the empirical application examples are heterogeneous, they often show only facets of the presented methods and the results are not consistently compared with other methods. There is no empirical comparison of different state space models that include, even approximately, all methods introduced up to now; the most comprehensive empirical comparisons can be found in the works of  (),  (),  (), and  (). However, it is also evident that a larger-scale empirical comparison of all the models presented is narrowly limited. This is due to several factors, such as different objectives, different claims data or the inclusion of additional information. Since the run-off data are often closely integrated in the model building and the objectives in the articles sometimes differ considerably (see Section 7.1), it is not possible to perform an empirical comparison of all the models that could do them justice. Otherwise, models would be applied to claims data and objectives for which they were not constructed. Moreover, some models require the incorporation of further information, such as inflation or volume indices, the availability of which cannot generally be assured (and, in the case of the benchmark data set from , is not available), but the omission of which would counteract the idea behind model building. Likewise, no recommendation can be formulated as to which model is best suited for actuarial practice. The decision for a specific model depends on numerous factors and should mainly rely on the verification of the model assumptions on the underlying data.
8. Conclusions
In this paper, we have provided a comprehensive review on the topic of stochastic claims reserving methods with state space representations. We have identified 16 relevant articles in this field and grouped them into five categories considering their key content similarities. Most of the articles fall into categories “Parametric evolution” (#5) and “Log-normal models” (#4), but there are also articles devoted to “Correlation models” (#2), “Univariate models” (#2), and “Row-wise stacking” (#3). Moreover, models for incremental payments (#12) and the calendar year-based state space modeling approach (#8) are the most prevalent.
Our main intentions were to identify where state space models have been used for improving stochastic claims reserving and to consolidate the topic in order to aid new researchers in this area. Out of these objectives, we have structured and categorized the relevant articles. Ideally, this sound basis would assist researchers currently focused on state space models in stochastic claims reserving and lead to fruitful future research in this area.
As for promising directions for future research in the field of stochastic claims reserving based on state space models, we mainly suggest to conduct micro-level claims reserving and to implement non-linear systems (see  ()). Moreover, using state space models and beyond, we would like to emphasize the use of granular models as well as of machine learning and soft computing techniques in future research projects. Although models based on aggregate data are widely used, especially in actuarial practice, they are often characterized by rather simple model assumptions that are inadequate for the underlying data. Thus, there is the need for more flexible models which are able to deal appropriately with data where the common model assumptions are violated (see  ()).
Author Contributions
Conceptualization, N.C. and A.J.; methodology, N.C. and A.J.; formal analysis, N.C. and A.J.; investigation, N.C. and A.J.; writing—original draft preparation, A.J.; writing—review and editing, N.C.; project administration, A.J. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Acknowledgments
The authors would like to thank both anonymous reviewers for their valuable feedback and suggestions, which were helpful in further improving this paper.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Alpuim, Teresa, and Isabel Ribeiro. 2003. A State Space Model for Run-Off Triangles. Applied Stochastic Models in Business and Industry 19: 105–20. [Google Scholar] [CrossRef]
 - Atherino, Rodrigo, Adrian Pizzinga, and Cristiano Fernandes. 2010. A row-wise Stacking of the Runoff Triangle: State Space Alternatives for IBNR Reserve Prediction. ASTIN Bulletin 40: 917–46. [Google Scholar]
 - Barker, Allen L., Donald E. Brown, and Worthy N. Martin. 1995. Bayesian estimation and the Kalman filter. Computers & Mathematics with Applications 30: 55–77. [Google Scholar]
 - Bayes, Thomas. 1763. Essay towards solving a problem in the doctrine of chances. Biometrika 45: 293–315. [Google Scholar] [CrossRef]
 - Chukhrova, Nataliya, and Arne Johannssen. 2017. State Space Models and the Kalman-Filter in Stochastic Claims Reserving: Forecasting, Filtering and Smoothing. Risks 5: 30. [Google Scholar] [CrossRef] [Green Version]
 - Chukhrova, Nataliya, and Arne Johannssen. 2021. Kalman Filter Learning Algorithms and State Space Representations for Stochastic Claims Reserving. Risks 9: 112. [Google Scholar] [CrossRef]
 - Costa, Leonardo, and Adrian Pizzinga. 2020. State-space models for predicting IBNR reserve in row-wise ordered runoff triangles: Calendar year IBNR reserves & tail effects. Journal of Forecasting 39: 438–48. [Google Scholar]
 - De Jong, Piet, and Ben Zehnwirth. 1983. Claims Reserving, State-Space Models and the Kalman Filter. Journal of the Institute of Actuaries 110: 157–81. [Google Scholar] [CrossRef]
 - De Jong, Piet. 2004. Forecasting General Insurance Liabilities. Research Paper No. 2004/03. Sydney: Division of Economic and Financial Studies, Macquarie University. [Google Scholar]
 - De Jong, Piet. 2005. State Space Models in Actuarial Science. Paper presented at the Second Brazilian Conference on Statistical Modelling in Insurance, Institute of Mathematics and Statistics, University of São Paulo, Maresias, Brazil, August 28–September 3. [Google Scholar]
 - De Jong, Piet. 2006. Forecasting Runoff Triangles. North American Actuarial Journal 10: 28–38. [Google Scholar] [CrossRef]
 - England, Peter D., and Richard J. Verrall. 2002. Stochastic Claims Reserving in General Insurance. British Actuarial Journal 8: 443–518. [Google Scholar] [CrossRef]
 - Hendrych, Radek, and Tomas Cipra. 2021. Applying State Space Models to Stochastic Claims Reserving. ASTIN Bulletin 51: 267–301. [Google Scholar] [CrossRef]
 - Hertig, Joakim. 1985. A Statistical Approach to IBNR-Reserves in Marine Reinsurance. ASTIN Bulletin 15: 171–83. [Google Scholar] [CrossRef] [Green Version]
 - Johannssen, Arne. 2016. Stochastische Schadenreservierung unter Verwendung von Zustandsraummodellen und des Kalman-Filters. Hamburg: Dr. Kovac. [Google Scholar]
 - Julier, Simon J., and Jeffrey K. Uhlmann. 2004. Unscented filtering and nonlinear estimation. Proceedings of the IEEE 92: 401–22. [Google Scholar] [CrossRef] [Green Version]
 - Kaas, Rob, Marc Goovaerts, Jan Dhaene, and Michel Denuit. 2009. Modern Actuarial Risk Theory—Using R, 2nd ed. Berlin: Springer. [Google Scholar]
 - Kremer, Erhard. 1982. IBNR-Claims and the Two-Way Model of ANOVA. Scandinavian Actuarial Journal 1982: 47–55. [Google Scholar] [CrossRef]
 - Li, Jackie. 2006. Comparison of Stochastic Reserving Methods. Australian Actuarial Journal 12: 489–569. [Google Scholar]
 - Ntzoufras, Ioannis, and Petros Dellaportas. 2002. Bayesian Modelling of Outstanding Liabilities incorporating Claim Count Uncertainty. North American Actuarial Journal 6: 113–28. [Google Scholar] [CrossRef] [Green Version]
 - Pang, Liyan, and Siqi He. 2012. The Application of State-Space Model in Outstanding Claims Reserve. Paper presented at the 2012 International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII), Sanya, China, October 20–21; pp. 271–74. [Google Scholar]
 - Taylor, Greg C. 2000. Loss Reserving: An Actuarial Perspective. Boston: Kluwer Academic Publishers. [Google Scholar]
 - Taylor, Greg C. 2019. Loss Reserving Models: Granular and Machine Learning Forms. Risks 7: 82. [Google Scholar] [CrossRef] [Green Version]
 - Taylor, Greg C., and Frank R. Ashe. 1983. Second Moments of Estimates of Outstanding Claims. Journal of Econometrics 23: 37–61. [Google Scholar] [CrossRef]
 - Taylor, Greg C., Gráinne McGuire, and Alan Greenfield. 2003. Loss Reserving: Past, Present and Future. Research Paper No. 109. Melbourne: University of Melbourne. [Google Scholar]
 - Verrall, Richard J. 1989. A State Space Representation of the Chain Ladder Linear Model. Journal of the Institute of Actuaries 116: 589–610. [Google Scholar] [CrossRef]
 - Verrall, Richard J. 1991. Chain Ladder and Maximum Likelihood. Journal of the Institute of Actuaries 118: 489–99. [Google Scholar] [CrossRef]
 - Verrall, Richard J. 1994. A Method for Modelling Varying Run-off Evolutions in Claims Reserving. ASTIN Bulletin 24: 325–32. [Google Scholar] [CrossRef] [Green Version]
 - Verrall, Richard J. 2004. Kalman Filter, Reserving Methods. In Encyclopedia of Actuarial Science. Edited by Jozef L. Teugels and Bjørn Sundt. Chichester: John Wiley & Sons, vol. 1, pp. 952–55. [Google Scholar]
 - Wright, Thomas S. 1990. A Stochastic Method for Claims Reserving in General Insurance. Journal of the Institute of Actuaries 117: 677–731. [Google Scholar] [CrossRef]
 - Wüthrich, Mario V., and Michael Merz. 2008. Stochastic Claims Reserving Methods in Insurance. Chichester: John Wiley & Sons. [Google Scholar]
 - Zehnwirth, Ben. 1997. Kalman Filters with Applications to Loss Reserving. Working Paper. [Google Scholar]
 
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.  | 
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).