Abstract
In this paper, we consider a loss reserving model for a general insurance portfolio consisting of a number of correlated run-off triangles that can be embedded within the quantile regression model for longitudinal data. The model proposes a combination of the between- and within-subportfolios (run-off triangles) estimating functions for regression parameter estimation, which take into account the correlation and variation of the run-off triangles. The proposed method is robust to the error correlation structure, improves the efficiency of parameter estimators, and is useful for the estimation of the reserve risk margin and value at risk (VaR) in actuarial and finance applications.
1. Introduction
The protection of the policyholders and the financial stability of the insurance market industry is a crucial aspect and regulatory authorities intervene to ensure it. Based on Solvency II and IFRS Phase II regulations, each insurance or reinsurance company is obliged to evaluate its insurance liabilities on a risk-adjusted basis to allow for uncertainty in cash flows that arises from the liability of the insurance contracts. Australian Prudential Regulation Authority (APRA) requires estimating a 75th percentile of the distribution of outstanding claims for recording in profit and loss statements and the risk margin should be established on a basis that is intended to secure the insurance liabilities of the insurance company at a given level of sufficiency ).
In recent years, quantile regression has become a very popular methodology that has incorporated several new reforms in insurance and finance. The least squares estimators investigate only changes in the mean when the entire shape of the claims distribution may change dramatically. Quantile regression characterizes any particular point of a distribution and thus provides a more complete description of the distribution in comparison to linear regression. Quantile regression techniques can differentiate risk factors that lead to high level claims from those that lead to low level claims. Quantile regression estimation may be more efficient than the ordinary least squares when the distribution is not normal. Furthermore, quantile regression is more robust against outliers and does not require the specification of any error distribution. Therefore, quantile regression may be more appropriate than least squares estimation in the context of the insurance industry (see Buchinsky 1998; Koenker 2005).
In the actuarial literature, few papers deal with quantiles. Pitt (2006) used censored regression quantiles to analyze claim termination rates at different quantiles of the distribution of claim duration for income protection insurance. Chan et al. (2008) proposed a robust Bayesian analysis of loss reserves data using the generalized -distribution. Dong et al. (2015) presented in detail the use of parametric and nonparametric quantile regression in non-life applications. Nevertheless, the above approaches have been used for univariate quantile regression models and are suitable for a single line of business (single run-off triangle).
In this paper, we consider a quantile regression application in a multivariate context alternative to a multivariate Chain Ladder model for a portfolio of within and between correlated run-off triangles. For a dependent line of business, it is difficult to describe and specify the underlying correlation structure, which may prevent insurance practitioners from using the quantile regression. We propose a reserving problem for a non-life insurance portfolio consisting of several run-off subportfolios corresponding to different lines of business that can be embedded within the quantile regression longitudinal model and can provide solutions to the estimation of more extreme and capital margins.
The remainder of this paper is structured as follows. Section 2 presents the types of dependence (correlation) modeling in loss reserving triangles. In Section 3, we present brief introductions of quantile functions and quantile regression. Section 4 illustrates how correlated run-off triangles can be embedded in a quantile regression longitudinal model. In Section 5, we present a numerical implementation of the longitudinal quantile regression model with two run-off triangles. Section 6 provides the calculation of the risk margin (RM) based on the solvency capital requirement (SCR) estimation and according to the cost-of-capital (CoC) methodology. Finally, in Section 7, some concluding remarks are presented.
2. Dependence (Correlation) Modeling in Loss Run-Off Triangles
In this section, we highlight three types of dependence structures that may appear in loss reserving estimation. In many stochastic claims reserving models, in a single line of business, it is assumed that accident years are independent. In practice, it is evident that different accident years are not independent because of accounting year effects, claims inflation, and other exogenous factors that affect all accident years simultaneously. As pointed out by Ajne (1994) and many other researchers, when dealing with a portfolio with several lines of business, the Chain Ladder predictors for the whole portfolio differ from the sums of the Chain Ladder predictors for the different individual lines of business, because the dependence structure between the subportfolios of a portfolio is not taken into consideration. Dependence modeling was also discussed by Merz et al. (2012).
2.1. Correlation Within Claims Reserving Triangles
Barnett and Zehnwirth (2000) studied calendar year effects within probabilistic trend family models, which allowed calculating distributions of, and correlations between, future payment streams. De Jong (2012), based on time series models, investigated the correlation between accident and calendar years within triangles. In this spirit, Wuthrich (2010) presented a Bayesian inference approach that allows for the study of accounting (calendar) year effects. By restricting to the lognormal approach to chain-ladder-type models, Kuang et al. (2011) proposed a method suitable for cases where there is a sudden change in the economic environment affecting the policies for all accident years in the reserving triangle. Pešta and Okhrin (2014) proposed a generalized time series model that allows for modeling the conditional mean and variance of the claim amounts, for the claims development. They used a copula framework to incorporate modeling dependencies within the loss triangles.
2.2. Correlation Between Claims Reserving Triangles
The dependence structure between lines of business (run-off triangles) that is related to the accurate estimation of risk’s diversification is very important to the solvency capital requirement (SCR) the company should hold in order to remain healthy and avoid holding unnecessarily high levels of capital (see Avanzi et al. (2016b)).
Clark (2006), by using a times series model, estimated the correlation between future payments, due to inflation, in two or more loss reserving run-off triangles, concluding that the payments move with inflation and the variability due to inflation can be related to economic forecast models. When companies have several lines of business, it is useful to examine the presence of possible dependencies within the structure of the company and investigate the financial effect of these dependencies. Barnett and Zehnwirth (2000) proposed a model that allows dependency between all the observations that belong to the same calendar year for each line of business. According to Braun (2004), the correlation between run-off triangles can be attributed to the claims inflation affecting all or most of the run-offs of a portfolio in a similar way. Shi and Frees (2011) used copulas to model the association among multiple run-off triangles and Abdallah et al. (2015) assumed a dependence structure that links the calendar years of different lines of business. Zehnwirth and Barnett (2001) considered a more complicated situation of correlated loss triangles. Braun (2004) extended the distribution-free method of Mack (1993) to estimate the prediction error of the Chain Ladder method for a portfolio of several correlated run-off triangles. Pröhl and Schmidt (2005) proposed a multivariate Chain Ladder method that is suitable for a portfolio consisting of several subportfolios with a certain dependence structure.
By using the theory of linear mixed models, Antonio and Beirlant (2006) built a flexible loss reserving model in the framework of longitudinal data. Quarg and Mack (2004) proposed a bivariate Chain Ladder predictor for the paid and incurred aggregate claims of the same portfolio aiming to reduce the gap between the univariate Chain Ladder predictors for the paid and incurred aggregate claims of the same portfolio. Taylor and Mcguire (2005, 2007) considered the claims reserving problem in a multivariate context and generalized linear models were used to estimate loss reserves of several stochastically dependent lines of business, individually and in aggregate. Schmidt (2006) provided a review on recent multivariate models and methods of loss reserving. Merz and Wüthrich (2008a, 2008b) considered the multivariate Chain Ladder method for a portfolio of correlated run-off triangles based on multivariate age-to-age factors and derived an estimator for the mean square error of prediction for the Chain Ladder predictor of the ultimate claim of the total portfolio. Shi and Frees (2011) demonstrated the role of dependencies in the aggregation of claims from multiple run-off triangles, proposing a copula regression model for the prediction of unpaid losses for dependent lines of business. Zhang (2010) presented a general multivariate stochastic reserving model by using the seemingly unrelated regression technique. This model not only specifies contemporaneous correlation, but also allows structural connections among triangles. Hudecová and Pešta (2013) proposed the application of generalized estimating equations (GEE) for the estimation of claims reserves, where claim triangles are handled as panel data, assuming dependent claim amounts within the same accident year. De Jong (2012) developed and applied a model for loss triangles that facilitates the structuring and measurement of dependence between loss triangles. Bermudez et al. (2013), by using linear regression techniques and copulas, and assuming several dependence structures between lines of business, estimated the risk-based capital reserve for the economic capital requirements under the Standard Model and the Internal Model approach. Their sensitivity analysis on the correlation matrix assumptions between lines of business showed that modifications of the correlation and dependence assumptions have a significant impact on the solvency capital requirement (SCR) estimation, under Solvency II regulations. In insurance applications, normality assumptions may be misleading as a measure of dependency in the tails of the variables. The impact of loss triangle dependence on risk margins was also considered by Hubert et al. (2017), who proposed the FastSUR algorithm, in order to robustify the general multivariate Chain Ladder method of Zhang (2010), where the parameters were estimated using seemingly unrelated regression(SUR). Based on MM-estimators, Peremans et al. (2018) proposed a robust alternative that estimates the SUR parameters in a more outlier resistant way.
Avanzi et al. (2016a, 2016b) proposed a multivariate Tweedie approach to capture cell-wise dependence and the dependency between business segments in the non-life insurance industry, respectively, in loss reserving. In addition, Avanzi et al. (2018) constructed a broad and flexible family of models, where dependency is induced by common shock components. For an extensive analysis of loss reserving techniques, the reader may refer to the books of Taylor (2000), Wuthrich and Merz (2008) and Radtke et al. (2012).
2.3. Correlation Within and Between Claims Reserving Triangles
In actuarial applications, an insurance portfolio is subdivided into several homogeneous lines of business (subportfolios). Losses from a line of business can be viewed as a financial risk. Social and economic environment may affect several lines of business of an insurance portfolio, simultaneously. Thus, we may consider that, in addition to within correlation, insurance lines of business (run-off triangles) are related to each other due to the calendar effects. Zhang et al. (2012) proposed a Bayesian non-linear hierarchical model, where data from individual companies are treated as repeated measurements of various run-off triangles of claims, thus respecting the correlation between successive observations. Shi et al. (2012) examined calendar year effects in a multivariate loss-reserving context through a log-normal model. They used random effects to accommodate the correlation due to accounting year effects within and between run-off triangles. This specification is in line with De Jong (2012), who introduced the calendar year effects through the correlation matrix. In the spirit of Shi et al. (2012), Merz and Wüthrich (2008a) defined a multivariate log-normal model that allows modeling both dependence between different run-off triangles and dependence within run-off triangles, such as claims inflation.
2.4. Why Quantile Regression Models with Correlated Run-off Triangles?
In the insurance practice, some regulation rules indicate that some changes over time occurred across the claim distribution. Therefore, it is very important to investigate these changes at different points of the distributions. For example, Australian insurance regulations require that a risk margin should be established at 75% percentile of the discounted value less than the best estimate (see Pitt (2006)). Most studies consider only correlation as a measure of dependency, focusing on reserving at mid-range . Consideration of capital margins at more extreme opens up the question of tail dependency, and a whole new field of exploration (see Avanzi et al. (2016b)).
Each quantile regression characterizes a particular point of a distribution, and thus provides more complete description of the distribution, taking into account the correlations in the tail of distributions. Furthermore, quantile regression is more robust against outliers and does not require specifying any error distribution (see Fu and Wang (2012)).
One implication of our model is the diversification effect of a portfolio of reserve risks and can be used as a risk measure with applications in actuarial science. Practically, our quantile approach leads to a provision of a specified probability, say 80%, sufficient to cover the run-off claims. Adding the necessary margin to the central estimate, the evaluation of the claims liability is provided and the provision is sufficient to cover the future liabilities.
3. Preliminaries on Quantile Functions and on Quantile Regression
Here, we provide some preliminaries on quantile regression that are needed below for loss reserving estimation.
3.1. Quantile Function
For a random variable with cumulative distribution function , the th quantile of is defined as the inverse function
where . In case that is a strictly increasing and continuous probability distribution function, is the unique real number such that (Gilchrist 2000).
Quantiles are connected with operations of ordering the sample observations that are used to define them. For a random sample of , the general th sample quantile may be formulated as the solution of the optimization problem
and denotes the indicator function. This loss function is an asymmetric absolute loss function because it is a weighted sum of absolute deviations, where the weight is assigned to the negative deviations while the weight is assigned to the positive deviations.
3.2. Quantile Regression Estimation
In the regression case, we assume a sample , , where is the dependent variable, is a vector of explanatory variables, and is a vector of coefficients. The general linear model has the form
while the th conditional quantile of given can be written as (see Koenker and Basset 1982)
We consider the th sample quantile . Mosteller proved the limiting normality of that provides a realization of the least estimation of the form
where is a vector to be estimated and is the error term. The linear conditional quantile function, , can be estimated by solving
where is already defined in Equation (2) and is the indicator function for any quantile . The case , which minimizes the sum of absolute residuals, corresponds to median regression, which is also known as regression. The minimization of Equation (6) was produced by Koenker and D’Orey (1994).
Under certain conditions, for independent observations, the asymptotic variances for can be obtained as (see Buchinsky 1998)
and the covariance matrix of is
With quantile regression, we can show how various financial characteristics are different at different quantiles. Thus, the quantile regression method allows the marginal effects to change for claims at different points in the conditional distribution by estimating using several different values of , . This means that the quantile regression allows for parameter heterogeneity across different types of claims.
5. Numerical Illustrations
In this section, we present a numerical implementation of longitudinal quantile regression model with two correlated run-off triangles. The codes of this paper were implemented in R, using own routines, the “ChainLadder" (Gesmann et al. 2018) and the “quantreg" (Koenker 2018) packages.
5.1. Numerical Example Based on Average Premium Per Exposure
In this section, based on average premium per exposure, the loss reserving model is implemented by using the longitudinal quantile regression model. We suppose that we have two blocks of business for which we are trying to calculate reserve indications. Both companies operate in Greece. Company A mainly focuses on motor business and underwrites all vehicle categories apart from taxis and trucks while Company B underwrites all vehicle categories for motor business. Table 3 and Table 4 show the triangles of the incremental incurred claims (paid and outstanding claims) for both companies.
Table 3.
Motor triangle and premiums for Company A.
Table 4.
Motor triangle and premiums for Company B.
In the sequel, we apply the following regression setting in a quantile longitudinal form,
where, for the triangle, is the average premium per exposure of the r accident year and of the j development year, is the overall mean, is the effect of the r accident year, is the effect of the j development year, is the error term, and the design matrix is of dimension .
It is obvious that, for Company A, for the accident year 2007, a big claim has been paid 10 years after the accident date (the amount of this claim is embedded at the total incremental amount of 10,190) and could be represented as an outlier claim. This claim will dramatically change the development pattern of the payments in case of using the Chain Ladder method because the loss development factor for this year increases from 1.0117 to 1.0171 (see Table 5). This is commonly observed in motor business where claims are settled many years later, especially when accidents with large compensations (such as partial or total disability, deaths, etc.) are observed. This can also be observed in accident year 2008 where a large amount is observed at the last known development year (13,809). We have a similar situation for the run-off triangle for Company B, but not to that extent as for Company A. For Company B, the data of the triangle seem to be more stable. Moreover, the premiums of the companies by accident year are presented at the run-off triangles. For the implementation of the dataset, we divided the payments by exposures for each year of business before the analysis was carried out.
Table 5.
Chain ladder: loss development factors.
The number of exposures (counts of incurred losses) for each accident year is also provided (see Table 6 and Table 7) for each lines of business.
Table 6.
Counts of incurred claims for Company A.
Table 7.
Counts of incurred claims for Company B.
If we were trying to calculate the expected value of the reserve run-off, we could simply calculate the expected value for each line of business separately and add all the expectations together. However, when we quantify a value other than the mean, such as a quantile, we cannot simply sum across the lines of business. In such a case, we would overstate the aggregate reserve need.
Remark 3.
The only time the sum of a th quantile would be appropriate for the aggregate reserve indication is when all the lines of business are fully correlated with each other which is of course a highly unlikely situation.
Figure 1 and Figure 2 show claims development charts of Companies A and B, respectively, with individual panels for each origin period. Chain Ladder loss development factors for each company are also presented in Table 5. According to the claims development chart, we observe that the patterns of Companies A and B appear similar (see Figure 3).
Figure 1.
Claims development chart of Company A with individual panels for each origin period.
Figure 2.
Claims development chart of Company B with individual panels for each origin period.
Figure 3.
Claims development chart of the triangles with one line per origin period.
Table 8 and Table 9 display the values of reserves and ultimate paid claims based on individual quantile regression method, for Companies A and B, respectively, for different quantiles. Loss ratios for each quantile are also provided at the end of each of the Tables. Table 10 and Table 11 display the values of reserves and ultimate paid claims based on longitudinal quantile regression method, for Companies A and B, respectively, for different quantiles.
Table 8.
Reserves and ultimate claims of Company A based on Individual Quantile Regression.
Table 9.
Reserves and ultimate claims of Company B based on Individual Quantile Regression.
Table 10.
Reserves and ultimate claims of Company A based on Longitudinal Quantile Regression.
Table 11.
Reserves and ultimate claims of Company B based on Longitudinal Quantile Regression.
Loss ratios () for motor car insurance typically range from 40% to 60%. In this case, insurance companies are collecting more premiums than the amount paid in claims. Loss ratio is considered as one of the tools which explains a company’s suitability for coverage. A high loss ratio means is considered bad, which leads to bad financial health because the insurance company may not collect enough premiums to pay claims and expenses while also making a reasonable profit.
Table 12 provides the values of reserves based on the individual quantile regression (IQR) and based on longitudinal quantile regression (LQR). To examine the role of dependence, it is important to calculate the reserves for each IQR, and then use the sum to compare it with the sum of the run-off triangles resulting from the LQR (last line of Table 12).
Table 12.
Estimated reserves using Individual Quantile Regression (IQR) and the Longitudinal Quantile Regression (LQR).
Applying individual quantile regression, a higher quantile leads to larger total reserve. Nevertheless, for Company A, quantiles over 95% provide equal values of reserves, while, for Company B, quantiles over 90% provide equal values of reserves. The longitudinal algorithm gives different estimations for each quantile. Applying longitudinal quantile regression, the estimated ultimate reserves for both Companies A and B are smaller than the sum of individual estimated reserves for each of Companies A and B based on individual quantile regression.
5.2. Comparison Criteria
For model comparison, four criteria, namely the root mean squared error (RMSE), the percentage total (PT), the mean absolute error (MAE), and the mean absolute percentage error (MAPE), were used.
RMSE is a measure of accuracy and is useful for comparing different models for a particular dataset (Hyndman and Koehler 2006). RMSE is the square root of the average of squared errors. The effect of each error on RMSE is proportional to the size of the squared error. Thus, larger errors have a disproportionately large effect on RMSE. Consequently, RMSE is sensitive to outliers (Pontius et al 2008; Willmott and Matsuura 2006).
RMSE for one triangle and for many run-off triangles is given, respectively, by
where represents the total number of the known incremental data (the left upper triangle) and is the counter for each triangle.
The percentage total (PT) was also a comparison criterion, which is defined for one and for many triangles, respectively, as
RMSE and PT measure the model-fit with respect to observations, where a PT value closer to 100 is accepted, while, for RMSE, we prefer the smallest values.
According to the comparison criteria, the longitudinal algorithm provides the smaller RMSE when using the 75% quantile, resulting to better fit of the data. Thus, a combination of different companies or lines of business provides a better estimation of the total reserve. In case of using the PT criterion, we take exactly the same results and the 75% quantile produces the best fit. If we make estimations separately, the suggested models for both triangles use quantiles below 75% which means weak prudence (Table 13).
Table 13.
RMSE and PT for Individual and Longitudinal Quantile Regression.
The MAE calculates the average amount of the errors by computing the absolute differences between prediction and actual observation divided by the total number of the observations. The lower the value of MAE is, the better the model fits.
Finally, the MAPE is the average of absolute percentage errors. MAPE has the significant disadvantage of producing infinite or undefined values when the actual values are zero or close to zero (Kim and Kim 2016). If the actual values are very small, then MAPE yields extremely large percentage errors (outliers).
The MAE criterion for single run-off triangle and the total MAE for run-off triangles is calculated, respectively, by
The MAPE criterion for single run-off triangle and the total MAPE for run-off triangles, is given, respectively, by
For the MAPE and MAE criteria, the smallest values indicate the best fit. The computations of the values for each criterion are given in Table 14. According to the MAE criterion, the 60% quantile is the suggested value to be used, while, based on MAPE, the 50% quantile is the most appropriate. Nevertheless, the difference between the MAE values for 50% and 60% is not so big, which means that 60% could also be a good choice.
Table 14.
MAE and MAPE for Individual and Longitudinal Quantile Regression.
Figure 4 and Figure 5 display the reserve estimation for each accident year using the individual quantile regression and the longitudinal quantile regression. Each plot provides the reserve values in different quantiles for each accident year.
Figure 4.
Reserves estimation for individual QR and longitudinal QR (Company A).
Figure 5.
Reserves estimation for individual QR and longitudinal QR (Company B).
Figure 6 illustrates the values of the ultimate reserves, for different quantiles, based on the individual and the longitudinal quantile regression models, in comparison to the ultimate reserves based on the Chain Ladder method for Companies A and B. The ultimate reserves based on the Chain Ladder estimation for Company A is 1,624,721 and for Company B is 1,901,883. More specifically, for Company A, the Chain Ladder reserve value coincides with the individual quantile regression reserve value at 50-quantile, and with the longitudinal quantile regression reserve value around the 80-quantile. For Company B, the Chain Ladder reserve value coincides with the individual quantile regression reserve value around 50-quantile, and with the longitudinal quantile regression reserve value around the 60-quantile.
Figure 6.
Ultimate reserves for individual QR and longitudinal QR.
6. Risk Capital Requirement and Risk Margin
Solvency II and IFRS bring some significant changes, particularly in relation to the estimation of insurance liabilities. Generally, the Probability of Sufficiency is a measure of solvency in liability valuation (DalMoro and Krvavych 2017):
- Probability of sufficiency below 50% indicates that the technical provisions are set below the central estimate, which leads to an under-reserved position.
- Probability of sufficiency with values between 50% and 60% indicates that the technical provisions are approximately at the level of central estimate, which leads to weak prudence.
- For values of probability of sufficiency around 75%, the technical provisions are above the central estimate, which leads to adequate prudence.
- Finally, if the probability of sufficiency is above 75%, the technical provisions are enough to lead to strong prudence.
Risk Margin
Based on Solvency II directive, risk margin represents the potential costs to transfer the insurance obligations to a third party and it is calculated based on the cost-of-capital (CoC) method. Solvency II considers risk over a one-year time horizon.
According to this approach, when the capital requirements for each future year are given, the risk margin is equal to the sum of the discounted costs of capital, which are the capital requirements multiplied by the cost-of-capital rate (6%). The risk margin (RM) is exclusively based on the solvency capital requirement (SCR) estimation and according to the cost-of-capital (CoC) methodology is calculated as follows (CEIOPS 2009):
where is the estimated reserve for the accident year , is the risk-free rate for maturity, is the Solvency Capital Requirement for the accident year , and is the cost of capital rate. The solvency capital requirement is the difference between the 99.5% quantile and the 50% quantile of the reserves. The cost of capital is 6% (as Solvency II suggests) and we suppose that the risk-free rate for maturity is for all accident years (the real risk-free rate for maturity is given by EIOPA).
Along with the best estimate (BE), risk margin makes up the technical provisions and ensures that their value is equivalent to the amount that an (re)insurer would be expected to require in order to take over and meet the insurance obligations. Generally, risk margin increases the value of the technical provisions from the BE up to an amount which is equivalent to a theoretical level needed to transfer obligations to another (re)insurer. Risk margin represents what an (re)insurer would have to pay to the market to take on the BE liabilities. When the market takes on your BE liabilities, they will have to set aside capital to cover the SCR. Therefore, holding the SCR incurs a cost. Risk margin represents this cost. Dong et al. (2015) showed how one can provide an accurate estimation of risk margin and hence provision, instead of estimating the mean and then applying a risk margin. Their method is more robust when the data are heavy tailed. Their approach has been used for the univariate quantile regression model and is suitable for a simple line of business (single run-off triangle).
Table 15 presents the calculated risk margins for each individual line of business (Companies A and B), based on the univariate quantile regression model and Table 16 presents the corresponding risk margin for each individual line of business based on our longitudinal model.
Table 15.
Risk Margin based on Individual Quantile Regression.
Table 16.
Risk Margin based on Longitudinal Quantile Regression.
Finally, Table 17 presents the calculated risk margins based on the bootstrap method and the resulting Figure 7 and Figure 8 illustrate: (a) the histograms of total simulated IBNRs; (b) the empirical cumulative distribution functions of IBNRs; (c) the simulated ultimate claim cost against ultimate claim cost; and (d) the latest actual incremental claims cost for the latest available calendar period against latest incremental claims, for Companies A and B, respectively.
Table 17.
Risk Margin based on Bootstrap (Poisson).
Figure 7.
Bootstrap graphs for Company A.
Figure 8.
Bootstrap graphs for Company B.
From histograms and the empirical cumulative distribution functions, we observe that the IBNRs may follow distributions with positive skewness. We also observe that the simulated data may not follow the same trend as the actual data (specifically in Company B). It indicates that the original data might have some trends that are not reflected in the model.
Remark 4.
If the distribution of the reserves were known, then the mean of this distribution would be the BE, i.e., the amount to be paid as compensation to the beneficiaries. Nevertheless, this distribution is not known and for that reason many methodologies are used to estimate the BE such as the bootstrap method. In case of quantile regression models, a specific quantile, which provides estimations close to the mean, would be used to estimate the BE. For that reason, we use the 50% quantile, but a larger one could be used especially for long tail distributions.
7. Concluding Remarks
We propose quantile regression for longitudinal data in the framework of a general multivariate loss reserving model. Our model considers a combination of the between and within lines of business, taking into account the correlations and variation of run-off triangles. We investigated a general insurance portfolio that consists of two correlated subportfolios (two auto run-off triangles). The least squares estimators investigated only changes in the mean, while the quantile regression characterized a particular point of a distribution, which provides a more complete description of the entire shape of the claims distribution. According to Solvency II and IFRS, the solvency capital requirement (SCR) was provided based on the best estimate (BE) and, in the sequel, the overall risk margin (RM), based on the cost-of-capital (CoC) methodology, was calculated.
Author Contributions
I.B., G.P. contributed equally to this work. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Acknowledgments
This work was partly supported by the University of Piraeus Research Center.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Abdallah, Anas, Jean-Philippe Boucher, and Hélène Cossette. 2015. Modeling Dependence between Loss Triangles with Hierarchical Archimedean Copulas. ASTIN Bulletin 45: 577–99. [Google Scholar] [CrossRef]
- Ajne, Björn. 1994. Additivity of chain–ladder projections. ASTIN Bulletin 24: 313–8. [Google Scholar] [CrossRef]
- Antonio, Katrien, and Jan Beirlant. 2006. Actuarial statistics with generalized linear mixed models. Insurance: Mathematics and Economics 75: 643–76. [Google Scholar] [CrossRef]
- Avanzi, Benjamin, Greg Taylor, Phuong Anh Vu, and Bernard Wong. 2016a. Stochastic loss reserving with dependence: A flexible multivariate Tweedie approach. Insurance: Mathematics and Economics 71: 63–78. [Google Scholar] [CrossRef]
- Avanzi, Benjamin, Greg Taylor, and Bernard Wong. 2016b. Correlations between insurance lines of business: An illusion or a real phenomenon? Some methodological considerations. ASTIN Bulletin 46: 225–63. [Google Scholar] [CrossRef]
- Avanzi, Benjamin, Greg Taylor, and Bernard Wong. 2018. Common Shock Models for Claim Arrays. ASTIN Bulletin 48: 1109–36. [Google Scholar] [CrossRef]
- Barnett, Glen, and Ben Zehnwirth. 2000. Best estimates for reserves. Proceedings of the Casualty Actuarial Society 87: 245–321. [Google Scholar]
- Bartlett, M. S. 1951. An inverse matrix adjustment arising in discriminant analysis. Annals of Mathematical Statistics 22: 107–11. [Google Scholar] [CrossRef]
- Bermúdez, Lluis, Antoni Ferri, and Montserrat Guille. 2013. A Correlation Sensitivity Analysis for non-life underwriting risk module SCR. ASTIN Bulletin 43: 21–37. [Google Scholar] [CrossRef]
- Braun, Christian. 2004. The prediction error of the chain ladder method applied to correlated run-off triangles. ASTIN Bulletin 34: 399–423. [Google Scholar] [CrossRef]
- Brown, Bruce Maxwell, and You-Gan Wang. 2005. Standard errors and covariance matrices for smoothed rank estimators. Biometrika 92: 149–58. [Google Scholar] [CrossRef]
- Buchinsky, Moshe. 1998. Recent advances in regression models: A practical guideline for empirical research. The Journal of Human Resources 33: 88–126. [Google Scholar] [CrossRef]
- CEIOPS. 2009. Advice for Level 2 Implementing Measures on Solvency II: Technical Provisions, Article 86 (d). Frankfurt: CEIOPS. [Google Scholar]
- Chan, Jeniffer S. K., S. T. Boris Choy, and Udi E. Makov. 2008. Dynamic and robust models for loss reserves using generalized-t distribution. ASTIN Bulletin 38: 207–30. [Google Scholar] [CrossRef]
- Chen, Li, Lee-Jen Wei, and Michael I. Parzen. 2004. Quantile regression for correlated observations. In Proceedings of the Second Seattle Symposium in Biostatistics. Lecture Notes in Statistics. New York: Springer, vol. 179, pp. 51–69. [Google Scholar]
- Christofides, S. 1990. Regression models based on log-incremental payments. In Claims Reserving Manual 2. London: Institute of Actuaries. [Google Scholar]
- Clark, David R. 2006. Variance and covariance due to inflation. CAS Forum 11: 61–95. [Google Scholar]
- Dal Moro, Eric, and Yuriy Krvavych. 2017. Probability of sufficiency of solvency II reserve risk margins: Practical approximations. ASTIN Bulletin 47: 737–85. [Google Scholar] [CrossRef]
- De Jong, Piet. 2012. Modeling Dependence between Loss Triangles. North American Actuarial Journal 16: 74–86. [Google Scholar] [CrossRef]
- Diggle, Peter J., Patrick J. Heagerty, Kung-Yee Liang, and Scott L. Zeger. 2002. Analysis of Longitudinal Data. New York: Oxford University Press. [Google Scholar]
- Dong, Alice X. D., Jennifer S. K. Chan, and Gareth W. Peters. 2015. Risk margin quantile function via parametric and non-parametric bayesian approaches. ASTIN Bulletin 45: 503–50. [Google Scholar] [CrossRef]
- Fu, Liya, and You-Gan Wang. 2012. Quantile regression for longitudinal data with a working correlation model. Computational Statistics & Data Analysis 56: 2526–38. [Google Scholar]
- Gesmann, M., D. Murphy, Y. Zhang, A. Carrato, M. Wuthrich, F. Concina, and E. Dal Moro. 2018. ChainLadder: Statistical Methods and Models for Claims Reserving in General Insurance. R package version 0.2.9. Available online: https://CRAN.R-project.org/package=ChainLadder (accessed on 27 May 2019).
- Gilchrist, Warren G. 2000. Statistical Modelling with Quantile Functions. London: Chapman & Hall. [Google Scholar]
- Harrison, David A., and Charles L. Hulin. 1989. Investigations of absenteeism: Using event-history models to study the absence-taking process. Journal of Applied Psychology 74: 300–16. [Google Scholar] [CrossRef]
- Holmberg, Randall D. 1994. Correlation and the measurement of loss reserve variability. CAS Forum 1: 247–78. [Google Scholar]
- Hubert, Mia, Tim Verdonck, and Özlem Yorulmaz. 2017. Fast robust SUR with economical and actuarial applications. Statistical Analysis and Data Mining 10: 77–88. [Google Scholar] [CrossRef]
- Hudecová, Šárka, and Michal Pešta. 2013. Modeling dependencies in claims reserving with GEE. Insurance: Mathematics and Economics 53: 786–94. [Google Scholar] [CrossRef]
- Hyndman, Rob J., and Anne B. Koehler. 2006. Another look at measures of forecast accuracy. International Journal of Forecasting 22: 679–88. [Google Scholar] [CrossRef]
- Jung, Sin-Ho. 1996. Quasi-likelihood for median regression models. Journal of the American Statistical Association 91: 251–7. [Google Scholar] [CrossRef]
- Kim, Sungil, and Heeyoung Kim. 2016. A new metric of absolute percentage error for intermittent demand forecasts. International Journal of Forecasting 32: 669–79. [Google Scholar] [CrossRef]
- Koenker, Roger, and Gilbert Basset. 1982. Robust Tests for Heteroscedasticity Based on Regression Quantiles. Econometrica 50: 43–61. [Google Scholar] [CrossRef]
- Koenker, Roger, and Vasco D’Orey. 1994. A remark on algorithm AS229: Computing dual regression quantiles and regression rank scores. Applied Statistics 43: 410–4. [Google Scholar] [CrossRef]
- Koenker, Roger. 2005. Quantile Regression. Cambridge: University Press. [Google Scholar]
- Koenker, Roger. 2018. Quantreg: Quantile Regression. R Package Version 5.36. Available online: https://CRAN.R-project.org/package=quantreg (accessed on 13 December 2019).
- Kremer, Erhard. 2005. The correlated chain-ladder method for reserving in case of correlated claims developments. Blatter DGVFM 27: 315–22. [Google Scholar] [CrossRef]
- Kuang, Di, Bent Nielsen, and Jens Perch Nielsen. 2011. Forecasting in an Extended Chain-Ladder–Type Model. The Journal of Risk and Insurance 78: 345–59. [Google Scholar] [CrossRef]
- Mack, Thomas. 1993. Distribution-free calculation of the standard error of chain-ladder reserve estimates. ASTIN Bulletin 23: 213–25. [Google Scholar] [CrossRef]
- Merz, Michael, and Mario Wüthrich. 2008a. Prediction error of the chain ladder reserving method applied to correlated run off trapezoids. Annals of Actuarial Science 2: 25–50. [Google Scholar] [CrossRef]
- Merz, Michael, and Mario Wüthrich. 2008b. Prediction error of the multivariate chain ladder reserving method. North American Actuarial Journal 12: 175–97. [Google Scholar] [CrossRef]
- Merz, Michael, Mario Wüthrich, and Enkelejd Hashorva. 2012. Dependence modelling in multivariate claims run-off triangles. Annals of Actuarial Science 7: 3–25. [Google Scholar] [CrossRef][Green Version]
- Pang, Lei, Wenbin Lu, and Huixia Judy Wang. 2012. Variance estimation in censored quantile regression via induced smoothing. Computational Statistics and Data Analysis 56: 785–96. [Google Scholar] [CrossRef]
- Parzen, M. I., L. J. Wei, and Z. Ying. 1994. A resampling method based on pivotal estimating functions. Biometrika 81: 341–50. [Google Scholar] [CrossRef]
- Peremans, Kris, Stefan Van Aelst, and Tim Verdonck. 2018. A robust general multivariate chain ladder method. Risks 6: 108. [Google Scholar] [CrossRef]
- Pešta, Michal, and Ostap Okhrin. 2014. Conditional least squares and copulae in claims reserving for a single line of business. Insurance: Mathematics and Economics 56: 28–37. [Google Scholar] [CrossRef]
- Pitt, David G. W. 2006. Regression quantile analysis of claim termination rates for income protection insurance. Annals of Actuarial Science 1: 345–57. [Google Scholar] [CrossRef][Green Version]
- Pontius, Robert Gilmore, Olufunmilayo Thontteh, and Hao Chen. 2008. Components of information for multiple resolution comparison between maps that share a real variable. Environmental Ecological Statistics 15: 111–42. [Google Scholar] [CrossRef]
- Pröhl, Carsten, and Klaus D. Schmidt. 2005. Multivariate Chain-Ladder. Dresdner Schriften zur Versicherungsmathematik. Available online: https://www.math.tu-dresden.de/sto/schmidt/dsvm/dsvm2005-3.pdf (accessed on 13 December 2019).
- Quarg, Gerhard, and Thomas Mack. 2004. Munich chain ladder. Blatter DGVFM 4: 597–630. [Google Scholar] [CrossRef]
- Radtke, Michael, Klaus D. Schmidt, and Anja Schnaus. 2012. Handbook on Loss Reserving. EAA Lecture Notes. European Actuarial Academy. Berlin: Springer. [Google Scholar]
- Schmidt, Klaus D. 2006. Optimal and additive loss reserving for dependent lines of business. Paper presented at 2006 CAS Casualty Loss Reserve Seminar, Atlanta, GA, USA, September 11–12; pp. 319–51. [Google Scholar]
- Searle, Shayle R., George Casella, and Charles E. McCulloch. 1992. Variance Components. New York: John Wiley and Sons. [Google Scholar]
- Shi, Peng, and Edward Frees. 2011. Dependent loss reserving using copulas. ASTIN Bulletin 41: 449–86. [Google Scholar]
- Shi, Peng, Sanjib Basu, and Glenn G. Meyers. 2012. A Bayesian log-normal model for multivariate loss reserving. North American Actuarial Journal 16: 29–51. [Google Scholar] [CrossRef]
- Stoner, Julie A., and Brian G. Leroux. 2002. Analysis of clustered data: A combined estimating equations approach. Biometrika 89: 567–78. [Google Scholar] [CrossRef]
- Taylor, Greg, and Grainne Mcguire. 2005. Synchronous bootstrapping of seemingly unrelated regressions. Paper presented at the 36th International ASTIN Colloquium, ETH, Zurich, September 4–7. [Google Scholar]
- Taylor, Greg, and Grainne Mcguire. 2007. A synchronous bootstrap to account for dependencies between lines of business in the estimation of loss reserve prediction error. North American Actuarial Journal 11: 70–88. [Google Scholar] [CrossRef]
- Taylor, Gregory. 2000. Loss Reserving—An Actuarial Perspective. Norwell: Kluwer Academic Publishers. [Google Scholar]
- Wang, You-Gan, Quanxi Shao, and Min Zhu. 2009. Quantile regression without the curse of unsmoothness. Computational Statistics and Data Analysis 53: 3696–705. [Google Scholar] [CrossRef]
- Willmott, Cort, and Kenji Matsuura. 2006. On the use of dimensioned measures of error to evaluate the performance of spatial interpolators. International Journal of Geographic Information Science 20: 89–102. [Google Scholar] [CrossRef]
- Wüthrich, Mario, and Michael Merz. 2008. Stochastic Claims Reserving Methods in Insurance. Hoboken: Wiley Finance. [Google Scholar]
- Wüthrich, Mario V. 2010. Accounting Year Effects Modeling in the Stochastic Chain Ladder Reserving Method. North American Actuarial Journal 14: 235–55. [Google Scholar] [CrossRef]
- Yin, Guosheng, and Jianwen Cai. 2005. Quantile regression models with multivariate failure time data. Biometrics 61: 151–61. [Google Scholar] [CrossRef]
- Zehnwirth, Ben, and Glen Barnett. 2001. Reserving for multiple excess layers. Paper presented at the ASTIN Colloquium at the ASTIN Colloquium, Washington, DC, USA, July 8–11. [Google Scholar]
- Zhang, Yanwei, Vanja Dukic, and James Guszcza. 2012. A Bayesian non-linear model for forecasting insurance loss payments. Journal Royal Statistical Society A 175: 637–56. [Google Scholar] [CrossRef]
- Zhang, Yanwei. 2010. A general multivariate chain ladder model. Insurance: Mathematics and Economics 46: 588–99. [Google Scholar] [CrossRef]
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).