Next Article in Journal
Predicting High-Cost Healthcare Utilization Using Machine Learning: A Multi-Service Risk Stratification Analysis in EU-Based Private Group Health Insurance
Previous Article in Journal
AI Risk Management: A Bibliometric Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Mack Chain Ladder and Data Granularity for Preserved Development Periods

by
Greg Taylor
School of Risk and Actuarial Studies, University of New South Wales, Randwick, NSW 2052, Australia
Risks 2025, 13(7), 132; https://doi.org/10.3390/risks13070132
Submission received: 4 April 2025 / Revised: 11 June 2025 / Accepted: 13 June 2025 / Published: 7 July 2025

Abstract

This paper is concerned with the choice of data granularity for the application of the Mack chain ladder model to forecast a loss reserve. It is a sequel to a related paper by Taylor, which considers the same question for the EDF chain ladder model. As in the earlier paper, it considers the question as to whether a decrease in the time unit leads to an increase or decrease in the variance of the loss reserve estimate. The question of whether a Mack chain ladder that is valid for one time unit (here called mesh size) remains so for another is investigated. The conditions under which the model does remain valid are established. There are various ways in which the mesh size of a data triangle may be varied, two of them of particular interest. The paper examines one of these, namely that in which development periods are preserved. Two versions of this are investigated: 1. the aggregation of development periods without change to accident periods; 2. the aggregation of accident periods without change to development periods. Taylor found that, in the case of the Poisson chain ladder, an increase in mesh size always increases the variance of the loss reserve estimate (subject to mild technical conditions). The case of the Mack chain ladder is more nuanced in that an increase in variance is not always guaranteed. Whether or not an increase or decrease occurs depends on the numerical values of certain of the age-to-age factors actually observed. The threshold values of the age-to-age factors at which an increase transitions to a decrease in variance are calculated. In the case of a change in the mesh of development periods, but with no change to accident periods, these values are computed for one particular data set, where it is found that variance always increases. It is conjectured that data sets in which this does not happen would be relatively rare. The situation is somewhat different when changes in mesh size over accident periods are considered. Here, the question of an increase or decrease in variance is more complex, and, in general terms, the occurrence of an increase in variance with increased mesh size is less likely.

1. Introduction

1.1. Background

The chain ladder is a widely used model for insurance loss reserving (Mack 1993; Taylor 1985, 2000; Wüthrich and Merz 2008). The data set to which it is applied is typically triangular, with rows labelled by accident period and columns by development period.
Accident and development periods are commonly years, but other units of time are possible, e.g., quarters, months, and weeks. As the time unit is decreased, the number of data points increases, but the volatility of each point increases. This leads to a question as to whether a decrease in the time unit leads to an increase or decrease in the variance of the loss reserve estimate. A natural companion to this question is whether there is an optimal time unit at which the variance of the forecast loss reserve is minimized.
A parallel question on the effects of mesh size on modelling and forecasting arises in a wide range of scientific areas. Taylor (2025) cites references.
The prior actuarial literature contains little comment on these matters, but Taylor (2025) investigated the question in relation to the EDF chain ladder (Wüthrich and Merz 2008; Taylor 2009), where EDF refers to the exponential dispersion family, and particularly the Poisson chain ladder. The conclusion in this last case was subject to a number of conditions, but, in broad terms, it was that a choice of the most granular data possible (i.e., a choice of accident and development periods of short duration) would minimize the variance of the forecast loss reserve.

1.2. Purpose of the Paper

Taylor (2025) noted that there were two distinct formulations of the chain ladder model, namely the EDF chain ladder referred to above and the Mack chain ladder (Mack 1993). Whereas Taylor investigated the former, the present paper considers similar questions of data granularity in relation to the Mack chain ladder.
The present paper thus discusses the influence of data granularity on the variance of the loss reserve forecast by the Mack chain ladder. Under suitable conditions, it is possible to identify the granularity that minimizes this variance.
In Taylor (2025), a distribution from the EDF was assigned to each observation. This enabled an appeal to sufficient statistics in the search for minimum variance unbiased estimators (“MVUEs”), and these statistics carried much of the theoretical load in that paper. By contrast, the Mack model is distribution-free, so sufficient statistics cannot be defined. The identification of MVUEs must proceed by different means.

1.3. Layout of the Paper

The paper considers changes in mesh size on the triangular data set, which is to say changes in the units of time used for accident and development periods. The effect of an increase in mesh size (i.e., an increase in the amount of time spanned by one of these periods) on the variance of the estimated loss reserve is investigated.
There are various ways in which the mesh size of a data set can be varied. Some are more sensible than others, and two fundamental types of variation were identified in Taylor (2025), specifically those that preserve
  • Calendar periods;
  • Development periods.
These types of variation are described in Section 2, where a specific notation is developed for each type of variation to allow translation between the original data set and that resulting from the change in mesh size.
Both types of variation are deserving of study in the context of the Mack chain ladder. However, considerations of space have limited the present paper to just one, namely changes in mesh size that preserve development periods. Those that preserve calendar periods may be explored in a separate paper.
Section 3 reviews the Mack chain ladder model, including its forecast of loss reserve, with an emphasis on the variance of the quantity under forecast.
Section 4 examines the effect of changed mesh size on the variance of the loss reserve when the change preserves development periods. First, the conditions are established under which the Mack chain ladder continues to be a valid model under changes in mesh (Section 4.1). In the cases where the Mack chain ladder remains a valid model, the effect of the changed mesh on the variance of the estimated loss reserve is then calculated.
Two cases are considered:
  • That in which the development period mesh is changed (Section 4.2.1);
  • That in which the accident period mesh is changed (Section 4.2.2).

2. Notation and Mathematical Preliminaries

The notational and mathematical apparatus required here is much the same as in Taylor (2025), and the majority of it is taken from that source.

2.1. Fundamentals

Let i = 1,2 , , I denote the accident period and j = 1,2 , , I the development period. Taylor (2025) denoted the range of j as 0,1 , , I 1 but the present notation will be more natural later when development periods are merged and a notation is required for this.
Let Y i j , with mean μ i j , denote the random variable representing the amount of claim payments during development period j of accident period i . These quantities are usually referred to as incremental claim payments. For the moment, all accident and development periods are of equal duration, though this will change later.
A calendar period consists of all those combinations of i and j such that i + j is constant. For definiteness, label the calendar t if i + j 1 = t .
It will be assumed that a standpoint is taken at the end of calendar period I and one is in possession of certain data from calendar periods 1,2 , , I . Specifically, these data comprise the claim triangle  D U = Y i j : i = 1 , , I , j = 1 , , I i + 1 . This will be referred to as the upper triangle. One wishes to use these data to forecast the lower triangle  D L = Y i j : i = 2 , , I , j = I i + 2 , , I . Let D = D U D L .
In a loss-reserving context, a realization of the upper triangle will have been observed. This will be denoted d U , which is the same as D U with each random variable Y i j replaced by its realization y i j .
It will also be convenient to define cumulative claim payments. Thus, define
C i j =   k = 1 j Y i k   ,
which is the random variable representing the amount of cumulative claim payments up to the end of development period j of accident period i . Further, let c i j be defined in the same way in terms of the y i k so that c i j is a realization of C i j .
It is assumed that there is no claim activity beyond development period I , in which case C i I is the ultimate claim cost of accident period i .
The objective of the loss-reserving exercise is to forecast unobserved lower triangle. This will be denoted D ^ L , which is the same as D L with each random variable Y i j replaced by its forecast Y ^ i j .
The upper and lower triangles have been defined in terms of incremental claim payments. It will sometimes be convenient to define them in terms of cumulative claim payments, e.g., D U = C i j : i = 1 , , I , j = 0 , , I i + 1 . As there is an equivalence relation between the two forms of triangle, the terms upper and lower triangle will be used, with a slight abuse of terminology, to refer to either form, provided that the context removes any ambiguity.
Let R i denote row i of the incremental d U , i.e., R i = y i j : j = 1 , , I i + 1 , and let r j denote the sum of this row:
r j = R i y i j =   c i , I i + 1   .
Further, let C j denote column j of the cumulative d U , i.e., C j = c i j : i = 1 , , I j + 1 , and let s j denote the sum of this column:
s j = C j c i j =   i = 1 I j + 1 c i j   .
It will also be convenient to define
s j =   i = 1 I j c i j   .
As noted above, C i I is the ultimate claim cost of accident period i , and so C ^ i I is an estimate of this quantity. The amount of outstanding losses (the loss reserve) for this accident period is equal to
L i = C i I C i , I i + 1 ,
and is estimated by
L ^ i = C ^ i I c i , I i + 1 .
The total reserve for all accident periods of interest is
L = i = 2 I L i ,
which is estimated by
L ^ = i = 2 I L ^ i .
For a generic random variable X , the notation X   ~   μ , σ 2 will mean that X has mean and variance μ , σ 2 with no specified distribution.

2.2. Mesh Size

2.2.1. Preservation of Calendar Periods

Section 2.1 is phrased in terms of accident, developmentm and calendar periods, without any specification of the meaning of “period”. Often, in the literature, this unit is a year. But it need not be; it might be a quarter, month, week, or any other convenient length of time. This will be referred to as the mesh size of the data triangle.
Consider how the mesh size might be changed. Suppose that, in D U , I = N q for some strictly positive integers N , q . The mesh size can be changed from one unit of time to q units. There will then be N accident and development periods, instead of the original I .
An example would be the case in which the units of time in D U are quarters and I = 40 , N = 10 , q = 4 . Here, the mesh size is changed from quarters to years, and D U contains 10 accident and development years.
In the case of general N and q , the change of mesh will induce a new upper triangle D U in which row i will be obtained from the merger of rows q i 1 + 1 , q i 1 + 2 , , q i from D U .
Now suppose, in addition, that the change of mesh is required to preserve calendar periods. By this is meant that calendar period t in D U will comprise calendar periods q t 1 + 1 , q t 1 + 2 , , q t from D U .
For given i and t , development period j = t i + 1 . Then the i , j cell of D U will consist of all pairs of q i 1 + r , q t 1 + s q i 1 + 1 + r , r , s = 1 , , q from D U such that the second member of the pair is non-negative. Equivalently, it consists of all pairs q i 1 + r , m a x 0 , q j 1 + s + 1 r , r , s = 1 , , q . Let Y i j , C i j be defined in the same way as Y i j , C i j but for the i , j cell of D U .
This is the form of aggregation from unit to q -unit periods commonly used in commercial practice. It is illustrated in Figure 1, in which a 40 × 40 quarterly triangle is collapsed to a 10 × 10 yearly triangle. The upper triangle is shaded yellow, and the lower one is shaded green. Calendar years are delineated by the red diagonals. Development years 5 and 6 are shaded blue and purple, respectively. Development year 1 is also shown in orange to illustrate its exceptional nature.
It will also be useful to define the following quantities in parallel with s j and s j :
s j =   i = 1 N j + 1 c i j   ,
s j =   i = 1 N j c i j   .

2.2.2. Preservation of Development Periods

It is seen in Figure 1 that the quarterly development periods that contribute to a specific yearly development period differ by accident quarter. Although this is standard commercial practice, one might regard this as sometimes undesirable.
An alternative is as follows. Define merged accident periods exactly as in Section 2.2.1, but define merged development periods in such a way that, for all accident periods, development period j always consists of the same development periods from D U .
In this case, there is no requirement that accident and development periods be subject to the same mesh size. There is not even a requirement that all development periods be of the same duration.
To develop a notation for changed mesh size, we commence with the ordered set 0 , , I . This is the set labels for (unmerged) development periods, supplemented by a zero. The reason for this is that the development period label j will be taken to relate to the end of that period, and so the zero denotes the point of commencement of development period 1. Then 0 , , I will span the entire duration of the original I development periods.
Now introduce J + 1 integer cut-points ω j , j = 0 , , J to partition the set 0 , , I with ω 0 = 0 , ω J = I . The segments of this partition are merged development periods. There are J of these, and the j -th is the union of the unmerged development periods ω j 1 + 1 , , ω j , j = 1 , , J and is of duration ω j ω j 1 time periods.
A simple example would be similar to the one introduced in Section 2.2.1, in which the original development periods are quarterly, and the mesh size is changed from quarterly to annual. In this case, with I = 40 , one sets ω j = 4 j , and the merged (annual) development period j comprises unmerged development periods 4 j 1 + r , r = 1,2 , 3,4 .
A more general situation is illustrated in Figure 2, where just three cut-points have been inserted, namely ω 1 = 16 , ω 2 = 20 , ω 3 = 24 . This means that development quarters 17 to 20 have been merged into a single development year, here shaded blue, and similarly development quarters 21 to 24, shaded purple. The other development quarters have been left intact. Note that, for each of these development years, there is a triangle of data relating to the latest accident periods. These have not been shaded as they are incomplete and not comparable with the merged development years for earlier accident periods.
The quantities ω j provide a mapping between the original development periods j and the development periods under the increased mesh size. This is illustrated in Figure 3, which displays the mapping that applies to Figure 2.
In general, let quantities associated with merged development periods be denoted as starred versions of the analogous quantities associated with unmerged development periods. For example, whereas Y i j denotes the claim payments in unmerged development period j of accident period i , Y i , k : l will denote the claim payments in merged development period k : l of the same accident period, i.e., in development periods k + 1 , , l , equivalently, from the end of development period k to the end of development period l .
Greater brevity is possible in denoting cumulative claim payments. Whereas C i j denotes the cumulative claim payments to the end of unmerged development period j of accident period i , C i j will denote same quantity, provided that a merged development period ends in the unmerged development period j .
Sometimes it will be useful to notate quantities on the merged time scale. For example, C i j will denote cumulative claim payments to the end of the merged development period j . Evidently, C i j = C i , ω j .
With this adjustment of mesh size, the data set D U to which a chain ladder will be applied comprises all cells of D U that form complete development periods under the enlarged mesh. In Figure 2, these are the blue and purple cells for (original) development periods 16 to 23 and the yellow cells for other development periods.
In the general case described earlier in this sub-section,
D U = Y i , j : k : i = 1 , , I , j   s u c h   t h a t   j = ω j 1 , k = ω j I i + 1   f o r   s o m e   j ,
where
Y i , j : k = l = ω j 1 + 1 ω j Y i l   .
It is noteworthy that not all rows of D U extend as far as the I -th diagonal of D U . That is to say, the re-constituted data set does not contain all of the most recently available data. It will be useful to construct an indicator of the most recent usable (original) cell of each row. This will be the last cell of the row up to and including the I -th diagonal that falls within a complete development period under the enlarged mesh.
Define
j ~ i = m a x j : ω j I i + 1 ,
j ~ i = ω j ~ i   ,
which is to say that j ~ i is the last complete merged development period in row i , and j ~ i is the last unmerged development period contained in j ~ i .
Similarly, define the maximum row containing the complete development period j as
i ~ j = m a x i : ω j I i + 1 .
It will also be useful to define the following quantities, in parallel with s j and s j ,
s j =   i = 1 i ~ j c i j   ,
s j =   i = 1 i ~ j + 1 c i j   .

3. Mack Chain Ladder

3.1. Model Assumptions

Assumption 1
(Mack assumption (1)).  E C i , j + 1 C i 1 , , C i j = C i j f j  for all  C i j D  with  j < I .
The factor f j > 0 is usually referred to as an age-to-age factor or a link ratio.
Assumption 2
(Mack assumption (2)). Different accident periods are stochastically independent; i.e.,  C i 1 , , C i I , C k 1 , , C k I  are independent for  i k .
Assumption 3
(Mack assumption (3)).  V a r C i , j + 1 C i 1 , , C i j = C i j σ j 2  for all  C i j D  with  j < I .
This is the totality of Mack’s assumptions, but it will be convenient to add a few further very mild assumptions here.
Assumption 4.
σ j 2 > 0 , j = 1 , , I . This asserts only that no observations are deterministic.
Assumption 5.
c i , I i + 1 > 0 , i = 1 , , I . The purpose here is to avoid a non-positive forecast of future claims (see (21) below).
Assumption 6.
s j 0 , j = 1 , , I 1 . The purpose here is to ensure that the chain ladder algorithm is well-defined (see (18) below). It seems that this assumption is implicit in Mack’s paper.

3.2. Estimation of Loss Reserve

The chain ladder proceeds to estimate loss reserve as follows. As given by Mack, the age-to-age factor f j and dispersion factor σ j 2 are estimated by
f ^ j = s j + 1 s j
and
σ ^ j 2 = 1 I j 2 i = 1 I j 1 c i j F i j f ^ j 2   ,
where
F i j = c i , j + 1 c i j   .
The forecasts C ^ i j D ^ L are then calculated as
C ^ i j = c i , I i + 1 f ^ I i + 1 f ^ I i + 2 f ^ j 1 .
As noted in Section 2, C i , I is the ultimate claim cost of accident period i , and so C ^ i , I is an estimate of this quantity. The amount of outstanding losses for this accident period is equal to
R i = C i , I c i , I i + 1 ,
and is estimated by
R ^ i = C ^ i , I c i , I i + 1 .
The total reserve for all accident periods of interest is
R = i = 2 I R i ,
which is estimated by
R ^ = i = 2 I R ^ i .
Mack derives the following results on the basis of the assumptions of Section 3.1.
Proposition 1
(Mack’s Theorem 2). Under Assumptions 1 and 2, the estimators f ^ j , j = 1 , , I 1  are unbiased and uncorrelated. It follows that  R ^ i  is an unbiased estimator of  R i .

3.3. Variance of Forecast

It will be convenient to introduce the notation
F i , l : m = F i l F i , l + 1 F i , m 1 ,
F l : m = f l f l + 1 f m 1 ,
F ^ l : m = f ^ l f ^ l + 1 f ^ m 1 ,
subject to the convention that F i , l : m = F l : m = F ^ l : m = 1 for l m .
Remark 1.
It will be found useful later to remark that  F l : m > 0 , which follows from the fact that  f j > 0  in Assumption 1.
The variance of the estimated reserve R ^ i is of interest, specifically V a r R ^ i | D U = V a r C ^ i I c i , I i + 1 | D U = V a r C ^ i I | D U = V a r c i , I i + 1 F ^ i | D U , where F ^ i is the further abbreviation F ^ i = F ^ I i + 1 : I .
It will be convenient to write
V a r R ^ i k | D U = C i I k 2 V i k ,
where R ^ i k denotes the estimate of R i with C i I forecast by C ^ i I k on the basis of data to the end of the development period k I i + 1 , i.e.,
R ^ i k = C ^ i I k c i , I i + 1 = c i k F ^ k : I c i , I i + 1 ,
and where
C i I k = c i k F k : I ,
and the addition of a hat has the usual meaning.
Accident period i has developed as far as development period I i + 1 , and so the estimate of loss reserve that maximizes the use of data is the one appearing in (23), which may be expressed as R ^ i I i + 1 , and its variance, from (29), is
V a r R ^ i I i + 1 | D U = C i I I i + 1 2 V i I i + 1 .
This (or, more precisely, an estimate of it) was calculated by Mack (1993, p. 217) and also by Gisler (2019, p. 806). Wüthrich and Merz (2008, pp. 53–54) reproduce the same quantity, though they go on to suggest alternative results based on varying bootstrap re-sampling strategies.
The following reconciles with Mack’s result in the case k = I i + 1 , and extends it to general k . The extension follows straightforwardly from the algebra in Mack’s paper dealing with the special case.
V i k = j = k I 1 σ j 2 f j 2 1 c i , k F k : j + 1 s j ,
which may be conveniently abbreviated as follows:
V i k = j = k I 1 B i j k + σ j 2 f j 2 1 s j ,
where
B i j k = σ j 2 f j 2 1 c i k F k : j   .
There is a recurrence relation that relates values of V i k for consecutive evaluation points k and is useful for the computation of these quantities. It is proven in Appendix A.1, and set out in the following.
Proposition 2.
With  V i k  defined by (33), the following recurrence relation holds.
V i k 1 = B i , k 1 , k 1 + σ k 1 2 f k 1 2 1 s k 1 + 1 γ i , k 1 j = k I 1 σ j 2 f j 2 1 s j + γ i , k 1 V i k ,
where
γ i k = F i k f k   .
The ratio γ i , k 1 appearing here plays a crucial role in the following and is of interest. The numerator is the observed development of the accident period i from the end of development period k 1 to k , and the denominator is its expected value. So, for example, γ i , k 1 > 1 indicates that development over that period exceeds expected.
It is sometimes useful to introduce the parameter ϕ j = σ j 2 / f j . Then, (34) takes the form
V i k = j = k I 1 ϕ j B ~ i j k + 1 f j 1 s j ,
where
B ~ i j k = 1 f j 1 c i k F k : j = 1 c i k F k : j + 1   .
An interesting case of (33) occurs when the ratio ϕ j is independent of j , which includes the case where all cells of D are subject to over-dispersed Poisson distributions with common scale parameter. The value of V i is then given by the following result.
Proposition 3.
Consider the case in which the following assumption holds in addition to Assumptions 1 to 6.
Assumption 7.
σ j 2 / f j = ϕ ,  const. for all  C i j D .
Then (38) reduces to
V i k = ϕ j = k I 1 B ~ i j k + 1 f j 1 s j ,
Now consider the total reserve R ^ , with each accident period i evaluated at the end of calendar period I , i.e., development period I i + 1 and R ^ i = R ^ i I i + 1 . By Assumption 2, (25) and (29) yield
V a r R ^ | D U = V a r i = 2 I R ^ i I i + 1 | D U        = i = 2 I C i I I i + 1 2 V i I i + 1 + 2 i , k = 2 i < k I c i , I i + 1 c k , I k + 1 C o v F ^ i , F ^ k | D U .
where, in the last summation, the summand draws on (21).
This may be simplified by taking account of Proposition 1. The covariance term becomes (remembering that i < k )
C o v F ^ i , F ^ k | D U = C o v F ^ i , F ^ k , I k + 1 : I i + 1 F ^ i | D U = E F ^ k , I k + 1 : I i + 1 V a r F ^ i | D U         = F I k + 1 : I i + 1 C i I I i + 1 c i , I i + 1 2 V i I i + 1   ,
where the second equality follows from Proposition 1 and the last equality from (29) and (30). With substitution of (42), result (41) becomes
V a r R ^ | D U = i = 2 I C i I I i + 1 2 V i I i + 1        + 2 i , k = 2 i < k I c i , I i + 1 c k , I k + 1 F k , I k + 1 : I i + 1 C i I I i + 1 c i , I i + 1 2 V i I i + 1        = i = 2 I C i I I i + 1 2 V i I i + 1 1 + 2 k = i + 1 I c k , I k + 1 c i , I i + 1 F k , I k + 1 : I i + 1        = i = 2 I C i I I i + 1 2 V i I i + 1 k = 2 I C k , I i + 1 c i , I i + 1 .

4. Effect of Change of Mesh Size Under the Preservation of Development Periods

Section 2.2 discussed two types of change of mesh size, preserving, respectively, calendar and development periods. These two types of change carry very different implications for chain ladder models. The effects of both types of change on the EDF chain ladder were discussed by Taylor (2025).
It is also necessary to consider the effects of both on the Mack chain ladder. This, however, would expand the present paper beyond a reasonable size. For this reason, the following sections consider only changes in mesh that preserve development periods. The preservation of calendar periods may be considered in a separate paper.
A forerunner to the examination of the effects of change of mesh size under the preservation of development periods is a consideration of whether or not the Mack chain ladder structure is maintained under such changes. This will be the subject of Section 4.1.

4.1. Maintenance of Model Assumptions

4.1.1. Cell Means

Consider the change of mesh described in Section 2.2.2. The notation will be as introduced there. For brevity, just label the merged development periods j = 0 , , J 1 . It is necessary to check whether the Mack chain ladder model is applicable under the changed mesh size, i.e., whether Assumptions 1 to 6 of Section 3.1 continue to hold. The present sub-section examines Assumptions 1, 2, 5 and 6; the next examines Assumptions 3 and 4.
  • Discussion of Assumption 1
Consider a generic merged development period j + 1 ω j : ω j + 1 . According to the notation of Section 2.2.2,
E C i j + 1 | C i 1 , , C i j = k = ω j + 1 ω j + 1 E C i k | C i 1 , , C i j   ,
Now, for k ω j ,
E C i k | C i 1 , , C i j = E C i , k 1 f k 1 | C i 1 , , C i j        = E C i , k 2 f k 2 f k 1 | C i 1 , , C i j = = C i j F ω j : k ,
By the repeated application of Assumption 1.
Substitution of (45) into (44) then yields
E C i j + 1 | C i 1 , , C i j = C i j k = ω j + 1 ω j + 1 F ω j : k   ,
which is of the form required by Assumption 1, and so this form of assumption continues to hold for the enlarged mesh size.
  • Discussion of Assumption 2
Note that C i ω 1 , , C i ω J is a subset of C i 1 , , C i I . It then follows from Assumption 2 that C i ω 1 , , C i ω J , C i k , , C k ω J are independent for i k . This is of the same form as Assumption 2, and so this form of assumption continues to hold for the enlarged mesh size.
  • Discussion of Assumption 5
With the change of mesh size, this assumption requires replacement by the following.
Assumption 5*.
c i , j ~ i > 0 , i = 1 , , I .
  • Discussion of Assumption 6
With the change of mesh size, this assumption requires replacement by the following.
Assumption 6*.
s j 0 , j = 1 , , J 1 .

4.1.2. Cell Variances

  • Discussion of Assumption 3
Mack (1993, Theorem 3) evaluates variances of the form V a r C i I | C i 1 , , C i j , and the calculation given there is easily adapted to the case V a r C i k | C i 1 , , C i j for j < k I . The result is
V a r C i k | C i 1 , , C i j = C i j l = j k 1 f j f l 1 σ l 2 f l + 1 2 f k 1 2 .
Simple adaptation to the present situation yields
V a r C i j + 1 + 1 | C i 0 , , C i j = C i j l = ω j ω j + 1 1 f ω j f l 1 σ l 2 f l + 1 2 f ω j + 1 1 2   .
This can be expressed in the form
V a r C i j + 1 | C i 0 , , C i j = C i j σ j 2
where
σ j 2 = l = ω j ω j + 1 1 f ω j f l 1 σ l 2 f l + 1 2 f ω j + 1 1 2   .
The relation (49) is of the form required by Assumption 3, which therefore continues to hold for the enlarged mesh size.
  • Discussion of Assumption 4
By Assumptions 1 and 3, f j , σ j 2 > 0 , and it immediately follows from (50) that σ j 2 > 0 , as required by Assumption 4.
The reasoning of Section 4.1.1 and the present sub-section leads to the following result.
Proposition 4.
Consider the data array  D , and suppose it is subject to a Mack chain ladder model in the sense of satisfying Assumptions 1 to 6 of Section 3.1. Now impose the change of mesh size described in Section 2.2.2. This induces new data sets D U , D L  and  D = D U D L . If Assumptions 5 and 6, are replaced by 5* and 6*, then  D  will also be subject to a Mack chain ladder model.

4.1.3. Estimation of Loss Reserve

The estimation proceeds in parallel with Section 3.2. In place of (18) to (20), one writes
f ^ j = s j + 1 s j   ,   j = 1 , ,   J 1 ,
and
σ ^ j 2 = 1 i ~ j 2 i = 1 i ~ j 1 c i j F i j f ^ j 2   ,   j = 1 , ,   J 2 .
where
F i j = c i j + 1 c i j   .
The forecasts C ^ i k are calculated as
C ^ i k = c i j ~ i f ^ j ~ i f ^ j ~ i + 1 f ^ k 1 , k = j ~ i + 1 , j ~ i + 2 , , J .
With an obvious notation parallel to (26)–(28), this last forecast may be abbreviated as follows:
C ^ i k = c i j ~ i F ^ j ~ i : k , k = j ~ i + 1 , j ~ i + 2 , , J .
The forecast ultimate claim cost of accident period i is then
C ^ i I = C ^ i J = c i j ~ i F ^ j ~ i : J .
Corresponding to (23) is
R ^ i = C ^ i J c i , I i + 1 ,
and (25) follows as previously. Note here that the value of c i , I i + 1 is known (it is the total amount of claims paid to the valuation date in respect of accident period i ) though it is not usable in the estimation of an age-to-age factor.
The forecasts in (54) are based on the data point c i j ~ i , and, as noted in Section 2.2.2, this does not always coincide with the most recent data point c i , I i + 1 under the original mesh size. Definition (13) shows that, when it does not, ω j < I i + 1 , meaning that development period j ~ i does not include data from the I -th diagonal of D U . The consequence of this is as follows.
Proposition 5.
When an increase in mesh size converts data triangle  D U  to  D U , and a chain ladder model is applied to  D U  in accordance with Section 4.1.1, the forecasts (54) for some accident periods will rely on earlier data than was available for the same accident periods in  D U .
Nevertheless, the following can be said of this model’s forecasts, in parallel with Proposition 1. The reasoning is the same as there.
Proposition 6.
Under Assumptions 1 and 2, the estimators  f ^ j , j = 1 , ,   J 1  are unbiased and uncorrelated. It follows that  R ^ i  is an unbiased estimator of  R i .

4.2. Forecast Variance

4.2.1. Variation of Mesh over Development Periods

It will be convenient to commence with a consideration of the case in which an enlargement of mesh size is effected by the coalescence of just two consecutive development periods, say j and j + 1 . In the notation of Section 2.2.2, 1 , , I is partitioned into the I 1 subsets by cut-points ω j = j for j = 0,1 , ,   j 1 and ω j = j + 1 for j =   j , , I 1 .
In this case, (13) and (15) yield
j ~ i = I i + 1 , i > I j + 1 = I i , i I j + 1
and
i ~ j = I j + 1 , j < j = I j , j j .
The variance of loss reserve estimate in (29) and (33) is now replaced by the following.
V a r R ^ i k | D U = C i I k 2 V i k .
where V i k is yet to be determined and is the quantity corresponding to V i k in (33) when development periods j and j + 1 are merged, as above.
To calculate how V i differs from V i , it is necessary to consider three cases as follows.
Case I:  i < I j + 1 . In this case I i + 1 > j , which means that past development includes the two merged development periods j , j + 1 , and so accident period i will in the future pass through only development periods with “normal” age-to-age factors. It also follows from (58) that j ~ i = I i , which corresponds with j = I i + 1 .
Case II:  i > I j + 1 . In this case I i + 1 < j , which means that past development has not yet reached development period j , the first of the two to be merged, and so the accident period i will in the future pass through both development periods j , j + 1 . It also follows from (58) that j ~ i = I i + 1 , w h i c h   c o r r e s p o n d s   w i t h   j = I i + 1 .
Case III:  i = I j + 1 . In this case I i + 1 = j , which means that past development has passed through the first of the “abnormal” development periods, j , but not the second, j +1. It may be noted that the merged cell I j + 1 , j is incomplete, since the unmerged cell I j + 1 , j + 1 lies in the future. It also follows from (58) that j ~ i = I i , which corresponds with j = I i .
It is evident from the description of Case I that, the data point c i , I i + 1 is available in row i , and the variance of the estimated loss reserve may be calculated in the “normal” way. Specifically, (60) may be applied with k = I i :
V a r R ^ i I i | D U = C i I I i 2 V i I i = C i I I i + 1 2 V i I i + 1 ,
with the last two terms on the right obtained from (60) and (61).
Case II may be dealt with similarly, and again (61) arises. The fact that accident period i is still to pass through the two merged development periods j , j + 1 has no effect on the stochastic behavior of the variance.
Case III is a little different. Here (61) is replaced by
V a r R ^ i I i | D U = C i I I i 2 V i I i = C i I I i 2 V i I i = C I j + 1 , I j 1 2 V I j + 1 j 1 .
In this case, the data point c I j + 1 , j must be forfeited as it forms part of an incomplete merged cell, and so the valuation standpoint must be taken at the end of the development period I i instead of the usual I i + 1 .
The following proposition summarizes the situation.
Proposition 7.
With  V i k  defined by (34), the relevant variances of loss reserve estimates are the following:
  • For  i < I j + 1 , V a r R ^ i I i | D U = C i I I i + 1 2 V i I i + 1   ,
  • For  i > I j + 1 , V a r R ^ i I i + 1 | D U = C i I I i + 1 2 V i I i + 1   ,
  • For  i = I j + 1 , V a r R ^ i I i | D U = C I j + 1 , I j 1 2 V I j + 1 j 1   .
It is evident from this result and (29) that the variance of estimated loss reserve is unchanged by the merger of the two development periods for all i I j + 1 . However, this equality does not hold for i = I j + 1 , and the question of interest concerns the relative magnitudes of V a r R ^ i I i | D U   a n d   V a r R ^ i I i + 1 | D U .
To study this question, one can write
V a r R ^ i I i | D U V a r R ^ i I i + 1 | D U          = C I j + 1 , I j 1 2 V I j + 1 j 1 C I j + 1 , I j 2 V I j + 1 j .
Define
π i k = j = k I 1 B i j k j = k I 1 B i j k + j = k I 1 σ j 2 f j 2 1 s j = j = k I 1 B i j k V i k   ,
and note that
1 π i k = j = k I 1 σ j 2 f j 2 1 s j j = k I 1 B i j k + j = k I 1 σ j 2 f j 2 1 s j = j = k I 1 σ j 2 f j 2 1 s j V i k   ,
And
V i k j = k I 1 σ j 2 f j 2 1 s j = j = k I 1 B i j k   ,
where (34) has been used to obtain the last three relations.
It is evident from (64) that
0 < π i k < 1   .
The following result, proven in Appendix A.2, establishes the ordering of V a r R ^ i | D U   a n d   V a r R ^ i | D U .
Proposition 8.
For the case  i = I j + 1 , a necessary and sufficient condition that  V a r R ^ i | D U > ( = , < ) V a r R ^ i | D U  is
γ i , k 1 < = , > γ i , k 1 0   ,
where  k = j  and the threshold value  γ i , k 1 0  is given by
γ i , k 1 0 =   ½ π i k + 1 ½ π i k 1 + ω i , k 1 1 ½ π i k 2 ½ .
with
ω i , k 1 = B i , k 1 , k 1 + σ k 1 2 f k 1 2 1 s k 1 V i k   .
Corollary 1.
It follows from (70) that  γ i , k 1 0 > 1 , and so a sufficient condition that  V a r R ^ i | D U > V a r R ^ i | D U  is that  γ i , k 1 < 1 ,   i . e .   F i , k 1 < f k 1 , where  i , k  take the same values as in Proposition 8.
The main conclusion to be drawn from Proposition 8 is that V a r R ^ i | D U does not always exceed V a r R ^ i | D U . That is to say, the merger of development periods (or decreasing granularity) does not always result in increased predictive variance. This result differs from the corresponding one for the EDF chain ladder in Taylor (2025).
The difference arises from the unique feature of the Mack chain ladder according to which the variance of predicted loss reserve is conditioned by, and in fact is proportional to, the squared amount of claims paid to the date of valuation (see (32)). It is seen above that when development periods j and j + 1 are merged, the estimated variance of forecast loss reserve accident for accident period i = I j + 1 becomes proportional to c i , I i instead of c i , I i + 1 .
If c i , I i is small relative to c i , I i + 1 , then it can turn out that V a r R ^ i | D U < V a r R ^ i | D U . This is precisely what happens when F i , I i = c i , I i + 1 / c i , I i is larger than its expected value f I i by a sufficient margin.
Corollary 1 points out that the threshold value γ i , k 1 0 > 1 , and, to this extent, V a r R ^ i | D U will tend to exceed V a r R ^ i | D U ; i.e., decreasing granularity will tend to lead to increased variance of loss reserve forecast, but not invariably.
A numerical example of these thresholds is now given. It is based on the numerical example from Mack (1993), which in turn uses the data set from Taylor and Ashe (1983). The data set is reproduced in Appendix B.
The parameter estimates of f j and σ j 2 given by Mack just after his Table 1, after mild smoothing, are taken as parameters for the purpose of this example and are displayed in Table 1 below.
The effect of merging two development years on the variance of predicted loss reserve will be studied. For this purpose, values of V i k will be calculated from (34) for i = 2 , , 8 , k = 11 i , and then V i k 1 will be calculated from (36). The standard deviation of the estimated loss reserve is then derived from (32). The results of these calculations are reported in Table 2, which also includes values of C i I k , the estimate ultimate claim cost of accident year i , as estimated at the end of development year k .
The interpretation of this table is as follows. Consider accident year 2. Its forecast ultimate claim cost, on the basis of the full triangle in Table 1, is 5,445,867 with a standard deviation of 79,922 (which is also the standard deviation of the loss reserve). Now suppose that development years 9 and 10 are merged, i.e., j = 9 .
Then, according to Proposition 7, variances of forecast loss reserves are unchanged except in the case of accident year 2, which must be calculated as at the end of development year 8; i.e., the valuation point is changed from end of calendar year 10 to 9. The table then shows that the ultimate cost of the accident year is estimated as 5,212,813, with a standard deviation of 118,419. The standard deviation has increased by 48%.
A similar situation is found for all other mergers of development years. For example, if development years 4 and 5 are merged, i.e., j = 4 , then the affected row of Table 2 will be that relating to accident year 7, and here the increase in variance of forecast reserve is 35%. In fact, it is seen from the table that any merger of two consecutive development years leads to an increase in variance.
Proposition 8 shows that this result will occur only if the observed development factors are not too large. This aspect of the data set is now studied.
Table 3 displays the threshold values γ i , k 1 0   of the age-to-age factors in the numerical example under study, calculated from (69). The intermediate quantities π i k and ω i , k 1 are also shown, as well as the observed value of γ i , k 1 corresponding to the threshold value γ i , k 1 0 .
The table is read as follows. For accident year j = 4 , for example, corresponding to the merger of development years 4 and 5 (see the commentary on Table 2), the loss reserve must be evaluated at the end of development year k = 6 (instead of 7). According to Proposition 8, this will lead to an increased variance of loss reserve unless γ 4,6 > γ 4,6 0 = 1.34 .   I n f a c t , t h e   t a b l e indicates that γ 4,6 = 0.97 (=1.047/1.08, from Table A2).
Indeed, the observed values of γ i , k 1 0   l i e well below the threshold value γ i , k 1 0 for all i in the table. This indicates that variance is increased for any merger of a pair of development years, a result that is consistent with Table 2.
Remark 2.
The above analysis has considered only the merger of development periods j  and  j + 1 . However, the analysis can be repeated for further mergers. Suppose, for example, that the original development periods were months and one wished to form quarterly periods, perhaps involving the merger of  j , j + 1  and  j + 2 .  One could commence with the merger of periods  j  and  j + 1 , as just analyzed. This led to the subset cut-points
ω j = j , j = 0,1 , ,   j 1 j + 1   for   j = j , , I 1 .
One could then merge period  j + 2  with the first two. This would create new cut-points
ω j = j   for   j = 0,1 , ,   j 1 = j + 2   for   j = j , , I 2 .
The above analysis would then apply equally to this merger of two development periods. Two-period mergers could be continued in this way until the desired final set of merged development periods was attained. The conclusions would be as above, specifically that
  • The reduction in granularity increases the variance of the estimated loss reserve unless a condition parallel to (68) is breached;
  • At each merger, the thresholds appearing in (68) can be calculated, and the occurrence or non-occurrence of that breach assessed.

4.2.2. Variation of Mesh over Accident Periods

Section 4.2.1 considered just the merger of development periods into longer periods. It might be desired that accident periods be merged at the same time. As an example, suppose that accident periods i and i + 1 are merged. Consistently with the notation adopted in Section 3, let the new accident periods be denoted by i , where
i = i   for   i = 1,2 , ,   i = i 1   for   i =   i + 2 , , I 1 ,
and the new accident period i = i comprises the merged i and i + 1 .
In parallel with the notation introduced in Section 2.2.2, C i j will denote cumulative claim payments to the end of (a possibly merged) development period j with respect to the newly defined accident period i , and so, for i = i ,
C i j = C i , j + C i + 1 , j
Note that j is used to index development periods that may have already been merged. For the purpose of the present sub-section, D is taken to denote the claim triangle after any such mergers.
Let the data set consisting of all completed development periods after this merger of accident years be denoted D U . Let D L denote the corresponding lower triangle, and D = D U D L . This data set is illustrated in Figure 4 for the case i = 11 with accident periods i , i + 1 merged.
Consider the extent to which Assumptions 1 to 6 of Section 3.1 continue to hold in these changed circumstances. The briefest reflection reveals that Assumptions 2, 4 and 5 continue to hold, so attention is now turned to the other assumptions.
  • Assumption 1
This assumption evidently holds for all accident periods i other than i , so consider this one. Note that
E C i j + 1 | C i 1 , C i j , C i + 1,1 , C i + 1 , j         = E C i , j + 1 + C i + 1 , j + 1 | C i 1 , C i j , C i + 1,1 , C i + 1 , j         = C i j f j .
where the penultimate equality follows from Assumption 1 in Section 3.1. The only data point on which right side here is C i j , and so it follows that
E C i j + 1 | C i 1 , , C i j = C i j f j   ,
This shows that Assumption 1 continues to hold in the present circumstances. Note that the age-to-age factors f j are unchanged by the merger of accident periods.
  • Assumption 3
In this case,
V a r C i j + 1 | C i 0 , , C i j = V a r C i , j + 1 + C i + 1 , j + 1 | C i 0 , , C i j = V a r C i , j + 1 | C i 0 , , C i j + V a r C i + 1 , j + 1 | C i 0 , , C i j   ,
by Assumption 2. Then, by Assumption 3 and an argument similar to that leading to (75),
V a r C i j + 1 | C i 0 , , C i j = C i j σ j 2   .
This shows that Assumption 3 continues to hold in the present circumstances. Note that the variance factors σ j 2 are unchanged by the merger of accident periods.
  • Assumption 6
In this case, Assumption 6 of Section 3.1 must be modified to the following.
Assumption 6**.
i = 1 I j c i j 0 , j = 1 , , I 1 .
Here, and in the following, the ** notation always indicates quantities relating to D .
When Assumptions 1 to 5 and 6** hold, the Mack chain ladder model applies to the data set D . It is seen from Figure 4 that a Mack chain ladder model could be imposed on this data set in the usual way, even for the merged accident period, but with the exception of cell (11, 30), or in general cell i , I i + 1 . For completeness, the merged cell containing this one must also include i + 1 , I i + 1 in order to be usable, but this is as yet unobserved.
Hence, the chain ladder would be imposed on the data set D U \ i , I i + 1 , and it would be required that the forecast of the merged accident period be based on the last complete observation C i , I i . Subject to the exclusion of this single cell, parameter estimation is exactly as for a conventional Mack chain ladder. The same is true for the forecast, except for the merged accident period i . As foreshadowed just above, the forecast of ultimate claim cost in this case is (c.f. (56))
C ^ i , I = c i , I i F ^ I i : I   ,
Corresponding to (23) and (57) is
R ^ i = C ^ i , I c i , I i + 1 + c i + 1 , I i .
As noted in relation to (57), the value of c i , I i + 1 is known for the purpose of (78) though not usable in the estimation of an age-to-age factor.
Lemma 1 now gives the variance of the estimated loss reserve for the combined accident periods i and i + 1 . The proof is given in Appendix A.3.
Lemma 1.
The variance  V a r R ^ i | D U  takes the following form:
V a r R ^ i | D U = C i I I i + C i + 1 I I i 2 V i I i ,
where  V i I i   is given by
V i I i = j = I i I 1 B i , j , I i + σ j 2 f j 2 1 s j ,
B i , j , I i = σ j 2 f j 2 1 c i , I i F I i : j = σ j 2 f j 2 1 C i , I i + C i + 1 , I i F I i : j   .
 and
s j = k = 1 i 1 c k j = s j c i j , j = I i , = s j , j > I i .
Further
V a r R ^ i + R ^ i + 1 | D U = 1 + 2 ζ i , I i f I i C i I I i + 1 2 V i I i + 1 + C i + 1 , I I i 2 V i + 1 I i .
where
ζ i , I i = c i + 1 , I i c i , I i + 1   .
It is now possible to compare V a r R ^ i | D U and V a r R ^ i + R ^ i + 1 | D U . The difference between the variance of estimated loss forecast for merged and unmerged accident periods is now given by
V a r R ^ i | D U V a r R ^ i + R ^ i + 1 | D U         = C i I I i + C i + 1 I I i 2 V i I i         1 + 2 ζ i , I i f I i C i I I i + 1 2 V i I i + 1 + C i + 1 , I I i 2 V i + 1 I i ,
It is of interest to find characterizations of the positivity and negativity of this difference. Lemma 2 is useful for this purpose. The strategy here is to re-express (85) in such a way that all C terms appearing are the same, and similarly all V terms. Specifically, the only C terms appearing are C i I I i + 1 and the only V terms are V i I i + 1 . The proof is given in Appendix A.4.
Lemma 2.
For brevity here,  i  and  I i  are abbreviated to  i and k . Then the subject of (85) may be expressed as
V a r R ^ i | D U V a r R ^ i + R ^ i + 1 | D U = V a r R ^ i | D U V a r R ^ i + R ^ i + 1 | D U = { γ i k 2 1 + 2 ζ i k 1 π i k + 1 + ζ i k 2 η i k ω i k [ ω i , k 1 + 1 π i k + 1 + π i k + 1 γ i k ] 1 + 2 ζ i k f k } C i I k + 1 2 V i k + 1 .
where
η i k = s k c i k 1 1 σ k 2 f k 2 1 s k B i k k > 0   .
Proposition 9.
A necessary and sufficient condition that  V a r R ^ i | D U > ( = , < ) V a r R ^ i + R ^ i + 1 | D U  is
γ i k < = , > γ ~ i k 0   ,
where, for brevity,  i , k   d e n o t e   i , I i + 1 , and  γ ~ i , k 1 0  is defined by
γ ~ i k 0 = ½ π ~ i k + 1 + 1 ½ π ~ i k + 1 1 + ω ~ i k + ξ i k 1 1 ½ π ~ i k + 1 2 ½ ,
where
ξ i k = 1 + 2 ζ i k f k 1 1 + 2 ζ i k 1 π i k + 1 + ζ i k 2 η i k ω i k ,
π ~ i k + 1 = ξ i k π i k + 1 ,
ω ~ i k = ξ i k ω i k   .
Note the similarity between (69) and (89). After the conversion of quantities to tilda versions, the two results are almost identical.
A fairly weak condition is required to establish that γ i k 0 1 / ξ i k , as testified by Corollary 2, which is proved in Appendix A.6.
Corollary 2.
A sufficient condition that  γ i k 0 1 / ξ i k  is that
1 + ω i k 1 π i k f k .
Note that, as ξ i k increases, the lower bound on the critical value γ i k 0 becomes steadily smaller. For large values of ξ i k , the bound approaches zero. This means that only very small values of γ i k are shown to guarantee that increased granularity reduces the variance of the predicted loss reserve. This result stands in stark relief against that of Corollary 1 in the case of an increased granularity within a single accident period.
The physical meaning of large or small values of ξ i k is not clear, but it is a function of ζ i k that does have a clear meaning (see (84)). There is therefore value in examining the relation between ζ i k and ξ i k . Table 4 does so, examining three key values of ζ i k .
For an intuition for these values, recall from (64) that 0 < π i k < 1 and from (70) and (87) that, in many applications, ω i k and ω i k will be small relative to unity.
By (84), ζ i k measures the volume of claims in the accident period i + 1 relative to that in i . Combining Corollary 2 with Table 4 gives an indication of how variation of ζ i k changes the effect of granularity on the variance of forecast loss reserve. For example, if accident period i + 1 heavily dominates i ( ζ i k ), then ξ i k is large, and Corollary 2 indicates that the separation of the two accident periods is unlikely to be of benefit to the variance. This is consistent with intuition.
As a further example, suppose that ζ i k = 1 , implying equal claim volumes in accident periods i + 1 and i . Suppose also that π i k = ½ and f k = 1.2 . Then, by Table 4, ξ i k will be somewhat greater than 0.59. Then, if condition (93) is satisfied, Corollary 2 reveals that γ i k 0 is somewhat less than 1.7.
If the conclusion of Corollary 2 is a relatively tight inequality, then there is considerable benefit in the increased granularity of maintaining the separation of accident periods i + 1 and i .
The reasons why a decrease in granularity does not always lead to an enlargement of the variance of forecast loss reserve are apparent from (79) and (83).
In the former of these relations, it is seen that the merger of the two accident periods causes both to be forecasts as at time I i (instead of I i + 1 in one case), resulting in a loss of information. This is exactly what happened in the merger of development periods in Section 4.2.1.
On the other hand, however, relation (83) shows that, when the separation of accident periods is maintained, positive covariance arises between their forecasts of loss reserve. This is the term involving ζ i k , and it increases the variance of loss reserve of the aggregated accident periods.
Whether or not increased granularity is beneficial depends on the net effect of these opposing forces.
Remark 3.
The above analysis has considered only the merger of accident periods  i  and  i + 1  . However, the analysis can be repeated for further mergers. Suppose, for example, that the original development periods were months and one wished to form quarterly periods, perhaps involving the merger of  i , i + 1  and  i + 2 . One could commence with the merger of periods  i  and  i + 1 , as just analyzed.
One could then merge period  i + 2  with the combination of the first two. The above analysis would then apply equally to this merger of two accident periods. Two-period mergers could be continued in this way until the desired final set of merged accident periods was attained. The conclusions would be as above, specifically that
  • The reduction in granularity increases the variance of estimated loss reserve unless (88) holds with the relation >;
  • At each merger, the thresholds  γ i , k 1 0  appearing in (88) can be calculated and the likelihood of that relation assessed.

5. Discussion and Conclusions

Taylor (2025) showed that, when the EDF chain ladder is applied to a data triangle, greater data granularity always reduces the variance of the estimated loss reserve. The present paper considers a parallel question in relation to the Mack chain ladder, an alternative model form.
As in Taylor (2025), two types of increases in the mesh size of the data triangle are introduced:
  • Preserving calendar periods;
  • Preserving development periods;
but considerations of space limit the present paper to just the latter (Section 4). The former type of change of mesh may be considered in a separate paper.
The first question requiring consideration concerns whether the assumptions underpinning the Mack chain ladder in relation to the original data set continue to hold when specific development periods are merged. Proposition 4 finds that they do so under some mild regularity conditions.
Section 4.2 studies the changes in variance of the estimated loss reserve when the mesh size is changed, first when development periods are merged (Section 4.2.1) and then when accident periods are merged (Section 4.2.2). The consideration of changes in both of these dimensions is necessary as one may wish to compare the situations in, say, a 40 × 40 triangle that uses quarterly units of time and its derivative 10 × 10 triangle.
Section 4.2.1 considers the case in which just two consecutive development periods are merged, but all other development periods and all accident periods are left unchanged.
Here, results similar, but not identical, to those in Taylor (2025) are found. The variances of loss reserves are unchanged except in the case of a single specific accident period.
It is no longer the case that greater data granularity is guaranteed to reduce the variance of the estimated loss reserve. Whether or not a reduction occurs depends on the data points observed. The reason for this difference is essentially that, in the case of the Mack model, the standard deviation of the estimated loss reserve with respect to a particular accident period is proportional to the cumulative claim payments to date, whereas this is not so for the EDF chain ladder.
A reduction in variance does in fact occur if a defined condition on the data of the specific accident period subject to change is satisfied. The condition is that the ratio of the last observed age-to-age factor for the accident period to its expected value falls below a threshold whose value is defined in Proposition 8. It takes the form of a closed-form function of the data and the chain ladder parameters (age-to-age factors).
It is noteworthy that this threshold value always exceeds unity. Hence, an observed value of the ratio in question below unity is sufficient to indicate a reduction in the variance of the loss reserve estimate (Corollary 1). When the ratio exceeds unity, the occurrence or otherwise of variance depends on the threshold value, which is a matter of numerical evaluation.
A numerical example is given in Section 4.2.1, where it is found that the condition for variance reduction is comfortably satisfied for each possible merger of two consecutive development periods. There is plenty of scope for further numerical investigation, but the ease with which the necessary condition is met in this one example prompts a conjecture that counter-examples might sometimes require a certain degree of contrivance and that greater granularity would frequently, if not usually, lead to a reduction in variance of estimated loss reserve in practical cases.
All of these results hitherto have concerned the simple case in which two consecutive development periods are merged. However, Remark 2 points out that it is a simple matter to extrapolate to parallel results for an arbitrary merger of development periods.
Section 4.2.2 considers the merger of accident periods. Again, the Mack chain ladder remains valid under mild regularity conditions. It is found that the conditions for a reduction in variance of estimated loss reserve in this case are much more complex than in Section 4.2.1 for the merger of development periods.
As in Section 4.2.1, a reduction in variance with increased granularity is guaranteed if the ratio of the last observed age-to-age factor for the accident period to its expected value falls below a threshold. The threshold value is defined in Proposition 9 and, once again, it takes the form of a closed form function of the data and the chain ladder parameters.
In this case, however, the function is much more complex and less transparent than in Section 4.2.1. Moreover, as pointed out in Corollary 2, it produces a much stricter condition on the chain ladder parameters if variance reduction is to be guaranteed.
This point is discussed in Section 4.2.2, where the reason is identified as the correlation between the forecast loss reserves associated with distinct and unmerged accident periods. Whether variance reduction is achieved depends heavily on the relative volumes of claims in the respective accident periods subject to merger.
Although the threshold values mentioned above are readily available numerically, specific analytic results appear hard to come by in this area. Section 4.2.2 contains some indicative numerical clues.
All in all, the influence of data granularity on the variance of forecast loss reserve differs vastly between the EDF chain ladder and the Mack chain ladder. This seems remarkable for two models that produce precisely the same forecasts using precisely the same estimation algorithm.

Funding

This research received no external funding.

Data Availability Statement

The data set used here is that of Taylor and Ashe (1983), as presented in Mack (1993).

Conflicts of Interest

The author declares no conflict of interest.

Appendix A

Appendix A.1

Proof of Proposition 2.
Equation (34) yields the following at evaluation point k 1 :
V i k 1 = j = k 1 I 1 B i j , k 1 + + σ j 2 f j 2 1 s j = B i , k 1 , k 1 + + σ k 1 2 f k 1 2 1 s k 1 + j = k I 1 B i j , k 1 + σ j 2 f j 2 1 s j .
Now,
B i j , k 1 = σ j 2 f j 2 1 c i , k 1 F k 1 : j = σ j 2 f j 2 c i k c i , k 1 f k 1   1 c i k F k : j = c i k c i , k 1 f k 1   B i j k   .
The substitution of (A2) into (A1) yields
V i k 1 = B i , k 1 , k 1 + σ k 1 2 f k 1 2 1 s k 1 + 1 c i k c i , k 1 f k 1 j = k I 1 σ j 2 f j 2 1 s j + c i k c i , k 1 f k 1 j = k I 1 B i j k + σ j 2 f j 2 1 s j ,
and the proposition follows. □

Appendix A.2

Proof of Proposition 8.
The substitution of (21) into the quantity on the right side of (63) yields
C I j + 1 , I j 1 2 V I j + 1 j 1 C I j + 1 : I j 2 V I j + 1 j        = c I j + 1 , j 1 F j 1 : I 2 V I j + 1 j 1 c I j + 1 , j F j : I 2 V I j + 1 j   .
For brevity, it will be convenient to temporarily replace (just for the present proof) I j + 1 by i and j by k , whereupon (A4) becomes
C I j + 1 , I j 1 2 V I j + 1 j 1 C I j + 1 : I j 2 V I j + 1 j        = c i , k 1 F k 1 : I 2 V i k 1 c i k c i , k 1 f k 1 2 V i k        = c i , k 1 F k 1 : I 2 V i k 1 γ i , k 1 2 V i k ,
where use has been made of (27) and (37).
Now substitute (36) for V i k 1 to obtain
V i k 1 γ i , k 1 2 V i k = B i , k 1 , k 1 + σ k 1 2 f k 1 2 1 s k 1 + j = k I 1 σ j 2 f j 2 1 s j        + γ i , k 1 V i k j = k I 1 σ j 2 f j 2 1 s j γ i , k 1 2 V i k   .
Substitute (66) into (A6), divide through by V i k , and then substitute (64) into the result to obtain
V i k 1 γ i , k 1 2 V i k V i k = 1 π i k + ω i , k 1 + π i k γ i , k 1 γ i , k 1 2   ,
where ω i , k 1 is defined by (70).
The right side is merely a quadratic in γ i , k 1 , and its zeros are
γ i , k 1 = ½ π i k ½ π i k 2 + 4 1 π i k + ω i , k 1 ½        = ½ π i k 1 ½ π i k 1 + ω i , k 1 1 ½ π i k 2 ½ .
Note that, since the square root here is strictly positive, one of these zeros is > 1 , and the other is < π i k 1 . It was shown in (67) that π i k < 1 , and so this second zero is strictly negative. This zero is meaningless as γ i , k 1 cannot be negative. The first zero is meaningful and will be denoted by γ i , k 1 0 > 1 .
It is evident from the signs of the coefficients in (A7) that the quadratic changes from positive to negative at γ i , k 1 = γ i , k 1 0 , and so V i k 1 γ i , k 1 2 V i k > < 0 according to γ i , k 1 < > γ i , k 1 0 . The required result is then obtained by restoring i , k to their original values, I j + 1 , j . □

Appendix A.3

Proof of Lemma 1.
As seen in the preamble to the lemma, when the available data set is D U , the chain ladder model is applied to D U \ i , I i + 1 , and expected age-to-age and variance factors are unchanged relative to the model applied to D U :
f j = f j , σ j 2 = σ j 2 .
Moreover,
C i j = C i , j + C i + 1 , j , i = i ; = C i , j , i < i = C i + 1 , j , i > i   .
The quantity V a r R ^ i | D U is represented by (32) to (35), adapted to the new data set D U , specifically,
V a r R ^ i | D U = V a r R ^ i I i | D U = C i I I i 2 V i I i = C i I I i + C i + 1 I I i 2 V i I i ,
where V i I i is given by (80) to (82).
To prove (83), note that
V a r R ^ i + R ^ i + 1 | D U         = V a r R ^ i I i + 1 | D U + V a r R ^ i + 1 I i | D U         + 2 C o v R ^ i I i + 1 , R ^ i + 1 I i | D U .
For brevity, in the following, i and k will be used to denote i and I i , respectively. By (29),
V a r R ^ i + r k + 1 r | D U = C i + r , I k + 1 r 2 V i + r k + 1 r , r = 0,1 .
Now
R ^ i + r k + 1 r = c i + r , k + 1 r F ^ k + 1 r : I c i + r , k + 1 r , r = 0,1 ,
and so
C o v R ^ i k + 1 , R ^ i + 1 k | D U = c i , k + 1 c i + 1 , k C o v F ^ k : I , F ^ k + 1 : I = c i , k + 1 c i + 1 , k f k C i I k + 1 c i , k + 1 2 V i k + 1 ,
where the final expression follows from (42).
Substitution of (84) into (A15) reduces the latter to
C o v R ^ i k + 1 , R ^ i + 1 k | D U = ζ i k f k C i I k + 1 2 V i k .
Finally, the substitution of (A13) and (A16) into (A12) yields the stated result of the lemma after the translation of i , k back to their original meanings. □

Appendix A.4

Proof of Lemma 2.
The translation of (85) into the “ i , k notation” introduced in the lemma is
V a r R ^ i | D U V a r R ^ i + R ^ i + 1 | D U          = C i I k + C i + 1 , I k 2 V i k          1 + 2 ζ i k f k C i I k + 1 2 V i k + 1 + C i + 1 , I k 2 V i + 1 k .
As mentioned in the preamble to the lemma, all C and V terms are adjusted so that only C i I k + 1 and V i k + 1 appear.
C terms
By (31) and (37),
C i I k = c i k F k : I c i , k + 1 F k + 1 : I C i I k + 1 = c i k f k c i , k + 1 C i I k + 1 = γ i k 1 C i I k + 1 .
By (31), (84) and (A18),
C i + 1 , I k = c i + 1 , k F k : I c i k F k : I C i I k = ζ i k γ i k 1 C i I k + 1 .
V terms
Consider the term V i k , defined by (80) to (82). By (35), (81), and (84),
B i j k = C i k C i k + C i + 1 , k B i j k = 1 + ζ i k 1 B i j k   .
Further, by comparison of (82) with (4),
j = k I 1 σ j 2 f j 2 1 s j = j = k I 1 σ j 2 f j 2 1 s j + σ k 2 f k 2 1 s k c i k s k = 1 π i k + η i k ω i k V i k   ,
where π i k , η i k , ω i k ,   are defined by (64), (70), and (87), respectively.
By (A20) and (A21),
V i k = 1 + ζ i k 1 π i k + 1 π i k + η i k ω i k V i k   .
The term of (A17) involving V i + 1 k can be dealt with similarly by the decomposition of V i + 1 k and separate treatment of its two components.
V i + 1 k = j = k I 1 B i + 1 , j k + σ j 2 f j 2 1 s j ,
where
B i + 1 , j k = σ j 2 f j 2 1 c i + 1 , k F k : j = ζ i k 1 σ j 2 f j 2 1 c i k F k : j = ζ i k 1 B i j k ,
and substitution of this in (A23) yields
V i + 1 k = ζ i k 1 π i k + 1 π i k V i k   .
Now re-express (A22) and (A25) in terms of V i k + 1 . This can be achieved by re-writing (36) in the form
V i k = ω i k + 1 γ i k 1 π i k + 1 + γ i k V i k + 1        = ω i , k 1 + 1 π i k + 1 + π i k + 1 γ i k V i k + 1 .
This converts (A22) and (A25) to the following forms:
V i k = 1 + ζ i k 1 π i k + 1 π i k + η i k ω i k ω i k + 1 π i k + 1 + π i k + 1 γ i k V i k + 1   ,
V i + 1 k = ζ i k 1 π i k + 1 π i k ω i k + 1 π i k + 1 + π i k + 1 γ i k V i k + 1   .
The three members on the right side of (A17) can now be compiled one by one. By (A18), (A19) and (A27),
C i I k + C i + 1 , I k 2 V i k = γ i k 2 1 + ζ i k 2 1 + ζ i k 1 π i k + 1 π i k + η i k ω i k ( ω i k          + 1 π i k + 1 + π i k + 1 γ i k ) C i I k + 1 2 V i k + 1 .
By (A19) and (A28),
C i + 1 , I k 2 V i + 1 k = γ i k 2 ζ i k 2 ζ i k 1 π i k + 1 π i k ( ω i k + 1 π i k + 1 + π i k + 1 γ i k ) C i I k + 1 2 V i k + 1 .
The difference between these two quantities is
C i I k + C i + 1 , I k 2 V i k C i + 1 , I k 2 V i + 1 k          = γ i k 2 π i k + 1 + 2 ζ i k 1 π i k + 1 + ζ i k 2 η i k ω i k ( ω i k          + 1 π i k + 1 + π i k + 1 γ i k ) C i I k + 1 2 V i k + 1 .
Finally, including the remaining member from the right side of (A17) yields
V a r R ^ i | D U V a r R ^ i + R ^ i + 1 | D U          = { γ i k 2 1 + 2 ζ i k 1 π i k + 1 + ζ i k 2 η i k ω i k [ ω i k + 1 π i k + 1          + π i k + 1 γ i k ] 1 + 2 ζ i k f k } C i I k + 1 2 V i k + 1 .

Appendix A.5

Proof of Proposition 9.
The establishment of the necessary and sufficient condition stated in the proposition requires the evaluation of the zeros of the quantity (86) in Lemma 2, as a function of γ i k , or, equivalently, the quantity
Q = 1 + 2 ζ i k 1 π i k + 1 + ζ i k 2 η i k ω i k ω i k + 1 π i k + 1 + π i k + 1 γ i k         1 + 2 ζ i k f k γ i k 2 .
This is a quadratic in γ i k , so coefficients of different powers of this variable are collected, with the following result.
Q = 1 + 2 ζ i k f k γ i k 2 1 + 2 ζ i k 1 π i k + 1 + ζ i k 2 η i k ω i k π i k + 1 γ i k         1 + 2 ζ i k 1 π i k + 1 + ζ i k 2 η i k ω i k ω i k + 1 π i k + 1 ,
This may be abbreviated to
1 + 2 ζ i k f k 1 Q = γ i k 2 ξ i k π i k + 1 γ i k ξ i k ω i k + 1 π i k + 1
where ξ i k is defined by (90).
The zeros of this quadratic are obtained by the same process as in (A8), with modification for multiplier ξ i k . Note the negative sign of the constant term in (A35), which implies one positive and one negative zero. Only the positive zero is of interest. This completes the proof of the proposition. □

Appendix A.6

Proof of Corollary 2.
The condition to be proved is equivalent to γ ~ i k 0 1 .
By (89),
γ ~ i k 0 = ½ π ~ i k + 1 + 1 ½ π ~ i k + 1 1 + X ,
where ω ~ i k + ξ i k 1 is a sufficient condition for X > 0 .
There are two cases to be considered.
Case I:  π ~ i k + 1 2 . In this case,
γ ~ i k 0 = 1 + 1 ½ π ~ i k + 1 X 1 .
Case II:  π ~ i k + 1 > 2 . In this case, (89) becomes
γ ~ i k 0 = ½ π ~ i k + 1 + ½ π ~ i k + 1 1 1 + X = π ~ i k + 1 1 + ½ π ~ i k + 1 1 X > 1
Thus, ω ~ i k + ξ i k 1 > 0 is a sufficient condition for γ ~ i k 0 1 in general.
Therefore, the remainder of this proof will address the requirement that ω ~ i k + ξ i k 1 > 0 . The quantity on the left side of this inequality may be expressed as
ω ~ i k + ξ i k 1 = ξ i k 1 + ω i k 1         = 1 + 2 ζ i k f k 1 { 1 + ω i k [ 1 + 2 ζ i k 1 π i k         + 1 + ζ i k 2 η i k ω i k ] 1 + 2 ζ i k f k }         = 1 + 2 ζ i k f k 1 { η i k ω i k 1 + ω i k ζ i k 2         + 2 ζ i k 1 + ω i k 1 π i k + η i k ω i k f k         + ω i k 1 + η i k 1 + ω i k } .
Thus
1 + 2 ζ i k f k η i k ω i k 1 + ω i k 1 ω ~ i k + ξ i k 1        = ζ i k 2 + 2 ζ i k 1 + 1 π i k η i k ω i k f k η i k ω i k 1 + ω i k        + 1 + η i k 1 + ω i k η i k 1 + ω i k        = ζ i k + 1 + 1 π i k η i k ω i k f k η i k ω i k 1 + ω i k 2        + 1 + 1 η i k 1 + ω i k 1 + 1 + ω i k 1 π i k f k η i k ω i k 1 + ω i k 2   .
Now the penultimate term here is greater than unity, and the final term is unity or less when (93) holds, and then the totality of the right side of (A40) is strictly positive. This completes the proof. □

Appendix B

The data set of Taylor and Ashe (1983), as presented in Mack (1993), appears in Table A1.
Table A1. Data set of cumulative claim payments.
Table A1. Data set of cumulative claim payments.
Accident
Year
Cumulative Claim Payments to End of Development Year
12345678910
1357,8481,124,7881,735,3302,218,2702,745,5963,319,9943,466,3363,606,2863,833,5153,901,463
2352,1181,236,1392,170,0333,353,3223,799,0674,120,0634,647,8674,914,0395,339,085
3290,5071,292,3062,218,5253,235,1793,985,9954,132,9184,628,9104,909,315
4310,6081,418,8582,195,0473,757,4474,029,9294,381,9824,588,268
5443,1601,136,3502,128,3332,897,8213,402,6723,873,311
6396,1321,333,2172,180,7152,985,7523,691,712
7440,8321,288,4632,419,8613,483,130
8359,4801,421,1282,864,498
9376,6861,363,294
10344,014
The age-to-age factors F i j derived from this table are set out in Table A2.
Table A2. Age-to-age factors.
Table A2. Age-to-age factors.
Accident
Year
Age-to-Age Factor from Development Year
12345678910
13.1431.5431.2781.2381.2091.0441.0401.0631.018
23.5111.7551.5451.1331.0841.1281.0571.086
34.4481.7171.4581.2321.0371.1201.061
44.5681.5471.7121.0731.0871.047
52.5641.8731.3621.1741.138
63.3661.6361.3691.236
72.9231.8781.439
83.9532.016
93.619
10

References

  1. Gisler, Alois. 2019. The reserve uncertainties in the chain ladder model of Mack revisited. ASTIN Bulletin 49: 787–821. [Google Scholar] [CrossRef]
  2. Mack, Thomas. 1993. Distribution-free calculation of the standard error of chain ladder reserve estimates. ASTIN Bulletin 23: 213–25. [Google Scholar] [CrossRef]
  3. Taylor, Greg C., and Frank R. Ashe. 1983. Second moments of estimates of outstanding claims. Journal of Econometrics 23: 37–61. [Google Scholar] [CrossRef]
  4. Taylor, Gregory. 2000. Loss Reserving: An Actuarial Perspective. Boston: Kluwer Academic Publishers. [Google Scholar]
  5. Taylor, Gregory. 2009. The chain ladder and Tweedie distributed claims data. Variance 3: 96–104. [Google Scholar]
  6. Taylor, Gregory. 2025. The EDF chain ladder and data granularity. Risks 13: 65. [Google Scholar] [CrossRef]
  7. Taylor, Gregory Clive. 1985. Claim Reserving in Non-Life Insurance. Amsterdam: North-Holland. [Google Scholar]
  8. Wüthrich, Mario V., and Michael Merz. 2008. Stochastic Claims Reserving Methods in Insurance. Chichester: John Wiley & Sons Ltd. [Google Scholar]
Figure 1. Change of mesh size from quarterly to yearly.
Figure 1. Change of mesh size from quarterly to yearly.
Risks 13 00132 g001
Figure 2. Change of mesh size with preservation of development periods.
Figure 2. Change of mesh size with preservation of development periods.
Risks 13 00132 g002
Figure 3. Mapping between original development periods and those under increased mesh size.
Figure 3. Mapping between original development periods and those under increased mesh size.
Risks 13 00132 g003
Figure 4. Data set after the merger of accident periods.
Figure 4. Data set after the merger of accident periods.
Risks 13 00132 g004
Table 1. Mack chain ladder parameters.
Table 1. Mack chain ladder parameters.
j f j σ j 2 / 1000
13.49160
21.7545
31.4635
41.1716
51.1012
61.088
71.062
81.041
91.020.5
Table 2. Effect of enlarged mesh on prediction error.
Table 2. Effect of enlarged mesh on prediction error.
Accident YearEstimated at t = 10Estimated at t = 9Change in
Ultimate Claim CostStandard DeviationUltimate Claim CostStandard DeviationStandard Deviation
Amount%Age
25,445,86779,9225,212,813118,41938,49748
35,207,801118,3365,204,969167,91249,57642
45,159,269166,9105,321,496292,498125,58875
54,703,764270,7194,545,440376,655105,93639
64,931,552395,6024,666,544495,43999,83725
75,443,915542,9255,521,854734,851191,92535
86,536,467812,1935,674,999967,892155,69919
Table 3. Threshold values of age-to-age factors.
Table 3. Threshold values of age-to-age factors.
Accident
Year
Value ofObserved
k 1 π i k ω i , k 1 γ i , k 1 0 γ i , k 1
280.580.571.331.04
370.490.501.281.00
460.420.661.340.97
550.360.501.261.03
640.330.401.211.06
730.320.441.230.99
820.320.401.211.15
Table 4. Relation between ζ i k and ξ i k .
Table 4. Relation between ζ i k and ξ i k .
Value   of   ζ i k Value   of   ξ i k
0 1 + η i k ω i k
1 1 + 2 f k 1 3 2 π i k + 4 η i k ω i k
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Taylor, G. The Mack Chain Ladder and Data Granularity for Preserved Development Periods. Risks 2025, 13, 132. https://doi.org/10.3390/risks13070132

AMA Style

Taylor G. The Mack Chain Ladder and Data Granularity for Preserved Development Periods. Risks. 2025; 13(7):132. https://doi.org/10.3390/risks13070132

Chicago/Turabian Style

Taylor, Greg. 2025. "The Mack Chain Ladder and Data Granularity for Preserved Development Periods" Risks 13, no. 7: 132. https://doi.org/10.3390/risks13070132

APA Style

Taylor, G. (2025). The Mack Chain Ladder and Data Granularity for Preserved Development Periods. Risks, 13(7), 132. https://doi.org/10.3390/risks13070132

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop