Abstract
This paper is concerned with the choice of data granularity for the application of the Mack chain ladder model to forecast a loss reserve. It is a sequel to a related paper by Taylor, which considers the same question for the EDF chain ladder model. As in the earlier paper, it considers the question as to whether a decrease in the time unit leads to an increase or decrease in the variance of the loss reserve estimate. The question of whether a Mack chain ladder that is valid for one time unit (here called mesh size) remains so for another is investigated. The conditions under which the model does remain valid are established. There are various ways in which the mesh size of a data triangle may be varied, two of them of particular interest. The paper examines one of these, namely that in which development periods are preserved. Two versions of this are investigated: 1. the aggregation of development periods without change to accident periods; 2. the aggregation of accident periods without change to development periods. Taylor found that, in the case of the Poisson chain ladder, an increase in mesh size always increases the variance of the loss reserve estimate (subject to mild technical conditions). The case of the Mack chain ladder is more nuanced in that an increase in variance is not always guaranteed. Whether or not an increase or decrease occurs depends on the numerical values of certain of the age-to-age factors actually observed. The threshold values of the age-to-age factors at which an increase transitions to a decrease in variance are calculated. In the case of a change in the mesh of development periods, but with no change to accident periods, these values are computed for one particular data set, where it is found that variance always increases. It is conjectured that data sets in which this does not happen would be relatively rare. The situation is somewhat different when changes in mesh size over accident periods are considered. Here, the question of an increase or decrease in variance is more complex, and, in general terms, the occurrence of an increase in variance with increased mesh size is less likely.
Keywords:
data granularity; forecast efficiency; Mack chain ladder; loss reserve; mesh size; Poisson 1. Introduction
1.1. Background
The chain ladder is a widely used model for insurance loss reserving (Mack 1993; Taylor 1985, 2000; Wüthrich and Merz 2008). The data set to which it is applied is typically triangular, with rows labelled by accident period and columns by development period.
Accident and development periods are commonly years, but other units of time are possible, e.g., quarters, months, and weeks. As the time unit is decreased, the number of data points increases, but the volatility of each point increases. This leads to a question as to whether a decrease in the time unit leads to an increase or decrease in the variance of the loss reserve estimate. A natural companion to this question is whether there is an optimal time unit at which the variance of the forecast loss reserve is minimized.
A parallel question on the effects of mesh size on modelling and forecasting arises in a wide range of scientific areas. Taylor (2025) cites references.
The prior actuarial literature contains little comment on these matters, but Taylor (2025) investigated the question in relation to the EDF chain ladder (Wüthrich and Merz 2008; Taylor 2009), where EDF refers to the exponential dispersion family, and particularly the Poisson chain ladder. The conclusion in this last case was subject to a number of conditions, but, in broad terms, it was that a choice of the most granular data possible (i.e., a choice of accident and development periods of short duration) would minimize the variance of the forecast loss reserve.
1.2. Purpose of the Paper
Taylor (2025) noted that there were two distinct formulations of the chain ladder model, namely the EDF chain ladder referred to above and the Mack chain ladder (Mack 1993). Whereas Taylor investigated the former, the present paper considers similar questions of data granularity in relation to the Mack chain ladder.
The present paper thus discusses the influence of data granularity on the variance of the loss reserve forecast by the Mack chain ladder. Under suitable conditions, it is possible to identify the granularity that minimizes this variance.
In Taylor (2025), a distribution from the EDF was assigned to each observation. This enabled an appeal to sufficient statistics in the search for minimum variance unbiased estimators (“MVUEs”), and these statistics carried much of the theoretical load in that paper. By contrast, the Mack model is distribution-free, so sufficient statistics cannot be defined. The identification of MVUEs must proceed by different means.
1.3. Layout of the Paper
The paper considers changes in mesh size on the triangular data set, which is to say changes in the units of time used for accident and development periods. The effect of an increase in mesh size (i.e., an increase in the amount of time spanned by one of these periods) on the variance of the estimated loss reserve is investigated.
There are various ways in which the mesh size of a data set can be varied. Some are more sensible than others, and two fundamental types of variation were identified in Taylor (2025), specifically those that preserve
- Calendar periods;
- Development periods.
These types of variation are described in Section 2, where a specific notation is developed for each type of variation to allow translation between the original data set and that resulting from the change in mesh size.
Both types of variation are deserving of study in the context of the Mack chain ladder. However, considerations of space have limited the present paper to just one, namely changes in mesh size that preserve development periods. Those that preserve calendar periods may be explored in a separate paper.
Section 3 reviews the Mack chain ladder model, including its forecast of loss reserve, with an emphasis on the variance of the quantity under forecast.
Section 4 examines the effect of changed mesh size on the variance of the loss reserve when the change preserves development periods. First, the conditions are established under which the Mack chain ladder continues to be a valid model under changes in mesh (Section 4.1). In the cases where the Mack chain ladder remains a valid model, the effect of the changed mesh on the variance of the estimated loss reserve is then calculated.
Two cases are considered:
- That in which the development period mesh is changed (Section 4.2.1);
- That in which the accident period mesh is changed (Section 4.2.2).
2. Notation and Mathematical Preliminaries
The notational and mathematical apparatus required here is much the same as in Taylor (2025), and the majority of it is taken from that source.
2.1. Fundamentals
Let denote the accident period and the development period. Taylor (2025) denoted the range of as but the present notation will be more natural later when development periods are merged and a notation is required for this.
Let , with mean , denote the random variable representing the amount of claim payments during development period of accident period . These quantities are usually referred to as incremental claim payments. For the moment, all accident and development periods are of equal duration, though this will change later.
A calendar period consists of all those combinations of and such that is constant. For definiteness, label the calendar if .
It will be assumed that a standpoint is taken at the end of calendar period and one is in possession of certain data from calendar periods . Specifically, these data comprise the claim triangle . This will be referred to as the upper triangle. One wishes to use these data to forecast the lower triangle Let .
In a loss-reserving context, a realization of the upper triangle will have been observed. This will be denoted , which is the same as with each random variable replaced by its realization .
It will also be convenient to define cumulative claim payments. Thus, define
which is the random variable representing the amount of cumulative claim payments up to the end of development period of accident period . Further, let be defined in the same way in terms of the so that is a realization of .
It is assumed that there is no claim activity beyond development period , in which case is the ultimate claim cost of accident period .
The objective of the loss-reserving exercise is to forecast unobserved lower triangle. This will be denoted , which is the same as with each random variable replaced by its forecast .
The upper and lower triangles have been defined in terms of incremental claim payments. It will sometimes be convenient to define them in terms of cumulative claim payments, e.g., . As there is an equivalence relation between the two forms of triangle, the terms upper and lower triangle will be used, with a slight abuse of terminology, to refer to either form, provided that the context removes any ambiguity.
Let denote row of the incremental , i.e., , and let denote the sum of this row:
Further, let denote column of the cumulative , i.e., , and let denote the sum of this column:
It will also be convenient to define
As noted above, is the ultimate claim cost of accident period , and so is an estimate of this quantity. The amount of outstanding losses (the loss reserve) for this accident period is equal to
and is estimated by
The total reserve for all accident periods of interest is
which is estimated by
For a generic random variable , the notation will mean that has mean and variance with no specified distribution.
2.2. Mesh Size
2.2.1. Preservation of Calendar Periods
Section 2.1 is phrased in terms of accident, developmentm and calendar periods, without any specification of the meaning of “period”. Often, in the literature, this unit is a year. But it need not be; it might be a quarter, month, week, or any other convenient length of time. This will be referred to as the mesh size of the data triangle.
Consider how the mesh size might be changed. Suppose that, in , for some strictly positive integers . The mesh size can be changed from one unit of time to units. There will then be accident and development periods, instead of the original .
An example would be the case in which the units of time in are quarters and . Here, the mesh size is changed from quarters to years, and contains 10 accident and development years.
In the case of general and , the change of mesh will induce a new upper triangle in which row will be obtained from the merger of rows from .
Now suppose, in addition, that the change of mesh is required to preserve calendar periods. By this is meant that calendar period in will comprise calendar periods from .
For given and , development period . Then the cell of will consist of all pairs of from such that the second member of the pair is non-negative. Equivalently, it consists of all pairs . Let be defined in the same way as but for the cell of .
This is the form of aggregation from unit to -unit periods commonly used in commercial practice. It is illustrated in Figure 1, in which a quarterly triangle is collapsed to a yearly triangle. The upper triangle is shaded yellow, and the lower one is shaded green. Calendar years are delineated by the red diagonals. Development years 5 and 6 are shaded blue and purple, respectively. Development year 1 is also shown in orange to illustrate its exceptional nature.
Figure 1.
Change of mesh size from quarterly to yearly.
It will also be useful to define the following quantities in parallel with and :
2.2.2. Preservation of Development Periods
It is seen in Figure 1 that the quarterly development periods that contribute to a specific yearly development period differ by accident quarter. Although this is standard commercial practice, one might regard this as sometimes undesirable.
An alternative is as follows. Define merged accident periods exactly as in Section 2.2.1, but define merged development periods in such a way that, for all accident periods, development period always consists of the same development periods from .
In this case, there is no requirement that accident and development periods be subject to the same mesh size. There is not even a requirement that all development periods be of the same duration.
To develop a notation for changed mesh size, we commence with the ordered set . This is the set labels for (unmerged) development periods, supplemented by a zero. The reason for this is that the development period label will be taken to relate to the end of that period, and so the zero denotes the point of commencement of development period 1. Then will span the entire duration of the original development periods.
Now introduce integer cut-points to partition the set with . The segments of this partition are merged development periods. There are of these, and the -th is the union of the unmerged development periods and is of duration time periods.
A simple example would be similar to the one introduced in Section 2.2.1, in which the original development periods are quarterly, and the mesh size is changed from quarterly to annual. In this case, with , one sets , and the merged (annual) development period comprises unmerged development periods .
A more general situation is illustrated in Figure 2, where just three cut-points have been inserted, namely This means that development quarters 17 to 20 have been merged into a single development year, here shaded blue, and similarly development quarters 21 to 24, shaded purple. The other development quarters have been left intact. Note that, for each of these development years, there is a triangle of data relating to the latest accident periods. These have not been shaded as they are incomplete and not comparable with the merged development years for earlier accident periods.
Figure 2.
Change of mesh size with preservation of development periods.
The quantities provide a mapping between the original development periods and the development periods under the increased mesh size. This is illustrated in Figure 3, which displays the mapping that applies to Figure 2.
Figure 3.
Mapping between original development periods and those under increased mesh size.
In general, let quantities associated with merged development periods be denoted as starred versions of the analogous quantities associated with unmerged development periods. For example, whereas denotes the claim payments in unmerged development period of accident period , will denote the claim payments in merged development period of the same accident period, i.e., in development periods , equivalently, from the end of development period to the end of development period .
Greater brevity is possible in denoting cumulative claim payments. Whereas denotes the cumulative claim payments to the end of unmerged development period of accident period , will denote same quantity, provided that a merged development period ends in the unmerged development period .
Sometimes it will be useful to notate quantities on the merged time scale. For example, will denote cumulative claim payments to the end of the merged development period . Evidently, .
With this adjustment of mesh size, the data set to which a chain ladder will be applied comprises all cells of that form complete development periods under the enlarged mesh. In Figure 2, these are the blue and purple cells for (original) development periods 16 to 23 and the yellow cells for other development periods.
In the general case described earlier in this sub-section,
where
It is noteworthy that not all rows of extend as far as the -th diagonal of . That is to say, the re-constituted data set does not contain all of the most recently available data. It will be useful to construct an indicator of the most recent usable (original) cell of each row. This will be the last cell of the row up to and including the -th diagonal that falls within a complete development period under the enlarged mesh.
Define
which is to say that is the last complete merged development period in row , and is the last unmerged development period contained in .
Similarly, define the maximum row containing the complete development period as
It will also be useful to define the following quantities, in parallel with and ,
3. Mack Chain Ladder
3.1. Model Assumptions
Assumption 1
(Mack assumption (1)). for all with .
The factor is usually referred to as an age-to-age factor or a link ratio.
Assumption 2
(Mack assumption (2)). Different accident periods are stochastically independent; i.e., are independent for .
Assumption 3
(Mack assumption (3)). for all with .
This is the totality of Mack’s assumptions, but it will be convenient to add a few further very mild assumptions here.
Assumption 4.
. This asserts only that no observations are deterministic.
Assumption 5.
. The purpose here is to avoid a non-positive forecast of future claims (see (21) below).
Assumption 6.
. The purpose here is to ensure that the chain ladder algorithm is well-defined (see (18) below). It seems that this assumption is implicit in Mack’s paper.
3.2. Estimation of Loss Reserve
The chain ladder proceeds to estimate loss reserve as follows. As given by Mack, the age-to-age factor and dispersion factor are estimated by
and
where
The forecasts are then calculated as
As noted in Section 2, is the ultimate claim cost of accident period , and so is an estimate of this quantity. The amount of outstanding losses for this accident period is equal to
and is estimated by
The total reserve for all accident periods of interest is
which is estimated by
Mack derives the following results on the basis of the assumptions of Section 3.1.
Proposition 1
(Mack’s Theorem 2). Under Assumptions 1 and 2, the estimators are unbiased and uncorrelated. It follows that is an unbiased estimator of .
3.3. Variance of Forecast
It will be convenient to introduce the notation
subject to the convention that for .
Remark 1.
It will be found useful later to remark that , which follows from the fact that in Assumption 1.
The variance of the estimated reserve is of interest, specifically , where is the further abbreviation .
It will be convenient to write
where denotes the estimate of with forecast by on the basis of data to the end of the development period , i.e.,
and where
and the addition of a hat has the usual meaning.
Accident period has developed as far as development period , and so the estimate of loss reserve that maximizes the use of data is the one appearing in (23), which may be expressed as , and its variance, from (29), is
This (or, more precisely, an estimate of it) was calculated by Mack (1993, p. 217) and also by Gisler (2019, p. 806). Wüthrich and Merz (2008, pp. 53–54) reproduce the same quantity, though they go on to suggest alternative results based on varying bootstrap re-sampling strategies.
The following reconciles with Mack’s result in the case , and extends it to general . The extension follows straightforwardly from the algebra in Mack’s paper dealing with the special case.
which may be conveniently abbreviated as follows:
where
There is a recurrence relation that relates values of for consecutive evaluation points and is useful for the computation of these quantities. It is proven in Appendix A.1, and set out in the following.
Proposition 2.
With defined by (33), the following recurrence relation holds.
where
The ratio appearing here plays a crucial role in the following and is of interest. The numerator is the observed development of the accident period from the end of development period to , and the denominator is its expected value. So, for example, indicates that development over that period exceeds expected.
It is sometimes useful to introduce the parameter . Then, (34) takes the form
where
An interesting case of (33) occurs when the ratio is independent of , which includes the case where all cells of are subject to over-dispersed Poisson distributions with common scale parameter. The value of is then given by the following result.
Proposition 3.
Consider the case in which the following assumption holds in addition to Assumptions 1 to 6.
Assumption 7.
const. for all .
Then (38) reduces to
Now consider the total reserve , with each accident period evaluated at the end of calendar period , i.e., development period and . By Assumption 2, (25) and (29) yield
where, in the last summation, the summand draws on (21).
This may be simplified by taking account of Proposition 1. The covariance term becomes (remembering that )
where the second equality follows from Proposition 1 and the last equality from (29) and (30). With substitution of (42), result (41) becomes
4. Effect of Change of Mesh Size Under the Preservation of Development Periods
Section 2.2 discussed two types of change of mesh size, preserving, respectively, calendar and development periods. These two types of change carry very different implications for chain ladder models. The effects of both types of change on the EDF chain ladder were discussed by Taylor (2025).
It is also necessary to consider the effects of both on the Mack chain ladder. This, however, would expand the present paper beyond a reasonable size. For this reason, the following sections consider only changes in mesh that preserve development periods. The preservation of calendar periods may be considered in a separate paper.
A forerunner to the examination of the effects of change of mesh size under the preservation of development periods is a consideration of whether or not the Mack chain ladder structure is maintained under such changes. This will be the subject of Section 4.1.
4.1. Maintenance of Model Assumptions
4.1.1. Cell Means
Consider the change of mesh described in Section 2.2.2. The notation will be as introduced there. For brevity, just label the merged development periods . It is necessary to check whether the Mack chain ladder model is applicable under the changed mesh size, i.e., whether Assumptions 1 to 6 of Section 3.1 continue to hold. The present sub-section examines Assumptions 1, 2, 5 and 6; the next examines Assumptions 3 and 4.
- Discussion of Assumption 1
Consider a generic merged development period . According to the notation of Section 2.2.2,
Now, for ,
By the repeated application of Assumption 1.
Substitution of (45) into (44) then yields
which is of the form required by Assumption 1, and so this form of assumption continues to hold for the enlarged mesh size.
- Discussion of Assumption 2
Note that is a subset of It then follows from Assumption 2 that are independent for . This is of the same form as Assumption 2, and so this form of assumption continues to hold for the enlarged mesh size.
- Discussion of Assumption 5
With the change of mesh size, this assumption requires replacement by the following.
Assumption 5*.
.
- Discussion of Assumption 6
With the change of mesh size, this assumption requires replacement by the following.
Assumption 6*.
.
4.1.2. Cell Variances
- Discussion of Assumption 3
Mack (1993, Theorem 3) evaluates variances of the form , and the calculation given there is easily adapted to the case for . The result is
Simple adaptation to the present situation yields
This can be expressed in the form
where
The relation (49) is of the form required by Assumption 3, which therefore continues to hold for the enlarged mesh size.
- Discussion of Assumption 4
By Assumptions 1 and 3, , and it immediately follows from (50) that , as required by Assumption 4.
The reasoning of Section 4.1.1 and the present sub-section leads to the following result.
Proposition 4.
Consider the data array , and suppose it is subject to a Mack chain ladder model in the sense of satisfying Assumptions 1 to 6 of Section 3.1. Now impose the change of mesh size described in Section 2.2.2. This induces new data sets and . If Assumptions 5 and 6, are replaced by 5* and 6*, then will also be subject to a Mack chain ladder model.
4.1.3. Estimation of Loss Reserve
The estimation proceeds in parallel with Section 3.2. In place of (18) to (20), one writes
and
where
The forecasts are calculated as
With an obvious notation parallel to (26)–(28), this last forecast may be abbreviated as follows:
The forecast ultimate claim cost of accident period is then
Corresponding to (23) is
and (25) follows as previously. Note here that the value of is known (it is the total amount of claims paid to the valuation date in respect of accident period ) though it is not usable in the estimation of an age-to-age factor.
The forecasts in (54) are based on the data point , and, as noted in Section 2.2.2, this does not always coincide with the most recent data point under the original mesh size. Definition (13) shows that, when it does not, , meaning that development period does not include data from the -th diagonal of . The consequence of this is as follows.
Proposition 5.
When an increase in mesh size converts data triangle to , and a chain ladder model is applied to in accordance with Section 4.1.1, the forecasts (54) for some accident periods will rely on earlier data than was available for the same accident periods in .
Nevertheless, the following can be said of this model’s forecasts, in parallel with Proposition 1. The reasoning is the same as there.
Proposition 6.
Under Assumptions 1 and 2, the estimators are unbiased and uncorrelated. It follows that is an unbiased estimator of .
4.2. Forecast Variance
4.2.1. Variation of Mesh over Development Periods
It will be convenient to commence with a consideration of the case in which an enlargement of mesh size is effected by the coalescence of just two consecutive development periods, say and . In the notation of Section 2.2.2, is partitioned into the subsets by cut-points for and for .
In this case, (13) and (15) yield
and
The variance of loss reserve estimate in (29) and (33) is now replaced by the following.
where is yet to be determined and is the quantity corresponding to in (33) when development periods and are merged, as above.
To calculate how differs from , it is necessary to consider three cases as follows.
Case I: . In this case , which means that past development includes the two merged development periods , and so accident period will in the future pass through only development periods with “normal” age-to-age factors. It also follows from (58) that , which corresponds with .
Case II: . In this case , which means that past development has not yet reached development period , the first of the two to be merged, and so the accident period will in the future pass through both development periods . It also follows from (58) that .
Case III: . In this case , which means that past development has passed through the first of the “abnormal” development periods, , but not the second, +1. It may be noted that the merged cell is incomplete, since the unmerged cell lies in the future. It also follows from (58) that , which corresponds with .
It is evident from the description of Case I that, the data point is available in row , and the variance of the estimated loss reserve may be calculated in the “normal” way. Specifically, (60) may be applied with :
with the last two terms on the right obtained from (60) and (61).
Case II may be dealt with similarly, and again (61) arises. The fact that accident period is still to pass through the two merged development periods has no effect on the stochastic behavior of the variance.
Case III is a little different. Here (61) is replaced by
In this case, the data point must be forfeited as it forms part of an incomplete merged cell, and so the valuation standpoint must be taken at the end of the development period instead of the usual .
The following proposition summarizes the situation.
Proposition 7.
With defined by (34), the relevant variances of loss reserve estimates are the following:
- For
- For
- For
It is evident from this result and (29) that the variance of estimated loss reserve is unchanged by the merger of the two development periods for all . However, this equality does not hold for , and the question of interest concerns the relative magnitudes of .
To study this question, one can write
Define
and note that
And
where (34) has been used to obtain the last three relations.
It is evident from (64) that
The following result, proven in Appendix A.2, establishes the ordering of .
Proposition 8.
For the case , a necessary and sufficient condition that is
where and the threshold value is given by
with
Corollary 1.
It follows from (70) that , and so a sufficient condition that is that , where take the same values as in Proposition 8.
The main conclusion to be drawn from Proposition 8 is that does not always exceed . That is to say, the merger of development periods (or decreasing granularity) does not always result in increased predictive variance. This result differs from the corresponding one for the EDF chain ladder in Taylor (2025).
The difference arises from the unique feature of the Mack chain ladder according to which the variance of predicted loss reserve is conditioned by, and in fact is proportional to, the squared amount of claims paid to the date of valuation (see (32)). It is seen above that when development periods and are merged, the estimated variance of forecast loss reserve accident for accident period becomes proportional to instead of .
If is small relative to , then it can turn out that . This is precisely what happens when is larger than its expected value by a sufficient margin.
Corollary 1 points out that the threshold value , and, to this extent, will tend to exceed ; i.e., decreasing granularity will tend to lead to increased variance of loss reserve forecast, but not invariably.
A numerical example of these thresholds is now given. It is based on the numerical example from Mack (1993), which in turn uses the data set from Taylor and Ashe (1983). The data set is reproduced in Appendix B.
The parameter estimates of and given by Mack just after his Table 1, after mild smoothing, are taken as parameters for the purpose of this example and are displayed in Table 1 below.
Table 1.
Mack chain ladder parameters.
The effect of merging two development years on the variance of predicted loss reserve will be studied. For this purpose, values of will be calculated from (34) for , and then will be calculated from (36). The standard deviation of the estimated loss reserve is then derived from (32). The results of these calculations are reported in Table 2, which also includes values of , the estimate ultimate claim cost of accident year , as estimated at the end of development year .
Table 2.
Effect of enlarged mesh on prediction error.
The interpretation of this table is as follows. Consider accident year 2. Its forecast ultimate claim cost, on the basis of the full triangle in Table 1, is 5,445,867 with a standard deviation of 79,922 (which is also the standard deviation of the loss reserve). Now suppose that development years 9 and 10 are merged, i.e., .
Then, according to Proposition 7, variances of forecast loss reserves are unchanged except in the case of accident year 2, which must be calculated as at the end of development year 8; i.e., the valuation point is changed from end of calendar year 10 to 9. The table then shows that the ultimate cost of the accident year is estimated as 5,212,813, with a standard deviation of 118,419. The standard deviation has increased by 48%.
A similar situation is found for all other mergers of development years. For example, if development years 4 and 5 are merged, i.e., , then the affected row of Table 2 will be that relating to accident year 7, and here the increase in variance of forecast reserve is 35%. In fact, it is seen from the table that any merger of two consecutive development years leads to an increase in variance.
Proposition 8 shows that this result will occur only if the observed development factors are not too large. This aspect of the data set is now studied.
Table 3 displays the threshold values of the age-to-age factors in the numerical example under study, calculated from (69). The intermediate quantities and are also shown, as well as the observed value of corresponding to the threshold value
Table 3.
Threshold values of age-to-age factors.
The table is read as follows. For accident year , for example, corresponding to the merger of development years 4 and 5 (see the commentary on Table 2), the loss reserve must be evaluated at the end of development year (instead of 7). According to Proposition 8, this will lead to an increased variance of loss reserve unless indicates that 0.97 (=1.047/1.08, from Table A2).
Indeed, the observed values of well below the threshold value for all in the table. This indicates that variance is increased for any merger of a pair of development years, a result that is consistent with Table 2.
Remark 2.
The above analysis has considered only the merger of development periods and . However, the analysis can be repeated for further mergers. Suppose, for example, that the original development periods were months and one wished to form quarterly periods, perhaps involving the merger of and One could commence with the merger of periods and , as just analyzed. This led to the subset cut-points
One could then merge period with the first two. This would create new cut-points
The above analysis would then apply equally to this merger of two development periods. Two-period mergers could be continued in this way until the desired final set of merged development periods was attained. The conclusions would be as above, specifically that
- The reduction in granularity increases the variance of the estimated loss reserve unless a condition parallel to (68) is breached;
- At each merger, the thresholds appearing in (68) can be calculated, and the occurrence or non-occurrence of that breach assessed.
4.2.2. Variation of Mesh over Accident Periods
Section 4.2.1 considered just the merger of development periods into longer periods. It might be desired that accident periods be merged at the same time. As an example, suppose that accident periods and are merged. Consistently with the notation adopted in Section 3, let the new accident periods be denoted by , where
and the new accident period comprises the merged and .
In parallel with the notation introduced in Section 2.2.2, will denote cumulative claim payments to the end of (a possibly merged) development period with respect to the newly defined accident period , and so, for ,
Note that is used to index development periods that may have already been merged. For the purpose of the present sub-section, is taken to denote the claim triangle after any such mergers.
Let the data set consisting of all completed development periods after this merger of accident years be denoted . Let denote the corresponding lower triangle, and . This data set is illustrated in Figure 4 for the case with accident periods merged.
Figure 4.
Data set after the merger of accident periods.
Consider the extent to which Assumptions 1 to 6 of Section 3.1 continue to hold in these changed circumstances. The briefest reflection reveals that Assumptions 2, 4 and 5 continue to hold, so attention is now turned to the other assumptions.
- Assumption 1
This assumption evidently holds for all accident periods other than , so consider this one. Note that
where the penultimate equality follows from Assumption 1 in Section 3.1. The only data point on which right side here is , and so it follows that
This shows that Assumption 1 continues to hold in the present circumstances. Note that the age-to-age factors are unchanged by the merger of accident periods.
- Assumption 3
In this case,
by Assumption 2. Then, by Assumption 3 and an argument similar to that leading to (75),
This shows that Assumption 3 continues to hold in the present circumstances. Note that the variance factors are unchanged by the merger of accident periods.
- Assumption 6
In this case, Assumption 6 of Section 3.1 must be modified to the following.
Assumption 6**.
.
Here, and in the following, the ** notation always indicates quantities relating to .
When Assumptions 1 to 5 and 6** hold, the Mack chain ladder model applies to the data set . It is seen from Figure 4 that a Mack chain ladder model could be imposed on this data set in the usual way, even for the merged accident period, but with the exception of cell (11, 30), or in general cell . For completeness, the merged cell containing this one must also include in order to be usable, but this is as yet unobserved.
Hence, the chain ladder would be imposed on the data set , and it would be required that the forecast of the merged accident period be based on the last complete observation . Subject to the exclusion of this single cell, parameter estimation is exactly as for a conventional Mack chain ladder. The same is true for the forecast, except for the merged accident period . As foreshadowed just above, the forecast of ultimate claim cost in this case is (c.f. (56))
Corresponding to (23) and (57) is
As noted in relation to (57), the value of is known for the purpose of (78) though not usable in the estimation of an age-to-age factor.
Lemma 1 now gives the variance of the estimated loss reserve for the combined accident periods and . The proof is given in Appendix A.3.
Lemma 1.
The variance takes the following form:
where
is given by
and
Further
where
It is now possible to compare and The difference between the variance of estimated loss forecast for merged and unmerged accident periods is now given by
It is of interest to find characterizations of the positivity and negativity of this difference. Lemma 2 is useful for this purpose. The strategy here is to re-express (85) in such a way that all terms appearing are the same, and similarly all terms. Specifically, the only terms appearing are and the only terms are . The proof is given in Appendix A.4.
Lemma 2.
For brevity here, and are abbreviated to and . Then the subject of (85) may be expressed as
where
Proposition 9.
A necessary and sufficient condition that is
where, for brevity, , and is defined by
where
Note the similarity between (69) and (89). After the conversion of quantities to tilda versions, the two results are almost identical.
A fairly weak condition is required to establish that , as testified by Corollary 2, which is proved in Appendix A.6.
Corollary 2.
A sufficient condition that is that
Note that, as increases, the lower bound on the critical value becomes steadily smaller. For large values of , the bound approaches zero. This means that only very small values of are shown to guarantee that increased granularity reduces the variance of the predicted loss reserve. This result stands in stark relief against that of Corollary 1 in the case of an increased granularity within a single accident period.
The physical meaning of large or small values of is not clear, but it is a function of that does have a clear meaning (see (84)). There is therefore value in examining the relation between and . Table 4 does so, examining three key values of .
Table 4.
Relation between and
For an intuition for these values, recall from (64) that and from (70) and (87) that, in many applications, and will be small relative to unity.
By (84), measures the volume of claims in the accident period relative to that in . Combining Corollary 2 with Table 4 gives an indication of how variation of changes the effect of granularity on the variance of forecast loss reserve. For example, if accident period heavily dominates (), then is large, and Corollary 2 indicates that the separation of the two accident periods is unlikely to be of benefit to the variance. This is consistent with intuition.
As a further example, suppose that , implying equal claim volumes in accident periods and . Suppose also that and . Then, by Table 4, will be somewhat greater than 0.59. Then, if condition (93) is satisfied, Corollary 2 reveals that is somewhat less than 1.7.
If the conclusion of Corollary 2 is a relatively tight inequality, then there is considerable benefit in the increased granularity of maintaining the separation of accident periods and .
The reasons why a decrease in granularity does not always lead to an enlargement of the variance of forecast loss reserve are apparent from (79) and (83).
In the former of these relations, it is seen that the merger of the two accident periods causes both to be forecasts as at time (instead of in one case), resulting in a loss of information. This is exactly what happened in the merger of development periods in Section 4.2.1.
On the other hand, however, relation (83) shows that, when the separation of accident periods is maintained, positive covariance arises between their forecasts of loss reserve. This is the term involving , and it increases the variance of loss reserve of the aggregated accident periods.
Whether or not increased granularity is beneficial depends on the net effect of these opposing forces.
Remark 3.
The above analysis has considered only the merger of accident periods and . However, the analysis can be repeated for further mergers. Suppose, for example, that the original development periods were months and one wished to form quarterly periods, perhaps involving the merger of and . One could commence with the merger of periods and , as just analyzed.
One could then merge period with the combination of the first two. The above analysis would then apply equally to this merger of two accident periods. Two-period mergers could be continued in this way until the desired final set of merged accident periods was attained. The conclusions would be as above, specifically that
- The reduction in granularity increases the variance of estimated loss reserve unless (88) holds with the relation >;
- At each merger, the thresholds appearing in (88) can be calculated and the likelihood of that relation assessed.
5. Discussion and Conclusions
Taylor (2025) showed that, when the EDF chain ladder is applied to a data triangle, greater data granularity always reduces the variance of the estimated loss reserve. The present paper considers a parallel question in relation to the Mack chain ladder, an alternative model form.
As in Taylor (2025), two types of increases in the mesh size of the data triangle are introduced:
- Preserving calendar periods;
- Preserving development periods;
but considerations of space limit the present paper to just the latter (Section 4). The former type of change of mesh may be considered in a separate paper.
The first question requiring consideration concerns whether the assumptions underpinning the Mack chain ladder in relation to the original data set continue to hold when specific development periods are merged. Proposition 4 finds that they do so under some mild regularity conditions.
Section 4.2 studies the changes in variance of the estimated loss reserve when the mesh size is changed, first when development periods are merged (Section 4.2.1) and then when accident periods are merged (Section 4.2.2). The consideration of changes in both of these dimensions is necessary as one may wish to compare the situations in, say, a triangle that uses quarterly units of time and its derivative triangle.
Section 4.2.1 considers the case in which just two consecutive development periods are merged, but all other development periods and all accident periods are left unchanged.
Here, results similar, but not identical, to those in Taylor (2025) are found. The variances of loss reserves are unchanged except in the case of a single specific accident period.
It is no longer the case that greater data granularity is guaranteed to reduce the variance of the estimated loss reserve. Whether or not a reduction occurs depends on the data points observed. The reason for this difference is essentially that, in the case of the Mack model, the standard deviation of the estimated loss reserve with respect to a particular accident period is proportional to the cumulative claim payments to date, whereas this is not so for the EDF chain ladder.
A reduction in variance does in fact occur if a defined condition on the data of the specific accident period subject to change is satisfied. The condition is that the ratio of the last observed age-to-age factor for the accident period to its expected value falls below a threshold whose value is defined in Proposition 8. It takes the form of a closed-form function of the data and the chain ladder parameters (age-to-age factors).
It is noteworthy that this threshold value always exceeds unity. Hence, an observed value of the ratio in question below unity is sufficient to indicate a reduction in the variance of the loss reserve estimate (Corollary 1). When the ratio exceeds unity, the occurrence or otherwise of variance depends on the threshold value, which is a matter of numerical evaluation.
A numerical example is given in Section 4.2.1, where it is found that the condition for variance reduction is comfortably satisfied for each possible merger of two consecutive development periods. There is plenty of scope for further numerical investigation, but the ease with which the necessary condition is met in this one example prompts a conjecture that counter-examples might sometimes require a certain degree of contrivance and that greater granularity would frequently, if not usually, lead to a reduction in variance of estimated loss reserve in practical cases.
All of these results hitherto have concerned the simple case in which two consecutive development periods are merged. However, Remark 2 points out that it is a simple matter to extrapolate to parallel results for an arbitrary merger of development periods.
Section 4.2.2 considers the merger of accident periods. Again, the Mack chain ladder remains valid under mild regularity conditions. It is found that the conditions for a reduction in variance of estimated loss reserve in this case are much more complex than in Section 4.2.1 for the merger of development periods.
As in Section 4.2.1, a reduction in variance with increased granularity is guaranteed if the ratio of the last observed age-to-age factor for the accident period to its expected value falls below a threshold. The threshold value is defined in Proposition 9 and, once again, it takes the form of a closed form function of the data and the chain ladder parameters.
In this case, however, the function is much more complex and less transparent than in Section 4.2.1. Moreover, as pointed out in Corollary 2, it produces a much stricter condition on the chain ladder parameters if variance reduction is to be guaranteed.
This point is discussed in Section 4.2.2, where the reason is identified as the correlation between the forecast loss reserves associated with distinct and unmerged accident periods. Whether variance reduction is achieved depends heavily on the relative volumes of claims in the respective accident periods subject to merger.
Although the threshold values mentioned above are readily available numerically, specific analytic results appear hard to come by in this area. Section 4.2.2 contains some indicative numerical clues.
All in all, the influence of data granularity on the variance of forecast loss reserve differs vastly between the EDF chain ladder and the Mack chain ladder. This seems remarkable for two models that produce precisely the same forecasts using precisely the same estimation algorithm.
Funding
This research received no external funding.
Data Availability Statement
The data set used here is that of Taylor and Ashe (1983), as presented in Mack (1993).
Conflicts of Interest
The author declares no conflict of interest.
Appendix A
Appendix A.1
Proof of Proposition 2.
Equation (34) yields the following at evaluation point :
Now,
The substitution of (A2) into (A1) yields
and the proposition follows. □
Appendix A.2
Proof of Proposition 8.
The substitution of (21) into the quantity on the right side of (63) yields
For brevity, it will be convenient to temporarily replace (just for the present proof) by and by , whereupon (A4) becomes
where use has been made of (27) and (37).
Now substitute (36) for to obtain
Substitute (66) into (A6), divide through by , and then substitute (64) into the result to obtain
where is defined by (70).
The right side is merely a quadratic in , and its zeros are
Note that, since the square root here is strictly positive, one of these zeros is , and the other is . It was shown in (67) that , and so this second zero is strictly negative. This zero is meaningless as cannot be negative. The first zero is meaningful and will be denoted by .
It is evident from the signs of the coefficients in (A7) that the quadratic changes from positive to negative at , and so according to . The required result is then obtained by restoring to their original values, . □
Appendix A.3
Proof of Lemma 1.
As seen in the preamble to the lemma, when the available data set is , the chain ladder model is applied to , and expected age-to-age and variance factors are unchanged relative to the model applied to :
Moreover,
The quantity is represented by (32) to (35), adapted to the new data set , specifically,
where is given by (80) to (82).
To prove (83), note that
For brevity, in the following, and will be used to denote and , respectively. By (29),
Now
and so
where the final expression follows from (42).
Substitution of (84) into (A15) reduces the latter to
Finally, the substitution of (A13) and (A16) into (A12) yields the stated result of the lemma after the translation of back to their original meanings. □
Appendix A.4
Proof of Lemma 2.
The translation of (85) into the “ notation” introduced in the lemma is
As mentioned in the preamble to the lemma, all and terms are adjusted so that only and appear.
C terms
By (31) and (37),
By (31), (84) and (A18),
V terms
Consider the term , defined by (80) to (82). By (35), (81), and (84),
Further, by comparison of (82) with (4),
where are defined by (64), (70), and (87), respectively.
By (A20) and (A21),
The term of (A17) involving can be dealt with similarly by the decomposition of and separate treatment of its two components.
where
and substitution of this in (A23) yields
Now re-express (A22) and (A25) in terms of . This can be achieved by re-writing (36) in the form
This converts (A22) and (A25) to the following forms:
The three members on the right side of (A17) can now be compiled one by one. By (A18), (A19) and (A27),
By (A19) and (A28),
The difference between these two quantities is
Finally, including the remaining member from the right side of (A17) yields
□
Appendix A.5
Proof of Proposition 9.
The establishment of the necessary and sufficient condition stated in the proposition requires the evaluation of the zeros of the quantity (86) in Lemma 2, as a function of , or, equivalently, the quantity
This is a quadratic in , so coefficients of different powers of this variable are collected, with the following result.
This may be abbreviated to
where is defined by (90).
The zeros of this quadratic are obtained by the same process as in (A8), with modification for multiplier . Note the negative sign of the constant term in (A35), which implies one positive and one negative zero. Only the positive zero is of interest. This completes the proof of the proposition. □
Appendix A.6
Proof of Corollary 2.
The condition to be proved is equivalent to .
By (89),
where is a sufficient condition for .
There are two cases to be considered.
Case I:
. In this case,
Case II:
. In this case, (89) becomes
Thus, is a sufficient condition for in general.
Therefore, the remainder of this proof will address the requirement that . The quantity on the left side of this inequality may be expressed as
Thus
Now the penultimate term here is greater than unity, and the final term is unity or less when (93) holds, and then the totality of the right side of (A40) is strictly positive. This completes the proof. □
Appendix B
Table A1.
Data set of cumulative claim payments.
Table A1.
Data set of cumulative claim payments.
| Accident Year | Cumulative Claim Payments to End of Development Year | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
| 1 | 357,848 | 1,124,788 | 1,735,330 | 2,218,270 | 2,745,596 | 3,319,994 | 3,466,336 | 3,606,286 | 3,833,515 | 3,901,463 |
| 2 | 352,118 | 1,236,139 | 2,170,033 | 3,353,322 | 3,799,067 | 4,120,063 | 4,647,867 | 4,914,039 | 5,339,085 | |
| 3 | 290,507 | 1,292,306 | 2,218,525 | 3,235,179 | 3,985,995 | 4,132,918 | 4,628,910 | 4,909,315 | ||
| 4 | 310,608 | 1,418,858 | 2,195,047 | 3,757,447 | 4,029,929 | 4,381,982 | 4,588,268 | |||
| 5 | 443,160 | 1,136,350 | 2,128,333 | 2,897,821 | 3,402,672 | 3,873,311 | ||||
| 6 | 396,132 | 1,333,217 | 2,180,715 | 2,985,752 | 3,691,712 | |||||
| 7 | 440,832 | 1,288,463 | 2,419,861 | 3,483,130 | ||||||
| 8 | 359,480 | 1,421,128 | 2,864,498 | |||||||
| 9 | 376,686 | 1,363,294 | ||||||||
| 10 | 344,014 | |||||||||
The age-to-age factors derived from this table are set out in Table A2.
Table A2.
Age-to-age factors.
Table A2.
Age-to-age factors.
| Accident Year | Age-to-Age Factor from Development Year | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
| 1 | 3.143 | 1.543 | 1.278 | 1.238 | 1.209 | 1.044 | 1.040 | 1.063 | 1.018 | |
| 2 | 3.511 | 1.755 | 1.545 | 1.133 | 1.084 | 1.128 | 1.057 | 1.086 | ||
| 3 | 4.448 | 1.717 | 1.458 | 1.232 | 1.037 | 1.120 | 1.061 | |||
| 4 | 4.568 | 1.547 | 1.712 | 1.073 | 1.087 | 1.047 | ||||
| 5 | 2.564 | 1.873 | 1.362 | 1.174 | 1.138 | |||||
| 6 | 3.366 | 1.636 | 1.369 | 1.236 | ||||||
| 7 | 2.923 | 1.878 | 1.439 | |||||||
| 8 | 3.953 | 2.016 | ||||||||
| 9 | 3.619 | |||||||||
| 10 | ||||||||||
References
- Gisler, Alois. 2019. The reserve uncertainties in the chain ladder model of Mack revisited. ASTIN Bulletin 49: 787–821. [Google Scholar] [CrossRef]
- Mack, Thomas. 1993. Distribution-free calculation of the standard error of chain ladder reserve estimates. ASTIN Bulletin 23: 213–25. [Google Scholar] [CrossRef]
- Taylor, Greg C., and Frank R. Ashe. 1983. Second moments of estimates of outstanding claims. Journal of Econometrics 23: 37–61. [Google Scholar] [CrossRef]
- Taylor, Gregory. 2000. Loss Reserving: An Actuarial Perspective. Boston: Kluwer Academic Publishers. [Google Scholar]
- Taylor, Gregory. 2009. The chain ladder and Tweedie distributed claims data. Variance 3: 96–104. [Google Scholar]
- Taylor, Gregory. 2025. The EDF chain ladder and data granularity. Risks 13: 65. [Google Scholar] [CrossRef]
- Taylor, Gregory Clive. 1985. Claim Reserving in Non-Life Insurance. Amsterdam: North-Holland. [Google Scholar]
- Wüthrich, Mario V., and Michael Merz. 2008. Stochastic Claims Reserving Methods in Insurance. Chichester: John Wiley & Sons Ltd. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).



