Abstract
The chain ladder model is defined by a set of assumptions about the claim array to which it is applied. It is, in practice, applied to claim arrays whose data relate to different frequencies, e.g., yearly, quarterly, monthly, weekly, etc. There is sometimes a tacit assumption that one can shift between these frequencies at will, and that the model will remain applicable. It is not obvious that this is the case. One needs to check whether a model whose assumptions hold for annual data will continue to hold for a quarterly (for example) representation of the same data. The present paper studies this question in the case of preservation of calendar periods, i.e., (in the example) annual calendar periods are dissected into quarters. The study covers the two most common forms of chain ladder model, namely the Tweedie chain ladder and Mack chain ladder. The conclusion is broadly, if not absolutely, negative. Certain parameter sets can indeed be found for which the chain ladder structure is maintained under a change in data frequency. However, while it may be technically possible to maintain the chain ladder model under such a change to the data, it is not possible in any reasonable, practical sense.
1. Introduction
1.1. Purpose of the Paper
The present paper is concerned with the effect of data granularity on the chain ladder (; , ; ). The limitations of this model have been pointed out by many authors, e.g., () (non-stochastic model); () (model specification error); () (parameter uncertainty); () (rapid change in exposure, changing speed of claim settlement); () (changing business mix, unusual events, trends and shocks); () (outlier observations).
None of the nominated limitations involve data granularity. Seemingly, only one contribution to the prior literature does so. This is (), who considers the difference between the conventional chain ladder, applied to discrete data, and a continuous form of the model.
The effect of data granularity on the performance of the chain ladder was investigated by (, ). Performance was defined in terms of variance of forecast loss reserve. Granularity was defined in terms of the duration of the cells of the claim array; shorter durations equated to greater granularity.
The first of the two cited papers considered the EDF chain ladder, and showed that increased granularity always reduced variance. The second paper dealt with the Mack chain ladder, and considered the special case in which reductions in granularity amounted to coalescence of development periods. The same coalescences occurred over all accident periods and, in this sense, the changes in granularity “preserved development periods”.
In this case, it was found that increased granularity “usually” reduced variance, where a precise meaning was given to the qualifier “usually”.
Both of the cited papers pointed out that changes of granularity can be defined so as to preserve either development periods or calendar periods. The former case considered by () naturally prompts a question as to how the performance of chain ladder forecasts are affected by changes of granularity that preserve calendar periods.
As discussed in the prior papers, there is one preliminary issue that requires consideration before any examination of the performance of the chain ladder under changed granularity. Suppose one commences with a claim array that satisfies chain ladder assumptions, and decreases granularity by aggregating cells in one way or another.
This will define a new (aggregated) array, and one needs to consider whether or not it still satisfies chain ladder assumptions. If it does not, one cannot (validly) apply the chain ladder, and questions about chain ladder forecast variance become meaningless.
(, ) showed that chain ladder assumptions continued to hold under aggregation when that aggregation preserved development periods. It was also shown in () that this will not occur for the special case of a Poisson chain ladder unless the model parameters satisfy quite specific constraints. If these constraints are not met, the chain ladder will no longer be applicable, and questions about change in forecast variance will not be sensible.
However, the implications of these constraints were not followed up in the earlier paper. Although those constraints were stated definitively, the statement was abstract, and its full implications not made manifest.
The purpose of the present paper is to fill this gap by examining the parameter constraints in detail, and to determine whether the restriction they impose on calendar period aggregation is a mere minor nuisance or a fundamental impediment.
In practical cases, one might (or might not) find that aggregation or disaggregation of calendar periods creates numerically small inconsistencies with the chain ladder (see e.g., Table 1 and related discussion), and that these are inconsequential. This may even be true as far as point forecasts are concerned. However, even under such a pragmatic empirical approach, it is advisable that one be fully aware of the properties of the model in use.
Table 1.
EDF chain ladder parameters.
Moreover, the further one delves into the fine detail of a model implementation, the more significant these inconsistencies might become. For example, suppose one wishes to bootstrap a chain ladder (). It will be assumed that chain ladder residuals are identically distributed, but this will not be so if the chain ladder assumption on cell mean structure is violated. Is this still inconsequential? The answer is now less clear.
Thus, while the headline results of the model might be accompanied by some robustness against small failings in the model formulation, other quantities of more delicate and sophisticated construction might be more sensitive.
1.2. Layout of the Paper
The remainder of the paper is structured as follows. Section 2 introduces notation and fundamental mathematical entities. It also defines granularity in precise terms, specifically in terms of mesh size. Section 3 introduces the two forms of chain ladder model with which the paper is concerned. These are the Tweedie and Mack models.
Section 4.1 narrows the focus of the Tweedie chain ladder to the CMVR Tweedie model from which the widely familiar chain ladder algorithm emerges. The effects of aggregation of calendar periods are considered in this context. Section 4.1.1 shows that certain of the chain ladder assumptions that hold for a given data array continue to hold when calendar periods are aggregated.
However, Section 4.1.2 finds that the cell mean algebraic structure assumed by the chain ladder is preserved under aggregation only if some constraints on the model parameters are satisfied. The sub-section proceeds to analyze in detail the extent to which these constraints compress the natural parameter space.
Section 4.2 then contains an analysis parallel to that of Section 4.1, but this time in relation to the Mack chain ladder model. Some concluding remarks appear in Section 5.
2. Notation and Mathematical Preliminaries
The notational and mathematical apparatus required here is much the same as in (, ), and the majority of it is taken from those sources.
2.1. Fundamentals
Let denote accident period and development period. Let , with mean , denote the random variable representing the amount of claim payments during development period of accident period . These quantities are usually referred to as incremental claim payments. Let denote the entire data array: . A realization of will be denoted .
A calendar period consists of all those combinations of and such that is constant. For definiteness, label the calendar if .
It will also be convenient to define cumulative claim payments. Thus define
which is the random variable representing the amount of cumulative claim payments up to the end of development period of accident period . Further, let be defined in the same way in terms of the , so that is a realization of .
A -dimensional Euclidean space will be denoted by , and the interior of its positive orthant by The unit matrix of dimension will be denoted by , the -dimensional column vector with all entries unity by , and indicates the Kronecker matrix product, defined as follows: if is an matrix with entries and a matrix, then is the matrix
2.2. Tweedie Distribution
The Tweedie family of distributions was introduced by () as a sub-family of the exponential dispersion family (“EDF”) (). A member of the Tweedie family has the pdf
for some function , and where , is a dispersion parameter, and is a parameter with that defines the shape of the distribution. It will be said that . For such ,
There are known special cases for specific values of . One of particular interest that was discussed in () is the case , which yields the over-dispersed (“ODP”) Poisson distribution.
There is a further sub-family of the Tweedie that will be of interest here. This is the constant mean-variance ratio (“CMVR”) sub-family studied by (). In relation to a given data array , the will lie within the CMVR Tweedie sub-family if and , independent of . By Equation (3), this requires that
Note that, for ODP distributed observations (), the requirement is that
This CMVR sub-family will be denoted , where is the common mean-variance ratio. Thus, . [With a slight abuse of notation, the random variable will be notated as belonging to the sub-family if its distribution does].
Theorem 2.5 and related comment in () amount to the following.
Proposition 1.
The sub-family is closed under addition of stochastically independent random variables.
2.3. Mesh Size
Section 2.1 is phrased in terms of accident, development and calendar periods, without any specification of the meaning of “period”. Often, in the literature, this unit is a year. But it need not be; it might be a quarter, month, week, or any other convenient length of time. This will be referred to as the mesh size of the data array. Changes of mesh are described in Section 2.2.1 of (), from which the following explanation is extracted.
Suppose that for some strictly positive integers . The mesh size can be changed from one unit of time to units. There will then be accident and development periods, instead of the original .
An example would be the case in which the units of time in a data set are quarters and . Here, the mesh size is changed from quarters to years, and the aggregated data set contains 10 accident years and 10 development years.
It also contains aggregated calendar periods, each consisting of unaggregated calendar periods. In the case of general , denote these by . Accident periods will also be aggregated with the same mesh size, i.e., .
The aggregated triangle will then consist of cells defined by the ordered pairs . These cells can be expressed in the more familiar accident period/development period form by defining . The cell defined by can also be defined by the ordered pair with defined as just stated.
This cell consists of all such that . Consider the last of these inequalities, which is equivalent to
The penultimate inequality may be similarly treated, so that the cell consists of all such that
where it is required that .
Note that, by reference to the constraints on in relations (6) the quantity satisfies the following:
This aggregation from single-unit to -unit periods is commonly used in commercial practice. It is illustrated in Figure 1, in which a quarterly array is collapsed to a yearly array (). The upper triangle, representing past data in a loss reserving context, is shaded yellow, the lower green. Calendar years are delineated by the red diagonals.
Figure 1.
Change in mesh size from quarterly to yearly. Yellow denotes past cells; green future. Blue and purple mark yearly cells under aggregation of calendar quarters. The construction of these cells for development year 1 is exceptional, and these are shaded orange.
Development years 5 and 6 are shaded blue and purple, respectively. Note their nature, as staggered aggregations of accident quarters. Development year 1 is also shaded orange to illustrate its exceptional nature.
As an example of the application of relations (6), consider the case Inequalities (6) become , which yields cells and , and this reconciles with the diagram.
The same application of relations (6) to the case produces non-positive values of , and these require exclusion. This explains the exceptional nature of in the diagram.
With this definition of aggregated cells, payments in these cells can be defined. Let denote claim payments made in the aggregated cell, where square backets indicate that the suffixes are to be read in the context of the enlarged mesh. Then, by the above reasoning,
where each double summation includes summands. Let denote the entire array under the increased mesh: .
Similarly, let denote cumulative claim payments to the end of aggregated cell . For accident period within , the final unaggregated development period in is , and so
where the last equality follows from Equation (1).
3. Chain Ladder Models
3.1. Tweedie Chain Ladder
Chapter 6 of () discusses the EDF chain ladder, a version of the chain ladder in which the distributions of all observations are members of the EDF. The Tweedie family of distributions was defined in Section 2.2, as was the CMVR sub-family for a given data set. Since these are sub-families of the EDF, chain ladder models may be defined on them, as special cases of the EDF chain ladder.
Of interest here is the CMVR Tweedie chain ladder, a special case in which the distributions of all observations are members of for fixed . The model is defined by the following assumptions.
Assumption 1.
For , a data array, all for fixed .
Assumption 2.
The are stochastically independent.
Assumption 3.
For all , for some parameters .
This last assumption immediately yields the following result, which will be useful later.
Proposition 2.
Under Assumption 3, , which depends on but not .
3.2. Mack Chain Ladder
The Mack chain ladder was defined by (). The same definition will be used here, and so will be repeated below.
Assumption 4 (Mack assumption (1)).
for all with .
The factor is usually referred to as an age-to-age factor or a link ratio.
Assumption 5 (Mack assumption (2)).
Different accident periods are stochastically independent, i.e.,
Assumption 6 (Mack assumption (3)).
for all with .
This is the totality of Mack’s assumptions. () added several others, which were necessary for consideration of estimation of chain ladder parameters and application of the model to the forecast of a loss reserve. However, they are not required here.
4. Change in Mesh Size Under Preservation of Calendar Periods
This section commences with a data array that satisfies one of the two sets of chain ladder assumptions. The mesh size is then increased, while preserving calendar periods in the sense of Section 2.3 to form a new (aggregated) data array , and the question of whether this array satisfies chain ladder assumptions is considered. Section 4.1 and Section 4.2 consider the CMVR Tweedie and Mack chain ladder models, respectively.
4.1. CMVR Tweedie Chain Ladder
4.1.1. Cell Distributions and Independence
Proposition 3.
For , all for the same fixed as in Proposition 1.
Proof.
Follows immediately from Assumption 1, Equation (8) and Proposition 1. □
Proposition 4.
The are stochastically independent.
Proof.
First note that the summands of the various in Equation (8) form disjoint subsets of . The proposition then follows immediately from Assumption 2. □
Thus, at least Assumptions 1 and 2, suitably modified, hold for the data array .
4.1.2. Cell Means
This sub-section considers whether Assumption 3 continues to hold for the data array .
By Equation (8) and Assumption 3,
By convention, the non-positive values of that occur as summation indexes when are ignored.
To lighten the notation, it will be useful to write . Then the lower limit of the summation over in Equation (10) may be expressed as . Now set , which reduces Equation (10) to
where
It is convenient to re-express Equation (11) in vector form:
where the upper denotes transposition and the vectors are defined as . Note that .
If an assumption parallel to T3 is to hold for , then a result parallel to Proposition 2 must also hold, i.e.,
where the equality follows from Equation (13).
Even without further analysis, this is seen to be a highly restrictive condition, as it is a constraint on the allowable model parameters. However, there is benefit in evaluating the extent of the restriction. This will be taken up later in the present sub-section.
For the moment, the following numerical example deals with a case in which Equation (14) does not hold.
Numerical example. Consider the case of . One of the implications of Equation (14) is
By definition of , this last equality is
and then , where
with similar definitions of the other vectors in Equation (14).
Now suppose that the relevant values of are as in Table 1, where the values of , calculated from Equation (12), are also displayed.
The ratio in Equation (14) can be calculated from the tabulated values, using the expressions in Equation (16), and it is found that
and so Equation (14) fails to hold for this set of parameters . In this case, if the original array satisfies the chain ladder assumptions, then its reduction under lesser granularity fails to satisfy them.
For the purpose of investigating precisely which sets of parameters satisfy requirement Equation (14), it will be convenient to re-express this requirement in the following equivalent form:
For each
Substitution of Equation (13) into this relation yields, for each ,
where
It is evident that is a real anti-symmetric matrix.
Before a systematic analysis is conducted, some properties of the and special cases of parameter sets that satisfy Equation (14) (equivalently Equation (19)) can be identified.
Proposition 5.
If for some constant , then (rank zero), and in this case Equation (19) is satisfied for any . Otherwise, .
Proof.
See Appendix B.1. □
Corollary 1.
can be of full rank only in the case .
Proposition 6.
(seasonal variation in exposures). Suppose that, in Assumption 3,
Then Equation (19) holds.
Proof.
See Appendix B.2. □
Remark 1.
Note the form of Equation (21). In the absence of , the row effect follows a cycle with period . This might be viewed as seasonal variation in risk exposure. The multiplier adds variation from one -unit accident period to another.
Equation (19) places an apparent constraints on the vectors for known matrices . However, these constraints are not all independent. Proposition 7 determines which of the constraints must be retained.
Proposition 7.
To establish Equation (18) for any given , it will be sufficient to prove that
and, similarly, requirement Equation (19) may be restricted to the following:
No smaller set of equalities than these is sufficient.
Proof.
See Appendix B.3. □
Now consider the conditions under which Equation (23) holds. For this purpose, it will be convenient to re-express it in the following equivalent form:
where is the -vector
and is the block matrix that consists of blocks with in the and block positions, thus
It is seen that Equation (24) imposes the necessary constraints on the -vector .
A key matrix in the following is
Let
Remark 2.
Although, according to Proposition 5, for each , it is quite possible, even likely in practice, that will be of full rank, i.e., .
Proposition 8.
Consider the condition Equation (19) and suppose that the values of are fixed and known. Suppose further that . Then the solutions of Equation (19) for form a set where:
- 1.
- is the -dimensional linear subspace on which for where is defined by (A9); and
- 2.
- is a smooth -dimensional sub-manifold of where
This set of solutions is smooth except possibly where the closure of intersects .
Proof.
See Appendix B.4. □
It is of interest to examine the dimensions of the sub-manifolds emerging in Proposition 8.
Remark 3 (the case r = 0).
By Proposition 5, this is the case for all . Proposition 8 then yields as the set of solutions in , i.e., there is no restriction on the vector .
Remark 4.
More generally, one might consider the proportion of dimensions available to that are lost from its natural parameter space in consequence of the constraints Equation (19). Here, natural parameter space means that which would be available to in the absence of the constraints, namely . The proportion lost is
- in respect of ;
- in respect of .
As it is possible that (Remark 2), the loss of dimensionality in can be very severe indeed.
Example 1.
Consider again the numerical example earlier in the present sub-section in which . As pointed out in Section 2.3, this can relate to the aggregation of a quarterly claim array to form a annual array. Suppose further that , whence are subsets of each of dimension 4.
This example demonstrates how, in a realistic setting, the solution set of Equation (19) may be extremely restricted. There is an almost vanishingly small proportion of cases in which the CMVR Tweedie chain ladder model remains valid after aggregation of calendar periods.
The analysis of the present sub-section has been conducted on the basis that the matrix (based on parameters ) is known and one must find a vector (based on parameters ) that satisfies Equation (19). However, it is apparent from Assumption 3 that the two and parameter sets contribute exchangeably to . This is a fact that has been noted in the previous actuarial literature.
It follows that any procedure applied to these two parameter sets will yield the same results if applied to their exchange, i.e., with . Here, it would be necessary to replace by
and replace by where
Then, for given , an aggregated claim array will satisfy CMVR Tweedie chain ladder assumptions if and only if (compare Equation (19)) satisfies
It is also necessary to replace the matrices and by and , respectively, by means of the appropriate replacements in Equations (26) and (27) and denote by .
All of the results from Proposition 5 to Example 1 follow with these replacements. It is worthwhile, in particular, providing the re-statement of the main result, Proposition 8.
Proposition 9.
(Proposition 8 re-stated for exchanged parameter sets). Consider the condition Equation (31), and suppose that the values of are fixed and known. Suppose further that . Then the solutions of Equation (31) for form a set where:
- (i)
- is the -dimensional linear subspace on which for ; and
- (ii)
- is a smooth -dimensional sub-manifold of where
This set of solutions is smooth except possibly where the closure of intersects .
Remark 5.
All above results concerning the maintenance of a chain ladder model under a change in mesh size have been expressed in terms of enlargement of mesh size (aggregation of calendar periods). However, each such result can be re-stated in an alternative and equivalent form in terms of reduction in mesh size. That is to say, if chain ladder structure is (is not) maintained on enlargement of mesh size, then it is (is not) on the inverse change (reduction) of mesh size.
4.2. Mack Chain Ladder
As in Section 4.1, aggregation of a data array to form a new array will be considered.
4.2.1. Row Independence
Proposition 10.
Different accident periods of are stochastically independent, i.e.,
Proof.
Note that the accident periods contributing to distinct aggregated periods and in Equation (9) form disjoint subsets. The stated result then follows from Assumption 5. □
Thus, at least Assumption 5, suitably modified, continues to hold for the data array
4.2.2. Cell Means
Consider whether a modified version of Assumption 4 holds for . Specifically, consider whether parameters exist such that for all with .
The cell is defined in terms of unaggregated cells as set out in Equation (11), whence
Note that
and so
where
and
a compound age-to-age factor.
Equation (35) can be expressed in the more informative form
where the terms are weights
and are subject to
It will be convenient to represent the terms from Equation (37) in vector form. Specifically, let
Moreover, let
whence Equation (37) may be re-expressed as
This may be compared with Equation (13), and one important difference may be noted. Whereas the leading vector in Equation (13) depended on only , the leading vector in Equation (44) depends on both . This will be of significance in the subsequent analysis.
Now Assumption 4 requires the existence of an age-to-age factor , independent of for each . The corresponding assumption for aggregated data would be that be independent of . The following proposition states the condition that ensures this.
Proposition 11.
Suppose that the values of are fixed and known. Suppose further that . Then, for any fixed , the vectors must be chosen from an -dimensional linear subspace of if the condition that be independent of is to be satisfied.
Proof.
See Appendix B.5. □
Remark 6.
In parallel with Propositions 8 and 9, the requirement that the Mack chain ladder remain applicable to a claim array when calendar periods are aggregated requires a substantial reduction in dimension of the natural parameter space (as defined in Remark 4) available to , namely . The proportionate reduction in dimensions is . This occurs for every column of the claim array.
Remark 7.
Just as in the case of the CMVR Tweedie chain ladder, the loss of parameter dimensionality required for maintenance of the Mack chain ladder in the case of calendar period aggregation can be severe. However, the situation is markedly worse in the case of the Mack chain ladder, as the constrained vectors are stochastic. Constraints on stochastic quantities will almost never be satisfied.
5. Discussion and Conclusions
(, ) considered the question of whether increased data granularity reduced prediction uncertainty arising from application of chain ladder models. The answer was very largely affirmative, but those papers considered only the case in which increased granularity was achieved by dissecting development periods.
A natural extension of that work consists of a similar examination of the increase in granularity achieved by the dissection of calendar periods. Before embarking on this study, one needs to check whether the chain ladder assumptions that hold for some specific data set continue to hold under a change in granularity arising from either dissection or aggregation of calendar periods.
() had already noted that maintenance of these assumptions was not automatic and, at least in the case of a Poisson chain ladder, occurred only when the model’s parameter set was subject to a set of constraints. Here, that finding has been examined in detail in relation to both the CMVR Tweedie and Mack chain ladders.
The general shape of the admissible parameter set under aggregation of calendar periods is established in Propositions 8 (CMVR Tweedie chain ladder) and 11 (Mack chain ladder). The properties of these two parameter sets are different but, in both cases, their dimensionalities are found to be very substantially less than those of the natural parameter spaces (the spaces before imposition of constraints).
In other words, the parameter constraints that are required to maintain chain ladder as a valid model are extremely strict. The consequence is that, while it may be technically possible to maintain the chain ladder model under aggregation of calendar periods, it is not possible in any reasonable, practical sense.
This conclusion is especially strong in the case of the Mack chain ladder. In this case, the constraints apply to stochastic quantities (Remark 7), and their fulfilment will depend not on the model’s parametric structure but on the happenstance of the drawing of the data set.
What does this mean for a practitioner? Consider a case in which the practitioner is pondering the question of whether to work with a quarterly triangle or collapse into yearly. Before considering the implications of the current paper, it should not be forgotten that the quarterly form of data can be expected to produce the smaller forecast error (, ).
But consider the implications of the current paper on top of this. The practitioner may well be entitled to assume that a chain ladder adequately represents either the yearly or the quarterly triangle, but not both. There will probably be little harm in selecting one or the other and working it through to broad conclusions. However, any comparative results for the alternative forms of data will be invalid.
Funding
This research received no external funding.
Data Availability Statement
The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.
Conflicts of Interest
The author declares no conflicts of interest.
Appendix A
Some Results in Differential Geometry
The following theorem will be required for the proof given in Appendix B.4.
The theorem is a generalization of the pre-image theorem. In its full generality (), it is a result in differentiable topology, but only the special case related to Euclidean space is required here.
Theorem A1 (Regular value theorem).
Let be an open connected -dimensional subset of and be differentiable with Jacobian matrix . A sufficient condition for the zero set to be a smooth -dimensional sub-manifold with for all and is that is non-empty and a regular level set, i.e., for each , has rank on a neighbourhood of .
Theorem A2 (Application to quadratic functions).
Let with and for known symmetric matrices . Let be the matrix and let . Then the zero set of is where:
- (i)
- is the -dimensional linear subspace on which for ; and
- (ii)
- is either empty or a smooth -dimensional sub-manifold of where
This zero set is smooth except possibly where the closure of intersects .
Proof.
First evaluate the Jacobian matrix of as
Stack these to obtain the Jacobian matrix of
where is as in the statement of the theorem.
Now consider the two cases according to whether is or is not equal to zero.
Case I: . This case occurs if and only if, by (A1),
or, equivalently,
The values of that satisfy this condition form a linear subspace of dimension . It is evident that, when (A3) holds,
and so the zero set of includes .
Case II: . This is the open set . Consider the rank of on this set, which is given by
From the block form of given in (A2), this is seen to be a matrix whose rows are linearly independent, and then its rank is . Substitution of this result into (A6) yields
Theorem A1 may now be applied to conclude that the zero set of on is either empty or a smooth -dimensional sub-manifold .
Full zero set of . The zero set of on is , which is smooth except possibly where the closure of intersects . □
Appendix B
Proofs of Results.
Appendix B.1
Proof of Proposition 5.
By Equation (20), each column of is a linear combination of the two vectors and , and so . Indeed, the -th column is . Suppose the two vectors are linearly independent. Then the -th column of will be zero if and only if . All columns will be zero if and only if , which contradicts their linear independence. So .
Consider whether may equal unity. This would require all columns to be scalar multiples one of another, which cannot occur if and are linearly independent. In conclusion, linear independence of these vectors implies .
If and are linearly dependent, then for some constant , and then Equation (20) immediately reduces to (rank zero). □
Appendix B.2
Proof of Proposition 6.
Let denote the vector , whence Equation (21) yields . Substitution of this in Equation (14) gives
which is independent of as required. □
Appendix B.3
Proof of Proposition 7.
Let be an acyclic undirected complete graph with a set of vertices. Since it is complete, it contains an edge between every pair of vertices, with denoting the edge between the -th and -th vertices. Let the set of edges be denoted by , so that now .
For fixed , associate the ratio , from Equation (18), with the -th vertex. Now suppose that Equation (18) is satisfied, and let the generic equality in Equation (18) be associated with the edge . The graph now represents the equalities expressed by Equation (18). The question concerns the smallest subset of these that implies all .
A spanning tree of is a sub-graph of with which is a tree, i.e., contains a unique path between any two given vertices. There may be more than one spanning tree, but each contains edges ().
It is clear that is minimal in the sense that deletion of any of its edges will eliminate the path between at least one pair of vertices. Thus, any spanning tree will identify a minimal subset of equalities in Equation (18) that imply the whole of (Equation (18). One such spanning tree is with and the typical member of this set is associated in Equation (18) with the following specific equality from Equation (18):
□
Appendix B.4
Proof of Proposition 8.
Apply Theorem A2 with and the substitutions . In this case, becomes
Note that, in view of Equation (26), each block column of contains each listed times, and so that column has the same rank as , i.e., . The block columns are all linearly independent in , and so .
Thus, Theorem A2 may be applied with the replacements . In consequence, the zero set of is where:
- (i)
- is the -dimensional linear subspace on which for ; and
- (ii)
- is either empty or a smooth -dimensional sub-manifold of where .
This zero set is smooth except possibly where the closure of intersects .
There remains the question of whether . Consider of the form
for arbitrary (positive) . This is a special case Equation (21) and so, by Proposition 6, it is a solution of Equation (19).
Now either or not. If so, then immediately . So now consider the case , whence . Then and, by (A9), (A10) and Equation (26), for each .
If each is of full rank, this cannot occur. If all are rank deficient, select any , and suppose that . Then will map dimensions of its domain to a -dimensional vector space and the remaining to zero. Thus, it is possible to select such that provided that .
It is not known whether all components of are positive as required, so make the replacement for constant . Then . Allow , and for sufficiently small , all components of will be positive.
Thus, unless for all it is always possible to find a member in the zero set, and hence , and . Finally, in the case for all , the zero set consists of the whole of , by Proposition 5. □
Appendix B.5
Proof of Proposition 11.
The vector may be expressed in the form
where , the subspace of that is orthogonal to , and are scalars.
By substitution of (A11) into Equation (44), and taking account of orthogonality,
Similarly for , and this will be the same as if and only if
Note that there is no restriction on .
Now rotate co-ordinates to a new set in which one co-ordinate axis is in the direction of and the remaining axes all lie within . By (A11), has co-ordinate in the direction. Co-ordinates in this direction are subject to the restriction (A13), which means that the dimensions that would be available to the in the absence of (A13) are reduced to just one dimension, a loss of dimensions. It follows that the total number of dimensions available to the vector is . □
References
- Berquist, James R., and Richard E. Sherman. 1977. Loss reserve adequacy testing: A comprehensive, systematic approach. Proceedings of the Casualty Actuarial Society 64: 123–84. [Google Scholar]
- Bondy, John Adrian, and Uppaluri Siva Ramachandra Murty. 2008. Graph Theory. GTM 244. Berlin and Heidelberg: Springer. [Google Scholar]
- England, Peter, and Richard Verrall. 1999. Analytic and bootstrap estimates of prediction errors in claims reserving. Insurance: Mathematics and Economics 25: 281–93. [Google Scholar]
- Hiabu, Munir. 2017. On the relationship between classical chain ladder and granular reserving. Scandinavian Actuarial Journal 2017: 708–29. [Google Scholar] [CrossRef]
- Lo, Joseph. 2011. Implementing the Mack Model. Paper presented at GIRO Conference and Exhibition, Liverpool, UK, October 11–14; London: Institute and Faculty of Actuaries. Available online: https://www.actuaries.org.uk/system/files/documents/pdf/a9-implementing-mack-model-v1-0-giro-site.pdf (accessed on 30 September 2025).
- Mack, Thomas. 1993. Distribution-free calculation of the standard error of chain ladd, Ler reserve estimates. ASTIN Bulletin 23: 213–25. [Google Scholar] [CrossRef]
- Nelder, John Ashworth, and Robert W. M. Wedderburn. 1972. Generalised linear models. Journal of the Royal Statistical Society, Series A 135: 370–84. [Google Scholar] [CrossRef]
- Sloma, Przemyslaw. 2019. Generalized Mack Chain-Ladder Model of Reserving with Robust Estimation. Variance 12: 226–48. [Google Scholar]
- Taylor, Greg. 1985. Claim Reserving in Non-Life Insurance. Amsterdam: North-Holland. [Google Scholar]
- Taylor, Greg. 2000. Loss Reserving: An Actuarial Perspective. Boston: Kluwer Academic Publishers. [Google Scholar]
- Taylor, Greg. 2021. A special Tweedie sub-family with application to loss reserving prediction error. Insurance: Mathematics and Economics 101B: 262–88. [Google Scholar]
- Taylor, Greg. 2025a. The EDF chain ladder and data granularity. Risks 13: 65. [Google Scholar]
- Taylor, Greg. 2025b. The Mack chain ladder and data granularity for preserved development periods. Risks 13: 132. [Google Scholar] [CrossRef]
- Tu, Loring W. 2010. An Introduction to Manifolds. New York: Springer. [Google Scholar]
- Tweedie, Maurice C. K. 1984. An index which distinguishes between some important exponential families. In Statistics: Applications and New Directions, Paper presented at Indian Statistical Golden Jubilee International Conference, Calcutta, 16–19 December 1981. Edited by Jayanta K. Ghosh and Jogabrata Roy. Calcutta: Indian Statistical Institute, pp. 579–604. [Google Scholar]
- Verrall, Richard J. 2000. An investigation into stochastic claims reserving models and the chain-ladder technique. Insurance: Mathematics and Economics 26: 91–99. [Google Scholar]
- Wüthrich, Mario V., and Michael Merz. 2008. Stochastic Claims Reserving Methods in Insurance. Chichester: John Wiley & Sons Ltd. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).