Calendar Effect and In-Sample Forecasting Applied to Mesothelioma Mortality Data

Isakson, Alex; Krummaker, Simone; Martínez-Miranda, María Dolores; Rickayzen, Ben

doi:10.3390/math9182260

Open AccessArticle

Calendar Effect and In-Sample Forecasting Applied to Mesothelioma Mortality Data

¹

Bayes Business School (Formerly Cass), City, University of London, London EC1Y 8TZ, UK

²

Department of Statistics and Operations Research, University of Granada, 18071 Granada, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(18), 2260; https://doi.org/10.3390/math9182260

Submission received: 14 July 2021 / Revised: 6 September 2021 / Accepted: 10 September 2021 / Published: 14 September 2021

(This article belongs to the Special Issue Methodological and Applied Contributions on Stochastic Modelling and Forecasting)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we apply and further illustrate a recently developed extended continuous chain ladder model to forecast mesothelioma deaths. Making such a forecast has always been a challenge for insurance companies as exposure is difficult or impossible to measure, and the latency of the disease usually lasts several decades. While we compare three approaches to this problem, we show that the extended continuous chain ladder model is a promising benchmark candidate for asbestosis mortality forecasting due to its flexible and simple forecasting strategy. Furthermore, we demonstrate how the model can be used to provide an update for the forecast of the number of deaths due to mesothelioma in Great Britain using in recent Health and Safety Executive (HSE) data.

Keywords:

continuous chain ladder; age-period-cohort model; backfitting; density estimation; kernel smoothing

JEL Classification:

C14; C53

1. Introduction

1.1. Motivation

Since the 1960s, mesothelioma or asbestos-related cancer has gained worldwide interest as a result of its increasing incidence, related medico-legal issues and poor prognosis. Mesothelioma is mainly caused by occupational exposure to asbestos fibres in sectors such as mining, road, railway and general construction as well as shipyards, etc., which have a mainly male workforce [1]. A brief, or even indirect, exposure to a small dose of asbestos fibres might be enough to trigger the disease much later in life [2]. For example, the latency period of mesothelioma is between 20 and 70 years with an average of around 40 years. Once the symptoms appear, it is rapidly fatal, with the majority of deaths occurring amongst those over 60 years of age [1]. In Great Britain, mesothelioma mortality has been steadily increasing in recent years, with 2101 deaths recorded in 2016 and a trend for the average age of deaths to slowly increase over time. Notifications of mesothelioma claims have exhibited a stable but increasing trend so far [3].

Asbestos-related claims have a lasting impact on the global insurance industry. The industry is paying, on average, 1.9 billion for mesothelioma claims annually (2013–2017, [4]) under policies that covered, for example, employer or product liability at the time of exposure. This has resulted in multiple insurer insolvencies since the 1940s. Still, today, it is difficult to project the industry’s ultimate loss exposure due to advances in treatment, increasing life expectancies, changes in litigation and the number of new claimants emerging. A core uncertainty for the insurance industry is, therefore, whether the amounts of technical reserves set aside to cover future claims are sufficient. The UK insurance market estimates are based on population mesothelioma deaths projected by the Health and Safety Executive (HSE), which is the UK independent regulator with respect to health and safety in the workplace.

With respect to mesothelioma mortality forecasting, there are two key questions for insurers to answer: (i) when the numbers of deaths are going to peak (and establishing the peak value); and (ii) how the deaths will develop after the peak, that is, the shape of the forecasts. As insurers have to set aside reserves for future claims payments, the ultimate claim amounts and shape of how the number of deaths due to mesothelioma will reduce over time are of critical importance. If the shape is incorrectly forecasted, reserves might be overestimated or underestimated.

1.2. Literature Review

For decades, mortality forecasting has been an important tool for decision making in many fields such as actuarial science, economics, epidemiology and demography, to name only a few. With such great interest in the topic, numerous mortality forecasting approaches have been developed. The literature on mortality forecasting approaches is large, and we do not aim to provide a full review here (see for example [5] for a recent description). The first step in understanding and forecasting mortality patterns is to construct a model describing observed death counts or mortality rates, across age groups or within cohorts. The Lee–Carter model [6] is the current benchmark in mortality studies used by government agencies and pension funds. The model assumes that the dynamics of the logarithms of the central death rates are driven by an age specific constant plus the speed of change at each age multiplied by an overall time trend of mortality rates. The model has many extensions in the literature, which provide improved estimation procedures [7], a relaxation of assumptions [8] and adjustments to the model [9,10,11], among others.

Standard mortality approaches rest on dose–response analyses where both death counts and exposure are available. However, this is not the case in mesothelioma mortality forecasting, where the number of people who have been exposed to asbestos is unknown. There are two possible approaches to this problem. The first one is to construct a synthetic measure for exposure and to use a dose–response model. The second approach is to model only the observed number of deaths. The first approach has been used in the UK by the Health and Safety Executive (HSE), using a birth-cohort model [12,13,14], which assumes that the risk of mesothelioma depends on age and years of exposure, and that an individual’s asbestos exposure is related to the year of exposure. However, a key problem with this approach is that the affected individuals could have been silently exposed to asbestos over prolonged periods of time so that there is no reliable measure for the exposure to asbestos. Therefore, the estimates include a high degree of uncertainty, and regular adjustments are necessary.

A discussion on modelling mortality with synthetic exposure can be found in [15]. Other approaches involving synthetic exposure include [16], who describes an application of the Lee–Cater model to forecast mesothelioma mortality in Argentina, while [17] considered Generalised Interactive Linear Models for Italian data, and [18] used Generalised Additive Models for Brazil. A different and simpler approach to model the observed number of deaths has been proposed by [15,19]. It does not rely on exposure measures and, therefore, avoids the difficulties associated with extrapolating them into the future. The approach is inspired by the so-called “chain ladder” method introduced by [20] in actuarial science. This is a technique used to calculate the liabilities in the form of outstanding claims faced by an insurance company. For an overview of the classical chain ladder method, see [21]. While the method was introduced as a deterministic algorithm, it is has been shown that it consists of an age-period-cohort (APC) model estimated using Poisson regression [22].

APC models have been studied for a long time. Refer to [23] for a general review of age-period-cohort models, and [24] for a recent review and comparison for cancer studies. They are primarily descriptive tools for data in Lexis diagrams when there are non-trivial age, period and cohort effects. A key issue in APC models is the overparameterisation induced by the relation

a g e = p e r i o d - c o h o r t

. The consequence of this relationship is that, without further assumptions, APC models are not uniquely identifiable, i.e., the model has infinitely many fits with infinitely many interpretations [25]. The modelling challenges that come with this problem have been well formulated by [26,27] in discrete time and by [28] in continuous time. When facing overparametrisation, there are two choices, either to work with the over-parametrised model, using for instance an ad hoc identification of the parameters in order to specify them uniquely, or to use a unique and well-defined parametrisation based on a maximal invariant (in generalized linear models, this is the canonical parameter). Following the second approach, [22] proposed second differences in order to uniquely parametrise APC models. While the canonical parametrisation has important theoretical advantages, it might feel less intuitive to many researchers and applied analysts. Therefore, we can find many examples in the literature where non-unique parametrisations are used. Some additional issues arise when looking at models with non-linear parametrisations, such as the Lee–Carter model—see [29,30] for a discussion on the identification problem in this model. Other common approaches in mortality analysis include Bayesian methods or random effects methods (e.g., [31,32]). Unfortunately the identification problem remains in these cases—see [30] for a discussion on additional issues with these methods.

Usually, the objective of a mortality study is to forecast the future mortality, and extrapolation is often used. A major challenge when using extrapolative approaches is to identify the underlying long-term mortality trend that can be extrapolated. Unfortunately, the data might not contain enough information, and a careful analysis of past mortality and its determinants is often required. Within the APC models, the period and cohort parameters are typically treated as independent time series processes, which are used to project the parameters stochastically into the future. Reference [33] combines the canonical parametrisation of [22] with standard methods for the forecasting of non-stationary time series. Reference [34] considers Box–Jenkins procedures to determine the time series processes generating the parameters. Recently, [35,36] defined the term “in-sample forecasting” to mean forecasting a structured function in regions where the function is not observed, but where it is determined by its values in the observed region. This has several advantages over methods based on time series analysis, as [37] discussed. Standard APC models can be understood as discrete density models with a simple multiplicative structure (or log lineal). There are many examples in actuarial science and epidemiology where in-sample forecasting is possible under an age-cohort (AC) model. Reference [38] called this approach the “continuous chain ladder” because of its relationship to the chain ladder method used in non-life insurance. Our paper shows that mesothelioma mortality forecasting is one such example even when a period effect is included in the model.

1.3. Aim and Outline

The aim of our paper is to propose, apply and further illustrate in-sample forecasting, by providing a projection of future mesothelioma deaths using simple methodology. We build on the approach of [37] by using the updated dataset from the HSE to conduct a study that forecasts the number of deaths due to mesothelioma. Furthermore, we analyse the differences in forecasts of future expected mesothelioma deaths due in respect of three models: the model applied in this paper, the discrete APC model of [19] and the model using synthetic exposure measures adopted by HSE [39].

Our paper makes two main contributions: First, we illustrate the method in which the approach of [37] addresses the problem of the lack of exposure data by applying the method to a real dataset that would normally pose a challenge to an insurer’s claim reserving methodology. Our study expands on the method in terms of usability and applications. While [37] developed an extended continuous chain ladder model which allows for a calendar (period) effect and they illustrated the broad applicability using two empirical examples, we are able to explore further implications of the method when applying it to the challenging problem of mesothelioma mortality forecasting. We show that the approach is very flexible by introducing a smoothing technique for the temporal effects. Second, we provide an up-to-date estimate of the future number of asbestos related deaths using this new methodology, together with the recent data released by the HSE for 1968–2016.

The remainder of the paper is organised as follows: In Section 2 we formulate the approach of [37] in the context of mortality forecasting and show how it can be applied to provide forecasts of the number of deaths due to mesothelioma. We conduct this in three parts: first, a statistical structured density model is motivated and formulated; second, the density components are estimated non-parametrically; and finally, mortality projections are made into the future. Under the formulated model, we describe how the past data can provide the density component estimates as well as a complete forecast of the future; hence, the name “in-sample forecasting” is used. Section 3 presents the mesothelioma mortality forecasting analysis, together with discussion, before the paper finishes with concluding remarks in Section 4. Further details on the theoretical approach are provided in Appendix A and Appendix B.

2. Materials and Methods

2.1. Density Model

Let X denote cohort (birth-year) of an individual and let Y denote age at death for that same individual. Thus,

X + Y

is the period or calendar year of death. Let us consider X and Y as the two main time effects so we aim to describe the number of deaths as a function of X and Y. According to these variables, past observations

{(X_{i}, Y_{i}); i = 1, \dots, n}

are typically supported by a trapezium. Figure 1 shows the special trapezium support of the observations in the dataset analysed later in this paper (see more details in the next section). In this context, the aim is to forecast the number of deaths which will occur in the periods beyond the most recent one where we have observations. The area with observations is a trapezium, and the area we aim to forecast is a triangle. Both areas are presented in Figure 1.

To derive forecasts for the targeted triangle, Reference [36] suggested modelling and estimating the density of deaths on the full rectangle shown in Figure 1, which is the two dimensional density f of

(X, Y)

. The problem is that, from the available observations, we can only estimate the density from the data contained within the trapezium. The solution to the problem comes from imposing a suitable multiplicative structure for the density. A simple structured density model is as follows:

f (x, y) = f_{1} (x) f_{2} (y),

(1)

where

f_{1}

represents the cohort density, and

f_{2}

denotes the age at death density. With the help of a backfitting algorithm, Reference [36] derived estimated components

{\hat{f}}_{1}

and

{\hat{f}}_{2}

from the data in the trapezium and used them to forecast the density on the target triangle. This can be performed since the data provide information for all cohorts and ages involved in the forecasting region.

Model (1) is, however, too simple in many respects. From a mathematical perspective it means that variables X and Y are independent. In the context of this paper, this means that mesothelioma deaths only depend on birth cohort and age of individuals. Reference [36] extended the simple model described above by including a third component

f_{3}

in the multiplicative structure, corresponding to the period or calendar time, that is, the variable

X + Y

. In this paper, we used such an extended model and described the density f of the observations with the following multiplicative structure:

f (x, y) = f_{1} (x) f_{2} (y) f_{3} (x + y),

(2)

where

f_{3}

represents the calendar or period effect on deaths. This model can be described as being a continuous APC model, while model (1) is a continuous AC model.

To simplify the notation in these sections we normalize

X_{i}

and

Y_{i}

to take values in the unit interval

[0, 1]

(see left panel of Figure 2) and assume that the observations

(X_{i}, Y_{i})

are given on the trapezium

I = {(x, y) \in {[0, 1]}^{2} : c \leq x + y \leq 1}

for some

c > 0

, where

[c, 1]

is the interval of calendar times where we have observations. The forecast region corresponds to the triangle

I^{fc} = {(x, y) \in {[0, 1]}^{2} : x + y > 1}

. The assumed trapezium and the forecast region are shown in the right panel of Figure 2.

A key observation at this point is that model (2) is not identified, which means that the functions

f_{1}

,

f_{2}

and

f_{3}

are not uniquely determined. Identification of this model relates to the identification of the commonly used age-period-cohort models (see [22,30]). On a logarithmic scale, the model becomes

log f (x, y) = log f_{1} (x) + log f_{2} (y) + log f_{3} (x + y)

, which can be rewritten as

log f (x, y) = log g_{1} (x) + log g_{2} (y) + log g_{3} (x + y)

, where the following is the case:

\begin{matrix} log g_{1} (x) & = - a_{1} - b x + log f_{1} (x) \\ log g_{2} (y) & = - a_{2} - b y + log f_{2} (y) \\ log g_{3} (x + y) & = a_{1} + a_{2} + b (x + y) + log f_{3} (x + y), \end{matrix}

with three arbitrary real-valued constants

a_{1}

,

a_{2}

and b. This means that the density components (on a logarithmic scale) can only be determined up to two linear trends. To overcome this problem we imposed the following three identification constraints:

\begin{matrix} \int_{0}^{1} f_{1} (x) d x & = 1 \end{matrix}

(3)

\begin{matrix} \int_{0}^{1} f_{2} (y) d y & = 1 \end{matrix}

(4)

\begin{matrix} f_{3} (z) & = constant for all z \in [1 - κ, 1], \end{matrix}

(5)

for an appropriate parameter

0 \leq κ \leq 1

, which must be determined later on. Notice that determining

κ = 1

means that model (2) becomes the simple continuous AC model (1). In this sense,

κ

represents the distance from a (continuous) APC model to an AC submodel (the distance being bigger for small values of

κ

).

Although the conditions (3)–(5) might seem very restrictive, they can always be fulfilled for smooth functions. Conditions (3) and (4) ensure that

f_{1}

and

f_{2}

can be interpreted as proper densities, and they can easily be achieved just by rescaling. Thus, the only assumption to be ensured is that

f_{3}

is constant in the near past (5), which is justified from smoothness considerations. Assuming that

log f_{3} (z)

is differentiable at

z = 1

, it can be approximated by a linear function in that region, i.e.,

log f_{3} (z) = a + b z + χ (z)

for

z \in [1 - κ, 1]

, with

κ

being small, where

χ (z)

is approximately zero, and a and b are constants. Notice that this interval might be very small at worst, but it could also be the entire interval

[0, 1]

. Now we can move the linear trend

a + b z

to

log (f_{1})

and

log (f_{2})

, and condition (5) is fulfilled. This is illustrated in Appendix A with a simple example.

2.2. Data

We consider data provided by UK Health Service Executive that consist of annual aggregated counts of deaths caused by exposure to asbestos in Great Britain. The original data are given by age levels and calendar year of death between 1968 and 2016 (all given in years). We consider only data corresponding to males with ages between 25 and 89, which gives an array with dimensions 65 (age levels) by 49 (calendar year). The observed total number of deaths (sample size) is n = 49,750. The authors of [19] analysed the same data at the time but only up to calendar year 2013 by using a discrete APC model. They showed that the main variables to consider in relation to death due to mesothelioma are the cohort and the age (see also [15] for similar conclusions from data up to 2007). Figure 3 shows the data according to these effects and visualizes the special trapezium support available for estimation.

As discussed in Section 1.2 the only actual data available for forecasting mesothelioma deaths is the observed number of deaths (by age and period of deaths), while the number of people at risk (exposure) is not known. The risk set consists of those who have survived relative to the time of exposure and who have then been exposed. The long latency period of mesothelioma, and its rapid fatal end once discovered, contributes to the problem of finding reliable measures on exposure, as well as data on mortality from competing risks. Many researchers in this area, including the UK Health Service Executive, chose to estimate the exposure and used it in conjunction with the actual data. However constructing these estimates is nontrivial, and they include a high degree of uncertainty; moreover, regular adjustments are necessary when these measures of exposure are extrapolated [39]. Our approach, which follows that of [15], is to avoid having to estimate the exposure. Instead, we only need to model the observed number of deaths. In fact, in our specific case where we are adopting a continuous approach, we modelled the density of deaths. This suffices since our objective is to forecast aggregated mortality, and its simplicity is an advantage in forecasting.

2.3. Estimation

The density components

f_{1}

,

f_{2}

and

f_{3}

in model (2) are the building blocks of any information about mortality in

I

. If the density

f (x, y)

is known, all sorts of information could be extracted, such as the number of future deaths in

I^{fc}

. Let us denote this number as

D_{I^{fc}}

. An estimate can be calculated through the following expression:

\begin{matrix} D_{I^{fc}} = τ \int_{I^{fc}} f (x, y) d x d y, \end{matrix}

where

τ

is the number of all deaths in the full rectangle in Figure 3. Notice that

τ

satisfies the relation

τ \int_{I} f (x, y) d x d y = D_{I}

, where

D_{I}

is the number of observed deaths in

I

. This implies that

τ

can be estimated, in practice, given an estimate

\hat{f} (x, y)

of

f (x, y)

evaluated on

I

just by computing

\hat{τ} = D_{I} {(\int_{I} \hat{f} (x, y) d x d y)}^{- 1}

. Then, the forecast

D_{I^{fc}}

is obtained by

{\hat{D}}_{I^{fc}} = \hat{τ} \int_{I^{fc}} {\hat{f}}^{fc} (x, y) d x d y

, given a forecast

{\hat{f}}^{fc} (x, y)

of

f (x, y)

on

I^{fc}

. Thus, the problem of forecasting the future number of deaths reduces to a density estimation and forecasting problem.

Next, we describe how to estimate the density components

f_{1}

,

f_{2}

and

f_{3}

from a data sample

{(X_{i}, Y_{i}) : i = 1, \dots, n}

. Consider the following notation: Let

S = {(x, y) \in I : x \leq 1 - δ, y \leq 1 - δ}

, with a small

δ > 0

, be a subset of

I

where we have sufficiently many data points for estimation. Define the following:

\begin{matrix} J_{1} (y) & = {x \in [0, 1] : (x, y) \in S} \\ J_{2} (x) & = {y \in [0, 1] : (x, y) \in S} \\ J_{3} (z) & = {x \in [0, 1] : (x, z - x) \in S}, \end{matrix}

and the following is the case.

\begin{matrix} f_{w, 1} (x) & = \int_{J_{2} (x)} f (x, y) d y \\ f_{w, 2} (y) & = \int_{J_{1} (y)} f (x, y) d x \\ f_{w, 3} (z) & = \int_{J_{3} (z)} f (x, z - x) d x . \end{matrix}

Under model (2), the density components

f_{1}

,

f_{2}

and

f_{3}

fulfill the following integral equations:

\begin{matrix} f_{1} (x) & = \frac{f_{w, 1} (x)}{\int_{J_{2} (x)} f_{2} (y) f_{3} (x + y) d y} \end{matrix}

(6)

\begin{matrix} f_{2} (y) & = \frac{f_{w, 2} (y)}{\int_{J_{1} (y)} f_{1} (x) f_{3} (x + y) d x} \end{matrix}

(7)

\begin{matrix} f_{3} (z) & = \frac{1_{[0, 1 - κ)} (z) f_{w, 3} (z)}{\int_{J_{3} (z)} f_{1} (x) f_{2} (z - x) d x} + \frac{1_{[1 - κ, 1]} (z) \int_{1 - κ}^{1} f_{w, 3} (v) d v}{\int_{1 - κ}^{1} \int_{J_{3} (v)} f_{1} (x) f_{2} (v - x) d x d v}, \end{matrix}

(8)

where

𝟙_{A} (x)

is the indicator function defined by

𝟙_{A} (x) = 1

if

x \in A

and

𝟙_{A} (x) = 0

otherwise.

In practice, the exact solutions of the above integral equations are unknown because f is unknown. To estimate

f_{1}

,

f_{2}

and

f_{3}

, we formulated empirical integral equations by substituting f in (6)–(8) by a suitable estimator. We considered the local linear density estimator of [40] with a bandwidth vector

(b_{1}, b_{2})

, which is defined in Appendix B. With this choice, the empirical integral equations are defined as follows:

\begin{matrix} {\hat{f}}_{1} (x) & = {\hat{ϕ}}_{1} \frac{{\hat{f}}_{w, 1} (x)}{\int_{J_{2} (x)} {\hat{f}}_{2} (y) {\hat{f}}_{3} (x + y) d y} \end{matrix}

(9)

\begin{matrix} {\hat{f}}_{2} (y) & = {\hat{ϕ}}_{2} \frac{{\hat{f}}_{w, 2} (y)}{\int_{J_{1} (y)} {\hat{f}}_{1} (x) {\hat{f}}_{3} (x + y) d x} \end{matrix}

(10)

\begin{matrix} {\hat{f}}_{3} (z) & = {\hat{ϕ}}_{3} \frac{𝟙_{[0, 1 - κ)} (z) {\hat{f}}_{w, 3} (z)}{\int_{J_{3} (z)} {\hat{f}}_{1} (x) {\hat{f}}_{2} (z - x) d x} + {\hat{ϕ}}_{3} \frac{𝟙_{[1 - κ, 1]} (z) \int_{1 - κ}^{1} {\hat{f}}_{w, 3} (v) d v}{\int_{1 - κ}^{1} \int_{J_{3} (v)} {\hat{f}}_{1} (x) {\hat{f}}_{2} (v - x) d x d v} \end{matrix}

(11)

under following constraints:

\int_{S} {\hat{f}}_{1} (x) d x = 1, \int_{S} {\hat{f}}_{2} (y) d y = 1 and \int_{S} {\hat{f}}_{1} (x) {\hat{f}}_{2} (y) {\hat{f}}_{3} (x + y) d x d y = \hat{ϑ},

(12)

where

\hat{ϑ} = n^{- 1} \sum_{i = 1}^{n} 𝟙 ((X_{i}, Y_{i}) \in S)

is an estimator of

ϑ = \int_{S} f (x, y) d x d y

. The coefficients

{\hat{ϕ}}_{j}

(

j = 1, 2, 3

) in (9)–(11) are chosen such that the constraints in (12) are satisfied. Here, we have used the notation

{\hat{f}}_{w, l}

for the estimator of

f_{w, l}

above (

l = 1, 2, 3

), which has been obtained by replacing f with

\hat{f}

.

Reference [37] proved the existence of a unique solution for the above empirical equations (see also [41] for related theoretical tools); however, the solution cannot be explicitly obtained, and a backfitting algorithm is required in practice to derive estimates

{\hat{f}}_{1}

,

{\hat{f}}_{2}

and

{\hat{f}}_{3}

. The algorithm can be written as follows.

Step 0.

Let

{\hat{f}}_{1}^{[0]}

and

{\hat{f}}_{2}^{[0]}

be starting values for estimating

f_{1}

and

f_{2}

, which satisfy the first two constraints in (12). Calculate the following:

{\tilde{f}}_{3}^{[0]} (z) = \{\begin{matrix} \frac{{\hat{f}}_{w, 3} (z)}{\int_{J_{3} (z)} {\hat{f}}_{1}^{[0]} (x) {\hat{f}}_{2}^{[0]} (z - x) d x} & for z \in [0, 1 - κ) \\ \frac{\int_{1 - κ}^{1} {\hat{f}}_{w, 3} (v) d v}{\int_{1 - κ}^{1} \int_{J_{3} (v)} {\hat{f}}_{1}^{[0]} (x) {\hat{f}}_{2}^{[0]} (v - x) d x d v} & for z \in [1 - κ, 1] \end{matrix}

and set

{\hat{f}}_{3}^{[0]} (z) = {\hat{ϕ}}_{3}^{[0]} {\tilde{f}}_{3}^{[0]} (z)

, where

{\hat{ϕ}}_{3}^{[0]}

is chosen such that the third constraint in (12) is satisfied.

Step r.

Let

{\hat{f}}_{1}^{[r - 1]}

,

{\hat{f}}_{2}^{[r - 1]}

and

{\hat{f}}_{3}^{[r - 1]}

be the backfitting estimates from the previous iteration step. Compute updates as follows.

(a): Calculate the following:

${\tilde{f}}_{1}^{[r]} (x) = \frac{{\hat{f}}_{w, 1} (x)}{\int_{J_{2} (x)} {\hat{f}}_{2}^{[r - 1]} (y) {\hat{f}}_{3}^{[r - 1]} (x + y) d y}$

and set ${\hat{f}}_{1}^{[r]} (x) = {\hat{ϕ}}_{1}^{[r]} {\tilde{f}}_{1}^{[r]} (x)$ , where ${\hat{ϕ}}_{1}^{[r]}$ is chosen such that the first constraint of (12) is fulfilled.
(b): Calculate the following:

${\tilde{f}}_{2}^{[r]} (y) = \frac{{\hat{f}}_{w, 2} (y)}{\int_{J_{1} (y)} {\hat{f}}_{1}^{[r]} (x) {\hat{f}}_{3}^{[r - 1]} (x + y) d x}$

and set ${\hat{f}}_{2}^{[r]} (y) = {\hat{ϕ}}_{2}^{[r]} {\tilde{f}}_{2}^{[r]} (y)$ , where ${\hat{ϕ}}_{2}^{[r]}$ is chosen such that the second constraint of (12) is satisfied.
(c): Compute ${\hat{f}}_{3}^{[r]}$ analogous to ${\hat{f}}_{3}^{[0]}$ in Step 0.

Step r in the algorithm is iterated (

r = 2, 3, \dots

) until convergence. As a convergence criterion, we evaluate the change

| {\hat{f}}_{i}^{[r]} (x) - {\hat{f}}_{i}^{[r - 1]} (x) | / max {\hat{f}}_{i}^{[r]} (x)

and stop when it is smaller than a tiny constant

c > 0

, for all

i = 1, 2, 3

. In our empirical analyses, we used

c = 1 e - 7

with a maximum number of iterations of 1000.

Notice that from the above algorithm we obtain estimates for

f_{1} (x)

and

f_{2} (y)

for

x, y \in [0, 1]

; that is, for all observed cohorts and ages. However we only obtained estimates of

f_{3} (x + y)

for

x + y

in the trapezium

I

, which is the past period

x + y

. Thus, to derive the required forecasts, we will need to extrapolate

f_{3}

. Among other aspects, this issue makes the model (2) more challenging than the simpler model (1). A convenient extrapolation of

f_{3}

is described in Section 2.4.

2.4. Forecasting

Under model (2), from the density components’ estimates that were derived in the previous section, we obtained an estimator of the density f of deaths observed in the past calendar times, i.e., in the trapezium

I

. However, the aim in mesothelioma mortality forecasting is to derive mortality projections for future calendar years, i.e., those lying in the triangle

I^{f c} = {(x, y) \in {[0, 1]}^{2} : x + y > 1}

shown on the right panel of Figure 2. To satisfy this aim we need to estimate the two-dimensional density f also on

I^{fc}

. Under model (2), this can be performed from the previous backfitting estimates

{\hat{f}}_{1}

and

{\hat{f}}_{2}

, along with a method of extrapolating

{\hat{f}}_{3}

to the future calendar year points (

z = x + y \in (1, 2]

). Recall that the backfitting algorithm only estimates the calendar effect density

f_{3} (z)

up to the present time point (

z = 1

).

From the identification constraints (3)–(5), we have assumed that the calendar effect

f_{3}

is constant around the present time point

z = 1

, to be more precise on the interval

[1 - κ, 1]

. The authors of [37] used this assumption to extrapolate the calendar effect as a constant into the future, that is, by setting

{\hat{f}}_{3}^{fc} (z) = {\hat{f}}_{3} (1)

for

z > 1

(recall that

z = 1

represents the more recent observed calendar time). Thus, projections of

f (x, y)

at future time points

(x, y) \in I^{fc}

are given by the following.

{\hat{f}}^{fc} (x, y) = {\hat{f}}_{1} (x) {\hat{f}}_{2} (y) {\hat{f}}_{3}^{fc} (x + y) .

By contrast to other forecasting strategies, such as the

I (0)

,

I (1)

and

I (2)

forecaster of [33], the above extrapolation strategy is a natural method for describing the future based only on the past data (in-sample) and smoothness considerations. While the former forecasters extrapolate the (logarithmic) calendar effect linearly into the future, estimating the slope of such a line in three different ways, our approach eliminates the calendar effect from the model, normalizing it to have zero slope in the recent past by the imposed identification constraints.

At this point there is only one issue left: how to choose, in practice, the constant

κ

. This parameter can be interpreted as the length of the recent past which should be used to estimate and forecast the calendar effect. Therefore, it is of interest in practice to illustrate the effect of such a parameter on the forecasts. In our application to mesothelioma mortality, we perform this and conclude with a data-driven choice derived by cross-validation. The cross-validation method estimate

κ

from the data as follows: pick some small

δ > 0

and define the following.

\begin{matrix} S_{δ}^{<} & = {(x, y) \in S : x + y \leq 1 - δ} \\ S_{δ}^{>} & = {(x, y) \in S : x + y > 1 - δ, x \leq 1 - δ, y \leq 1 - δ} . \end{matrix}

Let

D

be the set of data points

(X_{i}, Y_{i})

that lie in

S_{δ}^{<}

, that is,

(X_{i}, Y_{i}) \in S_{δ}^{<}

. For any

κ \in (δ, 1]

, compute the backfitting estimators

{\hat{f}}_{1}^{κ}

,

{\hat{f}}_{2}^{κ}

,

{\hat{f}}_{3}^{κ}

from the data sample

D

, as described in Section 2.3, where the set

S

is replaced by

S_{δ}^{<}

. Next, we define the following.

{\hat{f}}_{3}^{κ} (z) = {\hat{f}}_{3}^{κ} (1 - δ) for z \in (1 - δ, 1],

The estimator of

κ

is defined as follows:

\hat{κ} = arg \min_{κ \in (δ, 1]} CV (κ) .

(13)

where the following is the case.

CV (κ) = \int_{S_{δ}^{>}} {{\hat{f}}^{κ} (x, y)}^{2} d x d y - \frac{2}{n} \sum_{i = 1}^{n} 1 ((X_{i}, Y_{i}) \in S_{δ}^{>}) {\hat{f}}^{κ} (X_{i}, Y_{i}) .

The justification of the above criterion comes from the fact that CV is an estimator of the Mean Integrated Squared Error (MISE) of

{\hat{f}}^{κ} (x, y)

in the set

S_{δ}^{>}

, which ideally one would minimise to choose

κ

. To observe this, we expand the MISE as follows.

\begin{matrix} MISE ({\hat{f}}^{κ} (x, y)) & = \int_{S_{δ}^{>}} {({\hat{f}}^{κ} (x, y) - f (x, y))}^{2} d x d y \\ = \int_{S_{δ}^{>}} {\hat{f}}^{κ} {(x, y)}^{2} - 2 f (x, y) {\hat{f}}^{κ} (x, y) + f {(x, y)}^{2} d x d y . \end{matrix}

Since the last term is positive and does not dependent on

κ

, we can minimise the above expression for MISE and ignore the final term. Then, as the second term depends on the unknown f, we can replace it with the simple non-parametric estimator

n^{- 1} \sum_{i = 1}^{n} 𝟙 ((X_{i}, Y_{i}) \in S_{δ}^{>})

. This provides the above expression for CV

(κ)

.

3. Results

Using the methodology described above, we have analysed a dataset consisting of annual aggregated counts of deaths caused by exposure to asbestos in Great Britain for males aged between 25 and 89. From this data, we can update the results from the previous analysis of [19] using the additional data and applying model (2), which allows us to take into account the calendar year effect on the mesothelioma deaths.

Therefore, we assume that the two-dimensional density

f (x, y)

, with x denoting the cohort and y the age, can be written as the product of three functions: the densities corresponding to the cohort and the age effects,

f_{1}

and

f_{2}

, as well as a third function,

f_{3}

, describing the effect of the year (period) of death. In order to estimate the three density components, we have used the backfitting algorithm described in Section 2.3. For the local linear estimator

\hat{f}

of the two-dimensional density f, we have considered bandwidths

{\hat{b}}_{1} = 6

,

{\hat{b}}_{2} = 4.2

years. The bandwidths

{\hat{b}}_{1}

and

{\hat{b}}_{2}

were obtained as follows: we first computed the common cross-validated bandwidths

{\tilde{b}}_{1}

and

{\tilde{b}}_{2}

for the local linear estimator

\hat{f}

and then rescaled them by the factor

n^{- 1 / 5} / n^{- 1 / 6}

. The justification of this rescaling is based on theory. The authors of [37] proved the consistency of the backfitting estimates assuming that the component bandwidths satisfy the condition

n^{1 / 5} b_{j} \to c_{j}

for some constant

c_{j} > 0

(

j = 1, 2

), i.e., they have convergence order of

n^{- 1 / 5}

. The cross-validated estimates derived for the two-dimensional density have order

n^{- 1 / 6}

(see [40]), so rescaling the cross-validated bandwidths with the above factor provides the theoretical requirements.

The estimated density components are shown in Figure 4. The graphs display the estimates produced by the backfitting algorithm for different values of the parameter

κ

. Recall that this parameter defines the length of the most recent time interval where the calendar effect function

f_{3}

is constant. It can be also interpreted as the length of the recent past, which is used to estimate the calendar effect. For an easier interpretation here, we provide this parameter in number of years so the value

κ = 49

corresponds to a constant calendar effect over the whole period from 1968 to 2016,

κ = 10

corresponds to a constant calendar effect in the last 10 years, i.e., from 2007 to 2016, and non-constant from 1968 to 2006; in general,

κ = 49 - p

corresponds to a constant calendar effect in the last p years, i.e., from

2016 - p + 1

to 2016, and non-constant from 1968 to

2016 - p

. This can be seen in the last graph of Figure 4 where the bigger

κ

s correspond to nearly constant estimates of

f_{3}

, while smaller values allow for general shapes. The estimates of

f_{1}

and

f_{2}

also vary slightly with the value of

κ

. Notice that the shapes of the densities do not allow us to observe the variations in more detail in the graphs.

Applying the forecasting method described in Section 2.4 in conjunction with the density component estimates for different

κ

’s, we have calculated the predicted total number of deaths in future years (that is, in the years following 2016). The results are shown in Figure 5 for

κ = 10, \dots, 49

, along with the observed number of past deaths. We can observe that

κ

does not have a big effect on the forecasts, which agrees with the slight variations in the density component estimates for each

κ

shown in Figure 4.

It is of interest to predict the peak value (highest number of deaths) as well as the year of peaking. From Figure 5, we can observe that the peak had already been reached in 2016, and it is confirmed for all the

κ

values considered. The peak value was 2101 observed deaths. This agrees with the provisional data for 2017 published by the HSE [39], where the number of deaths in the year 2017 had decreased to 2087.

To derive our final forecasts, we consider the cross-validation method defined above to choose the value of

κ

, see Equation (13). In order to minimize the function CV, we evaluate it on

κ = 10, \dots, 49

. The CV function is quite flat, showing slightly smaller values for the bigger

κ

’s. This seems to be in line with the stable behaviour of the density component estimates shown in Figure 4 and suggests choosing the maximum value of

κ

, which is

κ = 49

years. This value corresponds to the case of

f_{3}

being constant for every period, that is, it corresponds to a simple age-cohort model. For this choice we predicted 2063 deaths in the year 2017, which is a bit lower than the available provisional figure of 2087 for 2017 [39]. The number of deaths decreases slowly until 2032 deaths were reached in 2020. Table 1 shows our forecasts until 2022 using the full dataset (third column), as well as using only data up to 2013 (sixth column). The available data are shown for assessing forecasts derived from data up to 2013 (second column).

For comparison purposes we have included in the table forecasts derived from the discrete approach of [19] (fourth and last columns) and the more recent HSE forecasts [39] (fifth column). The discrete approach of [19] is computed by truncating the data corresponding to the youngest cohorts, that is, from 1966 as those authors suggested. This is necessary since the approach does not perform any smoothing and cannot properly deal with sparsity in the data corresponding to those cohorts. These forecasts have been computed using the apc package [42] and are shown for years up to 2022. Our approach and the discrete approach of [19] provide similar forecasts, but we do not need to perform any arbitrary truncation in contrast to the discrete approach. Moreover, truncating cohorts from 1966 means that mortality is only projected from ages above 50, which might explain the slightly lower forecasts from the discrete model. On the other hand, HSE provides a similar forecast for calendar year 2018 but differs substantially for future periods. Figure 6 shows the differences in the shapes of forecasts being compared. HSE projects a notably faster decline of deaths than the other approaches. For reference, we have also shown in this figure the forecasts published in [19] using (truncated) data up to 2013 (the truncation in this case resulted in projections only for ages above 47 years). Moreover, we have added our forecasts with data up to 2013 (without truncation) and constant calendar effect (corresponding to the value of

κ

chosen by cross-validation).

By restricting the data for estimation up to 2013, we can assess whether our proposal would have been able to predict the peak in 2016. Figure 7 shows the peak forecasts for different values of

κ

along with the year of peak. Peak values vary with

κ

between 2032 and 2077 and the year of peaking between 2016 and 2018. The observed peak of 2101 in 2016 is, therefore, hardly predicted using data up to 2013. The same happens with the forecasts derived by [19], which predicted a peak of 2079 deaths in 2017. This seems to be natural since the statistical projections describe the expected future mortality as a smooth curve, while the actual numbers in years close to the peak are expected to fluctuate above and below due to year-on-year random variation [39].

4. Discussion and Conclusions

Standard methods and common benchmarks in the literature on mortality forecasting rely on dose–response models where both deaths and exposure are observed. Such methods involve the non-trivial and risky exercise of estimating the exposure when it is not known, as happens in the case of asbestos-related mortality. Our paper demonstrates that the methodology of [37] can be another benchmark with important benefits. First, it does not require any modelling, estimating and extrapolating of the exposure when it is unknown. This makes the approach more robust compared with those that are more detailed in this regard, especially when the model for the exposure is mis-specified. Second, it is very intuitive due to its connection with the popular APC models. Third, it takes advantage of the powerful non-parametric structured models, which exhibit excellent theoretical and practical properties, compared to the standard (discrete) APC models. Finally, forecasting is entirely determined by the data, avoiding the need to use time series modelling and other more sophisticated extrapolation techniques. This further contributes to the robustness of the approach in practice.

We applied our method to actual data consisting of the number of deaths due to mesothelioma between 1968 and 2016, which we obtained from the Health and Safety Executive. From this, we have been able to produce an up to date forecast for the number of deaths in the future. We have also compared the results from our model to those from the discrete AC model of [19] and the model used by the Health and Safety Executive. While our forecasts, and those from the discrete age-cohort model, show very similar shapes, the HSE forecasts differ notably, showing a much faster decline in the number of deaths up to calendar year 2030. In addition, we have provided forecasts with truncated data which could be compared to data for the following years. In all cases, the forecasts were reasonable but differed depending on the constant

κ

, which is the length of the interval in the past for which the period function is set as constant. While choosing

κ

is challenging, we explained a cross validation approach on how to choose it in practice.

Modelling the number of deaths by our density approach suffices when the objective is to forecast aggregated mortality. However, more detailed information might be required beyond this objective. For instance, if a further study considers the prevention of deaths due to mesothelioma, then it would be necessary to model the length of the latency period and to take into account survival from competing risks.

A limitation of previous approaches compared in this paper ([19,39]) is that the age-profile is assumed to be common for all cohorts [12]. The model considered in this paper relaxes this assumption by introducing some dependence between age and cohorts. More general structures could be assumed under a similar density approach, but with the risk of increasing the uncertainty arising out of such more general models. Therefore, further research would be required to find a good compromise between model complexity and uncertainty. Evaluating the uncertainty of the forecasts is another issue which will require further work, e.g., to derive confidence bands for the forecasts.

Author Contributions

Conceptualization, M.D.M.-M.; Data curation, M.D.M.-M.; Formal analysis, A.I. and M.D.M.-M.; Investigation, M.D.M.-M.; Methodology, M.D.M.-M.; Project administration, A.I., S.K., M.D.M.-M. and B.R.; Resources, M.D.M.-M.; Software, M.D.M.-M.; Validation, A.I. and M.D.M.-M.; Visualization, S.K. and B.R.; Writing—original draft, M.D.M.-M.; Writing—review and editing, A.I., S.K., M.D.M.-M. and B.R. All authors have contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

M.D. Martínez-Miranda gratefully acknowledges support from the Spanish Ministry of Economy and Competitiveness, through grant numbers MTM2016-76969P and PID2020-116587GB-I00, which includes support from the European Regional Development Fund (ERDF).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Health and Safety Executive (HSE) (2019). Mesothelioma statistics for Great Britain, 2019. Annual statistics available from www.hse.gov.uk/statistics/ (accessed on 14 July 2021).

Acknowledgments

The authors thank Robert J. Brooks for providing the data analysed in the paper. Moreover, discussions with Jens P. Nielsen are gratefully acknowledged. Finally, we thank two anonymous reviewers for their many valuable comments and suggestions which have helped to improve the quality of the article.

Conflicts of Interest

The authors declare no potential conflicts of interests.

Appendix A

Here, we describe how the identification constraint (5) can be fulfilled in practice by using a simple example. Recall that, under smoothness assumptions, this amounts to removing from

f_{3}

a log-linear trend in the interval

[1 - κ, 1]

so it becomes (approximately) constant.

Define the density components in model (2) as

f_{1} (x) = 1

(cohort effect),

f_{2} (y) = exp (- y)

(age effect), for

0 \leq x, y \leq 1

and

f_{3} (z) = exp (z / 3)

, for

0 \leq z \leq κ

, and

f_{3} (z) = exp (- κ / 2 + 5 z / 6)

and for

1 - κ \leq z \leq 1

with

κ = 0.4

(period effect). The three components are displayed in black in Figure A1. Now, we remove the log-linear trend of

f_{3}

in the interval

[0.6, 1]

, that is,

exp (5 z / 6) = exp (5 x / 6) exp (5 y / 6)

, and allocate

exp (5 x / 6)

into

f_{1} (x)

and

exp (5 y / 6)

into

f_{2} (y)

. The resulting density components are displayed in red in the same figure. Thus, we have rewritten the density components in the model in such a manner that

f_{3}

fulfills the constraint (5). Notice that this practice can be extended to any function

f_{3}

as long it is smooth around

z = 1

for at least a small

κ

.

Figure A1. Illustration of how the identification works in an example. The log-linear trend of

f_{3}

in the interval

[1 - κ, 1]

is removed and allocated into the components

f_{1}

and

f_{2}

.

Figure A1. Illustration of how the identification works in an example. The log-linear trend of

f_{3}

in the interval

[1 - κ, 1]

is removed and allocated into the components

f_{1}

and

f_{2}

.

Appendix B. Two-Dimensional Local Linear Density Estimator

The local linear estimator of a two-dimensional density function

f (x, y)

was introduced by [40] as follows. Let the following:

{\tilde{f}}_{h_{1}, h_{2}} (x, y) = \frac{1}{n h_{1} h_{2}} \sum_{i = 1}^{n} K (\frac{X_{i} - x}{b_{1}}) K (\frac{Y_{i} - y}{b_{2}}) W_{i}

be a standard kernel density estimator of f (see for example [43]), where

W_{i} = 1 ((X_{i}, Y_{i}) \in S)

, K is a two-dimensional kernel function, and

(h_{1}, h_{2})

is the bandwidth vector. Consider the following minimization problem:

\begin{matrix} \hat{η} (x, y) = arg \min_{η = (η_{0}, η_{1}, η_{2})} lim_{h_{1}, h_{2} \to 0} \int_{S} [ & {\tilde{f}}_{h_{1}, h_{2}} (v, w) - a {(v, w; x, y)}^{⊤} η (x, y)]^{2} \end{matrix}

\begin{matrix} \times K (\frac{v - x}{b_{1}}) K (\frac{w - y}{b_{2}}) d v d w, \end{matrix}

(A1)

where

a (v, w; x, y) = {(1, (v - x) / b_{1}, (w - y) / b_{2})}^{⊤}

, and let

\hat{η} = ({\hat{η}}_{0}, {\hat{η}}_{1}, {\hat{η}}_{2})

denote its solution. It can be shown [37] that the following is the case:

\hat{η} (x, y) = A {(x, y)}^{- 1} b (x, y),

where the following results.

\begin{matrix} A (x, y) & = \int_{S} a (v, w; x, y) a {(v, w; x, y)}^{⊤} b_{1}^{- 1} b_{2}^{- 1} K (\frac{v - x}{b_{1}}) K (\frac{w - y}{b_{2}}) d v d w \\ b (x, y) & = \frac{1}{n} \sum_{i = 1}^{n} a (X_{i}, Y_{i}; x, y) h_{1}^{- 1} h_{2}^{- 1} K (\frac{X_{i} - x}{b_{1}}) K (\frac{Y_{i} - y}{b_{2}}) W_{i} . \end{matrix}

The local linear estimator

\hat{f}

is defined as the first component of the vector

\hat{η}

.

References

Selby, K. Mesothelioma Statistics. Available online: https://www.asbestos.com/mesothelioma/statistics/ (accessed on 25 August 2021).
O’Reilly, K.M.; Mclaughlin, A.M.; Beckett, W.S.; Sime, P.J. Asbestos-related lung disease. Am. Fam. Physician 2007, 75, 683–688. [Google Scholar]
UK Asbestos Working Party. Update from UK Asbestos Working Party. Available online: www.actuaries.org.uk/practice-areas/general-insurance/research-working-parties/uk-asbestos (accessed on 18 December 2020).
AM Best. Asbestos and Environmental Losses Continue. Available online: http://news.ambest.com/articlecontent.aspx?refnum=281133&altsrc=43 (accessed on 18 December 2020).
Janssen, F. Advances in mortality forecasting: Introduction. Genus 2018, 74, 21. [Google Scholar] [CrossRef] [Green Version]
Lee, R.; Carter, L. Modeling and forecasting U.S. mortality. J. Am. Stat. Assoc. 1992, 87, 659–671. [Google Scholar] [CrossRef]
Hatzopoulos, P.; Haberman, S. A parameterized approach to modeling and forecasting mortality. Insur. Math. Econ. 2009, 44, 103–123. [Google Scholar] [CrossRef]
Booth, H.; Tickle, L. Mortality modelling and forecasting: A review of methods. Ann. Actuar. Sci. 2008, 3, 3–43. [Google Scholar] [CrossRef]
Hyndman, R.J.; Ullah, M. Robust forecasting of mortality and fertility rates: A functional data approach. Comput. Stat. Data Anal. 2007, 51, 4942–4956. [Google Scholar] [CrossRef] [Green Version]
Renshaw, A.; Haberman, S. A cohort-based extension to the Lee-Carter model for mortality reduction factors. Insur. Math. Econ. 2006, 38, 556–570. [Google Scholar] [CrossRef]
Russolillo, M.; Giordano, G.; Haberman, S. Extending the Lee-Carter model: A three-way decomposition. Scand. Actuar. J. 2011, 2011, 97–117. [Google Scholar] [CrossRef]
Hodgson, J.; McElvenny, D.; Darnton, A.; Price, M.; Peto, J. The expected burden of mesothelioma mortality in Great Britain from 2002 to 2050. Br. J. Cancer 2005, 92, 587–593. [Google Scholar] [CrossRef]
Tan, E.; Warren, N. Projection of mesothelioma mortality in Great Britain. In Health and Safety Executive, Research Report; HSE Books: Norwich, UK, 2009; p. 728. [Google Scholar]
Tan, E.; Warren, N.; Darnton, A.J.; Hodgson, J.T. Projection of mesothelioma mortality in Britain using Bayesian methods. Br. J. Cancer 2010, 103, 430–436. [Google Scholar] [CrossRef] [PubMed]
Miranda, M.D.M.; Nielsen, B.; Nielsen, J.P. Inference and forecasting in the age–period–cohort model with unknown exposure with an application to mesothelioma mortality. J. R. Stat. Soc. Ser. A Stat. Soc. 2015, 178, 29–55. [Google Scholar] [CrossRef] [Green Version]
Trotta, A.; Santana, V.S.; Andreozzi, L. P010 Forecasting of Mesothelioma Mortality in Argentina, 2014–2023. Available online: http://dx.doi.org/10.1136/oemed-2016-103951.335 (accessed on 24 August 2021).
Oddone, E.; Bollon, J.; Nava, C.R.; Consonni, D.; Marinaccio, A.; Magnani, C.; Gasparrini, A.; Barone-Adesi, F. Effect of Asbestos Consumption on Malignant Pleural Mesothelioma in Italy: Forecasts of Mortality up to 2040. Cancers 2021, 13, 3338. [Google Scholar] [CrossRef]
Algranti, E.; Saito, C.A.; Carneiro, A.P.S.; Moreira, B.; Mendonça, E.M.C.; Bussacos, M.A. The next mesothelioma wave: Mortality trends and forecast to 2030 in Brazil. Cancer Epidemiol. 2015, 39, 687–692. [Google Scholar] [CrossRef]
Martínez-Miranda, M.D.; Nielsen, B.; Nielsen, J.P. Simple benchmark for mesothelioma projection for Great Britain. Occup. Environ. Med. 2016, 73, 561–563. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zehnwirth, B. Probabilistic Development Factor Models with Applications to Loss Reserve Variability, Prediction Intervals and Risk Based Capital; Casualty Actuarial Society Forum: Arlington, VA, USA, 1994; Volume 2, pp. 447–606. [Google Scholar]
England, P.D.; Verrall, R.J. Stochastic claims reserving in general insurance. Br. Actuar. J. 2002, 8, 443–518. [Google Scholar] [CrossRef]
Kuang, D.; Nielsen, B.; Nielsen, J.P. Identification of the age-period-cohort model and the extended chain-ladder model. Biometrika 2008, 95, 979–986. [Google Scholar] [CrossRef]
O’Brien, R. Age-Period-Cohort Models: Approaches and Analyses with Aggregate Data; Chapman and Hall CRC Press: Boca Raton, FL, USA, 2014. [Google Scholar]
Smith, T.R.; Wakefield, J. A review and comparison of age-period-cohort models for cancer incidence. Stat. Sci. 2016, 31, 591–610. [Google Scholar] [CrossRef]
Carstensen, B. Age–period–cohort models for the Lexis diagram. Stat. Med. 2007, 26, 3018–3045. [Google Scholar] [CrossRef]
Clayton, D.; Schifflers, E. Models for temporal variation in cancer rates. I: Age–period and age–cohort models. Stat. Med. 1987, 6, 449–467. [Google Scholar] [CrossRef]
Clayton, D.; Schifflers, E. Models for temporal variation in cancer rates. II: Age–period–cohort models. Stat. Med. 1987, 6, 469–481. [Google Scholar] [CrossRef] [PubMed]
Keiding, N. Statistical inference in the Lexis diagram. Philos. Trans. R. Soc. Lond. Ser. Phys. Eng. Sci. 1990, 332, 487–509. [Google Scholar]
Beutner, E.A.; Reese, S.; Urbain, J.P. Identificability issues of age-period and age-period-cohort models of the Lee-Carter type. Insur. Math. Econ. 2017, 75, 117–125. [Google Scholar] [CrossRef] [Green Version]
Nielsen, B.; Nielsen, J.P. Identification and forecasting in mortality models. Sci. World J. 2014, 2014, 347043. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Berzuini, C.; Clayton, D. Bayesian analysis of survival on multiple time scales. Stat. Med. 1994, 13, 823–838. [Google Scholar] [CrossRef]
Yang, Y.; Land, K.C. Age–period–cohort analysis of repeated cross-section surveys: Fixed or random effects? Sociol. Methods Res. 2008, 36, 297–326. [Google Scholar] [CrossRef] [Green Version]
Kuang, D.; Nielsen, B.; Perch Nielsen, J. Forecasting in an extended chain-ladder-type model. J. Risk Insur. 2011, 78, 345–359. [Google Scholar] [CrossRef] [Green Version]
Hunt, A.; Blake, D. Identifiability in age/period/cohort mortality models. Ann. Actuar. Sci. 2020, 14, 500–536. [Google Scholar] [CrossRef]
Lee, Y.; Mammen, E.; Nielsen, J.P.; Park, B. Asymptotics for in-sample density forecasting. Ann. Stat. 2015, 43, 620–651. [Google Scholar] [CrossRef]
Mammen, E.; Miranda, M.D.M.; Nielsen, J.P. In-sample forecasting applied to reserving and mesothelioma mortality. Insur. Math. Econ. 2015, 61, 76–86. [Google Scholar] [CrossRef] [Green Version]
Mammen, E.; Martínez-Miranda, M.D.; Nielsen, J.P.; Vogt, M. Calendar effect and in-sample forecasting. Insur. Math. Econ. 2021, 96, 31–52. [Google Scholar] [CrossRef]
Miranda, M.D.M.; Nielsen, J.P.; Sperlich, S.; Verrall, R. Continuous Chain Ladder: Reformulating and generalizing a classical insurance problem. Expert Syst. Appl. 2013, 40, 5588–5603. [Google Scholar] [CrossRef]
Health and Safety Executive (HSE). Mesothelioma Statistics for Great Britain. Annual Statistics. 2019. Available online: www.hse.gov.uk/statistics/ (accessed on 14 July 2021).
Neilsen, J.P. Multivariate boundary kernels from local linear estimation. Scand. Actuar. J. 1999, 1999, 93–95. [Google Scholar] [CrossRef]
Mohammadi, B.; Shole Haghighi, A.A.; Khorshidi, M.; De la Sen, M.; Parvaneh, V. Existence of Solutions for a System of Integral Equations Using a Generalization of Darbo’s Fixed Point Theorem. Mathematics 2020, 8, 492. [Google Scholar] [CrossRef] [Green Version]
Nielsen, B. apc: Age-Period-Cohort Analysis, R Package Version 1.4. Available online: https://cran.r-project.org/web/packages/apc/index.html (accessed on 14 July 2021).
Wand, M.; Jones, M. Kernel Smoothing; Chapman and Hall: London, UK, 1995. [Google Scholar]

Figure 1. Support of the observations and forecast region.

Figure 2. Normalized support of the observations (green) and corresponding forecast region (red).

Figure 3. Histogram of the mesothelioma mortality data in UK from 1968 to 2016. The number of deaths are shown according to the cohort (x) and age of death (y), for periods (

x + y

) between 1968 and 2016.

Figure 3. Histogram of the mesothelioma mortality data in UK from 1968 to 2016. The number of deaths are shown according to the cohort (x) and age of death (y), for periods (

x + y

) between 1968 and 2016.

Figure 4. Estimated density components for the mesothelioma mortality data considering different

κ

values (given in years). The dashed black line shows the estimated density components for the

κ = 49

years chosen by cross-validation.

Figure 4. Estimated density components for the mesothelioma mortality data considering different

κ

values (given in years). The dashed black line shows the estimated density components for the

κ = 49

years chosen by cross-validation.

Figure 5. Forecasts of the annual number of deaths using different values of

κ

. The observed past numbers of deaths are indicated by dots.

Figure 5. Forecasts of the annual number of deaths using different values of

κ

. The observed past numbers of deaths are indicated by dots.

Figure 6. Comparison of forecasts. The observed past numbers of deaths are indicated by dots.

Figure 7. Peak forecasts (left) and year of peak (right) for different values of

κ

. Based on data up to 2013.

Figure 7. Peak forecasts (left) and year of peak (right) for different values of

κ

. Based on data up to 2013.

Table 1. Mesotheolioma mortality forecasts in the UK. Five forecasts of numbers of deaths are compared: “our-201x” corresponds to our proposal using data up to 201x and constant calendar effect; “apc-201x*” to the discrete approach of [19] using data up to 201x and truncating cohorts from 1966 (x = 3, 6); and HSE projections also using provisional data for 2017, “HSE-2017p”. Available data are shown for assessing forecasts.

Period	Data	Our-2016	apc-2016*	HSE-2017p	Our-2013	apc-2013*
2014	2032				2048	2056
2015	2042				2062	2070
2016	2101				2071	2077
2017	2087	2063	2069		2074	2079
2018		2058	2062	2068	2072	2074
2019		2048	2049	2036	2063	2063
2020		2032	2030	1994	2049	2045
2021		2010	2002	1943	2028	2018
2022		1982	1969	1885	2002	1988

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Isakson, A.; Krummaker, S.; Martínez-Miranda, M.D.; Rickayzen, B. Calendar Effect and In-Sample Forecasting Applied to Mesothelioma Mortality Data. Mathematics 2021, 9, 2260. https://doi.org/10.3390/math9182260

AMA Style

Isakson A, Krummaker S, Martínez-Miranda MD, Rickayzen B. Calendar Effect and In-Sample Forecasting Applied to Mesothelioma Mortality Data. Mathematics. 2021; 9(18):2260. https://doi.org/10.3390/math9182260

Chicago/Turabian Style

Isakson, Alex, Simone Krummaker, María Dolores Martínez-Miranda, and Ben Rickayzen. 2021. "Calendar Effect and In-Sample Forecasting Applied to Mesothelioma Mortality Data" Mathematics 9, no. 18: 2260. https://doi.org/10.3390/math9182260

APA Style

Isakson, A., Krummaker, S., Martínez-Miranda, M. D., & Rickayzen, B. (2021). Calendar Effect and In-Sample Forecasting Applied to Mesothelioma Mortality Data. Mathematics, 9(18), 2260. https://doi.org/10.3390/math9182260

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Calendar Effect and In-Sample Forecasting Applied to Mesothelioma Mortality Data

Abstract

1. Introduction

1.1. Motivation

1.2. Literature Review

1.3. Aim and Outline

2. Materials and Methods

2.1. Density Model

2.2. Data

2.3. Estimation

2.4. Forecasting

3. Results

4. Discussion and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B. Two-Dimensional Local Linear Density Estimator

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI