Next Article in Journal
Implementation of Person-Centered Care: A Feasibility Study Using the WE-CARE Roadmap
Next Article in Special Issue
Cost-Free LTC Model Incorporated into Private Pension Schemes
Previous Article in Journal
Mental Health Status of Healthcare Professionals and Students of Health Sciences Faculties in Kuwait during the COVID-19 Pandemic
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Do Different Models Induce Changes in Mortality Indicators? That Is a Key Question for Extending the Lee-Carter Model

1
Centro de Gestión de la Calidad y del Cambio, Universitat Politècnica de València, Camino de Vera s/n, E-46022 Valencia, Spain
2
Cass Business School, University of London, London EC1Y 8TZ, UK
3
Department of Statistics and Operations Research, Universitat de València, E-46100 Burjassot, Spain
4
Department of Economics, Università di Messina, 98122 Messina, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Environ. Res. Public Health 2021, 18(4), 2204; https://doi.org/10.3390/ijerph18042204
Submission received: 13 January 2021 / Revised: 8 February 2021 / Accepted: 15 February 2021 / Published: 23 February 2021
(This article belongs to the Special Issue Advances in Measuring Health and Wellbeing)

Abstract

:
The parametric model introduced by Lee and Carter in 1992 for modeling mortality rates in the USA was a seminal development in forecasting life expectancies and has been widely used since then. Different extensions of this model, using different hypotheses about the data, constraints on the parameters, and appropriate methods have led to improvements in the model’s fit to historical data and the model’s forecasting of the future. This paper’s main objective is to evaluate if differences between models are reflected in different mortality indicators’ forecasts. To this end, nine sets of indicator predictions were generated by crossing three models and three block-bootstrap samples with each of size fifty. Later the predicted mortality indicators were compared using functional ANOVA. Models and block bootstrap procedures are applied to Spanish mortality data. Results show model, block-bootstrap, and interaction effects for all mortality indicators. Although it was not our main objective, it is essential to point out that the sample effect should not be present since they must be realizations of the same population, and therefore the procedure should lead to samples that do not influence the results. Regarding significant model effect, it follows that, although the addition of terms improves the adjustment of probabilities and translates into an effect on mortality indicators, the model’s predictions must be checked in terms of their probabilities and the mortality indicators of interest.

1. Introduction

In the context of rapid recent demographic changes, such as the finding that “human senescence has been delayed by a decade” in [1], the development of new models for fitting and projecting life tables is a key major direction of research for demographers, actuaries, epidemiologists and other biomedical researchers. Even if new proposals such as [2] are emerging, different extensions of the seminal Lee-Carter model introduced by [3] are still being developed. Thus recently, [4] compared three probability models (Poisson, Negative Binomial and Binomial) based on the Generalized Linear Model (GLM) framework of the Lee-Carter model. In fact, the majority of the existing models proposed in the actuarial and demographic literature fall into an age/period/cohort framework that builds on the Lee-Carter model [5].
It is also essential to have an appropriate set of indicators for studying these phenomena, including life expectancy, the Gini index, and the modal age at death [6,7]. All of these indicators can be projected using the predictions of annual age-specific mortality probabilities, q x t , obtained from different methodologies, which are based, in this paper, on Lee-Carter models. The errors associated with these estimations can be calculated employing a block-bootstrap methodology [8], and confidence intervals can be provided.
This paper aims to evaluate whether improvement in models will be reflected in significant differences between the projections of these mortality indicators. Therefore, we consider the Lee-Carter model with one or two temporal terms, and we also consider the addition of a cohort effect. Authors such as [9,10] find there is little change in the ability to forecast life expectancy in comparison with the original Lee-Carter model. There has been extensive work on the number of components to be included in Lee-Carter type models. [11] conclude that there is no real penalty for adding extra terms and recommend using six components, using the results of [12] for providing some theoretical reasons for this conclusion. [13,14,15] are extensive studies which compared many models and countries. Consequently, using only two-period components in Lee-Carter type models could seem inadequate in the light of these empirical and theoretical results, but we aim to investigate the statistical methodology for comparing the forecasts of mortality indicators. The methodology can be extended to any forecasting results.
Our study provides two important innovations which are significant contributions to the existing literature on mortality modelling. First, we use the block-bootstrap technique of [8] to produce the confidence intervals for the mortality indicators allowing for parameter error. We extend this by testing whether the forecasts are affected using the block-bootstrap from fitting mortality models other than those used to forecast the indicator. Our second methodological innovation is the use of functional analysis techniques to detect differences between the projections of the mortality indicators. This is important because the projections of an indicator over time will consist of correlated values, which requires either the use of longitudinal data techniques or functional data methods to analyze precisely.
The paper is structured as follows. Material and methods are described in Section 2. Then, Section 2.1 gives a brief summary of Lee-Carter models and Section 2.2 discusses the indicators of mortality used in this analysis. The following Section 2.3 introduces the block-bootstrap techniques for estimating prediction intervals. Functional data analysis techniques allowing the comparison of mortality indicators are then discussed in Section 2.4. Later, in Section 3 we present results of our analysis of Spanish mortality data by means of the methods and indicators discussed previously. Finally, Section 4 draws conclusions from these results and summarises our findings.

2. Materials and Methods

Mortality is one of the demographic components that began to be studied as early as the 17th century in England, but today it presents challenges that are more relevant than ever. The main tool to study mortality is the life tables. Mortality rates are often presented in the form of life tables, giving the probabilities of dying at each age (conditional on survival to that age) for a population. There are two types of life tables: cohort and period tables. The cohort life table presents the mortality experience of a specific cohort (born in the same year), and hence it reflects the actual mortality rates experience from birth until all individuals in the cohort have died. In contrast, the period life table presents what would happen to a synthetic cohort if it experienced the mortality conditions of a particular time period throughout its entire life. Cohort tables therefore require data over many years and will only be complete on exhaustion of the cohort. Because of this, we normally use mortality indicators based on the period life table.
Crude mortality probabilities at age x and time t are typically based on the corresponding number of deaths recorded, d x t , relative to the population aged x last birthday at the start of the calendar year, i.e., the initial exposure-to-risk, E x t . Typically, it is assumed that these crude probabilities are random realisations of a smooth underlying function of age, period and birth cohort, which can be found by statistical analysis. Static life tables assume that the mortality probabilities do not change with time and so are functions of age only. In contrast, dynamic life tables are rectangular arrays of mortality probabilities, q x t , with each column in this array representing the period life table for year t (Figure 1). See [16] for a more detailed discussion of dynamic life tables.

2.1. Lee-Carter Models

The Lee-Carter model was introduced in [3] and has become the standard model for projecting dynamic life tables. The model was used with the logarithm of the annual age-specific central mortality rates, m x t , which are equivalent to the force of mortality, μ x t , under the assumption that μ x t is constant over integer ages and years. For the calculation of mortality indicators, the projected central mortality rates are converted into annual age-specific probabilities of death using standard formulae. In contrast, this paper models the annual age-specific probability of death, q x t . Therefore, this approach has the advantage that the indicators studied in this paper can be calculated directly.
In [3], least-squares estimation via singular value decomposition of the matrix of the log age-specific central mortality rates was used to fit the model. This implicitly assumed that the death counts are homoscedastic across ages and years. In contrast, the maximum-likelihood method, assuming that death counts follow the Poisson distribution, avoids this drawback [17]. However, using the Poisson model for the death counts is inconsistent with our desire to model the age-specific probabilities of death, and so instead we model death counts as binomial random variables. In addition, [18] compared Poisson models for the force of mortality and binomial models for probabilities of death for six different countries and, in most of them, the binomial models outperform the Poisson ones. However, the results using the binomial model and probabilities of death show evidence of overdispersion, since the deviance statistic is greater than the degrees of freedom in the model. Therefore, in this study, the quasi-binomial family model is used to model the probabilities of death, which overcomes this problem. To do this, we use the extended version [19,20] of the Lee-Carter predictor structure in conjunction with the logit transformation of the probability of death, q x t , i.e., we use
ln q x t 1 q x t = a x + i = 1 r b x ( i ) k t ( i ) + ϵ x t .
In our application with Spanish mortality data, we have used (Equation (1)) with r = 1 (i.e., the classical Lee-Carter structure) and r = 2 (as investigated in [20]), and consequently the corresponding models will be denoted LC1 and LC2, respectively. The estimation of the parameters for these models is carried out by means of maximum likelihood methods assuming the quasi-binomial distribution for the death counts, using the gnm library published by [21] in R [22]. A detailed description of these models and the alternative method to fit them to data can be found in [23]. Besides, the fitting procedure of a range of different extensions to the Lee-Carter model using the gnm library from [21] can be found in [18].
In addition to extending the Lee-Carter model with additional age/period terms, various authors have proposed modifying the Lee-Carter model to include the influence of the year of birth (cohort) c = t x . In [24], a model H1 is analyzed and recommended as the simplest version of the Lee-Carter model with a cohort term [25]. We use this H1 model with the logit transformation of the probability of death q x t to give,
ln q x t 1 q x t = a x + b x k t + γ t x + ϵ x t ,
which will also be referred to as H1 in this paper.
To forecast q x t , we first model k t ( i ) , i = 1 , 2 and γ t x as time series using the Box-Jenkins methodology. Separate univariate ARIMA processes are applied to the first two-period components of the LC2 model, due to the underlying assumption that k t ( 1 ) and k t ( 2 ) are independent. This is a consequence of using an orthogonal decomposition [26].
Finally, we note that many studies (for instance [10,18,27]) have found that although the introduction of a cohort term to the Lee-Carter models generally leads to an improvement in overall fit, models may become less robust when fitting it to data and cause problems when forecasting mortality rates. To overcome this, we have implemented in Section 3.1 a two stage fitting procedure, as suggested in [25]. According to [28] the two-stage approach will generally lead to more imperfect fits to the available data than a one-stage approach and are therefore not proper maximum likelihood estimates. However, the potential loss of goodness of fit of the two-stage approach may be justified if it gives a model that is more robust to changes in the data [29].

2.2. Mortality Indicators

When projecting dynamic life tables, it is important to be able to summarise the changes in mortality rates across different ages. In previous studies, such as [10,13,14,24], the key indices of interest have been mortality rates, life expectancies and discounted annuity values. The last two of these are indicators related to the typical life span, and are especially important in an actuarial context. However, for an accurate assessment of risk, measures of the dispersion of life spans are also important. In this paper, we study three period-based mortality indicators: life expectancy and modal age of death (which are both measures of typical life span in a population) and the Gini index (to measure the dispersion of life span). Each of these mortality indicators can be calculated from a dynamic period life table, as described in [6], and they are briefly described here below.
The life expectancy for individuals with age x is given by
e x t = T x t l x t ,
where l x t is the hypothetical number of people alive from the synthetic cohort experiencing mortality rates for year t at the beginning of the age interval [ x , x + 1 ) and T x t is the total number of years lived by this synthetic cohort during and after this age interval. In Section 3, we calculate life expectancies at birth and at age 65, i.e., e 0 t and e 65 t , respectively
The modal age of death is age-associated with the maximum frequency of death from the synthetic cohort experiencing mortality rates for year t. Its expression according to [30] is,
M ( t ) = { x | max [ d x t ] f o r x > 5 } .
This indicator’s choice is justified because it can reflect changes in the probability of death q x t , which are not detected with life expectancy [31].
However, life expectancy and the modal age of death are both measures of the “typical” length of life of the synthetic cohort, and so do not provide any information about whether changes in mortality apply equally to different age groups. In contrast, the Gini coefficient can be used as a measure of inequality in the length of life, as discussed in [32], as well as being the most common statistical index of diversity or inequality for other variables in the social sciences. Other indices such as the Interquartile range (IQR), which also allow the measurement of this unequal contribution, do not have some desirable basic properties for measuring inequality [32].
The Gini index is calculated using the area underneath the Lorenz curve for the distribution of time of death, which is the curve obtained when we plot the proportion of our synthetic cohort who have died before age x, f x t , given by
f x t = l 0 t l x t l 0 t = 1 l x t l 0 t ,
on the x-axis, and the cumulative proportion of the years lived by this (deceased) population compared with the total for the synthetic cohort, g x t , given by
g x t = T 0 t T x t x l x t T 0 t ,
on the y-axis. By definition, 0 f x t 1 and 0 g x t 1 , with f 0 t = g 0 t = 0 and f ω t = g ω t = 1 , where ω is the oldest age in the life table. Furthermore, the Lorentz curve is always below the diagonal f x t = g x t with equality only in the case where the entire cohort dies at one specific age. For discrete life tables, one of the most widely used approaches for estimating the Gini index is,
I G t = x = 0 ( ω 1 ) ( f x t g x t ) x = 0 ( ω 1 ) f x t ,
where ω is the last age observed.
Hence, the Gini index measures the dispersion of deaths in the synthetic cohort across the age range. It varies from 0, meaning that all individuals die at the same age, to 1, where almost the entire cohort dies at birth except one individual who dies at age ω . [7] found it to be an excellent indicator to discriminate European Union countries.

2.3. Block-Bootstrap Prediction Intervals

Forecasts of the mortality indicators in the future are highly uncertain. Therefore, we illustrate this uncertainty by calculating confidence intervals for the forecast indicators. To consider parameter error, several different bootstrapping procedures have been proposed in [33,34].
In the case where the residuals from the fitting model have dependence across two-dimensions (i.e., age, and time), the ordinary bootstrap in [34] is not valid. Therefore, we use a residual-based block-bootstrap of the fitted residuals, as proposed in [8], because this technique partially retains the underlying dependence structure observed in the residuals and hence generates more realistic resamples [35].
The models in this paper are fitted using the quasi-binomial model. Therefore, the deviance residuals from fitting the model to data are given by expression (Equation (2))
r x t = s i g n ( d x t d ^ x t ) 2 d x t log d x t d ^ x t + ( E x t d x t ) log E x t d x t E x t d ^ x t ,
where d x t is the observed and d ^ x t is the fitted number of deaths for age x and year t.
The block-bootstrap procedure starts with the rectangular array of deviance residuals, r x t , with age x in rows, and calendar time t in columns. To obtain a new artificial set of residuals, r ^ x t ( n ) , we partition an empty rectangular array, with the same dimensions as the original matrix of residuals, into smaller, non-overlapping rectangular blocks. Each block is then filled by a randomly selected block of residuals from the original array. This randomly selected block consists of all residuals in the rectangle to the southeast from a randomly selected element from the original matrix.
To select the appropriate block size, in the absence of firm theoretical guidance, [8] plot correlograms and contour maps of the original residuals and compare these with the resampled residuals. If these plots match reasonably well, this suggests that there is a similar underlying dependence structure in the resampled residuals and in the original set. In this paper, our initial guesses are based on the dependence structure observed for the contour maps of variograms or covariance function of the raw residuals. The observation of these maps allows us to identify the distance in years and age from which we can admit the independence between the residuals, and these values define the dimensions of the block. Further details can be found in [10]. These resampled residuals, r ^ x t ( n ) , are then combined with the fitted death counts, d ^ x t , from the original model, by solving
( r ^ x t ( n ) ) 2 = 2 d x t log d x t d ^ x t + ( E x t d x t ) log E x t d x t E x t d ^ x t ,
to give resampled “observations” of the death counts, the modified d x t ( n ) . These resampled death counts are then fitted using one of the three mortality models being considered, providing new estimates of the model parameters.
However, it should not be essential that the model used to fit these resampled death counts is the same as the original model used to generate the residuals and fitted mortality rates. Therefore, there are nine possible combinations of models given by the three models used to generate the residuals and fitted mortality rates used by the block-bootstrap technique. The process is repeated to give N bootstrap samples of death counts, which, in turn, provide N resampled sets of model parameters. For each of these, the k t ( i ) s and γ t x s are projected on the basis of an A R I M A model selected using the Box-Jenkins procedure. Hence, we obtain predictions for the mortality rates and the corresponding mortality indicators for desired future years allowing fully for parameter error. The 95% confidence intervals, I C 95 , for the different mortality indicators are obtained by selecting the 2.5% and 97.5% percentiles of the projected indicators for the desired future years, for example, I C 95 e 0 , 2020 = [ p 0.025 e 0 , 2020 , p 0.975 e 0 , 2020 ] represents the 95% confidence interval for period life expectancy at birth in 2020, based on the 2.5% and 97.5% percentiles of the projected distribution for this indicator.

2.4. ANOVA for Functional Data Analysis

The experimental design we have just described is a two fixed factor design: the model used for fitting, and the sample obtained from the residual used for resampling. Its structure is shown in Table 1.
It is a balanced design with the same number of repetitions in each cell, N, equal to the number of block-bootstrap samples. Each of these repetitions is 20 values, the projected mortality indicator corresponding to the years 2013 to 2032. The factor model has three categories, the Lee-Carter with one time-dependent term, LC1, the Lee-Carter with two time-dependent terms, LC2, and the Lee-Carter with one time-dependent term and one cohort term, H1. In turn, the factor sample also has three categories reflecting the origin of the fitted and residuals used in the bootstrap process, RLC1, RLC2 and RH1 according to whether they were obtained from the fitting of the original data with the LC1, LC2 or H1 model.
As mentioned at the beginning, we aim to check differences among the projections obtained with each combination of model and block-bootstrap sample. A classic ANOVA method should not be applied in this context. We have to deal with functional data, and the use of functional data analysis methods is a natural way to proceed. Several authors have dealt with the development of ANOVA techniques for these kinds of data, but not all of these methods can be used in a design with two fixed factors. Some of them address a single factor [36,37] and others address mixed-effects [38,39]. Therefore, we will resort to the method proposed by [40], which is an effective, flexible, and easy to compute technique able to deal with complicated ANOVA designs requiring no normality assumptions and as few additional hypotheses as possible. The method uses random projections to transform functional data into univariate data, solves the ANOVA problem in this simple situation, and obtains conclusions for the functional data by collecting the information from several projections. Following [40], we can decompose any functional data X i m o d , s a m ( t ) as,
X i m o d , s a m ( t ) = m ( t ) + f m o d ( t ) + g s a m ( t ) + h m o d , s a m ( t ) + ϵ i m o d , s a m ( t ) ,
where m is non-random and describes the overall shape of the projections, i = 1 , , N , and the functions f m o d , g s a m and h m o d , s a m account for the main effect and interaction of model and sample. Finally, ϵ i m o d , r e s are independent and identically distributed random trajectories centered on the mean.

3. Results

The data used in this analysis come from the life tables published by the Spanish National Institute of Statistics (INE) and are available on its web page www.ine.es (accessed on 15 May 2020). In particular, we have worked with the crude estimates of probabilities of death, q x t , obtained using the methodology proposed by [41], based on [42]. This more recent Spanish dataset is computed using micro-mortality data, based on individual dates of death, and with no smoothing procedure applied at the oldest ages. Therefore, this dataset is more accurate than those provided by the Human Mortality Database [41]. Also, this paper describes the steps that are taken, and R-packages are quoted for the sake of replicability and reproducibility.

3.1. Model Fitting

The models described in Section 2.1 (i.e., the LC1, LC2 and H1 models) have been used to fit mortality data for Spain for the period 1991–2012 and ages 0 to 99. As discussed in Section 2.1, the estimation of the parameters for these models is carried out by means of maximum likelihood methods assuming the quasi-binomial distribution for the death counts, using the gnm R-package [21].
However, the models proposed in Section 2.1 are over-parameterised, and require additional identifiability constraints in order to obtain unique estimates of the parameters. The model LC1 is usually fitted with the location constraint t k t ( 1 ) = 0 and the scale constraint b x ( 1 ) = 1 [3,27]. However, in this paper, we use the alternative constraints k t 0 ( 1 ) = 0 and b 0 ( 1 ) = 1 , because these are simple to be specified using the gnm function with the additional constrain and constrainTo inputs. In addition, we have used the residSVD function to provide initial estimates of the parameters, to speed up and ensure convergence of the fitting algorithm.
In a similar manner, the LC2 model can be fitted with the gnm library using the instances function. For consistency with the LC1 model, we apply the restrictions k t 0 ( 1 ) = 0 , b 0 ( 1 ) = 1 , k t 0 ( 2 ) = 0 and b 0 ( 2 ) = 1 .
For the H1 model, the issue of identifiability constraints becomes more difficult due to the relationship between the factors, c o h o r t = p e r i o d a g e , as noted by [25]. To overcome this, [24] propose carrying the estimation out in two stages. First, a x is fixed, in a similar manner as was originally done in [3], i.e.,
a ^ x = t ln q x t 1 q x t T .
Then, the remaining parameters can be estimated with the fixed values of a x by using the offset term in the gnm function. In this paper, for the H1 model, we further impose the constraints k t 0 ( 1 ) = 0 , b 0 ( 1 ) = 1 to identify the model and an additional constraint, b x ( 1 ) > 0 x , to ensure that mortality rates are positively correlated across the age range. Furthermore, we have set the weight of the first and last five cohorts to zero, as done in [24], to avoid estimating parameters for which we have very little data.
For the sake of brevity, we have only fitted data for males and have not reproduced our results for females. Figure 2a–c show the behaviour of the logit of the crude mortality rates according to age x, period t and cohort t x , respectively, by grouping the crude mortality rates by the factors age, period and cohort and plotting box-and-whisker diagrams them. We note that Figure 2c shows outliers for cohorts born in 1990 until 2009, which correspond to the high rates of mortality at birth. The goodness of fit of these models to the data is evaluated using the total deviance, which is shown for all three models with the corresponding number of parameters in Table 2.
Additionally, [25] suggest carrying out diagnostic checks on the fitted model by plotting residuals. Figure 3a–c show the underlying dependence structure in the deviance residuals given by expression (Equation (2)) for LC1, LC2 and H1, respectively. As can be seen in Figure 3c, the inclusion of the cohort effect for model H1 has partially eliminated the diagonal effect appearing in the other two models. However, the residuals for any of the models still show significant two-dimensional dependence.

3.2. Prediction Intervals for Mortality Indicators

As discussed in Section 2.3, the standard bootstrap of [34] is not valid in the case where the residuals show two-dimensional dependence, since it is based on simple random sampling. Instead, use the block-bootstrap technique suggested by Liu [8]. To determine the optimum block sizes, we analyse the dependence structure observed for the residuals in Figure 3a–c using a variogram [10]. These plots show large patches of positive and negative values (some of which are of very large magnitude), which is suggestive of a high degree of two-dimensional dependence and so requires the use of large block sizes.
Therefore, we use block sizes of 3 × 9 , 3 × 9 and 3 × 14 for LC1, LC2 and H1, respectively. As described in Section 2.3, we then use these resampled death counts from the bootstrapping procedure to refit the models and then forecast confidence intervals for the mortality indicators for the period 2013–2032. In general, these confidence intervals show that life expectancy at birth, residual life expectancy at age 65, and the modal age of death continue to increase, and the forecast Gini index decreases.
Table 3 shows the INE estimations and the corresponding confidence intervals for LC1 for predicted life expectancy at birth and at 65, noting that the LC1 model furnishes the highest values and the values closest to the INE ones.
Confidence intervals for forecasted Spanish life expectancy at birth for the period 2013–2032 are shown in Figure 4 where the interval corresponding to LC2 model is the widest. In addition, life expectancies for the period 2013–2019 published by INE are outside of all intervals for the first two years (2013 and 2014).
Confidence intervals for forecasted Gini index for the period 2013–2032 are shown in Figure 5 where the interval corresponding to the H1 model is the widest. Also, the H1 model shows the Gini index with greatest decrease.

3.3. ANOVA for Functional Data Analysis

We use the functional ANOVA procedure to test differences in the behavior of the mortality indicators for the nine different combinations of the model and the block-bootstrap sample. Thus, we expect a model effect, but no sample or interaction effects since the samples represent the same population regardless of the model that generated them.
Using Equation (4), we test the following null hypotheses
H 0 m o d : f L C 1 = f L C 2 = f H 1 = 0 H 0 s a m : g R L C 1 = g R L C 2 = g R H 1 = 0 H 0 m o d , s a m : h m o d e l , s a m p l e = 0 , m o d e l = { L C 1 , L C 2 , H 1 } , r e s i d u a l = { R L C 1 , R L C 2 , R H 1 } .
for each indicator, the alternative hypothesis being that some of the equalities are not true. As discussed in Section 2.4, the technique we use to do this was introduced by [40]. These authors have also developed an R-package [43], which obtains p-values by three different approaches: the Bonferroni method, the false discovery rate method and the bootstrap method. As [43] point out, the Bonferroni correction is conservative. In contrast, the bootstrap method is time consuming, and requires proofs to show that it is appropriate in the specific contexts in which it is applied to. Therefore, the we use the false discovery rate (FDR) method, proposed by [44].
Table 4 summarizes the result of the functional ANOVA obtained with the False Discovery Rate method, where NR stands for the non-rejection of the null hypothesis and R for rejection.
We have also performed multiple comparisons to establish whether there are homogeneous groups among the categories of the main factors tested above. The results of these are shown in Table 5. Figure 6 and Figure 7 help us to understand the results in Table 5. From Figure 6a, we note that LC1 model provides greater projections of the life expectancy than the other two models, and Figure 6b displays small but significant differences between the bootstrap samples. As regards the Gini index, the LC1 and LC2 models show a similar trend over time, but there is a more pronounced decrease corresponding to the H1 model (Figure 7a). For the sample factor, the lowest values of the Gini index corresponds to the RLC1 sample of residuals whose projections are significantly different from the other two (Figure 7b). The equivalent graphs for the other two indicators (life expectancy at age 65 and modal age at death) are not reproduced here for the sake of brevity. The interaction between the model and the residuals deserves particular comment.

4. Conclusions

In terms of the key indices of interest for mortality forecasting, specially in an actuarial context, refs. [10,13,14,24,45,46] considered death rates, life expectancies and discounted annuity values. This paper evaluates if differences between three different extensions of the Lee-Carter model are reflected in the forecasts of different mortality indicators. The three mortality indicators used are the life expectancy and modal age of death (which are measures of typical life span in a population) and the Gini index (which is a measure of the dispersion of life span). As far as we are aware, there are no previous studies to date on the impact of model risk on forecasting of the Gini index. To illustrate the uncertainty in our forecasts, we calculate confidence intervals for the forecast indicators using the block-bootstrap technique proposed in [8] for the fitted residuals.
To evaluate whether the differences in the forecast mortality indicators between the different mortality models are statistically significant, we have used functional ANOVA to test the two-factor design resulting from crossing models and block-bootstrap samples, as discussed in Section 2.3 and Section 2.4. In Table 4, we show a statistically significant model, sample, and interaction effects between these two factors for all of the mortality indicators under consideration. Although it was not our main objective, it is essential to point out that the sample effect should not be present since they must be realizations of the same population, and therefore the procedure should lead to samples that do not influence the results. Most authors test their models with the samples derived from their fit, leading to biased conclusions from their results.
We find that our predictions for e 0 t and e 65 t with Spanish mortality data are higher than those obtained in previous studies [6,10,23]. Also, there are significant differences in the forecasts of all of the indicators between the different models which we investigate in this study. These differences in forecasts between models is a key component of the “longevity risk”, as identified by the IMF in [47]. Larger predictions may be more realistic than those obtained previously and may represent a better response to the financial challenge that “longevity risk” implies, as noted by the IMF in a recent report [47]. Therefore, in terms of longevity risk, when different models disagree, the preferred model could be the one with the greatest predicted life expectancy, which in our case is LC1. Although the addition of further terms improves the adjustment of probabilities and translates into an effect on any derived mortality indicators, the models’ predictions must be checked in terms of their probabilities and the mortality indicators obtained.
In this study, we use the functional ANOVA to provide an objective criterion for measuring the impact of different techniques for forecasting mortality indicators. This technique is complementary to measures such as goodness-of-fit statistics, as used in [13,14,24]. Although the conclusions that we reach are based on the dataset for Spanish males, the use of the block-bootstrap and the statistical tools proposed provide a framework for investigating a wide range of mortality modeling hypotheses. However, we leave it to future work to look at other datasets to examine whether our conclusions are consistent for other populations and to draw more general conclusions about the impact of model risk on the forecasting of mortality indicators.

Author Contributions

All authors have read and agreed to the published version of the manuscript. Conceptualization, A.D. and F.M.; methodology, F.M., S.H. and E.O.; software, A.D.; validation, A.D.; formal analysis, A.D.; investigation, A.D.; writing—review and editing, A.D. and S.H.; visualization, A.D.; supervision, F.M., S.H. and E.O.; project administration, A.D.; funding acquisition, A.D.

Funding

This research was funded by “José Castillejo” Program from Ministerio de Educación grant number JC2011-0169, by Universitat Politècnica de València (PAID-00-12), and the Faculty of Business Administration.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this analysis come from the life tables published by the Spanish National Institute of Statistics (INE) and are available on its web page www.ine.es (accessed on 15 May 2020).

Acknowledgments

We appreciate the opportunity to discuss our ideas with faculty and doctoral students at Cass Business School, especially with Andrés Villegas and Andrew Hunt. The authors are indebted to the anonymous referees whose suggestions improved the original manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Vaupel, J.W. Biodemography of human ageing. Nature 2010, 464, 536–542. [Google Scholar] [CrossRef] [Green Version]
  2. Atance, D.; Balbás, A.; Navarro, E. Constructing dynamic life tables with a single-factor model. Decis. Econ. Financ. 2020, 43, 787–825. [Google Scholar] [CrossRef]
  3. Lee, R.; Carter, L. Modelling and forecasting U.S. mortality. J. Am. Stat. Assoc. 1992, 87, 659–671. [Google Scholar]
  4. Azman, S.; Pathmanathan, D. The GLM framework of the Lee-Carter model: A multi-country study. J. Appl. Stat. 2020, 1–12. [Google Scholar] [CrossRef]
  5. Villegas, A.M.; Kaishev, V.K.; Millossovich, P. StMoMo: An R Package for Stochastic Mortality Modeling. J. Stat. Softw. 2018, 84, 1–38. [Google Scholar] [CrossRef] [Green Version]
  6. Debón, A.; Martínez-Ruiz, F.; Montes, F. Temporal Evolution of Mortality Indicators. N. Am. Actuar. J. 2012, 16, 364–377. [Google Scholar] [CrossRef]
  7. Debón, A.; Chaves, L.; Haberman, S.; Villa, F. Characterization of between-group inequality of longevity in European Union countries. Insur. Math. Econ. 2017, 75, 151–165. [Google Scholar] [CrossRef] [Green Version]
  8. Liu, X.; Braun, W.J. Investigating Mortality Uncertainty Using the Block Bootstrap. J. Probab. Stat. 2010, 2010, 385–399. [Google Scholar] [CrossRef] [Green Version]
  9. Lazar, D.; Denuit, M.M. A multivariate time series approach to projected life tables. Appl. Stoch. Model. Bus. Ind. 2009, 25, 806–823. [Google Scholar] [CrossRef]
  10. Debón, A.; Martínez-Ruiz, F.; Montes, F. A geostatistical approach for dynamic life tables: The effect of mortality on remaining lifetime and annuities. Insur. Math. Econ. 2010, 47, 327–336. [Google Scholar] [CrossRef] [Green Version]
  11. Hyndman, R.J.; Booth, H. Stochastic population forecasts using functional data models for mortality, fertility and migration. Int. J. Forecast. 2008, 24, 323–342. [Google Scholar] [CrossRef]
  12. Greenshtein, E. Best subset selection, persistence in high-dimensional statistical learning and optimization under l1 constraint. Ann. Stat. 2006, 34, 2367–2386. [Google Scholar] [CrossRef] [Green Version]
  13. Booth, H.; Hyndman, R.J.; Tickle, L.; de Jong, P. Lee-Carter mortality forecasting: A multi-country comparison of variants and extensions. Demogr. Res. 2006, 15, 289–310. [Google Scholar] [CrossRef]
  14. Cairns, A.J.; Blake, D.; Dowd, K.; Coughlan, G.D.; Epstein, D.; Khalaf-Allah, M. Mortality density forecasts: An analysis of six stochastic mortality models. Insur. Math. Econ. 2011, 48, 355–367. [Google Scholar] [CrossRef] [Green Version]
  15. Shang, H.L.; Booth, H.; Hyndman, R. Point and interval forecasts of mortality rates and life expectancy: A comparison of ten principal component methods. Demogr. Res. 2011, 25, 173–214. [Google Scholar] [CrossRef] [Green Version]
  16. Pitacco, E.; Denuit, M.; Haberman, S.; Olivieri, A. Modelling Longevity Dynamics for Pensions and Annuity Business; Oxford University Press: Oxford, UK, 2009. [Google Scholar]
  17. Brouhns, N.; Denuit, M.; Vermunt, J. Measuring the longevity risk in mortality projections. Bull. Swiss Assoc. Actuar. 2002, 2, 105–130. [Google Scholar]
  18. Currie, I.D. On fitting generalized linear and non-linear models of mortality. Scand. Actuar. J. 2016, 2016, 356–383. [Google Scholar] [CrossRef]
  19. Booth, H.; Maindonald, J.; Smith, L. Applying Lee-Carter under conditions of variable mortality decline. Popul. Stud. 2002, 56, 325–336. [Google Scholar] [CrossRef] [PubMed]
  20. Renshaw, A.; Haberman, S. Lee-Carter mortality forecasting with age specific enhancement. Insur. Math. Econ. 2003, 33, 255–272. [Google Scholar] [CrossRef]
  21. Turner, H.; Firth, D. Generalized Nonlinear Models in R: An Overview of the Gnm Package, R package version 1.1-1; R Foundation for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
  22. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
  23. Debón, A.; Montes, F.; Puig, F. Modelling and Forecasting mortality in Spain. Eur. J. Oper. Res. 2008, 189, 624–637. [Google Scholar] [CrossRef] [Green Version]
  24. Haberman, S.; Renshaw, A. A comparative study of parametric mortality projection models. Insur. Math. Econ. 2011, 48, 35–55. [Google Scholar] [CrossRef] [Green Version]
  25. Renshaw, A.; Haberman, S. A cohort-based extension to the Lee-Carter model for mortality reduction factors. Insur. Math. Econ. 2006, 38, 556–570. [Google Scholar] [CrossRef]
  26. Hyndman, R.J.; Shahid Ullah, M. Robust forecasting of mortality and fertility rates: A functional data approach. Comput. Stat. Data Anal. 2007, 51, 4942–4956. [Google Scholar] [CrossRef] [Green Version]
  27. Cairns, A.J.; Blake, D.; Dowd, K.; Coughlan, G.D.; Epstein, D.; Ong, A.; Balevich, I. A quantitative comparison of stochastic mortality models using data from England and Wales and the United States. N. Am. Actuar. J. 2009, 13, 1–35. [Google Scholar] [CrossRef]
  28. Hunt, A.; Villegas, A.M. Robustness and convergence in the Lee–Carter model with cohort effects. Insur. Math. Econ. 2015, 64, 186–202. [Google Scholar] [CrossRef]
  29. Haberman, S.; Renshaw, A. On age-period-cohort parametric mortality rate projections. Insur. Math. Econ. 2009, 45, 255–270. [Google Scholar] [CrossRef] [Green Version]
  30. Canudas-Romo, V. Three measures of longevity: Time trends and record values. Demogr. 2010, 47, 299–312. [Google Scholar] [CrossRef]
  31. Canudas-Romo, V. The modal age at death and the shifting mortality hypothesis. Demogr. Res. 2008, 19, 1179–1204. [Google Scholar] [CrossRef]
  32. Shkolnikov, V.; Andreev, E.; Begun, A. Gini coefficient as a life table function: Computation from discrete data, decomposition of differences and empirical examples. Demogr. Res. 2003, 8, 305–358. [Google Scholar] [CrossRef] [Green Version]
  33. Brouhns, N.; Denuit, M.; Keilegom, I.V. Bootstrapping Poisson log-bilinear model for mortality forecasting. Scand. Actuar. J. 2005, 2005, 212–224. [Google Scholar] [CrossRef]
  34. Koissi, M.; Shapiro, A.; Högnäs, G. Evaluating and extending the Lee-Carter model for mortality forecasting confidence interval. Insur. Math. Econ. 2006, 38, 1–20. [Google Scholar] [CrossRef]
  35. Efron, B.; Tibshirani, R. An Introduction to the Bootstrap; Chapman & Hall: New York, NY, USA; London, UK, 1993. [Google Scholar]
  36. Cuevas, A.; Febrero-Bande, M.; Fraiman, R. An anova test for functional data. Comput. Stat. Data Anal. 2004, 47, 111–122. [Google Scholar] [CrossRef]
  37. Martínez-Camblor, P.; Corral, N. Repeated measures analysis for functional data. Comput. Stat. Data Anal. 2011, 55, 3244–3256. [Google Scholar] [CrossRef]
  38. Abramovich, F.; Angelini, C. Testing in mixed-effects FANOVA models. J. Stat. Plan. Inference 2006, 136, 4326–4348. [Google Scholar] [CrossRef]
  39. Antoniadis, A.; Sapatinas, T. Estimation and inference in functional mixed-effects models. Comput. Stat. Data Anal. 2007, 51, 4793–4813. [Google Scholar] [CrossRef]
  40. Cuesta-Albertos, J.; Febrero-Bande, M. A simple multiway ANOVA for functional data. TEST 2010, 19, 537–557. [Google Scholar] [CrossRef]
  41. Muriel, S.; Cantalapiedra, M.; López, F. Towards Advanced Methods for Computing Life Tables; Technical Report. Instituto Nacional de Estadística, 2010. Available online: https://www.ine.es/ss/Satellite?L=es_ES&c=INEDocTrabajo_C&cid=1259925434773&p=1254735839320&pagename=MetodologiaYEstandares%2FINELayout (accessed on 10 May 2011).
  42. Elandt-Johnson, R.; Johnson, N. Survival Models and Data Analysis; Wiley: New York, NY, USA, 1980. [Google Scholar]
  43. Febrero-Bande, M.; Oviedo de la Fuente, M. Statistical Computing in Functional Data Analysis: The R Package fda.usc. J. Stat. Softw. 2012, 51, 1–28. [Google Scholar] [CrossRef] [Green Version]
  44. Benjamini, Y.; Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 2001, 29, 1165–1188. [Google Scholar]
  45. Shang, H.; Haberman, S. Forecasting age distribution of death counts: An application to annuity pricing. Ann. Actuar. Sci. 2020, 14, 150–169. [Google Scholar] [CrossRef] [Green Version]
  46. Bozikas, A.; Pitselis, G. An empirical study on stochastic mortality modelling under the age-period-cohort framework: The case of greece with applications to insurance pricing. Risks 2018, 6, 44. [Google Scholar] [CrossRef] [Green Version]
  47. International Monetary Fund. The Financial Impact of Longevity Risk. Technical Report. 2012. Available online: https://www.imf.org/~/media/Websites/IMF/imported-flagship-issues/external/pubs/ft/GFSR/2012/01/pdf/_c4pdf.ashx (accessed on 8 May 2012).
Figure 1. Illustration of age, cohort and period dimensions.
Figure 1. Illustration of age, cohort and period dimensions.
Ijerph 18 02204 g001
Figure 2. Behaviour of the logit of the crude mortality rates. (a) for age, (b) time and (c) cohort.
Figure 2. Behaviour of the logit of the crude mortality rates. (a) for age, (b) time and (c) cohort.
Ijerph 18 02204 g002
Figure 3. Residuals for age-period for each model for males. (a) LC1, (b) LC2 and (c) H1.
Figure 3. Residuals for age-period for each model for males. (a) LC1, (b) LC2 and (c) H1.
Ijerph 18 02204 g003
Figure 4. Confidence intervals for forecasted Spanish life expectancy for the period 2013–2032. (a) LC1, (b) LC2 and (c) H1.
Figure 4. Confidence intervals for forecasted Spanish life expectancy for the period 2013–2032. (a) LC1, (b) LC2 and (c) H1.
Ijerph 18 02204 g004
Figure 5. Confidence intervals for forecasted Gini index for the period 2013–2032. (a) LC1, (b) LC2 and (c) H1.
Figure 5. Confidence intervals for forecasted Gini index for the period 2013–2032. (a) LC1, (b) LC2 and (c) H1.
Ijerph 18 02204 g005
Figure 6. Average forecasting estimations for life expectancy. (a) Model (b) Sample.
Figure 6. Average forecasting estimations for life expectancy. (a) Model (b) Sample.
Ijerph 18 02204 g006
Figure 7. Average forecasting functions for the Gini index. (a) Model (b) Sample.
Figure 7. Average forecasting functions for the Gini index. (a) Model (b) Sample.
Ijerph 18 02204 g007
Table 1. Experimental design for comparison of functional indicators.
Table 1. Experimental design for comparison of functional indicators.
Model
LC1LC2H1
LC1NNN
SampleRLC2NNN
RH1NNN
Table 2. Goodness-of-fit measures for the different models.
Table 2. Goodness-of-fit measures for the different models.
LC1LC2H1
Deviance2911.452136.052192.48
Number of parameters100 + 22 + 100 100 + 22 × 2 + 100 × 2 100 + 22 + 100 + 121
Table 3. Confidence intervals for forecasted Spanish life expectancy for the period 2013–2032 using LC1 only for men.
Table 3. Confidence intervals for forecasted Spanish life expectancy for the period 2013–2032 using LC1 only for men.
Life Expectancy at BirthLife Expectancy at 65
RLC1RLC2RH1 RLC1RLC2RH1
YearINEp 0.025 p 0.975 p 0.025 p 0.975 p 0.025 p 0.975 INEp 0.025 p 0.975 p 0.025 p 0.975 p 0.025 p 0.975
201379.9379.4679.7379.4979.6879.4979.7118.9218.5918.7618.6218.7218.6218.74
201480.1279.6479.9079.7079.8479.6579.8919.0618.7118.8618.7318.8218.7218.86
201579.9279.8780.1479.9280.0879.9080.1218.7918.8419.0118.8718.9718.8719.00
201680.3180.0680.3580.1380.2980.1080.3319.1418.9719.1419.0019.1018.9919.14
201780.3780.2780.5780.3480.5080.3180.5519.1219.1019.2719.1319.2419.1219.27
201880.4680.4780.7780.5480.7180.5180.7619.2219.2319.4119.2619.3719.2519.41
201980.8580.6680.9880.7480.9280.7280.9719.5219.3519.5519.3919.5019.3819.54
2020 80.8681.1980.9481.1280.9281.18 19.4819.6819.5219.6319.5119.67
2021 81.0581.3981.1481.3281.1281.38 19.6119.8219.6419.7719.6419.81
2022 81.2481.5981.3381.5281.3181.58 19.7319.9519.7719.9019.7619.94
2023 81.4381.7881.5381.7281.5081.77 19.8620.0919.9020.0319.8920.07
2024 81.6281.9881.7281.9181.6981.97 19.9820.2220.0220.1620.0220.20
2025 81.8082.1781.9182.1081.8882.16 20.1120.3520.1520.2920.1420.33
2026 81.9982.3782.1082.2982.0782.36 20.2320.4920.2720.4220.2720.46
2027 82.1782.5682.2882.4782.2582.54 20.3520.6220.4020.5520.3920.59
2028 82.3582.7582.4682.6682.4382.73 20.4820.7520.5220.6720.5120.72
2029 82.5282.9382.6482.8482.6182.92 20.6020.8820.6420.8020.6420.85
2030 82.7083.1182.8183.0282.7983.10 20.7221.0020.7720.9320.7620.98
2031 82.8783.2982.9983.2082.9683.28 20.8421.1320.8921.0520.8821.11
2032 83.0483.4783.1683.3783.1483.46 20.9621.2621.0121.1821.0021.23
Table 4. Rejection (R) or non-rejection (NR) of null hypothesis with functional ANOVA with FDR.
Table 4. Rejection (R) or non-rejection (NR) of null hypothesis with functional ANOVA with FDR.
ModelResidualModel × Residual
Indicator H 0 mod H 0 sam H 0 mod , sam
e 0 t RRR
e 65 t RRR
Gini indexRRR
Modal ageRRR
Table 5. Multiple comparison for Models and Samples.
Table 5. Multiple comparison for Models and Samples.
Model
f LC 1 = f LC 2 f LC 1 = f H 1 f LC 2 = f H 1
e 0 t RRR
e 65 t RRR
Gini indexRRR
Modal ageRRR
Sample
g RLC 1 = g RLC 2 g RLC 1 = g RH 1 g RLC 2 = g RH 1
e 0 t RRR
e 65 t RRR
Gini indexRRR
Modal ageRRR
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Debón, A.; Haberman, S.; Montes, F.; Otranto, E. Do Different Models Induce Changes in Mortality Indicators? That Is a Key Question for Extending the Lee-Carter Model. Int. J. Environ. Res. Public Health 2021, 18, 2204. https://doi.org/10.3390/ijerph18042204

AMA Style

Debón A, Haberman S, Montes F, Otranto E. Do Different Models Induce Changes in Mortality Indicators? That Is a Key Question for Extending the Lee-Carter Model. International Journal of Environmental Research and Public Health. 2021; 18(4):2204. https://doi.org/10.3390/ijerph18042204

Chicago/Turabian Style

Debón, Ana, Steven Haberman, Francisco Montes, and Edoardo Otranto. 2021. "Do Different Models Induce Changes in Mortality Indicators? That Is a Key Question for Extending the Lee-Carter Model" International Journal of Environmental Research and Public Health 18, no. 4: 2204. https://doi.org/10.3390/ijerph18042204

APA Style

Debón, A., Haberman, S., Montes, F., & Otranto, E. (2021). Do Different Models Induce Changes in Mortality Indicators? That Is a Key Question for Extending the Lee-Carter Model. International Journal of Environmental Research and Public Health, 18(4), 2204. https://doi.org/10.3390/ijerph18042204

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop