1. Introduction
Accurate mortality forecasts are crucial for social, economic, financial, and medical decisions, but no consensus exists about what mortality variable to use and how to produce its forecasts (see
Basellini et al. 2023;
Booth and Tickle 2008, for detailed reviews). A standard life table contains age-specific death rates, probabilities of death, survival probabilities, life-table deaths, and life expectancy. The most commonly modeled mortality variable is the age-specific mortality rate, popularized by the
Lee and Carter (
1992) model. However, the LC model assumes a constant mortality improvement rate over time, which would under-estimate life expectancy if there is an accelerating trend in mortality improvements (
Booth et al. 2002). This becomes a significant issue when health advancements and socioeconomic factors have accelerated mortality improvements in recent decades (
Siu et al. 2011).
This deficiency prompts many researchers to consider other mortality variables, such as life-table deaths (see, e.g.,
Bergeron-Boucher et al. 2017,
2018;
Oeppen 2008) or age-specific probabilities of death (see, e.g.,
Cairns et al. 2006,
2009). The life-table deaths resemble a probability density function with a non-negativity constraint and summability to the radix
. Similarly, the age-specific probabilities of death have two constraints: the non-negativity constraint and summability to one. When forecasting the probabilities of deaths, the logit transformation is often preferred due to the constraints (see, e.g.,
Li et al. 2015;
Sweeting 2011). We study the age-specific survival function, similar to the cumulative probability of deaths, whose values are bounded in a unit interval. Compared to the life-table deaths or probabilities of death, this mortality variable has only one constraint, which is that its values lie within the unit interval.
With a logit transformation, a time series of survival functions is transformed to the unbounded data without constraints except for one in the last age group. For all age groups but the last one, we introduce a functional time-series forecasting method to model the historical patterns and extrapolate them into the future. The forecasting method performs a functional principal component analysis to extract the first few functional principal components and their associated principal component scores. The principal component scores can be modeled and forecasted by a univariate time-series forecasting method. By multiplying the forecasted scores with the estimated functional principal components, we obtain h-step-ahead forecasts within the transformed space. By taking the inverse logit transformation, the forecast survival functions are obtained by adding zeros for the last age group.
In the insurance industry, insurers rely on accurate mortality forecasts to price annuities or set reserves for mortality-related products. Annuity sales in voluntary markets have historically been low in Australia, primarily because of the availability of publicly funded age pensions and mandatory private retirement savings, such as superannuation. The demand for and profitability of annuity products is impacted by the misalignment that exists between retirees’ perceptions of their remaining lifespan and insurance companies’ assumptions and projections. In addition, heterogeneity in mortality risk across various factors further dampens demand for generic annuities (
Brown and McDaid 2003).
However, the landscape has evolved in recent years. The desire for greater income, increased choice, and enhanced flexibility during retirement has surged, particularly fueled by rising inflation rates. Lifetime or fixed-term income stream products, particularly those offering investment-linked options, have witnessed growing popularity. With investment risk effectively transferred from insurers to policyholders, the level of confidence insurers place in their mortality forecasts significantly influences annuity pricing.
In this paper, we employ functional principal component analysis to forecast age-specific survival functions. We begin by introducing age- and sex-specific survival functions for Australia with logit transformation in
Section 2. In
Section 3, we apply
Hyndman and Ullah (
2007)’s method to model and forecast a time series of age- and sex-specific survival functions and then compare it with the widely used Lee–Carter method. In
Section 4, we develop a single-premium temporary immediate annuity pricing model based on survival functions and subsequently compare the results with annuity prices on sales.
Section 5 concludes with reflections on broader applications for the methods presented herein.
2. Age-Specific Survival Functions
We consider age- and sex-specific survival functions for Australia. These data can be publicly obtained from the
Human Mortality Database (
2024). For a given year
t, the survival rates reduce as age increases. There are 111 ages, which are age
. At the last age group of 110+, the survival rates reach their minimum value of zero.
The survival function represents the probability that an individual survives to age
x. According to this conditional feature, we construct survival function
as follows:
where
is the mortality rate at which people living to age
x are expected to die before reaching age
x + 1 and
. In another way, we can obtain the survival function
from the following expression:
where
i represents all time points from age 0 to
.
To understand the principal features of the data,
Figure 1 presents rainbow plots of the age- and sex-specific survival functions in Australia from 1921 to 2020. The time ordering of the curves follows the color code of a rainbow, where data from the distant past years are shown in red, and data from more recent years are shown in purple (see, e.g.,
Hyndman and Shang 2010). For both males and females, the increasing ‘rectangularization’ (
Wilmoth and Horiuchi 1999) of the survival curve indicates improving mortality across the lifespan. In contrast to the death rates, the survival functions are smooth.
The logit function can map data within bounded intervals onto unbounded data. The inverse logit function can ensure that the predicted values remain within the unit interval, which is important for probabilities. By taking the logit transformation, we obtain a time series of transformed data in
Figure 2. When the survival functions reach zero in the last age group, the logit transformation produces undefined values. So, we removed the ones from the logit transformation and added them to the forecasts.
3. Functional Time-Series Forecasting Method
3.1. Functional Principal Component Analysis
The model proposed by
Lee and Carter (
1992) is a widely utilized statistical tool in actuarial science and demography for forecasting mortality rates and life expectancy. The two-factor (Age–Period) Lee–Carter (LC) model can be expressed as follows:
where
represents the central death rate at age
x in year
t,
is the average log mortality at age
x over time,
measures the relative change in the log death rate at each age,
is the general level of the log death rate in year
t, and
is the residual error term at age
x and year
t.
The Lee–Carter model in Equation (
1) is under-determined. Therefore, two constraints were introduced to ensure the Lee–Carter model yields consistent results before re-estimating
based on observed total deaths in each year. The first constraint limits the sum of
to 1, while the second constraint requires that the sum of
equals zero.
As an extension to the LC method,
Hyndman and Ullah (
2007) (HU) introduced the HU method, which employs Functional Principal Component Analysis to decompose a set of curves into orthogonal functional principal components and their independent principal component scores. The HU method enhances the LC method through three key points:
- (1)
Smoothing the log mortality rates is achieved using weighted penalized regression splines, with a specific emphasis on preserving monotonicity at older ages.
- (2)
Utilizing multiple principal components.
- (3)
A wider array of univariate time series models can be employed to forecast the principal component scores.
Let
be a time series of cumulative probabilities of deaths, and let
be a time series of logit-transformed data defined on a common function support
;
is a function support and
R is the real line.
and
, where
are elements of the Hilbert space
equipped with the inner product
, where
u represents a continuum. Each function is a square-integrable function with a finite second moment. A non-negative definite covariance function is given by
for all
. The covariance function
in Equation (
2) allows the covariance operator of
, denoted by
, to be defined as
Via
Mercer (
1909)’s lemma, there exists an orthonormal sequence
of continuous functions in the square-integrable space and a non-increasing sequence
of positive numbers, such that
Via
Karhunen (
1947)–
Loève (
1978) expansion, a stochastic process
can be expressed as
where
denotes the mean function approximated by
, the
kth set of the principal component scores,
, is given by the projection of decentered data
in the direction of the
kth eigenfunction
,
denotes the model errors with zero mean and finite variance, and
denotes the number of retained functional principal components.
There exist various methods for determining the number of retained components:
- (a)
Scree plots or the fraction of variance explained by the first few functional principal components (
Chiou 2012);
- (b)
Pseudo-versions of Akaike information criterion and Bayesian information criterion (
Yao et al. 2005);
- (c)
- (d)
- (e)
As pointed out by
Hyndman et al. (
2013), the forecast accuracy is robust to over-estimating
K, but under-estimating
K can be detrimental. We consider
as in
Hyndman et al. (
2013). In contrast to the
Lee and Carter (
1992) method, the higher-order terms capture additional non-random patterns.
Table 1 gives the total variance explained by different choices of
K for comparison.
Conditional on the estimated mean function and functional principal components, the forecast curves can be obtained via the forecast principal component scores. Differing from the random walk with drift used in
Lee and Carter (
1992), a broader range of univariate time-series forecasting methods can be applied. While
Hyndman and Shang (
2009) proposed univariate time-series forecasting methods,
Aue et al. (
2015) considered multivariate time-series forecasting methods. As pointed out by
Peña and Sánchez (
2007), the multivariate time-series method can capture contemporaneous and instantaneous correlations among the principal component scores. However, it cannot handle a large number of parameters in the model. The univariate time-series method can not model cross-correlation between two sets of the principal component scores. However, it can handle the non-stationarity exhibited in any set of principal component scores. Between the univariate and multivariate time-series forecasting methods, we apply an exponential smoothing state space model of
Hyndman et al. (
2008).
Considering the typical retirement age in Australia, we set age 65 as the starting point. For instance, when determining the annuity price for an individual aged 65 to 110, a prediction spanning 45 years is required. Based on the Australian female and male data from 1921 to 2020, we forecast age- and sex-specific survival functions from 2021 to 2065 as
Figure 3.
3.2. Comparison of the Point Forecast Accuracy
Backtesting is performed by splitting the Australian female and male survival functions into two sets: the training dataset from 1921 to 2000 and the testing dataset from 2001 to 2020. The forecast results from each method are then compared with the testing dataset to evaluate the method’s finite-sample performance. As an illustration,
Figure 4 illustrates the forecast results of the survival function in 2001, 2010, and 2020. For both males and females in each sample year, the figure demonstrates that the FPCR curve aligns more closely with the actual data compared to the LC curve.
To evaluate and compare the forecast accuracy, we consider mean absolute forecast error (MAFE) and mean absolute percentage error (MAPE). The MAFE quantifies the average absolute deviation between predicted values and observed values. The MAPE resembles the MAFE but is represented as a percentage of the actual value. With our focus on the post-retirement stage, we observe individuals ranging from 65 to 110 years old, encompassing a total of 46 ages in
h-step-ahead survival function forecast. The MAFE and MAPE are then defined as follows:
where
denotes the actual survival function for the
age and
forecasting year, while
denotes the point forecasts for survival function.
Figure 5 displays the 1-to-20-step-ahead point forecast errors averaged over 20 forecast horizons. The errors become larger for both methods in longer forecast horizons. The FPCR produces lower MAFE and MAPE values than the ones from the LC for the male and female data.
The FPCR produces more accurate survival function forecasts than the ones obtained from the LC method. Another finding is that our model fits better to Australian female data than the male data. A summary of the averaged test results can be found in
Table 2.
3.3. Comparison of the Interval Forecast Accuracy
Prediction intervals are crucial tools for assessing the probabilistic uncertainty associated with point forecasts. This uncertainty can arise from systematic factors like parameter or model uncertainty, as well as random fluctuations such as error terms in specific models. As emphasized by
Chatfield (
1993,
2000), interval forecasts are as vital as point forecasts for several reasons:
- (1)
Quantify the levels of future uncertainty;
- (2)
Help decision makers to understand the range of possible outcomes before making more informed choices;
- (3)
Assess the quality of different forecasting methods more thoroughly;
- (4)
Take varying assumptions to explore diverse scenarios.
To measure forecast uncertainty, we construct the pointwise prediction intervals by using a nonparametric bootstrap method. The
forecast.ftsm function from the ftsa package (
Hyndman and Shang 2024) in
version 6.4 provides a solution to apply this method. The nonparametric bootstrap method can be applied through a simple procedure as below:
- (1)
Sample Resampling: The nonparametric model can be utilized to estimate the h-step-ahead forecasts for the kth principal component scores and model fit errors . That is, we can generate bootstrap samples of principal component scores and model residual terms by randomly sampling with replacements from the estimated principal component scores and residual terms.
- (2)
Statistic Calculation: After obtaining the bootstrapped principal component scores and errors in the model residuals, one can then obtain the bootstrapped survival function
. We take the first six principal components as suggested by
Hyndman and Booth (
2008) and express
where
is the estimated mean function,
denotes the
kth functional principal component,
denotes the bootstrapped principal component scores,
represents the bootstrapped model residuals, and
denotes the number of replications.
- (3)
Repeat Resampling: Repeat processes 1 and 2 above numerous times to construct a bootstrap distribution. The nonparametric bootstrap method closely resembles a Monte Carlo simulation experiment. We choose to run it B times to ensure that calculations based on the bootstrap distribution remain unaffected by significant Monte Carlo errors. Consequently, we obtain 1000 values of the bootstrapped survival functions , which we use to compute the 95% pointwise prediction intervals of the forecast results.
4. Single-Premium Temporary Immediate Annuity Pricing
Annuities guarantee the purchaser regular future payments, typically monthly and often extending for life. Under this overarching concept, various annuity types cater to distinct financial objectives. The primary categories include fixed or variable annuities, temporary or lifetime annuities, and immediate or deferred annuities.
A fixed annuity ensures a consistent payment throughout the agreement’s term, remaining constant without decrease or increase. In contrast, a variable annuity’s value fluctuates in line with the performance of the underlying mutual funds it is invested in, allowing for potential increase or decrease. Temporary annuities provide payments for a specified period, such as 5, 10, or 20 years, while lifetime annuities cover the remainder of the annuitant’s life. An immediate annuity initiates payouts immediately after the annuitant makes a lump-sum payment to the insurer. In contrast, a deferred annuity commences payments at a future date that is predetermined by the annuitant.
Annuitants commonly purchase annuities to complement their existing retirement income sources, such as pensions and superannuation. An annuity offers an assured income stream to provide a sense of security to annuitants. Even if their other financial resources diminish, they will still receive an additional steady income stream from the annuity.
Guaranteed lifetime annuities can serve as a suitable option for individuals seeking a steady income to complement their superannuation, pensions, or other investments. However, lifetime annuities might be perceived as expensive due to several reasons, such as longevity risk, guaranteed payouts, and administrative costs.
4.1. Annuity Pricing Based on Survival Functions
We study term annuity pricing by considering maturity-specific contracts that make the annuity almost lifetime for those individuals aged over 65 (see also
Shang and Haberman 2020). The available maturity dates range from 5 to 45, increasing in 5-year increments, to encompass ages up to 110. These temporary annuities guarantee a fixed income level that exceeds the income provided by a lifetime annuity for a similar premium. They serve as an alternative to lifetime annuities and offer the buyer the flexibility to explore the option of purchasing a deferred annuity at a later date.
The cost of an annuity, which has a maturity term of
T up to 45 years, is a stochastic variable since it depends on the zero-coupon bond price and future mortality rates. The price of the annuity for an individual aged
x years with a benefit of one Australian dollar per year under a constant interest rate (
i%) environment can be expressed as follows:
where
represents the survival probability that can be derived from methods discussed in
Section 3. A cohort approach is adopted in calculating the survival probabilities. The probability of survival for a person at time
(or year 2021) for
years, given their current age
x, can be obtained from the formula below:
where
represents the survival function for age
at time
(e.g.,
years after year 2021) and
is the survival function for an
x-year-old person at time 0.
By considering annuity purchase ages from 65 to 110, 45-step-ahead point and interval forecasts of survival functions are constructed to measure forecast uncertainty. The maturity term of
T years is selected carefully to meet the condition that sum of age and maturity is less than 110. By combining Equations (
3) and (
4), the annuity price for each purchase age and maturity date can be expressed as follows:
If we add the overhead cost loading
L into the calculation, Equation (
5) leads to
Taking survival function forecast results from
Section 3, we can calculate the best estimate of the annuity prices with different ages and maturity dates for Australian female and male policyholders in
Table 3.
We exclude overhead cost loading in our calculations, i.e., . The reason is that overhead cost loading varies between companies and can be easily incorporated by multiplying the final results by . The constant interest rate is set at 2.5%, based on Australia’s long-term inflation target of 2% to 3%.
The 95% pointwise prediction intervals for the annuity pricing results can also be computed for the Australian female and male residents in
Table 4.
The assumption of a constant interest rate greatly affects the calculation of annuity prices. To gauge this sensitivity, we perform a sensitivity test using interest rate assumptions of 1% and 5%. Detailed results for both the Australian male and female residents are provided in the
Appendix A.
4.2. Compare Results with Annuity Price on Sales
Australia’s annuity landscape would greatly benefit from increased competition and diversity among providers. As of the end of 2023, Challenger Life stands as Australia’s largest and dominant annuity provider, furnishing steady income to thousands of clients. According to
Challenger (
2024), the Challenger Lifetime Annuity (Liquid Lifetime) offers three options, so customers can tailor it to meet their needs. These three options can be differentiated by payment delivery options, namely, immediate payments, deferred payments, and market-linked payments. An immediate payment annuity allows individuals to convert a lump sum of money into a stream of income payments right away. A deferred payment annuity enables individuals to make either a lump-sum payment or periodic premiums to an insurance company and then receive regular income payments at a future date in return. Market-linked payments, as the name suggests, are tied to market indexes, influencing the payments customers receive.
We take the prices for immediate payment annuity from
Challenger (
2024) to compare with the annuity prices obtained from
Section 4.1 based on the survival function forecast. The provided rates serve as estimates and are intended for illustration purposes only. Challenger typically updates these rates on a weekly basis. Prices are segregated by gender and are based on an initial investment of
$100,000 to obtain annuity payments at different starting ages, as detailed in
Table 5.
The Challenger lifetime annuity permits withdrawals up to the specified withdrawal period for each starting age. The withdrawal amount is determined by the remaining product value on the withdrawal date, and may be subject to deduction of administration fees and other charges. Following the end of the withdrawal period, the annuity product is considered to have no remaining withdrawal value, and, therefore, cannot be withdrawn or have any purpose for withdrawal. Payment rates vary depending on the chosen inflation protection options, including full protection, partial protection, or no protection. The payment amount can also be linked to the Reserve Bank of Australia (RBA) cash rate rather than the consumer price index.
The prices linked to the longest maturity dates in our survival function model can be considered as lifetime benefits, as
Human Mortality Database (
2024) assumes no individuals are alive beyond age 110. To build up a fair comparison, we backtrack to determine the prices for the Challenger product, calculating the initial investment needed to obtain a
$1 annual benefit for a lifetime for each payment option. We opt to compare the non-inflation-protected annuity payment options Challenger offers with the outcome of our survival function models for each starting age and gender in
Table 6.
The comparison yields intriguing insights. For individuals commencing at ages 65 and 70,
Challenger (
2024)’s non-inflation-protected annuity prices align neatly between the 2.5% and 5% interest sensitivity test outcomes for survival function, which is logical given the RBA cash rate of 4.35% as of 17 March 2024. However, this seems not the case when we look into the results for starting ages of 75 and 80. At age 75, Challenger’s prices go closer to our model’s 2.5% interest results, but they converge towards the 1% interest outcomes at age 80. This suggests a potential divergence in mortality curve conservatism or added loadings reflecting the greater uncertainty in outcomes and wider prediction intervals for older age groups in
Challenger (
2024)’s pricing model. These discoveries carry significant weight, suggesting that insurers might be able to fine-tune annuity pricing to foster competition and boost business activity, provided our mortality forecast models attain adequate credibility.
5. Conclusions
Survival probabilities are less prone to random fluctuations due to the cumulative nature of the indicator (from birth to age
x), suggesting that their use could provide more robust forecasts. We begin by introducing two distinct mortality forecasting methods employed for projecting survival functions: the Lee–Carter method and the functional principal component regression method. For the FPCR, we choose
Hyndman and Ullah (
2007)’s model to forecast a time series of age- and sex-specific survival functions. The LC method and the FPCR are subsequently subjected to a backtesting process, utilizing both Australian male and female mortality data spanning from 1921 to 2000 from the
Human Mortality Database (
2024).
Our objective is to validate the performance of these methods by comparing their projections for the period 2001 to 2020 with the actual data from the same dataset. After the backtesting phase, we recommend the use of the FPCR for forecasting based on multiple evaluation metrics, namely MAFE and MAPE. Following the selection of the FPCR, this article proceeds to apply this method to forecast survival functions for the Australian population from 2021 to 2065. The focus of the forecast is on age groups ranging from 65 to 110, known as the retirement stage.
The survival function forecasts for the Australian population are utilized in a single-premium temporary immediate annuity pricing model. This model facilitates the calculation of annuity prices for Australian policyholders across various starting ages and maturity dates. The integration of these forecast outcomes into the annuity pricing model offers insights into the pricing dynamics for single-premium annuities for policyholders in Australia. The annuity pricing results also come with 95% pointwise prediction intervals, offering a measure of forecast uncertainty.
We compare the annuity prices derived from survival functions with the Challenger immediate payment annuity prices for each starting age. It appears that Challenger sets higher annuity prices for older age groups, which may be due to the inherent uncertainty in long-term mortality forecasts. To ensure reproducibility, the
codes for generating point and interval forecasts using our method are available at
https://www.github.com/wondersky8086/survival-function-forecast.git (accessed on 19 June 2024).
There are several ways in which the methodology presented here can be further extended, and we briefly mention two:
- (1)
We consider an independent functional time-series forecasting method to separately model and forecast female and male survival functions. One could consider a joint modeling technique, such as multivariate and multilevel functional time-series methods of
Shang and Kearney (
2022), which captures the possible correlation between the series.
- (2)
Our data consist of all available years from the
Human Mortality Database (
2024), but it may be subject to structural change. One possible idea is to apply a change point detection method from
Shang and Xu (
2022) to identify potential change points, and then fit our forecasting method to a truncated sample period.