Next Article in Journal
Sustainable Development of Entrepreneurship through Operational Risk Management: The Role of Corporate Social Responsibility
Next Article in Special Issue
Fair and Sustainable Pension System: Market Equilibrium Using Implied Options
Previous Article in Journal
Towards Diagnosing and Mitigating Behavioral Cyber Risks
Previous Article in Special Issue
Use of Prediction Bias in Active Learning and Its Application to Large Variable Annuity Portfolios
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Forecasting Age- and Sex-Specific Survival Functions: Application to Annuity Pricing

by
Shaokang Wang
1,*,
Han Lin Shang
1,
Leonie Tickle
1 and
Han Li
2
1
Department of Actuarial Studies and Business Analytics, Macquarie University, Macquarie Park, NSW 2109, Australia
2
Department of Economics, The University of Melbourne, Melbourne, VIC 3010, Australia
*
Author to whom correspondence should be addressed.
Risks 2024, 12(7), 117; https://doi.org/10.3390/risks12070117
Submission received: 19 June 2024 / Revised: 10 July 2024 / Accepted: 18 July 2024 / Published: 22 July 2024
(This article belongs to the Special Issue Risks Journal: A Decade of Advancing Knowledge and Shaping the Future)

Abstract

:
We introduce the function principal component regression (FPCR) forecasting method to model and forecast age-specific survival functions observed over time. The age distribution of survival functions is an example of constrained data whose values lie within a unit interval. Because of the constraint, such data do not reside in a linear vector space. A natural way to deal with such a constraint is through an invertible logit transformation that maps constrained onto unconstrained data in a linear space. With a time series of unconstrained data, we apply a functional time-series forecasting method to produce point and interval forecasts. The forecasts are then converted back to the original scale via the inverse logit transformation. Using the age- and sex-specific survival functions for Australia, we investigate the point and interval forecast accuracies for various horizons. We conclude that the functional principal component regression (FPCR) provides better forecast accuracy than the Lee–Carter (LC) method. Therefore, we apply FPCR to calculate annuity pricing and compare it with the market annuity price.

1. Introduction

Accurate mortality forecasts are crucial for social, economic, financial, and medical decisions, but no consensus exists about what mortality variable to use and how to produce its forecasts (see Basellini et al. 2023; Booth and Tickle 2008, for detailed reviews). A standard life table contains age-specific death rates, probabilities of death, survival probabilities, life-table deaths, and life expectancy. The most commonly modeled mortality variable is the age-specific mortality rate, popularized by the Lee and Carter (1992) model. However, the LC model assumes a constant mortality improvement rate over time, which would under-estimate life expectancy if there is an accelerating trend in mortality improvements (Booth et al. 2002). This becomes a significant issue when health advancements and socioeconomic factors have accelerated mortality improvements in recent decades (Siu et al. 2011).
This deficiency prompts many researchers to consider other mortality variables, such as life-table deaths (see, e.g., Bergeron-Boucher et al. 2017, 2018; Oeppen 2008) or age-specific probabilities of death (see, e.g., Cairns et al. 2006, 2009). The life-table deaths resemble a probability density function with a non-negativity constraint and summability to the radix 10 5 . Similarly, the age-specific probabilities of death have two constraints: the non-negativity constraint and summability to one. When forecasting the probabilities of deaths, the logit transformation is often preferred due to the constraints (see, e.g., Li et al. 2015; Sweeting 2011). We study the age-specific survival function, similar to the cumulative probability of deaths, whose values are bounded in a unit interval. Compared to the life-table deaths or probabilities of death, this mortality variable has only one constraint, which is that its values lie within the unit interval.
With a logit transformation, a time series of survival functions is transformed to the unbounded data without constraints except for one in the last age group. For all age groups but the last one, we introduce a functional time-series forecasting method to model the historical patterns and extrapolate them into the future. The forecasting method performs a functional principal component analysis to extract the first few functional principal components and their associated principal component scores. The principal component scores can be modeled and forecasted by a univariate time-series forecasting method. By multiplying the forecasted scores with the estimated functional principal components, we obtain h-step-ahead forecasts within the transformed space. By taking the inverse logit transformation, the forecast survival functions are obtained by adding zeros for the last age group.
In the insurance industry, insurers rely on accurate mortality forecasts to price annuities or set reserves for mortality-related products. Annuity sales in voluntary markets have historically been low in Australia, primarily because of the availability of publicly funded age pensions and mandatory private retirement savings, such as superannuation. The demand for and profitability of annuity products is impacted by the misalignment that exists between retirees’ perceptions of their remaining lifespan and insurance companies’ assumptions and projections. In addition, heterogeneity in mortality risk across various factors further dampens demand for generic annuities (Brown and McDaid 2003).
However, the landscape has evolved in recent years. The desire for greater income, increased choice, and enhanced flexibility during retirement has surged, particularly fueled by rising inflation rates. Lifetime or fixed-term income stream products, particularly those offering investment-linked options, have witnessed growing popularity. With investment risk effectively transferred from insurers to policyholders, the level of confidence insurers place in their mortality forecasts significantly influences annuity pricing.
In this paper, we employ functional principal component analysis to forecast age-specific survival functions. We begin by introducing age- and sex-specific survival functions for Australia with logit transformation in Section 2. In Section 3, we apply Hyndman and Ullah (2007)’s method to model and forecast a time series of age- and sex-specific survival functions and then compare it with the widely used Lee–Carter method. In Section 4, we develop a single-premium temporary immediate annuity pricing model based on survival functions and subsequently compare the results with annuity prices on sales. Section 5 concludes with reflections on broader applications for the methods presented herein.

2. Age-Specific Survival Functions

We consider age- and sex-specific survival functions for Australia. These data can be publicly obtained from the Human Mortality Database (2024). For a given year t, the survival rates reduce as age increases. There are 111 ages, which are age 0 , 1 , , 109 , 110 + . At the last age group of 110+, the survival rates reach their minimum value of zero.
The survival function represents the probability that an individual survives to age x. According to this conditional feature, we construct survival function S ( x ) as follows:
S ( x + 1 ) = S ( x ) × ( 1 q x )
where q x is the mortality rate at which people living to age x are expected to die before reaching age x + 1 and S ( 0 ) = 1 . In another way, we can obtain the survival function S ( x ) from the following expression:
S ( x ) = i = 0 x 1 ( 1 q i )
where i represents all time points from age 0 to x 1 .
To understand the principal features of the data, Figure 1 presents rainbow plots of the age- and sex-specific survival functions in Australia from 1921 to 2020. The time ordering of the curves follows the color code of a rainbow, where data from the distant past years are shown in red, and data from more recent years are shown in purple (see, e.g., Hyndman and Shang 2010). For both males and females, the increasing ‘rectangularization’ (Wilmoth and Horiuchi 1999) of the survival curve indicates improving mortality across the lifespan. In contrast to the death rates, the survival functions are smooth.
The logit function can map data within bounded intervals onto unbounded data. The inverse logit function can ensure that the predicted values remain within the unit interval, which is important for probabilities. By taking the logit transformation, we obtain a time series of transformed data in Figure 2. When the survival functions reach zero in the last age group, the logit transformation produces undefined values. So, we removed the ones from the logit transformation and added them to the forecasts.

3. Functional Time-Series Forecasting Method

3.1. Functional Principal Component Analysis

The model proposed by Lee and Carter (1992) is a widely utilized statistical tool in actuarial science and demography for forecasting mortality rates and life expectancy. The two-factor (Age–Period) Lee–Carter (LC) model can be expressed as follows:
ln ( m x , t ) = a x + b x k t + ϵ x , t ,
where m x , t represents the central death rate at age x in year t, a x is the average log mortality at age x over time, b x measures the relative change in the log death rate at each age, k t is the general level of the log death rate in year t, and ε x , t is the residual error term at age x and year t.
The Lee–Carter model in Equation (1) is under-determined. Therefore, two constraints were introduced to ensure the Lee–Carter model yields consistent results before re-estimating k t based on observed total deaths in each year. The first constraint limits the sum of b x to 1, while the second constraint requires that the sum of k t equals zero.
As an extension to the LC method, Hyndman and Ullah (2007) (HU) introduced the HU method, which employs Functional Principal Component Analysis to decompose a set of curves into orthogonal functional principal components and their independent principal component scores. The HU method enhances the LC method through three key points:
(1)
Smoothing the log mortality rates is achieved using weighted penalized regression splines, with a specific emphasis on preserving monotonicity at older ages.
(2)
Utilizing multiple principal components.
(3)
A wider array of univariate time series models can be employed to forecast the principal component scores.
Let S ( x ) = [ S 1 ( x ) , S 2 ( x ) , , S n ( x ) ] be a time series of cumulative probabilities of deaths, and let X ( u ) = [ X 1 ( u ) , X 2 ( u ) , , X n ( u ) ] be a time series of logit-transformed data defined on a common function support u I R ; I is a function support and R is the real line. S t ( x ) and X t ( u ) , where t { 1 , , n } are elements of the Hilbert space H equipped with the inner product w , v = I w ( u ) v ( u ) d u , where u represents a continuum. Each function is a square-integrable function with a finite second moment. A non-negative definite covariance function is given by
c ( u , v ) : = Cov [ X ( u ) , X ( v ) ] = E { [ X ( u ) μ ( u ) ] [ X ( v ) μ ( v ) ] } ,
for all u , v I . The covariance function c ( u , v ) in Equation (2) allows the covariance operator of X , denoted by K , to be defined as
K ( ϕ ) ( u ) = I c ( u , v ) ϕ ( v ) d v .
Via Mercer (1909)’s lemma, there exists an orthonormal sequence ( ϕ k ) of continuous functions in the square-integrable space and a non-increasing sequence λ k of positive numbers, such that
c ( u , v ) = k = 1 λ k ϕ k ( u ) ϕ k ( v ) .
Via Karhunen (1947)–Loève (1978) expansion, a stochastic process X t ( u ) can be expressed as
X t ( u ) = μ ( u ) + k = 1 β t , k ϕ k ( u ) = μ ( u ) + k = 1 K β t , k ϕ k ( u ) + e ( u ) ,
where μ ( u ) denotes the mean function approximated by X ¯ ( u ) = 1 n t = 1 n X t ( u ) , the kth set of the principal component scores, β t , k = X t ( u ) μ ( u ) , ϕ k ( u ) , is given by the projection of decentered data X t ( u ) μ ( u ) in the direction of the kth eigenfunction ϕ k , e ( u ) denotes the model errors with zero mean and finite variance, and K < n denotes the number of retained functional principal components.
There exist various methods for determining the number of retained components:
(a)
Scree plots or the fraction of variance explained by the first few functional principal components (Chiou 2012);
(b)
Pseudo-versions of Akaike information criterion and Bayesian information criterion (Yao et al. 2005);
(c)
Predictive cross-validation leaving out one or more curves (Ramsay and Silverman 2006);
(d)
Bootstrap methods (Hall and Vial 2006);
(e)
Eigenvalue ratio criterion (EVR) (Lam and Yao 2012).
As pointed out by Hyndman et al. (2013), the forecast accuracy is robust to over-estimating K, but under-estimating K can be detrimental. We consider K = 6 as in Hyndman et al. (2013). In contrast to the Lee and Carter (1992) method, the higher-order terms capture additional non-random patterns. Table 1 gives the total variance explained by different choices of K for comparison.
Conditional on the estimated mean function and functional principal components, the forecast curves can be obtained via the forecast principal component scores. Differing from the random walk with drift used in Lee and Carter (1992), a broader range of univariate time-series forecasting methods can be applied. While Hyndman and Shang (2009) proposed univariate time-series forecasting methods, Aue et al. (2015) considered multivariate time-series forecasting methods. As pointed out by Peña and Sánchez (2007), the multivariate time-series method can capture contemporaneous and instantaneous correlations among the principal component scores. However, it cannot handle a large number of parameters in the model. The univariate time-series method can not model cross-correlation between two sets of the principal component scores. However, it can handle the non-stationarity exhibited in any set of principal component scores. Between the univariate and multivariate time-series forecasting methods, we apply an exponential smoothing state space model of Hyndman et al. (2008).
Considering the typical retirement age in Australia, we set age 65 as the starting point. For instance, when determining the annuity price for an individual aged 65 to 110, a prediction spanning 45 years is required. Based on the Australian female and male data from 1921 to 2020, we forecast age- and sex-specific survival functions from 2021 to 2065 as Figure 3.

3.2. Comparison of the Point Forecast Accuracy

Backtesting is performed by splitting the Australian female and male survival functions into two sets: the training dataset from 1921 to 2000 and the testing dataset from 2001 to 2020. The forecast results from each method are then compared with the testing dataset to evaluate the method’s finite-sample performance. As an illustration, Figure 4 illustrates the forecast results of the survival function in 2001, 2010, and 2020. For both males and females in each sample year, the figure demonstrates that the FPCR curve aligns more closely with the actual data compared to the LC curve.
To evaluate and compare the forecast accuracy, we consider mean absolute forecast error (MAFE) and mean absolute percentage error (MAPE). The MAFE quantifies the average absolute deviation between predicted values and observed values. The MAPE resembles the MAFE but is represented as a percentage of the actual value. With our focus on the post-retirement stage, we observe individuals ranging from 65 to 110 years old, encompassing a total of 46 ages in h-step-ahead survival function forecast. The MAFE and MAPE are then defined as follows:
MAFE ( h ) = 1 46 × ( 21 h ) x = 65 110 ζ = 1 21 h S ( x , ζ ) S ^ ( x , ζ ) MAPE ( h ) = 100 46 × ( 21 h ) x = 65 110 ζ = 1 21 h S ( x , ζ ) S ^ ( x , ζ ) S ( x , ζ )
where S ( x , ζ ) denotes the actual survival function for the x th age and ζ th forecasting year, while S ^ ( x , ζ ) denotes the point forecasts for survival function.
Figure 5 displays the 1-to-20-step-ahead point forecast errors averaged over 20 forecast horizons. The errors become larger for both methods in longer forecast horizons. The FPCR produces lower MAFE and MAPE values than the ones from the LC for the male and female data.
The FPCR produces more accurate survival function forecasts than the ones obtained from the LC method. Another finding is that our model fits better to Australian female data than the male data. A summary of the averaged test results can be found in Table 2.

3.3. Comparison of the Interval Forecast Accuracy

Prediction intervals are crucial tools for assessing the probabilistic uncertainty associated with point forecasts. This uncertainty can arise from systematic factors like parameter or model uncertainty, as well as random fluctuations such as error terms in specific models. As emphasized by Chatfield (1993, 2000), interval forecasts are as vital as point forecasts for several reasons:
(1)
Quantify the levels of future uncertainty;
(2)
Help decision makers to understand the range of possible outcomes before making more informed choices;
(3)
Assess the quality of different forecasting methods more thoroughly;
(4)
Take varying assumptions to explore diverse scenarios.
To measure forecast uncertainty, we construct the pointwise prediction intervals by using a nonparametric bootstrap method. The forecast.ftsm function from the ftsa package (Hyndman and Shang 2024) in Risks 12 00117 i001 version 6.4 provides a solution to apply this method. The nonparametric bootstrap method can be applied through a simple procedure as below:
(1)
Sample Resampling: The nonparametric model can be utilized to estimate the h-step-ahead forecasts for the kth principal component scores β ^ n + h | n , k and model fit errors e ^ n + h . That is, we can generate bootstrap samples of principal component scores and model residual terms by randomly sampling with replacements from the estimated principal component scores and residual terms.
(2)
Statistic Calculation: After obtaining the bootstrapped principal component scores and errors in the model residuals, one can then obtain the bootstrapped survival function S ^ ( x ) . We take the first six principal components as suggested by Hyndman and Booth (2008) and express
S ^ n + h | n ( b ) ( x ) = μ ^ ( x ) + k = 1 6 β ^ n + h | n , k ( b ) ϕ k ( x ) + e ^ n + h ( b ) ( x ) , b = 1 , 2 , B ,
where μ ^ ( x ) is the estimated mean function, ϕ k ( x ) denotes the kth functional principal component, β ^ n + h | n , k ( b ) denotes the bootstrapped principal component scores, e ^ n + h ( b ) ( x ) represents the bootstrapped model residuals, and B = 1000 denotes the number of replications.
(3)
Repeat Resampling: Repeat processes 1 and 2 above numerous times to construct a bootstrap distribution. The nonparametric bootstrap method closely resembles a Monte Carlo simulation experiment. We choose to run it B times to ensure that calculations based on the bootstrap distribution remain unaffected by significant Monte Carlo errors. Consequently, we obtain 1000 values of the bootstrapped survival functions { S ^ n + h | n ( 1 ) ( x ) , S ^ n + h | n ( 2 ) ( x ) , , S ^ n + h | n ( 1000 ) ( x ) } , which we use to compute the 95% pointwise prediction intervals of the forecast results.

4. Single-Premium Temporary Immediate Annuity Pricing

Annuities guarantee the purchaser regular future payments, typically monthly and often extending for life. Under this overarching concept, various annuity types cater to distinct financial objectives. The primary categories include fixed or variable annuities, temporary or lifetime annuities, and immediate or deferred annuities.
A fixed annuity ensures a consistent payment throughout the agreement’s term, remaining constant without decrease or increase. In contrast, a variable annuity’s value fluctuates in line with the performance of the underlying mutual funds it is invested in, allowing for potential increase or decrease. Temporary annuities provide payments for a specified period, such as 5, 10, or 20 years, while lifetime annuities cover the remainder of the annuitant’s life. An immediate annuity initiates payouts immediately after the annuitant makes a lump-sum payment to the insurer. In contrast, a deferred annuity commences payments at a future date that is predetermined by the annuitant.
Annuitants commonly purchase annuities to complement their existing retirement income sources, such as pensions and superannuation. An annuity offers an assured income stream to provide a sense of security to annuitants. Even if their other financial resources diminish, they will still receive an additional steady income stream from the annuity.
Guaranteed lifetime annuities can serve as a suitable option for individuals seeking a steady income to complement their superannuation, pensions, or other investments. However, lifetime annuities might be perceived as expensive due to several reasons, such as longevity risk, guaranteed payouts, and administrative costs.

4.1. Annuity Pricing Based on Survival Functions

We study term annuity pricing by considering maturity-specific contracts that make the annuity almost lifetime for those individuals aged over 65 (see also Shang and Haberman 2020). The available maturity dates range from 5 to 45, increasing in 5-year increments, to encompass ages up to 110. These temporary annuities guarantee a fixed income level that exceeds the income provided by a lifetime annuity for a similar premium. They serve as an alternative to lifetime annuities and offer the buyer the flexibility to explore the option of purchasing a deferred annuity at a later date.
The cost of an annuity, which has a maturity term of T up to 45 years, is a stochastic variable since it depends on the zero-coupon bond price and future mortality rates. The price of the annuity for an individual aged x years with a benefit of one Australian dollar per year under a constant interest rate (i%) environment can be expressed as follows:
a x T = τ = 1 T τ p x e i τ
where τ p x represents the survival probability that can be derived from methods discussed in Section 3. A cohort approach is adopted in calculating the survival probabilities. The probability of survival for a person at time t = 0 (or year 2021) for τ years, given their current age x, can be obtained from the formula below:
τ p x = S ( x + τ , τ ) S ( x , 0 )
where S ( x + τ , τ ) represents the survival function for age x + τ at time τ (e.g., τ years after year 2021) and S ( x , 0 ) is the survival function for an x-year-old person at time 0.
By considering annuity purchase ages from 65 to 110, 45-step-ahead point and interval forecasts of survival functions are constructed to measure forecast uncertainty. The maturity term of T years is selected carefully to meet the condition that sum of age and maturity is less than 110. By combining Equations (3) and (4), the annuity price for each purchase age and maturity date can be expressed as follows:
a x T = τ = 1 T S ( x + τ , τ ) S ( x , 0 ) e i τ .
If we add the overhead cost loading L into the calculation, Equation (5) leads to
a x T = ( 1 + L ) × τ = 1 T S ( x + τ , τ ) S ( x , 0 ) e i τ .
Taking survival function forecast results from Section 3, we can calculate the best estimate of the annuity prices with different ages and maturity dates for Australian female and male policyholders in Table 3.
We exclude overhead cost loading in our calculations, i.e., L = 0 . The reason is that overhead cost loading varies between companies and can be easily incorporated by multiplying the final results by 1 + L . The constant interest rate is set at 2.5%, based on Australia’s long-term inflation target of 2% to 3%.
The 95% pointwise prediction intervals for the annuity pricing results can also be computed for the Australian female and male residents in Table 4.
The assumption of a constant interest rate greatly affects the calculation of annuity prices. To gauge this sensitivity, we perform a sensitivity test using interest rate assumptions of 1% and 5%. Detailed results for both the Australian male and female residents are provided in the Appendix A.

4.2. Compare Results with Annuity Price on Sales

Australia’s annuity landscape would greatly benefit from increased competition and diversity among providers. As of the end of 2023, Challenger Life stands as Australia’s largest and dominant annuity provider, furnishing steady income to thousands of clients. According to Challenger (2024), the Challenger Lifetime Annuity (Liquid Lifetime) offers three options, so customers can tailor it to meet their needs. These three options can be differentiated by payment delivery options, namely, immediate payments, deferred payments, and market-linked payments. An immediate payment annuity allows individuals to convert a lump sum of money into a stream of income payments right away. A deferred payment annuity enables individuals to make either a lump-sum payment or periodic premiums to an insurance company and then receive regular income payments at a future date in return. Market-linked payments, as the name suggests, are tied to market indexes, influencing the payments customers receive.
We take the prices for immediate payment annuity from Challenger (2024) to compare with the annuity prices obtained from Section 4.1 based on the survival function forecast. The provided rates serve as estimates and are intended for illustration purposes only. Challenger typically updates these rates on a weekly basis. Prices are segregated by gender and are based on an initial investment of $100,000 to obtain annuity payments at different starting ages, as detailed in Table 5.
The Challenger lifetime annuity permits withdrawals up to the specified withdrawal period for each starting age. The withdrawal amount is determined by the remaining product value on the withdrawal date, and may be subject to deduction of administration fees and other charges. Following the end of the withdrawal period, the annuity product is considered to have no remaining withdrawal value, and, therefore, cannot be withdrawn or have any purpose for withdrawal. Payment rates vary depending on the chosen inflation protection options, including full protection, partial protection, or no protection. The payment amount can also be linked to the Reserve Bank of Australia (RBA) cash rate rather than the consumer price index.
The prices linked to the longest maturity dates in our survival function model can be considered as lifetime benefits, as Human Mortality Database (2024) assumes no individuals are alive beyond age 110. To build up a fair comparison, we backtrack to determine the prices for the Challenger product, calculating the initial investment needed to obtain a $1 annual benefit for a lifetime for each payment option. We opt to compare the non-inflation-protected annuity payment options Challenger offers with the outcome of our survival function models for each starting age and gender in Table 6.
The comparison yields intriguing insights. For individuals commencing at ages 65 and 70, Challenger (2024)’s non-inflation-protected annuity prices align neatly between the 2.5% and 5% interest sensitivity test outcomes for survival function, which is logical given the RBA cash rate of 4.35% as of 17 March 2024. However, this seems not the case when we look into the results for starting ages of 75 and 80. At age 75, Challenger’s prices go closer to our model’s 2.5% interest results, but they converge towards the 1% interest outcomes at age 80. This suggests a potential divergence in mortality curve conservatism or added loadings reflecting the greater uncertainty in outcomes and wider prediction intervals for older age groups in Challenger (2024)’s pricing model. These discoveries carry significant weight, suggesting that insurers might be able to fine-tune annuity pricing to foster competition and boost business activity, provided our mortality forecast models attain adequate credibility.

5. Conclusions

Survival probabilities are less prone to random fluctuations due to the cumulative nature of the indicator (from birth to age x), suggesting that their use could provide more robust forecasts. We begin by introducing two distinct mortality forecasting methods employed for projecting survival functions: the Lee–Carter method and the functional principal component regression method. For the FPCR, we choose Hyndman and Ullah (2007)’s model to forecast a time series of age- and sex-specific survival functions. The LC method and the FPCR are subsequently subjected to a backtesting process, utilizing both Australian male and female mortality data spanning from 1921 to 2000 from the Human Mortality Database (2024).
Our objective is to validate the performance of these methods by comparing their projections for the period 2001 to 2020 with the actual data from the same dataset. After the backtesting phase, we recommend the use of the FPCR for forecasting based on multiple evaluation metrics, namely MAFE and MAPE. Following the selection of the FPCR, this article proceeds to apply this method to forecast survival functions for the Australian population from 2021 to 2065. The focus of the forecast is on age groups ranging from 65 to 110, known as the retirement stage.
The survival function forecasts for the Australian population are utilized in a single-premium temporary immediate annuity pricing model. This model facilitates the calculation of annuity prices for Australian policyholders across various starting ages and maturity dates. The integration of these forecast outcomes into the annuity pricing model offers insights into the pricing dynamics for single-premium annuities for policyholders in Australia. The annuity pricing results also come with 95% pointwise prediction intervals, offering a measure of forecast uncertainty.
We compare the annuity prices derived from survival functions with the Challenger immediate payment annuity prices for each starting age. It appears that Challenger sets higher annuity prices for older age groups, which may be due to the inherent uncertainty in long-term mortality forecasts. To ensure reproducibility, the Risks 12 00117 i001 codes for generating point and interval forecasts using our method are available at https://www.github.com/wondersky8086/survival-function-forecast.git (accessed on 19 June 2024).
There are several ways in which the methodology presented here can be further extended, and we briefly mention two:
(1)
We consider an independent functional time-series forecasting method to separately model and forecast female and male survival functions. One could consider a joint modeling technique, such as multivariate and multilevel functional time-series methods of Shang and Kearney (2022), which captures the possible correlation between the series.
(2)
Our data consist of all available years from the Human Mortality Database (2024), but it may be subject to structural change. One possible idea is to apply a change point detection method from Shang and Xu (2022) to identify potential change points, and then fit our forecasting method to a truncated sample period.

Author Contributions

Writing—original draft, Shaokang Wang; Supervision, Han Lin Shang, Leonie Tickle and Han Li. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: https://www.github.com/wondersky8086/survival-function-forecast.git (accessed on 19 June 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Sensitivity Testing for Annuity Price Calculation under Different Interest Rates

In Table A1, we compute the best estimated fixed-term annuity prices for various policy starting ages and maturity periods under different interest rates.
Table A1. Australian female and male annuity prices with 1% and 5% interest rates.
Table A1. Australian female and male annuity prices with 1% and 5% interest rates.
Maturity (T)
InterestSexAge51015202530354045
1%F654.789.1613.1116.519.1720.9521.7821.9822.00
704.738.9612.5515.3417.1317.9218.1018.12
754.648.5411.5013.3414.1114.2814.29
804.437.729.6810.4610.6210.63
854.046.347.227.397.4
903.354.544.764.78
952.492.922.96
1001.691.81
1051.15
M654.739.0212.8316.1318.8320.7921.7922.0222.03
704.678.7612.2214.9616.8417.7217.9117.92
754.548.2911.1412.9713.7713.9313.94
804.327.459.3410.0910.2410.25
853.916.086.897.047.05
903.234.334.524.53
952.412.812.84
1001.671.78
1051.17
5%F654.257.449.8011.4612.5313.1113.3413.3813.39
704.217.299.4410.8011.5211.7911.8411.84
754.136.988.759.659.9710.0210.03
803.956.367.547.927.998.00
853.615.315.845.925.93
903.013.904.044.04
952.262.592.61
1001.551.64
1051.06
M654.217.339.6111.2212.312.9513.2213.2713.27
704.157.139.2010.5511.311.5911.6511.65
754.056.788.499.399.719.779.77
803.856.157.287.667.727.72
853.505.105.595.665.67
902.903.723.843.85
952.192.492.51
1001.531.62
1051.08
In Table A2, we compute the 95% pointwise prediction intervals of the fixed-term annuity prices for various policy starting ages and maturity periods under different interest rates.
Table A2. The 95% pointwise prediction intervals of Australian female and male annuity prices with 1% and 5% interest rates.
Table A2. The 95% pointwise prediction intervals of Australian female and male annuity prices with 1% and 5% interest rates.
Interest Maturity (T)
RateSexAge51015202530
1%F65(4.62, 4.84)(8.52, 9.42)(11.58, 13.75)(13.67, 17.8)(14.77, 21.54)(15.18, 24.88)
70(4.50, 4.83)(7.98, 9.4)(10.37, 13.67)(11.64, 17.64)(12.12, 21.14)(12.24, 24.14)
75(4.28, 4.83)(7.14, 9.35)(8.69, 13.53)(9.28, 17.26)(9.43, 20.41)(9.45, 22.99)
80(3.94, 4.80)(5.96, 9.23)(6.76, 13.17)(6.96, 16.54)(6.99, 19.27)(6.99, 21.26)
85(3.42, 4.75)(4.67, 8.98)(4.99, 12.58)(5.03, 15.59)(5.04, 17.77)
90(2.89, 4.61)(3.58, 8.56)(3.68, 11.76)(3.68, 14.27)
95(2.15, 4.50)(2.4, 8.24)(2.41, 10.96)
100(1.59, 4.43)(1.66, 7.74)
105(1.13, 4.23)
M65(4.82, 4.84)(9.06, 9.42)(12.75, 13.76)(15.58, 17.87)(17.49, 21.74)(18.63, 25.37)
70(4.7, 4.84)(8.64, 9.42)(11.84, 13.75)(14.02, 17.83)(15.24, 21.66)(15.76, 25.11)
75(4.58, 4.84)(8.13, 9.41)(10.73, 13.72)(12.18, 17.76)(12.79, 21.4)(12.89, 24.2)
80(4.4, 4.83)(7.42, 9.39)(9.28, 13.65)(10.01, 17.49)(10.14, 20.48)(10.14, 21.78)
85(4.14, 4.83)(6.47, 9.35)(7.45, 13.41)(7.63, 16.48)(7.64, 17.99)
90(3.65, 4.79)(5.18, 9.11)(5.41, 12.43)(5.43, 13.95)
95(2.9, 4.67)(3.4, 8.15)(3.42, 9.78)
100(1.66, 4.11)(1.75, 5.92)
105(0.93, 2.87)
5%F65(4.12, 4.30)(6.96, 7.64)(8.79, 10.22)(9.82, 12.20)(10.26, 13.69)(10.40, 14.78)
70(4.01, 4.30)(6.55, 7.62)(7.99, 10.17)(8.62, 12.10)(8.81, 13.51)(8.85, 14.49)
75(3.82, 4.29)(5.92, 7.59)(6.86, 10.08)(7.15, 11.90)(7.21, 13.16)(7.22, 14.01)
80(3.53, 4.27)(5.02, 7.49)(5.50, 9.85)(5.61, 11.50)(5.62, 12.59)(5.62, 13.24)
85(3.08, 4.23)(4.01, 7.31)(4.2, 9.46)(4.22, 10.93)(4.23, 11.81)
90(2.62, 4.11)(3.14, 6.98)(3.20, 8.90)(3.20, 10.13)
95(1.96, 4.01)(2.15, 6.73)(2.16, 8.37)
100(1.46, 3.94)(1.52, 6.37)
105(1.04, 3.78)
M65(4.29, 4.30)(7.38, 7.64)(9.58, 10.23)(10.96, 12.23)(11.73, 13.78)(12.11, 14.97)
70(4.19, 4.30)(7.06, 7.64)(8.97, 10.21)(10.04, 12.21)(10.53, 13.74)(10.71, 14.87)
75(4.08, 4.3)(6.67, 7.63)(8.23, 10.20)(8.95, 12.17)(9.19, 13.63)(9.23, 14.55)
80(3.92, 4.29)(6.13, 7.61)(7.25, 10.16)(7.61, 12.03)(7.67, 13.23)(7.67, 13.67)
85(3.7, 4.29)(5.41, 7.58)(6.00, 10.01)(6.10, 11.52)(6.10, 12.13)
90(3.28, 4.26)(4.41, 7.41)(4.56, 9.40)(4.57, 10.16)
95(2.63, 4.16)(3.01, 6.71)(3.02, 7.70)
100(1.54, 3.68)(1.60, 5.02)
105(0.87, 2.61)

References

  1. Aue, Alexander, Diogo Dubart Norinho, and Siegfried Hörmann. 2015. On the prediction of stationary functional time series. Journal of the American Statistical Association: Theory and Methods 110: 378–92. [Google Scholar] [CrossRef]
  2. Basellini, Ugofilippo, Carlo Giovanni Camarda, and Heather Booth. 2023. Thirty years on: A review of the Lee-Carter method for forecasting mortality. International Journal of Forecasting 39: 1033–49. [Google Scholar] [CrossRef]
  3. Bergeron-Boucher, Marie-Pier, James E. Oeppen, Vladimir Canudas-Romo, and James W. Vaupel. 2017. Coherent forecasts of mortality with compositional data analysis. Demographic Research 37: 527–66. [Google Scholar] [CrossRef]
  4. Bergeron-Boucher, Marie-Pier, Violetta Simonacci, Jim Oeppen, and Michele Gallo. 2018. Coherent modeling and forecasting of mortality patterns for subpopulations using multiway analysis of compositions: An application to Canadian provinces and territories. The North American Actuarial Journal 22: 92–118. [Google Scholar] [CrossRef]
  5. Booth, Heather, John Maindonald, and Len Smith. 2002. Applying Lee-Carter under conditions of variable mortality decline. Population Studies 56: 325–36. [Google Scholar] [CrossRef] [PubMed]
  6. Booth, Heather, and Leonie Tickle. 2008. Mortality modelling and forecasting: A review of methods. Annals of Actuarial Science 3: 3–43. [Google Scholar] [CrossRef]
  7. Brown, Robert L., and Joanne McDaid. 2003. Factors affecting retirement mortality. North American Actuarial Journal 7: 24–43. [Google Scholar] [CrossRef]
  8. Cairns, Andrew, David Blake, and Kevin Dowd. 2006. A two-factor model for stochastic mortality with parameter uncertainty: Theory and calibration. Journal of Risk and Insurance 73: 687–718. [Google Scholar] [CrossRef]
  9. Cairns, Andrew, David Blake, Kevin Dowd, Guy Coughlan, D. Epstein, A. Ong, and I. Balevich. 2009. A quantitative comparison of stochastic mortality models using data from England and Wales and the United States. North American Actuarial Journal 13: 1–35. [Google Scholar] [CrossRef]
  10. Challenger. 2024. Challenger Website, Technical Report, Challenger Group Limited. Available online: https://www.challenger.com.au/ (accessed on 20 March 2024).
  11. Chatfield, Chris. 1993. Calculating interval forecasts. Journal of Business and Economics Statistics 11: 121–35. [Google Scholar] [CrossRef]
  12. Chatfield, Chris. 2000. Time-Series Forecasting. Boca Raton: Chapman & Hall. [Google Scholar]
  13. Chiou, Jeng-Min. 2012. Dynamical functional prediction and classification with application to traffic flow prediction. The Annals of Applied Statistics 6: 1588–614. [Google Scholar] [CrossRef]
  14. Hall, Peter, and Céline Vial. 2006. Assessing the finite dimensionality of functional data. Journal of the Royal Statistical Society: Series B 68: 689–705. [Google Scholar] [CrossRef]
  15. Human Mortality Database. 2024. University of California, Berkeley (USA), and Max Planck Institute for Demographic Research (Germany). Available online: www.mortality.org (accessed on 19 January 2024).
  16. Hyndman, Rob J., and Heather Booth. 2008. Stochastic population forecasts using functional data models for mortality, fertility and migration. International Journal of Forecasting 24: 323–42. [Google Scholar] [CrossRef]
  17. Hyndman, Rob J., and Han Lin Shang. 2009. Forecasting functional time series (with discussions). Journal of the Korean Statistical Society 38: 199–221. [Google Scholar] [CrossRef]
  18. Hyndman, Rob J., and Han Lin Shang. 2010. Rainbow plots, bagplots, and boxplots for functional data. Journal of Computational and Graphical Statistics 19: 29–45. [Google Scholar] [CrossRef]
  19. Hyndman, Rob J., and Han Lin Shang. 2024. ftsa: Functional Time Series Analysis, R Package Version 6.4; Available online: https://CRAN.R-project.org/package=ftsa (accessed on 1 June 2024).
  20. Hyndman, Rob J., and Md Shahid Ullah. 2007. Robust forecasting of mortality and fertility rates: A functional data approach. Computational Statistics and Data Analysis 51: 4942–56. [Google Scholar] [CrossRef]
  21. Hyndman, Rob J., Heather Booth, and Farah Yasmeen. 2013. Coherent mortality forecasting: The product-ratio method with functional time series models. Demography 50: 261–83. [Google Scholar] [CrossRef] [PubMed]
  22. Hyndman, Rob J., Anne B. Koehler, J.Keith Ord, and Ralph D. Snyder. 2008. Forecasting with Exponential Smoothing: The State Space Approach. Berlin: Springer. [Google Scholar]
  23. Karhunen, Kari. 1947. Über lineare Methoden in der Wahrscheinlichkeitsrechnung. Helsinki: Kirjapaino-oy sana. [Google Scholar]
  24. Lam, Clifford, and Qiwei Yao. 2012. Factor modeling for high-dimensional time series inference for the number of factors. The Annals of Statistics 40: 694–726. [Google Scholar] [CrossRef]
  25. Lee, Ronald D., and Lawrence R. Carter. 1992. Modeling and forecasting U.S. mortality. Journal of the American Statistical Assocation: Applications & Case Studies 87: 659–71. [Google Scholar]
  26. Li, Han, Colin O’Hare, and Xibin Zhang. 2015. A semiparametric panel approach to mortality modeling. Insurance: Mathematics and Economics 61: 264–70. [Google Scholar] [CrossRef]
  27. Loève, Michel. 1978. Probability Theory II, 4th ed. New York: Springer. [Google Scholar]
  28. Mercer, James. 1909. Functions of positive and negative type, and their connection with the theory of integral equations. Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character 209: 415–46. [Google Scholar]
  29. Oeppen, Jim. 2008. Coherent forecasting of multiple-decrement life tables: A test using japanese cause of death data. In In Proceedings of the European Population Conference, Barcelona, Spain, July 9–12; Available online: https://api.semanticscholar.org/CorpusID:1030770 (accessed on 1 May 2024).
  30. Peña, Daniel, and Ismael Sánchez. 2007. Measuring the advantages of multivariate vs. univariate forecasts. Journal of Time Series Analysis 28: 886–909. [Google Scholar] [CrossRef]
  31. Ramsay, J. O., and B. W. Silverman. 2006. Functional Data Analysis, 2nd ed. New York: Springer. [Google Scholar]
  32. Shang, Han Lin, and Fearghal Kearney. 2022. Dynamic functional time-series forecasts of foreign exchange implied volatility surfaces. International Journal of Forecasting 38: 1025–49. [Google Scholar] [CrossRef]
  33. Shang, Han Lin, and Ruofan Xu. 2022. Change point detection for COVID-19 excess deaths in Belgium. Journal of Population Research 39: 557–65. [Google Scholar] [CrossRef] [PubMed]
  34. Shang, Han Lin, and Steven Haberman. 2020. Forecasting age distribution of death counts: An application to annuity pricing. Annals of Actuarial Science 14: 150–69. [Google Scholar] [CrossRef]
  35. Siu, Johnny, Hang Li, and Mary Hardy. 2011. Measuring basis risk in longevity hedges. North American Actuarial Journal 15: 177–200. [Google Scholar]
  36. Sweeting, P. J. 2011. A trend-change extension of the Cairns-Blake-Dowd model. Annals of Actuarial Science 5: 143–62. [Google Scholar] [CrossRef]
  37. Wilmoth, John R., and Shiro Horiuchi. 1999. Rectangularization revisited: Variability of age at death within human populations. Demography 36: 475–95. [Google Scholar] [CrossRef]
  38. Yao, Fang, Hans-Georg Müller, and Jane-Ling Wang. 2005. Functional data analysis for sparse longitudinal data. Journal of the American Statistical Association: Theory and Methods 100: 577–90. [Google Scholar] [CrossRef]
Figure 1. A time series of survival functions for Australian females and males between ages 0 and 110+ from 1921 to 2020.
Figure 1. A time series of survival functions for Australian females and males between ages 0 and 110+ from 1921 to 2020.
Risks 12 00117 g001
Figure 2. A time series of logit-transformed age-specific survival functions for Australian females and males between ages 0 and 110+ from 1921 to 2020.
Figure 2. A time series of logit-transformed age-specific survival functions for Australian females and males between ages 0 and 110+ from 1921 to 2020.
Risks 12 00117 g002
Figure 3. Forecasts of age- and sex-specific survival functions from 2021 to 2065 obtained by the FPCR.
Figure 3. Forecasts of age- and sex-specific survival functions from 2021 to 2065 obtained by the FPCR.
Risks 12 00117 g003
Figure 4. Backtesting on the holdout survival functions in 2001, 2010 and 2020 in Australia.
Figure 4. Backtesting on the holdout survival functions in 2001, 2010 and 2020 in Australia.
Risks 12 00117 g004
Figure 5. MAFE and MAPE for Australian male and female data.
Figure 5. MAFE and MAPE for Australian male and female data.
Risks 12 00117 g005
Table 1. Sensitivity for choice of K.
Table 1. Sensitivity for choice of K.
Gender/KK = 1K = 2K = 6
Male0.93850.99840.9999
Female0.96290.99941.0000
Table 2. Point forecast accuracy by average across all forecast horizons.
Table 2. Point forecast accuracy by average across all forecast horizons.
FemaleMale
MethodMAFEMAPEMAFEMAPE
LC0.00492.69650.00733.7589
FPCR0.00321.64310.00321.9565
Table 3. Australian female and male annuity prices with different ages and maturities (T).
Table 3. Australian female and male annuity prices with different ages and maturities (T).
Maturity (T)
SexAge51015202530354045
Female654.578.4611.7114.3016.2017.3717.8818.0018.01
704.528.2811.2413.3714.6415.1715.2815.29
754.447.9010.3511.7512.3012.4212.42
804.247.178.789.399.509.51
853.875.936.656.786.79
903.224.294.474.48
952.402.792.82
1001.631.74
1051.12
Male654.538.3311.4713.9915.9117.2017.8117.9417.95
704.468.1010.9513.0514.3814.9615.0815.09
754.357.6810.0311.4312.0012.1112.11
804.146.928.489.069.169.17
853.755.686.356.476.48
903.104.094.254.26
952.332.682.71
1001.611.72
1051.14
Table 4. The 95% pointwise prediction intervals of age-specific survival functions, from which we calculate annuity prices with different ages and maturities (T) for Australian females and males, based on the assumption of a constant interest rate of 2.5%.
Table 4. The 95% pointwise prediction intervals of age-specific survival functions, from which we calculate annuity prices with different ages and maturities (T) for Australian females and males, based on the assumption of a constant interest rate of 2.5%.
Maturity (T)
SexAge51015202530
Female65(4.42, 4.63)(7.89, 8.7)(10.41, 12.26)(12.01, 15.35)(12.79, 18.00)(13.06, 20.20)
70(4.31, 4.62)(7.4, 8.67)(9.38, 12.2)(10.35, 15.22)(10.69, 17.71)(10.77, 19.68)
75(4.10, 4.62)(6.65, 8.63)(7.93, 12.07)(8.38, 14.92)(8.49, 17.16)(8.50, 18.86)
80(3.78, 4.59)(5.58, 8.52)(6.24, 11.77)(6.40, 14.35)(6.42, 16.28)(6.42, 17.59)
85(3.29, 4.54)(4.40, 8.30)(4.67, 11.27)(4.70, 13.57)(4.71, 15.11)
90(2.78, 4.42)(3.40, 7.92)(3.49, 10.56)(3.49, 12.48)
95(2.08, 4.31)(2.30, 7.63)(2.31, 9.87)
100(1.54, 4.24)(1.61, 7.19)
105(1.10, 4.05)
Male65(4.61, 4.63)(8.37, 8.7)(11.42, 12.27)(13.58, 15.4)(14.94, 18.15)(15.69, 20.54)
70(4.50, 4.63)(8.00, 8.69)(10.64, 12.25)(12.30, 15.38)(13.17, 18.09)(13.52, 20.36)
75(4.39, 4.63)(7.54, 8.68)(9.69, 12.23)(10.79, 15.32)(11.23, 17.9)(11.30, 19.75)
80(4.21, 4.62)(6.90, 8.67)(8.43, 12.18)(8.99, 15.11)(9.09, 17.23)(9.09, 18.10)
85(3.97, 4.62)(6.04, 8.63)(6.85, 11.98)(6.99, 14.32)(7.00, 15.40)
90(3.50, 4.59)(4.87, 8.42)(5.07, 11.15)(5.08, 12.33)
95(2.79, 4.47)(3.25, 7.57)(3.26, 8.92)
100(1.61, 3.94)(1.69, 5.56)
105(0.91, 2.77)
Table 5. Challenger female and male annual payments—Liquid Lifetime (immediate payments).
Table 5. Challenger female and male annual payments—Liquid Lifetime (immediate payments).
AMOUNT OF INFLATION PROTECTION
SexAgeWithdrawal PeriodFullPartialNilRBA Cash Linked
Female6522 years$4924$5965$6807$7063
7018 years$5583$6593$7366$7586
7514 years$6520$7500$8203$8385
8010 years$7986$8947$9586$9761
Male6519 years$5222$6256$7074$7315
7015 years$5997$7006$7760$7966
7512 years$7026$8001$8684$8862
809 years$8709$9666$10,284$10,473
Table 6. The survival function annuity prices were compared to those in Challenger (2024). The highlighted values represent payment rates without inflation protection, providing a basis for comparison with the payment rates derived from survival function results.
Table 6. The survival function annuity prices were compared to those in Challenger (2024). The highlighted values represent payment rates without inflation protection, providing a basis for comparison with the payment rates derived from survival function results.
Inflation ProtectionSurvival Function
SexAgeFullPartialNilRBA1%2.5%5%
Female65$20.31$16.76$14.69$14.16$22.00$18.01$13.39
70$17.91$15.17$13.58$13.18$18.12$15.29$11.84
75$15.34$13.33$12.19$11.93$14.29$12.42$10.03
80$12.52$11.18$10.43$10.24$10.63$9.51$8.00
Male65$19.15$15.98$14.14$13.67$22.03$17.95$13.27
70$16.68$14.27$12.89$12.55$17.92$15.09$11.65
75$14.23$12.50$11.52$11.28$13.94$12.11$9.77
80$11.48$10.35$9.72$9.55$10.25$9.17$7.72
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, S.; Shang, H.L.; Tickle, L.; Li, H. Forecasting Age- and Sex-Specific Survival Functions: Application to Annuity Pricing. Risks 2024, 12, 117. https://doi.org/10.3390/risks12070117

AMA Style

Wang S, Shang HL, Tickle L, Li H. Forecasting Age- and Sex-Specific Survival Functions: Application to Annuity Pricing. Risks. 2024; 12(7):117. https://doi.org/10.3390/risks12070117

Chicago/Turabian Style

Wang, Shaokang, Han Lin Shang, Leonie Tickle, and Han Li. 2024. "Forecasting Age- and Sex-Specific Survival Functions: Application to Annuity Pricing" Risks 12, no. 7: 117. https://doi.org/10.3390/risks12070117

APA Style

Wang, S., Shang, H. L., Tickle, L., & Li, H. (2024). Forecasting Age- and Sex-Specific Survival Functions: Application to Annuity Pricing. Risks, 12(7), 117. https://doi.org/10.3390/risks12070117

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop