Next Article in Journal
Some Results on Measures of Interaction between Paired Risks
Previous Article in Journal
Moments of Compound Renewal Sums with Dependent Risks Using Mixing Exponential Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Evaluation of the Distribution of a General Multivariate Collective Model: Recursions versus Fast Fourier Transform

1
Faculty of Mathematics and Computer Science, Ovidius University of Constanta, 900527 Constanta, Romania
2
Institute for Mathematical Statistics and Applied Mathematics, 050711 Bucharest, Romania
Risks 2018, 6(3), 87; https://doi.org/10.3390/risks6030087
Submission received: 26 July 2018 / Revised: 22 August 2018 / Accepted: 23 August 2018 / Published: 26 August 2018

Abstract

:
With the purpose of introducing dependence between different types of claims, multivariate collective models have recently gained a lot of attention. However, when it comes to the evaluation of the corresponding compound distribution, the problems increase with the dimensionality of the model. In this paper, we consider a multivariate collective model that generalizes a model already studied from the point of view of recursive and FFT evaluation of its distribution, and we extend the same study to the general model. With the intention to see which method works better for this general model, we compare the recursive method with the FFT technique, and emphasize the advantages and drawbacks of each one, based on numerical examples.

1. Introduction

Recently, Robe-Voinea and Vernic (2016a, 2016b, 2017, 2018) and Vernic (2018) studied the recursive and Fast Fourier Transform (FFT)-based evaluation of the distribution of the following multivariate collective model:
( S 1 , , S m ) = l = 0 N 1 U 1 l + k = 0 N 0 L 1 k , , l = 0 N m U m l + k = 0 N 0 L m k , m 2 ,
which may arise in different contexts (see, e.g., the discussion in Section 14.1 of Reference Sundt and Vernic (2009)), from which we mention the case where a policyholder has m types of policies, such as auto, home, business, etc., that can be simultaneously affected by some claim events, such as floods, storms or earthquakes. More precisely, in this case, l = 0 N j U j l denotes the aggregate claims affecting solely the policy of type j, while N 0 denotes the random variable (r.v.) number of claims simultaneously affecting all m types of policies, with L j k denoting the size of the kth such claim corresponding to the policy of type j. The assumptions under which this model was considered are: Each set of claim sizes ( U j l ) l 1 are non-negative, independent and identically distributed (i.i.d.) r.v.s, 1 j m ; they are also independent of the claim numbers and of the other claim sizes, ( L 1 k , , L m k ) included; the random vectors ( L 1 k , , L m k ) k 1 are non-negative i.i.d. as the generic random vector L = ( L 1 , , L m ) , and independent of the claim numbers, while the components of L , however, are dependent; by convention, U j 0 = L j 0 = 0 , 1 j m .
Note that the above model assumes that a claim event affects either a single type of insurance line or all the insurance lines at once; there is no middle way, i.e., an event cannot affect only, say, lines 1 and 2, without causing claims in the other lines.
To overcome this drawback, in this paper we consider the more general multivariate collective model:
S = ( S 1 , , S m ) = k = 1 m 1 i 1 < < i k m i = 0 N i 1 i k X i i 1 i k ,
where
  • The m-variate claim size random vectors X i i 1 i k i 1 are i.i.d. as the generic m-variate random vector X i 1 i k , whose jth univariate component X j i 1 i k = 0 if j i 1 , , i k , meaning that X i 1 i k results from those claim events simultaneously affecting solely the lines i 1 , , i k ; these events are counted by the r.v. N i 1 i k . Moreover, the X i i 1 i k s are also independent of the other claim size random vectors (i.e., of each X i j 1 j t i 1 , where i 1 , , i k j 1 , , j t ) and of the claim numbers. We let X i , j i 1 i k denote the jth univariate component of X i i 1 i k , f i 1 i k the probability function (p.f.) of X i 1 i k (in the discrete case) and, by convention, X 0 i 1 i k = 0 .
  • The components of the random vector number of claims N = N i 1 i k 1 k m ; 1 i 1 < < i k m are dependent r.v.s, in total (maximum) ν = 2 m 1 .
We adopt the actuarial terminology in which the distribution of S is called “compound” and the distribution of N is called “counting”.
To evaluate the distribution of this model, we shall consider that all the claim distributions are of the discrete type (e.g., they have been previously discretized; this is a usual assumption for collective models). We start the next section by presenting the exact formula of the p.f. of S based on convolutions, which, unfortunately, is unpractical. Therefore, we also aim at developing recursions for the evaluation of this distribution, an approach that requires the introduction of supplementary assumptions under which it is possible to obtain recursive formulas; examples of such recursions are given in Section 2.1. Apart from the restrictive assumptions, another important drawback of recursions is that they become very time consuming when the dimensionality m of the model increases (see the numerical examples in Section 2.3). To overcome these drawbacks, in Section 2.2 we propose the use of the Fast Fourier Transform (FFT) technique, which can be applied whenever we know the form of the characteristic function of S and which is very efficient when we want to evaluate the distribution’s tail. However, this remarkably fast method is an approximate one, and we must pay a special attention to its specific errors; this aspect is illustrated by the numerical examples discussed in Section 2.3.
For simplicity, let us introduce more notation: We denote by f S the p.f. of S , by g and φ the probability generating function (pgf) and the characteristic function (cf), respectively, of a r.v., which will be indexed with the r.v.’s name. Also, n , t , x , y are vectors whose corresponding dimension results from the context, 0 is the zero-vector, while the difference x y is componentwise. By x + = i = 1 m x i we denote the sum of the components of the vector x and by f n the n-fold convolution of f. To shorten the formulas, we rewrite the sum k = 1 m 1 i 1 < < i k m as 1 k m 1 i 1 < < i k m .

2. Evaluation of the Compound Distribution

We start by presenting the exact formula of the p.f. of S based on convolutions. This formula is so complex that, in general, it cannot be directly applied to find the distribution of S .
Proposition 1.
The p.f. of the multivariate collective model (2) is given by
f S ( x ) = n Pr N = n x = x i 1 i k 1 k m 1 i 1 < < i k m 1 k m 1 i 1 < < i k m f i 1 i k n i 1 i k x i 1 i k ,
where n = ( n i 1 i k ) 1 k m ; 1 i 1 < < i k m , n i 1 i k N .
Proof. 
We have
f S ( x ) = Pr S = x = n Pr N = n Pr k = 1 m 1 i 1 < < i k m i = 0 n i 1 i k X i i 1 i k = x = n Pr N = n x = x i 1 i k 1 k m 1 i 1 < < i k m Pr 1 k m 1 i 1 < < i k m i = 0 n i 1 i k X i i 1 i k = x i 1 i k ,
which immediately yields the result. ☐
We shall also need the pgf and the cf of S .
Proposition 2.
The pgf and cf of the general multivariate collective model (2) are, respectively, given by
g S ( t ) = g N g X i 1 i k t 1 k m ; 1 i 1 < < i k m ,
φ S ( t ) = g N φ X i 1 i k t 1 k m ; 1 i 1 < < i k m .
Proof. 
We prove only the pgf formula (the one for the cf follows along the same lines). Considering the independence assumptions of the model, we have
g S ( t ) = E j = 1 m t j S j = E j = 1 m t j k = 1 m 1 i 1 < < i k m i = 0 N i 1 i k X i , j i 1 i k = E k = 1 m 1 i 1 < < i k m E j = 1 m t j i = 0 N i 1 i k X i , j i 1 i k N = E k = 1 m 1 i 1 < < i k m g X i 1 i k N i 1 i k t ,
hence the formula (3). ☐

2.1. Recursive Evaluation

Due to the difficulty of directly applying the exact formula from Proposition 1, we present in the following examples of alternative recursive formulas for obtaining the p.f. of S under some supplementary assumptions. These assumptions are chosen such that the multivariate compound distribution of S can be rewritten as a compound distribution with a univariate counting distribution, for which we can apply the already existing recursions.

2.1.1. Case 1 Assumptions

As in Reference Robe-Voinea and Vernic (2017), we assume that N follows the multivariate Poisson distribution M P o ( λ ; λ ˜ ) with parameters λ > 0 and λ ˜ = ( λ i 1 i k ) 1 k m ; 1 i 1 < < i k m > 0 , having the pgf (see, e.g., Johnson et al. (1997))
g N t = exp λ 1 k m 1 i 1 < < i k m t i 1 i k 1 + k = 1 m 1 i 1 < < i k m λ i 1 i k t i 1 i k 1 .
As a consequence, Proposition 2 easily yields the following pgf and cf
g S ( t ) = exp λ 1 k m 1 i 1 < < i k m g X i 1 i k t 1 + 1 k m 1 i 1 < < i k m λ i 1 i k g X i 1 i k t 1 ,
φ S ( t ) = exp λ 1 k m 1 i 1 < < i k m φ X i 1 i k t 1 + 1 k m 1 i 1 < < i k m λ i 1 i k φ X i 1 i k t 1 .
Also, two recursive formulas for evaluating the distribution of S are obtained in the following proposition, where we denote by f X + the p.f. of the sum r.v. 1 k m ; 1 i 1 < < i k m X i 1 i k .
Proposition 3.
Under the assumption that N M P o ( λ ; λ ˜ ) it holds that
f S ( x ) = k = 1 m 1 i 1 < < i k m l i 1 , , i k λ i 1 i k 0 < y i 1 , , y i k x i 1 , , x i k y i k + 1 = = y i m = 0 y l x l f i 1 i k y f S ( x y ) + λ 0 < y x y l x l f X + y f S ( x y ) , x l 1 , x j 0 , j l ,
and
f S ( x ) = 1 x + 1 k m 1 i 1 < < i k m λ i 1 i k 0 < y i 1 , , y i k x i 1 , , x i k y i k + 1 = = y i m = 0 y + f i 1 i k y f S ( x y )      + λ 0 < y x y + f X + y f S ( x y ) , x + 1 ,
with starting value
f S ( 0 ) = exp λ 1 k m 1 i 1 < < i k m f i 1 i k 0 + 1 k m 1 i 1 < < i k m λ i 1 i k f i 1 i k 0 λ + ,
where λ + = λ + 1 k m ; 1 i 1 < < i k m λ i 1 i k . In the above formulas, y = y 1 , , y m is such that y i 1 , , y i m is a permutation of its components.
Proof. 
Due to the independence of the random vectors X i 1 i k , we have that g X + = 1 k m ; 1 i 1 < < i k m g X i 1 i k ; therefore, we can rewrite the pgf (5) as
g S ( t ) = exp λ + λ λ + g X + t 1 + 1 k m 1 i 1 < < i k m λ i 1 i k λ + g X i 1 i k t 1 = exp λ + λ λ + g X + t + 1 k m 1 i 1 < < i k m λ i 1 i k λ + g X i 1 i k t 1 ,
meaning that in this case, the distribution of Model (2) is also a compound distribution, with a univariate Poisson counting distribution. More precisely, S can also be rewritten as
S = k = 0 N ˜ C k ,
where N ˜ P o λ + , C 0 = 0 , while the random vectors C 1 , C 2 , are i.i.d. as the m-variate random vector C having the mixture p.f.
f C ( x ) = λ λ + f X + ( x ) + 1 k m ; 1 i 1 < < i k m λ i 1 i k λ + f i 1 i k ( x ) .
Regarding model (8), with N ˜ satisfying Panjer’s recursion (see Panjer (1981)) with parameters a , b R , i.e.,
Pr N ˜ = n = a + b n Pr N ˜ = n 1 , n 1 ,
from Reference Sundt (1999) (see, also, formulas (15.4) and, respectively, (15.5) in Sundt and Vernic (2009)) it holds that
f S ( x ) = 1 1 a f C ( 0 ) 0 < y x a + b y l x l f C ( y ) f S ( x y ) , x l 1 ,
f S ( x ) = 1 1 a f C ( 0 ) 0 < y x a + b y + x + f C ( y ) f S ( x y ) , x > 0 .
Since in our case N ˜ P o λ + , we have a = 0 and b = λ + . Based on this, we insert Equation (9) into Equation(10) and obtain for x l 1 ,
f S ( x ) = λ + 0 < y x y l x l λ λ + f X + ( y ) + 1 k m ; 1 i 1 < < i k m λ i 1 i k λ + f i 1 i k ( y ) f S ( x y ) .
We know that X j i 1 i k = 0 if j i 1 , , i k , hence, concerning the argument y of f i 1 i k y , we can take the components y i k + 1 = = y i m = 0 . Therefore, if l i 1 , , i k , clearly y l = 0 in the argument y of f i 1 i k y , which yields the first stated formula. The second formula results in a similar way by inserting Equation (9) into Equation (11), while the starting value is immediate from f S ( 0 ) = g S ( 0 ) and from the above form of g S . This completes the proof. ☐

2.1.2. Case 2 Assumptions

Similarly to Robe-Voinea and Vernic (2016a, 2016b), the supplementary assumptions are now:
A1 
The p.f. of the total number of claims N t o t = 1 k m ; 1 i 1 < < i k m N i 1 i k satisfies Panjer’s recursion for a , b R .
A2 
Given N t o t = n , the conditional distribution of the random vector number of claims N is assumed to be multinomial M n o m ( n ; p ) with parameters n N and p = ( p i 1 i k ) 1 k m ; 1 i 1 < < i k m , where p i 1 i k ( 0 , 1 ) such that 1 k m ; 1 i 1 < < i k m p i 1 i k = 1 . Therefore, with n = ( n i 1 i k ) 1 k m ; 1 i 1 < < i k m and n = 1 k m ; 1 i 1 < < i k m n i 1 i k ,
Pr ( N = n | N t o t = n ) = n ! 1 k m ; 1 i 1 < < i k m n i 1 i k ! 1 k m ; 1 i 1 < < i k m p i 1 i k n i 1 i k .
Under these assumptions, the pgf, the cf of S and two alternative recursive formulas are presented in the following.
Proposition 4.
Under the assumptions (A1 and A2), the pgf and cf of the general multivariate collective model (2) become, respectively,
g S ( t ) = g N t o t k = 1 m 1 i 1 < < i k m p i 1 i k g X i 1 i k t ,
φ S ( t ) = g N t o t k = 1 m 1 i 1 < < i k m p i 1 i k φ X i 1 i k t .
Proof. 
To obtain the pgf formula, we recall that the pgf of the multinomial distribution M n o m ( n ; p ) is (see, e.g., Johnson et al. (1997)) g ( t ) = 1 k m ; 1 i 1 < < i k m p i 1 i k t i 1 i k n , so that the pgf of N becomes for t = t i 1 i k 1 k m ; 1 i 1 < < i k m ,
g N ( t ) = E E k = 1 m 1 i 1 < < i k m t i 1 i k N i 1 i k N t o t = E k = 1 m 1 i 1 < < i k m p i 1 i k t i 1 i k N t o t = g N t o t k = 1 m 1 i 1 < < i k m p i 1 i k t i 1 i k .
Inserting this into Equation (3) easily yields Equation (12). Equation (13) follows in a similar way, which completes the proof. ☐
Proposition 5.
Under the assumptions (A1 and A2) of Model (2), with starting value
f S ( 0 ) = g S ( 0 ) = g N t o t k = 1 m 1 i 1 < < i k m p i 1 i k f i 1 i k 0 ,
the following recursive formula holds for x l 1 , x j 0 , j l ,
f S ( x ) = K k = 1 m 1 i 1 < < i k m l i 1 , , i k p i 1 i k 0 < y i 1 , , y i k x i 1 , , x i k y i k + 1 = = y i m = 0 a + b y l x l f i 1 i k y f S ( x y ) + a 1 i 1 < < i k m l i 1 , , i k p i 1 i k 0 < y i 1 , , y i k x i 1 , , x i k y i k + 1 = = y i m = 0 f i 1 i k y f S ( x y ) ,
while for x + > 0 ,
f S ( x ) = K 1 k m 1 i 1 < < i k m p i 1 i k 0 < y i 1 , , y i k x i 1 , , x i k y i k + 1 = = y i m = 0 a + b y + x + f i 1 i k y f S ( x y ) ,
where K = 1 a 1 k m ; 1 i 1 < < i k m p i 1 i k f i 1 i k 0 1 and y = y 1 , , y m is such that y i 1 , , y i m is a permutation of its components.
Proof. 
Considering the assumptions (A1 and A2), we rewrite Model (2) as
S = k = 0 N t o t C k ,
where C 0 = 0 , while the random vectors C 1 , C 2 , are i.i.d. as the m-variate random vector C with the p.f.
f C ( y ) = k = 1 m 1 i 1 < < i k m p i 1 i k f i 1 i k y .
We use again Equations (10) and (11). By inserting Equation (16) into Equation (10), the stated formula of the constant K is easily obtained and, for x l 1 ,
f S ( x ) = K 0 < y x a + b y l x l k = 1 m 1 i 1 < < i k m p i 1 i k f i 1 i k y f S ( x y ) = K k = 1 m 1 i 1 < < i k m p i 1 i k 0 < y x a + b y l x l f i 1 i k y f S ( x y ) .
Using reasoning similar with the one used in the proof of Proposition 3, we obtain Equation (14). Similarly, Equations (11) and (16) lead to Equation (15). This completes the proof. ☐
Particular case: m = 3 . Let us now have a look at a recursive formula in the trivariate case, where the general Model (2) is S = ( S 1 , S 2 , S 3 ) with
S 1 = i = 0 N 1 X i , 1 1 + i = 0 N 12 X i , 1 12 + i = 0 N 13 X i , 1 13 + i = 0 N 123 X i , 1 123 , S 2 = i = 0 N 2 X i , 2 2 + i = 0 N 12 X i , 2 12 + i = 0 N 23 X i , 2 23 + i = 0 N 123 X i , 2 123 , S 3 = i = 0 N 3 X i , 3 3 + i = 0 N 13 X i , 3 13 + i = 0 N 23 X i , 3 23 + i = 0 N 123 X i , 3 123 .
For example, Equation (15) becomes
f S ( x ) = K i = 1 3 p i 1 y i x i y 2 = y 3 = 0 a + b y i x + f i y f S ( x y ) + 1 i 1 < i 2 3 p i 1 i 2 0 < y i 1 , y i 2 x i 1 , x i 2 y i 3 = 0 a + b y i 1 + y i 2 x + f i 1 i 2 y f S ( x y ) + p 123 0 < y x a + b y + x + f 123 y f S ( x y ) ,
where K = 1 a i = 1 3 p i f i 0 + 1 i 1 < i 2 3 p i 1 i 2 f i 1 i 2 0 + p 123 f 123 0 1 .

2.1.3. Case 3 Assumptions

Another assumption under which recursive formulas already exist is the univariate mixed Poisson counting distribution. To this purpose, we assume that, given that a positive univariate r.v. Θ takes the value θ , the r.v.s N i 1 i k are all i.i.d. Poisson distributed such that N i 1 i k Θ = θ P o θ λ i 1 i k , λ i 1 i k > 0 , 1 k m , 1 i 1 < < i k m . Then, the pgf of S given Θ = θ becomes, from Equation (3):
g S Θ = θ ( t ) = g N Θ = θ g X i 1 i k t 1 k m ; 1 i 1 < < i k m = exp θ λ + 1 k m ; 1 i 1 < < i k m λ i 1 i k λ + g X i 1 i k t 1 ,
where λ + = 1 k m ; 1 i 1 < < i k m λ i 1 i k . This is the pgf of a compound distribution with univariate Poisson P o θ λ + counting distribution and multivariate claims distribution having p.f. h = 1 k m ; 1 i 1 < < i k m λ i 1 i k λ + f i 1 i k ; hence, the conditional distribution of S , given Θ = θ , can be evaluated based on Equations (10) and (11), with a = 0 and b = θ λ + . To find the unconditional distribution of S , we use the technique described in Chapter 20 of Sundt and Vernic (2009). Therefore, with U denoting the distribution function of Θ , we introduce the auxiliary functions
v i x = 0 θ i f S Θ = θ ( x ) d U θ , i = 0 , 1 , 2 , ,
and note that f S = v 0 . Multiplying Equations (10) and (11) by θ i and integrating yields the following two recursions for v i
v i x = λ + x l 0 < y x y l h y v i + 1 x y , x l 1 , v i x = λ + x + 0 < y x y + h y v i + 1 x y , x + > 0 ,
with starting value v i 0 = 0 θ i e θ λ + h 0 1 d U θ . Therefore, the algorithm for evaluating f S y for all 0 y x is more complex and implies the backward evaluation of all v i y , 0 y x , i = 1 , , x + (here backward means by decreasing i, see, e.g., the algorithm in Section 20.4.1 in Reference Sundt and Vernic (2009)). Being very time consuming, we don’t insist on this algorithm. However, we note that the recursions can be refined under the assumption that the mixing distribution U is of the continuous type, with the density denoted by u satisfying the condition
d d θ ln u θ = i = 0 k η i θ i i = 0 k χ i θ i .
This is also called Willmot’s mixing distribution, see Reference Willmot (1993).
Remark 1.
In view of the FFT, we also display the formula of the cf of S given Θ = θ ,
φ S Θ = θ ( t ) = exp θ 1 k m ; 1 i 1 < < i k m λ i 1 i k φ X i 1 i k t λ + ,
where
φ S ( t ) = 0 φ S Θ = θ ( t ) d U θ .
Particular case: Simpler recursions are obtained when Θ is gamma G a δ , β distributed, with δ , β > 0 . In this case, the univariate mixed Poisson P o θ λ + distribution becomes a Negative Binomial distribution N B δ , β β + λ + , which satisfies Panjer’s recursion with a = λ + β + λ + and b = δ 1 λ + β + λ + . Since
f S ( x ) = 0 f S Θ = θ ( x ) d U θ = n h n ( x ) 0 Pr N ˜ θ = n d U θ = n Pr N ˜ = n h n ( x ) ,
where N ˜ θ P o θ λ + , hence N ˜ N B δ , β β + λ + , and it follows that we can use Equations (10) and (11) to obtain direct recursions for f S , i.e.,
f S ( x ) = λ + β + λ + λ + h 0 0 < y x 1 + δ 1 y l x l h ( y ) f S ( x y ) , x l 1 , f S ( x ) = λ + β + λ + λ + h 0 0 < y x 1 + δ 1 y + x + h ( y ) f S ( x y ) , x > 0 ,
with starting value f S ( x ) = β β + λ + λ + h 0 δ . Moreover, regarding the cf, we easily obtain
φ S ( t ) = β δ Γ δ 0 φ S Θ = θ ( t ) θ δ 1 e β θ d θ = β β 1 k m ; 1 i 1 < < i k m λ i 1 i k φ X i 1 i k t + λ + δ .

2.2. Fast Fourier Transform Evaluation

The recursive method is an exact one, but, as already mentioned in the introduction, it has some important drawbacks: It can be applied only on some particular models and it becomes quite slow with the increasing of the dimensionality of S . A much faster and less restrictive way to evaluate the p.f. of S is provided by the Fast Fourier Transform method, which is an approximate technique used to strongly reduce the computing time, especially when evaluating the distribution’s tail. As an advantage, this method can be applied to any model as long as its cf (4) (on which it is based) has a closed form, even if there is no recursive formula available. Therefore, the FFT technique received special consideration in the actuarial literature (see, e.g., References Bühlmann (1984), Embrechts et al. (1993), Jin and Ren (2014) or Robe-Voinea and Vernic (2018)). It consists of an algorithm that computes the discrete Fourier transform of a multivariate function, as well as its inverse, extremely fast. Let f x denote an m-variate function defined on the integer support × m j = 1 0 , 1 , , r j 1 ; then its discrete Fourier transform, f ˜ , and, respectively, the inverse mapping, can defined by (definition consistent with the functions fftn and ifftn in Matlab)
f ˜ ( c ) = x 1 = 0 r 1 1 x m = 0 r m 1 f ( x ) exp 2 π i j = 1 m x j c j r j , c j = 0 , , r j 1 , 1 j m , f ( x ) = 1 j = 1 m r j c 1 = 0 r 1 1 c m = 0 r m 1 f ˜ ( c ) exp 2 π i j = 1 m x j c j r j , x j = 0 , , r j 1 , 1 j m .
In general, the FFT method requires that the values r j are powers of two for all j. For the multivariate model (2), this algorithm becomes:
FFT Algorithm for model (2)
Step 1. After setting the truncation point for each claim size random vector X i 1 i k at the same r = ( r 1 , , r m ) , the corresponding truncated claim size distribution is obtained as f i 1 i k = f i 1 i k j 0 j r 1 ; if necessary, the resulting f i 1 i k will be filled with zeros (e.g., to constraint the r j s to be powers of two).
Step 2. Apply the m-dimensional FFT to each f i 1 i k , which results in the multidimensional table f ˜ i 1 i k .
Step 3. Use Equation (4) in the general case to obtain the discrete cf φ ˜ S ( j ) = g N f ˜ i 1 i k j 1 k m ; 1 i 1 < < i k m , 0 j r 1 .
Step 4. Apply the multidimensional IFFT to φ ˜ S to obtain the p.f. of S .
Usually, to find the optimal r j s, one gradually increases them until the differences between the actual solutions and the previous ones are under a certain threshold (e.g., we increase r j as 32, 64, 128, 256 etc.). However, when dealing with heavy tailed claim size distributions, the results of this method can be strongly affected by a specific error caused by the discrete Fourier transform, which consists of placing under the truncation point the compound probability mass which is in fact above this point. This so-called “aliasing error” (AE) can be significantly reduced by applying to the claim size distributions an exponential change of measure, hence, forcing the tails of these distributions to decrease at an exponential rate; this transformation is known under the name of “exponential tilting” (for more details on this transformation see, e.g., Reference Grübel and Hermesmeier (1999)).
Particular cases: Under the particular assumptions considered in the previous section to allow for a recursive evaluation, one should use the following formulas at Step 3 of the above algorithm:
-
When N M P o ( λ ; λ ˜ ) , φ ˜ S is given by Equation (6);
-
Under the Case 2 assumptions (A1 and A2), φ ˜ S is given by Equation (13);
-
Under the Case 3 mixed Poisson assumption, φ ˜ S is given by Equation (18).

2.3. Numerical Illustration

In this section, we consider a particular trivariate model (2) with
S 1 = i = 0 N 1 X i , 1 1 + i = 0 N 12 X i , 1 12 + i = 0 N 13 X i , 1 13 + i = 0 N 123 X i , 1 123 , S 2 = i = 0 N 2 X i , 2 2 + i = 0 N 12 X i , 2 12 + i = 0 N 123 X i , 2 123 , S 3 = i = 0 N 3 X i , 3 3 + i = 0 N 13 X i , 3 13 + i = 0 N 123 X i , 3 123 ,
for which we implemented both the recursive formulas and the FFT algorithm, under different assumptions.
As claim size distributions, we considered only type II Pareto distributions with the purpose to emphasize the effect of the exponential tilting on the FFT technique. We recall that the decumulative distribution (or survival) function of the m-variate type II Pareto distribution P a m I I α , σ i i = 1 , , m , α , σ i > 0 , i = 1 , , m , is given by
F ¯ x = 1 + i = 1 m x i σ i α , x i > 0 , i = 1 , , m .
The expected value of each marginal exists only if α > 1 , while the variance exists only when α > 2 . We took (mainly from the numerical Example 4 in Reference Robe-Voinea and Vernic (2018))
X 1 P a 1 I I 1 , 1 , X 2 P a 1 I I 2 , 2 , X 3 P a 1 I I 3 , 1 , X 12 P a 2 I I 1.5 , 1 , 2 , X 13 P a 2 I I 2 , 1 , 1 , X 123 P a 3 I I 1.5 ; 2 , 2 , 2 .
The expected value of X 1 and the variances of X 2 , X 12 , X 13 , X 123 do not exist, hence we can see the effect of the exponential tilting in the heavy-tailed case. To discretize these distributions, we used the method of rounding considering the span h = 1 (good enough for illustration, but not optimal, see the discussion in Reference Robe-Voinea and Vernic (2018)).
Concerning the FFT method, as discussed in Section 2.2, we increased the truncation point r = r , r , r (we took r 1 = r 2 = r 3 = r for simplicity) from 16 till 128 (unfortunately, r = 256 generated an “out of memory” warning), and noticed that r = 128 yielded enough accurate results (for our data) compared to the exact method (see Tables 1, 3 and 5). Moreover, we also varied the tilting parameter θ = θ 1 = θ 2 = θ 3 and noticed that an increasing of θ improves the results till θ = 7 / r , while a larger value like θ = 9 / r doesn’t significantly improve the results (see Table 4 in Example 2).
As expected, there is an important difference between the computing times requested by the two methods. This difference increases with the increasing of the truncation point r and becomes really huge for r = 32 in Example 1 and for r = 128 in Examples 2 and 3. Therefore, we decided to compare the resulting p.f.s only up to a certain right endpoint denoted by x M = x M 1 , x M 1 , x M 1 , even if the support of the FFT was much larger. Note that the discretization time was not taken into account in the displayed computing times since discretization is needed by both methods (the total discretization time up to r = 128 was about 160 s).
To emphasize the differences between the FFT and the recursive results, we used the cumulative distribution function (cdf), the AE and the maximum absolute error evaluated between the exact p.f. and the FFT one; these last two are defined, respectively, by
A E = 0 x x M f S ( x ) f S F F T ( x ) , M a x . e r r = max 0 x x M f S ( x ) f S F F T ( x ) .
We shall now present three examples based on the three particular cases considered in Section 2.1. From these examples, we also note that in cdf terms, F S F F T ( x M ) F S ( x M ) , an inequality caused by the AE that places compound mass below the truncation point.
Example 1.
We assume that N M P o ( λ ; λ ˜ ) , where λ = 1 , λ 1 = 3 , λ 2 = 3 , λ 3 = 2 , λ 12 = 2 , λ 13 = 1.7 , λ 123 = 1.5 ; since for this particular model, the recursive method (we implemented Equation (7)) implies the evaluation of the p.f. f X + (i.e., multivariate convolutions), the corresponding computing time increases tremendously with r . Therefore, starting with r = 32 , we took only x M = 20 , which needed about 30 minutes only for the convolution part. However, the FFT was ready in only a few s even for r = 128 , see Table 1, where we also display a comparison of the accuracy of the two methods. This example clearly emphasizes the speed discrepancy between the two methods and the important advantage of the FFT speed.
Example 2.
We now assume that N t o t follows a Poisson distribution P o λ = 5 , for which we recall that a = 0 , b = λ and g N t o t = e λ t 1 . Numerically, we took the multinomial parameters p 1 = 0.3 , p 2 = 0.2 , p 3 = 0.2 , p 12 = 0.15 , p 13 = 0.1 , p 123 = 0.05 . We implemented the recursive Equation (17) and performed it up to the maximum x M = 70 in about 35 min. The speed difference between the two methods can be seen in Table 2, where we displayed the relative computing times Rec/FFT (for r = 128 , FFT took about 8 s).
The accuracy comparison of the two methods is presented in Table 3 and the effect of changing the tilting parameters in Table 4, both supporting the above conclusions regarding the choices of r and θ .
Example 3.
This example is related to Case 3, i.e., N follows a mixed Poisson distribution and, for simplicity, we let Θ G a δ , β . Therefore, we implemented recursion (19) and the FFT based on Equation (20). The values of the parameters are: δ = β = 2 , λ 1 = λ 2 = 2.5 , λ 3 = λ 12 = 2 , λ 13 = 1.7 , λ 123 = 1.5 . The comparison between the two methods is presented in Table 5, from where we note once again that a value of r = 128 is sufficient to obtain good enough results by FFT (at least for these data). Concerning the computing times, the values were similar with the ones obtained in Example 2, see Table 2.

3. Conclusions

In this paper, we proposed a general multivariate collective model that allows for dependence between the r.v.s number of claims, and, moreover, between the different r.v.s claim sizes. Since the evaluation of the resulting compound distribution is not straightforward, we discussed two types of techniques to deal with it: The recursive method that was presented in Section 2.1 and the FFT algorithm that was described in Section 2.2. Unfortunately, even if the recursive method has the advantage of being exact, it has two main drawbacks compared with the FFT method: First, recursions are available under some restrictive assumptions and second, they become very slow with the increasing of the dimensionality of the model. On the other hand, the main drawback of the FFT method consists in its specific errors, especially the aliasing error. However, the FFT technique is so fast compared with the exact recursions, that it is quite worthwhile to use it, especially when values from the tail of the compound distribution are needed (nevertheless, it is important to pay attention when choosing optimal values for the truncation points and for the tilting parameters). Another advantage of the FFT is that specific functions are already implemented in existing software, even for higher dimensions, with, eventually, the disadvantage of memory limitation.
To conclude, we would recommend the following approach: If recursive formulas are available for the considered model, they should be used to evaluate the compound distribution until some reasonable (in computing time terms) upper limit is reached, and then the FFT method should be applied for a more extended domain; to validate the accuracy of the FFT results, they should be compared with the ones obtained by the recursive method.

Funding

This research received no external funding.

Acknowledgments

The author gratefully acknowledges the two anonymous referees for their nice and valuable comments, and the prompt help of the associate editor.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Bühlmann, Hans. 1984. Numerical evaluation of the compound Poisson distribution: Recursion or fast Fourier transform? Scandinavian Actuarial Journal 1984: 116–26. [Google Scholar] [CrossRef]
  2. Embrechts, Paul, R. Grübel, and S. M. Pitts. 1993. Some applications of the fast Fourier transform algorithm in insurance mathematics. Statistica Neerlandica 47: 59–75. [Google Scholar] [CrossRef]
  3. Grübel, Rudolf, and Renate Hermesmeier. 1999. Computation of compound distributions I: Aliasing errors and exponential tilting. ASTIN Bulletin: The Journal of the IAA 29: 197–214. [Google Scholar] [CrossRef]
  4. Jin, Tao, and Jiandong Ren. 2014. Recursions and fast Fourier transforms for a new bivariate aggregate claims model. Scandinavian Actuarial Journal 2014: 729–52. [Google Scholar] [CrossRef]
  5. Johnson, Norman Lloyd, Samuel Kotz, and Narayanaswamy Balakrishnan. 1997. Discrete Multivariate Distributions. New York: Wiley. [Google Scholar]
  6. Panjer, Harry H. 1981. Recursive evaluation of a family of compound distributions. ASTIN Bulletin: The Journal of the IAA 12: 22–26. [Google Scholar] [CrossRef]
  7. Robe-Voinea, Elena-Gratiela, and Raluca Vernic. 2016a. On the recursive evaluation of a certain multivariate compound distribution. Acta Mathematicae Applicatae Sinica, English Series 32: 913–20. [Google Scholar] [CrossRef]
  8. Robe-Voinea, Elena-Gratiela, and Raluca Vernic. 2016b. Another approach to the evaluation of a certain multivariate compound distribution. Analele Universitatii “Ovidius” Constanta-Seria Matematica 24: 339–49. [Google Scholar] [CrossRef]
  9. Robe-Voinea, Elena-Gratiela, and Raluca Vernic. 2017. On a multivariate aggregate claims model with multivariate Poisson counting distribution. Proceedings of the Romanian Academy Series A 18: 3–7. [Google Scholar]
  10. Robe-Voinea, Elena-Gratiela, and Raluca Vernic. 2018. Fast Fourier Transform for multivariate aggregate claims. Computational and Applied Mathematics 37: 205–19. [Google Scholar] [CrossRef]
  11. Sundt, Bjørn. 1999. On multivariate Panjer recursions. ASTIN Bulletin: The Journal of the IAA 29: 29–45. [Google Scholar] [CrossRef]
  12. Sundt, Bjørn, and Raluca Vernic. 2009. Recursions for Convolutions and Compound Distributions with Insurance Applications. Berlin: Springer Science & Business Media. [Google Scholar]
  13. Vernic, Raluca. 2018. On the evaluation of some multivariate compound distributions with Sarmanov’s counting distribution. Insurance Mathematics and Economics 79: 184–93. [Google Scholar] [CrossRef]
  14. Willmot, Gordon E. 1993. On recursive evaluation of mixed Poisson probabilities and related quantities. Scandinavian Actuarial Journal 1993: 114–33. [Google Scholar] [CrossRef]
Table 1. Example 1: Comparing recursive and FFT methods for h = 1 , θ = 7 / r and various r , x M .
Table 1. Example 1: Comparing recursive and FFT methods for h = 1 , θ = 7 / r and various r , x M .
r = x M = 16 r = 32 , x M = 20 r = 64 , x M = 20 r = 128 , x M = 20
Rec. F S ( x M ) 0.2197370.3128450.3128450.312845
FFT F S F F T ( x M ) 0.2198840.3129090.3128550.312847
FFT time up to r 0.016 s0.124 s0.952 s9.484 s
A E 1.4743 ×  10 4 6.3771 ×  10 5 1.0251 ×  10 5 1.7488 ×  10 6
M a x . e r r 8.8571 ×  10 8 2.0810 ×  10 8 3.5244 ×  10 9 6.9685 ×  10 10
Table 2. Example 2: Relative performances of the two methods when varying r ( θ = 7 / r ).
Table 2. Example 2: Relative performances of the two methods when varying r ( θ = 7 / r ).
r = 16 r = 32 r = 64
Rec/FFT in time12130781
Table 3. Example 2: Comparing recursive and FFT methods for h = 1 , θ = 7 / r and various r , x M .
Table 3. Example 2: Comparing recursive and FFT methods for h = 1 , θ = 7 / r and various r , x M .
r = x M = 16 r = x M = 32 r = x M = 64 r = 128 , x M = 70
Rec. F S ( x M ) 0.800350.915430.964360.96804
FFT F S F F T ( x M ) 0.800390.915440.964360.96804
A E 4.3580 ×  10 5 1.4294 ×  10 5 3.8798 ×  10 6 9.0937 ×  10 7
M a x . e r r 7.9393 ×  10 7 1.8276 ×  10 7 4.1642 ×  10 8 9.6893 ×  10 9
Table 4. Example 2: Comparing recursive and FFT methods for h = 1 , r = 64 and various θ .
Table 4. Example 2: Comparing recursive and FFT methods for h = 1 , r = 64 and various θ .
RecursionFFT no tilt. FFT tilt . θ = 5 / r FFT tilt . θ = 7 / r FFT tilt . θ = 9 / r
F S ( x M ) 0.964360.968630.964390.964360.96436
A E 4.2693 ×  10 3 2.8485 ×  10 5 3.8798 ×  10 6 5.4867 ×  10 6
M a x . e r r 4.5823 ×  10 5 3.0770 ×  10 7 4.1642 ×  10 8 2.7813 ×  10 8
Table 5. Example 3: Comparing recursive and FFT methods for h = 1 , θ = 7 / r and various r , x M .
Table 5. Example 3: Comparing recursive and FFT methods for h = 1 , θ = 7 / r and various r , x M .
r = x M = 16 r = x M = 32 r = x M = 64 r = 128 , x M = 70
Rec. F S ( x M ) 0.490440.721910.887010.90087
FFT F S F F T ( x M ) 0.490550.722000.887050.90088
A E 1.0650 ×  10 4 8.3639 ×  10 5 3.5553 ×  10 5 6.4368 ×  10 6
M a x . e r r 1.6674 ×  10 7 3.2871 ×  10 8 6.8457 ×  10 9 1.5338 ×  10 9

Share and Cite

MDPI and ACS Style

Vernic, R. On the Evaluation of the Distribution of a General Multivariate Collective Model: Recursions versus Fast Fourier Transform. Risks 2018, 6, 87. https://doi.org/10.3390/risks6030087

AMA Style

Vernic R. On the Evaluation of the Distribution of a General Multivariate Collective Model: Recursions versus Fast Fourier Transform. Risks. 2018; 6(3):87. https://doi.org/10.3390/risks6030087

Chicago/Turabian Style

Vernic, Raluca. 2018. "On the Evaluation of the Distribution of a General Multivariate Collective Model: Recursions versus Fast Fourier Transform" Risks 6, no. 3: 87. https://doi.org/10.3390/risks6030087

APA Style

Vernic, R. (2018). On the Evaluation of the Distribution of a General Multivariate Collective Model: Recursions versus Fast Fourier Transform. Risks, 6(3), 87. https://doi.org/10.3390/risks6030087

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop