1. Introduction
In property and casualty insurance, the loss payment delay related to claim reporting and handling can be significant. For example, an injury may not be reported by the victim until years after the accident or exposure that caused it, especially for latent diseases. Reported claims often remain unpaid for long periods, since court backlogs, discovery procedures, and legal negotiations delay settlement. Therefore, it is customary to categorize incurred claims into Incurred But Not Reported (IBNR), Reported But Not Settled (RBNS), and Settled (S), etc.
Insurers can only observe claims that have been reported, yet they are required to reserve funds for all incurred claims. Therefore, methods to estimate the distribution of incurred losses based on the insurer’s information about paid and/or reported amount have been studied extensively in the actuarial literature. For example, Arjas (1989) [
1] laid out a structure for modelling the claim reporting/handling process as a marked point process, where each incurred claim (point) is associated with a random mark that carries information about how it is categorized into different stages of settlement over time. The author showed how to use martingale theory to estimate IBNR claims based on information included in reported claims. Norberg (1993) [
2] assumed that claims occur according to a time–inhomogeneous Poisson process and that the time from claim occurrence to reporting and time from claim reporting to settlement follow arbitrary distributions. It was showed that the IBNR, RBNS, and Settled (S) claims all follow compound Poisson distributions and that they are independent. For a more recent treatment, one is referred to Chapter 8 of Mikosch (2009) [
3].
Instead of assuming certain probability distributions for the claim reporting/handling delays, the claim reporting/handling process may be represented by a Markov chain. This idea was proposed in Hachemeister (1980) [
4], and was followed by, for example, Hesselager (1994) [
5], who assumed that the claims arrive according to a Poisson process and derived the moments of the IBNR and RBNS claims.
As was pointed out in Hesselager (1994) [
5] and Norberg (1993) [
2], when it is assumed that the claims occur according to a Poisson process, due to the properties of marked Poisson processes, the amount of claims in different settlement stages (such as IBNR, RBNS) are independent - a statement that has not been tested empirically. The purpose of the current paper is not to argue whether the assumption of independencies among the claims in different stages of the settlement process is reasonable or not. Instead, we propose a general model that allows interdependencies among claims in difference stages of the settlement process. As argued in Neuhaus (2004) [
6], since claim development between two valuation dates comprises two separate types of development: changes in the assessment of reported incurred claims, and reports of new claims that are received by the insurer, it could be advantageous to distinct between the cost of reported claims and the cost of unreported claims in claim reserving. Information on their dependency will be important when aggregating both costs to estimate the variation of the total claim development.
Specifically, we assume that claims occur according to a Markovian arrival process (MAP) (see for example, Asmussen 2003 [
7]), and that an incurred claim goes through some stages of a claim reporting/handling process according to a Markovian law. We derive formulas for the joint distribution and the joint moments for the amount of INBR, RBNS and Settled claims. Since the MAP is quite general and includes Poisson processes, renewal processes with phase–type inter–arrival times and Markov modulated Poisson processes as special cases, our model may be used to evaluate how different claim occurring/handling processes may affect the volatility of the IBNR, RBNS, and Settled claims and their interdependencies. As an example, in
Section 5, we assume that the claim occurrence frequency is influenced by an external environment, which evolves according to a Markov chain, so the claims arrive according to a Markov modulated Poisson process. Using the results developed in
Section 3 and 4, we show that in this case the amount of IBNR and RBNS claims are more volatile than in the case when claims arrive according to a homogeneous Poisson process. In addition, the amount of IBNR and RBNS claims are stochastically dependent.
Methodologically, we will follow Willmot (1990) [
8], which recognized the connections between the distribution of the number of IBNR claims and the number of customers in an infinite server queue, where the service time of the queue is analogous to the time needed to report a claim. This allows us to make use of the results on the distribution of the number of customers in an infinite server queue with Markovian arrivals in the queueing literatures (see for example, Ramaswami and Neuts 1980 [
9] and Masuyama and Takine 2002 [
10]).
The remaining parts of this paper are organized as follows.
Section 2 introduces the claim occurrence and settlement processes;
Section 3 derives an equation that characterizes the joint distribution of the amount of the claims in different stages of the settlement process;
Section 4 derives formulas for the joint moments;
Section 5 presents numerical examples;
Section 6 concludes.
2. The Model
2.1. The Claim Incurral Process
We assume that claims occur according to a MAP with representation . That is, let be an irreducible continuous time Markov process with states, initial distribution γ, intensity matrix , and limiting distribution π. A transition in may be accompanied by the occurrence of a claim. As such, the intensity matrix has the decomposition , where is the matrix of intensities of state changes without arrivals and that of state changes with arrivals. Claims that arrive with the transitions are assumed to have probability distribution function , density function , kth moments and moment generating function (mgf) . In addition, we let denote a matrix with the th element for let be a matrix with the th element , let be a matrix with the th element for
2.2. The Claim Reporting and Handling Process
As in Hesselager (1994) [
5], the claim reporting and handling process is modelled by a continuous time Markov chain
with state space
. Let
, where
,
, and
denote subsets containing IBNR, RBNS and Settled states respectively. The state 0 is assumed to be absorbing, which means that there is no further development after a claim is settled.
An incurred claim is initially assigned to a state j in the set of transient states with probability . The matrix of the rates of transitions among the transient states is denoted by and the vector of rates of transitions from transient states to the absorbing state is denoted by . Then, the total time a claim spends in the claim reporting and handling process follows a phase–type distribution with representation , where .
In fact, the model structure requires that both the claim reporting and the claim handling time follow phase type distributions. Since it is well–known that the family of phase–type distributions is dense in the set of all positive-valued distributions (Asmussen 2003) [
7], the current model can be used to approximate other models that directly assume certain claim reporting/handling delay distributions.
In addition, the proposed model is quite flexible in that one can add more states into the state space S to represent additional features of the claim reporting/handling process. For example, in insurance practice, a settled claim can be reopened. Reopened claims can be included in the model, for example, by introducing a state “pre-settled” in the subset , where a claim in the “pre-settled” state can move to the state 0 (Settled) with certain probability, but it also may move back to some other state in , representing a reopened claim.
3. The Joint Distribution of Claims in Different States
Let
,
, denote the amount of claims incurred during time interval
and is at stage
of claim reporting/handling process. Let
be a matrix of conditional probabilities with the
th element
Let
where
, be the moment generating function (mgf) of
For
, let
and
denote the time of occurrence and the size of the
ith claim respectively. Then we have that
where
denotes an indicator function, which takes value 1 if the argument(s) is true and 0 otherwise.
For
where for
,
is the probability that a claim that occurs at time
is at stage
k of the claim reporting/handling process at time
t. Because of the assumed Markovian structure of the claim reporting/handling process, for
,
=
, where
denotes the
kth column of an identity matrix, and
.
It follows by using the law of total probability that
where
Combining Equations (2) and (4), we have
Using similar arguments as in the derivation of Theorem 3.1 in Masuyama and Takine (2002) [
10], Equation (6) leads to
The proof of the Theorem is provided in the
appendix of the current paper.
Remark 1. It can be seen from Equation (7) that
where
is the
kth element of
and
is the
kth column vector of a
identity matrix.
Intuitively, considering what may occur during a small time interval
, we have
Letting yields Equation (9).
4. The Joint Moments
For
and
, let
be the
lth moment of
conditional on
. Let
denote the vector of conditional moments with the
ith element being
. Let
denote
.
Then, differentiating both sides of Equation (7) with respect to
and rearranging, we obtain for
where
and
is a matrix with
th element
for
.
In particular,
which has the solution
For
,
, and
, let
be a vector having the
ith element
Then by differentiating both sides of Equation (7), we have
where
.
Noticing that by Equation (8),
So Equation (14) can be simplified to
In particular, let
denote
, then it satisfies
Remark 2. Let
be the
lth moment of the amount of IBNR claims conditional on
. Since
, we have
This can be calculated by making use of formula (15).
The unconditional moments of
are obtained by pre–multiplying the vector of conditional moments by the initial distribution
γ of the claim arrival process. When the claim arrival process is stationary,
i.e.,
, the calculation of the unconditional moments simplifies because
and if we pre–multiply the moment formulas (11) and (15) by
π, the first term on right hand side of the equations becomes zero. Particularly, for the first moments, we have that
which gives the simple formulas
and
When
, we have for
where the term
is in fact the expected amount of time a claim stays in stage
k before settlement (absorption).
Remark 3. Let denote the vector of conditional moments of the total amount of incurred claims during , regardless of the claim reporing/handling status. Then it can be calculated by setting in equation (11). In the equilibrium case, we have that This result will be used in the next section.
Remark 4. Similar to Section 8 of Ramaswami and Neuts (1980) [
9], it can be shown that when
, the distribution of
has an asymptotical limit and the joint moments
for
converges to a finite vector. This fact is actually illustrated in the numerical examples in the next section.
Remark 5. Since the calculation of the joint moments of only requires the moments of the claim sizes – the exact form of the claim size distribution is not needed. In the following numerical examples, exponential claim sizes are assumed for presentational convenience only.
5. Numerical Examples
This example considers three cases of the claim arrival process and illustrates how they affect the moments of the amount of IBNR and RBNS claims and their dependency. In the following, a random variable following an exponential distribution with rate λ is said to follow an distribution.
Case 1: Claims arrive according to a Poisson process with inter claim arrival times following an
distribution. The claim sizes follow an
distribution. In terms of the MAP representation, we have
Case 2: The claims arrive according to a Markov modulated Poisson process and claim sizes are modulated by the states of the underlying Markov process. Specifically, assume that an external environment evolves according to a continuous time Markov chain
with a state space
, where the two elements standing for normal and risky environment respectively. The environment process is assumed to have the infinitesimal generator
where
and
. The equilibrium distribution of the environment states is given by
which means that in the long run, with
chance, the environment is normal and with
chance the environment is risky. In the normal environment
N, the claim occurrence rate is
and the claim sizes follow an
distribution. In the risky environment
R, the claim occurrence rate is
and the claim sizes follow an
distribution.
Assuming that the claim arrival process is stationary, the MAP representation is given by
Case 3: This case is similar to Case 2, but with different parameter values. Here it is assumed that the environment process has state space
and have the infinitesimal generator
where
and
. So the equilibrium distribution of the environment states is given by
In the normal environment
N, the claim occurrence rate is
and the claim sizes follow an
distribution. In the risky environment
R, the claim occurrence rate is
and the claim sizes follow an
distribution.
Assuming that the claim arrival process is stationary, the MAP representation is given by
For all three cases, the expected values of the amount of claims incurred in the time interval are the same and have the value .
The parameter values for the environment process in Case 2 and 3 are designed to study how the environment affects the claim incurring and reporting process. In general, assuming
, then a large ratio
indicates that the arrival process is “bursty”and thus more volatile. See for example, Neuts and Li (1997) [
11] for a discussion of the burstiness of Markovian arrival processes. With the above setup, The standard deviations (SD) of the amount of claims incurred in the time interval
were calculated using the method pointed out in remark 3 and their values are found to be
, 117, and
for case 1, 2 and 3 respectively. Obviously, the claim arrival process of case 2 is the most volatile and case 1 is the least volatile.
In all three cases, the claim reporting/handling process
is assumed to have three states
, with
,
and
. Therefore, both the time from claim occurrence to reporting and the time from claim reporting to settlement follow Exponential distributions. Assuming that they have rate
and 1 respectively, then we have
A claim has to be reported before being handled, so
The mean, standard deviation and the coefficient of variation of the amount of the IBNR, RBNS and Settled claims for the three cases are computed using the formulas developed in
Section 4. The values are plotted in
Figure 1, from which it can be seen that the mean amount of IBNR, RBNS and Settled claims coincide in the three cases, however the standard deviations and the values of the coefficient of variation are very different. For case 1, IBNR and RBNS losses are independent because of the Poisson arrival assumption. However, as shown in
Figure 2, the amount of IBNR and RBNS claims in case 2 and 3 are correlated, with the correlation affected by the burstiness of the claim arrival process.
6. Conclusion and Discussions
In this paper, we propose and analyze a model of insurance claim occurrence, reporting and handling process based on Markovian arrival processes. This model generalizes the commonly used models assuming Poisson claim arrivals. It enables us to evaluate how the claim occurrence process may affect the volatilities and interdependencies of the amount claims in different stages of the loss reporting/handling process.
The model can be generalized in many ways. For example, in the current model, the claim severity is fixed at the time when it occurs and are independent of the claim reporting and setting process, the full amount is paid when a claim is settled. However, in reality, claim severity could change during the claim settling process due to reassessment. In fact, in the model introduced by Huynh
et al. (2015) [
12], it is assumed that the claim size may change during the claim setting (investigation) process. The Markovian model proposed in this paper could be generalized to consider this. For example, one could assume that each claim of size
y is modified by a random factor
with distribution function
during its stay in the RBNS phases. With such modifications, Equation (3) becomes
Thus Equation (8) becomes
With this, the calculation of the moments can be carried out is a similar way as in
Section 4.
It should be pointed out that parameter estimation for MAP processes is much more complicated than for a Poisson process. How to estimate parameters for the MAP claim arrival process based on observation of reported/paid losses is an interesting future research topic.