1. Introduction
In the real world, the time until an event occurs is typically considered to be a continuous variable. However, in many applications, recorded observations have a discrete nature because they are represented by discrete values. When there are many different possible values, any distinction between the continuous or discrete nature of a variable becomes negligible. Nonetheless, in certain application areas, time is often modelled as a discrete variable with relatively few possible values. This is particularly true for actuarial models, for which typically a cohort of people, either real or hypothetical, is followed over time, and events such as deaths are recorded per year. However, if a person leaves the group for reasons other than death, or other than the specific cause of death under investigation, then right-censoring occurs. In such cases, time is recorded as the person’s age at the time of the event, making time a discrete variable.
For discrete-time data, approaches like those considered in this paper can be visualised as a table with
k discrete-time points, say
. The data include the number of events and the number of right-censored individuals at each discrete-time point, excluding time
where no events or right-censoring are assumed to have occurred. At any given time, we consider how many people are alive; that is, how many people have survived that time, so this is effectively Bernoulli data; then, we look ahead and assess how many people will be alive in the future. The actuarial estimator is a nonparametric method for estimating the survival function which explicitly restricts attention to the discrete-time, with the possible inclusion of right-censored data [
1,
2,
3].
In this paper, we take a similar approach, but we look at it from a predictive perspective using nonparametric predictive inference (NPI), which is inherently predictive. NPI is a statistical method that relies on only a few assumptions based on Hill’s assumption
[
4] and uses imprecise probabilities to measure uncertainty [
5,
6]. Additionally, NPI has been adapted to accommodate various types of data and a wide range of applications. For example, NPI has been used with Bernoulli data [
7,
8], data containing right-censored observations [
9,
10], bivariate data [
11], multinomial data [
12,
13], and circular data [
6]. The NPI approach has also been developed for right-censored observations in survival data [
14]. NPI for Bernoulli data [
7] will be utilised to develop the NPI alternative to the actuarial estimator, based on the assumption of non-informative right censoring [
9,
15].
The paper is organized as follows.
Section 2 provides a brief introduction to the actuarial estimator of the survival function.
Section 3 provides an overview of NPI and its application to Bernoulli and right-censored data. In
Section 4, we present NPI as an alternative to the actuarial estimator for right-censored data. It proposes lower and upper probabilities for the event that all future observations are greater than a specific discrete time
.
Section 5 focuses on the proposed NPI-based discrete-time reliability function by providing lower and upper probabilities for the event that at least
x out of
m future observations will survive at discrete time
. In
Section 6, we apply the proposed method to system reliability using survival signatures [
16,
17] combined with NPI for Bernoulli data [
7]. Finally, we conclude with some remarks in
Section 7.
2. Actuarial Estimator of the Survival Function
In a discrete-time setting, a common nonparametric method for estimating the survival function is the actuarial estimator. To introduce the actuarial estimator, we first consider
n individuals alive at time
. Let
be positive, exchangeable, and discrete random variables, of which their discrete lifetimes are assumed to be independent and identically distributed, that take values at discrete-time points
, where
, with
. We refer to the event of interest as ‘death’, but it can be any time-related event of interest; for example, in reliability, it will typically be a failure event. The discrete-time hazard function at a specific time
is defined as the conditional probability that a randomly selected individual,
,
, will experience the event of interest at time
given that this individual did not experience the event prior to
such that
Let
be the number of individuals who died at time
and let
be the number of individuals whose lifetimes are right-censored at time
. Let
be the number of individuals known to be at risk (still alive and uncensored) at time
, that is
. It is common to assume that all individuals are at risk at
, so
. Then, the discrete-time hazard function,
, at a discrete time
can be estimated by the actuarial estimator [
1,
2,
3], as
The survival function at time
is defined as
; note that
. The survival function
can be estimated in terms of the actuarial estimator
for
, by
In
Section 4, we will explore an alternative to the actuarial estimator under the NPI methodology, using NPI for Bernoulli data [
7]. First, a brief introductory overview of NPI, including NPI for Bernoulli data, is provided in
Section 3.
3. Nonparametric Predictive Inference (NPI)
Nonparametric predictive inference (NPI) is a frequentist statistical method which requires only a few assumptions, enabled by the use of imprecise probabilities to quantify uncertainty [
5,
6]. NPI is based on Hill’s assumption
[
4]. Assume that
are real-valued absolutely continuous and exchangeable random quantities. Let the ordered observed values of
be denoted as
. To simplify notation, let
and
. These
n observations divide up the real-line into
intervals
, where
. Based on
n observations, the assumption
[
18] is that the probability that the next future observation
is equally likely to fall in each open interval
, for all
, so
The assumption
alone is insufficient for constructing precise probabilities for many events of interest, but it is still useful to derive bounds for probabilities. Repeated application of the assumptions
enables predictive inference for
future observations. These assumptions imply that all orderings of the
m future observations among the
n data observations are equally likely. Based on these assumptions, Coolen [
7] introduced NPI for Bernoulli observations, using an assumed latent variable representation for successes and failures as values on the real line separated by a threshold.
Assume that there are
exchangeable Bernoulli trials with failure and success as possible outcomes for each trial, and data containing
s successes in
n trials. Let
denote the random number of successes in trials 1 to
n, and
denote the random number of successes in trials
to
. Coolen [
7] presented general formulae for NPI lower and upper probabilities for any event of interest involving
. The attention here is limited to the results needed in this paper.
Coolen [
7] derived the NPI lower and upper probabilities for the event that all
m future trials are successes, given data consisting of
s successes observed in
n trials, for
. The NPI lower probability for this event is
and the corresponding NPI upper probability is
Based on the general results by Coolen [
7], Aboalkhair [
19] derived formulae for the NPI lower and upper probabilities for the event that there are at least
r successes in
m future trials, given
s successes observed in
n trials. The NPI lower probability for this event is
and the corresponding NPI upper probability is
The nature of
results in NPI being a frequentist statistical methodology [
4,
5,
20], which can be interpreted in a way similar to that of posterior predictive methods within Bayesian statistics but without any prior information being included [
7,
21]. Hill [
20] provides detailed discussion of
including comparison with nonparametric Bayesian methods, and Hill [
21] presented a formal justification of
within the Bayesian context. This justification, however, is a rather complicated splitting process under finite exchangeability. It is more natural to consider NPI, based on
, as a frequentist statistics methodology based on assumed exchangeability of all observations. The exchangeability assumption implies that all orderings of observations are equally likely, and
-based inference keeps this property for the orderings of future observations among data observations. This is a relatively weak assumption, but it excludes scenarios with known trends in data, e.g., time series, or other clear patterns in the data which would undermine the assumption that all orderings of the observations are equally likely.
4. NPI-Based Discrete-Time Survival Function
In this section, we introduce an NPI-based alternative to the actuarial estimator for the survival function. We introduce it in two steps: first, we derive the NPI lower and upper probabilities for the conditional survival function, and then we derive the corresponding survival function. In doing so, we need to consider the interdependence between the m future observations; however, we first need to introduce some notation.
Let be positive, exchangeable, and discrete random variables that take values at discrete-time points , where , with . We define for the start of the study when all individuals survived, and . Let be the number of individuals known to be alive at time . Also, let represent the number of observed events at time , and represent the number of censored events at time . We assume that the censored observations occur at discrete times , where . Then, the number of individuals at risk at time , denoted by , is computed by . Therefore, the number of individuals at risk at time , , will decrease at subsequent discrete times. Furthermore, let , for , be the event times for the individuals in the risk set at time .
Now, let us consider for the time of event of the ith future individual, for . We consider the event of interest that all m future observations survive a specific discrete time given that they survived the earlier discrete time . This event can be denoted as
We consider the survival of all m future observations at time as exchangeable with the survival of the individuals in the risk set at that discrete time. So, we assume that the random quantities , with respect to the event , , are exchangeable with with respect to the event for , where are the event times for the individuals in the risk set at time .
The NPI lower and upper probabilities for the event
can be derived by utilising NPI for Bernoulli data [
7] via Equations (
5) and (
6), respectively. This can be performed by considering the number of individuals known to be alive at time
,
, out of the number of individuals at risk at time
,
. Thus, the NPI lower probability for this event is
and the corresponding NPI upper probability for this event is
We now consider the event that the
m future observations will all exceed
, that is
. The NPI lower and upper probabilities for this event can be expressed in terms of the NPI conditional lower and upper probabilities in Equations (
9) and (
10), respectively, at all earlier times
, as follows
For the special case when we have one future observation
(i.e.,
), the NPI lower and upper probabilities for the event
can be directly calculated from Equations (
11) and (12), respectively, as
The NPI lower and upper probabilities for the event
, as presented in Equations (
11) and (12), take into account the dependence among all these future observations when there is limited information in the form of
n observations in the data set. It is of interest to see the effect of taking this dependence carefully into account. For this reason, we will compare our method with the results one would obtain if, mistakenly, when interested in
m future observations, one would use the NPI lower and upper probabilities for the event
, presented in Equations (
13) and (
14), raised to the power of
m, i.e.,
. This will be demonstrated in Example 2 by studying the impact of ignoring the interdependence between the
m future observations, but first, Example 1 is provided to demonstrate our method.
Example 1. We will start with a simple example involving observations, which are available at discrete times , for . At each time point, we have the number of observed events, ; the number of censored individuals, ; the number of individuals known to be alive at time , ; and the number of individuals at risk at time , . It is important to note that is computed differently than , for example, but (see Section 2). The data are shown in the first six columns in Table 1. The probability of the hazard function, , at a discrete time can be estimated by using the actuarial estimator with Equation (2). Then, the estimated probability of surviving , for , is derived using Equation (3). These results are presented in the seventh and eighth columns of Table 1. Next, we apply the NPI alternative to the actuarial estimator, leading to the NPI lower and upper probabilities for the event , as given by Equations (11) and (12), respectively. These are calculated for the discrete-time points , , , and , for different numbers of future observations, i.e., for . It is worth noting that at the start of the study at time , no events or censorings have been recorded, so . Based on the results in Table 1, we observe that the difference between the NPI upper probability and the NPI lower probability is quite small at time for all considered numbers of future observations and becomes larger later on. This increase in difference is influenced by two effects: fewer individuals in the risk set at later times , , and , and the products of lower and upper probabilities are taken such that each term (i.e., time point) adds to the imprecision. When we compare the results from our proposed method for future observation with those resulting from estimating the survival function based on the actuarial estimator, we find that the values, based on using the actuarial estimator, fall between our NPI lower and upper probabilities for , but they are closer to the upper probability values.
Example 2. The dataset used in this example was also utilised by Berkson and Gage [22] as well as by Lawless [23] and Yan [15]. It comprises 374 observations, wherein 95 are right-censored, and the remaining are event times measured at 10 discrete times in years. The dataset is summarised in the first three columns of Table 2. By using Equations (11) and (12), we obtain the NPI lower and upper probabilities for the event for future observations at specific time points. The results are summarised in Table 2. To understand the impact of considering the dependence among future observations, we compare our results for five future observations with those obtained by erroneously considering only the first future observation raised to the power of 5, i.e., . Due to the positive dependence among the future observations, , , , , and , our correct NPI lower and upper probabilities for the event are greater than those obtained using the mistaken approach (taking the lower and upper probabilities for raised to the power of 5). Although the imprecisions (differences between the upper and lower probabilities) are small, they would become more noticeable for more than five future observations due to the positive dependence among all future observations.
5. NPI-Based Discrete-Time Reliability Function
In this section, we are introducing NPI lower and upper probabilities for the event that at least x out of m future observations will survive at discrete time . Let denote the number of future observations out of m that survive at discrete time . Given Bernoulli trials, with observations surviving at time , we aim to derive the NPI lower and upper probabilities for the event , where x can take values in the set .
The NPI upper probability for the event
is derived by utilising Equation (
8), as
The terms on the right-hand side of Equation (
15) are all derived by applying Equation (
8), where for the terms involving
the data consist of
Bernoulli trials, with
observations surviving at time
, leading to
where
future observations. It is important to point out that for the case
, the NPI upper probability for the event
is equal to 0.
The NPI lower probability for the event
, for
is derived by utilising Equation (
7), as
The terms on the right-hand side of Equation (
17) are all derived by applying Equation (
7), where for the terms involving
the data consist of
Bernoulli trials, with
observations surviving at time
, leading to
where
future observations. It should be remarked that the NPI lower probability for the event
for the case
is equal to 0.
It is worth noting that the lower and upper probabilities for the event when are both equal to 1, regardless of the values of y. Therefore, only the results for will be reported hereafter.
Example 3. In this example, we will illustrate the method presented in Section 5 using a simple example involving nine observations, available at discrete times for (data are summarized in Table 3). Table 3 shows the NPI lower and upper probabilities for the event , where and , with . For , the NPI lower and upper probabilities are equal to 1 for all and at all , due to the fact that no future observation out of y will survive at discrete time . Note that some cells in Table 3 are empty due to the calculation of probabilities for the event that at least x out of y future observations will survive at discrete time . From Table 3, we can also observe that at a specific discrete time , the NPI lower and upper probabilities decrease in x when everything else is constant and increase in y when everything else is constant. Meanwhile, Table 4 presents the NPI lower and upper probabilities for the event for future observations, again the lower and upper probabilities are equal to 1 when . From Table 4, we can see that the difference between the NPI lower and upper probabilities decreases in x while everything else is held constant at each discrete time . Without any further added assumptions, the values of the NPI lower probabilities at are 0 for , whereas the NPI upper probabilities are positive. 6. Application to System Reliability Using Survival Signatures
In
Section 5, we derived the NPI lower and upper probabilities for the event that at least
x out of
m future observations will survive at discrete time
. In this section, we will utilise these probabilities to assess system reliability, considering single or multiple types of components, using survival signatures. Essentially, the results from
Section 5 will be employed to derive lower and upper probabilities for the discrete-time system reliability event
, where
denotes the random failure time of the system. We will combine the concept of survival signature [
16,
24] with the proposed method in
Section 5. First, we will give a brief overview of survival signatures, where the values of the survival signatures are assumed to be given. Then, we will demonstrate the application of the proposed methods to the reliability of some discrete-time systems with both single and multiple types of components.
6.1. The Survival Signature
The signature has been introduced to evaluate the reliability of systems consisting of only one type of component and is used to model the structure of a system, separating this from the random failure times of the components [
17]. The NPI method is used in order to learn about the components within the system, based on data consisting of failure times for components that are exchangeable with those within the system. We therefore assume that such data are available, such as those obtained from testing or previous use of the components [
16,
17]. Following the literature, the assumption of exchangeability is often replaced by the stronger assumption of independent and identically distributed
component failure times [
25]. Taking into account a system consisting of
m components with exchangeable failure times, Samaniego [
26,
27] introduced the system signature as a tool for reliability assessment for systems consisting of components of a single type. However, the use of signatures becomes very complicated in the case of quantifying the reliability of systems with multiple types of components. Coolen and Coolen–Maturi [
24] introduced an alternative concept called the ’survival signature’. The idea of the survival signature is to generalise the signature to systems with multiple types of components. When quantifying the reliability of systems with only one type of component, the survival signature is closely related to the signature [
16,
24]. The NPI methodology has been introduced for system reliability using the survival signature via lower and upper survival functions for the failure time
of a system consisting of multiple types of components [
16], combined with NPI for Bernoulli data [
7].
For a system with m exchangeable components, we need to consider the state vector taking into account that for each i, if the ith component functions, then , otherwise when the ith component does not function. For all possible state vectors , the following structure function is defined as , so that if the system functions and if the system does not function. Throughout this section, the system is assumed to be coherent, which means that the structure function must not be decreasing in any of the components of , and this leads to the fact that the functioning of the system cannot be improved by worse performance of one or more of its components. Furthermore, we assume that the system functions if all its components function, so , and the system fails if all its components fail, so .
For a system consisting only of
m exchangeable components, the survival signature, denoted by
, for
, is defined as the probability that the system functions given that precisely
l of its components function [
24]. For coherent systems,
is an increasing function of
l, and we assume that
and
. There are
state vectors
with precisely
l components
, so with
; the set of these state vectors is denoted by
. Inspired by the
assumption which has been considered for the failure times of the
m components, all these state vectors are equally likely to occur [
24]. Thus, the survival signature
can be achieved as follows [
24]
Let
represent the number of components in the system with a single type that functions at time
. So, the probability that the system functions at time
is
For a system consisting of
types of components, the survival signature, denoted by
for
, is defined as the probability that a system functions given that precisely
of its components of type
k function, for each
. There are
state vectors
with precisely
of its
components
; so, with
, we denote the set of these state vectors for components of type
k by
. In addition, let
denote the set of these state vectors for the whole system for which
,
. Inspired by the
assumption which has been considered for the failure times of the
components of type
k, all these state vectors
are equally likely to occur. Thus, the survival signature
can be achieved as follows [
24].
Let
represent the number of components of type
k in the system which function at time
. So, the probability that the system functions at time
is
Assuming that the failure times of components of different types are independent, while the exchangeability is assumed for the failure times of components of the same type [
16], the survival function for
can be written as
6.2. Discrete-Time System Reliability
In this section, we will apply the proposed method to system reliability in the case of discrete time. We will combine the concept of the survival signature
(as reviewed in
Section 6.1) with the results obtained in
Section 5 to present lower and upper probabilities for the event
of a system reliability that consists of both single type and multiple types of components. These lower and upper probabilities represent the survival functions at discrete-time points
.
At a specific time
, let
represent the number of components for which test failure data are available, and let
represent the number of components that failed at time
. Therefore,
is the number of components of this type that are still functioning at time
[
16,
17]. Additionally, let
denote the number of components in the system out of f
m that are still functioning at a discrete time
.
We obtain the NPI lower and upper probabilities for the event that
for a system consisting of a single type of components, using the survival signature
combined with the proposed method in
Section 5 as follows
and
where
and
are derived from Equations (
16) and (
18), respectively, so
and
We now consider a system consisting of
types of components with
components of
, with
. For a specific time
, let
denote the number of components of type
k for which test failure data are available, and let
denote the numbers of components that failed at time
; therefore,
is the number of components of type
k that are still functioning at time
[
16,
17]. The failure times of components of different types are assumed to be independent, while failure times of components of the same type are assumed to be exchangeable [
16]. Let
denote the number of components of type
k in the system out of
that are still functioning at a discrete time
,
.
The NPI lower and upper probabilities for the event
of a system consisting of multiple types of components using the survival signature
combined with the proposed method in
Section 5 are as follows
and
where
and
for
are derived from Equations (
16) and (
18), respectively, thus
and
Next, we will apply the results presented above to discrete-time system reliability, which consists of a single type of component (see Example 4) and multiple types of components (see Example 5).
Example 4. The system depicted in Figure 1 is utilised in this example and was also utilised by Coolen and Coolen–Maturi [28]. We are examining the reliability of a discrete-time system with exchangeable components, presenting the survival signature values as follows: , , , , , and . We will analyse two datasets of different sizes, one with observations and the other with observations. These datasets include failure events and right-censored observations for discrete times to . Table 5 presents NPI lower and upper probabilities for at these discrete times, based on the survival signature values as provided and the results in Section 6.2. Upon comparing the results in Table 5, it is evident that the imprecisions (the difference between the upper and lower probability) for both sample sizes are minimal at time and increase as we progress to later times, owing to fewer observations in the risk set. Additionally, the differences between the lower and upper probabilities for with observations are generally smaller compared to those with observations. So, the imprecision for decreases as the dataset size increases, i.e., as we have more data available. Example 5. In this example, we are examining a system with types of components, types 1 and 2, depicted in Figure 2. Coolen et al. [24] utilised this system to demonstrate NPI for the system survival time. The survival signature for this system can be found in Table 6. We are focusing on the data provided in Table 7 for the two types with components, and each type has 10 observations, i.e., , including failure events and right-censored observations, for discrete times , , and . The table also includes the NPI lower and upper probabilities for at discrete times , , and , based on the given survival signature values and the results in Section 6.2. When considering the NPI approach for real-valued data, it is typical for the lower probability value for in a specific interval to be less than or equal to the upper probability value for in the next interval. This is evident in the results of [29]. However, the NPI for the discrete-time approach indicates that this may not always be the case, as observed in the results of Table 7 where , and in the results of Table 5 where . Many of the findings presented in this paper suggest that, for discrete-time cases, this discrepancy may arise due to multiple failures occurring between discrete-time points.