Nonparametric Predictive Inference for Discrete Lifetime Data

Coolen, Frank P. A.; Coolen-Maturi, Tahani; Mahnashi, Ali M. Y.

doi:10.3390/math12223514

Open AccessArticle

Nonparametric Predictive Inference for Discrete Lifetime Data

by

Frank P. A. Coolen

^1,*

,

Tahani Coolen-Maturi

¹ and

Ali M. Y. Mahnashi

²

¹

Department of Mathematical Sciences, Durham University, Durham DH1 3LE, UK

²

Department of Mathematics, College of Science, Jazan University, Jazan 45 142, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(22), 3514; https://doi.org/10.3390/math12223514

Submission received: 30 July 2024 / Revised: 2 October 2024 / Accepted: 28 October 2024 / Published: 11 November 2024

(This article belongs to the Special Issue Reliability Analysis and Stochastic Models in Reliability Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents nonparametric predictive inference for discrete lifetime data. While lifetimes are mostly treated as continuous random variables in statistics, there are scenarios where time observations are recorded as discrete values, for example, in actuary, where lifetimes are often recorded as integers in years. The presented method provides lower and upper probabilities for a variety of events of interest involving discrete lifetimes, with examples provided for illustration. Furthermore, the discrete-time situation is considered for inference of the reliability of systems, with discrete-time data for components of different types and using the survival signature to combine inference on components’ reliability to quantify the overall system reliability.

Keywords:

discrete lifetime data; nonparametric predictive inference; survival signature

MSC:

62G99

1. Introduction

In the real world, the time until an event occurs is typically considered to be a continuous variable. However, in many applications, recorded observations have a discrete nature because they are represented by discrete values. When there are many different possible values, any distinction between the continuous or discrete nature of a variable becomes negligible. Nonetheless, in certain application areas, time is often modelled as a discrete variable with relatively few possible values. This is particularly true for actuarial models, for which typically a cohort of people, either real or hypothetical, is followed over time, and events such as deaths are recorded per year. However, if a person leaves the group for reasons other than death, or other than the specific cause of death under investigation, then right-censoring occurs. In such cases, time is recorded as the person’s age at the time of the event, making time a discrete variable.

For discrete-time data, approaches like those considered in this paper can be visualised as a table with k discrete-time points, say

t_{1} < t_{2} < \dots < t_{k}

. The data include the number of events and the number of right-censored individuals at each discrete-time point, excluding time

t_{0}

where no events or right-censoring are assumed to have occurred. At any given time, we consider how many people are alive; that is, how many people have survived that time, so this is effectively Bernoulli data; then, we look ahead and assess how many people will be alive in the future. The actuarial estimator is a nonparametric method for estimating the survival function which explicitly restricts attention to the discrete-time, with the possible inclusion of right-censored data [1,2,3].

In this paper, we take a similar approach, but we look at it from a predictive perspective using nonparametric predictive inference (NPI), which is inherently predictive. NPI is a statistical method that relies on only a few assumptions based on Hill’s assumption

A_{(n)}

[4] and uses imprecise probabilities to measure uncertainty [5,6]. Additionally, NPI has been adapted to accommodate various types of data and a wide range of applications. For example, NPI has been used with Bernoulli data [7,8], data containing right-censored observations [9,10], bivariate data [11], multinomial data [12,13], and circular data [6]. The NPI approach has also been developed for right-censored observations in survival data [14]. NPI for Bernoulli data [7] will be utilised to develop the NPI alternative to the actuarial estimator, based on the assumption of non-informative right censoring [9,15].

The paper is organized as follows. Section 2 provides a brief introduction to the actuarial estimator of the survival function. Section 3 provides an overview of NPI and its application to Bernoulli and right-censored data. In Section 4, we present NPI as an alternative to the actuarial estimator for right-censored data. It proposes lower and upper probabilities for the event that all future observations are greater than a specific discrete time

t_{j}

. Section 5 focuses on the proposed NPI-based discrete-time reliability function by providing lower and upper probabilities for the event that at least x out of m future observations will survive at discrete time

t_{j}

. In Section 6, we apply the proposed method to system reliability using survival signatures [16,17] combined with NPI for Bernoulli data [7]. Finally, we conclude with some remarks in Section 7.

2. Actuarial Estimator of the Survival Function

In a discrete-time setting, a common nonparametric method for estimating the survival function is the actuarial estimator. To introduce the actuarial estimator, we first consider n individuals alive at time

t_{0}

. Let

X_{1}, X_{2}, \dots, X_{n}

be positive, exchangeable, and discrete random variables, of which their discrete lifetimes are assumed to be independent and identically distributed, that take values at discrete-time points

t_{j}

, where

j = 1, \dots, k

, with

t_{1} < t_{2} < \dots < t_{k}

. We refer to the event of interest as ‘death’, but it can be any time-related event of interest; for example, in reliability, it will typically be a failure event. The discrete-time hazard function at a specific time

t_{j}

is defined as the conditional probability that a randomly selected individual,

X_{i}

,

i = 1, \dots, n

, will experience the event of interest at time

t_{j}

given that this individual did not experience the event prior to

t_{j}

such that

h_{t_{j}} = P (X_{i} = t_{j} | X_{i} \geq t_{j})

(1)

Let

d_{t_{j}}

be the number of individuals who died at time

t_{j}

and let

c_{t_{j}}

be the number of individuals whose lifetimes are right-censored at time

t_{j}

. Let

{\overset{`}{n}}_{t_{j}}

be the number of individuals known to be at risk (still alive and uncensored) at time

t_{j}

, that is

{\overset{`}{n}}_{t_{j}} = {\overset{`}{n}}_{t_{j - 1}} - d_{t_{j - 1}} - c_{t_{j - 1}}

. It is common to assume that all individuals are at risk at

t_{0}

, so

{\overset{`}{n}}_{t_{0}} = n

. Then, the discrete-time hazard function,

h_{t_{j}}

, at a discrete time

t_{j}

can be estimated by the actuarial estimator [1,2,3], as

{\hat{h}}_{t_{j}} = \frac{d_{t_{j}}}{{\overset{`}{n}}_{t_{j}}}

(2)

The survival function at time

t_{j}

is defined as

S_{t_{j}} = P (X \geq t_{j})

; note that

S_{t_{0}} = P (X \geq 0) = 1

. The survival function

S_{t_{j}}

can be estimated in terms of the actuarial estimator

{\hat{h}}_{t_{l}}

for

l = 1, \dots, j - 1

, by

{\hat{S}}_{t_{j}} = \prod_{l = 1}^{j - 1} (\frac{{\overset{`}{n}}_{t_{l}} - d_{t_{l}}}{{\overset{`}{n}}_{t_{l}}}) = \prod_{l = 1}^{j - 1} (1 - {\hat{h}}_{t_{l}})

(3)

In Section 4, we will explore an alternative to the actuarial estimator under the NPI methodology, using NPI for Bernoulli data [7]. First, a brief introductory overview of NPI, including NPI for Bernoulli data, is provided in Section 3.

3. Nonparametric Predictive Inference (NPI)

Nonparametric predictive inference (NPI) is a frequentist statistical method which requires only a few assumptions, enabled by the use of imprecise probabilities to quantify uncertainty [5,6]. NPI is based on Hill’s assumption

A_{(n)}

[4]. Assume that

X_{1}, X_{2}, \dots, X_{n}, X_{n + 1}

are real-valued absolutely continuous and exchangeable random quantities. Let the ordered observed values of

X_{1}, \dots, X_{n}

be denoted as

x_{1} < x_{2} < \dots < x_{n}

. To simplify notation, let

x_{0} = - \infty

and

x_{n + 1} = \infty

. These n observations divide up the real-line into

n + 1

intervals

I_{j} = (x_{j}, x_{j + 1})

, where

j = 0, 1, \dots, n

. Based on n observations, the assumption

A_{(n)}

[18] is that the probability that the next future observation

X_{n + 1}

is equally likely to fall in each open interval

(x_{j}, x_{j + 1})

, for all

j = 0, 1, \dots, n

, so

\begin{matrix} P_{X_{n + 1}} (x_{j}, x_{j + 1}) = \frac{1}{n + 1} for all j = 0, 1, \dots, n \end{matrix}

(4)

The assumption

A_{(n)}

alone is insufficient for constructing precise probabilities for many events of interest, but it is still useful to derive bounds for probabilities. Repeated application of the assumptions

A_{(n)}, A_{(n + 1)}, \dots, A_{(n + m - 1)}

enables predictive inference for

m \geq 1

future observations. These assumptions imply that all orderings of the m future observations among the n data observations are equally likely. Based on these assumptions, Coolen [7] introduced NPI for Bernoulli observations, using an assumed latent variable representation for successes and failures as values on the real line separated by a threshold.

Assume that there are

n + m

exchangeable Bernoulli trials with failure and success as possible outcomes for each trial, and data containing s successes in n trials. Let

X_{1}^{n}

denote the random number of successes in trials 1 to n, and

X_{n + 1}^{n + m}

denote the random number of successes in trials

n + 1

to

n + m

. Coolen [7] presented general formulae for NPI lower and upper probabilities for any event of interest involving

X_{1}^{n}

. The attention here is limited to the results needed in this paper.

Coolen [7] derived the NPI lower and upper probabilities for the event that all m future trials are successes, given data consisting of s successes observed in n trials, for

s \in {0, 1, \dots, n}

. The NPI lower probability for this event is

\underset{̲}{P} (X_{n + 1}^{n + m} = m | X_{1}^{n} = s) = \prod_{i = 1}^{m} \frac{s + i - 1}{n + i}

(5)

and the corresponding NPI upper probability is

\bar{P} (X_{n + 1}^{n + m} = m | X_{1}^{n} = s) = \prod_{i = 1}^{m} \frac{s + i}{n + i}

(6)

Based on the general results by Coolen [7], Aboalkhair [19] derived formulae for the NPI lower and upper probabilities for the event that there are at least r successes in m future trials, given s successes observed in n trials. The NPI lower probability for this event is

\begin{matrix} \underset{̲}{P} (X_{n + 1}^{n + m} \geq r | X_{1}^{n} = s) = \\ 1 - {(\binom{n + m}{n})}^{- 1} \times [\sum_{ℓ = 0}^{r - 1} (\binom{s + ℓ - 1}{s - 1}) (\binom{n - s + m - ℓ}{n - s})] \end{matrix}

(7)

and the corresponding NPI upper probability is

\begin{matrix} \bar{P} (X_{n + 1}^{n + m} \geq r | X_{1}^{n} = s) & = {(\binom{n + m}{n})}^{- 1} \times [(\binom{s + r}{s}) (\binom{n - s + m - r}{n - s}) \\ + \sum_{ℓ = r + 1}^{m} (\binom{s + ℓ - 1}{s - 1}) (\binom{n - s + m - ℓ}{n - s})] \end{matrix}

(8)

The nature of

A_{(n)}

results in NPI being a frequentist statistical methodology [4,5,20], which can be interpreted in a way similar to that of posterior predictive methods within Bayesian statistics but without any prior information being included [7,21]. Hill [20] provides detailed discussion of

A_{(n)}

including comparison with nonparametric Bayesian methods, and Hill [21] presented a formal justification of

A_{(n)}

within the Bayesian context. This justification, however, is a rather complicated splitting process under finite exchangeability. It is more natural to consider NPI, based on

A_{(n)}

, as a frequentist statistics methodology based on assumed exchangeability of all observations. The exchangeability assumption implies that all orderings of observations are equally likely, and

A_{(n)}

-based inference keeps this property for the orderings of future observations among data observations. This is a relatively weak assumption, but it excludes scenarios with known trends in data, e.g., time series, or other clear patterns in the data which would undermine the assumption that all orderings of the observations are equally likely.

4. NPI-Based Discrete-Time Survival Function

In this section, we introduce an NPI-based alternative to the actuarial estimator for the survival function. We introduce it in two steps: first, we derive the NPI lower and upper probabilities for the conditional survival function, and then we derive the corresponding survival function. In doing so, we need to consider the interdependence between the m future observations; however, we first need to introduce some notation.

Let

X_{1}, X_{2}, \dots, X_{n}

be positive, exchangeable, and discrete random variables that take values at discrete-time points

t_{j}

, where

j = 1, \dots, k

, with

t_{1} < t_{2} < \dots < t_{k}

. We define

{\overset{`}{n}}_{t_{0}} = n

for the start of the study when all individuals survived, and

t_{k + 1} = \infty

. Let

n_{t_{j}}

be the number of individuals known to be alive at time

t_{j}

. Also, let

d_{t_{j}}

represent the number of observed events at time

t_{j}

, and

c_{t_{j}}

represent the number of censored events at time

t_{j}

. We assume that the censored observations occur at discrete times

t_{j}

, where

j = 1, 2, \dots, k

. Then, the number of individuals at risk at time

t_{j}

, denoted by

{\hat{n}}_{t_{j}}

, is computed by

{\hat{n}}_{t_{j}} = n_{t_{j - 1}} - c_{t_{j}}

. Therefore, the number of individuals at risk at time

t_{j}

,

{\hat{n}}_{t_{j}}

, will decrease at subsequent discrete times. Furthermore, let

X_{l}^{t_{j}} > t_{j}

, for

l = 1, \dots, {\hat{n}}_{t_{j}}

, be the event times for the individuals in the risk set at time

t_{j}

.

Now, let us consider

X_{n + i}

for the time of event of the ith future individual, for

i = 1, 2, \dots, m

. We consider the event of interest that all m future observations

X_{n + i}

survive a specific discrete time

t_{j}

given that they survived the earlier discrete time

t_{j - 1}

. This event can be denoted as

⋂_{i = 1}^{m} {X_{n + i} > t_{j} | X_{n + i} > t_{j - 1}} .

We consider the survival of all m future observations at time

t_{j}

as exchangeable with the survival of the

{\hat{n}}_{t_{j}}

individuals in the risk set at that discrete time. So, we assume that the random quantities

X_{n + 1}, X_{n + 2}, \dots, X_{n + i}

, with respect to the event

X_{n + i} > t_{j}

,

i = 1, \dots, m

, are exchangeable with

X_{1}^{t_{j}}, X_{2}^{t_{j}}, \dots, X_{l}^{t_{j}}

with respect to the event

X_{l}^{t_{j}} > t_{j}

for

l = 1, \dots, {\hat{n}}_{t_{j}}

, where

X_{l}^{t_{j}}

are the event times for the individuals in the risk set at time

t_{j}

.

The NPI lower and upper probabilities for the event

⋂_{i = 1}^{m} {X_{n + i} > t_{j} | X_{n + i} > t_{j - 1}}

can be derived by utilising NPI for Bernoulli data [7] via Equations (5) and (6), respectively. This can be performed by considering the number of individuals known to be alive at time

t_{j}

,

n_{t_{j}}

, out of the number of individuals at risk at time

t_{j}

,

{\hat{n}}_{t_{j}}

. Thus, the NPI lower probability for this event is

\underset{̲}{P} (⋂_{i = 1}^{m} {X_{n + i} > t_{j} | X_{n + i} > t_{j - 1}}) = \prod_{i = 1}^{m} \frac{n_{t_{j}} + i - 1}{{\hat{n}}_{t_{j}} + i}

(9)

and the corresponding NPI upper probability for this event is

\bar{P} (⋂_{i = 1}^{m} {X_{n + i} > t_{j} | X_{n + i} > t_{j - 1}}) = \prod_{i = 1}^{m} \frac{n_{t_{j}} + i}{{\hat{n}}_{t_{j}} + i}

(10)

We now consider the event that the m future observations will all exceed

t_{j}

, that is

⋂_{i = 1}^{m} {X_{n + i} > t_{j}}

. The NPI lower and upper probabilities for this event can be expressed in terms of the NPI conditional lower and upper probabilities in Equations (9) and (10), respectively, at all earlier times

t_{1}, t_{2}, \dots, t_{j}

, as follows

\begin{matrix} \underset{̲}{P} (⋂_{i = 1}^{m} {X_{n + i} > t_{j}}) = \prod_{ℓ = 1}^{j} (\prod_{i = 1}^{m} \frac{n_{t_{ℓ}} + i - 1}{{\hat{n}}_{t_{ℓ}} + i}) \end{matrix}

(11)

\begin{matrix} \bar{P} (⋂_{i = 1}^{m} {X_{n + i} > t_{j}}) = \prod_{ℓ = 1}^{j} (\prod_{i = 1}^{m} \frac{n_{t_{ℓ}} + i}{{\hat{n}}_{t_{ℓ}} + i}) \end{matrix}

(12)

For the special case when we have one future observation

X_{n + 1}

(i.e.,

m = 1

), the NPI lower and upper probabilities for the event

X_{n + 1} > t_{j}

can be directly calculated from Equations (11) and (12), respectively, as

\underset{̲}{P} (X_{n + 1} > t_{j}) = \prod_{l = 1}^{j} \frac{n_{t_{l}}}{{\hat{n}}_{t_{l}} + 1}

(13)

\bar{P} (X_{n + 1} > t_{j}) = \prod_{l = 1}^{j} \frac{n_{t_{l}} + 1}{{\hat{n}}_{t_{l}} + 1}

(14)

The NPI lower and upper probabilities for the event

⋂_{i = 1}^{m} {X_{n + i} > t_{j}}

, as presented in Equations (11) and (12), take into account the dependence among all these future observations when there is limited information in the form of n observations in the data set. It is of interest to see the effect of taking this dependence carefully into account. For this reason, we will compare our method with the results one would obtain if, mistakenly, when interested in m future observations, one would use the NPI lower and upper probabilities for the event

X_{n + 1} > t_{j}

, presented in Equations (13) and (14), raised to the power of m, i.e.,

{[\underset{̲}{P}, \bar{P}]}^{m} (X_{n + 1} > t_{j})

. This will be demonstrated in Example 2 by studying the impact of ignoring the interdependence between the m future observations, but first, Example 1 is provided to demonstrate our method.

Example 1.

We will start with a simple example involving

n = 9

observations, which are available at discrete times

t_{j}

, for

j = 1, 2, 3, 4

. At each time point, we have the number of observed events,

d_{t_{j}}

; the number of censored individuals,

c_{t_{j}}

; the number of individuals known to be alive at time

t_{j}

,

n_{t_{j}}

; and the number of individuals at risk at time

t_{j}

,

{\hat{n}}_{t_{j}}

. It is important to note that

{\hat{n}}_{t_{j}}

is computed differently than

{\overset{`}{n}}_{t_{j}}

, for example,

{\hat{n}}_{t_{2}} = 7

but

{\overset{`}{n}}_{t_{2}} = 8

(see Section 2). The data are shown in the first six columns in Table 1.

The probability of the hazard function,

h_{t_{j}}

, at a discrete time

t_{j}

can be estimated by using the actuarial estimator with Equation (2). Then, the estimated probability of surviving

t_{j}

, for

j = 1, 2, 3, 4

, is derived using Equation (3). These results are presented in the seventh and eighth columns of Table 1.

Next, we apply the NPI alternative to the actuarial estimator, leading to the NPI lower and upper probabilities for the event

⋂_{i = 1}^{m} {X_{9 + i} > t_{j}}

, as given by Equations (11) and (12), respectively. These are calculated for the discrete-time points

t_{1}

,

t_{2}

,

t_{3}

, and

t_{4}

, for different numbers of future observations, i.e., for

m \in {1, 3, 10, 15}

. It is worth noting that at the start of the study at time

t_{0}

, no events or censorings have been recorded, so

\underset{̲}{P} (⋂_{i = 1}^{m} {X_{9 + i} > t_{0}}) = \bar{P} (⋂_{i = 1}^{m} {X_{9 + i} > t_{0}}) = 1

.

Based on the results in Table 1, we observe that the difference between the NPI upper probability and the NPI lower probability is quite small at time

t_{1}

for all considered numbers of future observations and becomes larger later on. This increase in difference is influenced by two effects: fewer individuals in the risk set

{\hat{n}}_{t_{j}}

at later times

t_{2}

,

t_{3}

, and

t_{4}

, and the products of lower and upper probabilities are taken such that each term (i.e., time point) adds to the imprecision.

When we compare the results from our proposed method for

m = 1

future observation with those resulting from estimating the survival function based on the actuarial estimator, we find that the

{\hat{S}}_{t_{j}}

values, based on using the actuarial estimator, fall between our NPI lower and upper probabilities for

X_{10} > t_{j}

, but they are closer to the upper probability values.

Example 2.

The dataset used in this example was also utilised by Berkson and Gage [22] as well as by Lawless [23] and Yan [15]. It comprises 374 observations, wherein 95 are right-censored, and the remaining are event times measured at 10 discrete times in years. The dataset is summarised in the first three columns of Table 2.

By using Equations (11) and (12), we obtain the NPI lower and upper probabilities for the event

⋂_{i = 1}^{m} {X_{n + i} > t_{j}}

for

m \in {1, 2, 3, 10}

future observations at specific time points. The results are summarised in Table 2.

To understand the impact of considering the dependence among future observations, we compare our results

[\underset{̲}{P}, \bar{P}] (⋂_{i = 1}^{5} {X_{374 + i} > t_{j}})

for five future observations with those obtained by erroneously considering only the first future observation

(X_{375} > t_{j})

raised to the power of 5, i.e.,

{[\underset{̲}{P}, \bar{P}]}^{5} (X_{375} > t_{j})

. Due to the positive dependence among the future observations,

X_{375}

,

X_{376}

,

X_{377}

,

X_{378}

, and

X_{379}

, our correct NPI lower and upper probabilities for the event

⋂_{i = 1}^{5} {X_{374 + i} > t_{j}}

are greater than those obtained using the mistaken approach (taking the lower and upper probabilities for

X_{375} > t_{j}

raised to the power of 5). Although the imprecisions (differences between the upper and lower probabilities) are small, they would become more noticeable for more than five future observations due to the positive dependence among all future observations.

5. NPI-Based Discrete-Time Reliability Function

In this section, we are introducing NPI lower and upper probabilities for the event that at least x out of m future observations will survive at discrete time

t_{j}

. Let

N_{t_{j}}

denote the number of future observations out of m that survive at discrete time

t_{j}

. Given

{\hat{n}}_{t_{j}}

Bernoulli trials, with

{\hat{n}}_{t_{j}} - d_{t_{j}}

observations surviving at time

t_{j}

, we aim to derive the NPI lower and upper probabilities for the event

N_{t_{j}} \geq x

, where x can take values in the set

{0, 1, \dots, m}

.

The NPI upper probability for the event

N_{t_{j}} \geq x

is derived by utilising Equation (8), as

\bar{P} (N_{t_{j}} \geq x) = \sum_{y = x}^{m} \bar{P} (N_{t_{j}} \geq x | N_{t_{j - 1}} = y) [\bar{P} (N_{t_{j - 1}} \geq y) - \bar{P} (N_{t_{j - 1}} \geq y + 1)]

(15)

The terms on the right-hand side of Equation (15) are all derived by applying Equation (8), where for the terms involving

N_{t_{j - 1}}

the data consist of

{\hat{n}}_{t_{j - 1}}

Bernoulli trials, with

{\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}}

observations surviving at time

t_{j - 1}

, leading to

\begin{matrix} \bar{P} (N_{t_{j - 1}} \geq y) - \bar{P} (N_{t_{j - 1}} \geq y + 1) = \\ {(\binom{{\hat{n}}_{t_{j - 1}} + m}{{\hat{n}}_{t_{j - 1}}})}^{- 1} (\binom{({\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}}) + y}{({\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}})}) (\binom{{\hat{n}}_{t_{j - 1}} - ({\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}}) + m - y - 1}{{\hat{n}}_{t_{j - 1}} - ({\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}})}) \end{matrix}

(16)

where

y \in {0, 1, \dots, m}

future observations. It is important to point out that for the case

y = m

, the NPI upper probability for the event

N_{t_{j - 1}} \geq y + 1

is equal to 0.

The NPI lower probability for the event

N_{t_{j}} \geq x

, for

x \in {0, 1, \dots, m}

is derived by utilising Equation (7), as

\underset{̲}{P} (N_{t_{j}} \geq x) = \sum_{y = x}^{m} \underset{̲}{P} (N_{t_{j}} \geq x | N_{t_{j - 1}} = y) [\underset{̲}{P} (N_{t_{j - 1}} \geq y) - \underset{̲}{P} (N_{t_{j - 1}} \geq y + 1)]

(17)

The terms on the right-hand side of Equation (17) are all derived by applying Equation (7), where for the terms involving

N_{t_{j - 1}}

the data consist of

{\hat{n}}_{t_{j - 1}}

Bernoulli trials, with

{\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}}

observations surviving at time

t_{j - 1}

, leading to

\begin{matrix} \underset{̲}{P} (N_{t_{j - 1}} \geq y) - \underset{̲}{P} (N_{t_{j - 1}} \geq y + 1) = \\ {(\binom{{\hat{n}}_{t_{j - 1}} + m}{{\hat{n}}_{t_{j - 1}}})}^{- 1} (\binom{({\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}}) + y - 1}{({\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}}) - 1}) (\binom{{\hat{n}}_{t_{j - 1}} - ({\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}}) + m - y}{{\hat{n}}_{t_{j - 1}} - ({\hat{n}}_{t_{j - 1}} - d_{t_{j - 1}})}) \end{matrix}

(18)

where

y \in {0, 1, \dots, m}

future observations. It should be remarked that the NPI lower probability for the event

N_{t_{j - 1}} \geq y + 1

for the case

y = m

is equal to 0.

It is worth noting that the lower and upper probabilities for the event

N_{t_{j}} \geq x

when

x = 0

are both equal to 1, regardless of the values of y. Therefore, only the results for

x = {1, \dots, m}

will be reported hereafter.

Example 3.

In this example, we will illustrate the method presented in Section 5 using a simple example involving nine observations, available at discrete times

t_{j}

for

j = 1, 2, 3, 4

(data are summarized in Table 3).

Table 3 shows the NPI lower and upper probabilities for the event

N_{t_{j}} \geq x | N_{t_{j - 1}} = y

, where

x \in {0, 1, 2, 3}

and

y \in {0, 1, 2, 3}

, with

x \leq y

. For

x = 0

, the NPI lower and upper probabilities are equal to 1 for all

y \in {0, 1, 2, 3}

and at all

t_{j}

, due to the fact that no future observation out of y will survive at discrete time

t_{j}

. Note that some cells in Table 3 are empty due to the calculation of probabilities for the event that at least x out of y future observations will survive at discrete time

t_{j}

. From Table 3, we can also observe that at a specific discrete time

t_{j}

, the NPI lower and upper probabilities decrease in x when everything else is constant and increase in y when everything else is constant.

Meanwhile, Table 4 presents the NPI lower and upper probabilities for the event

N_{t_{j}} \geq x

for

x \in {1, 2, 3}

future observations, again the lower and upper probabilities are equal to 1 when

x = 0

. From Table 4, we can see that the difference between the NPI lower and upper probabilities decreases in x while everything else is held constant at each discrete time

t_{j}

. Without any further added assumptions, the values of the NPI lower probabilities at

t_{4}

are 0 for

x \in {1, 2, 3}

, whereas the NPI upper probabilities are positive.

6. Application to System Reliability Using Survival Signatures

In Section 5, we derived the NPI lower and upper probabilities for the event that at least x out of m future observations will survive at discrete time

t_{j}

. In this section, we will utilise these probabilities to assess system reliability, considering single or multiple types of components, using survival signatures. Essentially, the results from Section 5 will be employed to derive lower and upper probabilities for the discrete-time system reliability event

T_{S} > t_{j}

, where

T_{S}

denotes the random failure time of the system. We will combine the concept of survival signature [16,24] with the proposed method in Section 5. First, we will give a brief overview of survival signatures, where the values of the survival signatures are assumed to be given. Then, we will demonstrate the application of the proposed methods to the reliability of some discrete-time systems with both single and multiple types of components.

6.1. The Survival Signature

The signature has been introduced to evaluate the reliability of systems consisting of only one type of component and is used to model the structure of a system, separating this from the random failure times of the components [17]. The NPI method is used in order to learn about the components within the system, based on data consisting of failure times for components that are exchangeable with those within the system. We therefore assume that such data are available, such as those obtained from testing or previous use of the components [16,17]. Following the literature, the assumption of exchangeability is often replaced by the stronger assumption of independent and identically distributed

(i i d)

component failure times [25]. Taking into account a system consisting of m components with exchangeable failure times, Samaniego [26,27] introduced the system signature as a tool for reliability assessment for systems consisting of components of a single type. However, the use of signatures becomes very complicated in the case of quantifying the reliability of systems with multiple types of components. Coolen and Coolen–Maturi [24] introduced an alternative concept called the ’survival signature’. The idea of the survival signature is to generalise the signature to systems with multiple types of components. When quantifying the reliability of systems with only one type of component, the survival signature is closely related to the signature [16,24]. The NPI methodology has been introduced for system reliability using the survival signature via lower and upper survival functions for the failure time

T_{S}

of a system consisting of multiple types of components [16], combined with NPI for Bernoulli data [7].

For a system with m exchangeable components, we need to consider the state vector

\underset{̲}{x} = (x_{1}, x_{2}, \dots, x_{m}) \in {0, 1}^{m}

taking into account that for each i, if the ith component functions, then

x_{i} = 1

, otherwise

x_{i} = 0

when the ith component does not function. For all possible state vectors

\underset{̲}{x}

, the following structure function is defined as

ϕ : {0, 1}^{m} \to {0, 1}

, so that

ϕ (\underset{̲}{x}) = 1

if the system functions and

ϕ (\underset{̲}{x}) = 0

if the system does not function. Throughout this section, the system is assumed to be coherent, which means that the structure function

ϕ (\underset{̲}{x})

must not be decreasing in any of the components of

\underset{̲}{x}

, and this leads to the fact that the functioning of the system cannot be improved by worse performance of one or more of its components. Furthermore, we assume that the system functions if all its components function, so

ϕ (1) = 1

, and the system fails if all its components fail, so

ϕ (0) = 0

.

For a system consisting only of m exchangeable components, the survival signature, denoted by

Φ (l)

, for

l = 1, \dots, m

, is defined as the probability that the system functions given that precisely l of its components function [24]. For coherent systems,

Φ (l)

is an increasing function of l, and we assume that

Φ (0) = 0

and

Φ (m) = 1

. There are

(\binom{m}{l})

state vectors

\underset{̲}{x}

with precisely l components

x_{i} = 1

, so with

\sum_{i = 1}^{m} x_{i} = l

; the set of these state vectors is denoted by

S_{l}

. Inspired by the

i i d

assumption which has been considered for the failure times of the m components, all these state vectors are equally likely to occur [24]. Thus, the survival signature

Φ (l)

can be achieved as follows [24]

Φ (l) = {(\binom{m}{l})}^{- 1} \sum_{\underset{̲}{x} \in S_{l}} ϕ (\underset{̲}{x})

(19)

Let

C (t) \in {0, 1, \dots, m}

represent the number of components in the system with a single type that functions at time

t > 0

. So, the probability that the system functions at time

t > 0

is

P (T_{S} > t) = \sum_{l = 0}^{m} Φ (l) P (C (t) = l)

(20)

For a system consisting of

K \geq 2

types of components, the survival signature, denoted by

Φ (l_{1}, \dots, l_{K})

for

l_{k} = 0, \dots, m_{k}

, is defined as the probability that a system functions given that precisely

l_{k}

of its components of type k function, for each

k \in {1, 2, \dots, K}

. There are

(\binom{m_{k}}{l_{k}})

state vectors

{\underset{̲}{x}}^{k}

with precisely

l_{k}

of its

m_{k}

components

x_{i}^{k} = 1

; so, with

\sum_{i = 1}^{m_{k}} x_{i}^{k} = l_{k}

, we denote the set of these state vectors for components of type k by

S_{l}^{k}

. In addition, let

S_{l_{1}, \dots, l_{k}}

denote the set of these state vectors for the whole system for which

\sum_{i = 1}^{m_{k}} x_{i}^{k} = l_{k}

,

k \in {1, 2, \dots, K}

. Inspired by the

i i d

assumption which has been considered for the failure times of the

m_{k}

components of type k, all these state vectors

{\underset{̲}{x}}^{k} \in

are equally likely to occur. Thus, the survival signature

Φ (l_{1}, \dots, l_{K})

can be achieved as follows [24].

Φ (l_{1}, \dots, l_{K}) = [\prod_{k = 1}^{K} {(\binom{m_{k}}{l_{k}})}^{- 1}] \times \sum_{\underset{̲}{x} \in S_{l_{1}, \dots, l_{K}}} ϕ (\underset{̲}{x})

(21)

Let

C_{k} (t) \in {0, 1, \dots, m_{k}}

represent the number of components of type k in the system which function at time

t > 0

. So, the probability that the system functions at time

t > 0

is

P (T_{S} > t) = \sum_{l_{1} = 0}^{m_{1}} \dots \sum_{l_{K} = 0}^{m_{K}} Φ (l_{1}, \dots, l_{K}) P (⋂_{k = 1}^{K} {C_{k} (t) = l_{k}})

(22)

Assuming that the failure times of components of different types are independent, while the exchangeability is assumed for the failure times of components of the same type [16], the survival function for

T_{S}

can be written as

P (T_{S} > t) = \sum_{l_{1} = 0}^{m_{1}} \dots \sum_{l_{K} = 0}^{m_{K}} Φ (l_{1}, \dots, l_{K}) \prod_{k = 1}^{K} P ({C_{k} (t) = l_{k}})

(23)

6.2. Discrete-Time System Reliability

In this section, we will apply the proposed method to system reliability in the case of discrete time. We will combine the concept of the survival signature

Φ (l)

(as reviewed in Section 6.1) with the results obtained in Section 5 to present lower and upper probabilities for the event

T_{S} > t_{j}

of a system reliability that consists of both single type and multiple types of components. These lower and upper probabilities represent the survival functions at discrete-time points

t_{j}

.

At a specific time

t_{j}

, let

{\hat{n}}_{t_{j}}

represent the number of components for which test failure data are available, and let

d_{t_{j}}

represent the number of components that failed at time

t_{j}

. Therefore,

{\hat{n}}_{t_{j}} - d_{t_{j}}

is the number of components of this type that are still functioning at time

t_{j}

[16,17]. Additionally, let

N_{t_{j}} \in {0, 1, \dots, m}

denote the number of components in the system out of f m that are still functioning at a discrete time

t_{j}

.

We obtain the NPI lower and upper probabilities for the event that

T_{S} > t_{j}

for a system consisting of a single type of components, using the survival signature

Φ (l)

combined with the proposed method in Section 5 as follows

\underset{̲}{P} (T_{S} > t_{j}) = \sum_{ℓ = 0}^{m} Φ (ℓ) \bar{D} (N_{t_{j}} = ℓ)

(24)

and

\bar{P} (T_{S} > t_{j}) = \sum_{ℓ = 0}^{m} Φ (ℓ) \underset{̲}{D} (N_{t_{j}} = ℓ)

(25)

where

\bar{D} (N_{t_{j}} = ℓ)

and

\underset{̲}{D} (N_{t_{j}} = ℓ)

are derived from Equations (16) and (18), respectively, so

\begin{matrix} \bar{D} (N_{t_{j}} = ℓ) & = \underset{̲}{P} (N_{t_{j}} \geq ℓ) - \underset{̲}{P} (N_{t_{j}} \geq ℓ + 1) \\ = {(\binom{{\hat{n}}_{t_{j}} + m}{{\hat{n}}_{t_{j}}})}^{- 1} (\binom{({\hat{n}}_{t_{j}} - d_{t_{j}}) + ℓ - 1}{({\hat{n}}_{t_{j}} - d_{t_{j}}) - 1}) (\binom{{\hat{n}}_{t_{j}} - ({\hat{n}}_{t_{j}} - d_{t_{j}}) + m - ℓ}{{\hat{n}}_{t_{j}} - ({\hat{n}}_{t_{j}} - d_{t_{j}})}) \end{matrix}

(26)

and

\begin{matrix} \underset{̲}{D} (N_{t_{j}} = ℓ) & = \bar{P} (N_{t_{j}} \geq ℓ) - \bar{P} (N_{t_{j}} \geq ℓ - 1) \\ = {(\binom{{\hat{n}}_{t_{j}} + m}{{\hat{n}}_{t_{j}}})}^{- 1} (\binom{({\hat{n}}_{t_{j}} - d_{t_{j}}) + ℓ}{({\hat{n}}_{t_{j}} - d_{t_{j}})}) (\binom{{\hat{n}}_{t_{j}} - ({\hat{n}}_{t_{j}} - d_{t_{j}}) + m - ℓ - 1}{{\hat{n}}_{t_{j}} - ({\hat{n}}_{t_{j}} - d_{t_{j}}) - 1}) \end{matrix}

(27)

We now consider a system consisting of

K \geq 2

types of components with

m_{k}

components of

k \in {1, 2, \dots, K}

, with

\sum_{k = 1}^{K} m_{k} = m

. For a specific time

t_{j}

, let

{\hat{n}}_{t_{j}}^{k}

denote the number of components of type k for which test failure data are available, and let

d_{t_{j}}^{k}

denote the numbers of components that failed at time

t_{j}

; therefore,

{\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k}

is the number of components of type k that are still functioning at time

t_{j}

[16,17]. The failure times of components of different types are assumed to be independent, while failure times of components of the same type are assumed to be exchangeable [16]. Let

N_{t_{j}}^{k} \in {0, 1, \dots, m_{k}}

denote the number of components of type k in the system out of

m_{k}

that are still functioning at a discrete time

t_{j}

,

k = 1, 2, \dots, K

.

The NPI lower and upper probabilities for the event

T_{S} > t_{j}

of a system consisting of multiple types of components using the survival signature

Φ (l)

combined with the proposed method in Section 5 are as follows

\underset{̲}{P} (T_{S} > t_{j}) = \sum_{ℓ_{1} = 0}^{m_{1}} \dots \sum_{ℓ_{K} = 0}^{m_{K}} Φ (ℓ_{1} \dots ℓ_{K}) \prod_{k = 1}^{K} \bar{D} (N_{t_{j}}^{k} = ℓ_{k})

(28)

and

\bar{P} (T_{S} > t_{j}) = \sum_{ℓ_{1} = 0}^{m_{1}} \dots \sum_{ℓ_{K} = 0}^{m_{K}} Φ (ℓ_{1} \dots ℓ_{K}) \prod_{k = 1}^{K} \underset{̲}{D} (N_{t_{j}}^{k} = ℓ_{k})

(29)

where

\bar{D} (N_{t_{j}}^{k} = ℓ_{k})

and

\underset{̲}{D} (N_{t_{j}}^{k} = ℓ_{k})

for

ℓ_{k} \in {0, 1, \dots, m_{k}}

are derived from Equations (16) and (18), respectively, thus

\begin{matrix} \bar{D} (N_{t_{j}}^{k} = ℓ_{k}) & = \underset{̲}{P} (N_{t_{j}}^{k} \geq ℓ_{k}) - \underset{̲}{P} (N_{t_{j}}^{k} \geq ℓ_{k} + 1) \\ = {(\binom{{\hat{n}}_{t_{j}}^{k} + m_{k}}{{\hat{n}}_{t_{j}}^{k}})}^{- 1} (\binom{({\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k}) + ℓ_{k} - 1}{({\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k}) - 1}) (\binom{{\hat{n}}_{t_{j}}^{k} - ({\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k}) + m_{k} - ℓ_{k}}{{\hat{n}}_{t_{j}}^{k} - ({\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k})}) \end{matrix}

(30)

and

\begin{matrix} \underset{̲}{D} (N_{t_{j}}^{k} = ℓ_{k}) & = \bar{P} (N_{t_{j}}^{k} \geq ℓ_{k}) - \bar{P} (N_{t_{j}}^{k} \geq ℓ_{k} - 1) \\ = {(\binom{{\hat{n}}_{t_{j}}^{k} + m_{k}}{{\hat{n}}_{t_{j}}^{k}})}^{- 1} (\binom{({\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k}) + ℓ_{k}}{({\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k})}) (\binom{{\hat{n}}_{t_{j}}^{k} - ({\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k}) + m_{k} - ℓ_{k} - 1}{{\hat{n}}_{t_{j}}^{k} - ({\hat{n}}_{t_{j}}^{k} - d_{t_{j}}^{k}) - 1}) \end{matrix}

(31)

Next, we will apply the results presented above to discrete-time system reliability, which consists of a single type of component (see Example 4) and multiple types of components (see Example 5).

Example 4.

The system depicted in Figure 1 is utilised in this example and was also utilised by Coolen and Coolen–Maturi [28]. We are examining the reliability of a discrete-time system with

m = 5

exchangeable components, presenting the survival signature values as follows:

Φ (0) = 0

,

Φ (1) = 0

,

Φ (2) = 0.6

,

Φ (3) = 0.9

,

Φ (4) = 1

, and

Φ (5) = 1

. We will analyse two datasets of different sizes, one with

n = 10

observations and the other with

n = 20

observations. These datasets include failure events and right-censored observations for discrete times

t_{1}

to

t_{5}

. Table 5 presents NPI lower and upper probabilities for

T_{S} > t_{j}

at these discrete times, based on the survival signature values as provided and the results in Section 6.2.

Upon comparing the results in Table 5, it is evident that the imprecisions (the difference between the upper and lower probability) for both sample sizes are minimal at time

t_{1}

and increase as we progress to later times, owing to fewer observations in the risk set. Additionally, the differences between the lower and upper probabilities for

T_{S} > t_{j}

with

n = 20

observations are generally smaller compared to those with

n = 10

observations. So, the imprecision for

T_{S} > t_{j}

decreases as the dataset size increases, i.e., as we have more data available.

Example 5.

In this example, we are examining a system with

K = 2

types of components, types 1 and 2, depicted in Figure 2. Coolen et al. [24] utilised this system to demonstrate NPI for the system survival time. The survival signature for this system can be found in Table 6. We are focusing on the data provided in Table 7 for the two types with

m_{1} = m_{2} = 3

components, and each type has 10 observations, i.e.,

n_{1} = n_{2} = 10

, including failure events and right-censored observations, for discrete times

t_{1}

,

t_{2}

, and

t_{3}

. The table also includes the NPI lower and upper probabilities for

T_{S} > t_{j}

at discrete times

t_{1}

,

t_{2}

, and

t_{5}

, based on the given survival signature values and the results in Section 6.2.

When considering the NPI approach for real-valued data, it is typical for the lower probability value for

X_{n + 1} > t

in a specific interval to be less than or equal to the upper probability value for

X_{n + 1} > t

in the next interval. This is evident in the results of [29]. However, the NPI for the discrete-time approach indicates that this may not always be the case, as observed in the results of Table 7 where

\underset{̲}{P} (T_{S} > t_{1}) > \bar{P} (T_{S} > t_{2})

, and in the results of Table 5 where

\underset{̲}{P} (T_{S} > t_{3}) > \bar{P} (T_{S} > t_{4})

. Many of the findings presented in this paper suggest that, for discrete-time cases, this discrepancy may arise due to multiple failures occurring between discrete-time points.

7. Concluding Remarks

This paper introduced an alternative predictive approach to the actuarial estimator in the context of discrete-time data. The proposed NPI method provides lower and upper probabilities for the event that all future observations survive at a discrete-time point. The proposed method, based on NPI for Bernoulli data, is developed to derive the NPI lower and upper probabilities for the event that a specific number of future Bernoulli trials survive out of multiple future trials considered. Additionally, this development has been applied to systems reliability with single and multiple types of components at discrete-time points in conjunction with the survival signature method. The methods presented in this paper can be applied to various applications in system reliability, where in particular their use to support decisions in practical scenarios lead to interesting topics for future research.

Author Contributions

Methodology, F.P.A.C., T.C.-M. and A.M.Y.M.; Formal analysis, F.P.A.C., T.C.-M. and A.M.Y.M.; Investigation, F.P.A.C., T.C.-M. and A.M.Y.M.; Writing—original draft, F.P.A.C., T.C.-M. and A.M.Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Singer, J.D.; Willett, J.B. It’s about time: Using discrete-time survival analysis to study duration and the timing of events. J. Educ. Stat. 1993, 18, 155–195. [Google Scholar]
Allison, P.D. Discrete-time methods for the analysis of event histories. Sociol. Methodol. 1982, 13, 61–98. [Google Scholar] [CrossRef]
Masyn, K.E. Discrete-Time Survival Mixture Analysis for Single and Recurrent Events Using Latent Variables. Ph.D. Thesis, University of California, Los Angeles, CA, USA, 2003. Available online: https://www.statmodel.com/download/masyndissertation.pdf (accessed on 29 July 2024).
Hill, B.M. Posterior distribution of percentiles: Bayes’ theorem for sampling from a population. J. Am. Stat. Assoc. 1968, 63, 677–691. [Google Scholar] [CrossRef]
Augustin, T.; Coolen, F.P.A. Nonparametric predictive inference and interval probability. J. Stat. Plan. Inference 2004, 124, 251–272. [Google Scholar] [CrossRef]
Coolen, F.P.A. On nonparametric predictive inference and objective Bayesianism. J. Logic Lang. Inf. 2006, 15, 21–47. [Google Scholar]
Coolen, F.P.A. Low structure imprecise predictive inference for Bayes’ problem. Stat. Probab. Lett. 1998, 36, 349–357. [Google Scholar] [CrossRef]
Coolen, F.P.A.; Coolen-Schrijner, P. Nonparametric predictive subset selection for proportions. Stat. Probab. Lett. 2006, 76, 1675–1684. [Google Scholar] [CrossRef]
Coolen, F.P.A.; Yan, K.J. Nonparametric predictive inference with right-censored data. J. Stat. Plan. Inference 2004, 126, 25–54. [Google Scholar] [CrossRef]
Coolen, F.P.A.; Yan, K.J. Nonparametric Predictive Comparison of Two Groups of Lifetime Data. In Proceedings of the 3rd International Symposium on Imprecise Probabilities and Their Applications, Carlton Scientific, Lugano, Switzerland, 14–17 July 2003; pp. 148–161. [Google Scholar]
Coolen-Maturi, T.A.; Coolen, F.P.A.; Muhammad, N. Predictive inference for bivariate data: Combining nonparametric predictive inference for marginals with an estimated copula. J. Stat. Theory Pract. 2016, 10, 515–538. [Google Scholar] [CrossRef]
Baker, R. Multinomial Nonparametric Predictive Inference: Selection, Classification and Subcategory Data. Ph.D. Thesis, University of Durham, Durham, UK, 2010. Available online: https://maths.durham.ac.uk/stats/people/fc/thesis-RB.pdf (accessed on 29 July 2024).
Coolen, F.P.A.; Augustin, T. Learning from multinomial data: A nonparametric predictive alternative to the Imprecise Dirichlet Model. In Proceedings of the ISIPTA 4th International Symposium on Imprecise Probabilities and Their Applications, Pittsburgh, PA, USA, 20–23 July 2005; pp. 125–134. [Google Scholar]
Janurová, K.; Briš, R. A nonparametric approach to medical survival data: Uncertainty in the context of risk in mortality analysis. Reliab. Eng. Syst. Saf. 2014, 125, 145–152. [Google Scholar] [CrossRef]
Yan, K.J. Nonparametric Predictive Inference with Right-Censored Data. Ph.D. Thesis, Durham University, Durham, UK, 2002. Available online: https://maths.durham.ac.uk/stats/people/fc/thesis-KJY.pdf (accessed on 29 July 2024).
Coolen, F.P.A.; Coolen-Maturi, T.; Al-Nefaiee, A.H. Nonparametric predictive inference for system reliability using the survival signature. Proc. Inst. Mech. Eng. Part O J. Risk Reliab. 2014, 228, 437–448. [Google Scholar] [CrossRef]
Al-Nefaiee, A.H. Nonparametric Predictive Inference for System Failure Time. Ph.D. Thesis, University of Durham, Durham, UK, 2014. Available online: https://maths.durham.ac.uk/stats/people/fc/thesis-AAN.pdf (accessed on 29 July 2024).
Hill, B.M. Bayesian nonparametric prediction and statistical inference. In Bayesian Analysis in Statistics and Econometrics; Springer: New York, NY, USA, 1992; pp. 43–94. [Google Scholar]
Aboalkhair, A.M. Nonparametric Predictive Inference for System Reliability. Ph.D. Thesis, University of Durham, Durham, UK, 2012. Available online: https://maths.dur.ac.uk/stats/people/fc/thesis-AA.pdf (accessed on 29 July 2024).
Hill, B.M. De Finetti’s Theorem, Induction, and Bayesian nonparametric predictive inference (with discussion). In Bayesian Analysis in Statistics and Econometrics; Springer: Berlin/Heidelberg, Germany, 1988; pp. 211–241. [Google Scholar]
Hill, B.M. Parametric models for A(n): Splitting processes and mixtures. J. R. Stat. Soc. Ser. B 1993, 55, 423–433. [Google Scholar]
Berkson, J.; Gage, R.P. Calculation of survival rates for cancer. Mayo Clin. 1950, 25, 270–286. [Google Scholar]
Lawless, J.F. Statistical Models and Methods for Lifetime Data; Wiley: New York, NY, USA, 1982. [Google Scholar]
Coolen, F.P.A.; Coolen-Maturi, T. Generalizing the signature to systems with multiple types of components. In Complex Systems and Dependability; Springer: Berlin/Heidelberg, Germany, 2013; pp. 115–130. [Google Scholar]
Samaniego, F.J. System Signatures and Their Applications in Engineering Reliability; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
Samaniego, F.J. On closure of the IFR class under formation of coherent systems. IEEE Trans. Reliab. 1985, 34, 69–72. [Google Scholar] [CrossRef]
Navarro, J.; Samaniego, F.J.; Balakrishnan, N.; Bhattacharya, D. On the application and extension of system signatures in engineering reliability. Nav. Res. Logist. 2008, 55, 313–327. [Google Scholar] [CrossRef]
Coolen, F.P.A.; Coolen-Maturi, T. Predictive inference for system reliability after common-cause component failures. Reliab. Eng. Syst. Saf. 2015, 135, 27–33. [Google Scholar] [CrossRef]
Coolen-Maturi, T.; Mahnashi, A.M.; Coolen, F. Nonparametric Predictive Inference for Two Future Observations with Right-Censored Data. Math. Methods Stat. 2024, in press.

Figure 1. System with a single type of

m = 5

components for Example 4.

Figure 1. System with a single type of

m = 5

components for Example 4.

Figure 2. System with 2 types of components for Example 5.

Table 1. The actuarial estimator for the survival function and the NPI lower and upper probabilities for the event

⋂_{i = 1}^{m} {X_{9 + i} > t_{j}}

,

m \in {1, 3, 10, 15}

, Example 1.

Table 1. The actuarial estimator for the survival function and the NPI lower and upper probabilities for the event

⋂_{i = 1}^{m} {X_{9 + i} > t_{j}}

,

m \in {1, 3, 10, 15}

, Example 1.

								$m = 1$		$m = 3$		$m = 10$		$m = 15$
$t_{j}$	$d_{t_{j}}$	$c_{t_{j}}$	${\hat{n}}_{t_{j}}$	$n_{t_{j}}$	${\overset{`}{n}}_{t_{j}}$	$1 - {\hat{h}}_{t_{j}}$	${\hat{S}}_{t_{j}}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$
$t_{1}$	1	0	9	8	9	0.889	0.889	0.8000	0.9000	0.6667	0.9167	0.4211	0.9474	0.3333	0.9583
$t_{2}$	2	1	7	5	8	0.750	0.667	0.5000	0.6750	0.3333	0.7333	0.1238	0.8359	0.0758	0.8712
$t_{3}$	2	1	4	2	5	0.600	0.400	0.2000	0.4050	0.0952	0.5238	0.0177	0.7165	0.0080	0.7795
$t_{4}$	0	1	1	1	2	1.000	0.400	0.1000	0.4050	0.0238	0.5238	0.0016	0.7165	0.0005	0.7795

Table 2. NPI lower and upper probabilities for

⋂_{i = 1}^{m} {X_{374 + i} > t_{j}}

,

m \in {1, 2, 5, 10}

and

{[\underset{̲}{P}, \bar{P}]}^{5} (X_{375} > t_{j})

(Example 2).

Table 2. NPI lower and upper probabilities for

⋂_{i = 1}^{m} {X_{374 + i} > t_{j}}

,

m \in {1, 2, 5, 10}

and

{[\underset{̲}{P}, \bar{P}]}^{5} (X_{375} > t_{j})

(Example 2).

					$m = 1$		$m = 2$		$m = 5$		$m = 10$		${[\underset{̲}{P}, \bar{P}]}^{5} (X_{375} > t_{j})$
$t_{j}$	$d_{t_{j}}$	$c_{t_{j}}$	${\hat{n}}_{t_{j}}$	$n_{t_{j}}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$	${[\underset{̲}{P}]}^{5}$	${[\bar{P}]}^{5}$
$t_{1}$	90	0	374	284	0.757	0.760	0.574	0.578	0.2513	0.2557	0.0644540	0.0667235	0.2486	0.2536
$t_{2}$	76	0	284	208	0.553	0.557	0.306	0.311	0.0527	0.0549	0.0029253	0.0031739	0.0517	0.0536
$t_{3}$	51	0	208	157	0.415	0.421	0.173	0.178	0.0128	0.0138	0.0001793	0.0002070	0.0123	0.0132
$t_{4}$	25	12	145	120	0.341	0.349	0.117	0.123	0.0049	0.0055	0.0000269	0.0000336	0.0046	0.0051
$t_{5}$	20	5	115	95	0.280	0.289	0.079	0.084	0.0018	0.0022	0.0000040	0.0000055	0.0017	0.0020
$t_{6}$	7	9	86	79	0.254	0.266	0.065	0.071	0.0011	0.0014	0.0000016	0.0000025	0.0011	0.0013
$t_{7}$	4	9	70	66	0.236	0.251	0.056	0.063	0.0008	0.0011	0.0000008	0.0000014	0.0007	0.0010
$t_{8}$	1	3	63	62	0.229	0.247	0.053	0.061	0.0007	0.0010	0.0000006	0.0000012	0.0006	0.0009
$t_{9}$	3	5	57	54	0.213	0.234	0.046	0.055	0.0005	0.0008	0.0000003	0.0000008	0.0004	0.0007
$t_{10}$	2	5	49	47	0.200	0.225	0.040	0.051	0.0004	0.0006	0.0000002	0.0000005	0.0003	0.0006

Table 3. NPI lower and upper probabilities for the event

N_{t_{j}} \geq x | N_{t_{j - 1}} = y

with

x \leq y

.

Table 3. NPI lower and upper probabilities for the event

N_{t_{j}} \geq x | N_{t_{j - 1}} = y

with

x \leq y

.

							$x = 1$		$x = 2$		$x = 3$
$t_{j}$	$d_{t_{j}}$	$c_{t_{j}}$	$n_{t_{j}}$	${\hat{n}}_{t_{j}}$	${\hat{n}}_{t_{j}} - d_{t_{j}}$	$y$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$
$t_{1}$	1	0	8	9	8	1	0.8000	0.9000
						2	0.9455	0.9818	0.6545	0.8182
						3	0.9818	0.9955	0.8727	0.9545	0.5455	0.7500
$t_{2}$	2	1	5	7	5	1	0.6250	0.7500
						2	0.8333	0.9167	0.4167	0.5833
						3	0.9167	0.9667	0.6667	0.8167	0.2917	0.4667
$t_{3}$	2	1	2	4	2	1	0.4000	0.6000
						2	0.6000	0.8000	0.2000	0.4000
						3	0.7143	0.8857	0.3714	0.6286	0.1143	0.2857
$t_{4}$	1	1	0	1	0	1	0	0.5000
						2	0	0.6667	0	0.3333
						3	0	0.7500	0	0.5000	0	0.2500

Table 4. NPI lower and upper for the event

N_{t_{j}} \geq x

.

Table 4. NPI lower and upper for the event

N_{t_{j}} \geq x

.

						$x = 1$		$x = 2$		$x = 3$
$t_{j}$	$d_{t_{j}}$	$c_{t_{j}}$	$n_{t_{j}}$	${\hat{n}}_{t_{j}}$	${\hat{n}}_{t_{j}} - d_{t_{j}}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$	$\underset{̲}{P}$	$\bar{P}$
$t_{1}$	1	0	8	9	8	0.9625	0.9955	0.7883	0.8832	0.4091	0.7500
$t_{2}$	2	1	5	7	5	0.8409	0.9432	0.5000	0.7318	0.1591	0.3500
$t_{3}$	2	1	2	4	2	0.5334	0.7834	0.1833	0.4334	0.0333	0.1333
$t_{4}$	1	1	0	1	0	0	0.5714	0	0.2571	0	0.0714

Table 5. NPI lower and upper probabilities for

T_{S} > t_{j}

, for the system in Figure 1, with

n = 10

and

n = 20

, Example 4.

Table 5. NPI lower and upper probabilities for

T_{S} > t_{j}

, for the system in Figure 1, with

n = 10

and

n = 20

, Example 4.

	$n = 10$							$n = 20$
$t_{j}$	$d_{t_{j}}$	$c_{t_{j}}$	${\hat{n}}_{t_{j}}$	${\hat{n}}_{t_{j}} - d_{t_{j}}$	$\underset{̲}{P}$	$\bar{P}$	$\bar{P} - \underset{̲}{P}$	$d_{t_{j}}$	$c_{t_{j}}$	${\hat{n}}_{t_{j}}$	${\hat{n}}_{t_{j}} - d_{t_{j}}$	$\underset{̲}{P}$	$\bar{P}$	$\bar{P} - \underset{̲}{P}$
$t_{1}$	2	0	10	8	0.8811	0.9426	0.0615	4	0	20	16	0.9177	0.9465	0.0288
$t_{2}$	2	1	7	5	0.7765	0.8909	0.1144	4	2	14	10	0.8344	0.8921	0.0577
$t_{3}$	2	0	5	3	0.6190	0.8095	0.1905	3	1	9	6	0.7552	0.8559	0.1007
$t_{4}$	1	1	2	1	0.3857	0.7810	0.3953	2	2	4	2	0.4810	0.7333	0.2523
$t_{5}$	1	0	1	0	0	0.5833	0.5833	2	0	2	0	0	0.3857	0.3857

Table 6. Survival signature of the system in Figure 2 (Example 5).

(ℓ₁,ℓ₂)	Φ(ℓ₁,ℓ₂)	(ℓ₁,ℓ₂)	Φ(ℓ₁,ℓ₂)
$(0, 0)$	0	$(2, 0)$	0
$(0, 1)$	0	$(2, 1)$	0
$(0, 2)$	0	$(2, 2)$	4/9
$(0, 3)$	0	$(2, 3)$	6/9
$(1, 0)$	0	$(3, 0)$	1
$(1, 1)$	0	$(3, 1)$	1
$(1, 2)$	1/9	$(3, 2)$	1
$(1, 3)$	3/9	$(3, 3)$	1

Table 7. NPI lower and upper probabilities for

T_{S} > t_{j}

, for the system in Figure 2 with two types of components and

m_{1} = m_{2} = 3

, Example 5.

Table 7. NPI lower and upper probabilities for

T_{S} > t_{j}

, for the system in Figure 2 with two types of components and

m_{1} = m_{2} = 3

, Example 5.

$t_{j}$	$d_{t_{j}}^{1}$	$c_{t_{j}}^{1}$	${\hat{n}}_{t_{j}}^{1}$	${\hat{n}}_{t_{j}}^{1} - d_{t_{j}}^{1}$	$d_{t_{j}}^{2}$	$c_{t_{j}}^{2}$	${\hat{n}}_{t_{j}}^{2}$	${\hat{n}}_{t_{j}}^{2} - d_{t_{j}}^{2}$	$\underset{̲}{P} (T_{S} > t_{j})$	$\bar{P} (T_{S} > t_{j})$
$t_{1}$	2	1	9	7	3	0	10	7	0.5500	0.7118
$t_{2}$	3	2	5	2	3	1	6	3	0.1412	0.3189
$t_{3}$	2	0	2	0	2	1	2	0	0	0.1478

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Coolen, F.P.A.; Coolen-Maturi, T.; Mahnashi, A.M.Y. Nonparametric Predictive Inference for Discrete Lifetime Data. Mathematics 2024, 12, 3514. https://doi.org/10.3390/math12223514

AMA Style

Coolen FPA, Coolen-Maturi T, Mahnashi AMY. Nonparametric Predictive Inference for Discrete Lifetime Data. Mathematics. 2024; 12(22):3514. https://doi.org/10.3390/math12223514

Chicago/Turabian Style

Coolen, Frank P. A., Tahani Coolen-Maturi, and Ali M. Y. Mahnashi. 2024. "Nonparametric Predictive Inference for Discrete Lifetime Data" Mathematics 12, no. 22: 3514. https://doi.org/10.3390/math12223514

APA Style

Coolen, F. P. A., Coolen-Maturi, T., & Mahnashi, A. M. Y. (2024). Nonparametric Predictive Inference for Discrete Lifetime Data. Mathematics, 12(22), 3514. https://doi.org/10.3390/math12223514

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Nonparametric Predictive Inference for Discrete Lifetime Data

Abstract

1. Introduction

2. Actuarial Estimator of the Survival Function

3. Nonparametric Predictive Inference (NPI)

4. NPI-Based Discrete-Time Survival Function

5. NPI-Based Discrete-Time Reliability Function

6. Application to System Reliability Using Survival Signatures

6.1. The Survival Signature

6.2. Discrete-Time System Reliability

7. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI