Estimation and Prediction for Nadarajah-Haghighi Distribution under Progressive Type-II Censoring

Wu, Mingjie; Gui, Wenhao

doi:10.3390/sym13060999

Open AccessArticle

Estimation and Prediction for Nadarajah-Haghighi Distribution under Progressive Type-II Censoring

by

Mingjie Wu

and

Wenhao Gui

^*

Department of Mathematics, Beijing Jiaotong University, Beijing 100044, China

^*

Author to whom correspondence should be addressed.

Symmetry 2021, 13(6), 999; https://doi.org/10.3390/sym13060999

Submission received: 9 May 2021 / Revised: 27 May 2021 / Accepted: 31 May 2021 / Published: 3 June 2021

(This article belongs to the Special Issue Probability, Statistics and Applied Mathematics)

Download

Browse Figures

Versions Notes

Abstract

:

The paper discusses the estimation and prediction problems for the Nadarajah-Haghighi distribution using progressive type-II censored samples. For the unknown parameters, we first calculate the maximum likelihood estimates through the Expectation–Maximization algorithm. In order to choose the best Bayesian estimator, a loss function must be specified. When the loss is essentially symmetric, it is reasonable to use the square error loss function. However, for some estimation problems, the actual loss is often asymmetric. Therefore, we also need to choose an asymmetric loss function. Under the balanced squared error and symmetric squared error loss functions, the Tierney and Kadane method is used for calculating different kinds of approximate Bayesian estimates. The Metropolis-Hasting algorithm is also provided here. In addition, we construct a variety of interval estimations of the unknown parameters including asymptotic intervals, bootstrap intervals, and highest posterior density intervals using the sample derived from the Metropolis-Hasting algorithm. Furthermore, we compute the point predictions and predictive intervals for a future sample when facing the one-sample and two-sample situations. At last, we compare and appraise the performance of the provided techniques by carrying out a simulation study and analyzing a real rainfall data set.

Keywords:

Nadarajah-Haghighi distribution; progressive type-II censoring; Metropolis-Hasting algorithm; Bayes estimation; Expectation–Maximization algorithm; Bayes prediction

1. Introduction

Nowadays, censoring has aroused great concern in the aspects of various reliability studies and life tests. There are already certain kinds of censoring schemes introduced in many references, among which type-I and type-II censoring are likely to be the two most famous types. Plenty of literature has discussed them, such as [1,2]. A life test in type-I censoring ends at a prespecified time, so the number of observations is random. By contrast, a life test in type-II censoring is not finished until a prefixed number of failures occur.

However, there are some constraints in type-I and type-II censoring. The units cannot be withdrawn when an experiment is in progress, for example. In this case, a kind of more flexible and more general censoring scheme named progressive type-II censoring scheme was put forward in order to improve this disadvantage. That is, it allows the removal of units in between. This scheme is introduced as follows. We assume that the number of n units are put in a life test.

R_{1}

of the

n - 1

surviving units are withdrawn at random from the experiment as the first failure takes place. Likewise, after the second failure,

R_{2}

out of the

n - R_{1} - 2

remaining surviving units are randomly removed again. This experiment continues until m failures happen. The remaining units of size

R_{m}

are withdrawn from the test at this moment, where

R_{m} = n - m - \sum_{i = 1}^{m - 1} R_{i}

. Then,

R = (R_{1}, R_{2}, \dots, R_{m})

is named the progressive type-II censoring scheme, which should be prefixed before the experiment. Many statistical inferences and studies have been done involved with progressive censoring by several researchers like [3,4,5,6].

An extension of the exponential distribution was suggested by [7] by introducing a shape parameter. Its corresponding probability density function (PDF) is defined by

\begin{matrix} f (x; α, λ) & = α λ {(1 + λ x)}^{α - 1} exp {1 - {(1 + λ x)}^{α}}, x > 0; α > 0, λ > 0, \end{matrix}

(1)

and cumulative distribution function (CDF) is expressed as

\begin{matrix} F (x; α, λ) & = 1 - exp {1 - {(1 + λ x)}^{α}}, x > 0; α > 0, λ > 0, \end{matrix}

(2)

where the shape and scale parameters are respectively

α

and

λ

. Both of them are positive. We denote this distribution by NH(

α, λ

). In addition, the NH distribution in some other literature is also named the extended exponential distribution, such as [8,9].

Ref. [7] made a discovery that even though some data sets contained the value zero, an NH distribution still could provide suitable fits to them. In addition, the trend of hazard rate function is dependent on

α

in this distribution. For instance, the hazard rate function tends to increase (decrease) when

α > 1

(α < 1)

, respectively. However, the hazard rate function turns into a constant as

α

is given the value of 1. At this time, (1) becomes the one-parameter exponential distribution as well.

Some literature can be found with regard to the statistical inferences for the NH distribution. As for the estimation of the unknown parameters, based on progressive type-II censored samples with binomial removals, Ref. [10] has discussed the maximum likelihood and Bayes estimations. Then, Ref. [11] researched progressive type-II censored data. They calculated the maximum likelihood estimates (MLEs) through the method of Newton–Raphson, and Bayes estimates through the Markov Chain Monte Carlo method. Interval estimations were made like asymptotic confidence intervals (CIs) and the highest posterior density (HPD) intervals as well. Several references also laid emphasis on the problems of order statistics of an NH distribution, such as [8,9]. They both worked on recurrence formulas of the moments for order statistics. However, the former dealt with the situation in progressive type-II right censoring while the latter focused on the complete samples. Additionally, for the unknown parameters, Ref. [12] discussed MLEs, and Bayesian estimates by the Lindley approximation. The point and interval predictions for future records were also made by making use of non-Bayesian and Bayesian predictions. In addition, based on the progressively first-failure censored NH distribution, Ref. [13] studied the MLEs and Bayes estimates for the two unknown parameters and lifetime parameters of survival and hazard rate functions. They also suggested an optimal censoring scheme through different optimality criteria.

In the article, using progressive type-II censored samples, we discuss the estimation and prediction problems for the NH distribution by diverse kinds of methods. In addition, we compare their performance through the simulation study and a real rainfall data set.

In Section 2, for the unknown parameters, we utilize the Expectation–Maximization (EM) algorithm for calculating the MLEs. Then, on the basis of the Fisher information matrix, we set up the asymptotic intervals. Furthermore, here bootstrap intervals are introduced as well. In Section 3, under two types of loss functions, Bayesian estimators of unknown parameters are calculated through both the Metropolis-Hasting (MH) method and Tierney and Kadane (TK) algorithm. Then, using the sample generated by the MH algorithm, we get the HPD credible intervals as well. In Section 4, Bayesian predictive intervals and the point prediction of a future sample are discussed. Additionally, we compare and appraise the performance of the provided techniques by carrying out a simulation study and analyzing a real data set in Section 5.

2. Maximum Likelihood Estimation

We assume that the number of n units are placed in a life test. They are independent and identically follow the NH distribution defined in (1). Under a progressive type-II censoring scheme

R = (R_{1}, R_{2}, \dots, R_{m})

, let us denote a set of censored samples of size m by

X = (X_{1 : m : n}, X_{2 : m : n}, \dots, X_{m : m : n})

from the NH distribution. In light of [3], the likelihood function related to

α

and

λ

is written as

\begin{matrix} l (α, λ | \underset{̲}{x}) & = κ \prod_{i = 1}^{m} f (x_{i}; α, λ) {[1 - F (x_{i}; α, λ)]}^{R_{i}} \\ = κ {(α λ)}^{m} \prod_{i = 1}^{m} {(1 + λ x_{i})}^{α - 1} exp {1 - {(1 + λ x_{i})}^{α}} {[exp {1 - {(1 + λ x_{i})}^{α}}]}^{R_{i}}, \end{matrix}

(3)

where

κ = n (n - 1 - R_{1}) \dots (n - m + 1 - \sum_{i = 1}^{m - 1} R_{i})

, and

\underset{̲}{x} = (x_{1}, x_{2}, \dots, x_{m})

stands for the corresponding observed value of X. Ignoring the constant, we compute the log-likelihood function by

\begin{matrix} L (α, λ | \underset{̲}{x}) = ln l (α, λ | \underset{̲}{x}) = m ln α λ + \sum_{i = 1}^{m} {[1 - (1 + λ x_{i})]}^{α} (1 + R_{i}) + (α - 1) \sum_{i = 1}^{m} ln (1 + λ x_{i}) . \end{matrix}

(4)

As usual, to acquire the MLEs of

α

and

λ

, we are supposed to solve the corresponding likelihood equations as

\begin{matrix} \frac{\partial L}{\partial α} & = \frac{m}{α} - \sum_{i = 1}^{m} (1 + R_{i}) {(1 + λ x_{i})}^{α} ln (1 + λ x_{i}) + \sum_{i = 1}^{m} ln (1 + λ x_{i}) = 0, \end{matrix}

(5)

\begin{matrix} \frac{\partial L}{\partial λ} & = \frac{m}{λ} - α \sum_{i = 1}^{m} (1 + R_{i}) x_{i} {(1 + λ x_{i})}^{α - 1} + \sum_{i = 1}^{m} \frac{x_{i}}{1 + λ x_{i}} = 0 . \end{matrix}

(6)

However, it is infeasible to solve the above equations analytically in closed form. Therefore, some numerical techniques like the EM algorithm are suggested for calculating the desired MLEs. As a solution to lost or partial data situations, Ref. [14] introduced the EM algorithm at first.

We are aware that the progressive type-II censored data are not complete, so they can be treated as a set of incomplete data. Now we assume that

Z = (Z_{1}, Z_{2}, \dots, Z_{m})

represents censored data and

X = (X_{1 : m : n}, X_{2 : m : n}, \dots, X_{m : m : n})

represents observed data. It is worth noting that for

i = 1, 2, \dots, m

, every

Z_{i}

is a set of

1 \times R_{i}

vectors of

(Z_{i 1}, Z_{i 2}, \dots, Z_{i R_{i}})

. Consequently, observed data X combine censored data Z to form the complete data set as

V = (X, Z)

. We disregard the additive constant and then obtain the corresponding log-likelihood function

L_{c} (V; α, λ)

for V as

\begin{matrix} L_{c} (V; α, λ) = & n ln α λ + (α - 1) \sum_{i = 1}^{m} ln (1 + λ x_{i}) + \sum_{i = 1}^{m} (1 - {(1 + λ x_{i})}^{α}) \\ + (α - 1) \sum_{i = 1}^{m} \sum_{k = 1}^{R_{i}} ln (1 + λ z_{i k}) + \sum_{i = 1}^{m} \sum_{k = 1}^{R_{i}} (1 - {(1 + λ z_{i k})}^{α}), \end{matrix}

(7)

where

z_{i k}

denotes a censored value of

Z_{i R_{i}}

for

k = 1, \dots, R_{i}

and

i = 1, \dots, m

.

The EM algorithm is composed of two parts named E-step and M-step. To begin with, the E-step is involved with calculation related to the corresponding pseudo-log-likelihood function as

\begin{matrix} L_{s} (α, λ) = & n ln α λ + (α - 1) \sum_{i = 1}^{m} ln (1 + λ x_{i}) + \sum_{i = 1}^{m} (1 - {(1 + λ x_{i})}^{α}) \\ + (α - 1) \sum_{i = 1}^{m} \sum_{i = 1}^{R_{i}} E (ln (1 + λ z_{i k}) | z_{i k} > x_{i}) + \sum_{i = 1}^{m} \sum_{i = 1}^{R_{i}} E ((1 - {(1 + λ z_{i k})}^{α}) | z_{i k} > x_{i}), \end{matrix}

(8)

where

\begin{matrix} E (ln (1 + λ z_{i k}) | z_{i k} > x_{i}) & = \frac{1}{1 - F (x_{i}; α, λ)} \int_{x_{i}}^{+ \infty} e^{1 - {(1 + λ t)}^{α}} [ln (1 + λ t)] {(1 + λ t)}^{α - 1} d t \\ = - \frac{1}{α [1 - F (x_{i}; α, λ)]} \int_{0}^{e^{1 - {(1 + λ x_{i})}^{α}}} ln (1 - ln y) d y \\ = E_{1} (x_{i}, α, λ), say \end{matrix}

(9)

\begin{matrix} E (1 - {(1 + λ z_{i k})}^{α} | z_{i k} > x_{i}) & = \frac{α λ}{1 - F (x_{i}; α, λ)} \int_{x_{i}}^{+ \infty} [1 - {(1 + λ t)}^{α}] {(1 + λ t)}^{α - 1} e^{1 - {(1 + λ t)}^{α}} d t \\ = - \frac{1}{1 - F (x_{i}; α, λ)} \int_{0}^{e^{1 - {(1 + λ x_{i})}^{α}}} (ln y) d y \\ = {(1 + λ x_{i})}^{α} \\ = E_{2} (x_{i}, α, λ), say . \end{matrix}

(10)

As for the M-step, the maximum value of (8) depends on the values of

α

and

λ

. We assume that the estimate of

(α, λ)

is referred to as

(α_{(j)}, λ_{(j)})

in the jth iteration. Thus, we can figure out

(α_{(j + 1)}, λ_{(j + 1)})

through maximizing the function

\begin{matrix} L_{s} (α, λ) = & n ln α λ + (α - 1) \sum_{i = 1}^{m} ln (1 + λ x_{i}) + (α - 1) \sum_{i = 1}^{m} R_{i} E_{1} (x_{i}, α_{(j)}, λ_{(j)}) \\ + \sum_{i = 1}^{m} (1 - {(1 + λ x_{i})}^{α}) + \sum_{i = 1}^{m} R_{i} E_{2} (x_{i}, α_{(j)}, λ_{(j)}) . \end{matrix}

(11)

As the values of

α_{(j)}

and

λ_{(j)}

are known, at first,

λ_{(j + 1)}

is calculated through solving the equation

\begin{matrix} \frac{\partial L_{s}}{\partial λ} = \frac{n}{λ_{(j + 1)}} - α_{(j)} \sum_{i = 1}^{m} x_{i} {(1 + λ_{(j + 1)} x_{i})}^{α_{(j)} - 1} + (α_{(j)} - 1) \sum_{i = 1}^{m} \frac{x_{i}}{1 + λ_{(j + 1)} x_{i}} = 0 . \end{matrix}

(12)

Once

λ_{(j + 1)}

is obtained, we can also solve

α_{(j + 1)}

by the following equation

\begin{matrix} \frac{\partial L_{s}}{\partial α} = & \frac{n}{α_{(j + 1)}} + \sum_{i = 1}^{m} ln (1 + λ_{(j + 1)} x_{i}) - \sum_{i = 1}^{m} {(1 + λ_{(j + 1)} x_{i})}^{α_{(j + 1)}} ln (1 + λ_{(j + 1)} x_{i}) \\ + \sum_{i = 1}^{m} R_{i} E_{1} (x_{i}, α_{(j)}, λ_{(j + 1)}) = 0 . \end{matrix}

(13)

Then, in the next iteration, we treat

(α_{(j + 1)}, λ_{(j + 1)})

as the updated numeric value for

(α, λ)

. Like this, we proceed with the above two steps repeatedly before

α

and

λ

finally converge to their MLEs. According to [15], the log-likelihood function is ensured to increase with each iteration. As a result, the EM algorithm almost always manages to make the likelihood function attain its local maximum value despite beginning with a random value of the parameter domain. When, in Section 5, using the EM algorithm for MLEs under progressive type-II censoring, we take the MLEs derived from complete samples as initial values.

2.1. Fisher Information Matrix

Let

θ

denote the unknown parameter

(α, λ)

. We assume that V represents the complete data and X represents the observed data. Then, let us denote the observed, missing, and complete information by

I_{X} (θ)

,

I_{V | X} (θ)

, and

I_{V} (θ)

, respectively. Based on the idea of [16], we can obtain

\begin{matrix} I_{X} (θ) = I_{V} (θ) - I_{V | X} (θ), \end{matrix}

(14)

where

I_{V} (θ) = - E [\frac{\partial^{2} L_{c} (V; θ)}{\partial θ^{2}}] = [\begin{matrix} a_{11} (α, λ) & a_{12} (α, λ) \\ a_{21} (α, λ) & a_{22} (α, λ) \end{matrix}],

(15)

and

I_{V | X}^{j} (θ) = - E_{Z_{j} | X_{j : m : n}} [\frac{\partial^{2} ln f_{Z_{j}} (z_{j} | x_{j}, θ)}{\partial θ^{2}}] = [\begin{matrix} b_{11} (x_{j}; α, λ) & b_{12} (x_{j}; α, λ) \\ b_{21} (x_{j}; α, λ) & b_{22} (x_{j}; α, λ) \end{matrix}] .

(16)

We note that these are two

2 \times 2

matrices. Moreover,

I_{V | X}^{j} (θ)

stands for the Fisher information matrix of one single observation which is censored as the jth failure occurs. Subsequently, we first calculate the elements of

I_{V} (θ)

as follows.

\begin{matrix} a_{11} (α, λ) & = \frac{n}{α^{2}} + \frac{1}{α^{2}} \int_{0}^{+ \infty} {[ln (1 + u)]}^{2} (1 + u) e^{- u} d u, \\ a_{22} (α, λ) & = \frac{n}{λ^{2}} + \frac{α - 1}{λ^{2}} \int_{0}^{+ \infty} [1 + α (1 + u)] {[1 - {(1 + u)}^{- \frac{1}{α}}]}^{2} e^{- u} d u, \\ a_{12} (α, λ) & = a_{21} (α, λ) = \frac{1}{λ} \int_{0}^{+ \infty} [1 - {(1 + u)}^{- \frac{1}{α}}] [(1 + u) ln (1 + u) + u] e^{- u} d u . \end{matrix}

From (16), we also have

\begin{matrix} b_{11} (x_{j}; α, λ) = & \frac{1}{α^{2}} + \frac{1}{[1 - F (x_{j}; α, λ)] α^{2}} \int_{{(1 + λ x_{j})}^{α} - 1}^{+ \infty} (1 + t) {[ln (1 + t)]}^{2} e^{- t} d t \\ + {(1 + λ x_{j})}^{α} {[ln (1 + λ x_{j})]}^{2}, \\ b_{22} (x_{j}; α, λ) = & \frac{1}{λ^{2}} + \frac{α - 1}{[1 - F (x_{j}; α, λ)] λ^{2}} \int_{{(1 + λ x_{j})}^{α} - 1}^{+ \infty} [1 + α (1 + t)] {[1 - {(1 + t)}^{- \frac{1}{α}}]}^{2} e^{- t} d t \\ + α (α - 1) {(1 + λ x_{j})}^{α - 2} x_{j}^{2}, \\ b_{12} (x_{j}; α, λ) = & b_{21} (x_{j}; α, λ) = x_{j} {(1 + λ x_{j})}^{α - 1} [α ln (1 + λ x_{j}) + 1] \\ + \frac{1}{λ [1 - F (x_{j}; α, λ)]} \int_{{(1 + λ x_{j})}^{α} - 1}^{+ \infty} [(1 + t) ln (1 + t) + t] [1 - {(t + 1)}^{- \frac{1}{α}}] e^{- t} d t . \end{matrix}

Then, we can calculate the total missing information matrix by the following expression

\begin{matrix} I_{V | X} (θ) = \sum_{j = 1}^{m} R_{j} I_{V | X}^{j} (θ) . \end{matrix}

(17)

Based on the two matrices above, we easily compute the observed information matrix from (14) and its inverse matrix as well. The MLE

\hat{θ} = (\hat{α}, \hat{λ})

has asymptotic normality, so we can write it as

\hat{θ} \sim N (θ, I_{X}^{- 1} (\hat{θ}))

. As a consequence, the

100 (1 - p) %

asymptotic CIs are derived by

\begin{matrix} (\hat{α} - z_{\frac{p}{2}} \sqrt{V a r (\hat{α})}, \hat{α} + z_{\frac{p}{2}} \sqrt{V a r (\hat{α})}) and (\hat{λ} - z_{\frac{p}{2}} \sqrt{V a r (\hat{λ})}, \hat{λ} + z_{\frac{p}{2}} \sqrt{V a r (\hat{λ})}), \end{matrix}

(18)

where

0 < p < 1

and

z_{\frac{p}{2}}

is a numerical number of the upper

\frac{p}{2}

th percentile of the standard normal variate. Here, two elements of the main diagonal of

I_{X}^{- 1} (\hat{θ})

are

V a r (\hat{α})

and

V a r (\hat{λ})

.

2.2. Bootstrap Confidence Intervals

As is known to us, the asymptotic CIs perform poorly in a sample with a small size. The bootstrap methods are more likely to provide more approximate confidence intervals. As a result, we introduce two CIs using the bootstrap methods. One is the percentile bootstrap (Boot-p) method which was put forward by [17]. According to [18], another is called the bootstrap-t (Boot-t) method. The Boot-p method and the Boot-t method are presented in Algorithms 1 and 2, respectively.

Algorithm 1 Boot-p method

Require: The number of simulation times N; the censoring scheme R; the confidence level

100 (1 - p) %

; the progressive type-II censored sample X

Ensure: The

100 (1 - p) %

boot-p CI for

φ

,

({\hat{φ}}_{B o o t - p} (\frac{p}{2}), {\hat{φ}}_{B o o t - p} (1 - \frac{p}{2}))

, where

φ

can be

α

and

λ

;

1: Compute the MLE

(\hat{α}, \hat{λ})

from X;

2: for

v = 1

to N do

3: Derive a bootstrap sample from the NH distribution by using

(\hat{α}, \hat{λ})

under R;

4: Calculate the MLE

({\hat{α}}_{v}^{*}, {\hat{λ}}_{v}^{*})

based on the bootstrap sample;

5: end for

6: Sort

{\hat{φ}}_{v}^{*}, v = 1, 2, \dots, N,

ascendingly as

{\hat{φ}}_{(1)}^{*}, {\hat{φ}}_{(2)}^{*}, \dots, {\hat{φ}}_{(N)}^{*}

;

7: Compute

{\hat{φ}}_{B o o t - p} (\frac{p}{2}) = {\hat{φ}}_{([\frac{p}{2} N])}^{*}

and

{\hat{φ}}_{B o o t - p} (1 - \frac{p}{2}) = {\hat{φ}}_{([(1 - \frac{p}{2}) N])}^{*}

, where

[\frac{p}{2} N]

is the largest integer that does not exceed

\frac{p}{2} N

.

8: return

({\hat{φ}}_{B o o t - p} (\frac{p}{2}), {\hat{φ}}_{B o o t - p} (1 - \frac{p}{2}))

;

Algorithm 2 Boot-t method

Require: The number of simulation times N; the censoring scheme R; the confidence level

100 (1 - p) %

; the progressive type-II censored sample X

Ensure: The

100 (1 - p) %

boot-t CI for

φ

,

({\hat{φ}}_{B o o t - t} (\frac{p}{2}), {\hat{φ}}_{B o o t - t} (1 - \frac{p}{2}))

, where

φ

can be

α

and

λ

;

1: Compute the MLE

(\hat{α}, \hat{λ})

from X;

2: for

v = 1

to N do

3: Derive a bootstrap sample from the NH distribution by using

(\hat{α}, \hat{λ})

under R;

4: Calculate the MLE

({\hat{α}}_{v}^{*}, {\hat{λ}}_{v}^{*})

based on the bootstrap sample;

5: Compute the statistic

T_{v}^{*} = \frac{{\hat{φ}}_{v}^{*} - \hat{φ}}{\sqrt{V a r ({\hat{φ}}_{v}^{*})}}

;

6: end for

7: Sort

T_{1}^{*}, T_{2}^{*}, \dots, T_{N}^{*}

ascendingly as

T_{(1)}^{*}, T_{(2)}^{*}, \dots, T_{(N)}^{*}

;

8: Pick

T_{([\frac{p}{2} N])}^{*}

,

T_{([(1 - \frac{p}{2}) N])}^{*}

, and their corresponding MLEs

{\hat{φ}}_{L}^{*}

,

{\hat{φ}}_{R}^{*}

;

9: Compute

{\hat{φ}}_{B o o t - t} (\frac{p}{2}) = \hat{φ} + \sqrt{V a r ({\hat{φ}}_{L}^{*})} T_{([\frac{p}{2} N])}^{*}

and

{\hat{φ}}_{B o o t - t} (1 - \frac{p}{2}) = \hat{φ} + \sqrt{V a r ({\hat{φ}}_{R}^{*})} T_{([(1 - \frac{p}{2}) N])}^{*}

;

10: return

({\hat{φ}}_{B o o t - t} (\frac{p}{2}), {\hat{φ}}_{B o o t - t} (1 - \frac{p}{2}))

;

3. Bayesian Estimation

In order to choose the best Bayesian estimator, a loss function must be specified and used to represent the penalty associated with each possible estimator. Researchers usually use the symmetric squared error loss function. There is no doubt that it is reasonable to use the squared error loss function when the loss is essentially symmetric. However, for some estimation problems, the actual loss is often asymmetric. Therefore, we also need to choose an asymmetric loss function.

We consider Bayesian estimators under symmetric and asymmetric loss functions. We suppose that

δ

is a Bayesian estimate for

θ

. Firstly, the symmetric squared error loss (SEL) function is written as

L_{S} (θ, δ) = {(δ - θ)}^{2} .

(19)

Hence, under

L_{S} (θ, δ)

, the Bayesian estimate for

θ

is expressed by

\begin{matrix} δ_{S} = E (θ | \underset{̲}{x}) . \end{matrix}

(20)

In light of [19], the balanced squared error loss (BSEL) function has the following form:

L_{B} (θ, δ) = (1 - ϵ) {(δ - θ)}^{2} + ϵ {(δ - δ_{0})}^{2},

(21)

where

0 \leq ϵ \leq 1

, and

δ_{0}

is a known estimate of

θ

. It is worth noting that when

ϵ

is given as the value of 0, the BESL function turns into the SEL function. Based on the BSEL function

L_{B} (θ, δ)

, the Bayes estimator is calculated by

\begin{matrix} δ_{B} = ϵ δ_{0} + (1 - ϵ) E (θ | \underset{̲}{x}) . \end{matrix}

(22)

Let us denote a set of progressive type-II censored samples by

X = (X_{1 : m : n}, X_{2 : m : n}, \dots, X_{m : m : n})

from the NH distribution. Suppose that two independent parameters

α

and

λ

follow the gamma prior distributions as

\begin{matrix} α \sim G a m m a (a, b) and λ \sim G a m m a (c, d), α, λ > 0, a, b, c, d > 0 . \end{matrix}

(23)

Here, the prior information is shown by selecting the numerical numbers for four hyperparameters

a, b, c,

and d. Hence, the joint prior density is expressed by

ϕ (α, λ) \propto α^{a - 1} e^{- b α} λ^{c - 1} e^{- d λ} .

(24)

Consequently, the joint posterior density is then obtained by

\begin{matrix} ϕ (α, λ | \underset{̲}{x}) & = \frac{l (α, λ | \underset{̲}{x}) ϕ (α, λ)}{\int_{0}^{+ \infty} \int_{0}^{+ \infty} ϕ (α, λ) l (α, λ | \underset{̲}{x}) d α d λ} \\ = ϱ^{- 1} α^{a + m - 1} λ^{c + m - 1} e^{- b α - d λ} \prod_{i = 1}^{m} {(1 + λ x_{i})}^{α - 1} exp \{(1 + R_{i}) [1 - {(1 + λ x_{i})}^{α}]\}, \end{matrix}

(25)

where

ϱ

denotes a normalizing constant.

As we can see, it is impractical to deal with the joint posterior distribution analytically. The Bayesian estimators of some functions involved with

α

and

λ

also include the rates between two integrals. Therefore, as we calculate Bayesian estimators based on these two kinds of loss functions, two approximate methods are proposed to cope with the corresponding ratio of integrals in the next two subsections, respectively.

3.1. Tierney and Kadane Method

Based on the idea of [20], we take advantage of the TK method and then calculate approximate Bayes estimates. The posterior expectation for

ξ (α, λ)

can be expressed as

Ψ (\underset{̲}{x}) = \frac{\int_{0}^{+ \infty} \int_{0}^{+ \infty} exp {Φ (α, λ) + L (α, λ | \underset{̲}{x})} ξ (α, λ) d α d λ}{\int_{0}^{+ \infty} \int_{0}^{+ \infty} exp {Φ (α, λ) + L (α, λ | \underset{̲}{x})} d α d λ} = \frac{\int_{0}^{+ \infty} \int_{0}^{+ \infty} exp {n ψ_{ξ}^{*}} d α d λ}{\int_{0}^{+ \infty} \int_{0}^{+ \infty} exp {n ψ} d α d λ},

(26)

where

Φ (α, λ) = ln ϕ (α, λ)

, and

L (α, λ | \underset{̲}{x})

represents the log-likelihood function (4). Moreover,

ψ (α, λ)

and

ψ_{ξ}^{*} (α, λ)

are defined as

ψ (α, λ) = \frac{Φ (α, λ) + L (α, λ | \underset{̲}{x})}{n} and ψ_{ξ}^{*} (α, λ) = ψ (α, λ) + \frac{ln ξ (α, λ)}{n} .

Suppose that

({\hat{α}}_{ψ}, {\hat{λ}}_{ψ})

and

({\hat{α}}_{ψ^{*}}, {\hat{λ}}_{ψ^{*}})

respectively maximize the functions

ψ (α, λ)

and

ψ_{ξ}^{*} (α, λ)

. Through the TK method,

Ψ (\underset{̲}{x})

is approximated as

Ψ (\underset{̲}{x}) = \sqrt{\frac{|Σ_{ξ}^{*}|}{|Σ|}} exp \{n [ψ_{ξ}^{*} ({\hat{α}}_{ψ^{*}}, {\hat{λ}}_{ψ^{*}}) - ψ ({\hat{α}}_{ψ}, {\hat{λ}}_{ψ})]\} .

(27)

Here, the determinants of negative inverse Hessian for

ψ (α, λ)

and

ψ_{ξ}^{*} (α, λ)

are

|Σ|

and

|Σ_{ξ}^{*}|

, respectively. Note that

|Σ|

and

({\hat{α}}_{ψ}, {\hat{λ}}_{ψ})

in (27) do not depend on the function

ξ (α, λ)

while the other two expressions do rely on

ξ (α, λ)

. We observe that

\begin{matrix} ψ (α, λ) = & \frac{1}{n} \{(a + m - 1) ln α + (c - 1 + m) ln λ + \sum_{i = 1}^{m} [1 - {(1 + λ x_{i})}^{α}] \\ + (α - 1) \sum_{i = 1}^{m} ln (1 + λ x_{i}) + \sum_{i = 1}^{m} R_{i} [1 - {(1 + λ x_{i})}^{α}] - (b α + d λ)\} . \end{matrix}

(28)

Then,

({\hat{α}}_{ψ}, {\hat{λ}}_{ψ})

are solved by the system of equations:

\begin{matrix} \frac{\partial ψ}{\partial α} = & \frac{1}{n} \{\frac{a + m - 1}{α} - b + \sum_{i = 1}^{m} ln (1 + λ x_{i}) - \sum_{i = 1}^{m} (1 + R_{i}) {(1 + λ x_{i})}^{α} ln (1 + λ x_{i})\} = 0, \end{matrix}

(29)

\begin{matrix} \frac{\partial ψ}{\partial λ} = & \frac{1}{n} \{\frac{c + m - 1}{λ} - d + (α - 1) \sum_{i = 1}^{m} \frac{x_{i}}{1 + λ x_{i}} - α \sum_{i = 1}^{m} (1 + R_{i}) x_{i} {(1 + λ x_{i})}^{α - 1}\} = 0 . \end{matrix}

(30)

The relevant second partial derivatives can be also computed by

\begin{matrix} \frac{\partial^{2} ψ}{\partial α^{2}} = & \frac{1}{n} \{- \frac{a + m - 1}{α^{2}} - \sum_{i = 1}^{m} (1 + R_{i}) {(1 + λ x_{i})}^{α} {[ln (1 + λ x_{i})]}^{2}\}, \end{matrix}

(31)

\begin{matrix} \frac{\partial^{2} ψ}{\partial α \partial λ} = \frac{\partial^{2} ψ}{\partial λ \partial α} = & \frac{1}{n} \{\sum_{i = 1}^{m} \frac{x_{i}}{1 + λ x_{i}} - \sum_{i = 1}^{m} (1 + R_{i}) [1 + α ln (1 + λ x_{i})] x_{i} {(1 + λ x_{i})}^{α - 1}\}, \end{matrix}

(32)

and

\begin{matrix} \frac{\partial^{2} ψ}{\partial λ^{2}} = & \frac{1}{n} \{- \frac{c + m - 1}{λ^{2}} - α (α - 1) \sum_{i = 1}^{m} (1 + R_{i}) x_{i}^{2} {(1 + λ x_{i})}^{α - 2} - (α - 1) \sum_{i = 1}^{m} \frac{x_{i}^{2}}{{(1 + λ x_{i})}^{2}}\} . \end{matrix}

(33)

Next, we obtain

|Σ|

as

\begin{matrix} |Σ| = {[\frac{\partial^{2} ψ}{\partial α^{2}} \frac{\partial^{2} ψ}{\partial λ^{2}} - \frac{\partial^{2} ψ}{\partial α \partial λ} \frac{\partial^{2} ψ}{\partial λ \partial α}]}^{- 1} . \end{matrix}

As for

|Σ_{ξ}^{*}|

, we first obtain

({\hat{α}}_{ψ^{*}}, {\hat{λ}}_{ψ^{*}})

from the following expressions:

\begin{matrix} \frac{\partial ψ^{*}}{\partial α} & = \frac{\partial ψ}{\partial α} + \frac{ξ_{α}}{n ξ (α, λ)} = 0, \end{matrix}

(34)

\begin{matrix} \frac{\partial ψ^{*}}{\partial λ} & = \frac{\partial ψ}{\partial λ} + \frac{ξ_{λ}}{n ξ (α, λ)} = 0 . \end{matrix}

(35)

Then, the corresponding second-order derivatives are obtained as

\begin{matrix} \frac{\partial^{2} ψ^{*}}{\partial α^{2}} & = \frac{\partial^{2} ψ}{\partial α^{2}} + \frac{ξ_{α α} ξ (α, λ) - {(ξ_{α})}^{2}}{n {[ξ (α, λ)]}^{2}}, \end{matrix}

(36)

\begin{matrix} \frac{\partial^{2} ψ^{*}}{\partial λ^{2}} & = \frac{\partial^{2} ψ}{\partial λ^{2}} + \frac{ξ_{λ λ} ξ (α, λ) - {(ξ_{λ})}^{2}}{n {[ξ (α, λ)]}^{2}}, \end{matrix}

(37)

\begin{matrix} \frac{\partial^{2} ψ^{*}}{\partial λ \partial α} = \frac{\partial^{2} ψ^{*}}{\partial α \partial λ} & = \frac{\partial^{2} ψ}{\partial α \partial λ} + \frac{ξ_{α λ} ξ (α, λ) - ξ_{α} ξ_{λ}}{n {[ξ (α, λ)]}^{2}} . \end{matrix}

(38)

Next, we obtain

|Σ_{ξ}^{*}|

as

\begin{matrix} |Σ_{ξ}^{*}| = {[\frac{\partial^{2} ψ^{*}}{\partial α^{2}} \frac{\partial^{2} ψ^{*}}{\partial λ^{2}} - \frac{\partial^{2} ψ^{*}}{\partial α \partial λ} \frac{\partial^{2} ψ^{*}}{\partial λ \partial α}]}^{- 1} . \end{matrix}

For the reason that

Σ_{ξ}^{*}

and

ψ_{ξ}^{*} (α, λ)

in (27) depend on the function

ξ (α, λ)

, we respectively take

ξ (α, λ) = α

and

ξ (α, λ) = λ

when proceeding with the above computation steps. Therefore, the Bayes estimates through the TK method can be calculated as

\begin{matrix} {\tilde{α}}_{T K} = \sqrt{\frac{|Σ_{α}^{*}|}{|Σ|}} exp \{n [ψ_{α}^{*} ({\hat{α}}_{ψ^{*}}, {\hat{λ}}_{ψ^{*}}) - ψ ({\hat{α}}_{ψ}, {\hat{λ}}_{ψ})]\}, \\ {\tilde{λ}}_{T K} = \sqrt{\frac{|Σ_{λ}^{*}|}{|Σ|}} exp \{n [ψ_{λ}^{*} ({\hat{α}}_{ψ^{*}}, {\hat{λ}}_{ψ^{*}}) - ψ ({\hat{α}}_{ψ}, {\hat{λ}}_{ψ})]\} . \end{matrix}

In the same way, the corresponding Bayesian estimates are also calculated based on the BSEL function. Here, we do not go into details for the sake of brevity.

3.2. Metropolis-Hastings Algorithm

As we can see from the previous subsection, for the unknown parameters, the TK method is not able to make interval estimates. Thus, we introduce another method called the MH algorithm to obtain some approximate Bayesian point and interval estimations. The MH algorithm is a very general type of Markov Chain Monte Carlo technique. For the first time, Ref. [21] suggested this technique. Then, it was expanded based on the idea of [22]. From their studies, we are aware that no matter how complex the known target distribution is, the MH algorithm can still derive random samples from it.

A bivariate normal distribution is treated as the proposal distribution for

α

and

λ

. Using the posterior density, for unknown parameters, we can derive samples from it so as to calculate Bayesian point estimators and HPD interval estimations. We present the main steps of the MH algorithm in Algorithm 3.

For the sake of ensuring the convergence and reducing some effects brought by

(α_{0}, λ_{0})

, we abandon the initial

N_{0}

number and then compute estimates from the rest of the

N - N_{0}

samples. Therefore, using the SEL function, we can calculate the Bayesian estimates by

\begin{matrix} {\tilde{η}}_{M H} = \frac{1}{N - N_{0}} \sum_{q = N_{0} + 1}^{N} η_{i}, \end{matrix}

where

η

can be

α

and

λ

.

Likewise, we also calculate the corresponding Bayes estimates based on the BSEL function. In addition, the

100 (1 - p) %

HPD credible intervals are set up as well according to [23]. We sort the rest of samples in ascending order to be

η_{(N_{0} + 1)}, η_{(N_{0} + 2)}, \dots, η_{(N)}

. Then, we can compute the

100 (1 - p) %

Bayes interval estimate of

η

as

\begin{matrix} (η_{(q)}, η_{(q + [(1 - p) \times (N - N_{0})])}), q = N_{0} + 1, \dots, N - N_{0} - [(1 - p) \times (N - N_{0})] . \end{matrix}

(39)

The shortest interval among (39) is taken as the

100 (1 - p) %

HPD interval of

η

.

Algorithm 3 MH algorithm

Require: The joint posterior distribution

ϕ (α, λ | \underset{̲}{x})

; the initial values

(α_{0}, λ_{0})

; the number of simulation times N; the variance–covariance matrix

Σ_{M H}

.

Ensure: The N number of samples

(α_{q}, λ_{q}), q = 1, 2, \dots, N

;

1: for

q = 1

to N do

2: Generate

(α^{*}, λ^{*})

from the bivariate normal

N_{2} ((α_{q - 1}, λ_{q - 1}), Σ_{M H})

;

3: Calculate

γ = min {1, \frac{ϕ (α^{*}, λ^{*} | \underset{̲}{x})}{ϕ (α_{q - 1}, λ_{q - 1} | \underset{̲}{x})}}

;

4: Derive a figure

β

that follows a Uniform

(0, 1)

distribution;

5: if

β \leq γ

then

6:

(α_{q}, λ_{q})

←

(α^{*}, λ^{*})

;

7: else

8:

(α_{q}, λ_{q})

←

(α_{q - 1}, λ_{q - 1})

;

9: end if

10: end for

11: return

(α_{q}, λ_{q}), q = 1, 2, \dots, N

;

4. Bayesian Prediction

We consider estimates of future samples in terms of known information and compute corresponding predictive intervals as well. Using the predictive distribution, the Bayesian approach on the prediction of future samples is treated as a significant topic in statistics. Subsequently, in the light of [24], we make discussions about Bayesian prediction facing the one-sample and two-sample situations indicated.

4.1. One-Sample Prediction

Let us denote a set of m progressive type-II censored data by

X = (X_{1 : m : n}, X_{2 : m : n}, \dots, X_{m : m : n})

. It is obtained from n samples which follow the NH distribution described in (1) based on a censoring scheme

R = (R_{1}, R_{2}, \dots, R_{m})

. Suppose that

Z_{i} = (Z_{i 1}, Z_{i 2}, \dots, Z_{i R_{i}})

stands for the ordered statistic of

R_{i}

removed units as the ith failure is observed. Using the observed samples

\underset{̲}{x}

, we focus on predicting the value of

z = (z_{i k}, k = 1, \dots, R_{i}; i = 1, \dots m)

. As a consequence, we can get the conditional density of z as

\begin{matrix} f_{i k} (z | \underset{̲}{x}, α, λ) & = k (\binom{R_{i}}{k}) \frac{{[1 - F (z)]}^{R_{i} - k} {[F (z) - F (x_{i})]}^{k - 1} f (z)}{{[1 - F (x_{i})]}^{R_{i}}} \\ = k (\binom{R_{i}}{k}) \sum_{j = 0}^{k - 1} {(- 1)}^{k - 1 - j} (\binom{k - 1}{j}) {[1 - F (z)]}^{R_{i} - 1 - j} f (z) {[1 - F (x_{i})]}^{j - R_{i}}, z > x_{i} . \end{matrix}

(40)

According to the conditional density, the survival function can be expressed by

\begin{matrix} S_{i k} (t | \underset{̲}{x}, α, λ) & = \frac{P (z > t | \underset{̲}{x}, α, λ)}{P (z > x_{i} | \underset{̲}{x}, α, λ)} = \frac{\int_{t}^{+ \infty} f_{i k} (z | \underset{̲}{x}, α, λ) d z}{\int_{x_{i}}^{+ \infty} f_{i k} (z | \underset{̲}{x}, α, λ) d z} \\ = \frac{\sum_{j = 0}^{k - 1} {(- 1)}^{k - 1 - j} (\binom{k - 1}{j}) {[1 - F (y)]}^{R_{i} - j} \frac{{[1 - F (t)]}^{j - R_{i}}}{R_{i} - j}}{\sum_{j = 0}^{k - 1} {(- 1)}^{k - 1 - j} (\binom{k - 1}{j}) \frac{1}{R_{i} - j}} . \end{matrix}

(41)

Therefore, based on a prior

ϕ (α, λ | \underset{̲}{x})

, the associated posterior predictive density is given by

\begin{matrix} g (z | \underset{̲}{x}) & = \int_{0}^{+ \infty} \int_{0}^{+ \infty} ϕ (α, λ | \underset{̲}{x}) f_{i k} (z | \underset{̲}{x}, α, λ) d α d λ, \end{matrix}

(42)

and posterior survival function is also given by

\begin{matrix} S_{i k}^{*} (z | \underset{̲}{x}) & = \int_{0}^{+ \infty} \int_{0}^{+ \infty} ϕ (α, λ | \underset{̲}{x}) S_{i k} (z | \underset{̲}{x}, α, λ) d α d λ . \end{matrix}

(43)

Consequently, the two-sided

100 (1 - p) %

symmetric predictive interval

(L_{o}, U_{o})

of z is the solution to the following equations:

\begin{matrix} S_{i k}^{*} (L_{o} | \underset{̲}{x}) = 1 - \frac{p}{2} and S_{i k}^{*} (U_{o} | \underset{̲}{x}) = \frac{p}{2} . \end{matrix}

As for the point prediction of the kth future sample z of the

R_{i}

surviving units, on the basis of the SEL function, it is predicted by

\begin{matrix} \hat{z} & = \int_{x_{i}}^{+ \infty} z g (z | \underset{̲}{x}) d z = \int_{0}^{+ \infty} \int_{0}^{+ \infty} \int_{x_{i}}^{+ \infty} z ϕ (α, λ | \underset{̲}{x}) f_{i k} (z | \underset{̲}{x}, α, λ) d z d α d λ \\ = \int_{0}^{+ \infty} \int_{0}^{+ \infty} ϕ (α, λ | \underset{̲}{x}) G (x_{i} | α, λ) d α d λ, \end{matrix}

(44)

where

\begin{matrix} G (x_{i} | α, λ) & = \int_{x_{i}}^{+ \infty} z f_{i k} (z) d z \\ = k (\binom{R_{i}}{k}) \sum_{j = 0}^{k - 1} {(- 1)}^{k - 1 - j} {[1 - F (x_{i})]}^{j - R_{i}} (\binom{k - 1}{j}) \int_{x_{i}}^{+ \infty} u {[1 - F (u)]}^{R_{i} - j - 1} f (u) d u . \end{matrix}

(45)

In our model, we then obtain that

\begin{matrix} G (x_{i} | α, λ) = k (\binom{R_{i}}{k}) \sum_{j = 0}^{k - 1} {(- 1)}^{k - 1 - j} e^{(j - R_{i}) {[1 - (λ x_{i} + 1)]}^{α}} (\binom{k - 1}{j}) \int_{{(λ x_{i} + 1)}^{α} - 1}^{+ \infty} \frac{- 1 + {(1 + t)}^{\frac{1}{α}}}{λ} e^{- (R_{i} - j) t} d t . \end{matrix}

(46)

We note that we cannot evaluate the above expressions analytically. Thus, we adopt the MH algorithm to tackle this problem and then compute the point predictions. Assume that

(α_{q}, λ_{q}), q = 1, 2, \dots, N

denote the samples which are derived from the posterior distribution. Therefore, we can compute the point prediction for z as

\begin{matrix} \hat{z} = \frac{1}{N - N_{0}} \sum_{q = N_{0} + 1}^{N} G (x_{i} | α_{q}, λ_{q}) . \end{matrix}

(47)

4.2. Two-Sample Prediction

We generally consider the prediction in a two-sample situation as an extension of that in a one-sample situation. There are two samples in the two-sample prediction problem. One is named the informative sample, and another is referred to as the future sample. Suppose that

W = (W_{1}, W_{2}, \dots, W_{M})

of size M stands for the ordered statistic of future observations, and it is independent of the informative sample

X = (X_{1 : m : n}, \dots, X_{m : m : n})

. Thus, we focus on making predictions of observation of the jth failure of a future sample W generated identically from the target distribution. Therefore, we can calculate the predictive density of

W_{j}

by

\begin{matrix} f_{j} (w_{j} | α, λ) = j (\binom{M}{j}) \sum_{k = 0}^{j - 1} {(- 1)}^{j - 1 - k} (\binom{j - 1}{k}) {[1 - F (w_{j})]}^{M - 1 - k} f (w_{j}), \end{matrix}

(48)

where

w_{j}

denotes the value of

W_{j}

. Then, the survival function can be computed as

\begin{matrix} S_{j} (u | α, λ) = P (w_{j} > u) . \end{matrix}

(49)

Consequently, the posterior survival function is written by

\begin{matrix} S_{j} (w_{j} | \underset{̲}{x}) = \int_{0}^{+ \infty} \int_{0}^{+ \infty} ϕ (α, λ | \underset{̲}{x}) S_{j} (w_{j} | \underset{̲}{x}, α, λ) d α d λ, \end{matrix}

(50)

where

\underset{̲}{x} = (x_{1}, x_{2}, \dots, x_{m})

. The two-sided

100 (1 - p) %

symmetric predictive interval

(L_{t}, U_{t})

of

w_{j}

is the solution to the following expressions:

\begin{matrix} S (L_{t} | \underset{̲}{x}) = 1 - \frac{p}{2} and S (U_{t} | \underset{̲}{x}) = \frac{p}{2} . \end{matrix}

The posterior density is also obtained as

\begin{matrix} h (w_{j} | \underset{̲}{x}) = \int_{0}^{+ \infty} \int_{0}^{+ \infty} f_{j} (w_{j} | α, λ) ϕ (α, λ | \underset{̲}{x}) d α d λ . \end{matrix}

(51)

Then, we can compute the point prediction of jth order lifetime as

\begin{matrix} \hat{w_{j}} & = \int_{0}^{+ \infty} w_{j} h (w_{j} | \underset{̲}{x}) d w_{j} = \int_{0}^{+ \infty} \int_{0}^{+ \infty} \int_{0}^{+ \infty} ϕ (α, λ | \underset{̲}{x}) w_{j} f (w_{j} | α, λ) d w_{j} d α d λ \\ = \int_{0}^{+ \infty} \int_{0}^{+ \infty} ϕ (α, λ | \underset{̲}{x}) H (α, λ) d α d λ, \end{matrix}

(52)

where

\begin{matrix} H (α, λ) & = \int_{0}^{+ \infty} w_{j} f (w_{j} | α, λ) d w_{j} \\ = j (\binom{M}{j}) \sum_{k = 0}^{j - 1} {(- 1)}^{j - 1 - k} (\binom{j - 1}{k}) \int_{0}^{+ \infty} \frac{{(t + 1)}^{\frac{1}{α}} - 1}{λ} e^{- (M - k) t} d t . \end{matrix}

(53)

Using the MH algorithm as in the previous subsection, we can calculate the point prediction of a future sample

W_{j}

as

\begin{matrix} \hat{w_{j}} = \frac{1}{N - N_{0}} \sum_{q = N_{0} + 1}^{N} H (α_{q}, λ_{q}) . \end{matrix}

(54)

5. Data Analysis and Simulation Study

For the purpose of demonstrating practical applications of provided techniques on estimation and prediction problems, in the next two subsections, we compare and appraise their performance by carrying out a simulation study and analyzing real data, respectively.

5.1. Data Analysis

We consider a set of real data on the total annual rainfall (unit: inches) during January from 1880 to 1916 researched by [12]. It is listed in Table 1 below.

With an aim to analyze real data, at first, we calculate the MLEs through the EM algorithm and then figure out the Akaike information criterion (AIC), Bayesian information criterion (BIC), and Kolmogorov–Smirnov (K–S) statistics for goodness-of-fit tests. For comparison, some other life distributions are also put on goodness-of-fit tests, such as Generalized Exponential, Chen, and Inverse Weibull distributions. Their PDFs are given as below, respectively.

(1): The PDF of the Generalized Exponential distribution is

$\begin{matrix} f_{1} (x; α, λ) & = α λ e^{- λ x} {(1 - e^{- λ x})}^{α - 1}, x > 0, α, λ > 0 . \end{matrix}$
(2): The PDF of the Chen distribution is

$\begin{matrix} f_{2} (x; α, λ) & = α λ x^{λ - 1} exp \{x^{λ} + α (1 - e^{x^{λ}})\}, x > 0, α, λ > 0 . \end{matrix}$
(3): The PDF of the Inverse Weibull distribution is

$\begin{matrix} f_{3} (x; α, λ) = α λ x^{- (α + 1)} exp {- λ x^{- α}}, x > 0, α, λ > 0 . \end{matrix}$

Table 2 presents all the test results of these distributions. As we all know, the smaller the numerical values of K–S, AIC, and BIC are, the bigger the log-likelihood function value is, and the better the model performs. Therefore, it can be seen from the table that the NH distribution provides a more suitable fit to this set of real data. For more fitting illustration, two plots made through the MLEs are provided in Figure 1. The first plot contains the fitted CDFs of the Nadarajah-Haghighi, Generalized Exponential, Chen, and Inverse Weibull distributions for the real data set. The second plot consists of the histogram of the real data and the fitted PDFs of these four distributions.

Next, we get two sets of censored data under two dissimilar progressive type-II censored schemes. They are

r_{s_{1}} = (0^{* 5}, 3, 0, 2, 2, 1, 0^{* 7}, 2, 0^{* 6}, 2)

and

r_{s_{2}} = (0^{* 10}, 2, 0, 2, 1, 0^{* 8}, 1, 0^{* 6}, 1)

, respectively, where short notation

0^{* 5}

denotes

0, 0, 0, 0, 0

. The two corresponding sets of observed data are presented in Table 3 and Table 4.

Then, we obtain some statistical inferences for the two sets of data. For the unknown parameters, we calculate TK and MH estimations accompanied by their MLEs under different loss functions in Table 5. Then we tabulate some interval estimations of

α

and

λ

in Table 6 which include

90 %

asymptotic CIs,

95 %

bootstrap CIs, and

95 %

HPD interval. Additionally, in Table 7, in the one-sample situation, we tabulate point predictions and the

95 %

predictive intervals for the kth experimental unit which is censored at the ith failure. Furthermore, Table 8 includes the point prediction and also the 95% predictive intervals for the top five items of future observations of size 25 and 30 under the two-sample framework, respectively.

5.2. Simulation Study

We implement a simulation study for comparison of numerical results and then evaluate the proposed methods. Using different censoring schemes, we first derive progressive type-II censored samples that follow an NH

(\tilde{α}, \tilde{λ})

distribution, where

\tilde{α} = 1.4

and

\tilde{λ} = 0.2

are true values we choose. Next, the EM algorithm can be applied so as to compute the MLEs. Afterwards, we give hyperparameters

(a, b, c, d)

the values as

(0.5, 1.7, 0.5, 5.6)

. The Bayesian estimators under two different types of loss functions are calculated by the TK and MH methods. We take the mean-squared error (MSE) values into account for comparing these Bayes estimates with different methods. The MSEs of

α

and

λ

have the following forms

\begin{matrix} M S E_{α} = \frac{1}{N} \sum_{q = 1}^{N} {({\hat{α}}_{q} - \tilde{α})}^{2} and M S E_{λ} = \frac{1}{N} \sum_{q = 1}^{N} {({\hat{λ}}_{q} - \tilde{λ})}^{2}, \end{matrix}

where

{\hat{α}}_{q}

and

{\hat{λ}}_{q}

denote the estimates of

α

and

λ

at qth simulation time from different methods.

Different values of

ϵ

in the BSEL functions are considered as a kind of measure of its influence on Bayes estimates. In addition, all the

(n, m)

and censored schemes are shown in the tables, where m denotes the quantity of observed samples and n denotes the quantity of complete samples.

In Table 9, we tabulate average estimators and also the values of MSEs for

α

and

λ

under diverse censoring schemes. For each

(n, m)

with each censoring scheme, the first and the second rows show the average estimators of

α

and their MSEs. In addition, the average estimators of

λ

and their MSEs are shown in the third and fourth rows. This table demonstrates that the estimators computed through the MH algorithm have better performance than those obtained by the TK method. Additionally, Bayes estimates almost always play a more prominent role than the corresponding MLEs.

In Table 10, under pre-specified sample sizes and censoring schemes, we construct varieties of interval estimations for the unknown parameters along with their coverage probabilities (CPs) and average length (AL). In each

(n, m)

with each censoring scheme, the first and second rows respectively show the statistical inferences of

α

and

λ

. For more illustration, we also provide four plots in Figure 2 and Figure 3, where the numbers 1 to 12 on the x-axis denote all the schemes from top to bottom in the table. Figure 2 includes the AL for different intervals of

α

and

λ

under different schemes, respectively. In addition, under different schemes, the CPs for different intervals of

α

and

λ

are provided in Figure 3.

As we can see, the AL of the HPD intervals are the shortest, followed by the bootstrap intervals, and the asymptotic intervals are the longest. However, as for the CPs, the asymptotic intervals hold the highest, followed by the HPD intervals, while the CPs of bootstrap often lie below the nominal level. Therefore, considering both AL and CPs, we believe that the asymptotic and bootstrap intervals are inferior to the HPD intervals.

In Table 11, we compute point prediction and 95% predictive intervals for different censored observations using the MH algorithm. We can see from the predicted intervals that when we censor more than one unit at a time, the AL of the prediction intervals of the unit censored at the early stage is shorter than that at the late stage. However, when we censor just one unit many times, we find that the later the stage is, the smaller the AL of the interval prediction is. The AL of prediction intervals is inclined to shorten as the observed sample grows in number.

In Table 12, under the two-sample prediction frame, we tabulate the point predictions and

95 %

predictive intervals of the first, the third, and the fifth future observations of total size 5. It is worth noting that the lengths for these interval predictions are prone to become longer with the increasing value of j. As a consequence,

M = m

is taken into consideration when the real data are under analysis.

6. Conclusions

This article provides the classical and Bayesian estimations for Nadarajah-Haghighi distribution using progressive type-II censored samples. As for the unknown parameters, we compute maximum likelihood estimates through an Expectation–Maximization algorithm. Under two different kinds of loss functions including the balanced squared error and squared error loss functions, we calculate varieties of approximate Bayes estimates using both the Metropolis-Hasting algorithm and the Tierney and Kadane method. In addition, we get estimation results of confidence intervals, which include the asymptotic intervals, and bootstrap intervals. Then, using the sample derived from the Metropolis-Hasting algorithm, we also obtain the highest posterior density intervals. Furthermore, the point and interval prediction for a future sample is considered in the one-sample and two-sample situations. In the end, we compare and appraise the performance of the provided techniques through a simulation study. For further explanation, one set of real rainfall data is also analyzed.

Overall, we can learn from the numerical studies that the proposed methods can work well. For instance, compared with maximum likelihood estimates and the Bayes estimates using the Tierney and Kadane method, the Bayes estimates using the Metropolis-Hasting algorithm are quite closer to the true values of unknown parameters. The MSEs are mostly less than

0.1

. In addition, in Table 7, we can see that the true values of censored samples are completely in the predictive intervals.

Additionally, the methods provided in this article can be extended to other life distributions.

Author Contributions

Investigation, M.W.; Supervision, W.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Project 202110004106 supported by Beijing Training Program of Innovation and Entrepreneurship for Undergraduates.

Data Availability Statement

The data presented in this study are openly available in reference [12].

Conflicts of Interest

The authors declare no conflict of interest.

References

Balakrishnan, N.; Cohen, A.C. Order Statistics and Inference: Estimation Methods; Academic Press: Boston, MA, USA, 1991. [Google Scholar]
Lawless, J.F. Statistical Models and Methods for Lifetime Data; Wiley: New York, NY, USA, 1982. [Google Scholar]
Balakrishnan, N.; Aggarwala, R. Progressive Censoring: Theory, Methods, and Applications; Birkhäuser: Boston, MA, USA, 2000. [Google Scholar]
Cohen, A.C. Truncated and Censored Samples: Theory and Applications; Marcel Dekker: New York, NY, USA, 1991. [Google Scholar]
Rastogi, M.K.; Tripathi, Y.M.; Wu, S.-J. Estimating the parameters of a bathtub-shaped distribution under progressive type-II censoring. J. Appl. Stat. 2012, 39, 2389–2411. [Google Scholar] [CrossRef]
Ren, J.; Gui, W. Inference and optimal censoring scheme for progressively type-ii censored competing risks model for generalized rayleigh distribution. Comput. Stat. 2020, 36, 479–513. [Google Scholar] [CrossRef]
Nadarajah, S.; Haghighi, F. An extension of the exponential distribution. Statistics 2011, 45, 543–558. [Google Scholar] [CrossRef]
Kumar, D.; Dey, S.; Nadarajah, S. Extended exponential distribution based on order statistics. Commun. Stat. Theory Methods 2017, 46, 9166–9184. [Google Scholar] [CrossRef]
Kumar, D.; Malik, M.R.; Dey, S.; Shahbaz, M.Q. Recurrence relations for moments and estimation of parameters of extended exponential distribution based on progressive type-II right-censored order statistics. J. Stat. Theory Appl. 2019, 18, 171–181. [Google Scholar] [CrossRef] [Green Version]
Singh, S.; Singh, U.; Kumar, M.; Vishwakarma, P. Classical and Bayesian inference for an extension of the exponential distribution under progressive type-II censored data with binomial removals. J. Stat. Appl. Probab. Lett. 2014, 1, 75–86. [Google Scholar] [CrossRef]
Singh, U.; Singh, S.K.; Yadav, A.S. Bayesian estimation for extension of exponential distribution under progressive type-II censored data using Markov Chain Monte Carlo method. J. Stat. Appl. Probab. 2015, 4, 275–283. [Google Scholar]
Selim, M.A. Estimation and prediction for Nadarajah-Haghighi distribution based on record values. Pak. J. Stat. 2018, 34, 77–90. [Google Scholar] [CrossRef]
Ashour, S.K.; El-Sheikh, A.A.; Elshahhat, A. Inferences and optimal censoring schemes for progressively first-failure censored Nadarajah-Haghighi distribution. Sankhya A 2020, 1–39. [Google Scholar] [CrossRef]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. (Methodol.) 1977, 39, 1–38. [Google Scholar]
Ruud, P.A. Extensions of estimation methods using the EM algorithm. J. Econom. 1991, 49, 305–341. [Google Scholar] [CrossRef]
Louis, T.A. Finding the observed information matrix when using the EM algorithm. J. R. Stat. Soc. Ser. (Methodol.) 1982, 44, 226–233. [Google Scholar]
Efron, B. The Jackknife, the Bootstrap and Other Resampling Plans; SIAM: Philadelphia, PA, USA, 1982. [Google Scholar]
Hall, P. Theoretical comparison of Bootstrap confidence intervals. Ann. Stat. 1988, 16, 927–953. [Google Scholar] [CrossRef]
Ahmed, E.A. Bayesian estimation based on progressive type-II censoring from two-parameter bathtub-shaped lifetime model: An Markov Chain Monte Carlo approach. J. Appl. Stat. 2014, 41, 752–768. [Google Scholar] [CrossRef]
Tierney, L.; Kadane, J.B. Accurate approximations for posterior moments and marginal densities. J. Am. Stat. Assoc. 1986, 81, 82–86. [Google Scholar] [CrossRef]
Metropolis, N.; Rosenbluth, A.W.; Rosenbluth, M.N.; Teller, A.H.; Teller, E. Equation of state calculations by fast computing machines. J. Chem. Phys. 1953, 21, 1087–1092. [Google Scholar] [CrossRef] [Green Version]
Hastings, W.K. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]
Chen, M.-H.; Shao, Q.-M. Monte Carlo estimation of Bayesian credible and HPD intervals. J. Comput. Graph. Stat. 1999, 8, 69–92. [Google Scholar]
Al-Hussaini, E.K. Predicting observables from a general class of distributions. J. Stat. Plan. Inference 1999, 79, 79–91. [Google Scholar] [CrossRef]

Figure 1. Fitted CDFs and PDFs of different distributions for the real data set.

Figure 2. The AL for different intervals of

α

and

λ

under different schemes.

Figure 2. The AL for different intervals of

α

and

λ

under different schemes.

Figure 3. The CPs for different intervals of

α

and

λ

under different schemes.

Figure 3. The CPs for different intervals of

α

and

λ

under different schemes.

Table 1. A set of real data recorded by [12].

1.33	1.43	1.01	1.62	3.15	1.05	7.72	0.20	6.03	0.25	7.83	0.25	0.88
6.29	0.94	5.84	3.23	3.70	1.26	2.64	1.17	2.49	1.62	2.10	0.14	2.57
3.85	7.02	5.04	7.27	1.53	6.70	0.07	2.01	10.35	5.42	13.3

Table 2. Goodness-of-fit test inferences on real data by the proposed distributions.

Distribution	$\hat{α}$	$\hat{λ}$	Log-Likelihood	K-S	AIC	BIC
Nadarajah-Haghighi	1.395140	0.174058	−83.0911	0.092371	170.1822	173.4040
Generalized Exponential	1.045170	0.294296	−83.2741	0.111670	170.5483	173.7701
Chen	0.165973	0.479978	−83.9279	0.094239	171.8558	175.0776
Inverse Weibull	0.705413	1.027060	−93.4268	0.189740	190.8537	194.0755

Table 3. The first censored data set under

r_{s_{1}}

.

Table 3. The first censored data set under

r_{s_{1}}

.

i	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15
$X_{i : 25 : 37}$	0.07	0.14	0.20	0.25	0.25	0.88	0.94	1.01	1.05	1.17	1.26	1.33	1.43	1.53	1.62
$R_{i}$	0	0	0	0	0	3	0	2	2	1	0	0	0	0	0
i	16	17	18	19	20	21	22	23	24	25
$X_{i : 25 : 37}$	1.62	2.01	2.10	2.49	2.57	2.64	3.15	3.85	5.04	7.83
$R_{i}$	0	0	2	0	0	0	0	0	0	2

Table 4. The second censored data set under

r_{s_{2}}

.

Table 4. The second censored data set under

r_{s_{2}}

.

i	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15
$X_{i : 30 : 37}$	0.07	0.14	0.20	0.25	0.25	0.88	0.94	1.01	1.05	1.17	1.26	1.33	1.43	1.53	1.62
$R_{i}$	0	0	0	0	0	0	0	0	0	0	2	0	2	1	0
i	16	17	18	19	20	21	22	23	24	25	26	27	28	29	30
$X_{i : 30 : 37}$	1.62	2.01	2.10	2.49	2.57	2.64	3.15	3.23	3.70	3.85	5.84	6.70	7.02	7.72	10.35
$R_{i}$	0	0	0	0	0	0	0	1	0	0	0	0	0	0	1

Table 5. Average estimators of

α

and

λ

for real data.

Table 5. Average estimators of

α

and

λ

for real data.

				TK					MH
Scheme	MLE	SEL		BESL			SEL		BESL
		SEL	$ϵ = 0.2$	$ϵ = 0.4$	$ϵ = 0.6$	$ϵ = 0.8$	SEL	$ϵ = 0.2$	$ϵ = 0.4$	$ϵ = 0.6$	$ϵ = 0.8$
$r_{s_{1}}$	0.9247	0.7848	0.8127	0.8407	0.8687	0.8967	1.1036	1.0678	1.0320	0.9962	0.9604
	0.3774	0.5625	0.5254	0.4884	0.4514	0.4144	0.3337	0.3424	0.3511	0.3599	0.3686
$r_{s_{2}}$	0.9887	0.8015	0.8389	0.8763	0.9138	0.9604	1.1410	1.1105	1.0800	1.0496	1.0191
	0.3094	0.5635	0.5126	0.4618	0.4110	0.3686	0.3000	0.3018	0.3037	0.3056	0.3075

Table 6. Interval estimations of

α

and

λ

for real data.

Table 6. Interval estimations of

α

and

λ

for real data.

Scheme	CI Asymptotic	CI Boot-p	CI Boot-t	HPD Interval
$r_{s_{1}}$	(0.3357, 1.5137)	(0.3423, 1.7384)	(0.4752, 1.7318)	(0.5304, 1.8993)
	(0.01228, 0.7425)	(0.1464, 0.9007)	(0.1452, 0.8945)	(0.1015, 0.6525)
$r_{s_{2}}$	(0.3900, 1.5873)	(0.5687, 1.8329)	(0.5660, 1.8380)	(0.5564, 1.8538)
	(0.01668, 0.6023)	(0.1296, 0.8380)	(0.1307, 0.8309)	(0.09441, 0.5832)

Table 7. One-sample point predictions and predictive intervals for real data.

			$r_{s_{1}}$					$r_{s_{2}}$
i	k	Value	Predictions	Intervals	i	k	Value	Predictions	Intervals
8	1	5.84	2.5400	(1.0505, 6.9691)	11	1	5.04	3.0718	(1.3036, 7.9066)
	2	6.03	5.4233	(1.5441, 18.4140)		2	5.42	5.4477	(1.8574, 16.8984)
18	1	7.27	3.8929	(2.1406, 8.8412)	13	1	6.03	3.2494	(1.4745, 8.3945)
	2	7.72	6.2865	(2.6459, 20.4446)		2	6.29	6.8604	(2.0070, 18.9765)

Table 8. Two-sample point predictions and predictive intervals for real data.

$r_{s_{1}}$				$r_{s_{2}}$
M	j	Predictions	Intervals	M	j	Predictions	Intervals
25	1	0.1376	(0.003080, 0.4900)	30	1	0.1191	(0.002924, 0.4567)
	2	0.2642	(0.03048, 0.7844)		2	0.2863	(0.03005, 0.7668)
	3	0.3959	(0.07797, 1.0559)		3	0.3788	(0.06768, 0.9248)
	4	0.5369	(0.1401, 1.3212)		4	0.5196	(0.1287, 1.2057)
	5	0.7035	(0.2237, 1.6404)		5	0.7995	(0.1885, 1.4179)

Table 9. Average estimators and MSE values under diverse censoring schemes.

$(n, m)$	Scheme	MLE			TK					MH
			SEL		BESL			SEL		BESL
			SEL	$ϵ = 0.2$	$ϵ = 0.4$	$ϵ = 0.6$	$ϵ = 0.8$	SEL	$ϵ = 0.2$	$ϵ = 0.4$	$ϵ = 0.6$	$ϵ = 0.8$
(20, 15)	$(5, 0^{* 14})$	1.5410	1.4226	1.4462	1.4699	1.4936	1.5173	1.3911	1.4211	1.4511	1.4810	1.5110
		(0.7026)	(0.0677)	(0.1946)	(0.3216)	(0.4486)	(0.5756)	(0.0406)	(0.0260)	(0.0146)	(0.0065)	(0.0016)
		0.3090	0.2944	0.2973	0.3002	0.3031	0.3060	0.2623	0.2716	0.2810	0.2903	0.2996
		(0.1849)	(0.0285)	(0.0597)	(0.0910)	(0.1223)	(0.1536)	(0.0011)	(0.0007)	(0.0004)	(0.0001)	(0.00004)
	$(0^{* 14}, 5)$	1.2693	1.5471	1.4915	1.4359	1.3804	1.3248	1.3423	1.3277	1.3131	1.2985	1.2839
		(0.6620)	(0.0297)	(0.1561)	(0.2826)	(0.4090)	(0.5355)	(0.1096)	(0.0701)	(0.0394)	(0.0175)	(0.0043)
		0.4670	0.2198	0.2692	0.3186	0.3681	0.4175	0.2737	0.3123	0.3510	0.3896	0.4283
		(0.5197)	(0.0011)	(0.1048)	(0.2085)	(0.3122)	(0.4159)	(0.0064)	(0.0041)	(0.0023)	(0.0010)	(0.0002)
	$(0^{* 5}, 1^{* 5}, 0^{* 5})$	1.5281	1.4984	1.5043	1.5102	1.5162	1.5221	1.3973	1.4235	1.4496	1.4758	1.5019
		(0.7185)	(0.1154)	(0.2360)	(0.3566)	(0.4772)	(0.5978)	(0.0402)	(0.0257)	(0.0144 )	(0.0064)	(0.0016)
		0.3180	0.2362	0.2525	0.2689	0.2852	0.3016	0.2553	0.2678	0.2803	0.2929	0.3054
		(0.2156)	(0.0085)	(0.0499)	(0.0913)	(0.1327)	(0.1741)	(0.0025)	(0.0016)	(0.0009)	(0.0004)	(0.0001)
(30, 20)	$(10, 0^{* 19})$	1.5437	1.5117	1.5181	1.5245	1.5309	1.5373	1.3816	1.4140	1.4464	1.4788	1.5112
		(0.6316)	(0.0679)	(0.1806)	(0.2933)	(0.4061)	(0.5188)	(0.0579)	(0.0371)	(0.0208)	(0.0092)	(0.0023)
		0.2633	0.2291	0.2359	0.2427	0.2496	0.2564	0.2571	0.2583	0.2595	0.2608	0.2620
		(0.6318)	(0.0030)	(0.1287)	(0.2545)	(0.3802)	(0.5060)	(0.0036)	(0.0023)	(0.0012)	(0.0005)	(0.0001)
	$(0^{* 19}, 10)$	1.2614	1.4896	1.4439	1.3983	1.3526	1.3070	1.3771	1.3539	1.3308	1.3076	1.2845
		(0.7242)	(0.0256)	(0.1653)	(0.3050)	(0.4447)	(0.5844)	(0.1277)	(0.0817)	(0.0459)	(0.0204)	(0.0051)
		0.4438	0.2571	0.2944	0.3317	0.3691	0.4064	0.2592	0.2961	0.3330	0.3699	0.4068
		(0.2199)	(0.0074)	(0.0499)	(0.0924)	(0.1349)	(0.1774)	(0.0026)	(0.0016)	(0.0009)	(0.0004)	(0.0001)
	$(0^{* 5}, 1^{* 10}, 0^{* 5})$	1.5153	1.4815	1.4882	1.4950	1.5017	1.5085	1.3990	1.4223	1.4455	1.4688	1.4920
		(0.6083)	(0.0713)	(0.1787)	(0.2861)	(0.3935)	(0.5009)	(0.0321)	(0.0206)	(0.0115)	(0.0051)	(0.0012)
		0.2524	0.2415	0.2436	0.2458	0.2480	0.2502	0.2389	0.2416	0.2443	0.2470	0.2497
		(0.0569)	(0.0053)	(0.0156)	(0.0259)	(0.0362)	(0.0465)	(0.0019)	(0.0012)	(0.0006)	(0.0003)	(0.00007)
(50, 30)	$(20, 0^{* 29})$	1.4952	1.4717	1.4764	1.4811	1.4858	1.4905	1.4137	1.4300	1.4463	1.4626	1.4789
		(0.3397)	(0.0899)	(0.1398)	(0.1898)	(0.2397)	(0.2897)	(0.0694)	(0.0444)	(0.0250)	(0.0111)	(0.0027)
		0.2241	0.2457	0.2413	0.2370	0.2327	0.2284	0.2418	0.2382	0.2347	0.2311	0.2276
		(0.0386)	(0.0056)	(0.0122)	(0.0188)	(0.0254)	(0.0320)	(0.0036)	(0.0023)	(0.0013)	(0.0005)	(0.0001)
	$(0^{* 29}, 20)$	1.2919	1.4558	1.4230	1.3902	1.3574	1.3246	1.3771	1.3600	1.3430	1.3259	1.3089
		(0.7772)	(0.0234)	(0.1741)	(0.3249)	(0.4756)	(0.6264)	(0.0389)	(0.0249)	(0.0140)	(0.0062)	(0.0015)
		0.4235	0.2635	0.2955	0.3275	0.3595	0.3915	0.2704	0.3010	0.3316	0.3622	0.3928
		(0.1905)	(0.0019)	(0.0396)	(0.0773)	(0.1150)	(0.1527)	(0.0020)	(0.0012)	(0.0007)	(0.0003)	(0.00008)
	$(0^{* 10}, 2^{* 10}, 0^{* 10})$	1.4932	1.4508	1.4592	1.4677	1.4762	1.4847	1.3977	1.4168	1.4359	1.4550	1.4741
		(0.3588)	(0.0994)	(0.1512)	(0.2031)	(0.2550)	(0.30692)	(0.0253)	(0.0162)	(0.0091)	(0.0040)	(0.0010)
		0.2130	0.2502	0.2427	0.2353	0.2278	0.2204	0.2412	0.2355	0.2299	0.2242	0.2186
		(0.0287)	(0.0060)	(0.0105)	(0.0150)	(0.0196)	(0.0241)	(0.0024)	(0.0015)	(0.0008)	(0.0003)	(0.00009)
(50, 40)	$(10, 0^{* 39})$	1.4917	1.4726	1.4764	1.4802	1.4840	1.4878	1.4094	1.4258	1.4423	1.4587	1.4752
		(0.3039)	(0.1203)	(0.1570)	(0.1937)	(0.2304)	(0.2671)	(0.0476)	(0.0304)	(0.0171)	(0.0076)	(0.0019)
		0.2133	0.2392	0.2340	0.2288	0.2236	0.2184	0.2385	0.2334	0.2284	0.2233	0.2183
		(0.0274)	(0.0073)	(0.0113)	(0.0153)	(0.0193)	(0.0233)	(0.0033)	(0.0021)	(0.0012)	(0.0005)	(0.0001)
	$(0^{* 39}, 10)$	1.3412	1.4719	1.4457	1.4196	1.3934	1.3673	1.3993	1.3828	1.3724	1.3620	1.3516
		(0.3604)	(0.0837)	(0.1390)	(0.1943)	(0.2497)	(0.3050)	(0.0535)	(0.0342)	(0.0192)	(0.0085)	(0.0021)
		0.2625	0.2501	0.2525	0.2550	0.2575	0.2600	0.2580	0.2589	0.2598	0.2607	0.2616
		(0.0564)	(0.0034)	(0.0140)	(0.0246)	(0.0352)	(0.0458)	(0.0037)	(0.0024)	(0.0013)	(0.0006)	(0.0001)
	$(0^{* 15}, 1^{* 10}, 0^{* 15})$	1.4836	1.4388	1.4477	1.4567	1.4656	1.4746	1.4178	1.4309	1.4441	1.4572	1.4704
		(0.3111)	(0.0800)	(0.1262)	(0.1724)	(0.2186)	(0.2648)	(0.0869)	(0.0556)	(0.0313)	(0.0139)	(0.0034)
		0.2027	0.2491	0.2398	0.2305	0.2212	0.2119	0.2500	0.2405	0.2310	0.2216	0.2121
		(0.0232)	(0.0066)	(0.0099)	(0.0132)	(0.0165)	(0.0198)	(0.0041)	(0.0026)	(0.0015)	(0.0006)	(0.0001)

Table 10. Interval estimations for

α

and

λ

.

Table 10. Interval estimations for

α

and

λ

.

$(n, m)$	Scheme	CI Asymptotic		CI Boot-p		CI Boot-t		HPD Interval
$(n, m)$	Scheme	AL	CP	AL	CP	AL	CP	AL	CP
(20, 15)	$(5, 0^{* 14})$	2.2380	0.91	2.2227	0.84	2.2346	0.82	1.7451	0.94
		1.8438	0.96	1.2346	0.88	1.1826	0.85	0.4510	0.89
	$(0^{* 14}, 5)$	2.3588	0.82	2.2743	0.84	2.2630	0.80	1.9469	0.90
		2.5277	0.99	2.1215	0.88	2.0130	0.84	0.5276	0.85
	$(0^{* 5}, 1^{* 5}, 0^{* 5})$	2.2651	0.90	2.2334	0.85	2.2127	0.81	1.7436	0.94
		1.8265	0.96	1.1729	0.89	1.1185	0.86	0.4398	0.91
(30, 20)	$(10, 0^{* 19})$	2.1805	0.92	2.1550	0.81	2.1512	0.83	1.7465	0.93
		1.5069	0.95	0.8910	0.87	0.8744	0.87	0.4632	0.86
	$(0^{* 19}, 10)$	2.2304	0.83	2.2501	0.84	2.2735	0.89	1.8968	0.85
		2.1300	0.98	1.7849	0.85	1.7454	0.85	0.5327	0.83
	$(0^{* 5}, 1^{* 10}, 0^{* 5})$	2.2239	0.92	2.1862	0.88	2.1514	0.81	1.9170	0.91
		1.4774	0.96	0.8311	0.86	0.8258	0.87	0.5571	0.85
(50, 30)	$(20, 0^{* 29})$	2.1632	0.91	2.0787	0.83	2.0992	0.83	1.7824	0.90
		1.0141	0.93	0.6490	0.86	0.6364	0.86	0.4732	0.86
	$(0^{* 29}, 20)$	2.0748	0.83	2.2663	0.87	2.2554	0.84	1.9939	0.90
		1.7660	0.98	1.2997	0.86	1.3687	0.81	0.5757	0.83
	$(0^{* 10}, 2^{* 10}, 0^{* 10})$	2.1403	0.92	2.0688	0.86	2.0924	0.91	1.7033	0.89
		0.7627	0.94	0.5884	0.92	0.5953	0.84	0.4031	0.87
(50, 40)	$(10, 0^{* 39})$	2.0706	0.93	2.0121	0.81	2.0206	0.86	1.5973	0.93
		0.6258	0.91	0.5271	0.85	0.5241	0.85	0.4201	0.80
	$(0^{* 39}, 10)$	2.0300	0.90	2.1391	0.84	2.1765	0.87	1.6519	0.88
		0.9439	0.95	0.7621	0.88	0.7669	0.86	0.3791	0.90
	$(0^{* 15}, 1^{* 10}, 0^{* 15})$	2.0847	0.94	2.0430	0.85	2.0256	0.82	1.6971	0.89
		0.6253	0.92	0.4951	0.84	0.5084	0.83	0.4596	0.91

Table 11. One-sample point prediction estimates and predictive intervals for future observations.

$(n, m)$	Scheme	i	k	Estimates	Intervals	Lengths of Intervals
(20, 15)	$(5, 0^{* 14})$	1	1	0.9168	(0.1630, 3.1686)	3.0056
			3	3.1398	(0.7415, 8.3882)	7.6467
			5	7.8950	(2.3762, 24.2866)	21.9103
	$(0^{* 14}, 5)$	15	1	4.6012	(3.9880, 6.6546)	2.6665
			3	6.1506	(4.3911, 12.1899)	7.7988
			5	10.3906	(5.6081, 35.3189)	29.7107
	$(0^{* 5}, 1^{* 5}, 0^{* 5})$	6	1	4.2389	(1.0089, 15.8052)	14.7962
		8	1	4.4379	(1.5750, 16.1571)	14.5821
		10	1	5.6292	(2.6247, 16.6635)	14.0388
(30, 20)	$(10, 0^{* 19})$	1	1	0.4789	(0.1578, 1.4200)	1.2622
			5	2.2323	(0.7545, 4.8205)	4.0659
			10	8.0724	(3.4517, 22.1716)	18.7199
	$(0^{* 19}, 10)$	20	1	5.5364	(5.2350, 7.3862)	2.1511
			5	8.3779	(6.1593, 14.7437)	8.5844
			10	18.4454	(10.1088, 65.0851)	54.9763
	$(0^{* 5}, 1^{* 10}, 0^{* 5})$	6	1	3.9537	(0.6622, 14.7512)	14.0890
		10	1	4.7242	(1.5685, 15.1969)	13.6284
		15	1	5.4807	(2.6243, 16.1969)	13.5726
(50, 30)	$(20, 0^{* 29})$	1	1	0.4093	(0.0507, 0.8596)	0.8089
			10	3.0288	(1.2624, 5.2746)	4.0121
			20	10.5404	(6.6580, 31.2231)	24.5650
	$(0^{* 29}, 20)$	30	1	2.9426	(2.7841, 3.4023)	0.6181
			10	4.8167	(3.6071, 7.2911)	3.6840
			20	12.3140	(6.8756, 34.3254)	27.4498
	$(0^{* 10}, 2^{* 10}, 0^{* 10})$	11	1	2.0263	(0.6846, 5.7133)	5.0286
			2	4.0697	(1.1222, 12.4491)	11.3269
		15	1	2.3432	(1.1565, 6.1354)	4.9788
			2	4.3654	(1.5891, 13.0481)	11.4589
		20	1	2.6122	(1.5050, 6.3439)	4.8389
			2	4.8881	(1.9265, 13.4706)	11.5440
(50, 40)	$(10, 0^{* 39})$	1	1	0.3455	(0.0098, 1.0866)	1.0767
			5	1.7686	(0.5484, 3.6979)	3.1494
			10	6.6152	(3.0174, 15.4718)	12.4544
	$(0^{* 39}, 10)$	40	1	5.0792	(4.7710, 6.3954)	1.6244
			5	6.4781	(5.3351, 9.4168)	4.0817
			10	11.4035	(7.7941, 33.3881)	25.5940
	$(0^{* 15}, 1^{* 10}, 0^{* 15})$	16	1	3.7037	(1.3536, 12.3988)	11.0452
		20	1	4.3664	(1.5667, 12.4796)	10.9128
		25	1	5.4018	(2.8572, 13.5803)	10.7230

Table 12. Two-sample point prediction estimates and predictive intervals for future observations.

$(n, m)$	Scheme	j	Estimates	Intervals	Lengths of Intervals
(20, 15)	$(5, 0^{* 14})$	1	0.7347	(0.01806, 2.9387)	2.9206
		3	3.2404	(0.5026, 7.2558)	6.7531
		5	6.9877	(2.0376, 25.8675)	23.8299
	$(0^{* 14}, 5)$	1	0.8496	(0.01991, 3.2487)	3.2288
		3	3.6011	(0.6036, 9.2542)	8.6506
		5	11.1663	(2.5090, 29.6404)	27.1313
	$(0^{* 5}, 1^{* 5}, 0^{* 5})$	1	0.7669	(0.01644, 2.5628)	2.5463
		3	2.5715	(0.4903, 6.8088)	6.3184
		5	6.4934	(1.8535, 19.6232)	17.7696
(30, 20)	$(10, 0^{* 19})$	1	0.6903	(0.01680, 2.5374)	2.5206
		3	2.1596	(0.4982, 6.3802)	5.8820
		5	6.6775	(1.9026, 17.7907)	15.8881
	$(0^{* 19}, 10)$	1	0.7224	(0.01750, 2.8048)	2.7873
		3	2.8923	(0.5440, 8.0353)	7.4912
		5	8.2721	(2.1957, 27.9897)	25.7940
	$(0^{* 5}, 1^{* 10}, 0^{* 5})$	1	0.8726	(0.01864, 2.9484)	2.9298
		3	3.1496	(0.5695, 7.8097)	7.2401
		5	8.4393	(2.2528, 25.8951)	23.6422
(50, 30)	$(20, 0^{* 29})$	1	0.7316	(0.01710, 2.5608)	2.5437
		3	2.7498	(0.5269, 6.4588)	5.9319
		5	7.6534	(2.1383, 17.9347)	15.7963
	$(0^{* 29}, 20)$	1	0.4412	(0.01155, 1.6821)	1.6706
		3	1.7731	(0.3478, 4.3245)	3.9767
		5	4.4294	(1.3648, 12.8956)	11.5307
	$(0^{* 10}, 2^{* 10}, 0^{* 10})$	1	0.7633	(0.01673, 2.4739)	2.4572
		3	2.2907	(0.5077, 6.3384)	5.8306
		5	6.4343	(2.0049, 17.4835)	15.4786
(50, 40)	$(10, 0^{* 39})$	1	0.8176	(0.02018, 2.9416)	2.9214
		3	3.1126	(0.6190, 7.4884)	6.8693
		5	7.6178	(2.4146, 19.5747)	17.1601
	$(0^{* 39}, 10)$	1	0.5339	(0.01375, 1.9762)	1.9625
		3	1.7281	(0.4186, 4.9427)	4.5240
		5	5.3492	(1.6347, 12.7377)	11.1029
	$(0^{* 15}, 1^{* 10}, 0^{* 15})$	1	0.6786	(0.01577, 2.4020)	2.3862
		3	2.5097	(0.5036, 6.0720)	5.5684
		5	5.5798	(2.0612, 16.7064)	14.6452

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, M.; Gui, W. Estimation and Prediction for Nadarajah-Haghighi Distribution under Progressive Type-II Censoring. Symmetry 2021, 13, 999. https://doi.org/10.3390/sym13060999

AMA Style

Wu M, Gui W. Estimation and Prediction for Nadarajah-Haghighi Distribution under Progressive Type-II Censoring. Symmetry. 2021; 13(6):999. https://doi.org/10.3390/sym13060999

Chicago/Turabian Style

Wu, Mingjie, and Wenhao Gui. 2021. "Estimation and Prediction for Nadarajah-Haghighi Distribution under Progressive Type-II Censoring" Symmetry 13, no. 6: 999. https://doi.org/10.3390/sym13060999

APA Style

Wu, M., & Gui, W. (2021). Estimation and Prediction for Nadarajah-Haghighi Distribution under Progressive Type-II Censoring. Symmetry, 13(6), 999. https://doi.org/10.3390/sym13060999

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimation and Prediction for Nadarajah-Haghighi Distribution under Progressive Type-II Censoring

Abstract

1. Introduction

2. Maximum Likelihood Estimation

2.1. Fisher Information Matrix

2.2. Bootstrap Confidence Intervals

3. Bayesian Estimation

3.1. Tierney and Kadane Method

3.2. Metropolis-Hastings Algorithm

4. Bayesian Prediction

4.1. One-Sample Prediction

4.2. Two-Sample Prediction

5. Data Analysis and Simulation Study

5.1. Data Analysis

5.2. Simulation Study

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI