1. Introduction
The reliability evaluation method of the product is essentially a mathematical statistical method of specific censored data under specific distribution occasions. Normal distribution, exponential distribution and Weibull distribution are often used to describe the life distribution of products. Among them, Weibull distribution is the most flexible one. Weibull distribution was first proposed by Swedish scientist Weibull. The Weibull distribution model can be regarded as a generalization of the exponential distribution model. The introduction of shape parameters makes it more accurate than the exponential distribution model in describing the products whose failure rate increases or decreases with time. In addition, by adjusting the shape parameters, Weibull distribution is also capable of describing the normal distribution approximately. Many studies show that the life distribution of most mechanical products approximately follows Weibull distribution. Wang [
1] proposed a new inference for constant-stress accelerated life tests with Weibull distribution. Wang [
2], Joarder [
3] and Kundu [
4] proposed MLE methods for Weibull parameters. Denecke [
5], Tan [
6] and Krishnamoorthy [
7] studied the method of solving lower confidence limit for Weibull distribution.
Reliability life test is an important way to collect life data. If a test stops before all samples fail, we call it censoring test, and the corresponding test data is truncated sample. The censoring test can be classified into four types, namely conventional Type-II censoring, progressive Type-II censoring [
8], conventional Type-I censoring and progressive Type-I censoring [
9].
With the rapid development of science and technology, products with long lifetime and high reliability are emerging. As a result, in many life tests only the heavily censored data are available for reliability estimation. Under some special conditions, including products with high reliability and very few test samples, no test data or only very few failure data can be obtained, which challenges the traditional reliability evaluation methods [
10,
11].
Jiang [
12] studied the problem of reliability estimation with zero failure data, in which the Bayesian prior distribution model was constructed by using the “kernel” idea. Jiang [
13] developed a hybrid censoring index to quantitatively describe censoring characteristics of a data set, proposed a novel parameter estimation method based on information extracted from censored observations, and evaluated the accuracy and robustness of the proposed method through a numerical experiment. Han [
14,
15,
16] reviewed the reliability assessment methods under zero-failure data conditions, and summarized the corresponding advantages, disadvantages and application ranges. Li [
17,
18] proposed a reliability assessment method with revised confidence limits. It can effectively avoid the aggressive phenomenon of reliability estimation results. The hierarchical Bayesian method generalizes the current Bayesian method to provide the ability to perform parameter point estimation and confidence interval estimation at the same time, which effectively improves the credibility of the results [
19]. Based on the modification of the hierarchical Bayesian estimation method, the E-Bayesian estimation method was proposed. Compared with the hierarchical Bayesian method, the E-Bayesian estimation method simplifies the mathematical model and improves the calculation efficiency [
20,
21,
22]. Jia [
23] proposed an improved method based on the Bayesian inference and least-squares method, and the four existing methods were com-pared with this method in terms of applicability, precision, efficiency, robustness, and simplicity.
The current E-Bayesian estimation methods mainly focus on point estimation of parameters, which do not solve the problem of confidence interval estimation. If the calculation method of confidence interval estimation is different from point estimation, the credibility of the results will reduce [
24,
25]. In addition, most E-Bayesian estimation methods assume that the range of failure probability is (0, 1), it does not consider the relationship between the failure probability of each truncation time, which causes the range of failure probability values to be too large, and affects the accuracy. Furthermore, in the process of solving the distribution parameters by using the distribution curve, the weight of the weighted least square method used does not consider the role of the failure sample, which makes the obtained reliability higher. In order to overcome the above problems, we propose a new reliability assessment method based on the E-Bayesian. Firstly, based on the E-Bayesian estimation and the distribution curve method, we develop a confidence interval method. Secondly, we use the unevenness of the distribution function and the function characteristics to determine the value range of the failure probability. Thirdly, improve the original weighted least square method with a new weight function, which consider the number of failed samples.
2. Heavily Censored Data Model
Due to time and cost constraints, for many products their life tests are often terminated before all units fail, and thus produce the so-called censored data. In the test, due to the high reliability of the product and the large number of units, the test observation time is relatively short. Therefore, in the collected data, the ratio of faults to the entire unit is usually less than 0.5 or very small. This collected data is called heavily censored data [
13]. A model for heavily censored data is established, and it is the foundation for subsequent product reliability index evaluation.
The Weibull distribution is a common random distribution and widely used in modeling lifetime distribution for machinery and electronic products [
26,
27]. If we set the shape parameter of Weibull distribution to be 1, it reduces to an exponential distribution, which is only suitable to model electronic lifetime with the property of memoryless. Under the condition of heavily censored data, it is assumed that the product life
follows a distribution function
, where there are
(
) samples in the timing truncation test, a total of
time timing truncation tests are performed, and the end time of each timing truncation test is used
to indicate, there are
samples that terminate the test at
, and
.
Let be the number of failed samples during , and be the corresponding failure time. Here, and . Thus indicates the total number of failed samples in the test. The samples are numbered from 1 to at each end of the timing truncation test. After all timing truncation tests are completed, the life number data are collected corresponding to the sample number . Apparently, reduces to the special case of zero-failure life test.
In addition, , where consist of two parts, the first part is the number of samples still participating in the test at , the second part is failure samples at the internal . Setting as the failure probability at each censoring moment. It is obvious that at time t = 0, .
Here, the distribution curve method is used to estimate the distribution parameters. The estimator of the failure probability at each truncation moment is the precondition, and it is the core link used by the distribution curve method.
4. E-Bayesian Point Estimation of Reliability
In order to obtain distribution parameter estimates, points
first need to be obtained, and then a distribution curve based on the points needs to be assigned. As the exact value of the failure probability
cannot be obtained during the calculation, the estimation value
is used to describe the failure probability
. After all the
are obtained, the distribution curve method is used to form a distribution curve based on these points
. The current commonly used method for fitting a distribution curve is the weighted least square method. Reference [
19] using the weighted least squares method to fit the distribution curve, the weight is
. This method only considers the influence of the number of test samples and test time on the estimated value of the distribution parameter, and ignores the role of failed samples in the value of the test, so the method can only be used to estimate the distribution parameter of the sample without failure data. For test samples containing failure data, the method has obvious shortcomings. In order to consider the number of test samples, test time, and number of failed samples in the distribution curve, the original weighted least square method is improved. An improved least square method is proposed, the improved weight is
, and the method is used to fit the distribution curve.
Let
,
,
, and then Equation (11) is transformed into
With the improved weighted least squares method, the point estimates of distribution parameters
and
are obtained by fitting
, and then the point estimate of reliability is obtained. By minimizing the square of fitting errors, we have
where
,
.
For the derivative calculation of Equation (21), let
,
,
, and
, giving
Taking the estimated value of the distribution parameters obtained from Equation (22) into Equation (19), the point estimation
under the Weibull distribution can be obtained
5. E-Bayesian Confidence Interval Estimation
Currently, all applications of the E-Bayesian estimation method focus on the point estimation of parameters, and no further research about parameter confidence interval calculations on this basis. For the estimation of parameter confidence intervals, the other methods should be used, such as the optimal confidence limit method [
32] and bootstrap method [
33]. Because the methods used for the parameter confidence interval estimation and the point estimation are different, the result will have “disjointed” problem. In fact, on the basis of the E-Bayesian estimation method and the distribution curve method, the confidence interval of the parameters can be obtained.
Under the confidence level value
, the first step is to estimate upper confidence value
of the failure probability
. Based on the poster distribution density function of
in Equation (6) and the E-Bayesian estimate of the failure probability
in Equation (10),
is defined as
Similar to the estimation of upper confidence limit for the hierarchical Bayesian estimation method, Based on the definition of the upper confidence limit, we have
Since and , Equation (25) is solved by the binary search method, and then the upper confidence limit of failure probability is obtained. Based on the linearization process to Equation (18), and are obtained.
According to the improved weighted least square method, the failure probability upper confidence limit curve can be obtained by fitting points
. From Equation (21), we have
As
has a limited value range and it has a small effect on the reliability index, so the confidence interval of
is not assumed.
is the point estimation value of
, and then taking
into fitting the curve with
, obtaining the point estimation value
. The next step is to obtain the lower confidence limit
of the estimation value
. Taking
into Equation (26), we have
The lower confidence limit
of
can be obtained when
have the smallest value:
The point estimation value
and lower confidence limit
are taken into the Weibull reliability calculation formula, and then the lower confidence limit of reliability
at confidence level
can be obtained at any time
, it is
Similarly, the upper confidence limit of the failure probability is
The upper confidence limit of the failure rate function is
6. Simulation Verification
In order to verify the accuracy of the proposed method, simulation experiments were used to generate simulation samples. Based on the generated simulation samples, the classic estimation method [
34], the hierarchical Bayesian estimation method [
19], and the E-Bayesian estimation method are used to evaluate the reliability of the sample. The results are compared from multiple aspects such as robustness, average time consumption and distance from the true value. Zhang [
35] describes the process of generating simulation samples:
(1) Setting the true value of the shape parameter and scale parameter in the Weibull distribution, and then the failure time is randomly generated through MATLAB, that is, there are experimental samples from the simulation experiment.
(2) uniformly distributed random numbers , .
(3) The truncation level CL is determined, based on that, () failure times from all are arbitrarily selected, denoted as . The remaining failure time is represented by .
(4) is replaced by , which is used to indicate zero-failure truncation time .
(5) Based on the value of and , the truncation time is grouped and the minimum truncation time is taken as the truncation time of the censored experiment of the group.
For this simulation, 11 samples were selected for 5 censored experiments. In the Weibull distribution, the shape parameter
and scale parameter
. Taking the truncation level CL = 0.1, the simulation samples were generated, they were shown in
Table 1.
From Equation (14), when , the upper limit of needs to be determined, since even in a censored test, the value of will continuously change with time . It will increase the calculation complexity and cumulative error, so generally the value of is set to 1 and the value range of failure probability is .
Based on the data in
Table 1, the classical estimation method [
34], the hierarchical Bayesian estimation method [
19], and the E-Bayesian estimation method proposed in this study were used to solve the failure probability estimates. The calculation results of the failure probability at time
are shown in
Table 2.
Table 2 gives the failure probability estimation results of three different methods. The estimation range under different
values and average running time are also compared in the table. Compared with [
19], although the robustness of the proposed method is slightly lower, and the algorithm efficiency is significantly improved. Furthermore, compared with [
19,
34], our method has higher calculation accuracy due to modify the value range of failure probability.
is used in subsequent calculation, and reliability
calculated by the method proposed in this study is compared with
and
which are calculated by methods proposed in [
19,
34] respectively, the results are shown in
Figure 1.
In the
Figure 1, Method 1 is the hierarchical Bayesian estimation method [
19], Method 2 is the classic estimation method [
34], and Method 3 is the E-Bayesian reliability evaluation method proposed in this study. From
Figure 1, it can be seen that E-Bayesian reliability assessment methods with heavily censored data have the closest estimated results to the true values. The classic estimation method [
34] does not consider the effect of censored time on the failure probability in the timing truncation test, which makes the reliability overestimated. In the process of determining the failure probability value range, the hierarchical Bayesian estimation method [
19] fixed the interval to [0, 1], without considering the sequence relationship between the failure probability values. The determination of the failure probability value range is too conservative. In addition, in the process of solving the distribution parameters using the weighted least squares method, the weight model does not consider the effect of the number of failed samples on the weight value. Due to the above reasons, the obtained reliability estimation value is too large.
In order to solve the lower confidence limit of reliability, the first step is to calculate the upper confidence limit
of failure probability
under the confidence level of 0.9, and then getting the lower confidence limit of the reliability parameter. As reference [
34] does not further analyze the confidence interval of the reliability parameter, the method proposed in this study is only compared with the method in [
19]. The corresponding lower confidence limit (LFM) comparison results are shown in
Figure 2.
In
Figure 2, Method 1 is the hierarchical Bayesian estimation method [
19] and Method 3 is the E-Bayesian reliability evaluation method proposed in this study. From the comparison results, it can be seen that the method proposed in this study is better than the hierarchical Bayesian method proposed in [
19].
8. Conclusions
In this paper, an improved E-Bayesian estimation method based on Weibull distribution is proposed. Compared with the traditional ones, the main contribution is twofold: (1) We extend the application range of E-Bayesian estimation from zero failure data to extreme few failure data; (2) A novel confidence interval estimation approach for E-Bayesian is provided.
Simulation results show that although the proposed method sacrifices few robustness, it outperforms the traditional ones in the following aspects: (1) Compared with the hierarchical Bayesian estimation method, the E-Bayesian estimation method proposed in this study has simpler mathematical models and calculations. The efficiency is significantly improved. (2) The value range of the failure probability is improved by using the unevenness and function characteristics of the distribution function, and the improved weighted least square method is used to solve the distribution parameters. Compared with the hierarchical Bayesian estimation method, the E-Bayesian estimation method proposed in this study has higher calculation accuracy.
Finally, the method proposed in this study is applied to the data analysis of the sun gear timing truncation test of a transmission mechanism and GPS receiver. It further verified that the method proposed in the study is more conducive to practical application in engineering. The results show that the heavily censored data based on E-Bayesian estimation method can solve the reliability evaluation problem under the condition of non-failure data and few failure data. Therefore, the method proposed in this paper has a wider range of engineering applications.
Reliability assessment methods for heavily censored data has not been studied thoroughly and there are still many issues to be resolved. In practice, we can only have very few failure data of products, it is difficult to collect product performance data. In addition, due to the limited test time and samples, it is generally impossible to collect sufficient life data. In the future, we will pay more attention to the combination of life data and other reliability data, as well as the utilization of historical data.