5.1. Novel Reliability-Evaluation Model
In the reliability-evaluation method, firstly, the Weibull distribution parameters for failures at each grade are estimated. In the estimation, the membership degree is introduced to reflect the probability distribution under the corresponding grade. During the data fitting, the value of
that corresponds with
can be expressed by,
Then, parameters
and
of the Weibull distribution for grade-
failures are calculated based on linear fitting, and based on this, the failure distribution function can be defined considering the influence effect of the failure on the system function and performance. This is relevant to the failure grade
and its membership degree
. The distribution function is expressed as follows:
where the term
represents the evaluation standard value of the failure grade.
Finally, the reliability function of grade-
failures is obtained by
. The overall reliability-evaluation model can be expressed by
In summary, the proposed failure-grade fuzzy-evaluation method is used to characterize the multiple states of failures, and the modified sample expansion is the basis of the failure data analysis and the reliability modeling. After the failure grading and sample expansion, the new failure data (including the original sample and the newborn sample) are further handled in the following ways, aiming to ensure the accuracy of the distribution-parameter estimation.
- (1)
Failures with a membership degree of zero at each grade were eliminated. Since the membership degree of the failure at the grade is less than or equal to one and may be zero, the failures with a membership degree of zero at each grade are eliminated for the ease of the reliability modeling.
- (2)
Small sample data cases at some grades after expansion were re-handled, as in Figure 8. In detail, according to the analysis of the distribution of failure-grade index, it approximately follows a positive skewed distribution; therefore, the failure samples at some edge grades may still be small samples even after expansion. In order to avoid the serious deviation phenomenon caused by the small-sample cases at some grades, in this work, the failures at the grade with small sample data (in previous works, when the sample size K was less than 30, it was regarded as the small-sample case) are automatically incorporated into the failure data at the lower or higher grades, starting at the lowest and highest grades, respectively, until the sample size at each grade satisfies the sample-size requirement.
5.2. Example Analysis
The machine tool and the cooling system are further used as the example cases of this analysis, and their original failure data have different features. For example, the original data of the machine tool only have 13 recordings of failures, so they belong to the small sample data. Although the original data of the cooling system have 67 failures, there still exist small-sample cases under some grades after failure grading. Two example cases can better verify the feasibility and sample adaptation of the proposed reliability-evaluation method. Moreover, in each example case, the evaluation results based on the methods that consider and do not consider the failure grading are compared to show the necessity of considering multi-state failure, and the influences of the expansion randomness, the expansion capacity, and grade-index values on the reliability-evaluation result are analyzed to verify the effectiveness of the proposed model.
(1)
Example case 1 (taking the machine tool as the research object). Since the sample size of the machine tool is small, the sample-expansion capacity is taken by 50. The newborn sample is obtained through expanding five times; then, it is used to generate five failure samples with different expansion capacities (
). During the five expansions, the errors for time between failures are 0.0022, 0.0043, 0.0065, 0.0093 and 0.0012, and the correlation coefficients for the failure-grade index are 0.9567, 0.9688, 0.9748, 0.9969 and 0.9529, respectively, which indicates that the newborn data do not deviate from the distribution rule of the original data. Finally, the failure samples including 263 failures are obtained by combining them with the original 13 failures. Based on the proposed methods, the failure samples at all grades can be obtained, and their sizes are 45, 175, 185, 87 and 32, which are all larger than 30.
Figure 9 shows a comparison of reliability curves based on the methods before and after considering failure grading (
): traditional and novel reliability-evaluation. The difference between the two curves is also calculated and plotted. Based on the comparison, it is known that the reliability-evaluation value becomes obviously after considering failure grading, and it has the tendency of increasing gradually and then basically not changing over time. The largest value of the difference is 0.3128. This result indicates that the proposed method can increase the accuracy of the reliability evaluation through considering the impact degree of failures on the system running.
In order to reveal the influence mechanism of the modeling process on the evaluation result, the above simulation is repeated ten times. Then, the evaluation results obtained by ten simulations are compared. The maximum and minimum fluctuation values of the evaluation are used to show the stability of the algorithm, and they are obtained by calculating the differences between values at the same time from ten simulations and taking the smallest and largest value from all values. In this case, they are 0.0801 and 0.0112, respectively. Moreover, the evaluation results based on different expansion capacities (
) are compared. Sample sizes at all grades under different expansion capacities are listed in
Table 6, wherein
are sample sizes at five grades, respectively. The comparison shows that the small-sample case more easily occurs at some failure grades when the expansion capacity is small.
Moreover, the comparison of reliability curves based on different values of expansion capacity (
) is shown in
Figure 10. From this figure, the following conclusions can be obtained, (1) from the reliability comparison based on the failure data with and without incorporating small sample data at some grades (as shown in
Figure 6, and corresponding to the cases
and
), it is found that the reliability value under the case
is obviously smaller than that under the case
, which means that although the small-sample incorporation is necessary due to the existence of the small sample, sometimes the handling way will largely influence the evaluation accuracy of the reliability level; (2) the reliability value of the machine tool decreases over the expansion capacity, which indicates that the capacity has the major influence on the evaluation result, and the more larger the sample size, the closer the evaluation result is to the real reliability level, for example, the results under cases
and
are very close; (3) in summary, although the increase in the expansion capacity will influence the calculation efficiency of the evaluation result, its value should be large enough to make the sample data at each grade is sufficient to ensure that the result does not deviate from the real level in practical applications.
Figure 11 shows the comparison of reliability curves based on different values of failure-grade index (
) when
, that is, the grade indexes of all failures are taken as a certain value. From the result, the following conclusions can be obtained, (1) the evaluation result is obviously affected by the failure-grade index, and it decreases over the index value, which indicates the novel model is effective in reliability evaluation; (2) the larger the failure grade, the higher the decline rate of the reliability over the time, for example, the evaluation value at
decreases by 16.33% compared with that at
, the decline percentages when
are 33.45%, 51.42%, 70.28% and 90.13% respectively.
(2)
Example case 2 (taking the cooling system as the research object). Compared with example case 1, the original sample size is larger; therefore, the sample-expansion capacity is set to 200 and the expansion is performed four times. Each expansion ensures that the error and the coefficient of correlation satisfy the given threshold constraints. Then, firstly, four failure samples with a capacity of 200 are obtained. Secondly, through the generation method shown in
Figure 12, six failure samples with a capacity of 400 are obtained, and the small sample at Grade-1 is caused because the original failures at Grade-1 are fewer. Sample sizes at all grades of different failure samples are listed as
Table 7. From the table, we may observe that the sample sizes at all grades when
all satisfy the large sample requirement, and the failure-number distribution at different grades is generally the same.
Figure 13 shows a comparison of reliability curves based on different failure samples; it shows that the curves are concentrated, and their maximum difference is 0.0787. This comparison result explains the usability of the proposed methods for failure grading and reliability evaluation.
Figure 14 shows a comparison of reliability curves based on different values of the failure-grade index for sample 1. From the comparison, we can draw conclusions as follows: (1) the failure-grade index has a major influence on the evaluation result, and the larger the failure-grade index, the smaller the reliability-evaluation value, which indicates the effectiveness of the proposed method; (2) in all cases, the reliability-evaluation value decreases rapidly with time at the beginning and then flattens out after
; and (3) compared with example case 2, the reliability curve in example case 1 declines more gently, mainly because the mean time between failures (MTBF) of the cooling system is smaller than that of the machine tool, based on the original sample data of two systems listed in
Table 2 and
Table 3. The observed value of the MTBF can be calculated by
. The calculated MTBF values of the machine tool and the cooling system are 619.41 h and 330.63 h, respectively. Therefore, the decline rate of the reliability curve of the cooling system is higher than that of the machine tool, which means the tendency of the reliability curve is consistent with the practical experience. Moreover, when the time is less than
, the larger the failure-grade index, the higher the decline rate of the reliability over time, and when the time is larger than
, the change tendency tends to be flat.