Next Article in Journal
Triplet Contrastive Learning for Aspect Level Sentiment Classification
Next Article in Special Issue
Queueing Inventory System in Transport Problem
Previous Article in Journal
Detection and Mitigation of GNSS Spoofing Attacks in Maritime Environments Using a Genetic Algorithm
Previous Article in Special Issue
Fluctuation Analysis of a Soft-Extreme Shock Reliability Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Control of Degrading Units through Threshold-Based Control Policies

by
Dmitry Efrosinin
1,2,* and
Natalia Stepanova
3
1
Institute of Stochastics, Johannes Kepler University Linz, 4040 Linz, Austria
2
Department of Information Technologies, Faculty of Mathematics and Natural Sciences, Peoples’ Friendship University of Russia (RUDN University), 117198 Moscow, Russia
3
Laboratory N17, Trapeznikov Institute of Control Sciences of RAS, 117997 Moscow, Russia
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(21), 4098; https://doi.org/10.3390/math10214098
Submission received: 29 September 2022 / Revised: 18 October 2022 / Accepted: 28 October 2022 / Published: 3 November 2022
(This article belongs to the Special Issue Stochastic Modeling and Applied Probability, 2nd Edition)

Abstract

:
Optimal control problems are applied to a variety of dynamical systems with a random law of motion. In this paper we show that the random degradation processes defined on a discrete set of intermediate degradation states are also suitable for formulating and solving optimization problems and finding an appropriate optimal control policy. Two degradation models are considered in this paper: with random time to an instantaneous failure and with random time to a preventive maintenance. In both cases, a threshold-based control policy with two thresholds levels defining the signal state, after which an instantaneous failure or preventive maintenance can occur after a random time, and a maximum number of intermediate degradation states is applied. The optimal control problem is mainly solved in a steady-state regime. The main loss functional is formulated as the average cost per unit of time for a given cost structure. The Markov degradation models are used for numerical calculations of the optimal threshold policy and reliability function of the studied degrading units.

1. Introduction

In recent decades, due to the rapid development of sensor technologies, data transmission technologies, and monitoring systems, the tasks of controlling the aging and degradation of technical and biological objects have received additional impetus for research. Some wearing and aging models were in the focus of many investigators in the framework of shock and damage models. An excellent review and contribution of the earlier papers devoted to the topic can be found in the studies by Esary et al. [1], Kopnov and Timashev [2], and the references therein. Murphy and Iskandar [3] have studied the application of a control policy with two parameters to the degradation process associated with shocks. In Singpurwalla [4], the hazard potential notion as a random life resource has been introduced and considered. Then, this notion was investigated in different directions in Singpurwalla [5]. The aging and degradation models suppose the study of systems with gradual failures for which multi-state reliability models were elaborated (for the history and bibliography see, e.g., Lisniansky and Levitin [6]). Rykov and Dimitrov [7] proposed the model for complex hierarchical reliability system. Rykov and Efrosinin [8] studied the controllable degradation unit as the fault tolerance unit. Generalized birth and death processes as degradation models were considered in Rykov [9]. A degrading unit with random life resources, which operates until complete failure occurs, was analyzed by Rykov and Efrosinin [10]. Giorgio et al. [11] proposed a Markov model for the gradual deterioration process, which progressively degrades until a complete failure occurs. The regenerative variant of the splitting method to estimate the failure probability in a controllable degradation process was provided in Borodina et al. [12].
Let the mechanical or biological system contain a degrading unit. Degradation processes under study are assumed to have observable states or there is some observable measure parameter that can be associated with a process, e.g., signal of acoustic emission, ultrasonic method for the detection of hidden defects, measures of the gravimetric analysis, and electromagnetic flaw detection. The degradation process is broken down into discrete stages, the dwell times of which are random. Two types of systems are of special interest. In first case the degrading unit has a random time to an instantaneous failure while in the second case the degrading unit has a random time to a preventive repair. In both cases the unit is of multiple uses, i.e., after a complete failure it can be repaired. After the repair, the degradation process starts again in an absolutely new state. The unit is supplied by a monitoring system and by a controller. The monitoring system gives the information about the current degradation state and, based on this information, the controller makes a decision about a necessity to perform the preventive repair until the last degradation state is achieved. The control problem is studied in a stationary regime. The control principles of such degradation systems may be easily explained by virtue of the following examples. Some of them were introduced in Kopnov [13].
Corrosion process of a unit with protective covering. Let a unit subjected to corrosion have a protective covering, which decreases during operation, and make it possible to trace its thickness, see Figure 1. The problem is to find the optimal value of the initial thickness and the thickness when the covering should be renewed.
Damage process due to the fatigue crack growth. Let a unit fail due to the fatigue crack growth, as illustrated in Figure 2. The unit can be supplied by a fatigue gauge that can be e.g., as a plane notched pattern. This gauge works with the unit and reflects its damage accumulation process. The problem is to find the optimal value of the gauge’s crack when stopping and replacement are implemented. The recovery of of such a unit can be also performed by means of the welding.
Wear of the tool of machine-tools. The wear of the machine tools is correlated with thermo-emf of the pair “cutter-blank”. A monitoring system can be installed, and the optimal value of the measured voltage can be determined to prevent failures and outputs of poor quality.
Wear of a plane bearing. The bearing is a critical unit of a piece of metallurgical equipment, hence the optimal maintenance problem of a plain bearing must be discussed. The restoration cost of the failed system is higher than the inspection and preventive replacement cost. In Ref. [14], the authors investigate the bearing shell wear process as a degradation one.
Discharge of an external load. If the degradation process is associated with external loads, and a unit fails when the load exceeds a failure level, then partial or full discharge of loads is another example of the controllable damage process.
In this paper we consider degradation systems operating under a threshold-based policy with threshold levels ( m , n ) . Here, the first threshold level m specifies a state, where a controller receives a signal that after a random time for one type of degenerating process an instantaneous failure may occur or for another type a preventive repair may take place. The second threshold n stands for the maximal number of gradual failure states before hitting a complete failure state. In corrosion processes the pair ( m , n ) defines the appropriate thickness of the protective covering and the thickness where a signal should be generated. In a damage process the pair defines the maximal size of the crack and the corresponding signal size. The proposed approach for optimization of the degradation process has a number of advantages. The optimal control policy depends on only two parameters, which can be calculated relatively easily. The performance and reliability characteristics of the degraded units are obtained explicitly. Assuming that the input random variables belong to parametric families of distributions, then, in practice, for a degradation process with discrete phases, the unknown parameters of the distributions can be easily estimated from statistical data.
The paper is organized as follows. In Section 2, we describe the mathematical model of the degrading unit with a random time to an instantaneous failure and derive the performance and reliability characteristics. In Section 3, we develop the analysis for the mean losses and reliability function for the model of the degrading unit with random time to a preventive repair. Some illustrative numerical examples are discussed in Section 4.
In order to make it easier to understand the description of the mathematical models and the corresponding results, we have summarized the main notations together with their descriptions in Table 1.

2. Degrading Unit with an Instantaneous Failure

2.1. Mathematical Model

Consider a degradation process { X ( t ) } t 0 with a discrete set of states E = { 0 , 1 , , m , , n , F n + 1 } , where n stands for the maximal possible number of intermediate degradation states up to a complete failure state F. Until a degradation state m with 0 m < n , which is referred to as a signal state, is reached, only a gradual deterioration of the unit can take place. After reaching state m both another gradual failure and an instantaneous failure can occur, leading the deteriorating unit to a complete failure state F. The diagram of possible realization of the process { X ( t ) } t 0 is illustrated in Figure 3.
The following notations are used in the figure: S i j = T i + T i + 1 + + T j 1 —time to reach state j E from state i E , where T i is a random variable of a sojourn time in state i E , V is a random time to an instantaneous failure after entering the state m, U i is a random repair time after a failure in state i E . After the degradation process has reached the signal state m, two events are possible, according to a specified model. Either instantaneous failure occurs after a random time V, or there is a transition to the last degradation phase and a subsequent failure in time S m n + 1 . After an instantaneous failure, the unit is restored in a random period of time U x ( V ) , where x ( V ) E is a current gradual degradation state at the moment of the instantaneous failure. After restoration, the unit becomes as good as new, i.e., the degradation process starts again from state 0. For distribution functions of introduced random variables, we use the following notations:
F i ( t ) = P [ T i t ] , F i j * ( t ) = P [ S i j t ] , j > i , i , j E , G ( t ) = P [ V t ] , B i ( t ) = P [ U i t ] , i E , F i i + 1 * ( t ) = F i ( t ) , F i j * ( t ) = F i j 1 * F j ( t ) = 0 t F i j 1 * ( t v ) d F j ( v ) .
As a recovery control we consider a two-level control policy ( m , n ) to repair the system given that the signal state is m with a total number of degradation states n. We further introduce an additional cost structure associated with the functioning of the deteriorating unit: c i —a repair cost per unit of time after a complete failure in state i E \ { F } , c r —a fixed cost due to a complete failure after passing all degradation states, c R —a fixed cost due to an instantaneous failure, c e i —operational cost per unit of time in state i E .

2.2. Regenerative Process with Costs

The degradation process { X ( t ) } t 0 can be represented in the form of a regenerative process whose regenerative moments are the moments of visiting the signal state m, as shown in Figure 3. Denote by E [ Y m , n ] the average duration of a regenerative cycle, and by E [ Z m , n ] the average cost in this cycle for the policy ( m , n ) . We distinguish between two types of regenerative cycles, where the random variables Y m , n and Z m , n can be represented in the following way,
Y m , n = V + U x ( V ) + S 0 m , if S m j 1 V < S m j , S m n + 1 + U n + S 0 m , if S m n + 1 < V ,
Z m , n = c x ( V ) U x ( V ) + c R + i = 0 x ( V ) c e i H i + c e n + 1 H n + 1 , if S m j 1 V < S m j , c n U n + c r + i = 0 n + 1 c e i H i , if S m n + 1 < V ,
where H i is a random holding time in state i in a regenerative cycle.
Proposition 1. 
The average duration of a regenerative cycle for the threshold policy ( m , n ) is derived from a relation
E [ Y m , n ] = E [ min { V , S m n + 1 } ] + E [ S 0 m ] + E [ U n ] ( 1 p F ) + E [ U x ( V ) ] p F ,
where p F = P [ S m n + 1 > V ] is the probability of an instantaneous failure in one regenerative cycle.
Proof. 
From the definition of the regenerative cycle (1) and the law of the total probability we get
E [ Y m , n ] = j = m + 1 n E [ Y m , n | S m j 1 V S m j ] P [ S m j 1 V S m j ] + E [ Y m , n | S m n + 1 < V ] P [ S m n + 1 < V ] = E [ V + U x ( V ) + S 0 m ] j = m + 1 n P [ S m j 1 V S m j ] + E [ S m n + 1 + U n + S 0 m ] P [ S m n + 1 < V ] = E [ V ] P [ S m n + 1 > V ] + E [ S m n + 1 ] P [ S m n + 1 < V ] + E [ S 0 m ] + E [ U n ] P [ S m n + 1 > V ] + E [ U x ( V ) ] P [ S m n + 1 < V ] ,
which obviously implies the relation (3). □
Proposition 2. 
The average cost per unit of time in a regenerative cycle is derived from a relation
E [ Z m , n ] = c n E [ U n ] + c r + i = m n c e i E [ H i ] ( 1 p F ) + ( E [ c x ( V ) U x ( V ) ] + c R + E [ i = m x ( V ) c e i H i ] ) p F + i = 0 m 1 c e i E [ H i ] + c e n + 1 E [ H n + 1 ] .
Proof. 
The relation (4) follows directly from the definition (2) and the law of the total probability. □
From the theory of regenerative process it is known that the average cost per unit of time g m , n under the control policy ( m , n ) can be expressed as
g ( m , n ) = E [ Z m , n ] E [ Y m , n ] ,
where the failure probability p F and expectations can be calculated by
p F = 0 ( 1 F m n + 1 * ( t ) ) d G ( t ) , E [ min { V , S m n + 1 } ] = 0 ( 1 G ( t ) ) ( 1 F m n + 1 * ( t ) ) d t , E [ min { V , S m n + 1 } ] = 0 ( 1 G ( t ) ) ( 1 F m n + 1 * ( t ) ) d t , E [ S 0 m ] = i = 0 m 1 E [ T i ] = i = 0 m 1 0 ( 1 F i ( t ) ) d t , E [ U i ] = 0 ( 1 B i ( t ) ) d t , E [ U x ( V ) ] = j = m n E [ U j ] 0 P [ X ( t ) = j | X ( 0 ) = m ] d G ( t ) , E [ c x ( V ) U x ( V ) ] = j = m n c j E [ U j ] 0 P [ X ( t ) = j | X ( 0 ) = m ] d G ( t ) , E [ H i ] = 0 P [ X ( t ) = i , t < Y m , n ] d t , E [ i = m x ( V ) c e i H i ] = j = m n i = m j c e i E [ H i ] 0 P [ X ( t ) = j | X ( 0 ) = m ] d G ( t ) .
In addition to the average cost per unit of time, an analysis of the degradation process can include a calculation of the average time r ( m , n ) = E [ T ] to the first failure of the unit given that the initial state is X ( 0 ) = 0 ,
r ( m , n ) = E [ min { V , S m , n + 1 } ] + E [ S 0 , m ] ,
the steady-state probability π F ( m , n ) = lim t P [ X ( t ) = F ] of the failure state F,
π F ( m , n ) = E [ U n ] ( 1 p F ) + E [ U x ( V ) ] p F E [ Y ] ,
and the availability of the unit a ( m , n ) = lim t P [ X ( t ) F ] ,
a ( m , n ) = 1 π F ( m , n ) .
The optimization of functions (5)–(9) with respect to the policy ( m , n ) allow us to find the corresponding optimal values,
g ( m , n ) min m , n , g ( m , n ) min m , n with a constraint r ( m , n ) > b 0 , r ( m , n ) max m , n a ( m , n ) max m , n π F ( m , n ) min m , n .

2.3. Markov Degradation Model

In a special case of random variables with exponential distributions, T i E ( λ i ) , i E \ { F } , U i E ( μ i ) , i { m , , n } and V E ( ν ) , we explicitly get the expectations
E [ S 0 m ] = i = 0 m 1 1 λ i , E [ S m n + 1 ] = i = m n 1 λ i , E [ U i ] = 1 μ i , i = m , , n , E [ H i ] = 1 λ i , i = 0 , 1 , , m 1 .
To derive the expressions for p F and other expectations in (6), consider an auxiliary death process describing a continuous-time Markov chain { Y ( t ) } t 0 with a set of states E Y = { m , , n , F } and an absorption in state F. Denote by p j ( t ) = P [ Y ( t ) = j | Y ( 0 ) = m ] the transition probability of the death process at time t given that the initial condition is p m ( 0 ) = P [ Y ( 0 ) = m ] and p j ( 0 ) = P [ Y ( 0 ) = j ] = 0 for j m , j E Y . The probability p j ( t ) can be derived from the system of Kolmogorov’s differential equations. Applying the Laplace transform to this system and following [15], we obtain the following statement.
Proposition 3. 
The probabilities p j ( t ) satisfy the system of differential equations
p m ( t ) = λ m p m ( t ) , p i ( t ) = λ i p i ( t ) + λ i 1 p i 1 ( t ) , i = m + 1 , , n , p n + 1 ( t ) = λ n p n ( t ) ,
which has the unique solution
p j ( t ) = i = m j 1 λ i i = m j 1 e λ i t λ i φ j ( λ i ) ,
where φ ( x ) = ( x + λ m ) ( x + λ j ) , in particular
F m n + 1 * ( t ) = P [ S m n + 1 t ] = 1 p n + 1 ( t ) = 1 i = m n λ i i = m n e λ i t λ i φ n ( λ i ) .
Corollary 1. 
From Proposition 3 it follows that
p F = i = m n λ i i = m n ν ( ν + λ i ) λ i ϕ n ( λ i ) , E [ min { V , S m n + 1 } ] = i = m n λ i i = m n 1 ( ν + λ i ) λ i ϕ n ( λ i ) , E [ U x ( V ) ] = j = m n 1 μ j i = m j 1 λ i i = m j 1 ν ( ν + λ i ) λ i ϕ j ( λ i ) , E [ c x ( V ) U x ( V ) ] = j = m n c j μ j i = m j 1 λ i i = m j 1 ν ( ν + λ i ) λ i ϕ j ( λ i ) , E [ H i ] = 1 ν + λ i 1 j = m i 1 λ j j = m i 1 ν ( ν + λ j ) λ j ϕ i ( λ j ) , m i n , E [ H n + 1 ] = ( 1 p F ) 1 μ n + p F E [ U x ( V ) ] .
Proof. 
These expressions follow directly from (6) and the convolution property in the case of independent random variables. □
Denote by R ( t ) = P [ T > t ] the reliability function of the degradation process.
Proposition 4. 
The reliability function R ( t ) is of the form
R ( t ) = i = 0 n λ i i = m n j = 0 m 1 e ( λ i + ν ) t e λ j t λ i ( λ j λ i ν ) ϕ n ( λ i ) ψ m 1 ( λ j ) ,
where
ψ m 1 ( x ) = ( x + λ 0 ) ( x + λ m 1 ) , φ n ( x ) = ( x + λ m ) ( x + λ n ) .
Proof. 
The life time T of the degradation unit is defined as T = S 0 m + min { V , S m n + 1 } and then, due to Proposition 3, we get
P [ T > t ] = P [ S 0 m + min { V , S m n + 1 } > t ] = 0 t P [ min { V , S m n + 1 } > y t ] f m n + 1 * ( y ) d y = 0 t ( 1 G ( t y ) ) ( 1 F m n + 1 * ( t y ) ) f 0 m * ( y ) d y .
After substituting the distribution functions G ( t ) , F m n + 1 * ( t ) and the probability density function f 0 m * ( t ) = j = 0 m 1 λ j j = 0 m 1 e λ j t ψ m 1 ( λ j ) into the last expression after some algebra, we get
P [ T > t ] = i = 0 n λ i i = m n j = 0 m 1 1 λ i ϕ n ( λ i ) ψ m 1 ( λ j ) 0 t e ( λ i + ν ) ( t y ) λ i y d y .
Then, the subsequent integration leads to the expression (12). □

2.4. Numerical Examples

Let us consider a system with a maximal possible number of gradual degradation states n 14 , where a signal state 0 m < n . We assume the following monotonicity for the ordering of the values of system parameters, namely, μ i > μ i + 1 , i = m , , n , λ i < λ i + 1 , i = 0 , , n . Moreover let ν < < λ i , i = 0 , , n , in this case we assume that the instantaneous failure is a quite rare event, comparing the transitions over the graduate degradation states. We allow the parameters of the degradation process to take the following values by taking the above-mentioned considerations into account: ν = 0.001 ,
{ λ 0 , , λ 14 } = { 0.009 , 0.010 , 0.012 , 0.015 , 0.026 , 0.038 , 0.058 , 0.065 , 0.093 , 0.125 , 0.163 , 0.206 , 0.253 , 0.306 , 0.364 } , { μ 0 , , μ 14 } = { 1.650 , 1.642 , 1.618 , 1.578 , 1.523 , 1.450 , 1.362 , 1.258 , 1.138 , 1.002 , 0.850 , 0.682 , 0.498 , 0.298 , 0.082 } .
Further we fix the costs at the following values:
c i = 2 , i E \ { F } , c e i = 0.1 , i E , c r = 5 , c R = 10 .
Example 1. 
This example presents the results of calculating the optimal threshold policy ( m * , n * ) , which minimizes the average cost per unit of time g ( m , n ) defined in (5) as a function of different costs. One type of cost is set to be varied, while all other parameters take the fixed values as defined above.
In Figure 4, we display the optimal policy ( m * , n * ) as the repair cost c i (the figure labeled by “a”) and the operational cost c e i (the figure labeled by “b”) per unit of time varies. The blue cycles tag the values for the optimal signal state m * , while the magenta cycles tag the optimal values for the number of gradual failures n * . For example, if c i = 3 , the optimal policy is equal to ( m * , n * ) = ( 7 , 8 ) , and if c i = 9 , then ( m * , n * ) = ( 4 , 6 ) . If c e i = 0.3 , then ( m * , n * ) = ( 1 , 5 ) , and if c e i = 0.9 , then ( m * , n * ) = ( 1 , 3 ) . As can be seen from the figure, both thresholds are quite sensitive to an increase in the repair cost, and both thresholds are decreasing monotonously. At the same time, an increase in the operating costs also leads to a decrease in the signal threshold, but the total number of degradation phases has little sensitivity to an increase in this type of costs. The influence of the fixed costs c r and c R is shown in Figure 5 (the figure labeled respectively by “a” and “b”). The figure illustrates, that, e.g., for c r = 2 , the optimal policy ( m * , n * ) = ( 7 , 8 ) , and for c r = 6 , ( m * , n * ) = ( 8 , 9 ) .
We observe here non-monotonic changes in thresholds as the cost after a complete failure c r increases. For example, for c r = 2 , the optimal policy ( m * , n * ) = ( 7 , 8 ) , for c r = 6 , ( m * , n * ) = ( 8 , 9 ) , and for c r = 9 , we get ( m * , n * ) = ( 3 , 8 ) . In the figure, where the fixed cost c R due to an instantaneous failure increases, the optimal thresholds increases again monotonously. For example, if c R = 6 , the ( m * , n * ) = ( 1 , 6 ) , and, if c R = 18 , the ( m * , n * ) = ( 8 , 9 ) .
Example 2. 
In this example we compare the different optimization problems specified in (10). In Table 2, we evaluate the optimal thresholds n * for different optimization problems given that the signal threshold takes a fixed value m = 4 . The parameters of the deteriorating unit take on the values given at the beginning of the section.
As can be seen from these calculations, depending on the optimizing function, the solution for the optimal threshold policy differs considerably. Thus, when formulating the optimization problem, it is also necessary to pay close attention to the possible constraints on the values of other quantitative characteristics of the degrading unit.
Example 3. 
In this example we analyze the effect of the optimal policy ( 4 , n * ) evaluated with respect to optimization problems; see Cases 1–5 enumerated in Table 2, and the intensity of the instantaneous failure ν on the reliability function R ( t ) in (12).
In Figure 6, the intensity ν = 0.001 (the figure labeled by “a”) and ν = 0.005 (the figure labeled by “b”). We report the numerical examples to show the reliability function R ( t ) for the optimal threshold policies ( m * , n * ) . We can point out that for ν = 0.001 , Cases 5 and 2 correspond respectively to the boundary cases of the most and less reliable degrading unit. The curves of other cases are located between these boundaries, which coincides with the results from Table 2. If ν = 0.005 increases, we observe that in all cases the unit becomes less reliable, moreover, the curves for Cases 2, 3, and 5 coincide. In this figure, the most reliable model corresponds to Case 1. This result does not contradict the fact that the model in Case 5 maximizes the mean time to failure, since the signal threshold m is fixed in the calculation data and optimization is only based on the value of the second threshold n.

3. Degrading Unit with a Partial Preventive Repair

In this section we consider a deteriorating unit that degrades according to a degradation process { X ( t ) } t 0 , whose trajectory is illustrated in Figure 7.
The notations associated with this model are the same as previously. According to this model, the degradation process has the set of space E X as before. After reaching the signal state m, two events are possible: either there is a partial preventive repair in a random time V, which occurs in a degradation state x ( V ) with a state-dependent repair time U x ( V ) , or in time S m n + 1 , where there is a transition to a complete failure state and where the unit can be completely recovered in a random time U n . After a preventive repair in state x ( V ) , we assume that the unit may not be so good as a new one, and it returns to the gradual failure state x ( V ) l 0 , where l = n m .

3.1. Regenerative Process with Costs

The proposed degradation process { X ( t ) } t 0 can be treated again as a regenerative one, where the hitting times of the signal state m are the regenerative moments. In a similar was as before, we distinguish between two types of cycles with and without a complete failure within it. The random variables Y m , n and Z m , n are then of the form,
Y m , n = V + U x ( V ) + S x ( V ) l m , if S m j 1 V < S m j , S m n + 1 + U n + S 0 m , if S m n + 1 < V ,
Z m , n = c x ( V ) U x ( V ) + c R + i = x ( V ) l x ( V ) c e i H i , if S m j 1 V < S m j , c n U n + c r + i = 0 n + 1 c e i H i , if S m n + 1 < V .
Let us denote by p F = P [ S m n + 1 V ] the probability of a complete failure in a regenerative cycle. Then, the following statement holds.
Proposition 5. 
The average duration E [ Y m , n ] and average cost E [ Z m , n ] in a regenerative cycle satisfies the relations
E [ Y m , n ] = E [ min { V , S m n + 1 } ] + ( E [ S 0 m ] + E [ U n ] ) ( 1 p F ) + ( E [ U x ( V ) ] + E [ S x ( V ) l m ] ) p F , E [ Z m , n ] = c n E [ U n ] + c r + i = 0 n + 1 c e i E [ H i ] ( 1 p F ) + E [ c x ( V ) U x ( V ) ] + c R + E [ i = x ( V ) l x ( V ) c e i H i ] p F ,
where
p F = P [ S m n + 1 V ] = 0 F m n + 1 * ( t ) d G ( t ) ,
and
E [ i = x ( V ) l x ( V ) c e i H i ] = j = m n i = j l m 1 c e i E [ H i ] + i = m j c e i E [ H i ] 0 P [ X ( t ) = j | X ( 0 ) = m ] d G ( t ) .
Proof. 
The proof of this statement is based on relations (13) and (14), and it is similar to that applied to the first model with instantaneous failures. The details are omitted here. □

3.2. Mean Time to Failure

As there are preventative repairs in this model, the random time T to complete failure will consist of other components. Let us introduce the following notations: T F —the random time from the beginning of the regenerative cycle until the moment of a complete failure if it occurs, T N F —the random duration of the regenerative cycle without a complete failure, and N—the random number of cycles to complete failure. Since the complete failure occurs in each regenerative cycle with the probability p F , the number of cycles N has a geometric distribution, i.e.,
P [ N = k ] = ( 1 p F ) k 1 p F , k 1 , E [ N ] = 1 p F .
Then the mean time to the first complete failure r ( m , n ) = E [ T ] is of the form.
Proposition 6. 
The mean time to the first complete failure r ( m , n ) given that the initial state is X ( 0 ) = 0 satisfies the relation,
E [ T ] = E [ S 0 m ] + E [ T N F ] ( E [ N ] 1 ) + E [ T F ] = = E [ S 0 m ] + E [ min { V , S m n + 1 } ] 1 p F + ( E [ U x ( V ) ] + E [ S x ( V ) l m ] ) 1 p F 1 .
Proof. 
Due to the structure of the random variable T and the law of the total probability, we have,
r ( m , n ) = E [ T ] = E [ S 0 m ] + k = 1 E [ T | N = k ] P [ N = k ] = E [ S 0 m ] + E [ T N F ] k = 1 ( k 1 ) ( 1 p F ) k 1 p F + E [ T F ] = E [ S 0 m ] + E [ T N F ] 1 p F 1 + E [ T F ] .
Consistent with the definition introduced for random variables T F and T N F , we obtain E [ T F ] = E [ S m n + 1 ] and E [ T N F ] = E [ V ] + E [ U x ( V ) ] + E [ S x ( V ) l m ] . By substituting these expectations into the last expressions and subsequent taking into account that E [ min { V , S m n + 1 } ] = E [ S m n + 1 ] p F + E [ V ] ( 1 p F ) after some simple algebra, we get the relation (15). □
Now we will briefly discuss the problem of calculating the reliability function R ( t ) = P [ T > t ] . Let us assume that the complete failure of the system is a rare event. Further, we introduce the following notation: T N —the random time from the beginning of the regenerative cycle up to the time moment of the first complete failure within the Nth cycle. Due to the limiting theorem for the sojourn time of ergodic regenerative processes (see, e.g., [16]) the following statement holds.
Proposition 7. 
Let the failure in one regenerative cycle be a rare event, i.e., the number of regeneration cycles N is significantly large. If E [ T N F q ] < for some q > 1 , then for each t 0 the following asymptotic property can be derived,
R ( t ) lim N P [ T N > t ] = lim N e 1 E [ T N ] t = e p F E [ T N F ] ,
where
E [ T N ] = 1 p F E [ T N F ] , N .
In assumption of the Markov property, the reliability function of the degradation unit with an arbitrary p F can be calculated explicitly. The calculation is performed by means of conditional Laplace transforms f ˜ i ( s ) = 0 f i ( t ) e s t d t , R e [ s ] 0 , of the probability density function for the residual life time f i ( t ) d t = P [ T [ t , t + d t ) | X ( 0 ) = i ] given that the initial state is i. Obviously, in absorbing state F the conditional Laplace transform f ˜ F ( s ) = 1 .
Proposition 8. 
The Laplace transform R ˜ ( s ) of the reliability Function R ( t ) satisfies the relation,
R ˜ ( s ) = 1 s 1 q 0 ( s ) i = m n σ i ( s ) 1 i = m n τ i ( s ) q i l ( s ) j = m i 1 σ j ( s ) ,
where
q i ( s ) = j = i m 1 λ j s + λ j , i = 0 , , m , τ i ( s ) = ν μ i ( s + μ i ) ( s + λ i + ν ) , i = m , , n , σ i ( s ) = λ i s + λ i + ν , i = m , , n .
Proof. 
The time T is a duration commencing the initial state of the degradation process 0 and ends when the degrading unit visits the complete failure state F. The first-step analysis is employed for the Markov chain with an absorption in state F to get the system of equations for the conditional Laplace transforms f ˜ i ( s ) , i E X .
f ˜ i ( s ) = λ i s + λ i f ˜ i + 1 ( s ) , i = 0 , , m 1 , f ˜ i ( s ) = λ i s + λ i + ν f ˜ i + 1 ( s ) + ν μ i ( s + μ i ) ( s + λ i + ν ) f ˜ i l ( s ) , i = m , , n 1 , f ˜ n ( s ) = λ n s + λ n + ν + ν μ n ( s + μ n ) ( s + λ n + ν ) f ˜ n l ( s ) .
In (18), the equality for state i includes the term at ν , which represents the transition with a rate ν to a preventive maintenance state, from which the unit moves to a newer gradual failure state i l after an exponentially distributed time with a parameter μ i . Solving recursively equations (18) and using for convenience the notations q i ( s ) , τ i ( s ) and σ i ( s ) from the statement, we obtain explicit solutions for this system. In particular for the function f ˜ 0 ( s ) after some simple algebra we have
f ˜ 0 ( s ) = q 0 ( s ) i = m n σ i ( s ) 1 i = m n τ i ( s ) q i l ( s ) j = m i 1 σ j ( s ) .
The statement further follows from the fact that the Laplace transform R ˜ ( s ) and the conditional Laplace transform f ˜ 0 ( s ) satisfy the relation
R ˜ ( s ) = 1 s 1 f ˜ 0 ( s ) .
The inverse transformation of the function R ( s ) is required to get R ( t ) . Differentiation of the function R ˜ ( s ) with respect to the parameter s at point s = 0 results in the mean life time to a complete failure, i.e.,
E [ T ] = d d s f ˜ 0 ( s ) | s = 0 .

3.3. Numerical Examples

We next present the numerical examples involving the optimal policy ( m * , n * ) for different optimization problems and the numerical inversion of the transform R ˜ ( s ) . In the following examples we assume that n 14 , 0 m n , and l = n m m . Moreover, μ i > μ i + 1 , i = m , , n , λ i < λ i + 1 , i = 0 , , n , where λ i , μ i and the costs c i , c e i , c r and c R take the same values as for the previous model. We further fix ν = 0.10 .
Example 4. 
In Figure 8, we calculate the optimal policy ( m * , n * ) as the cost c i (the figure labeled by “a”) and the cost c e i (the figure labeled by “b”) varies. For example, the figure for the repair cost c i shows that if c i = 1 , then the optimal policy is ( m * , n * ) = ( 6 , 11 ) , and if c i = 3 , then we obtain ( m * , n * ) = ( 5 , 8 ) . From the figure for the operating cost c e i , we can see that if c e i = 0.2 , 0.6 , 1.2 , then the optimal policy is respectively equal to ( m * , n * ) = ( 5 , 9 ) , ( 4 , 8 ) , ( 1 , 2 ) . Thus, the values of the optimal thresholds decrease monotonically with an increase in the cost c i , while we no longer observe such monotonicity with an increase in the cost c e i . Moreover, at high operating costs, it becomes advantageous to limit the number of intermediate degradation states to a small number, i.e., by the policy ( 1 , 2 ) , so that the unit can be repaired to an almost new state after a complete failure.
The influence of the repair costs c r and c R is illustrated in Figure 9 (the figure labeled respectively by “a” and “b”). Here we observe, for example, that if c r = 1 , 6 , the policies ( m * , n * ) = ( 4 , 7 ) , ( 6 , 10 ) are optimal, and if c R = 5 , 15 , we then get the optimal policies ( m * , n * ) = ( 6 , 10 ) , ( 5 , 0 ) . In both cases the changes are monotonic, although the optimal control appears to be less sensitive to an increase in the cost c R .
Example 5. 
Table 3 summarizes the results of the calculations of optimal threshold levels for the different optimization criteria proposed in (10). In this case, just as in the previous degradation model, we obtained completely different optimal thresholds for different optimization criteria. The highest mean time to failure is naturally given by optimizing the function r ( m , n ) , i.e., r ( 4 , 8 ) = 948.61 . However, we obtained the shortest mean time to failure by minimizing the steady-state probability of a complete failure state π F ( n , m ) , i.e., π F ( 5 , 6 ) = 436.15 , which is not an obvious result.
Example 6. 
In Figure 10, we take ν = 0.05 (the figure labeled by “a”) and ν = 0.10 (the figure labeled by “b”). The control policy in both cases is ( m , n ) = ( 7 , 14 ) . The other parameters take on the values defined above and display the real reliability function R ( t ) as a numerical inversion of the Laplace transform R ˜ ( s ) from (17), together with the asymptotic function (16). The asymptotic function is defined respectively by R ( t ) = e 0.0048 t with E [ T N F ] = 57.31 , p F = 0.2775 , and R ( t ) = e 0.0014 t with E [ T N F ] = 70.19 , p F = 0.0982 .
We observe here that in the figure, which corresponds to a lower value for the probability of complete failure in a regeneration cycle p F , the asymptotic curve better approximates the corresponding real reliability function.

4. Conclusions

We investigated two types of degradation processes on a set of intermediate degradation states with the random time to an instantaneous failure and the random time to a preventive repair. In both cases, the rationality of using a two-threshold control policy to optimize a range of performance and reliability characteristics of degraded systems was confirmed. Thus, when modeling and putting into operation the technical units subject to a progressive process of degradation, it is possible to use the results of this work to provide the monitoring system with information about the optimal signal state and the optimal maximum number of discrete intermediate degradation states. In the case of the death Markov process, the reliability characteristics of the presented degradation units can be calculated analytically.

Author Contributions

Conceptualization, D.E.; Formal analysis, D.E. and N.S.; Methodology, D.E.; Software, N.S.; Writing—review & editing, D.E. All authors have read and agreed to the published version of the manuscript.

Funding

Open Access Funding by the University of Linz. This paper has been supported by the RUDN University Strategic Academic Leadership Program (recipient D. Efrosinin).

Data Availability Statement

Not applicable.

Acknowledgments

The authors acknowledge the financial support by the Johannes Kepler University Linz. This paper has been supported by the RUDN University Strategic Academic Leadership Program (recipient D. Efrosinin).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Esary, J.D.; Marshall, A.W.; Proshan, F. Shock models and wear processes. Ann. Probab. 1973, 1, 627–649. [Google Scholar] [CrossRef]
  2. Kopnov, V.A.; Timashev, S.A. Optimal death process control in two-level policies. In Proceedings of the 4th Vilnius Conference on Probability Theory and Statistics, Vilnius, Lithuania, 24–29 June 1985; Volume 4, pp. 308–309. [Google Scholar]
  3. Murphy, D.N.P.; Iskandar, B.P. A new shock damage model: Part II—Optimal maintenance policies. Reliab. Eng. Syst. Saf. 1991, 31, 211–231. [Google Scholar] [CrossRef]
  4. Singpurwalla, N.D. The hazard potential: Introduction and overview. J. Am. Stat. Assoc. 2006, 101, 1705–1717. [Google Scholar] [CrossRef]
  5. Singpurwalla, N.D. Reliability and Risk; Wiley & Sons Ltd.: Chichester, UK, 2006. [Google Scholar]
  6. Lisnuansky, A.; Levitin, G. Multi-State System Reliability. Assessment, Optimization and Application; World Scientific: Hackensack, NJ, USA; London, UK; Singapore; Hong Kong, China, 2003. [Google Scholar]
  7. Rykov, V.; Dimitrov, B. On multi-state reliability systems. In Proceedings of the Seminar Applied Stochastic Models and Information Processes, Barcelona, Spain, 8–13 September 2002; pp. 128–135. [Google Scholar]
  8. Efrosinin, D.; Rykov, V. On reliability control of Fault Tolerance Units. In Proceedings of the Forth International Conference on Mathematical Methods in Reliability, Santa Fe, NM, USA, 21–25 June 2004. [Google Scholar]
  9. Rykov, V. Generalized birth and death processes and their application to aging models. Autom. Remote Control 2006, 3, 103–120. [Google Scholar]
  10. Rykov, V.; Efrosinin, D. Degradation models with random life resources. Commun. Stat. Theory Methods 2010, 39, 398–407. [Google Scholar] [CrossRef]
  11. Giorgio, M.; Guida, M.; Pulcini, G. An age- and state-dependent Markov model for degradation processes. IIE Trans. 2011, 43, 621–632. [Google Scholar]
  12. Borodina, A.; Efrosinin, D.; Morozov, E. Application of Splitting to Failure Estimation in Controllable Degradation System. In DCCN 2017: Distributed Computer and Communication Networks; Communications in Computer and Information Science; Vishnevskiy, V., Samouylov, K., Kozyrev, D., Eds.; Springer: Cham, Switzerland, 2017; Volume 700. [Google Scholar]
  13. Kopnov, V.A. Optimal degradation process control by two-level policies. Reliab. Eng. Syst. Saf. 1999, 66, 1–11. [Google Scholar] [CrossRef]
  14. Kopnov, V.A.; Kanajev, E.I. Optimal control limit for degradation process of a unit modeled as a Markov chain. Reliab. Eng. Syst. Saf. 1994, 43, 29–35. [Google Scholar] [CrossRef]
  15. Gnedenko, B.V. The Questions of the Mathematical Theory of Reliability; Radio and Communications: Moscow, Russia, 1983. (In Russian) [Google Scholar]
  16. Solovyev, A.D. Asymptotic behaviour of the time to the first occurrence of a rare event. Eng. Cybern. 1971, 9, 1038–1048. [Google Scholar]
Figure 1. Corrosion process.
Figure 1. Corrosion process.
Mathematics 10 04098 g001
Figure 2. Fatigue crack growth.
Figure 2. Fatigue crack growth.
Mathematics 10 04098 g002
Figure 3. Realization of a degrading process under threshold policy ( m , n ) .
Figure 3. Realization of a degrading process under threshold policy ( m , n ) .
Mathematics 10 04098 g003
Figure 4. ( m * , n * ) versus c i (a) and c e i (b) for the model with an instantaneous failure.
Figure 4. ( m * , n * ) versus c i (a) and c e i (b) for the model with an instantaneous failure.
Mathematics 10 04098 g004
Figure 5. ( m * , n * ) versus c r (a) and c R (b) for the model with an instantaneous failure.
Figure 5. ( m * , n * ) versus c r (a) and c R (b) for the model with an instantaneous failure.
Mathematics 10 04098 g005
Figure 6. R ( t ) for ν = 0.001 (a) and ν = 0.005 (b).
Figure 6. R ( t ) for ν = 0.001 (a) and ν = 0.005 (b).
Mathematics 10 04098 g006
Figure 7. Realization of a degrading process under threshold policy ( m , n ) with l = n m .
Figure 7. Realization of a degrading process under threshold policy ( m , n ) with l = n m .
Mathematics 10 04098 g007
Figure 8. ( m * , n * ) versus c i (a) and c e i (b) for the model with a preventive repair.
Figure 8. ( m * , n * ) versus c i (a) and c e i (b) for the model with a preventive repair.
Mathematics 10 04098 g008
Figure 9. ( m * , n * ) versus c r (a) and c R (b) for the model with a preventive repair.
Figure 9. ( m * , n * ) versus c r (a) and c R (b) for the model with a preventive repair.
Mathematics 10 04098 g009
Figure 10. Real and asymptotic reliability functions R ( t ) for ν = 0.05 (a) and ν = 0.10 (b).
Figure 10. Real and asymptotic reliability functions R ( t ) for ν = 0.05 (a) and ν = 0.10 (b).
Mathematics 10 04098 g010
Table 1. List of the main notations used in the paper.
Table 1. List of the main notations used in the paper.
VariableDescription
{ X ( t ) } t 0 controllable degradation process
( m , n ) two-threshold policy with a signal state m and a number of degradation states n
Eset of states of the degradation process
E Y set of states of an auxiliary Markov death process
T i random sojourn time in state i
S i j time to reach state j from state i
U i random repair time in state i
Vrandom time to instantaneous failure (in Model 1) or to a preventive repair (in Model 2)
Y m , n random length of the regenerative cycle given that the control policy is ( m , n )
Z m , n random cost in a regenerative cycle for the policy ( m , n )
H i random time from Y m , n when X ( t ) = i
c i repair cost per unit of time is state i
c e i operational cost per unit of time in state i
c r fixed cost due to a complete failure
c R fixed cost due to an instantaneous failure (in Model 1) or due to a maintenance repair (in Model 2)
Table 2. Comparison of different optimization problems for the model with an instantaneous failure.
Table 2. Comparison of different optimization problems for the model with an instantaneous failure.
Case NOpt. Problem n * g ( 4 , · ) p F ( 4 , · ) π F ( 4 , · ) r ( 4 , · )
1 g ( 4 , n ) max n 80.11480.14840.0016442.83
2 p F ( 4 , n ) min n 50.11570.09720.0015391.67
3 π F ( 4 , n ) min n 60.11630.12040.0015414.82
4 g ( 4 , n ) max n 130.11910.17650.0040470.94
r ( 4 , n ) > 470
5 r ( 4 , n ) max n 140.15490.18140.0221475.87
Table 3. Comparison of different optimization problems for the model with a preventive repair.
Table 3. Comparison of different optimization problems for the model with a preventive repair.
Case Nr.Opt. Problem ( m * , n * ) g ( · , · ) p F ( · , · ) π F ( · , · ) r ( · , · )
1 g ( m , n ) min m , n (3, 6)0.12790.18710.0030848.44
2 p F ( m , n ) min m , n (2, 4)0.12800.16360.0037626.16
3 π F ( m , n ) min m , n (5, 6)0.24260.57180.0020436.15
4 g ( m , n ) min m , n (4, 8)0.13170.25350.0031948.61
r ( m , n ) > 900
5 r ( m , n ) max m , n (4, 8)0.13170.25350.0031948.61
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Efrosinin, D.; Stepanova, N. Optimal Control of Degrading Units through Threshold-Based Control Policies. Mathematics 2022, 10, 4098. https://doi.org/10.3390/math10214098

AMA Style

Efrosinin D, Stepanova N. Optimal Control of Degrading Units through Threshold-Based Control Policies. Mathematics. 2022; 10(21):4098. https://doi.org/10.3390/math10214098

Chicago/Turabian Style

Efrosinin, Dmitry, and Natalia Stepanova. 2022. "Optimal Control of Degrading Units through Threshold-Based Control Policies" Mathematics 10, no. 21: 4098. https://doi.org/10.3390/math10214098

APA Style

Efrosinin, D., & Stepanova, N. (2022). Optimal Control of Degrading Units through Threshold-Based Control Policies. Mathematics, 10(21), 4098. https://doi.org/10.3390/math10214098

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop