1. Introduction
In a conventional vacation queue, the servers (human or machine) suspend their service duties and leave for a vacation for a given amount of time before returning to service. There are various reasons for the server’s vacation, one of which is the deterioration of the service rate. In some systems, the decision on the vacation is instantaneous (e.g., due to unexpected human sickness or machine failure), while in other systems, the vacation is pre-planned based on the system policies (e.g., labor regulations for a human server or the technical maintenance of a mechanical server). Vacation queueing models can be applied to a large variety of real-world stochastic systems and have therefore attracted interest from many researchers. The idea of vacation queuing was introduced by Levy and Yechiali [
1]; in this model, during the idle time of a server in an M/G/1 queue, the server performs a secondary task. The duration of time that the server spends on the secondary task is considered to be a random variable with a known distribution function. Levy and Yechiali [
1] proposed two vacation queue models. In the first model, the server, upon returning to the initial queue, stays in the system even if there are no customers, and waits for the next arrival (single vacation policy). In the second model, the server takes another vacation if no customers are in the system at the time of returning to the initial queue (multiple vacations policy). In more recent surveys, Refs. [
2,
3] discussed additional types of models for vacation queueing systems, each with a different service policy. According to an
exhaustive service policy, the server returns from vacation after a random duration and starts to serve customers until becoming idle again (and leaving for the next vacation period). According to the
gated service policy, only customers who arrive during the vacation are served upon the server’s return from a vacation, and the server leaves for another vacation after serving only these customers. In the
limited-service policy, the server serves a predefined number of customers and leaves for a vacation after serving that number. If the queue is empty upon the return to service, the server waits for the first customer to arrive and continues according to the limited queue policy. Ibe and Isijola [
4] considered a system with two vacation types. One type of vacation follows a busy service period, while the other type follows a relaxed service period in which the server is idle for part of the time. This model is based on human as well as technical behaviors. It considers the case of human servers, where the server requires a longer recuperation period after an exhaustive service period compared with a more relaxed service period in which the customer arrival rate is relatively low. In the case of a technical server (e.g., a manufacturing CNC machine or an automatic packaging machine), the service time after exhaustive use is likely to be longer than that after limited use or none at all due to the more significant wear and tear of the machine. For further discussion on vacation queues, see, e.g., [
5,
6,
7,
8].
Traditionally (as introduced by Levi and Yechiali [
1]), the server’s vacation time is used for performing ancillary duties while the main service is suspended. Thus, vacation policies do not improve the main queueing system’s efficiency and may even impair it. In particular, the need to wait for a server to complete ancillary duties may increase customers’ waiting times. According to the extensive research conducted by Polas et al. [
9], waiting time has a significant and implicit impact on customer satisfaction (see also [
10,
11,
12]). Hanukov et al. [
13] suggested an alternative approach (instead of vacations) for the utilization of a server’s idle time, which can improve the main queueing system’s efficiency. On one hand, all these papers presume that an idle server reduces the overall efficiency of the system. On the other hand, many researchers have pointed out that the effectiveness of non-stop work diminishes over time, while it increases after a rest period. In his book, Pang [
14] claimed that, in the context of human servers, “
rest is an essential component of working well and working smart” and outlined several studies demonstrating that rest not only enables physical recuperation and the opportunity to think and innovate, but also increases overall productivity. Similarly, Weber et al. [
15] stated that “
When individuals undertake too many tasks, they actually are less efficient. It has been scientifically proven that idle time is required for developing creativity”. The effect of extended non-stop working hours on the performance of resident physicians in hospitals has been analyzed [
16], and as a result, the actual working period of resident physicians between rest periods has been reduced. Another study on the effects of driving time and rest time on the performance of commercial bus and truck drivers concluded that a 15 min recuperation period, following a 2 h continuous driving task, allowed drivers to maintain a safe driving performance [
17].
Likewise, a machine might process a task at a much slower rate because of wear and tear, thus needing time to be serviced or repaired. To this end, the authors of [
18] considered maintenance control models in which the deterioration of the performance of a machine is modeled as a discrete-time Markov chain. At each state transition in the chain, the machine is either left untouched for the next service or it is repaired/replaced with another machine. If the machine is repaired, service is suspended during the repair period, while if the machine is replaced, services continue with the replacement machine.
To the best of our knowledge, Eisen [
19] was the first to introduce the term “
deteriorating server”. In his model, the mean service time increases after each customer is served, but due to the complexity of the problem, he focused on simple cases where the number of states is relatively small (e.g., three levels of service rates: normal, slow, and down) in order to obtain an analytical solution. For larger problems, Eisen proposed using simplified ergodic Markovian tools and selected arbitrary constants. In a more recent study, Kaufman and Lewis [
20] considered a production machine maintenance policy modeled on a single-server queue. The machine is subject to deterioration that results in slower service rates and, eventually, in failure. They proposed a maintenance policy that considers the state of the machine (server) and the number of customers (jobs) in the queue. They also presented a repair model in which the costs of the server are fixed and a replacement model in which variable costs are considered. They showed that in general, the optimal maintenance policies have a switching curve structure that is monotonous in the server state. Yang et al. [
21] considered maintenance in an
M/
M/1 queue which is subject to random shocks that cause a sharp deterioration of the service rate, and they proposed a preventative maintenance policy. Fitouhi and Nourelfath [
22] addressed the problem of integrating noncyclical preventive maintenance and tactical production planning for a single machine. Given a set of products that must be produced in lots during a specified finite planning horizon, the maintenance policy suggests possible preventive replacements at the beginning of each production planning period to achieve minimal repairs and machine failures. In a recent study conducted by Huang et al. [
23], the authors considered two types of queues, with each queue characterized by different exponential service requirements and serviced by a single server with a deteriorating service rate. In each state, the server can choose which queue to serve or to undergo a maintenance procedure. There are costs associated with waiting customers and with the cost of maintenance (the problem is derived from the semi-conductor manufacturing industry). The authors showed that a fixed maintenance policy is optimal for general arrival and server state processes, and a joint scheduling and maintenance policy is optimal when the maintenance policy is independent of the queue lengths, using a semi-Markov decision process (SMDP) model. Bouslah et al. [
24] addressed the problem of maintenance control for production lines, where the reliability of each machine depends on the prevention of failures in other machines. As an illustration of this problem, they studied a two-machine line model. Choudhary et al. [
25] also considered the deterioration of the server’s efficiency during service, which was recovered through a vacation of a random time duration. For clarification,
Table 1 summarizes previous studies that address the deterioration of the server.
An additional stream of research that is related to our work is the so-called ‘queues with modulated service rate’ (also called ‘queues in a random environment’), according to which the service system operates in different environments, and in each environment, the server serves customers at a different service rate. Examples of such a system include traffic flow control that is affected by scheduled or unscheduled incidents, and computer systems in which the processing rate depends on the applications running in parallel (see, e.g., [
26]). The studies in this stream assume that the service rate of a specific service depends on the current environment and is independent of prior services. In contrast to these models, in our work, the current service’s execution is affected by prior services, such that the service rate of the current service depends on the following factors: (i) the number of services that have already been provided by the server in the work period; (ii) the service rates of the previous services provided by the server; and (iii) the time that the server spent in the vacation period before starting the new working period. This model is formulated and used to provide a scheme for determining the optimal allocation of the server’s active (in service) and idle (in vacation) time in order to increase the service system’s efficiency, which is the main goal of our work.
Another possible vacation policy was introduced by Servi and Finn [
27], who proposed the ‘working vacation’ scheme, where the server changes the service rate during the vacation period (instead of no service at all, as in the previous models). Bouchentouf et al. [
28] extended the working vacation scheme to the M/M/c/N feedback queueing system with breakdowns and customers’ impatience. They developed an optimization problem based on expected cost function. Do et al. [
29] considered the
M/
M/1 retrial queue with strategic customers and working vacation. They investigated the optimal and the Nash equilibrium strategies and compared different scenarios regarding information availability. For further discussion on working vacation models, see, e.g., [
30,
31,
32].
In this paper, we introduce a variation of the conventional vacation queueing policy that takes into account and quantifies the effect of the server’s rest and recuperation on the system’s effectiveness. According to the proposed model, the effectiveness of the server deteriorates during service and is increased during the recuperation period. Accordingly, the main server leaves the service from time to time for recuperation, and is temporarily replaced during the recuperation period by a secondary server (who/which is less effective). The main server returns to service upon the conclusion of the recuperation period. The paper focuses on the time when the main server leaves service and the length of the recuperation period. While Choudhary et al. [
25] assumed that the working and vacation durations are exogenous variables that cannot be controlled by the system, in reality the system’s manager can control these variables (e.g., by controlling the vacation duration) in many service systems. Thus, in the current work, we extend this model, taking the working and vacation durations as endogenous variables, and provide a scheme for the system’s manager to optimize the vacation policy.
We consider two types of systems: (i) a system in which the server’s efficiency level deteriorates to 0 after a long time, meaning that departing for a vacation is mandatory; and (ii) a system with a deteriorating service rate that, after a long time, reaches a certain (low) level, meaning that, theoretically, the server can operate for an infinite period of time without vacations. As mentioned above, in a large number of practical situations, the rate of service in a service system may decrease randomly. For example, a human worker (e.g., a cashier at a supermarket checkout, an officer at an airline check-in counter, or a worker on an assembly line) is likely to work slower when s/he is tired; likewise, a machine (e.g., a computer, a manufacturing machine, or an automatic food-packaging machine) might process a task at a much slower rate as a result of wear and tear. Thus, a worker needs time to recuperate, and a machine needs time to be serviced or repaired, during which he/she/it can be replaced with a temporary worker or machine.
Given the proposed queuing system described above, we address the following research questions:
How many customers should be served before releasing the main server for a vacation?
How long should the server spend on vacation in each cycle in order to minimize the customers’ mean sojourn time in the system?
How does utilizing the vacation period to recover the server’s efficiency affect the customer waiting time?
The main contributions of the paper are as follows:
- (i)
We constructed a two-stage service system with a proposed vacation policy as a quasi-birth-and-death (QBD) process and developed quantitative performance measures.
- (ii)
We developed a methodology for determining the optimal number of customers to be served during the working period and the optimal mean time of the vacation period.
- (iii)
We show that the main server’s service rate may be much higher than the temporary server’s rate at the optimal switching point.
- (iv)
We show the percentage of improvement achieved using the suggested vacation policy.
The paper is organized as follows:
Section 2 describes the model formulation, and
Section 3 provides a general economic analysis using a benchmark system. In
Section 4, we compare the performance of the proposed model with that of a regular
M/
M/1 model with a deteriorating server.
Section 5 provides concluding remarks.
Note that, although each of the variables and parameters used in this paper is defined and described when required, a comprehensive detailed list of notations is provided in
Appendix A.
2. Model Formulation
Here, we consider a service system with a single server. Customers arrive according to a Poisson process with rate . The server’s efficiency level (hereafter EL) deteriorates after each service is provided. The server provides n services and then leaves for a vacation recovery period, irrespective of the number of customers currently present in the system. The duration of the vacation follows an exponential distribution with rate . During the vacation, the server’s EL increases in time. Let m be the mean EL when the server returns to the system (this can be smaller or larger than 1). Let , , be the server’s mean EL when providing the service, where after the server’s return from a vacation and provision of the first service in the current cycle, and just before the server leaves for a vacation (the last service in the work period). Let be the rate of the service provided by the server, where μ is the nominal service rate. Accordingly, , and so . During the vacation, the main server (hereafter MS) is replaced with a temporary server (hereafter TS). The duration of a service during the vacation period, during which these services are performed by the TS, is exponentially distributed with rate . When the MS returns from the vacation, he/she/it joins the system even if there are no customers. If, upon returning from the vacation, the MS finds that the TS is providing a service to a customer, the MS waits inside the system until the service’s completion and then replaces the TS.
The system’s manager aims to use the vacation as a tool for improving efficiency for future customers, which is accomplished at the expense of current customers. Thus, the system’s manager intentionally sets the vacation policy, which involves setting both (i) the number of customers served by the MS in each work period (
n) and (ii) the mean vacation time (
). Such a control structure, in which a decision is made regarding the mean vacation time, is in line with prior works (see, e.g., [
33]). Note that
can be described by the values
n and
m (the greater the reduction in the mean EL that needs to be regained during the vacation (
), the longer the required vacation will be). Thus, for a convenient presentation of the analysis,
m and
n are considered as the system’s decision variables. An example based on the explicit expression of the mean vacation time as a function of
n and
m is presented in the next section.
In order to analyze the system in a steady state, it is formulated as a quasi-birth-and-death (QBD) process. The system’s state is defined by a two-dimensional vector . L denotes the number of customers in the system in the steady state, and S denotes the MS’s status in the steady state, as follows: (i) if S is equal to any value between 1 and n, S denotes the step number of the service, which is given by either the number of customers served since the last vacation (when the server is active in the system) or the step number of the next service (when the server is idle in the system and waits for the next customer). (ii) If S equals ‘v’, this indicates that the MS is on vacation; (iii) if S equals 0, this indicates that the MS has just returned from a vacation and is waiting in the system for the completion of the TS’s service. We define the steady-state joint probability distribution function of the two-dimensional Markovian process as , ; .
In order to construct the infinitesimal generator matrix Q of the corresponding QBD process, the system states are arranged in the following order: , .
Then,
where the matrix
is of the order
,
is of the order
,
is of the order
, and
,
, and
, are each of the order
. These matrices are given as follows:
, where
I is the identity matrix:
For each system’s level
i, we define the corresponding probability vector as
and
,
. Then, the total probability vector of all the system states is given by
. Let
be a column vector with all its entries being equal to one. The system’s balance equations are given by
Theorem 1. The system’s stability condition is given by Proof of Theorem 1. Let
and let
be the unique solution of
Then, according to Neuts [
34], the condition for stability is given by
Solving the set in Equation (4) and substituting in Equation (5) proves the claim. □
Let
R be the rate matrix satisfying
Then, the steady-state probabilities of the queueing system are calculated (see Neuts [
34]) by
where
and
are calculated using some of the balance equations along with the normalization equation, specifically by
For a given
m and
n, let
be the mean number of customers in the system, let
be the mean number of customers in the queue, let
be the mean sojourn time of a customer in the system, and let
be the mean waiting time of a customer in the queue.
is given by
Then,
is given by
According to Little’s law, and .
Let
be the portion of time for which service is provided by the TS, and let
be the portion of time for which service is provided by the MS. Let
be an
-dimensional column vector and let
be an
-dimensional column vector. Then,
is given by
and consequently,
. Let
be the portion of time that the MS is on vacation, and let
be an
-dimensional column vector. Then,
is given by
Let
be the portion of time that the MS is available in the system after returning from vacation but is waiting for the TS to complete the current service, and let
be an
-dimensional column vector. Then,
is given by
and consequently,
.
3. Economic Analysis
In this section, we provide a numerical study demonstrating how to determine the optimal values of
n, the number of services in each cycle before the MS leaves for a vacation, and
m, the mean EL of the MS at the point of returning to the system. We define
, the server’s EL, as a decreasing function of time
t, which deteriorates during the
service. In line with [
35], we assume the following exponential structure of
, implying a diminishing marginal return of the server’s EL deterioration during the service:
where
is the deterioration rate. According to Equation (14),
and
. Due to the exponentiality assumption of the service rate, the deterioration of the efficiency does not affect the service rate during a specific service episode, i.e., the service rate of a particular service is constant during the actual provision of service to a specific customer, which is determined by the EL at the beginning of the service. After a service is completed, the EL for the next customer is updated according to Equation (14). Since the
service is exponentially distributed with the rate
, the
is recursively given by
Then,
is recursively given by
where
is the maximal service rate in a cycle (which is the service rate of the first customer after the main server returns from the vacation), and
is the minimal service rate in a cycle (which is the service rate for the last customer in the cycle before the main server leaves for a vacation). Thus, the main server leaves for a vacation with a mean EL equal to
According to Equation (16), (i)
monotonically deteriorates with
k, and (ii) the service rate approaches zero when the number of services in the cycle approaches infinity,
. We prove the latter property in the following proposition.
Proposition 1. The service rate (given in Equation (16)) approaches zero when the number of services in the cycle approaches infinity, .
Proof of Proposition 1. Denote the limit as c. Then,
. Solving leads to . □
As mentioned, while on a vacation, the main server recovers according to a time-dependent recovery function, and therefore the EL grows. The main server’s mean EL upon returning to the system,
m, depends on the mean EL at the beginning of the vacation
and the mean time spent on vacation
. We assume that
m is given by the following function:
This function is used because (i)
, i.e., if the server does not take a vacation, the mean EL remains as
, and (ii)
, i.e.,
m approaches a certain constant efficiency value,
, when the mean duration of the vacation is sufficiently long. As described in the previous section, initially, the system’s manager decides on the number of customers served in each work period (
n) and the mean vacation time
. Then,
m can be calculated as a function of
n and
, as given in Equation (17). However, for convenient presentation of the analysis, we consider
m and
n as the system’s decision variables and then calculate
as a function of
m and
n. Accordingly, by isolating
in Equation (17), we obtain
Let be the return rate from the vacation as a function of m and n, and let be the service rate of the service as a function of m and n, where m and n, as mentioned above, are the system’s decision variables.
We use the following parameter values for a base illustrative example: Consider a service system in which customers arrive with a rate
. For the service rates, we set
, which allows us to calculate the MS’s service rate for each service
and we use
to represent the TS’s service rate. During the MS’s active period, the EL deteriorates at a rate of
(see Equation (16)), and during the vacation, the EL grows at a rate of
(see Equation (17)). Finally, we set the maximal EL to
. These values are used because they satisfy the stability condition, which is given in Theorem 1 for a large range of
m and
n values, therefore enabling us to analyze the system’s performance for a wide range of parameters.
Figure 1 and
Table 2 present the values of the customers’ mean sojourn time,
, for a wide range of combinations of
with
, where the best value (i.e., the minimal value) of
, denoted as
, is underlined and printed in bold in
Table 2. The values in
Table 2 are obtained by substituting the corresponding values of
m and
n into
, where
is calculated using Equation (9). According to the best solution, the main server serves
customers in each cycle and then leaves for a vacation. The main server returns from vacation when their/its mean EL reaches
. Thus, in the optimal setup, according to Equation (18), the server is on vacation for
min, and the customers’ mean sojourn time in the system is
min.
As mentioned above,
Figure 1 depicts the value of the sojourn time
for various
m and
n values. As shown here,
is a smooth convex function with a minimum value at
(approximately 72% of the maximal value of 1.8) for a wide range of values of
n. The gradient of
remains relatively small for small variations in
of approximately ±11%, but it is more sensitive to variations in
in the extreme region
. An interesting observation based on this analysis is that the duration of the vacation period (expressed by
m) has a stronger effect on the mean sojourn time of customers in the system than the number of services before the time of leaving for a vacation
.
Figure 2 shows how the MS’s service rate deteriorates during the operating time, compared with the constant service rate of the TS. Note that when the servers are switched (i.e., when
), the MS’s service rate is still much higher than that of the TS (28 vs. 5). This result indicates that switching servers is beneficial for the system, even at the expense of a significant temporary reduction in the current service rate due to the TS’s inferior performance. Intuitively, the switching point should take place when the MS’s service rate is close to the TS’s service rate or even drops below it. However, as shown in the figure,
the ideal switching point is when the MS has considerable ‘spare’ capabilities compared with the TS. This can be explained by the recovery period of the main server during the vacation. Note that, as previously mentioned, the service rate of the MS deteriorates in an exponential manner during the work period. The illusion of a linear behavior in
Figure 2 is due to the fact that the figure shows a close-up view of the functions within a narrow window of data. For a wider view, see
Appendix B.
Sensitivity Analysis
In order to investigate the effects of the system’s parameters on the optimal decisions
and
on the optimal sojourn time
, we use the numerical example given at the beginning of this section as a base case and calculate
,
, and
for various values of the system’s parameters (
,
,
b, and
), changing the value of a single parameter each time. The results are presented in
Figure 3,
Figure 4,
Figure 5,
Figure 6,
Figure 7,
Figure 8,
Figure 9 and
Figure 10.
Figure 3 shows the results of
for different values of
(the service rate of the TS). As expected, the best sojourn time
is reduced with the increase in the TS’s service rate
(a better temporary server). The effect of
on
is given by a monotonous exponential function, as its effect becomes smaller when
is larger than 15 (50% of the nominal value of
). Assuming that the cost of the TS is proportional to the service rate value, based on these results, it is sufficient to select a TS with a service rate of 50–60% of the MS’s nominal service rate.
Figure 4 shows the simultaneous effects of the TS’s service rate
on
,
and the fraction of time for which the MS is on vacation,
. The conclusions from this figure are as follows: (i) when
increases, the optimal number of services provided before leaving for a vacation,
, decreases at an exponential rate, and (ii) the optimal mean EL at the points of the MS’s return,
, increases linearly with
. Moreover, the figure shows that when
increases, the
increases (see, e.g., the switching point from
to
, where
increases and
remains constant). However, interestingly, a decrease in
does not affect the
(see, e.g., the switching point from
to
, where
decreases and
remains constant). This result can be explained as follows: if the MS performs fewer services, he/she/it stays in the system for less time; on the other hand, when the MS leaves for a vacation, his/her/its EL is higher and, consequently, the MS needs less time to reach the return level
. Thus, the decrease in
has two opposite effects on the
, and as a result, the
does not change. To study this phenomenon in greater depth,
Table 3 presents the
for various values of
and
. The table shows that for small values of
n (
), an increase in
n has a positive effect on the
. However, for larger values of
n , the
almost remains constant with
n. Moreover,
Table 3 shows that the
increases very slightly with
n, and then
decreases very slightly, which can also be explained by the two opposite effects of
n on the
, as described above.
Following the results shown in
Figure 4c and
Table 3, we compared the values regarding the portion of time that the main server is on vacation (
) with several labor regulations and policies related to the work conditions of cashiers in supermarkets in general, and to breaks during cashiers’ shifts in particular. The regulations and policies for the rest periods of cashiers in supermarkets vary from country to country and from company to company, depending on the country’s jurisdiction and laws and specific company policies. However, there are some general guidelines and common practices. In most countries and companies, rest periods for cashiers are scheduled at regular intervals, and although the specific timing and duration of the breaks vary, they are typically in the range of 10 to 15 min every 3–4 h (the figures are taken from job descriptions for supermarket cashiers in several Western countries), which reflect
values (the portion of time in which the MS is on vacation) in the range of 0.041 (10 min break every 4 h) to 0.083 (15 min break every 3 h). According to our model and the results given above, the portion of the server’s rest period for the given system parameters varies between 0.023 and 0.393, with the best value of
for
and
(see
Table 2). Based on these results, the rest period of cashiers in supermarkets should be 17 min every 3 h or 23 min every 4 h. However, our model does not consider other factors, such as financial considerations (costs of the MS and the TS, the average profit from each customer, and the transition times between servers). The addition of all these factors to our model could benefit decision makers in determining working policies (in this case, supermarket cashiers) in order to optimize the system’s performance.
Figure 5 and
Figure 6 illustrate the effect of the MS deterioration rate
on the entire system.
Figure 5 shows that, as can be expected,
increases linearly with the MS’s deterioration rate.
Figure 6 shows that
decreases at a diminishing rate with the increase in
, and
only slightly decreases with the increase in
. The question that arises is how an increase in
affects the
(the MS’s efficiency upon leaving for a vacation). Obviously, on the one hand, an increase in
has a negative effect on
, but on the other hand,
decreases with
, which has positive effect on
.
Figure 6 also shows that
decreases with the increase in
. This means that the main server compensates for the increasing loss of efficiency by reducing the number of services in each cycle; however, this loss is not totally compensated for.
Figure 7 and
Figure 8 illustrate the effects of changes in the recovery rate
b on the system’s parameters.
Figure 7 shows that, as can be expected,
decreases linearly with the increase in the MS’s recovery rate.
Figure 8 shows that both
and
are almost entirely robust to changes in
b. This result can be explained by the vacation time, which acts as a function of
b (see Equation (18)).
Finally,
Figure 9 and
Figure 10 present the effects of various customers’ arrival rates
on the system. As can be expected,
Figure 9 shows that
increases linearly with the increase in the customers’ arrival rate, while
Figure 10 shows that
is totally robust to changes in
, and
slightly increases with the increase in
. Interestingly, the figure shows that
slightly increases with
. This result can be explained as follows: as more customers arrive and enter the system, the MS is less idle during the time in which he/she/it is in the system and thus spends less time in the system.