1. Introduction
Queueing theory is a branch of applied mathematics that studies waiting lines or queues. It finds widespread applications in various fields, including telecommunications, computer networks, manufacturing systems, and service industries. Two server queueing models are particularly significant, as they offer insights into scenarios where multiple servers handle arriving customers or tasks. While traditional queueing models assume independent arrival and service processes, in many practical situations these processes are interconnected in a way that can significantly impact system performance and behaviour. The study of queues with interdependent arrival and service processes helps in understanding how these mutual influences shape queue dynamics, waiting times, resource utilisation, and overall system efficiency.
1.1. Literature Review of Queues with Interdependence in Arrival and Service Processes
Mitchell et al. [
1] investigated an M/M/1 queue in which the arrival and service processes are linked through correlated random variables, conforming to a bivariate exponential distribution. This approach sheds light on how correlations between customer service times and the time between their arrival impact the system. Courtois et al. [
2] generalized the M/G/1 queuing process by assuming both the arrival and service rates to be essentially arbitrary functions of the number of customers currently using the system. In addition, they assumed that the amount of service demanded by a customer is conditioned by the queue length at the moment service is begun for that customer. Sengupta [
3] investigated a single-server semi-Markovian queue with a first-come-first-served policy in which both the arrival and service processes follow a semi-Markov process. Boxma et al. [
4] considered a storage model with a dual interpretation: as a distinctive queueing model featuring interdependence between service requests and subsequent inter-arrival times, and as a fluid production/inventory model operating within a two-state random environment.
Muller et al. [
5] studied a queueing system characterised by partial correlation, focusing on the interplay between the quantity of work (service time) introduced by a customer and the subsequent inter-arrival time, which the authors assumed to be interdependent. The focus of the study revolved around a distinctive single-server multi-station alternating queue configuration in which the interplay between preparation and service times involves intricate auto- and cross-correlation patterns. Their investigation was carried out across two distinct scenarios. Vlasiou et al. [
6] addressed a configuration involving a single-server multi-station alternating queue characterised by interrelated preparation and service times and exhibiting both auto- and cross-correlations. Adan et al. [
7] presented a comprehensive investigation into a single-server queue system in which both the inter-arrival times and the service times depend on a common discrete time Markov chain. This model represents an extension of the well-known MAP/G/1 queue, incorporating interdependencies between inter-arrival and service times. Iyer et al. [
8] analysed queueing models characterised by joint density of the inter-arrival and service times. These models involve a mixture of joint densities, and find natural applications in scenarios involving a single server catering to multiclass populations through a shared queue. Additionally, this approach can help to model interdependencies between inter-arrival and service times in the context of queue control models. Buchholz et al. [
9] focused on a queueing system in which the inter-arrival times exhibit correlation and the service times are correlated with the inter-arrival times. They established a compelling connection between this correlated queueing model and the MMAP[K]/PH[K]/1 queue, a well-studied structure to which matrix geometric algorithms are readily applicable. Kim et al. [
10] analysed the wait time distribution in an M/M/1 queue in which the inter-arrival time between the
n-th and
-th customers along with the service time of the
n-th customer are modelled as correlated random variables characterised by Downton’s bivariate exponential distribution. Krishnamoorthy et al. [
11] examined a queueing system with finite Markov-dependent arrival and service batch sizes, and additionally considered correlated successive inter-arrival and service time durations. Dai et al. [
12] investigated a unique category of correlated queue in which the service duration of an arriving customer hinges upon the inter-arrival time separating them from the previous customer.
Moiseev et al. [
13] examined a two-stage queuing system with infinite servers in which customers arrive according to a Markovian Arrival Process (MAP). In this system, the service times at the first stage and the probability of transitioning to the second stage depend on the type of request, which is determined by the state of the underlying Markov chain of MAP when the request arrives. The authors’ analysis of this model utilized the multi-dimensional dynamic screening method.
1.2. Literature Review of Queues with Multiservers
The literature abounds with extensive research concerning queueing models that involve multiple servers and operate under the assumption of independent arrival and service processes. Kleinrock [
14] laid the foundation for analysing queueing systems with multiple servers. The earliest works focused on fundamental models characterised by independent customer arrivals and exponential service durations. These pioneering studies offered essential insights into the behaviour of queueing systems, particularly in the context of systems featuring multiple servers. These foundational investigations laid the groundwork for subsequent advancements in the field. Cohen [
15] played a crucial role in advancing this trajectory by expanding the analytical framework to accommodate batch arrivals and service times. This extension was a pivotal step, contributing to a more comprehensive understanding of the dynamics of queueing systems with multiple servers. Neuts et al. [
16] addressed a notable challenge by highlighting the analytical complexity of queueing systems involving more than two heterogeneous servers. They aptly observed that in such scenarios the steady-state behaviour of the system can only be explored through algorithmic approaches. Building upon this observation, Krishna Kumar et al. [
17] studied an M/M/2 queueing system characterised by heterogeneous servers.
There are a number of recent papers dealing with two-server queueing systems, including [
18,
19,
20,
21,
22,
23,
24,
25,
26,
27,
28], to which interested readers may refer.
A semi-Markov process is both a Markov renewal process and a generalization of the Markov process. Unlike Markov renewal processes, which focus on the number of visits to each state, a semi-Markov process considers random variables describing the time spent in each state before transitioning. In discrete-time Markov processes, transitions occur after fixed unit time intervals; in continuous-time Markov processes, transitions follow exponential distributions for the time spent in each state. A semi-Markov process resembles a Markov process in terms of state-to-state transitions; however, it is set apart by the time spent in each state before moving to the next state being a random variable with parameter(s) that depend on both the state it is currently in and the one to be visited next.
In a semi-Markov process, changes of the occupied states take place according to a Markov chain rule; nevertheless, a random amount of time between changes is required. Suppose that a process can be in any one of N states and that each time it enters state it remains there for a random amount of time before transitioning to state j with probability . The state j that is to be visited next is chosen at the time that state i is visited. The sojourn time in i depends on both i and j. Let the time until the transition from i to j have a distribution , and let denote the state at time t; then, is called a semi-Markov processes.
To define a semi-Markov process, it is necessary to have a finite state space Markov chain (MC). This MC can be one-dimensional or higher, and must be finite. Associated with the MC is its initial probability vector (IPV) and one-step transition probability matrix (TPM). The semi-Markov process evolves as follows. First, choose an initial state j according to the IPV. A particle occupies this state. At this moment, the next state (say, k) to be visited by the particle is chosen; this is sampled using the one-step TPM. The sojourn time of the particle in j is distributed based on the mean time in j as . At the end of this time, the particle transits to k; at this epoch, the next state (say, m) to be visited is chosen using the one-step TPM. In k, the sojourn time has a mean value . This process continues; the continuous time process is referred to as a semi-Markov process. The sojourn time distribution can be exponential or any general distribution (with the non-negative part of the real line as support). In this paper, the sojourn time distribution is taken as the exponential distribution.
We consider a two-server queueing system in which the arrival and service processes are interdependent. Numerous studies in the existing literature have investigated the interdependence among random variables and processes through various analytical methods. In this paper, we analyse the interdependence among arrival and service processes using a semi-Markov approach. This is a new approach first introduced by Achyutha Krishnamoorthy in 2021 (see [
29,
30]). The procedure is as follows. Suppose that
n processes evolve in an interdependent fashion, as explained below. Assume that these
n processes are generated through transitions in the states of
n finite state space-distinct Markov chains. They may or may not have absorbing states. Considering the product space of these Markov chains, a Markov chain is imposed on this product space. This chain has an initial probability vector and a one-step transition probability matrix. The combined process evolves through transitions in the elements of the product space. We may assume that these transitions are governed by a semi-Markov rule, with the exception of the sojourn time in each stage|state (
n-tuples) being exponentially distributed based on a parameter with dependence on the current state and the one to be visited next (as governed by the one-step transition probability matrix on the product space Markov chain).
In [
31], the authors examined two distinct queueing models. Model I used a single-server working vacation queueing system in which the arrival and service processes were interdependent. The evolution of the arrival and service processes occurs through transitions within the product space of two separate Markov chains of orders
m and
n, respectively. The transitions in the product spaces of these Markov chains are governed by a semi-Markov rule, with sojourn times in the states governed by an exponential distribution with parameters depending on the state it is in and the state to be visited next, which are determined according to the Markov chain rule. During working vacation periods, service is delivered at a reduced rate. The duration of these vacation intervals follows an exponential distribution with the parameter
. In contrast, the second model is based on independent arrival and service processes following phase-type distributions with representations (
,
T) of order
m and (
,
of order
n, respectively. The service time during normal operation is the phase-type distribution indicated above, whereas that during working vacation is a phase-type distribution with representation (
,
.
In the present paper, we analyse a queueing system with two servers having interdependent arrival and service processes. The evolution of these processes is governed by transitions in the product space of three Markov chains. The transitions in this Markov chain follow a semi-Markov rule and the exponential distribution governs the sojourn times in the states.
1.3. Motivation
In queueing theory literature, many papers have been developed to analyse arrival and service processes which are mutually independent with the exception that service can be rendered only when the server and at least one customer to receive the service are available. A few papers have considered successive inter-arrival times and/or successive service times to be correlated, for example, Markovian or batch Markovian arrival/ service processes. Even fewer papers have considered the interdependence of arrival and service processes. For example, the inter-arrival times
between the
and
n-th customers and the service time
of the
n-th customer were correlated for
by Sengupta [
3,
32]. Van Houdt [
33] subsequently filled certain gaps in the work of Sengupta [
3]. In all these papers, the authors made specific distributional assumptions with respect to the arrival and service times. Thus, the question arises as to whether it is necessary to specify the distributions of the inter-arrival and service times of customers. It is this question that led us to design the presented approach to multi-server queues.
Thus, in this paper we make no assumptions about the inter-arrival and service time distribution. More importantly, no such assumption can be meaningfully made in the context of modelling this interdependence. There are at most three papers [
29,
30,
31] that have appeared in the stochastic modelling literature employing this new approach, and no cases involving more than one server have been considered under this framework in the extant literature. Thus, the methodology presented here is quite novel. This procedure can be extended to queues of a finite number with more than two servers. The dimension increases by one as the number of servers increases.
Markov chains provide the simplest dependence among a sequence of random variables. In the present modelling, we assume that arrival and service rendered by the two servers evolve within the product space of three Markov chains: one each for arrival, service by server 1, and service by server 2. Thus, our approach is completely new, especially in service systems with multiple servers.
1.4. Practical Applications
In the correlated inter-arrival time and service time, we can use the following example: with being the sequence of independent and identically distributed inter-arrival times and the sequence of service times of successive customers, the stress is on the correlation, that is, whether the correlation is positive, negative, or zero. A negative correlation indicates that if the inter-arrival time is large, the corresponding service time turns out to be comparatively small, and vice versa. In the positive correlation case, this indicates a “direct proportion” between the two. In our model of interdependence, quick transitions in the arrival phases indicate faster arrival. Accordingly, the servers can accelerate their service speed or, equivalently, their service rate. In the correlated arrival and service processes indicated above, the n-th customer to arrive receives service at a future time after arrival, as the server can be busy at the customer’s arrival epoch. While the situation in our model is the same, the reflection of the interdependence of arrival and service is seen simultaneously. This helps to accelerate or decelerate the rate of service provided by the servers. These kinds of adjustments can be seen in the production process; if customers arrive at a fast rate, the production needs to be adjusted to ensure that the queue size does not blow up.
In large airports, when non-domestic flights flights land the passengers are required to proceed to the immigration section. A queue of such passengers is formed; at the head of the queue, the person who manages the distribution of passengers directs them to different immigration counters depending on the slots available at each counter. The processing of each passenger passes through distinct stages in a forward direction. With only slight modifications, the model discussed in the paper can be implemented to clear long queues of passengers efficiently.
Another practical application of a two-server queueing model with interdependent arrival and service processes can be found in scenarios involving distributed data processing or computing. In such scenarios, data packets or tasks arrive at a network of servers, and the rate of arrival and processing time can be influenced by the current state of the servers in the network. The arrival rate of data packets may be influenced by the current number of idle or busy servers. For example, if both servers are idle, the arrival rate may increase, as more data processing capacity is available. Conversely, if both servers are busy, the arrival rate might decrease as new data packets wait to be processed. In such a scenario, the service rate for each server can depend on its current workload.
The salient features of the model discussed in this paper are as follows:
We introduce a new approach to analysing two-server queueing systems with interdependent arrival and service processes.
The presented analysis does not involve the Kronecker product/sum, unlike the queueing models with independent arrival and service processes analysed using matrix geometric methods.
Notations and abbreviations used in this paper:
: continuous-time Markov chain.
: level-independent quasi-birth and death.
: column vector of 1s of appropriate order.
: quasi-birth and death.
The remainder of the paper is organized as follows: in
Section 2, the model is mathematically formulated; in
Section 3, we perform a steady-state analysis of the studied queuing model after establishing the stability conditions of the system; in
Section 4, performance measures are computed and presented; finally, numerical illustrations of the two models are discussed in
Section 5.
2. Model Description and Mathematical Formulation
Consider a queueing system with two heterogeneous servers and in which the arrival and service processes are interdependent. When a customer arrives with both servers idle, server provides service. When both servers are occupied, an arriving customer joins an infinite capacity queue. When the system becomes empty, both servers remain idle. The arrival process passes through several stages, including back and forth. An arriving customer who finds a free server enters for service immediately. The service process provided by server has stages, while that provided by server has stages. These proceed in the forward direction. The arrival process has m stages. Three Markov chains govern the arrival and service processes. Service provided by is governed by the Markov chain and service provided by is governed by the Markov chain . The state space of the Markov chain is , while that of the Markov chain is . The arrival process is governed by a Markov chain with state space . We can consider the Markov chain on the product space with state space .
The absorbing states of this Markov chain are . Changes in the first coordinate due to transitions indicate service phase changes provided by server , those in the second coordinate indicate service phase changes provided by server , and those in the third coordinate indicate arrival phase changes. The transitions are interdependent in the sense that the sojourn time in any state depends on this state as well as the one (say, ) to be visited next. This sojourn time is assumed to be exponentially distributed with parameter . Because the transitions are interdependent, within a short interval it is possible for none, one, two, or even all coordinates to change with positive probability when a transition occurs. The state space indicated above is for this general case; however, we assume here that at most one change takes place with positive probability. This assumption leads to an infinitesimal generator which is highly sparse. Let the initial probability vector of the arrival process be , where . We can further simplify the assumption that the service processes for both servers start from stage 1 and move in the forward direction only, that is, the service processes are in the order and , respectively. The initial probability vectors of the service processes of both the servers are and component vectors, respectively. Further, to ensure that arrivals do not occur too quickly, we assume that .
2.1. The QBD Process
The model described in
Section 2 can be studied as a Level-Independent Quasi-Birth–Death (LIQBD) process (see Latouche et al. [
34]). First, we define the following notations. At time
t, let:
be the number of customers in the system.
be the the phase of the service provided by server .
be the phase of the service provided by server .
be the phase of arrival.
Here, is an LIQBD process with state space
= . In the absence of customers, no service can be provided; this is indicated by the * in the position of the service coordinates (the third and fourth coordinates of the 4-tuple). If only one customer is in the system, either or is idle, when is idle it is represented by and when is idle it is represented by , where .
2.2. Transitions
The transitions are described in
Table 1.
The infinitesimal generator of the CTMC is
Here, is a square matrix of order m which contains the transition rates within level 0, is a matrix of order which contains transition rates from level 0 to level 1, is a matrix of order which contains transition rates from level 1 to level 0, is a square matrix of order which contains the transition rates within the level 1, is a matrix of order which contains transition rates from level 1 to level 2, is a matrix of order which contains transition rates from level 2 to level 1, represents transition rates from level n to level for , represents transition rates within the level n for , and represents transition rates from the level n to level for . All of these are square matrices of order .
The matrix
is provided by
which is a square matrix of order
m where
=
;
.
We define the matrix
F as
which is a square matrix of order
m.
Then, can be expressed as .
We write
as
which is a square matrix of order
m.
We define the matrix
as
which is a square matrix of order m.
Then, .
We define
which as a square matrix of order
m, where
as a square matrix of order
m with
;
as a square matrix of order
m, where
and
as a square matrix of order
m with
.
Using these, we can write
Let
where
i varies from 1 to
.
is a square matrix of order
m.
Let
where
j varies from 1 to
.
is a square matrix of order
m.
is a matrix of order .
is a square matrix of order .
With these notations,
takes the form
Let
be a square matrix of order
m for
. Furthermore, let
be a matrix of order
with
.
We define
as a square matrix of order
m for
.
is a square matrix of order .
We define
as a square matrix of order
m with
.
In the above, = for with .
Take
which is a square matrix of order
m with
Let be a square matrix of order with .
Let be a square matrix of order m for
Let be a square matrix of order with .
Using the above notations,
can be written as
Let be a square matrix of order for .
Using these,
can be expressed as
We define
as follows:
where
j varies from 1 to
.
is a square matrix of order
m.
Then, we can write
which is a square matrix of order
with
.
These help us to represent
as follows:
Next, we proceed to compute the stability condition of the system and obtain the system state probability vectors. From each of these vectors, the system occupying a specific state can be computed.
5. Numerical Illustration
In this section, we provide four numerical illustrations of our model.
We take and . The state space of the arrival process is , where the transient states are 1,2 and the absorbing state is 3. The state space of the service process provided by server 1 () is , where the transient states are 1,2 and the absorbing state is 3. The state space of the service process provided by server 2 () is , where the transient states are 1,2,3 and the absorbing state is 4. Then, the state space of the Markov chain = . The absorbing states of are: .
We assume that at most one coordinate change in a transition has a positive probability. In the absence of customers, no service can be provided, as indicated by a * symbol in the position of the service coordinates. Here, = , . The case of more than one coordinate change in a single transition (only up to three being possible in the present case, as we have the service stages provided by each server as first two coordinates and the changes in the arrival phases as the last coordinate) can be considered in a similar fashion. Consequently, we obtain a much less sparse transition rate matrix.
5.1. Illustration 1
Using the transition rates shown in
Table 2 and
Table 3, we obtain the values of the performance measures as follows.
5.2. Illustration 2
Using the transition rates shown in
Table 4 and
Table 5, we obtain the values of the performance measures are as follows.
5.3. Illustration 3
Using the transition rates shown in
Table 6 and
Table 7, we obtain the values of the performance measures as follows.
5.4. Illustration 4
Using the transition rates shown in
Table 8 and
Table 9, we obtain the values of the performance measures as follows.
From the above numerical illustrations, it can be concluded that when the transition rates increase, the values of , , , , and increase, while the values of , , and decrease as the transition rates increases.