1. Introduction
Network slicing is an essential functionality of future fifth-generation (5G) New Radio (NR) cellular systems [
1,
2]. It can allow resources to be allocated and isolated for different traffic classes or mobile virtual network operators (MVNOs) on the same physical network infrastructure, e.g., mobile physical network operators (MPNOs) [
3]. In both cases, traffic aggregates may have different quality of service (QoS) requirements in terms of throughput, latency, and drop probability as stated in the International Telecommunications Union Radiocommunication Sector (ITU-R) M.2410 [
4].
QoS provisioning to traffic slices needs to be provided in an end-to-end way [
5,
6]. The critical part is the Radio Access Network (RAN), where time-varying radio channel conditions may deteriorate statically provisioned performance guarantees [
7,
8]. In addition, according to the unified opinion of the 3rd Generation Partnership Project (3GPP), ITU-R, and the Groupe Speciale Mobile Association (GSMA) [
9,
10], the utilized slicing scheme should satisfy two contradictory requirements—traffic isolation and efficient usage of radio resources. The former requirement can be implemented by utilizing static or dynamic resource allocation policies [
11,
12], while the latter can be implemented via traffic prioritization schemes [
13,
14]. The joint dynamics of schemes accounting for both types of requirements is rather complex. To this end, mathematical models are needed to understand the trade-offs between user-centric key performance indicators (KPI), such as session loss and pre-emption probabilities, and system-centric ones, such as the efficiency of system resource utilization.
1.1. Related Work
There have been a number of studies addressing the question of network slicing at the air interface. Specifically, Ref. [
15] focuses on two key features of slicing: traffic isolation and automated management. In doing so, a pre-emption-based prioritization (PP) scheme, ”merging” the resource allocation and traffic prioritization schemes, is proposed. To evaluate or estimate the so-called PP scheme, a queuing system model analyzing the functioning of a single base station (BS) accommodating multiple services with different QoS requirements is given. Concretely, this paper considers each service-oriented slice to be assigned an overall share of radio resources, including guaranteed ones utilized by neighboring slices. As one key result, the proposed PP scheme can achieve 100% gain in terms of blocking probabilities with respect to a predefined baseline. In addition, Ref. [
11] investigates the following features of slicing: flexible priority-based traffic isolation, fair QoS-aware resource allocation, and efficient usage of radio resources. The authors proposed a slicing scheme bolstering the traffic isolation and maintaining the efficient usage of radio resources, and represented it by utilizing a queuing model with three 5G services having uniform data rate requirements at one BS. This slicing scheme takes advantage of the complete partitioning and complete sharing policies’ key features. In practice, this paper considers the users of each service-oriented slice to be ensured a minimum data rate, with the possibility to achieve higher data rates whenever radio resources are free. The proposed slicing scheme can achieve 90% gain in terms of average user satisfaction index, and reduce the session drop probability by an order of magnitude dependent on the baseline.
Authors in [
13] studied the industrial deployment of 5G with simultaneous support of enhanced mobile broadband (eMBB) and ultra-reliable low-latency communication (URLLC) services. The effectiveness of coexistence strategies is achieved via explicit prioritization. In addition, the authors consider service strategies in which URLLC traffic can be offloaded onto device-to-device (D2D) connections with and without explicit reservation of a fraction of resources for direct connections. As a result, the authors concluded that a D2D-aware strategy, in which the BS explicitly reserves resources for direct connection, is significantly superior to strategies in which no explicit reservation is used, as well as the strategy without support for D2D connections.
Recently, studies suggesting the use of machine learning (ML) for network slicing have started to appear. Specifically, Ref. [
16] utilizes deep reinforcement learning (DRL) technology and proposes a hybrid hard–soft slicing framework, guaranteeing service level agreements (SLAs) and maximizing the spectrum efficiency given some isolation constraints. Technically, the paper considers the users of each service-oriented slice to be able to utilize the radio resources of a newly configured service-oriented slice neighboring existing ones. As the main result, the SLAs can be guaranteed all the time with the proposed hybrid slicing able to achieve near-optimal performance in terms of SLA satisfaction ratio, isolation degree, and spectrum efficiency. As the main drawback, the proposed slicing cannot satisfy mixed SLAs such as latency and reliability. The authors in [
17] investigate the impact of traffic in one slice on QoS parameters experienced by the traffic in another slice. They develop a data-driven slicing and allocation model by using ML algorithms, where resources between network slices are intelligently redistributed in accordance with prescribed QoS parameters. The study in [
18] considers an experimental 5G network prototype with the ability to configure radio resources for network slices using ML solutions based on real-time performance metrics. The obtained results confirm that the ML-based approaches outperform the traditional ones and improve the utilization of resources while guaranteeing the QoS parameters.
1.2. Contributions
In this paper, by utilizing the tools of the queuing theory, we formulate and solve a general slicing RAN problem for K traffic aggregates utilizing both resource reservation and pre-emptive-priority service discipline. In the considered system, each slice is assigned a certain dedicated share of radio resources such that , where C is the overall amount of radio resources. The shared pool of resources, is regulated by the pre-emptive priority service procedure to improve the degree of statistical multiplexing. The system is then solved for user- and system-centric KPI, session drop/pre-emption probability as well as system resource utilization, by utilizing the queuing-theoretic formalism. The proposed system allows to satisfy inherently contradictory requirements of having a strong degree of isolation between traffic classes and high efficiency of resource usage. The performance of the proposed system is demonstrated by utilizing a three-class slicing scheme serving an ultra-reliable low-latency service, an enhanced mobile broadband service, and a massive machine-type communication (mMTC) service at the same radio interface.
The main contributions of our study are the following:
Mathematical framework for mixed reservation- and priority-based network slicing at the radio interface along with parameterization by accounting for wireless channel specifics that include numerous special traffic isolation strategies such as full reservation and priority;
Observation that the mixed reservation- and priority-based strategy allows extremely high resource utilization to be maintained, approaching 95% in overloaded system conditions, while still providing a strong degree of traffic isolation;
Observations that short-lived lightweight low-priority traffic, such as mMTC, is best suited for the mixed reservation–priority strategy while elastic eMBB traffic benefits more from the full reservation strategy in terms of both drop and pre-emption probabilities.
The rest of the paper is organized as follows. First, in
Section 2 we introduce our system model. The mathematical model is formalized and solved for performance metrics of interest in
Section 3. Numerical results of the coexistence of three services, URLLC, eMBB, and mMTC, are presented in
Section 4. Conclusions are drawn in the last section.
3. Mathematical Model
The behavior of the queueing system specified in
Section 2 can be described by the
K-dimensional continuous-time Markov chain (CTMC)
, where
captures the number of
k-type requests in the system at time
t. Denote
as the maximum number of
k-type sessions in service, then
,
, is the number of
k-type sessions that are currently in the system,
. Additionally, let us denote
as the maximum guaranteed number of
k-type sessions in service. The considered process is defined over the following state space
Due to the partially elastic nature of the considered sessions, the amount of resources,
,
, available to the
k-type sessions is equally distributed between them and depends on the state
. That is, we have
3.1. Model without Pre-Emption
Let us start with a partial sharing strategy with a non-pre-emptive priority. In the state space
we can identify three important sets, namely
,
, and
,
. The former set,
,
, called the “accepting” set, contains all the states in which sessions are accepted to the system and is provided by
The second set of states
,
, called the “loss” set, is a set of system states in which arriving sessions are dropped,
Finally, in the accepting set
, we can select a subset
, a set of system states in which the arriving sessions will be accepted to the system using
PRBs, i.e.,
Consider now the process of session admission to the system. Particularly, when a new session arrives to the system, the following may happen:
If (i) upon arrival of the new session of type k there are more than , PRBs available, and (ii) the current amount of sessions in service is smaller than , this session is accepted at the system;
In any other case, the session is rejected.
Denote by
the steady-state probability distribution of the CTMC
. Since the considered stochastic process
is reversible, by solving the system of local balance equations,
recursively, we obtain the steady-state probability distribution of the system in the product-form (
14), that is,
where
,
.
Once the steady-state distribution vector is found, we may proceed to determine the performance measures of the system:
Drop probability of
k-type session,
, is given by
Average amount of resources occupied by
i-type sessions,
, is given by
The fraction of utilized resources,
U, is provided as
3.2. Model with Pre-Emption
Consider now the system with the pre-emptive priority service. The behavior of this queuing system is similar to the previous one in the sense that the considered process is defined over the same state space
as in (
8). We assume that the services have different priorities: the highest priority is for the first type of services, the lowest is for
K-type of services. Priority service is implemented in such a way that in the case of insufficient resources in the system to provide the
k-type service with a minimum requirement, the service of one or more lower priority sessions in the shared pool of resources
c could be pre-empted.
Figure 2 illustrates the algorithm for selecting sessions that could be pre-empted when a new session arrives to the system. We assume that
—the numbers of sessions that could be pre-empted. Given the initial condition
, consider the steps of the algorithm.
Step 1. Check if there are enough resources for arriving at j-type session. If the condition is met, the session is accepted. If the condition is not met, go to Step 2.
Step 2. Check if the service of the i-type session in the shared pool c can be pre-empted. If the condition is met, increase by one and go to Step 1. If the condition is not met, go to Step 3.
Step 3. Check if there are sessions in the system with a higher priority . If the condition is met, go to Step 2 with . If the condition is not met, the session is rejected.
Similarly to the previous model, we also define accepting, loss, and accepting with the maximum number of PRB sets of system states, as
,
, and
,
, respectively. The first set,
, is defined as
the second set,
, is provided by
and the last set,
, is defined in (
12).
For pre-emptive service, the system of these sets should also be complemented with the pre-emption set of states,
,
, where a
k-type session is accepted to the system, causing the pre-emption of lower priority sessions. This set is given by
Consider now the process of session admission at the system. Particularly, when a new session arrives to the system, the following may happen:
If (i) upon arrival of the new k-type session there are more than , PRBs available, and (ii) the current amount of k-type sessions in service is smaller than , this session is accepted to the system;
If (i) the arriving k-type session, , observes less than free PRBs in the system, and (ii) the current amount of k-type sessions in service is smaller than , and (iii) there are more than PRBs occupied by lower priority sessions in the shared pool of resources, c, the session is admitted to the system causing the pre-emption of lower priority sessions;
In any other case, the session is rejected.
By utilizing the rules specified above, we can fully characterize the stochastic process
describing the service of URLLC and eMBB sessions with the partial reservation strategy and pre-emptive priority. The arbitrarily chosen “central” state of the process is shown in
Figure 3.
As opposed to the non-pre-emptive system, the CTMC
is non-reversible, implying that no product-form solution is available for this system. In this case, the steady-state distribution can be estimated numerically by solving the following set of linear equations:
where
A is an infinitesimal generator having elements
defined in (
21), with the shorthand notation ∗ provided by
Once the steady-state distribution vector is obtained, one may proceed with the performance measures of the considered system that can be expressed as follows:
Drop probability of
k-type session,
, is given by
The pre-emption probability,
, that is, the probability that arbitrarily chosen sessions are dropped during ongoing service when
k-type session is accepted to the system
Note that
,
U are defined in (
16) and (
17), respectively.
4. Numerical Results
In this section, we proceed to provide an illustrative example of the proposed mixed reservation- and priority-based traffic coexistence strategy. Specifically, we consider three services that have to be supported by the 5G NR air interface—URLLC, eMBB, and mMTC. According to the 5G International Mobile Telecommunications-2020 (IMT-2020) requirements specified in [
4] and network slicing recommendations in 3GPP Technical Specification (TS) 23.501 and TS 38.300, URLLC should receive the highest priority and be served as there are absolutely no other types of traffic in the system reaching the drop probability of
. On the other hand, mMTC requirements are the loosest out of all these three types of services, requiring that no more than 1% of traffic is not delivered within 10 s. The eMBB service is characterized by the balanced requirements. Thus, in our numerical example, we assign the priorities accordingly.
The default system parameters utilized to produce the reported results in this section are shown in
Table 1. Note that here, we explicitly compare two slice isolation schemes: (i) full reservation, where the whole set of resources,
PRBs, is equally divided between the slices, and (ii) partial reservation with priorities, where a set of resources is allocated to the shared pool. Note that in both cases, URLLC traffic is well isolated from the rest of the traffic and, thus, we will only consider performance degradation experienced by eMBB and mMTC.
We start with
Figure 4 with the two critical metrics of interest, eMBB and mMTC session drop,
,
, and pre-emption probabilities,
,
, for two considered schemes as a function of the offered traffic load of URLLC,
. Note that the full reservation scheme is indicated by
PRBs, while the mixed reservation- and priority-based is indicated by
. In the latter case, there is the shared pool of nine PRBs that are shared between all types of traffic. By analyzing the session drop probability demonstrated in
Figure 4a, one may observe that the mixed reservation- and priority-based mechanism outperforms full reservation in terms of the mMTC drop probability for low and moderate values of the
. For both eMBB and mMTC traffic, the system is characterized by two regimes: (i) up until approximately
, the mixed scheme performs better, (ii) after
, the full reservation scheme outperforms the mixed one. The rationale is that prior to
both eMBB and mMTC traffic efficiently utilize the shared pool of resources available. However, when the offered traffic load of highest priority URLLC sessions further increases, they start to occupy the shared pool of resources, aggressively leading to the performance loss of eMBB and mMTC traffic.
The price one has to pay for better session drop probability in the mixed reservation- and priority-based scheme is higher session pre-emption probability demonstrated in
Figure 4b for the same system parameters. Here, we see that the session pre-emption probability is lower for the full reservation scheme across almost the whole range of the URLLC-offered traffic load
. Notably, the mMTC traffic suffers less with the corresponding probabilities being quite close to each other. The rationale here is that short-lived lightweight traffic is better suited for priority service due to fine granularity in terms of the number of sessions that need to be interrupted to accommodate the arriving higher priority session. However, the elastic eMBB traffic suffers most with the difference reaching three times at approximately
. Note that the pre-emption of a rare but long eMBB session has much higher negative impact compared to mMTC sessions, thus making the full reservation scheme better suited for the former type of traffic.
We finally proceed with the resource utilization of the system demonstrated in
Figure 5 for the same set of input parameters as in
Figure 4, as a function of URLLC offered traffic load,
, and eMBB maximum requested rate,
. By analyzing the results presented in
Figure 5a, we observe that across the whole range of
the mixed strategy shows consistently better results by approximately 10%. Notably, in overloaded conditions with a certain level of isolation supported, the resource utilization is just 5% off from 100%. In fact, this is the best side of the considered mixed strategy. Qualitatively similar results are also observed in
Figure 5b, where resource utilization of the system as a function of the eMBB maximum requested rate
is shown. Here, we might observe that as the system starts to be overloaded, the mixed scheme allows 95% of utilization to be reached, while the full reservation schemes remain at approximately 80%, resulting in slightly higher gains of around 15%.
5. Conclusions
In this paper, motivated by the need for simple mathematical models for RAT network slicing, we proposed and solved the model for mixed reservation- and priority-based traffic coexistence. The proposed framework aims to provide the balance between traffic isolation and efficient use of resources and also includes numerous service strategies as its special cases including full reservation, priority-only strategies, as well as mixed strategy without pre-emption. The system is formulated and solved by utilizing the tools of the queuing theory. We also demonstrated how the input parameters can be related to the wireless channel specifics including antennas and propagation models.
Our numerical results demonstrate that the proposed mixed reservation- and priority-based strategy allows resource utilization to be improved up to 95% while still providing isolation between traffic types in highly overloaded conditions. Compared to the full reservation strategy, the gains are in the range 10–15% across a wide range of system parameters. The mixed scheme is also better in terms of session drop probabilities for low and moderate offered traffic load of the highest priority sessions. However, elastic eMBB traffic benefits more from the full reservation strategy in terms of both drop and pre-emption probabilities.
One of the important advantages of the proposed approach is the simplicity of implementation, since for one of the models the steady-state probability distribution can be obtained by solving the system of local balance equations in a product form. However, for the model with pre-emption, the product-form solution is not available, so the steady-state distribution can only be estimated numerically by solving a system of linear equations. In addition, we considered 5G NR, which may operate in a mmWave or sub-6 GHz band. The proposed model is general and does not depend on the type of propagation environment and operational frequency.
In our future studies, the performance of the proposed algorithm on specific traffic sets and per-slice QoS/QoE (quality of Eexperience) will be evaluated. Moreover, propagation models that are closer to reality will also be utilized.