1. Introduction
Various elements are enabled in the development progress beyond 5G, including artificial intelligence (AI) and ubiquitous connectivity. Telecommunication intelligence (specifically in 6G networks) refers to the ability of a self-contained ecosystem, the self-awareness of state conditions, and optimal appropriate response reactions [
1]. Since 6G technology delivers extended distributed intelligence and control mechanisms, artificial intelligence (AI) and machine learning (ML) can benefit the control and optimization processes. For example, the radio function in the access layer is efficiently controlled and coordinated using ML. ML enhances the gNodeB (gNB) functions in bearer coordination, interference management, and radio resource allocation [
2,
3,
4].
The progression toward 6G promotes innovation in telecommunication services and functionalities, in which a blueprint comprising potential use cases is set [
5]. This blueprint includes in-depth and new use cases, producing numerous traffic taxonomies in an infrastructure. The blueprint also improves the handling mechanism for traffic classifications in an infrastructure. Generally, a 5G network using the Quality of Flow Index (QFI) method ensures a granular traffic classification. This outcome enhances the Quality of Service (QoS) approach in the previous generation. Nevertheless, the upcoming 6G network is more holistic due to the end-to-end network flexibility. This flexibility enables the network to adapt to traffic behaviors along the entire connection regardless of the terrestrial type [
5].
The increasing number of telecommunication devices is directly proportional to the growing spatial demand for these services. This observation necessitates the provision of viable services to every populated area. Given that network adoption can increase in the forthcoming years, higher transport network variations concerning their selection and deployment can be observed. One example of addressing the less populated areas (rural land areas) is through network adoption by providing offshore services. The 5G Infrastructure Public–Private Partnership (5G PPP) introduced a vision to have an architecture design as a single access network comprising terrestrial networks (TNs) and non-terrestrial networks (NTNs) to advance ubiquitous connectivity [
1]. An example of NTN technology is satellite technology. Satellite ubiquity aptitude demonstrates its presence for quick and accessible options for service provisioning.
The advancement of technology towards 6G is encouraged by diverse use cases and supported by advanced transport networks. Nonetheless, there are missing areas in integrating the heterogeneity of use cases and transport networks that could lead to more efficient infrastructure. This study proposed an intelligent steering mechanism for regrouping the data that shared the same attributes and matched the transport network properties. The data or traffic in this study were reorganized and regrouped into a total number of available transport networks arranged in a particular service area. Each data group was redirected towards optimum-matched transport technologies compared to the conventional traffic-splitting method. For example, in an urban area where the current terrestrial transport network is reaching capacity, expansion works are not time- and operationally feasible; therefore, integrating a non-terrestrial network is the most viable alternative, and steering the traffic to alternative paths is a feasible approach. Therefore, the most viable alternative is integrating an NTN technology and redirecting the traffic to alternative paths. Hence, utilizing existing satellite and cable TV is an option in which these two technologies are dynamically provisioned and readily available [
6,
7,
8].
This paper defines an optimization problem that aims to maximize overall resource utilization by steering pertinent traffic types to appropriate transport networks that resolve unnecessary issues such as traffic delays and congestion. The process assigned relevant traffic clusters to appropriate transport networks, using machine learning (ML) and mixed integer linear programming (MILP) techniques. Subsequently, the resource assignment method was manipulated to determine the optimal assignment between the cluster and the transport network. The objectives of this study are as follows:
- i.
Introduce a traffic steering scheme that optimizes the coexistence of heterogeneous transport networks.
- ii.
Demonstrate the comparative measures for traffic steering using ML and MILP against traffic classifications.
- iii.
Demystify the factors contributing to the cluster-transport assignment.
The remainder of this study is organized as follows:
Section 2 refers to several research studies related to this study.
Section 3 details the foreseeable issues discovered by these studies.
Section 4 discusses the framework for the traffic-to-transport network assignment proposal.
Section 5 highlights the findings and analysis of the observations from the simulation scenario incorporated in the framework. Finally,
Section 6 describes the conclusion and future works.
2. Related Works
The future of the 6G transport network anticipates the possibility of multiple co-existing transport network technologies. Meanwhile, the 5G network optimally utilizes an optical transport network to connect the network elements. Given that the developed optical transport networks have made significant progress in facilitating various use cases, the optical network is extensively used for hyperscale applications in data center network architecture. This process adheres to the strict requirements of Data Center Interconnects (DCIs) [
9]. A study by Ruffini et al. reported a state-of-the-art passive optical network (PON) technology (NG-PON2) [
10]. The study effectively performed the convergence process in a single PON infrastructure. Conversely, Mitra et al. described that the optical network deployment involved significant concerns regarding land acquisition and long deployment time [
11].
There are circumstances where, geographically, it would not be optimal to lay the fiber networks in greenfield or brownfield areas. Hence, the most practical solution to address the increasing service demand is redirecting the traffic to alternative transport network technology (non-terrestrial network options). The software-defined network (SDN) and network function virtualization (NFV) technology also encourages software function developments in a white box, providing significant advantages to distributed nodes with steering (or load balance) and gateway functions to other transport network technologies. A study by Tirmizi et al. denoted the recent works in hybrid satellite and terrestrial fields, which provided twofold values [
12] concerning the workability of the satellite transport network, and also revealed the issues concerning the necessary coordination steps in understanding the respective transport network technology properties and synthesizing the two technologies. These circumstances can inevitably result in the coexistence of heterogeneous transport networks, requiring an effective intelligent mechanism. Therefore, the coverage capability of satellite technology renders it a feasible option for transport network service accessibility. The Data Over Cable Service Interface Specification (DOCSIS) cable infrastructure is another technology option with an extensive footprint in residential areas [
5]. Consequently, this study maximized the three coexistence lists of transport network technology by harnessing the ML ability and the utilization of an optimization solver to yield intelligent mechanisms.
Another study by Tirmizi et al. surveyed the findings from an integration process involving TN and NTN transport technologies [
12]. The study highlighted several challenges in adopting satellite technology, including resource management, the mechanism to offload traffic, and path selection or routing. Considering the satellite network attributes and the obligation to sustain traffic QoS, steering the right traffic to the right transport network was the underlying factor in these challenges. Each stipulated challenge was elaborated by establishing a mechanism for utilizing hybrid networks (satellite and terrestrial) to achieve the most effective configuration. The study also reported that using the traditional approach to handle traffic in the hybrid network or adopting a distinctive resource management approach from these two networks could generate inefficient resource utilization and compromise network performance. Therefore, future network generation should emphasize improved connectivity, including various technologies to support the exponential growth of data, devices, and use cases [
1,
5,
7]. A novel mechanism is needed to maximize and achieve optimal integration in a heterogeneous network infrastructure due to the demands concerning future ideas and the unique attributes of integrating multiple transport network technologies.
Numerous studies have investigated this condition in single-domain and multi-domain transport technologies. Two studies by Giambene et al. and Lubna et al. effectively contributed to the concept of this study [
6,
13]. Lubna et al. studied links connecting two distinctive sender and receiver nodes [
14]. The study computed an optimum path based on various link attributes such as bandwidth, delay, and loss. Subsequently, the optimal link from the computing process decided to schedule the traffic to the link to carry the low latency and high data rate traffic. This study used a similar approach in which the selected transport network acquired specific bandwidth, delay, and error rate values. In contrast, the optimal matching attributes between the traffic and link determined the link or transport selection. An intelligent pre-path selection process was also necessary for the node, comprising various traffic types and terrestrial and transport network integration. This process divided the traffic into respective transport networks, and each transport network type exclusively handled similar traffic patterns within its domain.
Although traffic engineering studies concerning path selections and routing decisions have existed for many years, the incoming technology can slightly modify the current states of 5G and 6G heterogeneous networks, routing strategy, and optimal path selection. Hence, integrating NTNs into the infrastructure is essential to present the network adoption benefits [
15]. The coexistence of TNs and NTNs initiates the exploration of the mechanisms to redirect certain traffic types to designated transport networks [
15]. Moreover, effectively directing traffic to various transport networks requires determining the optimal approach to distribute the traffic load across different transport network types. This necessity occurs as each traffic possesses unique attributes that define its properties. For example, satellite constellations demonstrate quick resolution to infrastructure, service provisioning, and capacity planning constraints. Even though the satellite network is optimal for delivering efficient service coverage, the longer round-trip time constraint cannot be eliminated. Hence, intelligent mechanisms are vital in the infrastructure to comprehensively comprehend the traffic load properties and the transport network attributes. This process determines the most suitable traffic-transport allocation and assignment to optimize end-to-end efficiency.
Several studies have investigated 6G networks across multiple layers. One approach is to pursue the capability to enable a disaggregated infrastructure and achieve an optimal operational state by implementing distributed service provisioning mechanisms [
16]. Therefore, the end-to-end design of the next-generation network must be adapted to meet emerging demands. Numerous studies have implemented AI and traffic engineering methods to cope with the robust requirements supporting future networks. The advancement of ML facilitated the expansion of AI, which became one of its fundamental components. Furthermore, the innovation breakthrough in end-to-end operation and efficiency in the infrastructure has led to numerous enhancements. A study by Tang et al. performed a thorough and chronological survey, analyzing many ML use case types in 5G infrastructure [
17]. The study involved the elements at the network access, routing, congestion control, and application layers.
Introducing newer use cases expands the traffic classifications for 6G networks. A study by Alevizaki et al. developed a distributed control mechanism using evolution game theory [
16]. This mechanism resolved the traffic flow assignment issues from the UE. Hence, future network generation anticipates a highly dynamic state of traffic behavior. The pursuit of minimizing costs and optimizing resources are fundamental principles for ensuring the sustainability of operators or service providers. Likewise, a study by Oughton et al. emphasized a survey principle and listed the financial metrics that led to operators adopting resource-sharing mechanisms [
18].
The diversity of use cases led to the translation to traffic classification, followed by the QoS Flow Index (QFI) method setting up the specific treatment of each packet in the infrastructure. From another perspective, the generated traffic from every UE has a range of characteristics set as a type of use case, which is made based on size and critical properties like latency and error rate. Nevertheless, certain applications (single-use cases) have attributes overlapping with other use cases. For example, virtual reality (VR) is classified as enhanced mobile broadband (eMBB) and ultra-reliable low-latency communication (URLLC) (subject to operator definition). In some scenarios, a casual internet user produces low and non-critical traffic sharing attributes similar to a massive machine type communication (mMTC) use case traffic. These circumstances forage the investigation to demystify the interrelation between user equipment (UEs) in the serving area; hence, this study opted to manipulate the unsupervised machine learning to produce a cluster within it and find the classifications amongst the UEs that shared similar characteristics.
A study by Schotten documented a comprehensive list of prospective use cases following the introduction of 6G networks [
19]. The study revealed multiple novel use cases, such as holographic communication, extended reality (XR), tactile internet, and haptic communication [
20,
21]. These use cases highlighted the extreme infrastructure requirements to deliver the products [
20].
Figure 1 depicts the novel use cases for 6G shared multi-attributes. The hologram and XR use cases were effectively performed under low latency and substantial bandwidth provisions. Meanwhile, tactile internet demonstrated high reliability, security, and availability while occasionally requiring high bandwidth capacity [
19]. Tactile internet also required the cross attributes of capacity, reliability, and low latency connectivity. Notably, Schotten examined and classified the 6G use cases by identifying the overlapping attributes shared by three 5G use cases [
19].
Typically, the current system for splitting traffic employs various approaches, such as the round-robin scheduling method and a split ratio method that proportions traffic to available path options. A study by Lubna et al. applied a multipath scheduler to multiple transport links by classifying traffic types based on low- and high-latency traffic [
14]. The study did not discover any relevant scheduling or splitting traffic mechanism research based on the attributes of specific transport network technology mentioned earlier. An alternative method proposed by Giambene et al. involved splitting the traffic into designated satellite and terrestrial paths based on both link conditions [
6]. This method did not consider the assigned traffic type to the transport. Nonetheless, traffic and transport network conditions require optimal resource management in actual network conditions. Infrastructure-related resource management efficiency and network performance can be increased by directing pertinent traffic to appropriate links, specifically in highly densely populated traffic. Hence, more comprehensive splitting mechanisms are essential to comprehend and assign traffic intelligently to the transport network.
New challenges are observed because various services and use cases require the most efficient operating infrastructure. A study by Goścień demonstrated that traffic engineering leveraged the adaptiveness of a network to traffic behavior [
22]. The study classified the traffic types into four categories, representing real traffic observation from real networks. In addition, the study mitigated the bandwidth-blocking probability and implemented an optimization mechanism to resolve the routing and channel resource assignment. This algorithm computed the resources of a total number of light paths in one of the connecting nodes in the network and measured the channel usage size. The algorithm also determined an optimum state that translated into a relocation scheme to address the bandwidth-blocking probability. The optimization scheme improved the overall traffic assignment or allocation and surpassed the traditional method. Nevertheless, the relocation policy enforced in the study was among the same optical transport network technology types.
A recent growth of interest in the ML application for the traffic engineering sector has been observed. A study by Fan et al. summarized the ML applicability for traffic classification with a high accuracy of classification outcomes [
23]. The study concluded that a more accurate type was critical for assigning and managing traffic for pertinent network applications. Thus, this classification was vital for optimizing resource allocation instead of relying on traditional methods (payload and port-based). Considering that the 6G transport network encompasses heterogeneous technologies, numerous ML techniques have been applied to various network circumstances concerning path and node selection. This process necessitates the efficient management of heterogeneous transport networks.
Another study by Mendoza et al. created an environment containing wired ethernet and satellite links [
24]. The study used this environment to experiment with the end-to-end traffic engineering application in a 5G environment. Better network visibility was presented when SDN was integrated into wired and satellite links, enabling routing computation and path restoration. The SDN in the infrastructure assisted in populating path options and determining the desired traffic path. The study also showcased the SDN in implementing traffic engineering, enabling the selection of the most efficient route for certain traffic types. Meanwhile, the performance of the user datagram protocol (UDP) traffic transport using a satellite link was optimal within the satellite link capacity. Only a minimum traffic amount with a high-bandwidth channel and higher link capacity was permitted in the forwarding transmission control protocol (TCP) traffic path using a satellite link. Consequently, the SDN features for incorporating technology into terrestrial networks were feasible in the study. On the contrary, conducting additional traffic classification research could be advantageous to comprehensively understand resource management in an infrastructure.
Recent studies in integrating satellite–terrestrial networks focused on enabling computing functions in satellite ecosystems. Ref. [
25] focused on enabling double-edge computing functions to reduce processing time when using satellite transport networks. As a result, the study in [
25] computes a value that determines the cache for content delivery and a deterministic value to offload tasks in both resource nodes in terrestrial and satellite. The output from [
25] demonstrates the ability of the network to react based on data on the ground and in space for higher efficiency than conventional architecture. Thus, it enabled the reduction of processing time and delay, which are crucial for future 6G with terrestrial and non-terrestrial heterogeneity. Another study proved the use case of machine learning in Multi-Access Edge Computing (MEC) [
26] and explored the scaling mechanism, creating a dynamic scaling capability based on demand and producing a sufficient number of virtual network function (VNF) nodes to cope and maintain the QoS.
In the traffic engineering framework, the study of [
27] demonstrated the capability to locate the optimal placement of VNF to determine highly efficient service function chaining (SFC) in a network function virtualization (NFV) environment using MILP and approximation solution. The findings from [
27] indicate the optimum method to address strict SFC routing policy, the efficient management of VNF usability, and the identification of an algorithm that works efficiently in specific scenarios. The study in [
28] demonstrated deep reinforcement learning (DRL), a subsection in machine learning frameworks. The authors of [
28] explained the on-demand routing mechanism in wide area networks (WANs) of 6G. The specific DDPG was used to determine the routing action from the study. Ref. [
28] included various factors to improve the routing action by computing the state via the routing agent. The optimal routing path was computed from the reward values learned from the routing agent that collects delay, jitter, and loss ratio. Each parameter was multiplied with weightage values generated from the links’ congestion, thus significantly reducing the delay, jitter, and loss ratio.
Since traffic engineering-based studies contribute to developing a state-aware infrastructure, highlighting Intent-Based Networks (IBNs) is crucial for developing the 6G network. Several ML and SDN NFV-related studies have influenced a few aspects, such as the IBN innovation, the model developed from the learning process, and the dynamic configuration ability from the SDN capability. This outcome enables the infrastructure to possess real-time configuration, dynamic network topology adaptation, and efficient management [
29]. A study by Velasco et al. assessed the ML, ML orchestrator, and pipeline to address the optical communication parameters in enabling network automation and IBN architecture [
30]. Generally, IBNs are pivotal for producing a distributed architecture [
16]. An ML pipeline is necessary at a network edge to make decisions, which involves evaluating the network condition and implementing SDN-based actions. Thus, Velasco et al. classified the automation of optical networks that managed and assigned light paths or channels in optical links regarding traffic behavior [
30]. Notably, the study emphasized network adaptation enablement, where it monitors the network and reacts to maintain the threshold of the network’s performance set by the network operator. Conversely, the study focused on traffic capacity variations and predicted link accommodations by ML and SDN regardless of traffic classification.
Thus, this study complements the numerous methods with a novel approach to splitting traffic based on its classifications, assigning each to appropriate transport network technologies. Subsequently, the applicability of using IBN with an ML pipeline was explored. The ML pipeline produced a traffic classification output, and the MILP generated path selection decisions for the traffic in the heterogeneous transport network. A specific transport network technology type was also selected in this study, and the detailed properties of each transport network technology were applied in the learning steps. The intelligent mechanisms were aligned and feasible with the IBN architecture, and the ML enabled the orchestrator to use a suitable model. Therefore, this process computed a proper configuration for the network.
4. Proposed ML Techniques, MILP Framework, and Methodology
4.1. Network Model
Figure 3 highlights that this study operates within the defined network model, with K numbers of transport network technology (connectivity alternatives between the access layer node and the core node). Even though each transport technology possessed various links to form mesh connectivity in optical and DOCSIS cables, the technology was treated as a single connectivity line. This observation was similar to the inter-satellite links from satellite technology. The transport network to the provided list mentioned in the previous section was scoped using a K value of 3. Meanwhile, the different annotations of arrow colors and directions indicating the downlink (DL) and uplink (UL) traffic represented the traffic classifications. Specifically, the code color represented the traffic types exchanged inside the infrastructure.
Figure 3a,b illustrate the difference in traffic classification treatment between the conventional and the proposed intelligent traffic engineering for 6G architectures.
The traffic classification in the conventional architecture was treated with different priorities and scheduling. Nonetheless, the proposed architecture in this study complemented the approach by adding preliminary traffic segregation works before accessing the transport networks. The post-clustering and assignment process indicates that a group of packets that shares the same attributes is steered to optimal-matched transport network attributes and subsequently undergoes a conventional traffic prioritization process. This proposed architecture mission aimed to maximize end-to-end efficiency by enabling relevant packets to be steered into optimal paths following their respective transport network technologies. Equation (1) describes the calculation of the total volume of packets,
, from the
i-th index of packets that sums up the total traffic volume in an instance,
.
where
Ui(
t) is the packet in the instance, and
umn(
t) represents a packet based on their traffic classification types denoted by m, representing traffic classification types ranging from 1 to 5 and the
n-th index of packets.
T(
t) equals the total number of packets in an instance in the respective m category, where
n ranges from 1 to
N.
T(
t) underwent unsupervised learning, a clustering technique to group the total UE and traffic per instance.
4.2. Clustering Model
The clustering process transformed the packets from T(t) into the aggregated volume of packets within clusters, denoted as Dj(t), where j is the cluster label set to equal the number of transport network technologies equal to three, represented as j ϵ {C#1, C#2, C#3}. Thus, Tj(t) = DC#1(t) + DC#2(t) + DC#3(t). Meanwhile, the series of instances, t, exhibited different Dj(t) values over time based on the dataset and clustering output. Subsequently, a variation of traffic types in a cluster, umn(t,j), determines the value Dj(t). Each Dj(t) was analyzed to produce the total downlink (DL) or uplink (UL) denoted by Vdl/ul, minimum error rate, εmin, and packet delay budget, βmax, values in forming Dj(t):(Vdl/ul, εmin, βmax). Each instance in the format in the clusters, Dj(t), incorporated the total sums of packets handled by the transport network. Given that the number of transport network technologies is K, λk is the respective denotation of K transport network technology. Moreover, a similar format, λk:(VT-dl/ul, εT-min, βT-max), represents the transport network technology, where VT-dl/ul is the capacity of the transport network, εT-min is the error rate, and βT-max is the packet trip time.
4.3. Clustering Process
This study examined the functionality of unsupervised ML under clustering algorithms. The clustering algorithms were conducted with unlabeled data, in which the attributes of each data point were analyzed, and the similarity in a cluster was exhibited. An emulator that emulated the generated inbound and outbound data transacting from UE to gNB and the core (including vice versa) was used to derive the data for this study. These data included the Reference Signal Received Power (RSRP), DL, UL, and QFI values. A clustering algorithm was employed to learn from each data point and generate a vector value, Dj(t), representing a point’s attributes or properties. Subsequently, the process combined all points with similar attributes into clusters.
A different perspective on the resource assignment was also obtained based on different input types, and this study performed traffic clustering based on the use cases of the traffic. Three use cases were considered in this study: eMBB, URLLC, and mMTC. These use cases were represented with Fi(t), where i = [eMBB, URLLC, mMTC]. The datasets were classified into three categories, resulting in a format that resembled the previous cluster format, Fi(t):(Vdl/ul, εmin, βmax). This process asserted uniform traffic types within the group, contradicting the clustering process performed using the ML clustering technique. The clustering process then had to undergo another unsupervised learning technique for detailed execution.
The dimensionality reduction algorithm produced two-dimensional data from the multivariate data. Before clustering, this process reshaped the data into a vector value using the t-distributed stochastic neighboring embedding (T-SNE) method. The data were filtered based on the referenced signal receive power (RSRP) values to exclude the farthest UE from the serving radio unit (RU). Only the UE inside the serving area was considered in the subsequent stages (step 3 in
Figure 4). Finally, the filtered list of downlink and uplink values with QoS Flow ID (QFI) underwent the T-SNE transformation process to create an unlabeled dataset before the data execution for the clustering process (step 4 in
Figure 4). The multivariate data from the prior processes underwent initial steps of scaling and transformation to produce a standardized scale to enable the balance dominance of every dataset field before applying T-SNE to reduce the multivariate dataset into a single unlabeled value.
A clustering process was performed using Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) to identify patterns from the T-SNE transformation output. The clustering process identified the attributes of each point and grouped them into three clusters:
Tj(
t) =
DC#1(
t) +
D C#2(
t) +
DC#3(
t). Each point (represented as UE) was assigned a cluster label (C#1, C#2, and C#3) after the clustering procedure was completed, indicating the group of each point (UE) (steps 5, 6, and 7 in
Figure 4).
Subsequently, each label on a UE was stored and recorded for future data training and learning processes. The post-clustering process then shifted to the resource assignment stages. Steps 8 and 9 were the preprocessing stages before the resource assignment step. During this stage, the data were collected and converted into a matrix format. The transport network data comprised capacity information, round trip time, and packet loss rate, representing a transport network attribute value in a particular instance, λk:(Vdl/ul, εmin, βmax).
4.4. Matching Process
The MILP was employed to determine the optimal resource assignment based on the cluster transport network assignment. Each value from the UE cluster and the transport network was checked similarly during the resource assignment process. Step 9 in
Figure 4 implies the iteration process of the similarity check between each cluster and the transport network properties or attributes.
xij is the decision variable where the value of ‘
i’ represents the clusters assigned to the available ‘
j’ transport network as resources. Hence,
xij 1 to determine the output of the cluster transport assignment.
xij ∈ {0,1},
i = the transport networks,
j = the clusters
This study evaluated the assignment output by matching clusters with transport networks with qualities compatible with the cluster. Similarly, the same processes were applied to examine the corresponding matching assignment on the clusters formed by various UE types (see
Figure 5). Consequently, three significant clusters were created from the eMBB, URLLC, and mMTC UE types, and the same matching assignment output using MILP was evaluated. The process also considered clustering each UE type in an instance. Furthermore, the total DL or UL size, maximum packet delay budget values, and minimum packet loss rate of the cluster were recorded and represented as cluster attributes. Lastly, similarity checks were performed between the attributes, clusters, and transport networks.