1. Introduction
Devices are utilizing more wireless networks due to the rising deployment of the Internet of Things (IoT). For the IoT to maintain its full connectivity potential, a communication infrastructure that will serve as a backbone network is required. Visible light communication (VLC) offers an attractive and economical communication platform for IoT systems and has rapidly evolved as a potential alternative to the congested traditional radio frequency (RF) since it uses low-cost light-emitting diodes and photodiodes for both illumination and communication [
1]. Hence, the VLC system saves the extra power that is required in the traditional RF communication system. The VLC system occupies a spectrum between 380 nm to 750 nm, corresponding to a frequency spectrum of 430 THz to 790 THz [
2]. The VLC system is encouraging for several applications and has been applied in different environments such as indoor [
3,
4], outdoor [
5], underwater [
6], and underground [
7] scenarios since it is capable of providing high data rates with limited security issues compared to the RF system with lower bandwidth and security challenges.
For the indoor VLC system, a directed LoS link is the simplest configuration, where the channel of transmission is defined by a highly directed angle of beaming of the transmitter and a narrow FoV of the receiver. The directed LoS links do not interfere with noise from ambient light sources since a narrow FoV is usually used as the transmitter and are not prone to multipath propagation effects, therefore exhibiting a high transmission rate. However, the LoS links offer limited channel coverage area, making it difficult to provide extended coverage and mobility in the indoor environment. This is due to the need for the alignment between the transmitter and receiver. More so, the links experience blockages caused by obstructions from people’s movement and presence of nearby obstacles. As a result, there is a shadowing effect, which reduces the received signal strength (RSS) in the shadowed region and lowers the quality of communication. Moreover, blockage and shadowing issues in the indoor VLC system can be alleviated using a non-directed LoS link or a dispersed topology [
8]. Unlike directed LoS links, these configurations employ wide beam transmitters, wide FoV receivers, and surface dispersions to expand the coverage region and achieve mobility in the indoor environment. The non-directed LoS link depends on several reflections from object surfaces, allowing a PD to detect considerable amount of the transmitted light from various directions. On the other hand, diffused design does not involve establishing LoS link (non-directed NLoS) or aligning the wide beam transmitter with the receiver since the emitted light beams are reflected from several surfaces. Nevertheless, non-directed LoS and diffused configuration utilize greater transmit power and are more susceptible to multipath propagation effects such as multipaths-induced dispersions, occurring in the form of ISI. Consequently, degrading the achievable data rate of the links.
Some articles, such as [
9,
10], have focused on environmental modeling and theoretical analysis of the shadowing effect using different scenarios. Moreover, integrating the VLC system with the RF system helps to avoid service disruption and ensure continuous transmission while maintaining a high data rate in the VLC system [
10]. However, a major issue in a hybrid VLC–RF system is the decision to perform either a horizontal handover (in a homogeneous system) or vertical handover (VHO) (in a heterogeneous system). This is because a user device is able to switch from an AP with reduced coverage and/or degraded channel quality to another with stronger connection. For VHO decisions, a PVHO algorithm based on two fundamental VHO schemes, IVHO and DVHO for hybrid VLC and LTE systems, was presented in [
11]. In the IVHO scheme, MTs switch immediately during channel disruption while the DVHO scheme sets fixed dwelling time as the duration of short interruption to avoid ping-pong effects. As an improvement to the static DVHO scheme, [
12] introduced a CAD-VHO scheme for the VLC-RF system. Note that excessive switching between the APs may result in increased signaling overhead and latency.
As discussed in [
13], MT activities are key contributors to VLC disruption, which often result from user mobility and follow predictable patterns. These patterns can be modeled by parameters such as RSS, location, time of day, and duration. In addition, the duration of the VLC–LoS link blockage is usually for a short period of time, requiring minimal handover delay time before service is restored. Hence, it is possible to model and make VLC link interruption prediction using the MT activity record. The use of machine learning in the handover decision process was investigated in [
14,
15,
16,
17]. In [
14], a dynamic coefficient training through ANN was adopted to provide reliable communication for group mobility and improve the handover decision process. More so, an LVQNNs technique for maintaining steady communication based on the RSS indicator, data rate criterion, service cost, and statistics of the MT’s speed was suggested in [
15]. The authors in [
16] presented a context-aware VHO in a heterogeneous WiMax and WiFi system. The Whale optimization algorithm was implemented with the neural network to ensure effective handover prediction using the RSS indicators. In [
17], a VHO algorithm was presented for heterogeneous wireless networks based on multi-attributes and NN. The download rate of the integrated network was utilized as a prediction target, while the appropriate wireless network for VHO decision was selected by the NN algorithm. This method presented a tradeoff between the handover success rate and the number of wireless networks in the integrated system. In addition, fuzzy prediction is a well-known intelligent method for enhancing VHO decisions in the heterogeneous networks. This approach depends on mobility prediction to compute the declination ratio of expected signal output to the actual signal output. A fuzzy controlled VHO system was developed for heterogeneous networks in [
18]. The authors utilized a multi-criteria approach to address network selection issues. In addition, the fuzzy-logic-based technique was utilized to obtain seamless VHO in [
19]. The handover technique reduced ping-pong effects while improving the overall performance for mobile users in the network. Similarly, a vertical handover decision algorithm was proposed for next-generation networks using a fuzzy predictive model in [
20]. The scheme utilized a multi-layer feed forward network to determine the user’s next position and a multiple-attribute access network selection function to obtain the most likely access network.
Furthermore, hidden Markov model (HMM)-based HHP schemes for user mobility in wireless networks were presented in [
21,
22,
23]. These schemes were developed to provide horizontal handover between two or more APs in the same network. The authors in [
21] employed the HMM to predict RSS values from observed users’ movement data. A fuzzy controller module was further utilized to classify the handover decision before deciding whether or not to execute the handover. In addition, an approach to improve the handover decision in femtocell networks based on HMM was suggested in [
22]. The HMM predicted the next AP position based on the geographical locations of the previous and current users. Similarly, [
23] utilized the HMM scheme to predict service eNodeB in the long-term evolution (LTE) network according to user mobility for improved communication. The handover approach relied on trajectory features to trigger handovers between two non-adjacent APs. The model exploited a large number of hidden states, leading to more inaccurate sequences and increased computational complexity.
Moreover, other studies have considered formulating heterogeneous handover prediction as MDP problems. In [
24], a VHO scheme for heterogeneous VLC and WiFi networks was formulated as an MDP problem. The technique adopted a dynamic approach to achieve a trade-off between the cost of switching and the need for delay. The scheme made VHO decisions according to the length of the queue and the state of the wireless channel. A VHO scheme to enhance the user’s QoE in an indoor VLC environment was investigated in [
25]. Likewise, the scheme was formulated as an MDP problem that maximizes the QoE of users and reduces handover delay. The decision to perform VHO was based on the current location of the user and the mode of transmission, which could either be a VLC or an RF channel. In [
26], a VHO technique for an indoor hybrid VLC–Femto system was considered based on candidate decisions. The technique utilized AHP and CG to manage an MADM framework and for matching different types of traffic. The rationality of each decision was computed by combining the results from AHP and CG to select the one with a higher degree. A larger number of hidden states was exploited by the models and thus leading to extra inaccurate sequences and increased computational complexity.
In this article, an automated vertical handover prediction algorithm for a heterogeneous VLC–WiFi system is developed to improve communication in the indoor environment. The HMM is employed as a tool to model the disruption (OFF) and continuation (ON) of the VLC–LoS link. The proposed VHO utilizes the RSS dataset of the VLC and WiFi, containing RSS values that correspond to the MTs’ location. RSS values are time-varying due to the blockage in the VLC–LoS download link and the instability of the WiFi system. Unlike other existing HMM methods, effective RSS values are obtained from the huge RSS datasets using a PCA method, thus reducing the computational complexity of the HMM. The extracted features are then restructured into window sizes at different locations, which are employed with the HMM to maximize the handover decision-making process. The results of this paper are unique since the proposed VHO reduces excessive switching in a short period of time known as the ping-pong effect and yields an enhanced dwell time compared to existing state-of-the-art VHO schemes.
The remainder of this article is arranged in the following manner.
Section 2 discusses the system model, method of data collection and feature extraction and basic HMM principles.
Section 3 demonstrates the design of the dual HMMs for predicting the next RSS value and optimizing handover decisions.
Section 4 presents performance evaluation of the proposed VHO scheme based on HMM. The Section evaluates the HMM prediction accuracy, the number of handovers in the system, and the dwell time.
Section 5 concludes the article.
2. Methodology
2.1. VLC Network Model
In this article, a direct VLC–LoS propagation link is modeled due to its power efficiency dominance over the NLoS reflected links. The VLC channel quality relies on LoS transmission and cannot guarantee continuous transmission when there is a blockage and/or the MT is not in the VLC range. We consider the VLC system in an indoor environment with LED bulbs (transmitter), having a mixture of 475 THz, 535 THz, and 638 THz. Each LED, having 30 V peak voltage and 300 mA driving current is placed at the ceiling center in a 3 m × 3 m space. The VLC receiver is a low-cost (S5971) Silicon PIN PD, which offers good temperature tolerance, fast response, and wideband characteristics at a low bias. The LED is situated at m above the upward facing PD attached to a user’s device.
Suppose a flat frequency response around the DC transmitter and receiver. The DC gain of the LoS channel is computed as [
8]:
where
is the Euclidean distance between the AP and
r-th user,
represents the Lambertian radiant intensity,
is the radiation angle related to the axis normal to the VLC AP plane, and
represents the incidence angle with reference to the axis normal to the PD. Note that the PD’s FoV =
is determined by the maximum incident angle. More so,
A is the PD’s physical area,
denotes the order of Lambertian emission, while
and
are the optical filter and concentrator gains, respectively. Consequently, the relationship between the LED-transmitted power (
) and optical power of the receiver (
) is obtained as [
8]:
In addition to the indoor LoS channel model, optical channel noise is considered in the network model. Optical background noise sources in the indoor VLC systems include ambient light noise, signal and ambient light induced noise in the PD, and thermal noise produced by the TIA [
8]. In this study, the VLC channel is modeled as a linear optical AWGN channel. The PD current
I is given as [
27]:
where
denotes the photo sensitivity of the PD, ⊗ symbol represents convolution, and
N is the noise, which is the combination of the shot noise and thermal noise, that is:
The signal and ambient light induced shot noise variance is given by [
13]:
where
q is the electronic charge,
R represents the PD responsivity,
denotes the received noise power,
defines the received optical power, and
B indicates the receiver bandwidth. On the other hand, the thermal noise variance is expressed as [
13]:
where
K denotes the Boltzmann’s constant,
T represnts the absolute temperature, and
defines the TIA feedback resistance.
Table 1 shows the VLC system parameters used in this study. Similar to [
13], a value of
is allocated to both the direction of the LED, defined by a semiangle of the power,
, and the direction of the PD, specified by its FoV. In addition, constant values are assigned to
and
to avoid their dependence on the incidence angle. A fluorescent lamp that operates at 50 Hz is used to represent the artificial ambient light noise, while a visible passband filter with 660 nm wavelength is placed in front of the PD to prevent other wavelenghts. The PD functions at a wavelength of 660 nm and converts measured light to electrical current at a cutoff frequency of 100 MHz. Furthermore, the low noise TIA (AD8015) amplifies the input current from the PD to an output voltage signal. Other undesired noise components of the transmitted optical signal are filtered using an electrical low pass Bessel filter. The demodulator is used to recover the original transmitted data.
2.2. VLC–WiFi Network Model
We consider an indoor environment consisting of a hybrid VLC–WiFi system.
Figure 1 shows a typical indoor scenario where one VLC AP is placed on the ceiling in each room and a single WiFi AP (always on) is deployed in a corner of the rooms. The VLC has low coverage of about 2–3 m but provides high bandwidth. However, the WiFi provides lower bandwidth and covers a wider area (overlapping the VLC AP) in the apartment. Suppose several users holding multi-mode MTs randomly move around in the apartment. A vertical handover is expected to be prompted whenever there is a blockage or disruption in the VLC link of the MTs.
This study focuses on the downlink transmission in the heterogeneous VLC–WiFi network. This implies that VLC is used as the preferred downlink since it has higher bandwidth. Moreover, the WiFi network is designated for uplink transmission and can also serve as a backup if the VLC–LoS downlink channel is blocked or when the VLC is out of range. This is because the WiFi provides a wider coverage and can penetrate opaque objects.
In a real-world scenario, the location of both the VLC and WiFi APs can be irregular. However, for the purpose of simplicity and to avoid interference among the VLC APs, we suggest the arrangement shown in
Figure 1. A CSMA/CA [
28] is used to prevent interference between WiFi APs. More so, it is essential to allocate resources efficiently in a real-time VLC–WiFi system, especially when the receiving system is saturated. Moreover, it is assumed in this study that data can be delayed if there is an interruption or/and shadowing during VLC–LoS downlink transmission. The delayed data will be re-sent when the VLC channel becomes available. Otherwise, communication can proceed over the available WiFi network. As a result, an automated VHO prediction scheme is introduced to effectively manage the decision-making process in the hybrid system. This is accomplished by minimizing both switching costs and optimizing the dwell time of the VLC–WiFi system.
2.3. Mobility and Blockage Model
Similar to [
21], RSS values were measured by an MT moving in the indoor apartment at a speed between 1 m/s and
m/s. The measurements were collected at a sampling rate of 100 Hz for a duration of 500 s from 6 history locations over short distances. The user mobility was simulated using the random waypoint model [
29], which ensures that users travel in a straight line from one waypoint to another and that the waypoints are chosen at random in the space. Note that the measured RSS values are directly related to the distances between the MT and the APs. This implies that the closer an MT is to the AP, the stronger the RSS values and vice versa. Strong RSS values were produced when the MT was within 1 m of the VLC AP and there was no blockage on the LoS link. On the other hand, the weak RSS values were obtained when the distance between the MT and the VLC AP was greater than 2 m and the VLC–LoS connection was blocked. This implies that the RSS values are time-varying, and the dimension of the datasets determines the computational complexity of the HMM.
VLC–LoS blockages, such as human crossing, can be modeled using the occurrence and occupation rates over time [
10]. The occurrence rate signifies the total number of blockages that occur over a sampling interval, while the occupation rate is the number of times that LoS blockages occur in the network. The blockage typically results in VLC signal attenuation, leading to a significant decrease (negative peak) in the value of the RSS readings. In this study, the LoS blockage event is modeled using the rate of occupation since the event is assumed to follow the Poisson point process and be evenly distributed between 0 and 1.
Consequently, effective RSS values were extracted from the RSS datasets using the PCA method. The reduced feature set may be readily observed and processed using various machine learning methods such HMM while preserving as much data as practicable. Other discriminant analysis methods that have been adopted with HMM include LDA or FDA [
30] and DMD [
31]. Moreover, the PCA is a multivariate method with low computing complexity for reducing the dimensionality of a huge dataset by transforming variables to smaller components while keeping the information in the larger dataset [
32]. The mathematical process for the PCA entails the following steps:
- (1)
Normalization. This ensures that each sampling point in a continuous signal contributes evenly to feature analysis. Generally, a significant discrepancy between the large and small RSS values in observation variables of the dataset produces bias at the output. Therefore, normalizing the variables of the dataset mitigates such bias. Supposing
is a RSS vector containing
i sampling points at a specific location
l, the normalized signal is given as:
where
and
are the mean and standard deviation of
, respectively.
- (2)
Covariance Matrix Computation. Note that the observation variables in the RSS dataset have a high correlation since the dataset contains repeated values at some MT locations. Thus, the covariance matrix,
, is computed to identify variations between the input data set and the mean. The
is a
-dimensional symmetric matrix that contains the covariances for all possible pairs of observation variables. Suppose that the RSS dataset,
C, has variables
, the covariance matrix of
C is given by:
The diagonal elements represent the variances of each observation vector, and the elements of are symmetric with respect to the main diagonal.
- (3)
Eigenvectors and Eigenvalues Computation. The PCs of the RSS dataset are produced using eigenvectors
and eigenvalues
, which are determined from the covariance matrix. SVD is a well known linear algebraic approach for obtaining
and
. The SVD method decomposes
in Equation (
8) to yield unitary matrices,
and
, as well as a diagonal matrix
with the same dimension as
and non-negative diagonal components in decreasing order, such that:
Note that entries in the diagonal of correspond to while the columns of represent , which are sorted in order of the (from highest to lowest). Thus, the PCs are obtained in order of importance as .
- (4)
Feature Vector Construction. Supposing that
k,
components have the largest variance, that is, the most information based on the PCA, the
k-dimensional matrix is expressed as:
Hence, the feature vector is obtained as:
The extracted features, known as effective RSS,
(containing the most information) are reshaped to smaller time window (
w) at various locations for the purpose of selecting the required observation frame for HMM, given by:
The choice of window size is critical to the performance of HMM since it helps to maintain the probability distribution of the transition matrix in the model. Suppose a very small time window is selected, the observation frame becomes too small, which may not be sufficient for the model to make a fair prediction about the RSS variation’s pattern. On the other hand, selecting a very large time window will result in a large observation frame that may contain several user mobility changes in the indoor environment. Therefore, we test different window sizes to select the one that maximizes the MT’s movement in a considerable observation frame using simulation.
2.4. Hidden Markov Model
The HMM is a class of PGM with state (hidden) variables, which are estimated through a sequence of output (observable) variables. The HMM is a flexible tool that can easily predict continuous RSS values and classify the handover decision of MTs in the indoor environment. For a sequence of N hidden states, each state of the model is represented as the locations of MT in the building at a particular time t. More so, the MT’s movement history at a state is depicted by a sequence of M observations (emissions) . This signifies the possible RSS values received by the MT.
Furthermore, for a given sequence of observable variables, the HMM estimates three important parameters , where is the initial start probability, is a transition probability matrix representing the probabilities of switching from state i to j at a given time, and is is a emission distribution matrix, referring to the probability of obtaining observation symbols from a specific state j. To maximize the occurrence of RSS values, is obtained by estimating the ML of the HMM parameters from the sequence of the received RSS values using the Baum–Welch algorithm.
3. RSS Prediction Based on HMM
Figure 2 depicts a typical HMM as an automated scheme for classifying and predicting the next RSS values of the hybrid VLC–WiFi system. This is in contrast to the method in [
21], which uses the HMM with fuzzy controller module as a model for prediction.
The RSS dataset is divided into two parts:
and
, which are used for model training and testing, respectively.
Figure 3 shows the time series of the hybrid VLC–WiFi RSS dataset used for training. As mentioned earlier, the VLC signal attenuation is caused by blockage and user mobility. The dataset samples are manually annotated and categorized into three RSS intervals: high, medium, and low intervals as depicted in
Table 2. Subsequently, the labeled dataset is used to train the HMM.
Two Gaussian HMMs: “VLC-AP” () and “WiFi-AP” () models are formed to represent the VLC and WiFi APs, respectively. The initial parameters of each model are given as and . Here, the sequence of observation O for each model is developed to include frames of 3 time slots, which restricts the monitoring of user motion to between 3 and 5 m. Thus, O for each category of an RSS indicator is used to train the appropriate HMM.
3.1. HMM Training
Training an HMM entails deriving the best parameters of
and
to optimize the probability of occurrence of
O and the joint probability of
O and
S, given by:
where
is the current estimated parameters,
is the parameters to be optimized,
is the conditional probability distribution of
O,
represents the probability of an MT being in state
, and
depicts the transitional probability of an MT moving from state
i to
j. To solve the optimization problem, an EM algorithm [
33] is applied, which iteratively estimates the ML of independent model parameters in the following steps:
Step 1—Expectation (E): To evaluate
, it suffices to compute
and
using HMM dependency since several terms vanish from the form of
in Equation (
13). Therefore, the posterior state probability value
is obtained from a given sequence of RSS indicators using independence of PGM, which is further decomposed to forward
and backward
terms such as:
Both terms can be efficiently computed using the forward and backward algorithm. Furthermore, the transitional probability of an MT moving from a state to another is decomposed using the Bayes’ rule as:
Step 2—Maximization (M): Here,
is optimized with respect to
. Moreover, each parameter in the categories 1, 2, and 3 in Equation (
13) is independent and thus can be optimized separately. To maximize
and
, we introduce Lagrange functions
and
considering the constraints that
and
,
. Suppose the derivative of
with respect to
is equal to 0, we have:
In addition, let the derivative of
with respect to
be equal to 0:
Note that the RSS dataset is continuous and follows a normal distribution. Hence, the emission distribution
for each AP is a Gaussian distribution, and the probability density function (pdf) is given by:
where
,
. In addition,
is maximized by defining the Lagrange function
such that its derivative with respect to
is equal to zero. Hence, the estimated parameters are obtained as:
and
Subsequently, the trained
and
models are concatenated to a single model
using their respective estimated parameters. The trained initial state probability is given as
, the trained transition probability matrix
is specified by a block diagonal matrix containing
and
(see [
34]), and the trained emission distribution is represented as
. Thus,
as shown in Algorithm 1.
Algorithm 1:Combined HMM. |
|
3.2. Prediction Algorithm
The prediction algorithm is obtained based on the trained HMM probabilities
and the observation sequence
of the RSS testing dataset.
is first converted to Gaussian variables
by computing the weight of each RSS data point in relation to the number of states using
. Thereafter, the V-Alg [
35] updates
using parameters
and
at time
t. Hence, the most probable next AP can be identified from the corresponding hard decision (best path) of the updated
. To allow an unbiased handover decision from the VLC AP to WiFi AP, and vice versa, equal switching probability is given in the transition matrix for the V-Alg. Consequently, the best path value (
) derived from observed MT mobility at time
t is compared to the average number of states
such that the vertical handover decision is classified as follows:
This classification helps to prevent the ping-pong effect, which may occur due to the MT mobility and/or if the VLC LoS is blocked for a short period of time. The process of prediction is summarized in Algorithm 2.
Algorithm 2:Prediction Algorithm |
|
3.3. Model Performance Evaluation
The performance of the proposed HMM-based predictive VHO algorithm is analyzed using the RSS measurements and parameters described in
Section 2. The VLC–WiFi dataset was visually inspected using MATLAB signal analyzer software. A
portion of the dataset is used for training, while the remaining
is set aside for validation and testing. The training samples were classified as VLC RSS and WiFi RSS, resulting in the formation of dual HMMs for both APs. The HMMs are ergodic forms, with each having two mixtures and three states. Therefore, the sequence of best paths
, each ending in 6 state at time
t, is obtained for the dual model
. Paths [4–6] and [1–3] were created during the user’s mobility and/or LoS blockage, corresponding to the different RSS levels of
and
, accordingly. This implies that the paths {4; 1}, {5; 2}, and {6; 3} reflect the low, medium, and high RSS values, respectively.
The performance of the proposed HMM-based VHO technique is compared with the exiting techniques in the literature based on the following evaluation metrics:
Accuracy [
36]:
where true positive is the number of times the output of HMM corresponds with the actual identified RSS values, while the true negative is the number of times the HMM predicts the wrong RSS values exactly as the actual wrong RSS value.
Mixed absolute percentage error [
37]:
where
M is the number of predicted points,
is the actual RSS value, and
is the predicted RSS values. These metrics are used to obtain the difference between the actual measured value and the predicted value of the model.