1. Introduction
Positioning technology is considered essential technology to monitor resources and manpower in today’s world. The automation of positioning technology in a real-time environment is a growing need in current industries. Positioning technology such as the Global Navigation Satellite System is available for outdoor or open environments. It is very accurate and has been used extensively. However, there is a lack of common and reliable positioning technology for an indoor environment [
1,
2,
3]. Location intelligence using positioning technology can create live maps and Apps for monitoring and tracking. The decision makers can identify opportunities of business growth, employee safety and efficiency by analysis of data from location intelligence tools. Therefore, there is a high demand for accurate location tracking tools for an indoor environment [
4,
5,
6].
There are a number of indoor positioning technologies available today. They deliver indoor localization and they are broadly classified into three categories: wireless-signal-based techniques, vision-based techniques and other techniques [
4]. In the wireless-signal-based techniques, the system uses various parameters, such as Received Signal Strength Indicator (RSSI), time of flight, time of arrival, time difference of flight, time difference of arrival and channel state information, to predict the position of mobile devices connected to the wireless system. In vision-based techniques, the system utilises computer vision techniques with the support of various types of cameras to predict the position of mobile devices connected to the system. Among wireless-based techniques, RSSI-based fingerprinting is popular in the literature as it is less complex and requires no additional hardware [
7,
8,
9,
10,
11]. In recent times, there has been a significant growth in the use of Wi-Fi technology in residential, industrial and commercial settings, which has also contributed to adoption of RSSI-based fingerprinting techniques in indoor localization. Therefore, several attempts have been made to implement indoor localization using Wi-Fi, fingerprinting, machine learning and Received Signal Strength (RSS) values [
4,
5]. There are three categories of fingerprint methods, namely, deterministic, probabilistic and machine-learning methods, where the first two methods incur significant computational cost. A machine-learning approach can be more computationally efficient and popular [
12], and therefore we use a machine learning approach in this paper.
1.1. Motivation
Initially, Wireless Access Point (WAP) is implemented with a single frequency band. The recent trend is to use multiple frequency bands in WAP, as the 802.11 standard comes with several distinct radio frequency bands. Hence, the industry utilises multiple frequency bands to improve bandwidth, speed and stability in Wi-Fi. Therefore, the future trend is to utilise multiple frequency bands in a single WAP device [
13]. To go with the natural progression of Wi-Fi technology trend, it is imperative to utilise multiple frequency bands on location tracking using Wi-Fi. Therefore, more research is needed to see the effects of dual or multiple frequency bands on Wi-Fi based location prediction.
1.2. Contributions
The significant contributions of this paper are listed below:
Analysis of the use of dual frequency bands (2G and 5G) to improve accuracy of positioning prediction in comparison with single frequency band.
Analysis of the effects on accuracy of positioning prediction by varying the location of Test Point (TP), fingerprint coverage and grid size.
Results comparable with the existing literature are obtained by using significantly less Reference Points (RPs) per area.
The remaining part of this paper is organised as follows:
Section 2 provides a brief review on Wi-Fi-based fingerprinting methods using machine learning, followed by a research gap and research questions.
Section 3 presents the experimental details, including experimental location, RSS measurement process, performance metrics, system modeling, machine learning algorithms and tools used in this study. Results of the experimental study are presented in
Section 4, followed by the concluding remarks and future directions in
Section 5. All of the acronyms used in this paper are presented in abbreviations.
2. Related Works
This section provides a brief literature review on Wi-Fi-based positioning techniques using fingerprinting and machine learning. The mapping of RSSI values displays the random fluctuations in radio frequency signal in the wireless environment due to its time-varying nature [
14]. The quality of the received signal in time-varying wireless channels can be measured by RSSI or RSS using a mobile device. One of the main challenges of using Wi-Fi for indoor location prediction is that there is a huge amount of variability and impairment in the channels due to partitions, indoor objects and walls [
4,
5,
15,
16,
17].
Fingerprint-based localization is also known as a radio-map-based method or scene analysis [
18], where we create a map of RSSI or RSS values in the offline phase. This map is used to predict the location of mobile devices with measured RSSI or RSS in an online phase. RSSI or RSS values of received signal from Wi-Fi stations can be measured using an application on a mobile device. The localization prediction depends on the measurement methods used while developing the radio map [
4]. One of the issues with the fingerprint is the time and cost required for collecting large amounts of data, but there are alternative methods [
19] available for efficient data collection.
A review on indoor localization techniques and technologies can be found in [
5], in which they perform an extensive review of different techniques, technologies used and indoor localization systems. In [
4], they comprehensively review an indoor localization system using Wi-Fi and fingerprint with machine learning techniques. This paper also provides a comprehensive discussion on the applications of indoor localization, operation of Wi-Fi-based indoor localization, machine learning techniques applied in indoor localization and fingerprinting techniques.
The experimental results and performance analysis of a Wi-Fi based localization system using machine-learning algorithms such as K-Nearest Neighbour (KNN), Support Vector Machine (SVM), Decision Tree (DT) and Random Forest (RF), are presented in [
12]. The experiment is conducted in a hall of size of 14.15 m × 3.77 m, with six WAPs inside the hall. RSSI measurement is conducted using a custom-built Android based application on a Galaxy Note 10.1. It is unclear which frequency band is used in Wi-Fi in this experiment. In addition, there are six WAPs in a small hall, which is an unrealistic scenario. The Principal Component Analysis (PCA) technique is utilised to reduce the correlation between the grid points and the performance of the proposed methods is presented in terms of cumulative error distribution function, mean, variance and root mean square error of distance error.
In [
20], the authors propose a grid search, based on PCA and SVM, for indoor localization on a single floor of 340 m
. The experiment is conducted in a single floor, with sixteen WAPs uniformly distributed, and WAP is operated at 2.4 GHz. RSS measurement is conducted using a TL-WN823N USB wireless network adapter. It is unclear what type of mobile devices are used to collect the RSS, and also there are sixteen WAPs in a small area, which is an unrealistic scenario. The PCA technique is utilised to extract the radio map, and the performance of the proposed method is presented in terms of mean square error in localization.
The authors of [
21] propose a bisecting k-mean-based fingerprint indoor localization technique, and present their results in terms of Accuracy (LA)and Average Distance Error (ADE). The experiment is conducted in a corridor of a floor using WirelessMon application. The application is used to detect WAP and measure RSSI values from a maximum of seventeen WAPs. It is unclear which frequency band is used in this experiment. According to the authors, the bisecting k-mean-based fingerprint indoor localization technique performs better than the k-mean-based technique.
The High Adaptability Indoor Localization (HAIL) technique which uses machine learning is presented in [
22]. The proposed technique uses absolute RSS and relative RSS values and a back-propagated neural network. The authors propose a separate fingerprint for each class of device (device dependence) for better accuracy. This is a compelling consideration, but it may be costly and impractical. In [
23], the authors introduce a simulation-based location tracking system using Wi-Fi, fingerprinting, a weighted fuzzy matching algorithm and a particle swarm optimization algorithm.
In [
24], the authors formulate a positioning problem as a pattern recognition problem, in which they use a simplified Bag-of-Features-based technique to transform raw RSS values into a robust feature vector. The model is validated with both simulated and real-time experiments. Using the grid size of
m
, they achieve a mean localization error of 1.5 m, but the authors do not mention the frequency band used in the experiment.
The authors of [
25] propose a localization solution based on the particle swam optimization technique, which is a random optimization technique that originated from the foraging behaviours of birds. The authors use a grid size of
m
for RPs in the offline phase to prepare the fingerprint data and grid size of
m
for TPs to use as test data. The frequency band used was 2.4 GHz. The performance of the proposed technique is compared with four classical machine-learning algorithms (KNN, SVM, LR, RF) and the average localization error achieved with the proposed techniques is 2.0817 m, which is better than the performance of the classical methods.
In [
14], the authors develop dual frequency (2.4 and 5 GHz) RSSI fingerprint and propose a hybrid RSSI fingerprint classification model, which follows the Canadian Institute For Advanced Research (CIFAR)-10 model framework based on image classification. The dual RSSI data samples on RPs are converted to image data, and the image is used as an input for the training model. The classification output from CIFAR-10 model is used to predict the position of the mobile device. The experiment is performed at an office of 2600 m
with seven dual-frequency WAPs. Their study presents the results in terms of mean, standard deviation, root mean square value and maximum error.
The authors of [
26] analyse the effect of the location of the beacon; however, they do not discuss the affect of grid size and location of RPs. They use the
m
grid and Deep Learning (DL) methods and achieve accuracy of less than 3.3 m at the 90th percentile and a mean distance error of 1.67 m. In [
27], the authors study the effect of different grid sizes of 1 × 1, 1.5 × 1.5, 2 × 2, 2.5 × 2.5 and 3 × 3 m
, and find that smaller grid size (1 × 1) m
is better for accuracy, but considering other parameters, they recommend (2.5 × 2.5) m
for indoor localization. A recent paper [
28] discusses the effects of grid size on the accuracy of prediction errors in localization. Though they get better accuracy with a smaller grid size, they recommend choosing a grid size of 1.5 × 1.5 m
for practical purposes. They have proposed a Generative-Adversarial-Network-based DL scheme for multi floor localization architecture. Similarly, ref. [
29] discusses grid size, but argues that if there are large number of WAPs available, then grid size is less influential. Having large number of WAPs is good for an experimental or simulation setting, but not in a practical setting, which indicates that grid size is an important consideration for the performance. However, there is no clear agreement on the optimum grid size for Wi-Fi fingerprinting.
2.1. Research Gap
According to the IEEE 802.11 standard, Wi-Fi can broadcast on several license-exempt frequency bands, including 2.4 GHz, 5 GHz and 6 GHz, and these frequency ranges have specific properties. These frequency bands offer different data rates, bandwidth and coverage according to propagation characteristics of frequency band. The higher-frequency bands (5 GHz and 6 GHz) provide a higher bandwidth, but have a smaller coverage area, whereas the 2.4 GHz band provides a larger coverage area but has a smaller bandwidth. Therefore, modern Wi-Fi stations use multiple frequency bands to exploit these characteristics.
Our literature review indicates that there is limited research work conducted on indoor localization using dual frequency (2.4 GHz and 5 GHz) and classical machine-learning algorithms.
Table 1 summarises the technique and frequency band(s) used in indoor positioning literature. Most of the proposed systems for indoor localization in the current literature either use a 2.4 GHz frequency band [
20,
25] or do not discuss the use of specific frequency band [
12,
15,
21,
22,
23]. We found only one paper that uses dual frequencies for indoor localization [
14], and their study was based on image classification using the CIFAR-10 model. In this study, the indoor localization is conducted using images as inputs. These review findings clearly indicate that there is a need for further research on how the use of dual-frequency bands can impact indoor location prediction. Thus, we plan to analyse the effect of dual frequency on location prediction using fingerprinting and regression-based machine learning algorithms to address the existing research gap.
Our review indicates that there is no clear agreement on grid size for the Wi-Fi fingerprint, despite it being an important parameter in localization. Therefore, we also plan to study the effect of grid size on the position prediction and location accuracy. Additionally, we also analyse how the RP coverage affects the performance of location prediction.
2.2. Research Questions
This research paper focuses on answering the following three research questions:
Does the accuracy of the location prediction of a mobile device increase by using dual frequencies available at Wi-Fi stations?
Which machine-learning algorithm performs better in terms of location prediction in this new experimental scenario?
How do fingerprint training grid size, training points and test points location affect the prediction accuracy?