1. Introduction
As wireless networks and smart phones have become widespread, indoor localization systems (ILSs) started receiving much attention. While tracking the location of the user’s devices, indoor localization systems can provide the location information to the service client and the service providers in an indoor environment. For example, in an art gallery, indoor localization systems can provide the location of devices so that visitors can obtain a description of the artwork they are currently viewing. In addition, gallery operators can place artworks based on statistical information of the user’s location.
Especially, Wi-Fi is commonly used for indoor localization because the wireless access point (WAP) information can be used without additional hardware [
1]. To predict the location of user, Wi-Fi-based indoor localization systems generally use the received strength signal(RSS) values of user’s device captured by multiple WAPs. Here, RSS is a measurement of the power present in a received radio signal. However, their usage is limited because Wi-Fi-based indoor localization systems show low performance in environments with natural noise such as shading and multiple path fading [
2].
As a representative method to overcome the performance degradation of Wi-Fi-based indoor localization systems under the environment with natural noise, some studies proposed a method using channel state information (CSI) that contained more location-related information than RSS. Even though CSI can improve the localization accuracy of the Wi-Fi-based indoor localization systems, the usage in practical applications is limited because it requires the modification of the device [
3,
4].
To address such practical issue, studies on removing natural noise using traditional filters such as moving average filter and particle filter are proposed [
5,
6,
7]. As shown in
Figure 1A, after removing the natural noise using filters, the user’s location is estimated through a heuristic classification algorithm such as decision tree (DT) and random forest (RF). Moreover, most recent studies try to apply deep learning techniques to estimate the user’s location by learning the characteristics of the RSS values and natural noise as shown in
Figure 1B [
8,
9,
10]. Let us note that the performance of both methods significantly decrease under the environment with artificial noise caused by media access control (MAC) spoofing attack [
11]. Here, MAC spoofing attack is an attack that attackers spoof their MAC address into MAC address of user’s device to perform a man-in-the-middle (MITM) attack.
In this paper, we introduce a specific MAC spoofing attack scenario, where the localization accuracy of the state-of-the-art indoor localization methods decreases. In this attack scenario, after spoofing the user’s MAC address, an adversary sends his signal to wireless access points (WAPs) distant from the actual location of the user as if his signal comes from a normal user’s device. As a result, RSS values with the same one as the user’s device ID are captured even at locations where the user device is not actually located. Such results can cause severe budget losses on discount stores which are sensitive to small rearrangement the interior.
To deal with this attack scenario using the artificial noise, we propose a new deep learning-based indoor localization method whose overall operation are shown in
Figure 1C. Different from the previous deep learning-based methods in
Figure 1B, the RF-based filter is applied to remove artificial noise before feeding into the deep learning model. The RF-based filter learns noise patterns of MAC spoofing attack. After identifying whether the RSS value includes artificial noise generated by MAC spoofing attack or not, the RF-based filter removes the artificial noises.
Main contributions of this paper can be summarized as follows: (1) After analyzing the problem of the previous indoor localization systems, we introduce a possible attack scenario decreasing their localization accuracy; (2) We propose a new deep learning-based indoor localization method using RF-based filter to show the good localization accuracy under the environment with artificial noise; (3) From the experimental results with multi-building, multi-floor dataset [
12], we show that the deep learning-based indoor localization method shows better localization accuracy against MAC spoofing attack than the state-of-the-art deep learning-based indoor localization method.
Since we applied a Random Forest filter to remove artificial noise generated by MAC spoofing attack, this paper is similar to the work of Alotaibi et al [
13]. However, in this work, Alotaibi et al. did not consider RSS time-series information. Different from the work of Alotaibi et al. by considering RSS time-series information, we consider when fake user’s signals are captured by the AP in a space different from the actual user during the time the indoor localization system estimates the user’s indoor location.
The rest of this paper is organized as follows. In
Section 2, we introduce the existing works related to Wi-Fi-based indoor localization systems. In
Section 3, we show some preliminary experimental results for designing the proposed indoor localization method. After describing the details of the proposed MAC spoofing attack scenario and deep learning-based indoor localization method in
Section 4, we evaluate the performance of the proposed deep learning-based indoor localization method from the experimental results using a multi-building and multi-floor indoor localization dataset in
Section 5. Finally, we conclude the paper in
Section 6.
2. Related Work
According to the data collected from user’s mobile devices, the Wi-Fi-based indoor localization systems are categorized into using a CSI and using a raw RSS in general. In 2017, Hao Chen et al. proposed a convolutional neural network (CNN)-based Wi-Fi localization algorithm using a time-frequency matrix organized from CSI. After converting complex number information of CSI into a feature image, they showed about 91% accuracy through a CNN model consisting of three convolutional layers and two fully connected layers [
3]. Shangqing Liu et al. used CNN to extract the relationship between the channel information of CSI and the number of people in the multi-human environment. Also, they used long short term memory (LSTM) model to analyze the dependence between the number of people and CSI. This method showed average accuracy by as much as 86.4% under the environment with five or more people [
4].
However, Wi-Fi-based indoor localization systems using CSI have the limitation that the existing device driver should be modified. As an alternative, the Wi-Fi-indoor localization systems using raw RSS from WAP have been widely studied [
14]. The indoor localization systems using raw RSS are mainly categorized into two groups: (1) filtering-based approach [
5,
6,
7,
15,
16]; (2) deep learning-based approach [
8,
9].
The filtering-based methods remove noise before estimating the user’s location by using a classifier. Henri Nurminen et al. proposed running a light-weight fallback filter in the background of real-time particle filter and forward-backward recursions-based smoother for 2D, 3D door positioning [
5]. Bodhibrata Mukhoopadhyay et al. suggested estimating a particular location using mode values of RSS and removing high frequency noise using moving average filter to improve indoor position accuracy [
16]. Zhu Nan et al. proposed a new particle filter based on Rao Blackwellized particle filter (RBPF) to address the issue that WAPs could be sparse and short range [
6]. In addition to improving performance using filters, the classification system for indoor positioning has been advanced through machine learning techniques. Rafał GÓRAK et al. proposed a modified random forest algorithm for indoor localization system [
15]. They also showed that the indoor localization system worked without errors even in situations where some WAPs were turned off. Sunmin Lee et al. proposed a system that estimated the indoor location of smart watch devices using random forest, and used basic service set identifier (BSSID) as well as RSS to address the problem of similar signal strength [
7]. The location-based radio map and BSSID list-based radio map is also used to reduce the number of comparison.
The deep learning-based methods train noise itself, such as shading and fading. Kim et al. proposed a hierarchical deep neural network (DNN) architecture consisting of a stacked autoencoder for multi-label classification of building, floor, and location [
9]. By appending autoencoder models according to the number of buildings and floors, their DNN architecture for multi-building and multi-floor indoor localization can cover a large-scale complex of many buildings. To address the interference of moving objects and co-channel interference, Qiwu Zhu et al. proposed an ensemble model consisting of fuzzy classifier and multi layer perceptron(MLP) at the indoor parking localization [
17]. They showed high accuracy through experiments using indoor parking lots in real shopping mall. Mai Ibrahim et al. presented a CNN-based method for indoor localization from multi-building and multi-floor dataset [
8]. Their method showed 100% accuracy for building and floor prediction by using RSS time-series information. However, the performance of the deep learning-based methods decreases when fake RSS tuples artificially generated from active attacks such as MAC spoofing attack are injected into fingerprint DB.
3. Preliminary
In this section, we introduce the experimental environment used at Mai Ibrahim et al.’s method [
8], which is a state-of-the-art CNN-based indoor localization using RSS time-series data, and shows the measured prediction accuracy using a public RSS dataset, called UjiIndoorLoc [
12]. We also introduce the limitation of the deep learning-based method under a threat model through active attacks such as MAC spoofing attack. Such a limitation motivated us to design the proposed deep learning-based indoor localization method using RF filter.
In
Figure 2, we show an example which shows the overall operation of how to predict user’s indoor location using RSS values in photo exhibition. Let us consider a user in front of the eagle photo whose position ID is one. First, the user’s mobile device sends the RSS values together with the phone ID(: 1). Then, nearby WAPs, i.e., WAP(A) and WAP(B), capture RSS. Second, WAPs (A) and (B) send the captured RSS and phone ID to a server(: 2). Third, after collecting RSS values from two WAPs and storing in the fingerprint DB, a server predicts position ID 1 matched with user’s location(: 3). Fourth, a server sends the narrative information related to position ID 1 to user’s mobile device(: 4, 5).
As described in [
8], we generate a feature image using
T number of RSS tuples with the same phone ID stored in the fingerprint DB for
S seconds. For example, let us assume that
T is set into 3 and RSS tuples are collected from 6 WAPs. As shown in
Figure 3a, three raw time-series RSS tuples with phone ID 1 are used to generate a feature image, whose size is
. Using the feature image, we train and test the deep learning-based method.
In practice, after implementing the experimental environment used at Mai Ibrahim et al.’s method [
8], we measured the prediction accuracy of indoor localization using UjiIndoorLoc dataset [
12], which has RSS values collected from 520 WAPs. From
S and
T values set into 60 and 3 respectively, we create the feature image whose size is
, i.e.,
(# of WAPs)
. Also, we train and test the deep learning-based method using 6453 number of training data and 138 number of test data, respectively. When training CNN model composed of two convolutional layers, we set the parameters into: ReLU, Softmax, Adam and MaxPooling for activation function, output layer activation function, optimizer, and pooling method, respectively [
18,
19]. Also, both the kernel size and the stride are set to 2. As a result of training with 30 epochs, the CNN model showed the indoor location prediction accuracy by as much as 94.93%. When
T was set to 4, the accuracy were improved up to 100%.
Let us note that by targeting on the environment shown in
Figure 2, as shown in
Figure 3b, an adversary can generate a manipulated RSS tuple including artificial noise with the same phone ID. By injecting a fake RSS tuple(red-colored tuple) into fingerprint DB through active attack such as MAC spoofing attack, the adversary can cause the wrong prediction from a deep learning-based indoor localization method. That is, such manipulation results in poor indoor localization prediction accuracy. As shown in
Figure 4, with the MAC spoofing attack, a user who requests some location information receives the wrong information from server. To prevent attacks from such a threat, refining RSS tuples stored in the fingerprint DB is necessary before generating the feature image. As an efficient method to eliminate artificial noise generated from such a threat, we propose a deep learning-based indoor localization method using RF filter.
6. Conclusions
Indoor localization systems have been implemented using various technologies such as Wi-Fi, RFID, Bluetooth, and so on. Among them, Wi-Fi technology is commonly used for indoor localization systems because it does not need any additional hardware. The existing Wi-Fi-based indoor localization systems use filtering or deep learning techniques to remove natural noise such as shading and fading. However, the previous methods are vulnerable to artificial noise generated from active attacks such as MAC spoofing attack. In this paper, we introduced a MAC spoofing scenario which generates the artificial noise injected data. In our MAC spoofing scenario, an adversary sends his signal to WAPs distant from the actual location of the user as if his signal comes from user’s device. The evaluation results show the prediction accuracy of the previous Wi-Fi-based indoor localization system without filtering decreased from 94.93% to 2.27%. To address the performance degradation problems due to the artificial noise, we also proposed a new deep learning-based indoor localization method using RF filter. The RF filter in proposed deep learning-based indoor localization method learns noise patterns of MAC spoofing attack to identify and remove artificial noises. From the experimental results using a public dataset, we showed that the proposed indoor localization method increased the prediction accuracy from 2.27% to 95.31% under the artificial noise injection attack.