1. Introduction
In recent years, with the development and maturity of satellite navigation systems, it is no longer difficult to achieve high-precision outdoor positioning. However, due to the complex indoor positioning environment, with numerous obstructions, it is difficult to obtain more accurate location information from targets in an indoor environment, which is also a hot research topic.
At present, the common indoor positioning technologies are mainly Wi-Fi, Bluetooth, infrared, ultra-wideband, Radio Frequency Identification (RFID), ZigBee, etc., and mainly divided into two service providers: base station and inertial position [
1]. Commercial technologies basically use wireless communication base station solutions, such as Wi-Fi and RFID. RFID technology has the characteristics of a long reading distance, strong penetration ability, anti-pollution, high efficiency, and large amount of information [
2]. In this paper, we propose an RFID three-dimensional indoor positioning scheme based on deep learning. Radio Frequency Identification (RFID) technology has the advantages of no contact, permanent storage, strong readability, etc. It can effectively support indoor positioning. Using the unique identification characteristic of the tag to the object, the RFID positioning system obtains the position information of the electronic tag according to the signal sent by the electronic tag received by the reader. While saving costs, it can obtain more authentic and effective target location information. RFID is widely used in various scenes, such as smart libraries, logistics warehouses, shopping malls, etc.
RFID system is mainly composed of hardware components and software components [
3]. The hardware components mainly include a reader and RFID tags. The RFID tags are placed on an identified object. The reader generally includes a high-frequency module, a control unit, and a coupling element. Interactive data are inquired between the reader and the RFID tags through radio frequency to realize short-distance communication. The data is sent to a computer through the software components.
RFID positioning technology is divided into absolute positioning and relative positioning [
4]. Absolute positioning refers to the physical absolute positioning of a designated target, while relative positioning refers to determine the positional relationship among the targets. In specific indoor applications, relative positioning and absolute positioning are applied to different scenes. The absolute positioning error is relatively larger, and the approximate position of the object in the coordinate system can be obtained through positioning, while the relative positioning makes it easy to get the positional relationship between different labels, so by attaching the labels to the objects, we can detect and correct the position of the wrong sequence objects.
In real life, RFID indoor localization technology is widely used in warehouse goods inventory, book placement order inspection, etc. We often need to locate a large number of targets, and the data we get for positioning is also very large. For deep learning, the more data used for training, the more accurate its final predictions can be. Therefore, the combination of deep learning and indoor positioning technology can effectively extract high-level and abstract features from the original data, allowing the computer to interpret and predict the data, thus improving the positioning efficiency.
In this paper, a scheme called 3DLRA—combining deep learning with RFID positioning technology—is proposed. The antenna groups are used to obtain the information of the tag to be tested and the reference tag. Through the variation characteristics of RSSI and Phase, the similarity between the tag to be tested and the reference tag is calculated to obtain the absolute position of the tag to be tested on a two-dimensional plane. Then, deep learning is used to mine the characteristics and laws of the data, thus further obtaining the Z-axis position information of the tag with higher accuracy. The relative positioning and absolute positioning are combined to obtain the three-dimensional positioning information of the tag to be tested.
In the following, the research status of the RFID positioning technology is descripted in
Section 2; the whole system architecture and model is built in
Section 3; the whole experiment is described and analyzed in
Section 4; our system is compared with other typical RFID positioning methods in
Section 5; and the summary of this paper is given in the last section.
2. Related Work
Today’s RFID indoor positioning schemes can be divided into one-dimensional absolute or relative positioning, two-dimensional planar positioning, and three-dimensional spatial positioning. Among them, three-dimensional spatial positioning is a difficult and hot topic in today’s research and has great research value. The following part will elaborate and analyze the existing research results.
The one-dimensional positioning scheme can obtain the absolute position or relative position relation of the target object, and is suitable for the management and recording of goods in assembly line situations. In the indoor environment, because the position of the target object will change, the relative positioning method is usually determined by the position information of the target object. The traditional relative positioning method STPP [
5], based on the space–time phase profile, analyzes the spatial order of the tag through the phase curve. The PRDL [
6] method proposed by Shen et al. breaks through the bottleneck of high density tags, combines deep learning with relative positioning, and improves the relative positioning accuracy. The HMRL [
7] method uses the change in the tag signal caused by human movement to realize high-precision relative positioning.
The two-dimensional positioning scheme can obtain the x–y coordinates of the target, which is suitable for plane navigation technology, etc. The two-dimensional positioning can be divided into ranging positioning and non-ranging positioning. The basic ranging methods include time of arrival (TOA), time difference of arrival (TDOA), angle of arrival (AOA), and received signal strength (RSSI) [
8]. For the first time, the LANDMARC [
9] system introduces a reference tag, and uses the “K-Neighbor” algorithm to weigh the coordinates of the tags to obtain the position information of the target. The VIRE [
10] method improves the LANDMARC method by adding a virtual reference tag and setting a threshold filter tag to simplify the calculation process. BVIRE improved the VIRE algorithm and proposed the concept of a boundary virtual reference tag, which improved the positioning accuracy of the boundary tag to be tested [
11]. The ANTspin [
12] is introduced into the rotating antenna to dynamically collect the signal strength information of the tag, and the target position information is obtained through calculation. Fu proposed a combination of RFID phase and laser-based clustering to use particle filters to locate the moving object [
13]. The LF [
14] RFID system uses an LF magnetic field for reliable positioning in multipath and non-line-of-sight environments.
The three-dimensional positioning scheme can obtain the spatial position information of the target, which is beneficial for the progress of indoor navigation technology and makes people’s life more convenient. The APM [
15] scheme handles distance estimation parameters based on minimum interrogation power and multilateral measurements, while the APAA [
15] scheme supports reader mobility and promotes a highly dense tag environment. The 3DinSAR [
16] method uses a holographic atlas to obtain target position information through the phase difference of tags with different heights. Although this method can realize real-time perception, the calculation is complicated. The VLM three-dimensional localization algorithm [
17] is based on virtual tags and topological constraints, but this algorithm has blind spots, which may lead to localization failure. The Tagspin [
18] method allows tags to rotate uniformly on the edge of the rotating disk, enabling fast and high-precision positioning. When estimating the received power, multidimensional scaling (MDS) [
19] considers path loss and shielding effects, and implements 3D positioning of active RFID tags. The Active–Passive algorithm is based on the Nelder–Mead nonlinear optimization method, which can accurately locate the x-axis and y-axis positions of the tags, but the z-axis positioning accuracy is relatively bad [
20].
4. System Implementation
4.1. Hardware and Software
We mainly used an Impinj-R420 UHF-RFID reader, equipped with five E9208PCRNF UHF antennas and a set of H47 passive tags. The reader was connected to the same Ethernet with the computer through a network cable, and the reader got the data through Tagsee. The computer we used was configured with an Intel Core i7-9750H, memory 8G, and graphics card GTX 1650 4G. The program is programmed with Python 3.6. The Python packages required for the program to run were tensorflow cpu1.1.0, keras2.1.2, numpy1.2.10, pandas0.2.0, and matplotlib1.2.0.
4.2. Environmental Deployment
There are many types of books in the library, and manual sorting takes a lot of time and effort. Therefore, RFID positioning technology is commonly used to manage books. In summary, we set the library as an environmental scene, and obtain the bookshelves that you want to find by 3D positioning. With the layer position of the books on the bookshelf, this RFID three-dimensional positioning helps book managers to find books, thereby saving manpower. This method is also applicable to other scenarios, such as warehouse logistics. Considering that the radio frequency in the UHF band is relatively sensitive to the environment, especially metal, we used wooden bookshelves in our experiments.
According to the LANDMARC system, we know that the density of the reference tags will affect the performance of the x–y plane positioning. We have placed a different number of reference tags on each layer of the bookshelf and conducted multiple sets of experiments, as shown in
Table 1. Finally, we found that placing three reference tags in each layer is the most reasonable, which can not only save costs, but also obtain a higher positioning accuracy.
Figure 6 shows the experimental scenario we set up, in which five antennas and two bookshelves were placed. Each bookcase is shown in
Figure 7. Three layers of passive tags are placed on the bookshelf, and 20 tags are tested on each layer. The distance between the tags to be tested was 2 cm. There were 18 reference tags in total. Three reference tags were placed on each layer of each shelf. They were arranged on both sides and in the middle. On each layer of the shelf, the reference label was placed on the inner side, and the label to be tested was placed on the outer side. The antenna height was 95 cm, and the shelf height was 25 cm. The five antennas were divided into two groups: Antennas 1–4 were Group A, and Antenna 5 was Group B; Group A antennas were used to obtain the absolute position information of the x–y plane of the tag to be tested, and the Group B antenna was used to obtain the z-axis relative position information of the tag to be tested.
4.3. Data Collection
Group A used four antennas to read the information of the reference tags and the tags to be tested at the same time. Each time it reads for 20 s, and each tag can read up to 250 sets of data by one of the antennas, and saved to realize the positioning of the absolute position of the tag to be tested in the x-y plane. Group B uses one antenna to read the information of the tags. The antenna remains stationary, and one person moves between the bookshelf and Antenna 5. In the original experiment, the distance between the antenna and the tags on the bookshelf was 38.5 cm; the person moved 180 cm, and the person passed along the tag sequence at a uniform speed. The movement time was about 12 s. Each time, an average of 60 tags were collected, and each tag had about 100 sets of data. In order to effectively process the data, the data of the tags were complemented with 150 pieces by using the method of zero padding, as data backup.
4.4. Data Processing
We imported the collected data of Group A into the absolute positioning model of the system. In the smart library, because we need to know which shelf a book is on, we put a reference tag on each shelf. Through the RSSI read by each reader, the similarity between the tag on the book and all reference tags was obtained, and we computed the minimum correlation to integrate these similarities. The specific method is mentioned in
Section 3.3.1. Then, the similarity between the tag and each shelf is obtained. We determined which shelf the tag is on by comparing the similarities.
We imported the collected data of Group B into the relative positioning model of the system. The input data of the neural network was (n, 3, 1,150), and the length of the input data was 150. After getting the neural network model, we brought the test data into the model and get the confidence of the layer position information of the tags. The theoretical position information of the tags of the third layer was (0,0,1); the actual location differs from the information confidence. The maximum value of the confidence sequence of the tag was taken as the predicted position information of the actual layer of the tag.
4.5. Analysis of 3D Positioning Accuracy
We defined the absolute accuracy and relative accuracy. Absolute accuracy indicates the proportion of books with the correct absolute position on the x–y plane to the total number of books. The relative accuracy indicates the proportion of books with correct relative positions in the z-axis direction to the total number of books. Considering that the requirements for absolute accuracy and relative accuracy of the tags are different in different scenarios, we used a weighted algorithm to express the final 3D positioning accuracy. We stipulated that R_A represents relative accuracy and A_A represents absolute accuracy. Suppose the proportion of relative accuracy is a, and the proportion of absolute accuracy is b, and a + b = 1. Then, the three-dimensional positioning accuracy is expressed as follows:
In the smart library scenario, by comparing different segmentation weights, multiple experiments were performed, and each segmentation weight was subjected to 50 experiments, and the experimental accuracy was analyzed, as shown in
Table 2. Under the condition that the absolute positioning and the relative positioning are balanced, it was finally found that the weighted operation was performed at a ratio of 8:2, and the obtained 3D positioning accuracy is the highest and the most objective and true. The highest 3D positioning accuracy obtained was 96.264%. In the following calculation, we replaced “b” with 0.2 and replaced “a” with 0.8 in Equation (11). The three-dimensional positioning accuracy is expressed as follows:
In addition, we verified the performance of the system in a horizontal position. The deployment of the experimental scenario remains unchanged. We moved the tags to be tested to collect about 150 pieces of test data.
Figure 8 is the CDF graph of the absolute positioning error. In the figure, the blue curve indicates that the average positioning error on the x-axis is 3.82 cm, and the pink curve indicates that the average positioning error on the y-axis is 8.35 cm. Combining the average positioning error of the x-axis and y-axis, the average positioning error of the two-dimensional plane is 10.02 cm. The error was better than many existing solutions; e.g., AOA (70 cm) [
8], Landmarc (30 cm) [
9], and Vire (14 cm) [
10].
4.6. Robustness Analysis
When implementing the system, we determined the distance between the antenna and the tag sequence, the speed of human movement, and the height of the antenna. In actual applications, these parameters are not determined. This requires us to explore the robustness of the final model and try to improve the stability of the model. We changed the distance between the antenna and the tag sequence, the moving speed of the person, and the height of the antenna; then, we collected multiple sets of data for testing, and observed how the changes in these parameters affect the positioning accuracy.
• Distance between antenna and tags
In the experiment, the distance between Antenna 5 and the tags was 38.5 cm. On this basis, the distance was increased or decreased to analyze the change of the relative positioning accuracy. The results are shown in
Table 3. We first conducted experiments in the range of increasing or decreasing 1–5 cm. Considering the power of the antenna, we set the maximum distance between the antenna and the tags to 53.5 cm, and the closest distance to 23.5 cm. Within the set variation range, the overall accuracy was greater, above 0.9. Therefore, the system is more robust in terms of the distance between the antenna and the tags.
• Speed of moving
In the experiment, we set the speed of human movement between eleven and twelve centimeters per second, and adjusted it based on this. Since the speed of human movement cannot be accurately controlled, we increased the number of experiments to ensure the objectivity of the result. As shown in
Table 4, the results show that the overall accuracy is high and the model is robust.
• Antenna height
In the experiment, we set the height of the antenna to 95 cm, and adjusted it based on this. It is considered that if the change of the antenna height is small, it has almost no effect on the accuracy, so the antenna height change range, as shown in
Table 5, was set. After experimental verification, we can see that the system is robust.
5. System Comparison
Based on the experimental scenes, we used the methods mentioned in the following table to test the positioning accuracy. Finally, we found that PRDL has a higher accuracy in one-dimensional positioning, combining with deep learning. Using similar values, HMRL has a higher accuracy in two-dimensional positioning. LANDMARC has high accuracy in the two-dimensional positioning by using the reference label. ANTspin uses a rotating antenna combined with deep learning to achieve a high accuracy three-dimensional positioning. Active–Passive locates objects in 3D space by using RFID tags and readers. VLM provides fine-grained localization accuracy in 3D positioning based on connectivity information. 3DLRA combines the characteristics mentioned above achieving a higher accuracy in three-dimensional positioning.
Below is a comparison of our 3D positioning method with existing classic RFID positioning solutions. “√” in
Table 6 indicates that the positioning method has the characteristics of this column.
From the analysis of the above table, we can see that the RFID 3D positioning scheme we designed has certain advantages over previous studies. Our positioning system does not require moving antennas, which can save manpower and costs. In addition, we used deep learning technology in combination with absolute positioning and relative positioning to further mine data features, thereby obtaining a higher positioning accuracy.