1. Introduction
Location information of a pedestrian is recognized as being very important for emergency rescue and location-based services and has been estimated and provided based on various methodologies [
1]. To this end, Global Positioning System (GPS)/Global Navigation Satellite System (GNSS) is the most common method in outdoor environments [
2], with mobile/wireless network-based wireless location technology also used in various ways [
3,
4,
5,
6]. However, GPS/GNSS can normally only be used outdoors and wireless network-based positioning is generally only available indoors or in outdoor areas where infrastructure is installed. On the other hand, wireless positioning based on a mobile communication network is most suitable when considering the availability of positioning in all indoor and outdoor spaces. Wireless positioning techniques include Cell-ID, time of arrival [
7,
8], time-difference of arrival [
9], fingerprinting [
10,
11], etc. Among them, a fingerprinting or a trilateration technique can be used for positioning using signal strength information [
12]. In order to construct an efficient database in the fingerprinting technique or to implement a trilateration technique, a Signal Propagation Model (SPM) and location information of the base stations are basically required in the Long Term Evolution (LTE) system [
13,
14]. This information can be also used to create a fingerprint database. However, this information is managed as the property of the mobile carrier and is not provided free of charge to positioning technology developers or location-based service providers. This is the first motivation of this study.
The power of the signal transmitted from the antenna of each base station is attenuated as it propagates through space. In addition, the power is more attenuated as the signal penetrates structures such as buildings. In urban areas, therefore, SPMs are geographically different due to different spatial structures. Because of this, analyzing the Reference Signal Received Power (RSRP) measured at several locations at the same distance from a base station reveals that the signal variance is very large. Therefore, to develop LTE signal-based positioning technology, different SPMs must be used at different base stations [
15,
16]. This is the second motivation of this study.
Considering this situation, the following problem exists: the location information of the base station is required for Signal Propagation Modelling (SPMing) but is not provided by the mobile carrier. Here, SPMing means regressing the coefficients of a SPM. Even if the RSRP measurements can be converted into distance information using the SPM provided in the literature, it is impossible to accurately estimate the location of the base station based on the existing trilateration method because of the large variance of the RSRP measurements [
17]. This is the third motivation of this study.
With these motivations, in this paper, a technique that can simultaneously perform SPMing and base station positioning is proposed. From now on, this technique is called Simultaneous SPMing and base station Positioning (SMaP). First, multiple Virtual Locations (VLs) for base stations are created in the service area. The VLs can be easily created in a grid structure akin to building a fingerprint database within the service area. Based on the assumption that there is a base station in each VL, SPMing is performed using the location-based RSRP measurements. The residuals between the output of the estimated SPM and the RSRP measurements are then calculated. One VL with the minimum Sum of the Squared Residuals (SSR) value can be extracted. This VL is determined as the location of the base station. In addition, the SPM estimated based on the VL determined as the location of the base station is selected as the SPM suitable for the surrounding area of the corresponding base station.
The SPMs estimated based on the VLs far from the actual location of the base station do not properly reflect the attenuation characteristics of RSRP over distance. In this case, the calculated SSR values are large. Therefore, it is determined that the hypothesis that the VL with the minimum SSR value is the closest to the actual location of the base station among the multiple VLs is appropriate, and this hypothesis is to be verified through the experimental results.
Experiments were performed to verify the performance of the proposed technique. The RSRP measurements with GPS/GNSS-based location information are collected on the road through a vehicle equipped with a signal acquisition device, Nemo Handy. Among the collected signals, the proposed technique is applied after extracting the signal from Korea Telecom (KT), a mobile communication company. The actual location information of base stations in the service area was supported by KT for research purposes. Experimental analysis shows that the proposed technique can simultaneously perform SPMing and base station positioning. It also shows that the hypothesis that the VL with the minimum SSR value is closest to the actual location of the base station is true.
The rest of this paper is organized as follows. The characteristics of the RSRP measurements are described in
Section 2, and SMaP is explained in
Section 3. In
Section 4, experimental analysis is conducted and, finally,
Section 5 concludes this paper.
3. Simultaneous SPMing and Base Station Positioning
In order to perform SMaP, functions and procedures are described in
Figure 3. First, location-based RSRP measurements are obtained through a signal measurement device and SMaP is performed based on this. Based on
Figure 3, the idea of this paper to perform SMaP is described sequentially.
Step−A Searching Area Setting:
Figure 4a shows the location information of signals acquired from the KT base station of Physical Cell Identity (PCI)−144 in Seocho−dong, Seoul, Korea. The locations at which the measurements were obtained are satellite-based augmentation system-based differential GPS information. Based on these measurements, a searching area is set for the positioning of the base station. In this figure, the blue dots are the locations of the RSRP obtained through a vehicle test, and it can be seen that measurements were obtained only in limited areas such as roads. The red dashed square shows the searching area.
Step−B Virtual Locations Creation: VLs of a base station in the searching area are generated in the form of grid points,
where
is the latitude and longitude coordinates of the
jth VL (
) and
is the number of VLs.
In the searching area, the VLs of the base station are created in the form of grid points. In this study, a total of 1218 VLs were created by setting the interval of the grid points to 40 m, as shown in
Figure 4b. The interval of the grid points was determined experimentally, and detailed explanations are given through the experimental results in the next section.
Equation (3) can be rewritten as follows:
where
where
s is the total number of RSRP measurements and the superscript
j is the number of each VL.
and
are the latitude and longitude coordinates of the
jth VL and the
ith RSRP measurement acquisition location, respectively.
and
are the Earth radius in the latitude and longitude directions, respectively, and can be calculated as [
29]:
where
m and
are the equatorial radius and eccentricity of the Earth’s ellipsoid, respectively.
Step−C SPMing: SPMing is performed based on the created VLs. Assuming that the base station is at each virtual location, the two coefficients and in (9) are estimated by the least squares method as in (6). This process is repeatedly performed based on all VLs, and the estimated coefficients are stored as follows: and , where .
Step−D Calculate the Sum of Squared Residuals: When this process is completed, the SSR for each VL is calculated as follows:
where
is calculated as (10).
Step−E Minimum SSR-based Base Station Positioning: Finally, the position of the base station is determined as follows:
where
It is the hypothesis of this paper that the VL with the minimum SSR value is closest to the base station among all VLs. Therefore, it is the main idea of this paper to estimate the VL with the minimum SSR value as the location of the base station according to this hypothesis. Proof of this hypothesis is carried out through experimental analysis in the next Section.
4. Experimental Analysis
Experiments were conducted to verify the performance of the proposed SMaP. As shown in
Figure 4, the test site is the center of Seoul, Korea, and the target mobile carrier is KT. As shown in
Figure 5a, a smartphone S10 equipped with Nemo Handy, an LTE signal acquisition app from Keysight Technologies, was mounted in one vehicle and data was collected while driving for about 3 h on the road in the service area. This data includes downlink information of the LTE system and DGPS-based location information from which the information is obtained. An example of the signal acquired from Nemo Handy is shown in
Figure 5b. This figure shows the number of the serving PCI and RSRP information of the received signals, and the information was saved as a file with the extension nmf.
Figure 5c shows the urban environment seen from the vehicle during the signal acquisition experiment. From this figure, it can be seen that the signal propagation environment is not good because the tall buildings are attached to each other in the urban area, but signal propagation is easy in the road direction.
Among the collected data, only the data corresponding to Band−8 (downlink center frequency is 954.3 MHz and bandwidth is 10 MHz) was extracted through parsing the saved file. After extracting each piece of RSRP information and time-synchronized GPS/GNSS-based location information together, SMaP was performed.
Experimental analysis is performed in the following order. First,
Figure 6 shows that the minimum SSR value according to the VL has adequate regularity to prove the hypothesis that the VL with the minimum SSR value presented in this paper is close to the true location of the base station. In this figure, it is also shown that only the result of SPMing performed on the VL with the minimum SSR value is regressed in the form of (2). By showing in
Figure 7 that the estimated location of the base station is near the true location as a result of the SMaP performed based on this data, it is confirmed that the hypothesis presented in this paper is highly likely to be correct. Another hypothesis other than this one can be considered. That is, the position having the highest power among the acquired RSRP information may be estimated as the position of the base station. However, if there is a repeater,
Figure 7d shows that this method cannot accurately estimate the location of the base station. By showing in
Figure 8 the results of SMaP performed based on 6 PCI data different from the PCI shown in
Figure 7, it can be sure that the hypothesis presented in this paper is correct. As a result of SMaP, it can be seen in
Figure 8g that not only the location of the base station but also the coefficients of the SPM can be accurately estimated. A detailed description of each figure is as follows.
Figure 6 shows the signal processing of SMaP based on the measurements of the PCI–144 base station, shown in
Figure 4, among the extracted data. The algorithm was run on 1218 VLs with a total of 4127 measurements. SPMing is performed after assuming that the base station is in each VL. The algorithm processing time took 9.7 s in Matlab.
Figure 6a shows the SSR according to the VL. From this figure, it can be seen that SSR is calculated as a different value for each VL and there is one VL with the minimum SSR value.
Figure 6b shows the reconstructed distance–RSRP information based on this VL. Here, the distance was calculated through (10). From this figure, it can be confirmed that the RSRP measurements according to distance are distributed in the form of a log function.
Figure 6c shows the distance–RSRP information reconstructed by assuming an arbitrary VL, other than the VL with the minimum SSR value, as the base station. From these figures, if there is an error in the location information of the base station, it can be inferred that a large error will occur in SPMing due to the incorrect distance–RSRP measurement information formed based on this error. Because of this, the calculated SSR has a large value. Therefore, it can be confirmed that it is appropriate to determine the VL with the minimum SSR value obtained through
Figure 6a as the location of the base station.
Figure 7 shows the results of performing SMaP and the comparison technique. In
Figure 7a, the green circle is the actual location of the base station and the red x is the estimated location. The solid line in
Figure 7b is the estimates of RSRP according to distance based on the SPM estimated using the measurements. Measurements obtained at a distance of more than 500 m from the base station are mostly signals propagated through roads. The power of these signals is greater than the signals propagating toward the building, not the road. Sometimes, signals from a repeater are received in this area, as can be seen in
Figure 7d. The signal provided by the repeater is the same as that provided by the base station, and the PCI information included in the signal is the same as the neighboring base station PCI. Therefore, this signal is mistaken as a signal transmitted from a base station rather than a repeater, and the RSRP information of the signal received at a long distance from the base station has a large value, which adversely affects SPMing. Therefore, only signals acquired within 500 m are used for SPMing.
Figure 7a shows the result of the base station positioning based on the SSR denoted in
Figure 6a. Among all the VLs, the SSR calculated at one location has the smallest value, and this location was selected as the location of the corresponding base station. The estimation error was calculated as 31.52 m. That is, it can be confirmed that the location of the base station is accurately estimated because an error occurs within the interval of the grid points. In addition,
Figure 7c shows the fingerprint DB constructed at 40 m intervals based on the measurements. In this figure, it can be seen that a larger RSRP is being measured near the base station. However, it is not possible to determine the location with the strongest RSRP as the location of the base station because it may not be possible to obtain the measurements at the exact location of the base station by vehicle collection.
Additionally,
Figure 7d,e supports this explanation. This figure shows the fingerprint DB constructed based on the measurements of two different base stations. In the case of PCI–186, it seems that there are several base stations with the same PCI based on the signal strength information alone. This is because there are repeaters in certain buildings. As a result, it is possible to obtain a measurement with a large RSRP value even at a location far from the actual base station. In the case of PCI–176, it can be seen that the base station is near the place where the RSRP value is large. Since the base station is installed on the building, there are many multipath signals in the vicinity. As a result, the largest RSRP measurement is taken near the base station, not the location of the base station. For these reasons, determining the location with the largest RSRP measurement as the location of the base station can lead to large errors.
Figure 8 shows the results of base station positioning and SPMing of 6 base stations, and
Table 1 and
Table 2 summarize the experimental results. First, it can be seen from
Table 1 that the positioning accuracy of base stations varies according to the interval of grid points. However, it is not possible to know the correlation of accuracy according to the interval. The reason is that the error characteristic of the signal of the LTE system varies depending on the surrounding environment, and the surrounding environment of each base station is different. Through this experiment, in this paper, the interval of the grid points is set to 40 m. In
Figure 8a–f, the blue dots show the locations where the measurements were obtained, the green circle shows the true location of the base station, and the red x shows the base station location estimated through SMaP when the interval of the grid points is set to 40 m.
Figure 8a is the case where there are several repeaters, as shown in
Figure 7d, and
Figure 8c is the case where the measurements are spread widely around the base station, as shown in
Figure 8b. In the rest, since the base station is installed near the road, the measurements are long distributed along the road and are distributed within a measurable distance toward the side of the road. In all cases, it can be confirmed that the locations of the base stations were accurately estimated through the SMaP proposed in this paper.
What makes this possible is in the SPMing shown in
Figure 8g. The dotted line is the result of SPMing using all the measurements, and the solid line is that using only the measurements within 500 m from the VL. In the cases of PCI–186, 58, and 156, if SPMing is performed using both the signals transmitted from the repeaters and the signals propagated along the road, it is impossible to estimate the SPM that properly reflects characteristics of the signal propagation around the base station. However, by using only the measurements within a certain coverage, it is possible to estimate the SPM that relatively accurately reflects the signal propagation characteristics around the base station. The SPMing accuracy and the positioning accuracy of the base station have a close relationship. That is, when the SPMing is correct, the position of the base station can be accurately determined, and when the position of the base station is correct, the accurate SPM can be obtained. However, since the interval of the VLs is 40 m, it does not greatly affect base station positioning unless the SPMing is significantly wrong, such as with PCI–186.
The position indicated by a triangle in
Figure 8a–f is the location with the maximum power among the measurements and was estimated as the location of the base station. The estimation errors are shown in
Table 2. In the case of PCI–186, the problem is that the power of the repeater signal is the largest among the measurements, resulting in a very large positioning error. Other than that, the positions were estimated near the base stations. However, as described above, the base stations were installed on buildings or on power poles, and the measurements were acquired on the road. Thus, the signal with the maximum power among the acquired signals cannot be obtained right in front of the base station. In addition, signal synthesis by multipath signals occurs frequently in urban areas. Therefore, the positioning errors are relatively large.
Based on this result, it can be confirmed that the proposed method can accurately estimate the location of the base stations and perform SPMing at the same time only with measurements obtained in a limited area in an urban area. When SMaP is completed in this way, it is possible to establish the location information of all base stations in the service area and the SPM for the corresponding base station signals. This information can be used to generate data for various services. For example, it is possible to fill the gaps in the fingerprint DB. As shown in
Figure 7c, if the DB is constructed with only measurements, there may be places where there is no data even around the base station. The information on these places can be filled in by estimation using the location information and SPM of the base station estimated based on the SMaP proposed in this paper. Through this, it can be expected that the performance of fingerprinting-based smartphone positioning can be improved [
17].