LSRR-LA: An Anisotropy-Tolerant Localization Algorithm Based on Least Square Regularized Regression for Multi-Hop Wireless Sensor Networks

Zhao, Wei; Shao, Fei; Ye, Song; Zheng, Wei

doi:10.3390/s18113974

Open AccessArticle

LSRR-LA: An Anisotropy-Tolerant Localization Algorithm Based on Least Square Regularized Regression for Multi-Hop Wireless Sensor Networks

by

Wei Zhao

^1,2,*,

Fei Shao

^1,2,

Song Ye

²

and

Wei Zheng

²

¹

Jiangsu Key Laboratory of Data Science & Smart Software, Jinling Institute of Technology, Nanjing 211169, China

²

School of Computer Engineering, Jinling Institute of Technology, Nanjing 211169, China

^*

Author to whom correspondence should be addressed.

Sensors 2018, 18(11), 3974; https://doi.org/10.3390/s18113974

Submission received: 30 August 2018 / Revised: 10 November 2018 / Accepted: 12 November 2018 / Published: 15 November 2018

(This article belongs to the Section Sensor Networks)

Download

Browse Figures

Versions Notes

Abstract

:

As is well known, multi-hop range-free localization algorithms demonstrate pretty good performance in isotropic networks in which sensor nodes distribute evenly and densely. However, these algorithms are easily affected by network topology, causing a significant decrease in positioning accuracy. To improve the localization performance in anisotropic networks, this paper presents a multi-hop range-free localization algorithm based on Least Square Regularized Regression (LSRR). By building a mapping relationship between hop counts and real distances, we can regard the process of localization as a regularized regression. Firstly, the proximity information of the given network is measured. Then, a mapping model between the geographical distances and the hop distances is constructed by LSRR. Finally, each sensor node finds its own position via this mapping. The Average Localization Error (ALE) metric is used to evaluate the proposed method in our experiments, and results show that, compared with similar methods, our approach can effectively decrease the effect of anisotropy, thus considerably improving the positioning accuracy.

Keywords:

wireless sensor network; multi-hop localization; regression

1. Introduction

Obtaining accurate location information is a precondition for many applications that are based on localization technology. Among the existing positioning technologies, GPS satellite positioning systems rely on their global coverage and high accuracy, and have been widely used [1,2] all over the world. However, these satellite positioning systems also have some significant shortcomings, including high energy consumption, a high cost, fixed auxiliary facilities, and the only use of open space. Since the 1990s, Wireless Sensor Networks (WSNs) have attracted the attention of many relevant researchers. As an extension of a satellite positioning system, WSNs bring localization methods a low cost, low power consumption, and no additional auxiliary facilities. In some applications, WSN-based localization systems still work well while satellite positioning systems may be out of service [3,4,5]. For example, WSN localization approaches can provide us with a relatively accurate location in areas where satellite signals are blocked, such as primitive forests and buildings. Some localization methods of WSNs are based on the characteristics of network connectivity and multi-hop information. With the help of known nodes, which are called anchors, the unknown nodes can estimate their locations. Such localization technologies are also called multi-hop range-free localization methods [6,7]. Localization approaches in WSNs can be divided into two types: range-based algorithms [8,9] and range-free algorithms [10,11,12]. The range-based (e.g., RSSI-based, Received Signal Strength Indication) localization algorithms estimate the locations of sensor nodes by measuring their distances or angles from the anchor nodes. The range-based localization algorithms can deal with anisotropic networks and ensure a relatively accurate estimation of sensor locations; however, these algorithms are easily affected by environmental factors, such as noise and obstacles. In addition, their complexity and cost also increase with the accuracy requirement. For a large-scale sensor network with many sensor nodes, it is not reasonable to equip them all with ranging devices. In contrast, range-free localization algorithms use only the connectivity information among nodes, i.e., they do not require expensive ranging devices. In the case of large-scale WSN localization applications, the range-free algorithms would be preferred where sensor nodes are provided with a small amount of resources. Since the multi-hop range-free positioning technique only needs the hop information between nodes, it can position unknown nodes without additional devices and thus has been widely used.

The multi-hop range-free localization methods perform rather well in an even and dense network, which frequently means that a large number of nodes are deployed in a limited area. They seriously depend on the network topology. Taking Figure 1 as an example, the dotted lines indicate the geometric distance between nodes A and B, while the red solid lines denote the corresponding hop distances between them. If the sensor nodes are distributed evenly, the geometric distances will roughly match the hop distances (refer to Figure 1a). However, this matching relationship will be totally undermined if there are some obstacles that bend the paths between nodes A and B, leading to a mismatch between the geometric distances and the hop distances (refer to Figure 1b). We usually call a network like that shown in Figure 1b an anisotropic network, which has different propagation models and irregular localization areas. A large deviation between the geometric distances and hop distances may probably cause a sharp drop in positioning accuracy, which is exactly the reason that the multi-hop range-free localization methods perform so poorly in anisotropic networks.

To improve the positioning accuracy in such anisotropic networks, we propose a multi-hop range-free localization method based on Least Square Regularized Regression, i.e., LSRR-LA (Least Square Regularized Regression based multi-hop range-free Localization Algorithm). In the case of LSRR-LA, we build a mapping model between geometric distances and hop counts by using Least Square Regularized Regression, and then we estimate the locations of unknown nodes in terms of this model. During the establishment step of this mapping model, structural risk minimization is obtained by tuning the weights between the empirical risk and a penalty item, thus ensuring enough stability in, and accuracy of, this mapping model.

The rest of this paper is organized as follows. Related works are briefly introduced in Section 2. Section 3 describes the proposed LSRR-LA method in detail. Extensive simulation experiments are conducted to evaluate the performance of LSRR-LA and make comparisons with typical localization solutions, and the results are reported in Section 4. Finally, Section 5 gives some conclusions.

2. Related Works

As far as multi-hop range-free localization algorithms are concerned, some popular ones include DV-hop (Distance Vector hop) [13,14], Amorphous [15], and MDS-MAP (Multidimensional scaling MDP) [16]. The DV-hop method was first proposed by Dragos Niculescu at the University of Bucharest, and the basic idea is that one node estimates the distance to another node using the product of the network per-hop distance and the shortest path between them. The DV-hop algorithm does not need distance measurement equipment nor angle measurement hardware, and thus it is one of the most widely used algorithms for wireless sensor networks. It can be simply divided into three steps. First of all, the unknown nodes calculate the minimal hop counts to anchor nodes by the AODV (Ad hoc On-demand Distance Vector Routing) protocol. Then, the unknown nodes estimate their Euclidean distances to anchors by multiplying the minimum hop counts and the per-hop distance together. At last, the locations of unknown nodes are estimated by using the trilateration or maximum likelihood method. The Amorphous algorithm is an improved version of DV-hop. Unlike the first step used in DV-hop, Amorphous relies on another way that is called local hop counts to calculate the minimum number of hop counts. Furthermore, it also uses a different method to compute the average per-hop distance. In this way, Amorphous can generally achieve better performance than DV-hop. MDS-MAP, as another popular method, is a localization method that is based on multidimensional scaling (MDS). MDS-MAP uses the distance or connectivity information between all nodes at the same time and only uses the information between the unknown and anchor nodes. It can be used as either a range-based or a range-free algorithm. In the case of the range-free method, it rebuilds the relative coordinators of nodes by taking advantage of the pair distance, and obtains the absolute coordinators of nodes by using coordinate transformation with the help of anchor nodes. The above localization methods all assume that the nodes are distributed uniformly and densely in networks. However, the real deployment environment is often complex and changeable; thus, it is almost impossible to satisfy the ideal condition of uniformity and denseness for nodes to be located. For example, in anisotropic networks, such as networks with existing obstacles or noise corruption, these multi-hop range-free localization methods will probably demonstrate poor positioning accuracy.

Recently, designing localization algorithms through machine learning theory has drawn a lot of attention. These machine-learning-based localization algorithms utilize the correlations between nodal distribution characteristics and measurement information to construct a mapping model, which is then applied to estimate the positions of unknown nodes. Compared with the existing methods, machine-learning-based approaches can more effectively retrieve the fuzzy relations that are hidden behind information and data. Lim et al. proposed a localization method named Proximity-Distance Map (PDM) [17], which creates the mapping relationship between hop counts and the real distances of anchor nodes. Then, this mapping relationship is used to estimate the distance from the unknown nodes to the anchor nodes. Inspired by the PDM method in [17], Lee et al. presented two improved localization methods, i.e., Localized Support Vector Regression (LSVR) and Multi-dimensional Support Vector Regression (MSVR) [18], which regard the relationship between hop counts and distances as nonlinear and estimate distances by the regression method relying on a Support Vector Machine (SVM) [19].

Although the methods mentioned above can effectively improve the positioning accuracy in anisotropic networks, the computational complexity of their core algorithms is very high, which will cause sensor nodes to fail or die earlier because such nodes will estimate the locations by themselves, and they are short of computation and memory resources as well. Moreover, the LSVR and MSVR methods have too many parameters to be considered, which leads to very complex models; thus, they are not suitable for practical application scenarios. Recently, we proposed a novel DV-hop algorithm based on Locally Weighted Linear Regression (LWLR-DV-hop), in which the kernel method was adopted to improve the localization accuracy by raising the weight of neighboring anchor nodes [20]. Although the kernel method used in [20] improved the performance of DV-hop remarkably, it needs to compute the threshold of the hop count and the kernel parameter k, which definitely gives rise to computation complexity. Unlike [20], our proposed method involves only one parameter, and, thus, it has less computation complexity and can be regarded as an improved version of LWLR-DV-hop.

3. The Proposed Localization Algorithm

In this section, we will describe LSRR-LA in detail. Firstly, we present the localization problem in Section 3.1 and then deduce a formulated mapping model. Secondly, we will demonstrate the three steps of LSRR-LA in Section 3.2. Finally, the pseudo-code of LSRR-LA is illustrated.

3.1. Problem Statement

Consider a two-dimensional region of space, where there is a sensor network given by a set of nodes N = {N₁, N₂, …, N_m_+n}, which consists of m anchor nodes and n sensor nodes. The coordinates of these nodes can be described with definition (1):

c o r (N_{p}) = {(x_{p}, y_{p})}^{T}, f o r p = 1, ..., m + n .

(1)

In WSN N, the positions of m anchor nodes N_i ∈ A, A = {N_i|i = 1, ..., m} are known, while the positions of the n sensor nodes N_j ∈ B, B = {N_j|j = m + 1, ..., m + n} are unknown. In the case of multi-hop networks, the hop counts between pair nodes are already known. The Euclidean distance from node N_i to N_j (i ≠ j) can be presented by Equation (2):

d_{i j} = ‖ c o r (N_{i}) - c o r (N_{j}) ‖ = \sqrt{{(x_{i} - x_{j})}^{2} + {(y_{i} - y_{j})}^{2}} .

(2)

The hop counts from node N_i to N_j (i ≠ j) can be expressed as h_ij. Taking Figure 1a as an example, in such a multi-hop network d_ij is proportional to h_ij, i.e., d_i_j ∝ h_ij. Thus, the localization problem can be formulated as:

\begin{array}{l} Estimate cor (N_{k}), \\ Given cor (N_{i}), d_{i j}, and h_{k t}, \end{array}

(3)

where N_i, N_j ∈ A, N_k ∈ B, and h_kt denotes the hop counts between node N_k and node N_t (N_t ∈ A ∪ B). Assume that the distances and hop counts from N_i to other anchors can be represented by the vectors y_i and x_i, respectively. Our aim is to learn a function f:

X \mapsto Y

from m training data

{(x_{i}, y_{i})}_{i = 1}^{m}

, where x_i ∈ X and y_i ∈ Y. The input space X (hops), is known as the dependent variable, and the output space Y (distances) is the independent variable. The function f is represented by a linear model

f (x) = w^{T} x,

(4)

where w is a m × 1 vector that contains the coefficients of the linear function. Usually, linear least squares (LLS) is used to determine w. It uses the vertical distance between the observed values y_i and the predictions f(x_i), which are known as the residuals r_i = y_i − f(x_i) = y_i − w^Tx_i.

In the LLS case, the sum of the squared residuals is minimized, which in matrix form is

S (w) = {(y - X w)}^{T} (y - X w),

(5)

with X being the m × m design matrix that combines all of the hop-count vectors, and the m × 1 vector y combining the distance values:

X = [\begin{matrix} x_{1, 1} & x_{1, 1} & \dots & x_{1, m} \\ x_{2, 1} & x_{2, 2} & \dots & x_{2, m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{m, 1} & x_{m, 1} & \dots & x_{m, m} \end{matrix}], y = [\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{m} \end{matrix}],

(6)

where each row corresponds to one input/output example. The coefficients could be optimized by minimizing Equation (5) as follows:

\hat{w} = \arg \min_{w} {(y - X w)}^{T} (y - X w) .

(7)

The minimizer of the problem (7) is

\hat{w} = {(X^{T} X)}^{- 1} X^{T} y .

(8)

Thus, to make a prediction for a novel input x^new, we only need to know the model and its parameters w, which can be represented as

\hat{w} = \arg \min_{w} {(y - X w)}^{T} (y - X w) .

(9)

3.2. Localization Algorithm

The proposed LSRR-LA contains the following three parts: the measurement part, the training part, and the localization part.

Part A (Measurement): Assuming that h_i = [h_1,i, ..., h_m_,i]^T, i = 1, ..., m is the hop-count vector from anchor N_i ∈ A to the other anchors, we thus can describe the hop-count matrix of all anchors as H₁ = [h₁, ..., h_m]. Accordingly, the Euclidean distance vector and matrix between anchors can be represented as d_i = [d_1,i, ..., d_m_,i]^T, i = 1, ..., m and D₁ = [d₁, ..., d_m], respectively. For any sensor node N_j ∈ B, it has a hop-count vector to all anchors, h_j = [h_1,j, ..., h_m_,j]^T, j = m + 1, ..., m + n. Consequently, matrix H₂ = [h_m₊₁, ..., h_m_+n] indicates the hop-count vectors from all of the sensor nodes to all of the anchors.

On referring to Formula (10), we express the hop-count matrix H (or the distance matrix D) by splitting it into four parts. We denote by H₁ (resp. D₁) the m × m matrix for the anchors versus themselves, H₃ (resp. D₃) the n × m matrix for the anchors versus sensors, and H₄ (resp. D₄) the n × n matrix for the sensors versus themselves. It is easy to know that H₂ = H₃^T, D₂ = D₃^T.

H = [\begin{array}{c} H_{1} & H_{2} \\ H_{3} & H_{4} \end{array}] = [\begin{array}{c} \begin{matrix} h_{1, 1} & h_{1, 2} & \dots & h_{1, m} \\ h_{2, 1} & h_{2, 2} & \dots & h_{2, m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ h_{m, 1} & h_{m, 2} & \dots & h_{m, m} \end{matrix} & \begin{matrix} h_{1, m + 1} & h_{1, m + 2} & \dots & h_{1, m + n} \\ h_{2, m + 1} & h_{2, m + 2} & \dots & h_{2, m + n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ h_{m, m + 1} & h_{m, m + 2} & \dots & h_{m, m + n} \end{matrix} \\ \begin{matrix} h_{m + 1, 1} & h_{m + 1, 2} & \dots & h_{m + 1, m} \\ h_{m + 2, 1} & h_{m + 2, 2} & \dots & h_{m + 2, m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ h_{m + n, 1} & h_{m + n, 2} & \dots & h_{m + n, m} \end{matrix} & \begin{matrix} h_{m + 1, m + 1} & h_{m + 1, m + 2} & \dots & h_{m + 1, m + n} \\ h_{m + 2, m + 1} & h_{m + 2, m + 2} & \dots & h_{m + 2, m + n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ h_{m + n, m + 1} & h_{m + n, m + 2} & \dots & h_{m + n, m + n} \end{matrix} \end{array}], D = [\begin{array}{c} D_{1} & D_{2} \\ D_{3} & D_{4} \end{array}]

(10)

Note that H₁, H₃, and H₄ are known, while D₃ and D₄ are unknown. The goal is to predict D₃ (or D₂) from H and D₁.

Part B (Training): Now, we have the input matrix denoted by H₁ and the output matrix represented as D₁, and then we can train our linear model proposed in formula (4). However, as a difference from (4), here the coefficient vector w becomes a matrix W with m × m, and the outputs are stored in the matrix D₁. Thus, the LLS solution becomes

\hat{W} = {(H_{1}^{T} H_{1})}^{- 1} H_{1}^{T} D_{1} .

(11)

In Formula (11), the variation range of the distance matrix D₁ is different from that of the hop-count matrix H₁. If the raw data is used directly, the calculation results will be affected due to the different variation range. We can eliminate this impact by preprocessing operations, such as normalization, making D₁ and H₁ have the same scale and to be equally emphasized. In this paper, the normalized matrices of H₁ and D₁ are represented by

{\tilde{H}}_{1}

and

{\tilde{D}}_{1}

, respectively.

When calculating the model parameter W, we use the data of anchor nodes. If we include more anchor nodes, we will fit the training data more accurately, which means that our model is capable of minimizing the training errors more effectively. However, the minimization of training errors is not our goal. We hope that the model can accurately estimate the locations of sensor nodes, that is, the model should have a generalization capability. With many anchors, fitting the full model without penalization will result in large prediction intervals, and an LLS regression estimator may not uniquely exist. In order to avoid over-fitting, it is necessary to add a regularization item or a penalty term to the model [21]. In addition, because the LLS estimates depend upon

{(H_{1}^{T} H)}^{- 1}

, there can be problems in computing W if

H_{1}^{T} H

were singular or nearly singular. In this paper, a term kI (k > 0) is added to the matrix

H_{1}^{T} H

for remedying this problem. The method is also called Tikhonov regularization, which is the most commonly used method of regularization of ill-posed problems. Then, the estimated W can be described as

\hat{W} = {({\tilde{H}}_{1}^{T} {\tilde{H}}_{1} + k I)}^{- 1} {\tilde{H}}_{1}^{T} {\tilde{D}}_{1} .

(12)

Here, the matrix I is an identity matrix. With the regularization term, the model can effectively avoid over-fitting. The k in the formula is called the hyper-parameters, which is used to balance the training errors and the regularization items [22]. Considering that

{\tilde{H}}_{1}^{T} {\tilde{H}}_{1}

will be an ill-conditioned matrix if

‖ {\tilde{H}}_{1}^{T} {\tilde{H}}_{1} ‖ < 0.01

, we simply set the value of parameter k to be 0.01 in the later simulation part.

Part C (Localization): Now, the normalized hop-count matrix

{\tilde{H}}_{3}

and the coefficient matrix

\hat{W}

are used to estimate the normalized matrix

{\tilde{D}}_{3}

,

{\tilde{D}}_{3} = {\tilde{H}}_{3} \hat{W} .

(13)

After obtaining the result of Equation (13), we can derive D₃ by using a reverse operation of

{\tilde{D}}_{3}

, and then the trilateration method or the maximum likelihood method is used to estimate the coordinates of unknown nodes [23].

The pseudo-code of LSRR-LA is illustrated in Algorithm 1.

Algorithm 1: LSRR-LA

input: H: hop matrix of the anchors and sensors; D₁: distance matrix for the anchors versus themselves; k: hyper-parameters.

output:

{{\hat{x}}_{j}}_{j = m + 1}^{m + n}

: estimated locations of the sensors.

1 Normalize matrix H₁ and D₁, output

{\tilde{H}}_{1}

and

{\tilde{D}}_{1}

;

2 Calculate the mapping matrix

\hat{W}

by Equation (12);

3 Normalize matrix H₃, output

{\tilde{H}}_{3}

;

4 Calculate matrix

{\tilde{D}}_{3}

by Equation (13);

5 Derive D₃ by using a reverse operation of

{\tilde{D}}_{3}

;

6 Estimate coordinates

{{\hat{x}}_{j}}_{j = m + 1}^{m + n}

by using maximum likelihood method.

With respect to step 6 in Algorithm 1, for specific details the reader can refer to [23].

4. Performance Evaluations

In this section, we will evaluate the performance of our LSRR-LA in a simulative and experimental way. First, in Section 4.1, we analyze the complexity of those algorithms that include LSRR-LA and other classical algorithms, such as DV-Hop, Amorphous, PDM, LSVR, and MSVR. Then, in Section 4.2, we use the performance metric ALE (Average Localization Error) to evaluate these algorithms in C-shaped, L-shaped, and D-shaped network topologies. Finally, in Section 4.3, the positioning accuracy of LSRR-LA is verified in an outdoor experiment.

4.1. Complexity Comparison

The complexity mainly includes two aspects, i.e., communication complexity and computation complexity. The LSRR-LA method is similar to the DV-Hop, Amorphous, MDS-MAP, PDM, and LSVR methods. Each node needs to calculate the hop counts by flooding to other nodes, so their communication cost is equal, which is about O(n²m). Here, n denotes the number of sensor nodes while m indicates the number of anchor nodes. DV-hop and Amorphous use the Least Squares (LS) method to estimate the locations of sensor nodes, so they require computation complexity of about O(m³) [24]. MDS-MAP is a centralized localization method, and its location process can be divided into three steps: constructing the global shortest paths, executing the MDS algorithm, and converting the relative coordinates to absolute coordinates. Accordingly, the computation complexity is O(n³), O(n³), and O(m³ + n), respectively. The PDM method uses TSVD to process data in advance, which has a computation cost of O(m³) [25]; then, it uses the LS method to continue calculating data, so the PDM method costs more than DV-hop and Amorphous do. LSVR and MSVR use regression methods based on SVM, which needs to solve a quadratic programming problem [26]; thus, its computational complexity is O(m²)~O(m³). LSRR-LA uses the Least Square Regularized Regression method, so the computation cost is O(m³). Furthermore, its computation cost can be reduced to O(m²logm) by using the method introduced in [27].

4.2. Simulation Results

We conduct simulations in MATLAB (Jinling Institute of Technology, Nanjing, Jiangsu, China, version R2016b) and compare the performances of our proposed method with those of the previous methods, including DV-hop, Amorphous, MDS-MAP, PDM, and LSVR. The communication range of all of the sensor and anchor nodes is identical, and these nodes are deployed in a 1000 m × 1000 m square region for all simulations. We assume two types of distribution: regular distribution and random distribution. In each type of distribution, we considered three types of networks: C-shaped, L-shaped, and D-shaped. The networks used in the following simulations are presented in Table 1. Specially, we use the symbol ‘*’ to indicate the anchor nodes and ‘o’ to denote sensor nodes.

All of the reported results are the average over 100 trials. We used the ALE to evaluate the performances of all of the compared methods. The definition of ALE can be briefly formulated as

A L E = \frac{\sum_{i = 1}^{N} L o c a t i o n E r r o r_{i}}{N}

(14)

where N is the number of unknown nodes. The location error of sensor node i can be formulated as:

L o c a t i o n E r r o r_{i} = \frac{‖ {\hat{x}}_{i} - x_{i} ‖}{R} \times 100 %,

(15)

where x_i and

{\hat{x}}_{i}

denote the real and estimated location of node i, respectively. R is the communication range of nodes and ‖•‖ represents the Euclidean distance.

The assumptions and simulation parameters are listed in Table 2.

4.2.1. Regular Deployment

In the case of regular distribution, the sensor nodes and anchors are deployed uniformly along with the grids within the network. The size of the grid is set to 45 m × 45 m. We consider five levels of communication range R: 50 m, 100 m, 150 m, 200 m, and 250 m. Table 3 shows us the ALE results of the C-shaped network with 44 training anchor nodes. For example, the result of DV-hop is 283.8/92.6% in the case of R = 100 m, which means that the ALE of DV-hop is 283.8 and LSRR-LA improves the ALE by 92.6%. The ALE value 283.8 is a percentage, and if we turn this value to an absolute form, it will be 283.8 m (i.e., R*ALE/100). By contrast, the ALE value of LSRR-LA is 21.0, which means the absolute localization error is 21.0 m.

We can see from the simulation results in Table 3 that, as the communication range becomes larger, all of the competing methods demonstrate a performance improvement. However, all of the methods exhibit a substantial performance improvement except the proposed LSRR-LA method, which keeps a lower ALE as the communication range R increases gradually. It is clear that the proposed LSRR-LA method outperforms the previous methods, for it is rather stable and not sensitive to communication range.

We also considered five different anchor populations: 13.7% of the total nodes (M = 30 anchors), 18.3% (M = 40 anchors), 22.8% (M = 50 anchors), 27.4% (M = 60 anchors), and 31.9% (M = 70 anchors). Table 3 gives the ALE results of the C-shaped topology when the communication radius R = 100 m.

From the Table 4, it is obvious that the proposed LSRR-LA greatly outperforms the previous methods for the C-shaped anisotropic network. All of the competing methods demonstrate a performance improvement as the network gets denser; however, only the proposed LSRR-LA exhibits a substantial performance improvement while the previous methods do not exhibit satisfactory improvement.

Figure 2 shows a plot of Table 3 and Table 4. Figure 2a depicts the ALE results in terms of Table 2, where the communication radius varies from 50 m to 250 m. All of them generally conform to the trend that ALE decreases as the communication radius increases. When the communication radius R is smaller (R = 50 m), the positioning accuracy of DV-Hop, Amophos, and MDS-MAP becomes very poor. As the R increases, their localization errors begin to decrease. The LSRR-LA method always maintains the best positioning accuracy, whether R is smaller (R = 50 m) or R is larger (R = 250 m). Figure 2b shows the ALE results according to Table 4 where the number of anchor nodes increases from 30 to 70. It can be seen that the ALE change ranges of these algorithms are not violent, and the trend of ALE decreases gradually as the number of anchor nodes increases. As a result, LSRR-LA exhibits the best performance.

Figure 3 shows an overall summarization of the six competing methods for the three anisotropic networks when R = 100 m and M = 20%. In contrast, LSRR-LA demonstrates the best positioning accuracy for the three different networks, and meanwhile keeps a small standard deviation.

Figure 4 illustrates the localization results of the six methods and highlights the localization error of each node, when the communication radius R = 100 m and anchors M = 20% (under regular distribution). The blue circles indicate the estimated positions of the nodes and the blue lines describe the localization errors. The length of these lines is proportional to the localization errors.

In addition, three-dimensional graphics are used to visualize the performance in order to show the localization results more clearly. In Figure 5, the three-dimensional (3D) localization results of LSRR-LA are compared with those of the classic DV-Hop method. The results prove that LSRR-LA outperforms DV-Hop, especially at the edge of these regions.

4.2.2. Random Deployment

Sometimes, in the case of random deployment networks, some nodes cannot be connected to their neighbors due to the fact that the geometric distance between them may be beyond their communication range. This may cause a localization method to fail to estimate the locations of these isolated nodes, so we appropriately improve the total number of nodes and the communication radius to avoid this. Approximately 350 nodes (including anchor nodes and sensor nodes) are randomly distributed in an area of 1000 m × 1000 m. Table 5 shows the ALE results of the six methods when the total number of anchor nodes is 70 (20%) and the communication radius R varies from 100 m to 300 m.

We can see from the simulation results in Table 5 that, in the case of random deployment, our proposed LSRR-LA method achieves the best positioning accuracy. For example, when the node communication radius R = 200 m and anchors number M = 20%, the LSRR-LA method presents an ALE of 38.9, which is much better than the others.

As shown in Table 6, similar to regular deployment, five different anchor populations are also considered in random deployment. Here, we just show the ALE results of the C-shaped network when the communication radius R = 150 m and the number of anchors M is 65, 75, 85, 95, and 105, respectively. All of the competing methods demonstrate a performance improvement as the network becomes denser; however, these methods do not exhibit satisfactory improvements. Taking the LSRR-LA method for instance, we notice that its ALE is 38.9 with M = 65 while its ALE is only decreased to 34.8 with M = 105, which proves that anchor populations have just a little impact on positioning accuracy in random deployment networks, since it is not the same in the case of regular deployment networks (refer to Table 4).

Figure 6 shows us the overall summarization of Table 5 and Table 6. Accordingly, Figure 6a depicts the ALE results of the six algorithms in random deployed networks, when the communication radius varies from 100 m to 300 m. The positioning accuracy of DV-Hop, Amophos, and MDS-MAP algorithm is very poor when the communication radius R is small (R = 100 m). Our LSRR-LA always maintains the highest positioning accuracy regardless of whether R is smaller (R = 100 m) or R is larger (R = 300 m). Figure 7b shows the ALE results of the six algorithms when the number of anchor nodes is increased from 65 to 105. We can see that LSRR-LA is still the best in these algorithms.

Similar to Figure 3, Figure 7 also shows an overall summarization of the six competing methods in random deployment networks, when the communication radius R is 150 m and the anchor node M is 70. It can be seen from the figure that our LSRR-LA demonstrates the best positioning performance.

Figure 8 gives the localization results of the six algorithms when the communication radius R is 150 m and the number of anchors M is 70. Compared with Figure 4, LSRR-LA presents a rather stable performance regardless of whether it is in C-shaped, L-shaped, or D-shaped network; however, it is not the same in regular deployment networks, for it performs the best in the C-shaped network while it performs the worst in the D-shaped network.

Similar to Figure 5, we also demonstrate the 3D localization errors in Figure 9 for a better comparison of positioning accuracy between LSRR-LA and DV-Hop. Our LSRR-LA still outperforms DV-Hop in the random deployment networks.

4.3. Experimental Results

In this section, we validate the performance of LSRR-LA through a practical experiment. As shown in Figure 10, a small vacant patch of land is chosen as the experimental field, which is about 25 × 25 m². Some bushes, regarded as ‘obstacles’, are growing in the center of this field. A total number of 20 nodes were used in the experiment, and each node was mounted on a pole with a height of 1.3 m. All of the nodes are equipped with omnidirectional antennas and IEEE 802.15.4 compatible TI CC2450 transceivers operating in the 2.4 GHz ISM band.

The distribution of nodes is shown in Figure 11a. We assign a number to each node. The transmission power is set to 0 dBm; thus, in this environment, each node can communicate range over a distance of 8 m. The connected graph of the entire network is shown in Figure 11b.

In the experiments, the anchor ratio is varied from 10% to 30%. The localization performance is compared using box plots in Figure 12. On each box, the central mark indicates the median while the bottom and top marks represent the minimum and maximum values, and the bottom and top edges of the box indicate the 25th and 75th percentiles, respectively.

Figure 12a–c show the localization results in which the anchor ratio is 10%, 20%, and 30%, respectively. The boxes represent the results of DV-hop, Amorphous, MDS-MAP, PDM, LSVR, and LSRR-LA, in that order.

From the figures, it is evident that the positioning accuracy depends proportionally on the number of anchor nodes for the compared localization algorithms DV-hop, Amorphous, MDS-MAP, PDM, LSVR, and LSRR-LA. It also proves that our proposed method has much better performance than the compared methods.

We also use the error cumulative distribution function (CDF) to represent the performance of our proposed algorithm, as shown in Figure 13. If anchor ratio is 10%, we can see from Figure 13a that the CDF is 63.8%, 85.5%, and 98.3% when the location error is 0.5 m, 0.8 m, and 1.5 m, respectively. If the anchor ratio is up to 30%, we can see from Figure 13b that the CDF is 67.1%, 92.8%, and 100.0% when the location error is 0.5 m, 0.8 m, and 1.5 m, respectively. As a comparison, the LSRR-LA performs best due to its advantage of the regression model.

5. Conclusions

In this paper, aiming at improving the positioning accuracy in anisotropic networks, we propose a localization method based on Least Square Regularized Regression, i.e., LSRR-LA, which considers the localization problem as a regression problem and constructs a mapping model between hop counts and Euclidean distance. By using this mapping model, we can estimate the locations of unknown nodes more accurately and more effectively. Compared with the localization methods DV-hop, Amorphous, MDS-MAP, PDM, and LSVR, our LSRR-LA outperforms them obviously, especially in anisotropic networks.

Author Contributions

W.Z. and S.Y. conceived and designed the experiments; W.Z. and F.S. performed the experiments; W.Z. analyzed the data; W.Z. contributed analysis tools; W.Z. wrote the paper.

Funding

This research was funded by the Doctoral Scientific Research Startup Foundation of Jinling Institute of Technology (JIT-B-201429), the Scientific Research Foundation of Jinling Institute of Technology (JIT-2016-jlxm-20), the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (Grant No. 17KJB520008, 17KJA520001), and the National Natural Science Foundation of China (No.61375121). And The APC was funded by Jinling Institute of Technology.

Conflicts of Interest

The authors declare no conflict of interest.

References

Drawil, N.M.; Amar, H.M.; Basir, O.A. GPS localization accuracy classification: A context-based approach. IEEE Trans. Intell. Transp. Syst. 2013, 14, 262–273. [Google Scholar] [CrossRef]
Zheng, V.W.; Zheng, Y.; Xie, X.; Yang, Q. Collaborative location and activity recommendations with GPS history data. In Proceedings of the 19th International Conference on World Wide Web, New York, NY, USA, 26–30 April 2010; pp. 1029–1038. [Google Scholar]
Sichitiu, M.L.; Ramadurai, V. Localization of wireless sensor networks with a mobile beacon. Inf. Technol. J. 2013, 12, 174–183. [Google Scholar]
Gui, L.; Val, T.; Wei, A.; Dalce, R. Improvement of range-free localization technology by a novel DV-hop protocol in wireless sensor networks. Ad Hoc Netw. 2015, 24, 55–73. [Google Scholar] [CrossRef] [Green Version]
Zhang, S.; Liu, X.; Wang, J.; Cao, J.; Min, G. Accurate range-free localization for anisotropic wireless sensor networks. ACM Trans. Sens. Netw. 2015, 11, 51. [Google Scholar] [CrossRef] [Green Version]
Savvides, A.; Garber, W.L.; Moses, R.L.; Srivastava, M.B. An analysis of error inducing parameters in multihop sensor node localization. IEEE Trans. Mob. Comput. 2005, 4, 567–577. [Google Scholar] [CrossRef] [Green Version]
Xiao, Q.; Xiao, B.; Cao, J.; Wang, J. Multihop range-free localization in anisotropic wireless sensor networks: A pattern-driven scheme. IEEE Trans. Mob. Comput. 2010, 9, 1592–1607. [Google Scholar] [CrossRef]
Niculescu, D.; Nath, B. DV based positioning in ad hoc networks. Telecommun. Syst. 2003, 22, 267–280. [Google Scholar] [CrossRef]
Bay, A.; Carrera, D.; Fosson, S.M.; Fragneto, P.; Grella, M.; Ravazzi, C.; Magli, E. Block-sparsity-based localization in wireless sensor networks. EURASIP J. Wirel. Commun. Netw. 2015, 2015, 182. [Google Scholar] [CrossRef]
Wei, Y.; Li, W.; Chen, T. Node localization algorithm for wireless sensor networks using compressive sensing theory. Pers. Ubiquitous Comput. 2016, 20, 809–819. [Google Scholar] [CrossRef]
Zhang, S.; Er, M.J.; Zhang, B.; Naderahmadian, Y. A novel heuristic algorithm for node localization in anisotropic wireless sensor networks with holes. Signal Process. 2017, 138, 27–34. [Google Scholar] [CrossRef]
Guadane, M.; Bchimi, W.; Samet, A.; Affes, S. Enhanced range-free localization in wireless sensor networks using a new weighted hop-size estimation technique. In Proceedings of the IEEE, International Symposium on Personal, Indoor, and Mobile Radio Communications, Montreal, QC, Canada, 8–13 October 2017; pp. 1–5. [Google Scholar]
Shahzad, F.; Sheltami, T.R.; Shakshuki, E.M. Multi-objective optimization for a reliable localization scheme in wireless sensor networks. J. Commun. Netw. 2016, 18, 796–805. [Google Scholar]
Yan, X.; Sun, L.; Zhou, J.; Song, A. DV-hop localisation algorithm based on optimal weighted least square in irregular areas. Electron. Lett. 2018, 54, 1243–1245. [Google Scholar] [CrossRef]
Nagpal, R.; Shrobe, H.; Bachrach, J. Organizing a global coordinate system from local information on an ad hoc sensor network. In Proceedings of the Sensor Networks, Palo Alto, CA, USA, 22–23 April 2003; pp. 333–348. [Google Scholar]
Shang, Y.; Ruml, W.; Zhang, Y.; Fromherz, M.P. Localization from mere connectivity. In Proceedings of the 4th ACM International Symposium on Mobile Ad Hoc Networking Computing, Annapolis, MD, USA, 1–3 June 2003; pp. 201–212. [Google Scholar]
Lim, H.; Hou, J.C. Distributed localization for anisotropic sensor networks. ACM Trans. Sensor Netw. 2009, 5, 1–26. [Google Scholar] [CrossRef] [Green Version]
Lee, J.; Chung, W.; Kim, E. A new range-free localization method using quadratic programming. Comput. Commun. 2011, 34, 998–1010. [Google Scholar] [CrossRef]
Gu, B.; Sheng, V.S.; Tay, K.Y.; Romano, W.; Li, S. Incremental support vector learning for ordinal regression. IEEE Trans. Neural Netw. Learn. Syst. 2015, 26, 1403–1416. [Google Scholar] [CrossRef] [PubMed]
Zhao, W.; Su, S.; Shao, F. Improved DV-Hop algorithm using locally weighted linear regression in anisotropic wireless sensor networks. Wirel. Pers. Commun. 2018, 98, 3335–3353. [Google Scholar] [CrossRef]
Hoerl, A.E.; Kennard, R.W. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 2000, 12, 55–67. [Google Scholar] [CrossRef]
Arlot, S.; Celisse, A. A survey of cross-validation procedures for model selection. Statist. Surv. 2009, 4, 40–79. [Google Scholar] [CrossRef]
Nguyen, C.L.; Georgiou, O.; Doi, Y. Maximum likelihood based multihop localization in wireless sensor networks. In Proceedings of the IEEE International Conference on Communications, London, UK, 8–12 June 2015; pp. 6663–6668. [Google Scholar]
Liu, Y.; Yang, Z. Location, Localization, and Localizability Location-awareness Technology for Wireless Networks; Springer: New York, NY, USA, 2011; pp. 55–74. [Google Scholar]
Lee, J.; Chung, W.; Kim, E. A new kernelized approach to wireless sensor network localization. Inf. Sci. 2013, 243, 20–38. [Google Scholar] [CrossRef]
Shalev-Shwartz, S.; Singer, Y.; Srebro, N.; Cotter, A. Pegasos: Primal estimated sub-gradient solver for SVM. Math. Program. 2011, 127, 3–30. [Google Scholar] [CrossRef]
Lu, Y.; Dhillon, P.; Foster, D.P.; Ungar, L. Faster ridge regression via the subsampled randomized hadamard transform. Adv. Neural Inf. Process. Syst. 2013, 369–377. [Google Scholar]

Figure 1. An example of network anisotropy’s impact on the hop distance.

Figure 2. The ALE results of C-shaped regular networks.

Figure 3. Comparison of ALE when R = 100 m and M = 20%.

Figure 4. The localization results of the six algorithms in regular distribution networks (R = 100 m, M = 20%).

Figure 5. Comparison of the 3D Localization results (R = 100 m, M = 20%).

Figure 6. The ALE results of C-shaped random networks.

Figure 7. Comparison of ALE when R = 150 m and M = 70.

Figure 8. The localization results of the six algorithms in random deployment networks.

Figure 9. Comparison of 3D Localization results (R = 150 m, M = 70).

Figure 10. The real experimental scene.

Figure 11. The network topology.

Figure 12. Comparison of the ALE under various anchor populations.

Figure 13. Comparison of the cumulative distribution function (CDF).

Table 1. The simulation scenarios.

	C-Shaped	L-Shaped	D-Shaped
Random distribution
Regular distribution

Table 2. The simulation parameters.

Parameter	Value
Compared methods	DV-hop, Amorphous, MDS-MAP, PDM, LSVR
Size	1000 m × 1000 m
Type of distribution	regular and random
Type of network	C-shaped, L-shaped, and D-shaped
Anchor node ratio (%)	13.7–31.9%
Communication radius (m)	50–300

DV-hop: Distance Vector hop; MDS-MAP: Multidimensional scaling MAP; PDM: Proximity-Distance Map; LSVR: Localized Support Vector Regression

Table 3. Comparison of the average localization error (ALE) under different radio ranges in the regular distribution network.

	R = 50 m/5%	R = 100 m/10%	R = 150 m/15%	R = 200 m/20%	R = 250 m/25%
Number of Anchors M = 44/20%
DV-hop	613.1/93.2%	283.8/92.6%	189.9/78.4%	138.6/71.3%	103.9/65.6%
Amorphous	766.8/94.5%	400.4/94.8%	301.1/86.2%	275.7/85.6%	222.8/83.9%
MDS-MAP	651.8/93.6%	341.8/93.9%	231.1/82.2%	176.2/77.4%	143.8/75.1%
PDM	138.3/69.8%	73.5/71.4%	62.6/34.3%	53.4/25.6%	46.2/22.5%
LSVR	142.7/70.7%	76.2/72.4%	61.8/32.5%	49.3/19.4%	41.5/13.7%
LSRR-LA	41.8	21.0	41.9	39.7	35.8

LSRR-LA: Least Square Regularized Regression based multi-hop range-free Localization Algorithm

Table 4. Comparison of the ALE under different numbers of anchors in the regular distribution network.

	M = 30/13.7%	M = 40/18.3%	M = 50/22.8%	M = 60/27.4%	M = 70/31.9%
Communication Radius R = 100 m
DV-hop	290.1/90.1%	285.2/92.2%	285.9/93.7%	271.4/94.2%	284.4/95.0%
Amorphous	408.1/92.9%	398.1/94.4%	400.4/95.5%	385.0/95.9%	398.4/96.45%
MDS-MAP	339.3/91.5%	329.7/93.2%	328.0/94.5%	331.8/95.2%	326.9/95.7%
PDM	85.6/66.4%	75.3/70.4%	68.1/73.7%	66.4/76.2%	63.5/77.7%
LSVR	78.4/63.3%	75.3/70.4%	77.1/76.8%	75.4/79.0%	78.4/81.9%
LSRR-LA	28.8	22.3	17.9	15.8	14.2

Table 5. Comparison of ALE under different radio ranges in the random deployment network.

	R = 100 m/10%	R = 150 m/15%	R = 200 m/20%	R = 250 m/25%	R = 300 m/30%
Number of Anchors M = 70/20%
DV-hop	266.4/79.9%	156.3/71.6%	112.6/65.4%	87.4/61.3%	71.5/54.8%
Amorphous	516.2/89.6%	306.5/85.5%	215.5/81.9%	164.9/79.5%	125.6/74.3%
MDS-MAP	333.2/83.9%	224.2/80.2%	169.9 /77.1%	143.5/76.4%	117.3/72.5%
PDM	76.5/29.2%	56.8/21.9%	45.4/14.1%	37.9/10.7%	32.7/1.3%
LSVR	110.4/51.5%	71.9/38.3%	53.5/27.2%	42.8/21.0%	36.4/11.2%
LSRR-LA	53.5	44.4	38.9	33.8	32.3

Table 6. Comparison of ALE under different numbers of anchors in the random deployment network.

	M = 65/18.6%	M = 75/21.4%	M = 85/24.3%	M = 95/27.1%	M = 105/30.0%
Communication Radius R = 150
DV-hop	192.2/79.7%	190.6/79.3%	186.4/79.5%	186.9/80.9%	188.2/81.5%
Amorphous	391.7/90.0%	382.4/89.7%	380.4/89.9%	382.0/90.6%	379.9/90.8%
MDS-MAP	225.4/82.7%	226.8/82.6%	230.2/83.4%	228.1/84.3%	230.6/84.9%
PDM	54.8/28.9%	52.1/24.4%	49.9/23.4%	47.8/25.2%	47.5/26.7%
LSVR	51.5/24.3%	49.0/19.7%	47.3/19.0%	46.8/23.7%	46.3/24.8%
LSRR-LA	38.9	39.4	38.3	35.7	34.8

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, W.; Shao, F.; Ye, S.; Zheng, W. LSRR-LA: An Anisotropy-Tolerant Localization Algorithm Based on Least Square Regularized Regression for Multi-Hop Wireless Sensor Networks. Sensors 2018, 18, 3974. https://doi.org/10.3390/s18113974

AMA Style

Zhao W, Shao F, Ye S, Zheng W. LSRR-LA: An Anisotropy-Tolerant Localization Algorithm Based on Least Square Regularized Regression for Multi-Hop Wireless Sensor Networks. Sensors. 2018; 18(11):3974. https://doi.org/10.3390/s18113974

Chicago/Turabian Style

Zhao, Wei, Fei Shao, Song Ye, and Wei Zheng. 2018. "LSRR-LA: An Anisotropy-Tolerant Localization Algorithm Based on Least Square Regularized Regression for Multi-Hop Wireless Sensor Networks" Sensors 18, no. 11: 3974. https://doi.org/10.3390/s18113974

APA Style

Zhao, W., Shao, F., Ye, S., & Zheng, W. (2018). LSRR-LA: An Anisotropy-Tolerant Localization Algorithm Based on Least Square Regularized Regression for Multi-Hop Wireless Sensor Networks. Sensors, 18(11), 3974. https://doi.org/10.3390/s18113974

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

LSRR-LA: An Anisotropy-Tolerant Localization Algorithm Based on Least Square Regularized Regression for Multi-Hop Wireless Sensor Networks

Abstract

1. Introduction

2. Related Works

3. The Proposed Localization Algorithm

3.1. Problem Statement

3.2. Localization Algorithm

4. Performance Evaluations

4.1. Complexity Comparison

4.2. Simulation Results

4.2.1. Regular Deployment

4.2.2. Random Deployment

4.3. Experimental Results

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI