1. Introduction
Advances in sensor technology popularize battery-powered wireless sensor networks (WSNs) in a large number of industrial areas including vehicle traffic monitoring, smart factories, internet-of-things devices, and public safety networks to name a few [
1,
2]. As sophisticated functional features are necessary to achieve a high level of the automation for conveniences, additional wireless sensors are accommodated for covering an increasing demand of the sensing data. On top of a sophisticated network formation, sensors are subject to a variety of physical environments where the cost of intervening in the sensor network is massive [
3,
4]. Therefore, it is indispensable to extend the lifetime of the WSN.
There have been a number of efforts to maximize the longevity of the WSN, such as mobile relays, optimal deployment of sensor nodes, and energy harvesting [
5]. In terms of routing and clustering, low-energy adaptive clustering hierarchy (LEACH) protocol has played a trailblazing role in the minimization of the energy consumption in the network [
6]. This protocol uses a hierarchical clustering-based routing strategy which forms multiple clusters of nodes, each with a single cluster head (CH) node. In each group, member nodes transfer their information to the CH node, and the CH node is responsible for aggregating whole received information and forwarding it to the base station (BS). Besides single-hop clustering, there are several multi-hop hierarchical clustering based protocol. Power-efficient gathering in sensor information systems (PEGASIS) forms chains of sensor nodes such that each node transfers the data back and forth with adjacent nodes [
7]. Hybrid energy-efficient distributed clustering (HEED) considers the residual energy and the proximity to adjacent nodes for the selection of the CH node [
8]. In addition, energy-efficient uneven clustering (EEUC) protocol partitions the set of sensor nodes into unequal sized clusters and uses multi-hop routing for inter-cluster communication to save the energy of the CH node located near the BS [
9].
For simplicity of the LEACH protocol, several variations have been developed for the minimization of the network energy consumption. In LEACH-C [
6], a central coordinator exists to control the cluster formation. By collecting all required network information, the central coordinator provides a solution for the optimized clusters. In LEACH-CE [
10], a cluster head selection has been improved. Upon the formation of a cluster, the member node with the highest remaining energy is selected as a CH node. This enables to distribute the energy consumption load uniformly over the member nodes. In LEACH-CKM [
11], a
K-means (or
K-medians) clustering algorithm has been adopted to enhance the clustering performance. With a sophisticated energy-efficient cluster formation, the WSN extends its lifetime.
However, aforementioned centralized protocols impose several practical requirements. They require accurate location information of all sensor nodes in the network. Additional hardware with localizing features, e.g., GPS, result in an increase in the cost and power consumption for sensor devices. Furthermore, the collection of the location information of individual nodes and the distribution of the clustering decision by the network coordinator become a signaling burden to the network. To cope with the practical issues, LEACH-AP has been proposed as a distributed solution [
12]. LEACH-AP develops a distributed algorithm based on affinity propagation [
13]. Hence, the location information of sensor nodes is not necessary and the signaling overhead becomes minimal. In addition, it outperforms the existing centralized counterparts in terms of the network lifetime. However, there are some drawbacks of LEACH-AP. LEACH-AP simplifies an energy consumption model of CH nodes for mathematical tractability [
12]. The assumption that the sensing area is apart away from the BS and transmission energy dominates the energy consumption of CH nodes ignores the impact of the dynamic cluster size on the energy consumption of CH nodes in the simplified model. Thus, variable energy consumption for CH nodes has not been properly included, thereby resulting in an occasional deviation from the guaranteed optimality. In addition, LEACH-AP assumes that the sensing data within a cluster is closely correlated and thus all sensing data collected at the CH node are perfectly compressed. This assumption means that the size of the aggregated data at CH nodes is the same as the sensing data of each sensor node. In reality, the correlation between the sensing data of neighboring sensor nodes varies. The size of the aggregated data at CH nodes also differs depending on the number of data and their correlation, and may degrade the performance of the simplified model. However, revising the simplified energy consumption model of CH nodes to reflect a new feature renders the model highly nonlinear.
To resolve the issue, this work develops an energy consumption-minimizing strategy called LEACH-XMP (LEACH-eXtended Message-Passing) based on an enhanced technique of the message- passing algorithm [
14]. Unlike the previous approach in [
12], the LEACH-XMP adopts a sophisticated but realistic nonlinear energy consumption model for CH nodes without oversimplification. The resulting solution is very efficient in improving the cluster formation and in the lifetime maximization of whole WSN. The main contributions of this paper are summarized as follows:
Generalized energy consumption model: This paper adopts a realistic energy consumption model of a sensor node. The model considers the transmission power dissipation and digital-processing power related to data decoding and aggregation. We capture the impact of the dynamic cluster size on the digital-processing power of a CH node which has been unaddressed in the previous studies. In addition, the data compression rate based on the correlation of the collected data is considered. It influences the transmission power dissipation of a CH node since an inaccurate energy consumption model of CH nodes incurs a suboptimal cluster formation. None of the previous works have properly addressed this issue to the best of the authors’ knowledge.
Distributed algorithm: The new energy consumption model renders the optimization formulation highly nonlinear. We propose a message-passing approach that can tackle this nonlinearity very efficiently. We construct a graphical model that corresponds to the WSN of interest and develop an efficient distributed algorithm based on it. The advantages of the proposed algorithm include that (i) the algorithm is distributed by nature; (ii) computational load for each node is minimal; and (iii) the algorithm converges rapidly with only a small number of message-passing iteration. The advantages characterize the usefulness of the proposed algorithm for practical implementations.
Performance: The performance of the proposed algorithm is verified with extensive simulation results. The proposed algorithm outperforms all the previous approaches in terms of energy consumption rate, total delivered data, and network lifetime. In particular, the proposed algorithm improves the network lifetime up to compared to existing LEACH protocols in the network lifetime. Based on the realistic energy consumption model, the proposed algorithm provides an efficient solution for different network sizes. As the network size grows, the improvement also increases. This is one of the prominent main benefits of the proposed algorithm.
This paper is organized as follows:
Section 2 describes the system model of generalized LEACH and
Section 3 specifies the algorithm of LEACH-XMP. Subsequently,
Section 4 provides simulation results of proposed algorithm. Finally,
Section 5 draws the conclusion.
2. System Model
Consider a WSN that deploys a set of
N sensor nodes denoted by
(see
Figure 1). A single BS exists in the network and receives environmental sensing data from sensor nodes. Sensor nodes initially broadcast their identification number and remaining energy information. The received signal strength is used to estimate the distances from neighboring nodes. The LEACH protocol [
6] proceeds with set-up phase and steady-state phase. In the set-up phase, clusters are constructed by grouping sensor nodes, where time-slot-based scheduling is also determined within each cluster. In the following steady-state phase, sensing packets from sensor nodes are collected at their CH nodes and then forwarded to the BS.
We desire to maximize the lifetime of the WSN. Such a goal is analogous to maximizing the remaining energy of the WSN. The remaining average total energy of sensors the WSN is monitored to maximize on a regular basis. Let
be the
k-th time instant where the measurement from sensors is reported to the BS. At time instant
, all sensors measure their remaining energy along with the sensing data and share such information with neighbors. If at least one of their neighbors no longer work, they reconfigure the WSN to maximize the remaining energy. We assume that all sensors have the same amount of the initial energy denoted by
. Let
and
denote the remaining energy of sensor
j and the average remaining energy of live nodes in the WSN at time instant
, respectively. Thus, it holds that
for all sensors. Also, the set of live sensors at time instant
is denoted by
. Once the WSN is configured such that all sensors have their own roles in the network, sensor
j consumes the energy
depending on its role during the interval
. The remaining energy of sensor
j is given by
and the the average remaining energy of live sensors is given by
Hence, the maximization of the total remaining energy becomes equivalent to the minimization of the energy (corresponding to the second term in (
2)) for the communication at each reporting interval.
The energy consumption of a sensor node consists of the transmission energy dissipation and the digital-processing energy consumption proportional to the amount of the processed data [
6]. The transmission power dissipation depends on the amount of processed data and the propagation loss. To quantify this, we let
,
, and
denote signal-processing energy per bit, transmission energy per bit in free-space propagation, and the transmission energy per bit in multipath propagation, respectively. The energy consumption of a single sensor varies from the wireless propagation model. According to the energy consumption model in [
6], the free-space propagation becomes dominant when the transmission distance is small, while the multipath propagation prevails in the opposite case. A pairwise transmission is assumed to convey
b bits of sensing data. The energy consumption of node
i transmitting to node
j is denote by
and is given by
where
is the threshold distance associated with the point where the energy consumptions by multipath and free-space propagation are identical. A CH node is responsible for processing and aggregating the received data into a single packet to transfer to the BS. In addition to the energy dissipation of
per bit for the signal reception and transmission, a CH node consumes
per bit to aggregate own received data. For an
n-member cluster, the CH node transmits the collected information of
bits to the BS. In previous works [
6], it is assumed that the CH node can compress the collected information of
bits into a single data packet of a constant length
b to transmit to the BS. However, in practice, the aggregated information may not be compressed into a sufficiently compact data packet. The number of bits required for representing the information can increase according to the relationship among the data from distinct nodes. To characterize it, we consider a generalized model by considering the correlation parameter between different data, denoted by
. This parameter, taking on a value from
, is, in fact, associated with the best rate which an individual member signal can be compressed with. Let
denote the number of bits that the CH node of an
n-member cluster can obtain by compressing
n different data of
b bits. Under the assumption of uniform
among all data from individual sensors,
is evaluated by the inclusion-exclusion principle as a nonlinear function of
n, which can be expressed as
Note that, if
is zero, i.e., all measurements are independent of each other and share no common information,
becomes
. On the other hand, if
is one, i.e., all measurements are identical and only one measurement is enough,
becomes
b, which corresponds to the assumption made in previous works [
6]. This function increases sublinearly with respect to
n but decreasing with respect to
as depicted in
Figure 2. Thus, the transmitting energy cost per bit of CH node
j which is apart from the BS by the distance
amounts to
We include additional requirement for the eligibility of a CH node [
12]. Only a node that holds
is eligible for the CH selection. Here,
can be acquired by consensus-based techniques [
15] without global notification of the state of individual nodes.
For clustering formation, several decision tasks are fulfilled such that the number of clusters and their CH nodes are identified and member nodes are affiliated to one of those clusters. Each node decides its own role between CH and member node by considering the corresponding energy consumptions as similarity measures used in the clustering problem. Also, once a node is chosen as a CH node, it cannot become a member node of another cluster. These decisions boil down to two requirements: (i) Each node chooses its CH node. To become a CH node, the node should choose itself as its CH node; (ii) once a node is chosen by other nodes to act as a CH node, it cannot choose another node as its CH node. For concrete mathematical formulation, we introduce binary variable to indicate the membership of node i, such that indicates node i selects node j as a CH node. Once node i selects node j as its CH node, node j should choose itself as own CH node, i.e., . Let and be the set of neighboring nodes around node j and the number of neighboring nodes connected to CH node j (excluding CH node j), respectively. It holds that . We can reformulate the energy consumption of CH node j as
Note that unlike the simplified model in [
12], Equation (
6) captures a nonlinear energy consumption model of a CH node. The resulting formulation of the WSN is given by
where
is the number of neighbors around node
j. The first term of the objective function in (
7a) corresponds to the energy consumption of member nodes and the second term corresponds to the energy consumption of CH nodes. The first constraint indicates that each member can select only one CH node and the second constraint dictates that the selected node chooses itself as a CH node of the affiliated cluster. Note that solving the optimization in (
7a)–(
7d) is a computationally demanding task since the problem is inherently of combinatorial nature in addition to high nonlinearity of objective function and constraints.
4. Simulation Results
The simulation is conducted using standard Python 3 with NumPy and SciPy libraries. In this simulation, 100 sensor nodes are initially scattered at random in a 100 m × 100 m square region in between (0, 0) and (100, 100). Simulation parameters are set referring to [
6] and summarized
Table 1. We refer to the interval between consecutive cluster reformations as a single round, which amounts to 7 days. The predetermined cluster number required by the existing algorithms for comparison is set to
[
6]. Results are averaged over 100 random drops of wireless sensors. The lifetime of the WSN is calculated by the number of live nodes, which is evaluated at each round that consists of the set-up phase and the steady-state phase. The network lifetime and total amount of transferred data are measured by evaluating the number of rounds until the death of the last node.
We compare the performance of the proposed LEACH-XMP with existing LEACH-based protocols including LEACH-C, LEACH-CE, LEACH-CKM, and LEACH-AP. Since the existing techniques assume that the collected data can be perfectly compressed at CH nodes, i.e.,
for fair comparison, which is the special case of the proposed LEACH-XMP.
Figure 4 compares the ratio of live nodes over time, while
Figure 5 compares the total amount of data received at the BS over time in case A where the BS is located at (50, 175). As time elapses, the number of live nodes decreases. LEACH-XMP provides
,
,
, and
increase in the network lifetime as compared to LEACH-C, LEACH-CE, LEACH-CKM, and LEACH-AP, respectively. Accordingly, LEACH-XMP transfers the largest amount of sensing data to the BS among the LEACH variations.
Table 2 compares the computational complexity of LEACH-XMP and existing LEACH variations. In LEACH-C and LEACH-CE, the cluster formation is fulfilled by a simulated annealing algorithm [
6]. At each iteration of the simulated annealing algorithm, the roles of the partial set of nodes are interchanged to improve a temporal solution, and the resulting computational complexity per iteration is low, i.e.,
. However, a simulated annealing algorithm typically requires at least thousands of iterations for convergence,
t. Specifically, we set
in our simulations considering its slow convergence. On the other hand, our simulations results show that LEACH-CKM requires relatively small number of iterations, e.g., less than 10 iterations in most cases. In general, the performance of the
K-medians algorithm adopted in LEACH-CKM is strongly affected by the initial choice of the value of
K and CH nodes. Thus, the algorithm requires at least dozens of runs with different initial points to find more improved solutions. Since our simulations set the total number of runs for LEACH-CKM to 30, the overall computational complexity of LEACH-CKM increases significantly. At each iteration of message-passing, LEACH-AP requires
operations all over the network, while LEACH-XMP requires
operations since LEACH-XMP involves the sorting operation in
message calculation, which is slight increase in complexity. Note that LEACH-XMP efficiently finds a near optimal solution without multiple runs of different initialization setups and rapidly converges within dozens of iterations as will be confirmed in the following simulations results. Overall, LEACH-XMP has comparable complexity over other LEACH variations.
Figure 6 shows the impact of the BS position and energy distribution. Five simulation scenarios A-E are considered as listed in
Table 3. Five columns of each case correspond to LEACH-C, LEACH-CE, LEACH-CKM, LEACH-AP, and LEACH-XMP, respectively. Three stacked bars in each column are associated with the times elapsed until 50%, 90%, and 99% of the nodes are out of operation, respectively. In all cases, LEACH-XMP outperforms in terms of the lifetime of network, irrespective of the position of the BS. In particular, the 50%-node-survival time is enhanced considerably for the proposed algorithm. This shows that LEACH-XMP sophisticatedly optimizes nonlinear energy consumption which has not been properly handled previously.
Figure 7 shows how the total energy consumption of the network on a single round varies with the number of sensor nodes. As the network grows, the total energy consumption increases gradually. As compared to existing LEACH variations, message-passing-based techniques, i.e., LEACH-AP and LEACH-XMP, provide energy-efficient solutions. Furthermore, the performance improvement of the proposed LEACH-XMP over LEACH-AP becomes noticeable as the network size increases, in that the proposed LEACH-XMP finds a more energy-efficient solution than LEACH-AP from the consideration of the generalized energy consumption model for CH nodes.
Figure 8 shows how the average number of clusters changes depending on network size. As the network grows, the average number of clusters increases in both LEACH-AP and LEACH-XMP. However, the proposed algorithm maintains a small number of clusters comparing to LEACH-AP. LEACH-AP uses a simplified energy consumption model by ignoring non-linear terms. Thus, it normally underestimates the energy consumption of CH nodes, creating more number of clusters. By adopting a realistic energy consumption model, the proposed LEACH-XMP can identify an efficient solution for dynamic cluster sizes in large-scale networks. This explains the total energy consumption performance in
Figure 7.
We now consider a general case of
, i.e., non-ideal compression at CH nodes. According to (
4) and
Figure 2, the compressed data size to be delivered from CH nodes to the BS varies depending on
and cluster size. The compressed data size dominates the energy consumption of CH nodes and the overall cluster formation.
Figure 9 shows how the fraction of live nodes reduces as time elapses in the simulation. In all regimes of the
value, the remaining energy of nodes gradually become depleted and the nodes are excluded from the network. If the cluster formation is identical, a low value of
leads to the increase in energy consumption of CH as expected from (
6). However, the lifetime of the network still extends even in a low
regime. The clustered results from LEACH-XMP offset the increment of
. CH nodes are relieved from the burden to deliver a large amount of the data compressed with a very low rate and saves the remaining energy for survival. Thus, LEACH-XMP outperforms in a general case of
as compared to previously proposed protocols which fail to compensate the effect by lower
. If
,
increases quickly as
Figure 2 and the cluster formation alone cannot completely compensate for this effect.
Figure 10 illustrates how the average number of clusters varies with the rate of the data compression. The average number of clusters gradually increases with
. With small
, the amount of compressed data at CH nodes is large, causing the energy consumption burden to the CH nodes. Accordingly, the network becomes energy-efficient when it maintains a lower number of clusters. By contrast, the amount of the compressed data at CH nodes is small with a large
. This implies a lower energy consumption burden to the CH nodes, thereby populating more clusters in the network.
Figure 11 shows how fast the proposed LEACH-XMP converges in various network sizes. In all ranges of the network size, the proposed LEACH-XMP converges at the latest within 35 iterations. In general, a larger network size requires slightly longer convergence time. However, the performance difference becomes indistinguishable after 10 to 20 iterations. With the low signaling overhead, the proposed LEACH-XMP is beneficial for practical implementations.