1. Introduction
In wireless sensor networks, sensor nodes are generally not rechargeable or replaceable since they are deployed in large numbers; furthermore, the deployed areas are usually disaster areas that are difficult to reach. Thus, considering that sensor nodes are usually powered by limited energy resources, improving the lifetime of the network has always been a primary objective of wireless sensor networks.
Many efficient routing protocols have been proposed to extend the lifetime of wireless sensor networks [1-5]. Clustering-based approaches have especially attracted a great deal of interest because these approaches can achieve high energy efficiency and, at the same time, increase the network scalability. Typically, in clustering-based approaches, cluster heads are elected by local competitions. Usually the highest-energy nodes in the neighborhood are elected as cluster heads. Then, each nonhead node picks a cluster head with which to form a cluster. After clustering is finished, cluster heads aggregate the sensed data from the member nodes and forward it to the sink [6-8]. To optimize the transmission energy consumption, cluster heads eliminate duplicates and redundancies before the actual transmission.
Clustering-based approaches usually employ a multi-hop communication model, which is proven to be more efficient than direct communication model, to deliver the sensed data from the data sources to the sink. However, in such approaches, cluster heads closer to the sink tend to deplete their energy resources much faster than the other cluster heads due to equal clustering. More specifically, cluster heads placed closer to the sink are usually involved in many more traffic relays, and thus, they tend to deplete their energy resources much faster than other cluster heads (hot-spot problem). This may result in partitioning the sensor network as well as reducing the sensing coverage. To address this hot-spot problem, energy-efficient unequal clustering (EEUC) has been proposed in [9]. In EEUC, the cluster size decreases as the distance from the cluster head to the sink decreases. As a result, in EEUC, clusters closer to the sink have smaller cluster sizes than the other clusters. More specifically, EEUC places more, smaller clusters closer to the sink so that they consume less energy for sensing, aggregating, and intra-cluster communications. This unequal clustering may result in fewer burdens on specific cluster heads as well by selecting different routes for delivering data. However, EEUC is less effective for sensor networks with random distributions. EEUC achieves longer lifetimes by making a tradeoff between cluster energy consumption (i.e., sensing and intra-cluster transmissions) and energy consumption for inter-cluster communications. However, EEUC does not guarantee that a smaller-sized cluster will have a smaller node density in a randomly distributed sensor network. Placing small clusters in a crowded area may make EEUC less effective. To address this problem, we propose a novel approach, called CACD (clustering algorithm considering node distribution) that adjusts the size of each cluster considering the node density. Furthermore, CACD automatically adjusts the node density of each cluster on a per-cluster energy-consumption basis. In CACD, we first compute the optimized number of cluster heads for a given sensor network and then, adjust the size and the node density of each cluster according to the distance between the sink and the cluster head. Simultaneously, we estimate the energy level that each cluster requires for sensing and intra-cluster communication to allow for further adjustments. In addition, to keep the sensor network balanced, we partially reconstruct the cluster structures unlike previous methods that rebuild cluster structures at the beginning of each round. The reminder of this paper is organized as follows. In Section 2, we briefly review previous work for wireless sensor network. Section 3 gives the description of the proposed CACD approach. In Section 4, performance evaluation and analysis are given by simulation results. Finally, we conclude in Section 5.
2. Related Work
2.1 LEACH and LEACH-C
Low-energy adaptive clustering hierarchy (LEACH) is the first clustering-based approach for wireless sensor networks [10]. LEACH operates on a round-to-round basis and elects a set of cluster heads by a localized competition at the beginning of each round. To evenly distribute the energy dissipation among all the nodes, LEACH rotates the cluster heads among the nodes. In LEACH, each round has two phases: the set-up phase and the steady-state phase as shown in Fig. 1. At the beginning of the setup phase, each node decides whether or not to be a cluster head with a certain possibility, pi(t):
where N is the total number of nodes, k is the number of clusters and r is the number of rounds that have passed. Ci(t) = 0, if node i has been a cluster head in most recent (r mod N/k) rounds and 1 otherwise. Thus, Eq. (1) ensures that all nodes are cluster heads the same number of times.
Round composition of LEACH.
After the cluster head election, each cluster head for the current round broadcasts an ADV message using the CSMA (carrier sense multiple access) protocol to the rest of the nodes. Each non-cluster head node that receives the ADV message decides the cluster to which it belongs for the round, according to the strength of the ADV message. After each non-cluster head node has decided on its cluster, it informs its cluster head that it will be a member of the cluster by sending a Join-REQ message. Then, the cluster head sets up a TDMA (time division multiple access) schedule and broadcasts the schedule to the member nodes. In steady-state phase, sensor nodes send their data to the cluster heads during their assigned time slots. To prevent wasted energy, sensor nodes turn off their radio except during their allocated time slots. Cluster heads aggregate the sensed data and forward it to the sink using the CDMA module. Unfortunately, LEACH does not guarantee well-established cluster structures. Since LEACH elects the cluster heads according to residual energy, its cluster structure is highly dependent on the distribution of the sensor nodes. To address this problem, a centralized LEACH clustering algorithm (LEACH-C) has been proposed. However, because LEACH-C assumes that the sink knows the locations of the sensor nodes, this approach is more expensive and requires more transmission cost to form a cluster structure.
2.2 EEUC
Clustering-based algorithms for wireless sensor networks generally use a multi-hop communication model to forward data to the sink. This indicates that cluster heads closer to the sink are involved in many more traffic relays, and thus, tend to deplete their energy resources much faster than other cluster heads. To address this problem, EEUC proposes the unequal clustering scheme shown in Fig. 2. EEUC adjusts the cluster size based on the distance between a cluster head and the sink, such that clusters closer to the sink are smaller. More specifically, EEUC decides the cluster size by Eq.(2).
where Rcomp0 is the pre-defined maximum cluster size. dmax and dmin denote the maximum and minimum distances between the sink and a cluster head, respectively. d(si, sink) indicates the distance between the cluster head,si and the sink, and c is a constant coefficient between 0 and 1. According to Eq. (2), the size of a cluster varies from (1-c)Rcomp0 to Rcomp0. In EEUC, clusters closer to the sink consume less energy for activities inside the clusters (i.e. sensing, aggregating and intra-cluster communications) and thus can afford more traffic relays. However, EEUC becomes less effective for randomly distributed sensor networks because it does not guarantee that a cluster with a smaller cluster size has a smaller node density for such sensor networks. It is obvious that small clusters with a large node density make the unequal clustering-based approach less effective.
EEUC (unequal clustering scheme).
In addition, for wireless sensor networks with random distributions, EEUC may have clusters that are not involved in any traffic relays due to their small size, as shown in Fig. 3. In Fig. 3, clusters depicted by the dashed line are not involved in any data transmissions and thus the cluster heads will consume much less energy. To extend the network lifetime, it is more desirable for such clusters to be larger. This motivates us to design a protocol with partial re-construction. In our protocol, clusters that are involved in fewer inter-cluster communications are allowed to have more member nodes for optimizations. A more detailed explanation is given in Subsection 3.3.
3. The CACD Scheme
The CACD approach proposed in this paper has three phases: set-up, data-transmit, and re-construct. In the set-up phase, CACD elects a set of cluster heads, and each non-cluster head node chooses a cluster head to form a cluster. Similar to EEUC, CACD adjusts the node density of each cluster according to its distance from the sink. Sensor nodes transmit data to their cluster heads, and then the cluster heads aggregate and forward the data to the sink, using a multi-hop communication model in the data-transmit phase. In the re-construct phase, if certain clusters consume much more energy during the set-up and data-transmit phases, CACD balances the energy consumption by partially reconstructing those clusters. Fig. 4 shows how each round of CACD is organized. Please note that after re-construct phase, we omit for the next several rounds the set-up phase that is quite costly in order to fully utilize the well-established cluster structure as shown in Fig. 4. In Fig. 4, set-up phase is replaced with re-construct phase in round 2 and for the following rounds, the set-up phase is omitted and thus, the result of re-construct phase is maintained. By maintaining a well-established cluster structure and omitting the expensive set-up phase, CACD preserves more energy for inter-cluster communication. More operational details will be described in the next subsections. Without a loss of generality, we assume the following properties about the sensor network:
• The sink is stationary and has unlimited energy resources.
• Each sensor node is capable of measuring its distance from the sink according to the strength of receiving signal.
• Sensor nodes are stationary and have similar capabilities (processing, communication, energy resources).
• Sensor nodes can control their transmission power level according to the distance.
3.1 Set-Up Phase
In the set-up phase, CACD elects a set of cluster heads and then, forms clusters with the remaining nodes. CACD employs the hybrid energy-efficient distributed (HEED) cluster-head selection mechanism [11]. With a given initial percentage of cluster heads, Cprob, which is set to 5%, HEED rotates the cluster heads based on their residual energy. Eq. (3) is given for cluster head selection.
where CHCHprob indicates the probability of becoming a cluster head. Eresidual and Emax are the estimated remaining energy and the maximum energy of a node, respectively. After the cluster-head election, CACD forms clusters using the ADV and Join messages. The sink that overhears the ADV message estimates the distance between a cluster head and the sink according to the signal strength and assigns proper cluster sizes and member nodes to the cluster heads. With this information, each cluster head forms a cluster. The number of member nodes assigned to a cluster follows Eq. (4).
where, optimal sensor node (OSN) is the number of sensor nodes that should be included in a cluster while N and CN are the number of sensor nodes and cluster heads for the sensor network, respectively. P is a constant to adjust the node density of each cluster according to the distance from the sink. For effective assignment of P, CACD follows Eq. (5).
where, d(si, sink) is the distance between a cluster head, si, and the sink. F indicates the size of the sensor field, while v is a user-defined constant to assign the proper number of member nodes to a cluster. If a cluster has more member nodes than OSN, the cluster is partitioned or passes member nodes to neighboring clusters to build a more energy-efficient cluster structure. We define the threshold, M, for cluster partitioning as follows:
Figs. 5 and 6 give a running example of a cluster that requires partitioning. As shown in Fig. 5, cluster B has more sensing nodes than M (=9); thus, cluster B should be partitioned to balance the energy dissipation. For partitioning, the two most-distant nodes are elected as new cluster heads and the new cluster heads form clusters using the ADV and Join messages, as mentioned above. Fig. 6 shows the result of the partitioning procedure. In Fig. 6, cluster B is partitioned into two clusters B’ and D. As shown in the running example, CACD balances the energy consumption of each cluster by limiting the maximum number of member nodes in each cluster. Even though the partitioning procedure generates more energy-efficient cluster structures, the procedure can be expensive. Accordingly, CACD applies a simply node transmission procedure to the clusters that have member nodes more or less than OSN for further optimization. In order for all clusters to have as many member nodes as their OSN, those clusters with member nodes more or less than their OSN send OVER or LESS message, respectively to the neighboring cluster heads. The OVER message and LESS message have information on excess and shortage of member nodes. If a cluster with more member nodes (CO) and a cluster with less member node (CL) are one-hop neighbor, some member nodes of CO that are close to CL are moved to CL so that all clusters have member nodes as many as their OSN can, Please note that the maximum number of member nodes varies according to the distance from the sink. Thus, the results of CACD may be similar to that of EEUC in that the clusters close to the sink tend to be small. However, CACD guarantees more energy-efficient cluster structures even for crowded areas, by limiting the maximum number of nodes in a cluster, which varies according to the distance from the sink.
Unbalanced cluster structure (before split).
Balanced cluster structure (after split).
3.2 Data-Transmit Phase
After finishing the set-up phase, cluster heads create TDMA schedules according to the number of member nodes, and pass the schedules to their member nodes. Then, the member nodes transfer their data to the cluster heads during their assigned time slots. After the cluster heads aggregate the data, they forward it to the sink, using CDMA to prevent data collisions. When delivering the data, if the distance between the sending and receiving nodes is less than d0, CACD uses the Friis free-space model. Otherwise, the two-ray ground propagation model is used.
3.3 Re-construct Phase
During the set-up and data-transmit phases, the cluster heads estimate the energy consumed for building cluster structures and for aggregating and transmitting data to the sink (Eo). In the re-construct phase, cluster heads send Eo to the sink, which estimates the average energy cost (AEC) and re-construct energy cost (REC) of each cluster. Then, the sink sends the AEC and REC to the cluster heads. REC is a threshold for triggering the re-construct phase.
where Etotal indicates the total energy consumption of the sensor network and CN′ is the number of cluster heads. If a cluster head (Ct) has Eo larger than REC, Ct broadcasts a reconstruct message to its neighboring cluster heads to re-construct the cluster. Please note that in Eq. (8), the constant coefficient 1.5 can be varied by simulation conditions and thus should be carefully decided by experimental results. If cluster heads (Cr), whose Eo is less than AEC, exist, the member nodes of Ct that are close to Cr become member nodes of Cr. Then, Cr recreates the TDMA schedule and sends it to the member nodes. After the re-construct phase, the set-up phase is omitted for the next k round to utilize this reconstructed cluster structure, as shown in Fig. 4. A more detailed explanation for the best setting of k will be given in Section 4. If all neighboring clusters have Eo larger than AEC, the re-construct phase is aborted and CACD begins the next round so that the entire cluster structure can be re-built in the set-up phase.
CACD algorithm
The above algorithm gives a sketch of CACD. In line 1, we elect the cluster heads according to the mechanism used in HEED. Then, CACD builds a cluster structure in the remainder of the algorithm. While building the cluster structure, a cluster whose member nodes are more than M is partitioned in lines 5-7. When a cluster head has more member nodes than OSN, the cluster head finds clusters that are able to have more member nodes, and passes its member nodes to them in lines 8-18. In line 19, each cluster decides whether to trigger the re-construction phase. If Eo > REC, the cluster head triggers re-construct phase by broadcasting a reconstruct message in line 20 and waits for the cluster-extend message. If the cluster head receives the cluster-extend message, the re-construct phase is triggered in line 22. Otherwise, CACD simply moves to the next round so that the entire cluster structure is re-built in the set-up phase in line 24. If a cluster whose Eo is less than AEC receives a reconstruction messages, it broadcasts a cluster-extend message and triggers the re-construction phase in lines 26-31.
4. Performance Evaluation and Analysis
In this section, we evaluate the performance of our approach, CACD by comparing it against LEACH, HEED, and EEUC. We use Network Simulator 2 (NS2; http://www.isi.edu/nsnam/ns) for simulations and, without a loss of generality, the energy consumption model used in the simulations is the first-order radio model (Eq. (9)), which is also used for LEACH and EEUC.
ETX in Eq. (9) indicates the energy consumed by transmitting data. Eelec represents the energy consumed for processing one bit of data, while t and d represent the data size and the transmission distance, respectively. εamp is the energy required to amplify the signal. When d < d0, we use εfs, which is from the Friis free space model. Otherwise, εamp from the two-ray ground propagation model is used. ERX represents the energy consumed when receiving data. The required d0 here is defined by Eq. (10).
where L is the system loss factor, which is not related to the wave propagation while λ is the wave length of the carrier signal. ht and hr are the heights above the ground of the transmitting and receiving antennas, respectively.
L = 1 indicates no loss in the system hardware. ht = hr = 1.5 m, and λ is related to the carrier frequency. If we use a carrier frequency of 914 MHz, λ is as follows:
Using these parameters, do = 86.2 m. The simulation parameters are given in Table 1. All of the alternatives are compared under the same setup, and we use the scenario generator of NS2 to generate randomly deployed sensor nodes, as shown in Fig. 7.
Random deployment of sensor nodes.
The impact of k on the network lifetime of CACD.
The first set of our simulation is to find the best value for k (Section 3.3). By omitting the set-up phase for k rounds after the re-construct phase, CACD preserves more energy for intra-cluster communication and thus can extend the lifetime of the sensor networks. In Fig. 8, we count the rounds until the first node dies, while k varies from 1 to 10. As shown in the figure, CACD shows poor performance when k is less than 3. This is because, with such small k values, CACD cannot fully utilize the benefit of the re-construct phase. More specifically, during the re-construct phase, CACD produces a more energy-efficient cluster structure, but consumes extra energy in the process. Thus, in order to compensate for the extra energy and preserve energy for inter-cluster communications, CACD omits the set-up phase and maintains the result of the re-construct phase for the next k rounds. However, Fig. 8 shows that CACD cannot achieve these goals with such small k values. In addition, CACD becomes less effective when k is greater than 8. This is because maintaining a cluster structure for such a long period causes a rapid depletion of the cluster heads’ energy resources. This rapid depletion makes our approach less effective, as shown in Fig. 8. As a result, in order to optimize the CACD’s performance, we omit the set-up phase for the next 6 rounds after re-construct phase, which is the best setting of k, as shown in Fig. 8.
The next set of our simulation is to study the effectiveness of CACD. We evaluate and compare the energy consumption of each sensor node as shown in Fig. 9. Fig. 9 shows the average energy consumption of each round for the four alternatives. CACD is up to 3 times more energy efficient than LEACH, and up to 1.6 times more energy efficient than HEED and EEUC. LEACH’s poor performance is because it does not take into account the node density and the size of each cluster. Furthermore, the direct communication between each cluster head and the sink highly contributes to the poor performance.
HEED outperforms LEACH mainly due to the multi-hop communication model for forwarding data, but may suffer from the hot-spot problem since cluster heads closer to the sink are burdened with forwarding data [12]. This hot-spot problem accelerates the energy consumption of neighboring nodes and, in turn, may result in partitioning the sensor network as well as reducing the sensing coverage, which is undesirable for wireless sensor networks.
As shown in Fig. 9, EEUC consumes less energy than HEED due to its distance-aware clustering approach, but may become less effective for a wireless sensor network with a random distribution. CACD extends EEUC by considering the node density of each cluster and successively reduces the energy consumption of each round, as shown in Fig. 9.
Average energy consumption per round.
The number of alive sensor nodes over time.
Fig. 10 shows the number of sensor nodes that are still alive over the simulation time of the four alternatives. CACD clearly improves the lifetime of the network over LEACH, HEED, and EEUC.
When forming cluster structures, other alternatives do not consider the node density and energy consumption of each cluster and may create unbalanced cluster structures. These unbalanced cluster structures may waste unnecessary energy while processing and transmitting data. In contrast, CACD forces a cluster to have fewer member nodes than OSN, which varies according to the distance from the sink. This property of CACD guarantees more balanced energy load over the sensor network, even if the sensor network has a random distribution.
The last set of our experiments is to observe the tradeoff resulting from keeping the cluster structures. Fig. 11 shows energy consumptions of cluster heads on CACD and HEED. EEUC selects cluster heads considering the energy level of each node as well but we compare CACD only with HEED because detailed selection algorithm is not given in the paper. As shown in the figure, CACD reports that cluster heads consume more energy than HEED until 2 ROUND due to set-up phase and re-construct phase but after the round, CACD consumes less energy since CACD maintains the cluster structure for 7 rounds. The cluster heads maybe more burdened in the CACD but the stable cluster structure minimizes the burdens and makes CACD more energy efficient considering the overall energy consumption.
Average energy consumption of each cluster over ROUNDs.
5. Conclusion
In this paper, we proposed the CACD approach that aimed to prolong the lifetime of wireless sensor networks, which are usually powered by limited energy resources. The CACD approach creates cluster structures that can balance energy consumption, even for randomly distributed wireless sensor networks, by considering the node density and the energy consumption of each cluster. Analysis and performance evaluations showed that the CACD approach is up to three times more energy-efficient than previous methods.
As future work, we plan to extend our approach by exploring the cooperative communication model [13,14] to create a more energy-efficient cluster structure.