1. Introduction
Nanotechnology enables the design and manufacture of nano-sensors with sensing, data storing, computing and communication capabilities [
1]. These nano-sensors are able to sense events at the nano-scale, which is different from those in classical wireless sensor networks (WSNs). Wireless NanoSensor Networks (WNSNs) are new types of networks combing nanotechnology and sensor networks, which have broad application prospects in health monitoring, damage detection, biomedicine and military defense [
2].
Recent simulation studies from [
3] show that these nanosensors can communicate in Terahertz (0.1–10 THz) band using a new nano-material called graphene as parts of the transmission antenna. Terahertz band communication is considered to be the key technology to satisfy ultra-high-speed wireless communication, because it can provide very large transmission rate, up to Gb/s or even higher. Furthermore, nano-devices take advantage of the peculiarities of terahertz wave, such as the narrow beam and good directivity which can be used to detect and precisely position smaller targets [
4]. On this basis, ref. [
5] proposed a simple and effective modulation scheme of nano-devices called TS-OOK (Time Spread On-Off Keying), which is based on the exchange of femtosecond-long pulses spread in time. But in a TS-OOK communication system, the probability of collision increases with the increase of the number of transmitters [
6]. Due to the high density of nodes in WNSNs, when multiple nano-devices send messages to the same target nano-node at the same time, if one symbol conflicts, it will cause conflicts in each symbol until the end of the first packet. There will be catastrophic collisions [
7] that are unacceptable in many applications.
Considering the Terahertz band and characteristics of nano-devices in WNSNs, such as high propagation losses and very large distance-dependent bandwidth [
7,
8], new Media Access Control (MAC) protocols are required to regulate the channel access and transmission sequence between nodes. Recently, several MAC protocols have been proposed for WNSNs. Jornet et al. proposed and analysed a Physical Layer Aware MAC Protocol for Terahertz Electromagnetic Nanonetworks (PHLAME) [
8]. To solve the energy consumption and reliability, PHLAME chooses the optimal value of code weight and repetition. A novel MAC protocol was proposed in [
9] for cluster-based WNSNs, which is divided into two phases: the selection of a master node and the data transmission. An energy harvesting-aware and lightweight MAC protocol was proposed in [
10], the protocol made the lifetime of WNSNs unlimited. Then, [
11] introduced a Distributed Receiver-Initiated Harvesting-aware MAC for Nanonetworks (DRIH-MAC), which is designed to operate in a distributed network topology. But there are still hidden terminal problems in the distributed model. The Timing Channel for Nanonetworks (TCN) MAC protocol in [
12] exploits the timing channel for transmissions. But the energy efficiency of TCN is not evaluated on simulation platforms. Rikhtegar et al. [
13] proposed an energy efficient MAC protocol for Terahertz electromagnetic WNSNs (EEWNSN-MAC), which exploits the hierarchical structure and combines the contention free scheme, time division multiple access (TDMA) with a clustering algorithm for communication between nano-nodes. However, EEWNSN-MAC does not take the dynamic changes in transmitted packets of each nano-node and the energy harvesting system into consideration. Sami et al. [
14] proposed a cooperative MAC protocol called cooperative cognitive TDMA (CC-TDMA) for cognitive networks, which guarantees the quality of service (QoS) required by the primary network. The unlicensed users obtain greater opportunity for data transmission, thus increasing their performance. However, the performance of the protocol in the terahertz band cannot be predicted. A load-aware dynamic TDMA protocol (LA-TDMA) [
15], which realized the dynamic allocation of data transmission time slots on the basis of TDMA according to terahertz channel characteristics, number of nano-nodes and respective traffic volume of nodes. LA-TDMA ensures that data is transmitted without conflict, but it does not consider the priority difference of data. Cao et al. proposed the MAC protocol of individual domain network with high throughput and low delay [
16]. Based on IEEE 802.15.3c [
17], the high-throughput low-delay MAC (HTLD-MAC) protocol adopts the slot reservation mechanism based on channel quality and the adaptive confirmation mechanism. Nodes choose the block confirmation mechanism or immediate confirmation mechanism according to the channel quality, which can reasonably allocate slot resources, reduce data delay and improve network throughput. Mohrehkesh et al. proposed a MAC protocol based on receiving node initiation and energy acquisition, called RIH-MAC [
18]. The main goal of this protocol is to reduce energy consumption. In addition, this protocol adopts the handshake process initiated by receiving node rather than the handshake process initiated by sending node. However, the new problem is that the protocol cannot solve the problem of hidden terminals in ad-hoc networks, and the energy collection process takes a long time.
Since a very limited amount of electricity can be stored in nano-batteries and it’s difficult to replace or charge the nano-batteries in most application scenarios [
19], the energy limitation of nano-devices becomes one of the main challenges in the design of MAC protocols. Energy harvesting technology can continuously replenish energy for batteries, which makes sense in ensuring the sustainability of the system. A nano-scale energy harvesting system is proposed in [
20], which converts the harvested vibration energy, sound energy and electromagnetic energy into electric energy. The distributed energy harvesting TDMA (DEH-TDMA) protocol in [
21] introduces a piezoelectric nano-energy harvesting system due to the limited energy of nano-nodes. However, this protocol only makes decisions from the perspective of the nano-node itself and does not comprehensively consider the state information of other nodes in the network. Therefore, the decision results obtained by solving the MDP model are only the local optimal strategy.
In this paper, a centralized energy harvesting-based TDMA protocol, called CEH-TDMA is proposed. This protocol examines the data transmission process from a global perspective, where the nano-controller regulates the channel access and allocates time slots for all nano-nodes. The rest of this paper is organized as follows.
Section 2 introduced the symbol model. Next, a Markov decision process (MDP) model for the proposed CEH-TDMA is built and an optimal slot strategy is solved in
Section 3, and the establishment steps of CEH-TDMA is described in
Section 4. In
Section 5, we present simulation results for the performance of TDMA, LA-TDMA and DEH-TDMA based on average end-to-end delay, average remaining energy and number of transmitted packets. Conclusions are made in
Section 6.
3. MDP Model for CEH-TDMA
The CEH-TDMA protocol adopts a frame structure shown in
Figure 1. The nano-node informs the nano-controller of the status information including the number of data packets in the buffer area and the remaining energy quantity, and the nano-controller constructs a specific MDP model according to the information and solves the global optimal strategy, and then the result is broadcast to the nano-node. At this time, the nano-node obtains the decision result of the nano-controller from the time slot scheduling packet, and then occupies the corresponding time slot to transmit data. We will establish the system state, action space, state transition probability matrix and award function of the MDP model for the CEH-TDMA protocol respectively, considering the calculation complexity of the MDP model and in order to maximize the data transmission volume of the entire network, an approximate time slot allocation strategy is obtained by approximate solution.
3.1. System State Space
Since the nano-controller examines the data transmission process of WNSNs from a global perspective, the system state is the joint state of all nano-nodes. Then in frame , the system state space is , where represents the state of the nano-node , where is the total number of nano-nodes in WNSNs.
In the
frame, the state
of the nano-node
is the joint state consisting of the number of packets in the buffer
and the remaining energy
, which can be expressed as:
where
is the maximum number of packets in the buffer,
is the minimum energy required for normal operation of nano-nodes, which is set to the energy consumed in sending and receiving a packet,
is the maximum energy stored in the nano-battery.
In order to facilitate the calculation, normalized energy is adopted, where the remaining energy of the nano-node is mapped to the number of packets that can be sent under the current energy constraint:
where
,
respectively represents the number of packets that the nano-node can send when the remaining energy reaches
and the maximum
,
is the energy consumption of sending
packets.
is a rounding operation.
For self-powered nano-node
, the number of packets in the buffer and the remaining energy can be expressed as:
where
and
respectively denotes the number of packets in the buffer at the beginning of the
,
frame,
and
respectively denotes the remaining energy at the beginning of the
,
frame.
is the time length of the
frame,
and
is the number of packets sensed from the external environment and the energy acquired from the energy harvesting system during
, respectively.
and
are both binary parameters, where
means the nano-node
is active and participates in data transmission in the
frame; otherwise
. The number of transmitted packets is denoted by
,
is the energy consumption of sending
packets in the
frame. The number of sensed packets
, and the harvested energy
.
3.2. Action Space
In the CEH-TDMA protocol, the nano-controller determines the channel access of each nano-node according to the state space established in the previous section. Therefore, the action space is a joint space including the action of each nano-node, denoted by , where is the action space of one nano-node in the WNSNs. According to the above analysis of where indicates that the node in the current frame enters the dormant state due to lack of available energy or no data arrival in the buffer, indicates that the nano-battery has sufficient energy to transmit data.
3.3. State Transition Probability
In general, the data arrival and energy harvesting process of nano-nodes are independent in energy harvesting-based WNSNs. Therefore, it can be assumed that the state transition probability of packet and energy are also independent, so that the system state transition probability of nano-nodes can be obtained by solving the energy state transition probability and the packet state transition probability, respectively.
Firstly, the amount of data transmitted and energy consumption by nano-node
when taking the appropriate action
under its current state
can be written as:
where
and
respectively denotes the number of transmitted packets and energy consumption in the
frame,
and
respectively denotes the number of packets in the buffer and the number of packets that can be transmitted with the remaining energy
in the current frame,
is the number of packets that can be transmitted in the slot block shown in
Figure 1. In order that nano-nodes can send packets to the nano-controller as many as possible, the value of T is set to the maximum amount of data transmission in the current frame, which is jointly determined by
and
, that is, the maximum number of packets in the buffer and the number of packets that can be sent when the remaining energy reaches the maximum.
The number of packets arriving in the buffer and the energy arriving in the nano-battery during state transition can be expressed as:
where
and
respectively denotes the number of transmitted packets in (10) and the energy consumption in (11), the round-up operator
is adopted in the above two formulas since the number of packets and energy are all integers and nano-nodes are expected to transmit data as much as possible with sufficient energy in CEH-TDMA.
Substituting (13) into the packet arrival model described by (3), we can get the packet state transition probability .
Similarly, substituting (14) into the energy harvesting model described by (2), we can easily get the energy state transition probability .
So, the system state transition probability of the node can be expressed as the product of the packet state transition probability and the energy state transition probability [
23]:
where
indicates the state transition probability of the packet of node
, and
represents the energy state transition probability of node
.
Since the nano-node needs to consume a certain amount of energy to transmit the state information and receive a slot scheduling packet broadcast by the nano-controller, and the calculation of the energy state transition probability involves the data transmission amount and corresponding energy consumption of the node in the frame, which can be defined as:
where
represents the energy consumption of node
in frame
,
and
represent the state information and the number of bits of the slot scheduling packet, respectively,
represents the amount of transmitted data in the frame
,
and
respectively represent the transmission and reception energy consumption of per bit.
It is assumed that each nano-node in WNSNs is individually aware of signals from the external environment, so the data arrival between nodes is independent of each other. Since the energy harvesting rate is strongly related to the vibration source and the environment, the energy harvesting process between nano-nodes has a certain correlation. However, CEH-TDMA aims to obtain a dynamic TDMA protocol based on the number of packets in the buffer and the remaining energy allocation slots, studying the correlation of energy harvesting rates requires a large amount of field test data, so the CEH-TDMA protocol assumes the process of collecting energy from the external environment by the nano-nodes is independent of each other, so that the energy state transitions between the nodes are also independent of each other. The system state is the joint state space composed of the states of all the nano-nodes, so that the system state transition probability is:
3.4. Award Function
In order to maximize the network throughput and reduce the energy consumption by solving the established MDP model, the award function of each nano-node can be written as:
where
is the award function obtained after nano-node
takes action
under stat
,
is the average packet arrival rate,
is the length of a frame,
is the average number of packets arriving within a frame,
and
are respectively the amount of transmitted data and energy consumption within the current frame and is calculated by (10) and (11), respectively,
represents the maximum energy consumption for sending and receiving packets in the current frame. In addition, the normalized amount of transmitted data and energy consumption are adopted to analyze the award function.
The system award is the sum of the each nano-node’s award, specifically expressed as:
where
represents the income function obtained by the behaviour of the nano-node
taking the behaviour
in the state
.
3.5. Approximate Solution of MDP Model
The time slot allocation strategy of the CEH-TDMA protocol is to establish and solve the MDP model according to the real-time state of each nano-node by the nano-controller, and finally determine the global optimal channel access mode. The nano-node only needs to know the decision result from the time-slot scheduling package broadcast by the nano-controller instead of participating in the decision-making process, which greatly reduces the calculation and memory pressure of the nano-node. However, the protocol needs to consider the state information of all nodes in the network, which increases the complexity of the MDP model built above. Hence we will propose an approximate solution algorithm to reduce the computational complexity, so that the CEH-TDMA protocol can be applied to a wide range of practical scenarios.
The nano-controller can count the number of source nodes
in the current frame
under the condition of the known decision behavior space
, and set the number of time slot blocks in the data transmission phase to
. In the frame structure shown in
Figure 1, in the data transmission phase, the number of time slot blocks varies with the number of source nano-nodes, and each time slot block contains
time slots and remains unchanged. Considering that in WNSNs based on energy harvesting, the amount of data transmitted by a nano-node in the current frame is limited by its residual energy, so a measure function is defined to determine time slot allocation during the data transfer phase.
where
represents the number of data packets that the nano-node
wants to transmit in frame
, and is also the basis for the nano-controller to dynamically allocate time slots.
and
are the number of packets and residual energy sent by the nano-node to the nano-controller in the slot application phase,
indicates the energy consumption of the nano-node transmitting status information and the receiving time slot scheduling packet, and
indicates the transmission energy consumption per unit bit.
Based on the time slot measurement function defined above and the greedy solution algorithm, we propose a method shown in Algorithm 1 for approximate solving the MDP model. First, the nano-controller can obtain their status information when they successfully receive the time slot request from the nano-node. Then, the time slot measure value of each node is calculated according to the formula (20), and the descending order is performed. Then, for a nano-node with a slot measurement value of 0, the nano-controller makes a decision that the time slot is not allocated, and the other nodes occupy T time slot blocks in descending order of the time slot measurement value, thereby ensuring a large amount of data and the node with higher remaining energy preferentially transmits data. Finally, the nano-controller broadcasts the decision behavior set and the slot allocation information to all the nano-nodes in the form of a time slot scheduling packet, and records the system state and behavior space at this time, so that the next time the same node state occurs, the nano-controller can directly obtain the corresponding decision result by searching, thereby eliminating the process of solving.
Algorithm 1: Approximate solution for centralized MDP |
input: buffer and energy level of each nanonode
output: a set of actions compute slot measuring function according to Equation (20) sort in descending order, enabling for each element in do if then else end if end for return |
The above approximate solution algorithm needs to calculate the slot measure values of all nano-nodes and sort them in descending order, so the algorithm complexity can be expressed as . When , the algorithm complexity is , when , the algorithm complexity is .