1. Introduction
Recently, the Internet of Things (IoT) has drawn substantial attention from a research standpoint. In the upcoming years, more and more devices will be connected to the IoT, with the estimated number of connected devices according to [
1] rising to a whopping 38.6 and 50 billion by 2025 and 2030, respectively. The future of the Internet will consist of heterogeneously connected devices. A new type of network, Low Power Wide Area Networks (LPWANs), have enabled the communication between electronic devices and sensors through the Internet to improve and facilitate everyday life. The infrastructure for such a network is very lightweight and its unique characteristics of high communication range and rapid deployment support plenty of smart and intelligent applications [
2], such as smart cities, environmental metering, forest and sea status surveillance, healthcare, logistics, and so on.
A very important element of LPWANs, especially for this work, is long-range (LoRa) transmission. It is a very promising and innovative long-distance wireless data transmission technology operating on the unlicensed ISM frequency band. Due to LoRa’s unique characteristics, it allows the communication of thousands of devices, at distances ranging from a few kilometers inside the urban web to tens of kilometers outside it, using a single gateway [
3]. Moreover, the cost of the required equipment is particularly low, thus making this technology suitable for large-scale IoT applications.
The network consists of two basic device types, the end device (ED) and the sink node (SN) or base station. The former is a simple device with sensors attached on it, gathering information, and transmitting them back to the base station. The latter is a device in charge of controlling the whole network. It sends information between the ED and a central network server. Additionally, it has the capability to simultaneously support thousands of EDs at the same time.
The proposed algorithm makes optimized use of the hardware that was used in the work of [
4], adding backwards communication from the EDs with the cluster head (CH) in favor of a more efficient LoRa Medium Access Control (MAC) protocol. The latter is a unique type of device dividing the total number of EDs into smaller groups, named clusters, and making sure that all of them are synchronized concerning the time slot each one will transmit its data to the SN. At the beginning of each cycle, the SN requests the data from the EDs by sending a LoRa message to the CH. Upon this message reception, the latter broadcasts the calculated transmission program to initiate the data transmission from the EDs via the wake-up radio (WuR) message. As the CH knows the distance to the SN for all the EDs, it divides them into two groups: those that are closer to the SN and have a lower spreading factor (SF), and those that are further, which assigns them a higher SF. As a result, EDs with a lower SF will also have lesser latency, and EDs that have higher SF are guaranteed that their transmissions will have improved range and higher probability of successful reception. Furthermore, the proposed protocol achieves a higher degree of slot utilization compared to a classic TDMA, as EDs that do not have any data to report inform the cluster of this fact, and as a result the remaining EDs adaptively reschedule their transmission time for a faster cycle, less latency, and zero empty slots.
The remainder of this paper is organized as follows:
Section 2 overviews the related research on this field;
Section 3 offers insights pertaining to the network architecture used in this work; the operation of the proposed algorithm is analyzed in detail in
Section 4; and simulation results are demonstrated in
Section 5. Finally,
Section 6 concludes the paper.
2. Related Work
As the number of devices in an LPWAN increases, the carrier-sensing mechanism becomes less effective at reliably detecting channel activity, negatively affecting network performance. Therefore, LPWAN technologies such as Long-Range Wide Area Networks (LoRaWANs), built on top of LoRa, have been based on the pure-ALOHA protocol for uplink communication. ALOHA is an asynchronous protocol where the EDs communicate when they have data ready to send, either scheduled or event-driven. It is simple, lacks synchronization between the devices, and has minimum communication overhead. All these characteristics fit perfectly with the sporadic communication demands of applications with low traffic. Plenty of studies have been conducted to analyze the viability of this standard in terms of latency, reliability, and throughput, with most of them concluding that although the protocol performs adequately under a light load, it suffers from uplink traffic congestion in cases of heavy traffic as the number of network devices increases [
5,
6,
7,
8]. Slotted-ALOHA, a variant of the pure-ALOHA protocol, has been proposed as an alternative to enhance the overall system performance.
A frequent tactic used to improve battery life for most IoT devices is to resort to duty-cycling mechanisms, where the device periodically wakes up from sleep mode to retrieve new data. Although this approach allows for great power savings, it has its limitations. As the device wakes up based on a time schedule, it means that in some cases it will wake up even if there are no data for transmission, thus wasting energy and time. Additionally, if long sleep intervals are being used, then data latency increases. Trying to eliminate the above weaknesses, Piyare et al. [
4] suggested an ingenious receiver-initiated On-Demand Time Division Multiple Access (TDMA) communication protocol for IoT applications to boost both the latency and the energy efficiency of standard LoRa architectures. Furthermore, ultra-low power WuRs, which are capable of continuously listening to the wireless channel and activate the system when a specific signal referred to as the wake-up beacon (WuB) is detected, are introduced in this work. WuR is the ideal solution for removing limitations such as long sleep intervals or idle listening posed by duty cycles, which lead to increased delay and energy expenditure, respectively.
Considering the aforementioned remarks, in this work, a new communication scheme is proposed for networks using the LoRa protocol, based on the standard TDMA approach. While keeping the collision-free characteristic of the TDMA protocol along with its simplicity, this work focuses on the node’s ability to communicate if it does not have a packet for transmission. Nodes continue to be synchronized based on the WuR [
4]. In addition, after a time slot assignment to every node, whenever a node does not have data to report, it sends a LoRa message to the SH, and the time slot scheduling is reformed for the remaining nodes.
The proposed algorithm combines long-range communication with ultra-low power WuRs, achieving lower communication latency in heterogeneous long/short range networks when compared to typical TDMA implementations from the literature. With respect to [
4] implementation, the main objective of this work is to further reduce the mean data latency, aiming at applications that have limited time response frames such as utility networks (water, electricity, and gas) which will greatly benefit from the aforementioned characteristics in case of an emergency such as water and gas leakages in addition to electrical network failures and many more. By giving the EDs the ability to communicate with the CH if they do not have a packet, the system reforms the time slots for the remaining EDs and saves time that otherwise would have been wasted. In the literature, several other architectures were already proposed (e.g., Industrial LoRa, RT-LoRa, RS-LoRa) where the beacon transmitted by the SN is used by the ED to dynamically (and automatically) choose the correct SF as a function of the distance from the SN [
9,
10,
11]. A comparison of the proposed protocol with existing solutions is shown in
Table 1. It should be mentioned that while the on-demand solution as presented herein does not use channel selection but only SF selection, an updated version for channel and SF selection is based primarily in a simple amendment of the existing WuR message used to inform about the SF. Utilizing additional bits in the aforementioned message, the system could exploit higher number of concurrent transmissions, serving a higher number of nodes at the same time.
3. Network Architecture
This section describes in detail the architecture of the network that was adopted, as shown in
Figure 1. To begin with, we are referring to a heterogeneous network consisting of three different types of nodes: the EDs, the CH, and the SN. The SN is the most important type, being unique in the network. It is the node where all the data are being gathered. The EDs are nodes equipped with different kinds of sensors that collect data and transmit them to the SN, and lastly the CH is the means of communication between the EDs and the SN. In the network, the EDs are split into one or more clusters, with every one of them having its own CH. Both CHs and EDs are equipped with a LoRa and a WuR transceiver [
12,
13,
14,
15]. Contrariwise, the SN is equipped only with a LoRa transceiver.
As a way to improve energy efficiency, reduce data latency and crashes, and integrate with IoT scalability, in this work we opted for the use of an ultra-low power wake-up radio transceiver. These devices (which mostly utilize the band below 1 KHz) can monitor the channel in a continuous mode, consume microwatts of power for addressing and wake-up operations of the major radio system. Exploiting the capabilities of these devices, we avoided false wake-ups, scheduled the main operation of the LoRa transceiver, and organized the nodes according to the proposed algorithm (vide infra). This type of communication is based solely on the activation of the system via interruptions whenever a specific signal referred to as a wake-up beacon is detected from the wake-up transmitter (WuTX). As a result, since the WuR can always be on, each node is additionally equipped with such a device and can operate in a purely asynchronous manner, activating the main radio on demand, without requiring continuous transmission. Upon reception of the WuB signal from the sender node, it immediately activates the node’s LoRa transceiver so as to transmit its available data via the LoRa-reserved spectrum.
Since the WuR transceiver has a very low power consumption, it does not require continuous transmission, with direct effect on network life, as shown in
Section 5. To allow for a power-efficient receiver implementation, the data are sent using on–off keying (OOK) modulation. The SN uses LoRa to send commands to the CHs
and receive data from the EDs
. The CHs use a WuR message to synchronize every ED in the cluster and inform them of their time slot start
. Additionally, they use LoRa to receive a flag signal from an ED when it does not have a packet to transmit on the current cycle
. Then, the CH resends a WuR signal to update all the still-waiting EDs of their new time slot start.
4. Operation of Proposed Approach
In the TDMA implementation by Piyare et al. in [
4], they consider 9 end nodes (1–9), and every ED works with the same SF without considering its distance from the SN. This results in a significant difference in mean data latency, especially in cases where the larger SFs are being used for all EDs instead of a smaller one for some of the EDs. Moreover, for every ED a specific time slot is allocated to transmit its data over LoRa to the SN. The SN sends a message to the CH over LoRa, and then the latter sends a signal over WuR to all EDs and calculates the start of each time slot from the
WuBArrivalTime as:
This results in significant delays in scenarios where not every ED has a packet to transmit, because the time slots cannot be updated dynamically, and the network waits for an empty transmission slot. The slot size is determined by computing the time-on-air (
) using the above equation (see
Appendix A) for the LoRa data packet depending on the payload size, with a pre-defined guard time (
) of 6 ms, and
, which is the ID of each ED. The guard time guarantees that the window is large enough for the transmission and compensates for clock drift, which may be detrimental with an increasing number of EDs. Finally, the EDs start transmitting the data packets to the SN over the LoRa module, according to the slot schedule, as illustrated.
Firstly, the proposed algorithm (
Figure 2) is aware in advance of the distance of every one of the 9 EDs and the CH from the SN, so it can use the most efficient SF possible (
Table 2). We consider that in every scenario there will be at best two different SFs. That is true because we are aware of the operational range of LoRa being about 20 km. If we divide the LoRa range in six discrete SF zones (from 7 to 12), every zone has a radius larger than the previous one by about 3.333 km, and considering the range of WuR, which is 3 km, we end up with two SF groups that all the EDs in a cluster belong to. The first group will have from none to all the EDs and will operate with
, and the second group will have the rest of the EDs and will operate with
(
Figure 3). With that change we manage a smaller mean data latency, especially with the larger SFs due to the difference in time-on-air (ToA).
Additionally, the algorithm can dynamically calculate and set new starting points for every ED’s time slot when one or more of them do not have a packet to transmit. The SN sends a LoRa signal to the CH, and then the latter sends a message over WuR an informing the EDs about their scheduled transmission slots. The aforementioned WuR message has a different structure from a typical LoRa message, and as referenced above, it utilizes different radio technology. Recent works in this field have shown that WuR transceivers have the additional ability to transfer useful payload apart from the addressing information [
12,
13,
14]. At the beginning of each cycle, as the CH knows the exact position of each node, it calculates the appropriate SF that should be used for message transmission for each of them. Nodes that are closer to the SN are directed to use the lower SF and the ones that are further away from the SN will use the higher one. Under this scheme, for every geographical configuration of nodes around the CH (balanced or not), the system can take advantage of the shorter ToA for the lower SF and the increased reliability for higher SF in relation to the distance. In order to achieve this, bit-stream coding is used in a dual-mix mode (as string and as binary number), to inform the nodes about their initial transmission time schedule and to alert the existing nodes of any empty transmission slots reported from the nodes.
Upon reception of this 10bit stream part (shown in
Figure 4, which is additional to a typical WuR addressing message (shown in
Figure 5), each node is able to calculate its transmission time slot by the following scheme:
The first bit (a1) is used to distinguish the initial transmission program from any additional corrective message that informs the nodes about an unused transmission slot. The code for the initial schedule is translated as 1, while 0 means that the following bits are decoded as a binary number, standing for the node that will not use its time slot to transmit any data;
The other 9 bits are used from the nodes, in the case of the initial program, to decode their transmission schedule. For each of the following 9 bits, each node sets its SF according to the appropriate bit. For example, ED1 uses the b1 bit, while node 9 uses the b9. A bit value of 1 means the node should use the higher of the two SFs while bit value of 0 indicates the use of the lower SF in each cycle;
Whenever the first bit (a1) is equal to 0, the following bits are translated as a 9-bit binary number. The useful range for this number is from 1 to 8, as it stands for the node ID that will not transmit during its assigned time slot (there is no need to inform about the 9th node). Upon reception, each node adapts its transmission schedule and transmits its data one time slot sooner.
The above scheme ensures that each node is aware of the transmission schedule of all the other nodes in the cluster. Initially, each node calculates its time slot, also knowing the schedule of all the other nodes. Consequently, when a node sends a LoRa message to inform the CH with
that it will not make use of its assigned slot, the CH broadcasts a WuB message containing the ID of the node that will not report data. This message is transmitted via the WuR circuitry and being a broadcast message, it is ensured that all the nodes will receive it at the same time. Therefore, each node, knowing the reported node’s
SF, reschedules its time slot. Correspondingly, the system utilizes all the available time slots to report the data, having direct results in network performance. Using a direct relation from the addressing of the WuB used in [
4], the additional 10 bits that are used from the proposed protocol result in an additional 9.41 ms (17 ms + 9.41 ms = 26.41 ms) time as the total size is 3 bytes and an additional 2 bits (2 bytes used for addressing and 1 byte plus 2 additional bits for the enhanced WuB message). However, that process is activated only when the EDs use
, because otherwise the duration of the ED’s time slots is shorter than the update period. So, in that situation the network does not make any changes and just operates as the one in [
4].
5. Simulation Results
This section presents the simulation results for the Distance-Dependent TDMA protocol, to assess its improvement in mean data latency over the implementation of [
4]. The parameters for the simulation runs are presented in
Table 3. There was some energy efficiency testing as well. It should be noted here that the mean data latency pertains to the round-trip time latency, which in turn is computed as the time difference between the SN transmitting the initial LoRa message to the CH and receiving all LoRa messages from the EDs. The energy measurements are based on [
4] as well, using a 1200 mAh lithium polymer battery at 3.3 V and the power consumption numbers that Piyare uses. Both protocols have been implemented using Python 3.8 (Python Software Foundation, Wilmington, DE, USA) on a Jupyter Notebook (4.9.1, Professor Fernando Pérez, UC Berkeley, CA, USA) running Windows 11 with AMD Ryzen 5 5600 H. Two network architectures were simulated at four different network load situations. The network shape in
Figure 6 is mutual for both networks and only the distances of the nodes vary (
Table 4 and
Table 5). Both networks were tested with packets following: (i) Uniform distribution, (ii) Normal distribution, (iii) Binomial distribution and (iv) Poisson distribution.
The uniform distribution is the one that Piyare et al. use in [
4], and it basically means that every ED has a packet for the SN on every cycle. This is a very unrealistic scenario, so more lifelike distributions were tested. For the normal distribution the parameters used were
and
, for the binomial distribution
and
was used, and for the Poisson distribution
(see
Appendix B). These distributions were used to set how many EDs have a packet to transmit and feeds them with messages one at a time (EDs do not have any buffer as in the referenced system).
For both networks, the results were basically the same, but in different scales. When using the uniform distribution and changing the number of EDs in the network, we observed that until we reached 5 EDs the two approaches had the exact same results. From that point we faced lower data latency using our approach, due to the use of a smaller SF exploiting the ability of our algorithm to find the most suitable SF for every ED based on its distance from the SN (
Figure 7 and
Figure 8).
On the other hand, if we use the uniform distribution again but with a fixed number of 9 EDs in the network and our testing variable is the number of packets transmitted per cycle, we notice that [
4] has a steady mean data latency regardless of the changes, as the algorithm cannot sense if an ED has a packet or not, in contrast to our approach, which has a linear increase thanks to ED and CH communication (
Figure 9 and
Figure 10).
By using the other three types of distribution (normal, binomial and Poisson), we can clearly distinguish the advantages of the proposed protocol. In every scenario, the mean data latency is lower for any given number of EDs, especially after more than 5 EDs are added to the network where the selection of the proper SF takes place. The biggest difference is shown using the Poisson distribution, as the mean number of packets on every cycle is about 3, followed by the normal distribution which has a 4.5 packet-per-cycle rate and lastly the binomial distribution with 6.5 packets per cycle. On the top-left corner of each time graph we can discern how many cycles with each number of packets have been assigned for every distribution in a 9-ED network (
Figure 11,
Figure 12,
Figure 13,
Figure 14,
Figure 15 and
Figure 16).
Looking at the mean data latency of the two tested networks side by side, we notice that the gap between [
4] and our approach is greater in the second network, which uses bigger SFs. This is because the ToA does not increase linearly with every SF increment:
Lastly, we compared the energy efficiency of this approach over Piyare’s. The results presented below were exported using a unique method. We used the battery capacity of 1200 mAh operating at 3.3 V, exactly like Piyare. That gives us a 3.96 Wh battery. In the testing method we took the average power consumption of all 9 EDs of the network and we constantly ran simulations. There were no pauses, meaning that at the end of each cycle the SN immediately started a new one, and we measured how many hours the average ED would last for every distribution until its battery ran out. We can clearly distinguish (
Figure 17) that in every scenario our protocol manages to outperform Piyare’s approach concerning the endurance of the network in a continuous mode. If we consider that each cycle will happen with an inter-packet interval of 10 s, as Piyare used for his simulations, the improvement is significant.