1. Introduction
Smart grid is the energy infrastructure for smart cities, telecommunications, networks, and the computing industry. It upgrades traditional power grid systems with state-of-the-art information and communication technologies, such as wireless sensor network (WSN) techniques and the Internet of Things particularly designed for the smart grid industry (SG-IoT). In fact, SG-IoT heavily relies on WSN, which is characterized by a variety of distinct performance metrics, such as transmission rate, signal coverage, energy consumption (EC), and network lifetime [
1]. Among all these metrics, it is of particular importance to place emphasis on the EC (defined as the number of joules consumed per successfully transmitted bit) of wireless sensors, because the energy supply requirements of sensors are stringent in WSNs (i.e., very limited energy supply) and all the other performance metrics can be affected by the EC characteristic of sensors. To elaborate a little further, in many application scenarios of the smart grid industry, it is often inconvenient or unsafe for humans to work in the deployment site and the lifetime of sensors is often expected to be over several years. Therefore, low EC is of vital importance for these scenarios.
By contrast, in wireless communication systems that operate with the support of power grid infrastructure, it is more appropriate to invoke energy efficiency (EE), which is typically defined as the number of bits successfully transmitted per joule. This concept is at the heart of green communications, a vision globally recognized for reducing the Carbon footprint produced by the networking sector, especially in the era of 5G, 5G-Advanced, and 6G [
2]. Obviously, EE is the reciprocal of EC. Extensive studies have been devoted to optimizing the EE of wireless networks in the past decade. For instance, in [
3], based on the fractional programming framework, the joint power and subcarrier allocation problem was solved for maximizing the EE of a multi-user, multi-relay, single-cell orthogonal frequency-division multiple access (OFDMA) cellular network composed of single-antenna nodes. Afterwards, system models that are more complicated were considered: the joint transmit and receive beamforming-based multi-user, multi-relay, multi-input multi-output (MIMO)-OFDMA cellular networks [
4,
5]; the multi-cell single-antenna OFDMA networks [
6]; the partial/full interference alignment-based multi-user, multi-relay, multi-cell MIMO-OFDMA networks [
7]; the massive MIMO-aided, multi-pair, one-way decode-and-forward relay system [
8]; and the fully connected
K-user interference channel with each user having either a single antenna or multiple antennas [
9]. Additionally, the EE of wireless networks that are delay-sensitive was also studied by maintaining statistical quality-of-service QoS guarantees in OFDMA networks [
10] and by considering the uplink ultra-reliable low-latency communication (URLLC) traffic in the MIMO-aided grant-free access [
11] of 5G and its beyond. In [
12], secrecy-energy efficient hybrid beamforming schemes were designed for a satellite-terrestrial integrated network in order to maximize the achievable secrecy-EE while satisfying the signal-to-interference-plus-noise ratio (SINR) constraints of both the earth stations and the cellular users; further, in [
13], the secrecy-energy efficient beamforming in multibeam satellite systems was investigated with the metric of signal-to-leakage-plus-noise ratio (SLNR).
Since wireless sensors are typically powered by batteries, it is more appropriate to use EC in the context of wireless sensors. EC is closely related to the selected modulation scheme. Firstly, this selection may influence the type of electronic components utilized, such as a power amplifier (PA) or analog-to-digital converter (ADC), because different modulation schemes may require different circuit designs and implementations. Secondly, the specific choice of modulation schemes also affects the number of bits transmitted in a single symbol duration that consumes a certain amount of energy. Thirdly, different modulation schemes may incur different packet error rates (PERs), which influence the number of retransmissions that also consume energy and are necessary for successful packet delivery between any pair of wireless sensors. Therefore, it is important to investigate the impact of different modulation schemes on the EC and identify the most appropriate scheme for WSNs of SG-IoT.
Prior research mainly focused on studying the EC of modulation schemes that are sensitive to the nonlinearity of PAs. More specifically, in [
14], the EC minimization problems corresponding to
M-ary quadrature amplitude modulation (MQAM) and multiple frequency-shift keying (MFSK) were studied. In [
15,
16], the authors studied the relationship between the total EC per successfully transmitted information bit and the transmission distance while assuming different modulation methods, such as binary phase-shift keying (BPSK), quadrature phase-shift keying (QPSK), and 16QAM. They also studied the average signal-to-noise ratio (SNR) values required to achieve the optimal EC. In [
17], the transmission power of MQAM was optimized by using a particular model to achieve the minimum EC. In [
18], the EC per successfully transmitted bit for modulation techniques including binary frequency-shift keying (BFSK), BPSK, QPSK, 16QAM, and 64QAM was studied under various channel conditions. However, these modulation techniques require the use of linear PAs, which results in lower energy utilization efficiency. In contrast, constant envelope modulation techniques are insensitive to the nonlinearity of PAs; thus, they constitute a promising solution to improving the energy utilization efficiency. However, there is a scarcity of research focusing on the impact of constant envelope modulation techniques on the achievable EC in the context of WSNs underpinning SG-IoT.
Against the above backdrop, in this paper, we endeavor to investigate the impact of a constant envelope modulation technique, i.e., continuous phase modulation (CPM), on the EC of sensor nodes in WSNs suitable for SG-IoT. Our novel contributions are summarized as follows.
We establish a realistic power consumption model through the analysis of circuit power consumption, transmission power consumption, and reception power consumption on a point-to-point communication link; in particular, we consider three operation modes of the sensors, including sleeping mode, transient mode, and active mode.
Based on the above power consumption model and a typical automatic repeat request (ARQ)-based wireless transmission protocol, the EC incurred by successfully sending a single information bit is numerically evaluated under different configurations of CPM parameter values. In particular, we consider different waveform pulses of the CPM, including the rectangular pulse, rising cosine pulse, and GMSK pulse, for comprehensive coverage. We also investigate the impact of the distance between the transmitter and the receiver, the impact of the received SNR, the impact of the modulation order, and the average number of transmissions required for sending a single packet, under various modulation schemes considered.
We compare the EC per successfully transmitted bit of the CPM with that of conventional non-constant envelope modulation methods, such as offset quadrature phase-shift keying (OQPSK) used in the Zigbee standard and QAM modulation supported by the current 5G standard. Our simulation results and analysis demonstrate that CPM enjoys a significantly lower EC than OQPSK and 16QAM in the scenario considered, which is valuable for the standard evolution of beyond 5G tailored for the important use case of low-power SG-IoT.
2. The EC Model
To analytically determine the amount of energy consumed when a single bit is transmitted without error, an EC model needs to be established. We make the assumption that each packet transmitted in the forward direction induces an error-free feedback packet in the reverse direction, which acknowledges the successful reception of the data packet or requests for retransmission.
2.1. Packet Structure
In wireless communication systems, the general format of the physical layer packet structure is shown in
Figure 1 and consists of three parts: a pilot code for clock synchronization, a packet header specifying the configuration of transmission parameters, and a data payload carrying the transmitted data.
We assume that the entire packet uses the same modulation and that the symbol error probability (SEP) is determined by the received SNR
and the modulation scheme adopted. We also assume that symbol errors are independently and identically distributed (i.i.d.) and no channel coding is used, i.e., no redundant bits are added (redundant bits increase EC). If a symbol is erroneously detected at the receiver, the entire packet will be retransmitted until all the symbols in the packet are correctly detected at the receiver. The packet error probability (PEP) can be expressed as a function of the SEP, the packet length
L, and the number of bits per symbol
, i.e.,
where
M denotes the size of the modulation constellation.
Hence, by utilizing the packet length
L and the probability that a packet is successfully transmitted, i.e.,
, the average amount of data successfully delivered per transmission duration of a packet is formulated as
Then, upon assuming that the feedback signal indicating whether a retransmission is needed or not is reliably transmitted on the reverse link, the average number of transmissions required for successfully delivering a packet is given by
2.2. Basics of CPM
CPM is an attractive modulation scheme for WSNs underpinning SG-IoT because its carrier phase is modulated in a continuous manner and it is typically implemented as a constant-envelope waveform, i.e., the transmitted carrier power is constant. The phase continuity requires a relatively small percentage of the power to occur outside of the intended band (e.g., low fractional out-of-band power), leading to high spectral efficiency. Meanwhile, the constant envelope yields excellent power/energy efficiency. However, the primary drawback of CPM is the high implementation complexity required for an optimal receiver.
For systems that employ CPM, the transmitted signal at time instant
t can be expressed as [
19]
where
E is the symbol energy,
T is the symbol interval,
is the carrier frequency, and
is an arbitrary constant initial phase shift that can be set to zero without loss of generality when coherent transmission is considered. In addition,
is the time-varying information-carrying phase formulated as
where
is an infinitely long sequence of uncorrelated
M-ary data symbols, each having one of the values from the alphabet
with equal probability
;
is a sequence of modulation indices defined as
, with
being the peak frequency deviation. When
for all
k, the modulation index remains fixed for all symbols. When the modulation index changes from one symbol to another, the signal is called multi-
h CPM, with
varying in a cyclic manner.
is some normalized waveform shape that represents the baseband phase response (i.e., phase pulse) and is obtained from the frequency pulse
by
If the duration of is equal to the symbol interval T, namely, for , the modulated signal is called full-response CPM. If the duration of is larger than the symbol interval T, namely, for , the modulated signal is called partial-response CPM.
Suppose the length of the frequency pulse
in terms of the number of symbol intervals is
N. Thus,
yields full-response CPM. If
is selected as a rectangular pulse, namely,
then for a full-response CPM, we have
It is evident that the performance of CPM is influenced by certain parameters, including but not limited to M, , N, and the frequency pulse . Note that by choosing different pulse shapes and varying M, , and N, an infinite variety of CPM signals may be generated, each with its unique characteristics and performance.
For a CPM signal, the error rate performance can be derived based on the maximum-likelihood sequence detection (MLSD) receiver, which is conventionally computed using the Viterbi Algorithm (VA). Specifically, for a given CPM scheme, we have
According to Anderson’s seminal work on digital phase modulation [
20],
denotes the total number of feasible paths that satisfy the constraint of the minimum Euclidean distance
within the observation interval on the CPM phase grid. The value of
increases with the modulation order
M. Both
and
depend on critical parameters including
M,
,
N, and the pulse shaping function
.
2.3. Circuit Power Consumption
In wireless communication systems, a significant portion of energy is dedicated to signal transmission and reception circuits, which are mainly composed of the baseband (BB) digital signal processing unit and the radio frequency (RF) signal processing unit, as shown in
Figure 2. To elaborate a little further, the BB signal processing unit mainly includes source coding/decoding, pulse shaping, channel coding/decoding, digital modulation/demodulation, channel estimation, synchronization, and so on. For a wireless sensor, since the data rate requirement is usually low, the BB symbol rate is also low. Meanwhile, typically, no computation-intensive signal processing techniques, such as multi-user detection and iterative decoding, are used in an energy-constrained wireless sensor; hence, the BB power consumption is significantly smaller than the RF circuit power consumption.
A typical model of an RF signal processing unit, also known as an RF chain, is shown in
Figure 3 [
14,
16,
21,
22,
23]. Specifically, on the transmitter side, the BB signal is first converted to an analog signal by the digital-to-analog converter (DAC). Then, the analog signal is filtered by the low-pass filter and upconverted by the mixer, whose output is then filtered again, amplified by the power amplifier (PA), passed through the duplexer, and finally transmitted to the wireless channel. On the receiver side, the RF signal is sequentially filtered, amplified by the low-noise amplifier (LNA), cleaned by the anti-aliasing filter, downconverted by the mixer, filtered again before passing through the intermediate frequency amplifier (IFA) that has an adjustable gain, and finally converted back to a digital signal by the analog-to-digital converter (ADC). Note that the mixers operate with the aid of the local oscillator (LO) and, among all the RF components, the PA and LNA usually have much higher power consumption than the others.
Accordingly, the total power consumption during transmission can be expressed as
where
and
represent the power consumption of the transmitter’s BB processing unit and RF processing unit excluding the PA, respectively, and both of them can be regarded as constant values that are collectively denoted by
;
and
are defined as the power consumption of the PA and the transmit power, respectively, both of which are functions of the transmission distance
d upon assuming adaptive power control and are related via
; and
and
represent the drain efficiency and the peak-to-average power ratio (PAPR) of the PA, respectively [
18].
Similarly, the total power consumption of the receiver is expressed as
where
and
represent the power consumption of the receiver’s BB processing unit and RF processing unit excluding LNA, respectively, and both of them can be regarded as constant values;
represents the power consumption of the LNA, which is also constant upon assuming that the LNA is appropriately designed and biased, so that necessary sensitivity is provided for reliably receiving, demodulating, and decoding a minimum power signal. Hence,
,
, and
are collectively denoted by the constant
.
2.4. Transmission Power Consumption
Due to the path-loss, scattering, reflection, and other phenomena in the wireless channel, a certain amount of energy is inevitably lost during the transmission of electromagnetic waves that carry data symbols, thus resulting in transmission energy dissipation. The path-loss depends on several factors, such as the distance between the transmitter and receiver, the frequency of the signal, the type of antennas used, and the environment through which the signal propagates.
When the signal with a transmit power
is propagated through the wireless channel, the received power
at the receiver can be formulated as
according to Friis’ transmission equation. This expression characterizes the dependency between the received power with respect to several parameters. Specifically, the constant
depends on the transmit antenna gain
, the receive antenna gain
, and the carrier wavelength
. Additionally, the path-loss exponent is
. It is noted that the received power is inversely proportional to the distance
d raised to the power of
, and the received SNR
is proportional to
divided by
. These relationships collectively describe how the aforementioned parameters influence the received power and the received SNR [
18]:
where
W represents the transmission bandwidth,
denotes the power spectral density of the baseband-equivalent additive white Gaussian noise (AWGN), and they are primary determinants of
. Furthermore, the noise figure of the RF front-end of the receiver, denoted as
, and any additional noise or interference, represented by the link margin term
, can also impact the received SNR
.
Accordingly, by substituting Equation (
12) into Equation (
13), the relationship between the transmit power
, the communication distance
d, the received SNR
, and other parameters can be quantitatively expressed as
where we have
.
2.5. EC per Successfully Transmitted Bit
As mentioned in
Section 2, we assume that each packet transmitted in the forward direction is matched by an error-free feedback packet in the reverse direction in order to guarantee reliable transmission. Both directions of transmissions consume energy. The above transmission process, usually incorporating retransmissions, continues until the entire packet of the forward direction is correctly decoded at the receiver. Additionally, we assume that the sensor node transceiver circuitry works in a multi-mode manner: (1) when there are data to transmit or receive, all circuits of the sensor work in the active mode; (2) when neither transmission nor reception are needed, the circuits of the sensor enter sleep mode by default, which uses the minimum possible power (small enough to be negligible) to ensure that the circuits can be activated when necessary; (3) when the sensor is in the period of switching from sleep mode to active mode, it is in a transient mode that also consumes non-negligible energy. Note that the transient duration from active mode to sleep mode is sufficiently short to be neglected but the start-up process from sleep mode to active mode can be slow.
Therefore, for a single round-trip transmission (forward direction transmission and reverse direction feedback) on a point-to-point communication link, the total EC of the system can be divided into two parts: the EC of the forward direction transmission and the EC of the reverse direction feedback. For the forward direction transmission, the EC is composed of the start-up energy consumption
in transient mode (both the transmitting node and the receiving node may be in sleep mode initially), the transmitter energy consumption
in active mode, and the receiver energy consumption
in active mode. Therefore, the EC of the forward direction transmission is expressed as
where
is the transmission duration for sending a data packet in the forward direction and
is the corresponding duration of signal processing at the receiver of the forward direction.
Additionally, the EC of the reverse direction transmission is expressed as
where
is the total power consumption for transmission of a feedback packet in the reverse direction by the receiving node of the forward direction and
is the corresponding time duration. Note that
may be different from
of the forward direction because a feedback packet may have a different transmission rate and reliability requirements compared with a data packet.
is the duration of processing the feedback packet at its receiver (i.e., the transmitting node of the forward direction) in active mode.
Based on the above analysis, we obtain the EC per successfully transmitted bit as
where
is the transmit power for the feedback packet transmission and
is the average number of retransmissions.
3. CPM Parameter Selection
As mentioned in
Section 2.2, the main parameters that affect the performance of CPM are
M,
,
N, and the frequency pulse shaping function
, whilst the minimum squared Euclidean distance
also depends on these parameters. Therefore, it is necessary to determine how these parameters influence the EC per successfully transmitted bit.
In principle, it is possible to implement an infinite number of different CPM signals using various combinations of design parameters. However, for practical implementation, we must consider the trade-off between the achievable performance and the cost incurred. It is well known that partial-response CPM signals usually have better spectral efficiency than full-response CPM signals. However, the computational complexity of the optimal MLSD receiver exponentially increases with
N, which is the length of
in terms of the number of symbol intervals. In this study, we focus on the partial-response CPM with a moderate value of
. The modulation order
M also significantly influences the computational complexity and the requirements for demodulation devices [
24], which can lead to high EC. Therefore, small values, such as
, 4, 8, 16, etc., are generally chosen. We assume the modulation index
, which is also an important parameter and has a complex functional relationship with the minimum squared Euclidean distance
[
24,
25]. Smaller values of
h result in narrower bandwidth, more concentrated signal energy, and narrower transition bands. However, they also make the phase variations less obvious and can increase the complexity of demodulation decisions. For single index modulation,
or
is commonly used. Finally, the pulse shaping function
usually utilizes rectangular pulse (REC), rising cosine pulse (RC), and Gauss minimum-phase shift-keying pulse (GMSK) [
26].
Given the imperative to minimize EC for sensors, the present study focuses on CPM signals that can be implemented with simple devices and require low computational complexity. Accordingly, CPM signals with three different
functions are selected for investigation, assuming
,
, and
. The values of
under these parameter configurations are calculated with the methods given in [
24,
25] and listed in
Table 1.