1. Introduction
Energy efficiency is a fundamental metric for all battery-powered devices, such as wireless sensor and actuator network (WSAN) nodes, whose most power-hungry subsystem is usually the RF transceiver. A wake-up receiver (WuRx) is an always-on ultra-low-power receiver which constantly monitors the channel and wakes the node up at the reception of a communication request in order to overcome the trade-off between power consumption and node latency [
1,
2]. WuRxs can be classified depending on their application range. Short-range WuRxs are fully passive and achieve a communication distance limited to few centimeters or meters. Medium-range WuRxs are used in applications requiring a range of, at most, 100 m, their power consumption typically being in the nanowatt range. Long-range WuRxs consume microwatts and can receive packets from kilometers away.
A typical WuRx architecture is composed of two subsystems: an analog front-end (AFE) and a baseband logic. The AFE turns the RF input’s OOK-modulated signal into a stream of bits, whereas the baseband logic generates the wake-up signal upon reception of the correct codeword [
3,
4,
5,
6,
7,
8,
9]. State-of-the-art ultra-low-power WuRxs use oversampling techniques to overcome the phase alignment problem between the received data and internal clock and, with the aim of limiting the power consumption to a few nanowatts, typically use ring or relaxation oscillators. The frequency accuracy of such oscillators is poor (only a few percent), thus limiting the maximum receivable packet length from 8 bits to 63 bits. In Reference [
8], to increase the WuRx sensitivity and minimize the number of false wake-ups due to the noise and uncertainty of the clock frequency, a 2x oversampling scheme and a relaxation oscillator were employed, and an optimal 16 bit code was designed. This was unlike the applications in [
8,
9], which proposed a WuRx able to receive a set of different and longer packets (63 bits). This feature would enable it to also transmit encrypted data, which is a key issue for enhancing the security of WSANs [
9,
10,
11,
12]. The solution proposed in [
9] included a ring oscillator and a 4x oversampling architecture designed to tolerate 13 errors in the received packet, which implied a higher number of false wake-ups compared with [
8]. Similar oversampling techniques were employed in recently proposed WuRxs [
12,
13]. To allow the WuRx to receive long packets with no constraints in terms of false wake-ups, as proposed in [
3], it is possible to employ oversampling circuits in which data sampling is performed using crystal oscillators. They ensure excellent frequency stability and the capability of receiving long data. This is carried out at the cost of a power consumption far above tens of nanowatts, which is not affordable for ultra-low-power WuRxs. As an alternative, it is possible to employ clock and data recovery (CDR) circuits based on Phase Locked Loops (PLLs) [
14]. They ensure phase and frequency alignment between the received data and the clock, with a power consumption far lower than that required by crystal oscillators. However, PLLs need long preamble times (tens of bit times) to settle the clock frequency according to the received data rate, which is not acceptable in the case where the WuRx must also be employed for burst communications. A WuRx with an injection-locked oscillator (ILO) CDR, which guarantees lower preamble times, was proposed in [
15]. Nevertheless, [
15] needed Manchester encoding for the received data, which implies a halving in the data rate to prevent the ILO from going back to its free running mode due to the absence of data transitions. Recently, [
16] proposed a wake-up and data receiver in which the sampling time selection was achieved through a digitally programmable interface, while frequency control was carried out using a frequency-locked loop (FLL). Similar to [
14], it required a non-negligible time to set the clock frequency.
To overcome the issues related to the power consumption, maximum packet length, false wake-up tolerance, preamble time and data encoding, in [
11], a nanowatt WuRx suitable for receiving infinite bits in addition to a codeword targeting medium-range applications at a 1 kbps data rate was proposed. The phase alignment between the received data and the clock was carried out in [
11] using a CDR based on a gated oscillator (GO), which guaranteed a short preamble time and ultra-low power consumption. A similar synchronization scheme was employed in [
17], where the limitation on the maximum number of equal consecutive bits was not taken into account.
In this paper, we present an implementation in STMicroelectronics 90 nm BCD technology and the experimental results of a nanowatt WuRx, enabling the transmission of long codes based on the GO-CDR architecture we proposed in [
11] and including an AFE with MOSFETs operating in the subthreshold region and a calibration circuit for the GO-CDR, which allows the WuRx to process data containing even long streams of equal consecutive bits. The target is a 1 kbps data rate for an OOK-modulated input RF signal.
The remainder of this paper is organized as follows.
Section 2 describes the proposed WuRx architecture with special emphasis on the baseband logic.
Section 3 and
Section 4 present the circuit design and the implementation choices, respectively.
Section 5 shows the measurement results, and finally,
Section 6 concludes the paper.
2. Wake-Up and Data Receiver Architecture
The proposed WuRx is shown in
Figure 1. The always-on AFE was clockless (i.e., it did not need an oscillator), while the baseband logic required a clock to sample the incoming data. This allowed the WuRx to operate in two phases. During Phase 1, the baseband logic was off, whereas the AFE was active. Phase 2 started upon recognition of the first 0-to-1 transition of the message, occurring at the first transition of the AFE output signal. The baseband logic was turned on, and the incoming bitstream was compared with the stored codeword. This approach allowed us to reduce the power consumption of the node if the specific application was characterized by long idle periods, since the baseband logic was off most of the time [
16].
The AFE was composed of an external lumped component input matching network (IMN) followed by an envelope detector (ED) and a comparator, both of which were integrated on the chip.
As indicated in
Figure 1, the proposed data-startable baseband logic included [
11] (1) a GO-CDR, (2) a control logic with addressing capabilities (CL) to generate the wake-up signal and control signals for GO-CDR and (3) a bias and calibration (BC) circuit for the GO-CDR.
As illustrated in
Figure 2a, the purpose of the CDR circuit was to provide a clock to the CL to correctly sample on the positive edges a delayed version of the input data (DDin). As shown in
Figure 2b, ideally, the sampling edges would be placed at the center of each bit time (
). The circuit was composed of three sections: a delay block (DB), an edge detector implemented through an Exclusive NOR (EXNOR) gate and the GO. The EXNOR gate was fed with the data signal, Din, and its delayed version, DDin, resulting in a pulse of the gate signal at each Din transition. When Gate = 1, the GO was in free running mode with a frequency
, while with Gate = 0, it was blocked in a predefined state. When Gate switched from 0 to 1, the GO generated the positive edge of Clock, ideally after
, thus allowing it to clear any phase error accumulated up to that time, even if the free-running clock frequency was not precisely matched to the data rate (
). Therefore, the only constraint of this architecture is on the maximum number of equal consecutive bits (
Nm) that can be correctly sampled.
Nm can be calculated, imposing that no bit is sampled twice (which can occur if
) or not sampled at all (which can occur if
). Defining
, a simplified analysis carried out assuming a start-up time of zero for the oscillator and neglecting the Clock jitter, leads to the following constraints:
This results in
[
11].
In case a Manchester code is employed, which contains a transition in each bit time (i.e., Nm = 2) at the cost of halving the data rate compared with the standard binary encoding, the equation leads to α < 0.2. Such a frequency error upper limit is easily achievable in integrated ultra-low-power oscillators.
To avoid the use of a Manchester code with its associated limitations on the data rate, the proposed architecture included a bias and calibration circuit for the GO-CDR, which reduced α to negligible values and then allowed the WuRx to process data containing long sequences of equal consecutive bits.
5. Measurement Results
The fabricated chip was mounted on a board using a chip-on-board wiring technique, as shown in
Figure 8a.
Figure 8b shows the measurement setup employed for the performance evaluation of the proposed wake-up and data receiver. It included an RF generator for the RF input signal and its OOK modulation. An STM32 Nucleo board (Main Nucleo in
Figure 8b) was used for the generation of the bitstream, programming the SIPO register, processing the output bits generated by the WuRx and managing the calibration cycle. An additional STM32 Nucleo board, as described below, was used to characterize the impact of the gated-oscillator CDR on the WuRx sensitivity.
The input impedance at the SMA connector was characterized by means of a vector network analyzer (VNA) in the 10 MHz–1.5 GHz range (see
Figure 9). The resonance frequency clearly visible around 1.1 GHz was due to the wire inductance and the input capacitance (2.95 pF), which could be ascribed mainly to the pad, as verified by means of an extracted lumped element equivalent circuit. Indeed, in the present implementation, a standard analog pad was used, which needed to be replaced by a low-capacitance RF pad in the final implementation. Due to these limitations, the present prototype did not address the implementation of the input matching network (IMN). Consequently, all measurements shown hereafter were performed with a 50 Ω resistor soldered parallel to the input of the ED and using a commercial coaxial impedance adapter (see
Figure 8b), thus providing a unity gain IMN. Since the AFE response is independent on the RF carrier frequency, all measurements were performed using the 868 MHz European ISM band carrier frequency.
For the sake of completeness, IMNs for different carrier frequencies were designed using the extracted input impedance lumped element model to estimate the obtainable IMN voltage gain. The simulated IMNs were based on an L-shaped inductor-capacitor (LC) stage using inductances with quality factor Q = 80 [
21]. The simulated IMN gains at 100 MHz, 433 MHz and 868 MHz were 24.8 dB, 17.3 dB and 8.3 dB, respectively. The simulated IMN gains needed to be added to the measured circuit sensitivity to obtain the projected WuRx total sensitivity.
First, functional tests were performed to verify correct operation. Then, systematic measurements were accomplished to characterize the missed detection rate (MDR) and the false alarm rate (FAR). Finally, the capability of the WuRx to receive long sequences of data was investigated, and the performance of the bias and calibration circuit was analyzed.
The functional tests revealed problems with the data-startable baseband logic, which operated correctly only for a supply voltage ranging from 0.3 V to 0.5 V (i.e., lower than the nominal 0.6 V). To investigate the precise origin of this unexpected problem, post-layout transistor-level simulations were carried out for different supply voltage values. The simulation results revealed the occurrence of ringing phenomena caused by interline capacitances between the O3 and Clock signals in
Figure 4, which had been underestimated by the extractor. The problem could be suppressed by lowering the supply voltage. Therefore, all measurements shown hereafter have been performed with a 0.4 V supply for the baseband logic.
Figure 10 shows the sample measured waveforms in response to a packet composed of a 3-bit preamble (100) followed by a 16-bit string matching the stored codeword (1011101101010011). This measurement was performed with a −34 dBm RF input sequence at 1 kbps, with a 0.5% clock frequency error (
) measured after calibration. The curves demonstrate that the ED output was the correct envelope of the modulated RF signal, the generated clock sampled the DDin accurately and the baseband logic correctly generated the wake-up pulse.
MDR measurements were performed to evaluate the sensitivity of the WuRx. The MDR is the ratio between the number of missed wake-ups and the total number of sent packets. To evaluate it, the Nucleo was employed to generate 10,000 equal 19-bit packets (identical to the one reported in
Figure 10) separated by 100 ms from each other and to count the number of wake-up pulses. To investigate the impact of the GO-CDR on the sensitivity of the WuRx, an additional Nucleo (see
Figure 8b), synchronized and running in parallel with the main one, was employed to decode the AFE output (Din, see
Figure 1) with an external precisely timed clock and to compare the received stream with the one transmitted by the main Nucleo. The difference between the MDRs computed by the two Nucleo boards was a measure of how far the proposed GO-CDR affected the WuRx sensitivity. The MDR results are reported in
Figure 11. The measurements were performed by changing the power of the input RF signal and adjusting the AFE comparator threshold accordingly with a 0.5% GO free-running frequency error measured after calibration. Measurements were repeated for correlator thresholds equal to 16/16, 15/16 and 14/16.
The Nucleo dedicated to decoding the AFE output was programmed consistently. The input power corresponding to MDR = 10−3, when the received data was processed by GO-CDR, was PIN = −35.75 dBm for the 16/16 case and PIN = −36 dBm for the 14/16 and 15/16 cases. The Nucleo that decoded the Din with an external clock counted an MDR = 10−3 for PIN = −36.25 dBm for all correlator thresholds. Therefore, the use of the proposed GO-CDR circuit affected the sensitivity of the WuRx at MDR = 10−3 for 0.5 dBm. The same measurement procedure was repeated with different codewords by varying the number of consecutive zeros and ones, the correlator threshold and the codeword length. The measured MDR = 10−3 was always found for PIN = −35.75 dBm which, as for the aforementioned measurements, was affected by GO-CDR for 0.5 dBm.
In the 16/16 case, the total sensitivity at MDR = 10−3 referred to the input of the IMN, which included the projected IMN voltage gain, as explained above, of −60.5 dBm, −53 dBm and −44 dBm at 100 MHz, 433 MHz and 868 MHz, respectively.
To measure the false alarm rate (FAR), which is defined as the number of false wake-ups per hour due to the noise present in the receiver, the input of the coaxial impedance adapter was closed on a 50 Ω resistance. Typically, a FAR ≤ 1/h is considered acceptable [
5]. The Nucleo was used for counting the number of false wake-ups. The correlator was programmed with the 14/16 threshold, the AFE comparator threshold V
THR was set to the value corresponding to P
IN = −35.75 dBm, and the clock frequency error measured after calibration was 0.5%. Measurements were performed for 24 h time windows, resulting in zero overall false wake-ups.
To evaluate the WuRx capability to receive long sequences of data, additional MDR measurements were performed by sending 3174 equal 63-bit packets (for a total of 199,962 transmitted bits) separated by 100 ms from each other. All the transmitted packets contained a sequence of 20 consecutive ones. As reported in
Section 4, the 63-bit packet length was limited in the present prototype by the chosen maximum timeout value of the baseband logic (see
Figure 5). To perform these measurements, the output stream of the baseband logic (DDin) was sampled by the main Nucleo, using the clock generated by GO-CDR with a 0.5% frequency error after calibration (see
Figure 8b). As for the previous MDR measurements on 16-bit codewords, an additional Nucleo was employed to decode the AFE output with an external clock and then to compare the received stream with the one transmitted by the first Nucleo. Measurements were repeated with thresholds on the received bits equal to 63/63 and 58/63. In case the received sequence was processed by GO-CDR, an MDR = 10
−3 was found for P
IN = −35 dBm and P
IN = −35.5 dBm for the 63/63 and 58/63 cases, respectively. When the received sequence was decoded off-chip by the external MCU clock, an MDR = 10
−3 was found for P
IN = −36.25 dBm in either threshold case. Therefore, the use of the on-chip clock degraded the WuRx sensitivity by 1.25 dBm. This measured packet sensitivity differed from the 16-bit code sensitivity in
Figure 11 by 0.75 dBm, thus demonstrating the GO-CDR capability to also process long data streams. These results lead to the conclusion that the sensitivity is limited by the AFE.
Measurements were repeated in the 63/63 threshold case by varying the number of consecutive zeros and ones from 1 to 63 bits. In any case an MDR = 10−3 was found with PIN = −35 dBm.
Finally, measurements were performed to test the bias and calibration circuit (
Figure 7), supplying the GO with the nominal V
DD = 0.6 V. The main Nucleo was used to generate the reference clock (Clock_ref) for the frequency detector and manage the control signals (start_calib, end_calib) (see
Figure 8b). The current Ibias was set to get an initial frequency error between −20% and +20% relative to the nominal frequency (1 kHz), and a calibration cycle was performed for each value of Ibias.
Figure 12 shows the measured mean frequency error post-calibration, evaluated over 2000 clock periods. The maximum frequency error after calibration was limited to 0.5%, which was consistent with the simulation results. In these conditions, the GO-CDR was tested in terms of the maximum number of equal consecutive bits (
Nm). To perform this measurement, 63-bit packets, characterized by a variable number of zeros and ones, were provided to GO-CDR (i.e., excluding the AFE) by the Nucleo. The same Nucleo was used to sample the DDin using the clock generated by GO-CDR. The measurements revealed
Nm = 63 bits, thus demonstrating that GO-CDR was able to process packets even in case where they were made of all zeros or ones. With a 1% clock frequency error,
Nm decreased to 50 bits. These results were consistent with both Equations (1) and (2) and the simulation results, which projected
Nm~100 bits and 50 bits with α = 0.005 and 0.01, respectively (i.e., far above the maximum packet length of the WuRx). Furthermore,
Nm was not affected by the noise in the GO. The clock rms jitter was found to be 3 µs, thus revealing that
Nm was only affected by the free-running GO-CDR frequency error.
6. Discussion and Conclusions
This paper presented a nanowatt WuRx which enabled nodes to receive long data streams in addition to a wake-up codeword. It included an always-on clockless AFE and a data-startable baseband logic based on a gated oscillator clock and data recovery (GO-CDR) circuit. GO-CDR ensured phase alignment between the received data and clock with nanowatt power consumption, thus avoiding the use of power-hungry PLLs or crystal oscillators. Any free-running frequency mismatch between the GO and bitrate did not limit the number of receivable bits, but rather only the maximum number of receivable equal consecutive bits (Nm). To overcome this limitation, the proposed system included a frequency calibration circuit.
The proposed architecture was fabricated in STMicroelectronics 90 nm BCD technology. The circuit was supplied with 0.6 V, and the overall power consumption, excluding the calibration circuit, was 12.8 nW during the rest state and 17 nW at a 1 kbps data rate. Measurements on the GO-CDR calibration circuit revealed that, starting from a ±20% initial error, the maximum free-running frequency error after calibration was ±0.5%. In these conditions, the GO-CDR correctly sampled packets even if they were made of all zeros or ones. In the same conditions, with a 100 MHz RF carrier 1 kbps OOK modulated input, a 10−3 missed detection rate (MDR) with a −60.5 dBm sensitivity (including the projected input matching network gain) was measured, transmitting 16-bit codewords and tolerating 0 errors. The WuRx sensitivity was mainly limited by the AFE. A comparison with an experimental setup where sampling and correlation were performed by an external MCU with precise clock showed that the GO-CDR reduced the WuRx sensitivity by 0.5 dBm. Furthermore, it has been verified through measurements that WuRx received, with MDR = 10−3, 63-bit packets, even if they were made of all zeros or ones, with a 0-bit error tolerance and a −59.8 dBm sensitivity (including the projected input matching network gain). In this case, the GO-CDR affected the sensitivity for 1.25 dBm. Finally, the WuRx false alarm rate (FAR) was measured for 24 h time windows, resulting in zero overall false wake-ups.
Table 1 summarizes the system performance and compares it with other state-of-the-art WuRxs reported in the literature. When we compare the Figure-of-Merit (FoM), which is conventionally defined to take into account the sensitivity normalized to the bitrate and the power consumption, it can be observed that our implementation provided similar performance compared to other state-of-the-art WuRxs. However, it must be remarked that the sensitivity is determined essentially by the AFE, which is not the main focus of this paper. Therefore, we do not comment further on this point.
Table 1 shows that the proposed WuRx provides state-of-the-art performance in terms of the maximum packet length, error tolerance and maximum number of equal consecutive bits. Oversampling techniques, such as those in [
5] and [
9], exhibit limitations on the maximum packet length (11 bits and 63 bits, respectively) but do not set a constraint on
Nm. It must be noticed that [
9] showed the only WuRx which achieved the same packet length as the wake-up and data receiver we propose (i.e., 63 bits). In Reference [
9], a 13-bit error tolerance was accepted, while in our implementation, the same packet length was achieved with 0 errors and was only limited by the timeout register size. Furthermore, in [
9], the sensitivity was evaluated with MDR = 20 × 10
−3 and FAR < 1/h, while as reported above, we characterized the performance of the proposed WuRx with more stringent constraints (i.e., MDR = 10
−3 and FAR = 0).
In conclusion, we believe that the proposed scheme is well suited for ultra-low-power WuRxs with the capability to receive long streams.