4.1. Acoustic Sensor
Existing research on pipe leak-detection techniques based on deep learning technology has assumed a scenario where power is continuously supplied, allowing for the transmission of raw acoustic sensor data [
16,
17]. However, this study aims to minimize the amount of data transmitted from low-power wireless acoustic sensor modules to reduce power consumption. Previous studies have primarily utilized acoustic signals within the audible frequency range (below 20 kHz) to detect pipe leaks [
18]. However, in power plants and industrial plants, considerable noise is generated within this audible frequency range, making it challenging to discern leak (anomalous) signals using this method. Therefore, this research applies a technique for detecting pipe leaks using signals in the 20 kHz to 100 kHz range, in accordance with international standard specifications (Class II) [
19].
For low-power wireless sensing, it is crucial to address the issue of reduced acoustic signal-sensing capabilities arising from the use of low-power acoustic sensors and the power consumption associated with wireless communication. To mitigate these issues, we have enhanced signal-detection capabilities through the use of signal amplification. Furthermore, by converting the signal into the spectrum of a Fourier transform, we have successfully represented the characteristics of the signal with a minimal amount of data, thereby drastically reducing the volume of data required for wireless communication.
The low-power sensor module developed for acoustic signal collection is described in
Figure 5. The analog acoustic signals collected from the MEMS microphone (low power mode supply current: 75 µA, sensitivity: −38 dBV/Pa @ 94 dB SPL 1 kHz, frequency response: 100 Hz~80 kHz) are amplified and then converted into digital signals through an analog-to-digital conversion circuit (with a 100 kHz microphone, the sampling rate is 256 kHz). The sampled digital signals are transformed into frequency domain data through a one-dimensional discrete Fourier transform.
The one-dimensional discrete Fourier transform for the discrete function
f(
x), where
x = 0, 1, 2, …,
N − 1, is given by Equation (3),
where
and
.
The digital signals are collected at a high sampling rate of 256 kHz, which means 256,000 samples are taken per second. These collected signals are then divided into smaller segments. Each segment represents a time duration of 1/250 of a second. For each segment, we use 1024 of these samples to perform a one-dimensional discrete Fourier transform. Following the application of Equation (2), the magnitude of each Fourier transform is calculated, and the average values are obtained across 16 segments. The absolute values, representing the magnitudes of the frequency functions, are the spectra from the Fourier transform. This method allows for obtaining average spectral values across the 0–128 kHz frequency range at intervals of 0.25 kHz (equal to 256 kHz divided by 1024). According to the ASTM E 1002-05 standard [
20], only the average spectrum for the frequency range of 20 kHz to 100 kHz is output, excluding information from both the audible and higher frequency bands. Consequently, the final data obtained through the low-power acoustic sensor module, as depicted in
Figure 5, consist of average spectral values of the Fourier transform across the 20 kHz to 100 kHz frequency range at intervals of 0.25 kHz, totaling 320 data points. To reduce the data volume, the average spectral values from the Fourier transform are converted into 4 bytes float type per frequency. Consequently, the amount of data transmitted in a single session from the leak-detection sensor module, including an 8-byte header, amounts to (320 + 8) × 4 = 1312 bytes. If data were transmitted without refinement at a sampling rate of 256 kHz, the volume of data per transmission would amount to 256,000 × 4 = 1,024,000 bytes, or 1000 kilobytes. Therefore, the method proposed in this study reduces the volume of data transmitted by a factor of 1/800 compared to transmitting unrefined data.
4.2. Numerically Controlled Oscillator (NCO)
For the development of low-power acoustic sensing modules, it is essential to operate the frequency at which critical components, such as signal amplifiers and signal converters, function within a lower frequency band. This approach is necessitated by the fact that lower operating frequencies result in reduced current consumption. By bringing the operational frequency of these components to a lower band, it is possible to significantly decrease power usage, thereby enhancing the efficiency and sustainability of the acoustic sensing module. This strategy is crucial in extending the battery life and operational duration of wireless sensing devices, making them more practical for being used as edge sensors. In this case, the necessary mixing frequency signal is generated using the NCO, which produces a square wave rather than a sine wave in our project for simplicity and efficiency. For a square wave signal with a 50% duty cycle, Fourier analysis reveals the presence of odd harmonics such as the 3rd, 5th, 7th, etc. Due to these harmonics of the NCO, the mixing process also outputs signals that coincide with similar frequency vibrations. This overlap makes it challenging to isolate and observe signals within specific frequency bands. To address this issue, we filter the signals above the third harmonic out prior to the mixing process in the original signal. Also, the MCU operates at low power when set to a lower clock frequency and operating voltage. Considering the stable operation of the NCO and the data-processing capabilities, the clock frequency is set to 1 MHz, and it operates at approximately 60 µA from a 1.8 V power supply. This configuration optimizes the energy efficiency of the system while ensuring adequate performance for the required tasks. NCO operates by adding a fixed integer to its value at every clock cycle and altering the output signal when overflow occurs. By changing this integer, the frequency of the output signal can be adjusted. The CPU is incorporated with a 20-bit accumulator, which determines the output frequency based on the state of the 20-bit accumulator as in Equation (4):
In this context, Fosc corresponds to the system clock frequency of 1 MHz. The use of 221 instead of 220 is because the state of the NCO needs to undergo two changes to complete one cycle. The addition value (Vadd; fixed integer), being an integer, introduces a maximum jitter of 1/Fosc, resulting in a deviation from the 50% duty cycle of the generated signal, which consequently produces even harmonics. For the purpose of low power consumption in this system, a 1 MHz system clock was utilized to generate a 5 kHz signal with a maximum duty cycle error of 1%. The magnitude of the 2nd harmonics is approximately 0.01 relative to the magnitude of the generated signal, which is about the noise level when AI is learning or making judgments. The BPF range is 30–80 kHz. The 2nd harmonics fall within this range between 30 and 40 kHz. Even if mixing occurs, the signal itself is very small, and multiplying the small signal by 0.01 for the 2nd harmonics makes it negligible compared to the frequency components we aim to analyze. Even when the 2nd harmonics fall within the BPF range, they are generally smaller than the noise levels found in datasets typically used by AI. Additionally, the 2nd harmonics frequency can be further analyzed using the fundamental frequency.
4.3. Amplifier, Bandpass Filter, Mixer and Lowpass Filter
Amplifiers, Bandpass Filters (BPFs), Mixers and Lowpass Filters (LPFs) are fundamental components that work together to manipulate and refine signals to meet our technical requirements.
Amplifier increases the power of the collected signals, boosting its voltage without altering its original frequency or waveform. This unit amplifies the output signal to detect relative changes, as it is difficult to distinguish between normal and anomalous signals based on the absolute magnitude of the sensor output, which is very small. This enhancement is critical to ensure that the signal can be processed effectively before it is fed to the mixer or ADC in the next stage.
Bandpass filter is responsible for selecting frequencies within the range of our target to pass through while blocking frequencies outside that range. It is used to isolate frequency components of interest from a broader spectrum of frequencies. The microphone’s response range is from 100 Hz to 80 kHz, and we have removed the audible frequency band (100 Hz to 20 kHz) to focus on the 20 kHz to 80 kHz range. The center frequency of this range is (20 + 80)/2 = 50 kHz. We utilized a band-pass filter (BPF) with a center frequency of 50 kHz and a bandwidth of 50 kHz. This filter features a ripple of 0.2 dB and is designed based on a 6th-order Chebyshev filter topology. The Chebyshev filter is chosen for its ability to provide a sharper roll-off and more selective frequency response, making it particularly suitable for applications requiring precise frequency discrimination. This configuration ensures that the filter effectively isolates the desired signal components around the center frequency while minimizing the influence of out-of-band noise.
Mixer combines two input frequencies to produce signals with output frequencies that are the sum
f1 +
f2 and difference
f1 −
f2 of the original input frequencies. This operation is known as frequency mixing [
21]. Frequency mixing enables shifting signals from one frequency band to another to facilitate further processing or transmission.
Lowpass filter allows signals below a certain frequency to pass while attenuating signals above that frequency. We employed a low-pass filter (LPF) with a cutoff frequency of 125 Hz, which corresponds to half of our sampling frequency of 250 Hz. This filter features a ripple of 0.2 dB and is based on a 6th-order Chebyshev design. The choice of a Chebyshev filter allows providing better attenuation of frequencies beyond the cutoff point. This configuration effectively attenuates higher frequency components while preserving the integrity of the signal within the desired frequency range, thereby enhancing the signal-to-noise ratio and improving overall signal clarity.
Figure 6 illustrates the signal that has passed through a bandpass filter, depicting a specific frequency band. For example, this predefined frequency band could range from 20 kHz to 100 kHz, which constitutes the first frequency band. In instances where the target frequency for monitoring is 38 kHz, a numerically controlled oscillator can be configured to generate a signal at this specific frequency of 38 kHz.
Figure 7 depicts the transformation of signals using the mixer to convert the predefined frequency band between 20 kHz and 100 kHz, into the second frequency band by mixing it with the target frequency of 38 kHz generated by the NCO. The mixer can convert two input frequencies into a signal with a frequency that is the sum of the input values. For instance, a 20 kHz signal that has passed through the bandpass filter is added to the 38 kHz frequency from the NCO, resulting in the shifted frequency of 58 kHz. Consequently, the signal, originally spanning 20 kHz to 100 kHz before mixing, is transformed into the frequency range of 58 kHz to 138 kHz after mixing.
Additionally, the mixer can also convert the input frequencies into a signal with a frequency that corresponds to the difference between them. For example, a 20 kHz signal can be reduced by 38 kHz to result in −18 kHz (this negative value indicates the shift below zero frequency, which may be theoretically considered but not practically useful), and a 100 kHz signal reduced by 38 kHz shifts to 62 kHz. Thus, the signals originally between 20 kHz and 100 kHz can also transform to span from −18 kHz to 62 kHz after mixing. Therefore, the signals processed by the mixer may overlap in the frequency bands from −18 kHz to 62 kHz and 58 kHz to 138 kHz. Moreover, any signal within the original 20 kHz to 100 kHz band at the specific frequency of 38 kHz, when reduced by the 38 kHz of the NCO, transforms to 0 Hz, indicating the baseline or null frequency shift corresponding to the pre-mixing monitoring target frequency of 38 kHz.
Figure 8 illustrates the use of the lowpass filter to extract signals from the frequency range of −18 kHz to 138 kHz following mixing, isolating the third frequency band defined by the threshold frequency of 100 Hz or lower. Given that the threshold frequency is set at 100 Hz, the extracted third frequency band encompasses signals ranging from 0 Hz to 100 Hz.
Figure 9 displays the expanded view of the frequency band from 0 Hz to 100 Hz after passing through the lowpass filter. As previously mentioned, 0 Hz corresponds to the pre-mixing monitoring target frequency of 38 kHz. Additionally, 100 Hz can correspond to a pre-mixing frequency of 38.1 kHz. Moreover, according to Equation (1), since
, 100 Hz can also correspond to a pre-mixing frequency of 37.9 kHz. Therefore, processing and analyzing signals within the 0 Hz to 100 Hz frequency band post-mixing is equivalent to handling signals within the 37.9 kHz to 38.1 kHz frequency band pre-mixing.
Consequently, by utilizing amplifiers and filters designed for handling the 100 Hz frequency band instead of those required for the 38 kHz frequency band, it is possible to reduce the power consumption of the wireless monitoring device significantly.
4.4. Microcontroller Unit (MCU), Transmitter and ADC
MCU serves as the central processing unit within the system. In our project, we use 8-bit PIC series with MPLAB X IDE 6.20 platform, consuming 37 μA operating at 1 MHz. After the signal has been conditioned by the previous stage components, the MCU performs several key functions: It executes algorithms to further analyze or modify the signal by digital filtering, error checking and complex data transformations later in the edge server. Based on the processed data, the MCU makes decisions or calculations, such as detecting anomalies or triggering specific actions. In determining the presence of anomalous signals, we set the evaluation criterion to involve analyzing the spectral data between 30 kHz and 45 kHz. The methodology entails summing the spectral data values within this range and computing their average. Given that the data are sampled at intervals of 250 Hz, this results in a total of 320 samples. The threshold for anomaly detection is established based on the average value of the background noise under experimental conditions. For example, a signal can be programmed to be classified immediately as anomalous if its average deviates by 20% or 30% from the background noise average. Otherwise, the signal is considered normal. This criterion allows for a systematic and quantitative assessment of signal integrity, aiding in the accurate identification of potential issues. It also manages the operation of other components within the system, such as adjusting the frequency of the NCO or the operational parameters of the ADC and wireless transmitter.
Transmitter outputs the data to the server. Once the signal is processed and prepared actions are determined, the wireless transmitter’s role is to communicate the data or decisions to other devices or systems. This includes sending processed data, alerts or control messages over wireless protocols to the edge server, and the cloud server.
Analog to Digital Converter (ADC) is crucial for converting the conditioned analog signal into the digital format so that the MCU can process. It converts the analog signal received from the LPF into a digital signal by sampling the signal at 250 Hz rate and quantizing the amplitude into digital values.
This orchestrated operation of amplifiers, filters, A/D converter, NCO, MCU and transmitters is what enables remote, reliable and safe monitoring and control of our target NPP system.