A Novel Data Compression Methodology Focused on Power Quality Signals Using Compressive Sampling Matching Pursuit

Ruiz, Milton; Jaramillo, Manuel; Aguila, Alexander; Ortiz, Leony; Varela, Silvana

doi:10.3390/en15249345

Open AccessArticle

A Novel Data Compression Methodology Focused on Power Quality Signals Using Compressive Sampling Matching Pursuit

by

Milton Ruiz

^*

,

Manuel Jaramillo

,

Alexander Aguila

,

Leony Ortiz

and

Silvana Varela

Carrera de Electricidad, Universidad Politécnica Salesiana, Quito 170146, Ecuador

^*

Author to whom correspondence should be addressed.

Energies 2022, 15(24), 9345; https://doi.org/10.3390/en15249345

Submission received: 6 October 2022 / Revised: 21 November 2022 / Accepted: 28 November 2022 / Published: 9 December 2022

(This article belongs to the Section F: Electrical Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

In this research a new data compression technique for electrical signals was proposed. The methodology combined wavelets and compressed sensing techniques. Two algorithms were proposed; the first one was designed to find specific characteristics of any type of energy quality signal such as the number of samples per cycle, zero-crossing indices, and signal amplitude. With the data obtained, the second algorithm was designed to apply a biorthogonal wavelet transform resulting in a shifted signal, and its amplitude was modified with respect to the original. The errors were rectified with the attributes found in the early stage, and the application of filters was conducted to reduce the ripple attached. Then, the third algorithm was designed to apply Compressive Sampling Matching Pursuit, which is a greedy algorithm that creates a dictionary with orthogonal bases representing the original signal in a sparse vector. The results exhibited excellent features of quality and were accomplished by the suggested compression and reconstruction technique. These results were a compression ratio of 1020:1, that is, the signal was compressed by 99.90% with respect to the original one. The quality indicators achieved were RTE = 0.9938, NMSE = 0.0098, and COR = 0.99, exceeding the results of the most relevant research papers published in Q1 high-impact journals that were further discussed in the introduction section.

Keywords:

data compression; digital signal processing; compressed sensing; power quality (PQ); smart grid (SG); compressive sampling matching pursuit

1. Introduction

An electrical power system (EPS) encompasses subsystems such as generation, transmission, distribution, market, and users. An EPS is the most complex system built by mankind due to the fact that the costs of communication systems and sensors have reduced rapidly, resulting in the growth of the fitting of millions of sensors in all areas. The electricity consumption of EPSs is managed by measurement and control devices with suitable levels of communication, resistance, reliability, safety, and the capacity to adjust to loads that can vary frequently. A smart grid manages all interactions between physical and computational components and systems that operate in parallel. For the system’s management, not only must electrical variables be considered but also commercial models, economic opportunities, technologies, and regulatory policies, resulting in a new smart electrical network based on distributed systems, such as energy management systems (EMS), demand response management systems (DRMS), advanced distribution management systems (ADMS), advanced metering infrastructures (AMI), distributed energy resource management systems (DERMS), etc. [1]. These are high-penetration systems that monitor a network that depend on of the number of measurement points, and the sampling rate can generate zettabytes (ZB) of data that must be processed, transmitted, and stored. By the end of 2022, the amount of data generated by China by the IoT is estimated to be 10 ZB [2].

Related Work

This section presents the state-of-the-art in big data generated in electrical systems. In addition, the most important methodologies that are used for data compression techniques for electrical signals in quality management are listed below:

Signal compression is classified into two different approaches by considering data losses: lossy compression and lossless compression. In many documents, both are combined to improve compression ratios. Lossless compression is based on Delta, Run-length, Lempel–Ziv–Welch, and Huffman encoders. The compression rates obtained are lower than that obtained with lossy compression that eliminates noisy or redundant data [3,4]. For lossy signals, the most widely used techniques are orthogonal transforms such as Discrete Cosine Transform (DCT), Discrete Fourier transform (DFT) and Wavelet Transform (WT). The signals are decomposed based on their frequencies and the threshold determines the information required for analysis [5].

In the research presented in [5], the authors showed an approach that took a wavelet transform and windowing in order to remove repeated signals. Among the electrical signals that were analyzed were flicker, sag, and swell, and the compression ratio was close to 800:1.

In the research shown in [3], the authors showed a compression method for electrical signals based on the Fourier transform, components, and an adaptive threshold obtained through mathematical morphology. The compression ratios obtained were 33.33:1.

The authors in [6] studied the big data architecture developed for the electrical power system. This paper proposed the use of technologies to manage big data, cloud computing, the internet of things, mobile internet, artificial intelligence, and blockchain, Introducing high-power big data as a new asset of Chinese electricity companies.

In research on an electrical power emergency warning mechanism based on meteorological big data [7], the authors showed protection mechanisms against natural disasters by implementing early detection systems using measurement equipment and generating large amounts of data.

The researchers in references [8,9] showed the structure of a metering system and the working principles of smart meters by presenting a big data analysis along with data collection and data security.

In the work presented in [10] about a reliability evaluation of electric power communication networks based on big data, the authors presented a communication network designed to transmit large amounts of information complying with reliability standards.

The authors in paper [11] used greedy algorithms such as Orthogonal Matching Pursuit (OMP), which is based on building a dictionary with orthogonal bases that represent the original signal in a sparse representation. The best compression ratio results were obtained with a maximum compression ratio of 14:1.

In reference [12], the authors showed the compression of power quality signals as harmonics, flickers, and transients. The compression obtained using DTCWT in voltage decreases was 84%, in voltage increases was 88% in flickers was 83%, in transients was 69%, and finally in harmonics was 20%.

In reference [13], the researchers implemented improvements in the classification and compression of power quality signals. The main results were improvements in segmentation and classification with compression ratios of 25:1 and a performance of 56% compared to traditional compression techniques.

The authors in reference [14] showed a method for restoring signals with information losses in power transmission lines using compressed sensing techniques such as Basis Pursuit, Matching Pursuit, and Orthogonal Matching Pursuit. Signals were restored by using a dictionary that contained the most relevant information about the signals. These techniques allowed for the recovering of signals with 30% of randomly lost samples with times between 1 and 10 s.

In reference [15], the authors proposed a signal compression technique based on the Regularization Sparsity Adaptive Matching Pursuit algorithm (RCoSaMP) using measurements from power quality loggers. The stored data size was about 9.25 MB and, after compression, it was 16.22 kB; using a new detector, a 72:1 compression was obtained.

The researchers in reference [16] analyzed big data that needed to be transmitted by communication equipment to HMI control centers. The authors proposed a lossy compression technique called FF0; the compression results ranged from 65% to 99%.

The authors in reference [17] presented a technique for denoising and compressing lossy signals using wavelet transform, which was based on Shannon entropy, in order to calculate the basis of a signal. The results obtained were close to a 91% compression with high ratios in terms of the RTE, NMSE and COR rates.

The authors in reference [18] proposed a signal compression method that was applied to electrical power quality using genetic algorithms and neural networks. The signals were obtained from digital fault recorders (DFRs) and digital protection units, and the best compression results obtained were 24:1.

In reference [19], the researchers proposed an electrical signal compression method based on the detection of anomalies in a hierarchical block of samples and subsamples by applying a simple iterated delta filter and a streaming data pipeline. The best results obtained were 45.58:1.

The authors in reference [20,21] proposed a lossy compression method for electrical signals using a Fourier transform to find the harmonic components in each cycle of a signal to later create a matrix with spectral variations of the signals and to eliminate the repeated components of the signal matrix. The best results were 10,780:1. These results were inconsistent with the entire literature review, considering that compression with a loss of information is applied based on the thresholds of the frequencies obtained by the Fourier transform. Finally, the Fourier transform presents problems with time-varying signals, since it does not have windowing like wavelets. It is for this reason that the best result that was considered in this research was 570:1.

This article is organized as follows. Section 2 presents the formulation of the problem. Section 3 presents the simulation and the results. Section 4 analyzes the results of the model. Finally, Section 5 presents the conclusions of the research.

2. Problem Formulation

As was shown in the above-mentioned summary of the state-of-the-art in big data, a trend is evident in the installation of measurement equipment at all levels of electrical systems, from generation to consumer, generating enormous amounts of information. The electrical signals are sampled at high frequencies in Hz and stored in vectors

x (t)

where the domain is in terms of time, frequency, and power, and the range is in terms of the values of voltage, current, etc. The resolution of the signal depends on the quantization bits of the analogue-to-digital converters (adc), forming vectors

x (\frac{k}{f s}), k ϵ R .

The inclusion of several metering devices in the residential, commercial, and industrial sectors, generating reliable electrical networks with a low environmental impact and with financial benefits for making informed decisions based on an electricity market in real time.

2.1. Lossless Compression

The answer to the problem of the use of lossy compression was proposed based on the representation of signals in their orthogonal bases, such that

L_{P}

for 0 < p< ∞. In numerical analysis, wavelets are used as a tool for solving partial differential equations and applications of linear operators are used for arbitrary functions, Such as applications in sound engineering, image, and signal processing.

An orthonormal wavelet basis for

L^{2} (ℝ)

is a family of functions:

ψ_{j, k} (x) = 2^{\frac{- j}{2}} ψ (2^{- j} x - k), x ϵ ℝ, j, k ϵ ℤ

By translation and dilation of the mother wavelet

ψ ϵ L^{2} (ℝ)

, any function

f i n L^{2} (ℝ)

can be expressed as a function of the wavelet

ψ_{j, k}

:

f (x) = \sum_{j ϵ ℤ} \sum_{k ϵ ℤ} (f, ψ_{j k}) ψ_{j k} (x),

Maintaining the

L^{2}

equality, the wavelet coefficients are calculated by the scalar products:

(f, ψ_{j k}) = \int_{- \infty}^{\infty} f (x) \bar{ψ_{j k} (x)} d x

The biorthogonal wavelet for

f ϵ L^{2} (ℝ)

can be represented as:

f (x) = \sum_{j ϵ ℤ} \sum_{k ϵ ℤ} (f, ψ_{j k}) {\tilde{ψ}}_{j k} (x) = \sum_{j ϵ ℤ} \sum_{k ϵ ℤ} (f, {\tilde{ψ}}_{j k}) ψ_{j k} (x)

Let

D \in ℂ^{K x N}

be a dictionary of

N > K

vectors having the unit of

ℓ_{2}

-norm, i.e.,

ǁ d ℓ ǁ_{2} = 1

for all

ℓ \in ⟦ 1, N ⟧ .

This dictionary is supposed to be complete, which means that it includes K linearly independent vectors that define a basis of the signal space

ℂ^{K} .

2.2. Compressive Sampling Matching Pursuit (CoSaMP)

For vectors that represent sparse and compressible signals in

C^{N}

, the

L_{0}

“quasinorm” is defined as:

{||x||}_{0} = |s u p p (x)| = | {j : x_{0}}

Signal

x

is sparse when

||x_{0}|| \leq s

. Real signals can be transformed to sparse signals, which means that their inputs decay fast when categorized by magnitude. As a result, compressible signals closely approximate sparse signals, for example, orthonormal bases such as a Fourier or wavelet basis.

CoSaMP is a greedy pursuit algorithm that integrates combinatorial algorithms to ensure speed and to provide accurate error bounds. The most important information of the signals is stored in the discrete vectors

α [n] = [x_{1}, x_{2} \dots, x_{3}] ϵ R

, where the values are the representations of atoms. The reconstruction of the signal is carried out with an alternative technique to Nyquist, since the vector is a representation of the original signal with sparse data. To reconstruct the original signal

x [n]

, a vector is required that is the sparse representation of the original signal,

α

, multiplied by a dictionary matrix,

D

, which are the orthogonal bases of the original signal. The rows of the dictionary are much smaller than the columns fulfilling

M ≪ N

plus an error approximation

e

. Finally, x

[n] \approx D α + e

, where

x [n] ϵ R

.

For a given precision parameter,

n

, the algorithm CoSaMP produces a sparse approximation

α [n]

that satisfies:

{| | x - α | |}_{2} \leq C \cdot \max {n, \frac{1}{\sqrt{s}} {| | x - x_{\frac{s}{2}} | |}_{1} + {| | e | |}_{2}

where

x_{s / 2}

is a best

(s / 2)

-sparse approximation to

x

. The running time is

O (L \cdot \log ({||x||}_{2} / n)

, where

ℒ

bounds the cost of a matrix vector multiplied by Φ. The working storage is O(N).

2.3. Reconstruction Quality Metrics

The compression process has metrics that allow the analyzing of the quality of processed signals, taking into account the reconstructed signal

\hat{x} = [{\hat{x}}_{1}, {\hat{x}}_{2} \dots, {\hat{x}}_{3}] ϵ R

and comparing it with the original signal

x = [x_{1}, x_{2} \dots, x_{3}] ϵ R

. The two vectors must have the same dimension, that is,

l e n g t h (x) = l e n g t h (\hat{x})

, that is, they have the same number of samples that symbolize the values of the signal in discrete time. Next, the equations that allow the determining of the quality of a signal are presented:

Normalized mean-squared error (

N M S E

).—the

N M S E

result between the original signal and the reconstructed signal must be the closest to 0. The equation is presented below:

N M S E = \sum_{n = 1}^{N} {|x_{n} - \hat{x_{n}}|}^{2} / \sum_{n = 1}^{N} {|x_{n}|}^{2}

Correlation (

C O R

).—the

C O R

result between the original signal and the reconstructed signal must be the closest to 1, where the operator “·” is the inner product of the vectors. The equation is shown as follows:

C O R = (x_{n}^{T} \cdot \hat{x_{n}}) / (x_{n}^{T} \cdot x_{n})

Percentage of retained energy (

R T E

).—the

R T E

result between the original and the reconstructed signal should be as close to 100%. The equation is shown as follows:

R T E (%) = 100 * \sum_{n = 1}^{N} {\hat{x_{n}}}^{2} / \sum_{n = 1}^{N} x_{n}^{2}

2.4. Proposed Algorithms

This research proposed three algorithms. Algorithm 1 was developed to extract signals characteristic such as the zero-crossing index, the maximum amplitude, and the number of samples per period, thus allowing any type of signal with different sampling rates or amplitudes to be compressed. The first step is the acquisition of data from any measurement device. The data can contain different signals such as the voltage and current measurements of three phases at the beginning and end of a transmission line along with time, with a total of 13 signals that are arranged in a matrix called Electrical Signal “ES”.

For Algorithm 1 to be executed dynamically regardless of the number of signals, the number of rows and columns of the matrix is calculated, and the values in the Row Electrical Signal “RES” and Column Electrical Signal “CES” are stored. The second step is the extraction of the characteristics of the signals such as the zero-crossing index that stores the values in Start Zero-Crossing “SZC” and Final Zero-Crossing “FZC”. Identifying the zero-crossing index allows the algorithm to quantify the number of samples per cycle “NS” and to determine the maximum amplitudes of each signal “ML” and the location index “IL”.

The third step is to determine the orthogonal bases of each signal using the six level biorthogonal wavelet transform. “C” contains the wavelet decomposition, and “L” contains the number of coefficients per level. Finally, the approximation coefficients at level N are calculated using the wavelet decomposition structure [C,L]. Finally, the result is the compressed signal “CS”.

It must be taken into consideration that the execution time of Matching Pursuit depends on the number of samples of the original signal. The time complexity in big O notation is

O (m n)

. It is for this reason that the signal is first compressed by applying a wavelet in order to later apply compressed sensing, obtaining low compression and reconstruction times. The fourth step separates the time vector from the signals by taking the first and last time values along with the size of the vector and storing them in the Compression Time “CT”.

Next, the size of the new Compressed Data CD matrix that contains only the signal samples is calculated, storing the values in the Row Compressed Data “RCD” and Column Compressed Data “CCD”. The fifth step is the same as that performed in the second step. The sixth step creates an identity matrix of size NS, which is the number of samples per cycle, and stores it in the Identity Matrix “ID”, and then creates the transposed Discrete Cosine Transform matrix of size NS and stores it in the variable Psi. With these two matrices, the Phi sensing matrix is formed. Next, values are assigned to certain parameters that are required by Matching Pursuit.

Algorithm 1 Feature extraction and orthogonal bases

1: Step 1: Acquire data from a database file
2:    ES = 𝑙𝑜𝑎𝑑 (′𝑓𝑖𝑙𝑒𝑛𝑎𝑚𝑒′)
3:    [RES,CES] = size(ES)
4: Step 2: Feature Extraction
5:    𝑓𝑜𝑟 𝑖 = 1: 𝐶ES
6: SZC (𝑖) = ES (:, 𝑖) < 0 𝑎𝑛𝑑 ES (:+1, 𝑖) > 0
7: FZC (𝑖) = ES (:, 𝑖) > 0 𝑎𝑛𝑑 ES (:+1, 𝑖) < 0
8: NS(𝑖) = SZC (𝑖)- FZC (𝑖)
9:    𝑒𝑛𝑑𝑓𝑜𝑟
10:     𝑓𝑜𝑟 𝑖 = 1: 𝐶ES
11:    [ML (𝑖), IL (𝑖)] = 𝑚𝑎𝑥 (ES(SZC (𝑖):FZC(𝑖), 𝑖))
12:     𝑒𝑛𝑑𝑓𝑜𝑟
13: Step 3: Orthogonal bases
14:     for i = 1: CES
15:    [C, L] = wavedec(ES(:,i),level,’bior1.1’);
16:    CS(:i) = appcoef(C,L,’bior1.1’);
17:     end
18: Step 4: Split the time vector from the data
19:     CT = [ES (1,1) ES (end,1) RES]
20:   CD = (CS(:,2:end))
21:     [RCD,CCD] = size(CD)
22: Step 5: Feature Extraction
23:     𝑓𝑜𝑟 𝑖 = 1: CCD
24:     SZC (𝑖) = CD (:, 𝑖) < 0 𝑎𝑛𝑑 CD (:+1, 𝑖) > 0
25:    FZC (𝑖) = CD (:, 𝑖) > 0 𝑎𝑛𝑑 CD (:+1, 𝑖) < 0
26:     NS(𝑖) = SZC (𝑖)- FZC (𝑖)
27:      𝑒𝑛𝑑𝑓𝑜𝑟
28:      𝑓𝑜𝑟 𝑖 = 1: CCD
29: [ML (𝑖), IL (𝑖)] = 𝑚𝑎𝑥 (CD(SZC (𝑖) :FZC(𝑖), 𝑖))
30:      𝑒𝑛𝑑𝑓𝑜𝑟
31: Step 6: Compressed sensing
32:     ID = eye(NS)
33:     Psi = dctmtx(NS)’
34:     Phi = [ID Psi]
35:     k = 4
36:     SV = Compressive Sampling Matching Pursuit (Phi, k)
37:     CSMP = SV(CD)
38:     Data.CT = CT
39:     Data.CSMP = CSMP

For example, the minimum number of atoms (variable k) that are necessary to carry out an acceptable reconstruction is three when compared to the fifty-two samples per cycle of the signal. Next, the Compressive Sampling Matching Pursuit algorithm is implemented with the Phi matrix and k values and is stored in the solver variable SV. The signal compression is performed by entering each cycle of the signal in the SV solver, and the result is a sparse vector of size 1 × 52, in which only one value is presented and the remaining fifty-one are zeros, which are stored in the Compressed Signal Matching Pursuit CSMP variable Finally, in order not to store an array of zeros, the values obtained with the indices of the positions and the time are stored in an array of arrays, which is a structure called Data.

Algorithm 2 presents the steps that must be performed to reconstruct the signal. The first step is the acquisition of the data from the Data structure. The second step is conducted to form the signal, where it must be taken into account that the only values stored in CSMP are the four data values and the indices that represent the locations in the matrix of zeros in each period of the signal. The second step is the signal reconstruction.

The reconstruction of each period is conducted by multiplying CSMP by the Phi matrix, thus obtaining the CS signal. If the application requires the same amount of data as the original signal, ES, interpolation must be performed, thus obtaining a reconstructed signal with the same size as the original signal that is stored in Signal Recovery SR. The CT values are the start time, end time, and the number of samples of the original signal, which allow the creation of a time vector equal to the original one. Finally, it is necessary to calculate the metrics of the quality of the signals that are NMSE, COR, and RTE between the original signal and the reconstructed one.

Algorithm 2 Reconstruction

1: Step 1: Data acquisition
2:    load Data
3:    CT = Data.CT
4:    CSMP = Data.CSMP
5: Step 2: Reconstruction
6:    CS = Phi * CSMP
7:    SR = interpft(CS,CT(3));
8:    TR = linspace(CT(1), CT(2), CT(3))
9:    NMSE
10:     COR
11:     RTE

3. Results

The power quality variables analyzed were swell, sag, flicker, triphasic fault, and stable state. For the measurement of very fast transients such as flickers or atmospheric discharges that are of a short duration, impulses between 50 ns and 1 ms are required. The established sampling frequency was 200 kHz for 0.33 s, generating 66,000 samples per phase. The matrix that was formed by each electrical distortion was of a size of 66,000 × 4 and contained the time along with the measurements of the three phases R, S, and T. The disk size was 1,889,931 bytes. The equipment in which the entire signal compression process was carried out was a laptop with an Intel(R) Xeon(R) E-2176M Processor CPU @ 2.70 GHz and with 64 GB of RAM; the GPUs of the graphics card were used for parallel processing.

Figure 1 shows, in the upper-left part, different types of electrical faults, where the signals are represented in the time domain and are displayed by cycles. The number of samples for each signal was 52 per cycle. Next, different results are shown by modifying the index k, which is the number of atoms in the signal. In the upper right part, the graphical results of the algorithms proposed with k = 1 are presented. It can be seen that a minimum of one atom was required to carry out the reconstruction of the original signal with a certain degree of error. In the lower-left part, the result is presented with k = 5. It can be seen that a minimum of five atoms were required to carry out the reconstruction of the original signal with a certain degree of error. In the bottom-right part, the result is presented with k = 10. It can be seen that a minimum of ten atoms were required to carry out the reconstruction of the original signal with a certain degree of error.

The results obtained are presented below. Figure 2, Figure 3 and Figure 4 shows the compression results for signals in a steady state, plus the swell, flicker, triphasic fault, and sag, respectively. The index k was varied from 1 to 10. When k = 1, the optimization process calculated the single most representative atom; when the index k increased, the number of atoms increased in steps of one. It can be seen that while the index k increased, the RTE, NMSE, and COR metrics improved, the TC compression times and restoration times were reduced due to the shorter optimization process, and the final weight increased while the compression ratio decreases.

Figure 2 shows the statistical results of the 50 simulations performed using different power quality signals and by varying the number of atoms. The retained energy percentage reached optimal levels from k = 4, as shown in the results, with an RTE between 98.2% and 99.7%.

Figure 3 shows the similarity degree of the statistical results between the original and reconstructed signals. The correlation of the 50 simulations carried out is presented, and the optimal levels were achieved from k = 5, as shown in the results, with a COR between 99.6% and 99.8%.

Figure 4 shows the statistical results of the degree of the normalized mean-square error between the original signal and the reconstructed signal. The NMSE of the 50 simulations carried out is presented, and the optimal levels were achieved from k = 4, as shown in the results, with an NMSE between 0.0028% and 0.017%.

Table 1 shows the average compression and reconstruction times, the final size of the signal, the compression percentage, and the average compression ratio of the different energy quality signals with the number of atoms ranging from one to ten.

4. Analysis of Results

In recent years, research on the compression of electrical energy signals has increased due to the impact of intelligent electrical networks. It is for this reason that the results of Q1 high-impact journal articles from the last 5 years until 2022 served as a reference to be achieved and surpassed by this research. Below is a summary (Table 2) of the best results achieved from the most relevant articles.

The Compression Ratio obtained exceeded 1020:1 with a 99.90% compression of the original signal, surpassing all the high-impact research until 2022.

5. Conclusions

This research presented a methodology that used compressed sensing techniques and contributed to the processes of measurement, transmission, processing, and storage of information, thanks to the high levels of compression achieved. The results achieved with the algorithms proposed in this research compressed the electrical signals of power quality with ratios of 2216:1, but the quality indicators were not good. To improve the indicators, we recommend using a minimum value of k = 3; in this way, all the results achieved by other researchers until the year 2022 were surpassed.

To reconstruct a signal that contained fifty-two samples per cycle, a minimum of one atom was required. The atom was the most representative value of the signal and was obtained by applying the Compressive Sampling Matching Pursuit technique. Each signal contained different values and positions of the atoms. Atom positions depend on changes that might happen in the signal under analysis.

Finally, the compression level presents inverse proportion when compared to the quality indicators of the RTE, NMSE, and COR signals. As can be seen in the results, the higher the compression, for example with k = 1and considering a compression ratio of of 2216:1, the correlation was 70.00. On the other hand with k = 3 and a lower level of compression of 1024:1, the correlation was 99.07

It is proposed as future work to use the algorithms proposed in the present investigation with windowing, combining compression with and without loss of information to further improve the compression indexes RTE, NMSE, and COR.

Author Contributions

Conceptualization, methodology, software, resources, validation, and formal analysis, M.R.; investigation, writing—original draft preparation, M.R., M.J., A.A., L.O. and S.V. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Universidad Politécnica Salesiana.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gopstein, A.; Nguyen, C.; O’Fallon, C.; Hastings, N.; Wollman, D. NIST Framework and Roadmap for Smart Grid Interoperability Standards, Release 4.0; Department of Commerce, National Institute of Standards and Technology: Gaithersburg, MD, USA, 2021. [Google Scholar] [CrossRef]
Liu, H.; Huang, F.; Li, H.; Liu, W.; Wang, T. A big data framework for electric power data quality assessment. In Proceedings of the 2017 14th Web Information Systems and Applications Conference, WISA 2017, Liuzhou, China, 11–12 November 2017; pp. 289–292. [Google Scholar] [CrossRef]
Pinto, L.S.; Assunção, M.V.; Ribeiro, D.A.; Ferreira, D.D.; Huallpa, B.N.; Silva, L.R.; Duque, C.A. Compression Method of Power Quality Disturbances Based on Independent Component Analysis and Fast Fourier Transform. Electr. Power Syst. Res. 2020, 187, 106428. [Google Scholar] [CrossRef]
Mogahed, H.; Yakunin, A. Development of a Lossless Data Compression Algorithm for Multichannel Environmental Monitoring Systems. In Proceedings of the 2018 14th International Scientific-Technical Conference APEIE–44894, Novosibirsk, Russia, 2–6 October 2018. [Google Scholar] [CrossRef]
Ruiz, M.; Simani, S.; Inga, E.; Jaramillo, M. A novel algorithm for high compression rates focalized on electrical power quality signals. Heliyon 2021, 7, e06475. [Google Scholar] [CrossRef] [PubMed]
Dong, S.; Xu, M.; Zhou, A.; Zhu, L.; Qiao, J.; Bo, S. Research on Architecture of Power Big Data High-Speed Storage System for Energy Interconnection. In Proceedings of the 4th IEEE International Conference on Automation, Electronics and Electrical Engineering, AUTEEE 2021, Shenyang, China, 19–21 November 2021; pp. 588–592. [Google Scholar] [CrossRef]
Bo, W.; Fei, G.; Mei, L.; Faxian, Q. Research on Electric Power Emergency Warning Mechanism Based on Meteorological Big Data. In Proceedings of the 2021 6th International Symposium on Computer and Information Processing Technology, ISCIPT 2021, Changsha, China, 11–13 June 2021; pp. 362–365. [Google Scholar] [CrossRef]
Liu, B.; Zhu, C.; Wang, D.; Dong, P. Design of verification and verification system for electric metering pipeline meters based on big data analysis. In Proceedings of the 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering, ICBAIE 2021, Nanchang, China, 27 March 2021; pp. 248–251. [Google Scholar] [CrossRef]
Zhang, L.; Li, Y.; Qiu, B.; Zhang, J.; Liang, W. Design of communication power centralized remote monitoring system based on big data technology. In Proceedings of the 2021 International Conference on Electronics, Circuits and Information Engineering, ECIE 2021, Zhengzhou, China, 22–24 January 2021; pp. 46–49. [Google Scholar] [CrossRef]
Yuefu, F.; Yongchao, W.; Dongdong, C.; Yugui, N. Research on Reliability Evaluation of Electric Power Communication Network Based on Big Data. In Proceedings of the 2019 IEEE International Conference on Power, Intelligent Computing and Systems (ICPICS), Shenyang, China, 29–31 July 2019. [Google Scholar]
Silva, L.R.M.; de Andrade Filho, L.M.; Duque, C.A. Sparse representation algorithm applied to power systems signal compression. Int. Trans. Electr. Energy Syst. 2019, 29, e2693. [Google Scholar] [CrossRef]
Basavaraj, S. Approach for Power Quality Monitoring and Data Compression. In Proceedings of the International Conference on Power and Energy Systems: Towards Sustainable Energy (PESTSE), Bengaluru, India, 21–23 January 2016; Volume 1, pp. 4–8. [Google Scholar]
De Andrade, L.C.M.; Nanjundaswamy, T.; Oleskovicz, M.; Fernandes, R.A.S.; Rose, K. Advances in Classification and Compression of Power Quality Signals. J. Control. Autom. Electr. Syst. 2019, 30, 402–412. [Google Scholar] [CrossRef]
Ruiz, M.; Montalvo, I. Electrical faults signals restoring based on compressed sensing techniques. Energies 2020, 13, 2121. [Google Scholar] [CrossRef] [Green Version]
Wang, X.; Tian, L.; Gao, Y.; Hou, Y. Analysis of power quality disturbance signal based on improved compressed sensing reconstruction algorithm. In Proceedings of the 2017 IEEE Transportation Electrification Conference and Expo, Asia-Pacific, ITEC Asia-Pacific 2017, Harbin, China, 7–10 August 2017; pp. 1–5. [Google Scholar] [CrossRef]
Chen, C.; Xi, W.; Cui, Y.; Dong, T.; Shang, W. Compression algorithm for relay protection equipment data. In Proceedings of the 2019 IEEE 3rd Conference on Energy Internet and Energy System Integration (EI2), Beijing, China, 26–28 November 2019. [Google Scholar] [CrossRef]
Khan, J. Weighted entropy and modified MDL for compression and denoising data in smart grid. Int. J. Electr. Power Energy Syst. 2021, 133, 107089. [Google Scholar] [CrossRef]
Noronha Barros, F.G.; Fonseca, W.A.D.S.; Bezerra, U.H.; Nunes, M.V.A. Compression of electrical power signals from waveform records using genetic algorithm and artificial neural network. Electr. Power Syst. Res. 2017, 142, 207–214. [Google Scholar] [CrossRef]
Sahasranand, K.R.; Joseph, F.C.; Tyagi, H.; Gurrala, G.; Joglekar, A. Anomaly-Aware Adaptive Sampling for Electrical Signal Compression. IEEE Trans. Smart Grid 2022, 13, 2185–2196. [Google Scholar] [CrossRef]
Silva, L.R.M.; Kapisch, E.B.; Martins, C.H.N.; Filho, L.M.A.; Cerqueira, A.S.; Duque, C.A.; Ribeiro, P.F. Gapless Power-Quality Disturbance Recorder. IEEE Trans. Power Deliv. 2017, 32, 862–871. [Google Scholar] [CrossRef]
Kapisch, E.B.; de Morais, V.V.; Silva, L.R.M.; Filho, L.M.A.; Duque, C.A. Spectral Variation-Based Signal Compression Technique for Gapless Power Quality Waveform Recording in Smart Grids. IEEE Trans. Ind. Inform. 2022, 18, 4488–4498. [Google Scholar] [CrossRef]

Figure 1. Electrical signal and cycle compression.

Figure 2. Percentage of retained energy, RTE.

Figure 3. Correlation, COR.

Figure 4. Normalized mean-squared error NMSE.

Table 1. Results summary.

ATOMS	TC [s]	TR [s]	Final Weight [Bytes]	Compresion [%]	Compresion Ratio
K = 1	0.20772	0.01230	855	99.95476	2210
K = 2	0.03492	0.01009	1368	99.92762	1382
K = 3	0.02229	0.00673	1858	99.90169	1017
K = 4	0.02395	0.00642	2375	99.87433	796
K = 5	0.02351	0.00714	2903	99.84640	651
K = 6	0.02625	0.00714	3489	99.81539	542
K = 7	0.02952	0.00714	3982	99.78930	475
K = 8	0.03596	0.00795	4470	99.76348	423
K = 9	0.03184	0.00776	4990	99.73597	379
K = 10	0.04381	0.00624	5498	99.70909	344

Table 2. Summary of the best results achieved from the most relevant articles.

Signal Type	RTE	NMSE	COR [%]	TC [s]	TR [s]	Compression [%]	Compression Ratio	References
Steady state k = 3	0.992270	0.010817960	99.0755565	0.02355	0.00378	99.90238	1024:1	This research
Electrical fault	0.9998	0.001105	99.9534	0.03652	0.1447	99.421	800:1	Q1 [5]
Electrical fault	--	37.5 × 10⁻⁸	--	0.04709		81	--	Q1 [17]
Electrical fault	--	0.0217	--				24:1	Q1 [18]
Electrical fault	--	0.010780	--	--	--	--	31:1	Q1 [19]
Steady state	0.9780	0.0277	98.90	--	--	99.776	507:1	Q1 [21]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ruiz, M.; Jaramillo, M.; Aguila, A.; Ortiz, L.; Varela, S. A Novel Data Compression Methodology Focused on Power Quality Signals Using Compressive Sampling Matching Pursuit. Energies 2022, 15, 9345. https://doi.org/10.3390/en15249345

AMA Style

Ruiz M, Jaramillo M, Aguila A, Ortiz L, Varela S. A Novel Data Compression Methodology Focused on Power Quality Signals Using Compressive Sampling Matching Pursuit. Energies. 2022; 15(24):9345. https://doi.org/10.3390/en15249345

Chicago/Turabian Style

Ruiz, Milton, Manuel Jaramillo, Alexander Aguila, Leony Ortiz, and Silvana Varela. 2022. "A Novel Data Compression Methodology Focused on Power Quality Signals Using Compressive Sampling Matching Pursuit" Energies 15, no. 24: 9345. https://doi.org/10.3390/en15249345

APA Style

Ruiz, M., Jaramillo, M., Aguila, A., Ortiz, L., & Varela, S. (2022). A Novel Data Compression Methodology Focused on Power Quality Signals Using Compressive Sampling Matching Pursuit. Energies, 15(24), 9345. https://doi.org/10.3390/en15249345

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Data Compression Methodology Focused on Power Quality Signals Using Compressive Sampling Matching Pursuit

Abstract

1. Introduction

Related Work

2. Problem Formulation

2.1. Lossless Compression

2.2. Compressive Sampling Matching Pursuit (CoSaMP)

2.3. Reconstruction Quality Metrics

2.4. Proposed Algorithms

3. Results

4. Analysis of Results

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI