Next Article in Journal
A Three-Dimensional Method for Analysis of the Body Mode of Classical Guitars Using a Laser Displacement Sensor
Previous Article in Journal
BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improved Non-Negative Matrix Factorization-Based Noise Reduction of Leakage Acoustic Signals

1
State Key Laboratory of Silicate Materials for Architecture, Wuhan University of Technology, Wuhan 430070, China
2
School of Naval Architecture, Ocean and Energy Power Engineering, Wuhan University of Technology, Wuhan 430070, China
*
Author to whom correspondence should be addressed.
Sensors 2024, 24(16), 5146; https://doi.org/10.3390/s24165146
Submission received: 17 June 2024 / Revised: 29 July 2024 / Accepted: 4 August 2024 / Published: 9 August 2024
(This article belongs to the Section Physical Sensors)

Abstract

:
The detection of gas leaks using acoustic signals is often compromised by environmental noise, which significantly impacts the accuracy of subsequent leak identification. Current noise reduction algorithms based on non-negative matrix factorization (NMF) typically utilize the Euclidean distance as their objective function, which can exacerbate noise anomalies. Moreover, these algorithms predominantly rely on simple techniques like Wiener filtering to estimate the amplitude spectrum of pure signals. This approach, however, falls short in accurately estimating the amplitude spectrum of non-stationary signals. Consequently, this paper proposes an improved non-negative matrix factorization (INMF) noise reduction algorithm that enhances the traditional NMF by refining both the objective function and the amplitude spectrum estimation process for reconstructed signals. The improved algorithm replaces the conventional Euclidean distance with the Kullback–Leibler (KL) divergence and incorporates noise and sparse constraint terms into the objective function to mitigate the adverse effects of signal amplification. Unlike traditional methods such as Wiener filtering, the proposed algorithm employs an adaptive Minimum Mean-Square Error-Log Spectral Amplitude (MMSE-LSA) method to estimate the amplitude spectrum of non-stationary signals adaptively across varying signal-to-noise ratios. Comparative experiments demonstrate that the INMF algorithm significantly outperforms existing methods in denoising leakage acoustic signals.

1. Introduction

Different gases stored and transported in pressure vessels or pipelines are integral to modern life and industry. These gases range from natural gas used for heating and cooking to industrial gases like oxygen and nitrogen essential for various manufacturing processes. As the equipment ages and third-party damage occurs, gas leaks have become increasingly prominent, posing risks of environmental pollution and threats to property and personal safety. A common culprit is sulfur hexafluoride (SF6), used extensively in electrical substation transformers due to its excellent insulating properties. However, leaks of SF6 compromise the safety performance of electrical equipment and contribute to air pollution. This potent greenhouse gas has a global warming potential significantly higher than CO2, making its release into the atmosphere particularly concerning. Thus, effective gas leak detection is crucial for mitigating such hazards.
Traditional gas leak detection methods are categorized into mechanism-driven and data-driven approaches. Mechanism-driven approaches involve constructing predictive models based on the gas’s physical properties, such as vacuum, laser imaging, and infrared detection methods [1,2,3]. These methods are often constrained by their low detection accuracy, high cost, and impracticality for real-time field testing. In contrast, data-driven methods significantly depend on the type of input signal—whether vibration, pressure, or acoustic signals—which greatly influences detection outcomes [4,5,6]. This dependency on signal type makes data-driven methods more adaptable to various environmental conditions and potentially more effective in complex scenarios. Notably, acoustic signal-based inspections offer non-contact measurement, crucial for ensuring staff health and safety during gas leak detection. This feature not only protects the personnel from potential hazards but also prevents any disturbance to the infrastructure during inspection. Acoustic signals also span a broad frequency range from infrasound to ultrasound, encapsulating extensive fault information. This wide range allows for the detection of minute anomalies in the system, thereby increasing the sensitivity of the detection process. However, collected acoustic signals frequently include background noise, complicating the classification and identification processes and thereby impacting leak detection accuracy. The presence of such noise can mask subtle leak signals, making early detection difficult and sometimes leading to false positives or negatives. Consequently, developing an effective noise reduction algorithm for leak sound is essential for enhancing detection reliability.
Acoustic signal noise reduction is a crucial challenge in the field of acoustics, which aims to separate clean target signals from noise-contaminated signals. Acoustic signal noise reduction methods can be divided into traditional noise reduction methods and machine learning-based noise reduction methods, reflecting the principle and development history of noise reduction. Traditional acoustic noise reduction methods are mainly based on digital signal processing techniques and include approaches such as spectral subtraction [7], the Wiener filtering method [8], the statistical modeling method [9], and the subspace method [10]. However, these methods often rely on certain assumptions or minimal use of a priori information of pure acoustic and noisy signals, which results in diminished effectiveness against non-stationary noise. Audio noise reduction can be considered a supervised learning problem, and an increasing number of experts and scholars are employing machine learning methods to enhance the effectiveness of audio noise reduction. Machine learning methods can be further divided into those based on traditional machine learning models [11] and those based on deep learning models [12]. Deep learning-based modeling methods require larger computational resources and sufficient acoustic signals to train an excellent noise reduction model. However, gas leakage represents a type of low-probability event, making it challenging to collect ample leakage data and noise data for training. Consequently, noise reduction methods based on traditional machine learning models, owing to their lesser requirements for computational resources and data volume, are more suitable for scenarios such as gas leak detection.
The non-negative matrix factorization (NMF)-based noise reduction algorithm is a commonly utilized machine learning noise reduction algorithm that acquires different signal bases of pure and noisy signals by training the noisy audio and pure audio separately and then using the different signal bases to separate the noisy signals. It efficiently utilizes the a priori information of the noise-containing signal to capture the amplitude spectrum of the pure signal and the noisy signal at the time of training. Subsequently, it can be weighted as a linear combination of basis vectors by modeling them as negative activation coefficients.
Since the NMF-based noise reduction algorithm requires pure signals and noisy signals as a priori knowledge for training, the algorithm performance will be dramatically degraded when the disparity between the training data and the noisy signals is substantial and the trained basis matrix cannot effectively represent the noisy signals. Consequently, Zhang Lijun et al. [13] proposed an RNMF algorithm by incorporating a noise constraint term into the objective function to address the random errors generated during the decomposition of non-negative matrices. He Wei et al. [14] introduced an SRNMF algorithm by adding a sparse regularization term to the coefficient matrix in the objective function to manage the sparsity. However, both approaches utilize the Euclidean distance as the objective function to measure the error before decomposition and after reconstruction. For the high-energy amplitude of the background noise signal, it is prone to cause the noise to be amplified and compromise the algorithm’s performance. Moreover, the prevailing NMF-based noise reduction algorithms primarily employ Wiener filtering and other basic estimations of the amplitude spectrum of the signal, leading to issues such as inaccurate estimation of the changing non-stationary noise and inability to adaptively estimate the amplitude spectrum.
To address the above problems, this paper proposes an improved non-negative matrix factorization (INMF) noise reduction algorithm. The proposed INMF-based noise reduction method takes Kullback–Leibler (KL) divergence as the objective function and introduces noise terms and sparse constraint terms in the objective function, which is superior to the ones with Euclidean distance. Moreover, an improved adaptive Minimum Mean-Square Error-Log Spectral Amplitude (MMSE-LSA) sub-algorithm is embedded to estimate the amplitude spectrum of the reconstructed signal, compared with the current NMF series noise reduction algorithm using simple estimation methods such as Wiener filtering. The experimental results show that the proposed INMF noise reduction algorithm has a better noise reduction effect on the leakage acoustic signal.

2. INMF-Based Noise Reduction Algorithm

2.1. INMF Algorithm

With the development of NMF, it has been extensively applied to and refined across various fields. Zhang Lijun et al. [13] and He Wei et al. [14] modified it by using Euclidean distance as the objective function; however, this method tends to exacerbate errors caused by anomalies. Grady et al. formulated the objective function under KL divergence dispersion by deriving the coefficient matrix H and the basis matrix W , assuming that the anomalous noise follows a Poisson distribution [15]. Nevertheless, when applying non-negative matrix decomposition to process acoustic signal data, the actual algorithm performance often deviates significantly from theoretical predictions due to the random nature of the errors that may arise. Consequently, the noise term E is introduced here to mitigate the impact of these issues, i.e., X W H + E . The interference of the coefficient matrix H and the basis matrix W by noise can be effectively reduced by the introduced noise term, and the objective function is expressed as Equation (1):
D K L X , X ^ , E = k = 1 K l = 1 L ( x e ) k , l l g ( x e ) k , l x ^ k , l ( x e ) k , l + x ^ k , l
where D K L ( X , X ^ , E ) is the objective function under KL divergence, X is the amplitude spectrum of the noisy signal, X ^ is the amplitude spectrum of the noisy air leakage signal after noise reduction, E is the introduced noise term, K is the frequency point of the noisy air leakage signal, L is the time frame of the noisy air leakage signal, x X , e E , and l g is a standard notation for the logarithm.
In order to enhance the sparsity of Equation (1), we consider adding a parametric constraint term to the noise term. Since the solution of L 0 parametrization is difficult, L 1 parametrization is added as the noise term. Therefore, the objective function of optimization is Equation (2).
m i n W , H , E D K L X , X ^ , E + λ E 1   s .   t .   W 0 , H 0
where E 1 = i = 1 K l = 1 L e k , l and λ is the trade-off coefficient that controls the weight of E sparsity and reconstruction error, which can control the weight of E .
Given the sparse nature of time and frequency domains in the acoustic signal, the sparsity factor is crucial in determining the sparsity of the coefficient matrix H , thereby enabling the basis matrix W to serve as a perfect basis [16]. Consequently, the sparsity constraint term for the coefficient matrix H is proposed to be included in Equation (2) to enhance control over the sparsity of H and the distortion of the audio signal, thus refining the objective function to become Equation (3).
k = 1 K l = 1 L ( x e ) k , l l g ( x e ) k , l x ^ k , l ( x e ) k , l + x ^ k , l + λ E 1 + γ H 1
where γ ≥ 0 is the sparsity factor, which can control the sparsity of the coefficient matrix.
When the threshold operator is introduced, the convex optimization problem at update time can be solved by fixing W and H to continuously update the noise term added by the optimization [17]. Letting it be unbounded by the noise outlier form can improve the robustness of the algorithm. The soft threshold function s o f t λ ( ) can be expressed as Equation (4).
s o f t λ ( x ) = x λ , x > λ x + λ , x < λ 0 , O t h e r  
where x R and λ > 0 are used as thresholds. Equation (4) can be applied to matrices and vectors.
Given that there is no unique optimal solution for the objective function, the values of W , H , and E are established as Equations (5)–(7) by normalizing W and H , maintaining constant objective function values, and then optimizing Equation (3) via the gradient descent method.
W W X E H T 1 K × L H T
H H W T X E W T 1 K × L + γ
E s o f t λ X W H
where denotes the multiplication in the decomposition matrix, T denotes the transposition process, and 1 K × L R K × L denotes the matrix with all element values of 1.

2.2. Improved Adaptive MMSE-LSA Algorithm

The MMSE-LSA algorithm is a method for joint estimation using a priori and a posteriori signal-to-noise ratios. In this paper, the value estimated for the acoustic signal differs because the acoustic signal is non-stationary. Benjebbour et al. determine the a priori S/N ratio by introducing a conditioning factor α , which defines a fixed range for α and secures a more accurate empirical value by conducting several experiments [18]. However, this experimental approach to ascertaining the conditioning factor is not ideally suited for situations with varying audio signal-to-noise ratios. In contrast, the adaptive MMSE-LSA algorithm modifies the optimal value of the adjustment factor α at different stages of non-stationary noise in response to the variation in non-stationary noise, thus achieving enhanced results.
The MMSE-LSA algorithm obtains the estimated value of the denoised audio by minimizing Equation (8). | X ( n , k ) | can be expressed as Equation (9), and the estimate of the pure signal can be expressed as Equation (10).
E { ( l n | X ( n , k ) | l n X ^ ( n , k ) ) 2 }
| X ( n , k ) | = G ( n , k ) · | Y ( n , k ) |
G ( n , k ) = ξ ( n , k ) ξ ( n , k ) + 1 e x p 1 2 k e t t d t
In Equation (9), G ( n , k ) is the estimated value of the air leakage signal after noise reduction. In Equation (10), ξ ( n , k ) is the a priori signal-to-noise ratio of the n th frame in the audio signal at the k th frequency point, and the signal-to-noise ratio is defined in Equation (11).
ξ ( n , k ) = E | X ( n , k ) | 2 λ d 2 ( n , k )
From Equation (10), it can be seen that the a priori signal-to-noise ratio must be found first in order to estimate the pure audio signal. Therefore, the a priori signal-to-noise ratio has a direct impact on the audio noise reduction effect. The estimation of the a priori signal-to-noise ratio can be expressed as Equation (12).
ξ ^ ( n , k ) = α | X ^ ( n 1 , k ) | 2 λ d 2 ( n 1 , k ) + ( 1 α ) P [ γ ( n , k ) 1 ]
In Equation (12), | X ^ ( n 1 , k ) | is the pure signal spectral amplitude estimated at the k th component of the n 1 frame, λ d 2 ( n 1 , k ) is the estimated noise signal amplitude spectrum of the n 1 frame at the k th component, and α is the weighting factor.
P [ x ] = x x 0 0 x < 0
Equation (13) represents the half-wave rectification function, which is estimated using the minimum mean-square error (MMSE) in order to make the difference between ξ ^ ( n , k ) and ξ ( n , k ) as small as possible.
G = E ( ξ ^ ( n , k ) ξ ( n , k ) ) 2 | ξ ^ ( n 1 , k ) |
Substituting Equation (12) into Equation (14), Equation (15) is obtained.
G = α 2 ( n , k ) ( ξ ^ ( n 1 , k ) ξ ( n 1 , k ) ) 2 + ( 1 α ( n , k ) ) 2 ( ξ ( n , k ) + 1 ) 2
Find the derivative of G and let the partial derivative of G be 0. At this point, the optimal solution of α is obtained as Equation (16).
α o p t = 1 1 + ξ ( n , k ) ξ ( n 1 , k ) ξ ( n , k ) + 1 2
In Equation (16), ξ ¯ ( n , k ) = P { ξ ( n , k ) 1 } is adopted, instead of the unknown ξ ( n , k ) , for E { ξ ¯ ( n , k ) } ξ ( n , k ) . The value of α ( n , k ) converges to 1 when the a posteriori signal-to-noise ratio changes, and ξ ^ ( n , k ) changes when the value of α ( n , k ) is small, thus achieving an adaptive effect.

2.3. Principle of INMF Noise Reduction Algorithm

The flowchart of the noise reduction is shown in Figure 1. The proposed INMF algorithm in this paper is applied to the noise reduction of acoustic signals in two main steps: one is the supervised learning training process, and the other is the audio noise reduction process, as shown in Figure 1.
In the training process, the information of the pure air leakage signal and the interference noise signal are each captured using the short-time Fourier transform (STFT). Subsequently, the amplitude spectra of the pure air leakage signal and the interference noise signal are used as the target matrices, with V L   ≥ 0 and V N   ≥ 0. Following this, the aforementioned INMF algorithm is employed to iterate through Equations (5)–(7), with the number of iterations established at 100 to decompose the amplitude spectrum of the air leakage signal into the W L dictionary matrix and the amplitude spectrum of the interference noise into the W N dictionary matrix. The dictionary matrix of the air leakage signal is depicted in Figure 2a, where the columns of the dictionary matrix are termed dictionary atoms, as illustrated in Figure 2b. These dictionary atoms are non-negative functions of the frequency and can be linearly combined with the corresponding coefficients in real time. The decomposed W L and W N are combined to form a joint dictionary matrix W = W L W N , utilizing the joint dictionary as a priori information for the training process.
In the audio noise reduction process, the amplitude spectrum of the signal containing the interference noise is first obtained using the short-time Fourier transform, and the joint dictionary matrix W obtained using this amplitude spectrum and the training process is used as the input parameter of Equation (6); then, W is kept constant and updated iteratively using Equations (6) and (7) until the objective function reaches convergence and terminates. At the end of the iteration, the amplitude spectra of the pure air leakage audio signal V ^ L and the interference noise signal V ^ N are estimated and can be derived as Equation (17).
X W H = W L W N H L H N T = V ^ L + V ^ N = W L H L + W N H N
At the end of the iterative update iteration, the improved adaptive MMSE-LSA algorithm described above is used as the gain function G ~ , i.e.,
G ~ = W L H L / W L H L + W N H N
The pure air leakage audio amplitude spectrum, denoted as X ~ L = G ~ * X , can be estimated in real time. This approach overcomes the limitations of traditional methods such as Wiener filtering, which cannot effectively estimate signals that change in real time. Finally, by utilizing the invariance of the noise-containing frequency phase, the noise-reduced time-domain air leakage signal is obtained through the inverse short-time Fourier transform (ISTFT).

3. Experiments and Analysis of Results

3.1. Experimental Parameter Setting

The Source-to-Distortion Ratio (SDR) quantifies the overall quality of both air leakage signal distortion and noise reduction, which is frequently used to evaluate the performance of different noise reduction algorithms. SDR is measured in dB, and a higher value of SDR indicates less residual noise and a more pronounced noise reduction effect [19]. The Perceptual Evaluation of Speech Quality (PESQ) is an evaluation metric widely used to assess the performance of noise reduction, and a higher PESQ value signifies enhanced signal quality after noise reduction [20].
In the experiments conducted at Wuhan University of Technology, the pure gas leakage audio was sourced from recordings made in a semi-anechoic chamber using simulated gas leakages, alongside typical disturbance noises from a field substation environment, including wind, rain, birdsong, and cicada chirping. The sampling frequency was set at 44.1 kHz. The complete leakage process was captured, yielding 40 segments of 15 s audio clips for both the gas leak and each type of noise used for training. The Wavfile and Soundfile libraries facilitated the mixing of pure air leakage data with noise data, synthesizing noisy air leakage data at signal-to-noise ratios of −10 dB, −5 dB, 0 dB, 5 dB, and 10 dB. The noise reduction algorithm employed a Hamming window function with a frame length of 512 points, a frame shift of 128 points, and a maximum of 100 iterations.

3.2. Experimental Results and Analysis

To ascertain the values of the INMF trade-off factor λ and the sparsity factor γ, experiments are conducted using noise-containing air leakage data. The sum of the SDR assesses the impact of λ and γ on noise reduction. Figure 3 illustrates how λ and γ influence the source distortion rate at an input signal-to-noise ratio of 0 dB.
As illustrated in Figure 3, the average SDR of the source distortion rate gradually increases before stabilizing under various background noise conditions with an increase in the trade-off coefficient λ . The value of λ significantly influences the performance of the improved non-negative matrix decomposition. After a comprehensive analysis, optimal parameters were identified at λ = 1 .6 and γ = 0.05. Consequently, to further evaluate the performance of the enhanced noise reduction algorithm, subsequent experiments utilized these parameters.
To evaluate the effectiveness of the INMF objective function and the adaptive LSA-MMSE estimation of the magnitude spectrum within the proposed INMF noise reduction algorithm, several comparisons were conducted. The INMF algorithm was utilized as the objective function, alongside comparisons with the traditional Wiener filter for amplitude spectrum estimation (termed Half INMF (HINMF)), the VMD algorithm (which is noted for its efficacy in denoising non-stationary noise [21]), the RNMF algorithm [13], and the SRNMF algorithm [14].
Table 1 and Table 2 display the comparison of SDR and PESQ values across four typical background noises at varying signal-to-noise ratios. The performance metrics derived from different noise reduction algorithms vary considerably. At high signal-to-noise ratios, VMD is more effective than RNMF in reducing wind noise but less effective compared to SRNMF, HINMF, and INMF, which offer deeper improvements. Specifically, VMD performs suboptimally against fluctuating non-stationary noises such as birdsong compared to the refined NMF-based algorithms, which excel by effectively leveraging training samples and enhancing noise reduction in non-stationary environments. Among the four NMF algorithms—RNMF, SRNMF, HINMF, and INMF—HINMF incorporates noise terms and sparse constraints in its objective function, significantly improving the preservation of acoustic characteristics and reducing performance degradation from anomalies. Conversely, INMF, which builds upon HINMF by adding adaptive MMSE-LSA for real-time amplitude spectrum estimation, further enhances algorithm robustness and noise reduction effectiveness. As signal-to-noise ratios increase, the relative advantage of INMF over other algorithms diminishes, yet it maintains a commendable noise reduction capability. Overall, INMF demonstrates superior noise reduction at low signal-to-noise ratios, with a reduced but still notable advantage at high ratios.
To visually illustrate the differential effects of each algorithm on noise reduction, a time–frequency diagram is employed to showcase the noise reduction performance of each algorithm at a signal-to-noise ratio of 0 dB and under typical non-stationary noise conditions such as birdsong. The results are shown in Figure 4.
An analysis of Figure 4 reveals that the target signal is a broadband signal contaminated by typical non-stationary noise, which is irregularly distributed across the broadband spectrum of the air leakage signal. A comparative analysis indicates that VMD and RNMF exhibit substantial residual noise, suggesting weaker denoising capabilities. Conversely, SRNMF, HINMF, and INMF demonstrate stronger removal of non-stationary noises like birdsong, with INMF showing the least residual noise and the most effective noise reduction.

4. Discussion

This study focuses on the development and evaluation of an improved non-negative matrix factorization (INMF) algorithm for noise reduction in leakage acoustic signals. While the proposed method demonstrates superior performance in reducing non-stationary noise, it has several limitations. Firstly, the algorithm requires a higher computational cost due to its complexity, which may not be suitable for real-time applications with limited processing power. Secondly, the algorithm’s effectiveness in various real-world scenarios needs further validation, as the current study primarily uses synthetic noise data for evaluation. Finally, the performance of the proposed method was evaluated using synthetic noise data, which may not fully represent the variability and complexity of real-world noise environments. Future work should include comprehensive testing with real-world data and a thorough analysis of computational costs.

5. Conclusions

An INMF-based noise reduction algorithm for leakage acoustic signals is proposed in this paper, which addresses the problems of the current NMF-based noise reduction algorithms and related variants. The algorithm enhances the objective function of traditional non-negative matrix decomposition by incorporating noise terms and sparse constraint terms, thus mitigating the amplification of noise anomalies during decomposition. Optimal trade-off coefficients and sparse factors are determined through experimentation. Additionally, an adaptive MMSE-LSA method is introduced for estimating the amplitude spectrum, which adjusts to variations in non-stationary noise across different signal-to-noise ratios, enabling adaptive estimation of the amplitude spectrum. Comparative experiments confirm that the INMF algorithm offers superior denoising performance across various environmental noises and signal-to-noise ratios.

Author Contributions

Y.Y., Y.H., Y.W. and Z.C. contributed to this study’s conception and design. The first draft of the manuscript was written by Y.H. and Y.Y.; Y.Y., Y.H., Y.W. and Z.C. commented on previous versions of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by the Hubei Key Research and Development Program of China (2022BAA099).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Feng, T.; Chen, L.; Sun, W.; Zhao, L.; Dong, M.; Wang, X.; Guo, W.; Cheng, Y. Development of a new very low helium gas flow generation and measurement system. Vacuum 2022, 206, 22–35. [Google Scholar] [CrossRef]
  2. Wainner, R.T.; Aubut, N.F.; Laderer, M.C.; Frish, M.B. Scanning, standoff TDLAS leak imaging and quantification. In Next-Generation Spectroscopic Technologies X; SPIE: Bellingham, WA, USA, 2017; Volume 10210, pp. 29–39. [Google Scholar]
  3. Hagen, N. Survey of autonomous gas leak detection and quantification with snapshot infrared spectral imaging. J. Opt. 2020, 22, 23–34. [Google Scholar] [CrossRef]
  4. Stajanca, P.; Chruscicki, S.; Homann, T.; Seifert, S.; Schmidt, D.; Habib, A. Detection of Leak-Induced Pipeline Vibrations Using Fiber-Optic Distributed Acoustic Sensing. Sensors 2018, 18, 2841. [Google Scholar] [CrossRef] [PubMed]
  5. Li, H.; Li, H.; Pei, H.; Li, Z. Leakage detection of HVAC pipeline network based on pressure signal diagnosis. Build. Simul. 2019, 12, 617–628. [Google Scholar] [CrossRef]
  6. Yu, L.; Li, S.Z. Acoustic emission (AE) based small leak detection of galvanized steel pipe due to loosening of screw thread connection. Appl. Acoust. 2017, 120, 5–9. [Google Scholar] [CrossRef]
  7. Thiemann, J.; Kabal, P. Low distortion acoustic noise suppression using a perceptual model for speech signals. In Proceedings of the Speech Coding, 2002, IEEE Workshop Proceedings, Ibaraki, Japan, 9 October 2002; pp. 172–174. [Google Scholar]
  8. Chen, J.; Benesty, J.; Huang, Y.; Doclo, S. New insights into the noise reduction Wiener filter. IEEE Trans. Audio Speech Lang. Process. 2006, 14, 1218–1234. [Google Scholar] [CrossRef]
  9. Ephraim, Y.; Malah, D. Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. 1984, 32, 1109–1121. [Google Scholar] [CrossRef]
  10. Li, H.; Chu, L.; Lu, J.; Liu, Q.; Li, F.; Zhang, K. SVD-VMD algorithm and its application in leak detection of natural gas pipeline. Pet. Sci. Technol. 2022, 41, 230–255. [Google Scholar] [CrossRef]
  11. Kim, G.; Lu, Y.; Hu, Y.; Loizou, P.C. An algorithm that improves speech intelligibility in noise for normal-hearing listeners. J. Acoust. Soc. Am. 2009, 126, 1486–1494. [Google Scholar] [CrossRef]
  12. Qian, K.; Zhang, Y.; Chang, S.; Yang, X.; Florêncio, D.; Hasegawa-Johnson, M. Speech Enhancement Using Bayesian Wavenet. In Proceedings of the Interspeech, Stockholm, Sweden, 20–24 August 2017. [Google Scholar]
  13. Zhang, L.; Chen, Z.; Zheng, M.; He, X. Robust non-negative matrix factorization. Front. Electr. Electron. Eng. China 2011, 6, 192–200. [Google Scholar] [CrossRef]
  14. He, W.; Zhang, H.; Zhang, L. Sparsity-regularized robust non-negative matrix factorization for hyperspectral unmixing. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2016, 9, 4267–4279. [Google Scholar] [CrossRef]
  15. O’Grady, P.D.; Pearlmutter, B.A. Discovering speech phones using convolutive non-negative matrix factorisation with a sparseness constraint. Neurocomputing 2008, 72, 88–101. [Google Scholar] [CrossRef]
  16. Chung, H.; Plourde, E.; Champagne, B. Regularized NMF-based speech enhancement with spectral components modeled by gaussian mixtures. In Proceedings of the 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), Reims, France, 21–24 September 2014; pp. 1–6. [Google Scholar]
  17. Hale, E.T.; Yin, W.; Zhang, Y. Fixed-point continuation for l 1-minimization: Methodology and convergence. SIAM J. Optim. 2008, 19, 1107–1130. [Google Scholar] [CrossRef]
  18. Benjebbour, A.; Saito, Y.; Kishiyama, Y.; Li, A.; Harada, A.; Nakamura, T. Concept and practical considerations of non-orthogonal multiple access (NOMA) for future radio access. In Proceedings of the 2013 International Symposium on Intelligent Signal Processing and Communication Systems, Naha, Japan, 12–15 November 2013; pp. 770–774. [Google Scholar]
  19. Venkataramani, S.; Higa, R.; Smaragdis, P. Performance Based Cost Functions for End-to-End Speech Separation. In Proceedings of the 10th Asia-Pacific-Signal-and-Information-Processing-Association Annual Summit and Conference (APSIPA ASC), Honolulu, HI, USA, 12–15 November 2018. [Google Scholar]
  20. Al-Othmani, A.Z.; Manaf, A.A.; Zeki, A.M.; Almaatouk, Q.; Aborujilah, A.; Al-Rashdan, M.T. Correlation Between Speaker Gender and Perceptual Quality of Mobile Speech Signal. In Proceedings of the 14th International Conference on Ubiquitous Information Management and Communication (IMCOM), Sungkyunkwan Univ, Taichung, Taiwan, 3–5 January 2020. [Google Scholar]
  21. Fang, C.; Chen, Y.; Deng, X.; Lin, X.; Han, Y.; Zheng, J. Denoising method of machine tool vibration signal based on variational mode decomposition and Whale-Tabu optimization algorithm. Sci. Rep. 2023, 13, 1505. [Google Scholar]
Figure 1. Flowchart of the INMF-based noise reduction.
Figure 1. Flowchart of the INMF-based noise reduction.
Sensors 24 05146 g001
Figure 2. INMF learning dictionary for air leakage audio signal. (a) Dictionary matrix. (b) Dictionary atoms.
Figure 2. INMF learning dictionary for air leakage audio signal. (a) Dictionary matrix. (b) Dictionary atoms.
Sensors 24 05146 g002
Figure 3. Average SDR values of noise reduction under different noises when the input signal-to-noise ratio is 0 dB. (a) Wind sound signal. (b) Rain signal. (c) Birdsong signal. (d) Cicada sound signal.
Figure 3. Average SDR values of noise reduction under different noises when the input signal-to-noise ratio is 0 dB. (a) Wind sound signal. (b) Rain signal. (c) Birdsong signal. (d) Cicada sound signal.
Sensors 24 05146 g003aSensors 24 05146 g003b
Figure 4. Time and frequency diagram of noise reduction of each algorithm with bird noise at 0 dB input signal-to-noise ratio. (a) Noisy signal. (b) VMD. (c) RNMF. (d) SRNMF. (e) HINMF. (f) INMF. (g) Noiseless signal.
Figure 4. Time and frequency diagram of noise reduction of each algorithm with bird noise at 0 dB input signal-to-noise ratio. (a) Noisy signal. (b) VMD. (c) RNMF. (d) SRNMF. (e) HINMF. (f) INMF. (g) Noiseless signal.
Sensors 24 05146 g004
Table 1. Output SDR/dB under different noise environments.
Table 1. Output SDR/dB under different noise environments.
Input SNR/dBNoiseVMDRNMFSRNMFHINMFINMF
−10Wind−4.13−4.27−3.28−1.47−1.44
Rain−3.62−2.97−2.62−2.32−2.02
Bird−4.14−2.78−2.66−1.40−1.32
Cicada−4.55−4.25−3.90−2.33−1.88
−5Wind0.790.751.251.241.23
Rain−1.15−0.750.430.550.52
Bird−0.050.250.890.771.02
Cicada−1.77−0.960.370.350.40
0Wind5.865.526.876.856.83
Rain4.55.066.436.456.49
Bird3.575.496.036.276.82
Cicada3.574.896.106.316.5
5Wind8.348.249.459.049.23
Rain8.158.208.358.358.46
Bird8.429.309.779.829.95
Cicada7.128.08.248.388.43
10Wind13.3813.2014.4213.8714.02
Rain11.2410.3811.0112.0512.43
Bird11.2712.5213.8413.9714.25
Cicada10.2510.2312.0712.4712.85
Table 2. Output PESQ/dB under different noise environments.
Table 2. Output PESQ/dB under different noise environments.
Input SNR/dBNoiseVMDRNMFSRNMFHINMFINMF
−10Wind1.121.101.351.471.69
Rain1.071.091.211.391.45
Bird1.051.101.181.341.41
Cicada1.091.131.251.351.50
−5Wind1.431.411.631.791.77
Rain1.381.401.521.741.71
Bird1.341.391.481.651.72
Cicada1.371.411.541.670.40
0Wind1.731.691.972.092.08
Rain1.661.681.821.982.06
Bird1.651.721.771.992.04
Cicada1.681.731.641.692.05
5Wind2.022.372.632.732.96
Rain1.972.352.582.662.87
Bird1.952.282.552.602.88
Cicada1.962.302.542.632.85
10Wind2.232.582.822.913.08
Rain2.182.532.782.853.05
Bird2.152.492.752.843.01
Cicada2.202.522.742.863.02
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yu, Y.; Hu, Y.; Wang, Y.; Cai, Z. Improved Non-Negative Matrix Factorization-Based Noise Reduction of Leakage Acoustic Signals. Sensors 2024, 24, 5146. https://doi.org/10.3390/s24165146

AMA Style

Yu Y, Hu Y, Wang Y, Cai Z. Improved Non-Negative Matrix Factorization-Based Noise Reduction of Leakage Acoustic Signals. Sensors. 2024; 24(16):5146. https://doi.org/10.3390/s24165146

Chicago/Turabian Style

Yu, Yongsheng, Yongwen Hu, Yingming Wang, and Zhuoran Cai. 2024. "Improved Non-Negative Matrix Factorization-Based Noise Reduction of Leakage Acoustic Signals" Sensors 24, no. 16: 5146. https://doi.org/10.3390/s24165146

APA Style

Yu, Y., Hu, Y., Wang, Y., & Cai, Z. (2024). Improved Non-Negative Matrix Factorization-Based Noise Reduction of Leakage Acoustic Signals. Sensors, 24(16), 5146. https://doi.org/10.3390/s24165146

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop