Figure 1.
Flowchart of mono encoder. (TF, Time-to-Frequency; PCA, Principle Component Analysis).
Figure 1.
Flowchart of mono encoder. (TF, Time-to-Frequency; PCA, Principle Component Analysis).
Figure 2.
Flowchart of mono decoder. (iPCA, inverse Principle Component Analysis; FT, Frequency-to-Time).
Figure 2.
Flowchart of mono decoder. (iPCA, inverse Principle Component Analysis; FT, Frequency-to-Time).
Figure 3.
Scheme of PCA-based coding method. (PCA, Principle Component Analysis).
Figure 3.
Scheme of PCA-based coding method. (PCA, Principle Component Analysis).
Figure 4.
Compression rate for different format of matrix.
Figure 4.
Compression rate for different format of matrix.
Figure 5.
Example for matrix construction (“value” means the value of cells in original matrix, “column” means the column of original matrix, and “row” means the row of original matrix).
Figure 5.
Example for matrix construction (“value” means the value of cells in original matrix, “column” means the column of original matrix, and “row” means the row of original matrix).
Figure 6.
Subjective test results for different number of frames.
Figure 6.
Subjective test results for different number of frames.
Figure 7.
Multi-frame in encoder. (PCA, Principle Component Analysis; Rd, reduced-dimensional).
Figure 7.
Multi-frame in encoder. (PCA, Principle Component Analysis; Rd, reduced-dimensional).
Figure 8.
Multi-frame in decoder. (iPCA, inverse Principle Component Analysis; Rd, reduced-dimensional).
Figure 8.
Multi-frame in decoder. (iPCA, inverse Principle Component Analysis; Rd, reduced-dimensional).
Figure 9.
Bits allocation for reduced-dimensional matrix (darker color means more bits needed).
Figure 9.
Bits allocation for reduced-dimensional matrix (darker color means more bits needed).
Figure 10.
An example rotation matrix (“value” means the average value of cells in rotation matrices, “column” means the column of rotation matrix, and “row” means the row of rotation matrix).
Figure 10.
An example rotation matrix (“value” means the average value of cells in rotation matrices, “column” means the column of rotation matrix, and “row” means the row of rotation matrix).
Figure 11.
Bit allocation for rotation matrix (darker color means more bits needed; white color means no bits).
Figure 11.
Bit allocation for rotation matrix (darker color means more bits needed; white color means no bits).
Figure 12.
Flowchart of stereo encoder. (MDCT, Modified Discrete Cosine Transform; PCA, Principle Component Analysis; IC, Interaural Coherence; ILD, Interaural Level Difference).
Figure 12.
Flowchart of stereo encoder. (MDCT, Modified Discrete Cosine Transform; PCA, Principle Component Analysis; IC, Interaural Coherence; ILD, Interaural Level Difference).
Figure 13.
Flowchart of stereo decoder. (MDCT, Modified Discrete Cosine Transform; iPCA, inverse Principle Component Analysis; IC, Interaural Coherence; ILD, Interaural Level Difference).
Figure 13.
Flowchart of stereo decoder. (MDCT, Modified Discrete Cosine Transform; iPCA, inverse Principle Component Analysis; IC, Interaural Coherence; ILD, Interaural Level Difference).
Figure 14.
Subjective MOS of high-relation stereo signal. (MOS, Mean Opinion Score).
Figure 14.
Subjective MOS of high-relation stereo signal. (MOS, Mean Opinion Score).
Figure 15.
Subjective MOS of low-relation stereo signal.
Figure 15.
Subjective MOS of low-relation stereo signal.
Figure 16.
Test results for spatial sense.
Figure 16.
Test results for spatial sense.
Figure 17.
Relationship between quality and bitrate.
Figure 17.
Relationship between quality and bitrate.
Figure 18.
Results of MUSHRA test. (PCA_2 represents the PCA-based codec signal that is transmitted over two channels separately (75 kbps), PCA_PS+ represents PCA-based codec signal with improved parametric stereo (55 kbps), PCA_PS represents PCA-based codec signal with traditional parametric stereo (56 kbps), G.719 represents G.719 codec signal with traditional parametric stereo (56 kbps), anchor represents anchor signal, HE_AACv2 represents HE-AACv2 signal (55 kbps), and reference represents hidden reference signal).
Figure 18.
Results of MUSHRA test. (PCA_2 represents the PCA-based codec signal that is transmitted over two channels separately (75 kbps), PCA_PS+ represents PCA-based codec signal with improved parametric stereo (55 kbps), PCA_PS represents PCA-based codec signal with traditional parametric stereo (56 kbps), G.719 represents G.719 codec signal with traditional parametric stereo (56 kbps), anchor represents anchor signal, HE_AACv2 represents HE-AACv2 signal (55 kbps), and reference represents hidden reference signal).
Figure 19.
MUSHRA score of per item test. (PCA_2 represents the PCA-based codec signal that is transmitted over two channels separately (75 kbps), PCA_PS+ represents PCA-based codec signal with improved parametric stereo (55 kbps), PCA_PS represents PCA-based codec signal with traditional parametric stereo (56 kbps), G.719 represents G.719 codec signal with traditional parametric stereo (56 kbps), anchor represents anchor signal; HE-AACv2 represents HE-AACv2 signal (55 kbps), and hidden reference material has been removed. 1–6 represents different test materials).
Figure 19.
MUSHRA score of per item test. (PCA_2 represents the PCA-based codec signal that is transmitted over two channels separately (75 kbps), PCA_PS+ represents PCA-based codec signal with improved parametric stereo (55 kbps), PCA_PS represents PCA-based codec signal with traditional parametric stereo (56 kbps), G.719 represents G.719 codec signal with traditional parametric stereo (56 kbps), anchor represents anchor signal; HE-AACv2 represents HE-AACv2 signal (55 kbps), and hidden reference material has been removed. 1–6 represents different test materials).
Figure 20.
The spectrogram of the signal restored by three components.
Figure 20.
The spectrogram of the signal restored by three components.
Figure 21.
The spectrogram of the signal restored by four components.
Figure 21.
The spectrogram of the signal restored by four components.
Figure 22.
The spectrogram of the signal restored by five components.
Figure 22.
The spectrogram of the signal restored by five components.
Table 1.
PCA ALGORITHM.
Algorithm: PCA (Principle Component Analysis) |
---|
(i) Obtain matrix by columns: |
(ii) Zero-mean columns in to get matrix X |
(iii) Calculate C = |
(iv) Calculate eigenvalues and eigenvectors of C |
(v) Use eigenvector to construct according to the eigenvalue |
(vi) Select the first k columns of to construct the rotation matrix |
(vii) Complete dimensionality reduction by |
Table 2.
Quantization bit for reduced-dimensional matrix.
Table 2.
Quantization bit for reduced-dimensional matrix.
Principal Component | Bits Needed (bit/per Point) |
---|
First principal component | 3 |
Second principal component | 3 |
Third principal component | 2.5 |
Fourth principal component | 1.5 |
Fifth principal component | 0.45 |
Sixth principal component | 0.45 |
Table 3.
Quantization bits for rotation matrix.
Table 3.
Quantization bits for rotation matrix.
Region | Bits Needed (bit/per Point) |
---|
The first region | 4 |
The second region | 3 |
The third region | 2 |
The fourth region | 2 |
The fifth region | 0.5 |
The sixth region | 0.5 |
The seventh region | 0 |
Table 4.
Subband division.
Table 4.
Subband division.
index | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
start | 0 | 100 | 200 | 300 | 400 | 510 | 630 | 760 |
index | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 |
start | 900 | 1040 | 1200 | 1380 | 1600 | 1860 | 2160 | 2560 |
index | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 |
start | 3040 | 3680 | 4400 | 5300 | 6400 | 7700 | 9500 | 12,000 |
index | 24 | 25 | | | | | | |
start | 15,500 | 19,880 | | | | | | |
Table 5.
Codebook for ILD. (ILD, Interaural Level Difference).
Table 5.
Codebook for ILD. (ILD, Interaural Level Difference).
index | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
ILD | −20 | −15 | −11 | −8 | −5 | −3 | −1 | 0 |
index | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 |
ILD | 1 | 3 | 5 | 8 | 11 | 13 | 15 | 20 |
Table 6.
Codebook for IC. (IC, Interaural Coherence).
Table 6.
Codebook for IC. (IC, Interaural Coherence).
index | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
IC | 1 | 0.94 | 0.84 | 0.6 | 0.36 | 0 | −0.56 | −1 |
Table 7.
Bitrate allocation in encoder.
Table 7.
Bitrate allocation in encoder.
Module | Bitrate |
---|
Reduced-dimensional matrix | 35 kbps |
Rotation matrix | 5 kbps |
First principal component | 10 kbps |
Spatial parameters and side information | 5 kbps |