1. Introduction
In conventional wireless communication systems, beamforming techniques are optional as the signal to noise power ratio (SNR) of received signals is enough for nearly error-free communication with modern channel coding techniques. Furthermore, techniques to increase the usable frequency bandwidth and the number of independent streams were more efficient for the data rate than techniques for high received SNR. However, the resources of the frequency band have been exhausted with the massive growth of wireless devices such as smartphones, IoT machines, and the wireless infrastructure of cities. The mMTC deployment of 5G was considered for the case that massive wireless connections exist. Furthermore, measurements have shown that wireless channels cannot provide independent spatial paths proportional to the number of antennas [
1,
2,
3]. To accommodate the traffic of massive wireless devices, many wireless systems consider the use of millimeter-wave (mmWave) frequency bands over 30 GHz. The one feature of mmWave frequency bands is high path loss. Due to the high path loss, beamforming becomes essential to wireless systems that use the millimeter frequency bands.
For beamforming of mmWave frequency bands, massive MIMO systems have been studied [
4,
5]. In massive MIMO systems, the antenna number of a transmitter is larger than four times the total antenna number of receivers. The systems can provide nearly optimal SNR gain with zero-forcing (ZF) beamforming. However, the same number of RF chains with the antenna number is an infeasible constraint. To ease the constraint, hybrid beamforming where analog and digital beamforming are combined appeared [
4,
5,
6,
7,
8,
9]. Analog beamforming uses only phase shifters to make beamforming gain by combining the multiple same signals coherently at the desired direction. In hybrid beamforming, the gain of analog beamforming complements the loss of digital beamforming gain that is caused by reduced RF chains. As analog beamforming is conducted by phase shifters, the signal processing is modeled by a matrix that the elements are complex values of unit modulus. The constraint of the analog beamforming matrix makes the joint optimization of the beamforming matrices a non-convex problem. There have been many studies concerning the non-convex optimization in flat and selective fading channel [
5,
7,
8,
9]. Most of those studies have used alternate optimization assuming perfect channel state information at a transmitter (CSIT). However, in a real environment, it is difficult that a transmitter acquires the estimated channel matrices due to feedback overhead and almost wireless systems use a codebook of precoding matrices for feedback [
10,
11,
12,
13]. In the real systems, wireless transceivers that adopt hybrid beamforming use a protocol for beam management to determine analog beams [
14,
15,
16]. The protocols determine the best beam pair for each link among beams of a predefined codebook and the selected indices of the pair are delivered to the transmitter as feedback information. Although the protocols provide realistic ways for hybrid beamforming, the feedback of precoding matrices does not allow the joint optimization of the hybrid beamforming structure.
Another promising technique to accommodate the growth of wireless devices is multi-user MIMO (MU-MIMO) [
17,
18]. Wireless systems that use MU-MIMO precoding can transmit independent data streams to multiple receivers stably in the same time-frequency resource block. By MU-MIMO technique, wireless systems can achieve better throughput than that of single-user MIMO (SU-MIMO) technique. However, for stable transmission, suppression of interference between receivers is essential. If high-dimensional CSIT is satisfied, the inter-user interference (IUI) is suppressed efficiently by MU-MIMO precoding. The feedback of precoding matrices is inefficient for suppressing dynamic IUI. Imperfect precoding severely degrades the performance of MU-MIMO systems [
12,
13].
To realize beamforming and MU-MIMO systems sufficiently, the improvement of channel estimation and feedback techniques is important. For channel estimation of hybrid beamforming systems, many schemes have been proposed in frequency-division duplex (FDD) and time-division duplex (TDD) environments [
19,
20,
21,
22,
23,
24,
25]. In FDD systems, uplink and downlink channels use different bands. Therefore, for CSIT, the receiver must allocate some resources of an uplink channel to deliver channel information. The recent trend that increases the number of antennas has intensified the feedback overhead. In TDD systems, uplink and downlink channels use the same frequency band. By exploiting the reciprocity, CSIT for downlink can be achieved from pilot signals of the uplink channel. As CSIT can be achieved from the uplink channel, the overhead for CSIT is proportional to the number of receivers. However, channel estimation by using the reciprocity is not always possible due to various configurations of transmission and reception modes. Therefore, to cope with the complex configuration, TDD systems also need the feedback of channels for CSIT.
In massive MIMO systems, generally, the antenna number of a transmitter is much larger than the total antenna number of receivers. By the rate of the antenna numbers, the least-square (LS) method does not provide reasonable performance for channel estimation. In the mmWave frequency bands, the wireless channel shows sparsity in the angle domain as most signals of multiple paths are absorbed and removed easily by the surrounding environment [
1,
3]. In the case of MIMO channel, the sparsity is also observed since the received signals of multiple antennas propagate through similar paths [
3]. By exploiting the sparsity, compressed sensing (CS) can provide reasonable performance for channel estimation although the rate of antenna numbers is high [
19,
20,
21,
22,
26,
27,
28]. The schemes can be classified into three groups. The schemes of the first group have focused on the design of beam sweeping vectors to acquire qualified measurements for CS [
19,
26], and the schemes of the second group provided methods to reduce the computational load of the conventional OMP operation [
21,
22,
26,
28]. The schemes of the last group utilized structure of channel basis to improve the accuracy of channel estimation [
20,
27,
28]. The topics are important subjects of CS-based channel estimation. However, the schemes did not provide a practical feedback method of the estimated channel. Another category of CS-based channel estimation is hierarchical search (HS). HS schemes have focused on reduction of overhead for the measurements [
23,
24,
25]. To reduce the overhead, the proposed schemes in [
23,
24,
25] optimized search regions for the measurements by the feedback of search results. However, the schemes need multiple reports for channel estimation within a coherent time block. The closed-loop can be a high overhead for MU-MIMO systems due to protocol for multiple receivers and these schemes also did not provide a feedback method for the estimated channel.
Some estimation methods adopted machine learning (ML) schemes [
25,
29,
30,
31]. The ML-based methods also focused on estimation performance and shown better performance than that of the CS-based methods. Among the proposed systems, the authors of [
30,
31] considered feedback schemes of receivers. The feedback schemes designed pilots of a transmitter jointly with the structure of receivers. However, the joint designs only consider low-dimensional baseband channels.
2. Contributions and Notations
In this paper, a feedback scheme is proposed to reduce the feedback overhead of high-dimensional RF channels and improve the quality of the quantized estimated channel. The estimation and recovery of channels are performed by CS.
To reduce the feedback overhead, only sparse vectors are delivered for each subcarrier with a common sparse basis of all subcarriers. The matrix for the common sparse basis is delivered as indices of a pre-shared codebook. In this paper, the elements of the codebook are assumed as column vectors of the discrete Fourier transform (DFT) matrix. The sparse vectors are quantized by a low-dimensional codebook. The dimension of the codebook is only proportional to the estimated sparsity of the wireless channel. As the quantization is performed on only the sparse vectors, the reduction and quality improvement of feedback information is achieved.
The proposed scheme also uses the nonlinear codebook to improve the quality of CSIT. The sparse vectors of the common basis are calculated by the OMP method. After the calculation, elements of the sparse vectors are sorted by order of magnitudes. The nonlinear codebook is generated by considering the property of the sparse vectors. The quantized vectors by the nonlinear codebook can achieve lower error than that of a linear codebook with the same codebook size.
By the feedback of the proposed scheme, a transmitter can recover more accurate high-dimensional channels than recovered channel by linearly quantized pilots with the same codebook size. The recovered high-dimensional channels allow the transmitter to optimize jointly the analog and digital beamformer. Furthermore, a higher sum rate can be achieved than that of feedback using a codebook of precoding matrices.
This paper uses the lower-case letters for scalars, lower-case bold letters for vectors, and upper-case bold letters for matrices. Further, for a-by-b matrix of complex elements, || || for norm operation, and for diagonalization of a vector are used. For transpose and conjugate transpose of matrix , and are used. means the element of at the a-th low and the b-th column. means the b-th column vector of .
3. System Model
In this paper, a multi-user hybrid beamforming system is considered. In the system, it is assumed that beam sweeping is used to acquire CSI for beamforming.
Figure 1 shows the beam sweeping operation of the multi-user hybrid beamforming system. The system includes a transmitter and multiple receivers. The transmitter is consisted of digital beamformer following OFDM modulator and analog beamformer.
,
, and
are the number of independent data streams, RF chains, and antennas for transmission, respectively. In the structure,
is set to hold a condition that
. To consider MU-MIMO transmission, the same number of receivers are assumed with
. Each receiver consists of an analog combiner with
antennas following a RF chain. At the same time as the beam sweeping operation the transmitter sweeps channel with
beams. The receivers also sweep the channel with
beams. The measurements by the beam sweeping are as follows:
where
and
are vectors for the beam sweeping at the receivers and the transmitter.
is a channel matrix of frequency domain between the transmitter and the
u-th receiver.
k is an index of a OFDM subcarrier.
is a noise matrix at the
u-th receiver. The elements of
represent the measurements by the beam pairs. After the beam sweeping operation of the transmitter and the receivers, the best beam pairs of each receiver can be selected from the measurements and used for analog beamforming. Furthermore, the digital beamformer can use pre-shared precoding matrices based on baseband channels of the selected beam pairs.
For the channel model, a three-dimensional statistical channel model is considered. In the time domain, the channel matrix
of the model is calculated as follows:
where
and
are response matrices at the transmitter and the receiver sides.
is a diagonal matrix that reflects distortion of amplitude and phase for transmitted signals.
, and
means number of propagation paths in channel of the
u-th receiver and the number of scatterers in the
-th path.
and
are complex and real gains of the
-th scatterer in the
-th path.
and
are response vectors at the transmitter and the receiver sides.
and
are azimuth and elevation through the
-th scatterer of the
-th path, respectively. The response vector for azimuth
and elevation
is as follows:
where
and
are vectors for three-dimensional direction and position of the
p-th antenna element, respectively.
is wavelength of the transmitted signals. After demodulation, the channel matrix should be considered in frequency domain. The channel matrix of frequency domain is calculated from the matrix of time domain as follows,:
4. Hybrid Beamforming Based on CSI
In multi-user hybrid beamforming systems, a received signal of each receiver is expressed as follows:
where
is a signal for the
u-the receiver and
.
P and
are average power for transmission and a constant for normalization.
is also viewed as beamforming gain in massive MIMO systems.
is a analog combiner of the
u-th receiver for Rx beamforming.
and
are analog and digital Tx beamforming vectors for the
u-th receiver. For
, spectral efficiency is calculated using signal to interference plus noise ratio (SINR) as follows [
9]:
where
In (
16),
is the part that belongs to the desired signal and
is the part that includes interference and noise.
is baseband channel of the
u-th receiver. The transmitter should determine
and
for each receiver to maximize the sum rate as follows:
However, in most modern wireless systems, searching for the optimal matrices is impossible as the transmitter can not acquire or . In the systems, the transmitter only acquires and the quantized version of from feedback of the each receiver. The receivers acquire the information from received baseband pilots that are transmitted by beam sweeping operation. For multi-user transmission, the transmitter determines analog and digital beamforming matrices separately based on the feedback information.
If a transmitter can acquire or , the transmitter can determine the analog and digital beamforming vectors to maximize beamforming gain and suppress IUI of . The example is a conventional beam-steering method. The beam-steering method is a hybrid beamforming method that determines analog beamformer as the reported best beam and digital beamformer as the ZF matrix for .
5. Quantized Channel Feedback Based on Non-Linear Quantization of Sparse Vectors
Originally, CS was invented to recover an original sparse signal from small measurements of the signal. Typically, most of natural signals show high sparsity. Thus, CS can achieve high performance when compressing and recovering the signals. In the case of the mmWave channel, sparsity is observed in angle domain. After beam sweeping operation and OFDM demodulation, formulation of CS for compressive channel estimation is as follows:
,
, and
are the measurements by Tx beam sweeping, a sparse basis matrix, and a sparse vector.
means magnitude of noise. When Rx beam sweeping is performed,
is determined as follows:
where
is the selected vector among the column vectors of
.
K means the number of OFDM subcarriers.
of the receivers that do not support Rx beam sweeping is just a measurement vector by only Tx beam sweeping. For
and
, various matrices can be adopted [
19,
20,
21,
22]. In this paper, discrete Fourier transform (DFT) matrix is used for Tx and Rx beam sweeping. Typically, random matrices for measurements achieve high recovery performance at compressive sensing due to incoherence with the sparse basis. However, the random matrices spread power and significantly decrease SNR of the measurements. When DFT matrices are used for the measurements,
and
are calculated as follows:
where
In the matrices, and must be equal to and , respectively. In other words, the number of time slots for beam sweeping is proportional to the number of antennas. This can be a large overhead for initial access. Furthermore, DFT matrix can be used for only linear array antenna. However, the design of the sweeping matrices is not a focus of this paper and DFT matrices can provide qualified measurements for compressive channel estimation.
There are several methods to find the solution of (
20). The non-convex minimization with the zero-norm condition for the sparse vector provides the best solution but the complexity is impractically high. Among the methods, OMP is the simplest method. OMP is a greed algorithm that finds the best basis matrix and the correspond sparse coefficients iteratively. In this paper, OMP is used to find basis matrix and sparse vectors of channel. The formulation for OMP is as follows [
19]:
In (
27), the condition for the sparse vector is relaxed for lower complexity. In the OMP algorithm, the elements of the sparse vector are found by iterative search from the largest element. As the sparse vector has few non-zero elements, the sparse vector can be used for feedback information to reduce the feedback overhead.
5.1. Channel Feedback Using Common Sparse Basis
Measurements by
can be represented as follows:
where
and
are a basis matrix and a vector of coefficients for the basis. The first term of (
28) shows the similar structure with
. In other words,
and
can be mapped to
and
. Furthermore, note that
is independent of
k. Therefore, the channel coefficients of the every subcarrier can be recovered using the common
. The only feedback information of the each subcarrier is
except
.
In channel recovery with the common basis, the quality of the selected basis is a significant factor of recovery performance. Incorrect selection severely degrades the performance of channel recovery, and this is caused by noise. To search for the common basis efficiently, MMSE-OMP is used. As MMSE-OMP considers the effect of the noise, the quality of the recovered channel at a low SNR environment can be improved. In the process, the common basis is searched from a conjugated DFT matrix. Algorithm 1 shows the search process. In the algorithm,
,
, and
represent a constant for normalization, a matrix of the selected basis, and a estimated sparse vector of the
k-th subcarrier, respectively. As a result of Algorithm 1,
and
are found from the received measurements. As
is a submatrix of the DFT matrix for Tx beam sweeping and independent on
k, uplink resources for CSI feedback can be significantly saved. Furthermore,
can be quantized for the feedback by using a codebook that the dimension of the codebook is only dependent on the sparsity of wireless channel. Due to the sparsity of the mmWave wireless channels, the codebook can be searched more efficiently than a codebook for quantizing high-dimensional matrices.
Algorithm 1 Search of Common Sparse Basis and Corresponding Sparse Vectors |
fordo end for fordo end for
|
5.2. Nonlinear Codebook Generation
To recover the channel matrix at the transmitter, the matrix for the common basis and the sparse vectors must be delivered to the transmitter. In the case of the basis matrix, the selected columns by Algorithm 1 can be delivered in the form of indices. For the sparse vectors, quantization is an inevitable process. The sparse vectors of OMP show a property that the elements of the vector are sorted according to the magnitude. By considering the order of the magnitude, quantization can be performed more effectively than simple linear quantization. It is well known that the Grassmannian manifold provides an optimal codebook for complex unit vectors. However, it is difficult to use the Grassmannian manifold method for conditional vector space. In this paper, conditional RVQ is used for the quantization of conditional vector space and the nonlinear codebook is generated by the LBG algorithm in conditional vector space [
32].
Figure 2 shows an example of a codebook for conditional RVQ generated by the LBG algorithm.
In
Figure 2, green and blue dots mean unselected and selected vectors by a specific condition, respectively. The codebook is generated from the selected vectors. The red dots mean the vectors of the codebook. The generated codebook of
Figure 2 is calculated in the conditional vector space that the angle between the purple vector and the arbitrary green vector is lower than
.
Algorithm 2 details generation of a codebook for the sparse vectors. In Algorithm 2, the condition is as follows:
where the column vectors of
constitute space that sparse vectors of wireless channel can be observed. By the condition, the vectors of the space are filtered. Then, the filtered column vectors of
are grouped into
according to distances with the vectors of the codebook.
B means the size of the codebook. The vectors of the codebook are calculated by averaging the column vectors of the each group.
calculates an average vector of the groups. The calculation of the codebook is repeated until convergence is observed.
of Algorithm 2 is a small number close to zero. The output of Algorithm 2 is the codebook
and
is used to quantize the estimated sparse vector
as follows:
In (
32), only phase of
is quantized. The magnitude can be quantized more efficiently than the phase as the dimension is one. Therefore, this paper considers the case that magnitude and phase are quantized separately.
Algorithm 2 Codebook Generation for Conditional RVQ |
fordo if then end if end for whiledo for do for do end for end for for do end for end while
|
6. Simulation Results
This section shows simulation results for the NMSE of the recovered channels and a sum rate of multi-user downlink transmission. In the results, the proposed quantized channel feedback (QCF) scheme is compared with the linear quantized feedback scheme and the beam-steering method based on perfect CSIT. Parameters for simulation environment are presented in
Table 1. To reflect the sparsity of the angle domain, the number of multi-path and scatterers are set to small numbers. The sparsity of the angle domain is shown in an environment that the number of propagation paths is close to one [
3,
28]. Furthermore, azimuth spread of departure (ASD), elevation spread of departure (ESD), azimuth spread of arrival (ASA), and elevation spread of arrival (ESA) are set to 3 degrees for the sparsity. For NMSE, the normalized error between the perfect channels and recovered channels at a transmitter is calculated as follows:
where
and
are the perfect channel and the recovered channel at the transmitter. In the simulation for a sum rate, the number of receivers is set to four and the number of RF chains are set to the same number of receivers except for one case.
In
Figure 3, NMSE measurements according to codebook size are presented. QCF-L means a QCF scheme using the linear codebook that is generated by the LBG algorithm in complex unit vector space. For feedback, the two schemes use the same linear codebook. The results of
Figure 3 indicate that channel recovery using the common basis achieves the same NMSE with channel recovery using the selective basis. Therefore, there is no penalty to use the common sparse basis for basis of all OFDM subcarriers. When beamforming is used at the receivers, the same results are observed.
Figure 4 and
Figure 5 also show NMSE measurements according to codebook size. In
Figure 4 and
Figure 5, QCF schemes using the linear codebook and the proposed codebook are compared. QCF-NL means the QCF scheme using the proposed nonlinear codebook. In the figures, the schemes use the same common basis for channel recovery. Common observation of
Figure 4 and
Figure 5 is that the nonlinear codebook achieves better performance than that of the linear codebook with the same codebook size. Specifically, systems can reduce 1.5∼2 bits per subcarrier to achieve the same performance with the non-linear codebook. Furthermore, the performance gap of the two codebooks decreases with the increase of the codebook size. In the case of
, the gap is smaller than the gap of
.
Figure 6 and
Figure 7 show NMSE measurements according to SNR. For QCF schemes, a codebook of 6 bits is used. From the results, the nonlinear codebook shows better performance than the linear codebook and the performance gap is larger with the beamforming of the receivers. In the interval of 0 ∼ 5 dB SNR, the QCF schemes experience a lower effect of the noise than CS-OMP schemes. It seems that the basis selection of MMSE-OMP provides robustness to the noise even at the channel recovery of the transmitter.
Figure 8 and
Figure 9 show measurements of a sum rate according to SNR. For the measurements, the number of receivers is set to four and the same SNR is assumed for all receivers. The beam-steering method uses the same number of RF chains with the number of receivers and the beams that are directed to the true elevation of departure (EOD) and the true azimuth of departure (AOD) for each receiver. For digital beamforming, ZF beamformer is used in all measurements. Furthermore, a fully connected structure is used for all hybrid beamforming. For the hybrid beamforming of the QCF schemes, the method of [
9] is used to optimize jointly the analog and digital beamformers. As the proposed systems deliver compressed versions of the estimated channel matrices instead of indices for precoding matrices, a transmitter can adopt any beamforming schemes that use the channel matrix. In the results, the nonlinear codebook achieves a better sum rate than that of the linear codebook. The gap of the performance increases with SNR and is constant in the interval that the additional improvement for MNSE is not observed. The similar results are shown in the case of
. However, the slightly larger gap is observed at a low SNR environment than that of the case of
.
Figure 10 shows measurements of the case that the number of RF chains is twice the number of the receivers. For the beam-steering method, the same number of RF chains is used with the number of the receivers as the joint optimization cannot be performed. When the number of RF chains is twice the number of receivers, hybrid beamforming can achieve optimal beamforming gain with the joint optimization [
8]. Although the QCF schemes use imperfect CSI, the schemes show a higher sum rate than that of the beam-steering method based on perfect CSIT. However, the improvement of the sum rate decreases drastically after the SNR of 10 dB. Furthermore, the improvement by the Rx beamforming decreases with the increase of SNR. It is can be thought that the sum rate is limited by imperfect suppression of the IUI. Therefore, the limited sum rate can be improved by the more accurate estimation of sparse vectors.