1. Introduction
Multiple-input multiple-output (MIMO) systems achieve high data rate transmission and substantial gains in channel capacity over a rich-scattering environment [
1,
2], which makes very high spectral efficiency available for the emerging wireless standard IEEE 802.11n/ac [
3], etc. Moreover, if a powerful channel code, such as turbo codes [
4], can be serially concatenated with this architecture, a kind of MIMO system, i.e., the turbo space-time system, is obtained in order to achieve near-capacity performance [
5,
6].
Endowed with the turbo learning principle, iterative detection (inner decoder) and decoder (outer decoder) receivers for the turbo space-time system [
5], detecting and decoding, by exchanging soft bits of information mutually, can approximately approach optimal performance with reduced complexity [
5,
6].
Although the maximum a posteriori probability (MAP) MIMO detector could attain superior performance, it has a complexity that grows exponentially with the product of the number of transmit antennas and the modulation order, which, in reality, precludes it from implementation.
Therefore, three major types of sub-optimal soft-input soft-output (SISO) MIMO detectors, i.e., the SISO sphere decoder (SD) [
6,
7,
8], SISO interference cancellation (IC) detectors [
9,
10,
11], and SISO decision-feedback (DF) detectors [
12,
13], have attracted the most attention. Among them is SISO list sphere detection with a probabilistic radius tightening [
7] and SISO sphere detection with perfect node enumeration [
8], which, taking into account the characteristic of the extrinsic information estimates, can achieve the upper bound of the diversity [
14,
15]. However, the smaller the radius, the more computationally efficient the method of the SISO SDs would be. Nevertheless, only a sphere of relatively large radius and list size would ensure SISO SDs to provide near-optimal detection, which results in an unobviously-reduced complexity comparable to that of the MAP detection algorithm.
With recent advances in IC and DF, more and more novel achievements, such as SISO minimum-mean-square error (MMSE)-based successive interference cancelation receivers with a novel strategy [
9]; an area-efficient MMSE with parallel interference cancellation (PIC) detector [
10]; the MMSE-PIC-based SISO detector with Neumann series expansion to reduce the total complexity [
11]; a novel low-complexity decision-feedback detection algorithm with constellation constraints [
12]; a soft input, soft output, and soft feedback (SIOF) detector [
13]; etc., in this area have been applied to modify the strategy for mitigating error propagation (EP) for this kind of suboptimum detector.
Previously, it was shown that the QR decomposition [
16] of the channel matrix led to a simple successive detection algorithm with reduced complexity. In this way, we extend this idea to a SIOF detector and propose an iterative QR-based soft feedback segment interference cancellation (QRSFSIC) detection and decoder algorithm.
Furthermore, the originally-proposed turbo codes are generated by two parallel recursive systematic convolutional encoders separated by an interleaver and using an iterative SISO decoder [
4]. Nevertheless, using Reed–Muller (RM) [
4,
17], instead of convolutional codes as component codes, RM turbo codes, with a simple block interleaver, not only give very good performance for high code rates, but also avoid the problem of trellis termination. In this paper, it is serially concatenated with MIMO architecture to constitute a novel kind of MIMO system, named as the RM turbo space-time system.
The proposed algorithm is composed of an iterative QRSFSIC detector (inner decoder) and a trellis-based MAP RM turbo decoder (outer decoder), which mitigates the multi-stream interference and obtains the desired information bit by exchanging information between inner and outer decoders. It, based on QR, not only embeds SFSIC, but also merges RM-turbo codes with minimal trellis representation into the QRSFSIC detector and decoder. In the first place, it forms the sufficient statistics for the MMSE estimate according to QR decomposition-based soft feedback successive interference cancellation, which saves computation. Second, the signal that originated from the symbols of the reliable segment is iteratively cancelled with the QRSFSIC, which guarantees the optimal detection sorting in case of the risk of EP and, thus, leads the proposed scheme to approaching the upper bound of the diversity gain. Finally, RM-turbo codes with minimal trellis make it feasible for an iterative trellis-based MAP RM turbo decoder, resulting in the global optimum, corresponding to the transmitted data, with lower complexity.
Throughout this paper we adopt the following notational conventions. Boldface capitals and lower-case letters symbolize matrices and vectors, respectively. Furthermore, ( )T, ( )H, and ( )−1 represent the transpose, Hermitian transpose, and inverse, respectively. Finally, indices [m, l] are used to mean the lth bit position of the constellation symbol for the mth transmit antenna while and denote and , respectively, and the subscript k:l means that the range of index is from k to l.
2. Reed–Muller Space-Time Turbo System Model
Figure 1 presents the configuration of a Reed–Muller space-time turbo system with a receiver based on a QR-based SFSIC detection concatenated with an iterative trellis-based MAP RM turbo decoder. In this system, there are
Nt transmit antennas and
Nr receive antennas. Let
Mb be the number of symbols in the complex constellation
C. The user’s information bits
u are first encoded into the Reed–Muller turbo code stream by the outer encoder, which is then bit-interleaved using an offline-designed pseudo-random interleaver referred to as Π. Followed by de-multiplexing, the high-speed RM turbo code stream transfers into low-speed
Nt parallel sub-vectors. In the end, each sub-vector is independently mapped into the transmitted vector constellation symbol
x whose entries
are chosen from
C, where
is a symbol transmitted out of the
mth transmit antenna at the
ith interval. The received
Nr × 1 vector can be represented by:
where
H denotes the
Nr ×
Nt channel matrix, the elements of which are independent and identically distributed (i.i.d.) with the zero-mean unit-variance complex Gaussian distribution.
is a
Nr × 1 complex white Gaussian noise with zero mean and a covariance of
, where
I is an
Nr ×
Nr identity matrix.
In order to obtain the most likely information vector, we apply the statistical features among all components of the received signals. By doing this, the multi-stream interference from other transmit antennas is mitigated, and different types of detectors are also acquired.
The MAP algorithm selects the most likely encoded bit with respect to all combinations of transmitted signals by maximizing the log-likelihood ratio (LLR) [
4,
18] as given by:
where
is the set of all possible RM turbo-coded bit vectors, the
mth bit position of the constellation symbol for the
kth transmit antenna, which is fixed with
.
The Log MAP algorithm must calculate
, where
Γ is the code constraint length sequence corresponding to a sequence of modulated symbols to be transmitted during the
ith interval, the metric being expressed in Equation (2), and the vector that yields the largest metric being selected. It is a considerably complicated NP-complete (nondeterministic polynomial time-complete) problem when
NtΓ is large. Although the bit error rate (BER) of it is considerably low, its computational complexity is unacceptable in reality. To reduce the computational complexity, an alternative approach is adopted. By employing the sphere decoder technique, the list sphere decoder [
6,
7] and other improved search vector versions [
8] that are composed of the constellation points are limited to a sphere of radius C centered at the received signal. These approaches aim to implement suboptimum detection, with its complexity depending on the size of the candidate vector set to be searched for detections per information bit. However, it is still a rather complicated NP-complete problem when the size of the candidate vector set is large. Furthermore, the SIOF detectors are proposed to avoid EP and reduce the complexity. It is to further devised that the suboptimum decoders, which are based on the LLR function, have both lower computational complexities and fast convergence rates, that serve as the motivation to write this paper.
4. Simulation Results and Discussion
In this section, a simulation investigation which, in terms of a time-domain Monte Carlo method, was designed in MATLAB (Matrix Laboratory) R2010a under the Windows 7 Ultimate 64-bit operating system, using a computer with a 3.4-GHz Intel(R) Core(TM) i7-2600K CPU, is carried out to show the performance of the proposed scheme in this context. We use three iterative detection schemes, i.e., an iterative QR-based soft feedback segment interference cancellation detector and decoder (IQRSFSICDD), an iterative soft input-soft output and soft feedback detector and decoder (ISIOFDD), and an iterative minimum mean square error parallel interference cancellation detector and decoder (IMMSEPICDD). The simulation results are described in two aspects, i.e., the performance of ISISOICSPDD over a number of iterations between the detector and the decoder loop (the major iteration) and the bit error rate (BER), which, counted by comparisons between the estimated bits and the transmitted bits, is measured as a function of signal-to-noise ratio SNR (signal-to-noise ratio), which is followed by a discussion of the diversity order of the proposed scheme.
Let us consider that, in a Rayleigh fading channel, there is an RM space-time turbo system, shared by
Nt transmitters and
Nr receivers. The length of the frame is 1000 symbols and the transmitted bits for each SNR is larger than 10/BER, which is consistent with the requirements of the Monte Carlo method. The transmitters employ the 4-ary quadrature amplitude modulation (4-QAM) with Gray mapping. An overall code rate R = 121/240, or R = 676/988 outer channel RM turbo code with the same RM (2, 4), or an RM (3, 5) [
17,
18] code in each dimension is used.
In
Figure 1, the pseudo-random interleaver (∏) is used not only to de-correlate the fading channel and maximize the diversity order of the system, but also to eliminate the correlation in the sequence of the RM turbo coded bits, which is crucial for the proposed algorithm. The number of iterations of the RM turbo decoder (outer iteration) is limited to 5.
Figure 3 depicts the BERs of the IQRSFSICDD versus SNR values over a range of iterations, with the transmitters using an R = 121/240 outer channel RM turbo code and the 4-QAM for
Nt =
Nr = 3, and an R = 676/988 outer channel RM turbo code accompanied by the 4-QAM for
Nt =
Nr = 4, respectively.
For the graphic clarification, not all of the simulation results are shown and the number of the iteration being one means that there is no soft information exchanging between the iterative IQRSFSIC detector and minimal trellis-based MAP RM turbo decoder. From
Figure 3, we can obtain the following observation:
Originally, in the relatively lower input SNR interval, the BER of the proposed algorithm is almost not favorable to the numbers of detector and decoder iterations being executed.
Gradually, with the increase in the input SNR, the BERs between successive iterations could be distinguished.
Then, at a certain input SNR or, more accurately, an input SNR threshold, the BER curves go into a waterfall region with their characteristic sharp drop, where the proposed algorithm would converge to zero BER within a finite number of iterations.
Figure 3 has also shown that though the greater number of iterations would lead to the lower bit error rate, the gradient of the BER with respect to the number of iterations for the proposed algorithm does not obviously decrease when the iteration size is increased from 4 to 5. Henceforth, we fix the iterations to 5 as a fundamental trade-off between the BER and computational complexity of the proposed algorithm in all our following simulations.
Figure 4 represents the BERs of three iterative detection schemes versus SNR, with the transmitters using the R = 121/240 outer channel RM turbo code and the 4-QAM for
Nt =
Nr = 3, and the R = 676/988 outer channel RM turbo code accompanied by the 4-QAM for
Nt =
Nr = 4, respectively.
The BER of the proposed scheme is not only a few orders of magnitude lower than those of the IMMSEPICDD, but also lower than that of the ISIOFDD. However, when the source employs the R = 121/240 outer channel RM turbo code, though the BER of the proposed scheme is somewhat lower than that of the ISIOFDD, the difference between them is not significant. This is because, in this circumstance, the information coded with the lower overall code rate, i.e., R = 121/240, the performances of these two detection schemes are almost characterized by the error protection (UEP) property of the shortened codes. However, when the overall code rate is higher and the number of transmit antennas is increased, the higher diversity order makes the proposed scheme achieve the best performance. These conclusions also coincide with the results provided by [
4,
17] for the RM turbo code since both of the proposed scheme and the ones in [
4,
17] are endowed with stronger component codes, i.e., the RM codes, as extended Hamming codes, to obtain additional coding gain though the scenario for them to operate are far from each other. Moreover, according to
where
Pe is the bit error rate [
20] and SNR is the signal-to-noise ratio, we can discover that the diversity gain of the proposed scheme approximates diversity order 9 and 16 [
20] as the SNR increases. Again, this demonstrates that the diversity order of the proposed algorithm is
Nt ×
Nr. These conclusions are also consistent with the results provided by [
14,
15] for the turbo MIMO systems. Its performance is superior to all other schemes considered in this context.