1. Introduction
Two-thirds of the Earth’s surface corresponds to the sea, demanding connectivity in the new paradigm of the Fourth Industrial Revolution, namely, to interconnect Internet of Things (IoT) devices. Underwater acoustic (UWA) communications is the solution to solve this gap. However, due to the high level of multipath [
1], impulsive noise, and low carrier frequencies, the bit rates available in UWA communications are low [
2]. This limitation can be mitigated by employing techniques such as block transmission techniques, such as orthogonal frequency division multiplexing (OFDM) or single carrier with frequency-domain equalization (SC-FDE), as well as multiple input–multiple output (MIMO) systems, as adopted for the Fifth Generation of Cellular Communications (5G) [
3].
MIMO systems bring added value in terms of capacity and diversity gains. However, such gains are conditioned on the existence of spatially uncorrelated channels. Due to the lower carrier frequencies employed in UWA scenarios, such condition is rarely fulfilled [
4]. Since a certain level of channel correlation between signals of different antennas exists in real scenarios, such configuration is here considered.
The complexity of the UWA scenarios is defined by their non-homogeneity, being dependent on the following factors: pressure, temperature, and salinity. The propagation speed of the sound is affected by each of these factors [
5]. Several expressions have been developed empirically to calculate sound propagation speed in UWA scenarios. Wilson defined in [
5] the following expression widely utilized:
where
c stands for the propagation speed,
the temperature in degrees Celsius,
S the salinity in parts per thousands, and
the depth in meters.
Although there is complexity of propagation of sound in underwater scenarios, it is known that the increase of the number of antenna elements that comprise MIMO systems corresponds to a technique widely used to increase the capacity gains [
6,
7,
8]. However, this demands higher signal processing levels associated with linear receivers, such as the zero forcing (ZF). Therefore, a solution to simplify the complexity relies on the use of sub-optimal receivers, such as equal gain combining (EGC), maximum ratio combining (MRC) [
6,
7], and iterative block—decision feedback equalization (IB-DFE), being studied in this article.
Low-density parity-check (LDPC) has been adopted by 3GPP as an encoding scheme for the 5G standard [
9,
10]. Before, LDPC codes were adopted for the long-term evolution (LTE), as well as for the World Interoperability for Microwave Access (WiMAX) standard. Due to its outstanding performance, LDPC codes have been utilized in other scenarios, such as in UWA communications.
Impulsive noise is typically experienced in low frequencies, consisting of short-duration noisy pulses (from a few microseconds up to milliseconds) [
11,
12,
13]. This is caused by sea life activity in the underwater environment, which can be substantially above the additive white Gaussian noise (AWGN).
Previous works considered using the block transmission technique OFDM, in UWA communications, such as [
14]. The work published in [
11] focuses on impulsive noise superimposed on OFDM signals, including mitigation techniques. Furthermore, previous works have also considered the combination of OFDM, and MIMO systems applied to UWA communications [
15,
16]. Moreover, as compared to [
17], the research published in this article considers LDPC codes and impulsive noise, which is typical of UWA communications. Moreover, channel estimation, using training sequences, is also adopted to make the results of this complex system more realistic.
This article studies the performance of UWA communications, using MIMO communication systems, for different levels of channel correlations between adjacent antenna elements of the MIMO configurations, with different receiver types, using LDPC codes, and using SC-FDE transmission. Instead of studying individual schemes, this article studies such a complex system, also considering impulsive noise, LDPC codes, and channel estimation.
This article is organized as follows:
Section 2 describes the system and signal characterization,
Section 3 analyses the performance results, and
Section 4 concludes the article.
2. System and Signal Characterization
This article considers a multi-layer MIMO system, which requires a number of
R receiving antennas equal to or higher than
T transmitting antennas. It is considered that each transmitting antenna sends a different flow of symbols. On the other hand, the number of
R receiving antennas is responsible for providing diversity. SC-FDE signals are assumed [
18], associated with quadrature phase shift keying (QPSK) modulation.
As depicted in
Figure 1, the
nth transmitted block, of
N data symbols, sent by the
tth antenna is denoted as
, while the received block by the
rth antenna is denoted as
. The mapping between the time domain signal and the frequency-domain signal for the transmitted block is defined as
, i.e., by performing the discrete Fourier transform (DFT) of the time-domain block. Similar mapping is assumed for the received block as
.
After removing the cyclic prefix, and assuming a cyclic prefix longer than the overall channel impulse response of each channel, using the matrix-vector representation, the received frequency-domain signal comes:
where
stands for the frequency-domain transmitted data symbols, where
denotes the
channel matrix for the
kth subcarrier, with
th element
, and where
denotes the channel frequency response for the
kth subcarrier (assumed invariant during the transmission of a given block). Note that the mapping between the channel time and frequency domains is defined by
. Moreover,
is the frequency-domain block channel noise for that subcarrier.
2.1. System and Signal Model for the Receivers
A very efficient receiver commonly associated with SC-FDE schemes [
18] is the IB-DFE. Such an iterative receiver uses feedforward and feedback coefficients to process the signals in the frequency domain, reaching a performance typically much better than that of a non-iterative receiver. IB-DFE can be viewed as turbo equalization [
8,
19].
The ZF receiver tends to be complex because it requires the computation of the pseudo-inverse of the channel matrix, for each frequency component. This article avoids this complexity by implementing the m-MIMO using MRC and EGC receivers, simplifying its processing. Furthermore, these receivers are iterative, being, in this study, associated with SC-FDE transmissions.
Assuming a non-iterative receiver, the estimated frequency-domain data symbols
comes:
Depending on the algorithm,
can be computed as
for the ZF, as
for the MRC, and as
for the EGC [
20].
As described, the ZF receiver is a linear algorithm that applies the pseudo-inverse of the channel’s frequency response, for each frequency component of the channel. Therefore, the level of complexity and computation is very high, which also translates to high battery consumption. Moreover, the ZF is very efficient in removing the intersymbol interference but has the disadvantage of presenting noise enhancement when utilized in post-processing. Therefore, it tends to degrade the performance for average to high noise levels. On the other hand, the MRC and EGC tend to mitigate these limitations due to their simplicity but generate some residual interference generated in the decoding process, especially for moderate values of
. This can be mitigated by employing an iterative receiver that implements the following function [
20]:
where the interference cancelation matrix
can be computed as [
20]:
and where
is an
identity matrix and where the
th elements of the matrix
are defined as
for the MRC and
for the EGC (i.e., they have absolute value 1 and phase identical to the corresponding element of the matrix
).
2.2. Channel Estimation
This article assumes that training sequences (pilots) are utilized to perform the channel estimation. The channel frequency response is defined as [
21]:
where
denotes the training sequence transmitted by the
tth transmitting antenna
,
stands for the signal at the
rth receiving antenna
(
TS means training sequence), and
stands for the power (variance) of the training sequences. In this estimation, it is assumed that that the training sequences associated with different transmitting antennas are orthogonal, leading to
. This leads to
, where
stands for the channel estimation error, being Gaussian-distributed, with zero-mean, defined as
with
and with
, as defined in [
21].
In order to improve the channel estimates, the following enhancement can be employed [
22,
23]:
where
if the
nth time-domain sample is inside the cyclic prefix, and 0 otherwise.
2.3. Impulsive Noise
Impulsive noise [
11,
12,
13] is typically experienced in low frequencies, consisting of short-duration noisy pulses. In the underwater environment, this is caused by sea life activity, which can be as high as 40 dB above the AWGN.
Impulsive noise can be acceptable in analog communications, being, however, disruptive in digital communications, originating a burst of errors. Therefore, the AWGN model is not well suited to model an environment characterized by impulsive noise. The Markov chain is an important model to characterize the impulsive noise, which is closer to the real environment [
13]. The Markov chain is a model with memory because the next state depends on the actual state [
13]. The Markov chain has four different models, known as binary state model, Markov–Middleton model, partitioned Markov chain, and second-level Markov chain [
13]. As can be seen from
Figure 2, the binary state model has two states (0, 1), which best characterize the states of impulsive noise. The state
represents the channel free of impulsive noise (good channel), while the
represents the opposite (bad channel). The probability of transitions from
to is given by the values
[
13].
3. Simulation Results and Analysis
The performance of the UWA communication, in terms of bit error rate (BER), is evaluated making use of Monte Carlo simulations, associated with SC-FDE block transmission technique and m-MIMO. The error probability, using BER as performance index, was evaluated as a function of
, where
is the energy of the transmitted bits, and
is the one-sided power spectral density of the noise. A block length of
N = 256 QPSK symbols was assumed. LDPC codes of length 32,400 were adopted, with a code rate of ½. This corresponds to 64,800 coded bits, whose sequence is generated as defined in [
24]. A severe Rayleigh fading channel was considered with 20 uncorrelated equal power paths. This corresponds to a highly demanding channel, which can be viewed as a worst-case scenario for underwater propagation. The high number of multipaths and having all of them with the same average power makes this channel very destructive in terms of the creation of intersymbol interference, which can be viewed as the most disruptive cause that limits UWA communications. Channel estimation with training sequences is assumed. Multi-layer MIMO is assumed (spatial multiplexing). Except for the ZF, the other receivers are iterative, canceling the residual interference generated in the decoding algorithm. Nevertheless, while the ZF is much more computationally demanding, the other receivers are not. We considered four iterations of the iterative receivers, as the performance improvement was almost negligible beyond four iterations. As defined in
Section 2.3, the Markov chain, binary state model, was assumed to simulate the impulsive noise (in addition to other impairments, such as fading and AWGN noise), typically present in underwater scenarios. We considered
and
. Moreover, we have defined the following probabilities: Prob(Good->Bad) = 0.1; Prob(Good->Good) = 0.9; Prob(Bad->Bad) = 0.2; Prob(Bad->Good) = 0.8, while in [
11] a probability of
was assumed (i.e., Prob(Good->Bad). Note that Prob(Good->Bad) stands for the probability of transition from Good to Bad state. Impulsive noise [
11,
12] may vary from symbol to symbol. Moreover, we assumed a variance of the impulsive noise (when in a bad state) 20 dB higher than the variance of AWGN noise.
Table 1 presents a list of baseline simulations utilized in the different graphics of this section.
Figure 3 (baseline 1) shows the performance results for UWA communications with and without impulsive noise, with 4 × 32 MIMO, without LDPC codes, for four different receivers: the ZF, MRC, EGC, and IB-DFE. The matched filter bound (MFB) curve is a way to measure the channel modeled by the sum of delayed and independently Rayleigh-fading rays, which can be viewed as a lower bound. This graphic considers that the channel correlation between adjacent antenna elements of the MIMO system does not exist (correlation 0). Therefore, results with ideal channel estimation are considered in this graphic. As expected, the results obtained with impulsive noise are much worse than those only with AWGN noise for all the receivers. Nevertheless, it is known that the marine environment is characterized by the existence of impulsive noise generated by sea life activity, making this scenario more realistic.
When comparing the different receiver types, it is viewed that the MRC is the one that tends to achieve the best performance (while being less computational demanding than the ZF), while the EGC achieves the worst performance. The ZF and IB-DFE present the same results (these curves are superimposed, with and without impulsive noise). It is worth noting that such superposition of the IB-DFE over the ZF curves occurs in all graphics of this article. The Monte Carlo simulation considers the transmission of a high number sequence of bits, varying the noise and channel conditions in each sequence, and evaluating the number of corrupted bits. The BER is the quotient between the number of corrupted bits and the number of transmitted bits. The simulation time for this graphic was limited to approximately 6 h.
Figure 4 (baseline 2) shows the performance results for UWA communications with and without LDPC codes, with 4 × 32 MIMO, with impulsive noise, with ideal channel estimation, and without channel correlation between adjacent antenna elements. As expected, the use of LDPC codes corresponds to a performance improvement for all different receiver types. Furthermore, such performance improvement obtained with the LDPC codes is of the order of 5 dB. From these results, we can conclude that the LDPC codes, adopted for 5G communications, are also well suited for UWA communications. The simulation time using LDPC codes increases drastically, as compared to the uncoded scenario. This results from the fact that the required processing is higher and because the BER is lower. With a lower BER, the number of transmitted bits needs to be higher, such that the obtained BER can be viewed as an average value. The simulation time using LDPC codes was around 36 h.
Figure 5 (baseline 4) shows the performance results for UWA communications without channel correlation versus correlation 0.3 between adjacent antenna elements, for 4 × 32 MIMO, channel estimation, impulsive noise, and LDPC codes. As can be seen, having a channel correlation of 0.3 between antennas leads to a very low performance degradation relating to the configuration without correlation. Although of such low performance degradation, this configuration is more realistic because the UWA communications consider carrier frequencies of the order of 15 kHz, whose wavelength is too high. It is known that a minimum distance of around 3 to 4 wavelengths is required between adjacent antenna elements to assure uncorrelated signals, which is difficult to achieve in such a UWA scenario. As previously described, while the results without LDPC are of the order of 6 h, the simulation time using LDPC codes was around 32 h.
Figure 6 (baseline 5) shows the performance results for UWA communications with and without LDPC codes, with 4 × 32 MIMO, with impulsive noise, with ideal channel estimation, and with channel correlation 0.3 between adjacent antenna elements. Similar to the results without correlation (
Figure 4), the use of LDPC codes corresponds to a performance improvement for all different receiver types, of the order of 5 dB. Nevertheless, above 25 dB, the MRC with LDPC codes degrade relating to the MRC without LDPC codes. The simulation time with channel correlation is the same as those without correlation (it varies manly with and without LDPC codes).
Figure 7 (baseline 6) shows the performance results for UWA communications with channel correlation 0.3 versus 0.5 and 0.7 between adjacent antenna elements, for 4 × 32 MIMO, channel estimation (pilots), impulsive noise, and LDPC codes. As expected, increasing the level of channel correlation leads to a performance degradation. Moreover, the performance degradation observed when we switch from correlation 0.3 into 0.5 is moderate. Nevertheless, such performance degradation increases drastically when we switch from correlation 0.5 into 0.7, especially for the MRC and EGC. It is viewed that the MRC and EGC receivers degrade heavily when we increase the channel correlation. It is worth noting that the channel correlation is a result of a reduced separation between antenna elements. An uncorrelated channel requires a typical separation of 3 to 5 wavelengths. In UWA communications, where carrier frequencies of the order of 15 kHz are employed, such separation is difficult to achieve. Therefore, studying the performance with different values of channel correlation is important to evaluate the different receivers. Naturally, the channel correlation depends on the carrier frequency and distance, making it worth studying the performance as a function of different channel correlation values, rather than of the distance.
Figure 8 (baseline 7) shows the performance results for UWA communications with and without LDPC codes, with 4 × 32 MIMO, impulsive noise, ideal channel estimation, and channel correlation 0.5 between adjacent antenna elements. Similar to results without correlation (
Figure 4) and results with correlation 0.3 (
Figure 6), the use of LDPC codes corresponds to a performance improvement for all different receiver types, except above 25 dB and for the MRC and EGC receivers. The performance improvement obtained with LDPC codes is of the order of 5 dB.
Figure 9 (baseline 8) shows the performance results for UWA communications with 4 × 32 versus 4 × 256 MIMO, LDPC codes, impulsive noise, ideal channel estimation, and channel correlation 0.3 between adjacent antenna elements. Since MIMO multi-layer transmission was employed (spatial multiplexing), the number of transmitting antennas corresponds to the number of parallel flows of data, while the number of receiving antennas provides diversity. Therefore, 4 × 256 MIMO has a diversity eight times higher than 4 × 32. This translates into an improvement of performance for all receiver types. In the scenario of 4 × 32 MIMO, we observe that the MRC performance tends to degrade above 25 dB, while the ZF and IB-DFE performances are very close to the MFB. Nevertheless, with 4 × 256 MIMO, the MRC performs very close to the MFB and similar to the ZF and IB-DFE (these curves are almost superimposed). This occurs because the level of residual interference mitigated by the iterative receiver of the MRC is more accurately estimated and canceled due to the higher level of diversity provided by the 4 × 256 MIMO. The simulation time of 4 × 256 MIMO and LDPC codes is around 48 h, while 4 × 32 MIMO and LDPC codes it corresponds to around 36 h.
4. Conclusions
This article publishes the results of a study of LDPC-coded UWA communications, with m-MIMO and SC-FDE transmission technique, for four different receiver types: ZF, MRC, EGC, and IB-DFE. It was viewed that the IB-DFE tends to achieve the best overall performance, while the level of computational demand is highly reduced as compared to ZF, which also translates into less battery consumption. It was also viewed that a channel correlation of 0.3 or 0.5 between adjacent antenna elements could still be accepted because their performance degradation, as compared with the scenario without channel correlation, is moderate. Regardless of such performance degradation, this configuration is more realistic because the UWA communications consider carrier frequencies of the order of 15 kHz, whose wavelengths are too high. It is known that a minimum distance of around 3 to 4 wavelengths is required between adjacent antenna elements to ensure uncorrelated signals, which is difficult to achieve in such a UWA scenario. It was also viewed that increasing the correlation between adjacent antenna elements to 0.7 makes the system almost unacceptable, as it heavily degrades the performance.
It was shown that increasing the number of receiving antennas of the MIMO system leads to a more accurate estimation and cancelation of the residual interference of the iterative receivers (MRC and IB-DFE), improving their performances closer to the MFB. This occurs because this article considers spatial multiplexing MIMO (multi-layer transmission), where the number of receiving antennas corresponds to the diversity order.
Finally, it can be concluded that for the UWA environment, a system composed of LDPC codes associated with m-MIMO, using SC-FDE signals, and with the low complexity reached with the MRC receiver, makes such a composed system a good combination to achieve future evolutions of UWA communications, even in the presence of impulsive noise.