1. Introduction
The capacity demand in fifth-generation (5G) wireless communication and beyond is facing major challenges, particularly in terms of “last mile” transmission. Line-of-sight (LOS) propagation paths are a bottleneck on the ground unless a significant number of base stations (BS) are deployed, while satellite systems have capacity limitations. To effectively provide high data rates, massive numbers of antennas, and consistent coverage, innovative technologies are required [
1,
2]. The high-altitude platform (HAP) is an emerging solution, operating at an amplitude of up to 20–40 km to provide communication facilities that can achieve the best features in both terrestrial and satellite systems. Relative to the terrestrial cellular base station (BS), the HAP has a wide coverage area and is more flexible and non-polluting. Furthermore, when compared with satellite communication technologies, it consumes far less energy and has low propagation, which means greater QoS for real-time users [
3]. Massive MIMO is a technology used on HAPs that can improve energy efficiency and data rates by deploying a massive number of antennas, allowing multiple users to be served with the same time and frequency resources [
4].
Massive MIMO technology allows high data rate transmissions by using the high number of antennas on the HAPs. Moreover, the enormous number of antennas in an array can provide sufficient gain by precoding, which can overcome the free space path loss [
5]. However, in conventional fully digital (CFD) MIMO systems, each antenna requires a dedicated energy-intensive RF chain, which consumes a significant amount of energy (approximately 250 mW per RF chain). The energy consumption of CFD systems is too high due to the massive number of RF chains [
6]. This allows researchers to consider hybrid (analog-to-digital) MIMO systems, which significantly reduce the number of RF chains by switching some operations to an analog domain [
7].
A fully digital precoding technique demands a precise radio frequency (RF) chain consisting of a digital-to-analog converter (DAC), mixer, filter, and power amplifier for each antenna. When a large number of transmitting antennas is employed in the massive MIMO scenario, this results in prohibitively high hardware costs as well as significant energy consumption [
8]. To solve this issue, recently, finite-resolution PS schemes have been developed. In this scheme, finite-resolution PSs are used directly instead of high-resolution phase shifters. This can lower the energy consumption of a PS network without loss of performance, but it still demands a large number of PSs, each of which consumes a significant amount of energy [
9].The other scheme is to use the switch network instead of the PS network [
10]. This can considerably minimize hardware costs and energy utilization, but it has a noticeable performance loss.
Few studies have specifically addressed the shortcomings of power consumption, SE, and achievable sum rates for HAP networks, in contrast to the extensive research that has been undertaken in the spectrum sharing and theoretical analysis of HAP [
11,
12,
13,
14].
Due to the enormous amount of power consumed by base stations (BSs), energy efficiency (EE) has been viewed as an essential criterion for the development of future communication networks, both from an economic and environmental point of view. In this connection, the authors in [
15] jointly optimized the transmission beamforming and covariance matrix in a MISO system in order to maximize the equivalent efficiency (EE). The authors of [
16,
17] addressed the EE maximization problem for terrestrial-aerial networks and proposed two array signal-processing approaches based on Dinkelbach’s transformation in order to achieve suboptimal solutions. However, fully connected HP requires an extremely large number of active antennas and RF chains, which causes the existing techniques, such as those in [
16,
18,
19], to have an extremely high level of hardware power consumption, which in turn results in a low level of system EE. In [
20], the authors established beamforming techniques aimed at the mmWave HAP system. These algorithms were built for planar arrays that had thousands of antenna elements. The work in [
21] presents HP for HAP massive MIMO systems in order to obtain the RF and the baseband precoder with limited RF chains. The authors applied duality to exploit the relation between the RF precoder and the statistical CSI, which is complicated to tackle. In [
22], the authors proposed two stages of outer precoding design with the assumption of a zero-precoding (ZF) inner pre-beamformer and iterative algorithm to achieve the sum rate result for HAP. The authors of [
23] used NOMA-based HAP communications and multiple antennas to satisfy the connectivity, dependability, and high data rate needs of 5G-and-beyond applications. Furthermore, a user selection and correlation-based user corresponding method for a NOMA-based multi-user HAPS system were proposed. In [
24,
25], the authors designed an outer beamformer to reduce the dimensional statistical eigen mode of the users and user grouping algorithm for HAP massive MIMO systems. The work in [
26,
27] proposed HAP to perform the beamforming technique. For this, the authors realized the interference alignment method for achieving the maximum sum rate of HAPs, respectively. In [
28], the authors realized a decreased channel state information (CSI) overhead in a sub-connected RF precoding scheme, but this is incompatible with HAP massive MIMO systems. In [
29], the initial RF precoder selection from a discrete codebook and the proposed algorithm iteration were utilized together to develop the RF precoder, avoiding an exhaustive search while maintaining a high level of complexity. The published literature did not focus on reducing the complexity of the architecture or energy consumption of the system, while in our proposed machine learning technique, we use limited RF chains and sub-connected HP with low-bit PSs for HAP massive MIMO systems, which reduces the power consumption and complexity and improves the overall sum rate, energy, and spectral efficiency (SE).
In the proposed work, we examined energy-efficient HP sub-connected one- and two-bit PSs for HAP massive MIMO systems, and our main contributions are the following:
We investigate the energy consumption analysis to unveil the fact that the HPs based on a sub-connected low-bit PS architecture is substantially lower than that consumed by HR-PS-based HP. Moreover, utilizing one-bit PSs results in a slight and constant array gain loss.
We propose 1- and 2-bit sub-connected PSs for an HAP massive MIMO system with significantly reduced hardware cost, complexity, and energy consumption in the Rician fading channel. The sub-connected HP design is expected to be easier to deploy and be more energy efficient. The sub-connected architecture is more practicable for antenna deployment due to its lower cost and lower hardware complexity.
We propose ACE-based optimization with low-bit PSs to optimize the achievable sum rate, EE, and SE. First, our scheme randomly generates several candidate elements according to the probability distribution (PD) in HP, and then it weights them according to their sum rates, thereby improving the HP elements’ PD by decreasing the cross-entropy. By repeating this process, we ultimately generate an HP with a sufficiently high probability close to the optimized level.
We further examine the simulation results to show the fact that the HPs based on a sub-connected low-bit PS architecture can achieve near-optimum EE and sum rates compared with the traditional schemes.
The rest of this paper is arranged accordingly.
Section 2 is about the energy consumption analysis of different hybrid precoding schemes.
Section 3 is a brief description of the channel and system model, presenting ideas and steps for algorithms, ACE-based HPs with M-bit PSs, and complexity analysis. The experiment and discussion are in
Section 4. The conclusions are in
Section 5.
Notations: Lowercase and boldface is used to denote the vector, and uppercase and boldface is used to denote the matrix, while letters without boldface denote scalars. denotes a transpose, denotes an inversion, denotes an absolute operator, denotes a conjugate transpose, denotes the Frobenius norm, denotes a pseudo inversion, ⊗ denotes the Kronecker product, and denotes an identity matrix.
2. Energy Consumption Analysis of Different Hybrid Precoding Schemes
Here, we discuss a fully connected, high-resolution phase shifter (e.g., 4 bits) hybrid precoding-based architecture in which every RF chain is connected to each antenna through the network. The high-resolution (HR) PS-based hybrid precoding energy consumption can be represented as follows:
In Equation (1),
P represents the total transmission power,
is the power consumed by the RF chain,
depicts the energy consumed by the PSs,
represents the baseband power consumption,
N shows the number of PSs, and
represents the number of RF chains. The energy consumed by the HR-PSs is (
= 40 mW) for the 4-bit phase shifters. The HR-PS-based HP architecture can attain the achievable sum-rate, but it requires a large number of PSs. As stated in [
30], the power consumption by HR-PSs is quite high. Thus, to overcome this problem, a switch-based HP was introduced in [
7], where instead of HR-PSs, switches were used, which could also decrease the hardware cost. Nevertheless, this switch-based solution cannot actually obtain the array gain of mmWave massive MIMO systems. We describe the system in [
7] as follows:
where
is the power consumed by the switches, which is much lower than the HR-PSs (e.g.,
= 5 mW) [
31]. However, this design cannot achieve the array gain since only the
antennas are active simultaneously, resulting in sum rate performance degradation.
In a sub-connected one-bit PS-based HP architecture, in which each RF chain is attached to the subantennas of an array with
, the power consumption is much lower than in a fully connected architecture [
7], and it provides the better trade-off between the two precoding schemes. The energy consumed by a sub-connected HP architecture can be express as
It is noted that energy consumed by an HP architecture sub-connected with one or two bits is low (
= 5 mW). Therefore, compared with HR-PSs, the sub-connected PS energy consumption is too low. Aside from that, the proposed sub-connected PS HP architecture is able to use all antennas to achieve the optimal array gain compared with SW-based HP [
32]. Compared with the fullly connected HP architecture, the sub connected HP architecture with one- or two-bit PSs has a significantly lower number of phase shifters from
to
, which will have more benefits and will excite the PSs. It can save energy and compensate the insertion loss of PSs. In addition, it also has less complexity due to its simplicity in its architecture and configurations.
3. Channel and System Model
We adopted a multi-user HAP massive MIMO system where an HAP was equipped with a uniform planner array (UPA) with C number of antennas in each column, while the number of antennas in each row was
, as shown in
Figure 1, and each RF chain was connected to a subset of the antenna serving K single-antenna users, as shown in
Figure 2. The received signal at the kth user can be expressed as follows:
where
denotes the transmission signal of a size
, while
is of a size
and
is of a size
for the analog and digital precoder, respectively. In addition,
represents the additive white Gaussian noise vector with a distribution
.
denotes the overall downlink channel matrix of a size
, where
is the channel vector for the kth user, which can be expressed as
where
represents the Rician factor,
is the large scaling factor for the kth user,
represents the distance between the HAP and the kth user, as shown in
Figure 1, and
is the carrier wavelength, while
and
are the line-of-sight (LOS) components and non-line-of-sight (NLOS) components, respectively, which can be expressed as [
33]
where
,
, the angles
and
are the angles of departure (AoDs), d is the distance between two adjacent antenna elements, and
is the correlation matrix of the user K, which can be represented as follows:
where
,
,
, in which
and
are the mean AoDs,
,
is the zero-order Bessel function of the first kind,
is the mean angle of departure (AoD) of user K, and
is the angular spread.
Furthermore, we need to explain the hardware constraints imposed by the proposed low-bit-based sub-connected HP, which differ from those imposed by traditional architectures. The first constraint is that the analog precoder
must be a block diagonal matrix rather than a full matrix:
where the analog weighting vector for the nth sub antenna array is
with a size
. These components have the same amplitude but different phases. All N nonzero elements
should belong to
Our main objective is to develop an efficient hybrid precoders and to maximize the sum rate R, which can be written as
s.t.
where
ℜ represents the set of all possible analog precoders that satisfy the constrains in Equations (9) and (10). The sum rate can be expressed as
where
represents the signal-to-interference-plus-noise ratio (SINR) for the kth user, which can be expressed as follows:
Here,
represents the noise power, and
represents the kth column of
. It is worth mentioning that the constraints in Equations (9) and (10) on the analog precoder
are non-convex. This makes Equation (
12) incredibly difficult to solve. The number of possible
values is finite, since all nonzero elements belong to the constraint in Equation (
10). In order to tackle this challenging problem, we propose a machine learning (ML) adaptive cross-entropy optimization (ACE) with a low-bit, PS-based HP scheme.
3.1. Proposed ACE-Based Hybrid Precoding
To overcome the non-convex problem mentioned in Equations (9) and (10), we first decouple the joint design of the analog and digital precoders. As all of the analog precoder’s N nonzero elements belong to Equation (
10), the number of possibilities
is finite. As a result, we can consider the problem in Equation (
11) to be a non-coherent combining problem. We can select an
candidate first and compute the optimal
with an efficient channel matrix
without non-convex constraints. After searching for all possible
values, we can find the optimum analog precoder
and digital precoder
. Unfortunately, such an exhaustive search involves filtering through all possible
combinations, requiring excessively high complexity as the number of antennas N required is typically higher in massive MIMO systems, which in this case is N = 64,
. To tackle this challenging problem, we proposed an ACE algorithm with a one- or two-bit PS-based HP scheme, which is the advanced version of the CE algorithm [
34]. In the CE algorithm, all elite contributions are treated as equal. Logically, when updating the PD, the elite contributions with better values should be more important. Therefore, if we can weight the elites as per their objective values, then we can expect better results. In ACE, multiple candidates’ HP schemes are randomly generated based on the PD of the element in the HP. Then, they are weighted according to their sum rates for the calculated candidate HP. To enhance the probability distribution (PD) of an element
, we reduce the CE between the two PDs.
We formulated the non-zero elements in as an vector at the beginning, which can be expressed as , and the probability function as the vector. The probability is denoted by , where is the nth element of . First, we initialized as after setting the initial values of the parameters, namely the parameter for the ACE-based HP with sub-connected low-bit PSs. The updated method has the steps presented below.
In step 3, we produce the Z candidate analog precoder
, which depends upon the probability distribution function
, and in step 4, we calculate the digital precoding
based on the efficient channel
. We employed a zero-forcing digital precoding scheme with low complexity and near-optimal performance in this paper, which can be expressed as
where
represents the power-normalized factor.
In step 5, we calculate the achievable sum rate by putting
and
in Equation (
12). The achievable sum rates will be sorted in descending order in step 6. In step 7, the elite
values can be obtained with the highest sum rate. These candidates are utilized to update the probability distribution
and minimize the CE, which is equivalent to [
34]
where
is the probability to output
as stated in Equation (16). All elites have the same effects, and this leads to decreased performance. In order to fix this problem, we propose adaptively weighting the elites depending upon their sum rates. Thus, an auxiliary parameter U, which denotes the average of all sum rates, is introduced.
The weight
of the elite
can be calculated in step 8 based on
, and Equation (
15) can be modified to become
where
is the probability to calculate the
. Notice that
, the nth element
of
, is the Bernoulli random variable, where
has a probability
, and similarly,
has a probability
. Thus, we have
By putting Equation (
19) into Equation (
18), the first-order derivatives based on
from Equation (
18) is given as follows:
For Equation (20), when set to zero,
is updated in step 9 as follows:
We can further simplify Equation (
21) to avoid the local optimum:
where
in iteration
is a smooth size for the step. We used steps 3–10 for the predefined iteration. Finally, the optimal analog
and digital precoding
are obtained.
3.2. ACE-Based Hybrid Precoding with M-Bit PS
We present a more general case (i.e., m-bit PSs) for the proposed ACE-based HP. We first generated the parametric sample distribution, which produced the applicant alternatives for the next iterations. The simple method of producing a random sample
is to draw independently from
. Here, each sample belongs to a discrete distribution
, and the mth quantized process of a set M is chosen to be
. The elite candidates should then change the potential to minimize the CE:
where
is given by
The indicator function is
if the statement is valid (i.e., = 1); otherwise,
= 0. In addition,
, in which
refers to the set containing all possible analog precoders which satisfy the given constraint in Equation (9). Here, it is given that a single PS can be allocated to only one quantized process. The sum of every constrained probability is equal to one, and we implement the Lagrange multiplier to fulfill the constraints:
When we take the first derivative of Equation (25) with respect to the probability
and then set the outcome to zero, we obtain the following result:
Finally, we add Equation (26) for
, and we modify the probability as given below:
3.3. Complexity Analysis
According to Algorithm 1, the difficulty of the proposed ACE-based HP sub-connected low-resolution bit PS scheme initiates steps 4, 5, and 9. As in step 4, the effective channel matrices
for each candidate solution
Z, as well as the related digital precoder
, must be calculated as in Equations (15) and (16). As a result, this component has a complexity of
. In step 5, the sum rate for each candidate solution is computed. Here, we employ the traditional ZF precoder, which reduces the SINR of each user for each candidate to
. According to Equation (
22), step 9 is the updating step of
, which composes the complexity of
. After I iterations, the overall computational cost of the proposed ACE-based HP sub-connected low-resolution bit system PS is
. Meanwhile,
I and
Z are not essentially so big. This enables the complexity of the ACE-based HP sub-connected low-resolution bit PS to be reasonable and comparable to some current methods.
Algorithm 1 ACE-based hybrid precoding with sub-connected phase shifters (PSs) |
Input: Rician Channel-Matrix H, Iterations I, Candidates Z, Elites , Smoothing Step size Output: Analog Precoder , Digital Precoder - 1.
initialization i = 0, - 2.
for - 3.
Generate Z candidates randomly as based on - 4.
Calculate Z digital precoder based on effective channel by (15) - 5.
Calculate the sum-rate by (12) - 6.
Sorting in descending order as - 7.
Select elite as - 8.
Calculate weight for each elite according to their sum-rate - 9.
Update the probability according to weight and - 10.
end for
|
4. Experiment and Discussion
In this section, we will discuss the effectiveness of the ACE-based optimization that was proposed with low-bit PSs in terms of the achievable sum rate, EE, and SE. The following are the primary simulation parameters. We used the assumption that the users were dispersed in a 20-km radius under the HAP circle, and the height of the HAP was 20 km. It operated at a frequency of 2.4 gigahertz. The number of transmitting antennas , the transmitting power P was set to 20 dB, and we set the bandwidth to 10 MHz. The Rician factor =10, and the transmitting antenna array was a UPA with antenna spacing . The number of RF chains was four. Furthermore, the number of candidate solutions, the number of elite candidates, and the number of iterations were set to Z = 200, = 40, and I = 20, respectively.
Figure 3 presents that the proposed ACE sub-connected low-bit PS-based HP obtained a higher sum rate than the technique that is currently being used for HAP massive MIMO systems. For the ACE-Algorithm 1, the values were the same as above. The zero-forcing (ZF) precoding system is designed for use with the fully connected PS architecture, whereas the hybrid precoding system for conventional analog selection (AS) is designed for the switch-based architecture. When compared with CE, the ACE-based HP method has only one more step, but this step has a minimally increased level of complexity. The proposed ACE algorithm is, therefore, more effective. In
Figure 3, we can see that the sum rate of the proposed ACE-based low-bit PSs was significantly higher than that of traditional AS-based HP and CE-based HP, and the difference between the ZF precoding and ACE-based HP remained constant. Nevertheless, the gap can be closed if the PSs continue to increase in terms of their resolution.
Figure 4 compares the sum rates that can be achieved by the proposed ACE-based HP with sub-connected low-bit PSs to the sum rates that can be achieved by the existing solutions for the HAP massive MIMO system. We used 8 for the number of RF chains and 8 for the number of users K, while the number of antennas C = A = 12. The remaining parameters are the same ones that were discussed above. It is evident from this that the proposed solution with sub-connected low-bit PSs was capable of achieving a significantly higher sum rate than the conventional HP, CEO-based HP with one-bit PSs, and fully connected ZF-based HP gap. In addition, a noticeable performance gap may be seen between the proposed solutions, which used 1-bit and 2-bit PSs, and the HR-based full-connected HP, which used 4-bit PSs. However, this gap could be narrowed by increasing the resolution of the PSs.
Figure 5 presents the energy efficiency attained by the proposed ACE-based HP with sub-connected PSs and some existing HP schemes mentioned above. The number of RF chains was equal to the user count K, and it varied from 1 to 32, while the other parameters were similar to those in
Figure 2. In accordance with [
35], the energy efficiency was calculated as the ratio of the sum rate to the total power consumption:
The practical values for power consumption are as follows. The energy consumed by the sub-connected architecture where
p = 0.03 W for the baseband power consumption is
= 0.2 W, the power consumed by the RF chain
= 0.3 W, the one-bit PS power consumption
= 0.005 W, and the typical valves of PS power consumption are 0.015 W, 0.045 W, 0.06 W, and 0.078 W for PSs of 3, 4, 5, and 6 bits, respectively [
36,
37]. Our proposed ACE-based HP scheme with sub-connected PSs attained a high value of energy efficiency when compared with other existing HPs, as mention above.
Figure 6 presents that the proposed ACE-based algorithm with low-bit PSs obtained improved SE in comparison with other sub-connected schemes, especially when the number of users was not extremely large. The number of users
K could range from 1 to 32, and the RF chains had the same configuration as the users. The bandwidth was fixed at 10 MHz, and the other parameters were comparable to those shown in
Figure 4. In addition, it can be seen that when
K equaled eight, the proposed ACE with two-bit PSs had a greater spectral efficiency than other sub-connected schemes, with the exception of the ZF precoding fully connected method. The reason for this was that when the number of users and RF chains increased, the number of PSs also increased, leading to a sharp rise in the total number of PSs in systems that were PS-based. In practice, the utilization of PSs with high resolutions that need a significant amount of energy is not appropriate. The gap between the proposed algorithm and the ZF-based HP is constant behavior. This can be reduced by increasing the resolution of the PSs.
Finally, the impact of imperfect CSI was noticed in the proposed ACE-based HP, where
represents the imperfect CSI as in [
38]:
where
∈
represents the CSI accuracy,
is the original channel matrix, and
is the error matrix whose elements are distributed according to an independent and identically distributed distribution
.
Figure 7 depicts the ACE-based HP sum rates for various CSI scenarios when
= 10 and
and
K = 4. We considered a number of flawed CSI cases with varying values in addition to a perfect CSI scenario. CSI accuracy is not a factor in the proposed ACE-based HP with one-bit PSs. The proposed scheme’s sum rate, which was reached with a value of
= 0.8, was quite close to the perfect CSI situation. Furthermore, even when the CSI accuracy was extremely low, as when
= 0.4, the ACE-based proposal could still achieve 80 percent of the sum rate.