Next Article in Journal
Log Likelihood Spectral Distance, Entropy Rate Power, and Mutual Information with Applications to Speech Coding
Next Article in Special Issue
Sum Capacity for Single-Cell Multi-User Systems with M-Ary Inputs
Previous Article in Journal
On Generalized Stam Inequalities and Fisher–Rényi Complexity Measures
Previous Article in Special Issue
Channel Coding and Source Coding With Increased Partial Side Information
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Capacity and the Optimal Sum-Rate of a Class of Dual-Band Interference Channels †

Department of Electrical and Computer Engineering, University of Waterloo, ON N2L 3G1, Canada
*
Author to whom correspondence should be addressed.
This paper is an extended version of our paper published in the 2016 IEEE International Symposium on Information Theory, Barcelona, Spain, 10–15 July 2016.
Entropy 2017, 19(9), 495; https://doi.org/10.3390/e19090495
Submission received: 29 June 2017 / Revised: 8 September 2017 / Accepted: 11 September 2017 / Published: 14 September 2017
(This article belongs to the Special Issue Multiuser Information Theory)

Abstract

:
We study a class of two-transmitter two-receiver dual-band Gaussian interference channels (GIC) which operates over the conventional microwave and the unconventional millimeter-wave (mm-wave) bands. This study is motivated by future 5G networks where additional spectrum in the mm-wave band complements transmission in the incumbent microwave band. The mm-wave band has a key modeling feature: due to severe path loss and relatively small wavelength, a transmitter must employ highly directional antenna arrays to reach its desired receiver. This feature causes the mm-wave channels to become highly directional, and thus can be used by a transmitter to transmit to its designated receiver or the other receiver. We consider two classes of such channels, where the underlying GIC in the microwave band has weak and strong interference, and obtain sufficient channel conditions under which the capacity is characterized. Moreover, we assess the impact of the additional mm-wave band spectrum on the performance, by characterizing the transmit power allocation for the direct and cross channels that maximizes the sum-rate of this dual-band channel. The solution reveals conditions under which different power allocations, such as allocating the power budget only to direct or only to cross channels, or sharing it among them, becomes optimal.

Graphical Abstract

1. Introduction

Current technology such as 4G (e.g., [1]) is rapidly becoming inadequate to support the exponential growth in wireless traffic [2]. Moreover, the potential for improvement of network throughput in 4G is limited due to the shortage of spectrum in the incumbent microwave band (i.e., carrier frequencies below 6 GHz). Thus, methods to tackle the ever growing amount of mobile traffic has become a key research area (see [3] and references therein). Several new technologies are being considered, among which employing additional spectrum in the 28–300 GHz frequency range, often referred to as the millimeter wave (mm-wave) band, seems to be a promising solution to the problem of spectrum scarcity [3,4]. Specifically, integrating additional spectrum from the mm-wave band to complement transmissions in the microwave band is poised to play a central role in the functioning of 5G networks.
Transmission in the mm-wave band is distinctly different from that in the microwave band. Due to higher operating frequencies, omnidirectional transmission in the mm-wave band is subject to much higher absorption and power loss [5] compared to that in the microwave band, and thus a transmitter needs to employ beamforming with highly directional antenna arrays to counter this loss and reach its receiver [2]. However, beamforming constrains most of the transmission energy to the line-of-sight (LOS) component and very few, if any, significant non-LOS components exist [6,7]. Thus, transmission in this band is highly directional and point-to-point. Such mm-wave channels can support high data rates due to their vast bandwidth, but nevertheless are prone to blockage and absorption [8] due to their point-to-point nature. In contrast, microwave links are much more reliable due to rich scattering and diffraction, but cannot support as high rates as the mm-wave links. Thus, in a dual-band setting, conventional traffic and control information can be reliably communicated over microwave links, and high data-rate traffic can be sent through mm-wave links [2,7,9,10,11,12,13,14,15].
Most studies on microwave and mm-wave dual-band transmission have focused on how to improve network layer performance metrics of cellular access or backhaul networks by using the high-bandwidth, highly directional mm-wave links [9,13]. For example, the authors in [7] posed the problem of optimal resource allocation in a cellular setting in the dual microwave and mm-wave band, and showed that certain network level performance parameters, e.g., the number of simultaneously supported users and the link connection probability, are vastly improved with their proposed solution. Similarly, the authors in [11] study a two-tier cellular network where the 60 GHz band is used to create point-to-point directional links, and the 70 GHz band is used to establish long range connections just as the microwave band is used in our work, and propose a hybrid scheme involving both bands that improves the network throughput. In a more related article [14], the authors characterize the benefits of beamforming over a point-to-point dual band multi-antenna channel, and study the performance of a hybrid adaptive queueing scheme over both bands that maximizes the delay-constrained throughput.
Recent works on joint transmission in the microwave band and mm-wave band [14,16,17,18,19,20] indicate that it is possible to isolate the transmissions in the microwave band from that in the mm-wave band, and communicate over the two bands simultaneously. For example, the authors in [14] designed a queue-based scheme that transmits from a single transmitter to a single receiver simultaneously over the 3 GHz microwave band and the 30 GHz mm-wave band using beamforming, and conducted successful practical experiments in this dual-band setup. In addition, Intel has recently announced the production of a dual-band modem that supports both sub-6 GHz and 28 GHz bands [16]. Moreover, recent works on resource allocation in the microwave and mm-wave dual-band setting in [19,20], and their variant in [21], show that simultaneous transmission in both microwave and mm-wave bands are indeed feasible, and are gaining acceptance as an architecture for cellular access in 5G.
We study here the performance of a two-transmitter two-receiver ( 2 × 2 ) dual-band interference channel from an information theoretic perspective, where a transmitter communicates to its respective receiver over the microwave band and the mm-wave band simultaneously. In the microwave band, each receiver observes the superposition of signals from both transmitters as in a conventional single band Gaussian interference channel (GIC) [22]. However, as the mm-wave channels are considered to be highly directional, a transmitter in this band is well-modeled as being able to transmit towards one intended receiver, while causing negligible to no interference to the other receiver [6]. This raises the question: to which receiver should a transmitter in the mm-wave band transmit? In this 2 × 2 GIC, the transmitters in the mm-wave band can transmit from: (a) the first transmitter ( T x 1 ) to the first receiver ( R x 1 ), and the second transmitter ( T x 2 ) to the second receiver ( R x 2 ); (b) T x 1 to R x 2 , and T x 2 to R x 1 ; (c) T x 1 and T x 2 to R x 2 ; or (d) T x 1 and T x 2 to R x 1 . We focus here on the first two cases where the transmitters in the mm-wave band either transmit (a) from T x 1 to R x 1 , and from T x 2 to R x 2 , i.e., in the direct channels; or (b) from T x 1 to R x 2 , and from T x 2 to R x 1 , i.e., in the cross channels; or (c) share the spectrum between the two modes. We denote the resulting channels by Direct-Link IC (DLIC), Cross-Link IC (CLIC) and Direct-and-Cross-Link IC (DCLIC), respectively, and study their capacity.
The capacity of the conventional single band GIC [22] has been characterized when it has strong [22] or very strong interference [23]. We know that, under strong interference, encoding the message at each transmitter with independent Gaussian distributed codewords, and decoding both the desired and the interfering messages at each receiver is the optimal strategy [22]. However, if the GIC has weak interference [24], the capacity and the optimal strategy is still unknown in general.
The 2 × 2 parallel Gaussian IC (PGIC) consists of several orthogonal parallel channels (sub-channels) such that a 2 × 2 conventional GIC operates in each sub-channel without interfering with that in other sub-channels [25]. The optimal strategies for the PGIC are not known in general; however, its capacity was characterized in [26] when the GIC in each sub-channel has strong interference. In a related study, the capacity of the ergodic fading GIC was characterized in [27] when each fading state has strong interference. In the dual-band GIC considered here, the number of channel uses in the microwave band and the mm-wave band may differ. We model this with a bandwidth mismatch factor (BMF) in the system model. Note that the dual-band GIC considered here is a special case of the ergodic fading GIC in [27] if one identifies each fading state as a different sub-channel. In the special case that the bandwidth mismatch factor between the microwave and mm-wave bands is 1, it is also a special case of the PGIC in [26].
Moreover, the studies in [26,27] show that if every sub-channel (or fading state) has strong interference, the capacity is achieved by encoding jointly over all sub-channels and decoding messages from both transmitters. Encoding independently over each sub-channel of a PGIC is suboptimal in general [28], except for the GIC in the very weak (noisy) interference regime [29]. In fact, joint encoding over all sub-channels generally achieves better rates as it can potentially offset the weak interference in one or more sub-channels if the other sub-channels have strong interference [28].
In this sequel, we study the capacity of the DCLIC and the CLIC. First, we present a useful result that decomposes the capacity of the DCLIC into that of the underlying CLIC and the set of direct channels. This result shows that the capacity of the DCLIC can be established if the capacity of a corresponding CLIC is known. Hence, we focus on the CLIC next. In particular, we consider two specific classes, the strong CLIC and the weak CLIC, where the underlying GIC in the microwave band has strong and weak interference, respectively, and characterize sufficient conditions on the channel parameters under which their capacity is established.
Resource allocation techniques that maximize the sum-rate or throughput of interference channels have been investigated throughly (e.g., see [25,30,31]) as they indicate how to allocate resources in practice. The DCLIC models a basic multiuser network over the microwave and mm-wave dual-band, whose performance is likely to be dominated by the mm-wave channels due to their large bandwidths [9]. Thus, it is useful to understand how to optimize the performance of the DCLIC over the parameters in the mm-wave band. Therefore, we study the power allocation in the direct and cross channels in the mm-wave band that maximizes the sum-rate of the DCLIC. We derive the optimal powers in closed form, and study how the channel parameters influence the optimal power allocation.
The contributions of this paper are summarized as follows:
  • We show that the capacity region of the DCLIC can be decomposed into the capacity region of the underlying CLIC and two non-interfering direct links in the mm-wave band. This illustrates that the cross channels are actively involved in characterizing the capacity of the CLIC, whereas the direct channels improve the rates of individual users.
  • We characterize the capacity of the strong CLIC, and observe that the strong interference condition in the microwave band is sufficient to characterize the capacity.
  • For the weak CLIC, we characterize sufficient channel conditions under which its capacity is established. This shows that even if the GIC in the microwave band has weak interference, adequately strong cross channels in the mm-wave band are sufficient to characterize the capacity.
  • We characterize the optimal power allocation in the direct and cross channels that maximizes the sum-rate of the DCLIC, and study channel conditions under which the optimal power allocation either assigns the entire power budget to a specific subset of channels, or shares the power budget among all channels. We establish a direct relation between the channel parameters and the optimal powers, from which we observe the following:
    -
    The optimal power allocation distributes the power budget among the direct and cross channels following two properties: a waterfilling-like property and a max-min property.
    -
    When the power budget is sufficiently small, the optimal allocation assigns power to either both direct channels, or both cross channels and at most one direct channel.
    -
    Due to the max-min property, the optimal allocation imposes a maximum limit on the cross-channel powers. When the power budget exceeds a certain threshold, the limit on the cross-channel powers are reached, and all additional increments to the power budget are then added only to the direct channels that do not have such limits.
    -
    If the underlying GIC in the microwave band has very strong interference, the optimal power allocation assigns the power budget entirely to the direct channels.
    -
    If the channel parameters satisfy one of the following criteria, then transmitting only in the direct channels is approximately optimal, in the sense that the difference between the sum-rates resulting from allocating to only direct channels and allocating optimally in all channels, is negligibly small: (a) the transmit powers in the underlying GIC in the microwave band is very small; or (b) the cross channel gains in the mm-wave band are very large.
The rest of the paper is organized as follows. We define the system model in Section 2. We present the decomposition result on the DCLIC and the capacity of the strong CLIC in Section 3. We present the capacity result on the weak CLIC in Section 4. In Section 5, we formulate the optimum sum-rate problem and discuss its solution, and we conclude in Section 6.
Notation: We denote sets in calligraphic (e.g., A k ), except the sets of reals, positive reals, and positive integers, which are denoted by R , R + and Z + , respectively. Vectors are in bold (e.g., x ), and x 0 denotes that each element of x is in R + , where 0 is the zero vector. We denote by E ( Y ) the statistical expectation of a random variable (RV) Y, and by X N ( μ , σ 2 ) an RV following the Gaussian distribution with mean μ and variance σ 2 . We also denote an n-length vector ( X 1 , X 2 , , X n ) by X n , the empty set by ∅, log 2 x by log x , and define C ( x ) : = 1 2 log ( 1 + x ) .

2. System Model

First, we define the discrete memoryless (DM) model of the DCLIC, and from that we define the Gaussian models. In the DCLIC, a bandwidth mismatch between the first (microwave) band and the second (mm-wave) band may exist. Thus, we assume that during n accesses of the first band, the second band is accessed n 1 ( n ) times as cross channels, and n 2 ( n ) times as direct channels. We model this by two bandwidth mismatch factors (BMF) α k : = lim n n k ( n ) / n , k = 1 , 2 . Thus, the total normalized channel uses in the second band is asymptotically α : = α 1 + α 2 , which is shared between the cross channels ( α 1 ) and direct channels ( α 2 ). For ease of exposition, we denote n k ( n ) by n k , k = 1 , 2 .
The 2 × 2 DM interference channel is defined by ( ( X k , Y k ) k = 1 2 , p ( y 1 , y 2 | x 1 , x 2 ) ) where X k and Y k are the discrete input and output alphabets of user k, k = 1 , 2 , and p ( y 1 , y 2 | x 1 , x 2 ) is the set of channel transition probabilities [32] (Chapter 6.1). We define the 2 × 2 DM DCLIC similarly by the tuple, ( ( X k , X ^ k , X ¯ k , Y k , Y ^ k , Y ¯ k ) k = 1 2 , p ) , where X k and Y k are the input and output alphabets of the interference channel in the first band, X ¯ k and Y ¯ k are the input and output alphabets for the T x k to R x k direct channels in the second band, k = 1 , 2 , and X ^ 1 and Y ^ 2 (respectively X ^ 2 and Y ^ 1 ) are the input and output alphabets for the T x 1 to R x 2 (resp. T x 2 to R x 1 ) cross channel in the second band. The joint probability mass function (pmf) of the DCLIC decomposes as
p ( x 1 n , x ^ 1 n 1 , x ¯ 1 n 2 ) p ( x 2 n , x ^ 2 n 1 , x ¯ 2 n 2 ) = 1 n p ( y 1 , y 2 | x 1 , x 2 ) = 1 n 1 p ( y ^ 1 | x ^ 2 ) p ( y ^ 2 | x ^ 1 ) = 1 n 2 p ( y ¯ 1 | x ¯ 1 ) p ( y ¯ 2 | x ¯ 2 ) .
We define a ( 2 n R 1 , 2 n R 2 , n , n 1 , n 2 ) code for the DCLIC to consist of (a) two uniformly distributed and independent message sets, M 1 : = { 1 , , 2 n R 1 } and M 2 : = { 1 , , 2 n R 2 } , respectively, for T x 1 and T x 2 ; (b) two encoding functions, ϕ 1 and ϕ 2 , respectively for T x 1 and T x 2 such that ϕ k : M k X k n × X ^ k n 1 × X ¯ k n 2 , k = 1 , 2 ; and (c) two decoding functions, ψ 1 and ψ 2 , respectively, for R x 1 and R x 2 such that ψ k : Y k n × Y ^ k n 1 × Y ¯ k n 2 M k , k = 1 , 2 .
We define the decoding probability of error at R x k by P e , k n : = Pr ψ k ( Y k n , Y ^ k n 1 , Y ¯ k n 2 ) M k , k = 1 , 2 . We say that ( R 1 , R 2 ) is an achievable rate pair of the DCLIC, if there exists a sequence of ( 2 n R 1 , 2 n R 2 , n , n 1 , n 2 ) codes such that n k α k n , and P e , k n 0 as n , n k for k = 1 , 2 . The capacity region of the DCLIC is defined as the closure of the set of all achievable rate tuples.
Following the above notations, the GIC in the first band of the Gaussian DCLIC is modeled as in [22]. The signals received at R x 1 and R x 2 are given by
Y 1 = X 1 + a 21 X 2 + Z 1 ,
Y 2 = X 2 + a 12 X 1 + Z 2 , = 1 , , n ,
where X k , Y k R , a k m R are coefficients of the channels from T x k to R x m , k m { 1 , 2 } ( a k k 2 are normalized to 1 as in [22]), and Z k N ( 0 , 1 ) are i.i.d. noise, k = 1 , 2 . In addition, the codewords now satisfy the average power constraint, 1 n = 1 n X k 2 Q k , k = 1 , 2 . The cross links in the second band of the DCLIC are point-to-point, and are modeled as
Y ^ 2 = c 12 X ^ 1 + Z ^ 2 ,
Y ^ 1 = c 21 X ^ 2 + Z ^ 1 , = 1 , , n 1 ,
where X ^ k , Y ^ k R , c k m R are coefficients of the channels from T x k to R x m , k m { 1 , 2 } , and Z ^ k N ( 0 , 1 ) are i.i.d. noise. The codewords satisfy the average power constraint, 1 n 1 = 1 n 1 X ^ k 2 P ^ k , k = 1 , 2 . The direct links in the second band are similarly modeled as
Y ¯ 1 = d 1 X ¯ 1 + Z ¯ 1 ,
Y ¯ 2 = d 2 X ¯ 2 + Z ¯ 2 , = 1 , , n 2 ,
where X ¯ k , Y ¯ k R , d k R are the direct channel coefficients, Z ¯ k N ( 0 , 1 ) are i.i.d. noise, k = 1 , 2 , and the codewords satisfy the average power constraint, 1 n 2 = 1 n 2 X ¯ k 2 P ¯ k , k = 1 , 2 , as well. We present the Gaussian DCLIC in Figure 1. We define a ( 2 n R 1 , 2 n R 2 , n , n 1 , n 2 ) code for the Gaussian DCLIC from that of the DM DCLIC, by choosing all input and output variables to be in R , and by imposing the average power constraints on the codewords, X k n , X ^ k n 1 and X ¯ k n 2 , k = 1 , 2 , defined above.
Next, the Gaussian CLIC is defined from the Gaussian DCLIC by imposing the restrictions, n 2 = 0 and X ¯ k = Y ¯ k = , k = 1 , 2 , such that the channel outputs in the CLIC are described by (1)–(4), respectively. A ( 2 n R 1 , 2 n R 2 , n , n 1 ) code for CLIC is defined from a ( 2 n R 1 , 2 n R 2 , n , n 1 , n 2 ) code of the Gaussian DCLIC with the above-mentioned restrictions.
For ease of exposition, we define two classes of the CLICs: we say that a CLIC is a strong CLIC or a weak CLIC, if the underlying GIC in the first band has strong interference (i.e., a 12 2 1 , a 21 2 1 ) or weak interference (i.e., a 12 2 < 1 , a 21 2 < 1 ) , respectively. Moreover, a symmetric CLIC is a CLIC where a 2 : = a 12 2 = a 21 2 , c 2 : = c 12 2 = c 21 2 , Q 1 = Q 2 , and P ^ 1 = P ^ 2 . In addition, a symmetric DCLIC is a DCLIC with an underlying symmetric CLIC, and d 2 = d 1 2 = d 2 2 , P ¯ 1 = P ¯ 2 .
Finally, the Gaussian model for the DLIC are defined from that of the DCLIC by imposing the restrictions, n 1 = 0 and X ^ k = Y ^ k = , k = 1 , 2 , and thus the channel outputs are described by (1), (2), (5) and (6), respectively. In the sequel, we focus on the Gaussian model of the DCLIC and the CLIC.

3. Decomposition Result on the Capacity of the DCLIC

Recall that, in the DCLIC, there are n channel uses in the first band, n 1 cross channel uses in the second band, and n 2 direct channels uses in the second band. We show below that the capacity of the DCLIC can be decomposed into the capacity of the underlying CLIC, complemented by the direct channels that are used to transmit individual user information to their respective receivers.
Theorem 1.
The capacity region of the DCLIC with BMFs α 1 and α 2 is given by the set of all nonnegative rate tuples ( R 1 , R 2 ) that satisfy the decomposition
R 1 = r 1 + α 2 C ( d 1 2 P ¯ 1 ) , R 2 = r 2 + α 2 C ( d 2 2 P ¯ 2 ) ,
where ( r 1 , r 2 ) is an achievable rate tuple in the underlying CLIC with BMF α 1 .
Proof. 
The proof is relegated to Appendix A. ☐
Therefore, an achievable rate pair in the DCLIC with BMFs α 1 and α 2 consists of a rate pair achievable in the CLIC with BMF α 1 , and the rate pair achieved in the direct channels, ( α 2 C ( d 1 2 P ¯ 1 ) , α 2 C ( d 2 2 P ¯ 2 ) ) . The capacity of the DCLIC can thus be characterized from that of the underlying CLIC. Hence, we focus on the CLIC next. In particular, we consider two specific classes of the CLIC, the strong CLIC and the weak CLIC. First, we present the capacity of the strong CLIC.
Lemma 1.
The capacity region of the strong CLIC with BMF α 1 is given by the set of all nonnegative rate tuples ( R 1 , R 2 ) that satisfy
R 1 C ( Q 1 ) ,
R 2 C ( Q 2 ) ,
R 1 + R 2 C ( Q 1 + a 21 2 Q 2 ) + α 1 C ( c 21 2 P ^ 2 ) ,
R 1 + R 2 C ( Q 2 + a 12 2 Q 1 ) + α 1 C ( c 12 2 P ^ 1 ) .
The proof of Lemma 1 follows from that of [27] in a straightforward manner. Hence, we omit the proof here, and discuss only the key idea. In the strong CLIC, the GIC in the first band has strong interference. Additionally, in the second band, the cross-channel gains are positive and the direct-channel gains are zero, which results in a GIC with strong interference. Hence, the strong CLIC is a parallel GIC with strong interference in both bands. The capacity is thus achieved by encoding jointly over both bands and decoding messages from both transmitters as in [27].
Moreover, the capacity of the DCLIC with strong underlying CLIC, which follows from Theorem 1 and Lemma 1, is characterized below.
Corollary 1.
The capacity region of the DCLIC with BMFs α 1 and α 2 that has a strong underlying CLIC is given by the set of all nonnegative rate tuples ( R 1 , R 2 ) that satisfy
R 1 C ( Q 1 ) + α 2 C ( d 1 2 P ¯ 1 ) ,
R 2 C ( Q 2 ) + α 2 C ( d 2 2 P ¯ 2 ) ,
R 1 + R 2 C ( Q 1 + a 21 2 Q 2 ) + α 1 C ( c 21 2 P ^ 2 ) + α 2 C ( d 1 2 P ¯ 1 ) + α 2 C ( d 2 2 P ¯ 2 ) ,
R 1 + R 2 C ( Q 2 + a 12 2 Q 1 ) + α 1 C ( c 12 2 P ^ 1 ) + α 2 C ( d 1 2 P ¯ 1 ) + α 2 C ( d 2 2 P ¯ 2 ) .
Furthermore, if the DLIC, where the second band is used as direct channels only, has strong underlying GIC, the capacity region follows from Corollary 1 with α 1 = 0 and α 2 = α .

4. Capacity of the Weak CLIC

Now, consider the weak CLIC where the underlying GIC has weak interference. Recall that in a conventional GIC with weak interference, decoding both messages is suboptimal in general. However, in the weak CLIC, if messages are encoded jointly over both bands and the cross channels in the second band are sufficiently strong, then decoding both messages is indeed optimal. We characterize the sufficient conditions below.
Lemma 2.
Provided the channel parameters of the CLIC with BMF α 1 satisfy a 12 2 < 1 , a 21 2 < 1 , as well as the conditions
( 1 + Q 1 ) ( 1 + Q 2 ) ( 1 + a 12 2 Q 1 ) ( 1 + c 12 2 P ^ 1 ) α 1 ,
( 1 + Q 1 ) ( 1 + Q 2 ) ( 1 + a 21 2 Q 2 ) ( 1 + c 21 2 P ^ 2 ) α 1 ,
the following inequalities hold
I ( X 1 n ; Y 1 n | X 2 n ) I ( X 1 n ; Y 2 n ) + n α 1 C ( c 12 2 P ^ 1 ) ,
I ( X 2 n ; Y 2 n | X 1 n ) I ( X 2 n ; Y 1 n ) + n α 1 C ( c 21 2 P ^ 2 ) ,
for all X 1 n and X 2 n with product distributions p ( x 1 n ) p ( x 2 n ) on R n × R n , n Z + , which satisfy the average power constraints, 1 n = 1 n X k 2 Q k , k = 1 , 2 .
Proof. 
We prove only (18) as (19) follows similarly. From the system model for the first band in (1) and (2), we have
I ( X 1 n ; Y 1 n | X 2 n ) I ( X 1 n ; Y 2 n ) = h ( X 1 n + Z 1 n ) h ( Z 1 n ) h ( X 2 n + a 12 X 1 n + Z 2 n ) + h ( X 2 n + Z 2 n ) ( a ) h ( X 2 n + Z 2 n ) h ( Z 2 n ) + h ( X 1 n + Z 1 n ) h ( a 12 X 1 n + Z 2 n ) = ( b ) I ( X 2 n ; X 2 n + Z 2 n ) { h ( X 1 n + Z 1 n + Z 3 n ) h ( X 1 n + Z 1 n ) } n log | a 12 | ( c ) n C ( Q 2 ) I ( Z 3 n ; Z 3 n + Z 1 n + X 1 n ) n log | a 12 | = ( d ) n C ( Q 2 ) = 1 n h ( Z 3 | Z 3 n ) h ( Z 3 | Z 3 n , Z 3 n + Z 1 n + X 1 n ) n log | a 12 | ( e ) n C ( Q 2 ) = 1 n h ( Z 3 ) h ( Z 3 | Z 3 + Z 1 + X 1 ) n log | a 12 | = ( f ) n C ( Q 2 ) = 1 n I ( Z 3 ; Z 3 + Z 1 + X 1 ) n log | a 12 | ( g ) n C ( Q 2 ) = 1 n I ( Z 3 ; Z 3 + Z 1 + X 1 * ) n log | a 12 | = ( h ) n C ( Q 2 ) + = 1 n 1 2 log Q 1 + 1 Q 1 + 1 / a 12 2 n log | a 12 | ( i ) n C ( Q 2 ) + n 2 log 1 n = 1 n Q 1 + 1 1 n = 1 n Q 1 + 1 / a 12 2 n log | a 12 | ( j ) n 2 log ( 1 + Q 1 ) ( 1 + Q 2 ) ( 1 + a 12 2 Q 1 ) ( k ) α 1 n 2 log ( 1 + c 12 2 P ^ 1 ) ,
where (a) follows from unconditioning, h ( X 2 n + a 12 X 1 n + Z 2 n ) h ( X 2 n + a 12 X 1 n + Z 2 n | X 2 n ) = h ( a 12 X 1 n + Z 2 n ) , and since h ( Z 1 n ) = h ( Z 2 n ) ; (b) follows since 1 / a 12 2 > 1 , and thus h ( a 12 X 1 n + Z 2 n ) = h ( a 12 X 1 n + Z 1 n ) = h ( X 1 n + Z 1 n / a 12 ) + n log | a 12 | = h ( X 1 n + Z 3 n + Z 2 n ) + n log | a 12 | , where Z 3 N ( 0 , 1 / a 12 2 1 ) , are i.i.d.; (c) follows since the first term in b is maximized by X 2 N ( 0 , Q 2 ) , i.i.d.; (d) follows from the chain rule; (e) follows since Z 3 are i.i.d., and since unconditioning does not reduce entropy; (f) follows by replacing the i.i.d. RVs, Z 3 and Z 1 , with Z 3 N ( 0 , 1 / a 12 2 1 ) and Z 1 N ( 0 , 1 ) , respectively; (g) follows by invoking the worst additive noise result of [33] to every mutual information term inside the summation of (f), where X 1 * N ( 0 , Q 1 ) , with Q 1 : = E ( X 1 2 ) , and 1 n = 1 n Q 1 Q 1 ; (h) follows from the differential entropy of Gaussian RVs; (i) follows since the log function is concave in { Q 1 } = 1 n ; (j) follows since 1 n = 1 n Q 1 Q 1 ; and (k) follows from condition (16). ☐
We characterize the capacity of the weak CLIC under the conditions of Lemma 2 below.
Theorem 2.
The capacity region of the weak CLIC with BMF α 1 that satisfies the conditions in (16) and (17) is given by the set of all nonnegative rate tuples ( R 1 , R 2 ) that satisfy
R 1 C ( Q 1 ) ,
R 2 C ( Q 2 ) ,
R 1 + R 2 C ( Q 1 + a 21 2 Q 2 ) + α C ( c 21 2 P ^ 2 ) ,
R 1 + R 2 C ( Q 2 + a 12 2 Q 1 ) + α C ( c 12 2 P ^ 1 ) .
Proof. 
The proof is relegated to Appendix B. ☐
The result shows that, in the weak CLIC, where sufficient interference forwarding is not possible through the underlying weak GIC, if the cross links in the second band are sufficiently strong, it is possible to forward enough interference by encoding jointly over both bands such that decoding both messages becomes optimal. Hence, these conditions can be classified as being able to push the receivers to the “strong interference regime over both bands”.
Next, we illustrate the relationship between the channel parameters in (16) and (17) with an example of a symmetric weak CLIC, where (16) and (17) imply the same condition. We denote by c min 2 : = ( ( ( 1 + Q 1 ) ( 1 + Q 2 ) / ( 1 + a 2 Q 1 ) ) 1 / α 1 1 ) / P ^ 1 , and α 1 , min : = log ( ( 1 + Q 1 ) ( 1 + Q 2 ) / ( 1 + a 2 Q 1 ) ) / log ( 1 + c 2 P ^ 1 ) , respectively the minimum c 2 and the minimum α 1 required to achieve (16), and show the interplay between a 2 , c min 2 and α 1 , min for Q 1 = Q 2 = P ^ 1 = 1 .
In Figure 2a, we plot c min 2 against a 2 ( 0 , 1 ) for α 1 { 0.5 , 1 , 2 } . Note that c min 2 reduces monotonically as a 2 or α 1 increases. This follows since, if a 2 increases, then the first band forwards more interference, and, if α 1 increases, then the pre-log factor of the cross-channel capacity increases. In either case, smaller c min 2 is required to achieve (16). Similarly in Figure 2b, we plot α 1 , min against a 2 ( 0 , 1 ) for c 2 { 0.5 , 1 , 2 } , and note that α 1 , min reduces as a 2 or c 2 increases. Finally, in Figure 2c, we depict the set of cross-channel gains ( a 2 and c 2 ) of a symmetric CLIC with Q 1 = Q 2 = P ^ 1 = 1 , α 1 = 2 , and partition it depending on whether the capacity has been characterized in each set. Note that the capacity for the set where a 2 < 1 and c 2 < c m 2 has not been characterized yet.

5. The Optimal Sum-Rate Problem

In this section, we study the power allocation scheme over the direct and cross channels in the DCLIC that maximizes its sum-rate. Recall from Corollary 1 that, if the underlying GIC of the DCLIC has strong interference (i.e., a 12 2 1 and a 21 2 1 ), then the sum-rate of the DCLIC is known for all values of the remaining channel parameters, and, in particular, for all transmit powers in the second band. Therefore, we pose this problem for the class of DCLICs with strong underlying GIC (see [A1] below). Also recall that the normalized bandwidth ( α ) of the second band is shared between the direct channels ( α 2 ) and the cross channels ( α 1 ). We denote the fraction of α alloted to the direct and cross channels by β and β ¯ , respectively, where β : = α 2 / α and β ¯ : = 1 β = α 1 / α , and β , β ¯ ( 0 , 1 ) . Thus, β provides a trade-off between the bandwidths in the direct channels ( β α ) and the cross channels ( β ¯ α ).
We formulate the problem under the following assumptions:
  • the underlying GIC of the DCLIC has strong interference, but not very strong interference, 1 a 12 2 < 1 + Q 1 and 1 a 21 2 < 1 + Q 2 ;
  • the underlying GIC of the DCLIC satisfies: Q 2 + a 12 2 Q 1 = Q 1 + a 21 2 Q 2 ;
  • β and β ¯ are fixed a priori;
  • the transmission power in the direct channel ( p k ) and cross channel ( q k ) from transmitter k ( T x k ) satisfy the constraint, β p k + β ¯ q k = P , k = 1 , 2 , where P is the power budget.
In [A1], we assume that the underlying GIC does not have very strong interference, since the power allocation in that case is trivial (see Section 5.4). For ease of exposition, we assume in [A2] that the underlying GIC receives equal power in both its receivers. Note that the class of GICs that satisfies [A2] also contains the symmetric GICs. Moreover, the analysis under [A2] reveals enough insight such that it can be extended to the general case (see Remark 1).
In [A3], β is assumed to be fixed a priori and known. This models practical constraints in many wireless networks where dynamically allocating the bandwidth may not be feasible or straightforward [34]. In [A4], we assume that power budget (P) in both transmitters are the same. There is no loss of generality in this assumption as the relative difference between the power budgets of the first and the second user can be absorbed into the channel gains.

5.1. Problem Formulation and Solution

For a fixed power allocation ( p 1 , q 1 , p 2 , q 2 ) in the mm-wave channels, we denote the sum-rate achievable at R x 1 and R x 2 in (14) and (15) by Σ 1 and Σ 2 , and the interference-free sum-rate given by the sum of individual rates in (12) and (13) by Σ , and present them below
Σ 1 : = A 1 + α β ¯ 2 log ( 1 + c 21 2 q 2 ) + α β 2 log ( 1 + d 1 2 p 1 ) + log ( 1 + d 2 2 p 2 ) ,
Σ 2 : = A 2 + α β ¯ 2 log ( 1 + c 12 2 q 1 ) + α β 2 log ( 1 + d 1 2 p 1 ) + log ( 1 + d 2 2 p 2 ) ,
Σ : = A + α β 2 log ( 1 + d 1 2 p 1 ) + log ( 1 + d 2 2 p 2 ) ,
where A 1 : = 1 2 log ( 1 + Q 1 + a 21 2 Q 2 ) , A 2 : = 1 2 log ( 1 + Q 2 + a 12 2 Q 1 ) and A : = 1 2 log ( 1 + Q 1 ) ( 1 + Q 2 ) . Furthermore, they satisfy A 1 = A 2 < A due to [A2] and [A1]. Therefore, a necessary and sufficient condition for R to be an achievable sum-rate of the DCLIC is R min { Σ 1 , Σ 2 , Σ } , for some power allocation ( p 1 , q 1 , p 2 , q 2 ). The optimization problem that maximizes R over the transmit powers ( p 1 , q 1 , p 2 , q 2 ) is then
[ P 1 ] maximize R
subject to : R Σ 1 ,
R Σ 2 ,
R Σ ,
β p 1 + β ¯ q 1 = P ,
β p 2 + β ¯ q 2 = P ,
( p 1 , q 1 , p 2 , q 2 , R ) 0 .
Note that [ P 1 ] is a convex optimization problem, since its objective function R is linear, the equality constraints (30) and (31) are affine, and the inequality constraints (27)–(29) are convex. Furthermore, [ P 1 ] satisfies Slater’s condition [35] (Chapter 5.2.3), and thus it can be solved using the Karush–Kuhn–Tucker (KKT) conditions [35] (Chapter 5.5.3). We relegate the details to Appendix C.
It is well known that optimal power allocation in parallel Gaussian point-to-point channels follows the Waterfilling (WF) property. Due to this, if the power budget is sufficient small, it is allocated entirely to the “strongest” sub-channel, and, as the power budget is increased, power is allocated to the other “weaker” sub-channels, in addition to the strongest one (see Chapter 3.4.3 in [32]). In the ensuing discussion, it becomes clear that the optimal power allocation in [ P 1 ] (hereby referred to as “the optimal allocation”) has two noticeable properties: a WF-like property, due to which it assigns power to the cross and direct channels following a WF-like allocation, and a max-min property, due to which it increases the minimum of the sum-rate constraints.
Due to its WF-like property, the optimal allocation assigns the entire power budget (P) to only a subset of all the direct and cross channels, depending on channel conditions that indicate whether the direct channels are stronger than the cross channels or vice versa. Moreover, if P is sufficiently increased, it becomes optimal to allocate power to the remaining set of channels. In addition, since the objective of the problem is to maximize min { Σ 1 , Σ 2 , Σ } , the optimal allocation assigns powers in such a way that minimizes or eliminates any difference between Σ 1 , Σ 2 and Σ . We observe that, due to this property (max-min property), the optimal allocation imposes a maximum limit on the cross-channel powers, which is unlike WF in [32] where no such limit exists.
We study the optimal allocation by partitioning the entire set of channel parameters and P into disjoint sets ( S ( . ) ), such that the optimal allocation can be classified according to the power levels in the direct and cross channels in each set. Without loss of generality, we present the optimal allocation under c 21 2 > c 12 2 in Table 1, and study it in detail. In this case, only fours sets, S D , S C , S C D and S sat , are sufficient. The power allocation under c 21 2 < c 12 2 can be readily obtained from Table 1 by swapping the indices 1 and 2, and the case with c 21 2 = c 12 2 is briefly discussed in Section 5.3.
For notational convenience, we express the optimal powers and the conditions of the sets S ( . ) in Table 1 in terms of the following functions:
g 1 ( P ) : = d 1 2 c 12 2 ( 1 + P d 1 2 / β ) + d 2 2 c 21 2 ( 1 + P d 2 2 / β ) ,
g 2 ( P ) : = 1 + c 12 2 P / β ¯ d 1 2 c 12 2 + 1 P ( c 21 2 c 12 2 ) / β + c 21 2 / d 2 2 ,
g 3 ( P ) : = β γ P c 21 2 + β ¯ + β c 21 2 / d 2 2 β ¯ γ + β γ P c 12 2 + β ¯ + β c 12 2 / d 1 2 β ¯ γ ,
F ( P ) : = E 1 + E 2 ( E 1 E 2 ) 2 + 4 β 2 E 1 E 2 / ( 2 β ¯ ( 1 + β ) ) 1 ,
E k : = P c k m 2 + β ¯ + β c k m 2 / d k 2 , k m { 1 , 2 } ,
γ : = ( 1 + Q 1 ) ( 1 + Q 2 ) 1 + Q 1 + a 21 2 Q 2 1 / α β ¯ ,
and P 4 * : = β ¯ ( γ 1 ) / c 12 2 . We relegate the details of the derivation to Appendix C.
In the following, we use “the optimal allocation” and OA 1 interchangeably to refer to the optimal power allocation for [ P 1 ] under c 21 2 > c 12 2 , and discuss some interesting characteristics of OA 1 .
First, the condition g 1 ( P ) < 1 in S D implies that the direct channels are “stronger” than the cross channels, in the sense that, for sufficiently small P, the sum-rate achieved from allocating P only to both direct channels is larger than that achieved from any other subset of channels. Thus, following its WF-like property, OA 1 allocates P entirely to the direct channels, i.e., p 1 = p 2 = P β , and zero power to the cross channels, i.e., q 1 = q 2 = 0 . This allocation also achieves Σ 2 = Σ 1 , which is consistent with the max-min property of OA 1 .
Second, the condition g 2 ( P ) < 1 in S C implies that the cross channels are “stronger” than the direct channels, in the sense that, for sufficiently small P, the sum-rate achieved from allocating P to both cross channels and a direct channel is larger than that achieved from only direct channels. Note that, unlike in S D , OA 1 needs to allocate power to a direct channel in addition to both cross channels in S C , since allocating P entirely to the cross channels causes an imbalance between Σ 1 and Σ 2 ( Σ 2 < Σ 1 ) due to c 21 2 > c 12 2 , which violates the max-min property. Therefore, OA 1 shares P among the cross and direct channels from T x 2 to preserve Σ 2 = Σ 1 . Moreover, the condition P < P 4 * ensures that power in the cross channels have not yet reached their maximum limits.
Third, as P is increased, OA 1 shares P among all channels in S C D . As P increases, the additional benefits from transmitting in a particular subset of channels (either direct channels as in S D , or cross and direct channels as in S C ) begins to diminish. Thus, following its WF-like property, OA 1 starts sharing P among all channels. In addition, OA 1 shares P in such a way that preserves Σ 2 = Σ 1 .
Finally, if P is increased sufficiently, OA 1 follows the allocation in S sat , where the cross-channel powers have reached their maximum limits, q k , sat : = ( γ 1 ) / c k m 2 , k m { 1 , 2 } . Therefore, as P increases further, all subsequent increments of P is allotted to only direct channels. We now say that the cross channels are saturated, in the sense that allocating more power beyond these limits does not improve the sum-rate. Such limits for the cross channels in OA 1 is unlike the WF allocation in [32].
The cross channels become saturated due to the max-min property of OA 1 . Recall that, in S C D , OA 1 allocates powers to all channels, which increase as P increases. The increases in p 1 and p 2 results in the increase of Σ 1 , Σ 2 and Σ by an equal margin. However, an increase in q 1 only increases Σ 2 , and an increase in q 2 only increases Σ 1 . Now, note that, due to its max-min property, OA 1 preserves Σ 1 = Σ 2 in S D , S C , and S C D . However, there may exist a gap between Σ 1 = Σ 2 and Σ . As q 1 and q 2 are increased, this gap reduces and finally becomes zero in S sat . At this point, both cross channels become saturated simultaneously. If any more power is allocated to either q 1 or q 2 , a suboptimal sum-rate R = Σ < min { Σ 1 , Σ 2 } will result. Therefore, OA 1 maintains q k = q k , sat and diverts all additional power to the direct channels.
Note that once, in S sat , the sum-rates achieved by joint decoding ( Σ 1 and Σ 2 ) become equal to the sum of interference-free user rates ( Σ ), which is somewhat similar to the behavior of the GIC under very strong interference. At this point, all additional increments in P are allocated to the direct channels to increase the individual user rates.
In addition, note that the sets S ( . ) form a partition due to their construction using the optimal Lagrange multipliers (defined in Appendix C). Thus, the conditions in Table 1 are mutually exclusive.
Moreover, these conditions can be equivalently described in terms of three critical powers, P k * R + , defined by g k ( P k * ) = 1 , where g k ( P ) is defined in (33)–(35) for k { 1 , 2 , 3 } . Specifically, the conditions of S D , S C , and S sat are given by P < P 1 * , P < min { P 2 * , P 4 * } , and P > max { P 3 * , P 4 * } , respectively. In addition, we note that the direct channels are “stronger” than the cross channels in the sense specified above, if P 1 * > 0 . Similarly, if P 2 * > 0 , the cross channels are “stronger” than the direct channels. Furthermore, if 0 < P 3 * < P 4 * < P 2 * , the cross channels are said to be “much stronger”, in the sense that OA 1 continues allocating power to the cross channels as in S C until they become saturated, and only after that it assigns power to both direct channels.
We note from the mutual exclusiveness of S D and S C that, if P 1 * > 0 , then no P 2 * R + exists that satisfies g 2 ( P 2 * ) = 1 . This shows that, if the direct channels are stronger, the allocation in S C is suboptimal for any P > 0 . Similarly, if P 2 * > 0 , then no P 1 * R + exists such that g 1 ( P 1 * ) = 1 , and thus the allocation in S D is suboptimal for any P > 0 .

5.2. The Waterfilling-Like Nature of the Optimal Power Allocation

Now, we characterize how OA 1 adapts the power allocation, as the power budget (P) increases and crosses the critical thresholds P k * , k { 1 , 2 , 3 } . We say that OA 1 follows the sequence A 1 A 2 A 3 where A l { S D , S C , S C D , S sat } , if OA 1 allocates power as in A 1 for sufficiently small P, and then adapts the powers according to the allocation in A 2 and A 3 as P increases. In this regard, we note that OA 1 follows one of the three sequences, as explained below and illustrated graphically in Figure 3.
  • If P 1 * > 0 : OA 1 follows the sequence S D S C D S sat (denoted by [S1]). Since the direct channels are stronger, OA 1 allocates all of P to them as in S D when P is sufficiently small (i.e., P < P 1 * ). However, as P increases, the additional benefit from transmitting only in the direct channels decreases, and thus, when P P 1 * , OA 1 begins transmitting in both cross and direct channels as in S C D . This allocation follows from its WF-like property, and remains optimal for all P 1 * P P 3 * . Finally, when P > P 3 * , the max-min property of OA 1 comes into effect, and thus the cross channels become saturated and OA 1 starts following the allocation in S sat . Note that, in this case, the saturation threshold for P is P sat = P 3 * .
  • If P 2 * > 0 but 0 < P 3 * < P 4 * < P 2 * is not satisfied: OA 1 follows the sequence S C S C D S sat ([S2]). This case is similar to [S1] above, except for the fact that now the cross channels are stronger. Hence, OA 1 transmits in the cross channels and the direct channel with gain d 2 2 as in S C when P is sufficiently small (i.e., P < min { P 2 * , P 4 * } ). Next, following its WF-like property, OA 1 starts transmitting in all the direct and cross channels as in S C D when P min { P 2 * , P 4 * } . Finally, when P > max { P 3 * , P 4 * } , the cross channels become saturated, and OA 1 follows the allocation in S sat thereon. The saturation threshold for P in this case is P sat = max { P 3 * , P 4 * } .
    Note that, whenever P 2 * > 0 , OA 1 follows [S2] irrespective of how P 2 * , P 3 * , and P 4 * compare, except in two cases: (a) 0 < P 3 * < P 4 * < P 2 * , where OA 1 follows [S3], described next, and (b) 0 < P 4 * < P 3 * < P 2 * , which is infeasible as they violate the mutual exclusiveness of S C and S sat .
  • If 0 < P 3 * < P 4 * < P 2 * : OA 1 follows the sequence S C S sat ([S3]). In this case, the cross channels are much stronger than the direct channels. Hence, similar to [S2], OA 1 allocates power to both cross channels and a direct channel as in S C when P is sufficiently small (i.e., 0 P < P 4 * ). As P increases and P P 4 * , the cross channels become saturated, and OA 1 begins assigning powers to all channels as in S sat . Interestingly, in this case, OA 1 skips S C D . This shows that, since the cross channels are much stronger, it is optimal to allocate power as in S C until they become saturated at P sat = P 4 * , beyond which the allocation in S sat becomes optimal.
We illustrate the characteristics of OA 1 with two numerical examples where we choose the following parameters, a 12 2 = a 21 2 = 1.5 , Q 1 = Q 2 = 5 , α = 2 , β = 0.5 . First, we illustrate an example of [S1] by additionally choosing d 1 2 = 2 , d 2 2 = 3 , c 12 2 = 1 , c 21 2 = 1.5 , such that the direct channels are stronger than the cross channels in the sense that P 1 * = 0.619 and P 2 * = 0.75 R + . In Figure 4a, we plot the optimal powers against the power budget P, and note the following: (i) when 0 P < P 1 * = 0.619 , OA 1 allocates P entirely to the direct channels as in S D , and thus p 1 and p 2 increase with P, and q 1 = q 2 = 0 ; (ii) when P 1 * P P 3 * = 2.73 , OA 1 allocates power to all channels as in S C D , and thus q 1 and q 2 also increase with P; (iii) finally, when P > P 3 * , OA 1 follows S sat where the cross channels become saturated simultaneously, and all increments of P are added to p 1 and p 2 .
We depict the resulting constraints Σ 1 , Σ 2 , and Σ in Figure 4b. First, note that OA 1 preserves R = Σ 1 = Σ 2 for all P. However, there exists a gap between Σ 1 = Σ 2 and Σ in S D and S C D . Specifically, in S D , the gap remains constant ( A A 1 ); in S C D , it reduces gradually as OA 1 transmits in the cross channels, and, in S sat , it becomes zero as OA 1 achieves R = Σ 1 = Σ 2 = Σ , as expected.
Next, we illustrate an example of [S2] with the channel gains, d 1 2 = 0.5 , d 2 2 = 1 , c 12 2 = 1.5 , c 21 2 = 3 , such that the cross channels are stronger than the direct channels in the sense that P 2 * = 0.22 and P 1 * = 0.21 R + . In Figure 5a, we plot the optimal powers against P, and observe the following: (i) when P < P 2 * = 0.22 , OA 1 follows the allocation in S C , and thus p 2 , q 1 and q 2 increase with P, whereas p 1 = 0 ; (ii) when P 2 * P P 3 * = 0.97 , OA 1 allocates power to all channels as in S C D , and thus p 1 now increases with P; (iii) finally, when P > P 3 * , OA 1 follows S sat , and thus the cross channels become saturated simultaneously, as expected in [S2]. In Figure 5b, we plot the sum-rate constraints, and note that OA 1 achieves R = Σ 1 = Σ 2 in all the sets. In addition, the gap between Σ 1 = Σ 2 and Σ is gradually offset as OA 1 transmits in the cross channels in S C and S C D , and it finally becomes zero in S sat , as expected. We omit an example of [S3] ( S C S sat ), which is somewhat similar to [S2].
Remark 1.
Recall that OA 1 presents the solution for [ P 1 ] under c 12 2 < c 21 2 and [A2] (implying A 1 = A 2 ). If c 12 2 < c 21 2 remains unchanged, and [ P 1 ] is formulated under a more general assumption [A2G]; where A 2 < A 1 , then seven disjoint sets are needed to describe the optimal allocation (now denoted by OA G ). The conditions and power allocations of these sets are found by solving the corresponding KKT conditions in a similar way to that of [ P 1 ] under [A2] in Appendix C. Hence, their explicit derivation is omitted for brevity. Of the seven sets, the first four sets, S ^ l , l { D , C , C D , sat } , are counterparts of the corresponding sets S l in Table 1, in the sense that the power allocation in S ^ l is similar to that in S l as discussed below. The remaining three sets S ^ k , k = 1 , 2 , 3 are “new” in the sense that the power allocation in these sets do not resemble that in any of the sets S ( . ) in Table 1.
The power allocation in the first four sets S ^ l are similar to that of S l , l { D , C , C D , sat } in that (a) when the power budget (P) is sufficiently small, OA G allocates power to only both direct channels (in S ^ D ), or both cross channels and one direct channel (in S ^ C ) depending on whether the direct or the cross channels are “strong” as in S D or S C before; (b) as P is increased OA G allocates power to all channels (in S ^ C D ) as in S C D ; and (c) when P is sufficiently large, both cross channels become saturated simultaneously, and OA G allocates all additional increments of P to only the direct channels (in S ^ sat ) as in S sat before. Note that the optimal powers and the critical powers of S ^ ( . ) differ slightly from that of S ( . ) due to the general assumption [A2G]. For conciseness, we omit the conditions, and present the optimal powers in Table 2.
Here γ k : = ( ( 1 + Q 1 ) ( 1 + Q 2 ) / ( 1 + Q l + a l k 2 Q k ) ) 1 / α β ¯ , l k { 1 , 2 } , and F ^ ( P ) and η are defined in (39) and (40) below
F ^ ( P ) : = E 1 + η E 2 ( E 1 η E 2 ) 2 + 4 β 2 η E 1 E 2 / ( 2 β ¯ ( 1 + β ) ) 1 ,
η : = 1 + Q 1 + a 21 2 Q 2 1 + Q 2 + a 12 2 Q 1 1 / α β ¯ > 1 ,
with E 1 , E 2 , defined in (37). Note that, under [A2] ( η = 1 ), the powers in Table 2 simplifies to that in Table 1.
Next, we present the optimal powers and the critical powers of the three new sets in Table 3, where
P 5 * : = β ¯ ( η 1 ) c 12 2 , P 6 * : = β ¯ 1 d 1 2 1 c 12 2 , P 7 * : = β 1 c 12 2 1 d 1 2 , P 8 * : = η β ¯ c 12 2 β d 1 2 , f 3 ( P ) : = d 2 2 c 21 2 ( 1 + d 2 2 P / β ) + β η P c 12 2 + β ¯ + β c 12 2 / d 1 2 β ¯ η .
The power allocation for the new sets can be understood by following the WF-like and max-min properties of OA G . For illustration, we explain the allocations in S ^ 1 and S ^ 2 . In S ^ 1 , the cross channels are stronger than the direct channels. Thus, due to its WF-like property, OA G preferably allocates as much of P to the cross channels as possible. However, under [A2G] ( A 2 < A 1 ) and c 12 2 < c 21 2 , allocating power to both cross channels results in Σ 2 < Σ 1 , and does not increase min { Σ 2 , Σ 1 } as much as possible. This results in a suboptimal sum-rate. Therefore, OA G allocates p 1 = 0 , q 1 = P / β ¯ , which increases only Σ 2 , and assigns p 2 = P / β , q 2 = 0 , which represents the allocation in S ^ 1 .
Furthermore, as P increases and P > max { P 6 * , P 7 * } , additional benefits from transmitting only in the cross channel with gain c 12 2 reduces, and thus OA G starts allocating a fraction of P to the direct channel with gain d 1 2 , due to its WF-like property. Thus, OA G follows the power allocation in S ^ 2 , which remains optimal as long as P < P 8 * . The power allocation in S ^ 3 can be interpreted similarly.
The important insight is that OA G can be explained with the WF-like and max-min properties as in OA 1 , and it does not reveal any new fundamental properties.

5.3. Optimum Power Allocation in the Symmetric DCLIC

We briefly discuss the optimum allocation for the symmetric DCLIC, where a 2 : = a 12 2 = a 21 2 , c 2 : = c 12 2 = c 21 2 , d 2 : = d 1 2 = d 2 2 , and Q 1 = Q 2 . Due to symmetry, considering only symmetric power allocation of the form ( p , q , p , q ) is sufficient, and does not cause loss of generality. Moreover, any feasible (symmetric) power allocation achieves Σ 1 = Σ 2 , rendering the constraint R Σ 2 in (28) redundant. The sum-rate optimization problem in this case can be formulated and solved as in [ P 1 ] , and is omitted here for brevity. In addition, we denote the optimal power allocation for this case by OA S .
In this case, OA S can be described by the power allocation in four disjoint sets, S ¯ l , l { D , C , C D , sat } , which are counterparts of the sets S l in Table 1. The conditions and optimal powers in these sets are given in Table 4, and they can be derived following the same procedure as in Appendix C, and thus is omitted here.
Moreover, the critical powers are now given by
P ¯ 1 * : = β 2 c 2 1 d 2 , P ¯ 2 * : = β ¯ 2 1 d 2 2 c 2 , P ¯ 3 * : = ( 1 + β ) γ ¯ c 2 β d 2 β ¯ c 2 , P ¯ 4 * : = β ¯ ( γ ¯ 1 ) c 2 ,
where γ ¯ : = ( ( 1 + Q 1 ) 2 / ( 1 + Q 1 + a 2 Q 1 ) ) 1 / α β ¯ > 1 .
OA S has the same WF-like and max-min properties as that of OA 1 . In fact, a rudimentary inspection of the conditions reveals that OA S follows one of the three possible sequences of sets depending on the channel gains, as before:
  • If c 2 2 d 2 (direct channels are “stronger”): OA S follows the sequence S ¯ D S ¯ C D S ¯ sat . It transmits only in the direct channels as in S ¯ D when P < P ¯ 1 * , then transmits in all channels as in S ¯ C D when P ¯ 1 * P P 3 * , and finally starts following the allocation in S ¯ sat when P > P ¯ 3 * .
  • If 2 d 2 < c 2 < 2 d 2 γ ¯ (cross channels are “stronger”): OA S follows the sequence S ¯ C S ¯ C D S ¯ sat . It transmits only in the cross channels as in S ¯ C when P < P ¯ 2 * , then transmits in all channels as in S ¯ C D when P ¯ 2 * P min { P 3 * , P 4 * } , and finally follows S ¯ sat when P > min { P 3 * , P 4 * } .
  • If c 2 2 d 2 γ ¯ (cross channels are much “stronger”): OA S follows the sequence S ¯ C S ¯ sat . It follows S ¯ C when P < P ¯ 4 * , and then follows S ¯ sat when P P 4 * , while skipping S ¯ C D altogether.
Thus, OA S allocates all of P to either both direct or both cross channels if P is small, and then shares P among all channels as P increases. In addition, when P is increased beyond the saturation threshold, the cross channels become saturated. Note that, due to symmetry, OA S needs to transmit in both cross channels only in S ¯ C to preserve Σ 1 = Σ 2 , unlike in S C , where OA 1 additionally transmits in a direct channel.
We note the following, which are revealed due to symmetry. First, if c 2 < 2 d 2 , the direct channels are considered to be stronger than the cross channels. Similarly, if c 2 > 2 d 2 , the cross channels are considered to be stronger. Moreover, in the special case of c 2 = 2 d 2 , neither the direct nor the cross channels are stronger than the other, and thus allocating P entirely to only direct or cross channel would be suboptimal. Therefore, OA S transmits in all channels as in S C D when P 0 . Finally, the condition c 2 2 d 2 γ implies that the cross channels are much stronger than the direct channels. Thus, it is optimal to transmit only in the cross channels as in S ¯ C until they become saturated, at which point the allocation in S ¯ sat becomes optimal. Note that these conditions follow trivially from the corresponding conditions of [S1], [S2], and [S3] in Section 5.2 by applying symmetry.
Furthermore, due to symmetry, one can now directly observe that the conditions of the sets in Table 4 are mutually exclusive. For example, consider the conditions of S ¯ D , c 2 < 2 d 2 and P < P ¯ 1 * , and note that they violate the second condition of S ¯ C (since P ¯ 2 * < 0 ), the first condition in S ¯ C D (since P < P ¯ 1 * ), and the second condition in S ¯ sat (trivially follows from γ ¯ > 1 ).
In Figure 6, we depict the set of cross and direct channel gains ( c 2 and d 2 ) of a symmetric DCLIC with parameters a 2 = 1.5 , Q 1 = 5 , α = 2 , β = 0.5 , and partition it depending on whether the direct or the cross channels are stronger.

5.4. Discussion and Insights

Now, we discuss some insights obtained from the power allocation in OA 1 in Table 1.
First, if the underlying GIC in the first band has very strong interference, i.e., a 12 2 1 + Q 1 and a 21 2 1 + Q 2 , OA 1 allocates the power budget (P) entirely to direct channels as in S D for all values of P. Note that, under very strong interference, we have Σ min { Σ 1 , Σ 2 } for any feasible power allocation. Therefore, the sum-rate is maximized by maximizing Σ , which is achieved by allocating all of P to the direct channels.
Second, if the transmit powers in the underlying GIC of the strong CLIC ( Q 1 and Q 2 ) are small, i.e., Q 1 0 , Q 2 0 , then allocating P entirely to the direct channels as in S D , is approximately optimal for all values of P, in the sense that, the difference between the sum-rates achieved with the allocation in S D and that in OA 1 is very small. Specifically, since Q 1 0 , Q 2 0 , the gap between Σ and Σ 1 at P = 0 is very small. As P is sufficiently increased, OA 1 starts transmitting in all channels as in S C D . However, since the gap is very small, only a small fraction of P needs to be allocated to the cross channels to offset the gap, and thus almost all of P is redirected to the direct channels. The resulting allocation closely resembles that in S D .
We can attribute this behavior of OA 1 to the fact that the GIC with strong interference ( a 12 2 1 , a 21 2 1 ) and small transmit powers ( Q 1 0 , Q 2 0 ) approaches the very strong interference regime ( a 12 2 1 + Q 1 , a 21 2 1 + Q 2 ), where allocating P entirely to the direct channels as in S D is optimal for all P.
Third, if the transmit powers in the underlying GIC of the strong CLIC are large, i.e., Q 1 , Q 2 1 , OA 1 allocates power in all channels as in S C D for all but a relatively small range of P. Recall that even if OA 1 transmits in the direct or cross channels as in S D or S C for small P, eventually when P P 1 * or P P 2 * , OA 1 allocates power in all channels as in S C D , where the gap between Σ 1 and Σ starts to reduce. However, if Q 1 , Q 2 1 , this gap is very large, and thus very large P is needed to offset the gap as in S sat . Therefore, for all moderate P, OA 1 allocates power to all channels following S C D .
Fourth, if the cross channel gains in the second band are very large, allocating P entirely to the direct channels as in S D is approximately optimal for all P, in the sense described in the second point above. This follows since with very large c 12 2 and c 21 2 , OA 1 needs to allocate only a very small fraction of P to close the gap between Σ 1 and Σ , and thus redirects the remaining of P to the direct channels. The resulting allocation thus closely resembles that in S D .
Finally, when β 1 , the DCLIC can be approximated by the DLIC, and thus allocating P entirely to only the direct channels is approximately optimal. On the other hand, when β 0 , the DCLIC can be approximated by the CLIC where allocating P entirely to the cross channels is approximately optimal. However, since the cross channels do not increase Σ , the sum-rate becomes limited to Σ when P is sufficiently large, and does not increase anymore.

6. Conclusions

Motivated by simultaneous transmission in the microwave and mm-wave bands, we considered a special class of dual band 2 × 2 GIC. Due to their small wavelength and the use of highly directional antenna arrays, the mm-wave band can be used to form highly directional point-to-point links, allowing interference free reception at a designated receiver. For this model, we first showed that the capacity region of a DCLIC can be decomposed into the capacity of a CLIC complemented by direct channels. Moreover, we considered two specific classes of the CLIC with strong and weak underlying GIC, and obtained sufficient conditions on the channel gains under which the capacity region is characterized.
We characterized the power allocation over the mm-wave channels that maximizes the sum-rate of the DCLIC. Channel conditions have been characterized that determine how power is allocated to the channels depending on whether the direct or the cross channels are “strong”. In particular, the optimal allocation has a WF-like property due to which, when the power budget is small, it assigns the power budget entirely to either both direct channels, or both cross channels and at most one direct channel. As the power budget is increased, the optimal allocation eventually assigns power to all channels. It also has a max-min property, and, as a result, imposes a maximum limit on the cross-channel powers. This power allocation evaluates the impact of high bandwidth point-to-point mm-wave channels on the sum-rate performance, and thus can be useful in allocating resources in practice.
Potential future work includes analyzing other multi-user networks that operate with dual microwave and mm-wave bands.

Acknowledgments

Parts of this work was presented at the 2016 IEEE International Symposium on Information Theory, Barcelona, Spain, 10–15 July 2016. This work was supported by the Natural Sciences and Engineering Research Council of Canada.

Author Contributions

Both Subhajit Majhi and Patrick Mitran conceived of the idea of a dual-band GIC. Subhajit Majhi derived the results and completed writing the manuscript, with Patrick Mitran providing guidance on improving the quality of the presentation of the manuscript. All authors have read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
GICGaussian interference channel
CLICCross-Link interference channel
DLICDirect-Link interference channel
DCLICDirect-and-Cross-Link interference channel
PGICParallel Gaussian interference channel
BMFBandwidth mismatch factor
DMDiscrete memoryless
RVRandom variable
WFWaterfilling
KKTKarush–Kuhn–Tucker

Appendix A. Proof of Theorem 1

Proof. 
Outer bound: We derive the bounds on R 1 and R 2 as follows. If transmitter T x k transmits the message M k , k = 1 , 2 , we have from Fano’s inequality
n R 1 I ( X 1 n , X ^ 1 n 1 , X ¯ 1 n 2 ; Y 1 n , Y ^ 1 n 1 , Y ¯ 1 n 2 ) + n ϵ n = ( a ) I ( X 1 n , X ^ 1 n 1 ; Y 1 n , Y ^ 1 n 1 ) + I ( X ¯ 1 n 2 ; Y 1 n , Y ^ 1 n 1 | X 1 n , X ^ 1 n 1 ) + I ( X ¯ 1 n 2 ; Y ¯ 1 n 2 | Y 1 n , Y ^ 1 n 1 ) + I ( X 1 n , X ^ 1 n 1 ; Y ¯ 1 n 2 | X ¯ 1 n 2 , Y 1 n , Y ^ 1 n 1 ) + n ϵ n = ( b ) I ( X 1 n , X ^ 1 n 1 ; Y 1 n , Y ^ 1 n 1 ) + I ( X ¯ 1 n 2 ; Y ¯ 1 n 2 | Y 1 n , Y ^ 1 n 1 ) + n ϵ n = I ( X 1 n , X ^ 1 n 1 ; Y 1 n , Y ^ 1 n 1 ) + = 1 n 2 h ( Y ¯ 1 | Y ¯ 1 1 , Y 1 n , Y ^ 1 n 1 ) h ( Y ¯ 1 | Y ¯ 1 1 , X ¯ 1 n 2 , Y 1 n , Y ^ 1 n 1 ) + n ϵ n ( c ) I ( X 1 n , X ^ 1 n 1 ; Y 1 n , Y ^ 1 n 1 ) + = 1 n 2 h ( d 1 X ¯ 1 + Z ¯ 1 ) h ( Z ¯ 1 ) + n ϵ n ( d ) I ( X 1 n , X ^ 1 n 1 ; Y 1 n , Y ^ 1 n 1 ) + n 2 C ( d 1 2 P ¯ 1 ) + n ϵ n ,
where (a) follows from the chain rule; (b) follows since the second term of (a) is zero due to the Markov chain (MC) X ¯ 1 n 2 ( X 1 n , X ^ 1 n 1 ) ( Y 1 n , Y ^ 1 n 1 ) , and the fourth term of (a) is zero due to the MC ( Y 1 n , Y ^ 1 n 1 , X 1 n , X ^ 1 n 1 ) X ¯ 1 n 2 Y ¯ 1 n 2 ; (c) follows since unconditioning does not reduce entropy, and also from the memoryless Gaussian model of the direct channels; and (d) follows since X ¯ 1 N ( 0 , P ¯ 1 ) , i.i.d., maximizes the mutual information. Bounding R 2 similarly, and since n 2 α 2 n , we have
R 1 r 1 + α 2 C ( d 1 2 P ¯ 1 ) , R 2 r 2 + α 2 C ( d 2 2 P ¯ 2 ) ,
where r k : = sup X k n , X ^ k n 1 1 n I ( X k n , X ^ k n 1 ; Y k n , Y ^ k n 1 ) , k = 1 , 2 , for product distribution k = 1 2 p ( x k n , x ^ k n 1 ) , with n 1 α 1 n .
Achievability: We will code over t blocks of symbols together. We choose ( n , n 1 , n 2 ) , and define U k : = ( X k n , X ^ k n 1 ) , k = 1 , 2 , where p ( U 1 , U 2 ) = p ( U 1 ) p ( U 2 ) , and define U ¯ k : = X ¯ k n 2 , k = 1 , 2 and choose X ¯ k N ( 0 , P ¯ k ) , i.i.d., for = 1 , , n 2 . Suppose T x k transmits M k M k , k = 1 , 2 . Thus, to encode M k over t blocks for the underlying CLIC, we generate 2 t n R k i.i.d. sequences U k t ( M k ) , one for each M k M k , where U k t ( M k ) are distributed according to p ( u k t ) = = 1 t p ( u k , ) = = 1 t p x k , ( 1 ) n + 1 n x ^ k , ( 1 ) n 1 + 1 n 1 , k = 1 , 2 . Similarly, for the two direct channels, we generate 2 t n R k i.i.d. sequences U ¯ k t ( M k ) , distributed as = 1 n 2 t p ( x ¯ k ) , where x ¯ k N ( 0 , P ¯ k ) , for k = 1 , 2 , and = 1 , , n 2 .
Thus, to transmit M k , k = 1 , 2 , U 1 t ( M 1 ) and U 2 t ( M 2 ) are transmitted through the CLIC, and U ¯ 1 t ( M 1 ) and U ¯ 2 t ( M 2 ) are transmitted through the two direct channels in the second band. Upon receiving the sequences ( Y k n t , Y ^ k n 1 t ) and Y ¯ k n 2 t , R x k employs joint typical decoding to estimate the transmitted message as in [32] (Chapter 4.3). The probability of decoding error vanishes with t if
R k < 1 n I ( X k n , X ^ k n 1 ; Y k n , Y ^ k n 1 ) + n 2 n C ( d k 2 P ¯ k ) , k = 1 , 2 ,
which matches the upper bound on R 1 and R 2 , as n 2 α 2 n , and n . Finally, the capacity region of the CLIC with BMF α is defined as the closure of the union of all sets of achievable rate pairs ( R 1 , R 2 ) that satisfy (A2) with n 2 = 0 , where the union is taken over all product distributions, p ( x 1 n , x ^ 1 n 1 ) p ( x 2 n , x ^ 2 n 1 ) , and for all n , n 1 Z + , such that n α 1 n . ☐

Appendix B. Proof of Theorem 2

Proof. 
Outer bound: We derive the bound on R 1 first. Assuming that the transmitter T x k transmits the message M k , k = 1 , 2 , we have from Fano’s inequality
n R 1 I ( X 1 n , X ^ 1 n ; Y 1 n , Y ^ 1 n ) + n ϵ n I ( X 1 n , X ^ 1 n ; Y 1 n , Y ^ 1 n , X 2 n , X ^ 2 n ) + n ϵ n = ( a ) I ( X 1 n , X ^ 1 n ; Y 1 n , Y ^ 1 n | X 2 n , X ^ 2 n ) + n ϵ n = ( b ) I ( X 1 n , X ^ 1 n ; Y 1 n | X 2 n , X ^ 2 n ) + I ( X 1 n , X ^ 1 n ; Y ^ 1 n | X 2 n , X ^ 2 n , Y 1 n ) + n ϵ n = ( c ) I ( X 1 n ; Y 1 n | X 2 n ) + n ϵ n ( d ) = 1 n I ( X 1 ; Y 1 | X 2 ) + n ϵ n ( e ) n C ( Q 1 ) + n ϵ n ,
where (a) follows since ( X 1 n , X ^ 1 n ) ( X 2 n , X ^ 2 n ) ; (b) follows from the chain rule; (c) follows from the first term in (b) due to the MCs X ^ 2 n X 2 n Y 1 n and ( X ^ 1 n , X ^ 2 n ) ( X 1 n , X 2 n ) Y 1 n , and since the second term in (b) is zero due to ( X 1 n , X ^ 1 n , X 2 n , Y 1 n ) X ^ 2 n Y ^ 1 n ; (d) follows since unconditioning does not reduce entropy, and also from the memoryless Gaussian model; and (e) follows from the Gaussian system model, and by choosing X 1 N ( 0 , Q 1 ) i.i.d., which maximizes the mutual information. We similarly bound R 2 by, R 2 C ( Q 2 ) .
Next, we bound the sum-rate. We have from Fano’s inequality
n R 1 + n R 2 I ( X 1 n , X ^ 1 n ; Y 1 n , Y ^ 1 n ) + I ( X 2 n , X ^ 2 n ; Y 2 n , Y ^ 2 n ) + n ϵ n I ( X 1 n , X ^ 1 n ; Y 1 n , Y ^ 1 n , X 2 n , X ^ 2 n ) + I ( X 2 n , X ^ 2 n ; Y 2 n , Y ^ 2 n , X 1 n , X ^ 1 n ) + n ϵ n = ( a ) I ( X 1 n ; Y 1 n | X 2 n ) + I ( X 2 n ; Y 2 n | X 1 n ) + n ϵ n ( b ) I ( X 1 n ; Y 2 n ) + I ( X 2 n ; Y 2 n | X 1 n ) + n α 1 C ( c 12 2 P ^ 1 ) + n ϵ n = I ( X 1 n , X 2 n ; Y 2 n ) + n α 1 C ( c 12 2 P ^ 1 ) + n ϵ n ( c ) = 1 n I ( X 1 , X 2 ; Y 2 ) + n α 1 C ( c 12 2 P ^ 1 ) + n ϵ n ( d ) n C ( a 12 2 Q 1 + Q 2 ) + n α 1 C ( c 12 2 P ^ 1 ) + n ϵ n ,
where the first term in (a) follows from the steps (a)–(c) of (A3), and the second term follows similarly; (b) follows from (18) in Lemma 2; (c) follows since unconditioning does not reduce entropy, and also from the memoryless Gaussian model; and (d) follows from choosing X k N ( 0 , Q k ) , k = 1 , 2 , i.i.d., which maximizes the mutual information. Hence as n , and thus ϵ n 0 , we obtain the bound in (23). The other bound in (22) is found similarly. Thus, (20)–(23) provides an outer bound on the capacity region of the CLIC.
Achievability: We fix n and n 1 , and choose independent input distributions for the codewords of two users in each channel, i.e., p ( x 1 , x 2 ) = p ( x 1 ) p ( x 2 ) , and p ^ ( x ^ 1 , x ^ 2 ) = p ^ ( x ^ 1 ) p ^ ( x ^ 2 ) . The transmitter T x k encodes a message, M k M k into two codewords, x k n ( M k ) and x ^ k n 1 ( M k ) , generated according to the i.i.d. distributions, = 1 n p k ( x k ) and = 1 n 1 p ^ k ( x ^ k ) , k = 1 , 2 . The codewords, x k n ( M k ) and x ^ k n 1 ( M k ) , are then transmitted through the IC in the first band and the cross channels in the second band, respectively. Each receiver then estimates both messages, ( M 1 , M 2 ) , from the signals observed over both bands as in a multiple access channel. We use standard error analysis technique [32] (Chapter 4.5) to find that the probability of decoding error becomes arbitrarily small as n , if the rates achieved in the receiver R x k satisfies
R k I ( X k ; Y k | X m ) , R m I ( X m ; Y k | X k ) + α 1 I ( X ^ m ; Y ^ k ) , R 1 + R 2 I ( X 1 , X 2 ; Y k ) + α 1 I ( X ^ m ; Y ^ k ) ,
for k m { 1 , 2 } . Moreover, these achievable rates are maximized by using i.i.d. Gaussian inputs, X k * N ( 0 , Q k ) and X ^ k * N ( 0 , P ^ k ) , k = 1 , 2 . Finally, observe that (18) and (19) in Lemma 2 are valid for the independent Gaussian inputs X 1 * and X 2 * with n = 1 , which implies that the individual rates must satisfy, R k I ( X k * ; Y k * | X m * ) I ( X k * ; Y m * ) + α 1 C ( c k m 2 P ^ k ) I ( X k * ; Y m * | X m * ) + α 1 C ( c k m 2 P ^ k ) , where Y k * is the channel output with the Gaussian inputs X k * , k = 1 , 2 . Hence, the individual rates are limited to R k I ( X k * ; Y k * | X m * ) = C ( Q k ) , k m { 1 , 2 } , which matches the bound in (20) and (21). In addition, the sum-rate constraints in (A5) above match the sum-rate bounds in (22) and (23). Thus, the outer bound is achieved with independent Gaussian inputs. ☐

Appendix C. Derivation of the Optimal Power Allocation

The KKT Conditions: We show that [ P 1 ] is convex, and formulate its KKT conditions. Denote by x : = ( p 1 , q 1 , p 2 , q 2 , R ) R + 5 a feasible point, satisfying the constraints in [ P 1 ] . Now, the objective of [ P 1 ] is equivalent to minimizing R , which is linear, and the equality constraints (30) and (31) are affine. Next, we note that the constraints (27)–(29) are convex. To illustrate, we define G 1 ( x ) : = R A 1 α β ¯ κ 2 ln ( 1 + c 21 2 q 2 ) α β κ 2 ln ( 1 + d 1 2 p 1 ) α β κ 2 ln ( 1 + d 2 2 p 2 ) corresponding to (27) with κ : = 1 / ln 2 , and derive its Hessian, 2 G 1 ( x ) : = α κ 2 diag β d 1 4 / ( 1 + d 1 2 p 1 ) 2 , 0 , β d 2 4 / ( 1 + d 2 2 p 2 ) 2 , β ¯ c 21 4 / ( 1 + c 21 2 q 2 ) 2 , 0 , where D = diag [ a 1 , , a m ] is a diagonal matrix with elements a 1 , , a m . Note that 2 G 1 ( x ) is positive semidefinite, and thus (27) is convex. Likewise, (28) and (29) are found to be convex. In addition, (30) and (31) and x 0 imply that the feasible set is compact for given P > 0 and β ( 0 , 1 ) . Hence, [ P 1 ] is a convex problem over a compact set. Furthermore, [ P 1 ] satisfies Slater’s condition [35] (Chapter 5.2.3), since the point x ˜ : = P β ¯ ϵ / β , ϵ , P β ¯ ϵ / β , ϵ , A 1 is strictly feasible for sufficiently small ϵ > 0 . Therefore, [ P 1 ] can be solved using the KKT conditions [35] (Chapter 5.5.3).
We define the Lagrange multipliers { λ i } i = 1 3 , corresponding to (27)–(29), { μ i } i = 1 2 , corresponding to (30) and (31), and { ρ i } i = 1 5 , corresponding to the nonnegativity constraints, ( p 1 , q 1 , p 2 , q 2 , R ) 0 . The Lagrangian is then
L = R + λ 1 ( R Σ 1 ) + λ 2 ( R Σ 2 ) + λ 3 ( R Σ ) + μ 1 β p 1 + β ¯ q 1 P + μ 2 β p 2 + β ¯ q 2 P ρ 1 p 1 ρ 2 q 1 ρ 3 p 2 ρ 4 q 2 ρ 5 R ,
where Σ 1 , Σ 2 and Σ , are defined in (24)–(26) with A 1 = A 2 .
With a slight abuse of notation, we denote the optimal primal variable by ( p 1 , q 1 , p 2 , q 2 , R ) , and the optimal Lagrange multipliers by ( λ 1 , λ 2 , λ 3 , μ 1 , μ 2 , ρ 1 , ρ 2 , ρ 3 , ρ 4 , ρ 5 ) . The optimal variables satisfy the KKT conditions below
λ 1 + λ 2 + λ 3 = 1 ,
ρ 1 = β μ 1 α κ 2 d 1 2 1 + d 1 2 p 1 ,
ρ 2 = β ¯ μ 1 α κ 2 c 12 2 λ 2 1 + c 12 2 q 1 ,
ρ 3 = β μ 2 α κ 2 d 2 2 1 + d 2 2 p 2 ,
ρ 4 = β ¯ μ 2 α κ 2 c 21 2 λ 1 1 + c 21 2 q 2 ,
β p 1 + β ¯ q 1 = P ,
β p 2 + β ¯ q 2 = P ,
R Σ 1 0 ,
R Σ 2 0 ,
R Σ 0 ,
λ 1 ( R Σ 1 ) = 0 ,
λ 2 ( R Σ 2 ) = 0 ,
λ 3 ( R Σ ) = 0 ,
ρ 1 p 1 = 0 ,
ρ 2 q 1 = 0 ,
ρ 3 p 2 = 0 ,
ρ 4 q 2 = 0 ,
( p 1 , q 1 , p 2 , q 2 , R , λ 1 , λ 2 , λ 3 , ρ 1 , ρ 2 , ρ 3 , ρ 4 ) 0 ,
where we used the fact that ρ 5 = 0 since R A 1 > 0 . Next, in order to characterize the power allocation in OA 1 , where [A2] (implying A 1 = A 2 ) and c 12 2 < c 21 2 apply, we partition the set of optimal Lagrange multipliers.
Partitioning the Set of the Optimal Lagrange Multipliers: First, note that ρ 1 and ρ 2 do not satisfy ρ 1 > 0 and ρ 2 > 0 , as this implies p 1 = q 1 = 0 , which violates (A12). Similarly, ρ 3 and ρ 4 do not satisfy ρ 3 > 0 and ρ 4 > 0 , as this violates (A13). Therefore, we partition the set of ( ρ 1 , ρ 2 ) 0 into the disjoint subsets, B 1 : = { ( ρ 1 , ρ 2 ) : ρ 1 > 0 , ρ 2 = 0 } , B 2 : = { ( ρ 1 , ρ 2 ) : ρ 1 = 0 , ρ 2 > 0 } , and B 3 : = { ( ρ 1 , ρ 2 ) : ρ 1 = 0 , ρ 2 = 0 } . Similarly, we partition the set ( ρ 3 , ρ 4 ) 0 into three disjoint subsets, C k , k = 1 , 2 , 3 . Therefore, any tuple ( ρ 1 , ρ 2 , ρ 3 , ρ 4 ) must be in one of the nine resulting cases, B k C l , k , l = 1 , 2 , 3 . For conciseness, we denote the tuples ( ρ 1 , ρ 2 , ρ 3 , ρ 4 ) and ( λ 1 , λ 2 , λ 3 ) by the vectors ρ and λ , respectively.
We similarly partition the set of λ 0 into 5 subsets: D 1 : = { λ : λ 1 > 0 , λ 2 = 0 , λ 3 0 } , D 2 : = { λ : λ 1 = 0 , λ 2 > 0 , λ 3 0 } , D 3 : = { λ : λ 1 = 0 , λ 2 = 0 , λ 3 > 0 } , D 4 : = { λ : λ 1 > 0 , λ 2 > 0 , λ 3 = 0 } , and D 5 : = { λ : λ 1 > 0 , λ 2 > 0 , λ 3 > 0 } . Note that when λ D 1 , it satisfies either (a) λ 1 > 0 , λ 2 = 0 , λ 3 > 0 , or (b) λ 1 > 0 , λ 2 = 0 , λ 3 = 0 . Specifically, in (a) (A14) and (A16) are tight, and thus R = Σ 1 = Σ < Σ 2 , and in (b) only (A14) is tight, and thus R = Σ 1 < min { Σ 2 , Σ } . Thus, in D 1 , we have Σ 1 Σ , Σ 1 < Σ 2 . Similarly, we note the following: in D 2 , we have Σ 2 Σ , Σ 2 < Σ 1 ; in D 3 , we have Σ < min { Σ 1 , Σ 2 } ; in D 4 , we have Σ 2 = Σ 1 < Σ , and in D 5 , we have Σ 2 = Σ 1 = Σ .
Now, we characterize “rules” ( R m ) that determines whether any ρ B k C l and λ D j jointly satisfy the KKT conditions (i.e., they are compatible) or not (i.e., incompatible).
  • R 1 : For any k { 1 , 3 } , l { 1 , 2 , 3 } , if ρ B k C l then λ D j , j = 1 , 3 .
  • R 2 : For any k { 1 , 2 , 3 } , l { 1 , 3 } , if ρ B k C l then λ D j , j = 2 , 3 .
  • R 3 : If ρ B 1 C 2 then λ D j , j = 2 , 4 , 5 .
  • R 4 : If ρ B 2 C 1 then λ D j , j = 1 , 4 , 5 .
  • R 5 : If ρ B 2 C 3 then λ D j , j = 1 , 4 , 5 .
  • R 6 : If ρ B 3 C 2 then λ D j , j = 2 , 4 , 5 .
  • R 7 : If ρ B 2 C 2 then λ D j , j = 1 , 2 , 3 , 5 .
We provide below an outline of how { R m } m = 1 7 follow. For brevity, we indicate the compatibility (incompatibility) between B k C l and D j by (×) in Table A1, along with the conditions [Cm], m = 1 , , 4 , ( R m , m = 1 , , 7 ) that make them compatible (incompatible). Note that [Cm], m = 1 , , 4 , are given in (A25)–(A28) in the next two pages.
Table A1. Compatibility of B k C l and D j .
Table A1. Compatibility of B k C l and D j .
B k C l D 1 Cond. D 2 Cond. D 3 Cond. D 4 Cond. D 5 Cond.
B 1 C 1 × R 1 × R 2 × R 1 × c 12 2 < c 21 2 × c 12 2 < c 21 2
B 1 C 2 × R 1 × R 3 × R 1 × R 3 × R 3
B 1 C 3 × R 1 × R 2 × R 1 [C1][C3]
B 2 C 1 × R 4 × R 2 × R 2 × R 4 × R 4
B 2 C 2 × R 7 × R 7 × R 7 [C2]× R 7
B 2 C 3 × R 5 × R 2 × R 2 × R 5 × R 5
B 3 C 1 × R 1 × R 2 × R 2 × c 12 2 < c 21 2 × c 12 2 < c 21 2
B 3 C 2 × R 1 × R 6 × R 1 × R 6 × R 6
B 3 C 3 × R 1 × R 2 × R 1 [C4][C3]
R 1 follows from a specific relation between ρ 2 and λ 2 as follows. Suppose ρ B k C l as in R 1 . In B k , we have ρ 2 = 0 , and in D j we have λ 2 = 0 for j , k { 1 , 3 } . Now, ρ 2 = λ 2 = 0 implies μ 1 = 0 from (A9), resulting in ρ 1 = α κ d 1 2 / ( 2 ( 1 + d 1 2 p 1 ) ) < 0 from (A8), which contradicts ρ 1 0 in B k . Thus, λ D j , j = 1 , 3 . R 2 follows from a similar relation between ρ 4 and λ 1 .
Now, consider R 3 where ρ B 1 C 2 satisfy ρ 1 > 0 , ρ 2 = 0 , ρ 3 = 0 , ρ 4 > 0 . This results in p 1 = 0 , q 1 = P / β ¯ , p 2 = P / β , q 2 = 0 , following (A20)–(A23) and (A12) and (A13). This further implies Σ 1 < Σ 2 , which contradicts the implications of λ D j , j = 2 , 4 , 5 . Therefore, λ D j , j = 2 , 4 , 5 . R 4 follows similarly where the roles of ρ 1 , ρ 3 , and Σ 1 are exchanged with that of ρ 2 , ρ 4 , and Σ 2 , respectively.
In R 5 , ρ B 2 C 3 satisfy ρ 1 = 0 , ρ 2 > 0 , ρ 3 = 0 , ρ 4 = 0 , which results in p 1 = P / β , q 1 = 0 , p 2 > 0 , q 2 > 0 , from (A20)–(A23) and (A12) and (A13). Since q 1 = 0 and q 2 > 0 , we have Σ 2 < Σ 1 that contradicts the implications of λ D j , j = 1 , 4 , 5 as in R 4 above. The assertion in R 6 follows similarly by exchanging the roles of ρ 1 , ρ 2 , and Σ 1 with ρ 3 , ρ 4 , and Σ 2 , respectively.
In R 7 , ρ B 2 C 2 satisfy ρ 1 = 0 , ρ 2 > 0 , ρ 3 = 0 , ρ 4 > 0 , which results in p 1 = P / β , q 1 = 0 , p 2 = P / β , q 2 = 0 , from (A20)–(A23) and (A12) and (A13). This implies Σ 1 = Σ 2 < Σ , which is consistent with the implication of λ D 4 only.
Power Allocation in OA 1 : We associate the five compatible subsets in Table A1 (denoted by √) to the four sets S ( . ) in Table A1 according to the resulting power allocations: channel parameters and P are in (a) S C if ρ B 1 C 3 , λ D 4 ; (b) S D if ρ B 2 C 2 , λ D 4 ; (c) S sat if ρ B 3 C 3 or B 1 C 3 , λ D 5 ; and (d) S C D if ρ B 3 C 3 , λ D 4 . Next, we characterize the conditions, [Cm] , m = 1 , , 4 , in terms of the channel parameters, which give the conditions of sets S ( . ) .
  • S C : Since ρ B 1 C 3 , they satisfy ρ 1 > 0 , ρ 2 = 0 , ρ 3 = 0 , ρ 4 = 0 , which implies p 1 = 0 , q 1 = P / β ¯ from (A12), (A20) and (A21), and p 2 > 0 , q 2 > 0 from (A22) and (A23). In addition, λ D 4 implies Σ 1 = Σ 2 from (A17) and (A18), which, from the expressions of Σ 1 and Σ 2 in (24) and (25) gives c 21 2 q 2 = c 12 2 P / β ¯ . Thus, we have q 2 = P c 12 2 / ( c 21 2 β ¯ ) , and therefore, p 2 = P ( 1 c 12 2 / c 21 2 ) / β from (A13). Note that P > 0 and c 12 2 < c 21 2 are sufficient for ( q 1 , p 2 , q 2 ) 0 . Moreover, λ 3 = 0 implies Σ 2 < Σ , resulting in P < β ¯ ( γ 1 ) / c 12 2 , i.e., P < P 4 * where γ , defined in (38), is γ > 1 due to assumption [A1]. Next, from ρ 3 = 0 , ρ 4 = 0 and (A10) and (A11), we have λ 1 = ( 1 + P c 12 2 / β ¯ ) / ( P ( c 21 2 c 12 2 ) / β + c 12 2 / d 2 2 ) . In addition, from ρ 2 = 0 and (A8), the condition for ρ 1 > 0 is λ 2 > d 1 2 ( 1 + P c 12 2 / β ¯ ) / c 12 2 . Since λ 3 = 0 , from (A7) we have λ 2 + λ 1 = 1 , which subsequently gives d 1 2 ( 1 + P c 12 2 / β ¯ ) / c 12 2 + ( 1 + P c 12 2 / β ¯ ) / ( P ( c 21 2 c 12 2 ) / β + c 12 2 / d 2 2 ) < 1 , i.e., g 2 ( P ) < 1 as in Table 1. Thus, the condition of S C is
    [ C 1 ] : P < P 4 * , g 2 ( P ) < 1 .
  • S D : Since ρ B 2 C 2 , they satisfy ρ 1 = 0 , ρ 2 > 0 , ρ 3 = 0 , ρ 4 > 0 , and imply p 1 = P / β , q 1 = 0 , p 2 = P / β , q 2 = 0 , following (A20)–(A23) and (A12) and (A13). In addition, λ D 4 implies Σ 1 = Σ 2 < Σ from (A17)–(A19), for which assumption [A1] is sufficient. Next, using ρ 1 = ρ 3 = 0 and (A8)–(A11) the sufficient conditions for ρ 2 > 0 and ρ 4 > 0 are found to be λ 2 < d 1 2 / ( c 12 2 ( 1 + d 1 2 P / β ) ) and λ 1 < d 2 2 / ( c 21 2 ( 1 + d 2 2 P / β ) ) respectively. Since λ 3 = 0 , and thus λ 1 + λ 2 = 1 from (A7), the bounds on λ 1 and λ 2 are combined, which gives d 1 2 / ( c 12 2 ( 1 + d 1 2 P / β ) ) + d 2 2 / ( c 21 2 ( 1 + d 2 2 P / β ) ) > 1 , i.e., g 1 ( P ) > 1 as in Table 1. Finally, P > 0 is sufficient for ( p 1 , p 2 ) 0 . Thus, the condition of S D is
    [ C 2 ] : g 1 ( P ) > 1 .
  • S sat : Since ρ B 3 C 3 , they satisfy ρ 1 = ρ 2 = ρ 3 = ρ 4 = 0 , and imply ( p 1 , q 1 , p 2 , q 2 ) 0 , following (A20)–(A23). In addition, λ D 5 implies Σ 1 = Σ and Σ 2 = Σ from (A17)–(A19), which gives q 1 = γ 1 c 12 2 and q 2 = γ 1 c 21 2 , respectively, where γ > 1 . Thus, we have p 1 = P β β ¯ ( γ 1 ) β c 12 2 and p 2 = P β β ¯ ( γ 1 ) β c 21 2 from (A12)–(A13). Note that the condition for p 1 > 0 is P > β ¯ ( γ 1 ) / c 12 2 , i.e., P > P 4 * , and it is also sufficient for p 2 > 0 due to c 21 2 > c 12 2 .
    Now, from ρ = 0 and (A8)–(A11), and using the expressions of p l , q l , l = 1 , 2 , above we find λ 1 = γ β / ( P c 21 2 + β c 21 2 / d 2 2 β ¯ ( γ 1 ) ) , and λ 2 = γ β / ( P c 12 2 + β c 12 2 / d 1 2 β ¯ ( γ 1 ) ) . We also note that P > P 4 * is sufficient for λ l > 0 , l = 1 , 2 . Next, to ensure λ 3 > 0 , λ 1 and λ 2 must satisfy λ 1 + λ 2 < 1 , which gives the condition g 3 ( P ) < 1 as in S sat in Table A1. In addition, note that γ > 1 is sufficient for ( q 1 , q 2 ) 0 .
    Finally, the case with ρ B 1 C 3 only differ from that with B 3 C 3 in that now ρ 1 > 0 , and thus p 1 = 0 . We note that P = P 4 * is sufficient for p 1 = 0 . We also note that the other condition, which follows from the conditions on λ 1 and λ 2 as in B 3 C 3 , is expressed by evaluating g 3 ( P ) < 1 above at P = P 4 * . Therefore, the conditions of the two cases are combined as
    [ C 3 ] : P P 4 * , g 3 ( P ) < 1 .
  • S C D : Since ρ B 3 C 3 , they satisfy ρ = 0 , and imply ( p 1 , q 1 , p 2 , q 2 ) 0 , following (A20)–(A23. In addition, λ D 4 imply Σ 1 = Σ 2 . Next, from ρ = 0 , (A8)–(A11), (A12) and (A13), we find α κ 2 μ 1 ( β + β ¯ λ 2 ) = P + β ¯ c 12 2 + β d 1 2 , and α κ 2 μ 2 ( β + β ¯ λ 1 ) = P + β ¯ c 21 2 + β d 2 2 . Also, since Σ 1 = Σ 2 , we have from (24) and (25), λ 2 μ 2 c 12 2 = λ 1 μ 1 c 21 2 . In addition, since λ 3 = 0 we have λ 1 = 1 λ 2 . Combining these conditions, we get a quadratic equation of λ 2 , A ^ λ 2 2 B ^ λ 2 + C ^ = 0 , where A ^ : = β ¯ ( E 1 E 2 ) , B ^ : = E 1 E 2 + 2 β E 2 , and C ^ : = β E 2 , with E 1 : = P c 12 2 + β ¯ + β c 12 2 d 1 2 and E 2 : = P c 21 2 + β ¯ + β c 21 2 d 2 2 as defined in (37). One of its roots, λ 2 ( 1 ) : = ( B ^ + B ^ 2 4 A ^ C ^ ) / 2 A ^ , is infeasible as it violates (A7) and the nonnegativity of λ 2 , respectively, when A ^ > 0 and A ^ < 0 . Therefore, the valid solution is λ 2 = λ 2 ( 2 ) : = ( B ^ B ^ 2 4 A ^ C ^ ) / 2 A ^ . Next, from (A9) and substituting λ 2 ( 2 ) in α κ 2 μ 1 ( β + β ¯ λ 2 ) = P + β ¯ c 12 2 + β d 1 2 , and with some algebraic simplification, we have q 1 = F ( P ) / c 12 2 where F ( P ) : = E 1 + E 2 ( E 1 E 2 ) 2 + 4 β 2 E 1 E 2 2 β ¯ ( 1 + β ) 1 , and E 1 and E 2 defined in (37). Finally, from the mutual exclusiveness of the sets S ( . ) , the condition of S C D is given by
    [ C 4 ] : complement of ( [ C 1 ] [ C 2 ] [ C 3 ] ) .

References

  1. LTE-A 4G Solution, Stockholm, Sweden. Available online: https://www.ramonmillan.com/documentos/bibliografia/4GEricsson.pdf (accessed on 13 September 2017).
  2. Pi, Z.; Khan, F. An introduction to millimeter-wave mobile broadband systems. IEEE Commun. Mag. 2011, 49, 101–107. [Google Scholar] [CrossRef]
  3. Rappaport, T.S.; Sun, S.; Mayzus, R.; Zhao, H.; Azar, Y.; Wang, K.; Wong, G.N.; Schulz, J.K.; Samimi, M.; Gutierrez, F. Millimeter wave mobile communications for 5G cellular: It Will Work! IEEE Access 2013, 1, 335–349. [Google Scholar] [CrossRef]
  4. Akdeniz, M.R.; Liu, Y.; Samimi, M.K.; Sun, S.; Rangan, S.; Rappaport, T.S.; Erkip, E. Millimeter Wave Channel Modeling and Cellular Capacity Evaluation. IEEE J. Sel. Areas Commun. 2014, 32, 1164–1179. [Google Scholar] [CrossRef]
  5. Qingling, Z.; Li, J. Rain Attenuation in Millimeter Wave Ranges. In Proceedings of the IEEE 7th International Symposium on Antennas, Propagation and EM Theory, Guilin, China, 26–29 October 2006; pp. 1–4. [Google Scholar]
  6. Singh, S.; Mudumbai, R.; Madhow, U. Interference analysis for highly directional 60-GHz mesh networks: The case for rethinking medium access control. IEEE/ACM Trans. Netw. 2011, 19, 1513–1527. [Google Scholar] [CrossRef]
  7. Qiao, J.; Shen, X.S.; Mark, J.W.; Shen, Q.; He, Y.; Lei, L. Enabling device-to-device communications in millimeter-wave 5G cellular networks. IEEE Commun. Mag. 2015, 53, 209–215. [Google Scholar] [CrossRef]
  8. Collonge, S.; Zaharia, G.; Zein, G.E. Influence of the human activity on wide-band characteristics of the 60 GHz indoor radio channel. IEEE Trans. Wirel. Commun. 2004, 3, 2396–2406. [Google Scholar] [CrossRef] [Green Version]
  9. Zheng, K.; Zhao, L.; Mei, J.; Dohler, M.; Xiang, W.; Peng, Y. 10 Gb/s Hetsnets with Millimeter-wave communications: Access and networking - challenges and protocols. IEEE Commun. Mag. 2015, 53, 222–231. [Google Scholar] [CrossRef]
  10. G Radio Access: Requirements, Concept and Technologies. Available online: https://gsacom.com/paper/docomo-5g-radio-access-requirements-concept-and-technologies/ (accessed on 13 September 2017).
  11. Mehrpouyan, H.; Matthaiou, M.; Wang, R.; Karagiannidis, G.K.; Hua, Y. Hybrid millimeter-wave systems: A novel paradigm for hetnets. IEEE Commun. Mag. 2015, 53, 216–221. [Google Scholar] [CrossRef]
  12. Majhi, S.; Mitran, P. On the capacity of a class of dual-band interference channels. In Proceedings of the IEEE International Symposium on Information Theory, Barcelona, Spain, 10–15 July 2016; pp. 2759–2763. [Google Scholar]
  13. Feng, W.; Li, Y.; Jin, D.; Su, L.; Chen, S. Millimetre-wave backhaul for 5G networks: Challenges and solutions. Sensors 2016, 16, 892. [Google Scholar] [CrossRef] [PubMed]
  14. Morteza, H.; Koksal, C.E.; Shroff, N.B. Dual Sub-6 GHz—Millimeter Wave Beamforming and Communications to Achieve Low Latency and High Energy Efficiency in 5G Systems. Available online: https://arxiv.org/pdf/1701.06241.pdf (accessed on 13 September 2017).
  15. Niu, Y.; Li, Y.; Jin, D.; Su, L.; Vasilakos, A.V. A survey of millimeter wave communications (mmWave) for 5G: Opportunities and challenges. Wirel. Netw. 2015, 21, 2657–2676. [Google Scholar] [CrossRef]
  16. Intel Announces World’s First Global 5G Modem. Available online: https://newsroom.intel.com/newsroom/wp-content/uploads/sites/11/2017/01/5G-modem-fact-sheet.pdf (accessed on 13 September 2017).
  17. Ali, A.; González-Prelcic, N.; Heath, R.W. Estimating millimeter wave channels using out-of-band measurements. In Proceedings of the 2016 Information Theory and Applications Workshop, La Jolla, CA, USA, 31 January–5 February 2016; pp. 1–6. [Google Scholar]
  18. Ali, A.; Prelcic, N.; Robert, W.; Heath, J. Millimeter Wave Beam-Selection Using Out-of-Band Spatial Information. Available online: https://arxiv.org/pdf/1702.08574.pdf (accessed on 13 September 2017).
  19. Elshaer, H.; Kulkarni, M.N.; Boccardi, F.; Andrews, J.G.; Dohler, M. Downlink and uplink cell association with traditional macrocells and millimeter wave small cells. IEEE Trans. Wirel. Commun. 2016, 15, 6244–6258. [Google Scholar] [CrossRef]
  20. Semiari, O.; Saad, W.; Bennis, M. Joint millimeter wave and microwave resources allocation in cellular networks with dual-mode base stations. IEEE Trans. Wirel. Commun. 2017, 16, 4802–4816. [Google Scholar] [CrossRef]
  21. Rebato, M.; Boccardi, F.; Mezzavilla, M.; Rangan, S.; Zorzi, M. Hybrid spectrum sharing in mmwave cellular networks. IEEE Trans. Cogn. Commun. Netw. 2017, 3, 155–168. [Google Scholar] [CrossRef]
  22. Sato, H. The capacity of the Gaussian interference channel under strong interference. IEEE Trans. Inf. Theory 1981, 27, 786–788. [Google Scholar] [CrossRef]
  23. Carleial, A. A case where interference does not reduce capacity. IEEE Trans. Inf. Theory 1975, 21, 569–570. [Google Scholar] [CrossRef]
  24. Kramer, G. Outer bounds on the capacity of Gaussian interference channels. IEEE Trans. Inf. Theory 2004, 50, 581–586. [Google Scholar] [CrossRef]
  25. Chung, S.T.; Cioffi, J. The capacity region of frequency-selective Gaussian interference channels under strong interference. IEEE Trans. Commun. 2007, 55, 1812–1821. [Google Scholar] [CrossRef]
  26. Choi, S.W.; Chung, S.Y. On the separability of parallel Gaussian interference channels. In Proceedings of the IEEE International Symposium on Information Theory, Seoul, South Korea, 28 June–3 July 2009; pp. 2592–2596. [Google Scholar]
  27. Sankar, L.; Shang, X.; Erkip, E.; Poor, H.V. Ergodic fading interference channels: Sum-capacity and separability. IEEE Trans. Inf. Theory 2011, 57, 2605–2626. [Google Scholar] [CrossRef]
  28. Cadambe, V.; Jafar, S. Parallel Gaussian interference channels are not always separable. IEEE Trans. Inf. Theory 2009, 55, 3983–3990. [Google Scholar] [CrossRef]
  29. Shang, X.; Chen, B.; Kramer, G.; Poor, H. Noisy-interference sum-rate capacity of parallel Gaussian interference channels. IEEE Trans. Inf. Theory 2011, 57, 210–226. [Google Scholar] [CrossRef]
  30. Yu, W.; Ginis, G.; Cioffi, J.M. Distributed multiuser power control for digital subscriber lines. IEEE J. Sel. Areas Commun. 2002, 20, 1105–1115. [Google Scholar] [CrossRef]
  31. Chen, H.; Ma, Y.; Lin, Z.; Li, Y.; Vucetic, B. Distributed power control in interference channels with QoS constraints and RF energy harvesting: A game-theoretic approach. IEEE Trans. Veh. Technol. 2016, 65, 10063–10069. [Google Scholar] [CrossRef]
  32. Gamal, A.E.; Kim, Y.H. Network Information Theory; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
  33. Diggavi, S.N.; Cover, T.M. The worst additive noise under a covariance constraint. IEEE Trans. Inf. Theory 2006, 47, 3072–3081. [Google Scholar] [CrossRef]
  34. Sankar, L.; Liang, Y.; Mandayam, N.B.; Poor, H.V. Fading multiple access relay channels: Achievable rates and opportunistic scheduling. IEEE Trans. Inf. Theory 2011, 57, 1911–1931. [Google Scholar] [CrossRef]
  35. Boyd, S.; Vandenberghe, L. Convex Optimization; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Figure 1. System model of the Gaussian DCLIC, which consists of an underlying GIC in the microwave band and the set of direct channels and cross channels in the mm-wave band.
Figure 1. System model of the Gaussian DCLIC, which consists of an underlying GIC in the microwave band and the set of direct channels and cross channels in the mm-wave band.
Entropy 19 00495 g001
Figure 2. In (a,b), we plot c min 2 and α 1 , min , respectively. In (c), the channel gains of a symmetric CLIC is partitioned based on whether its capacity has been characterized in each set.
Figure 2. In (a,b), we plot c min 2 and α 1 , min , respectively. In (c), the channel gains of a symmetric CLIC is partitioned based on whether its capacity has been characterized in each set.
Entropy 19 00495 g002
Figure 3. Due to its WF-like property, OA 1 follows one of the three sequences depending on P k * . The saturation levels in the cross channels are due to its max-min property.
Figure 3. Due to its WF-like property, OA 1 follows one of the three sequences depending on P k * . The saturation levels in the cross channels are due to its max-min property.
Entropy 19 00495 g003
Figure 4. Optimal power allocation that follows the sequence [S1] S D S C D S sat . (a) optimal power allocation ( p 1 , q 1 , p 2 , q 2 ) ; (b) the resulting sum-rate constraints.
Figure 4. Optimal power allocation that follows the sequence [S1] S D S C D S sat . (a) optimal power allocation ( p 1 , q 1 , p 2 , q 2 ) ; (b) the resulting sum-rate constraints.
Entropy 19 00495 g004
Figure 5. Optimal power allocation that follows the sequence [S2] S C S C D S sat . (a) optimal power allocation ( p 1 , q 1 , p 2 , q 2 ) ; (b) the resulting sum-rate constraints.
Figure 5. Optimal power allocation that follows the sequence [S2] S C S C D S sat . (a) optimal power allocation ( p 1 , q 1 , p 2 , q 2 ) ; (b) the resulting sum-rate constraints.
Entropy 19 00495 g005
Figure 6. The set of all c 2 and d 2 is partitioned depending on whether the cross or the direct channels are “stronger”.
Figure 6. The set of all c 2 and d 2 is partitioned depending on whether the cross or the direct channels are “stronger”.
Entropy 19 00495 g006
Table 1. Optimal power allocation and conditions of sets in OA 1 .
Table 1. Optimal power allocation and conditions of sets in OA 1 .
SetOptimal PowersCondition
S D p 1 = P β , q 1 = 0 , p 2 = P β , q 2 = 0 , g 1 ( P ) > 1
S C p 1 = 0 , q 1 = P β ¯ , p 2 = P ( c 21 2 c 12 2 ) β c 21 2 , q 2 = c 12 2 c 21 2 P β ¯ , P < P 4 * , g 2 ( P ) < 1
S sat p 1 = P β β ¯ ( γ 1 ) β c 12 2 , q 1 = γ 1 c 12 2 , p 2 = P β β ¯ ( γ 1 ) β c 21 2 , q 2 = γ 1 c 21 2 , P P 4 * , g 3 ( P ) < 1
S C D p 1 = P β ¯ q 1 β , q 1 = F ( P ) c 12 2 , p 2 = P β ¯ q 2 β , q 2 = c 12 2 c 21 2 q 1 , complement of other conditions
Table 2. Optimal power allocation in the first four sets in OA G .
Table 2. Optimal power allocation in the first four sets in OA G .
SetOptimal Powers
S D p 1 = P β , q 1 = 0 , p 2 = P β , q 2 = 0 ,
S C p 1 = 0 , q 1 = P β ¯ , p 2 = P c 21 2 + β ¯ ( P c 12 2 + β ¯ ) / η β c 21 2 , q 2 = P c 12 2 / β ¯ + 1 η η c 21 2 ,
S sat p 1 = P β β ¯ ( γ 1 ) β c 12 2 , q 1 = γ 1 1 c 12 2 , p 2 = P β β ¯ ( γ 2 1 ) β c 21 2 , q 2 = γ 2 1 c 21 2 ,
S C D p 1 = P β ¯ q 1 β , q 1 = F ^ ( P ) c 12 2 , p 2 = P β ¯ q 2 β , q 2 = 1 + c 12 2 q 1 η c 21 2 1 c 21 2 ,
Table 3. Optimal power allocation and conditions of sets in OA S .
Table 3. Optimal power allocation and conditions of sets in OA S .
SetOptimal PowersCondition
S ^ 1 p 1 = 0 , q 1 = P β ¯ , p 2 = P β , q 2 = 0 , P P 5 * , P < P 6 *
S ^ 2 p 1 = P + β ¯ c 12 2 β ¯ d 1 2 , q 1 = P + β d 1 2 β c 12 2 , p 2 = P β , q 2 = 0 , P > P 6 * , P > P 7 * , P < P 8 *
S ^ 3 p 1 = P β β ¯ ( η 1 ) β c 12 2 , q 1 = η 1 c 12 2 , p 2 = P β , q 2 = 0 , P > P 5 * , P P 8 * , f 3 ( P ) > 1
Table 4. Optimal power allocation and conditions of sets in OA S .
Table 4. Optimal power allocation and conditions of sets in OA S .
SetOptimal PowersCondition
S ¯ D p = P β , q = 0 , P < P ¯ 1 *
S ¯ C p = 0 , q = P β ¯ , P < P ¯ 2 * , P < P ¯ 4 *
S ¯ C D p = 2 1 + β P + β ¯ c 2 β ¯ 2 d 2 , q = 1 1 + β P + β d 2 2 β c 2 , P P ¯ 1 * , P P ¯ 2 * , P P ¯ 3 *
S ¯ sat p = P β β ¯ ( γ ¯ 1 ) β c 2 , q = γ ¯ 1 c 2 , P P ¯ 4 * , P > P ¯ 3 *

Share and Cite

MDPI and ACS Style

Majhi, S.; Mitran, P. On the Capacity and the Optimal Sum-Rate of a Class of Dual-Band Interference Channels. Entropy 2017, 19, 495. https://doi.org/10.3390/e19090495

AMA Style

Majhi S, Mitran P. On the Capacity and the Optimal Sum-Rate of a Class of Dual-Band Interference Channels. Entropy. 2017; 19(9):495. https://doi.org/10.3390/e19090495

Chicago/Turabian Style

Majhi, Subhajit, and Patrick Mitran. 2017. "On the Capacity and the Optimal Sum-Rate of a Class of Dual-Band Interference Channels" Entropy 19, no. 9: 495. https://doi.org/10.3390/e19090495

APA Style

Majhi, S., & Mitran, P. (2017). On the Capacity and the Optimal Sum-Rate of a Class of Dual-Band Interference Channels. Entropy, 19(9), 495. https://doi.org/10.3390/e19090495

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop