2. System Model
We investigate an uplink massive MIMO-OFDM system characterized by
M transmission or service antennas at the BS and
K IoT devices. Notably, the number of IoT devices, denoted as
K, significantly exceeds the number of service antennas, represented by
M. Assuming sufficient guard interval such that inter-symbol interference is eliminated, the received signal vector for the
nth subcarrier, denoted as
, can be effectively expressed in a conventional manner within the spatial domain vector framework. Receiver (RX) processing is commonly employed to mitigate inter-user interference (IUI). By employing a RX processing vector for the
kth UE or IoT device, denoted as
, the received signal can be expressed as follows:
where
is the channel vector,
is the allocated uplink power, and
is the additive white Gaussian noise (AWGN) vector. It has mean zero and variance
(i.e.,
). In the rich scattering situation, the channel becomes uncorrelated Rayleigh fading which can be represented as
, where
is the large-scale fading coefficient for
ith IoT device, encapsulating the path loss characteristic. Concurrently, the small-scale fading dynamics are captured through a Gaussian statistical model, which accurately reflects the rapid fluctuations in signal amplitude due to multipath scattering. The system adheres to a power constraint for the uplink signal transmission, formalized mathematically as follows:
where the operation ∘ denotes component-wise multiplication, manifesting in the operation on signal vector
, which comprises the transmitted signals from all
K IoT devices formatted as
. This constraint ensures that the expected infinite norm of the component-wise product of the conjugated signal vector and the signal vector itself remains bounded, thereby regulating the maximum power transmitted by the IoT devices in the uplink channel.
The PAPR in the OFDM time domain signal can show the level of variation for the OFDM signal, and the signal is generally oversampled to catch the PAPR of the analog signal. In OFDM systems, the PAPR of the time-domain signal serves as an indicator of the extent of signal fluctuation. To accurately represent the PAPR characteristic of the corresponding analog signal, the OFDM signal is typically subjected to oversampling. This process is imperative for capturing the true essence of the signal’s dynamics, as it allows for a more precise approximation of the analog signal’s PAPR by examining its discrete counterpart at a higher resolution. The PAPR of the oversampled OFDM signal associated with the
kth UE can be expressed as follows:
where
is the oversampled OFDM signal in time domain, and
J is the FFT size,
L is the oversampling factor, where generally
. In the uplink signal, each IoT device performs the clipping operation. After clipping, the output signal is:
where
is the
tth time domain input OFDM signal to the clipping process,
is the maximum permissible amplitude of signal, and if the input signal amplitude is higher than
, the amplitude is limited to
.
represents the phase of the input OFDM signal, and regardless of the clipping process, it is not changed. To perform the clipping process, we need to define the level of clipping. The level of clipping can be set based on the definition of the clipping ratio (CR),
:
where
is the average power of input OFDM signal. Once the clipping process is conducted in the time domain, the clipping noise is added to all the subcarriers of the OFDM signal in the frequency domain. It is known that the clipped signal vector in the time domain is [
24]:
where
is the attenuation factor vector and
is the distortion noise vector in time domain.
can be estimated and approximately recovered. Due to this reason, we will focus on the clipping noise in this paper. It is noteworthy that the compensation of the attenuation factor causes noise enhancement, but it can be neglected in the region of interest that is relatively high
.
The RS is heavily reused to support massive IoT devices simultaneously. In this context, we adopt the notation
with double subscript to denote the
nth IoT device within the
gth group for the sake of simplicity. Here,
G signifies the total number of groups, and
N denotes the quantity of IoT devices in each group. To mitigate interference within each group, mutually orthogonal RS are deployed. However, across different groups, the reuse of RS leads to potential RS contamination. We organize all IoT devices into an
matrix, where
, accommodating the requisite number of groups based on the total count
K of IoT devices:
Then, from (
1), after the clipping process in the time domain, the received signal at
nth UE in
gth group is shown as:
DS denoting the desired signal, and various sources of distortion and interference that can degrade the quality of this signal. CDDS represents the clipping distortion directly associated with the desired signal, arising when the signal’s amplitude exceeds a predefined threshold. IFSRS refers to interference caused by other UEs transmitting over the same RS as the UE of interest. Correspondingly, CDSRS symbolizes the clipping distortion emanating from these co-channel UEs. Furthermore, IFDRS describes interference from UEs that are allocated different RSs than the UE of interest. CDDRS captures the clipping distortion introduced by these cross-channel UEs. RN represents the received noise in the system, incorporating both external and internal noise sources affecting the signal reception. DSOUC stands for the desired signal as transmitted over an unknown channel, indicating the uncertainties and variations in the propagation environment that can affect the signal’s integrity. The notation is introduced to specify the clipping distortion noise observed in the frequency domain, attributed to the nth IoT devices within the gth group. The term specifies the initial uplink transmission power from a UE, while denotes the uplink power control factor applied to the nth IoT devices within the gth group to manage their transmission power effectively. This comprehensive framework highlights the multifaceted nature of signal quality assessment in communication systems, addressing both signal distortions and interferences alongside operational parameters like power control.
A block diagram for (
8) is shown in
Figure 1.
With the appropriate oversampling and enough number of subcarriers,
is [
23]:
where
is the complementary error function and
is used to reflect the nonlinearity of oversampling. We use
when
[dB] is larger than 3 dB. If
[dB] is smaller than 3 dB, we use
.
3. Determination of Parameters
The expedient determination of system parameters plays a critical role in ensuring its optimal functionality, particularly in environments requiring minimal delays. This section elucidates the strategies for parameter identification and outlines methodologies aimed at enhancing EE within the realms of massive MIMO-OFDM systems that incorporate extensive IoT connectivity. A foundational step involves delineating the SINR for the massive MIMO-OFDM framework. In this context, the reduction of the PAPR emerges as a pivotal concern, necessitating the adoption of clipping schemes. The expression in (
8) reveals the complexity introduced by interference and clipping distortion components. To articulate the SINR formulation accurately, it becomes imperative to ascertain the power or variance associated with each constituent in (
8). Consequently, leveraging the foundation laid out in (
8), the SINR can be expressed as presented in (
10):
where
The TDD approach is commonly employed to mitigate RS overhead. Within this framework, the number of resource elements allocated for each coherence interval is given by
, where
represents the coherence time and
denotes the coherence bandwidth. Within each coherence interval,
slots are dedicated to uplink RS transmission, while the remaining
slots are allocated for uplink data transmission. Consequently, the overhead incurred by uplink RS amounts to
. Then, the uplink SE is:
where
signifies the uplink SINR for the
ith UE,
represents the count of simultaneously supported IoT devices for uplink transmission, and
and
denote the proportions of downlink and uplink data transmission, respectively.
In scenarios where there is a substantial number of IoT devices compared to the available service antennas (i.e.,
), zero-forcing (ZF) processing fails to offer substantial performance enhancements [
19]. Hence, in this paper, we employ MR processing.
Then, (
10) can be further derived as:
where
The derivation is in
Appendix A. Here,
is the channel estimation quality indicator (CEQI) for
nth UE in
gth group with MMSE estimation, and the value of
is between zero and one [
19,
25,
26,
27]:
With an appropriate uplink channel inverse power control, we can choose
, and then all the RX SNRs become the same:
Typically, RS power is boosted for stable channel estimation. Thus, (
12) is further simplified with the following approximation:
Then, we can simplify (
12) as (
16),
We assume that the same clipping ratio is applied to all IoT devices.
Observation 1. In the scenario of massive MIMO supporting extensive IoT connectivity, under appropriate power control, the CEQI can be effectively approximated by the reciprocal of the RS reuse factor, denoted as , where G represents the RS reuse factor.
With enough allocation of TX power
, from (
16), we can formulate SINR as follows:
Surprisingly, the SINR can be expressed in a significantly simplified form, which will be demonstrated to align closely with the outcomes of simulations in the subsequent section. The numerator originates from the channel gain, while the first term of the denominator accounts for inter-user interference (IUI), the second term relates to inter-group interference (IGI) resulting from RS collision, and the third term corresponds to the clipping distortion contributed by all IoT devices.
Observation 2. The channel gain is observed to be directly proportional to . Meanwhile, the interference scales with K, and the clipping distortion exhibits a dependency on both and K, highlighting the intricate relationship between system parameters and network performance.
Utilizing (
16) and (
17), we can now formulate two EE metrics using (
16) and (
17). Researchers often define EE as a rate that is divided by power consumption. For analytical tractability, it is pivotal to articulate a simplified yet effective power consumption model. We only consider the transmission power consumption of UEs, and represented it as
, where
is the uplink PA power consumption and
is the rest of power consumption [
28]. The relation between
and
is
, where
is the power efficiency of uplink PA.
Utilizing (
16), we derive the expression for
, as delineated in (
18):
where
. Here,
B indicates the signal bandwidth. This particular EE metric is referred to as Model I. We assume that
for simplicity. Next, using (
17), we formulate (
19) and we call this EE metric model II:
In the context of massive MIMO, antenna selection can be executed by establishing the number of service antennas, denoted as
M. It has been established that random antenna selection can yield satisfactory performance with a sufficient number of antennas in massive MIMO [
29]. Given the EE threshold,
, and other parameters, the determination of
M for antenna selection is based on the closed-form expression (
20). This scenario applies when utilizing Model I.
Additionally, for antenna selection, a simplified approach can be adopted by leveraging the approximation model introduced in Equation (
19). This model facilitates the determination of
M, as outlined in Equation (
21). This pertains to the scenario when employing Model II:
K and
D can also be formulated in a similar manner, which are shown as follows:
Regarding the parameters
and
, it is not feasible to derive their values using Model II, as this model does not incorporate
and
within its framework. Therefore, in such instances, only Model I is employed for determining these parameters. The respective expressions for
and
are denoted as (
26) and (
27):
Observation 3. Model II does not provide valid expressions for and ; therefore, it is necessary to utilize model I to determine these parameters.
Using (
26), it is straightforward to get the allowable coverage to satisfy the EE threshold. Utilizing the ETSI path loss model, the estimated coverage
is:
The derived closed-form expressions for the parameter estimations are summarized in
Table 1, where
.
In the downlink scenario, the process is almost similar, but it is crucial to note that the power consumption model for the downlink differs from that of the uplink. The simplified downlink power consumption model is expressed as: , where denotes the fixed power consumption that is not proportionate to M. Essentially, the power consumption is directly proportional to the number of service antennas, M, indicating that an increase in M does not invariably lead to advantageous outcomes.
4. Numerical Results and Discussion
In this section, we present the numerical results. The simulation parameters are given in
Table 2.
We use 20 MHz system bandwidth, 50 ms coherence time, and 180 kHz coherence bandwidth. As a path loss model, we use the ETSI urban-macro path loss model of 2 GHz carrier frequency.
In
Figure 2, we present the relationships between EE and several important parameters,
M,
K,
, and
. Simulation results are depicted as red ‘∗’ dotted lines. The theoretical analysis aligns well with the simulation results.
Figure 2a illustrates EE versus
M when
. Assuming 3GPP-based systems, there are 168 available resource symbols in 1 ms, totaling 8400 available resource symbols in 50 ms [
30,
31,
32,
33,
34,
35]. Half of the available resource symbols are used for uplink RS at the maximum, resulting in
with
ms. Since
when
ms,
in this case. We utilize
W. As
M increases, EE also increases. In the power consumption model presented in this paper,
accounts for a larger portion than
; hence, with high
, the EE performance is insufficient. There is little difference in EE performance between
= 3 dB to 8 dB.
Figure 2b displays the EE versus
K when
. With high
K, both IUI and clipping distortion noise increase due to heavy RS reuse. In the denominator of EE, power consumption also rises as
K increases. It is important to note that RS reuse commences at
.
Remark 1. EEdemonstrates an increasing trend with the augmentation of M. Conversely, an exponential decline inEEis observed upon the initiation of RS reuse as K increases.
Figure 2c illustrates the relationship between EE and
for the case of
and
. The optimal values of
exist and they are subject to change with variations in
M and
K. The performance of
is contingent upon the values of
M and
K. In the power consumption model employed in this study, light clipping may prove beneficial in enhancing the EE.
Figure 2d depicts the variation of EE with respect to
for the scenario where
and
. There exists a specific criterion for achieving sufficient EE or SE performance based on the value of
. In the given model, in order to maximize EE, the value of
should exceed 0.1 mW to attain satisfactory performance.
Remark 2. Optimal points exist for both ν and , and these points can vary depending on different situations. Specifically, if falls below a certain threshold, neither the EE nor the SE performances are deemed acceptable.
In
Figure 3, we illustrate the parameters derived from closed-form expressions, with simulation results depicted through red asterisk (‘∗’) dotted lines. Model I employs the formulation provided in (
18), whereas Model II utilizes the approximation of the SINR as outlined in (
19). The comparison between these models and the simulation results demonstrates a noteworthy concordance, indicating the reliability of the approximations used in Model II as the exact formulations of Model I.
Figure 3a illustrates the relationship between
M and
, along with the corresponding EE based on (
20) and (
21). Here, we set
and
= 2 Mbps/W. Both model I and model II cases satisfy the required EE thresholds. The determined value of
M exhibits an inverse characteristic to the EE versus
, as demonstrated in
Figure 2a. Furthermore,
Figure 3b depicts the relationship between
K and
, along with the associated EE based on (
22) and (
23), with
= 2 Mbps/W. The determined value of
K demonstrates similar behavior to the EE versus
relationship shown in
Figure 2b.
Remark 3. The established value of M exhibits an inverse relationship with the EE versus ν, while the determined value of K demonstrates a similar characteristic to the EE versus ν.
Figure 3c displays
versus
M, along with the associated EE based on (
24) and (
25), with
and
= 2 Mbps/W. Once
D is determined,
can be directly derived using the inverse function of (
9). In this scenario,
decreases as
M increases. The EE values fulfill the entire range of
M, and decreasing
consistently leads to reduced power consumption. Moreover,
Figure 3d illustrates
versus
K, and the related EE based on Equations (
24) and (
25), with
and
= 0.5 Mbps/W. Here, we lower the EE threshold to accommodate the complete range of
K. As
K increases, the reuse of resource becomes more intensive, necessitating a reduction in the EE threshold.
Figure 3e,f present
versus
K and
M, along with the associated EEs based on (
26). In this case, there is no approximated expression as the approximated model does not encompass
and
. The permissible coverage can be readily determined based on (
26).
Remark 4. Coverage determination can be guided by (26). To enhance coverage, it is advisable to augment M. Conversely, an increase in K leads to a reduction in coverage. This delineates a clear strategy for adjusting system parameters to achieve desired coverage objectives, emphasizing the trade-off between K and M in network design. In
Figure 4, we illustrate the variations in EE,
,
,
, and the total clipping distortion noise (CDN) across specified parameters through a three-dimensional plot. This visualization provides a comprehensive overview of how these key metrics interact within the system’s parameter space. For the detailed analysis of total CDN,
Figure 4g,h employ the following equation to quantify its variation with respect to other parameters, offering deeper insights into the impact of system configuration on clipping distortion noise:
It is observed that are proportional to the M, K, and .
Remark 5. The total noise resulting from clipping distortion exhibits a direct proportionality to M, K, and ν.
In
Figure 5, we display the cumulative distribution function (CDF) of EE. The dotted line represents the scenario where we determined the allowable coverage
using
= 1.5 Mbps/W,
,
. We utilize (
28) to obtain
= 480 m. With the determined
, the average EE is calculated to be 1.5 Mbps/W. On the other hand, the solid line corresponds to the case where we determined the value of
M with
= 2 Mbps/W,
. Using (
20), we obtain
M = 513. With the determined value of
M, the average EE is calculated to be 2 Mbps/W. It is important to note that
represents the average EE of all IoT devices, with 50% of IoT devices having a higher EE than the threshold and 50% having a lower EE than the threshold. It is crucial to consider that in order to set the EE to prevent an outage, there must be sufficient EE margin. For instance, in the case of the dotted line, the EE threshold should be set below 1 Mbps/W to fully prevent the EE outage. Similarly, in the case of the solid line, the EE threshold should be set below 1.5 Mbps/W to fully prevent the EE outage.
Remark 6. The EE threshold represents the average EE that meets the EE requirements for 50% of IoT devices. To prevent an outage, it should be established with an adequate EE margin.
We analyze the computational complexity of the proposed scheme, presenting the number of required multiplications, additions, and divisions, summarized in
Table 3. In the table,
denotes the number of multiplications,
the number of additions, and
the number of divisions. As mentioned, it indicates that model I generally demands a higher computational complexity compared to model II.
The computational complexity of the proposed scheme is bounded, even with increases in network size, as the necessary parameters can be efficiently derived from closed-form equations. For a LUT-based approach, where precalculated values are stored in memory, complexity can be approximated as , where n is the data size. Since finding parameters that meet a given EE threshold is inherently nonlinear, a LUT is an effective approach for identifying near-optimal values. Iterative search methods may offer comparable complexities. Techniques such as binary search, jump search, interpolation search, and Fibonacci search all provide different complexity trade-offs. For instance, the jump search operates at , while binary search and Fibonacci search achieve . Although these algorithms have lower complexity, they still scale with data size n.
To bridge theoretical insights with practical and commercially viable implementations of massive MIMO, we refer current commercial deployments in 5G and early-stage 6G that incorporate similar EE-optimized configurations to enhance system capacity and sustainability. In commercially deployed 5G New Radio systems, massive MIMO configurations are frequently utilized to improve throughput and connectivity, especially in high-density urban areas. For example, 3GPP standards support 168 resource elements in 1 ms TDD frames, with a maximum coherence interval of 8400 resource elements over a 50 ms duration. This setup enhances energy-efficient communication by optimizing resource allocation, employing dynamic beamforming techniques, and maximizing SE in multiuser environments. Many vendors integrate comparable energy-aware configurations into their BSs, utilizing adaptive antenna arrays to optimize both coverage and power consumption. For instance, in the 3.5 GHz band, several vendors employ 64T64R massive MIMO configurations. Based on our optimized parameters, adjusting the number of active antennas according to traffic load (e.g., reducing from 64 to 32 during low-demand periods) can yield significant EE improvements with minimal impact on performance. Emerging 6G commercial trials further emphasize these energy-efficient parameters, particularly to support IoT and ultra-reliable low-latency communication requirements. Vendors are exploring EE-focused beamforming strategies in experimental massive MIMO setups, targeting extended IoT device battery life while maximizing coverage range. These advancements reflect an industry shift towards implementing adaptive, resource-efficient massive MIMO configurations that align with our proposed EE optimization framework.
The findings of this work provide insights that are directly applicable to the design and optimization of massive MIMO networks, particularly in dense IoT environments where EE and system scalability are paramount. Several practical applications can benefit from the optimized strategies and metrics developed in this work. The methodologies we propose for high EE can be directly applied to the design and optimization of next-generation wireless networks. By calibrating operational parameters such as the number of service antennas and coverage extents, network operators can enhance performance while minimizing energy consumption. The simplified EE models derived in our work demonstrate remarkable applicability in real-world scenarios. This allows network planners to make informed decisions that balance EE with service quality in diverse environments, such as urban areas with high IoT device density. Energy consumption becomes a critical concern in IoT deployments, and our work provides actionable insights that contribute to sustainable practices in technology development. This is particularly relevant for industries aiming to reduce their carbon footprint while maintaining operational efficiency. The derived EE metrics allow network operators in urban and smart city applications to optimize the number of service antennas based on device density and required coverage area. This approach can be used in practice to significantly reduce power consumption while maintaining service quality across large numbers of IoT devices, such as sensors and smart meters. By dynamically adjusting antenna arrays, network planners can achieve high EE levels without sacrificing connectivity, which is essential for sustainable urban infrastructure. The clipping-based PAPR reduction techniques explored in this work can be directly implemented in the signal processing units of BS equipment. These methods offer equipment manufacturers guidelines for developing more power-efficient PAs tailored to dense IoT networks. In industrial applications, such as smart manufacturing and logistics, the ability to support a high density of IoT devices within a limited coverage area is critical. The heavy RS reuse strategies discussed in this work can be applied to allocate spectral resources effectively in these settings, allowing for simultaneous connectivity of numerous devices while minimizing interference and conserving power. The closed-form solutions for SINR can aid in configuring network settings that ensure reliability and energy savings in environments that require high device density. Given the increased regulatory focus on reducing the carbon footprint of telecommunication networks, the EE improvements demonstrated in this work can contribute to sustainable practices in massive MIMO deployments. By reducing power requirements through optimized operational parameters, network operators can achieve compliance with environmental standards, particularly in regions where green energy policies are enforced. The proposed methods align with initiatives aimed at reducing overall energy consumption, helping telecommunication providers meet energy targets while supporting growing IoT infrastructure. These applications showcase the relevance and utility of our findings across various real-world scenarios, highlighting the practical value of EE-centric designs in massive MIMO for IoT expansion.
Building upon the findings of this work, several promising avenues for future research emerge to advance the application and understanding of EE in massive MIMO systems, especially within the rapidly evolving IoT landscape. Integrating machine learning techniques offers a dynamic approach to EE management, enabling adaptive optimization of parameters such as antenna activity, power allocation, and resource blocks in response to real-time network conditions. This adaptive approach can enhance system performance and resilience under varying IoT loads. Additionally, as distributed architectures like cell-free massive MIMO gain prominence, applying the EE framework in these contexts is a valuable next step. Examining the effects of user-centric clustering, distributed beamforming, and cooperative signal processing on EE is particularly relevant for scalable IoT applications. Furthermore, exploring the intersection of EE optimization with emerging technologies—such as millimeter-wave communication, network slicing, and edge computing within massive MIMO for IoT—offers a forward-looking research direction. Extending this EE framework to accommodate heterogeneous IoT environments, where devices exhibit diverse capabilities, energy constraints, and quality of service requirements, would further strengthen its adaptability and practical impact.