1. Introduction
For the past several years, mobile devices (MDs), such as smartphones, handheld game consoles, and vehicle multi-media computers, have become virtually ubiquitous and an increasing number of new mobile applications, such as augmented reality (AR), image processing, natural language processing, face recognition, and interactive gaming, have emerged and been the focus of considerable attention [
1,
2]. These mobile applications are typically latency-sensitive, computation-intensive, and have high energy consumption characteristics. However, under the constraint of physical size, MDs have limited resources, which restricts their battery life and computational capacities [
3,
4]. Recent studies have shown that mobile-edge computation offloading (MECO) technology provides a promising opportunity for effectively overcoming the hardware limitations and energy consumption problems of MDs, by offloading computing-intensive tasks to adjacent clouds at the edges of mobile networks.
In particular, mobile-edge computing (MEC) offers cloud computing capabilities at the very edge of the mobile networks by deploying MEC servers with sufficient computational resources at base stations (BSs), which thus improves the computational efficiency and reduces the latency, and it has drawn significant attention from both academia and industry [
5]. The paradigm shifts from cloud computing to MEC can effectively reduce the backhaul latency and energy consumption, as well as support a more flexible infrastructure in a more cost-effective way. For example, the work in [
6] used unmanned aerial vehicles (UAVs) to help device-to-device (D2D) wireless networks [
7]. Furthermore, MEC together with virtual machine (VM) migration can effectively increase the scalability while reducing service delay [
8,
9]. Owing to these advantages, MEC has attracted extensive research attention from various aspects.
Because computational performance and energy consumption are competing for resources and are critical for mobile users [
10], effective computation offloading schemes have become prominent for MEC systems. Basically, a decision on computation offloading may result in binary offloading and partial offloading, which are closely related to the application model/type. It determines whether full or partial offloading is applicable, what could be offloaded, and how [
3]. A highly integrated or relatively simple task cannot be partitioned and has to be executed as a whole either locally at the mobile device or offloaded to the MEC server, called binary offloading. In practice, many mobile applications are composed of multiple procedures/components, making it possible to partition the program into two parts with one executed at the mobile device and the other offloaded for edge execution, called partial offloading. Although the former is easier to implement, for a very large dataset, by offloading critical subtasks to the MEC servers, partial offloading can help to reduce the latency and energy consumption on the local devices more flexibly and effectively [
3]. However, partial offloading is a very complex process affected by different factors, i.e., whether the applications can be divided into offloadable parts or not; the offloadable part may differ in the amount of data and required computation; how to decide which parts could be offloaded to the MEC; the components of applications that need input from some others; and parallel offloading may not be applicable. Fortunately, these factors have been discussed and studied in many works [
11,
12,
13,
14,
15,
16,
17,
18,
19].
For the offloading strategy, Bi et al. [
20] studied the binary scheme in the wireless powered MEC network that consisted of one server and several users, where the binary policy was adopted for maximizing the weighted sum computational rate. Zhang et al., in [
21], used the auction theory to model the matching relationship between the MEC server and the MDs, so as to offload tasks to the optimal MEC server. Al-Shuwaili et al., in [
22], jointly allocated communication and computational resources to minimize the total MD energy consumption under latency constraints by successive convex approximation. In [
23], by considering the data arrival time instants and computational deadlines, You et al. proposed an energy-efficient resource management policy for MECO systems and formulated an optimization strategy that minimized the total mobile energy consumption. For partial offloading, the work in [
24] investigated MEC systems with one energy harvesting (EH) device and proposed an effective dynamic computation offloading algorithm to minimize the execution cost. In [
25], You et al. studied the energy saving partial computation offloading problem, for a multiuser MECO system based on time-division multiple access (TDMA) and orthogonal frequency-division multiple access (OFDMA). In [
26], Wang et al. also considered the energy consumption minimization problem in a scenario with one MEC server and multiple users. They also employed wireless power transfer to further alleviate the burden on battery usage. Furthermore, the work in [
27] implemented a cooperative communication system that had three nodes, in which one of them acted as the helper for relaying and computing. Lastly, the work in [
28] considered both the fronthaul and backhaul link transmission and offloading in a small-cell architecture.
However, some of above literature only focused on the performance indicator of latency, while this paper aims at minimizing the energy consumption, which is critical for power-limited devices, especially for a device with lower battery energy. Furthermore, in the problem formulation, the previously mentioned works [
25,
26] only considered the local computing time and transmission time, but the server computing time was neglected; thus, they fell short of accuracy for scenarios in which the MDs needed to offload computation-intensive jobs to MEC servers with limited computing capacity. Moreover, most of the works performed so far performed the management aspects, the experimental evaluation of energy savings associated with offloading, and/or the definition of an offloading criterion that considered the energy cost of the radio interface (e.g., LTE or WiFi), but without optimizing the energy cost of the data transfer according to the current channel conditions. Notice, however, that depending not only on the application, but also on the current channel conditions, the best strategy concerning the offloading process may be different. Considering minimizing the energy consumption at the MD for delay-constrained offloading jobs, an optimization problem can be formulated to allocate the computation and communication resources jointly in the described MECO system.
In this paper, we pursue an energy-efficient MEC offloading design considering the partial and binary strategies, aiming to minimize the energy consumption of the mobile application while ensuring that the computational task can be successfully finished with a predefined hard limit of execution time. We derive the analytical solution for the single-MD to the MEC case, which can be generalized to the multi-MD case in later research. Our work presents some main differences from previous works. First, we introduce an offloading model that includes energy consumption at MD and processing time at the server side, while these offloading parameters were neglected in previous works such as [
24,
27]. Second, instead of considering that the application is run either totally in the cloud or totally in the MD, we include an optimization variable as the quantity of data to be processed on each side to allow a partial trade-off between local computing and offloading. Finally, in contrast with previous works, our approach allows for adapting the transmission strategy to the current channel state as perceived by the MD in the uplink.
The main technical contributions of this paper are as follows.
Based on the offloading model above, we formulate an offloading-optimization problem that minimizes the MD energy usage while ensuring the task is completed within a prescribed deadline, by jointly optimizing the transmitting time and offloading ratio.
We transform the latency-constrained problem into a two-stage optimization problem, which can be analytically solved using standard convex optimization techniques. In the solution, a channel condition threshold is derived above which full offloading is the optimal decision, whereas below the threshold, partial offloading is performed to trade off between the time and energy cost of offloading. For the partial offloading policy, the optimal transmission time and offloading ratio are derived in closed form expression.
This paper also discusses in detail various practical aspects of the offloading strategy, including the conditions under which total offloading or non offloading is optimal, the minimum admissible latency constraint that renders the problem feasible, and how the system parameters affect the offloading decision, including the energy efficiency of MD, the task complexity, and the computing capacity of local devices.
The remainder of this paper is organized as follows. In
Section 2, the communication and computational model in mobile execution and MEC execution are presented, respectively. In
Section 3, the optimization problem is formulated and solved for optimal partial offloading, while binary offloading decisions are also discussed under different channel conditions. The effects on the offloading strategy of a number of system parameters are analyzed in detail in
Section 4. Finally, some simulation results and conclusions are provided in
Section 5 and
Section 6, respectively.
2. System Model
As shown in
Figure 1a, the MEC server is a computing device installed at a wireless access station. The MD is provided with wireless access to the computational resources located in proximal servers. By assigning the computing-intensive applications to the BS, it can help mobile users improve the computing performance. The pioneering literature has extensively studied mobile networking and mobile-edge computing models (e.g., [
23,
29,
30,
31]), which provide useful insights and make performance analysis tractable. Based on these results, we adopted a canonical model that captured the essentials of a typical mobile application. Although it is possible to build a system model to depict the various aspects of MEC in detail, such a model could be too complex to be treated analytically and is difficult for practical systems. Specifically, in this paper, the mobile application is abstracted into a profile with three parameters [
22,
23], including:
Input data size : the number of data bits as the input to the application;
Required CPU cycles : the number of CPU cycles required to complete the application;
Application completion deadline : the maximum latency, before which the application should be completed.
For the partial offloading policy, let represent the ratio of the offloaded data to the full data size, and the offloaded data size of the mobile application can be denoted as . We consider the application scenario to be quasi-static, where the environment at MD will not change during the computation offloading period.
Notice that the input data size , the required CPU cycles , and the application completion deadline may have an impact on the energy consumption of mobile applications. With more data input, more CPU cycles, and/or a more stringent completion deadline, the energy consumption will be higher. Therefore, it will be beneficial to offload computation-intensive tasks to the MEC servers.
2.1. Communication Model
We first introduce the communication model between the MD and the MEC platform. The computation offloading policy was carried out based on energy consumption and completion time. The transmission power for the MD is denoted as
, and
represents the channel gain of the BS. The uplink data rate
for computation offloading of the MD is given by:
where
and
I represent the power spectrum density of additive white Gaussian noise and the interference signal, respectively. Letting
B be the bandwidth of the channel, at the MEC, the received signal power can be denoted by a function of data rate
r:
which is monotonically increasing and convex for
. The offloading transmission rate can be denoted as:
where
is the transmission time of the MD for offloading the input data size
. Then, the transmission power
can be calculated by combining (
2) and (
3):
2.2. Computation Model
(1) Local computing: With the local computing approach, the MD executes the application locally on the MD. Let
denote the computational capability (i.e., CPU cycles per second) of the MD, and accordingly, the completion time for local computing is defined as:
For the local computation energy, we have:
where
is the consumed energy per CPU cycle for the MD. Besides,
indicates the energy efficiency of the MD.
(2) Edge computing: With the edge computing approach, the MD is allowed to offload its computational load to a nearby MEC server. Subsequently, the server performs the offloaded task and feeds the result back to the MD. Therefore, in the overall offloading, the following three consecutive phases are included: (i) transmitting phase, (ii) computing phase, and (iii) receiving phase. According to the communication model, the transmission time and energy consumption of the MD sending its computational load to the MEC server are calculated by:
and:
The execution time for the computing load
on the MEC server is computed by:
where
denotes the computational capacity of the MEC server.
For the edge computing approach, the completion time and energy consumption of the mobile application are formulated as:
and:
where
and
denote, respectively, the time and energy required at the MD side when receiving the computational outcome from the MEC server. As in the existing works [
32,
33], the receiving time
and the receiving energy
can be ignored because for many applications, such as face recognition and car barrier detection, the data size of the result is relatively much smaller than the size of the input. In this paper, we consider the energy consumption on the MD side, while our future work will consider the execution energy consumption of the MEC server.
Generally, we can assume that the server has higher computational capacity than the mobile device, i.e.,
. Therefore, if a job is fully offloaded to the MEC server, the time saved in the computation phase can be expressed as:
2.3. Partial Offloading Model
Next, we introduce the partial offloading model. As in [
22,
33], we apply a linear model in this paper and the MD enables programs to process computing tasks sequentially. Here,
is the offloading ratio, which is the proportion of the task data offloaded to the MEC server, as shown in
Figure 1b. The offloading data rate is
, the offloading transmit power
, and the partial offloading time
. Therefore, the total energy consumption of the MD includes both local computing consumption and partial offloading consumption, which is expressed as:
Because we consider the computational performance as well, the completion time of the MD can be denoted as:
The completion time above also includes the partial offloading time and the local computing time, which are proportional to the original full offloading time and local-computing time, respectively.
By the definitions of and , the overall energy consumption of the MD is , for the task completed duration .
3. Joint Optimization of the Communication and Computation Resources with Partial Offloading
Based on the offloading model established in the previous section, we are now ready to optimize an offloading policy. Our goal was to find a policy that satisfies the hard delay constraints with minimal energy consumption. The steps were carried out by analytical means, and solutions were formulated under different channel conditions and computing profiles.
3.1. Optimization Formulation
Now, we present the optimization problem of minimizing the energy consumption at the MD while satisfying the hard delay constraint. As a first step, the total energy is a monotonically decreasing function of transmit time, as shown in Lemma 1:
Lemma 1. For any fixed, the total energyis monotonically decreasing with.
Lemma 1 proves that energy can be saved by extending the full offloading time under a fixed offloading ratio, till reaching the time constraint.
The optimization objective is to find the offloading policy to minimize the total energy consumption for the MD. It can be noted that
in (
13) and
in (
15) are affine functions of
. To simplify the problem, we let
and
be the optimization variables, and the optimal
can be calculated from
by Equation (
4). Hence, the energy consumption minimization problem for the MD is formulated as:
The constraint in (
16) specifies that the total completion time of the application bits is bounded by the maximum affordable latency
.
It is shown from (
16) that the primal optimization problem is affected by multiple parameters from both the communication and computational aspects. Of the parameters, the channel gain
is crucial because it determines the quality of the wireless channel and, hence, the cost of application offloading in terms of energy.
3.2. Offloading Policy in Good Channel Conditions
In good channel conditions, a high-data-rate link can be established and maintained at a low energy cost. In addition, the MEC servers have greater computing capacity compared to the local MDs. Therefore, it is of interest to determine analytically when to perform full offloading to take advantage of high-performance servers and good channel conditions. We have the following assertion:
Assertion 1. There exists channel gain thresholdthat, for a channel gain better than threshold, full offloading withis energy-optimal under any latency constraint, and: To satisfy the hard latency constraint at minimum energy cost, one can substitute
into (
15) and apply Lemma 1. The optimal transmission time
for
is given by:
and
can be computed using (
4).
3.3. Offloading Policy for the Channel Condition below the Threshold
On the other hand, let us consider the case
. In that situation, it takes more energy to overcome the propagation loss to offload a task to the MEC server (
). The objective function of (
16) has the following features:
and:
It can be observed that Problem (
16) is non-convex and hence is challenging to solve directly.
Assertion 2. Problem (16) can be transformed into a two stage optimization problem, by iteratively finding the optimaland. Proof. Since
is equivalent to
, one can first solve
for any specific value of
and, next, substitute the optimal
into (
16) to find the optimal
. Therefore, in the first sub-problem (
23), we optimize the offloading ratio
for a given
. In the second sub-problem (
29), we aim to find the optimal
, by substituting
into the original problem. □
The first sub-problem is given by:
Therefore, we define function
:
It is obvious that is convex and non-decreasing monotonically with respect to , since under the condition where . Furthermore, from the latency constraint, we can find , under the default condition and (When , it costs more energy offloading data to remote servers than local computing. Therefore, if , the local computing is optimal; otherwise, partial offloading is energy-optimal.).
Hence, for a given
, the optimal
is given by:
Next, we substitute (
25) into the main problem and solve
. We define:
and its second-order derivative is:
Therefore,
is convex with respect to
. From all the above, the composite function
is convex [
34] (chapter 3). By substituting (
25) into (
16),
we can now solve the convex sub-problem that minimizes
, thanks to the fact that
is convex. The optimal
is to be found by solving the second sub-problem:
Solution to the Sub-Problem
Next we solve the convex problem (
29) using standard convex optimization techniques, as shown in the following.
Assertion 3. The optimalis computed by the expression below:whereis the Lambert Wfunction [
35].
Proof. First, recall that
in (
12) is the difference between the time that the task is executed locally and remotely. By letting the first-order derivative of the objective function of (
29) be zero, we have:
By denoting
and based on Equation (
34),
u satisfies:
We can finally further derive that:
However, recall that the condition
has already been discussed in
Section 3.2, resulting in full offloading.
For
, by referring to the definition of the Lambert W function [
35],
u in Equation (
38) is obtained as follows:
Therefore, the optimal time readily follows. □
The optimal transmitting power
can be derived from Equation (
4) accordingly. Finally,
in (
25) can be redescribed as:
Overall, the solution in (
30) and (
39) indicates that if the channel is not good enough, offloading the computing tasks to the remote MEC will have a high energy cost; thus, the partial offloading strategy is the preferred decision.
Combining the above subsections, the optimal solution of the primal problem (
16) is finally presented as (
40). For channel conditions better than the threshold
defined in (
17), full offloading is carried out; for channel
, partial offloading is performed, and the offloading ratio
and transmission time
are optimally chosen to minimize the total energy cost while satisfying the hard time constraint
. Finally, the transmission power
can be derived from
by (
4).
3.4. Special Cases of Full Offloading and Non Offloading
Here, we proceed to analyze the special cases of the optimization problem, to determine under what conditions the binary decisions of full offloading or non offloading can be performed.
3.4.1. Optimality Condition of Total Offloading
Here, we investigate the conditions under which the optimum is to process all the application bits at the MEC server, i.e., . The condition is twofold: (1) should be feasible, and (2) .
According to the completion time constraint, the first condition (1) holds when , and , i.e., the time required to transmit the task to the MEC server plus the time for remote processing should not be larger than the maximum latency constraint.
Finally, the sufficient condition (2) can be expanded as:
This condition holds when
and
, which is equivalent to the situation where it will take less energy to offload the application, as discussed in
Section 3.2.
3.4.2. Optimality of Non Offloading
Here, we provide the necessary conditions under which the optimal solution is to process all task bits locally at the MD, i.e., . These conditions are twofold: (1) should be feasible, and (2) .
According to the completion time constraint, the first condition (1) holds only if i.e., the time for executing all the application bits locally at the MD should not violate the latency constraint.
On the other hand, we see from Equation (
19) that
is equal to
at
and is continuous within a certain interval containing
. These two characteristics allow us to state that
is constant (i.e., not dependent on
) within a certain interval containing
; therefore, the second condition (2) holds only when:
This condition indicates that if offloading within a time constraint costs more energy than local computing, the MD should rely on its own resources. This condition holds in the situation where
, and the proof is left for
Appendix B.
To summarize the results of the above subsections, the offloading policy decisions are listed in
Table 1, under different channel conditions and latency constraints.
5. Simulation Results
The following provides simulation results to illustrate the performance of the proposed offloading strategy. We first consider the scenario where the wireless picocell base station had a coverage radius of 500 m and the base station antenna was located inside the building. According to the path loss model for the residential environment, the path loss model is expressed as:
where
GHz is the carrier frequency;
is the distance between the mobile user and the wireless base station;
(dB) is the floor penetration factor for the ITU-Rmodel (13.2) [
36], and the number of floors
. We considered the computational task to be generated by a face recognition application, and the task profile refers to [
37,
38]. The computation profile refers to [
38,
39]. Moreover, as the mobile devices are evolving to become smarter and the resolution of the images and videos is getting higher, we set the data size for the task to
KB. The simulation parameters are summarized in
Table 2. These parameters were used as the default in the subsequent simulations unless explicitly specified. Beforehand, we calculated
s,
s,
s,
J.
The channel gain threshold was calculated from
Table 2 where
, according to Equation (
17). The relationship between channel gain and path loss was
; thus, the MEC’s full offloading coverage radius
m, which is denoted as the effective coverage in the rest of the paper.
In addition, we also list the numerical values of MEC’s full offloading coverage radius
for situations of different channel interference strengths and bit-wise complexity
, as shown in
Table 3. For an MEC system, developers can use these values to choose the right location of the MEC servers so as to make full use of their computing resources.
Figure 6 shows the relationship between transmission time and transmission energy consumed by the MD. Note that the energy spent was
. As the transmission time increased, it required less energy for the MD to transmit task data to the MEC server. In addition, when the MD got closer to the BS, the channel condition became better, i.e., the channel gain improved, and it cost the MD less energy to offload the computing task.
Next, for
, we verified that the full offloading policy was optimal in terms of minimizing energy consumption, as shown in
Figure 7.
Figure 7a shows the energy consumption for different distances between the BS and MD in the MEC-effective coverage of
. As shown in the figure, for
, the full offloading policy was optimal for saving energy, which corresponded to the optimum solution of (
16). In such a situation and
, full offloading energy was always lower than local computing energy. Moreover, only the task data size, the task complexity, and the CPU computing capability of MDs could affect the energy consumption and latency of local computing. Therefore, the changes in the distance between the MD and BS or the task completion time requirement did not influence the performance of local computing.
Figure 7b shows the energy savings as a percentage for the case of full offloading. Note that in the case of full offloading, the offloading time is
. As shown in the figure, if the latency deadline became less hard, and the distance from the BS decreased (the channel gain increased), the percentage of energy savings would increase, as expected.
In the situation in which the MD was out of the MEC-effective coverage for which , partial offloading was preferred, and we continued to evaluate performance of the energy-optimal policy under various settings.
Figure 8 demonstrates that the proposed partial offloading strategy was optimal for minimizing the energy consumption under the hard delay constraint, and the offloading fraction
increased as the latency deadline became more stringent. As shown in the figure, the total energy consumption increased as the distance from the BS increased.
Figure 9 also shows the energy and time consumption for different distances between the BS and MD for the case of partial offloading. For comparison, full offloading and local computing are included in
Figure 9. When
, full offloading consumed more energy than partial offloading and local computing under the same delay constraint, as shown in
Section 3. Moreover, the offloading energy increased as the distance from the BS increased, as shown in
Figure 9a. We can observe from
Figure 9b that the full offloading time was less than the partial offloading and local computing time. In fact, to meet the time constraint of
, it was necessary for the MD to distribute a part of the computing load to the MEC server; thus, the partial offloading strategy always cost more energy than local computing, as shown in
Figure 9a. Additionally, to minimize the energy consumption at the MD, the proportion of the offloaded application decreased with increasing distance. Therefore, more time was consumed on local computing. To compensate for the additional time loss of local computing, the offloading data rate
must be increased, yielding a shorter full offloading time
, as shown in
Figure 9b.
Figure 10 shows the percentage of a task to be remotely processed at the MEC server versus the distance from the BS. The task offloading percentage decreased as the distance from the BS increased, as expected. Note that as the latency constraint became tighter, the offloading percentage increased, and the percentage was also affected by the distance from the BS, which coincided with the optimal solution in (
39).