1. Introduction
The rise of CubeSats has revolutionized the space industry, providing a cost-effective and efficient means to conduct a wide range of scientific and technological missions. Since the first CubeSat was launched in 2003 [
1], these small satellites have gained significant popularity due to their compact size, reduced mass, and relatively short development timelines compared to traditional satellites. The adoption of CubeSats is driven largely by their use of commercial-off-the-shelf (COTS) components, which substantially lower the barriers to entry for space missions, making them accessible to academic institutions, small companies, and emerging space-faring nations. CubeSats have been successfully deployed in various applications, including Earth observation [
2,
3], communication [
4], air traffic management [
5], ionospheric science [
6], active orbital debris removal [
7], wildfire detection [
8], gravity wave detection [
9], and even interplanetary exploration [
10].
Despite the advantages offered by CubeSats, their operational deployment poses several challenges, particularly in achieving precise orbital maneuvers. CubeSats are typically launched as secondary payloads, which means that their insertion orbits are not specifically tailored to the mission requirements but rather subject to the constraints of the primary payload [
11]. This results in significant orbital variances that need to be corrected for the CubeSat to achieve its designated orbital elements. Such corrections are crucial, especially in missions involving constellations or formations of multiple CubeSats [
12,
13], where precise coordination and positioning are imperative.
Recent work in the CubeSat domain has mainly explored constellations and control methods. For instance, Jaffer et al. optimized CubeSat constellations for air traffic monitoring through ADS-B signals, demonstrating the potential of small satellite constellations in Earth observation and traffic management applications [
14]. Similarly, Gu et al. [
15] proposed an extended 2D map for satellite coverage analysis considering elevation-angle constraints, which enhances the accuracy of coverage calculations. Bai et al. presented a multi-stage constant-vector thrust control approach for finite-thrust Lambert transfers, contributing to more efficient orbital maneuver planning in [
16]. Additionally, Zhou et al. [
17] introduced a parametric formation control method for nanosatellite constellations, emphasizing cooperative observation missions such as those targeting the China Space Station.
One of the primary challenges in orbital maneuvering for CubeSats is the limitation in the onboard fuel capacity. Due to their small size, CubeSats can only carry a limited amount of propellant, which necessitates highly efficient maneuvering strategies to conserve fuel. Moreover, the propulsion systems available for CubeSats generally offer low thrust levels, requiring prolonged operational durations to achieve the necessary changes in velocity (
) for orbit correction [
18]. This is further compounded by the limited electric power generated by CubeSats, which restricts the continuous operation of propulsion systems and, consequently, the ability to perform extensive maneuvers. Moreover, it is not practical to use complex control methods such as MPC [
19,
20] in CubeSats due to onboard processor limitations.
To address the challenge of limited fuel availability onboard small satellites, various orbit insertion and maneuvering techniques have been proposed in the literature to reduce dependency on fuel consumption for orbital corrections. One prominent research area involves utilizing aerodynamic forces to control satellite orbits. This approach typically involves deploying large panels to generate drag forces in specific directions, thereby influencing the satellite’s trajectory [
21]. For example, the potential of using aerodynamic forces for CubeSat orbit control has been thoroughly examined in [
22]. Shao et al. [
23] extended this concept to satellite rendezvous scenarios by leveraging aerodynamic forces for orbital adjustments. Similarly, differential drag-based rendezvous maneuvers of multiple spacecraft under
perturbation were analyzed in [
24]. The control of satellite formations using lift and drag forces under the influence of
perturbation is further explored in [
25]. Varna and Kumar [
26] designed a sliding mode controller to manage multiple satellite formation flying using aerodynamic forces. In a similar vein, Prez and Bevilacqua [
27] developed a Lyapunov-based adaptive feedback control strategy to regulate satellite orbits through aerodynamic drag. The use of lift forces for satellite formation control is also discussed in [
28]. Moreover, Omar and Bevilacqua [
29] proposed a time-optimal collision avoidance strategy using aerodynamic drag, which was derived from an analytical solution and iteratively refined using high-fidelity orbit propagators.
In addition to aerodynamic control, several other auxiliary maneuvers have been proposed to further enhance orbit corrections. These include exploiting solar radiation pressure [
30,
31], third-body perturbations [
32], the Moon’s gravitational influence [
33,
34], magnetic disturbances [
35,
36], tethered system [
37], and electromagnetic forces [
38,
39]. However, the effectiveness of these methods is highly dependent on the specific orbital regime and the relative positions of the Earth, Moon, and Sun.
In addition to fuel and power constraints, the reliance on COTS components introduces variability in manufacturing accuracy and the performance of onboard systems. These variabilities can lead to deviations from the expected performance, further complicating the task of precise orbital maneuvering [
40]. The components, while cost-effective, may not always meet the standards required for space operations, leading to unmodeled dynamics and uncertainties during the maneuver execution.
To address these challenges, this paper presents an innovative approach that combines advanced orbital mechanics with state-of-the-art machine learning techniques. Central to this approach is the development of the
-optimized sequence, a maneuvering strategy that leverages the
perturbation—caused by the Earth’s equatorial bulge (oblateness)—to reduce the amount of propellant required for orbit corrections. The
perturbation is a dominant force in low Earth orbits (LEOs) [
41], where most CubeSats operate, and can be exploited to assist in achieving desired orbital changes with minimal fuel consumption. The
-optimized sequence is designed specifically to operate within the constraints of CubeSat missions, optimizing fuel usage while maintaining the desired orbital parameters.
While higher-order zonal harmonics (e.g.,
and
) and lunar-solar gravitational effects can be incorporated into orbital maneuver planning, these techniques are more suited for higher orbits, long-duration missions, or missions around other planets and asteroids [
42,
43]. In LEO, the
perturbation is the dominant force, and focusing on it provides significant fuel savings with reduced computational complexity compared to more complex perturbation-based techniques. For CubeSats with limited onboard resources, the
-based approach strikes an optimal balance between efficiency and simplicity.
The Earth’s gravitational field is not perfectly spherical due to the planet’s equatorial bulge, resulting in a perturbation known as the
effect. This oblateness, quantified via the
harmonic coefficient, significantly influences the motion of satellites in LEOs. The
perturbation induces secular changes in orbital elements, most notably in the right ascension of the ascending node (
), the argument of perigee (
), and, to a lesser extent, the inclination. These secular variations are crucial in orbit determination and long-term orbital planning. In particular, the
perturbation causes a precession of the orbital plane, with the rate of change in
being directly related to the satellite’s altitude and inclination. Recent studies [
44,
45] have explored the intricate dynamics resulting from Earth’s oblateness, offering insights into both short- and long-term effects on satellite orbits. Vallado [
46] provided a comprehensive overview of the fundamental effects of
on satellite dynamics, with detailed mathematical derivations that highlight its importance in mission design and trajectory correction. Similarly, Montenbruck and Gill [
47] discussed practical methods for accounting for
perturbation in satellite control and navigation systems, offering both theoretical insights and applied strategies for satellite operators. Additionally, Hoots incorporated
effects in the SGP4 propagation model in [
48], which is widely used in satellite tracking and prediction, further emphasizing the critical role of Earth’s oblateness in satellite mission planning and operations. These references collectively underscore the necessity of accounting for
perturbation in the development of efficient and accurate satellite control strategies.
The inherent uncertainties and unmodeled dynamics in CubeSat operations necessitate a more adaptive and resilient maneuvering approach. To address these challenges, machine learning (ML) emerges as a promising solution, offering the ability to adapt to and mitigate the effects of unmodeled dynamics and uncertainties [
49]. Various studies [
50,
51,
52] have demonstrated the effectiveness of ML techniques in different contexts, highlighting their potential for improving the accuracy and reliability of complex systems under real-world conditions. Within the spectrum of ML approaches, long short-term memory (LSTM) networks are often considered due to their capability to handle sequential data and long-term dependencies [
53,
54]. However, LSTM networks demand substantial computational resources, which exceed the capabilities of CubeSats, making them impractical for onboard use [
55]. To this end, the paper introduces a novel two-level ML framework that leverages a gated recurrent unit (GRU) model [
56], an alternative to LSTM, designed to enhance the accuracy and reliability of orbital maneuvers within the constraints of CubeSat platforms. The GRU is particularly well-suited for this application because it efficiently processes time-sequenced data, such as the maneuvering sequences required in orbital operations, while maintaining a lower computational burden compared to LSTM [
57].
In the proposed method, the -optimized sequence, as the higher level, is responsible for designing and optimizing the maneuver sequence. At the lower level of the proposed framework, the GRU model monitors discrepancies between the planned maneuver sequence and the actual outcomes observed during execution to detect possible disturbances and misalignments. It then correctly generates the best maneuver to reach the waypoint generated by the high-level controller, aiding in the initialization with the high-level controller. By continuously adapting to the errors and uncertainties that arise from factors such as COTS component variability and external perturbations, the GRU adjusts the maneuvering sequence in real-time, ensuring that the CubeSat remains on its intended trajectory with enhanced precision. The interplay between the two levels of the ML framework enables the system to iteratively refine the maneuvers, achieving a level of precision that would be difficult to attain with traditional methods alone.
The incorporation of ML into the maneuvering strategy offers several key advantages. ML algorithms, particularly those that are capable of learning and adapting in real-time, are well-equipped to handle the complexities of space dynamics, which are characterized by non-linearities, time-varying behaviors, and uncertainties [
58]. The GRU model’s ability to process temporal sequences and adjust its predictions based on new data makes it particularly effective in dealing with the dynamic nature of orbital maneuvers. Furthermore, the ML approach reduces the reliance on extensive pre-mission simulations and modeling, allowing for more flexible and responsive mission operations.
This paper provides a comprehensive analysis of the proposed approach, detailing the development and implementation of the -optimized sequence and the two-level ML framework. The methodology is validated through numerical simulations, which demonstrate the effectiveness of the -optimized sequence in reducing fuel consumption and the capability of the ML framework to enhance maneuver accuracy under uncertain conditions. The results underscore the potential of integrating ML with traditional orbital mechanics to advance the state of CubeSat mission planning and execution.
This paper makes several key contributions to the field of CubeSat orbital maneuvering:
Introduction of the -optimized sequence: A maneuvering strategy that leverages the perturbation to reduce fuel consumption in orbit corrections. This approach is specifically tailored to the constraints of CubeSat missions, enhancing fuel efficiency while maintaining precise orbital parameters.
Development of a two-Level ML framework: The paper presents an innovative two-level machine learning approach which improves the accuracy and reliability of CubeSat maneuvers. The augmented GRU model adapts to real-time deviations and uncertainties, while the low-level controller dynamically optimizes the maneuver sequence.
This paper introduces several key innovations in the field of CubeSat orbital maneuvering. Unlike traditional methods that rely solely on pre-planned sequences or computationally expensive optimization techniques, our approach combines the natural perturbative effects of the Earth’s oblateness with a machine learning-based controller that can adapt in real-time. The use of the -optimized sequence for fuel efficiency, combined with the GRU-based low-level control for real-time adaptability, allows our method to outperform existing techniques in both fuel consumption and maneuver accuracy, especially under uncertain conditions. These advancements mark a significant contribution to the field of small satellite control, making it more accessible and practical for CubeSat missions.
The structure of the paper is organized as follows: In
Section 2, the orbital dynamics and the mathematical model of the problem are thoroughly presented, establishing the theoretical foundation necessary for understanding the subsequent developments.
Section 3 introduces the ML method, focusing on the architecture and operation of the two-level ML framework, including the GRU model and its role. In addition, it presents the detailing of the
-optimized sequence’s formulation and the mechanics of leveraging the
perturbation to minimize fuel consumption in orbit corrections. In
Section 4, simulation results are presented, showcasing the effectiveness of the
-optimized sequence and the ML framework in practical scenarios, along with a comprehensive analysis of the improvements in maneuver accuracy and fuel efficiency. Finally,
Section 5 concludes the paper with a summary of the findings, discussing the implications of the proposed approach for future CubeSat missions and potential avenues for further research.
3. Integrated Control Methodology for CubeSat Orbital Maneuvering
This chapter presents an integrated control methodology designed to address the challenges of precise orbital maneuvering for CubeSats, considering the uncertainties and disturbances that affect the system during operations. The control framework consists of two interconnected layers: a high-level control layer that utilizes the -optimized sequence to generate waypoints for reaching the terminal point and a low-level control layer based on a GRU network to design maneuvers for reaching each waypoint, considering environmental uncertainties and control misalignments. This combined approach enables precise trajectory corrections while minimizing fuel consumption and compensating for the effects of unmodeled dynamics.
3.1. Overview of the Two-Level Control Architecture
The proposed control architecture for CubeSat orbital maneuvering consists of two interconnected layers: a high-level control layer and a low-level control layer. At the core of this system is the -optimized sequence, implemented within the high-level control layer. This sequence calculates an optimal maneuver plan based on the desired final orbital parameters, the initial orbital state, and the available time window for executing the maneuvers. The inputs to this controller include the initial orbital parameters , the final desired parameters , and the time window for maneuver completion. The output is a detailed maneuver plan specifying the timing and magnitude of each maneuver across five distinct steps, along with the expected orbital parameters at the end of each step.
While the -optimized sequence effectively calculates the maneuver sequence, it does not account for external disturbances or manufacturing and control errors inherent in the CubeSat’s hardware. These unmodeled dynamics can cause deviations between the actual CubeSat behavior and the expected results from the -optimized sequence. To address this, a low-level control layer is introduced, designed to monitor the performance of the high-level maneuvers and calculate real-time adjustments. This layer is based on a GRU network, which helps the system reach each waypoint generated by the -optimized sequence. The GRU network utilizes the initial maneuver generated by the -optimized sequence as a warm start to calculate appropriate adjustments, considering disturbances and uncertainties in the environment, ensuring the CubeSat reaches its destination safely and accurately.
The GRU network is trained offline using simulated data generated from the -optimized sequence and the satellite’s dynamic model. During training, the GRU learns the satellite’s maneuvering behavior and how to adjust the required and timing to reach each waypoint, compensating for unknown disturbances and potential control misalignments. The inputs to the GRU include the CubeSat’s current state and the waypoint generated by the high-level control layer. The GRU’s output provides the necessary adjustments in and timing to ensure the CubeSat accurately follows the desired trajectory, despite any real-time uncertainties.
The proposed two-level architecture is highly adaptive, enabling precise orbital corrections even in the presence of uncertainties. The block diagram of the system is illustrated in
Figure 3, showing the interactions between the high-level controller (green color), low-level controller (blue color), Kalman filter (red color), and the CubeSat dynamics (purple color). The combined operation of these components allows the CubeSat to execute the optimal maneuver sequence while continuously adapting to real-time conditions. It should be noted that all parameters and designations introduced in the diagram are defined and discussed in detail in the following subsections.
3.2. High-Level Control Layer
The high-level control layer is mainly responsible for generating the waypoints to reach the final target point and detailed maneuver commands in the environment without uncertainties. This layer focuses on implementing the orbital maneuvers in a fuel-efficient manner. The maneuvers are divided into in-plane and out-of-plane maneuvers, each targeting specific orbital elements.
3.2.1. In-Plane Maneuver Theory
In-plane maneuvers primarily modify the semi-major axis
and the argument of latitude
, as these generally require less
compared to out-of-plane maneuvers that adjust the inclination
and the right ascension of the ascending node
. The first in-plane maneuver is typically used to adjust the semi-major axis
. From a linear approximation of Gauss’s variational equations (Equation (1a)), the maneuver for modifying
is derived as follows:
where
is the change in velocity in the along-track direction, and
is the propulsion burn time. In this equation,
and
represent the initial and final semi-major axes, respectively, and
is the desired change in the semi-major axis. The maneuver is executed by applying a controlled thrust over a specific burn time
.
A more complex in-plane maneuver is required to adjust the argument of latitude
while keeping
unchanged. A double-thrust solution is used to achieve this, which involves performing a pair of along-track maneuvers that cancel each other’s effects on
while modifying
. The change in
is given by:
where
represents the number of revolutions between the two thrust impulses, and
and
are the initial and final arguments of latitude, respectively. This maneuver assumes the CubeSat is in a circular orbit and targets precise control over the argument of latitude without affecting the semi-major axis.
3.2.2. Out-of-Plane Maneuver Theory
Out-of-plane maneuvers are necessary for modifying the inclination
and the right ascension of the ascending node
. These maneuvers typically require larger
compared to in-plane maneuvers. According to Gauss’s equations (Equation (1c) and (1d)), the maneuvers for adjusting
and
can be derived as follows:
where
is an integer. The first equation represents the required velocity change for adjusting the inclination
while keeping
constant, and the second equation adjusts
while keeping
constant. These maneuvers are typically performed at specific points in the orbit, as indicated by the values of
, to maximize efficiency.
A combined out-of-plane maneuver can be used to simultaneously modify both
and
using a single thrust at a specific argument of latitude
, which is determined by equating the right-hand sides of Equation (7). The optimal argument of latitude is:
Substituting
into (7) gives the required velocity change for the combined maneuver:
In practice, if the CubeSat’s propulsion system cannot provide the necessary thrust instantaneously, the maneuver must be executed over a longer burn time. In this case, the continuous variation in
and
is given using the following integrals:
where
is the propulsion burn time.
Figure 4 illustrates the relative error of the plane change maneuvers derived from Equation (7) compared to Equation (10) as a function of
for a sun-synchronous orbit at 786 km altitude. While Equation (7) assumes a single thrust maneuver, longer burn times,
, reduce the accuracy due to the linear approximation and the continuous movement of the satellite during the maneuver, which is taken into account in Equation (10). The results show that while the relative error gradually increases with longer burn times, it remains below 1% for
s. If a smaller maximum burn time,
, is selected to reduce error, multiple consecutive maneuvers are required to achieve the desired changes. For each thrust, the optimal
and the required velocity change
are recalculated, and the burn time is set to
, where
is given by:
If , the maneuver is continued in the next orbital revolution with updated values for and .
3.2.3. Orbital Maneuver Planning
For transferring a CubeSat into its desired final orbit, two options, as maneuvering sequences, are presented in this section. The first maneuvering option is the classic sequence, which utilizes the maneuvers outlined in the previous section for the orbit correction, Equations (5)–(9), without employing the beneficial effects of perturbation. Note that in the rest of the paper, the orbital elements of the initial, transfer, and final orbits are designated by the subscripts “0”, “t”, and “f”, respectively. In addition, it is assumed that each maneuvering sequence starts from the time epoch and completes at .
The
classic sequence first modifies
and
simultaneously using a
maneuver, where
and
. Then, it utilizes an in-plane maneuver of
to correct
. Finally, the sequence performs
to change
to the desired value, where
represents the argument of latitude of CubeSat before starting the maneuver. It should be noted that because of changing
between two impulses of
,
is changed due to the
perturbation, and as a result, the final
may be different from
, which should be compensated for using a pure right ascension of the ascending node modification maneuver,
. The drift angle of
with respect to the desired orbit during the
maneuver,
, can be obtained using a first-order Taylor series approximation of Gauss’ Equation (1d) as follows:
where
denotes the time duration between the pair of
impulses. Thus, the total required
for the
classic sequence can be obtained as:
The
-optimized sequence attempts to take benefits from the
perturbation by placing the CubeSat on a transfer orbit with appropriate
and
, so that the
perturbation decreases the error in
and
with respect to the desired orbit over the maneuvering period. If the initial and transfer orbits have a similar right ascension of the ascending node at the epoch,
, the necessary condition for convergence of
to the desired value, using the
perturbation, is:
which can be substituted into Gauss’ Equation (1d) to find
as:
Considering the orbital period of the transfer and desired orbits, the necessary condition to achieve the desired value of the argument of latitude,
, can be found as:
where
is the number of revolutions of the CubeSat on the transfer orbit. Moreover,
represents the variation of the argument of latitude of the CubeSat on the transfer orbit with respect to the desired orbit due to the
perturbation, which can be calculated using a first-order Taylor series approximation of the Gauss’s Equation (1f), as:
Equation (17) describes the variation of the argument of latitude due to the perturbation during the transfer orbit. This equation accounts for the secular effects of perturbation, which arise from the Earth’s equatorial bulge, and how these effects influence the CubeSat’s orbital elements over time. The first part of Equation (17) considers the secular change in the argument of latitude caused by the change in semi-major axis between the final orbit, , and the transfer orbit, . The second part captures the effect of the perturbation due to the inclination change, , on the argument of latitude. The equation shows how these two factors contribute to the drift in the argument of latitude over the maneuvering period, demonstrating the cumulative impact of both the semi-major axis and inclination changes on the orbital parameters.
Figure 5 shows the diagram of the
-optimized sequence. The sequence begins with an in-plane maneuver of
to change the semi-major axis of the orbit (the red arrow). Then, the inclination angle of the transfer orbit is provided by a pure inclination change maneuver of
(the blue arrow). When the CubeSat is inserted in the appropriate transfer orbit, the argument of latitude and right ascension of the ascending node slowly drift to the desired values, because of the
perturbation (the purple arrow). Then, the CubeSat should perform the maneuver of
over the last orbital revolution of the maneuvering period when the argument of latitude exactly matches the desired value (the orange arrow). Finally, a pure inclination change maneuver of
should be performed at
to reach the desired orbit (the green arrow). It is worth noting that since out-of-plane maneuvers are performed at
, the argument of latitude is not affected by the maneuvers. The total required
of the sequence can be obtained as:
3.3. Kalman Filtering for Input Estimation
In the proposed control architecture, a continuous-time Kalman filter is employed to estimate the pointing accuracy,
, which represents the uncertainties in the thrust direction during the CubeSat’s orbital maneuvers. Accurate estimation of
is crucial for the low-level control layer, allowing the GRU to dynamically adjust the maneuver sequence in response to real-time conditions. The Kalman filter is designed to estimate the hidden states (unmeasured properties) of the system, even if the measurements are imprecise and uncertain [
61]. The pointing accuracy is measured by measuring the difference of the actual input forces,
, and the expected input force,
, represented by
For this purpose, the Kalman filter’s responsibility is to estimate
. We augment the state-space model by including the input vector as part of an extended state vector. The augmented state vector,
, is defined as:
The dynamics of the augmented state vector are governed by:
where
is the state transition matrix derived from the original system dynamics, and
represents the process noise associated with the dynamics of both the state and input vectors. Specifically,
is structured as:
where the zero matrices reflect the assumption that the input dynamics are driven by process noise rather than deterministic processes. The matrix
represents the Jacobian of the nonlinear function with respect to the state, and it is defined as
. This matrix captures the local linearization of the system dynamics around the current state estimate
. It reflects how small changes in the state vector
impact the system dynamics. However, the matrix
in Equation (2), which represents the control input’s effect on the system, does not need to be linearized. Since
is already a function of
, and
evolves over time,
is naturally time-varying and can be directly treated as
.
The Kalman filter operates by predicting the evolution of the augmented state vector
over time, based on the state transition model. The filter also predicts the error covariance, which quantifies the uncertainty in the state and input estimates. The prediction step involves the following equations:
where
is the predicted state vector,
is the predicted error covariance matrix, and
represents the process noise covariance matrix.
As new measurements of the state vector
are obtained, the Kalman filter updates the state estimates. The measurement model links the observed states to the augmented state vector through a measurement matrix,
, which extracts the state vector from the augmented state. The measurement model is expressed as:
where
is the measurement vector, and
is the measurement noise. In this context, the measurement matrix,
, is given by:
The Kalman gain,
, is computed to optimally combine the predicted state and the new measurements. The gain is calculated as [
62]:
where
is the measurement noise covariance matrix. The Kalman gain adjusts the weight given to the measurement versus the model prediction in the state update.
The state vector
, which includes the input vector
, is updated continuously as new measurements become available. The state update equation is:
The updated error covariance matrix is computed as:
Through this continuous-time Kalman filtering process, the filter provides real-time estimates of the actual input forces , compensating for discrepancies between commanded and actual inputs caused by uncertainties and disturbances. These estimates are then used to refine the control strategy, allowing for more precise adjustments to the CubeSat’s maneuvering sequence.
3.4. Low-Level Control Layer
While the high-level control layer, utilizing the -optimized sequence, generates an optimal maneuver plan under ideal conditions, it is not immune to external disturbances and unmodeled dynamics. These factors can cause deviations from the planned trajectory, necessitating real-time corrections. To address this, we propose the integration of a GRU-based low-level control layer. This layer is designed to provide adaptive maneuver adjustments, ensuring the CubeSat reaches its target orbit accurately despite environmental uncertainties. The GRU is a type of recurrent neural network (RNN) specifically designed to handle sequential data and capture dependencies over time, making it an ideal choice for processing orbital state data and predicting necessary maneuver adjustments. Unlike traditional RNNs, GRUs are computationally efficient and mitigate the vanishing gradient problem, which is crucial given the limited computational resources available on CubeSats.
The GRU network inputs include:
Current orbital state : A vector representing the CubeSat’s current orbital parameters.
Waypoint : The next target waypoint generated by the high-level -optimized sequence, representing the desired state in the maneuver plan.
Pointing accuracy: The pointing accuracy of the CubeSat that is estimated by the Kalman filter.
Historical state data: A sequence of previous orbital states and control inputs, enabling the GRU to maintain context and predict future deviations.
The outputs of the GRU include:
Adjusted : The necessary correction to the delta velocity that compensates for any deviations from the planned trajectory.
Adjusted timing: The timing adjustments required to execute the maneuvers at the optimal moments, ensuring precise control despite disturbances.
The GRU network consists of several key components: the update gate, reset gate, candidate hidden state, and final hidden state. The update gate controls how much of the previous hidden state should be retained, while the reset gate determines how much of the past information should be forgotten when calculating the candidate hidden state.
For each time step,
, the update gate,
is calculated as:
where
is the sigmoid activation function,
is the input at time step
,
is the hidden state from the previous time step,
and
are the weight matrices, and
is the bias term.
The reset gate,
, which controls how much of the previous hidden state to use in calculating the candidate hidden state, is given by:
Using the reset gate, the candidate hidden state,
, is computed as follows:
where
denotes element-wise multiplication, and
is the hyperbolic tangent activation function. The candidate hidden state,
, represents the new information that could be added to the final hidden state.
The final hidden state,
, which captures the updated information for the current time step, is given by:
The hidden state,
, encapsulates the learned knowledge from both the past and current time steps, making it suitable for predicting adjustments needed in future maneuvers. The GRU’s output
provides the necessary adjustments for
and maneuver timing as
where
is the maneuver timing adjustment.
The GRU is trained offline using a dataset generated through simulations. These simulations model the CubeSat’s dynamics under various conditions using STK including different types of disturbances such as atmospheric drag, gravitational perturbations, and thruster misalignments. Simulated scenarios are created where the CubeSat executes a maneuver plan generated by the
-optimized sequence. The simulations introduce disturbances that cause deviations from the ideal trajectory. The GRU is trained in a supervised manner, with the objective of minimizing the mean squared error (MSE) between the GRU’s predicted adjustments and the adjustments calculated from the simulations. The MSE loss function effectively penalizes inaccurate predictions, guiding the GRU to learn the correct mapping from the inputs to the necessary adjustments. To prevent overfitting and ensure robust performance across a wide range of conditions, dropout is applied during training. The GRU’s performance is validated using a separate dataset, comprising scenarios not seen during training, to test its ability to generalize to new and unexpected disturbances.
where
is the predicted adjustment at time
,
is the target adjustment based on the simulation,
is the number of training samples, and
is a positive of the normalized weight.
Once trained, the GRU is deployed onboard the CubeSat as part of the low-level control layer. During the mission, the GRU operates in real-time, continuously receiving updates on the CubeSat’s current state and comparing it with the planned trajectory. Based on this information, the GRU provides instantaneous adjustments to the maneuver plan, specifically adjusting the ΔV and timing for each waypoint generated by the high-level controller. This allows the CubeSat to maintain its desired trajectory, even in the presence of unmodeled dynamics and environmental uncertainties.
4. Experiment Result and Discussion
This section evaluates the performance of the proposed control strategy through numerical simulations. The first part investigates the effectiveness of the -optimized sequence in orbital corrections. The second part assesses the proposed approach by analyzing the training process and comparing the results against those obtained using an LSTM-based method.
4.1. Performance Evaluation of the -Optimized Sequence
The performance of the proposed
-optimized sequence is assessed through an orbit correction scenario involving a 10 kg CubeSat equipped with a 0.1 N continuous thruster, capable of a maximum burn time of 7 min per maneuver. The scenario assumes an initial time epoch of
days and a total maneuvering period of
days. Two cases are considered, each with initial deviations in the orbital elements relative to a target circular sun-synchronous orbit at an altitude of 786 km. The orbital deviations for both cases are presented in
Table 1.
Figure 6 compares the evolution of orbital element deviations between the actual and desired CubeSat positions for case A using both the
classic and
-optimized sequence. The results show that with the
-optimized sequence,
is initially increased to achieve a sufficient rate of change in
, allowing
to converge steadily to zero by the end of the sequence, using the
perturbation while maintaining the transfer orbit inclination
. Additionally, the deviations in
and
are concurrently reduced to zero by the sequence’s conclusion. It can be seen in
Figure 7, where the miss distance is compared, that the CubeSat gradually approaches the desired position, achieving a final positional accuracy better than 1.2 km. The remaining positional error is attributed to the truncation in the Taylor series expansion used in Gauss’ equations and the fact that the thrust application is continuous rather than impulsive.
For the classic sequence, the figures show that and are modified simultaneously at the initial epoch. However, due to the influence of the perturbation, slowly increases, necessitating an additional maneuver, , at the end of the sequence to correct the right ascension. Despite this correction, the final positional accuracy is slightly worse than in the -optimized sequence, with an error of approximately 2 km relative to the desired position. The larger final deviation in the classic sequence is primarily due to the last maneuver, , which also introduces a slight change in , causing the CubeSat to drift from the desired position. This final maneuver is needed because, during the correction maneuver, it is assumed that both and remain constant; however, in practice, changes due to the perturbation. The longer the correction maneuver takes, the more adjustment is needed for at the end, leading to greater deviations in and, consequently, a larger positional error.
In contrast, in the -optimized sequence, the correction in is naturally achieved by the perturbation itself. Given sufficient time, eventually converges to its desired value, resulting in a more accurate final position.
Beyond improving positional accuracy, the
-optimized sequence also outperforms the
classic sequence in terms of fuel consumption for case A. The total required
for the
-optimized sequence is 24
, compared to 91
for the
classic sequence. However, the
-optimized sequence does not always guarantee lower fuel consumption.
Figure 8 depicts the required
for cases A and B as a function of the maneuver period
, comparing the
classic and
-optimized sequence. For case A, the
classic sequence consumes less
than the
-optimized sequence when
days, while for case B, the
classic sequence is more fuel-efficient when
days.
As the maneuver period increases, the required generally decreases. However, for case A using the -optimized sequence, the stabilizes for days because during this period, resulting in a constant value of + . In this scenario, when days, providing a sufficient rate of change for to converge to zero by the end of the sequence. For days, the increasing difference leads to a larger . In case B, where , the decreases monotonically with the -optimized sequence, approaching a minimum value (equivalent to ) as approaches infinity.
In general, if the admissible maneuvering period is limited, using the -optimized sequence increases the fuel consumption, and as a result, the classic sequence is practically reasonable to be chosen. If the maneuvering period can be sufficiently large, the effect of perturbation on can be utilized for reducing the fuel consumption using the -optimized sequence.
4.2. Performance Evaluation of the Machine Learning Approach
The performance of the proposed machine learning approach for adjusting orbital maneuvers is evaluated by training the GRU network using a comprehensive dataset generated in a simulated environment. The database was created by modeling various orbital scenarios and disturbances using the Systems Tool Kit (STK) 12.3 and MATLAB 2024a. These scenarios include different initial orbital conditions, various levels of atmospheric drag, solar radiation pressure, gravitational perturbations from the Moon and Sun, and other environmental factors. The database is designed to cover a wide range of possible situations that the CubeSat might encounter during its mission, providing a robust foundation for training the machine learning model.
To ensure the model’s reliability and generalizability, a five-fold cross-validation approach was employed during the training process. This method divides the dataset into five subsets, using four subsets for training and one for validation in each iteration. The model is trained over multiple iterations, with the process repeated five times, ensuring that each subset is used for validation once. This approach helps prevent overfitting and ensures that the model performs well on unseen data.
Through simulations, the best hyperparameters for the GRU network were selected. These hyperparameters include the number of layers, the number of neurons in each layer, the learning rate, the batch size, and the dropout rate. The selected hyperparameters, which provided the optimal balance between accuracy and computational efficiency, are presented in
Table 2.
The structure of the GRU network used in this study is as follows: the network consists of three layers, each containing 128 neurons. The ReLU (rectified linear unit) activation function is applied to introduce non-linearity into the network, allowing it to model complex relationships between the inputs and outputs. A dropout rate of 0.3 is used to prevent overfitting by randomly disabling 30% of the neurons during each training iteration. The Adam optimizer, known for its efficiency and adaptability in training deep neural networks, is employed to minimize the loss function.
Additionally, the Kalman filter parameters critical for achieving accurate state estimation are meticulously configured. The measurement noise covariance matrix () and the process noise covariance matrix () are set with all diagonal values at 0.001 and 0.01, respectively. These values reflect the precision of onboard sensors and the expected low measurement noise, along with accounting for uncertainties in the dynamics and external disturbances. The calibration of these matrices was determined through iterative testing, ensuring robust state estimations under various operational conditions.
The training process was monitored by evaluating the loss function, which measures the error between the predicted and actual outcomes.
Figure 9 shows the normalized loss for both training and validation sets as a function of the number of iterations. The plot demonstrates that the model converges after approximately 130 iterations, with the loss function reaching a stable minimum. This indicates that the GRU network effectively learns the relationships between the input features and the necessary adjustments for orbital maneuvering, ensuring accurate predictions.
In addition to evaluating the model’s accuracy, the computational efficiency of the GRU-based approach was compared with a long short-term memory (LSTM) network, another popular RNN architecture used for time-series predictions. The comparison focuses on the total computational time required for training and the time taken to make predictions during operation.
Table 3 presents the computational time values for both methods.
The results indicate that the GRU network is significantly faster than the LSTM network in both training and prediction phases. The primary reason for this difference is the GRU’s simpler architecture. The GRU network uses fewer gates than the LSTM, reducing the computational complexity. Specifically, the GRU has two gates (update and reset) compared to the LSTM’s three gates (input, forget, and output), which simplifies the computations involved in updating the hidden state. Additionally, the GRU’s lower memory footprint further contributes to its faster performance, making it more suitable for onboard implementations where processing power and energy consumption are limited.
To further evaluate the performance of the trained system, a Monte Carlo simulation was conducted, focusing on the scenario described in
Section 4.1. In this simulation, the initial altitude of the CubeSat was varied randomly within the range of 700 to 800 km to assess the robustness of the system under varying initial conditions. Additionally, the pointing accuracy of the CubeSat was modeled as a normal distribution with a covariance of
degrees, simulating the uncertainties in the thrust direction.
The accuracy of the system was evaluated by measuring the miss distance, defined as the minimum distance between the CubeSat and the target orbit during the maneuvering period, as a function of . The Monte Carlo simulation was performed for both the trained system, which combines the -optimized sequence with the GRU and Kalman filter, and for a system that relies solely on the -optimized sequence without the GRU (acting as the low-level controller).
Figure 10 presents the miss distance as a function of
for the system (including median, interquartile range (IQR), and outliers) using only the
-optimized sequence. As shown in the figure, the miss distance increases significantly with
. For
degrees, the miss distance is already around 20 km. When
degrees, the miss distance exceeds 4000 km. The primary reason for this dramatic increase is that, due to uncertainties in the thrust direction, the semi-major axis of the orbit cannot be effectively controlled, leading to a gradual shift of the CubeSat away from the target. As
increases to 20 degrees, the CubeSat fails to approach the target in many cases, demonstrating the limitations of relying solely on the
-optimized sequence under high uncertainty.
In contrast,
Figure 11 illustrates the miss distance versus
for the proposed combined system that integrates the
-optimized sequence with the GRU and Kalman filter. The results clearly demonstrate that the GRU, aided by the Kalman filter, effectively compensates for uncertainties in the thrust direction. Although the miss distance does increase with
, the system maintains a much lower miss distance even at higher values of
. For example, with
degrees, the average miss distance remains around 10 km, which is within acceptable limits for most missions. This stark contrast highlights the enhanced performance of the combined system, which not only leverages the fuel-saving benefits of the
perturbation but also offers robust and fault-tolerant control through the GRU and Kalman filter.
The results of the Monte Carlo simulation underscore the advantages of integrating the -optimized sequence with advanced machine learning techniques. This combination not only capitalizes on the natural perturbations to reduce fuel consumption but also provides a robust solution capable of maintaining accuracy under significant uncertainties. The GRU’s ability to adapt to real-time data and the Kalman filter’s precise input estimation contribute to a system that is both efficient and resilient, making it highly suitable for CubeSat missions in dynamic and uncertain environments.
The performance of the two-level control architecture is influenced by the frequency of updates between the high-level -optimized sequence and the low-level GRU-based controller. Through sensitivity analysis, we observed that increasing the update frequency enhances the system’s ability to respond to disturbances in real time, improving positional accuracy. However, even with reduced update frequency, the GRU-based controller compensates for the delay, though with a slight decrease in accuracy under highly dynamic conditions. A moderate update frequency, such as once per orbital period, provides an optimal balance between computational demands and control precision.