1. Introduction
The attention on electric vertical takeoff and landing (eVTOL) aircraft has grown significantly because eVTOL is suitable for missions that require flexible maneuverability and precise control, such as urban air mobility (UAM) [
1,
2]. UAM represents a safe and efficient air transportation system where everything from small package delivery drones using unmanned aerial vehicles (UAVs) to passenger-carrying air taxis through eVTOL aircraft can operate above populated areas [
3,
4]. UAVs have broad applications, including the sixth-generation communication using UAVs and an aerial base station for supporting the Internet of Things deployment in remote and disaster areas [
5]. Hua et al. [
6] proposed 3D non-stationary modeling for UAV-to-ground communications and managed to demonstrate its validity and practicality against measured results. eVTOL aircraft’s unique characteristics of precise delivery, lower cost, and reduced noise have motivated significant developments [
7,
8]. Conceptions under development can be categorized into a few major aircraft types, including lift+cruise (such as Aurora Flight Sciences eVTOL [
9]) and tilt-wing (such as Airbus
Vahana [
10]).
Among the aforementioned eVTOL types, the tilt-wing configuration enables the aircraft to combine the flexibility of a helicopter for vertical takeoff and landing with the efficiency of airplanes during cruising. Thus, the transition optimization on tilt-wing eVTOL aircraft attracts special interest since transition has major effects on the success of flight tasks [
11]. Pradeep and Wei [
12,
13] put an emphasis on the formulation of multiphase optimal control problems with energy consumption index for tilt-wing and multirotor eVTOL vehicles. The proposed multiphase optimal control problem formulation and the numerical solution allowed an eVTOL air taxi to fulfill the specified required arrival time while attaining the most energy-efficient trajectory for arrival. This capability played a vital role in enabling safe and efficient future operations of eVTOL aircraft, facilitating passenger transportation and cargo delivery. Chauhan and Martins [
14] constructed an Airbus
Vahana [
10] model and optimized its takeoff-to-cruise trajectory with the objective of minimizing energy consumption. They concluded that the optimal takeoff trajectory involved stalling the wings or flying near the stall angle of attack. Moreover, in the absence of acceleration constraints, the optimized trajectories involved a rapid transition to forward flight, followed by climbing at a relatively constant speed, and then accelerating to the desired cruising speed. When considering passenger comfort and incorporating acceleration constraints, the transition, climb, and acceleration phases exhibited more gradual and less distinct behavior, as expected. However, completed work on transition optimization as well as conventional engineering design optimizations rely on iterative simulation model evaluations, which prohibit real-time decision making.
Thus, surrogate models have emerged as an effective alternative for fast interactive decision making in lieu of time-consuming simulation models [
15,
16]. Surrogate-based design optimization is a methodology that leverages surrogate models to approximate the behavior of design candidates. To elaborate, surrogate models are mathematical models that are trained using a limited number of data points obtained from computationally expensive simulations or physical experiments. These surrogate models are then used in optimization algorithms to efficiently make predictions, explore design space, and find optimal solutions. Surrogate-based design optimization is advantageous over conventional simulation-based optimization for computational efficiency [
17,
18], the effective exploration of high-dimensional design spaces [
19,
20], multi-objective optimization capabilities [
21,
22], sensitivity analysis [
23], and uncertainty analysis [
24,
25,
26].
The Gaussian process (GP) [
27] is one of the most commonly used surrogate models due to its capability and flexibility for modeling and predicting unknown functions based on observed data [
28]. The concept of a multi-output GP (MOGP) has emerged to further handle problems with multiple outputs [
29,
30]. By extending the concept of a GP, an MOGP is a surrogate modeling technique that allows for the joint modeling of multiple correlated outputs and the simultaneous capture of their dependencies. In particular, by jointly modeling the outputs by estimating the covariance matrix (also known as the correlation function), an MOGP leverages the shared information among the outputs, which reduces the overall modeling complexity. For more mathematical and surrogate modeling details, please refer to
Section 2.3. Meanwhile, deep learning surrogates are going through revolutionary developments and pushing forward cutting-edge research in the design optimization community. Thelen et al. [
31] computed design variable derivatives by analytically linking objective definition, mesh, and geometry, and they demonstrated the derivatives through the multi-fidelity Broyden–Fletcher–Goldfarb–Shannon algorithm for high-dimensional aerodynamic and aeroelastic design optimizations. Tao and Sun [
32] developed a multi-fidelity surrogate-based optimization framework based on deep belief networks. The results for airfoil and wing designs under uncertainty indicated that the multi-fidelity surrogate model performed well by significantly improving optimization efficiency. Renganathan et al. [
33] performed aerodynamic design optimization by fusing deep neural networks (DNN) and GP as one surrogate to incorporate the predictive power of DNN and the confidence interval of GP. Thus, the DNN-GP surrogate enabled relatively high-dimensional Bayesian optimization on aerodynamic design and outperformed adjoint-based optimization in terms of efficiency.
Even with these developments, surrogate modeling still cannot completely address the “curse of dimensionality”, which necessitates dimensionality reduction techniques for further improvements. O’Leary-Roseberry et al. [
34] constructed adaptive residual networks on reduced-dimensional space exploited by principal component analysis to directly predict optimal designs based on design requirements. They successfully demonstrated outstanding performance over full-space feed-forward networks on aerodynamic wing design cases. Meanwhile, a prosperous family of machine learning models, generative adversarial networks (GAN), was developed as generative models [
35] with the feature of automatic design space reduction. The competition between a generator and a discriminator allows them to generate new data that follow the same patterns as the training data. Therefore, when fed with realistic airfoil or wing design shapes, a GAN model enables implicit design space dimensionality reduction by filtering out unrealistic shapes and generating only realistic designs [
36]. Du et al. [
37] developed the B-spline-based generative adversarial networks (BSplineGAN) for intelligent airfoil parameterization with the UIUC airfoil database as training data. BSplineGAN automatically reduced the design space while maintaining sufficient shape variability, which was verified by fitting optimizations to arbitrary UIUC airfoils. They constructed DNN surrogates on the reduced space exploited by BSplineGAN and verified predictive performance on aerodynamic design cases in a fast, interactive manner.
Derived from the original GAN, GAN variants also handle regression tasks. Du and Martins [
38] introduced a novel multi-fidelity surrogate modeling architecture, super-resolution GAN (SRGAN), for predicting airfoil pressure distributions based on low-fidelity counterparts. Specifically, the SRGAN generator generated super-resolution pressure distributions, while the discriminator aimed to distinguish between pressure distributions of the generated super-resolution shapes and the high-resolution data set. Thus, training the generator minimized the difference between the super-resolution shapes and corresponding high-fidelity (i.e., high-resolution) data and maximized the similarity of super-resolution with the high-fidelity data. The results showed that the SRGAN outperformed low-fidelity simulations and direct DNN by accurately capturing the locations and magnitudes of strong high-fidelity shocks. A conditional generative adversarial networks (cGAN) incorporates additional conditioning information into the generative process [
39]. The goal of a cGAN is to generate samples that not only resemble the real data but also adhere to specified conditions. The conditioning information serves as a guide for the generator to generate samples that align with the desired conditions. Aggarwal et al. [
40] conducted experiments demonstrating the effectiveness of a cGAN model for regression problems. The cGAN was successfully demonstrated on a real-world ailerons data set for an F-16 airplane. The cGAN managed to predict the control input on the aircraft’s ailerons, which was described by 40 continuous inputs. Ye et al. [
41] proposed a novel GAN-based regression model (regGAN), adopting a combined loss function on a mean squared error (MSE) and a binary cross-entropy (BC) loss, which showed outstanding predictive performance on frying oil deterioration when provided with time and temperature as input parameters.
Prior research has demonstrated the potential of machine learning surrogates, including GAN variants. The SRGAN and cGAN exhibited predictive potential, while the regGAN realized predictions from physical input space to output space. We demonstrate the regGAN performance in predicting optimal eVTOL takeoff trajectories directly based on design requirements and compare results with the MOGP and cGAN surrogates. We summarize the contribution of this paper as follows. First, we develop and introduce the regGAN surrogate into the takeoff trajectory design area for the first time. Second, we realize the surrogate-based optimal takeoff trajectory design for eVTOL aircraft to fill the lack of research study in this field. Third, we introduce the inverse mapping concept (from design requirements, including design constraints and flight condition parameters, directly into optimal designs) to eVTOL trajectory design for the first time.
We organize the rest of the paper as follows.
Section 2 introduces the optimization framework and simulation models used in this work, followed by MOGP, cGAN, and regGAN setups for surrogate modeling. We demonstrate the regGAN as well as other surrogates on eVTOL optimal takeoff trajectory predictions in
Section 3. We end this paper with conclusions in
Section 4.
3. Results and Discussion
In this section, we formulate the optimization problem and vary the operating parameter bounds to consider various takeoff scenarios. We showcase and compare the optimal takeoff trajectory predictions by MOGP, cGAN, and regGAN surrogates with simulation-based optimal counterparts. The MOGP represents the traditional surrogates (such as polynomial chaos expansions) and shows promising capability in handling multiple outputs. The cGAN makes use of the GAN architecture for regression tasks that share similar principles as the regGAN. Note that the training and testing data sets are generated by simulation-based optimal designs, and the testing accuracy was calculated based on the testing data set. In addition, we do not consider buildings, although the application scenario is set up for UAM, since we mainly focus on the UAM scope of precise transportation/delivery through optimal energy-efficient takeoff designs.
Table 2 formulates the trajectory optimization problem. The objective is to minimize the electrical power consumed to reach a minimum vertical displacement of 305 m and a minimum horizontal speed of 67 m/s. The design requirements (i.e., design constraints and flight condition parameters) include an angle of attack constraint
deg, maximum acceleration magnitude
(
g is gravitational acceleration), propeller-induced velocity factor
, electrical and mechanical loss factor
, and wing size factor
. Note that the
constraint is a concept to consider for passenger comfort for future real-world transportation, although eVTOL aircraft currently have not been widely applied for such tasks yet. The design variables are the time-sequence electrical power (
) and wing angle to vertical (
), both of which have 21 cubic curve control points and a total takeoff time (
). As mentioned in
Section 2, we use the open-source
Dymos package within
OpenMDAO.
Figure 5 shows the optimal takeoff trajectory profiles, which verify the
Dymos package in terms of the time history of design variables and takeoff conditions.
3.1. MOGP Surrogate Modeling
We use 1000 random Latin hypercube sampling (LHS) points as training data for MOGP surrogate modeling and 300 LHS testing samples to verify predictive performance.
Table 3 shows the mean testing accuracy for each design variable group using different kernel and mean functions. The results show that the SE kernel function has better overall predictive performance than the Matérn kernel. The SE-based MOGP predicts the
,
, and
at mean testing accuracies of 99.5%, 92.5%, and 94.7%, respectively. The prediction difference between the constant and linear mean functions is negligible.
We chose an arbitrary case to further reveal the predictive performance of each MOGP surrogate.
Table 4 shows the design requirements for the visualization case.
Table 5 shows the testing accuracy on design variables for MOGP models 1–4. We compare the predicted optimal trajectory profiles by MOGP models 1–4 in
Figure 6, where all models achieve similar predictive accuracies. The testing accuracies, shown in
Table 5, are all over 90%; however, visualization exhibits obvious discrepancies between surrogate-based and simulation-based optimal trajectories. The results indicate that MOGP surrogates intend to use higher power early but not the maximum power. The early power usage results in higher thrust and greater acceleration in the early takeoff phase by MOGP surrogates than by the simulation-based optimal design. The acceleration by the surrogate-based optimal design in the early phase even violates the maximum acceleration constraint. So, we conclude that the MOGP cannot realize a sufficient accuracy level using 1000 training samples.
3.2. cGAN Surrogate Modeling
We implement one single-noise cGAN model and compare the predictive performance of this model against the
Dymos simulation-based optimal trajectory as well as the MOGP predictions shown in
Section 3.1. The model architecture and hyperparameter settings are as mentioned in
Section 2.5.3. Here, we train the model with MSE and BC loss functions separately. To enable the regression feature with the cGAN, we take the mean value of 100 random noise inputs for each set of design requirements.
Table 6 displays the mean and standard deviation of the testing accuracy for cGAN models. The mean of both cGAN models shows that the cGAN has an overall
accuracy. The low standard deviation of the cGAN model represents the robustness of the cGAN model. Both BC-based and MSE-based cGAN surrogates outperform the MOGP in terms of accuracy.
We compare and visualize an arbitrary set of predicted results among MSE-based cGAN, BC-based cGAN, and MOGP surrogates using the same design requirements as
Table 4.
Table 7 presents the testing accuracy of the cGAN models for the visualization case.
Figure 7 shows the optimal takeoff trajectory profiles for the cGAN models, with BC and MSE having similar predictive accuracy. The results indicate that the cGAN outperforms the MOGP’s predictive accuracy in this visualization case. Specifically, the cGAN captures the general trend of ground truth well due to the predictive power of the DNN surrogate as well as the addition of the discriminator to drive prediction matching training data set shapes. However, there are still some unexpected wiggles when we look at the acceleration profile. In addition, the cGAN approximates ground truth based on an average of a number of predictions (100 predictions in this work) over the same set of input parameters (design requirements in this work), where the prediction may vary slightly due to Monte Carlo properties. Hence, we develop and introduce the regGAN surrogate for further predictive improvement as follows.
3.3. regGAN Surrogate Modeling
The predictive performance of the regGAN surrogates is compared against
Dymos simulation-based optimal trajectory predictions as well as MOGP and cGAN surrogates. The same 1000 training samples and 300 testing samples are used for regGAN model training and verification, respectively. The architecture and the hyperparameters are introduced in
Section 2.5.3. We first focus on the regGAN surrogate with a single MSE loss function by setting a zero weight on BC in the combined loss (CL1) (
Table 8). The table shows that the regGAN CL1 model reaches 99.5% accuracy with robust predicted results, which can be recognized by the low standard deviation of testing accuracy. To further compare results, we implement regGAN on the same visualization case in
Table 4.
Table 9 shows that the regGAN has a mean testing accuracy of over 99.5% in the visualization case.
Figure 8 shows that the optimal takeoff trajectory profiles predicted by regGAN match
Dymos results closer than the cGAN model BC and MSE and MOGP model 2, as expected.
To investigate the regGAN surrogate’s performance, we utilized a combination of two loss functions during the model training.
Table 10 shows the loss weights together with the mean and standard deviation of testing accuracy for each regGAN model. The results indicate that all regGAN CL models have over 99.6% mean testing accuracy and high predictive robustness, as revealed by the low standard deviations.
Table 11 shows that the testing accuracy by regGAN models CL2–4 on the visualization case agrees well with the corresponding mean testing accuracy.
Figure 9 shows that all predicted trajectories by regGAN models CL2–4 match the reference profiles, meaning that an accuracy of 99.6% is reliable in lieu of simulation models. regGAN models CL2–4 slightly outperform regGAN model CL1, but the matches towards reference profiles are at a comparable level.
We also conducted a parametric study for different combined loss weights to further investigate regGAN performance. We use a different number of training samples, ranging from 50, 100, 200, 400, 600, 800, to 1000.
Table 12 indicates that using 100 and 200 training samples is able to obtain the above with an overall 97% accuracy. Note that when provided with 50 training samples, CL1 has the lowest mean testing accuracy and highest standard deviation time
prediction, while CL2, CL3, and CL4 have better overall predictive performance. Moreover, regGAN model CL2 (
and
) achieves over 99.5% accuracy starting with 400 training samples, while all the other regGAN surrogates require at least 800 training samples. In addition, regGAN model CL2 consistently shows lower predictive standard deviations (such as 0.251%, 0.257%, and 0.145% on
,
, and
) on the testing data set, further confirming a better and more robust predictive performance. Based on the visualization case, a mean testing accuracy of over 99.5% can be considered sufficiently accurate with negligible differences.
In sum, the MOGP surrogates could not match ground truths or capture the general trend of optimal takeoff trajectories well using 1000 training samples, which may be due to the Gaussian assumption of GP series models. The cGAN surrogates achieve better performance over MOGP using the 1000 training samples, with a closer match towards the general trend of ground truth labels, mainly because of the guidance of the discriminator for generating similar data patterns as the training data set. The regGAN surrogates outperform MOGP and cGAN since regGAN is trained with a combined loss function of MSE and BC adversarial losses. The BC adversarial loss leads regGAN to handle the general trend of observations, while the MSE loss directly drives the match between predictions and observations.
4. Conclusions
In this paper, we investigated surrogate-based optimal takeoff trajectory predictions for electric vertical takeoff and landing (eVTOL) drones within the scope of urban air mobility. We developed the regression generative adversarial network (regGAN), which outperformed the Gaussian process (MOGP) and the conditional generative adversarial network (cGAN) by achieving over 99.5% accuracy. We summarize the main contribution of this work as follows.
First, we implemented the surrogate-based inverse mapping concept into eVTOL optimal trajectory design for the first time. Specifically, surrogate models took design requirements as input and predicted optimal trajectories. We realized fast interactive eVTOL takeoff trajectory design without running any optimizations since the trained surrogates directly predicted optimal trajectories. However, reducing training costs is essential since each training sample requires a simulation-based trajectory design optimization.
Second, we introduced the MOGP, a representative traditional surrogate model, into the eVTOL takeoff trajectory design for rapid predictions. The results showed that the MOGP with a square exponential kernel function could accurately capture the inverse mapping using 1000 training samples. We then implemented a cGAN for eVTOL inverse mapping since cGAN also makes use of the GAN architecture for regression tasks. The results revealed that the cGAN achieved over 98% generalization accuracy in predicting optimal designs using the same 1000 training samples as the MOGP, which means the cGAN outperformed the MOGP on predictive performance (around 92% accuracy). In addition, the visualization case verified that the cGAN could match the general trend of optimal designs well with actual observations from Dymos but missed detailed features.
Third, we introduced the regGAN into takeoff trajectory design for the first time and achieved over 99.6% accuracy in predicting optimal design variables with the same 1000 training samples as the MOGP and cGAN. By varying the weights of different loss functions, the regGAN could achieve over 99.6% accuracy. Moreover, results indicated that the best regGAN surrogate architecture consistently achieved over 99.5% accuracy if provided with 400 or more random training samples. This confirmed the outstanding predictive performance and potential generality of the regGAN.
In future work, we are planning to explore and develop other novel deep learning architectures in regGAN. In addition, the simulation models used in this work are not high-fidelity models but effective for describing the physics; we will increase the fidelity of simulation models in future work, which may lead to a higher training cost for surrogate modeling. Moreover, we will consider takeoff time as another constraint to make sure the total takeoff will not take unreasonable time.