Machine-Learning-Enabled Foil Design Assistant

Kostas, Konstantinos V.; Manousaridou, Maria

doi:10.3390/jmse11071470

Open AccessArticle

Machine-Learning-Enabled Foil Design Assistant

by

Konstantinos V. Kostas

^1,*

and

Maria Manousaridou

²

¹

Department of Mechanical & Aerospace Engineering, School of Engineering and Digital Sciences, Nazarbayev University, Kabanbay Batyr Ave. 53, Astana 010000, Kazakhstan

²

School of Engineering and Digital Sciences, Nazarbayev University, Kabanbay Batyr Ave. 53, Astana 010000, Kazakhstan

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2023, 11(7), 1470; https://doi.org/10.3390/jmse11071470

Submission received: 19 June 2023 / Revised: 10 July 2023 / Accepted: 11 July 2023 / Published: 23 July 2023

(This article belongs to the Special Issue Machine Learning and Modeling for Ship Design)

Download

Browse Figures

Versions Notes

Abstract

:

In this work, supervised Machine Learning (ML) techniques were employed to solve the forward and inverse problems of airfoil and hydrofoil design. The forward problem pertains to the prediction of a foil’s aerodynamic or hydrodynamic performance given its geometric description, whereas the inverse problem calls for the identification of the geometric profile exhibiting a given set of performance indices. This study begins with the consideration of multivariate linear regression as the base approach in addressing the requirements of the two problems, and it then proceeds with the training of a series of Artificial Neural Networks (ANNs) in predicting performance (lift and drag coefficients over a range of angles of attack) and geometric design (foil profiles), which were subsequently compared to the base approach. Two novel components were employed in this study: a high-level parametric model for foil design and geometric moments, which, as we will demonstrate in this work, had a significant beneficial impact on the training and effectiveness of the resulting ANNs. Foil parametric models have been widely used in the pertinent literature for reconstructing, modifying, and representing a wide range of airfoil and hydrofoil profile geometries. The parametric model employed in this work uses a relatively small number of parameters, 17, to describe uniquely and accurately a large dataset of profile shapes. The corresponding design vectors, coupled with the foils’ geometric moments, constitute the training input from the forward ML models. Similarly, performance curves (lift and drag over a range of angles of attack) and their corresponding moments make up the input for the models used in the inverse problem. The effect of various training datasets and training methods in the predictive power of the resulting ANNs was examined in detail. The use of the best-performing ML models is then demonstrated in two relevant design scenarios. The first scenario involved a software application, the Design Foil Assistant, which allows real-time evaluation of foil designs and the identification of designs exhibiting a set of given aerodynamic or hydrodynamic parameters. The second case benchmarked the use of ML-enabled, performance-based design optimization against traditional foil design optimization carried out with classical computational analysis tools. It is demonstrated that a user-friendly real-time design assistant can be easily implemented and deployed with the identified models, whereas significant time savings with adequate accuracy can be achieved when ML tools are employed in design optimization.

Keywords:

parametric models; design assistant; design optimization; neural networks; airfoils; hydrofoils

1. Introduction

Apart from the ship’s hull, a series of other free-form functional surfaces based on foil profiles, such as hydrofoils, stabilizers, rudders, and propeller blades, plays a significant role in naval architecture and marine engineering. Therefore, being in a position to involve their design and shape optimization in the early ship design stages will have an inarguably positive impact on ships’ and marine structures’ performance characteristics, energy efficiency, and environmental footprints. Functional surfaces in ship design possess complex geometries, and therefore, advanced modeling and analysis techniques are needed to appropriately represent, evaluate, and modify their shapes. In this work, we focused on hydrofoil-based surfaces, such as the ones mentioned above, and we aimed to employ a high-level parametric model for their modeling and representation. High-level parametric modeling permits the accurate description of complex and intricate shapes encountered in diverse fields such as naval architecture, ocean engineering, turbo-machinery, and automotive design [1,2]. Parametrization and parametric modeling for foil shapes (airfoils and hydrofoils) have been extensively studied in the pertinent literature. Masters et al. [3,4] presented a comprehensive review and comparison of the different parametrizations employed in the representation of foil profile shapes over the last few years. At the same time, several computational methods exist in the literature for the evaluation of an airfoil’s or hydrofoil’s performance. For example, panel methods have been widely used in the aerodynamic and hydrodynamic analysis of airfoils and hydrofoils. These numerical techniques permit the development of efficient software tools for computing flow characteristics around complex geometries and facilitating the design and optimization processes. The pioneering work of Hess and Smith [5] in the 1960s laid the foundation for panel methods in airfoil analysis. Their approach, known as the Hess–Smith panel method, employed constant-strength doublets as the fundamental unknowns on the panels. Subsequent studies by Drela and Giles [6] introduced the vortex panel method, incorporating vortex singularities on the panels to capture flow features such as leading-edge separation and trailing-edge vortices. Several enhancements have been proposed over the years to improve the accuracy and efficiency of such methods. These include modifications to account for viscous effects, unsteady flow conditions, and dynamic stall phenomena. The development of higher-order panel methods, such as the higher-order vortex panel method and the panel-vortex method, has allowed for more-accurate predictions of flow separation and wake dynamics. Additionally, higher-order boundary element methods (BEMs), based on B-splines or NURBS representations for the geometry and/or the solution, have been presented for the analysis of flow around marine propellers; see, e.g., Lee and Kerwin [7], Kim et al. [8], Gao and Zou [9], along with high-order advanced BEMs for flows around foils and wings based on isogeometric analysis; see, for example, [10,11]. Panel methods have been extensively validated against experimental data and more-advanced computational fluid dynamics (CFD) techniques. Numerous studies have demonstrated the capability of panel methods to accurately predict airfoil and hydrofoil characteristics, including lift, drag, and flow separation, and they have been applied in a wide range of engineering fields, including aircraft design, wind turbine analysis, ship hydrodynamics, and underwater vehicles; see [12,13] for an overview of classical and modern methods. While panel methods offer significant advantages, they have also certain limitations [14,15]. They assume potential flow and neglect viscous effects, making them less-suitable for situations involving high Reynolds numbers or complex flow phenomena. However, for the initial design stages and types of applications we are targeting in this work, they are certainly adequate. Nevertheless, we should note that this is a rather restrictive limitation in our study, which restricts its usage solely to the early design stages. A more-general and -appropriate approach would be to include the results of advanced CFD simulations, which would expand the applicability of our proposed method, as well as allow the ML-enabled techniques to perform to their full potential by obviating the errors and uncertainties inherent to engineering-based CFD simulations and results; see, for example, the relevant discussions in [16,17].

Since both foil parametric modeling and computational analysis of foil performance are topics that have been extensively studied and matured over the last decades, we need to go beyond them to reach our overarching goal of including hydrofoil design and optimization in the early stages of ship design. To this end, we propose the employment of supervised machine learning techniques for the generation of artificial neural network (ANN) models that would be able to address in real-time and with adequate accuracy two design problems: (a) the prediction of the aerodynamic or hydrodynamic performance of a given foil design and (b) the identification and construction of the foil design that exhibits a desired, user-defined performance profile. The ultimate goal here was to cost-effectively explore the design space and easily identify appropriate designs in the conceptual design phase. At the same, we would like to be in a position to perform preliminary design optimization and identify plausible and near-optimal solutions before reaching the detailed analysis design stage. In this work, we assumed that a sufficiently accurate foil parametric model, with a relatively small number of parameters, is appropriate for the representation and construction of foil profiles, while the lift and drag coefficients, over a range of angles of attack, constitute the performance profiles, which are utilized in the early design stages. Therefore, ML model design and training were performed on the basis of the corresponding geometric and performance features with the resulting ANNs being subsequently utilized in the development of a “Foil Design Assistant” application, which allows designers to easily and effectively explore a vast design space, as is presented and discussed in Section 3. Finally, the same models were used to conduct performance-based design optimization, and as demonstrated in the same section, they can be easily employed in the early design stages with results that are adequate and comparable to classical performance-based optimization procedures.

Before proceeding with a detailed description of the proposed approach, we will briefly present here the current status regarding the employment of ML models and techniques in foil design and performance optimization and identify the gaps in the pertinent literature that we aspire to cover. Machine learning techniques have gained significant attention in recent years for their potential to enhance the design and performance prediction of airfoils and hydrofoils. ML models, such as neural networks and support vector machines, coupled with global optimization algorithms, have been successfully employed for airfoil and hydrofoil design optimization. For example, Hui et al. [18] presented a Convolutional Neural Network (CNN) to predict the pressure distribution of airfoils for subsonic speeds. In their work, they employed the signed distance function to represent foil profiles, and the developed CNN was trained on these pictorial representations. In addition, surrogate modeling, deep learning techniques, and other approaches have been also used; see [19,20] for some recent relevant works and [21] for a state-of-the-art review and current trends in aerodynamic shape optimization and ML. Additionally and although the pertinent literature is less extensive when considering ML implementations for the inverse problem, there has been a relatively small number of research works aiming to predict the shape of foils by the given performance data. For example, Kharal and Saleem [22] presented a method to predict the airfoil shape when its pressure distribution is given. In their work, they employed a parametric model based on PARSEC and Bézier-PARSEC parametrizations (see [23,24]) to represent shapes with a reduced set of parameters. This selection bears some similarity to our proposed approach, although we avoided the use of pressure distributions as the input data since they can be hardly acquired or described at the early design stages. Finally, in the context of high-dimensional simulation-driven functional-surface-optimization problems, designers can greatly benefit from the application of design-space-dimensionality-reduction methods. In such approaches, one assesses the variability presented within the given design space and subsequently reduces its dimensionality by the employment of pertinent methods. For example, Diez et al. [25] introduced a method based on the Karhunen–Loéve expansion to assess shape modification variability and establish a latent space with a reduced dimension for the shape modification vector of ship hulls; see also [26], which built upon this approach for a wider range of functional surface designs. Generative Adversarial Networks (GANs) have been also used in the pertinent literature to model the forms of airfoils and hydrofoils. For example, Chen et al. in [27] demonstrated the effectiveness of their approach in enhancing the optimization results when compared to classical foil parameterization techniques, and Du et al. in [28] combined B-spline shape representations with GANs to construct a generative tool for foil design. However, the lack of prior knowledge regarding the proper number of modes in dimension reduction strategies may lead to inappropriate results when only a low number of modes is incorporated, while, on the other hand, using too many modes undermines the benefits gained by dimensionality reduction. Li et al. [29] developed a set of marginal functions to represent the link between dominant foil modes and higher-order modes derived from the University of Illinois Urbana-Champaign (UIUC) airfoils database [30]. These functions effectively eliminated unwanted airfoils from the design space. However, defining acceptable constraint boundaries ahead of time is frequently difficult, and there was no guarantee that the margin-based validity functions did not also eliminate perfectly valid foil designs by accident. Therefore, such methods are not without drawbacks since they may fail to fully maintain shape complexity and geometric structure, resulting in reduced-dimension subspaces that lack the ability to generate sufficiently diverse and/or valid shapes during shape optimization. In summary, ML-based methods can explore vast design spaces, considering multiple parameters and constraints, to identify optimal configurations that maximize performance metrics such as the lift-to-drag ratio, along with other performance criteria and design constraints. Several studies, as the ones mentioned above, have demonstrated improved designs compared to traditional methods, enabling the discovery of novel and efficient geometries. By training on large datasets of experimental or computational data, ML algorithms can extract and learn complex relationships between design parameters and performance metrics. These models can then provide quick estimations of lift, drag, pressure distribution, or flow separation behavior, eliminating the need for costly and time-consuming wind tunnel experiments or numerical simulations. Studies have explored the integration of ML methods with traditional approaches, such as panel methods or CFD, to combine the strengths of both techniques. ML models can be even trained on CFD data to accelerate simulations or replace computationally expensive parts of the analysis, while still capturing the essential physics.

This paper is divided in two main sections: Section 2, where the employed tools and methods are discussed, and Section 3, containing the achieved results and comparisons. Specifically, we begin by describing the parametric model used for the representation of foil designs (see Section 2.1), followed by the presentation of the foil performance evaluation procedure in Section 2.2. We then discuss the geometric moments (see Section 2.3) that will be used to augment the training datasets, and we proceed with their construction in Section 2.4. Section 2 concludes with the description of the ML models’ training in Section 2.5. The constructed models’ comparison is presented and discussed in Section 3.1, followed by the presentation of the developed “Foil Design Assistant” application in Section 3.2. Section 3 concludes with a comparison of performance-based design optimization conducted with and without regression models in Section 3.3. The paper concludes with a summary of the achieved results and a brief discussion of future research directions in Section 4.

2. Materials and Methods

Although several ML-enabled approaches have been widely used in the pertinent literature to predict foil performance and employ these predictions in design optimization, some characteristics of the works presented in Section 1 require extensively large datasets or equally extensive precomputed results from advanced computational approaches, which adversely affect their application in the early design stages. The four major hindrances we would like to address in this work relate to:

The lengthy design vectors or matrices (high-dimensional initial design spaces) that are commonly found in dimensionality reduction approaches and ML models based on pattern matching and foil shape discretizations;
The existence of a non-negligible percentage of invalid designs in latent spaces produced by dimensionality-reduction techniques;
The requirements of huge training labeled datasets;
The low-level performance characteristics, such as pressure distributions, which are not convenient inputs during the early design stages.

These were addressed by employing a low-dimensional, robust, and accurate foil parametric model for the foil geometry description coupled with simple performance indicators and the lift and drag profiles over a range of angles of attack, which permit a user-friendly and intuitive definition of geometry or performance requirements by an engineer even at the early design stages. An obvious drawback in this setup is the lack of meta-information commonly used in advanced physics-informed neural networks (PINNs), which embed foil physics and are able to more-accurately predict foil performance and, consequently, enhance design optimization; see, for example, some recent relevant PINNs used in foil performance prediction in [31,32]. To address this last issue, we opted to include the integral characteristics (geometric moments) of foil profiles, which are commonly correlated with the performance criteria and can, therefore, provide a physics-informed flavor without requiring expensive calculations, as was recently demonstrated by Shah et al. in [26].

The employed regression models require a rich space of foil profile designs,

D

, with the corresponding label data, i.e., performance profiles, residing in a second set,

P

. The former set in this work corresponds to the foil profile geometric descriptions, and it comprises the design vectors,

v \in V \subseteq R^{17}

, which correspond to the parameters of the selected parametric model. This initial design vector was augmented, as we will describe in the sequel, by an additional vector of foil geometric moments. With regard to the latter set,

P

contains the matching foil performance profiles, which are mainly encoded with a 18-component vector. The first nine components are the values of the lift coefficient,

C_{L}

, for nine angles of attack, whereas the remaining nine components correspond to the drag coefficient,

C_{D}

, for the same angles of attack. Once again, moments were also used to augment this second vector. For all models considered, angles between 0

^{\circ}

and 8

^{\circ}

with a 1

^{\circ}

step were used for the

C_{L}

and

C_{D}

calculations. Note that these two sets were used as the input or output training sets depending on whether we were modeling the forward or inverse design problem.

2.1. Foil Parametric Model

One of the main objectives in this work was to employ a foil parametric model that can robustly and accurately describe a large family of designs with a small set of parameters. This was performed to avoid well-known problems with high-dimensional geometric descriptions and the effect of foil design parametrizations in design space representation and ML models’ training; see [18]. A hydrofoil parametric model, firstly introduced in [10] and later expanded in [33] to accurately represent a wider range of foil designs, constituted the basis of the parametric model used in this work. As mentioned in the introductory section, a wide variety of methods have been proposed in the pertinent literature for the parametrization of airfoils and hydrofoils, and the works of Masters et al. [3,4] presented a comprehensive review and extensive comparison for a big number of them. As reported in [33], the 17-parameter parametric model we adopted here permits the accurate representation, within the Kulfan tolerance [34,35], of 100% of the NACA dataset in [4] and more than 80% of the foil designs in the UIUC database [30]. As a side note, we should mention here that the calculations were conducted with the modified Kulfan tolerance proposed in [4], which involves an inflation of the computed error and, therefore, is stricter than the original Kulfan tolerance. The typical wind tunnel tolerance, also known as the Kulfan tolerance, corresponds to the maximum allowable geometric deviation between two foil designs, which does not alter their performance evaluation in wind tunnel experiments. Specifically, the initial Kulfan tolerance is defined as

z^{e r r o r} < \{\begin{matrix} 4 \times 10^{- 4} & if \frac{x}{c} \leq 0.2 \\ 8 \times 10^{- 4} & if \frac{x}{c} > 0.2 \end{matrix},

where c denotes the foil’s chord length and x the longitudinal position of the measured geometric deviation. This definition was modified in [4] to

{\hat{z}}^{e r r o r} < \{\begin{matrix} 2 \times z^{e r r o r} & if \frac{x}{c} \leq 0.2 \\ z^{e r r o r} & if \frac{x}{c} > 0.2 \end{matrix}

(1)

resulting in a single requirement, i.e.,

{\hat{z}}^{e r r o r} < 8 \times 10^{- 4}

. Furthermore, the employed parametric model was specifically designed to accommodate the specific needs of design optimization, i.e., relatively small number of parameters, encoded in the so-called design or shape vector, along with a high accuracy in the representation of the profile shapes, which coincides with the requirements for the ML model training we posed in this work. Finally, the parametric model’s robustness stems from the fact that all control point movement constraints are embedded in the construction of the parametric model with the parameters being nondimensionalized in

[0, 1]

and all combinations of the parameter values in the unit interval corresponding to valid profile designs with no self-intersections, unwanted oscillations, or any other invalid geometric feature.

The employed parametric model generates foil instances with a shape vector of 17 parameters, with each instance being represented by a cubic NURBS curve (see [36]), defined as follows:

C (t; v) = (x (t; v), y (t; v)) : = \sum_{i = 0}^{12} b_{i} (v) R_{i, 4} (t), t \in [t_{3}, t_{13}],

(2)

where

{R_{i, 4} (u)}_{i = 0}^{12}

is the set of rational B-spline bases of order 4 (degree 3) defined over the knot sequence,

{t_{0}, t_{1}, t_{2}, . . . . ., t_{16}}

and possessing non-negative weights

w_{i}

,

i = 0, 1, 2, \dots, 12

, while

b_{i} (v)

are the corresponding control points determined by the parameter vector

v

. The parameters’ definition is graphically presented in Figure 1 with their descriptions and types of effect on the foil shape, i.e., global or local, being summarized in Table 1. Finally, the shape modification effect of each parameter on the foil profile is individually depicted for each parameter in Figure 2. The solid thick line in Figure 2 corresponds to the foil instance with

v_{i} = 0.5, i = 1, \dots, 15, v_{16} = 0.001, v_{17} = - 0.001

, whereas the red-colored foil instances correspond to varying parameter values,

v_{i} = 0, 0.1, 0.2, \dots, 1

, for each of the first fifteen parameters. The interested reader may refer to [33] for more details on the construction and performance of the adopted parametric model.

The incorporation of the parametric model in the development and training of the ANNs used in this work has additional benefits in the behavior of these ML models when predicting the foil shape or its performance. Specifically, as we will demonstrate in Section 3, the parametric model limits the user to producing foil designs that are both valid and within the boundaries of the employed design space, which safeguards us against highly erroneous predictions, which may occur from ML methods [37,38], as well as a design exploration that goes far beyond the training set boundaries [39]. Moreover, it offers an easy way of checking the validity of shapes returned by the inverse design model, since its output must be a valid parametric model shape vector. Another important positive impact coming from the employment of the parametric model is the indirect dimensionality reduction, since we can describe complex shapes with a small set of parameters, and even more importantly, we can do that in a way that does not generate invalid designs in the reduced-dimension design space. This is a clear advantage of our approach, since common dimensionality-reduction techniques cannot eliminate the existence of invalid designs in the latent spaces; see the relevant discussions in [26,40].

2.2. Foil Performance Analysis

The lift and drag coefficients, for each design instance, were estimated with the aid of the well-known XFOIL (https://web.mit.edu/drela/Public/web/xfoil/ accessed on 10 May 2023) computational package, which is a popular tool used in assessing the performance of subsonic airfoils and hydrofoils. XFOIL employs a fast panel method with a fully coupled viscous–inviscid interaction method used in the ISES code developed by Drela and Giles; see [6,41]. XFOIL requires a polygonal approximation of the foil profile, and thus, for each foil design in our dataset, we generated, through the employed parametric model, 200 points that closely approximated the foil’s shape. Specifically, in most cases, 160 points, with a curvature-based distribution that minimizes the distance between the actual curve and the polygonal approximation, sufficed to produce convergent and accurate performance results. However, we decided to increase that number to 200 to avoid inaccuracies and potential problems with more-complex foil profile geometries. A matlab (https://www.mathworks.com/products/matlab.html accessed on 10 May 2023) script was generated to read the design vectors for all foil designs in the dataset, save a polygon approximation, and finally, automatically call XFOIL to compute the lift and drag coefficients for all angles of attack in

{0, 1, 2, \dots, 8} °

with a Reynolds number equal to

Re = 8 \times 10^{6}

. We should also note here that we increased the viscous solution iteration limit (ITER) in XFOIL so that up to 50 Newton iterations were executed when assessing convergence. This was a significant step, which helped us avoid missing results and non-converging calculations with some more-intricate foil designs. A sample XFOIL output for the calculations described here is depicted in Figure 3. As a final note, we should also add here that a small number of performance calculations (less than 3% of the foil designs in the database) failed for one or two angles of attack. In such cases, we performed a cubic spline interpolation with the remaining

C_{L}

and

C_{D}

values and estimated the missing value(s) with the resulting spline curve.

2.3. Geometric Moments

The geometry and performance information captured by the parameters of the parametric model and the lift/drag coefficients may not be sufficient to fully capture the relation between design and performance, which can result in inadequate regression or generally inaccurate prediction models. In the area of performance-based design optimization, it is not uncommon to embed, in the design vector, information about the quantities of interest and/or incorporate knowledge about the underlying physics in the ML model, as, for example, in the case of physics-informed neural networks; see [31,42,43]. However, these approaches tend to be computationally intensive and require detailed analysis, which imposes an additional computational burden, which is especially unwelcome in the early design stages. Motivated by the work of Shah et al. [26], we attempted to enrich the design and performance description with the integral properties of the foil shape geometry and performance profiles, respectively. Specifically, for the case of the foil’s shape, the geometric information, encoded in the parameters’ vector, was coupled with foil shape moments, leading to an enhanced design vector, which, as will be demonstrated, can offer a more-accurate description of the shape’s inherent characteristics and its relation to the performance metrics. Thus, the incorporation of geometric moments acts as an indirect way of incorporating quantities that are correlated to performance criteria in a cost-effective manner. In other words, while most performance criteria (e.g., lift or drag) require generally costly simulations, these quantities are generally dependent on geometric moments, which can be easily computed with a significantly lower computational cost.

Geometric moments for performance profiles do not necessarily incorporate any inherent information about the foil design, but they can still help the model better predict relations between performance characteristics and the corresponding foil geometry. Therefore, the geometric moments of the lift and drag coefficient profiles were also considered for the inverse design problem.

Assuming now that

Ω

corresponds to the 2D domain bounded by the foil profile curve

C (t; v) = \partial Ω

, we may compute its

r^{t h}

-,

r = p + q

, order moments by:

\begin{matrix} M_{(r)} = M_{p, q} = & \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} x^{p} y^{q} ρ (x, y) d Ω, \\ p, q \in {0, 1, 2, \dots}, \end{matrix}

(3)

where

ρ (x, y) = 1

when

(x, y) \in Ω

and 0 otherwise. Obviously, the 2D domain area corresponds to the 0^th-order moment, i.e.,

| Ω | = M_{(0)} = M_{0, 0}

, and its centroid can be calculated as

c = (c_{x}, c_{y}) = \frac{M_{(1)}}{M_{(0)}} = (\frac{M_{1, 0}}{M_{0, 0}}, \frac{M_{0, 1}}{M_{0, 0}})

. A proper combination of geometry and its moments allows the construction of a vector that captures the shape’s inherent characteristics and can serve as its unique identifier, the so-called signature vector in [44].

For the foil profile case, we may compute the Geometric Moments (GMs) with the two-dimensional version of the divergence theorem, i.e., Green’s theorem. Specifically, using Green’s theorem and the representation of the foil profile in (2), we may equivalently write:

\begin{matrix} M_{0, 0} = \frac{1}{2} \int_{t_{k - 1}}^{t_{n + 1}} x (t) \frac{d y (t)}{d t} - y (t) \frac{d y (t)}{d t} d t, \\ M_{1, 0} = \frac{1}{2} \int_{t_{k - 1}}^{t_{n + 1}} x {(t)}^{2} \frac{d y (t)}{d t} d t, & M_{0, 1} = - \frac{1}{2} \int_{t_{k - 1}}^{t_{n + 1}} y {(t)}^{2} \frac{d x (t)}{d t} d t, \\ M_{2, 0} = \frac{1}{3} \int_{t_{k - 1}}^{t_{n + 1}} x {(t)}^{3} \frac{d y (t)}{d t} d t, & M_{0, 2} = - \frac{1}{3} \int_{t_{k - 1}}^{t_{n + 1}} y {(t)}^{3} \frac{d x (t)}{d t} d t, \\ M_{1, 1} = \frac{1}{2} \int_{t_{k - 1}}^{t_{n + 1}} x {(t)}^{2} y (t) \frac{d y (t)}{d t} d t, & M_{1, 1} = - \frac{1}{2} \int_{t_{k - 1}}^{t_{n + 1}} y {(t)}^{2} x (t) \frac{d x (t)}{d t} d t, \end{matrix}

(4)

and so on.

For the lift and drag coefficient profiles, we constructed a spline interpolating the computed values, and we can then proceed with the calculation of the corresponding moments under the interpolant

y = f (x)

, where x corresponds to the angle of attack and y to the coefficient values. Note that we used the signed area here, since we may have foil shapes with negative lift coefficients, and therefore, using the signed area would be a more-meaningful performance feature to differentiate between positive- and negative-lift-producing designs. Hence, for the performance profiles, we may calculate the moments as follows:

\begin{matrix} M_{0, 0} = \int_{0}^{8} f (x) d x, M_{1, 0} = \int_{0}^{8} x y d x, M_{0, 1} = \int_{0}^{8} \frac{y^{2}}{2} d x, \\ M_{2, 0} = \int_{0}^{8} x^{2} y d x, M_{0, 2} = \int_{0}^{8} \frac{y^{3}}{3} d x, M_{1, 1} = \int_{0}^{8} x \frac{y^{2}}{2} d x, \end{matrix}

(5)

and so on.

The performance criteria considered in this work, i.e., the lift and drag coefficients, are invariant to translation and uniform scaling, whereas the moments described above change value with Translation (T) and Scaling (S). Therefore, in the general case, we would need to employ TS-invariant Geometric Moments (TS-iGM), as described in Xu and Li in [45] where the authors derived Translation, Scale, and Rotation invariant Geometric Moments (TSR-iGM). However, in our dataset, we only considered foil designs with chords aligned with the longitudinal axis (x-axis), fixed placement, and normalization to [0,1], which permitted us to use the moments derived above without introducing any biases; therefore, we may skip the use of moment invariants.

2.4. Training Datasets

As previously mentioned, the basis for the construction of the foil design space, and subsequently the resulting training datasets, was the publicly available UIUC database of foil designs; see [30]. This database comprises approximately 1500 foil profiles encoded via point sets of varying lengths, i.e., the number of points on the profile. The first step in the construction of the design dataset is a preprocessing step and pertains to ordering points to one common orientation and normalization of the foil shapes so that all of them lie on the x-axis and have a chord length equal to one unit. Subsequently, we determined the parameter values of the foil parametric model, which reconstruct each foil profile within the modified Kulfan tolerance; see Equation (1). This was done by solving the following minimization problem:

\begin{matrix} Find v^{*} : & ∥ p_{i}^{(κ)} - C (t; v^{*}) ∥_{\infty} = min {∥ p_{i} - C (t; v) ∥}_{\infty}, i = 1, \dots, m^{(κ)} \\ subject to : & 0 \leq v_{j} \leq 1, j = 1, \dots, 15 \\ v_{16} = {TE}_{1} = p_{1, y}^{(κ)}, v_{17} = {TE}_{2} = p_{m^{(κ)}, y}^{(κ)}, \\ ∥ p_{i}^{(κ)} - C (t; v^{*}) ∥_{\infty} \leq 8 \times 10^{- 4}, \end{matrix}

(6)

where

m^{(κ)}

is the number of points,

{p_{i}^{(κ)}}_{i = 1}^{m^{(κ)}}

, defining the

κ

-th foil in the UIUC database, and

p_{1, y}^{(κ)}, p_{m^{(κ)}, y}^{(κ)}

are the ordinates of the first and last foil point (Recall that a preprocessing step was performed to ensure that all foil points were stored in a counterclockwise orientation with the first point being the trailing edge point of the upper foil side), respectively.

We solved the minimization problems appearing in Equation (6) using a sequential quadratic programming technique implemented in matlab with an initial design vector resulting from a rough estimation of the design parameters based on the input profile points. This resulted in approximately 1300 design vectors that approximated the corresponding number of foils within the modified Kulfan tolerance. The identified parameter vectors were saved to form the initial design dataset

D_{0}

. If we now performed a Principal Component Analysis (PCA) [46] of

D_{0}

and use the first two principal components to visualize the designs in

D_{0}

, we obtained a picture with isolated designs or clusters of designs with relatively big gaps among them. Although this did not directly correspond to the distance between the high-dimensional design vectors, it provided us with an indicative picture of the distribution of the foil designs in the design space. Additionally, given the number of parameters and performance targets we planned to work with, the number of design vectors was not sufficiently large to ensure a good fit by standard regression or neural network models. To address these issues, we generated 10 design variations in the neighborhood of each design in

D_{0}

by varying the identified parameter values

\pm 10 %

around their initial value. In this way, we obtained a significantly larger dataset with fewer gaps between designs, as can been seen by the blue dots in Figure 4. This figure was produced by performing PCA of the augmented set and visualizing both the initial and additional designs (variations) in the same picture with the help of the first two principal components. As one may easily observe in Figure 4, the red circles, corresponding to

D_{0}

, are isolated or form clusters of designs with relatively big distances between them, whereas the included variations, blue dots, fill in most gaps and construct a significantly richer dataset that we denote as

D_{1}

. The next and final step in the process of design dataset construction was performed with the help of Equation (4) and pertained to the calculation of moments,

M_{(0)}

to

M_{(2)}

, for each design in

D_{1}

. These moments were subsequently added to form the final version of

D_{1}

.

The initial performance dataset,

P_{0}

, was generated by assessing the performance of each design in

D_{1}

with XFOIL, as described in Section 2.2, and storing the resulting

C_{L}

and

C_{D}

values in

P_{0}

. As in the case of the design dataset, the performance dataset was augmented with the moments for the performance curves,

C_{L}

and

C_{D}

, using Equation (5), producing the final performance dataset

P_{1}

. We should note here that a small number of designs failed to produce complete performance profiles (cases with more than a couple of

C_{L}

and/or

C_{D}

values missing), and they were excluded from both

D_{1}

and

P_{1}

. Therefore, the final datasets, which were used in the regression and training of the neural network models, comprise 12,615 designs with their design vectors and moments in

D_{1}

and the corresponding number of performance and performance moment values in

P_{1}

. In summary, the matrix structures corresponding to

D_{1}

and

P_{1}

are depicted in Table 2. We should also note here that we posed certain limitations to the type and number of performance inputs. If we were to use different values or even a different number of angles of attack for each foil design, we would probably need to revert to a Recurrent Neural Network (RNN), which would be able to handle variable-length sequences of inputs. This would become even more important if we were to consider more nuanced performance profiles, including pressure distributions or other low-level performance metrics.

2.5. Regression Models

For both cases, i.e., forward and inverse design problems, we considered 3 different regression models, which were compared with a range of different training datasets:

A Multivariate Linear Regression (MLR) [47,48], which constitutes our basis (reference) model;
Two families of feedforward Artificial Neural Networks (ANNs) trained with (a) Levenberg–Marquardt (LM) back-propagation, and (b) LM with Bayesian Regularization (BR); see Table 3 for more details.

As a reference model, for both problems, we employed the multivariate normal regression of either the 18-dimensional performance responses (

P_{0}

) on the design matrix corresponding to the 17 design parameters (

D_{0}

), or vice versa for the inverse design problem. The model is formulated as

y_{i} = X_{i} β + e_{i}, i = 1, \dots, n,

where n is the number of samples, 12,615 in our case,

y_{i}

is the 18- or 17-dimensional vector of responses,

X_{i}

is the design vector of predictor variables, and

β

corresponds to the resulting regression coefficients with

e_{i}

denoting the vector of error terms. Note that we added a column of ones in matrix

X

, formed by the 12,615

X_{i}

vectors, to include constant terms (intercepts) in the regression. The root-mean-squared error for the prediction of the performance values with MLR is reported in Table 4, whereas the corresponding MLR error for the inverse problem is found in Table 5.

The remaining models were two-layer feedforward neural networks with an input layer, a hidden layer, and an output layer. For the hidden layers, hyperbolic tangent sigmoid activation functions were used, whereas linear transfer functions were employed in the output layer, as we performed regression in all cases. We should also note here that positive linear (poslin, also known as ReLU—Rectified Linear Unit in the pertinent literature) and logistic sigmoid transfer functions were tested in the hidden layers with inferior results in all tested cases. For the majority of the trained models, we observed that 20 neurons in the hidden layers produced good results, but for some of the employed training datasets, this number was raised to 25 or 30, as can be seen in Table 3. Specifically, we performed model training tests starting with 10 neurons, in the hidden layers, and went up to 70 neurons with a step of 5 neurons. For all cases, the results showed an improvement from 10 to 20 or 30 neurons, followed by a decline in performance after 30 or 35 neurons. As mentioned above, two training approaches were used, LM back-propagation and LM with Bayesian regularization, with “LM” included in the model name in the former case and “BR” for the latter one; see Table 3. Additionally, the scaled conjugate gradient algorithm was used in the training, but we did not include it in our comparisons, as it was outperformed across all tests by the other two training methods.

LM is a standard approach used in training ANNs, and the following steps are applied in its implementation:

Network weights’ and biases’ initialization.
Input data are fed into the network, and the activation function values for each neuron in the hidden and output layers are calculated.
The error between the network’s predicted output and the actual target output is computed using a suitable error metric, the mean-squared error (MSE) in our case.
The error is then propagated backwards to adjust the weights and biases. For this, the Levenberg–Marquardt algorithm is used to optimize the network parameters. The LM algorithm combines the benefits of fast convergence from the Gauss–Newton algorithm and the robustness of steepest descent. It dynamically adjusts a parameter known as the damping factor during training, controlling the step sizes taken in the weight space.
The weights and biases of the network are updated using LM.
Steps 2 to 5 are repeated till convergence has been achieved (or when a maximum number of iterations has been reached).

Bayesian regularization updates the weight and bias values based on the same LM optimization algorithm. It then minimizes a linear combination of squared errors and weights and determines the correct combination so as to produce a network that generalizes well, i.e., does not over-fit the given training data. This was performed by extracting a validation set from the training data and monitoring the model’s performance to prevent over-fitting. Bayes’ theorem was used to combine the prior distributions with the likelihood to obtain the posterior distributions. The linear combination of the squared errors and weights was modified accordingly to achieve good generalization qualities. The interested reader may refer to [49,50] for a more-detailed description of the Bayesian regularization process. We should also note here that an additional test set, separate from the validation set mentioned above and corresponding to 15% of the datasets, was set aside, for all models, and used at the end to confirm the generalization qualities of the trained models. With regard to the training set employed in LM training, a 5-fold cross-validation was utilized to avoid over-fitting. In all cases presented in this work, coefficients of determination,

R^{2}

, exceeding 90% for both the training and test sets were achieved. Note that the value reported here for the training set corresponds to the average value from the 5-fold cross-validation.

Before we proceed with the presentation of the achieved results, we need to also briefly discuss the use of different datasets in the training of the models included in Table 3. Specifically, we used a wide range of combinations, starting from the simplest,

D_{0}

and

P_{0}

, dataset pair, to the full datasets,

D_{1}

and

P_{1}

, described in Table 2. This was performed to assess the effect of full or partial moments’ inclusion in either the design or performance datasets along with the effect of transferring components from the input to the output sets and vice versa. The area of the foil profile is one such component example, as it can be considered either as the input feature or the output target. Table 3 presents a summary of the trained models along with the composition of the input and output sets that were used for training.

3. Results and Discussion

We begin our presentation with the achieved results and prediction accuracy for the regression models described in the previous section, and we then move on to present two applications employing the best-performing ML models in the preliminary foil design and shape optimization. The first application was a “foil design assistant” software prototype, which can be used by a designer, in the early design stages, to interactively assess the performance of the foil designs he/she intuitively modifies or determine the foil profile that achieves the performance characteristics that he/she may set. The second application demonstrates the use of the developed ML models in foil shape optimization and compares them to the corresponding well-established performance-based shape optimization procedures.

3.1. Regression and ML Models’ Performance

Table 4 summarizes the achieved Root-Mean-Squared Error values (RMSEs) and their normalized counterparts (nRMSEs) for all forward regression models, whereas the corresponding summary for the inverse case is reported in Table 5. The calculation of the RMSE and nRMSE was performed for each model and the performance coefficients, as shown below:

\begin{matrix} RMSE = & \sqrt{\frac{\sum_{i = 1}^{n} {(y_{i} - y_{i}^{*})}^{2}}{n}}, \end{matrix}

(7)

\begin{matrix} nRMSE = & \frac{\sqrt{\frac{\sum_{i = 1}^{n} {(y_{i} - y_{i}^{*})}^{2}}{n}}}{y_{max} - y_{min}}, \end{matrix}

(8)

where

y_{i}

correspond to the dataset values,

y_{i}^{*}

to the model-predicted ones, and n to the number of components times the samples in the dataset. One may easily observe from Table 4 that the best-performing model for the case of

C_{L}

prediction was the model trained on the full

D_{1}

dataset and the

C_{L}

-based part of the

P_{1}

dataset (

C_{L}

values and moments) with foil areas added also to it. Moreover, the reference model, i.e., multivariate linear regression, was clearly outperformed by all trained ANNs. For the case of

C_{D}

prediction, we obtained slightly better results if we excluded the second moments from the input dataset, as can be seen from the same table. However, if we observe Figure 5a,b, which depict the maximum errors in the predictions, we can argue that the best performance was achieved by the “M0-2_BR” model for both the

C_{L}

and

C_{D}

predictions, as it minimized the maximum error exhibited in the prediction of both performance metrics. Another interesting observation, from the same histograms, is that both the

C_{L}

and

C_{D}

predictions improved as we inserted more moments in the input descriptors, therefore confirming our hypothesis that the inclusion of geometric moments improves the accuracy of the trained models as it aids in better capturing the relation between design and performance.

However, our results were not that clear when considering the performance of the studied regression models in the inverse design problem. For the inverse problem case,

D_{1}

and

P_{1}

swapped places, with the former becoming the output set and the latter taking the place of the input set. The first observation pertained to the performance of the reference model, which was only slightly worse when compared to the ANNs, as one may observe in Table 5. Furthermore, the shape prediction accuracy, at least when measured with the maximum deviation in the parameter values, does not look very promising in the histogram in Figure 6a. However, this finding was somewhat misleading, as for example, big differences in a parameter such as the tangent angle at the leading edge (see Table 1 and Figure 2) may result in only slight foil shape differences. If we now take into account such differences in the parameters’ effects, we may use the mean error instead, shown in Figure 6b, which was more representative of the actual foil shape prediction’s accuracy. As in the forward design problem, the inclusion of moments (this time, the moments of the performance curves were considered) had a beneficial effect on the prediction accuracy, but it was not as pronounced as it was for the forward design case. This can be justified by the fact that there was no direct physical correlation between the performance moments and the shape of a foil. Nevertheless, their inclusion had a non-negligible effect in the training process, and they, therefore, helped us achieve a better “discriminatory ability” of the corresponding ANNs.

3.2. Foil Design Assistant

The first application of the identified best-performing ANNs was the design and implementation of a software tool prototype, “FoilDesignAssistant”, which aims to aid designers, in the early design stages, to quickly assess the performance of a wide range of foil shapes and/or identify the foil shape that possesses the performance criteria set by them. The objectives set for this tool were as follows:

Provide an interactive and intuitive interface for modifying a foil’s profile.
Assess in real-time the foil’s performance characteristics.
Compare the initial and modified design with respect to their performance characteristics.
Provide an interactive and intuitive interface for defining a foil’s performance characteristics.
Determine in real-time the foil profile that exhibited the user-defined foil performance characteristics.
Limit the user input to information that is commonly available at the early design stages.

Figure 7 depicts the interface of the prototype software application developed in matlab. The tool aids the designer in both foil design cases, i.e., forward and inverse problems, using the ML models that were previously identified for their fidelity, i.e., the pair of M0-2_BR ANNs for the prediction of

C_{L}

and

C_{D}

and M0, M0-2_

C_{L} and C_{D}

(M0-2

C_{L} and C_{D}

) for the foil design parameters’ identification. The mode of operation is determined by the “design mode” check button depicted below the “Foil” design window in Figure 7. When this button is checked, the forward problem is considered, whereas when it is unchecked, the model for the inverse design problem is employed. The interface comprises three graph components: (a) the “Foil” graph (upper left corner in Figure 7, (b) the “Lift Coefficient” graph (upper right side in Figure 7), and (c) the “Drag Coefficient” graph below it in the same figure. Below the “Foil” graph, a series of sliders corresponding to the parameters of the employed parametric model are presented, whereas the input fields for defining the required performance characteristics are placed below the performance graphs at the right side. Additionally, a drop-list button below the “Drag Coefficient” graph is used to initialize the current design into one of the well-known foil designs (e.g., NACA family profiles and others) and an export functionality at the lower right part of the interface. The export function allows saving the current design (blue-colored design) as a NURBS curve or a point set with a user-selected number of points and discretization type. We subsequently describe the software tool’s operation in the two modes:

Forward design mode: The user interactively modifies the foil design by adjusting the parametric model parameter values via the sliders on the interface window. These parameters correspond to the already-described parameters in Table 1. Whenever a user modifies a value, the current design (blue curve) is updated in all graphs in real-time. At the same time, the initial design (black curve) is depicted in all graphs for facilitating the comparisons. Note here that the user only specifies the foil parametric model parameters, while all moments required as the input to the M0-2_BR model are automatically calculated via the parametric model embedded in the application.
Inverse design mode: In this mode, the user may specify one or all of the input fields residing on the right side of the interface. Specifically, CL0 and CD0 are used to specify the desired values of $C_{L}$ and $C_{D}$ at a 0°angle of attack, while CL8 and CD8 specify the desired values of the same coefficients at an 8°angle of attack for the sought-for foil profile. Furthermore, an intermediate performance point at the xCLint angle for $C_{L}$ and xCDint for $C_{D}$ may by provided, with the corresponding coefficient values being specified in CLint and CDint, respectively. Finally, the user may also specify the desired foil area using the “Foil area” input field. As soon as the user clicks on the “Find Design” button, two splines using the specified points for the $C_{L}$ and $C_{D}$ performance profiles are computed, and the results are fed into the inverse ML model, along with the foil area and all required performance profile moments. Similar to the forward design mode, all graphs are updated in real-time with the current design being depicted with blue color and the initial one with black.

Based on the description above, we satisfied (a) the requirement of interactivity by automatically, and in real-time, updating all relevant information and graphs, (b) the requirement of intuitiveness by employing the high-level parameters, with physical meaning, for the determination of the profile design, and finally, (c) the accuracy objective achieved by incorporating the best-performing ML models for both design problems. As a final note, we need to also mention here that one may easily substitute the ML models employed in the tool with any of the models presented in Section 2.5, and the required collection of user-determined input and output, along with the calculation of additional moments, is automatically adjusted to match the requirements of selected models, as described in Table 3.

3.3. Foil Shape Optimization

In this subsection, a comparison between a traditional performance-based shape optimization approach and the optimization setup enabled by the trained ANNs is presented. For this comparison, we intentionally picked an initial design for which the trained models provided a relative inaccurate performance prediction, as can be seen in Figure 8b. As one can observe in Figure 5b, the max error in the

C_{D}

prediction for the great majority of the designs was below 2%, whereas for the selected design, it reached 20% or even above. Specifically, we considered a one-meter chord length foil design (ID7910) with a sharp trailing edge (TE at

(1, 0)

), which is defined by the initial design vector,

v_{0} = (v_{0, 1}, v_{0, 2}, \dots, v_{15, 0}) \in V \subset R^{15}

, shown in Table 6. The corresponding foil profile shape is depicted in Figure 8a. Furthermore, we considered that the foil operates at a 3°angle of attack with a Reynolds number equal to

8 \times 10^{6}

, and we aimed to maximize the lift-over-drag ratio while maintaining a constant foil area. Finally, we limited the design space by permitting a maximum parameter value deviation (Each parameter value lies in

[0, 1]

, and hence,

\pm 5 %

corresponds to

\pm 0.05

) of

\pm 5 %

around the initial value for each parameter. Therefore, the shape optimization problem can be formulated as follows:

\begin{matrix} Find v^{*} : & \frac{C_{D} (v^{*})}{C_{L} (v^{*})} = min_{v \in V_{1}} \frac{C_{D} (v)}{C_{L} (v)} \\ subject to : & | A (v) - A_{0} | \leq ϵ, A_{0} = 0.1368 m^{2}, \\ V_{1} = {v \in V : v_{i, 0} - 0.05 \leq v_{i} \leq v_{i, 0} + 0.05, \forall v_{i}, i = 1, \dots, 15}, \end{matrix}

(9)

where

ϵ = 0.001

is the allowable deviation from the initial enclosed area of the selected foil profile (ID7910).

In the traditional performance-based shape optimization approach, we used the employed parametric model for generating foil design instances using a design vector with the model’s first 15 parameters; recall that we considered a sharp trailing edge at

(1, 0)

, and therefore,

{TE}_{1} = {TE}_{2} = 0

. The lift and drag coefficients, for each design instance, were once again estimated with the aid of the well-known XFOIL computational package (https://web.mit.edu/drela/Public/web/xfoil/ accessed on 10 May 2023), which, as mentioned previously, is a popular tool used in assessing the performance of subsonic airfoils. XFOIL employs a fast panel method with a fully coupled viscous/inviscid interaction method used in the ISES code developed by Drela and Giles; see also [6,41]. Obviously, when the trained ANNs are employed, no simulations are necessary, but we may still need to use the parametric model for the moments’ calculation. Specifically, in this benchmark example, we employed the M0-2_BR ANN, which produced the most-accurate prediction results; see Table 4 and Figure 5. The optimization algorithm employed for both cases was Pattern Search [51,52,53], a derivative-free global optimization algorithm, with the initial design (ID7910) being its starting point for both cases.

Figure 9 depicts the initial design (thin dashed line) along with the resulting optimized foil shapes for the traditional performance-based shape optimization with XFOIL (thick solid line) and the ML-enabled shape optimization (thick dashed line). The two optimized designs, although not identical, were well aligned and differed only slightly overall, with a more-visible deviation at the aft part of the upper foil side. In Table 7, we report the parametric model parameters corresponding to the initial design (Row 1) and the two optimized profiles in Rows 2 (traditional) and 3 (ML-based). One may easily observe from the same table that the difference between the two optimized designs was mainly in the parameters

{Xl}_{max}

and

s_{u 1}

, which, as expected, affected the part, which is visibly different between the two shapes. Performancewise, the shape resulting from the traditional optimization approach exhibited better performance, whereas the result from the ML-enabled optimization, although significantly improved when compared to the initial design, lacked slightly in performance when compared to the former optimized design. We summarize these results in Table 7, where the lift and drag coefficients along with the ratio, foil area, and required computational time are reported for each case. Note that all results in this table were evaluated with XFOIL so that they are directly comparable. The ML-enabled optimization achieved a 42.3% increase in lift followed by a small (4.58%) increase in drag, which resulted in a 26.5% improvement of the drag-to-lift ratio. The traditional approach outperformed these results, with the corresponding values being 57.9%, 1.92%, and 35.6%, respectively. However, if we look at the last column in Table 7, we can see where the ML-based approach really shined. Specifically, the computation time required dropped by approximately 85% when compared to the traditional approach. In addition, if one observes the parameter values identified by the ML-based approach in Table 6, we can easily assume that, by running a fast gradient-based optimization algorithm, such as steepest descent or Newton’s method, we may acquire the results achieved by the traditional optimization approach. Indeed, we used matlab’s fmincon function with the Sequential Quadratic Programming algorithm and reached the traditional approach’s results in just two iterations. Therefore, in summary, the ML-based approach can achieve relatively accurate and comparable (or even identical results, if we apply a secondary optimization step) with only

\frac{1}{7}

of the computational time required for the traditional approach, and this, as indicated in this example, occurred even when we started from a design that was not accurately captured by the trained ANN model.

4. Conclusions and Future Work

In this work, we demonstrated a successful application of Artificial Neural Network models in the forward and inverse foil design problems for early foil design stages. This was accomplished by integrating a foil parametric model for an intuitive, accurate, and compact representation of foil designs and an appropriately prepared rich dataset of realistic foil designs. The incorporation of the parametric model in the development of the model served also as a safeguard against invalid input, highly inaccurate predictions, and design exploration that went far beyond the training dataset. Specifically, the parametric model was constructed in way that guaranteed the construction of valid foil shapes for the complete range of its parameter values with the employed instances distributed in the design space, as indicated in Figure 4. At the same time, the output of the models used in the inverse design problem needed to produce valid shape vectors, which permitted us to easily identify erroneous predictions. In addition, geometric moments were incorporated in both the input and output datasets and were shown to significantly improve the accuracy of the ANNs in the prediction of the foil’s performance. Similarly, the inclusion of performance moments was shown to enhance the predictions for the inverse design problem, although their effect was less pronounced. The developed ANNs were compared against a reference linear regression model to benchmark their performance, and a wide range of variations with regard to their training procedure and the composition of the training datasets was examined to identify the best-performing models. Finally, the best-performing models were used to demonstrate the proposed approach in a real-time, intuitive “foil design assistant” software prototype, with a performance-based shape optimization example. The results obtained from the ML-enabled shape optimization were comparable to the traditional foil shape optimization approaches with the additional benefit of requiring only a fraction of the time needed for traditional approaches.

Although we demonstrated good prediction accuracy with the developed procedures, there were several limitations that we would need to address in the future. One of these limitations pertained to the usage of fixed performance metrics, i.e., the lift and drag coefficients for a fixed number of angles of attack, which limited the applicability of our approach to the early design stages. We could potentially employ recurrent neural networks and/or 1D convolutional neural networks to include variable-sized performance input and lower-level performance metrics, such as pressure distributions, which would expand the applicability of our approach beyond the early design stages. In a similar manner, the assessment of foil performance using panel methods was a restrictive limitation in our work, as it was noted in the introductory section. As we have now established the beneficial impact of the geometric moments’ inclusion, as well as the dimensionality reduction and robustness, which was attributed to the employed parametric model, we may move forward and include high-fidelity CFD simulations for the foil performance assessment, which would obviously expand the applicability of the proposed design tools. Moreover, we plan to conduct a study in the near future regarding the use of the developed Foil Design Assistant to assess its impact on the design effectiveness of students and professionals assigned foil design assessment and optimization tasks. Finally, we plan to extend this approach for the 3D case with wind turbine and propeller blades being the next possible objects of interest. Apart from the obvious tasks of extending the type of ML models and constructing appropriate datasets, the inclusion of geometric moment invariants should be also considered to alleviate the need for fixed placements and design normalization, which might not be possible for general functional surfaces.

Author Contributions

Conceptualization, K.V.K.; methodology, K.V.K. and M.M.; software, K.V.K. and M.M.; validation, K.V.K. and M.M.; formal analysis, K.V.K.; resources, K.V.K.; data curation, K.V.K. and M.M.; writing—original draft preparation, K.V.K. and M.M.; writing—review and editing, K.V.K. and M.M.; supervision, K.V.K.; project administration, K.V.K.; funding acquisition, K.V.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work received funding from Nazarbayev University, Kazakhstan, under the Faculty Development Competitive Research Grants Program 2022–2024: “Shape Optimization of Free-form Functional surfaces using isogeometric Analysis and Physics-Informed Surrogate Models—SOFFA-PHYS”, Grant Award No. 11022021FD2927.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data required to reproduce the findings in this work and the software application are available from Dr. Konstantinos Kostas, upon email request ([email protected]).

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of the data; in the writing of the manuscript; nor in the decision to publish the results.

References

Harries, S.; Abt, C.; Brenner, M. Upfront CAD—Parametric modeling techniques for shape optimization. In Advances in Evolutionary and Deterministic Methods for Design, Optimization and Control in Engineering and Sciences; Springer: Cham, Switzerland, 2019; pp. 191–211. [Google Scholar]
Kostas, K.; Ginnis, A.; Politis, C.; Kaklis, P. Ship-hull shape optimization with a T-spline based BEM–isogeometric solver. Isogeometric Analysis Special Issue. Comput. Methods Appl. Mech. Eng. 2015, 284, 611–622. [Google Scholar] [CrossRef] [Green Version]
Masters, D.A.; Poole, D.J.; Taylor, N.J.; Rendall, T.; Allen, C.B. Impact of Shape Parameterisation on Aerodynamic Optimisation of Benchmark Problem. In Proceedings of the 54th AIAA Aerospace Sciences Meeting, San Diego, CA, USA, 4–8 January 2016. [Google Scholar] [CrossRef] [Green Version]
Masters, D.A.; Taylor, N.J.; Rendall, T.C.S.; Allen, C.B.; Poole, D.J. Geometric Comparison of Aerofoil Shape Parameterization Methods. AIAA J. 2017, 55, 1575–1589. [Google Scholar] [CrossRef]
Hess, J.; Smith, A. Calculation of potential flow about arbitrary bodies. Prog. Aerosp. Sci. 1967, 8, 1–138. [Google Scholar] [CrossRef]
Drela, M.; Giles, M. Viscous-inviscid analysis of transonic and low Reynolds number airfoils. AIAA J. 1987, 25, 1347–1355. [Google Scholar] [CrossRef]
Lee, C.; Kerwin, J. A B-spline higher-order panel method applied to two-dimensional lifting problem. J. Ship Res. 2003, 47, 290–298. [Google Scholar] [CrossRef]
Kim, G.; Lee, C.; Kerwin, J. A B-spline based higher-order panel method for the analysis of steady flow around marine propellers. Ocean. Eng. 2007, 34, 2045–2060. [Google Scholar] [CrossRef]
Gao, Z.; Zou, Z. A NURBS based high-order panel method for three-dimensional radiation and diffraction problems with forward speed. Ocean. Eng. 2008, 35, 1271–1282. [Google Scholar] [CrossRef]
Kostas, K.; Ginnis, A.; Politis, C.; Kaklis, P. Shape-optimization of 2D hydrofoils using an Isogeometric BEM solver. Comput.-Aided Des. 2017, 82, 79–87. [Google Scholar] [CrossRef] [Green Version]
Chouliaras, S.; Kaklis, P.; Kostas, K.; Ginnis, A.; Politis, C. An Isogeometric Boundary Element Method for 3D lifting flows using T-splines. Comput. Methods Appl. Mech. Eng. 2021, 373, 113556. [Google Scholar] [CrossRef]
Katz, J.; Plotkin, A. Low-Speed Aerodynamics, 2nd ed.; Cambridge Aerospace Series; Cambridge University Press: Cambridge, UK, 2001. [Google Scholar] [CrossRef]
Versteeg, H.; Malalasekera, W. An Introduction to Computational Fluid Dynamics: The Finite Volume Method, 2nd ed.; Pearson: London, UK, 2007. [Google Scholar]
Smith, M.J.C.; Wilkin, P.J.; Williams, M.H. The Advantages of an Unsteady Panel Method in Modelling the Aerodynamic Forces on Rigid Flapping Wings. J. Exp. Biol. 1996, 199, 1073–1083. [Google Scholar] [CrossRef]
Shen, H.; Liu, Y.; Chen, B.; Lu, Y. Control-relevant modeling and performance limitation analysis for flexible air-breathing hypersonic vehicles. Aerosp. Sci. Technol. 2018, 76, 340–349. [Google Scholar] [CrossRef]
Wang, S.; González-Cao, J.; Islam, H.; Gómez-Gesteira, M.; Guedes Soares, C. Uncertainty estimation of mesh-free and mesh-based simulations of the dynamics of floaters. Ocean. Eng. 2022, 256, 111386. [Google Scholar] [CrossRef]
Mishra, A.A.; Mukhopadhaya, J.; Iaccarino, G.; Alonso, J. Uncertainty Estimation Module for Turbulence Model Predictions in SU2. AIAA J. 2019, 57, 1066–1077. [Google Scholar] [CrossRef] [Green Version]
Hui, X.; Bai, J.; Wang, H.; Zhang, Y. Fast pressure distribution prediction of airfoils using deep learning. Aerosp. Sci. Technol. 2020, 105, 105949. [Google Scholar] [CrossRef]
Wang, L.; Xu, J.; Luo, W.; Luo, Z.; Xie, J.; Yuan, J.; Tan, A.C. A deep learning-based optimization framework of two-dimensional hydrofoils for tidal turbine rotor design. Energy 2022, 253, 124130. [Google Scholar] [CrossRef]
Du, Q.; Liu, T.; Yang, L.; Li, L.; Zhang, D.; Xie, Y. Airfoil design and surrogate modeling for performance prediction based on deep learning method. Phys. Fluids 2022, 34, 015111. [Google Scholar] [CrossRef]
Li, J.; Du, X.; Martins, J.R. Machine learning in aerodynamic shape optimization. Prog. Aerosp. Sci. 2022, 134, 100849. [Google Scholar] [CrossRef]
Kharal, A.; Saleem, A. Neural networks based airfoil generation for a given Cp using Bezier–PARSEC parameterization. Aerosp. Sci. Technol. 2012, 23, 330–344. [Google Scholar] [CrossRef]
Sobieczky, H. Parametric Airfoils and Wings. In Recent Development of Aerodynamic Design Methodologies. Notes on Numerical Fluid Mechanics; Fujii, K., Dulikravich, G., Eds.; Springer Vieweg+Teubner Verlag: Berlin, Germany, 1999; Volume 65. [Google Scholar]
Derksen, R.; Rogalsky, T. Bezier-PARSEC: An optimized aerofoil parameterization for design. Adv. Eng. Softw. 2010, 41, 923–930. [Google Scholar] [CrossRef]
Diez, M.; Campana, E.F.; Stern, F. Design-space dimensionality reduction in shape optimization by Karhunen–Loève expansion. Comput. Methods Appl. Mech. Eng. 2015, 283, 1525–1544. [Google Scholar] [CrossRef]
Khan, S.; Kaklis, P.; Serani, A.; Diez, M.; Kostas, K. Shape-supervised Dimension Reduction: Extracting Geometry and Physics Associated Features with Geometric Moments. Comput.-Aided Des. 2022, 150, 103327. [Google Scholar] [CrossRef]
Chen, W.; Chiu, K.; Fuge, M.D. Airfoil design parameterization and optimization using Bézier generative adversarial networks. AIAA J. 2020, 58, 4723–4735. [Google Scholar] [CrossRef]
Du, X.; He, P.; Martins, J.R. A B-spline-based generative adversarial network model for fast interactive airfoil aerodynamic optimization. In Proceedings of the AIAA Scitech 2020 Forum, Orlando, FL, USA, 6–10 January 2020; p. 2128. [Google Scholar]
Li, J.; Bouhlel, M.A.; Martins, J.R. Data-based approach for fast airfoil analysis and optimization. AIAA J. 2019, 57, 581–596. [Google Scholar] [CrossRef]
UIUC Applied Aerodynamics Group. UIUC Airfoil Coordinates Database. 2023. Available online: https://m-selig.ae.illinois.edu/ads/coord_database.html (accessed on 10 February 2023).
Sun, Y.; Sengupta, U.; Juniper, M. Physics-informed deep learning for simultaneous surrogate modeling and PDE-constrained optimization of an airfoil geometry. Comput. Methods Appl. Mech. Eng. 2023, 411, 116042. [Google Scholar] [CrossRef]
Oshima, E.; Lee, N.; Gharib, M.; Lee, V.; Khodadoust, A. Development of a physics-informed neural network to enhance wind tunnel data for aerospace design. In Proceedings of the AIAA SCITECH 2023 Forum, National Harbor, MD, USA, 23–27 January 2023. [Google Scholar] [CrossRef]
Kostas, K.; Amiralin, A.; Sagimbayev, S.; Massalov, T.; Kalel, Y.; Politis, C. Parametric model for the reconstruction and representation of hydrofoils and airfoils. Ocean. Eng. 2020, 199, 107020. [Google Scholar] [CrossRef]
Kulfan, B.; Bussoletti, J. Fundamental Parametric Geometry Representations for Aircraft Component Shapes. In Proceedings of the 11th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference, Portsmouth, VA, USA, 6–8 September 2006. [Google Scholar]
Kulfan, B.M. Universal Parametric Geometry Representation Method. J. Aircr. 2008, 45, 142–158. [Google Scholar] [CrossRef]
Piegl, L.; Tiller, W. The Nurbs Book, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 1997. [Google Scholar]
Nguyen, A.; Yosinski, J.; Clune, J. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 427–436. [Google Scholar] [CrossRef] [Green Version]
Mishra, A.A.; Edelen, A.; Hanuka, A.; Mayes, C. Uncertainty quantification for deep learning in particle accelerator applications. Phys. Rev. Accel. Beams 2021, 24, 114601. [Google Scholar] [CrossRef]
Lakshminarayanan, B.; Pritzel, A.; Blundell, C. Simple and Scalable Predictive Uncertainty Estimation Using Deep Ensembles. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, Long Beach, CA, USA, 4–9 December 2017; pp. 6405–6416. [Google Scholar]
Khan, S.; Goucher-Lambert, K.; Kostas, K.; Kaklis, P. ShipHullGAN: A generic parametric modeller for ship hull design using deep convolutional generative model. Comput. Methods Appl. Mech. Eng. 2023, 411, 116051. [Google Scholar] [CrossRef]
Drela, M. XFOIL: An Analysis and Design System for Low Reynolds Number Airfoils. In Low Reynolds Number Aerodynamics; Mueller, T., Ed.; Lecture Notes in Engineering; Springer: Berlin/Heidelberg, Germany, 1989; Volume 54. [Google Scholar]
Ang, E.; Ng, B.F. Physics-Informed Neural Networks for Flow Around Airfoil. In Proceedings of the AIAA SCITECH 2022 Forum, San Diego, CA, USA, 3–7 January 2022. [Google Scholar] [CrossRef]
Wong, B.; Damodaran, M.; Khoo, B.C. Physics-Informed Machine Learning for Inverse Airfoil Shape Design. In Proceedings of the AIAA AVIATION 2023 Forum, San Diego, CA, USA, 12–16 June 2023. [Google Scholar] [CrossRef]
Bronstein, A.M.; Bronstein, M.M.; Kimmel, R. Numerical Geometry of Non-Rigid Shapes; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Xu, D.; Li, H. Geometric moment invariants. Pattern Recognit. 2008, 41, 240–249. [Google Scholar] [CrossRef]
Jolliffe, I.T. Principal Component Analysis, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar] [CrossRef]
Little, R.J.A.; Rubin, D.B. Statistical Analysis with Missing Data, 3rd ed.; Seriers in Probability and Statistics; Wiley: Hoboken, NJ, USA, 2019. [Google Scholar]
Meng, X.L.; Rubin, D.B. Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika 1993, 80, 267–278. [Google Scholar] [CrossRef]
MacKay, D.J.C. Bayesian Interpolation. Neural Comput. 1992, 4, 415–447. [Google Scholar] [CrossRef]
Dan Foresee, F.; Hagan, M. Gauss-Newton approximation to Bayesian learning. In Proceedings of the International Conference on Neural Networks (ICNN’97), Houston, TX, USA, 12 June 1997; Volume 3, pp. 1930–1935. [Google Scholar] [CrossRef]
Audet, C.; Dennis, J.E. Analysis of Generalized Pattern Searches. SIAM J. Optim. 2002, 13, 889–903. [Google Scholar] [CrossRef] [Green Version]
Lewis, R.M.; Torczon, V. A Globally Convergent Augmented Lagrangian Pattern Search Algorithm for Optimization with General Constraints and Simple Bounds. SIAM J. Optim. 2002, 12, 1075–1089. [Google Scholar] [CrossRef] [Green Version]
Lewis, R.M.; Shepherd, A.; Torczon, V. Implementing Generating Set Search Methods for Linearly Constrained Minimization. SIAM J. Sci. Comput. 2007, 29, 2507–2530. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Foil instance of the parametric model employed in this work. The 17 parameters (marked on the figure above) are used for the parametric definition of the 13 control points

{b_{i}}_{i = 0}^{12}

determining the foil profile.

Figure 1. Foil instance of the parametric model employed in this work. The 17 parameters (marked on the figure above) are used for the parametric definition of the 13 control points

{b_{i}}_{i = 0}^{12}

determining the foil profile.

Figure 2. Effect of the first 15 parametric model parameters on the foil profile. In each subplot, one parameter varies while the others are kept constant. The remaining 2 parameters,

{TE}_{1}

and

{TE}_{2}

, determine the trailing edge type, i.e., blunt or sharp, and the distance between the two foil sides at the TE.

Figure 2. Effect of the first 15 parametric model parameters on the foil profile. In each subplot, one parameter varies while the others are kept constant. The remaining 2 parameters,

{TE}_{1}

and

{TE}_{2}

, determine the trailing edge type, i.e., blunt or sharp, and the distance between the two foil sides at the TE.

Figure 3. Example of XFOIL output for a foil instance in the employed dataset. alpha corresponds to the angle of attack, whereas CL and CD correspond to the lift and drag coefficients, respectively.

Figure 4. Foil designs plotted in the 2D design space with coordinates corresponding to the first two principal components; red circles depict the initial UIUC database foil profiles and blue dots the constructed variations.

Figure 5. Histograms of model predictions for the forward foil design problem. (a) Histogram of the max errors in the prediction of the

C_{L}

values for all forward ANNs described above. (b) Histogram of the max errors in the prediction of the

C_{D}

values for all forward ANNs described in Section 2.5.

Figure 5. Histograms of model predictions for the forward foil design problem. (a) Histogram of the max errors in the prediction of the

C_{L}

values for all forward ANNs described above. (b) Histogram of the max errors in the prediction of the

C_{D}

values for all forward ANNs described in Section 2.5.

Figure 6. Histograms of model predictions for the inverse foil design problem. (a) Histogram of the max errors in the prediction of the design variable values for all inverse problem ANNs described in Section 2.5. (b) Histogram of the mean errors in the prediction of the design variable values for all inverse problem ANNs described in Section 2.5.

Figure 7. Foil Design Assistant application, main interface.

Figure 8. Selected design for optimization along with the corresponding lift (blue) and drag (red) coefficients for an angle of attack range in

[0, 8] °

. (a) Foil shape; (b)

C_{L}

and

C_{D}

performance—actual and predicted.

Figure 8. Selected design for optimization along with the corresponding lift (blue) and drag (red) coefficients for an angle of attack range in

[0, 8] °

. (a) Foil shape; (b)

C_{L}

and

C_{D}

performance—actual and predicted.

Figure 9. Optimized foil profiles compared with the initial design (thin dashed line). The ML-based optimization result is depicted with a thick dashed line, while the optimized design using XFOIL is depicted with a thick solid line.

Table 1. Parameters of the employed parametric model. All parameters are defined in

[0, 1]

.

Table 1. Parameters of the employed parametric model. All parameters are defined in

[0, 1]

.

No.	Parameter	Description	Type/Effect
1	$u_{max}$	maximum distance of upper side from chord	upper side/global
2	${Xu}_{max}$	longitudinal position of maximum distance	upper side/global
3	$a_{u}$	upper side, angle at trailing edge	upper side back/local
4	$t_{u 1}$	bulkiness for front part of upper side	upper side front/local
5	$t_{u 2}$	bulkiness for front-to-mid transition of upper side	upper side front/local
6	$s_{u 1}$	bulkiness and convexity control (mid-to-back transition of upper side)	upper side back/local
7	$s_{u 2}$	bulkiness and convexity control (back part of upper side)	upper side back/local
8	$l_{max}$	maximum distance of lower side from chord	lower side/global
9	${Xl}_{max}$	longitudinal position of maximum distance	lower side/global
10	$a_{l}$	lower side, angle at trailing edge	lower side back/local
11	$t_{l 1}$	bulkiness for front part of lower side	lower side front/local
12	$t_{l 2}$	bulkiness for front-to-mid transition of lower side	lower side front/local
13	$s_{l 1}$	bulkiness and convexity control (mid-to-back transition of lower side)	lower side back/local
14	$s_{l 2}$	bulkiness and convexity control (back part of lower side)	lower side back/local
15	$a$	tangent angle at upper/lower sides joining point	both sides/local
16	${TE}_{1}$	vertical distance from (1,0)—ordinate of 1st point	upper side/local
17	${TE}_{2}$	vertical distance from (1,0)—ordinate of last point	lower side/local

Table 2. Structure of design and performance datasets.

Dataset $D_{1}$
	1	2	…	17	18	19	…	23
1	$v_{1}^{1}$	$v_{2}^{1}$	…	$v_{17}^{1}$	$M_{0, 0}^{1}$	$M_{1, 0}^{1}$	…	$M_{1, 1}^{1}$
2	$v_{1}^{2}$	$v_{2}^{2}$	…	$v_{17}^{2}$	$M_{0, 0}^{2}$	$M_{1, 0}^{2}$	…	$M_{1, 1}^{2}$
⋮	…							⋮
12,615	$v_{1}^{12,615}$	$v_{2}^{12,615}$	…	$v_{17}^{12,615}$	$M_{0, 0}^{12,615}$	$M_{1, 0}^{12,615}$	…	$M_{1, 1}^{12,615}$
Dataset $P_{1}$
	1	…	9	10	…	18	19	…	25	26	…	31
1	$C_{L} {(0)}^{1}$	…	$C_{L} {(8)}^{1}$	$C_{D} {(0)}^{1}$	…	$C_{D} {(8)}^{1}$	$M_{0, 0}^{1, C_{L}}$	…	$M_{1, 1}^{1, C_{L}}$	$M_{0, 0}^{1, C_{D}}$	…	$M_{1, 1}^{1, C_{D}}$
2	$C_{L} {(0)}^{2}$	…	$C_{L} {(8)}^{2}$	$C_{D} {(0)}^{2}$	…	$C_{D} {(8)}^{2}$	$M_{0, 0}^{2, C_{L}}$	…	$M_{1, 1}^{2, C_{L}}$	$M_{0, 0}^{2, C_{D}}$	…	$M_{1, 1}^{2, C_{D}}$
⋮	…											⋮
12,615	$C_{L} {(0)}^{12,615}$	…	$C_{L} {(8)}^{12,615}$	$C_{D} {(0)}^{12,615}$	…	$C_{D} {(8)}^{12,615}$	$M_{0, 0}^{12,615, C_{L}}$	…	$M_{1, 1}^{12,615, C_{L}}$	$M_{0, 0}^{12,615, C_{D}}$	…	$M_{1, 1}^{12,615, C_{D}}$

Table 3. Description of forward and inverse regression models. The numbers in curly brackets for the input and output sets correspond to the columns of the datasets

D_{1}

and

P_{1}

described in Table 2.

Table 3. Description of forward and inverse regression models. The numbers in curly brackets for the input and output sets correspond to the columns of the datasets

D_{1}

and

P_{1}

described in Table 2.

Forward ANN Models
Model Name	Description	Input	Output
LM	2-layer feedforward with 20 nodes, Levenberg–Marquardt (LM) back-propagation	17 parameters, ${1 - 17}_{D_{1}}$	9 values, $C_{L}$ or $C_{D}$ , ${1 - 9}_{P_{1}}$ or ${10 - 18}_{P_{1}}$
BR	2-layer feedforward with 20 nodes, LM with Bayesian regularization	17 parameters, ${1 - 17}_{D_{1}}$	9 values, $C_{L}$ or $C_{D}$ , ${1 - 9}_{P_{1}}$ or ${10 - 18}_{P_{1}}$
LM_M0	2-layer feedforward with 20 nodes, LM back-propagation	17 parameters, ${1 - 17}_{D_{1}}$	19 values, $C_{L}, C_{D} and M 0$ , ${1 - 18}_{P_{1}} \cup {18}_{D_{1}}$
BR_M0	2-layer feedforward with 20 nodes, LM with Bayesian regularization	17 parameters, ${1 - 17}_{D_{1}}$	19 values, $C_{L}, C_{D} and M 0$ , ${1 - 18}_{P_{1}} \cup {18}_{D_{1}}$
M0_LM	2-layer feedforward with 20 nodes, LM back-propagation	18 parameters, ${1 - 18}_{D_{1}}$	9 values, $C_{L}$ or $C_{D}$ , ${1 - 9}_{P_{1}}$ or ${10 - 18}_{P_{1}}$
M0_BR	2-layer feedforward with 20 nodes, LM with Bayesian regularization	18 parameters, ${1 - 18}_{D_{1}}$	9 values, $C_{L}$ or $C_{D}$ , ${1 - 9}_{P_{1}}$ or ${10 - 18}_{P_{1}}$
M0-1_BR	2-layer feedforward with 20 nodes, LM with Bayesian regularization	20 parameters, ${1 - 20}_{D_{1}}$	9 values, $C_{L}$ or $C_{D}$ , ${1 - 9}_{P_{1}}$ or ${10 - 18}_{P_{1}}$
M0-1C_BR	2-layer feedforward with 25 nodes, LM with Bayesian regularization	22 parameters, ${1 - 20, \frac{19}{18}, \frac{20}{18}}_{D_{1}}$	9 values, $C_{L}$ or $C_{D}$ , ${1 - 9}_{P_{1}}$ or ${10 - 18}_{P_{1}}$
M0-2_BR	2-layer feedforward with 25 or 30 nodes, LM with Bayesian regularization	25 parameters, $D_{1} \cup {\frac{19}{18}, \frac{20}{18}}_{D_{1}}$	9 values, $C_{L}$ or $C_{D}$ , ${1 - 9}_{P_{1}}$ or ${10 - 18}_{P_{1}}$
Inverse ANN models
Model Name	Description	Input	Output
$C_{L}$ only	2-layer feedforward with 20 nodes, LM with Bayesian regularization	9 $C_{L}$ values, ${1 - 9}_{P_{1}}$	17 parameters, ${1 - 17}_{D_{1}}$
$C_{D}$ only	2-layer feedforward with 20 nodes, LM with Bayesian regularization	9 $C_{D}$ values, ${10 - 18}_{P_{1}}$	17 parameters, ${1 - 17}_{D_{1}}$
$C_{L} and C_{D}$	2-layer feedforward with 20 nodes, LM with Bayesian regularization	18 values ( $C_{L} a n d C_{D}$ ), ${1 - 18}_{P_{1}}$	17 parameters, ${1 - 17}_{D_{1}}$
M0 $C_{L} and C_{D}$	2-layer feedforward with 20 nodes, LM with Bayesian regularization	19 values, ${1 - 18}_{P_{1}} \cup {18}_{D_{1}}$	17 parameters, ${1 - 17}_{D_{1}}$
M0,M0-2_ $C_{D}$	2-layer feedforward with 20 nodes, LM with Bayesian regularization	25 values, ${1 - 18, 26 - 31}_{P_{1}} \cup {18}_{D_{1}}$	17 parameters, ${1 - 17}_{D_{1}}$
M0,M0-2_ $C_{L}$	2-layer feedforward with 20 nodes, LM with Bayesian regularization	25 values, ${1 - 24}_{P_{1}} \cup {18}_{D_{1}}$	17 parameters, ${1 - 17}_{D_{1}}$
M0,M0-2_ $C_{L} and C_{D}$	2-layer feedforward with 20 nodes, LM with Bayesian regularization	32 values, $P_{1} \cup {18}_{D_{1}}$	17 parameters, ${1 - 17}_{D_{1}}$

Table 4. Performance of forward ANNs and multivariate linear regression in the prediction of the lift and drag coefficients.

Model	BR_M0	BR	LM	LM_M0	M0_LM	M0_BR	M0-1_BR	M0-1C_BR	M0-2_BR
RMSE ( $C_{L}$ )	0.0390	0.0386	0.0449	0.0417	0.0429	0.0395	0.0345	0.0335	0.0293
nRMSE ( $C_{L}$ )	0.0208	0.0206	0.0240	0.0223	0.0229	0.0211	0.0184	0.0179	0.0156
RMSE ( $C_{D}$ )	0.0036	0.0020	0.0023	0.0036	0.0025	0.0028	0.0018	0.0018	0.0021
nRMSE ( $C_{D}$ )	0.0084	0.0045	0.0053	0.0083	0.0057	0.0065	0.0042	0.0042	0.0049
Multivariate linear regression					RMSE ( $C_{L}$ )	nRMSE ( $C_{L}$ )	RMSE ( $C_{D}$ )	nRMSE ( $C_{D}$ )
					0.1065	0.0569	0.0047	0.0109

Table 5. Performance of inverse problem ANNs and multivariate linear regression in the prediction of the design parameters.

Model	$C_{L}$ only	$C_{D}$ only	$C_{L} & C_{D}$	$M 0 C_{L} & C_{D}$	$M 0 - 2 C_{D}$ ¹	$M 0 - 2 C_{L}$ ¹	$M 0 - 2 C_{L} & C_{D}$ ¹
RMSE	0.1770	0.1795	0.1691	0.1638	0.1637	0.1633	0.1635
Multivariate linear regression					RMSE
					0.1770

¹ Note that we have suppressed the preceding “M0” from these models’ name for reasons of simplicity.

Table 6. Design parameters for the initial and optimized designs.

Design	$u_{\max}$	${Xu}_{\max}$	$a_{u}$	$t_{u 1}$	$t_{u 2}$	$s_{u 1}$	$s_{u 2}$	${TE}_{1, 2}$ ¹
initial	0.3071	0.5976	0.1645	0.4882	0.4652	0.4191	0.5818	0,0
optimized	0.3569	0.6445	0.1762	0.5371	0.5150	0.4680	0.5857	0,0
ML-optimized	0.3570	0.5994	0.1780	0.5286	0.5056	0.3695	0.5811	0,0
	$l_{\max}$	${Xl}_{\max}$	$a_{l}$	$t_{l 1}$	$t_{l 2}$	$s_{l 1}$	$s_{l 2}$	$a$
initial	0.2877	0.6095	0.3597	0.5393	0.4473	0.5051	0.7436	0.5751
optimized	0.2377	0.5596	0.4096	0.4895	0.3974	0.4553	0.6967	0.6137
ML-optimized	0.2470	0.5643	0.4097	0.4906	0.4081	0.4551	0.6998	0.5251

¹ All designs have a sharp trailing edge at

(1, 0)

.

Table 7. Design optimization results.

Design	Lift Coefficient ( $C_{L}$ )	Drag Coefficient ( $C_{D}$ )	Drag-Over-Lift ( $\frac{C_{D}}{C_{L}}$ )	Area	Optimization Time ¹
Initial	0.4050	0.00677	0.0167	0.1368	-
Optimized	0.6356	0.00690	0.0109	0.1363	2139.2 s
Difference	$57.9 %$	$1.92 %$	$35.6 %$	$0.36 %$	100%
ML-optimized	0.5763	0.00708	0.0123	0.1367	312.3 s
Difference	$42.3 %$	$4.58 %$	$26.5 %$	$0.10 %$	14.6%

¹ Optimization was a performed on a standard notebook with an Intel(R) Core(TM) i7-9750H CPU @ 2.60 GHz and 16 Gb of RAM running matlab 2023a on MS Windows 11.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kostas, K.V.; Manousaridou, M. Machine-Learning-Enabled Foil Design Assistant. J. Mar. Sci. Eng. 2023, 11, 1470. https://doi.org/10.3390/jmse11071470

AMA Style

Kostas KV, Manousaridou M. Machine-Learning-Enabled Foil Design Assistant. Journal of Marine Science and Engineering. 2023; 11(7):1470. https://doi.org/10.3390/jmse11071470

Chicago/Turabian Style

Kostas, Konstantinos V., and Maria Manousaridou. 2023. "Machine-Learning-Enabled Foil Design Assistant" Journal of Marine Science and Engineering 11, no. 7: 1470. https://doi.org/10.3390/jmse11071470

APA Style

Kostas, K. V., & Manousaridou, M. (2023). Machine-Learning-Enabled Foil Design Assistant. Journal of Marine Science and Engineering, 11(7), 1470. https://doi.org/10.3390/jmse11071470

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine-Learning-Enabled Foil Design Assistant

Abstract

1. Introduction

2. Materials and Methods

2.1. Foil Parametric Model

2.2. Foil Performance Analysis

2.3. Geometric Moments

2.4. Training Datasets

2.5. Regression Models

3. Results and Discussion

3.1. Regression and ML Models’ Performance

3.2. Foil Design Assistant

3.3. Foil Shape Optimization

4. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI