1. Introduction
Given its central role in energy systems, the optimization of motor performance emerges as paramount, influencing the efficacy of the entire energy system [
1,
2].
Currently, prevalent modeling approaches for motor optimization encompass both model-driven and data-driven methodologies. Model-driven modeling relies on fundamental electromagnetic principles to construct motor models [
3,
4,
5,
6], while data-driven methods leverage observed motor performance data and employ statistical and machine learning techniques to unveil system patterns and relationships [
7,
8,
9]. Employing data-driven methodologies to establish mathematical models for motors offers advantages such as reduced computational complexity and robust generalization capabilities [
5,
10,
11]. Moreover, data-driven techniques harness machine learning methodologies to better capture the intricacies of complex systems [
12].
Data-driven approaches have yielded significant advancements in the realm of motor optimization. A multi-tiered optimization strategy, integrating fuzzy reasoning with sequential Taguchi methodology, has demonstrated enhanced optimization precision for IPMSMs [
13]. A fuzzy sequential Taguchi method is utilized to achieve efficient and robust design optimization of fault tolerance for five-phase in-wheel motors of electric vehicles [
14]. Employing a combination of particle swarm optimization with a response surface method has proven effective in optimizing output torque and cogging torque of transverse flux permanent magnet motors [
15]. Online data-driven methodologies leveraging dual-loop optimization have streamlined the optimization process for permanent magnet linear synchronous motors [
16]. Optimization efforts for interior permanent magnet synchronous motors within hub motors, utilizing a Kriging model based on Latin hypercube sampling, have achieved a broader speed range and reduced cogging torque [
17]. The optimization of average thrust and thrust ripple for double-layered flux-switching permanent magnet motors was successfully realized through a hybrid approach, integrating random forest optimization of hyperparameters with non-dominated sorting genetic algorithm-II (NSGA-II) [
18]. The utilization of a proxy model based on DNN has facilitated the optimization of rotor torque and mechanical stress for motors [
19].
Data-driven motor modeling also presents some noteworthy issues. Reference [
20] highlights that the complexity of the motor optimization process increases with multidimensional design variables, thus necessitating a reduction in design space. Reference [
21] points out the curse of dimensionality in data-driven modeling with high-dimensional data, manifested in sparse data distributions and the trend toward the edges of the space. To ensure the accuracy of regression models, reference [
22] achieves effective sparsity structures by developing a G-group-level parameter inference GRIP test through assumed formulations.
The experimental cost of obtaining samples for data-driven motor optimization is relatively high, often resulting in limited data scale [
23]. Reference [
24] indicates that when modeling with small sample sizes and high-dimensional data, estimates of model parameters may be biased, leading to reduced prediction accuracy. Reference [
25] discusses the influence of data dimensionality and sample size on the performance of machine learning models. Reference [
26] suggests that machine learning methods are an effective means of addressing challenges in modeling with small data samples.
In engineering practice, the selection of suitable Pareto frontier optimal solutions requires consideration of multiple factors, including technical feasibility and cost-effectiveness. It entails not only guidance from scientific theories but also practical experience and skills [
27]. Reference [
28] emphasizes the potentially vast size of the Pareto solution set, underscoring the critical importance of judiciously selecting Pareto solutions for the quality and practicality of optimization outcomes. Reference [
29] introduces a quantitative approach to evaluating Pareto solutions by establishing the average variability of objective function values associated with adjacent Pareto solutions. Additionally, reference [
30] employs the R-method to rank Pareto optimal solutions.
Given the complexity of electric motors as intricate systems and the constraints posed by experimental costs, optimizing electric motors using data-driven approaches presents the characteristics as multivariate, multi-objectivity, and small dataset size. Prior research underscores the importance of conducting sensitivity analyses on motor design variables to mitigate the curse of dimensionality during modeling processes. In practical terms, owing to limited dataset sizes and high experimental costs, ensuring that the experimental points adequately cover the design space under constrained scales is imperative to ensure modeling accuracy. Additionally, attention must be paid to selecting Pareto-optimal solutions, as this impacts the quality and practicality of optimization outcomes. When addressing electric motor optimization problems, a comprehensive consideration of multiple factors is essential to derive reliable and practical optimization solutions.
Within this study, we aim to explore a data-driven approach for optimizing motor design, capitalizing on the inherent advantages of data-driven methodologies to enhance various facets of motor performance, including efficiency, output torque, and induced electromotive force. To elucidate and substantiate our proposed methodology, we initially focus on a permanent magnet synchronous motor as our optimization target, with efficiency, average torque, and harmonic distortion of induced electromotive force serving as our optimization objectives. Leveraging pre-existing knowledge, we conduct sensitivity analyses on key motor parameters, retaining sensitive variables as design parameters to effectively streamline the design space. Subsequently, we employ Latin hypercube sampling based on the Morris–Mitchell criterion to design the experimental space, ensuring comprehensive spatial coverage of the dataset. Following this, we utilize support vector regression (SVR) to construct the motor model based on a relatively modest dataset. Lastly, we integrate a multi-objective particle swarm optimization algorithm to yield non-dominated solutions. Given the complexities and constraints inherent in engineering implementations, we adopt the principle of proximity to select Pareto-optimal solutions and compare performance metrics both pre- and post-optimization. The flowchart of this study is delineated in
Figure 1.
2. Prototype Model
This paper focuses on optimizing a surface-mounted PMSM, targeting efficiency, average torque, and harmonic distortion of the electromotive force as optimization objectives. The motor parameters are outlined in
Table 1.
Finite element analysis was conducted using ANSYS Maxwell 2020R1 software. Motor parameters were inputted to construct the computer model. The motor operates at a synchronous speed of 1500 rpm with a power angle of 8.7°. It has a rated phase voltage of 220 V, winding resistance of 10 Ω, and winding self-inductance of 5.13 mH. Simulation time spans 200 ms, with a 2 ms time step.
Figure 2a illustrates the motor model within ANSYS Maxwell, while
Figure 2b displays the mesh partition, indicating reasonable division.
Figure 2c exhibits magnetic flux lines, showcasing satisfactory closure and symmetry with minimal leakage flux.
Figure 2d demonstrates magnetic flux density contours, with peak density near the air gap reaching approximately 2 T.
Figure 3a portrays phase current waveforms, reflecting good symmetry and stable amplitude across the three phases. Lastly,
Figure 3b depicts phase voltage waveforms of induced electromotive force, also demonstrating favorable symmetry and stable amplitude across phases.
4. Model Establishment
In data-driven motor optimization design, data processing determines the accuracy of the model and the effectiveness of the optimization. Through the improved Latin hypercube sampling method, a test space uniformly distributed in three-dimensional space was obtained. The data were input into ANSYS Maxwell. and a series of corresponding values of and were obtained. These data are used as training sets, and the support vector regression algorithm is used to construct the functional relationships between and , respectively.
4.1. Design of Experimental Space for Modeling
The objective of experimental space design is to adequately capture the features of the system model using a limited number of sample data. This includes understanding the coupling relationship between variables and the mapping between design variables and target variables. Therefore, it is imperative that the experimental points effectively cover the experimental space to ensure the precision of fitting the objective function by the machine learning model.
Latin hypercube sampling (LHS) is a stratified sampling method that efficiently distributes samples throughout the experimental space, requiring fewer samples than traditional random sampling methods [
39]. However, in ordinary Latin hypercube sampling, random arrangement of sample points may lead to suboptimal spatial coverage. To address this, the maximum–minimum distance criterion was introduced to enhance the spatial coverage of sampling points [
40].
Morris and Mitchell extended the maximum–minimum distance criterion and proposed the
criterion, also known as the Morris–Mitchell criterion [
41]. An experimental design is called
. An optimal design should satisfy the following conditions:
A smaller value of indicates better test sample space filling performance. In a specific test plan, the distance between any two sample points is sorted to create a list along with the corresponding index list . Here, represents different distance values, represents the number of point pairs with distance , s is the total number of different distance values, and p is a positive integer setting as 50.
The training set size required for this study is 200. Utilizing the Latin hypercube sampling method based on the Morris–Mitchell criterion, we acquired a set of 200 sampling points with good coverage in the three-dimensional space. By inputting the data from these sampling points into ANSYS Maxwell for analysis, we can derive both the training and testing sets for the model. The spatial distribution of the sampling points is depicted in
Figure 5.
4.2. Modeling Based on Support Vector Regression
Selecting support vector regression (SVR) to model the objective function of the motor is recommended. Support vector machines typically outperform other models in small sample datasets [
42,
43], efficiently handle data in high-dimensional space, and address linearly inseparable situations through kernel functions, showcasing strong generalization capabilities. The fundamental concept of SVR is to identify a hyperplane that minimizes the distance between sample points and the hyperplane within a specified tolerance.
For a given training dataset
, where
represents the input feature and
is the corresponding output, the objective of support vector regression (SVR) is to determine a function
that can predict the output
for a new input
. The model expression for SVR is as follows:
Here,
w is the weight vector,
b is the bias (intercept), and ⟨⋅,⋅⟩ represents the inner product. SVR utilizes a loss function to quantify the prediction error. In SVR, the loss function typically includes an error term along with a penalty term:
The tolerance (ξ) in support vector regression (SVR) refers to the acceptable range of data points that can be considered as support vectors. It is a crucial parameter used to determine the level of deviation allowed for data points to be classified as support vectors. The mathematical expression for SVR incorporates this tolerance to identify the support vectors within the specified range.
where
C is the regularization coefficient and
N is the number of samples. The goal of this optimization problem is to find a maximum margin hyperplane while ensuring that all sample points satisfy the constrain.
At the same time, in order to achieve nonlinear fitting, kernel skill needs to be used. The SVR algorithm in this article uses a Gaussian kernel (RBF kernel). The expression of Gaussian kernel is as follows:
where
defines the range of influence of a single sample, with smaller values indicating a smaller impact and larger values indicating a broader influence range. The parameter settings for the SVR algorithm in this study are presented in
Table 6.
4.3. Analysis of the Effect of Models Fitting
Utilizing support vector regression (SVR), the functions for
, and
were fitted. The fitting performance is visually demonstrated in
Figure 6. The test set data shows that all three functions effectively capture the patterns. Analysis from
Table 7 reveals that the root mean square error (RMSE) for all three functions is minimal, while the
Score is high, indicating a strong fitting performance.
6. Conclusions
Through this study, we have explored a data-driven approach to motor optimization, aiming to tackle the challenges posed by multiple objectives, variables, and small sample sizes. Our objective is to achieve more accurate data-driven modeling and optimize motor performance effectively.
Our methodology involves a targeted selection of key parameters through sensitivity analysis, leading to a reduction in the dimensionality of the design variable space. This step enhances optimization efficiency and accuracy. Additionally, we employ Latin hypercube sampling based on the Morris–Mitchell criterion to systematically design experimental spaces for establishing optimization models. Leveraging support vector regression, we develop a more precise surrogate model of the motor based on a small sample size. Finally, considering the practical difficulties and constraints of engineering implementation, we adopt a nearest-neighbor approach to select a set of Pareto front optimal solutions.
Utilizing the approach detailed in this study, we targeted a permanent magnet synchronous motor for optimization, focusing on efficiency, average output torque, and harmonic distortion of the induced electromotive force. Through our optimization methodology, we achieved a notable 6.80% increase in average output torque and successfully reduced the harmonic distortion of the induced electromotive force by 59.5% while maintaining near-constant motor efficiency.
In conclusion, this study presents a method and technology for motor optimization design, offering robust support for enhancing motor performance and achieving more efficient energy utilization.