1. Introduction
Wind energy, as the protagonist of energy transformation in response to energy shortages and the environmental pollution crisis, has attracted worldwide attention for its development and utilization. Nowadays, with the increasing installed capacity of wind power, the development of wind power still faces the serious phenomenon of wind abandonment and power limitation [
1]. Despite the tremendous advantages of grid-connected wind power, due to the uncertainty and intermittent nature of wind power, its ever-increasing installed capacity poses tremendous challenges to the power system [
2]. Improving the accuracy of wind speed and power prediction is one of the most important means of solving the above problems. Accurate power prediction is a prerequisite for the accurate grid-connected dispatch of wind power. It can not only help power dispatchers make accurate and effective decisions to reduce the adverse impact of wind power on the entire power grid but also greatly reduce the cost of wind power, bringing enormous economic benefits to the power market [
3].
Many research methods on high reliability and efficiency have been introduced in the related research of wind forecasting, which can be divided into the mathematical statistics method [
4,
5,
6,
7,
8], the physical model method [
9,
10,
11,
12,
13] and other methods [
14,
15,
16,
17,
18] according to the model principle.
Mathematical statistical prediction methods usually use historical data to build statistical models and infer future values by observing the relationships among data. Contrary to physical forecasting methods, mathematical methods require a great deal of historical data to train statistical models. The traditional statistical method commonly uses a large amount of historical data to model through the steps of pattern recognition, parameter estimation and model tests to determine a mathematical model containing a time series and then derive the prediction model [
19]. For example, Miyoshi et al. [
20] combined simulation with real-world sensor data using statistical mathematics and the dynamic system theory to achieve wind speed prediction using numerical weather prediction (NWP). However, due to the difficulty of setting and estimating parameters of high-order mathematical models, traditional statistical methods are often used as reference models with the improvement of prediction accuracy [
21]. With the development of computer science and data processing technology, the artificial network prediction (ANN) algorithm has been developed and is very effective at predicting wind speed, which has strong self-learning, self-organization and adaptive ability [
22]. Madhiarasan et al. [
23] employed hidden neuron estimation in a multi-layer perceptron network to predict wind speed. The developed neural network model has a compact structure, which can improve forecasting accuracy and accelerate the convergence rate. The accuracy of the mathematical prediction method depends heavily on the number of sample data, which results in a large amount of data calculation in the early stage. If the dimension of the prediction parameters is high, the calculation speed of the mathematical prediction is greatly reduced. Another obvious defect is that the mathematical statistical model generally fails to accurately predict extreme weather conditions that are not included in the historical database, which can lead to wind farms failing to respond in time when complex or special weather conditions occur [
24].
As the most primitive and reliable prediction method, physical statistical forecasting is usually based on Numerical Weather Prediction (NWP), which uses basic physical principles such as the law of mass conservation, momentum conservation and energy conservation to forecast wind speed [
25]. In recent years, related commercial companies have developed many models based on numerical weather prediction which have been widely used in commercial projects [
26,
27,
28]. The Global Forecasting System (GFS) developed by the U.S. Environmental Monitoring Center has the widest application range. It mainly uses a series of meteorological data (including temperature, pressure, distribution of obstacles or surface roughness, etc.) to achieve global forecasting by solving complex mathematical models [
27]. With the development of computer science, CFD has been widely adopted as an important tool in various fields, such as physics and medicine, making an indelible contribution to promoting scientific development and improving human healthcare [
29,
30]. Most physical models are based on CFD to simulate atmospheric flow. In the real wind field, solving hydrodynamic equations and obtaining wind speed distribution with high spatial resolution requires much calculation time. Although the reliability of the results can be guaranteed, high accuracy also results in long calculation times and high requirements for computer performance, so it is not suitable for application in wind speed prediction scenarios that require a fast response [
31].
Considering the defects of both models, the technical route of PT (Process Tomography) can be introduced into wind field reconstruction. Wind field reconstruction can be regarded as the application of PT technology in the field of wind resource utilization, i.e., only a small amount of real-time measurement data can be applied to realize the overall reproducibility of physical information, such as the wind direction and wind speed, by solving inverse problems. The inverse problem is usually solved by using sparse measurement data to obtain a large amount of original data. If the characteristics of the original data can be obtained before the calculation starts, the accuracy and efficiency of reconstruction will be greatly improved. The application of reconstruction in the field of short-term wind prediction addresses the uncertainty of statistical methods and large computational loads of physical methods, but the application of reduced models is urgently needed. With the development of the theory of computational mathematics, the research on reduced-order models has been quite in-depth. POD and Tucker decomposition, with their convenience and high precision, have become the most mature means of order reduction in many models and have been widely used in the field of matrix and tensor [
32]. For example, Jiang et al. [
33] employed POD to combine sparse sensor observations with the CFD database to quickly estimate the airflow field. Qin et al. [
34] employed singular value decomposition (SVD) and principal component analysis (PCA) to reduce the dimensions of the wind speed database and combined sparse sensor measurements to achieve high-precision wind field reconstruction. Zhang et al. [
35] proposed a sensor and CFD database fusion technology based on Tucker decomposition, which achieved the purpose of obtaining a 3D wind field from sparse measurements.
Although these approaches have been extensively studied in different fields, they are all based on steady-state physical fields. Significantly, the steady-state condition is unreasonable in real wind farms, especially for completely unknown incoming flows. In addition, in previous studies, the systematic comparison and analysis of POD and Tucker decomposition in 2D and 3D wind field reconstructions have not been enough, especially when analyzing the reconstruction efficiency and impact of the CFD database on reconstruction results.
Motivated by the above problems, the main research goals of the present paper contain two parts: First, we verified the reliability of POD and Tucker decomposition in a multi-dimensional unsteady wind field reconstruction. Second, we explored the factors affecting reconstruction errors to optimize the reconstruction results of multi-dimensional unsteady wind fields.
To achieve these research goals, an unsteady environment is created by changing wind speed inlet conditions, and the POD and Tucker decomposition can be employed for 2D and 3D unsteady wind field reconstructions, respectively. In addition, based on the given wind field environment, the factors influencing the results of reconstruction can be systematically analyzed. Through simulation and detailed discussion, both methods enabled the accurate and fast reconstruction of multi-dimensional unsteady wind fields, providing a new direction in which the instability of statistical models and the time-consuming physical model can be solved.
The chief originality of this study can be summarized as follows: Firstly, the combination of offline CFD simulation and reduced-order model can greatly save calculation time while ensuring the accuracy of reconstruction, which creatively combine wind field reconstruction with short-term wind forecasting, addressing the uncertainty of statistical methods and large computational loads of physical methods. Secondly, different from the previous studies on steady wind field reconstruction, the methods presented in this paper provide a solution for reconstruction of an unsteady wind field, especially for completely unknown incoming flows, greatly enhancing the practical application value of wind field reconstruction in short-term wind prediction. In the wind power industry, the presented methods can be used in the wind energy resource evaluation, wind turbine performance diagnosis, and optimization of wind turbine arrangements.
3. Reconstruction Results Based on CFD Wind Field Database
3.1. Construction of Simulated Wind Field Environment
To validate the reliability of the proposed methods in unsteady complex wind fields, a wind tunnel model with a complex terrain was simulated using hemispherical and sinusoidal models.
Figure 2 shows the isometric diagrams of the wind tunnel in detail. Additionally,
Figure 3 shows the specific dimensions of the geometric model and wind tunnel. The wind speed at the entrance of the wind field could be considered uniformly distributed, and the wind direction was defined as the angle between the wind speed direction and the positive half-axis of the X-axis.
The specific dimensions of the analyzed spaces are shown in
Figure 3, with a length of 400 m, a width of 300 m, and a height of 100 m. In addition, a simulated mountainous environment consisting of obstacle models is shown in the figure. The bottom radius and height of the hemisphere was 15 m, and the bottom center coordinate was (−30 m, 0 m). The sinusoidal model was formed through the rotation of a sinusoidal curve around the central axis, with a bottom radius of 20 m, and the center position of the bottom was symmetrical with the hemisphere along the y-axis. The distance between the sinusoidal model and the hemispherical model was 25 m.
3.2. Construction of Simulation Database
In this study, the most critical step was the establishment of a wind speed database, from which sufficient database information can ensure accurate feature extraction. In this paper, the two-dimensional wind speed distribution was represented by the plane above the model, and the 3D space of the area was expressed by five stacked planes. As described in
Section 2.1 and
Section 2.2, the wind speed distribution simulated by CFD with different inlet wind speeds were stored in the matrix database
U and the 4-D tensor database
V, respectively. A continuity equation and momentum conservation equation were used to simulate the velocity distribution of the wind fields. The corresponding equations are as follows:
Momentum equation:
where
is the fluid density,
,
, and
are the velocity component at coordinates (
x, y, z) at time t, respectively.
is the pressure, and
is the unit mass force.
is the dynamic viscosity. Although the three-dimensional transient Navier-Stokes equation can be used to describe turbulence, its non-linear application cannot accurately describe the time-dependent fluid change characteristics. In engineering applications, turbulence is the main factor causing the change of mean flow field and more complex models are needed to describe the overall flow effect. Therefore, the turbulent model based on Reynolds averaging Navier-Stokes (RANS) equations is required to solve the flow field. In RANS, the instantaneous velocity values are decomposed into time-averaged velocity and fluctuation components.
The above equation is substituted into the continuity equation and momentum equation for time-averaged calculation, and the continuity equation and momentum equation for time-averaged flow are obtained:
In order to close the control equation, the number of additional turbulent flows introduced needs to be the same as the number of additional differential equations solved. According to the number of additional differential equations solved, eddy viscosity modes can generally be divided into four categories: zero equation mode, half equation mode, one equation mode, and two equation mode. The standard model in the two-equation model is widely used in engineering flow field calculations due to its advantages of wide applicability, low computational cost, and reasonable accuracy.
For boundary conditions, the inlet type was chosen to be the ‘velocity-inlet’, and prepared UDF was imported, with the outlet set to the ‘pressure-outlet’. Referring to previous CFD calculations on slip conditions and cavitation, since the wind field above 20 m above the obstacle model is considered in this paper, which is beyond the scope of the boundary layer and does not involve the wind speed distribution around the wall, neither the obstacle model nor the wind tunnel wall in this study involves slip and cavitation [
45]. Moreover, the turbulence intensity and viscosity ratio of the inlet were set to 5% and 10%, respectively.
In this study, wind field databases were constructed with different inlet boundary conditions, and the sine equation was used to define the variation in the inlet speed with time:
where
v is the inlet speed and
v0 is the initial speed. The velocity was assumed to be uniform in the inlet section. Based on the above equation, the difference in the inlet speed was mainly reflected by two aspects: a different initial speed and a different inlet angle. Therefore, in this study, the wind speed database was constructed by changing the initial speed and the inlet angle. Three groups of the initial velocity and incident angle were set at inlet boundary conditions, which were 5 m/s, 45°; 10 m/s, 90°, and 15 m/s, 135°, respectively.
In the CFD calculation, the time step was set to 0.1 s; in total, 500-time steps were set, and the data were saved once at every 0.2 s interval. After the calculation was complete, the wind velocity distribution in the 8 s–16 s time interval of 41 moments was selected to form a database under boundary conditions. Finally, the three databases were combined into a complete wind field database. The dimensions of the matrix database U could be represented as , where n represented the selected measurement points, and N was the number of snapshots. The values of n and N were 10,000 (100 points in the x-direction and 100 points in the y-direction) and 123, respectively. The selected plane was 22 m high and 120 m × 100 m in size. In addition, the dimensions of the tensor database V were , where and represented the measurement points selected along the X and Y direction, respectively, while and N represented the number of selected planes and snapshots, respectively. The values of , , and N were 100, 100, 5, and 123, respectively. The five planes were 22 m, 25 m, 30 m, 35 m, and 40 m in height and 120 m × 100 m in size. The reconstruction results and error analysis were based on the two databases.
3.3. CFD Calculation
Before the reconstruction calculation, two important preparations needed to be completed: mesh division and fluent parameter setting.
The meshing strategies used in CFD calculations can be roughly divided into two categories: structured meshing and unstructured meshing. The characteristic of structured meshing is that the connection relationship between each node and adjacent points is fixed and implicit in the generated grid, so it is not necessary to specifically set data to confirm the relationship between nodes and adjacent points. The main advantages of structured meshing are as follows: (1) Fast generation of a grid; (2) the quality of the generated grid is good; (3) the data structure is simple; (4) most fitting methods for surface or space removal are parameterized or spline interpolation, and the area is smooth and closer to the actual model; (5) it is easy to achieve the boundary fitting of the region, which is suitable for the calculation of fluid field. Unstructured grid generation technology is mainly used to compensate for the lack of structured grids that cannot solve the mesh generation of arbitrary shapes and connected regions. However, the generation technology of unstructured grids is relatively complex, and currently only the automatic generation technology of planar triangles is relatively mature. The structure of the wind tunnel model in this study is relatively simple, and the geometric shape of the obstacle model is relatively regular. In structured meshing strategies, quadrilateral meshes are generally used for two-dimensional geometry and hexahedral meshes for three-dimensional geometry. Therefore, the hexahedral structured mesh is chosen as the meshing strategy, and the mesh generation tool is ICEM CFD. The mesh was encrypted around the models and boundary, and “o-block” was performed near the hemispherical and sinusoidal models. The partitioned locally encrypted hexahedral structured grid and boundary layer grid are shown in
Figure 4.
In addition, the mesh independence test is crucial in the mesh division. The mesh independence test can be stated as a mesh convergence study obtained through identical models with different elements, including coarse elements (small number of elements) and fine elements (large number of elements) [
46]. It aims to select computational models with an optimal number of elements, and not too many, in order not to burden the computational load whilst still being able to provide accurate results [
47,
48]. In the mesh independence test, 2,694,420 units were considered the standard. For comparison, three dense-to-sparse grids were designed: 1,133,561, 546,184, and 22,385. Five planes with 50,000-speed values were selected for comparison. The relative errors between the calculated speed values of the three groups of mesh and those of 269,420 units are shown in
Table 1. Obviously, when the number of units reached 546,184, the relative error decreased to about 0.3%. If the number of units increased to 1,133,561, the error could be further reduced, but the calculation cost would still be multiplied. Therefore, considering both the accuracy and time cost, 546,184 elements were selected for fluent calculation. The corresponding number of nodes and edges is 571,284 and 194, respectively.
After importing the drawn mesh into the fluent, the calculation parameters were set in detail as follows: The ‘Pressure-Based’ method and ‘Unsteady’ were selected as the solver. The solution method and gradient were chosen as ‘PISO’ and ‘Least Squares Cell Based’, respectively. Momentum, turbulent kinetic energy, and turbulent dissipation rate were all selected as the ‘Second-Order Upwind’. The time step was set to 0.1 s; in total, 500-time steps were set, the case and data were automatically saved once at every 0.2 s interval, and the convergence residuals were set to 1 × 10
−3. Finally, to determine an appropriate turbulence model, three boundary conditions and five typical turbulence models (standard K-ω, SST K-ω, realizable K-ε, RNG K-ε and standard K-ε) are selected. Based on the standard K-ε, the calculation results of the other four models at 18.6 s are shown in
Table 2. It can be concluded from
Table 2 that the calculation results of the five models are very similar. However, standard K-ε has the advantages of low computational requirements, strong computational stability and fast convergence speed. Thus, standard K-ε was chosen as the viscous model.
3.4. Presentation of Reconstruction Results
After the construction of the wind field database was complete, the two-dimensional and three-dimensional wind fields were reconstructed using the database in this section. To verify the reliability of the reconstruction methods, three boundary conditions were designed to validate the reconstruction, which were: 13 m/s, 30°; 8 m/s, 90°; and 17 m/s, 150°. The wind speed distribution at 18.6 s was selected as the reconstruction validation and result presentation. The reconstruction results were presented mainly from a comparison between the reconstructed nephograms and the CFD nephograms, as well as the reconstruction error of the three boundary conditions.
Summarizing the two reconstruction methods, the number of basis vectors and the dimension of core tensor, the number of sensors and the sensor measurement noise need to be determined before calculation. The number of basis vectors was determined to be 10, and the core tensor dimensions were set to [10, 10, 5, 10]. To facilitate the display of sensor locations in nephograms, 20 sensor data were selected as measurements in this study, which only accounted for 0.2% and 0.04% of the total data of the 2D and 3D wind fields, respectively. Finally, the measurement noise
was defined by Gaussian noise:
Here,
is the standard deviation, and
N is the normal distribution function in relation to
, and the noise level can be defined by
:
The noise level was set to 10% in this section. To ensure accurate results, 1000 calculations were performed for each reconstruction to obtain an average. In this section, the results of the reconstruction are presented to preliminarily verify the feasibility of the proposed method. The factors affecting reconstruction accuracy are discussed in the next section.
After determining the relevant calculation parameters, the velocity nephograms and the relative reconstruction error were obtained by calculation. In this study, the reconstruction errors of the absolute velocity u and the component velocity in three directions, i.e., the component velocity ux in the X direction, the component velocity uy in the Y direction, and the component velocity uz in the Z direction, are shown. As a representative, the two- and three-dimensional nephograms of the absolute velocity under three boundary conditions are shown.
Figure 5a–f and
Figure 6a–c show the 2D and 3D nephograms of reconstructed and simulated velocity distribution, respectively. The reconstructed nephograms in
Figure 5 can clearly restore the distribution characteristics of wind speed, and the range of velocity variation is highly consistent with the preset CFD wind field. In addition, it can be seen from
Figure 6 that the 20 sensors were uniformly distributed on five planes, and the reconstructed nephograms obtained were highly consistent with the nephograms calculated by CFD. Notably, in order to ensure the reliability of the reconstruction results, the sensor locations under the three boundary conditions were fixed during the reconstruction process.
A large number of nephograms in
Figure 5 and
Figure 6 have preliminarily validated the reliability of the two reconstruction methods.
Table 3 and
Table 4 further explore the accuracy of reconstruction in different dimensions and the reconstruction errors of different velocity components. Four conclusions can be drawn by analyzing the reconstruction errors in
Table 3 and
Table 4: (1) The reconstruction errors of absolute velocity with the three boundary conditions are quite low (all lower than 0.8%). It is important to note that only 20 measurements were available that fully proved the reliability of the two reconstruction methods in their respective application fields. (2) In 2D and 3D reconstruction, the error of
uz was the largest, since the absolute value of
uz was too small, which resulted in the inaccurate feature extraction of
uz database. (3) The reconstruction error in the X-direction of the boundary condition 8 m/s, 90° was generally large since the component velocity in the X direction was small when the incidence angle of the wind was 90°; this is similar to the reason analyzed in conclusion 2. However, simulated wind fields are often idealized, whereas the wind direction in the real wind field environment is random, and a completely vertical wind direction angle does not exist. (4) The amount of reconstructed data for the 3D wind field was five times that of the 2D wind field, but the error of absolute speed was lower. This proves the superiority of Tucker’s decomposition when dealing with high-dimensional data and illustrates the importance of efficient feature extraction when solving inverse problems.
In this section, the applications of the two reconstruction methods in two-dimensional and three-dimensional unstable wind field reconstruction are verified by computational simulation. Although computational simulation research is the basis/preliminary study before experimental studies, such research has the advantages of lower cost and faster results compared with experimental testing [
49,
50,
51]. However, it should be acknowledged that the lack of independent experimental validation is the limitation of the present study and it would be more convincing if the present study could be validated by performing an independent experimental study or by comparing with previous experimental/identical studies [
52,
53,
54]. In general, the comparison of nephograms validated the fact that the proposed methods could accurately restore the distribution characteristics of the wind field. Moreover, from the error analysis, it can be concluded that although the reconstruction errors of the multi-dimensional wind field and multi-component velocity are different, the variation in errors was similar, and the reconstruction results of absolute velocity were excellent, which fully verifies the reliability of the proposed method for the fast and accurate reconstruction of 2D and 3D wind fields based on sparse sensors.
4. Discussion and Analysis
The results in the previous section are based on fixed solution conditions, e.g., the number of basis vectors, the 4-mode dimension of the core tensors, the noise level, the database dimensions, etc. In this section, the factors affecting the reconstruction results are systematically analyzed to improve the reconstruction accuracy of both methods.
4.1. Presentation of Reconstruction Errors at All Time Steps
In
Section 3.4, only the characteristics of wind speed distribution and reconstruction errors at 18.6 s are shown. Therefore, before discussing the factors influencing the reconstruction results, this section first reconstructs the wind field distribution at all times (i.e., from 16.2 s to 20 s) and integrates the reconstruction errors.
Figure 7a–c and
Figure 8a–c show the relative error of
u,
ux, uy, and
uz with three boundary conditions using POD and Tucker. Both figures are double Y-axis diagrams, from which the following conclusions can be drawn:
(1) Under different boundary conditions, the reconstruction results of absolute velocity and the X-direction component velocity are satisfactory, but the reconstruction results of the Y-direction and Z-direction are poor, mainly because the absolute values of component velocity in these two directions are too small, the distribution gradient is large, and the wind of component direction almost only exists around the obstacle model, which brings great difficulty to the feature extraction of the database.
(2) By comparing the three figures in the same group, it was found that the result of speed reconstruction with the boundary condition (8 m/s 90°) was worse than the other boundary conditions, and the error fluctuated greatly with the change in the inlet speed. This was mainly due to the insufficient convergence of the fluent calculation results, thus affecting the quality of the built database, which showed the important role of the pre-calculated database quality in wind field reconstruction.
(3) By comprehensive comparison with
Figure 7 and
Figure 8, the amount of reconstruction data processed by Tucker decomposition was larger, but the reconstruction effect for the absolute speed was better than POD. This is due to the fact that after rearranging the matrix database into tensors in multi-direction, multi-plane and multi-time order, the data between the dimensions were more closely connected, and the wind speed distribution characteristics, including spatial-temporal information correlation, could be obtained more effectively.
(4) Interestingly, the reconstruction error curves shown in the two figures show a strong periodicity with a period of approximately 2 s; this appeared to be inextricably linked with the inlet speed. Therefore, in order to explore the relationship between the inlet speed variation and error period,
Figure 9 shows the curves of inlet speed over time with three boundary conditions. Comparing this with
Figure 7,
Figure 8 and
Figure 9, we found the distribution characteristics of the error period. The period of error variation was roughly the same as that of speed variation, but the period of the peak and trough was exactly the opposite, i.e., the reconstruction error decreased with increasing speed and increased with decreasing speed. Essentially, this conformed to the rule of physical field reconstruction based on the feature extraction method, i.e., the more stable and powerful the physical field was, the easier it was to be reconstructed efficiently.
4.2. Influence of Basis Vectors and Core Tensor Dimension on Reconstruction Performance
In the reconstruction calculation, the basis vectors and 4-mode dimensions had a profound impact on the calculation time and reconstruction accuracy, as it affected the solution of the modulus coefficient in Equations (11) and (22). Therefore, in this section, the number of basis vectors, 4-mode dimensions and noise level are combined to explore the reconstruction results under different conditions at 18.6 s to determine the optimal number of eigenvectors.
First, with fixed noise levels, the effects of the basis vectors and the core tensor dimension on the results need to be discussed. Notably, the reason behind choosing the 4-mode dimension for exploration is that it directly affects the dimension of the module coefficient
b, which, in turn, affects the reconstruction accuracy and computation time. In this section, the number of basis vectors and 4-mode dimensions was determined to be 1–20. The number of sensors was 20, and the noise level was 10%. Finally, the reconstruction error curves of POD and Tucker decomposition were obtained. As shown in
Figure 10, when the noise level was fixed, the influence of this change in the basis vector and the 4-mode dimension on the error was almost the same. In addition, as the basis vector and 4-mode dimension increased, the reconstruction error decreased significantly, but when the dimension increased to five, the error showed an upward trend. This is because increasing the number of eigenvectors results in a higher dimension of Φ and T, while the eigenvectors corresponding to the minimum eigenvalue are usually associated with noise, so discarding it can significantly improve the reconstruction accuracy.
After exploring the influence of different numbers of eigenvectors under the same noise level on the reconstruction error, it was necessary to further verify the robustness of the proposed reconstruction methods. In this section, the noise level increased from 1% to 20%, with an interval of 1%. Meanwhile, the number of basis vectors and 4-mode dimensions was set to 2, 4, 6, 8, and 10. It was assumed that noise only existed in the measurements and not in the pre-computed database, and the corresponding reconstruction error results are shown in
Figure 11. From this, the following conclusions can be drawn:
Obviously, the relative reconstruction error increased significantly with the increase in the noise level. However, the error changed most smoothly when two eigenvectors were taken, although the error was the largest. Moreover, for larger 4-mode and basis vectors, the curve rose more sharply with the increase in the noise level. This feature illustrates that the reconstruction errors with larger basis vectors and 4-mode are more sensitive to noise, which confirms the conclusion of
Figure 10. In addition, a turning point can be found in
Figure 11, which shows that when the noise level is higher than 5%, larger basis vectors and 4-mode result in poorer reconstruction results, beyond which the effect of the eigenvectors on the errors reverses. In general, the increase in basis vectors and 4-mode dimensions was beneficial to the reconstruction result, but such an excessive increase may enable the reconstruction error to be more sensitive to noise. Notably, referring to
Figure 11, when the measurement noise level was 10%, it appeared that the number of basis vectors and the 4-mode dimension should be determined to be six. Significantly, although the noise level reached 20%, the reconstruction error was still within an acceptable range (mostly below 1%). From this perspective, both reconstruction methods based on POD and Tucker decomposition had excellent noise resistance and could be regarded as a filter.
4.3. Influence of Simulation Database Construction on Reconstruction Performance
In the reconstruction calculation and error analysis of this paper, one critical factor has not been considered, i.e., the construction of a wind field database. In the simulation, data acquisition and database construction were easy to achieve, but in the actual wind field environment, due to the limitation of calculation time and measurement costs, an ideal wind field database was difficult to obtain. Therefore, at the end of this chapter, the impact of different database construction methods on the reconstruction error is discussed, which is of great significance for the application of the proposed reconstruction methods in the real wind field.
Based on the three different initial speed and incident angles set up in
Section 3.2, five matrix databases and five tensor databases are outlined in this section. The construction principles are as follows: (1) The wind velocity distribution calculated by the three boundary conditions was taken as a database separately, and the two databases were named
and
, respectively, in the order of 45°, 5 m/s, 90°, 10 m/s, and 135°, 15 m/s; (2). The wind speed distributions from 13.2 s to 16 s of the three databases were spliced together to form a low-dimensional version of the fused database, named
,
; (3) All the calculated wind velocity distributions were sliced together to form a full-scale wind field database named
,
. Then, based on the newly established five matrices and tensor databases, the reconstruction errors of absolute velocity at 18.6 s in the three boundary conditions were calculated using the proposed reconstruction methods. Notably, the number of sensors used in the calculation, the noise level, the basis vectors, and the 4-mode dimension were all in accordance with
Section 3.4. Detailed calculation results are shown in
Table 5 and
Table 6, and the following conclusions can be drawn:
(1) Obviously, the corresponding reconstruction results of the database with a single boundary condition were far inferior to that of the mixed database, which demonstrates the necessity of building multiple types of databases.
(2) By analyzing the reconstruction errors corresponding to , and , it was found that the wind direction angle had a higher influence on the reconstruction results than the initial velocity, i.e., the database features that were constructed needed to be as close to the actual wind field as possible to produce better reconstruction results.
(3) By comparing and , it was found that the reconstruction errors based on a hybrid database and full-size database were almost the same, but the calculation time of the two databases was quite different (the calculation time based on is three times that of ), which indicates that in the wind field environment with prominent characteristics of wind speed distribution, the constructed database needs to contain rich speed distribution information under different boundary conditions rather than multi-time data, so as to improve the reconstruction efficiency.
(4) Finally, by comprehensively comparing the reconstruction results of both tables, it was found that the reconstruction results based on Tucker’s decomposition were significantly better than those based on POD, which proves the superiority of Tucker decomposition when extracting multi-dimensional tensor data features and demonstrates its lower dependence on the database.
Therefore, the following factors need to be considered: the wind speed distribution under multiple boundary conditions should be included as much as possible because the multi-category database construction has more advantages than the database with single-entry boundary conditions in the reconstruction calculation. Additionally, in the construction of a hybrid database, redundant database information needs to be removed in consideration of time costs.
At the end of this section, the reconstruction time with a different number of sensors based on POD and Tucker’s decomposition was discussed, which was crucial for the reconstruction of the unsteady wind field. Based on the boundary condition 90°, 8 m/s, four groups of sensors (5, 10, 15, 20) were designed to discuss the time required for reconstruction, and the results are shown in
Table 7. With the increase in the number of sensors, the time required for reconstruction also increased accordingly. In addition, since 3D wind data were five times larger than 2D wind data, the reconstruction time based on Tucker’s decomposition was about five times that based on POD. When the number of sensors was compressed, such as five, the reconstruction time came close to 0.2 s and the time interval of data acquisition. This is of great significance in the reconstruction of the unsteady wind field. Because the reconstruction time can be further compressed when the computer performance is strong enough, it is possible to synchronize data acquisition with wind field reconstruction, which could promote the effective application of this method for short-term wind prediction.
4.4. Comparison of the Similar Methods
To further verify the superiority of POD and Tucker decomposition in wind field reconstruction, two existing reduced-order models are introduced: NMF (Nonnegative Matrix Factorization) [
55] and HOSVD (Higher-Order Singular Value Decomposition) [
56]. NMF is applied to the reconstruction of 2D wind speed distributions and compared with POD. The reconstruction result of HOSVD is compared with Tucker decomposition. Taking the three boundary conditions in
Section 3.4 an example, the reconstruction errors of absolute velocity for the five methods are shown in
Table 8 and
Table 9.
From
Table 8, although both methods perform well in reconstructing absolute velocity, the relative reconstruction errors based on POD are the smallest. This is because although the principle of NMF is similar to POD, NMF is better at the non-negative matrix field in image processing, which guarantees that the extracted base vector is non-negative at the expense of losing some of the original information in the database. Therefore, POD is more advantageous in controlling the error of two-dimensional wind field reconstruction. From
Table 9, the reconstruction results obtained by Tucker decomposition are significantly better than those obtained by HOSVD. This is because HOSVD is more difficult to control decomposition accuracy in tensor decomposition. In addition, the computational complexity of HOSVD is significantly higher than Tucker decomposition, which requires more time costs. Tucker decomposition has advantages over HOSVD in both calculation accuracy and time cost, so it is more suitable for reduced-order calculation of a wind field database. In general, the superiority of POD and Tucker decomposition in 2-D and 3-D wind field reconstruction is verified by comparing the reconstruction errors of similar reduced-order models.
5. Conclusions
In the present paper, two innovative wind field reconstruction methods combining CFD and reduced order model were explored based on a tailored library of features extracted from the matrix and tensor databases. In this study, 2D and 3D wind field reconstruction algorithms were developed based on POD and Tucker decomposition, respectively. Both procedures expertly and quickly obtained wind speed information for the entire area for reconstruction using extremely limited sensor measurements, which are of great significance due to their application in ultra-short-term wind prediction. A simulation and detailed discussion were performed to validate the practicability of the algorithms. The major findings are as follows: Firstly, the reconstruction results of three boundary conditions validate that the proposed reconstruction algorithms could accurately restore the distribution characteristics of the unsteady wind field based on sparse sensors. Secondly, rearranging the matrix database into the tensor can make the data between the dimensions more closely connected, and the spatial-temporal information correlation can be obtained more effectively. Thirdly, appropriately increasing the basis vectors and 4-mode dimension reduces the reconstruction error, while excessive eigenvectors reduce the noise resistance of the system and affect the reconstruction accuracy. Fourth, in the wind field environment with prominent characteristics of wind speed distribution, the constructed database needs to contain rich speed distribution information under different boundary conditions rather than multi-time data. However, in the more complex wind field environment, diverse boundary conditions and a large number of instantaneous wind speed data are necessary. Finally, by controlling the number of sensors and improving computer performance, the reconstruction time can be reduced to the time interval of data acquisition so that data acquisition can be synchronized with wind field reconstruction, which is greatly significant in the reconstruction of the unsteady wind field.
In general, the advantage of the proposed unsteady wind field reconstruction methods is that they can greatly save calculation time (i.e., realize the synchronization of data acquisition and wind field reconstruction) while ensuring the accuracy of reconstruction, addressing the uncertainty of statistical methods and large computational loads of physical methods, which can be used in the fields of wind energy resource evaluation, wind turbine performance diagnosis and optimization of wind turbine arrangement, etc. However, it should be acknowledged that the shortcomings of the proposed algorithm are still prominent. For example, for more complex terrains and more variable entrance boundary conditions, the feasibility of the reconstruction methods needs to be verified. Moreover, in order to achieve ultra-short-term wind prediction using unsteady wind field reconstruction, there are still many studies that need to be carried out, such as experimental validation and the construction of databases using real wind field measurements. Therefore, in future research, the reliability of the reconstruction algorithms will be verified using wind tunnel experiments and real wind field data.