1. Introduction
Turbulence is a hazardous weather phenomenon and is the main cause of weather-related reported aircraft incidents and accidents [
1]. The impact of the turbulence depends on the eddy magnitude and size, aircraft type specific parameters (such as weight, wing span, airspeed, etc.), and the phase of the flight during which the turbulence occurs. Although there are a wide range of eddy sizes in the atmosphere, aircraft mainly respond to eddies having sizes in the order of tens to hundreds of meters. Eddies of this size can be created by an energy cascade [
2], i.e., a larger eddy in the atmosphere breaks up and forms a smaller eddy, which eventually dissipates as heat.
Forecasting turbulence is of great interest for scientific research. A number of Numerical Weather Prediction (NWP)-based turbulence forecast techniques have been developed in recent decades [
3,
4,
5]. Because the scale of interest for turbulence forecasting is in the order of hundreds of meters, which is outside the current scope of state-of-the-art operational NWP, these turbulence forecasts are often still diagnostic based, i.e., forecasting the weather quantities that are prone to create an eddy of the right size to result in an impact with an aircraft [
5]. As a result, parameters in these forecast techniques often need to be “calibrated” using turbulence observations.
Observations of turbulence fall into two categories, i.e., subjective and objective observations. Subjective observations mainly take the form of pilot reports. Although turbulence experienced by the pilot must be of the right eddy size to result in impact, such observations are highly aircraft-type dependent. As aircraft response decreases with wing loading and increases with altitude and airspeed [
1,
6], the same atmospheric eddy can result in a drastically different impact for different aircraft. The same atmospheric eddy may be reported as causing severe turbulence by pilots of smaller aircraft, whereas pilots of large aircrafts would barely feel its effect. A previous study found that pilot reports are considered inadequate for observing turbulence if used alone [
7].
Turbulence can also be measured objectively, independent of the aircraft type, in turbulence metrics such as the cubic root of the eddy dissipation rate (EDR) [
8] and derived equivalent vertical gust velocity (DEVG) [
9,
10,
11,
12]. Both ICAO and WMO adopted EDR as the official turbulence reporting metric. Two notable EDR estimation algorithms exist in the literature, i.e., the algorithm developed by the National Aerospace Laboratory (NLR) in the Netherlands (the NLR Algorithm), which can estimate EDR using data recorded in the quick assess recorder (QAR) [
13], and the algorithm developed by NCAR (the NCAR Algorithm) [
14,
15], which was adopted by Boeing and is implemented on Boeing aircraft.
Although both algorithms estimate the EDR from estimates of the vertical component of wind in the flight data, differences exist in the underlining assumptions of the two algorithms. Differences in the EDR estimates, if any, can easily be misinterpreted as differences in the turbulence climatology of geographical regions due to differences in the spatial coverage of the data. The NCAR Algorithm provides a more global coverage of the EDR estimates because it is implemented in the onboard software of Boeing aircraft in a number of large airlines, and the EDR data can be shared among airlines through the IATA Turbulence Aware Program. By comparison, the EDR estimates from the NLR Algorithm are more local because the algorithm is applied on flight data recorded in the QAR, which is more dependent on the operating region of the airline. The difference in the EDR estimates can also lead to sub-optimal tuning of parameters in NWP turbulence forecasts, which can impact the confidence of the users in the forecast products.
This article compares the estimated EDR values between the two algorithms. The article is organized as follows:
Section 2 provides a brief description of the two algorithms; the comparison results are shown in
Section 3, with highlights of the differences between the two EDR estimates, followed by a discussion of the results in
Section 4. The article is summarized in
Section 5.
2. Methods and Data Sets
Although both NCAR and NLR Algorithms estimate the EDR from the wind, there are subtle differences between the two methods. The differences are discussed below.
2.1. Brief Description of the NLR Algorithm
The NLR Algorithm is a wind-based algorithm that calculates the EDR (
) using:
where
is a low-pass filtered airspeed that represents the average flying speed, and
is the running standard deviation of the band-pass-filtered vertical wind variations computed with cut-off frequencies
and
, which correspond to 0.1 and 2 Hz in this study. The sliding window has a length of 10 s. It is noted that the aircraft QAR does not directly record wind data and that the wind is a minimum variance estimate from the data available in the QAR. The vector of the wind data
, and hence the vertical component of the wind, is estimated by:
where the inertial speed vector
is in the Earth reference frame and is estimated by the ground speed and track angle. The aerodynamic speed vector
is estimated by aerodynamic speed, angle of attack, and sideslip angle, and is originally calculated in the aircraft body reference frame, which needs to be transformed back to the Earth reference frame using Euler angles. Interested readers should refer to [
13] for details.
2.2. Brief Description of the NCAR Algorithm
The NCAR Algorithm is a vertical wind-based algorithm that calculates the EDR (
) using:
where the summation is done over the frequency domain with
frequencies,
is the power spectrum of the vertical wind component,
is the bias-correction term, and
is a reference model spectrum in which the von Karman spectral model valid for an incompressible isotropic turbulent velocity field is used. The vertical wind component is estimated in a manner similar to that shown in Equation (2) except that the sideslip angle is assumed to be 0. The NCAR Algorithm is implemented by Boeing on the B777 aircraft and the authors do not have access to the details of the implementation.
The functional forms of both NCAR and NLR Algorithms match well in terms of their dependence on the vertical wind speed. However, the reference model spectrum and the assumption regarding the sideslip angle differ, and are potentially some of the sources of the differences observed in the comparison study below.
2.3. Vertical Acceleration Experienced by the Aircraft
Because pilots report the severity of turbulence based on their subjective feeling of the degree of shaking during a flight, we also examined the vertical acceleration recorded by the QAR to see how well it correlates with the EDR estimates. In this study, the root-mean-square of Acceleration (RMS-g) was computed from the B777 vertical acceleration recorded in a 10 Hz data frequency. To avoid underestimating the severity of short events while reducing the variations in the calculation, a running 5 s averaging window was used in the computation of RMS-g [
16].
2.4. Data Sets in the Comparison Study
The NCAR Algorithm runs on the onboard computer of the B777 fleet of Cathay Pacific and outputs through a real-time data downlink. There is thus a trade-off between data volume and data representativeness. In the current implementation, the NCAR Algorithm outputs, at most, every minute for its 1 min mean and peak EDR estimates from its internal data. Regarding the NLR Algorithm, because the software we use runs in a post-analysis mode, there is no limitation due to the bandwidth of the real-time data downlink and the software outputs its EDR estimates in a 4 Hz data frequency. For any particular flight, significantly more data exists from the NLR Algorithm than from the NCAR Algorithm. To match the output data for the two algorithms on the flights, we down-sampled the output from the NLR Algorithm. For each datapoint from the NCAR Algorithm, we computed the past 1 min peak EDR estimates from the NLR Algorithm of the same time. The peak EDR estimates were used, due to their higher sensitivity, to highlight any discrepancy between the two algorithms. For the vertical acceleration, the past 1 min peak RMS-g was computed from the 10 Hz vertical acceleration data recorded by the B777 over the same time period.
4. Discussion
Figure 3 shows a close match in terms of the frequency distribution of the EDR estimates. Compared with the frequency distribution of the EDR estimates from the NCAR Algorithm, the frequency distribution of the EDR estimates from the NLR Algorithm is slightly biased to the right. However, this bias is inconsequential for the reporting of turbulence from the pilots because the turbulence severity at which this bias exists is well below the threshold for reporting any significant turbulence. There is also a slight excess in the relative frequency, compared with the NLR EDR, for the NCAR EDR estimates for EDR values around 0.1, and the origin of this excess is yet to be determined.
The frequency distribution of EDR in the atmosphere is expected to follow a log-normal distribution [
15]:
However, the frequency distributions of EDR estimates from both of the algorithms shown in
Figure 3b are positively skewed, whereas they are expected to be normally distributed when plotted in log scale for the horizontal axis. Hence, both EDR estimates in our dataset do not fit well to Equation (4). The derivation from the normal distribution is expected due to the bias towards higher EDR estimates caused by the sampling and reporting strategy, i.e., the inclusion of only turbulent flights in the study and the enhanced EDR reporting frequency for the NCAR Algorithm when the flight encountered turbulence. The values of
and
estimated by fitting the data to Equation (4) using Powell’s method are −3.59(−3.41) and 1.11(0.90), respectively, for the NCAR(NLR) algorithm, and are roughly comparable with those in [
15].
The scatter plot in
Figure 4 shows a good correlation between the two EDR estimates. By partitioning the EDR estimates according to the altitude and locations as shown in
Figure 5, it was found that most of the disagreements between the two EDR estimates are recorded at lower altitudes (below 10,000 ft). The mean and root-mean-squared differences between the two EDR estimates for the lower altitude were found to be 0.073 and 0.140, respectively, compared with 0.031 and 0.063 for the whole dataset. For the EDR estimates in the middle and high altitude (above 10,000 ft) shown in
Figure 5a,b, the distribution of the points is approximately symmetric over the diagonal line.
In addition, the scatter plots shown in
Figure 5a,b closely resemble each other. The
z score for testing the statistical significance for differences in the correlation is given by:
where
and
are the correlation and sample size of the two samples, respectively. This value was 5.10 (corresponding to a
p-value of
in a two-tailed test) when applied to testing the difference in the correlation shown in these two figures. However, if we limit the dataset and include only those data captured during turbulence (estimated EDR ≥0.1 from both algorithms), the corresponding
z score drops to around 0.23 (corresponding to a
p-value of 0.82 in a two-tailed test) because the sample size drops to almost
of that of the original. The current differences in the correlation between the two algorithms mainly result from those small values of EDR estimates.
For EDR estimates of low levels (≤10,000 ft in altitude), it is clear from
Figure 5c that the NLR Algorithm often estimates a higher EDR than the NCAR Algorithm. The same observations can also be deduced from
Figure 5d, which focus on a single airport (Hong Kong International Airport). These low-level data are recorded during the ascent or descent of the flight. Aircraft usually operate in different settings when ascending or descending, i.e., with a higher flap and more frequent active maneuvering associated with pitching and rolling of the aircraft, than during en route flight. This difference in the aircraft operation settings may be attributed to the greater difference between the two EDR estimates at low levels. However, there are also only a limited number of flights included in this study, and no low-level en route flight is included in the dataset; hence, it is not possible to pinpoint the source of these differences.
The EDR estimates from both algorithms show similar correlations with the RMS-g (
Figure 6) for the datasets as a whole, and the dependence of their correlations with the airspeed and altitude. For the data shown in
Figure 7, among the five airspeed/altitude categories, the lowest one contains flight data recorded during the ascending and descending portion of the flight. It is in this data category that both EDR estimates showed the weakest correlation between the EDR estimates and the RMS-g, due to the larger variation in airspeed/altitude within this category of data. Although the relationship between the EDR, airspeed, altitude, and RMS-g is non-linear, if a multiple linear model is fitted for either EDR estimate + airspeed, or EDR estimate + altitude, against RMS-g, the coefficients of the linear model for the predictors would be similar if we compared the models fitted with NCAR and NLR EDR estimates. The fitted linear models would also suggest an increase in the RMS-g with an increase in airspeed or altitude, matching the theoretical prediction. It is not possible to fit a linear model using each of the three variables of EDR estimate, airspeed, and altitude as the predictors due to the strong correlation between airspeed and altitude.
5. Conclusions
In this study, we compared the EDR estimates from the NLR and NCAR Algorithms. In general, the EDR estimates from the two algorithms showed a good match with each other. However, the correlation of the two EDR estimates showed a dependence on both altitude and location. A further study with a larger dataset is required to pinpoint the source of these differences. A larger dataset can also allow for further partitioning of the dataset.
The NLR Algorithm is also used for estimating the EDR from QAR data recorded by other aircraft types, such as the Airbus A320 and A330. Because these aircraft usually fly shorter routes than the B777, the coverage of the EDR estimates from these aircraft would be different from those from the B777. Thus, differences between the two algorithms should be considered before pooling the EDR estimates for analysis, especially for analyzing low-level turbulence where the differences between the two algorithms is the greatest.