Large Sample Comparison of Parameter Estimates in Gamma Raindrop Distributions

Johnson, Roger W.; Kliche, Donna V.

doi:10.3390/atmos11040333

Open AccessArticle

Large Sample Comparison of Parameter Estimates in Gamma Raindrop Distributions

by

Roger W. Johnson

^1,* and

Donna V. Kliche

²

¹

Department of Mathematics, South Dakota School of Mines & Technology, Rapid City, SD 57701, USA

²

Atmospheric and Environmental Sciences, South Dakota School of Mines & Technology, Rapid City, SD 57701, USA

^*

Author to whom correspondence should be addressed.

Atmosphere 2020, 11(4), 333; https://doi.org/10.3390/atmos11040333

Submission received: 12 February 2020 / Revised: 25 March 2020 / Accepted: 28 March 2020 / Published: 29 March 2020

(This article belongs to the Special Issue Measurement and Modeling of the Precipitation Particle Size Distribution)

Download

Browse Figures

Versions Notes

Abstract

:

Raindrop size distributions have been characterized through the gamma family. Over the years, quite a few estimates of these gamma parameters have been proposed. The natural question for the practitioner, then, is what estimation procedure should be used. We provide guidance in answering this question when a large sample size (>2000 drops) of accurately measured drops is available. Seven estimation procedures from the literature: five method of moments procedures, maximum likelihood, and a pseudo maximum likelihood procedure, were examined. We show that the two maximum likelihood procedures provide the best precision (lowest variance) in estimating the gamma parameters. Method of moments procedures involving higher-order moments, on the other hand, give rise to poor precision (high variance) in estimating these parameters. A technique called the delta method assisted in our comparison of these various estimation procedures.

Keywords:

raindrop size distribution; gamma distribution; method of moments; maximum likelihood; delta method; simulation; correlation

1. Introduction

Knowledge of the raindrop size distribution (RSD) is essential in the retrieval of rainfall properties using radar remote sensing techniques and in understanding and describing the microphysical processes involved in the formation of precipitation. In particular, gamma RSDs have been used, among other investigations, to examine the time evolution of rainfall [1], characterize and quantify relations between such quantities as rainfall rate, radar reflectivity, and liquid water content [2,3,4,5], and help distinguish between stratiform and convective precipitation [6]. Throughout this article, we suppose that the RSD follows a gamma distribution. While the gamma distribution is not always appropriate, it often is (see e.g., [7] and [8]). Given a sample of raindrops, a variety of estimates have been proposed over the years for how this data may be used to estimate the gamma parameters. Among these, method of moments estimates are the most popular as the various moments may be understood as physical quantities [9]. For example, total drop surface area is related to the second moment, total drop volume or liquid water content to the third moment, and the reflectivity factor to the sixth moment. Another estimation procedure is maximum likelihood [10,11,12,13]. In this article, we compared seven different sets of gamma parameter estimates: five method of moments estimates [1,2,3,4,5,6,9,12], maximum likelihood estimates [10,11,12,13], and pseudo-maximum likelihood estimates [14], and did so only for the case of a “large” (>2000) sample of drop sizes, which were accurately measured across the full spectrum of size. All disdrometers are limited by truncation or quantization effects, and hence the ideal RSD, which is assumed in this article, will never be measured. However, a close approximation may be feasible by combining multiple disdrometers that enable sampling of both small and large drops. While not yet widely available, the technology to measure drop size quite accurately across nearly the full drop size spectrum is now available and will continue to improve over time. Presently, meteorological particle spectrometers may be used to measure small drop size (100 microns to 1.5 mm) with a resolution of 50 microns [15]. Additionally, third generation 2D-video disdrometers may be used to measure larger drops (0.7 mm and larger) with a resolution of 170 microns [15,16]. The large sample estimate comparisons in this article, then, may be viewed as exact in the presence of perfect drop size information or as a near approximation using the best current technology for recording drop size.

As it turns out, all seven sets of gamma parameter estimates presented below are correct on average in the case of large sample sizes; that is, they are unbiased. How do we decide which of the seven estimation procedures should be used? To settle this issue, we could look at the variabilities, specifically the variances (or their square roots, the standard deviations), to help us choose between estimates. Small variability, of course, is desirable. A technique sometimes referred to as the delta method from the engineering and statistical literature [17,18,19] was used to establish the lack of bias and the variances in all but the case of maximum likelihood.

2. Raindrop Size Distribution and Parameter Estimates

Use of the gamma distribution was proposed by [4,20], among others, as it often gives an appropriate description of the natural variations of the observed RSDs. For the present study, we represent the raindrop sizes as a gamma distribution function where n(D) represents the number of raindrops per unit diameter interval and per unit volume of air

n (D) = N_{T} \cdot f (D; μ, λ)

(1)

and the gamma density is given by

f (D) = f (D; μ, λ) = \frac{λ^{μ + 1}}{Γ (μ + 1)} D^{μ} e^{- λ D}, D > 0

(2)

following the notation in [21]. Here,

N_{T}

is the total drop number concentration;

μ

is the shape parameter; and

λ

is the rate parameter (in

{mm}^{-}^{1}

if drop size is in mm). Additionally,

Γ

refers to the gamma function

Γ (α) \equiv \int_{0}^{\infty} x^{α - 1} e^{- x} d x, for α > 0

which may be thought of as a generalized factorial function since

Γ (n) = (n - 1)!

for n a positive integer. Assuming a gamma density for atmospheric samples is equivalent to assuming a gamma density for surface samples, albeit with modified shape and rate parameters, under standard models of terminal raindrop fall velocities as a function of size. Further details may be found in [22] (Section 8).

In the maximum likelihood (ML) and pseudo-maximum likelihood gamma parameter estimates to follow, we will need to refer to the digamma function

ψ (x) \equiv \frac{d}{d x} \ln Γ (x)

as well as its derivative

ψ' (x) = \frac{d}{d x} ψ (x),

sometimes referred to as the trigamma function (see, e.g., [23]).

The gamma parameter estimates we compared are listed in Table 1. As a number of these estimates are obtained by the method of moments (MM) some moment notation is appropriate. Given measured raindrop size values

D_{1}, D_{2}, \dots, D_{n}

let

m_{j} = \frac{1}{n} \sum_{i = 1}^{n} D_{i}^{j}

denote the

j^{th}

sample moment with n the total number of raindrops. In Table 1, the MMrst notation indicates the method of moments is implemented using moments of order r, s, t, which will have distinct values from 0–6. In what follows, we will use

m_{1}

and

\bar{D}

(the sample mean raindrop diameter) interchangeably. The weighting of moments using division by n above, by the way, is arbitrary for the method of moments estimates to be discussed. So, for example, volume-weighted moments may be used instead.

Various combinations of three of these moments have commonly been used by atmospheric scientists to estimate the gamma parameters

μ, λ, N_{T} .

Among these, we find use of the zeroth, first, and second moments [1]; the second, third, and fourth moments [9,12]; the second, fourth, and sixth moments [2,3]; and the third, fourth, and sixth moments [4,5,6]. Table 1 lists the parameter estimates for these four different combinations of moments along with the use of the first, second, and third moments. For ease of presentation, the method of moments estimates for μ and λ in Table 1 are expressed in terms of values of an a appearing in the last column. The estimates in the last two rows of Table 1 do not make use of an auxiliary variable; to denote it these rows have an asterisk in the a column. We point out that each a value in Table 1 is at least one (note the (a − 1) denominators) because of Hölder’s inequality [24] (Equation (3.2.10)) with a equal to one only in the pathological case where all the drop sizes are identical.

Table 1 also includes two other types of gamma parameter estimates: (i) maximum likelihood (ML) estimates discussed, for example, in [11,12,13], and (ii) the pseudo maximum likelihood estimates by Ye and Chen in [14], who determined the maximum likelihood equations for a more encompassing generalized gamma distribution and then specialized these to the case of an “ordinary” gamma. The ML estimate for the shape parameter is obtained by numerically solving the equation

\ln [\bar{D} / {(\prod_{i = 1}^{n} D_{i})}^{1 / n}] = \ln ({\hat{μ}}_{M L} + 1) - \frac{Γ' ({\hat{μ}}_{M L} + 1)}{Γ ({\hat{μ}}_{M L} + 1)}

(3)

(see e.g., [12]). The maximum likelihood estimate of the rate parameter is then given by

{\hat{λ}}_{M L} = ({\hat{μ}}_{M L} + 1) / \bar{D} .

Table 1 only lists estimates of the gamma parameters μ and λ because the large sample variance expressions for

{\hat{N}}_{T}

are especially complicated when using moment estimators, and partly because large sample maximum likelihood theory is developed only for estimates of density parameters (i.e., parameters appearing within just the density function f in Equation (1)).

3. Large Sample Behavior of the Estimates

For large sample sizes, each of the estimates in Table 1 is normal, unbiased, and has variance as given in Table 2. We used the delta method, described in detail in Appendix B, to establish these results for all but the maximum likelihood procedure, whose large sample results are given, for example, in [10]. It is not surprising that the delta method conclusions require large sample sizes as the delta method may be viewed as a multivariate version of the central limit theorem. For small samples, biases will occur and the variances will be larger than those stated in Table 2. To exemplify our large sample results more precisely, consider the method of moments estimate of, say, the shape parameter μ using moments 2, 3, and 4, from Table 1. Here,

{\hat{μ}}_{M M 234} = \frac{4 - 4 a}{a - 1}, where a = \frac{m_{2} m_{4}}{m_{3}^{2}}

If

E ({\hat{μ}}_{M M 234})

denotes the expected value or mean of the estimated shape parameter

{\hat{μ}}_{M M 234}

and

V a r ({\hat{μ}}_{M M 234})

denotes its variance, the claim is that

E ({\hat{μ}}_{M M 234}) ≅ μ,

and

V a r ({\hat{μ}}_{M M 234}) ≅ \frac{1}{n} \cdot \frac{2 (μ + 3) (μ + 4) (μ^{2} + 19 μ + 72)}{(μ + 1) (μ + 2)}

where μ is the true value of the shape parameter in the RSD. The

≅

symbol indicates that the ratio of the left- and right-hand sides approaches one as the sample size increases. At the end of this section, more will be said about the sample size n needed for the large sample approximations in Table 2 to closely hold. For now, we mention that sample sizes as small as 2000 or more will suffice for several of the estimates.

As stated in the Introduction, when choosing between several estimates, each of which are unbiased, we will generally prefer estimates with smaller variance (or smaller standard deviation, the square root of the variance). As the variance expressions in Table 2 are fairly complicated, it is best to compare the variabilities graphically.

We started by examining the estimates of the shape parameter μ. The estimate standard deviations which follow from Table 2 (by taking square roots) are all of the form

1 / \sqrt{n}

multiplied by a function of (just) the shape parameter μ. In Figure 1, we take the ordinates to be

\sqrt{n}

multiplied by the standard deviation to compare the large sample variabilities of all of the estimates listed in Table 1. To clarify, the curve displayed in Figure 1 associated the MM123 estimate of the shape parameter, for example, has ordinates

\sqrt{2 (μ + 2) (μ + 3) (μ + 6) / (μ + 1)} .

From Figure 1, we observe the moment estimates of the shape parameter incorporating the higher-order moments are least desirable as they have the highest variabilities, and the Ye/Chen and maximum likelihood estimates are most desirable as they have the lowest variabilities. The MM012 method has the smallest variability among the method of moments estimates and, in fact, compares favorably with the Ye/Chen and maximum likelihood methods.

The Ye/Chen and maximum likelihood curves are practically identical and appear as a single curve in Figure 1. Consequently, only the single key entry of “ML/YC” is given in the Figure 1 legend. No attempt is made here to analytically quantify how close these two standard deviation curves are, but this is largely a consequence of

{[x ψ' (x)]}^{2} ≅ 1 + ψ' (x)

being a near identity over much of the region

x > - 1 .

Now we turn to the estimates of the rate parameter λ. It will again be easiest to compare estimates graphically. All estimate standard deviations that follow from Table 2 (by taking square roots) are of the form

λ / \sqrt{n}

multiplied by a function of (just) the shape parameter μ. In Figure 2, we take the ordinates to be

\sqrt{n} / λ

multiplied by the standard deviation to compare the large sample variabilities of all of the estimates listed in Table 1. To clarify, the curve displayed in Figure 2 associated the MM123 estimate of the rate parameter, for example, has ordinates

\sqrt{(2 μ^{2} + 23 μ + 52) / (μ + 1) (μ + 2)} .

As with the shape parameter, we see from Figure 2 that the moment estimates of the rate parameter incorporating the higher-order moments are least desirable as they have the largest variabilities, and the Ye/Chen and maximum likelihood estimates are the most desirable as they have the smallest variabilities. The MM012 method has the smallest variability among the method of moments estimates and compares favorably with the Ye/Chen and maximum likelihood methods.

The large sample variabilities given in Table 2 are attained, in a limiting sense, as the sample size grows. A natural question, then, is how large the sample size must be for these variabilities to hold to good approximation. A careful analysis would involve looking at both of the estimates

\hat{μ}

and

\hat{λ}

across each of the seven different estimation procedures, with the behavior undoubtedly depending on the true parameter values of μ and λ. For our purposes here, we only very briefly discuss how empirical values of the standard deviations through simulated observations from a gamma RSD match-up to values from Table 2 for large samples in two cases:

μ = 2, λ = 5

and

μ = 5, λ = 13

(essentially the values used in [22]).

For a sample size of 2000 and both pairs of the shape and rate parameter values just stated, we found the relative errors in the standard deviations for the MM012, MM123, ML, and Ye/Chen

\hat{μ}

and

\hat{λ}

estimates (but not the MM234, MM246, and MM346 estimates) to vary from about 1% to 3%. For the MM234 estimates with a sample size of 2000, we found relative errors for the standard deviations of the shape and rate parameters to each be about 1% in the case

μ = 5, λ = 13

, but were each about 10% in the case

μ = 2, λ = 5 .

Sample sizes larger than 2000 are needed for the variabilities listed for MM246 and MM346 in Table 2 to be reasonably accurate.

Increasing the sample size to 10,000, the MM246 and MM346 estimates had relative errors for the standard deviations of the shape and rate parameters to each be about 5% in the case

μ = 5, λ = 13

and to each be about 15% in the case

μ = 2, λ = 5 .

Upon increasing the sample size to 100,000, these relative errors decreased to at most 0.8% in the case

μ = 5, λ = 13

and to 4% to 5% in the case

μ = 2, λ = 5

for both of these two method of moments procedures.

In practice, raindrop size samples are likely to be observed under stable conditions, corresponding to having fixed μ and λ parameter values, only over rather short time intervals, perhaps on the order of minutes. Multiple disdrometers would then be needed to collect sample sizes as large as, say 2000 drops. For more moderate sample sizes, the variabilities listed in Table 2 and displayed in Figure 1 and Figure 2 should not, of course, be fully trusted. However, the ranking of estimates according to variability given in Table 2 and Figure 1 and Figure 2 should persist to some degree as the sample size decreases and can be investigated through simulation. Such simulation for less than large sample sizes, however, is beyond the scope of this article.

4. Discussion and Conclusions

We used the delta method, a result known in the engineering and statistical literature, as a viable approach in deciding which of several gamma parameter estimates may be used when raindrop sizes can be accurately measured over the full spectrum of size. With the aid of the delta method, the estimates of the gamma shape and rate parameters listed in Table 1 were found, for large sample sizes, to be normal and unbiased with variances given in Table 2. Given several unbiased estimates, it is natural to prefer estimates with small variance.

If a method of moments procedure is to be used then to reduce estimate variance, we determined that gamma parameter estimates having the smallest order moments feasible should be preferred. We say ‘feasible’ as, for example, the lowest order moments might not always be reliable. Wind, for instance, can affect the reliability of the first moment. For some discussion related to this see, for example, [3,26,27].

We determined the maximum likelihood and pseudo maximum likelihood estimates of Table 1 have the smallest variances among the seven estimation procedures examined when estimating the gamma shape and rate parameters, but the MM012 method of moments procedure is very nearly as good in terms of variance.

When using instrumentation that does not accurately measure drop size over the full spectrum of drop size, then the variance results of Table 2 only hold approximately for large sample sizes. To compare gamma parameter estimates in this situation (or in the case of small sample size), samples of drop sizes from a gamma with known values of the shape and rate parameter may be randomly generated. Using these randomly generated samples, estimates may be compared in terms of their bias and variance. Such comparisons, for truncated and binned disdrometer data, were performed in [22] where we also suggested that the smallest order moments feasible be used when estimating gamma shape and rate parameters by method of moments.

Author Contributions

Conceptualization and Validation, R.W.J. and D.V.K.; Methodology, Software, and Formal Analysis, R.W.J.; Writing—original draft preparation, R.W.J.; Writing—Review & Editing, D.V.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

We dedicate this article in honor and in memory of our friend and colleague, Paul Smith. Discussions with Paul helped lead to the development of this article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Additional Table 2 Entries

Here are the additional entries that did not easily fit into Table 2:

V a r ({\hat{μ}}_{MM 246}) = \frac{1}{n} \cdot \frac{2 (μ + 3) (μ + 4) (μ + 5) (μ + 6) (2 μ + 15) (2 μ^{3} + 75 μ^{2} + 739 μ + 2196)}{(μ + 1) (μ + 2) {(2 μ^{2} + 18 μ + 39)}^{2}}

\begin{array}{l} V a r ({\hat{λ}}_{MM 246}) = \\ \frac{λ^{2}}{n} \frac{(16 μ^{8} + 1096 μ^{7} + 28612 μ^{6} + 393920 μ^{5} + 3207215 μ^{4} + 16016320 μ^{3} + 48257292 μ^{2} + 80562249 μ + 57222072)}{2 (μ + 1) (μ + 2) (μ + 3) (μ + 4) {(2 μ^{2} + 18 μ + 39)}^{2}} \end{array}

V a r ({\hat{μ}}_{MM 346}) = \frac{1}{n} \frac{6 (μ + 4) (μ + 5) (μ + 6) (3 μ^{4} + 154 μ^{3} + 2433 μ^{2} + 15482 μ + 34800)}{(μ + 1) (μ + 2) (μ + 3) {(3 μ + 16)}^{2}}

V a r ({\hat{λ}}_{MM 346}) = \frac{λ^{2}}{n} \frac{(μ + 5) (μ + 6) (18 μ^{4} + 1041 μ^{3} + 17250 μ^{2} + 111820 μ + 252064)}{(μ + 1) (μ + 2) (μ + 3) (μ + 4) {(3 μ + 16)}^{2}}

Appendix B. Large Sample Calculations and the Delta Method

The large sample results for the maximum likelihood estimation given in Table 2 appear elsewhere in the literature (see, for example, [28] and [29](Equations (38) and (55))). The remaining large sample results in Table 2 all follow from the technique called the delta method [17,18,19]. The delta method details for the Ye and Chen estimates appear in [14]. As the remaining estimates in Table 1 are all method of moments estimates, and the delta method is perhaps not well-known, it seems appropriate to illustrate this technique for one of the method of moments procedures in Table 1. We did so for an MM234 estimate. We chose this particular combination of moments in large part because the calculations are a bit easier and the process is more transparent. With this example, it will be clear, in principle, how to compute the large sample properties for the other method of moments estimates in Table 1.

Before doing so, we mention what the delta method amounts to in the one-dimensional case. Readers may recognize this under the phrase “propagation of errors.” Letting

θ = E (D)

denote the expected value or mean drop size, we have

g (\bar{D}) ≅ g (θ) + g' (θ) (\bar{D} - θ)

where

g'

denotes the derivative of g, g is a function of a single variable. We conclude the normality of

g (\bar{D})

as a consequence of that of

\bar{D}

(technically, use the fact that the right-hand side above is an affine transformation of

\bar{D})

and that

g (\bar{D})

has mean

E [g (θ) + g' (θ) (\bar{D} - θ)] = g (θ) + g' (θ) E [(\bar{D} - θ)] = g (θ) + g' (θ) \cdot 0 = g (θ)

Furthermore, if the variance of the drop sizes is represented as

σ^{2} = V a r (D),

then

g (\bar{D})

has variance

V a r [g (θ) + g' (θ) (\bar{D} - θ)] = V a r [g' (θ) (\bar{D} - θ)] = {[g' (θ)]}^{2} \cdot V a r (\bar{D} - θ) = {[g' (θ)]}^{2} \cdot V a r (\bar{D}) = {[g' (θ)]}^{2} σ^{2} / n

To summarize the expected value and variance calculations above, we have

E (g (\bar{D})) ≅ g (E (D)) and V a r (g (\bar{D})) ≅ {[g' (E (D))]}^{2} \cdot V a r (\bar{D})

with the large sample normality of

g (\bar{D})

following from that of

\bar{D}

by the central limit theorem.

The general delta method deals with a function of several arguments, each of which is approximately normal. The large sample mean and variance expressions of the delta method are generalizations of the example of propagation of errors above. The large sample normality depends on the fact that each of the sample moments

m_{j}

are approximately normal, which is a consequence of the central limit theorem.

For the MM234 method of moments estimates, the general delta method concludes for large samples that the gamma parameter estimates given by

[\begin{matrix} {\hat{μ}}_{234} \\ {\hat{λ}}_{234} \end{matrix}] \equiv [\begin{matrix} g_{1} (m_{2}, m_{3}, m_{4}) \\ g_{2} (m_{2}, m_{3}, m_{4}) \end{matrix}]

(we drop the MM on the 234 subscript) are approximately normal with corresponding mean

[\begin{matrix} E ({\hat{μ}}_{234}) \\ E ({\hat{λ}}_{234}) \end{matrix}] ≅ [\begin{matrix} g_{1} (E (D^{2}), E (D^{3}), E (D^{4})) \\ g_{2} (E (D^{2}), E (D^{3}), E (D^{4})) \end{matrix}]

(4)

and with covariance matrix for

{\hat{μ}}_{234}, {\hat{λ}}_{234}

given by the expression

\frac{1}{n} A \sum A^{t}

where A is the Jacobian matrix

A = {[\begin{matrix} \frac{\partial g_{1}}{\partial m_{2}} & \frac{\partial g_{1}}{\partial m_{3}} & \frac{\partial g_{1}}{\partial m_{4}} \\ \frac{\partial g_{2}}{\partial m_{2}} & \frac{\partial g_{2}}{\partial m_{3}} & \frac{\partial g_{2}}{\partial m_{4}} \end{matrix}] |}_{(m_{2}, m_{3}, m_{4}) = (E (D^{2}), E (D^{3}), E (D^{4}))}

(5)

and

\sum

is the covariance matrix of the moments used

\sum = [\begin{matrix} C o v (D^{2}, D^{2}) & C o v (D^{2}, D^{3}) & C o v (D^{2}, D^{4}) \\ C o v (D^{3}, D^{2}) & C o v (D^{3}, D^{3}) & C o v (D^{3}, D^{4}) \\ C o v (D^{4}, D^{2}) & C o v (D^{4}, D^{3}) & C o v (D^{4}, D^{4}) \end{matrix}]

(6)

The details are best accomplished using a computer algebra system that includes basic matrix computation (we used Maple 18 [30]), but we provide a few details showing how the rest of the computation is accomplished.

To carry out the above calculations, we require the population moments of raindrop size for our gamma distribution setting. Using the gamma density in Equation (2), these are given by

E (D^{k}) = \int_{0}^{\infty} D^{k} f (D) d D = \frac{(μ + 1) (μ + 2) \dots (μ + k)}{λ^{k}}

for positive integer values of k. To determine, for example, the large sample mean of

{\hat{μ}}_{234}

(the top entry in Equation (4)) we evaluate, from Table 1,

{\hat{μ}}_{234} = \frac{4 - 3 a}{a - 1} where a = \frac{m_{2} m_{4}}{m_{3}^{2}}

at

(m_{2}, m_{3}, m_{4}) = (E (D^{2}), E (D^{3}), E (D^{4})) .

Straightforward calculation shows that a evaluates to

(μ + 4) / (μ + 3)

giving

(4 - 3 a) / (a - 1) = μ .

That is,

E ({\hat{μ}}_{234}) ≅ μ .

We now turn to the covariance matrix for our estimates. Starting with the A matrix, we illustrate the calculation of the 1,3 element (i.e., the entry in the first row and third column of the A matrix in Equation (5)). Note that

\frac{\partial g_{1}}{\partial m_{4}} = \frac{\partial}{\partial m_{4}} [\frac{4 - 3 a}{a - 1}] = \frac{\partial}{\partial m_{4}} [\frac{4 - 3 m_{2} m_{4} / m_{3}^{2}}{m_{2} m_{4} / m_{3}^{2} - 1}] = - \frac{m_{2} m_{3}^{2}}{{[m_{2} m_{4} - m_{3}^{2}]}^{2}}

Evaluating this at

\begin{array}{l} (m_{2}, m_{3}, m_{4}) & = (E (D^{2}), E (D^{3}), E (D^{4})) \\ = (\frac{(μ + 1) (μ + 2)}{λ^{2}}, \frac{(μ + 1) (μ + 2) (μ + 3)}{λ^{3}}, \frac{(μ + 1) (μ + 2) (μ + 3) (μ + 4)}{λ^{4}}) \end{array}

and simplifying gives

\frac{- λ^{4}}{(μ + 1) (μ + 2)} .

Additionally, to illustrate the calculation of an entry in the covariance matrix

\sum

of the moments, the 1,2 element in Equation (6) is

\begin{array}{l} C o v (D^{2}, D^{3}) & = E (D^{2} D^{3}) - E (D^{2}) E (D^{3}) \\ = E (D^{5}) - E (D^{2}) E (D^{3}) \\ = \frac{(μ + 1) (μ + 2) (μ + 3) (μ + 4) (μ + 5)}{λ^{5}} - \frac{(μ + 1) (μ + 2)}{λ^{2}} \cdot \frac{(μ + 1) (μ + 2) (μ + 3)}{λ^{3}} \\ = \frac{6 (μ + 1) (μ + 2) {(μ + 3)}^{2}}{λ^{5}} \end{array}

By symmetry, this is also the 2,1 element, Cov(D³, D²) == E(D³D²) − E(D³)E(D²), of the

\sum

matrix.

After a considerable amount of calculation (again, with the help of Maple [30]), we found the large sample covariance matrix of

{\hat{μ}}_{234}, {\hat{λ}}_{234}

to be given by

\frac{1}{n} A \sum A^{t} = \frac{1}{n} [\begin{matrix} \frac{2 (μ + 3) (μ + 4) (μ^{2} + 19 μ + 72)}{(μ + 1) (μ + 2)} & \frac{2 λ (μ + 4) (μ^{2} + 21 μ + 84)}{(μ + 1) (μ + 2)} \\ \frac{2 λ (μ + 4) (μ^{2} + 21 μ + 84)}{(μ + 1) (μ + 2)} & \frac{λ^{2} (μ + 4) (2 μ^{2} + 47 μ + 201)}{(μ + 1) (μ + 2) (μ + 3)} \end{matrix}]

The large sample variances of

{\hat{μ}}_{234}

and

{\hat{λ}}_{234}

appear along the diagonal (the 1,1 and 2,2 elements, respectively).

Tangential to the focus of this article, we can also determine the large sample (Pearson) correlation coefficient between

{\hat{μ}}_{234}

and

{\hat{λ}}_{234}

using, by definition,

ρ = \frac{C o v (\hat{μ}, \hat{λ})}{\sqrt{V a r (\hat{μ}) V a r (\hat{λ})}}

(the 1,2 or 2,1 element divided by the square root of the product of the diagonal elements). After some calculation, we found the large sample squared value of the correlation coefficient to be given by

ρ^{2} = \frac{2 {(μ^{2} + 21 μ + 84)}^{2}}{(2 μ^{2} + 47 μ + 201) (μ^{2} + 19 μ + 72)} = 1 - \frac{(μ + 5) (μ^{2} + 15 μ + 72)}{(2 μ^{2} + 47 μ + 201) (μ^{2} + 19 μ + 72)}

indicating, in general, very high correlation values between the shape and rate estimates (e.g.,

ρ

above 0.987 for

μ

larger than zero, regardless of the value of

λ

).

References

Smith, J.A. Marked point process model of raindrop-size distributions. J. Appl. Meteorol. 1993, 32, 284–296. [Google Scholar] [CrossRef] [Green Version]
Ulbrich, C.W.; Atlas, D. Rainfall microphysics and radar properties: Analysis methods for drop size spectra. J. Appl. Meteorol. 1998, 37, 912–923. [Google Scholar] [CrossRef]
Vivekanandan, J.; Zhang, G.; Brandes, E. Polarimetric radar estimators based on a constrained gamma drop size distribution model. J. Appl. Meteorol. 2004, 43, 217–230. [Google Scholar] [CrossRef] [Green Version]
Ulbrich, C.W. Natural variation in the analytical form of the raindrop size distribution. J. Appl. Meteorol. Climatol. 1983, 22, 1764–1775. [Google Scholar] [CrossRef] [Green Version]
Kozu, T.; Nakamura, K. Rainfall parameter estimation from dual-radar measurements combining reflectivity profile and path-integrated attenuation. J. Atmos. Ocean. Technol. 1991, 8, 8–270. [Google Scholar] [CrossRef] [Green Version]
Tokay, A.; Short, D.A. Evidence from tropical raindrop spectra of the origin of rain from stratiform versus convective clouds. J. Appl. Meteorol. 1996, 35, 355–371. [Google Scholar] [CrossRef] [Green Version]
Brawn, D.; Upton, G. On the measurement of atmospheric gamma drop-size distributions. Atmos. Sci. Let. 2008, 9, 245–247. [Google Scholar]
Johnson, R.W.; Kliche, D.V.; Smith, P.L. Modeling raindrop size. J. Stat. Educ. 2015, 23. [Google Scholar] [CrossRef]
Smith, P.L. Raindrop size distributions: exponential or gamma–Does the difference matter? J. Appl. Meteorol. 2003, 42, 1031–1034. [Google Scholar] [CrossRef]
Rice, J. Mathematical Statistics and Data Analysis, 2nd ed.; Wadsworth, Inc.: Belmont, CA, USA, 1995; pp. 261–265. [Google Scholar]
Mallet, C.; Barthes, L. Estimation of gamma raindrop size distribution parameters: Statistical fluctuations and estimation errors. J. Atmos. Ocean. Technol. 2009, 26, 1572–1584. [Google Scholar] [CrossRef]
Kliche, D.V.; Smith, P.L.; Johnson, R.W. L-moment estimators as applied to gamma drop size distributions. J. Appl. Meteorol. Climatol. 2008, 47, 3117–3130. [Google Scholar] [CrossRef] [Green Version]
Johnson, R.W.; Kliche, D.V.; Smith, P.L. Comparison of estimators for parameters of gamma distributions with left-truncated samples. J. Appl. Meteorol. Climatol. 2011, 50, 296–310. [Google Scholar] [CrossRef]
Ye, Z.; Chen, N. Closed-form estimators for the gamma distribution derived from likelihood equations. Am. Stat. 2017, 71, 177–181. [Google Scholar]
Thurai, M.; Bringi, V.; Gatlin, P.N.; Petersen, W.A.; Wingo, M.T. Measurements and modeling of the full rain drop size distribution. Atmosphere 2019, 10, 39. [Google Scholar] [CrossRef] [Green Version]
Thurai, M.; Gatlin, P.; Bringi, V.N.; Petersen, W.; Kennedy, P.; Notaroš, B.; Carey, L. Toward completing the raindrop size spectrum: Case studies involving 2D-video disdrometer, droplet spectrometer, and polarimetric radar measurements. J. Appl. Meteorol. Climatol. 2017, 56, 877–896. [Google Scholar] [CrossRef]
Lawless, J.F. Statistical Models and Methods for Lifetime Data, 2nd ed.; John Wiley & Sons, Inc.: New York, NY, USA, 2003; pp. 539–540. [Google Scholar]
Doob, J.L. The limiting distributions of certain statistics. Ann. Math. Stat. 1935, 6, 160–169. [Google Scholar] [CrossRef]
Cramér, H. Mathematical Methods of Statistics; Princeton University Press: Princeton, NJ, USA, 1946. [Google Scholar]
Willis, P.T. Functional fits to some observed drop size distributions and parametrization of rain. J. Atmos. Sci. 1984, 41, 1648–1661. [Google Scholar] [CrossRef] [Green Version]
Chandrasekar, V.; Bringi, V.N. Simulation of radar reflectivity and surface measurements of rainfall. J. Atmos. Ocean. Technol. 1987, 4, 464–478. [Google Scholar] [CrossRef] [Green Version]
Johnson, R.W.; Kliche, D.V.; Smith, P.L. Maximum likelihood estimation of gamma parameters for coarsely binned and truncated raindrop size data. Q. J. R. Meteorol. Soc. 2014, 140, 1245–1256. [Google Scholar] [CrossRef]
Olver, F.W.J.; Lozier, D.W.; Boisvert, R.F.; Clark, C.W. (Eds.) NIST Handbook of Mathematical Functions; Cambridge University Press: Cambridge, UK, 2010; pp. 136–144. [Google Scholar]
Abramowitz, M.; Stegun, I.A. (Eds.) Handbook of Mathematical Functions; Dover Publications, Inc.: New York, NY, USA, 1972; p. 11. [Google Scholar]
Bowman, K.O.; Shenton, L.R. Properties of Estimators for the Gamma Distribution; Marcel Dekker, Inc.: New York, NY, USA, 1988; p. 30. [Google Scholar]
Rinehart, R.E. Out-of-level instruments: Errors in hydrometeor spectra and precipitation measurements. J. Appl. Meteorol. Climatol. 1983, 22, 1404–1410. [Google Scholar] [CrossRef] [Green Version]
Nešpor, V.; Krajewski, W.F.; Kruger, A. Wind-induced error of raindrop size distribution measurement using a two-dimensional video disdrometer. J. Atmos. Ocean. Technol. 2000, 17, 1483–1492. [Google Scholar] [CrossRef]
Brawn, D. Adapted GLM gamma parameter estimates for drop size distributions. Atmos. Sci. Let. 2015, 16, 386–390. [Google Scholar]
Miura, K. An introduction to maximum likelihood estimation and information geometry. Inter. Infor. Sci. 2011, 17, 155–174. [Google Scholar] [CrossRef] [Green Version]
Maplesoft, version 2018.2; Waterloo Maple Inc.: Waterloo, Canada, 2018.

Figure 1. Comparison of the large sample shape parameter estimate variabilities.

Figure 2. Comparison of the large sample rate parameter estimate variabilities.

Table 1. Selected gamma shape and rate parameter estimates from the literature. Corresponding references appear in the first column. The estimates in all but the last two rows make use of an auxiliary variable ‘a’ in the last column. The last two rows do not make use of an auxiliary variable a so this column is filled with an asterisk.

Approach	Gamma Parameter Estimates		a
Approach	$\hat{μ}$	$\hat{λ}$	a
MM012 [1]	$\frac{2 - a}{a - 1}$	$\frac{m_{0}}{m_{1}} \frac{1}{a - 1}$	$\frac{m_{0} m_{2}}{m_{1}^{2}}$
MM123	$\frac{3 - 2 a}{a - 1}$	$\frac{m_{1}}{m_{2}} \frac{1}{a - 1}$	$\frac{m_{1} m_{3}}{m_{2}^{2}}$
MM234 [9,12]	$\frac{4 - 3 a}{a - 1}$	$\frac{m_{2}}{m_{3}} \frac{1}{a - 1}$	$\frac{m_{2} m_{4}}{m_{3}^{2}}$
MM246 [2,3]	$\frac{(11 - 7 a) + \sqrt{a^{2} + 14 a + 1}}{2 (a - 1)}$	$\sqrt{\frac{2 m_{2}}{m_{4}} \cdot \frac{2 (a + 1) + \sqrt{a^{2} + 14 a + 1}}{{(a - 1)}^{2}}}$	$\frac{m_{2} m_{6}}{m_{4}^{2}}$
MM346 [4,5,6]	$\frac{(11 - 8 a) + \sqrt{8 a + 1}}{2 (a - 1)}$	$\frac{m_{3}}{m_{4}} \frac{3 + \sqrt{8 a + 1}}{2 (a - 1)}$	$\frac{m_{3}^{2} m_{6}}{m_{4}^{3}}$
Ye/Chen [14]	$- 1 + \frac{\bar{D}}{[\frac{1}{n} \sum_{i = 1}^{n} D_{i} \ln (D_{i}) - \bar{D} \cdot \frac{1}{n} \sum_{i = 1}^{n} \ln (D_{i})]}$	$\frac{1}{[\frac{1}{n} \sum_{i = 1}^{n} D_{i} \ln (D_{i}) - \bar{D} \cdot \frac{1}{n} \sum_{i = 1}^{n} \ln (D_{i})]}$	*
ML [12]	${\hat{μ}}_{M L}$ not expressible in closed form (see Equation (3) in the text)	$\frac{{\hat{μ}}_{M L} + 1}{\bar{D}}$	*

Table 2. Large sample variances of the shape and rate parameter estimates shown in Table 1.

Approach	Large Sample Estimate Variance
Approach	$Shape Parameter : V a r (\hat{μ})$	$Rate Parameter : V a r (\hat{λ})$
MM012	$\frac{1}{n} \cdot 2 (μ + 1) (μ + 2)$	$\frac{λ^{2}}{n} \cdot \frac{(2 μ + 5)}{(μ + 1)}$
MM123	$\frac{1}{n} \cdot \frac{2 (μ + 2) (μ + 3) (μ + 6)}{(μ + 1)}$	$\frac{λ^{2}}{n} \cdot \frac{(2 μ^{2} + 23 μ + 52)}{(μ + 1) (μ + 2)}$
MM234 (Appendix B)	$\frac{1}{n} \cdot \frac{2 (μ + 3) (μ + 4) (μ^{2} + 19 μ + 72)}{(μ + 1) (μ + 2)}$	$\frac{λ^{2}}{n} \cdot \frac{(μ + 4) (2 μ^{2} + 47 μ + 201)}{(μ + 1) (μ + 2) (μ + 3)}$
MM246	Complicated―Listed in Appendix A
MM346	Complicated―Listed in Appendix A
Ye/Chen	$\frac{1}{n} \cdot (μ + 1) [μ + {(μ + 1)}^{2} ψ' (μ + 1)]$	$\frac{λ^{2}}{n} \cdot (1 + (μ + 1) ψ' (μ + 1))$
ML	$\frac{1}{n} \cdot \frac{(μ + 1)}{[(μ + 1) ψ' (μ + 1) - 1]}$	$\frac{λ^{2}}{n} \cdot \frac{ψ' (μ + 1)}{[(μ + 1) ψ' (μ + 1) - 1]}$

(1) The variance expressions for the Ye/Chen and ML estimates are all well-defined and positive for values of the shape parameter larger than −1. It is known, for example, that

ψ' (x) > 0

and

x ψ' (x) - 1 > 0

for x > 0 from [25] (Equation (1.46)).

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Johnson, R.W.; Kliche, D.V. Large Sample Comparison of Parameter Estimates in Gamma Raindrop Distributions. Atmosphere 2020, 11, 333. https://doi.org/10.3390/atmos11040333

AMA Style

Johnson RW, Kliche DV. Large Sample Comparison of Parameter Estimates in Gamma Raindrop Distributions. Atmosphere. 2020; 11(4):333. https://doi.org/10.3390/atmos11040333

Chicago/Turabian Style

Johnson, Roger W., and Donna V. Kliche. 2020. "Large Sample Comparison of Parameter Estimates in Gamma Raindrop Distributions" Atmosphere 11, no. 4: 333. https://doi.org/10.3390/atmos11040333

APA Style

Johnson, R. W., & Kliche, D. V. (2020). Large Sample Comparison of Parameter Estimates in Gamma Raindrop Distributions. Atmosphere, 11(4), 333. https://doi.org/10.3390/atmos11040333

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Large Sample Comparison of Parameter Estimates in Gamma Raindrop Distributions

Abstract

1. Introduction

2. Raindrop Size Distribution and Parameter Estimates

3. Large Sample Behavior of the Estimates

4. Discussion and Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. Additional Table 2 Entries

Appendix B. Large Sample Calculations and the Delta Method

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI