1. Introduction
As is well-known, the Poisson distribution is equidispersed, which limits its applicability in situations where overdispersion or underdispersion is present. To overcome this limitation, Conway and Maxwell [
1] introduced a generalization of the Poisson distribution which can exhibit both, the under- and overdispersion. Recently, with a view to extend the scope of applications of some other distributions such as the binomial by Borges et al. [
2] and negative hypergeometric by Roy et al. [
3], there have been heightened interest in generating COM type generalizations of other distributions. For example, Borges et al. [
2] proposed an extension of the binomial distribution called COMP–Binomial distribution and studied many of its properties. Subsequently, Chakraborty and Ong [
4] developed a COM–Poisson type generalization of the negative binomial distribution with many applications. In this paper, we develop a COM type generalization of the hypergeometric distribution, called the COM–Hypergeometric (COM-H). It is formulated in the spirit of the CMPB distribution introduced recently by Borges et al. [
2]. This is a three-parameter extension of the hypergeometric distribution which exhibits both over- and underdispersion, often encountered in count data. This is in contrast to the hypergeometric distribution which is always underdispersed. The COM-H distribution can be used in industrial quality control, reliability analysis, and for capture–recapture sampling in estimating wildlife populations. We compare graphically the probability mass functions (pmf) of the hypergeometric and the COM-H distributions for various choices of parameters to display the flexibility this model provides over the usual hypergeometric distribution. We also investigate the log-concavity and log-convexity of its pmf and use these to characterize its failure rate. We develop statistical inference for its parameters. In particular, we develop the maximum likelihood estimators for the parameters of the COM-H model and investigate some of their properties such as the bias, mean squared error (MSE), and coverage probabilities by a Monte Carlo simulation. We also develop the likelihood ratio test for the shape parameter and investigate its asymptotic power.
The paper is organized as follows. In
Section 2, we introduce the COM-H model by introducing a shape parameter. In
Section 3, we compare the shapes of pmfs of the COM-H and hypergeometric distributions for various choices of the parameters. In
Section 4, we express the probability-generating function (pgf) of the COM-H distribution in terms of the generalized hypergeometric series when the shape parameter is a positive integer. We show that in that case, the COM-H model is a member of the Kemp family of distributions and hence, shares various properties of this general family. In
Section 5, we discuss the relationships between its moments and inverse moments and in
Section 6, we express it as a weighted hypergeometric distribution. In
Section 7, we discuss its asymptotic approximations by CMPB and COM–Poisson (see [
1]) distributions. In
Section 8, we characterize the behavior of its failure rate in terms of the log-concavity and log-convexity of its pmf. In
Section 9, we develop the maximum likelihood estimators for its parameters and investigate their behavior by simulation. In that section, we also develop a likelihood ratio test to test if the underlying model is hypergeometric versus COM-H and investigate its power by simulation.
2. Formulation of COM–Hypergeometric Distribution
Consider a finite population of
N items with a fraction
of defective items. Then, the population has
defective items and
nondefective items. Suppose, we take a random sample of size
n without replacement from the
N items. Then, the number of defective items in the sample, denoted by the random variable
X, has the hypergeometric distribution with the pmf
The mean and variance of this distribution are, respectively,
The hypergeometric distribution is underdispersed since it is readily seen that
. Here, we develop a more flexible version of the hypergeometric distribution which can accommodate both under- and overdispersion. The proposed model, called the COM–hypergeometric (COM-H) distribution, with a shape parameter
, has its pmf as
where
We shall also denote this distribution by COM-H
when the underlying parameters are of interest. For
= 1, we have the usual hypergeometric distribution. The values of
> 1 correspond to underdispersion (the mean is greater than variance) while values of
< 1 represent overdispersion (the variance exceeds the mean) with respect to the hypergeometric distribution. For
→
, the pmf is concentrated at 0 and
n. In
Section 3 below, we present graphical comparisons of the pmf of the COM-H distribution for various choices of
, keeping other parameters fixed (see
Figure 1 and
Figure 2).
3. Graphical Comparison of Hypergeometric and COM-H Distributions
In
Figure 1 and
Figure 2, we present some graphs to compare the pmfs of the hypergeometric and COM-H distributions for different values of
.
Figure 1 compares the graphs for positive integer values of
, while
Figure 2 compares them for negative integer values of
. We observe that as the positive
value increases, there is a reduction in the variance of the distribution as compared to that of the hypergeometric distribution, which happens due to the concentration of probabilities around the mean as
increases. The dispersion in the model increases compared to that of the hypergeometric distribution for large negative values of
, as can be seen in
Figure 2. This happens since the probabilities are more dispersed away from the mean as compared to those of the hypergeometric distribution for large negative values of
.
4. Moments and Probability-Generating Function of COM-H (for Positive Integer )
Here, we express the probability-generating function (pgf) of the COM-H in the form of a generalized hypergeometric function for the case when is a positive integer. This facilitates the derivation of the factorial moments and hence other moments of the distribution. Let us first introduce some functions and notation which are used throughout this paper:
Pochhammer’s symbol or rising factorial:
Generalized hypergeometric function:
Then, the following theorem gives an expression of the pgf of the COM-H distribution in terms of the generalized hypergeometric function.
Theorem 1. The pgf of the COM-H distribution is given by where γ is a positive integer.
Proof. In the pmf of the COM-H distribution in (
2), the denominator can be written in terms of the generalized hypergeometric function as follows:
The pgf of
X, given by
, can then be written as
as required. □
Remark 1. The kth order derivative of is given by (using the formula from Abramowitz and Stegun [5], ): The first and second factorial moments of
X are then obtained by setting
, 2 and
. Thus, we obtain
and
Since , the variance of X can be obtained readily in terms of the hypergeometric series as .
When
, the above results reduce, respectively, to the mean and variance of the hypergeometric distribution. These can be verified as follows:
Using the formula (see Abramowitz and Stegun [
5])
the last term in (
3) can be written as
Therefore, , upon simplification, which is the mean of the hypergeometric distribution.
Since, for
,
using formula (
4) here again, the variance can be simplified to
which is the variance of the hypergeometric distribution.
5. The COM-H as a Member of Kemp’s Family of Distributions when Is an Integer
For the Kemp family of distributions (see Tripathi and Gurland [
6], Johnson et al. [
7], and Kemp and Kemp [
8]), the ratio of successive probabilities is given by
Using this result, the following theorem gives the ratio of successive probabilities of the COM-H distribution.
Theorem 2. The ratio of successive probabilities of the COM-H distribution can be written in the form of the ratio of probabilities of Kemp Type families of distributions.
Proof. The ratio of successive probabilities of the COM-Hypergeometric distribution can be written in the form of Kemp Type
families of distributions as follows:
or
which we can use in the COM-H model with
numerator factors,
, and
denominator factors
.
Hence, the theorem. □
6. COM-H as a Weighted Version of Hypergeometric Distribution
The COM-H model can be expressed as the weighted version of the hypergeometric distribution, with the weighted pmf expressed in the form
where
is the expectation with respect to the distribution
and
is the weight function. In Equation (
5), let X be a standard hypergeometric random variable with probability mass function (pmf)
, where
1 as in Equation (
1).
This weighted distribution concept is similar to that in Kokonendji et al. [
9], for the weighted Poisson distribution (WPD). These authors related the log-convexity and log-concavity property of the weight function to the underdispersion and overdispersion characteristics of the COM–Poisson distribution. For this purpose, they in fact represented the COM–Poisson distribution as a weighted Poisson distribution. They also discussed the concept of pointwise duality between two weighted Poisson distributions and illustrated it with an example of the COM–Poisson and WPD family considered earlier by Castillo and Peŕez-Casany [
10].
Definition 1 (Kokonendji et al. [
9])
. Let and be two positive Poisson weight functions. Then, the two corresponding WPDs are said to be pointwise dual (simply dual) if They show that if w is a positive Poisson weight function and for a given x, if is log-concave, then that WPD is also log-concave. The dual pair of WPDs is said to be closed if one of the distributions is overdispersed, and the other is underdispersed. This property makes the two distributions as a closed dual pair of the WPD family. In a similar vein, the COM-H distribution in Equation (
2) can be written in the form of a weighted hypergeometric distribution as
where the hypergeometric weight function (HWF) is given by
.
We discuss two important results of this weight function. The first result relates the log-concavity (log-convexity) of to the weighted hypergeometric distribution (WHD). The second result is about the duality of the WHD.
Theorem 3. Let w be a positive HWF. If for given x, is log-concave, then the corresponding COM-H, represented as a weighted hypergeometric distribution (WHD), is log-concave as well.
Proof. Let
w be the positive HWF. We can then associate the log-concavity of
, ∀
x, with that of the WHD. Consider
as the
for a given
x, then
We have
which follows, since
For the WHD pmf to be log-concave, we need
Thus, we can state that if the HWF is log-concave, then the corresponding WHD is log-concave. Hence, the theorem. □
We now investigate the duality of the WHD based on the HWF, which has the following practical importance. The dual pair of WHDs is closed with respect to the property that the WHD is exhibiting overdispersion and underdispersion for the pair of weight functions
and
with
respectively. Notice that the hypergeometric distribution can also be considered as a WHD family and it forms a “self-dual” [
9] with the weight function
,
. With this weight function, we can see that
The pointwise duality in the WHD family is displayed in
Figure 3 below.
Figure 3 shows the closed dual pair of WHDs. At
and
, the weight functions give
. When the value of
the WHD is log-concave (underdispersed), and the HWF also shows log-concavity with
. Similarly, for the value of
, the WHD is log-convex (overdispersed), and the HWF also shows log-convexity.
7. Convergence of COM-H to COM–Poisson and COMP–Binomial
As is well known, the hypergeometric distribution approaches the binomial distribution, which in turn approaches the Poisson distribution under certain limiting conditions. In this section, we show that similar relationships hold between the COM-H, COMP–binomial, and COM–Poisson distributions.
Theorem 4. The COM-H distribution (γ,n,,N) approaches the COMP–binomial distribution, when with (n,p) as fixed. Subsequently, the COMP–binomial distribution approaches to the COM–Poisson distribution (λ) when and , and N are large such that > n, with remaining fixed.
Proof. First, let us divide the numerator and the denominator of the COM-H pmf in (
2) by
. Then, the numerator can be written as
When
and
N are large, we have, for a fixed
x,
when
x is small relative to
N. Using a similar argument, we have
Thus, we can approximate the COM-H pmf by the COMP–binomial form as
Similarly, the COM-H distribution can be shown to approach the COM–Poisson distribution when we take
with
,
N large, and
. Thus,
Hence, the theorem. □
8. Failure Rate
The monotonicity of the failure rate plays an important role while modeling failure time in reliability studies. As in, Gupta and Tripathi [
11], we characterize the monotonicity of the failure rate in terms of the log-concavity and log-convexity of the distribution. We use the ratio of two consecutive probabilities to determine the log-concavity and log-convexity of the distribution. Let
then,
The following result characterizes the monotonicity of the failure rate of the COM-H distribution.
Theorem 5. The COM-H distribution is log-concave for positive γ, and thus has a nondecreasing failure rate (IFR) for .
Proof. For the COM-H distribution, we have
and
Now, for positive
, we have
and since
it can be seen that
for
. Thus, we can say that when
, the distribution is log-concave and has a nondecreasing failure rate (IFR). Hence, the theorem. □
Remark 2. For , the COM-H distribution does not show monotonicity for the failure rate.
The behavior of the failure rate function,
(see Xie et al. [
12]), where
, is displayed in
Figure 4. We observe that this model has a nondecreasing failure rate for different positive integer values of
. The failure rate increases when the positive value of
increases but only after a certain value of
x, as can be seen in
Figure 4.
Figure 5 displays graphs of the failure rate function for negative values of
. It can be seen that, in this case, the failure rate does not have a definite pattern such as a DFR. In this case, the failure rate of the COM-H distribution exhibits a bathtub shape.
9. Inference for COM-H Model
In this section, we discuss the maximum likelihood estimation (MLE) of
when
p is known and unknown. The MLE of
N is based on the modality of the pmf of the COM-H distribution, while the final estimate needs to be obtained by the use of numerical methods. Let
be a random sample of size
m from the COM-H distribution. Then, from the pmf in (
2), the likelihood function is given by
and the corresponding log-likelihood is given by
9.1. MLE of (with p Known)
The likelihood equation with respect to
, obtained from (
9), can be solved by the Newton–Raphson iterative method. Specifically, to evaluate
numerically, we begin with an initial value
and update
as
until convergence is attained according to some desired criterion, where
with
and
In the case when both
and
p are unknown, the parameters need to be estimated by maximizing the log-likelihood function in (
9) with respect to both parameters, as detailed later in
Section 9.2.
9.2. Maximum Likelihood Estimation of Both and p
For the estimation of both and p in the COM-H model, we proceed with the maximum likelihood estimation keeping N fixed. We used the optim function from the stats package in R programming language for this purpose.
Asymptotic Confidence Intervals for and p
Next, we investigated the performance of the MLEs of
and
p, which we denoted as
and
, respectively, in terms of their asymptotic bias, standard errors, and coverage probabilities, based on 1000 Monte Carlo simulations when both parameters were unknown. The asymptotic normality of the MLEs was used to construct asymptotic confidence intervals for
and
p. The asymptotic standard errors of
and
(the MLEs of
and
p) were obtained from the observed Fisher information matrix through the Hessian matrix. The square root of the diagonal elements of the inverse of Hessian matrix gave the estimates of the standard errors,
and
, of the MLEs of
and
p, respectively. Thus, the asymptotic
confidence intervals for
and
p were
Table 1 and
Table 2 provide the MLE, s.e., bias, root-mean-square error (RMSE), and coverage probabilities of
and
confidence intervals based on 1000 Monte Carlo simulations. Here, we used
,
and
,
, respectively. The results in these tables reveal that when
has a large negative value
, the bias and the standard error of its estimate are both large. The MLE of
p, on the other hand, behaves well with a small bias and s.e., and with coverage probabilities closer to the chosen nominal levels.
9.3. Likelihood Ratio Test for the Significance of
As mentioned earlier, the COM-H distribution reduces to the ordinary hypergeometric distribution with parameters
and
N when
. Since the shape parameter
controls the under- and overdispersion of the distribution, we can use the likelihood ratio test (LRT) to test the null hypothesis
against the alternative
:
. In this section, we investigate the behavior of the statistical power of this test by means of simulations. Let
be a random sample of size
m from the pmf
, where
, and denote the joint distribution by
. The restricted MLE of
is
(under the null hypothesis) and the unrestricted MLE of
is given by
under the alternative. The log-likelihood function of the COM-H distribution, when
is the parameter of interest, is given by
. The likelihood ratio test of Neyman and Pearson [
13] for testing
is given by
By comparing the log-likelihood function under the null and alternative hypotheses, this test gives evidence whether the deviation of one model from the other is statistically significant. The LR test statistic is
Under
, the asymptotic distribution of LR is that of a chi-squared random variable with one degree of freedom (d.f). We assessed the performance of this test by conducting a power study with Monte Carlo simulations. For this purpose, we chose a significance level
of 5% and
and the sample sizes
m = 100 (small), 500 (moderate), and 1000 (large). The departure from the null hypothesis could be explained by an effect size
, which served as the index of departure. We chose values of
= −0.5, 0.5, 1.2, 1.4, 1.6, and 2 with the corresponding effect sizes
and 1, respectively. The results of the simulation study are presented in
Table 3. We fixed
,
, and took
p =
in
Table 3. The statistical power greatly depended on
m and the effect size. It was observed that the power of the LR test did not vary much for different values of
p. The larger the value of
m and the effect size, the higher the power of the test. Moreover, the introduction of
was useful when the amount of underdispersion in the dataset was different from that of the hypergeometric distribution.
10. Discussion
We introduced a Conway–Maxwell type hypergeometric distribution and its salient characteristics in this paper. In subsequent publications, we will show some of its applications in fault detection for software reliability, feature it in acceptance sampling methods for quality control, and describe it through operating characteristic curves, where the hypergeometric distribution is widely used. Furthermore, we will propose parameter estimation techniques for fitting the COM-H distribution and compare it with current estimation methods.
Author Contributions
Conceptualization, S.R. and R.C.T.; Methodology, S.R. and N.B.; Validation, R.C.T.; Resources, R.C.T.; Writing—original draft, S.R.; Writing—review & editing, R.C.T.; Supervision, R.C.T. and N.B. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
The authors declare that no data is used.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Shmueli, G.; Borle, S.; Minka, P.T.; Kadane, J.B.; Boatwright, P. A useful distribution for fitting discrete data: Revival of the Conway-Maxwell-Poisson distribution. J. R. Stat. Soc. Ser. C 2005, 54, 127–142. [Google Scholar] [CrossRef]
- Borges, P.; Rodrigues, J.; Balakrishnan, N.; Bazan, J. A COM-Poisson type generalization of the binomial distribution and its properties and application. Stat. Probab. Lett. 2014, 87, 158–166. [Google Scholar] [CrossRef]
- Roy, S.; Tripathi, R.C.; Balakrishnan, N. A Conway Maxwell Poisson type generalization of the negative hypergeometric distribution. Commun. Stat. Theory Methods 2019, 49, 2410–2428. [Google Scholar] [CrossRef]
- Chakraborty, S.; Ong, S.H. A COM-Poisson type generalization of the negative binomial distribution. Commun. Stat. Theory Methods 2016, 45, 4117–4135. [Google Scholar] [CrossRef]
- Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions; Dover: New York, NY, USA, 1972. [Google Scholar]
- Tripathi, R.C.; Gurland, J. A general family of discrete distributions with hypergeometric probabilities. J. R. Stat. Soc. Ser. B 1977, 39, 349–356. [Google Scholar] [CrossRef]
- Johnson, N.L.; Kemp, A.W.; Kotz, S. Univariate Discrete Distributions, 3rd ed.; Wiley: New York, NY, USA, 2005. [Google Scholar]
- Kemp, C.D.; Kemp, A.W. Generalized hypergeometric distributions. J. R. Stat. Soc. Ser. B 1956, 18, 202–211. [Google Scholar] [CrossRef]
- Kokonendji, C.C.; Mizere, D.; Balakrishnan, N. Connections of the Poisson weight function to over-dispersion and under-dispersion. J. Stat. Plan. Inference 2007, 138, 1287–1296. [Google Scholar] [CrossRef]
- Castillo, J.; Perez-Casany, M. Weighted Poisson distributions for overdispersion and underdispersion situations. Ann. Inst. Stat. Math. 1998, 50, 567–585. [Google Scholar] [CrossRef]
- Gupta, P.L.; Gupta, R.C.; Tripathi, R.C. On the monotonic properties of discrete failure rates. J. Stat. Plan. Inference 1997, 65, 255–268. [Google Scholar] [CrossRef]
- Xie, M.; Gaudoin, O.; Bracquemond, C. Redefining failure rate function for discrete distribution. Int. J. Reliab. Qual. Saf. Eng. 2002, 9, 275–285. [Google Scholar] [CrossRef]
- Neyman, J.; Pearson, E.S. On the use and interpretation of certain test criteria for purposes of statistical inference. Biometrika 1928, 20, 175–240. [Google Scholar]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).