Next Article in Journal
Explicit Runge–Kutta Numerical Manifold Method for Solving the Burgers’ Equation via the Hopf–Cole Transformation
Previous Article in Journal
A Conjugate Linearly Polarized Light Wave Along an Optical Fiber with the Berry Phase Model and Its Magnetic Trajectories According to the Conjugate Frame
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Discretization of the Weibull-G Family of Distributions: Properties, Parameter Estimates, and Applications of a New Discrete Distribution

Department of Statistics, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
*
Author to whom correspondence should be addressed.
Symmetry 2024, 16(11), 1519; https://doi.org/10.3390/sym16111519
Submission received: 8 October 2024 / Revised: 6 November 2024 / Accepted: 11 November 2024 / Published: 13 November 2024
(This article belongs to the Section Mathematics)

Abstract

:
In this article, we introduce a new three-parameter distribution called the discrete Weibull exponential (DWE) distribution, based on the use of a discretization technique for the Weibull-G family of distributions. This distribution is noteworthy, as its probability mass function presents both symmetric and asymmetric shapes. In addition, its related hazard function is tractable, exhibiting a wide range of shapes, including increasing, increasing–constant, uniform, monotonically increasing, and reversed J-shaped. We also discuss some of the properties of the proposed distribution, such as the moments, moment-generating function, dispersion index, Rényi entropy, and order statistics. The maximum likelihood method is employed to estimate the model’s unknown parameters, and these estimates are evaluated through simulation studies. Additionally, the effectiveness of the model is examined by applying it to three real data sets. The results demonstrate that, in comparison to the other considered distributions, the proposed distribution provides a better fit to the data.

1. Introduction

The use of statistical distributions is crucial in fitting various types of data. Traditional distributions do not always capture the diverse shapes and behaviors found in natural data, leading to the development of new families of distributions to accommodate more complex data. On the other hand, discrete distributions are essential for measuring quantities (e.g., the lifetime of products) on a non-continuous scale, prompting researchers to introduce new discrete distributions. These distributions, however, have limitations in modeling complex data, encouraging the development of more flexible options. As a result, interest has increased in discretizing existing continuous models to achieve a better fit for a larger range of data sets. A statistical method relying on creating new discrete G classes of probability models from existing continuous classes has recently been derived. For instance, Ref. [1] presented and defined the discrete Gompertz-G family of distributions following [2]. A new G family of continuous distributions, named the exponential generalized-G family, was presented in [3]. Following [4], Ref. [5] introduced and discussed the discrete Rayleigh G family of distributions, whereas [6] suggested a discrete analogue of the Weibull-G family.
Recently, Ref. [7] presented a new method, called transformed-transformer, to generate new families of distributions. They chose the weight function W ( G ( x , δ ) ) = l o g ( 1 G ( x , δ ) ) , where G ( x , δ ) is a cumulative distribution function of any baseline distribution with parameter vector δ , in order to introduce the Weibull-G, beta-exponential-G, and gamma-G families of distributions.
In this paper, W ( G ( x , δ ) ) = l o g ( 1 G ( x , δ ) ) , which has been discussed in [7,8,9], is specifically considered. The cumulative distribution function (CDF) of the Weibull-G distribution is defined as
F ( x , α , β , δ ) = 0 l o g ( 1 G ( x , δ ) ) α β t β α 1 e ( t β ) α d t = 1 e l o g ( 1 G ( x , δ ) ) β α ,
with corresponding probability density function (PDF):
f ( x ; α , β , δ ) = α β g ( x , δ ) 1 G ( x , δ ) l o g ( 1 G ( x , δ ) ) β α 1 e l o g ( 1 G ( x , δ ) ) β α .
The survival function (SF) and the hazard rate function (HRF) are, respectively, given by
S ( x ; α , β , δ ) = e l o g ( 1 G ( x , δ ) ) β α ,
h ( x ; α , β , δ ) = α β g ( x , δ ) 1 G ( x , δ ) l o g ( 1 G ( x , δ ) ) β α 1 .
Over the last few decades, several discretized forms of continuous distributions have been derived to model different discrete data sets. The most notable discretization approach is the survival discretization method proposed by [10]. It is a novel technique that utilizes the survival function of any continuous distribution to develop a new discrete distribution. The probability mass function of a discrete distribution has been defined in [10] as follows:
P ( X = x ) = S ( x ) S ( x + 1 ) , x = 0 , 1 , 2 , ,
where S ( x ) is the survival function of the continuous distribution, defined as
S ( x ) = P ( X x ) = 1 F ( x ; Θ ) ,
where F ( x ; Θ ) is a CDF of the continuous distribution and Θ is a vector of parameters.
This approach has recently been extended to define some new discrete families of probability distributions. For example, Ref. [1] introduced the discrete Gompertz-G family of distributions and studied the discrete Gompertz exponential, discrete Gompertz–Weibull, and discrete Gompertz–inverse Weibull distributions; Ref. [5] studied the discrete Rayleigh family of distributions and proposed the discrete Rayleigh–Weibull distribution; Ref. [11] presented the odd Perks-G family of distributions and presented the discrete odd Perks exponential distribution; Ref. [12] introduced a discrete exponential generalized-G family of distributions and introduced the discrete exponential generalized Weibull distribution; and Ref. [13] proposed a discrete analogue of the odd Weibull-G family of distributions and studied the discrete odd Weibull–Geometric and the discrete odd Weibull–Inverse Weibull distributions.
Therefore, the main focus of this paper is to utilize the discretization technique proposed in [10] to construct a new discrete distribution that is more flexible in fitting data. To illustrate this, in this study, we use the survival discretization technique to convert the continuous Weibull-G (W-G) family to a discrete family, named the discrete Weibull-G (DW-G) family, in order to develop a three-parameter distribution, which is called the discrete Weibull exponential (DWE) distribution.
Starting with the continuous Weibull-G family proposed in [7], and utilizing the discretization approach from [10], the CDF of THE discrete Weibull-G (DW-G) family can be formulated as follows:
F ( x ; α , β , δ ) = 1 e l o g ( 1 G ( x + 1 , δ ) ) β α .
and the corresponding SF of the DW-G family can be derived as follows:
S ( x ; α , β , δ ) = e l o g ( 1 G ( x + 1 , δ ) ) β α .
Therefore, the probability mass function (PMF) of the DW-G family can be expressed as follows:
f ( x ; α , β , δ ) = e l o g ( 1 G ( x , δ ) ) β α e l o g ( 1 G ( x + 1 , δ ) ) β α .
and based on Equations (8) and (9), the HRF can then be written as follows:
h ( x ; α , β , δ ) = 1 e l o g ( 1 G ( x + 1 , δ ) ) β α e l o g ( 1 G ( x , δ ) ) β α .
The remainder of this article is organized as follows: Section 2 introduces the DWE distribution as well as its survival and hazard rate functions. Section 3 explores some fundamental statistical properties of the DWE distribution. Section 4 discusses the estimation of its distribution parameters using the method of maximum likelihood estimation (MLE). Section 5 supplies a simulation study in order to evaluate the estimates. Section 6 provides three real data applications which exemplify the adequacy of the proposed distribution. Finally, Section 7 concludes this paper.

2. Discrete Weibull Exponential Distribution (DWE)

If X is the exponential random variable with parameter θ > 0 , then its CDF is given by
G ( x ) = 1 e θ x , x > 0 , θ > 0 .
Substituting G ( x ) in Equations (7) and (9), the DWE distribution has the following CDF and PMF, respectively:
f ( x ; α , β , θ ) = e θ x β α e θ ( x + 1 ) β α ,
F ( x ; α , β , θ ) = 1 e θ ( x + 1 ) β α .
Moreover, the SF and HRF are, respectively, given as
S ( x ) = e θ x β α ,
h ( x ) = 1 e θ ( x + 1 ) β α e θ x β α .
Some PMF and HRF plots for the DWE model generated under selected parameter values are shown in Figure 1 and Figure 2. From Figure 1, it can be seen that the PMF plots of the DWE distribution exhibit various shapes, which include symmetric, right skewed, increasing, decreasing, J-shaped, and reversed J-shaped curves. Additionally, Figure 2 reveals that the HRF of the DWE distribution presents a variety of shapes. In particular, it can be increasing, increasing–constant, decreasing, uniform, or monotonically increasing. These observations strongly suggest the great flexibility of the DWE distribution in fitting data.

3. Statistical Properties of the DWE Distribution

Some properties of the DWE distribution are detailed in this section, including the quantile function, moments, moment-generating function, dispersion index, coefficient of variation, Rényi entropy, and order statistics.

3.1. The Quantile Function and the Median

The q t h quantile of DWE distribution can be obtained by
x q = β θ log 1 1 q 1 α 1 ,
where . is the ceiling function, which returns the smallest integer greater than or equal to its argument.
In particular, if we set q = 0.5, we obtain the value of the median of the proposed distribution; in particular, the median M is given as:
M = β θ { log ( 2 ) } 1 α 1 .

3.2. Moments

The rth moment of a random variable X, which follows the DWE distribution, can be calculated as
μ r ̀ = E ( x r ) = x = 0 x r f ( x ) = x = 1 [ x r ( x 1 ) r ] e θ x β α ,
where f ( x ) is the PMF of DWE distribution.
Therefore,
μ 1 ̀ = E ( x ) = x = 1 e θ x β α ,
μ 2 ̀ = E ( x 2 ) = x = 1 ( 2 x 1 ) e θ x β α ,
μ 3 ̀ = E ( x 3 ) = x = 1 ( 3 x 2 3 x + 1 ) e θ x β α ,
μ 4 ̀ = E ( x 4 ) = x = 1 ( 4 x 3 6 x 2 + 4 x 1 ) e θ x β α .
Thus, the variance of the DWE distribution is obtained as follows:
σ 2 = μ 2 ̀ μ 1 ̀ 2 = x = 1 ( 2 x 1 ) e θ x β α x = 1 e θ x β α 2 .
Moreover, the skewness and kurtosis can be found, respectively, as follows:
s k e w n e s s ( s k ) = μ 3 ̀ 3 μ 2 ̀ μ 1 ̀ + 2 μ 1 ̀ 2 σ 2 3 2 ,
k u r t o s i s ( K ) = μ 4 ̀ 4 μ 3 ̀ μ 1 ̀ + 6 μ 2 ̀ μ 1 ̀ 2 3 μ 1 ̀ 4 ( σ 2 ) 2 .

3.3. The Moment-Generating Function

Assume that X is a non-negative random variable (RV), which follows the DWE distribution. Then, the moment-generating function of the RV X can be derived as follows:
M x ( t ) = E ( e t x ) = x = 0 e t x p ( x ) = x = 0 e t x [ e θ x β α e θ ( x + 1 ) β α ] = 1 + x = 1 [ e t x e t ( x 1 ) ] e θ x β α .

3.4. The Dispersion Index and Coefficient of Variation

The dispersion index (DSI) of the DWE distribution is defined as
D S I = x = 1 ( 2 x 1 ) e θ x β α x = 1 e θ x β α 2 x = 1 e θ x β α .
Furthermore, the coefficient of variation (COV) of the DWE distribution is calculated as follows
c o v ( x ) = v a r ( x ) 1 2 m e a n = x = 1 ( 2 x 1 ) e θ x β α x = 1 e θ x β α 2 0.5 x = 1 e θ x β α .
Some numerical results for the mean, variance, skewness (sk), kurtosis (K), DSI, and COV are presented in Table 1.
From Table 1, The following is clear: As θ increases under fixed values of α and β , the mean and variance decrease, whereas the sk and K increase. As β increases under fixed values of θ and α , the mean, variance, sk, and K increase. As α increases under fixed values of θ and β , the mean, variance, sk, and K decrease. In addition, the DSI can be less or greater than 1, which means that it is suitable for representing under-or over-dispersed phenomena.

3.5. The Rényi Entropy

The Rényi Entropy (RE), proposed in [14], is a fundamental entropy function. It is obtained as follows:
γ R ( ρ ) = 1 1 ρ log x = 0 f ρ ( x ) ,
where ρ > 0 and ρ 0 .
Substituting Equation (12) into the entropy function, we obtain
γ R ( ρ ) = 1 1 ρ log x = 0 e θ x β α e θ ( x + 1 ) β α ρ .

3.6. The Order Statistic

Assume that x 1 , x 2 , . . . , x n are randomly sampled from the DWE distribution and let X r : n denote the rth order statistic.
Consider F r ( x ; α , β , θ ) ; the CDF of the ith order statistic for a random sample x 1 , x 2 , , x n from the DWE distribution is given by
F r : n = i = r n n i ( F ( x ) ) i [ 1 F ( x ) ] n i .
using the binomial expansion equation,
( 1 x ) n i = j = 0 n i ( 1 ) j n i j x j ,
and, substituting Equation (13) into Equation (19), the CDF of the order statistic can be presented as
F r : n = i = r n n i ( F ( x ) ) i [ 1 F ( x ) ] n i
= i = r n j = 0 n i ( 1 ) j n i n i j ( F ( x ) ) i ( F ( x ) ) j
= i = r n j = 0 n i ( 1 ) j n i n i j ( F ( x ) ) i + j
= i = r n j = 0 n i ( 1 ) j n i n i j ( 1 e θ ( x + 1 ) β α ) i + j .
The corresponding PMF of the rth order statistic can be obtained by applying the formula discussed in [15]:
f r ( z ) = F r ( z ) F r ( z 1 ) .
Thus,
f r ( x ; α , β , θ ) = i = r n j = 0 n i ( 1 ) j n i n i j ( 1 e θ ( x + 1 ) β α ) i + j ( 1 e θ x β α ) i + j .
In particular, setting r = 1 and r = n in Equation (26), we can obtain the PMFs of the minimum and maximum order statistics, respectively.

4. Parameter Estimation for DWE Distribution

The MLE approach was used to estimate the parameters for the DWE distribution. The steps are as follows: First, define the log-likelihood function. Second, consider the three parameters θ , β , and α , and differentiate the log-likelihood function with respect to each parameter. Finally, set the derivative equations to zeros. The likelihood function L ( x ; θ , β , α ) for the DWE distribution is given as follows
L ( X ; θ , β , α ) = i = 1 n F ( X i ) = i = 1 n e θ x β α e θ ( x + 1 ) β α .
Then, the log-likelihood function for the DWE distribution can be defined as
( θ , β , α ; x ) = i = 1 n log e θ x β α e θ ( x + 1 ) β α .
The derivatives of Equation (27), with respect to the DWE distribution parameters, are obtained as follows:
θ ( θ , β , α ; x ) = α β α θ α 1 i = 1 n ( x i + 1 ) α e θ x i + 1 / β α x i α e θ x i / β α e θ x i / β α e θ x i + 1 / β α ,
β ( θ , β , α ; x ) = α β α + 1 θ α i = 1 n x i α e θ x i / β α ( x i + 1 ) α e θ ( x i + 1 ) / β α e θ x i / β α e θ x i + 1 / β α ,
α ( θ , β , α ; x ) = θ β α i = 1 n ( x i + 1 ) α ln θ x i + 1 / β e θ x i + 1 / β α x i α ln θ x i / β e θ x i / β α e θ x i / β α e θ x i + 1 / β α .
Hence, the MLEs of the parameters θ , β , and α can be obtained by equating Equations (28)–(30) to zero, and solving them either analytically or numerically, using a method such as the Newton–Raphson iteration method. Moreover, using an R function like optim and nim, the estimators can automatically be obtained by maximizing Equation (27).

5. Simulation Study

In this section, three cases are presented to examine the performance of the MLEs of the DWE distribution parameters. Some values were considered for the parameters, as follows:
Case I:
θ = 0.5 , β = 0.6 , α = 1.1 .
Case II:
θ = 0.2 , β = 0.5 , α = 2.4 .
Case III:
θ = 0.05 , β = 0.4 , α = 1.3 .
The mean square error (MSE) was applied to evaluate the MLE, ω ^ for every parameter. The number of iterations for each case in the simulation was nsim = 10,000. The mean square error (MSE) can be calculated as
M S E ( ω ^ ) = i = 1 n s i m ( ω i ^ ω t r ) 2 n s i m .
The Monte Carlo simulation method was used. The MLEs of the parameters and their corresponding MSEs are listed in Table 2.
From Figure 3, Figure 4 and Figure 5, and Table 2, the MSE clearly becomes smaller as the sample size increases, and the MLEs become nearer to the true parameters.

6. Application

In this section, the flexibility of the DWE distribution is illustrated by fitting three real data sets using six different distributions, in addition to the proposed DWE distribution. The PMFs for the six distributions used in the comparison are as follows:
(1)
Geometric Distribution (Geom)
f ( x ) = ( 1 θ ) x 1 θ ; 0 < θ < 1 .
(2)
Poisson distribution (Pois)
f ( x ) = λ x e x p ( λ ) x ! , λ > 0 , x = 0 , 1 , 2 , . .
(3)
Discrete Weibull–Geometric Distribution (DWGeom) [16]:
f ( x ) = ( 1 p ) [ ρ x α ρ ( x + 1 ) α ] ( 1 p ρ x α ) [ 1 p ρ ( x + 1 ) α ] ; 0 < p < 1 , 0 < ρ < 1 , α > 0 .
(4)
Discrete fréchet (dfréchet) [17]:
f ( x ) = e x p β x + 1 α e x p β x α ; α , β > 0 .
(5)
Discrete extended odd Weibull exponential distribution (DEOWE) [18]:
f ( x ) = 1 + β e λ x 1 α 1 β 1 + β e λ ( x + 1 ) 1 α 1 β ; α , β , λ > 0 .
(6)
Discrete Weibull Distribution (DWD) [19]:
f ( x ) = q y β q ( y + 1 ) β ; y = 0 , 1 , 2 , . . , β > 0 , q ( 0 , 1 ) .
The parameters of fitted distributions were estimated using the maximum likelihood method (i.e., maximizing the log-likelihood). Goodness-of-fit (GOF) criteria, including the Corrected Akaike Information Criterion (AICc) [20], Akaike’s Information Criterion (AIC) [21], Bayesian Information Criterion (BIC) [22], and Hannan–Quinn Information Criterion (HQIC) [23], were used to discriminate the best model. Plots were generated for comparison between the DWE distribution and the other distributions, and the Kolmogorov–Simornov (K-S) test was conducted to determine the p-value for each distribution. The formulas for the GOF criteria are listed below:
A I C = 2 + 2 h ,
A I C c = A I C + 2 h ( h + 1 ) n h 1 ,
B I C = 2 + h l o g ( n ) ,
H Q I C = 2 + 2 h l o g ( l o g ( n ) ) ,
where is the log-likelihood function evaluated for the MLEs, h is the number of estimated parameters for the model, and n is the sample size.

6.1. First Data Set

The first data set is presented in [24]. It contains 26 observations that indicate the failure times for a specific product. This information has also been used in [11,25]. Figure 6 and Figure 7 show the p-p plot of the fitted distributions and the empirical CDFs for this data set. A summary of the MLE results for the parameters, along with the SEs, log-likelihood, AICc, AIC, BIC, and HQIC for each distribution is provided in Table 3.

6.2. Second Data Set

The second data set was reported in [26] and provides 61-day COVID-19 data recorded between 13 June and 12 August of 2021 in Italy. This data set includes newly reported cases on a daily basis. Figure 8 and Figure 9 show the p-p plots of the fitted distributions and the empirical CDFs for the second data set, respectively. A summary of the results of the MLEs of the parameters along with the SEs, the log-likelihood, AICc, AIC, BIC, and HQIC for each distribution is provided in Table 4.

6.3. Third Data Set

The third data set was reported in [27], and provides 101 observations. It includes the fatigue life (at 18 cycles per second) of 6061-T6 aluminum coupons that were cut parallel to the direction of rolling and oscillated. Figure 10 and Figure 11 show the p-p plots of the fitted distributions and the empirical CDFs for the third data set, respectively. A summary of the results of the MLEs of the parameters along with the SEs, the log-likelihood, AICc, AIC, BIC, and HQIC for each distribution is provided in Table 5.
From Table 3, Table 4 and Table 5, it is evident that the DWE distribution outperformed the other distributions when considering the AICc, AIC, BIC, and HQIC. In addition, Figure 6, Figure 8 and Figure 10, as well as Figure 7, Figure 9 and Figure 11 support this conclusion.

7. Conclusions

In this work, discretization of a continuous family of distributions was studied. This discretized family of distributions is called the discrete Weibull-G family of distributions, and the three-parameter DWE distribution is presented as a member of this family. Both the PMF and HRF of the DWE distribution have shapes that appear to be promising for capturing the various behaviors of data. Furthermore, some of the statistical properties of this distribution were studied. Its three parameters were estimated using the maximum likelihood method. Three cases with different sample sizes and different DWE parameter values were used to assess the performance of the MLEs of the DWE distribution parameters. Three real data sets were utilized to examine the efficiency of the DWE distribution, in comparison with various commonly used distributions. The presented figures demonstrate the usefulness of the proposed distribution. The obtained results indicate that the proposed DWE distribution represents a highly flexible distribution with good potential for modeling real data.

Author Contributions

Conceptualization, D.A. and H.K.; methodology, A.B. and D.A.; software, A.B.; validation, D.A. and H.K.; formal analysis, A.B.; investigation, A.B., D.A. and H.K.; resources, A.B.; data curation, A.B.; writing—original draft preparation, A.B.; writing—review and editing, D.A. and H.K.; visualization, A.B.; supervision, D.A. and H.K.; project administration, A.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Eliwa, M.; Alhussain, Z.; El-Morshedy, M. Discrete Gompertz-G family of distributions for over-and under-dispersed data with properties, estimation, and applications. Mathematics 2020, 8, 358. [Google Scholar] [CrossRef]
  2. Steutel, F.W.; Van Harn, K. Infinite Divisibility of Probability Distributions on the Real Line; CRC Press: Boca Raton, FL, USA, 2003. [Google Scholar]
  3. Yousof, H.M.; Majumder, M.; Jahanshahi, S.; Masoom Ali, M.; Hamedani, G. A new Weibull class of distributions: Theory, characterizations and applications. J. Stat. Res. Iran JSRI 2018, 15, 45–82. [Google Scholar] [CrossRef]
  4. Bourguignon, M.; Silva, R.B.; Cordeiro, G.M. The Weibull-G family of probability distributions. J. Data Sci. 2014, 12, 53–68. [Google Scholar] [CrossRef]
  5. Aboraya, M.; M. Yousof, H.; Hamedani, G.; Ibrahim, M. A new family of discrete distributions with mathematical properties, characterizations, Bayesian and non-Bayesian estimation methods. Mathematics 2020, 8, 1648. [Google Scholar] [CrossRef]
  6. Ibrahim, M.; Ali, M.M.; Yousof, H.M. The discrete analogue of the Weibull G family: Properties, different applications, Bayesian and non-Bayesian estimation methods. Ann. Data Sci. 2021, 10, 1069–1106. [Google Scholar] [CrossRef]
  7. Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
  8. Alzaatreh, A.; Ghosh, I. On the Weibull-X family of distributions. J. Stat. Theory Appl. 2015, 14, 169–183. [Google Scholar] [CrossRef]
  9. Cordeiro, G.M.; M. Ortega, E.M.; Ramires, T.G. A new generalized Weibull family of distributions: Mathematical properties and applications. J. Stat. Distrib. Appl. 2015, 2, 13. [Google Scholar] [CrossRef]
  10. Kemp, A.W. Classes of Discrete Lifetime Distributions. Commun. Stat.—Theory Methods 2004, 33, 3069–3093. [Google Scholar] [CrossRef]
  11. Elbatal, I.; Alotaibi, N.; Almetwally, E.M.; Alyami, S.A.; Elgarhy, M. On Odd Perks-G Class of Distributions: Properties, Regression Model, Discretization, Bayesian and Non-Bayesian Estimation, and Applications. Symmetry 2022, 14, 883. [Google Scholar] [CrossRef]
  12. Eliwa, M.S.; El-Morshedy, M.; Yousof, H.M. A Discrete Exponential Generalized-G Family of Distributions: Properties with Bayesian and Non-Bayesian Estimators to Model Medical, Engineering and Agriculture Data. Mathematics 2022, 10, 3348. [Google Scholar] [CrossRef]
  13. El-Morshedy, M.; Eliwa, M.; Tyagi, A. A discrete analogue of odd Weibull-G family of distributions: Properties, classical and Bayesian estimation with applications to count data. J. Appl. Stat. 2022, 49, 2928–2952. [Google Scholar] [CrossRef]
  14. Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics; University of California Press: Berkeley, CA, USA, 1961; Volume 4, pp. 547–562. [Google Scholar]
  15. Arnold, B.C.; Balakrishnan, N.; Nagaraja, H.N. A First Course in Order Statistics; Wiley: New York, NY, USA, 1992. [Google Scholar]
  16. Jayakumar, K.; Babu, M.G. Discrete Weibull geometric distribution and its properties. Commun. Stat.—Theory Methods 2018, 47, 1767–1783. [Google Scholar] [CrossRef]
  17. Nadarajah, S.; Lyu, J. New discrete heavy tailed distributions as models for insurance data. PLoS ONE 2023, 18, e0285183. [Google Scholar] [CrossRef]
  18. Nagy, M.; Almetwally, E.M.; Gemeay, A.M.; Mohammed, H.S.; Jawa, T.M.; Sayed-Ahmed, N.; Muse, A.H. The new novel discrete distribution with application on covid-19 mortality numbers in Kingdom of Saudi Arabia and Latvia. Complexity 2021, 2021, 7192833. [Google Scholar] [CrossRef]
  19. Augusto Taconeli, C.; Rodrigues de Lara, I.A. Discrete Weibull distribution: Different estimation methods under ranked set sampling and simple random sampling. J. Stat. Comput. Simul. 2022, 92, 1740–1762. [Google Scholar] [CrossRef]
  20. Hurvich, C.M.; Tsai, C.L. Regression and time series model selection in small samples. Biometrika 1989, 76, 297–307. [Google Scholar] [CrossRef]
  21. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  22. Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  23. Hannan, E.J.; Quinn, B.G. The determination of the order of an autoregression. J. R. Stat. Soc. Ser. B Methodol. 1979, 41, 190–195. [Google Scholar] [CrossRef]
  24. Gacula, M., Jr.; Kubala, J. Statistical models for shelf life failures. J. Food Sci. 1975, 40, 404–409. [Google Scholar] [CrossRef]
  25. Nassar, M.; Kumar, D.; Dey, S.; Cordeiro, G.M.; Afify, A.Z. The Marshall–Olkin alpha power family of distributions with applications. J. Comput. Appl. Math. 2019, 351, 41–53. [Google Scholar] [CrossRef]
  26. Almetwally, E.M.; Abdo, D.A.; Hafez, E.; Jawa, T.M.; Sayed-Ahmed, N.; Almongy, H.M. The new discrete distribution with application to COVID-19 Data. Results Phys. 2022, 32, 104987. [Google Scholar] [CrossRef] [PubMed]
  27. Birnbaum, Z.W.; Saunders, S.C. Estimation for a family of life distributions with applications to fatigue. J. Appl. Probab. 1969, 6, 328–347. [Google Scholar] [CrossRef]
Figure 1. PMF plots of the DWE model.
Figure 1. PMF plots of the DWE model.
Symmetry 16 01519 g001
Figure 2. HRF plots of the DWE model.
Figure 2. HRF plots of the DWE model.
Symmetry 16 01519 g002
Figure 3. The MSE of θ ^ , β ^ , α ^ for the DWE (0.5, 0.6, 1.1).
Figure 3. The MSE of θ ^ , β ^ , α ^ for the DWE (0.5, 0.6, 1.1).
Symmetry 16 01519 g003
Figure 4. The MSE of θ ^ , β ^ , α ^ for the DWE (0.2, 0.5, 2.4).
Figure 4. The MSE of θ ^ , β ^ , α ^ for the DWE (0.2, 0.5, 2.4).
Symmetry 16 01519 g004
Figure 5. The MSE of θ ^ , β ^ , α ^ for the DWE (0.05, 0.4, 1.3).
Figure 5. The MSE of θ ^ , β ^ , α ^ for the DWE (0.05, 0.4, 1.3).
Symmetry 16 01519 g005
Figure 6. p-p plots for the DWE and other considered distributions for the first data set.
Figure 6. p-p plots for the DWE and other considered distributions for the first data set.
Symmetry 16 01519 g006
Figure 7. Plots of the empirical CDFs of the DWE and other considered distributions for the first data set.
Figure 7. Plots of the empirical CDFs of the DWE and other considered distributions for the first data set.
Symmetry 16 01519 g007
Figure 8. p-p plots of the DWE and other considered distributions for the second data set.
Figure 8. p-p plots of the DWE and other considered distributions for the second data set.
Symmetry 16 01519 g008
Figure 9. Plots of the empirical CDFs of the DWE and other considered distributions for the second data set.
Figure 9. Plots of the empirical CDFs of the DWE and other considered distributions for the second data set.
Symmetry 16 01519 g009
Figure 10. p-p plots of the DWE and other considered distributions for the third data set.
Figure 10. p-p plots of the DWE and other considered distributions for the third data set.
Symmetry 16 01519 g010
Figure 11. Plots of the empirical CDFs of the DWE and other considered distributions for the third data set.
Figure 11. Plots of the empirical CDFs of the DWE and other considered distributions for the third data set.
Symmetry 16 01519 g011
Table 1. Numerical results for mean, variance (var), skewness (sk), kurtosis (K), dispersion index (DSI), and coefficient of variation (COV).
Table 1. Numerical results for mean, variance (var), skewness (sk), kurtosis (K), dispersion index (DSI), and coefficient of variation (COV).
θ β α MeanvarSkKDSICOV
0.90.80.80.636131.48203652.62349711.673082.1789321.764302
1.00.80.80.536511.15627652.69235912.161141.9552281.625485
1.10.80.80.467580.96184872.75516012.460551.9869281.709372
1.20.80.80.405720.79070062.82756212.820222.1183632.330601
1.30.80.80.350360.64225562.86849013.084211.1948902.651182
1.40.80.80.313170.54643062.91554713.277301.5656572.502524
1.50.80.80.274630.45997403.00700313.822241.3926222.460666
1.60.80.80.244850.39377043.05003913.981681.1114852.028944
1.70.80.80.220990.34268843.10534314.222922.1447813.195817
0.70.92.50.638390.33189480.21505452.2736800.63973061.0325782
0.71.02.50.763100.38383900.22299472.5914190.48852160.7450759
0.71.12.50.893390.44712070.24318852.7089280.49707600.7233519
0.71.22.51.019030.51340840.25751462.7585250.44994500.6674504
0.71.32.51.141100.58355640.26296622.7817590.39572190.6228665
0.71.42.51.274590.66692390.26753512.7821060.50707070.6369117
0.71.52.51.397790.74826410.27378002.7960960.50049790.5936862
0.71.62.51.526710.83639440.28109062.7998820.47979800.5476073
0.71.72.51.655140.93562850.28490572.7995890.66959920.6572667
0.50.50.90.655771.27871462.2734389.4179772.04055041.719686
0.50.51.00.581570.91623122.0256037.9604261.51805731.442058
0.50.51.10.536700.72776651.8319576.8413251.48426291.466665
0.50.51.20.499810.59271571.6576375.9295151.24757381.564041
0.50.51.30.468780.49698401.4987455.1754200.94372291.835876
0.50.51.40.451800.43688851.3518884.5021360.89159891.474663
0.50.51.50.431610.38657971.2424264.0344420.78002241.316580
0.50.51.60.418770.35073321.1333003.5923760.76504171.289626
0.50.51.70.406920.32446791.0491933.2426910.81066481.373050
Table 2. Simulation results for the DWE MLEs and MSEs with different sample sizes for the three considered cases.
Table 2. Simulation results for the DWE MLEs and MSEs with different sample sizes for the three considered cases.
Sample SizeParameterCase ICase IICase III
MLEMSEMLEMSEMLEMSE
n = 30 θ 0.50745280.0038824140.20251472.214930 × 10−40.051383925.946656 × 10−5
β 0.59640330.0028175210.50157743.806611 × 10−50.401951446.627253 × 10−6
α 1.19297660.0876456682.52986291.904817 × 10−11.359385944.455467 × 10−2
n = 100 θ 0.50251150.00109402150.20113776.642254 × 10−50.050588121.721367 × 10−5
β 0.59898360.00076265120.50138011.236618 × 10−50.401712443.818087 × 10−6
α 1.12418680.01790103812.44089134.721245 × 10−21.318162451.151682 × 10−2
n = 200 θ 0.50105360.00054438310.20076823.185856 × 10−50.050298128.172484 × 10−6
β 0.59964540.00038195480.50146587.656253 × 10−60.401496832.769827 × 10−6
α 1.11142040.00851556572.41740742.289356 × 10−21.308452165.572183 × 10−3
n = 500 θ 0.50067240.00021126960.20066831.355192 × 10−50.050263563.346773 × 10−6
β 0.59982960.00014737790.50125363.362149 × 10−60.401403772.338581 × 10−6
α 1.10451330.00330424062.40754698.797369 × 10−31.303856042.169917 × 10−3
Table 3. MLEs (with their corresponding SEs in parentheses ) and GOF criteria for the first data set.
Table 3. MLEs (with their corresponding SEs in parentheses ) and GOF criteria for the first data set.
DistributionsDWEGEOMPoisDWGeomDFrechetDEOWEDW
Parameter estimation θ ^  = 0.0505 θ ^  = 0.0233 λ ^  = 42.884 p ^  = 0.1166 β ^  = 35.906 α ^  = 2.6158 q ^  = 0.9941
(0.0995)(0.0045)(1.2843)(0.3936)(2.1594)(0.2543)(0.0037)
β ^  = 2.4137 ρ ^  = 0.9959 α ^  = 3.4671 β ^  = 0.6271 β ^  = 1.3578
(4.751) (0.0021)(0.5085)(0.8295)(0.1654)
α ^  = 4.3467 α ^  = 1.4229 λ ^  = 0.0162
(0.6987) (0.1404) (0.0006)
−logL100.1117123.4158114.7385116.5324104.1828103.9409116.8963
AICc207.3144248.9983231.6436240.1557212.8872214.9728238.3143
AIC206.2235248.8316231.4769239.0648212.3655213.8819237.7925
BIC209.9978250.0897232.7350242.8391214.8817217.6562240.3087
HQIC207.3103249.1939231.8392240.1517213.0901214.9688238.5171
p-Value6.7108 × 10−11.2006 × 10−43.9658 × 10−23.6631 × 10−34.0413 × 10−15.5648 × 10−11.4349 × 10−3
Table 4. MLEs (with their corresponding SEs in parentheses ) and GOF criteria for the second data set.
Table 4. MLEs (with their corresponding SEs in parentheses ) and GOF criteria for the second data set.
DistributionsDWEGEOMPoisDWGeomDFrechetDEOWEDW
Parameter estimation θ ^  = 0.0362 θ ^  = 0.0442 λ ^  = 22.623 p ^  = 0.0211 β ^  = 14.604 α ^  = 7.8043 q ^  = 0.9925
(0.0836 )(0.0055)(0.609)(0.36163 )(1.2822)(1.5493)(0.0031)
β ^  = 0.9462 ρ ^  = 0.9923 α ^  = 1.5544 β ^  = 17.578 β ^  = 1.5259
(2.1857 ) (0.0044)(0.1407)(6.7639)(0.1236)
α ^  = 1.9434 α ^  = 1.5108 λ ^  = 0.0883
(0.1868) (0.1346) (0.012)
−logL235.5084249.8884349.2768238.6766245.6704248.2538238.3603
AICc477.4378501.8445700.6214483.7743495.5477502.9286480.9276
AIC477.0168501.7767700.5536483.3533495.3408502.5076480.7207
BIC483.3494503.8876702.6645489.6859499.5625508.8402484.9424
HQIC479.4986502.6040701.3809485.8351496.9953504.9894482.3752
p-Value8.3373 × 10−11.0404 × 10−33.245 × 10−44.3862 × 10−12.4088 × 10−16.9468 × 10−32.0653 × 10−1
Table 5. MLEs (with their corresponding SEs in parentheses) and GOF criteria for the third dataset.
Table 5. MLEs (with their corresponding SEs in parentheses) and GOF criteria for the third dataset.
DistributionsDWEGEOMPoisDWGeomDFrechetDEOWEDW
Parameter estimation θ ^  = 0.0165 θ ^  = 0.0146 λ ^  = 68.34 p ^  = 0.165 β ^  = 50.9829 α ^  = 5.4788 q ^  = 0.9952
(0.0188)(0.0014)(0.8267)(0.1901)(3.7091)(0.5196)(0.0013)
β ^  = 1.2586 ρ ^  = 0.9952 α ^  = 1.4642 β ^  = 11.866 β ^  = 1.2614
(1.4371) (0.0014)(0.0834)(1.8745)(0.0663)
α ^  = 3.2033 α ^  = 1.2306 λ ^  = 0.0163
(0.232921) (0.070508) (0.001101)
−logL454.2379521.7143678.2322509.1632513.9630511.3819503.4051
AICc914.72571045.46941358.50511024.57651032.04971029.01381010.9339
AIC914.47571045.42861358.46431024.32651031.92601028.76381010.8102
BIC922.29121048.03381361.06951032.14201037.13631036.57931016.0205
HQIC917.63881046.48291359.51871027.48951034.03471031.92691012.9189
P-Value6.5356 × 10−13.0345 × 10−126.5006 × 10−51.1408 × 10−91.895 × 10−62.9103 × 10−121.5569 × 10−9
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Balubaid, A.; Klakattawi, H.; Alsulami, D. On the Discretization of the Weibull-G Family of Distributions: Properties, Parameter Estimates, and Applications of a New Discrete Distribution. Symmetry 2024, 16, 1519. https://doi.org/10.3390/sym16111519

AMA Style

Balubaid A, Klakattawi H, Alsulami D. On the Discretization of the Weibull-G Family of Distributions: Properties, Parameter Estimates, and Applications of a New Discrete Distribution. Symmetry. 2024; 16(11):1519. https://doi.org/10.3390/sym16111519

Chicago/Turabian Style

Balubaid, Abeer, Hadeel Klakattawi, and Dawlah Alsulami. 2024. "On the Discretization of the Weibull-G Family of Distributions: Properties, Parameter Estimates, and Applications of a New Discrete Distribution" Symmetry 16, no. 11: 1519. https://doi.org/10.3390/sym16111519

APA Style

Balubaid, A., Klakattawi, H., & Alsulami, D. (2024). On the Discretization of the Weibull-G Family of Distributions: Properties, Parameter Estimates, and Applications of a New Discrete Distribution. Symmetry, 16(11), 1519. https://doi.org/10.3390/sym16111519

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop