Generalized Mixtures of Exponential Distribution and Associated Inference

Yang, Yaoting; Tian, Weizhong; Tong, Tingting

doi:10.3390/math9121371

Open AccessArticle

Generalized Mixtures of Exponential Distribution and Associated Inference

by

Yaoting Yang

¹,

Weizhong Tian

^2,*

and

Tingting Tong

³

¹

Department of Applied Mathematics, Xi’an University of Technology, Xi'an 710054, China

²

Department of Mathematical Sciences, Eastern New Mexico University, Portales, NM 88130, USA

³

Department of Mathematical Sciences, New Mexico State University, Las Cruces, NM 88003, USA

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(12), 1371; https://doi.org/10.3390/math9121371

Submission received: 1 May 2021 / Revised: 31 May 2021 / Accepted: 10 June 2021 / Published: 13 June 2021

Download

Browse Figures

Versions Notes

Abstract

:

A new generalization of the exponential distribution, namely the generalized mixture of exponential distribution, is introduced. Some of its basic properties, such as hazard function, moments, order statistics, mean deviation, measures of uncertainly, and reliability probability, are studied. Three different estimation methods are investigated by the maximum likelihood estimator, least-square estimator, and weighted least-square estimator. The performances of the estimators are assessed by simulation studies. Real-world applications of the proposed distribution are explored, and data fitting results show that the new distribution performs better than its competitors.

Keywords:

generalized mixture of exponential distribution; reliability probability; maximum likelihood estimator; weighted least-square estimator

1. Introduction

Among the parametric models, exponential distribution is perhaps the most widely applied statistical distribution in several fields and plays an important role in the statistical theory of reliability and lifetime analysis. Based on this reason, statisticians have been interested in defining new classes of univariate distributions by adding one or more shape parameters to provide greater flexibility in modeling real data in many applied fields. Gupta and Kundu [1] studied the generalized exponential distribution and used it as an alternative to gamma or Weibull distribution in many situations. Gupta and Kundu [2] used the idea of Azzalini [3], introducing a new class of weighted exponential (WE) distributions, and Kharazmi et al. [4] extended it into the generalized weighted exponential (GWE) distribution. Nadarajah and Haghighi [5] discussed a new two-parameter generalization of the exponential distribution, which had its mode at zero and allowed increasing, decreasing, and constant hazard rates.

On the other hand, generalizations of exponentiated type distributions can be obtained from the class of generalized beta distributions, in particular after the works of Eugene et al. [6] Nadarajah and Kotz [7] introduced the beta exponential distribution, generated from the logit of a beta random variable. Barreto-Souza et al. [8] discussed beta generalized exponential distribution, which includes the beta exponential and generalized exponential distributions as special cases. A generalization of the exponentiated Frenchet distribution, called the beta Frenchet distribution, was studied by Barreto-Souza et al. [9]. Ristic and Balakrishnan [10] proposed the gamma exponential distribution generated by gamma random variables. Being of a similar methodology, many X-family exponential distributions were investigated recently. These include the Weibull exponential (WED) distributions which were introduced by Oguntunde et al. [11], Marshall–Olkin generalized exponential distributions which were defined by Ristic and Kundu [12], Kumaraswamy Marshall–Olkin exponential distributions which were given by George and Thobias [13], and generalized extended exponential-Weibull (GExtEW) distributions which were proposed by Shakhatreh et al. [14].

In this paper, a new class of generalized mixture exponential (GME) distribution is introduced, which has the exponential and WE distributions as its submodels. In order to motivate interest, let us first present the definition of the generalized skew normal distribution introduced by Kumar and Anusree [15]. A random variable Z is said to have a generalized skew normal distribution if its probability density function (pdf) is of the following form,

h (z, λ, β) = \frac{2}{2 + β} f (z) (1 + β F (λ z)),

where

f (z) = ϕ (z)

,

F (λ z) = Φ (λ z)

,

λ \in ℜ

and

β > - 2

. In fact, the correct values of

β

should be

β \geq - 1

, which has been discussed in Tian et al. [16].

The rest of the article is organized as follows. The GME distribution is introduced in Section 2. Some important properties of GME distributions, such as cumulative distribution function (cdf), hazard function, mean deviations, order statistics, measure of uncertainly, and reliability probability, are discussed in Section 3. Three different estimation methods are studied in Section 4. Simulations are conducted to investigate and compare the performances of the proposed estimation methods in Section 5. Two real data sets are analyzed for illustrating the usefulness of the proposed GME distribution in Section 6. Some conclusions are presented in Section 7.

2. Generalized Mixture Exponential Distribution

The GME distribution offers more flexible distributions with applications in lifetime modeling, which is defined as follows.

Definition 1.

A random variable X is said to have a GME distribution if its pdf is of the following form,

f (x; λ, α, β) = \frac{(α + 1) λ}{α + 1 + α β} e^{- λ x} [1 + β (1 - e^{- α λ x})], x > 0,

(1)

where

α > 0

is the scale parameter,

λ > 0

and

β \geq - 1

are the shape parameters, and we denote it as

X \sim G M E (λ, α, β)

.

Remark 1.

(i): For $β = 0$ , or $α \to 0$ , or $α \to \infty$ , $G M E (λ, α, β)$ is reduced into exponential distribution with parameter λ namely, $E (λ)$ .
(ii): For $β = - 1$ , $G M E (λ, α, β)$ is reduced into $E (λ (α + 1))$ .
(iii): For $β \to \infty$ , $G M E (λ, α, β)$ is reduced into $W E (λ, α)$ .

For different values of

λ, α, β

, the pdfs of

G M E (λ, α, β)

are presented in the Figure 1, which indicate that the GME distribution can generate distributions with various shapes.

Proposition 1.

The cdf, the survival function and the hazard function of

X \sim G M E (λ, α, β)

are given by

\begin{matrix} F (x; λ, α, β) & = & 1 + \frac{β e^{- (α + 1) λ x}}{α + 1 + α β} - \frac{(α + 1) (β + 1) e^{- λ x}}{α + 1 + α β}, \\ h (x; λ, α, β) & = & \frac{(α + 1) λ [1 + β (1 - e^{- λ α x})]}{(α + 1 + α β) + β (1 - e^{- λ α x})}, \\ S (x; λ, α, β) & = & \frac{(α + 1) (β + 1) e^{- λ x}}{α + 1 + α β} - \frac{β e^{- (α + 1) λ x}}{α + 1 + α β} . \end{matrix}

Proof of Proposition 1.

According to the Equation (1), we have

\begin{matrix} F (x; λ, α, β) & = \frac{λ (α + 1)}{α + 1 + α β} [(1 + β) \int_{0}^{x} e^{- λ t} d t - β \int_{0}^{x} e^{- λ α t} d t] \\ = 1 - e^{- λ x} + \frac{β}{α + 1 + α β} [e^{- λ (1 + α) x} - e^{- λ x}] . \end{matrix}

Therefore,

\begin{matrix} h (x; λ, α, β) & = \frac{f (x; λ, α, β)}{1 - F (x; λ, α, β)} = \frac{(α + 1) λ [1 + β (1 - e^{- λ α x})]}{(α + 1 + α β) + β (1 - e^{- λ α x})}, \end{matrix}

\begin{matrix} S (x; λ, α, β) & = 1 - F (x; λ, α, β) = \frac{(α + 1) (β + 1) e^{- λ x}}{α + 1 + α β} - \frac{β e^{- (α + 1) λ x}}{α + 1 + α β} . \end{matrix}

This ends the proof of Proposition 1. □

Figure 2 shows that the GME distribution produces flexible hazard rate shapes, such as decreasing, increasing, and stable.

3. General Properties of the GME Distribution

In what follows, we discuss various properties associated with the proposed distribution.

Proposition 2.

The shapes of density function of

X \sim G M E (λ, α, β)

can be characterized as follows,

(i): $f (x)$ is monotone decreasing, if $- 1 \leq β \leq 0$ or $0 < α β \leq 1$ ,
(ii): $f (x)$ is unimodal, if $α β > 1$ .

Proof of Proposition 2.

The derivatives of

f (x)

are obtained by Equation (1),

\begin{matrix} f^{'} (x) = \frac{λ^{2} (α + 1) e^{- λ x}}{α + 1 + α β} [\frac{β (α + 1)}{β + 1} e^{- α λ x} - 1] . \end{matrix}

(i): if $- 1 \leq β \leq 0$ , we have $\frac{λ^{2} (α + 1) e^{- λ x}}{α + 1 + α β} > 0$ , and $(α + 1) β \leq 0$ , thus,

$\frac{λ^{2} (α + 1) e^{- λ x}}{α + 1 + α β} [\frac{β (α + 1)}{β + 1} e^{- α λ x} - 1] < 0;$

if $0 < α β \leq 1$ , we get $e^{- α λ x} < 1$ and $\frac{β (α + 1)}{β + 1} < 1$ , thus, $[\frac{β (α + 1)}{β + 1} e^{- α λ x} - 1] < 0$ .
(ii): Setting $f^{'} (x) = 0$ , we have $x_{0} = - \frac{1}{α λ} l o g \frac{β + 1}{β + β α}$ , and $f^{''} (x_{0}) > 0$ . Thus, if $α β > 1$ , we have $f (x)$ is monotone increasing on $0 < x < x_{0}$ and monotone decreasing on $x > x_{0}$ . Thus, $f (x)$ is unimodal.

This ends the proof of Proposition 2. □

Remark 2.

The two properties in Proposition 2 are actually exclusive.

Proposition 3.

Let

X \sim G M E (λ, α, β)

, the moment generating function of X is

M_{X} (t) = \frac{λ (α + 1)}{α + 1 + α β} (\frac{1 + β}{λ - t} - \frac{β}{λ α + λ - t}), t < λ .

Proof of Proposition 3.

According to Equation (1) and the definition of moment generating function,

\begin{matrix} M_{X} (t) & = & \frac{λ (α + 1)}{α + 1 + α β} [(1 + β) \int_{0}^{\infty} e^{(t - λ) x} d x - β \int_{0}^{\infty} e^{(t - λ - λ α) x} d x] \\ = & \frac{λ (α + 1)}{α + 1 + α β} (\frac{1 + β}{λ - t} - \frac{β}{λ α + λ - t}), t < λ . \end{matrix}

This ends the proof of Proposition 3. □

Corollary 1.

Let

X \sim G M E (λ, α, β)

, the first four moments of X are

\begin{matrix} E [X] & = & \frac{{(1 + α)}^{2} (1 + β) - β}{λ (α + 1 + α β) (1 + α)}, E [X^{2}] = \frac{2 {(1 + α)}^{3} (1 + β) - 2 β}{λ^{2} (α + 1 + α β) {(1 + α)}^{2}}, \\ E [X^{3}] & = & \frac{6 {(1 + α)}^{4} (1 + β) - 6 β}{λ^{3} (α + 1 + α β) {(1 + α)}^{3}}, E [X^{4}] = \frac{24 {(1 + α)}^{5} (1 + β) - 24 β}{λ^{4} (α + 1 + α β) {(1 + α)}^{4}} . \end{matrix}

Proposition 4.

Let

X \sim G M E (λ, α, β)

and

μ = E [X]

, then the mean deviation about the mean of X is given by

D (μ) = \frac{2 (α + 1 + α β + β)}{λ (α + 1 + α β)} e^{- λ μ} - \frac{2 β}{λ (α + 1 + α β) (1 + α)} e^{- λ μ (1 + α)} .

Proof of Proposition 4.

According to Equation (1) and

D (μ) = E [|X - μ|]

, with

μ = E [X]

, we have

\begin{matrix} D (μ) & = \int_{0}^{μ} (μ - x) f (x) d x + \int_{μ}^{\infty} (x - μ) f (x) d x \\ = μ (\int_{0}^{μ} f (x) d x - \int_{μ}^{\infty} f (x) d x) + (- \int_{0}^{μ} x f (x) d x + \int_{μ}^{\infty} x f (x) d x) \\ = μ (2 F (μ) - 1) + (μ - 2 \int_{0}^{μ} x f (x) d x) \\ = \frac{2 β}{λ (α + 1 + α β) (1 + α)} (1 - e^{- λ μ (1 + α)}) - \frac{2 (α + 1 + α β + β)}{λ (α + 1 + α β)} (1 - e^{- λ μ}) + 2 μ \\ = \frac{2 (α + 1 + α β + β)}{λ (α + 1 + α β)} e^{- λ μ} - \frac{2 β}{λ (α + 1 + α β) (1 + α)} e^{- λ μ (1 + α)} . \end{matrix}

This ends the proof of Proposition 4. □

The entropy of a random variable is a measure of uncertainty, which is an important topic in the fields of communication theory, statistical physics, and probability theory. In the following, we study the entropy measures for

X \sim G M E (λ, α, β)

.

Proposition 5.

Let

X \sim G M E (λ, α, β)

, then the Shannon entropy, S(x), and Renyi entropy, R(x), of X are given by

\begin{matrix} S (x) & = & - log (\frac{λ (α + 1)}{α + 1 + α β}) - (1 + \frac{1}{α}) log (1 + β) + \frac{log (β)}{α} - \frac{log (u)}{α} - log (1 - u), \\ R_{γ} (x) & = & \frac{1}{1 - γ} [γ log (\frac{λ (α + 1)}{α + 1 + α β}) + (γ + \frac{γ}{α}) log (1 + β) - \frac{γ log (β)}{α} - log (λ α) \\ + log (B (\frac{β}{1 + β}; \frac{γ}{α}, γ + 1))], γ > 0 a n d γ \neq 1, \end{matrix}

respectively, where

B (z; a, b) = \int_{0}^{z} u^{a - 1} {(1 - u)}^{b - 1} d u

is the incomplete beta function.

Proof of Proposition 5.

For any

γ > 0

and

γ \neq 1

, the Reni entropy of X is defined as

\begin{matrix} R_{γ} (x) & = & \frac{1}{1 - γ} log [\int_{0}^{\infty} f^{γ} (x; λ, β, α) d x] \\ = & \frac{1}{1 - γ} log [{(\frac{λ (α + 1)}{α + 1 + α β})}^{γ} \int_{0}^{\infty} e^{- λ γ x} (1 + β {(1 - e^{- λ α x})}^{γ}) d x] \\ = & \frac{1}{1 - γ} log [{(\frac{λ (α + 1)}{α + 1 + α β})}^{γ} {(1 + β)}^{γ + \frac{γ}{α}} β^{- \frac{γ}{α}} \\ \times & \int_{0}^{\infty} {(\frac{β}{1 + β} e^{- λ α x})}^{\frac{γ}{α}} {(1 - \frac{β}{1 + β} e^{- λ α x})}^{γ} d x] \\ = & \frac{1}{1 - γ} [γ log (\frac{λ (α + 1)}{α + 1 + α β}) + (γ + \frac{γ}{α}) log (1 + β) - \frac{γ}{α} log (β) - log (λ α) \\ + log (B (\frac{β}{1 + β}; \frac{γ}{α}, γ + 1))] . \end{matrix}

The Shannon entropy

S (x)

is the limiting value of

R_{γ} (x)

as

γ \to 1

and, thus, the results are obtained.

This ends the proof of Proposition 5. □

In the next proposition, we study the probability that one of the two independent GME random variables exceeds the other, which is named as the reliability probability.

Proposition 6.

Suppose two independent random variables X and Y follow

G M E (λ, α, β)

, then the reliability probability is given by

\begin{matrix} P (X > Y) & = & \frac{{(1 + α)}^{2} {(1 + β)}^{2}}{2 {(1 + α + α β)}^{2}} + \frac{λ β^{2}}{2 {(1 + α + α β)}^{2}} \\ - & \frac{β (1 + β) {(1 + α)}^{2}}{(2 + α) {(1 + α + α β)}^{2}} - \frac{β (1 + β) (1 + α)}{(2 + α) {(1 + α + α β)}^{2}} . \end{matrix}

Proof of Proposition 6.

Let

Z = Y - X

and

X = X

, the joint density function of X and Z is obtained as

\begin{matrix} f (x, z; λ, β, α) & = \frac{λ^{2} {(1 + α)}^{2} {(1 + β)}^{2}}{{(1 + α + α β)}^{2}} e^{- λ (2 x + z)} - \frac{λ^{2} β (1 + β) {(1 + α)}^{2}}{{(1 + α + α β)}^{2}} e^{- λ [(1 + α) z + (2 + α) x]} \\ - \frac{λ^{2} β (1 + β) {(1 + α)}^{2}}{{(1 + α + α β)}^{2}} e^{- λ [z + (2 + α) x]} + \frac{λ^{2} β^{2} {(1 + α)}^{2}}{{(1 + α + α β)}^{2}} e^{- λ [(1 + α) z + (2 + 2 α) x]} . \end{matrix}

Therefore, the marginal density function of Z is

\begin{matrix} f (z; λ, β, α) & = \frac{{(α + 1)}^{2} {(1 + β)}^{2}}{{(1 + α + α β)}^{2}} e^{λ z} + \frac{λ β^{2}}{2 {(1 + α + α β)}^{2}} e^{λ (1 + α) z} \\ - \frac{β (1 + β) {(1 + α)}^{2}}{(2 + α) {(1 + α + α β)}^{2}} e^{λ z} - \frac{β (1 + β) (1 + α)}{(2 + α) {(1 + α + α β)}^{2}} e^{λ (1 + α) z} . \end{matrix}

Thus, the result is obtained by

P (X > Y) = P (Z < 0)

.

This ends the proof of Proposition 6. □

Order statistics are fundamental tools in non-parametric statistics and inference. In what follows, we derive an expression for the density function of the

r^{t h}

order statistic in a random sample size

n \geq r

from the GME distribution.

Proposition 7.

Suppose

X_{1}, X_{2}, \dots, X_{n}

is a random sample from

G M E (λ, α, β)

. Let

X_{1 : n} \leq X_{2 : n} \leq \dots \leq X_{n : n}

denote the corresponding order statistics. Then the pdf and cdf of

r^{t h}

order statistic,

X_{r : n}

,

1 \leq r \leq n

, are respectively,

\begin{matrix} f_{r : n} (x) & = & \frac{n!}{(r - 1)! (n - r)!} {[\frac{(1 + α) λ}{α + 1 + α β} (e^{- λ (1 + α) x} - e^{- λ x}) - e^{- λ x} + 1]}^{r - 1} \\ \times & {[\frac{(1 + α) λ}{α + 1 + α β} (e^{- λ x} - e^{- λ (1 + α) x}) + e^{- λ x}]}^{n - r} \frac{(α + 1) λ}{α + 1 + α β} e^{- λ x} [1 + β (1 - e^{- α λ x})], \\ F_{r : n} (x) & = & \sum_{l = r}^{n} \sum_{u = 0}^{n - r} {(- 1)}^{u} (\binom{n}{l}) (\binom{n - r}{u}) {[\frac{(1 + α) λ}{α + 1 + α β} (e^{- λ (1 + α) x} - e^{- λ x}) - e^{- λ x} + 1]}^{l + u} . \end{matrix}

Proof of Proposition 7.

It is well known that the pdf and cdf of

X_{r : n}

,

1 \leq r \leq n

are given by

\begin{matrix} f_{r : n} (x) & = & \frac{n!}{(r - 1)! (n - r)!} {[F (x)]}^{r - 1} {[1 - F (x)]}^{n - r} f (x), \\ F_{r : n} (x) & = & \sum_{l = r}^{n} (\binom{n}{l}) {[F (x)]}^{l} {[1 - F (x)]}^{n - l}, \end{matrix}

respectively. Thus, the results are derived straightly from Equation (1) and Proposition 1.

This ends the proof of Proposition 7. □

Proposition 8.

Let

X \sim G M E (λ, α, β)

, the quantile function of GME distribution,

x_{q}

, wherein

0 < q < 1

, can be obtained by solving the following equation,

β e^{- (α + 1) λ x_{q}} - (α + 1) (β + 1) e^{- λ x_{q}} = (q - 1) (1 + α + α β) .

(2)

We can see from Equation (2) that there is no closed form of the solution in

x_{q}

and, thus, we have to use numerical techniques to obtain the quantile.

The mean residual life (MRL) function plays a very important role in reliability engineering, survival analysis and many other fields. It represents the period from time t till the time of failure, and the MRL also represents the expected additional life length for a unit.

Proposition 9.

Let

X \sim G M E (λ, α, β)

, the MRL function of GME distribution, defined as

μ_{X} (t)

, is given by

μ_{X} (t) = \frac{{(1 + α)}^{2} (1 + β) e^{- λ t} - β e^{- (1 + α) λ t}}{{(1 + α)}^{2} (1 + β) λ e^{- λ t} - (1 + α) λ β e^{- (1 + α) λ t}}, t > 0 .

Proof of Proposition 9.

For

t > 0

, we have

\begin{matrix} μ_{X} (t) = E (X - t | X > t) & = & \frac{\int_{t}^{\infty} S (x; λ, α, β) d x}{S (t; λ, α, β)} \\ = & \frac{\int_{t}^{\infty} [\frac{(α + 1) (β + 1) e^{- λ x}}{α + 1 + α β} - \frac{β e^{- (α + 1) λ x}}{α + 1 + α β}] d x}{S (t; λ, α, β)}, \end{matrix}

where

S (\cdot)

is survival function of GME distribution. We know that

\begin{matrix} \int_{t}^{\infty} [\frac{(α + 1) (β + 1) e^{- λ x}}{α + 1 + α β} - \frac{β e^{- (α + 1) λ x}}{α + 1 + α β}] d x & = & \int_{t}^{\infty} \frac{(α + 1) (β + 1) e^{- λ x}}{α + 1 + α β} d x - \int_{t}^{\infty} \frac{β e^{- (α + 1) λ x}}{α + 1 + α β} d x \\ = & \frac{{(1 + α)}^{2} (1 + β) e^{- λ t} - β e^{- (1 + α) λ t}}{(1 + α + α β) (1 + α) λ} . \end{matrix}

Thus, the result is obtained.

This ends the proof of Proposition 9. □

4. Methods of Estimation

In this section, we consider the methods of maximum likelihood, least squares, and weighted least squares to estimate the unknown parameters,

θ = (λ, α, β)

, of the GME distribution. Suppose

x_{1}, x_{2}, \dots, x_{n}

is a random sample from

G M E (λ, α, β)

.

4.1. Maximum Likelihood Estimator

The method of maximum likelihood is the most frequently used method for parameter estimation. According to the Equation (1), the likelihood function is calculated as

L (λ, α, β | x_{1}, \dots, x_{n}) = {(\frac{(α + 1) λ}{α + 1 + α β})}^{n} e^{- λ \sum_{i = 1}^{n} x_{i}} \prod_{i = 1}^{n} [1 + β (1 - e^{- α λ x_{i}})] .

The log-likelihood function is given by

\begin{matrix} ℓ (λ, α, β | x_{1}, \dots, x_{n}) & = n [log (α + 1) + log (λ) - log (α + 1 + α β)] - λ \sum_{i = 1}^{n} x_{i} \\ + \sum_{i = 1}^{n} log [1 + β (1 - e^{- α λ x_{i}})] . \end{matrix}

(3)

We denote the first partial derivatives of (3) by

ℓ_{λ}

,

ℓ_{α}

and

ℓ_{β}

. Setting

ℓ_{λ} = 0

,

ℓ_{α} = 0

, and

ℓ_{β} = 0

, we have

\begin{matrix} ℓ_{λ} & = & \frac{n}{λ} - \sum_{i = 1}^{n} x_{i} + \sum_{i = 1}^{n} \frac{β α x_{i} e^{- α λ x_{i}}}{1 + β (1 - e^{- α λ x_{i}})} = 0, \\ ℓ_{α} & = & \frac{n}{α + 1} - \frac{n (1 + β)}{α + 1 + α β} + \sum_{i = 1}^{n} \frac{β λ x_{i} e^{- α λ x_{i}}}{1 + β (1 - e^{- α λ x_{i}})} = 0, \\ ℓ_{β} & = & - \frac{n α}{α + 1 + α β} + \sum_{i = 1}^{n} \frac{1 - e^{- α λ x_{i}}}{1 + β (1 - e^{- α λ x_{i}})} = 0 . \end{matrix}

The maximum likelihood estimator (MLE)

\hat{θ}

of the unknown parameters

θ

can be obtained by optimizing the log-likelihood function with respect to the involved parameters. Due to the non-linearity of these equations, the MLEs of parameters can be obtained numerically. These estimators can be easily obtained by using the functions from the statistical software R.

Fisher information is helpful to get the reference priors for the model parameters. In the following, we observe that the Fisher information is given by

\begin{matrix} I (θ) = - E [\begin{matrix} ℓ_{λ λ} & ℓ_{λ α} & ℓ_{λ β} \\ ℓ_{α λ} & ℓ_{α α} & ℓ_{α β} \\ ℓ_{β λ} & ℓ_{β α} & ℓ_{β β} \end{matrix}], \end{matrix}

where

\begin{matrix} ℓ_{λ λ} & = & - \frac{n}{λ^{2}} - \sum_{i = 1}^{n} \frac{β (1 + β) α^{2} x_{i}^{2} e^{- 2 α λ x_{i}}}{{[1 + β (1 - e^{- α λ x_{i}})]}^{2}}, \\ ℓ_{α α} & = & - \frac{n}{{(α + 1)}^{2}} + \frac{n {(1 + β)}^{2}}{{(1 + α + α β)}^{2}} - \sum_{i = 1}^{n} \frac{β (1 + β) λ^{2} x_{i}^{2} e^{- 2 α λ x_{i}}}{{[1 + β (1 - e^{- α λ x_{i}})]}^{2}}, \\ ℓ_{β λ} & = & \sum_{i = 1}^{n} \frac{α x_{i} e^{- α λ x_{i}}}{{[1 + β (1 - e^{- α λ x_{i}})]}^{2}} = ℓ_{λ β}, \\ ℓ_{β α} & = & \sum_{i = 1}^{n} \frac{λ x_{i} e^{- α λ x_{i}}}{{[1 + β (1 - e^{- α λ x_{i}})]}^{2}} = ℓ_{α β}, \\ ℓ_{λ α} & = & \sum_{i = 1}^{n} \frac{β (1 + β) (1 - α λ x_{i}) x_{i} e^{- α λ x_{i}} - β^{2} x_{i} e^{- 2 α λ x_{i}}}{[1 + β {(1 - e^{- α λ x_{i}})]}^{2}} = ℓ_{α λ}, \\ ℓ_{β β} & = & \frac{n α^{2}}{{(1 + α + α β)}^{2}} - \sum_{i = 1}^{n} \frac{{(1 - e^{- α λ x_{i}})}^{2}}{{[1 + β (1 - e^{- α λ x_{i}})]}^{2}} . \end{matrix}

4.2. Least-Square Estimator

Suppose

F (x_{(j)})

denotes the distribution function of the ordered random variables

x_{(1)} < \dots < x_{(n)}

. Denote the following function

h (λ, α, β) = \sum_{i = 1}^{n} {[F (x_{(i)}; λ, α, β) - \frac{i}{n + 1}]}^{2},

(4)

where

F (x; λ, α, β) = \frac{β}{α + 1 + α β} [e^{- λ (1 + α) x} - e^{- λ x}] - e^{- λ x} + 1

, and the least-square estimator (LS) of

θ

can be obtained by minimizing

h (λ, α, β)

. Therefore,

\hat{θ}

can be obtained by solving the following equations,

\begin{matrix} \frac{\partial h (λ, α, β)}{\partial λ} = \sum_{i = 1}^{n} \{2 Q_{i} \{\frac{β}{α + 1 + α β} [- (1 + α) C_{i} + B_{i}] + B_{i}\}\} = 0, \\ \frac{\partial h (λ, α, β)}{\partial α} = \sum_{i = 1}^{n} \{2 Q_{i} \{\frac{- (1 + β)}{{(α + 1 + α β)}^{2}} [e^{- λ (1 + α) X_{(i)}} - e^{- λ X_{(i)}}] - \frac{β λ}{α + 1 + α β} C_{i}\}\} = 0, \\ \frac{\partial h (λ, α, β)}{\partial β} = \sum_{i = 1}^{n} \{2 Q_{i} \{\frac{1 - α}{{(α + 1 + α β)}^{2}} [e^{- λ (1 + α}) X_{(i)} - e^{- λ X_{(i)}}]\}\} = 0, \end{matrix}

where

Q_{i} = \frac{β}{α + 1 + α β} [e^{- λ (1 + α) x_{(i)}} - e^{- λ x_{(i)}}] - e^{- λ x_{(i)}} + 1 - \frac{i}{n + 1}

,

B_{i} = x_{(i)} e^{- λ x_{(i)}}

and

C_{i} = x_{(i)} e^{- λ (1 + α) x_{(i)}}

.

4.3. Weighted Least-Square Estimator

The weighted least-square estimator (WLS) is an extension of LS and proposed by Swain et al. [17], which studied the WLS is obtained by minimizing the function,

W (λ, α, β) = \sum_{i = 1}^{n} \frac{{(n + 1)}^{2} (n + 2)}{i (n - i + 1)} {[F (x_{(i)}; λ, α, β) - \frac{i}{n + 1}]}^{2},

where

F (\cdot)

function has been given in Equation (4). Therefore, the WLS of

θ

can be obtained by

\begin{matrix} \frac{\partial W (λ, α, β)}{\partial λ} & = \sum_{i = 1}^{n} \{\frac{2 {(n - 1)}^{2} (n + 2)}{i (n - i + 1)} Q_{i} \{\frac{β}{α + 1 + α β} [- (1 + α) C_{i} + B_{i}] + B_{i}\}\} = 0, \\ \frac{\partial W (λ, α, β)}{\partial α} & = \sum_{i = 1}^{n} {\frac{2 {(n - 1)}^{2} (n + 2)}{i (n - i + 1)} Q_{i} {\frac{- (1 + β)}{{(α + 1 + α β)}^{2}} [e^{- λ (1 + α) x_{(i)}} - e^{- λ x_{(i)}}] \\ - \frac{β λ}{α + 1 + α β} C_{i}}} = 0, \\ \frac{\partial W (λ, α, β)}{\partial β} & = \sum_{i = 1}^{n} \{\frac{2 {(n - 1)}^{2} (n + 2)}{i (n - i + 1)} Q_{i} \{\frac{1 - α}{{(α + 1 + α β)}^{2}} [e^{- λ (1 + α) x_{(i)}} - e^{- λ x_{(i)}}]\}\} = 0, \end{matrix}

where

Q_{i}, B_{i}

and

C_{i}

,

i = 1, \dots, n

, are defined as above.

5. Simulation Studies

In this section, we assess the performance of the estimation methods proposed in the previous section by conducting several simulations for different sample sizes and values of the parameter.

As indicated in Proposition 1, the

F (x; λ, α, β)

there is used to generate pseudo-random numbers from the GME distribution. This technique is called the inverse transform method, which consists of the following steps:

(i): Generate a random number u from the standard uniform distribution in the interval [0,1].
(ii): Apply the numerical techniques to solve the equation $F (x) = u$ with given $λ, α, β$ .

We take the sample size

n = 50

,

100, 200, 300, 400, 500, 1000

for each simulation, and each sample was replicated

N = 1000

times. The values of parameter

θ = (1, 5, - 0.5), (2, 1, 1.5)

, and

(3, 10, 5)

are considered, respectively. All the results were computed using the R programming. The evaluation of the estimators are performed based on the average bias and the standard error (SE) for each single parameter, where

B i a s ({\hat{θ}}_{j}) = \frac{1}{N} \sum_{i = 1}^{N} ({\hat{θ}}_{j}^{(i)} - θ_{j})

,

S E ({\hat{θ}}_{j}) = \frac{1}{N} \sum_{i = 1}^{N} {({\hat{θ}}_{j}^{(i)} - θ_{j})}^{2}

, and

θ_{j}

is the

j^{t h}

component of

θ

. Moreover, the overall bias and mean squared error (MSE) of

\hat{θ}

are also considered, where the

B i a s (\hat{θ}) = \sum_{j = 1}^{3} B i a s ({\hat{θ}}_{j})

,

M S E (\hat{θ}) = \frac{1}{N} \sum_{i = 1}^{N} | | {\hat{θ}}^{(i)} - θ {| |}^{2}

, and

| | \cdot | |

is the Euclidean norm. The simulation results for different scenarios are given in the Table 1, Table 2 and Table 3.

From Table 1, Table 2 and Table 3, we find that the SE of all three estimator decrease as the sample size n increases and all estimators will tend to more accuracy when n is large. In addition, the estimated values obtained by the three estimator are close to the true values. Furthermore, the plots of bias of the simulated estimators of

λ

,

α

, and

β

, corresponding with different sample size n, are shown in Figure 3, Figure 4 and Figure 5, respectively.

From Figure 3, Figure 4 and Figure 5, we observe that the magnitude of bias of all estimators tends to zero as n grows, which means these estimators are asymptotically unbiased and consistent for the parameters. Thus, these estimator techniques perform well for estimating the parameters in the GME distribution. For further studying, we draw the plots of overall bias and MSE for

\hat{θ}

in Figure 6 and Figure 7.

Figure 6 and Figure 7 show the bias and MSE of

\hat{θ}

, and we can find that as n increases, the bias of

\hat{θ}

towards to zero, and the WLS always has the smallest value of MSE. The LS estimators have the largest MSE among the three considered estimators. Thus, we can conclude that WLS can be chosen as a more reliable estimator for the GME distribution.

6. Real Data Analysis

In this section, we use the weighted least-square estimator to analyze two real data sets for investigating the advantage of proposed GME distribution and compare it with some other distributions, including the exponential distribution distribution, WED distribution, GExtEW distribution, and GWE distribution, where the pdfs are given as follows.

(1): Exponential distribution: $E (λ)$

$f_{E} (x) = λ e^{- λ x}, x \geq 0, λ > 0 .$
(2): Weibull exponential distribution: $W E D (λ, α, β)$

$f_{W E D} (x) = α β (λ e^{- λ x}) [\frac{{(1 - e^{- λ x})}^{β - 1}}{{(e^{- λ x})}^{β + 1}}] exp \{- α {[\frac{1 - e^{- λ x}}{e^{- λ x}}]}^{β}\}, x > 0, α, β, λ > 0 .$
(3): Generalized extended exponential-Weibull distribution: $G E x t E W (λ, α, β, r, c)$

$f_{G E x t E W} (x) = c α (r β x^{r - 1} + λ) {(β x^{r} + λ x)}^{c - 1} e^{- {(β x^{r} + λ x)}^{c}}, x > 0, c, β, λ > 0 . r \in (0, \infty) ∖ {1} .$
(4): Generalized weighted exponential distribution: $G W E (λ, α, k)$

$f_{G W E} (x) = \frac{α}{B (1 / α, k + 1)} λ e^{- λ x} {(1 - e^{- λ α x})}^{k}, x > 0, α, λ > 0, k \in Z^{+} .$

6.1. Data Set 1: Waiting Times

This data set represents the waiting times (in minutes) before the service of 100 bank customers, which has been previously used by Ghitany et al. [18]. It can be seen in Appendix A.1 of Appendix A. Table 4 shows the parameter estimator results of the GME, E, WED, GExtEW, and GWE distributions for these data. The corresponding minus log-likelihood, Akaike information criterion (AIC), and Bayesian information criterion (BIC) are also presented. From Table 4, we find that the GME has the smallest values of all criteria for comparing all other distributions.

Figure 8 shows the fitted models for data set 1. The first subgraph of Figure 8 shows the fitted densities to the data set histogram and some estimated distributions, and the second subgraph displays the empirical distribution function for the data set and the estimated distributions. Both figures reveal that the GME distribution provides a qualified fit for the data set.

6.2. Date Set 2: Survival Times

The second data set represents the survival times of 121 patients with breast cancer obtained from a large hospital in a period from 1929 to 1938. This data set has recently been studied by Lee [19] and Tahir et al. [20]. The data set can be seen in Appendix A.2 of Appendix A. We compare the GME distribution with the E, WED, GExtEW, and GWE distributions. The estimated value of the parameters, AIC and BIC statistics of these distributions are listed in Table 5. It can be seen that GME distribution provides the best fit among these competing models. Figure 9 displays the fitted pdfs and cdfs of the GME, E, WED, GExtEW, and GWE distributions for data set 2, and suggests that the fit of the GME distribution is reasonable.

7. Conclusions

In this paper, we introduce a new lifetime distribution, GME distribution, and propose several statistical properties of it. As it is not feasible to compare these methods theoretically, we have studied several simulations to identify the most efficient estimation method for GME distribution. The simulation results show that weighted least-square estimator (WLS) is the best performing estimator in terms of MSE. That is, the weighted least-square estimation method is more feasible for estimating parameters in the GME distribution. Finally, two real data sets were analyzed to indicate the importance and flexibility of GME distribution in comparison to some existing lifetime distributions. In the future, the development of properties and proper estimation procedure of the bivariate model and multivariate generalization will be of interest, and more work is needed along that direction.

Author Contributions

W.T.: Conceptualization, Methodology, Validation, Investigation, Resources, Supervision, Project Administration, Visualization, Writing review and editing; Y.Y. and T.T.: Software, Formal analysis, Data curation, Writing—original draft preparation, Visualization. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Datasets are provided in the paper.

Acknowledgments

The authors would like to thank the editor and three anonymous referees for their careful reading of this article and for their constructive suggestions, which considerably improved this article.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

WE	weighted exponential distribution
GWE	generalized weighted exponential distribution
WED	Weibull exponential distribution
GExtEW	generalized extented exponential-Weibull distributions
GME	generalized mixture exponential distribution
MLE	maximum likelihood estimator
LS	least-square estimator
WLS	weighted least-square estimator

Appendix A. Data Set

Appendix A.1. Data Set 1

0.8 0.8 3.2 3.3 4.6 4.7 6.2 6.2 7.7 8 9.7 9.8 12.5 12.9 17.3 18.1 27 31.6 1.3 3.5 4.7 6.2 8.2 10.7 13 18.2 33.1 1.5 1.8 1.9 3.6 4 4.1 4.8 4.9 4.9 6.3 6.7 6.9 8.6 8.6 8.6 10.9 11 11 13 13.3 13.6 18.4 18.9 19 38.5 1.9 2.1 2.6 4.2 4.2 4.3 5 5.3 5.5 7.1 7.1 7.1 8.8 8.8 8.9 11.1 11.2 11.2 13.7 13.9 14.1 19.9 20.6 21.3 2.7 2.9 3.1 4.3 4.4 4.4 5.7 5.7 6.1 7.1 7.4 7.6 8.9 9.5 9.6 11.5 11.9 12.4 15.4 15.4 17.3 21.4 21.9 23.

Appendix A.2. Data Set 2

0.3 0.3 4.0 5.0 5.6 6.2 6.3 6.6 6.8 7.4 7.5 8.4 8.4 10.3 11.0 11.8 12.2 12.3 13.5 14.4 14.4 14.8 15.5 15.7 16.2 16.3 16.5 16.8 17.2 17.3 17.5 17.9 19.8 20.4 20.9 21.0 21.0 21.1 23.0 23.4 23.6 24.0 24.0 27.9 28.2 29.1 30.0 31.0 31.0 32.0 35.0 35.0 37.0 37.0 37.0 38.0 38.0 38.0 39.0 39.0 40.0 40.0 40.0 41.0 41.0 41.0 42.0 43.0 43.0 43.0 44.0 45.0 45.0 46.0 46.0 47.0 48.0 49.0 51.0 51.0 51.0 52.0 54.0 55.0 56.0 57.0 58.0 59.0 60.0 60.0 60.0 61.0 62.0 65.0 65.0 67.0 67.0 68.0 69.0 78.0 80.0 83.0 88.0 89.0 90.0 93.0 96.0 103.0 105.0 109.0 109.0 111.0 115.0 117.0 125.0 126.0 127.0 129.0 129.0 139.0 154.0.

References

Gupta, R.D.; Kundu, D. Generalized exponential distribution: Different method of estimations. J. Stat. Comput. Simul. 2001, 69, 315–337. [Google Scholar] [CrossRef]
Gupta, R.D.; Kundu, D. A new class of weighted exponential distributions. Statistics 2009, 43, 621–634. [Google Scholar] [CrossRef]
Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Kharazmi, O.; Mahdavi, A.; Fathizadeh, M. Generalized weighted exponential distribution. Commun. Stat. Simul. Comput. 2015, 44, 1557–1569. [Google Scholar] [CrossRef]
Nadarajah, S.; Haghighi, F. An extension of the exponential distribution. Statistics 2011, 45, 543–558. [Google Scholar] [CrossRef]
Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Commun. Stat. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
Nadarajah, S.; Kotz, S. The beta exponential distribution. Reliab. Eng. Syst. Saf. 2006, 91, 689–697. [Google Scholar] [CrossRef]
Barreto-Souza, W.; Santos, A.H.; Cordeiro, G.M. The beta generalized exponential distribution. J. Stat. Comput. Simul. 2010, 80, 159–172. [Google Scholar] [CrossRef] [Green Version]
Barreto-Souza, W.; Cordeiro, G.M.; Simas, A.B. Some results for beta Frenchet distribution. Commun. Stat. Theory Methods 2011, 40, 798–811. [Google Scholar] [CrossRef]
Ristic, M.M.; Balakrishnan, N. The gamma-exponentiated exponential distribution. J. Stat. Comput. Simul. 2012, 82, 1191–1206. [Google Scholar] [CrossRef]
Oguntunde, P.E.; Balogun, O.S.; Okagbue, H.I.; Bishop, S.A. The Weibull-exponential distribution: Its properties and applications. J. Appl. Sci. 2015, 15, 1305–1311. [Google Scholar] [CrossRef]
Ristic, M.M.; Kundu, D. Marshall-Olkin generalized exponential distribution. Metron 2015, 73, 317–333. [Google Scholar] [CrossRef]
George, R.; Thobias, S. Kumaraswamy Marshall-Olkin Exponential Distribution. Commun. Stat. Theory Methods 2019, 48, 1920–1937. [Google Scholar] [CrossRef]
Shakhatreh, M.K.; Lemonte, A.J.; Cordeiro, G.M. On the generalized extended exponential-Weibull distribution: Properties and different methods of estimation. Int. J. Comput. Math. 2020, 97, 1029–1057. [Google Scholar] [CrossRef]
Kumar, C.S.; Anusree, M.R. On a generalized mixture of standard normal and skew normal distributions. Stat. Probab. Lett. 2011, 81, 1813–1821. [Google Scholar] [CrossRef]
Tian, W.; Wang, C.; Wu, M.; Wang, T. The multivariate extended skew normal distribution and its quadratic forms. In Causal Inference in Econometrics; Springer: Cham, Switzerland, 2016; pp. 153–169. [Google Scholar]
Swain, J.J.; Venkatraman, S.; Wilson, J.R. Least-squares estimation of distribution functions in Johnson’s translation system. J. Stat. Comput. Simul. 1988, 29, 271–297. [Google Scholar] [CrossRef]
Ghitany, M.E.; Atieh, B.; Nadarajah, S. Lindley distribution and its application. Math. Comput. Simul. 2008, 78, 493–506. [Google Scholar] [CrossRef]
Lee, E.T. Statistical Methods for Survival Data Analysis; John Wiley: New York, NY, USA, 1992. [Google Scholar]
Tahir, M.H.; Mansoor, M.; Zubair, M.; Hamedani, G. McDonald log-logistic distribution with an application to breast cancer data. J. Stat. Theory Appl. 2014, 13, 65–82. [Google Scholar] [CrossRef] [Green Version]

Figure 1. The pdf curves for different parameters of

G M E (λ, α, β)

.

Figure 1. The pdf curves for different parameters of

G M E (λ, α, β)

.

Figure 2. The hazard function curves for different values of parameter in

G M E (λ, α, β)

.

Figure 2. The hazard function curves for different values of parameter in

G M E (λ, α, β)

.