Next Article in Journal
Optimality Conditions for Multiobjective Mathematical Programming Problems with Equilibrium Constraints on Hadamard Manifolds
Next Article in Special Issue
Approximation of the Statistical Characteristics of Piecewise Linear Systems with Asymmetric Damping and Stiffness under Stationary Random Excitation
Previous Article in Journal
Extension of Interval-Valued Fermatean Fuzzy TOPSIS for Evaluating and Benchmarking COVID-19 Vaccines
Previous Article in Special Issue
Influence of Binomial Crossover on Approximation Error of Evolutionary Algorithms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A More Flexible Asymmetric Exponential Modification of the Laplace Distribution with Applications for Chemical Concentration and Environment Data

Departamento de Matemáticas, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1270300, Chile
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(19), 3515; https://doi.org/10.3390/math10193515
Submission received: 11 August 2022 / Revised: 20 September 2022 / Accepted: 20 September 2022 / Published: 26 September 2022
(This article belongs to the Special Issue Probability, Stochastic Processes and Optimization)

Abstract

:
In this work, a new family of distributions based on the Laplace distribution is introduced. We define this new family by its stochastic representation as the sum of two independent random variables, one with a Laplace distribution and the other with an exponential distribution. Using a Monte Carlo simulation study, the statistical performance of the estimators obtained by the moments and maximum likelihood methods were empirically evaluated. We studied the coverage probabilities and mean length of the confidence intervals of the corresponding parameters based on the asymptotic normality of these estimators. This simulation study reported a good statistical performance of these estimators. Fits were made to three real data sets with the new distribution, two related to chemical concentrations and one to the environment, comparing it with three similar distributions given in the literature. We have used information criteria for the selection of models. These results showed that the exponentially modified Laplace model can be an alternative distribution to model skewed data with high kurtosis. The new approach is a contribution to the tools of statisticians and various professionals interested in modeling data with high kurtosis.

1. Introduction

There are several investigations that use the Laplace distribution to model data from certain fields based on an empirical fit using goodness-of-fit techniques. For example, in environmental problems, the Laplace distribution is used to analyze (or model) random variables that determine maximum pollution values and describe times of high pollution. In mining, the Laplace distribution is used to analyze the mineral content in soil samples [1,2].
However, not all data related to these types of problems have a symmetric behavior. For this reason, other distributions have been proposed that are capable of better modeling this type of data. In this sense, Agu and Onwukwe [3] presented the modified Laplace distribution ( M L ), Grushka [4] presented the exponentially modified Gaussian distribution ( E M G ) and Reyes et al. [5] presented the exponentially modified logistic distribution ( E M L O G ). One of the advantages of these new probability distributions obtained through mixtures is that the obtained distributions generally have longer tails than the base distribution, thus giving rise to better fits for empirical frequency distributions, [4,5].
Our research is based on the theory of probability distributions and based on the process of mixtures of probability distributions, it proposes a new parametric probability distribution using the Laplace distribution. The new distribution depends on three parameters and is obtained by adding two independent random variables: one with a Laplace distribution and the other with an exponential distribution. This distribution can be used as an alternative to some existing distributions. The density function of the new distribution is obtained using the stochastic representation Y = σ ( X + V ) + μ where X and V are independent random variables, such that X is standard Laplace distribution and V is exponentially distributed with parameter λ , where μ is the location parameter, σ is the scale parameter, and λ is the skewness parameter. This document is organized as follows: Section 2, in order to make this work self-contained, presents the probability distributions of the Laplace, exponential, modified Laplace, exponentially modified Gaussian, and exponentially modified logistic distributions with some characteristics of these that will be useful later. In Section 3, the exponentially modified Laplace probability distribution is constructed, obtaining the density and the main characteristics of the distribution. In Section 4, the methods of moments and maximum likelihood are presented to estimate the parameters of the distribution. A simulation study for the theoretical validation of the model is also presented. Section 5 shows a comparative analysis and a discussion of the results obtained by fitting the different data sets with the modified Laplace ( M L ), exponentially modified Gaussian ( E M G ), and exponentially modified logistic distributions ( E M L O G ) and the proposed exponentially modified Laplace distribution ( E M L ). Finally, in Section 6, conclusions are drawn from the work.

2. Preliminaries

The classical Laplace distribution (also known as Laplace’s first law) is a probability distribution, given by the density function
f ( x ; θ , s ) = 1 2 s e | x θ | s , x R
where < θ < and s > 0 are the location and scale parameters, respectively (Johnson et al. [6]), and we will denote it as X L ( θ , s ) . When the location parameter is equal to zero and the scale parameter is equal to one, then the standard Laplace distribution function is obtained, denoted by L ( 0 , 1 ) . The nth moment for a random variable X L ( 0 , 1 ) , is given by:
E ( X n ) = 1 2 n ! { 1 + ( 1 ) n } n = 1 , 2 ,
The continuous random variable, say X, is said to have an exponential distribution if it has the following probability density function:
f ( x ; λ ) = λ e λ x s i x > 0 0 s i x 0
where λ is called the rate of the distribution and will be represented as X e x p ( λ ) . The nth moment for a random variable X e x p ( λ ) is given by the following expression:
E ( X n ) = n ! λ n , n = 1 , 2 ,
Agu and Onwukwe [3] presented the modified Laplace distribution whose density function is given by
f X ( x ) = λ 2 σ 1 2 e x μ σ λ 1 e x μ σ , x μ λ 2 σ 1 1 2 e x μ σ λ 1 e x μ σ , x > μ
x R , which is denoted by X M L ( μ , σ , λ ) .
The p d f of a random variable with an exponentially modified Gaussian distribution E M G (Grushka [4]) is given by:
f Y ( y ; μ , σ , λ ) = λ 2 e λ 2 ( 2 y 2 μ λ σ 2 ) e r f c 2 μ + λ σ 2 y 2 σ 2 , x R
and is denoted as Y E M G ( μ , σ , λ ) , where e r f c ( z ) = 2 π z e t 2 d t .
A random variable X has a logistic distribution with location parameter α R and scale parameter β > 0 if its density function is:
f X ( x ; α , β ) = e ( x α ) / β β 1 + e ( x α ) / β 2 , x R
which is denoted as X L O G ( α , β ) . When the location parameter is 0 and the scale parameter is 1, then the standard logistic distribution function is obtained.
Reyes et al. [5], using the methodology given by [4], introduces the exponentially modified logistic distribution by the following stochastic representation:
Y = Z + T ,
where Z L O G ( α , β ) and T e x p ( 1 / β ) are random independent variables and are denoted by Y E M L O G ( α , β ) , transforming this into a more flexible distribution in terms of working with data that have high kurtosis. Its function is given by:
f Y y α , β = 1 β 2 e y α β 0 e 2 w β 1 + e y w α β 2 d w , < y <
and we denote as Y E M L O G ( α , β ) .

3. Exponentially Modified Laplace Distribution

In this section, the exponentially modified Laplace distribution ( E M L ) is presented using the Grushka methodology [4], considering the location and scale parameters. This distribution is obtained by substituting the normal distribution for the standard Laplace distribution in the stochastic representation. The flexibility of this new distribution allows better capture of outliers. We will start by deriving its density function.

3.1. Density Function

The exponentially modified Laplace distribution admits the following stochastic representation as
Y = σ ( X + V ) + μ ,
where X and V are independent random variables such that X L ( 0 , 1 ) and V exp ( λ ) , where μ is the location parameter, σ is the scale parameter, and λ is the skewness parameter, so we say that Y follows an exponentially modified Laplace distribution and is denoted by Y E M L ( μ , σ , λ ) .
Proposition 1.
Let Y be a random variable such that Y E M L ( μ , σ , λ ) . Then, its probability density function (pdf) is given by
f Y ( y ; μ , σ , λ ) = λ 2 σ ( λ 1 ) e y μ σ 2 λ + 1 e λ y μ σ , y > μ , λ 1 2 y μ σ + 1 4 σ e y μ σ , y > μ , λ = 1 λ 2 σ ( λ + 1 ) e y μ σ , y < μ
Proof. 
Using the stochastic representation in (3), we have
X L ( 0 , 1 ) f X ( x ) = 1 2 e | x | , < x < , V e x p ( λ ) f V ( v ) = λ e λ v , v > 0
and the Jacobian transformation approach, it follows that:
Y = σ ( X + V ) + μ W = V X = Y μ σ W V = W J = x y x w v y v w = 1 σ 1 0 1 = 1 σ .
Then,
f Y , W ( y , w ) = | J | f X , V y μ σ w , w f Y , W ( y , w ) = 1 σ f X y μ σ w f V ( w ) f Y ( y ) = 0 1 σ f X y μ σ w f V ( w ) d w f Y ( y ) = λ 2 σ 0 e λ w e y μ σ w d w , < y < ,
solving the integral, for λ 1 and λ = 1 , the result (4) is obtained.    □
Proposition 2.
If Y E M L ( μ , σ , λ ) and λ , then Y L ( μ , σ ) .
Proof. 
If λ in the density function given in (4), the result is obtained.    □
Figure 1 graphically illustrates the behavior of the density function of the exponentially modified Laplace distribution and the standard Laplace for different values of λ (upper), it is observed that as the parameter λ decreases, the tails become heavier. On the other hand, on the lower portion of the figure, the densities of the standard Laplace, modified Laplace, and exponentially modified Laplace distributions are plotted, in which greater flexibility is observed in the E M L model.
Proposition 3.
Let Y be a random variable such that Y E M L ( μ , σ , λ ) , then its cdf is given by
F Y ( t ; μ , σ , λ ) = λ 2 ( λ + 1 ) + λ 2 ( λ 1 ) 1 e t μ σ 2 λ λ + 1 1 e λ t μ σ , t > μ , λ 1 1 4 4 3 e t μ σ 2 t μ σ e t μ σ , t > μ , λ = 1 λ 2 ( λ + 1 ) e t μ σ , t < μ .
Proof. 
Using the definition of cdf, we have
F Y ( t ; μ , σ , λ ) = t λ 2 σ 0 e λ w e y μ σ w d w d y , < t < ,
solving the integral for, λ 1 and λ = 1 , the result (5) is obtained.    □
Corollary 1.
Let Y be a random variable such that Y E M L ( μ , σ , λ ) . Then, the reliability function defined as R ( y ) = P ( Y > y ) = 1 F Y ( y ) , y > 0 is given by
R ( y ) = 1 λ 2 ( λ + 1 ) λ 2 ( λ 1 ) 1 e t μ σ 2 λ λ + 1 1 e λ t μ σ , t > μ , λ 1 1 1 4 4 3 e t μ σ 2 t μ σ e t μ σ , t > μ , λ = 1 1 λ 2 ( λ + 1 ) e t μ σ , t < μ .
Proof. 
Using the reliability function definition R ( y ) and (5), the result is directly obtained.    □
Through Figure 2, we graphically illustrate the behavior of the cumulative distribution function (cdf) for the exponentially modified Laplace distribution. Compared to the standard Laplace distribution, it reflects a slower growth, implying a greater capture of outlier data.

3.2. Reliability Function Comparison of ML, EMLOG, EMG, and EML Distributions

The reliability function of a random variable Y indicates the probability that a variable exceeds the value of y. In this section, using Table 1, for a fixed value of λ = 0.7 , we make a brief comparison where it is observed that the tails of the E M L distribution are heavier than those of the M L , E M L O G , and E M G distributions.
Likewise, observing the graphical illustration represented in Figure 3, it can be seen that the tails of the E M L distribution are heavier than those of the M L , E M L O G , and E M G distributions.

3.3. Moments

The following proposition presents us with a formula that, with the use of numerical techniques, allows us to calculate the rth moment of an exponentially modified Laplace distribution.
Proposition 4.
If Y E M L ( μ , σ , λ ) , the rth moment of Y is given by:
μ r = E [ Y r ] = j = 0 r r j σ j μ r j k = 0 j j k k ! { 1 + ( 1 ) k ) } ( j k ) ! 2 λ j k
Proof. 
Using the stochastic representation given in (3), applying the binomial theorem and the moments of the standard Laplace and exponential distributions given in (1) and (2), respectively, the result is obtained.    □
Corollary 2.
Let Y E M L ( μ , σ , λ ) , then
μ 1 = σ λ + μ μ 2 = 2 σ 2 1 + 1 λ 2 + 2 σ μ λ + μ 2 μ 3 = 6 σ 3 λ 1 + 1 λ 2 + 6 σ 2 μ 1 + 1 λ 2 + 3 σ μ 2 λ + μ 3 μ 4 = 24 σ 4 1 + 1 λ 2 + 1 λ 4 + 24 σ 3 μ λ 1 + 1 λ 2 + 12 σ 2 μ 2 λ 1 + 1 λ 2 + 4 σ μ 3 λ + μ 4
Proof. 
Using Proposition 4 with r = 1 , 2 , 3 , 4 we obtain the results.    □
Corollary 3.
Let Y E M L ( μ , σ , λ ) . Then, the mean and variance are given, respectively, by
E ( Y ) = μ + σ λ V a r ( Y ) = σ 2 2 + 1 λ 2
Proof. 
Using μ 1 and μ 2 obtained in Corollary 2, and substituting in V ( Y ) = μ 2 ( μ 1 ) 2 , we obtain the results.    □
Corollary 4.
Let Y E M L ( μ , σ , λ ) , then the asymmetry and kurtosis coefficient of Y is given by
β 1 = 2 2 λ 2 + 1 3 2 β 2 = 24 λ 4 + 12 λ 2 + 9 2 λ 2 + 1 2
Proof. 
Using the standardized skewness and kurtosis coefficients of Y, the result is reached.    □
Figure 4 shows that the kurtosis coefficient for the distribution ( E M L ) takes values in the interval [ 5 , 9 ] , decreasing for values of λ between [ 0 , 1 ] and increasing for values greater than one.

4. Estimation

4.1. Moment Estimators

The following proposition shows analytic expressions for the moment estimators of μ , σ , and λ for the exponentially modified Laplace distribution ( E M L ).
Proposition 5.
Let y 1 , y 2 , , y n be a random sample from the distribution of random variable Y E M L ( μ , σ , λ ) , so that the moment estimators for θ = ( μ , σ , λ ) are obtained by solving the following numerical equation for μ:
μ 3 8 y ¯ μ 2 + 15 y ¯ 2 μ 6 y ¯ 3 3 y ¯ s 2 + y 3 ¯ = 0 ,
later, the moment estimator for σ is obtained by substituting the moment estimator for μ ( μ ^ M ), in the following equation:
σ ^ M = y 2 ¯ 2 y ¯ ( y ¯ μ ^ M ) μ ^ M 2 2
and finally, the estimator of moments for λ is obtained:
λ ^ M = σ ^ M y ¯ μ ^ M
where y ¯ , y 2 ¯ , y 3 ¯ , and s 2 are the sample moments, and sample variance, respectively.
Proof. 
Equating the first three population moments to the sample moments, we obtain:
y ¯ = σ λ + μ y 2 ¯ = 2 s 2 2 σ 2 + σ λ + y ¯ μ y 3 ¯ = 6 s 2 σ λ + μ + σ λ + y ¯ μ 2 ,
solving the system, we arrive at the result.    □

4.2. Likelihood Function

Consider a random sample of size n, y 1 , , y n , from the distribution E M L ( μ , σ , λ ) . So, the log-likelihood function for θ = ( μ , σ , λ ) T can be expressed as
( θ ) = n log λ n log 2 n log σ + n i = 1 log G ( y i , θ ) ,
where G ( y i , θ ) = 0 e λ w e y i μ σ w d w .
Maximum likelihood estimators (MLEs) were acquired maximizing the likelihood function given in (7). Since there is no analytical solution, we used the iterative numerical method “BFGS”, created by Byrd et al. [7]. The “BFGS” method is a limited-memory quasi-Newton method for approximating the Hessian matrix of the target distribution. This method allows us to numerically obtain the maximum likelihood estimates of the parameters of a distribution and their respective standard errors derived from the Hessian matrix.

4.3. Simulation Study

We used the Monte Carlo method to generate random numbers from the distribution E M L ( μ , σ , λ ) . The results obtained are a sequence of n random numbers that are stored inside an array that we call n vector. For this, we used 1000 samples of size 50, 100, 200 and 500, obtaining the estimates of the parameters by means of the moment and maximum likelihood methods. In addition, we analyze the standard deviation, average length of the confidence intervals, and the empirical coverage, for the parameters of the distribution, based on a 95% confidence level.
To develop the algorithm (Algorithm 1) we will use the following notation:
  • n: The length of the n vector.
  • Y: A random variable with the distribution E M L .
  • f Y ( y ) : The PDF of E M L .
  • L1: Number of samples of size n.
  • μ , σ , λ : Parameters.
Algorithm 1: Monte Carlo algorithm to generate random numbers from the E M L ( μ , σ , λ ) distribution
  • Start
    Input: f Y ( y ) , L 1 , n , μ , σ , λ .
    Output: n vector.
  • Generate a random variable X L ( 0 , 1 ) .
  • Generate a random variable V e x p ( λ ) .
  • Compute Y = X + V .
  • Since Y E M L ( μ , σ , λ ) , append y to n-vector.
  • Repeat steps 2–5 for each sample of size n obtained.
  • For each estimate, the 95% confidence interval is obtained and the length calculated. Additionally, the number of intervals containing the value of each parameter is counted. By obtaining the average of these 1000 values, the value a l i and the empirical coverage c are obtained.
  • end.
Table 2 contains the values of the estimates of the parameters, standard deviation, average interval length, and empirical coverage, based on a 95% confidence interval from simulations obtained by the method of moments for 1000 generated samples of size n = 50, 100, 200, and 500 from the population with distribution E M L ( μ , σ , λ ) . These estimates were obtained by solving the system of equations given in Proposition 5. Similarly, Table 3 shows the results of the simulation studies, illustrating the behavior of the MLEs. For each sample generated, MLEs are calculated numerically using the Newton–Raphson [8] procedure. In both tables, it can be seen that the simulations carried out by these methods show that the average estimates of the parameters are close to the proposed values. Additionally, the standard deviation and the average length of the interval decrease as the sample size increases. This is an expected result, since the ME and MLE are asymptotically consistent. On the other hand, the empirical coverage is adequate since it is close to 95%.

5. Three Illustrative Examples with a Real Data Set

In this section, three applications are presented in which the parameter estimators are obtained based on the maximum likelihood method (MLE) for ( μ , σ and λ ) through of the fitted models M L , E M L O G , E M G , and E M L to a set of real data. The numerical illustrations below are intended to show that the E M L model is an alternative to unimodal data modeling in different areas.

5.1. Illustrative Example 1

In our first illustration, the data set corresponds to the nickel content in soil samples analyzed at the Department of Mining (Department of Mines) of the University of Atacama, Chile, (see Appendix A, Table A1). Table 4 presents summary statistics for the data set of nickel content in soil samples, where γ 1 and γ 2 are the skewness and kurtosis coefficients of the sample, respectively. The moment estimators for these data are given by: θ ^ M = ( μ ^ M , σ ^ M , λ ^ M ) = ( 6.7497 , 5.0626 , 0.3412 ) .
Table 5 shows the maximum likelihood estimates and the standard deviations for the M L , E M L O G , E M G , and E M L models. In addition, we report the values of the Akaike [9] (AIC), Bayesian information criteria [10] (BIC), Akaike information criterion consistent [11] (CAIC), and Hannan–-Quinn information criterion [12] (HQIC). On the other hand, Figure 5 shows the histogram with estimated pdf. This indicates that the E M L model fits the data better than the M L , E M L O G , and E M G models. This result is supported by Figure 6 based on theoretical versus empirical (QQ) quantile plots.
Figure 5 presents the histogram of the data with adjustment of the modified Laplace, exponentially modified Laplace, exponentially modified Gaussian, and exponentially modified logistic (upper) distributions, fitted with the values of the maximum likelihood estimators of their parameters. Notice that the fitted exponentially modified Laplace distribution has heavier tails and a magnification of the upper tails of the soil nickel concentration data (lower).
On the other hand, Figure 6 shows the QQ plot of the fitted models. From these results, it can be seen that the exponentially modified Laplace distribution provided a better fit than the other distributions in consideration.

5.2. Illustrative Example 2

The second illustration is related to the neodymium content in soil samples analyzed at the Department of Mining (Department of Mines) of the University of Atacama, Chile (see Appendix A, Table A2). Table 6 presents summary statistics for the data set of the neodymium content in soil samples, where γ 1 and γ 2 are the skewness and kurtosis coefficients of the sample, respectively. The moment estimators for these data are given by: θ ^ M = ( μ ^ M , σ ^ M , λ ^ M ) = ( 4.2094 , 10.3030 , 0.5868 ) .
The modified Laplace, exponentially modified logistic, exponentially modified Gaussian, and exponentially modified Laplace distributions were fitted to the data set. Table 7 shows the maximum likelihood estimates of the parameters, with the corresponding standard deviations ( s d ) in parentheses, for the three mentioned distributions. The adjustment criteria (AIC, BIC, CAIC, and HQIC) indicate that the data fit better to the exponentially modified Laplace model, because they present a smaller or lower value.
Figure 7 shows the histogram plots and a magnification of the upper tails of the soil neodymium concentration data with the modified Laplace, exponentially modified logistic, exponentially modified Gaussian, exponentially modified Laplace, and distributions fitted with the maximum likelihood estimators of its parameters where the fit of outliers is best observed. In addition, Figure 8 shows the QQ plot of the fitted models, observing that the proposed model achieves a better capture of extreme values.

5.3. Illustrative Example 3

In this application, we used daily nitrogen concentration data obtained by chromatography [13]. Data are given in the Appendix A (Tabla Table A3). Table 8 presents summary statistics for the nitrogen concentration data set, where γ 1 and γ 2 are the sample skewness and kurtosis coefficients, respectively. Moment estimators for these data are given by: θ ^ M = ( μ ^ M , σ ^ M , λ ^ M ) = ( 0.0965 , 1.0965 , 2.0965 ) . Table 9 shows the maximum likelihood estimates for the parameters with their corresponding standard deviations ( s d ) in parentheses for the modified Laplace, exponentially modified logistic, exponentially modified Gaussian and exponentially modified Laplace distributions. The fit criteria used, AIC, BIC, CAIC and HQIC, indicate that the exponentially modified Laplace model fits the data better.
Figure 9 shows the histogram plots and a magnification of the lower tails of the nitrogen concentration data with the modified Laplace, exponentially modified logistic, exponentially modified Gaussian, and exponentially modified Laplace distributions fitted with the maximum likelihood estimators of its parameters where the fit of outliers is best observed. In addition, Figure 10 shows the QQ plot of the fitted models, observing that the proposed model achieves a better capture of extreme values.

6. Conclusions

In this paper, a new and more flexible distribution, called the exponentially modified Laplace distribution, has been proposed. We estimate the parameters of the model by the moment and maximum likelihood methods. Likewise, we apply information criteria to select the models and evaluate the goodness of fit of the new distribution compared to other similar distributions in the current literature. We performed a Monte Carlo simulation study to empirically assess the statistical performance of the estimates obtained. In addition, we study the standard deviations, the mean length of the confidence intervals, and the empirical coverage based on 95% confidence intervals. This simulation study reported a good statistical performance of these estimates. Three illustrations were made using data related to the chemical and environmental concentrations, comparing them with three similar distributions presented in the literature. The analyses reported a good performance of the new distribution compared to similar distributions, providing evidence that the proposed model is a good alternative for modeling skewed and high-kurtosis data. These results reported that the exponentially modified Laplace model can be an alternative to analyze this type of data. The new approach is a contribution to the tools of statisticians and various professionals interested in data modeling. From these applications, we have obtained useful information that can be used by professionals and users of statistics. A limitation of the proposed distribution is the loss of goodness of fit for data sets whose sample kurtosis is less than five. Some topics for future research based on this new distribution are related to the study of multivariate procedures, quantile regression, spatial methods, temporal methods, partial least squares, principal components, etc.

Author Contributions

Data curation, J.R.; formal analysis, J.R., M.A.R., P.L.C. and J.A.; investigation, J.R., M.A.R., P.L.C.; methodology, J.R., M.A.R., P.L.C. and J.A.; writing—original draft, J.R., M.A.R., P.L.C. and J.A.; writing—review and editing, M.A.R., P.L.C. and J.A.; funding acquisition, J.R., M.A.R. and J.A. All authors have read and agreed to the published version of the manuscript.

Funding

Research of J.R., M.A.R. and J.A. was supported by the Universidad de Antofagasta through project SEMILLERO UA 2022.

Data Availability Statement

The analyzed data are available in the Appendix A of the article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Nickel Data.
Table A1. Nickel Data.
23334466777881010111111
121213131414141414141415151515151616
161616161717171717191919191920202020
202021212121222323252528292930313232
334042424345464652545575109
Table A2. Neodymium Data.
Table A2. Neodymium Data.
47262922331671343127133684215529
25293618165018281629103171532333531
72893743293514252182649471914333521
253015272792633132043338252235313924
5010328219134682537212636327919
Table A3. Nitrogen Data.
Table A3. Nitrogen Data.
0.6070.6050.6060.6060.6090.6310.6170.626
0.6100.6110.6100.6060.6100.6120.6140.613
0.6140.6140.6150.6160.6160.6160.6160.615
0.6160.6160.6160.6180.6170.6170.6170.617
0.6170.6170.6170.6190.6190.6180.6180.622
0.6190.6200.6200.6190.6170.6160.6140.617
0.6110.6110.6120.6110.6120.6120.6120.613
0.6100.6120.6130.6140.6130.6120.6100.609
0.6130.6120.6160.6120.6110.6110.6130.609
0.6120.6120.6120.6050.6040.6150.6200.622
0.6170.6190.6210.6220.6300.6260.6160.617
0.6210.6230.6250.6260.6240.6180.6180.618
0.6210.6230.6250.6260.6240.6180.6180.618
0.6220.6230.6230.6080.6240.6200.6190.615
0.6110.6150.6120.6200.6230.6270.6280.625
0.6270.6280.6260.6270.6260.6250.6250.625
0.6240.6260.6270.6260.6280.6260.6190.618
0.6270.6260.6260.6270.6260.6260.6280.629
0.6270.6270.6270.6270.6250.6250.6290.623
0.6190.5730.5650.5850.5950.6080.6140.614
0.6120.6150.6160.6170.6150.6150.6150.614
0.6100.6100.6110.6110.6110.6120.6100.609
0.6110.6140.6170.6170.6200.6220.6190.618
0.6190.6220.6180.6190.6200.6190.6200.621
0.6170.6200.6210.6230.6260.6270.6260.626
0.6270.6260.6280.6260.6240.6240.6210.620
0.6210.6190.6210.6260.6270.6240.6220.622
0.6220.6220.6220.6250.6220.6210.6180.616
0.6210.6190.6230.6260.6250.6240.6190.620
0.6300.6290.6300.6310.6320.6240.6250.628
0.6230.6280.6260.6290.6280.6300.6180.607
0.6310.6300.6290.6300.6290.6310.6320.633
0.6250.6190.6190.6530.6240.6220.6450.619
0.6200.6190.6180.6200.6200.6210.6180.614
0.6170.6160.6160.6160.6150.6160.6170.616
0.6150.6170.6160.6140.6160.6170.6160.617
0.6180.6180.6190.6220.6220.6230.622

References

  1. Oosterbaan, R.J. Chapter 6, Frequency and Regression Analysis. In Drainage Principles and Applications, Publication 16; En Ritzema, H.P., Ed.; International Institute for Land Reclamation and Improvement (ILRI): Wageningen, The Netherlands, 1994; pp. 175–224. ISBN 90-70754-33-9. [Google Scholar]
  2. Burke, E.J.; Perry, R.H.; Brown, S.J. An extreme value analysis of UK drought and projections of change in the future. J. Hydrol. 2010, 388, 131. [Google Scholar] [CrossRef]
  3. Agu, F.I.; Onwukwe, E. Modified Laplace Distribution, Its Statistical Properties and Applications Asian. J. Probab. Stat. 2019, 4, 49277. [Google Scholar]
  4. Grushka, E. Characterization of Exponentially Modified Gaussian Peaks in Chromatography. Anal. Chem. 1972, 44, 1733–1738. [Google Scholar] [CrossRef] [PubMed]
  5. Reyes, J.; Venega, O.; Gómez, H.W. Exponentially Modified Logistic Distribution with Application to Mining and Nutrition. Appl. Math. Inf. Sci. 2018, 12, 1–8. [Google Scholar] [CrossRef]
  6. Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; Wiley: New York, NY, USA, 1995. [Google Scholar]
  7. Byrd, R.H.; Lu, P.; Nocedal, J.; Zhu, C. A Limited Memory Algorithm for Bound Constrained Optimization. SIAM J. Sci. Comput. 1995, 16, 1190–1208. [Google Scholar] [CrossRef]
  8. Tjalling, J.Y. Historical development of the Newton-Raphson method. SIAM Rev. 1995, 37, 531–551. [Google Scholar]
  9. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  10. Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  11. Bozdogan, H. Model selection and Akaike’s information criterion (AIC): The general theory and its analytical extensions. Psychometrika 1987, 52, 345–370. [Google Scholar] [CrossRef]
  12. Hannan, E.J.; Quinn, B.G. The Determination of the order of an autoregression. J. R. Stat. Soc. Ser. B 1979, 41, 190–195. [Google Scholar] [CrossRef]
  13. Torres de Young, S. Introduccion a la Cromatografia; Universidad Nacional de Colombia: Palmira, Colombia, 1994; 134p. [Google Scholar]
Figure 1. Graphical comparison of EML distributions with L for different values of λ (upper) and with ML and L (lower).
Figure 1. Graphical comparison of EML distributions with L for different values of λ (upper) and with ML and L (lower).
Mathematics 10 03515 g001
Figure 2. Comparison of the cdf of the E M L distribution (solid line) for λ = 2 (upper) and λ = 1 (lower) with the cdf of the distribution L (dashed line).
Figure 2. Comparison of the cdf of the E M L distribution (solid line) for λ = 2 (upper) and λ = 1 (lower) with the cdf of the distribution L (dashed line).
Mathematics 10 03515 g002
Figure 3. Comparison of the reliability function of the E M L distribution (solid line) for λ = 0.7 with the reliability function of the M L , E M L O G , and E M G distributions (dashed line, dotted line, dash-dotted line).
Figure 3. Comparison of the reliability function of the E M L distribution (solid line) for λ = 0.7 with the reliability function of the M L , E M L O G , and E M G distributions (dashed line, dotted line, dash-dotted line).
Mathematics 10 03515 g003
Figure 4. Graphical comparison of the kurtosis coefficient between the exponentially modified Laplace distribution (solid line), the exponentially modified Gaussian distribution (dashed line), and the exponentially modified logistic distribution (dotted line).
Figure 4. Graphical comparison of the kurtosis coefficient between the exponentially modified Laplace distribution (solid line), the exponentially modified Gaussian distribution (dashed line), and the exponentially modified logistic distribution (dotted line).
Mathematics 10 03515 g004
Figure 5. Histogram (upper) and tail (lower) for nickel concentration data set. Overlaid on top is the density E M L with parameters estimated via MLE (solid line), exponentially modified Gaussian density with parameters estimated via MLE (dotted line), exponentially modified logistic (dashed line), and modified Laplace (dash-dotted line).
Figure 5. Histogram (upper) and tail (lower) for nickel concentration data set. Overlaid on top is the density E M L with parameters estimated via MLE (solid line), exponentially modified Gaussian density with parameters estimated via MLE (dotted line), exponentially modified logistic (dashed line), and modified Laplace (dash-dotted line).
Mathematics 10 03515 g005
Figure 6. QQ plot for nickel concentration data set. The modified Laplace density (a), exponentially modified logistic density (b), exponentially modified Gaussian density (c), and exponentially modified Laplace density (d).
Figure 6. QQ plot for nickel concentration data set. The modified Laplace density (a), exponentially modified logistic density (b), exponentially modified Gaussian density (c), and exponentially modified Laplace density (d).
Mathematics 10 03515 g006
Figure 7. Histogram (upper) and tail (lower) for the neodymium concentration data set. The first graph shows the densities of the exponentially modified Laplace (solid line), Gaussian modified exponentially (dashed line), exponentially modified logistic (dotted line), and modified Laplace (dash-dotted line) distributions, with their parameters estimated by MLE.
Figure 7. Histogram (upper) and tail (lower) for the neodymium concentration data set. The first graph shows the densities of the exponentially modified Laplace (solid line), Gaussian modified exponentially (dashed line), exponentially modified logistic (dotted line), and modified Laplace (dash-dotted line) distributions, with their parameters estimated by MLE.
Mathematics 10 03515 g007
Figure 8. QQ plot for the neodymium concentration data set. The modified Laplace density (a), exponentially modified logistic density (b), exponentially modified Gaussian density (c), and exponentially modified Laplace density (d).
Figure 8. QQ plot for the neodymium concentration data set. The modified Laplace density (a), exponentially modified logistic density (b), exponentially modified Gaussian density (c), and exponentially modified Laplace density (d).
Mathematics 10 03515 g008
Figure 9. Histogram (upper) and tail (lower) for nitrogen concentration data set. The first graph shows the densities of exponentially modified Laplace (solid line), Gaussian modified exponentially (dashed line), exponentially modified logistic (dotted line) and modified Laplace (dash-dotted line) distributions, with their parameters estimated by MLE.
Figure 9. Histogram (upper) and tail (lower) for nitrogen concentration data set. The first graph shows the densities of exponentially modified Laplace (solid line), Gaussian modified exponentially (dashed line), exponentially modified logistic (dotted line) and modified Laplace (dash-dotted line) distributions, with their parameters estimated by MLE.
Mathematics 10 03515 g009
Figure 10. QQ plot for Nitrogen concentration data set. The density M L (a), E M L O G (b), E M G density (c), and E M L (d).
Figure 10. QQ plot for Nitrogen concentration data set. The density M L (a), E M L O G (b), E M G density (c), and E M L (d).
Mathematics 10 03515 g010
Table 1. Reliability function comparison for distributions M L , E M L O G , E M G , and E M L .
Table 1. Reliability function comparison for distributions M L , E M L O G , E M G , and E M L .
Distribution P ( Y > 2 ) P ( Y > 2.5 ) P ( Y > 3 ) P ( Y > 3.5 ) P ( Y > 4 ) P ( Y > 4.5 ) P ( Y > 5 )
M L 0.24440.15430.09580.05300.03610.02200.0134
E M L O G 0.28780.21160.15170.10650.07350.05010.0377
E M G 0.30730.22020.15610.11020.07750.05470.0385
E M L 0.32560.24490.18200.13390.09780.07110.0515
Table 2. ME simulation of 1000 iterations of the model E M L ( μ , σ , λ ) .
Table 2. ME simulation of 1000 iterations of the model E M L ( μ , σ , λ ) .
n μ σ λ μ ˜ sd ( μ ˜ ) ali ( μ ˜ ) c ( μ ˜ ) σ ˜ sd ( σ ˜ ) ali ( σ ˜ ) c ( σ ˜ ) λ ˜ sd ( λ ˜ ) ali ( λ ˜ ) c ( λ ˜ )
50010.30.03410.11890.466193.91.03410.11890.466193.90.33410.11890.466193.9
100010.30.01230.06500.254895.21.01230.06500.254895.20.31230.06500.254895.2
200010.30.00660.04110.161294.71.00660.04110.161294.70.30660.04110.161294.7
500010.30.00360.02450.095994.61.00360.02450.095994.60.30360.02450.095994.6
50010.7−0.12490.25891.015097.40.87510.25891.015097.40.57510.25891.015097.4
100010.7−0.11620.22930.898996.10.88380.22930.898996.10.58380.22930.898996.1
200010.7−0.07850.19010.745191.00.92150.19010.745191.00.62150.19010.745191.0
500010.7−0.04340.15400.603893.80.95660.15400.603893.80.65660.15400.603893.8
50011−0.10060.32081.257692.70.89940.32081.257692.70.89940.32081.257692.7
100011−0.03990.21740.852296.70.96010.21740.852296.70.96010.21740.852296.7
200011−0.01490.13730.538197.30.98510.13730.538197.30.98510.13730.538197.3
500011−0.00380.07600.297893.70.99620.07600.297893.70.99620.07600.297893.7
50011.2−0.05250.29841.169896.60.94750.29841.169896.61.14750.29841.169896.6
100011.2−0.01140.18270.716198.10.98860.18270.716198.11.18860.18270.716198.1
200011.20.00040.11180.438396.01.00040.11180.438396.01.20040.11180.438396.0
500011.2−0.00240.06370.249994.30.99760.06370.249994.31.19760.06370.249994.3
50−120.3−0.99020.06410.251294.72.00980.06410.251294.70.30980.06410.251294.7
100−120.3−0.99230.04620.181094.92.00770.04620.181094.90.30770.04620.181094.9
200−120.3−0.99580.03010.118194.72.00420.03010.118194.70.30420.03010.118194.7
500−120.3−0.99870.01850.072394.82.00130.01850.072394.80.30130.01850.072394.8
sd corresponds to the standard deviation, ali (average length of interval) is the average length of the confidence interval, and c the empirical coverage of the respective ME of the parameters, based on a 95% confidence interval.
Table 3. MLE simulation of 1000 iterations of the model E M L ( μ , σ , λ ) .
Table 3. MLE simulation of 1000 iterations of the model E M L ( μ , σ , λ ) .
n μ σ λ μ ^ sd ( μ ^ ) ali ( μ ^ ) c ( μ ^ ) σ ^ sd ( σ ^ ) ali ( σ ^ ) c ( σ ^ ) λ ^ sd ( λ ^ ) ali ( λ ^ ) c ( λ ^ )
50010.30.04700.52262.048693.60.93270.38851.522994.20.31750.20960.821696.2
100010.30.03260.34391.348094.20.98220.25340.993394.80.30810.11390.446394.9
200010.30.00930.23000.901594.50.99370.17460.684395.40.30350.07060.276995.1
500010.3−0.00610.14080.552195.00.99560.10870.426095.30.29990.04440.174094.6
50010.7−0.00230.38341.503194.50.94910.26111.023795.60.79740.56172.201794.8
100010.70.02550.28721.125794.10.96920.19130.749894.60.76070.39871.563095.7
200010.70.02610.20110.788294.91.00040.13960.547195.90.75270.27501.077996.1
500010.70.00840.12110.474795.20.99680.08290.324994.90.71300.12290.481694.8
50011.0−0.01490.37551.471895.40.91820.23690.928693.71.20521.03834.070193.8
100011.00.01760.28051.099794.70.96540.16970.665395.01.18740.82153.220493.7
200011.00.02480.21500.842894.60.98980.12660.496294.31.15280.62742.459594.8
500011.00.01060.13610.533794.70.99240.08420.330094.71.04730.32081.257496.5
50011.2−0.06440.34301.344494.80.89790.21390.838592.11.25770.94883.719595.5
100011.2−0.00130.27581.081294.60.95680.16290.638493.51.38210.90023.528793.1
200011.20.00900.21230.832294.90.98490.12000.470495.11.36750.72092.825994.2
500011.20.01230.13730.538394.50.99690.07920.310494.61.28310.42901.681995.5
50−120.3−0.88210.98993.880595.01.87770.73522.881993.80.31350.20270.794697.3
100−120.3−0.96010.67812.658296.11.94830.50541.981394.60.30710.11500.450795.5
200−120.3−0.96810.46531.823995.01.99170.33241.303094.90.30610.07020.275494.5
500−120.3−0.98830.28071.100494.61.99600.21970.861294.70.30210.04480.175594.9
sd corresponds to the standard deviation, ali (average length of interval) is the average length of the confidence interval, and c the empirical coverage of the respective EMV of the parameters, based on a 95% confidence interval.
Table 4. Summary Statistics for the Nickel Concentration Data Set.
Table 4. Summary Statistics for the Nickel Concentration Data Set.
n y ¯ s y γ 1 γ 2
8521.337216.63912.355911.1917
Table 5. Maximum likelihood estimators for M L , E M L O G , E M G , and E M L models for the soil nickel concentration data set, with their corresponding standard deviations in parentheses and comparison criteria AIC, BIC, CAIC, and HQIC.
Table 5. Maximum likelihood estimators for M L , E M L O G , E M G , and E M L models for the soil nickel concentration data set, with their corresponding standard deviations in parentheses and comparison criteria AIC, BIC, CAIC, and HQIC.
Parameter Estimates ML EMLOG EMG EML
μ ^ 11.0020 (0.0657)18.9149 (1.4048)10.0433 (0.0771)7.020 (0.0462)
σ ^ 11.6843 (1.21645)7.6833 (0.7231)9.0165 (0.0766)5.1452 (0.0258)
λ ^ 2.2279 (0.2418) 0.7810 (0.0610)0.3733 (0.0443)
AIC687.022699.207685.0531682.053
BIC694.385704.092692.381689.381
CAIC695.548705.092693.381690.381
HQIC690.168701.172688.001685.001
Table 6. Summary statistics for neodymium concentration data.
Table 6. Summary statistics for neodymium concentration data.
n y ¯ s y γ 1 γ 2
8635.103234.33073.884717.3951
Table 7. Maximum likelihood estimators for models M L , E M L O G , E M G , and E M L for the neodymium concentration data set in the soil, with their corresponding standard deviations in parentheses and comparison criteria AIC, BIC, CAIC, and HQIC.
Table 7. Maximum likelihood estimators for models M L , E M L O G , E M G , and E M L for the neodymium concentration data set in the soil, with their corresponding standard deviations in parentheses and comparison criteria AIC, BIC, CAIC, and HQIC.
Parameter Estimates ML EMLOG EMG EML
μ ^ 13.0001 (0.2804)29.0578 (2.1736)15.3836 (2.8609)10.44577 (0.0388)
σ ^ 18.9313 (1.8033)12.4762 (1.1937)17.9653 (1.0407)6.8136 (0.0091)
λ ^ 2.9883 (0.3412) 0.9147 (0.1363)0.2831 (0.0339)
AIC768.088802.523792.496763.294
BIC775.451807.432799.859770.567
CAIC776.451808.432800.859771.657
HQIC771.051804.499795.459766.257
Table 8. Summary statistics for nitrogen concentration data.
Table 8. Summary statistics for nitrogen concentration data.
n y ¯ s y γ 1 γ 2
3670.61890.0078−1.320512.4692
Table 9. Comparison of the maximum likelihood estimators for nitrogen concentration data between the M L , E M L O G , E M G , and E M L distributions with their corresponding standard deviations in parentheses and comparison criteria AIC, BIC, CAIC and HQIC.
Table 9. Comparison of the maximum likelihood estimators for nitrogen concentration data between the M L , E M L O G , E M G , and E M L distributions with their corresponding standard deviations in parentheses and comparison criteria AIC, BIC, CAIC and HQIC.
Parameter Estimates ML EMLOG EMG EML
μ ^ 0.6165 (0.0007)0.6192 (0.0003)0.6132 (0.0005)0.6147 (0.0005)
σ ^ 0.0065 (0.0003)0.0041 (0.0001)0.0064 (0.0002)0.0045 (0.0002)
λ ^ 1.5045 (0.1761) 1.0049 (0.0969)0.9616 (0.1368)
AIC−2549.257−2062.155−2465.067−2560.045
BIC−2530.541−2054.345−2453.351−2548.329
CAIC−2536.541−2053.344−2452.351−2547.329
HQIC−2544.602−2059.052−2460.412−2555.390
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Reyes, J.; Rojas, M.A.; Cortés, P.L.; Arrué, J. A More Flexible Asymmetric Exponential Modification of the Laplace Distribution with Applications for Chemical Concentration and Environment Data. Mathematics 2022, 10, 3515. https://doi.org/10.3390/math10193515

AMA Style

Reyes J, Rojas MA, Cortés PL, Arrué J. A More Flexible Asymmetric Exponential Modification of the Laplace Distribution with Applications for Chemical Concentration and Environment Data. Mathematics. 2022; 10(19):3515. https://doi.org/10.3390/math10193515

Chicago/Turabian Style

Reyes, Jimmy, Mario A. Rojas, Pedro L. Cortés, and Jaime Arrué. 2022. "A More Flexible Asymmetric Exponential Modification of the Laplace Distribution with Applications for Chemical Concentration and Environment Data" Mathematics 10, no. 19: 3515. https://doi.org/10.3390/math10193515

APA Style

Reyes, J., Rojas, M. A., Cortés, P. L., & Arrué, J. (2022). A More Flexible Asymmetric Exponential Modification of the Laplace Distribution with Applications for Chemical Concentration and Environment Data. Mathematics, 10(19), 3515. https://doi.org/10.3390/math10193515

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop