Next Article in Journal
The Stability of a Predator–Prey Model with Cross-Dispersal in a Multi-Patch Environment
Next Article in Special Issue
Bayesian and Non-Bayesian Inference to Bivariate Alpha Power Burr-XII Distribution with Engineering Application
Previous Article in Journal
Robust Semi-Infinite Interval Equilibrium Problem Involving Data Uncertainty: Optimality Conditions and Duality
Previous Article in Special Issue
Calibration Estimators with Different Types of Distance Measures Under Stratified Sampling in the Presence of Measurement Error
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Skew-Symmetric Generalized Normal and Generalized t Distributions

1
Department of Statistics, University of Pretoria, Pretoria 0002, South Africa
2
Department of Mathematics and Statistics, University of Neyshabur, Neyshabur, Iran
3
Statistical Center of Iran, Tehran, Iran
4
College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
*
Author to whom correspondence should be addressed.
Axioms 2024, 13(11), 782; https://doi.org/10.3390/axioms13110782
Submission received: 1 October 2024 / Revised: 1 November 2024 / Accepted: 4 November 2024 / Published: 13 November 2024

Abstract

:
In this paper, we introduce the skew-symmetric generalized normal and the skew-symmetric generalized t distributions, which are skewed extensions of symmetric special cases of generalized skew-normal and generalized skew-t distributions, respectively. We derive key distributional properties for these new distributions, including a recurrence relation and an explicit form for the cumulative distribution function (cdf) of the skew-symmetric generalized t distribution. Numerical examples including a simulation study and a real data analysis are presented to illustrate the practical applicability of these distributions.

1. Introduction

Azzalini [1] introduced the skew-normal distribution S N ( λ ) characterized by the following density function:
2 ϕ ( x ) Φ ( λ x ) , x R ,
where ϕ ( · ) represents the normal density function, and Φ ( · ) denotes the standard normal cumulative distribution function. The S N distribution has gained considerable attention due to its ability to capture asymmetry in data while preserving key characteristics of the normal distribution. Its flexibility has made it particularly useful in various fields, such as finance, environmental studies, and biomedical research.
Subsequently, Jamalizadeh et al. [2] proposed a two-parameter generalized SN distribution G S N ( λ 1 , λ 2 ) with the following density function:
ϕ G S N ( x ; λ 1 , λ 2 ) = 2 π cos 1 λ 1 λ 2 1 + λ 1 2 1 + λ 2 2 ϕ ( x ) Φ ( λ 1 x ) Φ ( λ 2 x ) , x R ,
where λ 1 and λ 2 are real numbers that enhance the model’s flexibility in capturing asymmetric data distributions. This two-parameter model effectively accommodates a wider range of skewness and kurtosis, offering more flexibility compared to its one-parameter counterpart.
Building on this, Jamalizadeh and Balakrishnan [3] introduced a three-parameter G S N distribution G S N ( λ 1 , λ 2 , ρ ) , which can be viewed as a special case of the unified multivariate skew-normal distribution introduced by Arellano-Valle and Azzalini [4]. The density function of G S N is defined as follows:
ϕ G S N ( x ; λ 1 , λ 2 , ρ ) = 2 π cos 1 ( ρ + λ 1 λ 2 ) 1 + λ 1 2 1 + λ 2 2 ϕ ( x ) Φ 2 ( λ 1 x , λ 2 x , ρ ) , x R ,
where Φ 2 represents the cumulative distribution function of the standard bivariate normal distribution with correlation ρ (with | ρ | < 1 ). This three-parameter model enhances the distribution’s capability to provide a more flexible fit for complex datasets and to accommodate dependencies between variables.
Remark 1.
In the special case where λ 1 = λ 2 = λ , the density function in (1) simplifies to the generalized normal distribution G N ( λ , ρ ) , given by
ϕ G N ( x ; λ , ρ ) = c ( λ , ρ ) ϕ ( x ) Φ 2 ( λ x , λ x , ρ ) , x R ,
where
c ( λ , ρ ) = 2 π cos 1 ( ρ λ 2 ) 1 + λ 2 ,
defines the normalization constant.
This distribution represents a symmetric distribution centered at zero as depicted in Figure 1. The capability of this distribution to retain symmetry while introducing elements of skewness makes it particularly valuable for statistical modeling applications.
Definition 1.
The family of skew-symmetric (-modulated) distributions is defined by the following density function [5]:
ϕ S S ( x ) = 2 f ( x ) Π ( ω ( x ) ) ,
where f ( · ) is a symmetric density function (symmetric about zero), ω ( · ) is an odd function, and Π : R [ 0 , 1 ] is a distribution function such that Π ( t ) + Π ( t ) = 1 .
This definition highlights the interplay between symmetry and skewness, enabling nuanced modeling of real-world phenomena. Azzalini and Regoli [6] explored various properties of skew-symmetric (-modulated) distributions, contributing significantly to the theoretical framework essential for practical applications. Several studies have investigated skew-symmetric distributions, including that of Nadarajah and Kotz [7], which introduced a family of skew-symmetric normal distributions characterized by the density function 2 ϕ ( x ) F ( α x ) , where α is a real constant and F ( · ) is an absolutely continuous distribution function with a symmetric density. By utilizing distribution functions such as normal, Student’s t, Laplace, logistic, and uniform distributions for F ( · ) , the authors demonstrated the versatility of skew-symmetric models across different contexts.
Gupta and Chang [8] examined a class of multivariate skew distributions, emphasizing the importance of skewness in multivariate data analysis. Meanwhile, Gomez et al. [9] studied a general family of skew-symmetric distributions generated by the normal distribution’s cumulative distribution function, further expanding the theoretical landscape of these distributions. Additionally, Nekoukhou and Alamatsaz [10] introduced a family of skew-symmetric Laplace distributions, which have practical applications in fields such as finance and risk management. Salehi and Azzalini [11] considered a Kotz-type distribution, where the tail weight and degree of peakedness is regulated by two parameters instead of a single one, and with a built symmetry-modulated Kotz-type distribution. They made statistical inference based on the likelihood function on three real data sets.
In this paper, we aim to introduce a three-parameter skew-symmetric generalized normal, and a four-parameter skew-symmetric generalized t distributions as two new flexible models with wider ranges of skewness. The remainder of this paper is structured as follows: Section 2 presents the skew-symmetric generalized normal distribution and discusses its key properties. Section 3 then introduces the skew-symmetric generalized t distribution, providing a recurrence relation and an explicit form for its cumulative distribution function (cdf). Section 4 offers numerical examples, including a simulation study and an analysis of real data. Finally, the paper concludes in Section 5.

2. Skew-Symmetric Generalized Normal Distribution

The three-parameter skew-symmetric generalized normal distribution, denoted as S S G N ( λ , ρ , α ) , is derived by substituting the symmetric density function f ( . ) from (2) into (4). In this formulation, we utilize the standard normal distribution function, represented as Π ( . ) , and define the weighting function ω ( x ) = α x . This approach allows us to capture the skewness and symmetry properties inherent in the distribution.
The density function for the S S G N is expressed mathematically as follows:
ϕ S S G N ( x ; λ , ρ , α ) = 2 c ( λ , ρ ) ϕ ( x ) Φ 2 ( λ x , λ x , ρ ) Φ ( α x ) , x R ,
where α R , λ R , and ρ ( | ρ | < 1 ) are shape parameters, and c ( λ , ρ ) is a normalization constant defined in (3). This formulation highlights the interplay between the parameters λ , ρ , and α , which together characterize the shape and behavior of the distribution.
In cases where ρ = 0 , the density function of the S S G N simplifies significantly, leading to the following expression:
ϕ S S G N ( x ; λ , α ) = 4 π cos 1 λ 2 1 + λ 2 ϕ ( x ) Φ ( λ x ) Φ ( λ x ) Φ ( α x ) , x R .
This simplification exposes the core structure of the distribution in the absence of the correlation parameter, facilitating a clearer analysis of the effects and roles of the remaining parameters.
The graphical representation of the density function of S S G N for various parameter values is illustrated in Figure 2. These plots provide valuable insights into how the parameters λ , ρ , and α influence the shape and characteristics of the S S G N . By examining these plots, one can observe the effects of skewness and kurtosis, which are critical in understanding the distribution’s behavior in practical applications.
Overall, the S S G N serves as a versatile model in statistical analysis, accommodating a range of data characteristics through its parameterization, and the visualizations further enhance our comprehension of its properties.
Remark 2.
The following results are readily obtained:
1. 
S S G N ( 0 , 0 , 0 ) N ( 0 , 1 )
2. 
S S G N ( 0 , 0 , α ) S N ( α )
3. 
S S G N ( λ , ρ , 0 ) G N ( λ , ρ )
4. 
S S G N ( λ , ρ , α ) S S G N ( λ , ρ , α ) (Thus, S S G N is not identifiable.)
5. 
If X S S G N ( λ , ρ , α ) , then X S S G N ( λ , ρ , α )
6. 
If X S S G N ( λ , ρ , α ) , then X = d U | α ( U ) > Z , where Z N ( 0 , 1 ) , U G N ( λ , ρ ) , and Z U .

Moments

In this section, we analyze the skewness and kurtosis of the three-parameter S S G N distribution. To facilitate this analysis, we first derive the moment-generating function (MGF) of the S S G N .
Theorem 1.
The moment-generating function of S S G N ( λ , ρ , α ) is given by
M ( t ; λ , ρ , α ) = 2 c ( λ , ρ ) exp t 2 2 Φ 3 λ t 1 + λ 2 , λ t 1 + λ 2 , α t 1 + α 2 , R ,
where
R = 1 ρ 12 ρ 13 1 ρ 13 1 = 1 ρ λ 2 1 + λ 2 λ α 1 + λ 2 1 + α 2 1 λ α 1 + λ 2 1 + α 2 1 .
Proof. 
To derive the moment-generating function, we start with the integral representation of the MGF:
M ( t ; λ , ρ , α ) = 2 c ( λ , ρ ) e t x ϕ ( x ) Φ 2 ( λ x , λ x , ρ ) Φ ( α x ) d x = 2 c ( λ , ρ ) e t 2 2 ϕ ( x t ) Φ 2 ( λ x , λ x , ρ ) Φ ( α x ) d x = 2 c ( λ , ρ ) e t 2 2 ϕ ( z ) Φ 2 ( λ ( z + t ) , λ ( z + t ) , ρ ) Φ ( α ( z + t ) ) d z = 2 c ( λ , ρ ) e t 2 2 E Φ 2 ( λ ( z + t ) , λ ( z + t ) , ρ ) Φ ( α ( z + t ) ) = 2 c ( λ , ρ ) e t 2 2 P ( Y 1 λ Z < λ t , Y 2 + λ Z < λ t , Y 3 α Z < α t ) = 2 c ( λ , ρ ) e t 2 2 Φ 3 λ t 1 + λ 2 , λ t 1 + λ 2 , α t 1 + α 2 , R ,
where ( Y 1 , Y 2 ) T follows a bivariate normal distribution N 2 ( 0 , 0 , 1 , 1 , ρ ) , which is independent of Y 3 and Z, where Z is independently and identically distributed as N ( 0 , 1 ) . □
The derivatives of the moment-generating function, evaluated at t = 0 , provide the moments of the S S G N ( λ , ρ , α ) . To aid in this process, we present the following lemma.
Lemma 1.
Let γ ,   β R p defined as γ = γ 1 , γ 2 , , γ p T and β = β 1 , β 2 , , β p T . Let Σ = σ i j denote a p × p positive definite covariance matrix. Furthermore, we assume that for i = 1 , 2 , , p , γ ,   β , and Σ are partitioned as follows:
γ = γ i γ i , β = β i β i , Σ = σ i i σ i i T σ i i Σ i i ,
then, for s R we have [12]
s Φ p γ s + β ; Σ = i = 1 p γ i σ i i ϕ γ i s + β i σ i i × Φ p 1 γ i γ i σ i i σ i i s + β i β i σ i i σ i i ; Σ i | i .
where γ i = γ 1 , , γ i 1 , γ i + 1 , , γ p T and Σ i | i = Σ i i σ i i σ i i T σ i i .
The first four moments of S S G N are expressed as follows:
E ( X ) = 2 c ( λ , ρ ) 2 π γ 1 π sin 1 ρ 13 + ρ 12 ρ 13 1 ρ 12 2 1 ρ 13 2 + γ 2 1 4 + 1 2 π sin 1 ρ 12 + ρ 13 2 1 ρ 13 2 ,
E ( X 2 ) = 1 λ 2 c ( λ , ρ ) ( 1 + ρ ) π ( 1 + λ 2 ) 1 ρ 2 + 2 λ 2 + 2 ρ λ 2 ,
E ( X 3 ) = 2 c ( λ , ρ ) 2 π γ 1 3 3 γ 1 π sin 1 ρ 13 + ρ 12 ρ 13 1 ρ 12 2 1 ρ 13 2 ( γ 2 3 3 γ 2 ) 1 4 + 1 2 π sin 1 ρ 12 + ρ 13 2 1 ρ 13 2 γ 1 π A ( γ 1 + γ 1 ρ 12 ) 1 ρ 12 2 B ( γ 2 γ 1 ρ 13 ) 1 ρ 13 2 + C γ 2 ( γ 1 γ 2 ρ 13 ) π 1 ρ 13 2 ,
E ( X 4 ) = 3 + c ( λ , ρ ) ( 1 + ρ ) π ( 1 + λ 2 ) 1 ρ 2 + 2 λ 2 + 2 ρ λ 2 3 λ 4 1 + λ 2 + λ 4 ( 1 + ρ ) ( 1 + λ 2 ) ( 1 + 2 λ 2 ρ ) 6 λ 2 ,
where γ 1 = λ 1 + λ 2 , γ 2 = α 1 + α 2 , and
A = 1 1 ρ 13 2 ( ρ 13 + ρ 12 ρ 13 ) 2 1 ρ 12 2 γ 2 γ 1 ρ 13 ( γ 1 + γ 1 ρ 12 ) ( ρ 13 + ρ 12 ρ 13 ) ( 1 ρ 12 2 ) ,
B = 1 1 ρ 12 2 ( ρ 13 + ρ 12 ρ 13 ) 2 1 ρ 13 2 γ 1 γ 1 ρ 12 + ( γ 2 γ 1 ρ 13 ) ( ρ 13 + ρ 12 ρ 13 ) ( 1 ρ 13 2 ) ,
C = 1 1 ρ 13 2 ( ρ 12 + ρ 13 2 ) 2 1 ρ 13 2 γ 1 + γ 2 ρ 13 ( γ 1 γ 2 ρ 13 ) ( ρ 12 + ρ 13 2 ) ( 1 ρ 13 2 ) .
The skewness and kurtosis of the S S G N can be derived from Equations (9)–(12) as follows:
C S = E ( X 3 ) 3 E ( X ) E ( X 2 ) + 2 E 3 ( X ) V a r 3 / 2 ( X ) ,
C K = E ( X 4 ) 4 E ( X ) E ( X 3 ) + 6 E 2 ( X ) E ( X 2 ) 3 E 4 ( X ) V a r 2 ( X ) 3 ,
where
V a r ( X ) = 1 λ 2 c ( λ , ρ ) ( 1 + ρ ) π ( 1 + λ 2 ) 1 ρ 2 + 2 λ 2 + 2 ρ λ 2 E 2 ( X ) .
The plots illustrating the skewness and kurtosis of S S G N for various parameter values are presented in Figure 3 and Figure 4, respectively.
As shown in Figure 3, the skewness of the S S G N increases with higher values of α and ρ , indicating a greater asymmetry in the distribution. Specifically, the maximum skewness occurs at λ = 3.46878 , resulting in a value of 1.2382 . In contrast, Figure 4 illustrates that the kurtosis initially decreases as | α | increases, before rising again. The peak kurtosis value is observed at 1.6648 for λ = 4.0195 . This behavior highlights the capacity of S S G N to model data with varying levels of asymmetry and peakedness, providing a flexible framework for statistical analysis.

3. Skew-Symmetric Generalized t Distribution

Jamalizadeh and Balakrishnan [3] defined a four-parameter generalized skew-t distribution, G S t ( ν , λ 1 , λ 2 , ρ ) , with the following density function:
f G S t ( x ; ν , λ 1 , λ 2 , ρ ) = 2 π cos 1 ( ρ + λ 1 λ 2 ) 1 + λ 1 2 1 + λ 2 2 g ( x ; ν ) G 2 λ 1 x 1 + ν ν + x 2 , λ 2 x 1 + ν ν + x 2 ; ρ , ν + 1 ,
where x R , g ( · ; ν ) is the density function of the t distribution with ν degrees of freedom, and G 2 ( · , · ; ρ , ν ) represents the distribution function of the standard bivariate t distribution with correlation ρ (where | ρ | < 1 ) and ν + 1 degrees of freedom.
Remark 3.
For the special case λ 1 = λ 2 = λ , the density function (16) reduces to G t ( ν , λ , ρ ) with the following density function:
f G t ( x ; ν , λ , ρ ) = c ( λ , ρ ) g ( x ; ν ) G 2 λ x 1 + ν ν + x 2 , λ x 1 + ν ν + x 2 ; ρ , ν + 1 , x R ,
where c ( λ , ρ ) is defined in (3).
This is a symmetric distribution, centered at 0, as illustrated in Figure 5.
The four-parameter skew-symmetric generalized t distribution, S S G t ( ν , λ , ρ , α ) , is obtained by substituting (17) into (4) as a symmetric density function f ( . ) , using the standard normal distribution function Π ( . ) and ω ( x ) = α x . The density function of S S G t is given by
f S S G t ( x ; ν , λ , ρ , α ) = 2 c ( λ , ρ ) g ( x ; ν ) G 2 λ x 1 + ν ν + x 2 , λ x 1 + ν ν + x 2 ; ρ , ν + 1 Φ ( α x ) , x R ,
where α R , λ R , ρ ( | ρ | < 1 ) are shape parameters, ν R + is the tail parameter, and c ( λ , ρ ) is defined in (3). When ρ = 0 , the density function of S S G t becomes
f S S G t ( x ; ν , λ , α ) = 4 π cos 1 λ 2 1 + λ 2 g ( x ; ν ) G λ x 1 + ν ν + x 2 ; ν + 1 G λ x 1 + ν ν + x 2 ; ρ , ν + 1 Φ ( α x ) .
The plots of the density function of S S G t for various parameter values are shown in Figure 6.
Remark 4.
The following results are readily obtained:
1. 
S S G t ( ν , 0 , 0 , 0 ) t ( ν )
2. 
S S G t ( ν , 0 , 0 , α ) S t ( ν , α )
3. 
S S G t ( ν , λ , ρ , 0 ) G t ( ν , λ , ρ )
4. 
S S G t ( ν , λ , ρ , α ) S S G t ( ν , λ , ρ , α ) (Thus, S S G t is not identifiable.)
5. 
If X S S G t ( ν , λ , ρ , α ) , then X S S G t ( ν , λ , ρ , α )
6. 
If X S S G t ( ν , λ , ρ , α ) , then X = d U | α ( U ) > Z , where Z N ( 0 , 1 ) , U G N ( λ , ρ ) , and Z U .
Remark 5.
If X S S G t ( ν , λ , ρ , α ) , then X = d W 1 / 2 Z , where W χ ν 2 / ν , Z S S G N ( λ , ρ , α ) , and W Z . Thus, the integral form of the cumulative distribution function (cdf) of the S S G t distribution is as follows:
F S S G t t ; ν , λ , ρ , α = E Φ S S G N t W ; λ , ρ , α = 2 π C ν 0 x ν 1 ϕ x Φ S S G N t x ν ; λ , ρ , α d x ,
where
C ν = 1 Γ ν 2 2 ν 1 / 2 .
Amiri et al. [13] obtained efficient recursive computational algorithms for multivariate t and multivariate unified skew-t distributions. Also, Salehi et al. [12] obtained recurrence relations for the cdf and the density function of the generalized skew two-piece skew-t distribution. Here, we intend to achieve to a recurrence relation for the cdf of the S S G t distribution from the integration form given by (20).
Theorem 2.
The following recurrence relation holds for all ν > 1 :
F S S G t t ; ν + 1 , λ , ρ , α = F S S G t ν 1 ν + 1 t ; ν 1 , λ , ρ , α + 2 c λ , ρ h t ; ν + 1 , ν 1 G 3 ν λ t ν + 1 + t 2 , ν λ t ν + 1 + t 2 , ν α t ν + 1 + t 2 ; ν , R * ,
where h t ; ν 1 , ν 2 = t ν 1 ν 2 g ν 2 ν 1 t ; ν 2 , G 3 ( · , · , · ; ν , R * ) stands for the cdf of the trivariate Student’s t distribution with ν degrees of freedom and the correlation matrix
R * = 1 ρ 0 ρ 1 0 0 0 1 .
Proof. 
From (20) and upon integrating by parts, the cdf of S S G t distribution with ν + 1 degrees of freedom is readily obtained as
F S S G t t ; ν + 1 , λ , ρ , α = 2 π C ν + 1 0 x ν ϕ x Φ S S G N t x ν + 1 ; λ , ρ , α d x = F S S G t ν 1 ν + 1 t ; ν 1 , λ , ρ , α + 2 π C ν + 1 0 x ν 1 ϕ x x Φ S S G N t x ν + 1 ; λ , ρ , α d x .
Now, the second part of the right-hand side (RHS) of (22) is simplified to
R H S = 2 π C ν + 1 t ν + 1 0 x ν 1 ϕ x ϕ S S G N t x ν + 1 ; λ , ρ , α d x = 2 π C ν + 1 2 c λ , ρ t ν + 1 0 x ν 1 ϕ x ϕ t x ν + 1 Φ 2 λ t x ν + 1 , λ t x ν + 1 ; ρ Φ α t x ν + 1 d x = c λ , ρ Γ ν 2 ν + 1 ν 1 / 2 π Γ ν + 1 2 t ν + 1 + t 2 ν / 2 × 2 π C ν 0 x ν 1 ϕ x Φ 2 λ t x ν + 1 + t 2 , λ t x ν + 1 + t 2 ; ρ Φ α t x ν + 1 + t 2 d x = 2 c λ , ρ h t ; ν + 1 , ν 1 × 2 π C ν 0 x ν 1 ϕ x Φ 3 ν λ t x ν ν + 1 + t 2 , ν λ t x ν ν + 1 + t 2 , ν α t x ν ν + 1 + t 2 ; R * d x = 2 c λ , ρ h t ; ν + 1 , ν 1 G 3 ν λ t ν + 1 + t 2 , ν λ t ν + 1 + t 2 , ν α t ν + 1 + t 2 ; ν , R * .
Remark 6.
From Theorem 2, the following results are respectively concluded for odd and even values of ν
F S S G t t ; ν , λ , ρ , α = F S S G t t ν ; 1 , λ , ρ , α + 2 c λ , ρ i = 1 ν 1 / 2 h t ; ν , 2 i 1 G 3 2 i λ t ν + t 2 , 2 i λ t ν + t 2 , 2 i α t ν + t 2 ; 2 i , R * ,
and
F S S G t t ; ν , λ , ρ , α = F S S G t 2 ν t ; 2 , λ , ρ , α + 2 c λ , ρ i = 2 ν / 2 h t ; ν , 2 i 2 G 3 2 i 1 λ t ν + t 2 , 2 i 1 λ t ν + t 2 , 2 i 1 α t ν + t 2 ; 2 i 1 , R * .
There is no explicit form for F S S G t t ; 1 , λ , ρ , α to be used as the starting point in (24). But an explicit form for f S S G t t ; 1 , λ , ρ , α is obtained as
f S S G t t ; 1 , λ , ρ , α = c λ , ρ π 1 + t 2 G 3 2 λ t 1 + t 2 , 2 λ t 1 + t 2 , 2 α t 1 + t 2 ; 2 , R * .
Also an explicit form for F S S G t t ; 2 , λ , ρ , α is as
F S S G t t ; 2 , λ , ρ , α = Φ S S G N 0 ; λ , ρ , α + c λ , ρ t c λ t 2 + t 2 , ρ Φ S S G N 0 ; λ t 2 + t 2 , ρ , α t 2 + t 2 .
Thus, a closed form for the cdf of the S S G t distribution is accessible.

Moments

According to Remark 5, the r t h moment of X S S G t ( ν , λ , ρ , α ) can be derived as follows:
E ( X r ) = E ( W r / 2 ) E ( Z r ) ,
where
E ( W r / 2 ) = ν 2 r / 2 Γ ν r 2 Γ ν 2 , ν > r .
Thus, the first four moments of S S G t can be obtained using the first four moments of S S G N in Equations (9)–(12). Consequently, the skewness and kurtosis of S S G t can be derived from Equations (13) and (14), respectively. The plots of skewness and kurtosis of S S G t for various parameter values are shown in Figure 7 and Figure 8, respectively.
As observed in Figure 7, along with the numerical optimization results, the skewness of S S G t increases with increasing α and ρ while decreasing with increasing ν . The maximum skewness occurs at λ = 3.8114 , with a value of 4.1821 . From Figure 8, the kurtosis increases with increasing | α | and ρ while decreasing with ν . The maximum kurtosis value is 22.1327 for λ = 4.5767 . Thus, the ranges of skewness and kurtosis of S S G t are wider than those of S S G N .

4. Numerical Illustration

For practical works, the distributions proposed so far in (5) and (18) must be supplied with a location (denoted by μ ) and a scale (denoted by σ ) parameters yielding S S G N ( μ , σ , λ , ρ , α ) and S S G t ( μ , σ , ν , λ , ρ , α , ν ) distributions, respectively. If we assume that the observations x = ( x 1 , , x n ) follow from the former distribution under independence conditions, then the log-likelihood function of ( μ , σ , λ , ρ , α ) is
l ( μ , σ , λ , ρ , α ) = n ln 2 c ( λ , ρ ) n ln σ n 2 ln ( 2 π σ 2 ) 1 2 σ 2 i = 1 n ( x i μ ) 2 + i = 1 n ln Φ 2 λ ( x i μ ) σ , λ ( x i μ ) σ , ρ + i = 1 n ln Φ α ( x i μ ) σ .
Similarly, for the S S G t ( μ , σ , ν , λ , ρ , α ) distribution, we have
l ( μ , σ , ν , λ , ρ , α ) = n ln 2 c ( λ , ρ ) n ln σ + n ln Γ ν + 1 2 n ln Γ ν 2 n 2 ln ( π ν ) ν + 1 2 i = 1 n ln 1 + ( x i μ ) 2 ν σ 2 + i = 1 n ln Φ α ( x i μ ) σ + i = 1 n ln G 2 λ x μ σ 1 + ν ν + x μ σ 2 , λ x μ σ 1 + ν ν + x μ σ 2 ; ρ , ν + 1 .
Maximization of the log-likelihoods given by (26) and (27) which must be performed by numerical techniques lead to the maximum likelihood estimates (MLEs) of the parameters. Using the R programming environment [14], we employ a combination of the global optimizer DEoptim [15] and the local optimizer optim (with the ’L-BFGS-B’ method), available in the DEoptim and stats R packages, respectively. DEoptim package is based on the Differential Evolution (DE) algorithm [16], and its significant performance as a global optimization algorithm on continuous numerical minimization problems has been extensively studied [17].

4.1. Simulation Study

In this section, we intend to carry out a brief simulation study in order to investigate the behavior of the MLEs of the parameters of S S G N ( μ , σ , λ , ρ , α ) distribution. To this end, we set some selected values as the true parameters, μ = 0 , σ = 1 , λ = 1 , α = 0.4 , ρ = 0.5 , and consider samples with different sizes, n = 100 , 200 , 500 , 1000 , 2000 , as the given observations. To generate samples from S S G N distribution we employ the acceptance–rejection algorithm using the stochastic representation given by Remark 2, part 6.
As the evaluation metrics measured for the estimators, the mean squared error (MSE) and bias are computed, and the results are summarized in Table 1. Moreover, Figure 9 shows the MSE of the parameters and the absolute value of bias for different values of n.
As it is observed from Figure 9, all of the MLEs are consistent but with different convergence rates. More specifically, the performance of the MLE of ρ for the small and medium sample sizes is not as good as those of other estimators. Therefore, we recommend using the distribution (6) instead of its complementary version in (5) when there is no significant difference in the Akaike information criteria (AICs) of these models for the given real data.

4.2. Real Data Analysis

To demonstrate the practical application of the distributions proposed so far, we examine a real dataset that includes the strength of carbon fibers [18] (see Table 2). Here, we also consider S N and S t distributions as the potential competitors of the distributions proposed so far. For fitting these distributions, we respectively employ the functions sn . mple and st . mple , available in the R package sn [19,20].
The MLEs of parameters, the corresponding standard error, log-likelihood, Akaike information criterion (AIC), Bayesian Information Criterion (BIC) and the p-value of the Kolmogorov–Smirnov (KS) test are reported in Table 3. According to the p-value of the KS test, the goodness-of-fits of all distributions are confirmed. However, as seen in Table 3, S S G N has the minimum AIC and BIC and thus provides the best fit for the data. The corresponding Q-Q plot of the S S G N model, along with the histogram of the data including the fitted curves, is shown in Figure 10.
The results also indicate that the S S G t distribution provides a good fit for the carbon fiber strength data as evidenced by its AIC value and the p-value from the KS test.

5. Conclusions

In this paper, we introduced the skew-symmetric generalized normal distribution ( S S G N ) and the skew-symmetric generalized t distribution ( S S G t ), extending the framework established by previous studies on skew-normal and skew-t distributions. We derived the density functions, moments, and important statistical properties of these distributions, demonstrating their flexibility in modeling asymmetric data. Moreover, a recurrence relation as well as an exact form for the cdf of the S S G t distribution were obtained. A brief simulation study was also conducted to investigate the behavior of the MLEs of the S S G N parameters. Then, a numerical illustration provided evidence of the practical applicability of the S S G N and S S G t distributions by fitting them to a real dataset concerning the strength of carbon fibers. The results indicated that the S S G N distribution outperformed its competitors, such as the skew-normal and skew-t distributions, in terms of the AIC and the KS test.

Author Contributions

Methodology, N.N.R., M.S., Y.M. and D.-G.C.; Writing—original draft, N.N.R., M.S. and Y.M.; Writing—review & editing, D.-G.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Acknowledgments

This work was based upon research supported in part by the RDP grant at the University of Pretoria, the National Research Foundation (NRF) of South Africa, Ref.: RA210106581084, grant No. 150170 and the South African DST-NRF-MRC SARChI Research Chair in Biostatistics (Grant No. 114613). The opinions expressed and conclusions arrived at are those of the authors and are not necessarily to be attributed to the NRF.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  2. Jamalizadeh, A.; Behboodian, J.; Balakrishnan, N. A two-parameter generalized skew-normal distribution. Stat. Probab. Lett. 2008, 78, 1722–1726. [Google Scholar] [CrossRef]
  3. Jamalizadeh, A.; Balakrishnan, N. Order statistics from trivariate normal and tν-distributions in terms of generalized skew-normal and skew-tν distributions. J. Stat. Plan. Inference 2009, 139, 3799–3819. [Google Scholar] [CrossRef]
  4. Arellano-Valle, R.B.; Azzalini, A. On the Unification of Families of Skew-normal Distributions. Scand. J. Stat. 2006, 33, 561–574. [Google Scholar] [CrossRef]
  5. Azzalini, A.; Capitanio, A. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew-t distribution. J. R. Stat. Soc. Ser. B 2003, 65, 367–389. [Google Scholar] [CrossRef]
  6. Azzalini, A.; Regoli, G. Some properties of skew-symmetric distributions. Ann. Inst. Stat. Math. 2012, 64, 857–879. [Google Scholar] [CrossRef]
  7. Nadarajah, S.; Kotz, S. Skewed distributions generated by the normal kernel. Stat. Probab. Lett. 2003, 65, 269–277. [Google Scholar] [CrossRef]
  8. Gupta, A.K.; Chang, F.C. Multivariate skew-symmetric distributions. Appl. Math. Lett. 2003, 16, 643–646. [Google Scholar] [CrossRef]
  9. Gomez, H.W.; Venegas, O.; Bolfarine, H. Skew-symmetric distributions generated by the distribution function of the normal distribution. Environmetrics 2007, 18, 395–407. [Google Scholar] [CrossRef]
  10. Nekoukhou, V.; Alamatsaz, M.H. A family of skew-symmetric-Laplace distributions. Stat. Pap. 2012, 53, 685–696. [Google Scholar] [CrossRef]
  11. Salehi, M.; Azzalini, A. On application of the univariate Kotz distribution and some of its extensions. METRON 2018, 76, 177–201. [Google Scholar] [CrossRef]
  12. Salehi, M.; Jamalizadeh, A.; Doostparast, M. A generalized skew two-piece skew-elliptical distribution. Stat. Pap. 2014, 55, 409–429. [Google Scholar] [CrossRef]
  13. Amiri, M.; Mehrali, Y.; Balakrishnan, N.; Jamalizadeh, A. Efficient recursive computational algorithms for multivariate t and multivariate unified skew-t distributions with applications to inference. Comput. Stat. 2022, 37, 125–158. [Google Scholar] [CrossRef]
  14. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2024; Available online: https://www.R-project.org/ (accessed on 30 September 2024).
  15. Ardia, D.; Mullen, K.; Peterson, B.; Ulrich, J.; Boudt, K. DEoptim: Global Optimization by Differential Evolution; R 259 Package Version 2.2-5; R Core Team: Vienna, Austria, 2020. [Google Scholar]
  16. Storn, R.; Price, K. Differential Evolution—A simple and efficient heuristic for global optimization over continuous 292 spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
  17. Price, K.; Storn, R.M.; Lampinen, J.A. Differential Evolution: A Practical Approach to Global Optimization; Springer Science and Business Media: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
  18. Badar, M.G.; Priest, A.M. Statistical aspects of fiber and bundle strength in hybrid composites. In Progress in Science and Engineering Composites; Hayashi, T., Kawata, K., Umekawa, S., Eds.; ICCM-IV: Tokyo, Japan, 1982; pp. 1129–1136. [Google Scholar]
  19. Azzalini, A. The R Package ‘sn’: The Skew-Normal and Related Distributions such as the Skew-t and the SUN (Version 2.1.1). 2023. Available online: http://azzalini.stat.unipd.it/SN/ (accessed on 30 September 2024).
  20. Azzalini, A.; Salehi, M. Some computational aspects of maximum likelihood estimation of the Skew- distribution. In Computational and Methodological Statistics and Biostatistics; Bekker, A., Chen, G., Ferreira, J., Eds.; Emerging Topics in Statistics and Biostatistics; Springer: Cham, Switzerland, 2020. [Google Scholar] [CrossRef]
Figure 1. The density function of G S N ( λ 1 , λ 2 , ρ ) for ρ = 0.8 .
Figure 1. The density function of G S N ( λ 1 , λ 2 , ρ ) for ρ = 0.8 .
Axioms 13 00782 g001
Figure 2. The density function of S S G N ( λ , ρ , α ) for some choices of the parameters.
Figure 2. The density function of S S G N ( λ , ρ , α ) for some choices of the parameters.
Axioms 13 00782 g002
Figure 3. The skewness of S S G N ( λ , ρ , α ) for the selected parameter values.
Figure 3. The skewness of S S G N ( λ , ρ , α ) for the selected parameter values.
Axioms 13 00782 g003
Figure 4. The kurtosis of S S G N ( λ , ρ , α ) for the selected parameter values.
Figure 4. The kurtosis of S S G N ( λ , ρ , α ) for the selected parameter values.
Axioms 13 00782 g004
Figure 5. The density function of G S t ( ν , λ 1 , λ 2 , ρ ) for ν = 2 , ρ = 0.8 .
Figure 5. The density function of G S t ( ν , λ 1 , λ 2 , ρ ) for ν = 2 , ρ = 0.8 .
Axioms 13 00782 g005
Figure 6. The density function of S S G t ( ν , λ , ρ , α ) for various parameter choices.
Figure 6. The density function of S S G t ( ν , λ , ρ , α ) for various parameter choices.
Axioms 13 00782 g006
Figure 7. The skewness of S S G t ( ν , λ , ρ , α ) for various parameter choices.
Figure 7. The skewness of S S G t ( ν , λ , ρ , α ) for various parameter choices.
Axioms 13 00782 g007
Figure 8. The kurtosis of S S G t ( ν , λ , ρ , α ) for various parameter choices.
Figure 8. The kurtosis of S S G t ( ν , λ , ρ , α ) for various parameter choices.
Axioms 13 00782 g008
Figure 9. The MSE and absolute bias of the MLEs of the SSGN’s parameters for μ = 0 , σ = 1 , λ = 1 , α = 0.4 , ρ = 0.5 , and different values of n.
Figure 9. The MSE and absolute bias of the MLEs of the SSGN’s parameters for μ = 0 , σ = 1 , λ = 1 , α = 0.4 , ρ = 0.5 , and different values of n.
Axioms 13 00782 g009
Figure 10. The histogram of the data and the fitted curves (left) and the Q-Q plot of S S G N (right).
Figure 10. The histogram of the data and the fitted curves (left) and the Q-Q plot of S S G N (right).
Axioms 13 00782 g010
Table 1. MLEs and the corresponding biases and MSEs.
Table 1. MLEs and the corresponding biases and MSEs.
n      μ      σ      λ      α      ρ
Estimate−0.07921.11851.2139 0.6989 0.1043
100Bias−0.07920.11850.2139 0.2989−0.3956
MSE 0.01220.05070.1307 0.1038 0.3129
Estimate−0.05751.06891.1192 0.4923 0.2130
200Bias−0.05750.06890.1192 0.0923−0.2869
MSE 0.00650.01800.0604 0.0875 0.2501
Estimate 0.03681.03041.0460 0.4343 0.3406
500Bias 0.03680.03040.0460 0.0343−0.1593
MSE 0.00510.00100.0028 0.0597 0.1509
Estimate−0.02671.02271.0239 0.4270 0.5493
1000Bias−0.02670.02270.0239 0.0270 0.0493
MSE 0.00400.00070.0021 0.0368 0.0057
Estimate−0.01181.00931.0090 0.3918 0.4787
2000Bias−0.01180.00930.0090−0.0081−0.0212
MSE 0.00370.00020.0008 0.0331 0.0038
Table 2. The strength of carbon fibers [18].
Table 2. The strength of carbon fibers [18].
1.901, 2.132, 2.203, 2.228, 2.257, 2.350, 2.361, 2.396, 2.397, 2.445, 2.454, 2.474, 2.518, 2.522, 2.525,
2.532, 2.575, 2.614, 2.616, 2.618, 2.624, 2.659, 2.675, 2.738, 2.740, 2.856, 2.917, 2.928, 2.937, 2.937,
2.977, 2.996, 3.030, 3.125, 3.139, 3.145, 3.220, 3.223, 3.235, 3.243, 3.264, 3.272, 3.294, 3.332, 3.346,
3.377, 3.408, 3.435, 3.493, 3.501, 3.537, 3.554, 3.562, 3.628, 3.852, 3.871, 3.886, 3.971, 4.024, 4.027,
4.225, 4.395, 5.020.
Table 3. MLEs (standard errors), log-likelihood, AIC, BIC and p-value of KS test.
Table 3. MLEs (standard errors), log-likelihood, AIC, BIC and p-value of KS test.
Distribution SSGN ( μ , σ , λ , α , ρ ) SN ( μ , σ , α ) St ( μ , σ , ν , α ) SSGt ( μ , σ , ν , λ , α , ρ )
μ ^ 2.27102.27092.27102.2710
(0.1063)(0.1667)(0.1406)(0.1237)
σ ^ 1.00051.00041.00041.0004
(0.0139)(0.0201)(0.0110)(0.0108)
λ ^ 0.3001--------0.5012
(0.0109)--------(0.0236)
α ^ 4.00184.41834.41823.9862
(0.0435)(0.0534)(0.0547)(0.0621)
ρ ^ −0.9899--------−0.9899
(0.0102)--------(0.0184)
ν ^ --------55,897.0701340.5657
--------(18.0547)(4.0534)
Log-likelihood−47.4183−55.9015−55.9015−47.0208
AIC104.8366117.8031119.8033106.0417
BIC115.5523124.2324128.3755118.9004
p-value0.45680.87710.87650.4732
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Nakhaei Rad, N.; Salehi, M.; Mehrali, Y.; Chen, D.-G. Skew-Symmetric Generalized Normal and Generalized t Distributions. Axioms 2024, 13, 782. https://doi.org/10.3390/axioms13110782

AMA Style

Nakhaei Rad N, Salehi M, Mehrali Y, Chen D-G. Skew-Symmetric Generalized Normal and Generalized t Distributions. Axioms. 2024; 13(11):782. https://doi.org/10.3390/axioms13110782

Chicago/Turabian Style

Nakhaei Rad, Najmeh, Mahdi Salehi, Yaser Mehrali, and Ding-Geng Chen. 2024. "Skew-Symmetric Generalized Normal and Generalized t Distributions" Axioms 13, no. 11: 782. https://doi.org/10.3390/axioms13110782

APA Style

Nakhaei Rad, N., Salehi, M., Mehrali, Y., & Chen, D. -G. (2024). Skew-Symmetric Generalized Normal and Generalized t Distributions. Axioms, 13(11), 782. https://doi.org/10.3390/axioms13110782

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop