Bivariate Power-Skew-Elliptical Distribution

Martínez-Flórez, Guillermo; Tovar-Falón, Roger; Gómez, Héctor W.

doi:10.3390/sym12081327

Open AccessArticle

Bivariate Power-Skew-Elliptical Distribution

by

Guillermo Martínez-Flórez

^1,2,†,

Roger Tovar-Falón

^1,† and

Héctor W. Gómez

^3,*,†

¹

Departamento de Matemáticas y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 230027, Colombia

²

Programa de Pós-Graduação em Modelagem e Métodos Quantitativos, Universidade Federal de Ceará, Fortaleza 60020-181, Brazil

³

Departamento de Matemáticas, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Symmetry 2020, 12(8), 1327; https://doi.org/10.3390/sym12081327

Submission received: 4 July 2020 / Revised: 3 August 2020 / Accepted: 5 August 2020 / Published: 9 August 2020

(This article belongs to the Special Issue Symmetric and Asymmetric Distributions: Theoretical Developments and Applications II)

Download

Browse Figures

Versions Notes

Abstract

:

In this article, we introduce a power-skew-elliptical (PSE) distribution in the bivariate setting. The new bivariate model arises in the context of conditionally specified distributions. The proposed bivariate model is an absolutely continuous distribution whose marginals are univariate PSE distributions. The special case of the bivariate power-skew-normal (BPSN) distribution is studied in details. General properties of the BPSN distribution are derived and the estimation of the unknown parameters by maximum pseudo-likelihood is discussed. Further, a sandwich type matrix, which is a consistent estimator for the asymptotic covariance matrix of the maximum likelihood (ML) estimator is determined. Two applications for real data of the proposed bivariate distribution is provided for illustrative purposes.

Keywords:

skew-elliptical distribution; exponentiated distribution; maximum pseudo-likelihood; asymmetric data; bivariate distribution

1. Introduction

Family of distributions that unify the main characteristics of other families are not very common in distributions theory. In this sense, an asymmetric family widely used in many situations and different areas of knowledge is the skew-normal (SN) distribution of [1], that is characterized because it has a wide range of asymmetry. Another well-known family in this same area is the fractional order statistical distribution, also known as exponentiated distribution or power-normal distribution introduced by [2], which is characterized by having a wide range of kurtosis. The unification of these two families was studied by [3] and was called power-skew-normal distribution. The resulting family contains, such as special cases the normal, skew-normal and power-normal distributions, and the ranges of asymmetry and kurtosis are greater than any of these three families of distributions. This type of unification of distribution families is important in the literature of distribution theory because it introduces great flexibility to the resulting models.

This model was extended to the case of elliptical distributions and was considered the case of the Birnbaum-Saunders (BS) family by [4], resulting in a large unification of useful distributions for example, to model the lifetime in survival analysis or applications in reliability theory. The model is called Birnbaum-Saunders power skew elliptical (BSPSE), and it contains as special cases as a large number of extensions of the lifetime Birnbaum-Saunders model, for both elliptical distributions and skew-elliptical families. In the univariate case, this type of results suggests that multivariate distributions constructed of unifications of distributions, it will induce multivariate models with great flexibility.

In this paper, we study a new family of bivariate distributions whose conditional densities follow a power-skew-elliptical distribution, which extends the power-skew normal model to the case of bivariate distributions, becoming a new alternative to the existing asymmetric bivariate models in literature such as the bivariate skew-normal distribution by [5] and the conditionally specified bivariate skewed distribution of [6].

A brief description of the some elliptical distributions is presented below.

1.1. Elliptical Distributions

A continuous one-dimensional random variable is said to have an elliptical distribution, if its distribution function is symmetric with support in the real number set. Specifically, a random variable X has a symmetric distribution if its probability density function (PDF) is given by

f (x) = \frac{c}{η} g (z^{2}),

(1)

for some non-negative function

g (u)

, with

u > 0

and corresponds to the kernel of the PDF such that

\int_{0}^{\infty} u^{- \frac{1}{2}} g (u) d u = 1 / c

, where

z = (x - ξ) / η

and c is a normalizing constant. The function

g (\cdot)

is known as the density-generating function. An elliptically distributed random variable X with location and scale parameters

ξ

and

η

, respectively, and density-generating function, say g is denoted by

X \sim EC (ξ, η; g)

. If

ξ = 0

and

η = 1

, then X has spherical distribution, which is denoted as

X \sim EC (0, 1; g)

.

Properties of this family have been studied in [7,8,9,10,11], among others. Particular cases of the

X \sim EC (0, 1; g)

distribution are the Pearson type VII distribution, the type Kotz distribution, the Student-t distribution with

ν

degrees of freedom, the Cauchy distribution and the normal distribution, among others. The density-generating function of the generalized normal, Cauchy, Student-t, type I logistic, type II logistic and power exponential are, respectively, given by

g (u) = {(2 π)}^{- 1 / 2} exp (- u / 2)

,

g (u) = {π (1 + u)}^{- 1}

,

g (u) = ν^{ν / 2} B {(1 / 2, ν / 2)}^{- 1} {(ν + u)}^{- (ν + 1) / 2}

, where

ν > 0

and

B (\cdot, \cdot)

is the beta function,

g (u) = c exp (- u) {(1 + exp (- u))}^{- 2}

, where

c \approx 1.484300029

is the normalizing constant obtained from

\int_{0}^{\infty} u^{- 1 / 2} g (u) d u = 1

,

g (u) = exp (- \sqrt{u}) {(1 + exp (- \sqrt{u}))}^{- 2}

and

g (u) = c (k) exp (- \frac{1}{2} u^{1 / (1 + k)})

,

- 1 < k \leq 1

, where

c (k) = Γ (1 + (k + 1) / 2) 2^{1 + (1 + k) / 2}

.

1.2. Skew-Elliptical Distribution

An extension of the elliptical model to the asymmetric case is the standard elliptical asymmetric (skew-elliptical) model defined as

\begin{matrix} h_{Y} (y; λ) = 2 f (y) F (λ y); y, λ \in R, \end{matrix}

(2)

where

f (\cdot)

is given in Equation (1),

F (\cdot)

is its respective cumulative distribution function (CDF), and

λ

is an asymmetry parameter. We use the notation

Y \sim SE (0, 1; g, λ)

. The CDF for this model is given by

H_{Y} (y) = 2 \int_{- \infty}^{y} f (t) F (λ t) d t .

(3)

Skew-elliptical distributions are discussed in [12,13,14,15,16], among others.

To follow we present some distributions belonging to this family.

1.2.1. Skew-Normal Distribution

A particular case of model in Equation (3) is the SN distribution introduced by [1], which is obtained by letting

f (\cdot) = ϕ (\cdot)

and

F (\cdot) = Φ (\cdot)

, that is, the PDF and CDF of the standard normal distribution. The PDF and CDF of the SN model are given by

h (x) = f_{SN} (x) = 2 ϕ (x) Φ (λ x); x \in R,

(4)

and

H (x) = F_{SN} (x) = Φ (x) - 2 T (x, λ); x \in R,

(5)

respectively, where

T (\cdot, \cdot)

is the Owen’s function, see [17] for more details.

1.2.2. Skew-Student-t Distribution

The skew-Student-t (SST) distribution (or skew-Pearson type IV) has the PDF given by

\begin{matrix} h (x) = f_{SST} (x) = 2 \frac{Γ (\frac{v + 1}{2})}{\sqrt{2 π} Γ (\frac{v}{2})} {(1 + \frac{x^{2}}{v})}^{\frac{v + 1}{2}} [\frac{1}{2} + λ x Γ (\frac{v + 1}{2}) \frac{F_{2 - 1} (\frac{1}{2}, \frac{v + 1}{2}, - \frac{3}{2}, - \frac{λ^{2} x^{2}}{v})}{\sqrt{v π} Γ (\frac{v}{2})}], \end{matrix}

(6)

where

v > 0

represents the degrees of freedom and

F_{2 - 1} (\cdot, \cdot, \cdot, \cdot)

is the hypergeometric function, see [18].

1.2.3. Skew-Cauchy Distribution

A continuous random variable X following skew-Cauchy (SC) distribution has PDF given by

h (x) = f_{SC} (x) = \frac{2}{π} {(1 + x^{2})}^{- 1} [\frac{1}{2} + \frac{1}{π} arctan (λ x)] .

(7)

1.2.4. Skew-Logistic Distribution

The PDF of the skew-logistic (SLOG) distribution is given by

h (x) = f_{SLOG} (x) = \frac{2 exp (- x)}{{(1 + exp (- x))}^{2}} \frac{1}{1 + exp (- λ x)} .

(8)

1.2.5. Skew-Laplace Distribution

The skew-Laplace (SL) distribution has PDF given by

h (x) = f_{SL} (x) = \{\begin{matrix} \frac{1}{2} exp ((1 + λ) x), & if x < 0, \\ exp (- x) (1 - \frac{1}{2} exp (- λ x)), & if x \geq 0 . \end{matrix}

(9)

1.3. Power-Skew-Elliptical Distribution

An alternative to the SN distribution of [1], was studied by [2] by introducing the fractional order statistical model, also known as alpha-power (AP) model, which has PDF given by

f_{AP} (z; α) = α h (z) {H (z)}^{α - 1}, z \in R,

(10)

where

H (\cdot)

is an absolutely continuous CDF with PDF

h (\cdot)

, and

α > 0

is a parameter that controls the distributional shape. The case

H (\cdot) = Φ (\cdot)

is called the power-normal (PN) distribution and has PDF given by

f_{PN} (z; α) = α ϕ (z) {Φ (z)}^{α - 1}, z \in R,

(11)

The PN model is denoted by

Z \sim PN (α)

and is considered an alternative distribution for modeling data with asymmetry and kurtosis above (or below) the expected for the normal distribution. If PDF

h (\cdot)

in model (10) has the form as in Equation (2), then the model is called PSE and its PDF is given by

f_{PSE} (z; λ, α) = α h (z; λ) {H (z; λ)}^{α - 1}, z \in R .

(12)

We will use the notation

Z \sim PSE (0, 1; g, λ, α)

. The case of this family of distributions for

h (\cdot; λ) = f_{SN} (\cdot, λ)

and

H (\cdot; λ) = F_{SN} (\cdot, λ)

was studied by [3] and it is called power-skew-normal distribution which is denoted by

PSN (λ, α)

. Some contributions to this family have been made by [19,20,21,22], among other.

Some additional works on distributions include those of [23,24], which the possibility of applying the analytical expressions for the calculation of the correct detection probability of the signal time window at synchronization has been proved.

The rest of the paper is organized as follows: Section 2 we introduce the new bivariate power-skew-elliptical family of distributions, several properties are derived and we consider the ML method for estimating the model parameters. In Section 3, we study the particular case of the bivariate BPSN model. ML estimation of the model is discussed and a reparameterization of the BPSN model is presented and we derive the information matrix. In Section 4, two applications is presented illustrating the good performance of the approaches developed in the paper.

2. Bivariate Power-Skew-Elliptical Distribution

In this section, we extend the PSE model to the bivariate case, this new model is a great extension because it is the bivariate unification of two families of distributions, on the one hand, the skew-elliptical family and on the other hand, the alpha-power family. The unification will generate a distribution with great flexibility in both asymmetry and kurtosis.

For the construction of bivariate power-skew-elliptical (BPSE) family of distributions, we will use the approach discussed in [25] which is based on conditional distributions. According to [25] a two-dimensional random vector

(X_{1}, X_{2})

is conditionally specified, if for any random variable

X_{2}

, the random variable

X_{1} ∣ X_{2} = x_{2}

is a member of a parametric family. Suppose that the joint PSE distribution function

H_{BPSE} (x_{1}, x_{2})

, of the random vector

(X_{1}, X_{2})

is such that, the conditional distribution of

X_{1}

given

X_{2} = x_{2}

and the conditional distribution of

X_{2}

given

X_{1} = x_{1}

are members of the PSE family of distributions with respect to a Lebesgue measure. We denote this by writing

X_{1} ∣ X_{2} = x_{2} \sim {PSE}_{1} (θ_{1}; g, λ_{1}, \underset{̲}{ω} (x_{2}))

(13)

and

X_{2} ∣ X_{1} = x_{1} \sim {PSE}_{2} (θ_{2}; g, λ_{2}, \underset{̲}{τ} (x_{1})),

(14)

where

\underset{̲}{ω}, \underset{̲}{τ}

are positive dependence functions which are to be determined.

In such case, we have conditionals in a given exponential family and we can identify the corresponding joint density. We can argue as follows. If

h_{X_{1}} (x_{1})

and

h_{X_{2}} (x_{2})

are marginal densities for a joint PSE density

h_{BPSE} (x_{1}, x_{2})

with conditional densities given by Equations (13) and (14), then it follows that

\begin{matrix} h_{BPSE} (x_{1}, x_{2}) & = τ (x_{1}) h_{X_{1}} (x_{1}) h_{2} (x_{2}; λ_{2}) {H_{2} (x_{2}; λ_{2})}^{τ (x_{1}) - 1}, \\ = ω (x_{2}) h_{X_{2}} (x_{2}) h_{1} (x_{1}; λ_{1}) {H_{1} (x_{1}; λ_{1})}^{ω (x_{2}) - 1} . \end{matrix}

(15)

Following [26] the solutions for dependence functions are given by

ω (x_{2}) = α_{1} - α_{12} ln [H_{2} (x_{2}; λ_{2})]

(16)

and

τ (x_{1}) = α_{2} - α_{12} ln [H_{1} (x_{1}; λ_{1})],

(17)

with

α_{1}, α_{2}

, positive real constants and

α_{12} = α_{21} \geq 0

.

Then, by using theorems in [27] and ([28], Chap. 2), we have that

\begin{matrix} h_{BPSE} (x_{1}, x_{2}) & = c (\underset{̲}{λ}, \underset{̲}{α}) h_{1} (x_{1}; λ_{1}) h_{2} (x_{2}; λ_{2}) {H_{1} (x_{1}; λ_{1})}^{α_{1} - 1} {H_{2} (x_{2}; λ_{2})}^{α_{2} - 1} \\ \times exp {α_{12} ln [H_{1} (x_{1}; λ_{1})] ln [H_{2} (x_{2}; λ_{2})]}, \end{matrix}

(18)

where

λ_{1}, λ_{2} \in R

, the constants

α_{1}, α_{2} > 0

and

α_{12} \geq 0

in order to guarantee

\int_{R} \int_{R} h (x_{1}, x_{2}) d x_{1} d x_{2} < \infty

, and

c (\underset{̲}{λ}, \underset{̲}{α})

is a normalizing constant with

\underset{̲}{λ} = {(λ_{1}, λ_{2})}^{⊤}

and

\underset{̲}{α} = {(α_{1}, α_{2}, α_{12})}^{⊤}

.

The independence case, where the joint distribution is the product of two PSE densities, is followed by taking

α_{12} = 0

, with

c (\underset{̲}{λ}, \underset{̲}{α}) = α_{1} α_{2}

.

Different conditional bivariate skew-elliptical distributions can be obtained from the generating function

g (\cdot)

, such as presented above, skew-normal, skew-Cauchy, skew-Student-t, skew-logistic and skew-Laplace, among others. This new family, which we will denote by

{BPSE}_{g}

generates a large number of bivariate distributions according to generating function

g (\cdot)

, in addition to those already mentioned, one could also talk about the distributions: special case, type Kotz, Bessel, and the many representations of the Pearson family of distributions different to the Cauchy and the Student-t. Thus, we define a broad flexible family of asymmetric bivariate distributions.

According to Equations (16) and (17), it follows that conditional distributions are given by

\begin{matrix} h_{X_{1} ∣ X_{2}} (x_{1} ∣ x_{2}) = ω (x_{2}) h_{1} (x_{1}; λ_{1}) {H_{1} (x_{1}; λ_{1})}^{α_{1} - 1} exp {α_{12} ln [H_{1} (x_{1}; λ_{1}) ln [H_{2} (x_{2}; λ_{2})]]}, \end{matrix}

(19)

and

h_{X_{2} ∣ X_{1}} (x_{2} ∣ x_{1}) = τ (x_{1}) h_{2} (x_{2}; λ_{2}) {H_{2} (x_{2}; λ_{2})}^{α_{2} - 1} exp {α_{12} ln [H_{1} (x_{1}; λ_{1}) ln [H_{2} (x_{2}; λ_{2})]]},

(20)

and hence, it follows that (19) and (20) belong to the exponentiated families of densities (13) and (14), where

h_{i} (\cdot)

and

H_{i} (\cdot)

, for

i = 1, 2,

are known density and distribution functions, respectively, while the marginal densities are given by

\begin{matrix} h_{X_{1}} (x_{1}) = c (\underset{̲}{λ}, \underset{̲}{α}) \frac{h_{1} (x_{1}; λ_{1}) {H_{1} (x_{1}; λ_{1})}^{α_{1} - 1}}{α_{2} - α_{12} ln [H_{1} (x_{1}; λ_{1})]} \end{matrix}

(21)

and

\begin{matrix} h_{X_{2}} (x_{2}) = c (\underset{̲}{λ}, \underset{̲}{α}) \frac{h_{2} (x_{2}; λ_{2}) {H_{2} (x_{2}; λ_{2})}^{α_{2} - 1}}{α_{1} - α_{12} ln [H_{2} (x_{2}; λ_{2})]} \end{matrix}

(22)

It follows that the CDFs of the conditioned PDFs given in Equations (19) and (20) are given by

H_{X_{1} ∣ X_{2}} (x_{1} ∣ x_{2}) = {\{H_{1} (x_{1}; λ_{1})\}}^{α_{1} - α_{12} ln [H_{2} (x_{2}; λ_{2})]},

(23)

and

H_{X_{2} ∣ X_{1}} (x_{2} ∣ x_{1}) = {\{H_{2} (x_{2}; λ_{2})\}}^{α_{2} - α_{12} ln [H_{1} (x_{1}; λ_{1})]} .

(24)

The rth moment can be calculated by using

\begin{matrix} E [X_{1}^{r} ∣ X_{2} = x_{2}] & = (α_{1} - α_{12} ln [H_{2} (x_{2}; λ_{2})]) \int_{0}^{1} {[H_{1}^{- 1} (v_{1}; λ_{1})]}^{r} v_{1}^{α_{1} - 1 - α_{12} ln [H_{2} (x_{2}; λ_{2})]} d v_{1}, \\ E [X_{2}^{r} ∣ X_{1} = x_{1}] & = (α_{2} - α_{12} ln [H_{1} (x_{1}; λ_{1})]) \int_{0}^{1} {[H_{2}^{- 1} (v_{2}; λ_{2})]}^{r} v_{2}^{α_{2} - 1 - α_{12} ln [H_{1} (x_{1}; λ_{1})]} d v_{2} . \end{matrix}

where

H_{i}^{- 1} (\cdot; \cdot)

is the inverse function of the CDF

H_{i} (\cdot; \cdot)

,

i = 1, 2

. To study the correlation between

X_{1}

and

X_{2}

, we can compute the measure

ρ_{X_{1} X_{2}} = \frac{E [X_{1} X_{2}] - E [X_{1}] E [X_{2}]}{σ_{X_{1}} σ_{X_{2}}} = \frac{σ_{X_{1} X_{2}}}{σ_{X_{1}} σ_{X_{2}}},

where

\begin{matrix} σ_{X_{1} X_{2}} & = \int_{0}^{1} \int_{0}^{1} H_{1}^{- 1} (v_{1}; λ_{1}) H_{2}^{- 1} (v_{2}; λ_{2}) v_{1}^{α_{1} - 1} v_{2}^{α_{2} - 1} \\ \times [v_{1}^{- α_{12} ln (v_{2})} - \frac{1}{(α_{2} - α_{12} ln (v_{1})) (α_{1} - α_{12} ln (v_{2}))}] d v_{1} d v_{2}, \\ σ_{X_{1}}^{2} & = \int_{0}^{1} \frac{{[H_{1}^{- 1} (v_{1}; λ_{1})]}^{2} v_{1}^{α_{1} - 1}}{α_{2} - α_{12} ln (v_{1})} d v_{1} - {\{\int_{0}^{1} \frac{H_{1}^{- 1} (v_{1}; λ_{1}) v_{1}^{α_{1} - 1}}{α_{2} - α_{12} ln (v_{1})} d v_{1}\}}^{2}, \\ σ_{X_{2}}^{2} & = \int_{0}^{1} \frac{{[H_{2}^{- 1} (v_{2}; λ_{2})]}^{2} v_{2}^{α_{2} - 1}}{α_{1} - α_{12} ln (v_{2})} d v_{2} - {\{\int_{0}^{1} \frac{H_{2}^{- 1} (v_{2}; λ_{2}) v_{2}^{α_{2} - 1}}{α_{1} - α_{12} ln (v_{2})} d v_{2}\}}^{2} . \end{matrix}

Moreover, we can consider

{\hat{ρ}}_{X_{1} X_{2}} = \frac{{\hat{σ}}_{X_{1} X_{2}}}{{\hat{σ}}_{X_{1}} {\hat{σ}}_{X_{2}}} .

as an estimator of

ρ_{X_{1}, X_{2}}

.

Statistical Inference for the Bpse Model

The normalization constant

c (\underset{̲}{λ}, \underset{̲}{α})

in the joint PDF

h (x_{1}, x_{2})

makes difficult the parameters estimation by maximizing the likelihood function. As alternative, we will follow the proposal of [29] for the parameters estimation of multivariate distributions, that is, we will maximize the pseudo-likelihood function. The pseudo-likelihood function is defined as the product of conditional functions. In this case, as in the ML method, the logarithm of the product of conditional distributions is maximized and this eliminates the logarithm of the normalization constant

c (\underset{̲}{λ}, \underset{̲}{α})

within the estimation process, which, as in this case will contain multiple integrals in its structure. Another important characteristic of this estimation process is that the maximum pseudo-likelihood estimators vector of the model parameters is consistent and converges asymptotically to a multivariate normal distribution.

Hence, given a random sample of vectors

(x_{11}, x_{21}), (x_{12}, x_{22}), \dots, (x_{1 n}, x_{2 n}),

with bivariate joint distribution PSE, the pseudo-likelihood function based on the conditional densities of the BPSE distribution, is given by

L_{p} (\underset{̲}{β}) = h_{X_{1} ∣ X_{2}} (x_{1} ∣ x_{2}) h_{X_{2} ∣ X_{1}} (x_{2} ∣ x_{1}),

(25)

where

\underset{̲}{β} = (\underset{̲}{θ_{1}}, \underset{̲}{θ_{2}}, λ_{1}, λ_{2}, α_{1}, α_{2}, α_{12})

, with

\underset{̲}{θ_{1}}

and

\underset{̲}{θ_{2}}

being the parameters of the

h_{X_{1} ∣ X_{2}} (x_{1} ∣ x_{2})

and

h_{X_{2} ∣ X_{1}} (x_{2} ∣ x_{1})

distributions, respectively. Thus, the maximum pseudo-likelihood estimator of

\underset{̲}{β}

is defined as the value

\underset{̲}{β_{0}}

of

\underset{̲}{β}

which maximizes the pseudo-likelihood function.

The log-pseudo-likelihood function is defined as the logarithm of the pseudo-likelihood function and for BPSE model it is expressed by

\begin{matrix} ℓ_{P} (\underset{̲}{β}) & = \sum_{i = 1}^{n} \sum_{j = 1}^{2} ln (α_{j} - \sum_{\begin{matrix} k = 1 \\ k \neq j \end{matrix}}^{2} α_{j k} ln [H_{k} (x_{k i}; \underset{̲}{θ_{k}}, λ_{k})]) + \sum_{i = 1}^{n} \sum_{j = 1}^{2} ln [h_{j} (x_{j i}; \underset{̲}{θ_{j}}, λ_{j})] \\ + \sum_{i = 1}^{n} \sum_{j = 1}^{2} (α_{j} - 1) ln [H_{j} (x_{j i}; \underset{̲}{θ_{j}}, λ_{j})] \\ + \sum_{i = 1}^{n} \sum_{j = 1}^{2} \sum_{\begin{matrix} k = 1 \\ k \neq j \end{matrix}}^{2} α_{j k} ln [H_{j} (x_{j i}; \underset{̲}{θ_{j}}, λ_{j})] ln [H_{k} (x_{k i}; \underset{̲}{θ_{k}}, λ_{k})] . \end{matrix}

(26)

The pseudo-score function is defined as the partial derivatives of the log-pseudo-likelihood function with respect to each of the parameters model, this is denoted by

U_{p} (\underset{̲}{β}) = {(U_{p} (\underset{̲}{θ_{1}}), U_{p} (\underset{̲}{θ_{2}}), U_{p} (λ_{1}), U_{p} (λ_{2}), U_{p} (α_{1}), U_{p} (α_{2}), U_{p} (α_{12}))}^{⊤} .

In accordance with [30], the pseudo likelihood estimator

\underset{̲}{\tilde{β}}

of

\underset{̲}{β}

is consistent, asymptotically normally distributed with covariance matrix given by

Σ (\underset{̲}{\tilde{β}}) = \frac{1}{n} Γ^{- 1} (\underset{̲}{β}) Ψ (\underset{̲}{β}) Γ^{- 1} (\underset{̲}{β}) .

Cheng, C. and Riu, J. [31] consider a sandwich type estimator, which is consistent for the asymptotic covariance matrix to estimate

Σ (\underset{̲}{\tilde{β}})

which is given by

\hat{Σ} (\underset{̲}{\tilde{β}}) = \frac{1}{n} {\hat{Γ}}_{n}^{- 1} (\underset{̲}{\tilde{β}}) {\hat{Ψ}}_{n} (\underset{̲}{\tilde{β}}) {\hat{Γ}}_{n}^{- 1} {(\underset{̲}{\tilde{β}})}^{⊤},

where

{\hat{Γ}}_{n} (\underset{̲}{β}) = - \frac{1}{n} \sum_{i = 1}^{n} \frac{\partial}{\partial {\underset{̲}{β}}^{⊤}} U_{i} (\underset{̲}{β}) |_{\tilde{β}} and {\hat{Ψ}}_{n} (\underset{̲}{β}) = \frac{1}{n} \sum_{i = 1}^{n} U_{i} (\underset{̲}{β}) U_{i} {(\underset{̲}{β})}^{⊤} |_{\underset{̲}{\tilde{β}}},

with

U_{i} (\underset{̲}{β}) = \frac{\partial}{\partial β} ℓ_{P} (\underset{̲}{β})

, is the score vector for the pseudo-likelihood function.

The structure of the

Γ

and

Ψ

matrices are going to depend on the PDF in the BPSE model, specifically of the

g (\cdot)

generating function. The non-singularity of

Σ

must be treated in each particular case, as well as its possible solution in case of singularity of this matrix.

3. Bivariate Power-Skew-Normal Model

In this section, we study the bivariate PSE distribution when density functions

h_{1} (x_{1}; λ_{1})

and

h_{2} (x_{2}; λ_{2})

correspond to the SN density of [1], that is,

h_{i} (x_{i}; λ_{i}) = f_{SN} (x_{i}; λ_{i}) = 2 ϕ (x_{i}) Φ (λ_{i} x_{i}) and H_{i} (x_{i}; λ_{i}) = F_{SN} (x_{i}; λ_{i}) = Φ (x_{i}) - 2 T (x_{i}; λ_{i}) .

In this case, the BPSN probability density function is given by

\begin{matrix} h_{BPSN} (x_{1}, x_{2}) & = & c (\underset{̲}{λ}, \underset{̲}{α}) f_{SN} (x_{1}; λ_{1}) f_{SN} (x_{2}; λ_{2}) {F_{SN} (x_{1}; λ_{1})}^{α_{1} - 1} {F_{SN} (x_{2}; λ_{2})}^{α_{2} - 1} \\ \times exp {α_{12} ln [F_{SN} (x_{1}; λ_{1})] ln [F_{SN} (x_{2}; λ_{2})]}, \end{matrix}

(27)

with

λ_{1}, λ_{2} \in R

, and

α_{1}, α_{2}, α_{12} > 0 .

The normalization constant can be written by using the transformation

v_{j} = F_{SN} (x_{j}; λ_{j})

for

j = 1, 2

as

c (\underset{̲}{λ}, \underset{̲}{α}) = c (\underset{̲}{α}) = {(\int_{0}^{1} \int_{0}^{1} v_{1}^{α_{1} - 1} v_{2}^{α_{2} - 1} exp (α_{12} ln (v_{1}) ln (v_{2})))}^{- 1} .

This standard BPSN model will be denoted by

BPSN (λ_{1}, λ_{2}, α_{1}, α_{2}, α_{12})

. For

λ_{1} = λ_{2} = 0

the bivariate conditional exponentiated normal model, studied by [26] is obtained, while for

λ_{1} = λ_{2} = α_{12} = 0

, the bivariate joint distribution of independent power-skew-normal random variables is obtained; and for

λ_{1} = λ_{2} = α_{12} = 0

and

α_{1} = α_{2} = 1

, the bivariate joint distribution of independent normal random variables is obtained. Note that, if

α_{1} = α_{2} = 1

, then it would have a type of bivariate conditional SN distribution.

The location-scale extension of the BPSN model can be written as

\begin{matrix} h_{BPSN} (x_{1}, x_{2}) & = & \frac{c (\underset{̲}{λ}, \underset{̲}{α})}{η_{1} η_{2}} f_{SN} (z_{1}; λ_{1}) f_{SN} (z_{2}; λ_{2}) {F_{SN} (z_{1}; λ_{1})}^{α_{1} - 1} {F_{SN} (z_{2}; λ_{2})}^{α_{2} - 1} \\ \times exp {α_{12} ln [F_{SN} (z_{1}; λ_{1})] ln [F_{SN} (z_{2}; λ_{2})]}, \end{matrix}

(28)

where

z_{j} = (x_{j} - ξ_{j}) / η_{j}

, with

- \infty < ξ_{j} < \infty

and

η_{j} > 0

, for

j = 1, 2

. We will denote it by

BPSN ((ξ_{1}, η_{1}), (ξ_{2}, η_{2}), λ_{1}, λ_{2}, α_{1}, α_{2}, α_{12})

. Figure 1 presents the contour graphs of the BPSN model for some selected values of the parameters.

Table 1 shows the correlation coefficients of the BPSN model for values

λ_{1}

ranging from

- 2.5

to

2.5

and from

0.5

to

0.5

;

λ_{2}

ranging from

- 1.5

to

2.0

and from

0.5

to

0.5

;

α_{1} = 3.25

,

α_{2} = 2.75

and

α_{12} = 0.5

. It can be observed that BPSN model is very flexible in terms of correlation, since

ρ \in (- 0.9933, 1.0)

, which contains the correlation coefficients of the bivariate conditional exponentiated: normal, logistic and Student-t models, for some values of

α_{1} \in

(0.5,10),

α_{2} \in (0.25, 10)

and

α_{12} \in (0.5, 2.5)

, see [26], for more details. According to [26], for bivariate conditional exponentiated normal model with

α_{1}, α_{2} \in (0.4, 100)

and

α_{12} \in (0.2, 100)

, the range of possible values for the correlation coefficient is on interval

(- 0.8634, 0.9247)

. Likewise, the range of possible values for the correlation coefficient for the BPSN model contains the respective range of possible values of the bivariate conditional exponentiated model studied by [28] which is

(0.20, 0.60)

for

α_{12} \in (0, 1000)

, and finally this range also contains the range of possible values for the correlation coefficient of the conditionally specified bivariate skewed model of [6], which is

(- 0.63662, 0.63662)

.

3.1. Statistical Inference

We consider a random sample of vectors following a BPSN distribution. The corresponding log-pseudo-likelihood function for the parameter vector

\underset{̲}{β} = {((ξ_{1}, η_{1}), (ξ_{2}, η_{2}), λ_{1}, λ_{2}, α_{1}, α_{2}, α_{12})}^{⊤}

, is given by

\begin{matrix} ℓ_{P} (\underset{̲}{β}) & = \sum_{i = 1}^{n} ln (α_{1} - α_{12} ln [F_{SN} (z_{2 i}; λ_{2})]) + \sum_{i = 1}^{n} ln [f_{SN} (z_{1 i}; λ_{1})] \\ + \sum_{i = 1}^{n} ln (α_{2} - α_{12} ln [F_{SN} (z_{1 i}; λ_{1})]) + \sum_{i = 1}^{n} ln [f_{SN} (z_{2 i}; λ_{2})] \\ + \sum_{i = 1}^{n} (α_{1} - 1) ln [F_{SN} (z_{1 i}; λ_{1})] + \sum_{i = 1}^{n} (α_{2} - 1) ln [F_{SN} (z_{2 i}; λ_{2})] \\ - 2 \sum_{i = 1}^{n} α_{12} ln [F_{SN} (z_{1 i}; λ_{1})] ln [F_{SN} (z_{2 i}; λ_{2})], \end{matrix}

(29)

where

z_{j i} = (x_{j i} - ξ_{j}) / η_{j}

. Then, the pseudo-score function which is denoted by

U_{p} (\underset{̲}{β}) = {(U_{p} (ξ_{1}), U_{p} (η_{1}), U_{p} (ξ_{2}), U_{p} (η_{2}), U_{p} (λ_{1}), U_{p} (λ_{2}), U_{p} (α_{1}), U_{p} (α_{2}), U_{p} (α_{12}))}^{⊤} .

has elements given by

\begin{matrix} U_{P} (ξ_{j}) = & \frac{α_{12}}{η_{j}} \sum_{i = 1}^{n} \frac{W_{(1) j i}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} + \frac{1}{η_{j}} \sum_{i = 1}^{n} [z_{j i} - \sqrt{\frac{2}{π}} λ_{j} W_{(2) j i}] \\ - \frac{1}{η_{j}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] W_{(1) j i}, j = 1, 2, \end{matrix}

\begin{matrix} U_{P} (η_{j}) = & \frac{α_{12}}{η_{j}} \sum_{i = 1}^{n} \frac{z_{j i} W_{(1) j i}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} + \frac{1}{η_{j}} \sum_{i = 1}^{n} [z_{j i}^{2} - 1 - \sqrt{\frac{2}{π}} λ_{j} z_{j i} W_{(2) j i}] \\ - \frac{1}{η_{j}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] z_{j i} W_{(1) j i}, j = 1, 2, \end{matrix}

\begin{matrix} U_{P} (λ_{j}) = & \sqrt{\frac{2}{π}} \frac{α_{12}}{1 + λ_{j}^{2}} \sum_{i = 1}^{n} \frac{W_{(3) j i}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} + \sqrt{\frac{2}{π}} \sum_{i = 1}^{n} z_{j i} W_{(2) j i} \\ - \sqrt{\frac{2}{π}} \frac{1}{1 + λ_{j}^{2}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] W_{(3) j i}, j = 1, 2, \end{matrix}

U_{P} (α_{j}) = \sum_{i = 1}^{n} \frac{1}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} + \sum_{i = 1}^{n} ln (F_{SN} (z_{j i}; λ_{j})), j = 1, 2,

and

U_{P} (α_{12}) = \sum_{i = 1}^{n} \sum_{j = 1}^{2} \frac{- ln (F_{SN} (z_{j i}; λ_{j}))}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} - 2 \sum_{i = 1}^{n} ln (F_{SN} (z_{1 i}; λ_{1})) ln (F_{SN} (z_{2 i}; λ_{2}))

where

W_{(1) j i} = \frac{f_{SN} (z_{j i}; λ_{j})}{F_{SN} (z_{j i}; λ_{j})}

,

W_{(2) j i} = \frac{ϕ (\sqrt{1 + λ_{j}^{2}} z_{j i})}{f_{SN} (z_{j i}; λ_{j})}

,

W_{(3) j i} = \frac{ϕ (\sqrt{1 + λ_{j}^{2}} z_{j i})}{F_{SN} (z_{j i}; λ_{j})}

.

The solution to the system of non-linear equations

U_{p} (ξ_{1}) = 0, U_{p} (η_{1}) = 0, U_{p} (ξ_{2}) = 0, U_{p} (η_{2}) = 0, U_{p} (λ_{1}) = 0, U_{p} (λ_{2}) = 0, U_{p} (α_{1}) = 0, U_{p} (α_{2}) = 0, U_{p} (α_{12}) = 0

, leads to pseudo-likelihood estimates of the parameter vector of the BPSN model, this system must be solved by using iterative numerical algorithms.

For estimating the covariance matrix, we have from [31] that, thte

Ψ (\cdot)

matrix will be estimated from the pseudo-score function given above, while the

Γ (\cdot)

component of the

Σ (\cdot)

matrix, we will write it in the form

Γ = (γ_{β_{j} β_{j^{'}}}) = - \frac{1}{n} K

where

K = (κ_{β_{j} β_{j^{'}}})

is a matrix of second derivatives of the pseudo-likelihood function, with respect to the parameters of the model, the elements

κ_{β_{j} β_{j^{'}}}

of this matrix are presented in the Appendix A.

3.2. Reparameterization for the Bpsn Model

As well known in the literature of distributions theory, the SN model has a singular information matrix for

λ = 0

, however, it has been proposed to perform a reparameterization of the model parameters, [32], this problem is presented by several extensions of the SN model, including for example, the power-skew-normal model whose information matrix is singular for

λ = 0

and

α = 1

, here, [33] present a reparameterization of the models parameters for this case.

Another solution to the problem of the singularity of the SN model when

λ = 0

was presented by [1], which consists of a representation of the form

Y = μ + σ \frac{Z - E [Z]}{\sqrt{Var [Z]}}

, where

μ \in R

and

σ > 0

are parameters of the random variable Y, and

Z \sim SN (λ)

. This representation is called centered parametrization, since

E [Y] = μ

and

Var [Y] = σ^{2} .

The new representation has parameters vector

θ = {(μ, σ, γ_{1})}^{⊤}

, where

- 0.9953 \leq γ_{1} \leq 0.9953

represents the asymmetry coefficient of the random variable Y. Under this representation, the information matrix of the new parameters vector turns out to be non-singular. Thus, the information matrix is written in the form

D I_{λ} D^{⊤}

where

I_{λ}

is the information matrix of the model with parameters vector

{(ξ, η, λ)}^{⊤}

and

D

is a matrix of derivatives of the parameters vector

{(ξ, η, λ)}^{⊤}

with respect to the parameters vector

θ

. Under this reparameterization, we have the relationship

ξ = μ - c σ γ_{1}^{1 / 3}, η = σ \sqrt{1 + c^{2} γ_{1}^{2 / 3}} and λ = \frac{c γ_{1}^{1 / 3}}{\sqrt{b^{2} + c^{2} (b^{2} - 1) γ_{1}^{2 / 3}}},

(30)

with

b = \sqrt{\frac{2}{π}}

and

c = {2 / (4 - π)}^{1 / 3} .

Also it has that, when

λ \to 0,

the information matrix converges to the diagonal matrix

Σ_{c} = diag (σ^{2}, σ^{2} / 2, 6) .

This guarantees the existence and uniqueness of the MLE of

ξ

and

η

, for each fixed value of

λ

, see [1].

The BPSN model also inherits the singularity problem in the matrix

Γ (\cdot)

for values close to

λ_{1} = λ_{2} = 0

and

α_{1} = α_{2} = 1

, specifically, for values close

λ_{1} = 0

, the columns corresponding to the elements

k_{ξ_{1} β_{j}}

and

k_{λ_{1} β_{j}}

, where

β_{j}

is related to the rest of the parameters, are linearly dependent. The same way, it happens for values close

λ_{2} = 0 .

This situation also occurs in the pseudo-score function, leading to problems to guarantee the existence and uniqueness of the pseudo-estimated values. This leads to the non-existence of the inverse of the

Γ (\cdot)

matrix and therefore, the covariance matrix

Σ (\cdot)

can not be calculated.

Following [1], we define

Y_{j} = μ_{j} + σ_{j} \frac{Z_{j} - E [Z_{j}]}{\sqrt{Var [(Z_{j})]}}

for

j = 1, 2

, where

Z_{j} \sim SN (λ_{j})

, then we arrive to representation of the BPSN with parameters vector

\underset{̲}{θ} = {(μ_{1}, μ_{2}, σ_{1}, σ_{2}, γ_{j 1}, γ_{j 2}, α_{1}, α_{2}, α_{12})}^{⊤}

where, for

j = 1, 2

, it has

- 0.9953 \leq γ_{j 1} \leq 0.9953

. As showed by [1], this representation can be written in the form

Y_{j} = ξ_{j} + η_{j} Z

, for

j = 1, 2

, that is,

Y_{j} \sim SN (ξ_{j}, η_{j}, λ_{j})

where

ξ_{j} = μ_{j} - c σ_{j} γ_{j 1}^{1 / 3}, η_{j} = σ_{j} \sqrt{1 + c^{2} γ_{j 1}^{2 / 3}} and λ_{j} = \frac{c γ_{j 1}^{1 / 3}}{\sqrt{b^{2} + c^{2} (b^{2} - 1) γ_{j 1}^{2 / 3}}},

(31)

for

j = 1, 2

. We denote it by

{SN}_{c} (μ_{j}, σ_{j}, γ_{j 1})

. Here, the new bivariate centered power-skew-normal model (BPSN

_{c}

) is defined just as the PDF defined by model in Equation (28), where

ξ_{j},

η_{j}

and

λ_{j}

are defined as in Equation (31). Given the relationship in Equation (31), ML estimates for the vector

\underset{̲}{θ} = {(μ_{1}, μ_{2}, σ_{1}, σ_{2}, γ_{j 1}, γ_{j 2}, α_{1}, α_{2}, α_{12})}^{⊤}

can be obtained from the estimates of the original model, that is, the estimates of maximum pseudo-likelihood for

α_{1}, α_{2}

and

α_{12}

are the same as the model without reparametrizating, while for the parameters subvector

{(μ_{1}, μ_{2}, σ_{1}, σ_{2}, γ_{j 1}, γ_{j 2})}^{⊤}

, it can be obtained by using the inverse relationships of (31), that is, from:

μ_{j} = ξ_{j} + b σ_{j} δ_{j}, σ_{j} = η_{j} \frac{{[1 + η_{j} (1 - b^{2})]}^{1 / 2}}{{(1 + λ_{j}^{2})}^{1 / 2}}, γ_{j 1} = \frac{4 - π}{2} {(b λ_{j})}^{3} {[1 + λ_{j}^{2} (1 - b^{2})]}^{- 3 / 2},

(32)

where

δ_{j} = λ_{j} / \sqrt{1 + λ_{j}^{2}}

. From (32), if

λ_{j} \approx 0

, then

γ_{j 1} \approx 0

, where

γ_{j 1}

corresponds to the asymmetry coefficient of the random variable

Y_{j}

. In this way, it is important to analyze the magnitude of the sample asymmetry coefficient of each variable.

The covariance matrix of [31] is obtained in the same way as in BPSN model, however, it can be demonstrated that the covariance matrix in the BPSN

_{c}

model lets herself be written as

Σ_{c} (\underset{̲}{\tilde{θ}}) = \frac{1}{n} {(D Γ (\underset{̲}{β}) D^{⊤})}^{- 1} D Ψ (\underset{̲}{β}) D^{⊤} {(D Γ (\underset{̲}{β}) D^{⊤})}^{- 1} .

where

D = \frac{\partial \underset{̲}{β}}{\partial \underset{̲}{θ}}

.

It can be shown that

D

is a block-diagonal matrix, given by

D = diag (D_{1}, D_{2}, I_{3}),

where

I_{3}

is an identity matrix of dimension

3 \times 3

, and

D_{j}

for

j = 1, 2

is the derivative matrix of the parameter vector

{(ξ_{j}, η_{j}, λ_{j})}^{⊤}

with respect to the vector

{(μ_{j}, σ_{j}, γ_{1 j})}^{⊤}

. The elements of this matrix can be found in their general form in ([34], p. 68). When

γ_{1 j} \to 0

, then

D_{j} Γ (\underset{̲}{β_{j}}) D_{j}^{⊤}

tends to the diagonal matrix

diag (σ_{j}^{2}, σ_{j}^{2} / 2, 6)

which is non-singular, see [1,34].

According to [1], rewriting

\underset{̲}{β} = {(\underset{̲}{β_{1}}, \underset{̲}{β_{2}}, \underset{̲}{α})}^{⊤}

, where

\underset{̲}{β_{j}} = {(ξ_{j}, η_{j}, λ_{j})}^{⊤}

, it can be shown that the

D_{j} Γ (\underset{̲}{β_{j}}) D_{j}^{⊤}

sub matrices for

j = 1, 2

, of the

D Γ (\underset{̲}{β}) D^{⊤}

matrix are non-singular and therefore, the rows of the

D Γ (\underset{̲}{β}) D^{⊤}

matrix are linearly independent, that is, this matrix is invertible, which guarantees the existence of the

Σ_{c} (\underset{̲}{\tilde{θ}})

matrix. The estimator of the covariance matrix

Σ_{c} (\underset{̲}{\tilde{θ}})

is obtained by replacing

\underset{̲}{β}

and

\underset{̲}{θ}

by their respective estimators.

4. Numerical Illustrations

4.1. Illustration 1

We consider an illustration where we study the fit of the BPSN model for a data set studied in [35]. We pooled together, the 50 Iris-setosa data points, the 50 Iris-versicolor data points and the 50 Iris-virginica data points, to get a total sample size of

n = 150

. Descriptive statistics for the data set are presented in Table 2. Quantities

\sqrt{b_{1}}

and

b_{2}

correspond to sample asymmetry and kurtosis coefficients.

Table 2 shows that variables

x_{1}

and

x_{2}

present high asymmetry, so a BPSN model could fit this data set. Under normality assumption, hypothesis tests for the asymmetry coefficient of

x_{1}

and

x_{2}

, (

H_{0} : \sqrt{β_{j 1}} = 0

,

j = 1, 2 .

) show test statistics

7.716

and

7.815

, respectively, values well above the percentile of

χ_{(1)}^{2}

distribution, at level 5% whose value is

3.84

, which indicates that the asymmetry in each variable is very important. Likewise, the univariate normality tests of [36] show p-values of

0.0225

and

0.0202

, concluding that the distributions are asymmetric.

The bivariate normality tests of [37,38,39,40] yielded test statistics (with p-values in parentheses), of 2.8909 (0.0000), 9.3258 (0.0094) and 62.2958 (0.0000), respectively, hence, it is concluded that the bivariate observations vector does not follow a bivariate normal distribution. Thus, an asymmetric bivariate distribution, such as the BPSN model may be useful to fit this data set. Hence, the bivariate normal distribution is not a tenable model for the data under study, and an alternative model that is able to incorporate some degree of asymmetry would probably fit the data better. We fitted the conditional bivariate skew-normal model (see, [6]), the bivariate power-normal (BPN) model and the BPSN model. To compare fitted model, we make use of the Akaike Information Criterion (AIC), by [41] and the corrected AIC (CAIC) [42]. These measures are defined as follows

A I C = - 2 ℓ (\hat{θ}) + 2 p and C A I C = - 2 ℓ (\hat{θ}) + \frac{2 n (p + 1)}{n - p - 2} .

We used the optim fuction of statistical package [43] for fitting the bivariate model. To choose the initial values in the iterative estimation process, for

α_{1}, α_{2}, α_{12},

in the BPSN model we use the transformation

Y_{j} = - ln (F_{SN} (z_{1 i}; λ_{j}))

for

j = 1, 2

which yields the bivariate exponential conditionals model discussed in detail by [28].

f_{Y_{1} Y_{2}} (y_{1}, y_{2}) = k (α_{1}, α_{2}, α_{12}) exp (- α_{1} y_{1} - α_{2} y_{2} - α_{12} y_{1} y_{2})

(33)

This transformation leads to obtaining estimates by the method of the moments of

α_{1},

α_{2}

and

α_{12}

, which are consistent and asymptotically normal, see [28]. These estimators are given by:

{\tilde{α}}_{1} = \frac{\tilde{γ}}{{\bar{y}}_{1} (\tilde{γ} + I (\tilde{γ} - 1))}, {\tilde{α}}_{2} = \frac{\tilde{γ}}{{\bar{y}}_{2} (\tilde{γ} + I (\tilde{γ} - 1))} and {\tilde{α}}_{12} = \frac{\tilde{γ} (\tilde{γ} - 1)}{{\bar{y}}_{1} {\bar{y}}_{2} (\tilde{γ} + I (\tilde{γ} - 1))},

where

\tilde{γ} = \frac{I}{1 + ρ_{Y_{1} Y_{2}} I} c

with

ρ_{Y_{1} Y_{2}} = cor (y_{1}, y_{2})

and

I = cv (y_{1}) cv (y_{2}),

where cor is the usual Pearson correlation between

Y_{1} and Y_{2}

, and

cv (y) = \sqrt{S_{y}^{2}} / \bar{y}

.

For the initial points of

λ_{1},

λ_{2}

, and the location-scale parameters, we fit the univariate SN distributions by using the selm function of [43], the moment estimators of these parameters given in [44], and the values of the means and standard deviations of the bivariate normal model. Finally with the obtained values for

(({\tilde{ξ}}_{1}, {\tilde{η}}_{1}), ({\tilde{ξ}}_{2}, {\tilde{η}}_{2}), {\tilde{λ}}_{1}, {\tilde{λ}}_{2}, {\tilde{α}}_{1}, {\tilde{α}}_{2}, {\tilde{α}}_{12})

, the iterative estimation process is started.

Table 3 presents ML estimates, AIC and CAIC values for BCSN, BPN and BPSN models, which is the one corresponding to the best (smallest AIC or CAIC) model fitting, which clearly indicates a better fit for BPSN model. Contour plots for the BPN, bivariate conditional skew-normal (BCSN) and BPSN distributions are presented in Figure 2.

Initially, the BPN model is compared to the BPSN model by the hypothesis tests

H_{0} : (λ_{1}, λ_{2}) = (0, 0) versus H_{1} : (λ_{1}, λ_{2}) \neq (0, 0) .

by using the likelihood ratio statistic,

Λ = \frac{L_{BPN} (\hat{θ})}{L_{BPSN} (\hat{θ})}

where

L_{BPN} (\cdot)

and

L_{BPSN} (\cdot)

are the pseudo-likelihood function of the BPN ans BPSN model, respectively. We obtain

- 2 log (Λ) = - 2 (ℓ_{BPN} (\hat{θ}) - ℓ_{BPSN} (\hat{θ})) = 6.704

which is greater than the value of the

χ_{2, 95 %}^{2} = 5.99

. Then the BPSN model is a good alternative for fitting the data set. This result suggests the importance of the parameters

λ_{1}

and

λ_{2}

in the good fit of the BPSN model, the effect of these two parameters can be seen in the graph (c) of the Figure 2. The contour graphs in the Figure show that the BPSN model manages to better capture the distribution of the data set under consideration, since more points are contained within the contours of the BPSN distribution compared to the BCSN and BPN models.

Another hypothesis of special interest is the significance of the parameter vector

(α_{1}, α_{2}, α_{12}),

in this particular case there is interest in the hypothesis set

H_{0} : (α_{1}, α_{2}, α_{12}) = (1, 1, 0) versus H_{1} : (α_{1}, α_{2}, α_{12}) \neq (1, 1, 0)

So, under

H_{0},

we have the independent bivariate SN distribution for

(X_{1}, X_{2})

. An appropriate test follows by using a statistic of type Wald which follows from the asymptotic normality of the maximum pseudo-likelihood estimator

\underset{̲}{\hat{β}}

. This statistic can be defined as

W_{n} = {(A \underset{̲}{\hat{β}} - m)}^{⊤} {({\hat{Σ}}_{3} (\underset{̲}{\hat{β}}))}^{- 1} (A \underset{̲}{\hat{β}} - m),

where

{\hat{Σ}}_{3} (\underset{̲}{\hat{β}})

is a submatrix of

\hat{Σ} (\underset{̲}{\tilde{β}})

corresponding to the vector

(α_{1}, α_{2}, α_{12}),

A = (0_{3 \times 6} I_{3})

and

m = {(1, 1, 0)}^{⊤}

, which, under the null hypothesis follows a

χ^{2}

distribution with 3 degrees of freedom.

Thus, we obtained that

W_{n} = 1229.17

with

p_{v a l u e} = 0.0000

that is, the null hypothesis is rejected, indicating that the exponentiated component is significant in the model, thus, both components of the unification, skew and power, are significant in the good fit of the BPSN model.

The multivariate Kolmogorov-Smirnov test of goodness of fit proposed by [45], in special for the case of a bivariate distribution, which we denote by BKS (bivariate Kolmogorov-Smirnov), the statistic is given by

d_{n} = sup_{(x_{1}, x_{2}) \in R^{2}} |F_{n} (x_{1}, x_{2}) - F (x_{1}, x_{2})|

where

F_{n}

is the empirical distribution function of the sample, and

F

is some specified distribution function. When

F

distribution is unknown, the Kolmogorov-Smirnov statistic is defined by

d_{n} (F) = max \{D^{1}, D^{2}\},

where

D^{1} sup_{(x_{1}, x_{2}) \in R^{2}} |G_{n} (y_{1}, y_{2}) - y_{1} \times y_{2}|

by using the transformation

y_{1} = F_{X_{1}} (x_{1})

and

y_{2} = F_{X_{2} ∣ X_{1}} (x_{2} ∣ x_{1})

, and

D^{2} sup_{(x_{1}, x_{2}) \in R^{2}} |G_{n} (y_{2}, y_{1}) - y_{2} \times y_{1}|

by using the transformation

y_{2} = F_{X_{2}} (x_{2})

and

y_{1} = F_{X_{1} ∣ X_{2}} (x_{1} ∣ x_{2}),

where

G_{n}

is the empirical distribution function of the sample. For the special case of the BPSN model,

d_{n} (BPSN) = max \{0.05907079, 0.07485913\} = 0.07485913,

which is less than

0.1464

, which is the critical value of Table 1 given by [45], at level of 5%. Therefore, it is concluded that, the BPSN model fits well with the iris data set.

It should be noted that, the BPSN model is compared to other models, in particular a model of the skew-normal family of [1] proposed by [6], and a model of the power-normal family of [2] studied by [26]. Our proposal better fitted the data set studied in [35]. This allows us to conclude that the BPSN model is a viable alternative to those existing in the literature when the data set presents degrees of asymmetry that are not captured by the multivariate normal model.

4.2. Illustration 2

In the second application we use data on measurements on air-pollution variables recorded at 12:00 noon in the Los Angeles area on different days, available in [46]. For this application, we use the variables

x = W i n d

and

y = N O_{2}

. For air pollutant concentrations, it is usually assumed that the data are uncorrelated and independent and thus do not require the diurnal or cyclic trend analysis [47]. The concentration of average air pollutants has been used in epidemiological surveillance as an indicator of the atmospheric contamination and its associated adverse effects in humans, causing diseases such as bronchitis.

The bivariate normality test of Royston returns a test statistic value of 13.55147 with p-value = 0.001141065, whereas the generalized Shapiro-Wilk test for multivariate normality returns a test statistic value of

M V W = 0.94391

, with p-value = 0.01166 rejecting the hypothesis of normality of the observations vector. Thus, a model like the BPSN is an alternative to fit the vector of observations.

In this case, the maximum pseudo-likelihood estimates for the parameter vector is given by

{\hat{ξ}}_{1} = 5.8000 (0.3802),

{\hat{ξ}}_{2} = 2.8274 (0.6279),

{\hat{η}}_{1} = 4.9416 (0.5998),

{\hat{η}}_{2} = 6.1694 (0.6590),

{\hat{λ}}_{1} = - 1.8609 (0.1185),

{\hat{λ}}_{2} = - 0.5184 (0.1263),

{\hat{α}}_{1} = 10.14438 (5.0364124),

{\hat{α}}_{2} = 11.3732 (6.5075)

and

{\hat{α}}_{12} = 15.4635 (8.4410)

.

For the hypothesis

H_{0} : (α_{1}, α_{2}, α_{12}) = (1, 1, 0) versus H_{1} : (α_{1}, α_{2}, α_{12}) \neq (1, 1, 0)

nosotros obtuvimos que

W_{n} = 196.6627

with

p_{v a l u e} = 0.0000,

es decir, se rechaza la hipótesis nula, indicando que la componente exponenciada es significativa en el modelo. For the The multivariate Kolmogorov-Smirnov test of goodness of fit,

d_{n} (BPSN) = max {0.1619804, 0.1066137} = 0.1619804,

which is less than the values 0.2789 (for

n = 40

) and 0.2512 (for

n = 50

), then the BPSN model presents a good fit for the environmental pollution data in the city of Los Angeles. Contour plots for the bivariate PSN distributions is presented in Figure 3.

5. Concluding Remarks

In this article, on the basis of conditionally specified distributions, we have introduced a new bivariate PSE distribution which is very general, quite flexible and widely applicable. The new bivariate model is an absolutely continuous bivariate distribution whose marginals are univariate PSE distributions. We have derived several properties of the bivariate PSE distribution and special attention is centered in the particular case of de bivariate PSN distribution. The estimation of the unknown parameters of the new bivariate model is approached by using the proposal of [29] by maximization of the pseudo-likelihood function and the observed information matrix is determined. LR tests for some hypotheses of interest are also considered. As remarked, the new bivariate PSN model proposed in this article can be skewed and correlated, and therefore is much more flexible than other bivariate skew models available in the literature for analysing bivariate data. This is supported in the application to real data which is verified that the new bivariate PSN model provides consistently a better fit than the bivariate model proposed by [6].

Author Contributions

All authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

The research of H. W. Gómez was supported by SEMILLERO UA-2020 project, Chile. The research of G. Martínez-Flórez was supported by project: Distribuições de Probabilidade Mutivariadas Assimétricas e Flexíveis, Universidade Federal de Ceará, Fortaleza, Brazil.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The elements of the

K

matrix can be written as:

\begin{matrix} κ_{ξ_{j} ξ_{j}} = & \frac{α_{12}}{η_{j}^{2}} \sum_{i = 1}^{n} [\frac{z_{j i} W_{(1) j i} - \sqrt{\frac{2}{π}} λ_{j} W_{(3) j i} + W_{(1) j i}^{2}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} - \frac{α_{12} W_{(1) j i}^{2}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}}] \\ - \frac{1}{η_{j}^{2}} \sum_{i = 1}^{n} [1 + \sqrt{\frac{2}{π}} λ_{j}^{3} z_{j i} W_{(2) j i} + \frac{2 λ_{j}^{2}}{π} W_{(2) j i}^{2}] \\ - \frac{1}{η_{j}^{2}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] [z_{j i} W_{(1) j i} - \sqrt{\frac{2}{π}} λ_{j} W_{(3) j i} + W_{(1) j i}^{2}], \end{matrix}

\begin{matrix} κ_{ξ_{j} η_{j}} = & \frac{α_{12}}{η_{j}^{2}} \sum_{i = 1}^{n} [\frac{(z_{j i}^{2} - 1) W_{(1) j i} - \sqrt{\frac{2}{π}} λ_{j} z_{j i} W_{(3) j i} + z_{j i} W_{(1) j i}^{2}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} - \frac{α_{12} z_{j i} W_{(1) j i}^{2}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}}] \\ + \frac{1}{η_{j}^{2}} \sum_{i = 1}^{n} [- 2 z_{j i} + \sqrt{\frac{2}{π}} λ_{j} (1 - λ_{j}^{2} z_{j i}^{2}) W_{(2) j i} - \frac{2 λ_{j}^{2} z_{j i}}{π} W_{(2) j i}^{2}] \\ - \frac{1}{η_{j}^{2}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] [(z_{j i}^{2} - 1) W_{(1) j i} - \sqrt{\frac{2}{π}} λ_{j} z_{j i} W_{(3) j i} + z_{j i} W_{(1) j i}^{2}], \end{matrix}

\begin{matrix} κ_{ξ_{j} λ_{j}} = & \sqrt{\frac{2}{π}} \frac{1}{1 + λ_{j}^{2}} \frac{α_{12}}{η_{j}} \sum_{i = 1}^{n} [\frac{(1 + λ_{j}^{2}) z_{j i} W_{(3) j i} + W_{(1) j i} W_{(3) j i}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} - \frac{α_{12} W_{(1) j i} W_{(3) j i}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}}] \\ + \frac{1}{η_{j}} \sqrt{\frac{2}{π}} \sum_{i = 1}^{n} [(λ_{j i}^{2} z_{j i}^{2} - 1) W_{(2) j i} + \sqrt{\frac{2}{π}} λ_{j} z_{j i} W_{(2) j i}^{2}] \\ - \sqrt{\frac{2}{π}} \frac{1}{1 + λ_{j}^{2}} \frac{1}{η_{j}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] [(1 + λ_{j}^{2}) z_{j i} W_{(3) j i} + W_{(1) j i} W_{(3) j i}], \end{matrix}

κ_{ξ_{j} α_{j}} = \frac{- 1}{η_{j}} \sum_{i = 1}^{n} [\frac{α_{12}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}} + 1] W_{(1) j i},

κ_{ξ_{j} α_{12}} = \frac{1}{η_{j}} \sum_{i = 1}^{n} [\frac{α_{j}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}} + 2 ln (F_{SN} (z_{j^{'} i}; λ_{j}))] W_{(1) j i},

\begin{matrix} κ_{η_{j} η_{j}} = & \frac{α_{12}}{η_{j}^{2}} \sum_{i = 1}^{n} [\frac{(z_{j i}^{2} - 2) z_{j i} W_{(1) j i} - \sqrt{\frac{2}{π}} λ_{j} z_{j i}^{2} W_{(3) j i} + z_{j i}^{2} W_{(1) j i}^{2}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} - \frac{α_{12} z_{j i}^{2} W_{(1) j i}^{2}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}}] \\ \frac{1}{η_{j}^{2}} \sum_{i = 1}^{n} [1 - 3 z_{j i}^{2} + \sqrt{\frac{2}{π}} λ_{j} z_{j i} (2 - λ_{j}^{2} z_{j i}^{2}) W_{(2) j i} - \frac{2}{π} λ_{j}^{2} z_{j i}^{2} W_{(2) j i}^{2}] \\ - \frac{1}{η_{j}^{2}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] [(z_{j i}^{2} - 2) z_{j i} W_{(1) j i} - \sqrt{\frac{2}{π}} λ_{j} z_{j i}^{2} W_{(3) j i} + z_{j i}^{2} W_{(1) j i}^{2}], \end{matrix}

\begin{matrix} κ_{η_{j} λ_{j}} = & \sqrt{\frac{2}{π}} \frac{1}{1 + λ_{j}^{2}} \frac{α_{12}}{η_{j}} \sum_{i = 1}^{n} [\frac{z_{j i}^{2} (1 + λ_{j}^{2}) W_{(3) j i} + z_{j i} W_{(1) j i} W_{(3) j i}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} - \frac{α_{12} z_{j i} W_{(1) j i} W_{(3) j i}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}}] \\ - \sqrt{\frac{2}{π}} \frac{1}{η_{j}} \sum_{i = 1}^{n} z_{j i} W_{(2) j i} [1 - λ_{j}^{2} z_{j i}^{2} - \sqrt{\frac{2}{π}} λ_{j} z_{j i} W_{(2) j i}] \\ - \sqrt{\frac{2}{π}} \frac{1}{1 + λ_{j}^{2}} \frac{1}{η_{j}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] [z_{j i}^{2} (1 + λ_{j}^{2}) + z_{j i} W_{(1) j i}] W_{(3) j i}, \end{matrix}

κ_{η_{j} α_{j}} = - \frac{1}{η_{j}} \sum_{i = 1}^{n} [\frac{α_{12} z_{j i}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}} + z_{j i}] W_{(1) j i},

κ_{η_{j} α_{12}} = \frac{1}{η_{j}} \sum_{i = 1}^{n} [\frac{α_{j} z_{j i}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}} + 2 z_{j i} ln (F_{SN} (z_{j^{'} i}; λ_{j}))] W_{(1) j i},

\begin{matrix} κ_{λ_{j} λ_{j}} = & \sqrt{\frac{2}{π}} \frac{α_{12}}{{(1 + λ_{j}^{2})}^{2}} \sum_{i = 1}^{n} [\frac{[- λ_{j} (1 + λ_{j}^{2}) z_{j i}^{2} - 2 λ_{j} + \sqrt{\frac{2}{π}} W_{(3) j i}] W_{(3) j i}}{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]} - \sqrt{\frac{2}{π}} \frac{α_{12} W_{(3) j i}^{2}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}}] \\ + \sqrt{\frac{2}{π}} \sum_{i = 1}^{n} [- λ_{j} z_{j i}^{3} W_{(2) j i} - \sqrt{\frac{2}{π}} z_{j i}^{2} W_{(2) j i}^{2}] \\ - \sqrt{\frac{2}{π}} \frac{1}{{(1 + λ_{j}^{2})}^{2}} \sum_{i = 1}^{n} [(α_{j} - 1) - 2 α_{12} ln (F_{SN} (z_{j^{'} i}; λ_{j^{'}}))] [- λ_{j} (1 + λ_{j}^{2}) z_{j i}^{2} - 2 λ_{j} + \sqrt{\frac{2}{π}} W_{(3) j i}] W_{(3) j i}, \end{matrix}

κ_{λ_{j} α_{j}} = - \sqrt{\frac{2}{π}} \frac{1}{1 + λ_{j}^{2}} \sum_{i = 1}^{n} [\frac{α_{12}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}} + 1] W_{(3) j i},

κ_{λ_{j} α_{12}} = \sqrt{\frac{2}{π}} \frac{1}{1 + λ_{j}^{2}} \sum_{i = 1}^{n} [\frac{α_{j}}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}} + 2 ln (F_{SN} (z_{j^{'} i}; λ_{j}))] W_{(3) j i},

κ_{α_{j} α_{j}} = - \sum_{i = 1}^{n} \frac{1}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}},

κ_{α_{j} α_{12}} = \sum_{i = 1}^{n} \frac{ln (F_{SN} (z_{j i}; λ_{j}))}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}},

κ_{α_{12} α_{12}} = - \sum_{i = 1}^{n} \sum_{j = 1}^{2} \frac{{ln}^{2} (F_{SN} (z_{j i}; λ_{j}))}{{[α_{j} - α_{12} ln (F_{SN} (z_{j i}; λ_{j}))]}^{2}},

κ_{α_{j} α_{j^{'}}} = 0, κ_{α_{j} λ_{j^{'}}} = 0, κ_{α_{j} η_{j^{'}}} = 0, κ_{α_{j} ξ_{j^{'}}} = 0,

κ_{λ_{j} λ_{j^{'}}} = - \frac{4}{π} \frac{α_{12}}{(1 + λ_{j}^{2}) (1 + λ_{j^{'}}^{2})} \sum_{i = 1}^{n} W_{(3) j i} W_{(3) j^{'} i},

κ_{λ_{j} η_{j^{'}}} = - 2 \sqrt{\frac{2}{π}} \frac{1}{η_{j^{'}}} \frac{α_{12}}{1 + λ_{j}^{2}} \sum_{i = 1}^{n} z_{j^{'} i} W_{(3) j i} W_{(1) j^{'} i},

κ_{λ_{j} ξ_{j^{'}}} = - 2 \sqrt{\frac{2}{π}} \frac{1}{η_{j^{'}}} \frac{α_{12}}{1 + λ_{j}^{2}} \sum_{i = 1}^{n} W_{(3) j i} W_{(1) j^{'} i},

κ_{η_{j} η_{j^{'}}} = - \frac{2 α_{12}}{η_{j} η_{j^{'}}} \sum_{i = 1}^{n} z_{j i} z_{j^{'} i} W_{(1) j i} W_{(1) j^{'} i},

κ_{η_{j} ξ_{j^{'}}} = - \frac{2 α_{12}}{η_{j} η_{j^{'}}} \sum_{i = 1}^{n} z_{j i} W_{(1) j i} W_{(1) j^{'} i},

κ_{ξ_{j} ξ_{j^{'}}} = - \frac{2 α_{12}}{η_{j} η_{j^{'}}} \sum_{i = 1}^{n} W_{(1) j i} W_{(1) j^{'} i} .

References

Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Durrans, S.R. Distributions of fractional order statistics in hydrology. Water Resour. Res. 1992, 28, 1649–1655. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. Skew-normal alpha-power model. Statistics 2014, 48, 1414–1428. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Bolfarine, H.; Gómez, Y.M.; Gómez, H.W. An Unification of Families of Birnbaum-Saunders Distributions with Applications. Rev. Stat. Stat. J. 2020. Available online: https://www.ine.pt/revstat/pdf/ANUNIFICATIONOFFAMILIESOFBIRNBAUM-SAUNDERS.pdf (accessed on 8 August 2020).
Azzalini, A.; Dalla-Valle, A. The multivariate skew-normal distribution. Biometrika 1996, 83, 715–726. [Google Scholar] [CrossRef]
Arnold, B.C.; Castillo, E.; Sarabia, J.M. Conditionally specified multivariate skewed distributions. Sankhya Indian J. Stat. Ser. A 2002, 64, 206–226. [Google Scholar]
Arellano-Valle, R.; Bolfarine, H. On some characterizations of the T-Distribution. Stat. Probab. Lett. 1995, 25, 79–85. [Google Scholar] [CrossRef]
Cambanis, S.; Huang, S.; Simons, G. On the theory of elliptically contoured distributions. J. Multivar. Anal. 1981, 11, 368–385. [Google Scholar] [CrossRef] [Green Version]
Fang, K.T.; Kotz, S.; Ng, K.W. Symmetric Multivariate and Related Distributions, 3rd ed.; Chapman & Hall: London, UK, 1990. [Google Scholar]
Gupta, A.K.; Varga, T. Elliptically Contoured Models in Statistics; Kluwer Academic Publishers: Boston, MA, USA, 1993. [Google Scholar]
Kelker, D. Distribution theory of spherical distributions and location scale parameters generalization. Sankhya Indian J. Stat. Ser. A 1970, 32, 419–430. [Google Scholar]
Azzalini, A.; Capitanio, A. Statistical applications of the multivariate skew normal distribution. J. R. Stat. Soc. Ser. B 1999, 61, 579–602. [Google Scholar] [CrossRef]
Branco, M.D.; Dey, D.K. A general class of multivariate skew-elliptical distributions. J. Multivar. Anal. 2001, 79, 99–113. [Google Scholar] [CrossRef] [Green Version]
Genton, M.G.; Loperfido, N.M. Generalized skew-elliptical distributions and their quadratic forms. Ann. Inst. Stat. Math. 2005, 57, 389–401. [Google Scholar] [CrossRef] [Green Version]
Shushi, T. Generalized skew-elliptical distributions are closed under affine transformations. Stat. Probab. Lett. 2018, 134, 1–4. [Google Scholar] [CrossRef]
Adcock, C.; Azzalini, A. A Selective Overview of Skew-Elliptical and Related Distributions and of Their Applications. Symmetry 2020, 12, 118. [Google Scholar] [CrossRef] [Green Version]
Owen, D.B. Tables for computing bivariate normal probabilities. Ann. Math. Stat. 1956, 27, 1075–1090. [Google Scholar] [CrossRef]
Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; Wiley-Blackwell; John Wiley & Sons: Hoboken, NJ, USA, 1995; Volume 2. [Google Scholar]
Martínez-Flórez, G.; Pacheco, M.; Giraldo, R. Inference in log-alpha-power and log-skew-normal multivariate models. Commun. Stat. Theory Methods 2016, 45, 4397–4415. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Farias, R.B.A.; Moreno-Arenas, G. Multivariate log-Birnbaum-Saunders regression models. Commun. Stat. Theory Methods 2017, 46, 10166–10178. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Lemonte, A.J.; Salinas, H.S. Multivariate Skew-Power-Normal Distributions: Properties and Associated Inference. Symmetry 2019, 11, 1509. [Google Scholar] [CrossRef] [Green Version]
Lemonte, A.J.; Martínez-Flórez, G.; Moreno-Arenas, G. Multivariate Birnbaum-Saunders distribution: Properties and associated inference. J. Stat. Comput. Simul. 2016, 85, 374–392. [Google Scholar] [CrossRef]
Pljonkin, A.P. Features of the Photon Pulse Detection Algorithm in the Quantum Key Distribution System. In Proceedings of the 2017 International Conference on Cryptography, Security and Privacy, Goa, India, 15–17 December 2017; pp. 81–84. [Google Scholar]
Pljonkin, A.P. Vulnerability of the Synchronization Process in the Quantum Key Distribution System. Int. J. Cloud Appl. Comput. 2019, 9, 50–58. [Google Scholar] [CrossRef] [Green Version]
Arnold, B.C.; Castillo, E.; Sarabia, J.M. Conditionally specified distributions. In Lecture Notes in Statistics; Berger, J., Fienberg, J., Gani, J., Krickeberg, I., Singer, B., Eds.; Springer: New York, NY, USA, 1992; Volume 73. [Google Scholar]
Martínez-Flórez, G.; Arnold, B.C.; Bolfarine, H.; Gómez, H.W. The multivariate alpha-power model. J. Stat. Plan. Inference 2013, 143, 1244–1255. [Google Scholar] [CrossRef]
Arnold, B.C.; Strauss, D. Bivariate distributions with conditionals in prescribed exponential families. J. Roy. Stat. Soc. Ser. B 1991, 53, 365–375. [Google Scholar] [CrossRef]
Arnold, B.C.; Castillo, E.; Sarabia, J.M. Conditionally Specification of Statistical Models; Springer Series in Statistics; Springer: New York, NY, USA, 1999. [Google Scholar]
Besag, J. Statistical analysis of non-lattice data. J. Roy. Stat. Soc. Ser. D 1975, 24, 179–195. [Google Scholar] [CrossRef] [Green Version]
Arnold, B.C.; Strauss, D. Pseudolikelihood Estimation: Some Examples. Sankhya Indian J. Stat. Ser. B 1991, 53, 233–2435. [Google Scholar]
Cheng, C.; Riu, J. On Estimating Linear Relationships When Both Variables Are Subject to Heteroscedastic Measurement Errors. Technometrics 2006, 48, 511–519. [Google Scholar] [CrossRef]
Rotnitziky, A.; Cox, D.R.; Bottai, M.; Robins, J. Likelihood-based inference with singular information matrix. Bernoulli 2000, 6, 243–284. [Google Scholar] [CrossRef]
Salinas, H.S.; Gómez, H.W.; Martínez-Flórez, G.; Bolfarine, H. Skew-normal alpha-power model [Statistics 48(2014) 1414–1428]. Statistics 2018, 52, 950–953. [Google Scholar] [CrossRef]
Azzalini, A.; Capitanio, A. The Skew-Normal and Related Families, 1st ed.; Cambridge University Press: Cambridge, UK, 2014. [Google Scholar]
Fisher, R.A. The use of multiple measurements in taxonomic problems. Ann. Eugen. 1936, 7, 179–188. [Google Scholar] [CrossRef]
Anderson, T.W.; Darling, D.A. A test of goodness of fit. J. Am. Stat. Assoc. 1954, 49, 765–769. [Google Scholar] [CrossRef]
Doornik, J.A.; Hansen, H. An Omnibus Test for Univariate and Multivariate Normality. Oxf. Bull. Econ. Stat. 2008, 70, 927–939. [Google Scholar] [CrossRef]
Henze, N.; Zirkler, B. A Class of Invariant Consistent Tests for Multivariate Normality. Commun. Stat. Theory Methods 1990, 19, 3595–3617. [Google Scholar] [CrossRef]
Royston, J.P. Some Techniques for Assessing Multivarate Normality Based on the Shapiro-Wilk W. J. Roy. Stat. Soc. Ser. C 1983, 32, 121–133. [Google Scholar] [CrossRef]
Royston, J.P. Remark AS R94: A Remark on Algorithm AS 181: The W-test for Normality. J. Roy. Stat. Soc. Ser. C 1995, 44, 547–551. [Google Scholar] [CrossRef]
Akaike, H. A new look at statistical model identification. IEEE Trans. Autom. Contr. 1974, 19, 716–722. [Google Scholar] [CrossRef]
Cavanaugh, J.E. Unifying the derivations for the Akaike and corrected Akaike information criteria. Stat. Probab. Lett. 1997, 33, 201–208. [Google Scholar] [CrossRef]
R Development Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2019; Available online: http://www.R-project.org (accessed on 10 January 2020).
Pewsey, A. Problems of inference for Azzalini’s skew-normal distribution. J. Appl. Stat. 2000, 27, 859–870. [Google Scholar] [CrossRef]
Justel, A.; Peña, D.; Zamar, Z. A multivariate Kolmogorov-Smirnov test of goodness of fit. Stat. Probab. Lett. 1997, 35, 251–259. [Google Scholar] [CrossRef] [Green Version]
Johnson, R.A.; Wichern, D.W. Applied Multivariate Statistical Analysis, 6th ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2007. [Google Scholar]
Gokhale, S.; Khare, M. Statistical behavior of carbon monoxide from vehicular exhausts in urban environments. Environ. Model. Softw. 2007, 22, 526–535. [Google Scholar] [CrossRef]

Figure 1. Graphs of contorns for the BPSN model (a) BPSN ((0,1), (0,1), 1.25, 2.75, 2.25, 1.75, 0.75), (b) BPSE ((0,1), (0,1), −1.25, −2.75, 2.25, 1.75, 0.75) and (c) BPSE ((0,1), (0,1), 1.25, 2.75, 2.25, 3.75, 1.25).

Figure 2. Contour plot of bivariate distributions for Iris data set: (a) BCSN model, (b) BPN model and (c) BPSN model.

Figure 3. Contour plots for the bivariate PSN distributions.

Table 1. Correlation coefficient for the BPSN distribution.

$λ_{2} / λ_{1}$	−1.5	−1.0	−0.5	0	0.5	1.0	1.5	2.0
−2.5	0.8033	0.3730	−0.2686	−0.6699	−0.8444	−0.9252	−0.9680	−0.9933
−2.0	0.7518	0.3559	−0.2385	−0.6124	−0.7759	−0.8519	−0.8924	−0.9164
−1.5	0.6025	0.2984	−0.1661	−0.4625	−0.5938	−0.6556	−0.6888	−0.7087
−1.0	0.2105	0.1324	−0.0046	−0.1014	−0.1479	−0.1713	−0.1845	−0.1927
−0.5	−0.2853	−0.0921	0.1722	0.3245	0.3855	0.4117	0.4246	0.4317
0	−0.5511	−0.2190	0.2543	0.5385	0.6573	0.7105	0.7377	0.7534
0.5	−0.6643	−0.2756	0.2846	0.6244	0.7679	0.8327	0.8662	0.8856
1.0	−0.7183	−0.3035	0.2971	0.6631	0.8185	0.8888	0.9254	0.9467
1.5	−0.7478	−0.3193	0.3031	0.6834	0.8451	0.9186	0.9569	0.9791
2.0	−0.7657	−0.3291	0.3062	0.6951	0.8609	0.9362	0.9755	0.9985
2.5	−0.7776	−0.3358	0.3081	0.7026	0.8709	0.9476	0.9876	1.0000

Table 2. Summary statistics for the data set.

Variable	${\bar{x}}_{j}$	$s_{j}$	$\sqrt{b_{j 1}}$	$b_{j 2}$
$x_{1}$	5.843	0.828	0.308	−0.605
$x_{2}$	3.057	0.435	0.312	0.138

Table 3. Estimated parameters (standard errors), of the BCSN, BPN and BPSN models.

Estimate	BCSN	BPN	BPSN
${\hat{ξ}}_{1}$	5.867 (0.055)	3.746 (0.136)	4.119 (0.163)
${\hat{ξ}}_{2}$	3.055 (0.035)	1.655 (0.082)	2.572 (0.159)
${\hat{η}}_{1}$	0.794 (0.043)	1.384 (0.058)	1.417 (0.204)
${\hat{η}}_{2}$	0.438 (0.026)	0.808 (0.055)	2.146 (0.390)
${\hat{λ}}_{1}$	−0.224 (0.110)		11.147 (3.685)
${\hat{λ}}_{2}$			−3.200 (0.504)
${\hat{α}}_{1}$		9.358 (0.613)	2.127 (0.192)
${\hat{α}}_{2}$		14.746 (0.374)	18.260 (3.158)
${\hat{α}}_{12}$		2.671 (0.715)	5.016 (1.473)
AIC	555.10	551.71	549.00
CAIC	557.68	554.73	552.59

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Martínez-Flórez, G.; Tovar-Falón, R.; Gómez, H.W. Bivariate Power-Skew-Elliptical Distribution. Symmetry 2020, 12, 1327. https://doi.org/10.3390/sym12081327

AMA Style

Martínez-Flórez G, Tovar-Falón R, Gómez HW. Bivariate Power-Skew-Elliptical Distribution. Symmetry. 2020; 12(8):1327. https://doi.org/10.3390/sym12081327

Chicago/Turabian Style

Martínez-Flórez, Guillermo, Roger Tovar-Falón, and Héctor W. Gómez. 2020. "Bivariate Power-Skew-Elliptical Distribution" Symmetry 12, no. 8: 1327. https://doi.org/10.3390/sym12081327

APA Style

Martínez-Flórez, G., Tovar-Falón, R., & Gómez, H. W. (2020). Bivariate Power-Skew-Elliptical Distribution. Symmetry, 12(8), 1327. https://doi.org/10.3390/sym12081327

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bivariate Power-Skew-Elliptical Distribution

Abstract

1. Introduction

1.1. Elliptical Distributions

1.2. Skew-Elliptical Distribution

1.2.1. Skew-Normal Distribution

1.2.2. Skew-Student-t Distribution

1.2.3. Skew-Cauchy Distribution

1.2.4. Skew-Logistic Distribution

1.2.5. Skew-Laplace Distribution

1.3. Power-Skew-Elliptical Distribution

2. Bivariate Power-Skew-Elliptical Distribution

Statistical Inference for the Bpse Model

3. Bivariate Power-Skew-Normal Model

3.1. Statistical Inference

3.2. Reparameterization for the Bpsn Model

4. Numerical Illustrations

4.1. Illustration 1

4.2. Illustration 2

5. Concluding Remarks

Author Contributions

Funding

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI