A New Generalization of the Student’s t Distribution with an Application in Quantile Regression

Reyes, Jimmy; Rojas, Mario A.; Arrué, Jaime

doi:10.3390/sym13122444

Open AccessArticle

A New Generalization of the Student’s t Distribution with an Application in Quantile Regression

by

Jimmy Reyes

,

Mario A. Rojas

and

Jaime Arrué

^*

Departamento de Matemáticas, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1270300, Chile

^*

Author to whom correspondence should be addressed.

Symmetry 2021, 13(12), 2444; https://doi.org/10.3390/sym13122444

Submission received: 14 November 2021 / Revised: 5 December 2021 / Accepted: 11 December 2021 / Published: 17 December 2021

Download

Browse Figures

Versions Notes

Abstract

:

In this work, we present a new generalization of the student’s t distribution. The new distribution is obtained by the quotient of two independent random variables. This quotient consists of a standard Normal distribution divided by the power of a chi square distribution divided by its degrees of freedom. Thus, the new symmetric distribution has heavier tails than the student’s t distribution and extensions of the slash distribution. We develop a procedure to use quantile regression where the response variable or the residuals have high kurtosis. We give the density function expressed by an integral, we obtain some important properties and some useful procedures for making inference, such as moment and maximum likelihood estimators. By way of illustration, we carry out two applications using real data, in the first we provide maximum likelihood estimates for the parameters of the generalized student’s t distribution, student’s t, the extended slash distribution, the modified slash distribution, the slash distribution generalized student’s t test, and the double slash distribution, in the second we perform quantile regression to fit a model where the response variable presents a high kurtosis.

Keywords:

generalization of the student’s t distribution; student’s t distribution; slash distribution; moments; maximum likelihood estimates

1. Introduction

The slash distribution is the result of the quotient of two independent random variables, one with a standard normal distribution and the other with a uniform distribution on the interval (0, 1), with the following stochastic representation

Y = σ (\frac{X}{U^{1 / q}}) + μ,

(1)

where

μ \in R

is the location parameter and

σ > 0

is the scale parameter and q is the parameter related to kurtosis. Will be denoted by

Y \sim S (μ, σ, q)

and its density function has the following expression

f_{Y} (y) = \frac{q 2^{\frac{q}{2} - 1}}{\sqrt{π} {|\frac{y - μ}{σ}|}^{q + 1}} [Γ (\frac{q + 1}{2}) - Γ (\frac{q + 1}{2}, \frac{{(y - μ)}^{2}}{2 σ^{2}})],

(2)

where

Γ (a) = \int_{0}^{\infty} t^{a - 1} e^{- t} d t

is the gamma function and

Γ (a, x) = \int_{x}^{\infty} t^{a - 1} e^{- t} d t

is the gamma function incomplete. This distribution presents heavier tails than the normal distribution, that is, it has more kurtosis. Properties of this family are discussed in Rogers and Tukey [1] and Mosteller and Tukey [2].

Maximum likelihood estimators for location and scale parameters are discussed in Kafadar [3]. Wang and Genton [4] described multivariate symmetrical and skew-multivariate extensions of the slash-distribution while Gómez et al. [5] (and Erratum in Gómez and Venegas, 2008) extend the slash distribution by introducing the slash-elliptical family; asymmetric version of this family is discussed in work of Arslan [6]. Genc [7] discussed a symmetric generalization of the slash distribution. More recently, Gómez et al. [8] utilize the slash-elliptical family to extend the Birnbaum–Saunders distribution.

In (1),

μ = 0

and

σ = 1

, we retrieve the standard slash distribution. What is more

q = 1

we obtain the canonical slash distribution. When q tends to infinity, the standard normal distribution is recovered.

When

U \sim e x p (2)

, in (1), the distribution obtained is called modified slash distribution studied by Reyes et al. [9]. Whose function of density is given by

\begin{matrix} f_{X} (x) = \frac{2}{\sqrt{2 π}} \int_{0}^{\infty} v^{\frac{1}{q}} e^{- \frac{1}{2} x^{2} v^{\frac{2}{q}} - 2 v} d v, q > 0, x \in R, \end{matrix}

(3)

and will be denoted by

X \sim M S (0, 1, q)

, where q is kurtosis parameter.

When

U \sim B (α, β)

and

q = 1

, in (1), the distribution obtained is called extended slash (ES) distribution studied by Rojas et al. [10]. Whose function of density is given by

\begin{matrix} f_{Y} (y; μ, σ, α, β) = \frac{1}{σ B (α, β)} \int_{0}^{1} ϕ ((\frac{y - μ}{σ}) t) t^{α} {(1 - t)}^{β - 1} d t \end{matrix}

(4)

is denoted as

Y \sim E S (μ, σ, α, β)

with

μ \in R

,

σ

,

α

,

β > 0

and

ϕ

denotes the pdf of the standard normal distribution (see Johnson et al. [11]) and

B (\cdot, \cdot)

denotes the beta function.

We will say that X has a student’s t distribution with

ν

degrees of freedom and with location parameter

μ

and scale parameter

σ

, which we will denote by

X \sim T (μ, σ, ν)

and you have a stochastic representation given by

\begin{matrix} X & = & σ \frac{W}{{(V / ν)}^{1 / 2}} + μ \end{matrix}

(5)

and continuous probability density function is given by

\begin{matrix} f_{X} (x) = \frac{Γ (\frac{ν + 1}{2})}{σ Γ (\frac{ν}{2}) \sqrt{ν π}} {[1 + \frac{1}{ν} {(\frac{x - μ}{σ})}^{2}]}^{\frac{ν + 1}{2}} \end{matrix}

(6)

with support on

(- \infty; \infty)

.

The moment’s order r of the random variable X with student’s t distribution can be explained by the function Gamma. If

X \sim T (0, 1, ν)

then

\begin{matrix} μ_{r} = E [X^{r}] = \frac{ν^{r / 2} a_{r / 2} 2^{r / 2} Γ (\frac{ν - r}{2})}{Γ (\frac{ν}{2})}, ν > r, \end{matrix}

(7)

where

a_{r / 2} = \int_{- \infty}^{\infty} x^{r} ϕ (x) d x

for r even, then

E [X] = 0

,

ν > 1

V (X) = \frac{ν}{ν - 2}

,

ν > 2

.

If

Y \sim T (μ, σ, ν)

then

\begin{matrix} E (Y^{r}) & = & \sum_{k = 0}^{r} (\begin{matrix} r \\ k \end{matrix}) σ^{k} μ^{r - k} μ_{k} . \end{matrix}

Rui Li-Saralees Nadarajah [12] makes a review of all the generalizations of the student’s t distribution published to date, where they show that the main motivation of these extensions is to model heavy tails or data with high kurtosis.

In the study of symmetric distributions with heavy tails El-Bassiouny et al. [13] present the generalized student’s slash t distribution. We will say that

X \sim G L S T (μ, σ, α, β, ν, q)

, with parameter

q > 0

, has pdf given by

\begin{matrix} f_{X} (x) = \frac{q Γ (\frac{r + 1}{2})}{σ \sqrt{π r} Γ (\frac{r}{2}) B (α, β)} \int_{0}^{1} w^{α q} {(1 - w^{q})}^{β - 1} {[1 + (\frac{x - μ}{σ}) \frac{w^{2}}{r}]}^{- \frac{r + 1}{2}} d w, q > 0, x \in R, \end{matrix}

(8)

where q is kurtosis parameter and

B (\cdot, \cdot)

denotes the beta function.

Another recent extension of the slash model was proposed by El-Morshedy, A. H. et al. [14]. These authors introduced the double slash (DSL) distribution with density function given by

\begin{matrix} f_{Y} (y) = q_{1} q_{2} \int_{0}^{1} [\int_{0}^{1} ϕ ((\frac{y - μ}{σ}) w t) t^{q_{1}} d t] w^{q_{2}} d w \end{matrix}

(9)

with

μ \in R

,

σ

,

q_{1}

and

q_{2} > 0

.

When

U \sim G a (2 β, β)

and

q = 1

, in (1), the distribution generalized modified slash distribution, denoted

G M S (μ, σ, β)

, studied by Reyes, J., Barranco-Chamorro, I., and Gómez, H. W. [15]. Whose function of density is given by

f_{Y} (y; μ, σ, β) = \{\begin{matrix} \frac{1}{σ \sqrt{8 π}} & if y = μ \\ \frac{2^{β / 2}}{\sqrt{2 π}} \frac{σ^{β + 1} β^{β + 2}}{{| y - μ |}^{β + 2}} U (1 + \frac{β}{2}, \frac{3}{2}, \frac{2 σ^{2} β^{2}}{{(y - μ)}^{2}}) & if y \neq μ, \end{matrix}

(10)

where

μ \in R

,

σ

,

β > 0

and

U (a, b, z) = \frac{1}{Γ (a)} \int_{0}^{\infty} t^{a - 1} {(1 + t)}^{b - a - 1} e^{- z t} d t,

(11)

is the confluent hypergeometric function of the second kind. Details about this function can be seen in Abramowitz and Stegun, p. 505.

With the motivation of finding a distribution that is a generalization of the student’s t distribution and that presents heavier tails than the distributions found so far in the literature, in this article, we introduce a new generalization of the student’s t distribution (GT) whose stochastic representation is given by

Y = σ \frac{W}{{(V / ν)}^{1 / q}} + μ,

(12)

where

W \sim N (0, 1)

,

V \sim χ_{(ν)}^{2}

are independent with

ν > 0

and

q > 0

and we will denote it as

Y \sim G T (μ, σ, ν, q)

.

The paper is organized as follows. In Section 2 the probability density function (pdf) is given and some properties of the

G T

distribution are presented and shows that the distribution student’s t is a particular case of the distribution

G T

. Additionally, moments of order r are obtained, including the kurtosis coefficient. In Section 3 derivation of the moment and maximum likelihood estimators are discussed. A simulation study is presented to illustrate the behavior of the estimator of the parameters

μ

,

σ

, and q, for

ν = 8

. Section 4 results of using the proposed model in two real applications are reported. Section 5 presents quantile regression. Section 6 presents the main conclusions.

2. The Generalized Student’s t Distribution

We present the generalized student’s t distribution with heavier tails compared to similar distributions. Initially we will present its density function.

2.1. Density Function

We will use the stochastic representation

\begin{matrix} Y & = & σ \frac{W}{{(V / ν)}^{1 / q}} + μ, \end{matrix}

(13)

where W is distributed standard normal, V is distributed chi square, with

ν

degrees of freedom, W and V are independent random variables,

μ

,

σ

are location and scale parameters, respectively,

ν

degrees of freedom and

q > 0

is the parameter related to the distribution kurtosis.

We use the notation

Y \sim G T (μ, σ, ν, q)

, and for the standard case, we denote

X \sim G T (0, 1, ν, q)

.

Proposition 1.

Let

Y \sim G T (μ, σ, ν, q)

. Then, the pdf of Y is given by

f_{Y} ((y; μ, σ, ν, q) = \{\begin{matrix} \frac{1}{σ 2^{(ν / 2)} ν^{1 / q} Γ (ν / 2) \sqrt{2 π}} \int_{0}^{\infty} t^{\frac{ν - 2}{2} + \frac{1}{q}} e^{- \frac{1}{2} [{(\frac{y - μ}{σ})}^{2} {(t / ν)}^{2 / q} + t]} d t & y \neq μ \\ \frac{Γ (\frac{ν}{2} + \frac{1}{q})}{σ {(ν / 2)}^{1 / q} Γ (ν / 2) \sqrt{2 π}} & y = μ . \end{matrix}

(14)

Proof.

Since W and V are two independent random variables, such that

W \sim N (0, 1)

and

V \sim χ_{(ν)}^{2}

, then the joint pdf of

(Y, T) = (σ W / {(V / ν)}^{1 / q} + μ, V)

is

f_{(Y, T)} (y, t, μ, σ, ν, q) = \frac{1}{σ 2^{(ν / 2)} ν^{1 / q} Γ (ν / 2) \sqrt{2 π}} t^{\frac{ν - 2}{2} + \frac{1}{q}} e^{- \frac{1}{2} [{(\frac{y - μ}{σ})}^{2} {(t / ν)}^{2 / q} + t]},

where

y \in R

and

t > 0

. By marginalizing the result follows immediately para

y \neq μ

. Doing

y = μ

the other expression is obtained. □

Corollary 1.

If

q = 1

in (14), then la fdp de Y is called the canonical generalized student’s t distribution.

f_{Y} (y; μ, σ, ν, 1) = \{\begin{matrix} \frac{{(\frac{y - μ}{σ})}^{- (\frac{ν}{2} + 2)} 2^{- 3 (1 + \frac{ν}{4})}}{σ \sqrt{2 π}} U [1 + \frac{ν}{4}, \frac{3}{2}, \frac{ν}{{(\frac{y - μ}{σ})}^{2}}] & y \neq μ \\ \frac{1}{σ \sqrt{2 π}} & y = μ, \end{matrix}

(15)

where

U (a, b, x) = \frac{1}{Γ (a)} \int_{0}^{\infty} e^{- x t} t^{a - 1} {(1 + t)}^{b - a - 1} d t

, it is called the second-class hypergeometric confluent function.

Proof.

If

q = 1

in (14), then la fdp de Y is

f_{Y} ((y; μ, σ, ν, 1) = \{\begin{matrix} \frac{1}{σ 2^{(ν / 2)} ν Γ (ν / 2) \sqrt{2 π}} \int_{0}^{\infty} t^{\frac{ν}{2}} e^{- \frac{1}{2} [{(\frac{y - μ}{σ})}^{2} {(t / ν)}^{2} + t]} d t & y \neq μ \\ \frac{1}{σ \sqrt{2 π}} & y = μ . \end{matrix}

(16)

Making

a = ν / 2

and

b = \frac{{(\frac{y - μ}{σ})}^{2}}{ν^{2}}

and making the change of variables

w = \frac{t}{4 a}

and applying the result obtained in Reyes et al. [9]

\int_{0}^{\infty} t^{a} e^{- (\frac{x^{2}}{2} t^{2} - 2 a t)} d t = \frac{a Γ (a + 1)}{2^{a / 2} x^{(a + 2)}} U [1 + \frac{a}{2}, \frac{3}{2}, \frac{a^{2}}{x^{2}}],

where

x = 2 (\frac{y - μ}{σ})

the result is obtained. □

Figure 1 on the left shows the PDFs of the generalized student’s t distribution for q = 1 compared to the Student’s t for

ν = 5

, the normal distribution, the generalized bar t distribution and the double bar distribution. In which, it can be seen that as the variable tends to ∞ to the right (or to the left), the new model captures more data than the other comparative distributions. Furthermore, it is observed that to the extent that q is smaller, the distribution has greater kurtosis.

2.2. Tails Comparison of GT and Student’s t Distributions

In this part, we perform a comparison of the upper tails between the

G T

distribution and student’s t distribution. For this, we consider the canonical version (

q = 1

) of

G T

distribution considering student’s t distribution with

ν = 5

degrees of freedom. Table 1 shows

P (Y > y)

for different values of y in the mentioned distributions. The

G T

distribution has tails much heavier than the student’s t distribution.

Remark 1.

Table 1 illustrates the fact that the generalized student’s t distributions have heavier tails than the tails of the student’s t distribution.

2.3. Compared GT Quantiles with T Quantiles

Figure 2 shows the quantile function of the generalized student’s t distribution compared to quantile function of student’s t for different values of q and

ν = 5

.

Proposition 2.

Let

Y \sim G T (0, 1, ν, q)

. Then an approximation of quantile p of Y is

y_{p} = \{\begin{matrix} \frac{t_{p}}{2 {(\frac{j_{p}}{ν})}^{\frac{q - 2}{2 q}}} [1 + {(\frac{j_{p}}{ν})}^{\frac{q - 2}{q}}] & q < 2 \\ \frac{t_{p}}{{(\frac{j_{p}}{ν})}^{\frac{q - 2}{2 q}}} & q > 2, \end{matrix}

where

t_{p}

and

j_{p}

denotes the quantiles p of student’s t and chi-square distribution whit ν degrees of freedom.

Proof.

Y = \frac{Z}{{(\frac{J}{ν})}^{\frac{1}{q}}} = \frac{Z}{{(\frac{J}{ν})}^{\frac{1}{2}}} \frac{{(\frac{J}{ν})}^{\frac{1}{2}}}{{(\frac{J}{ν})}^{\frac{1}{q}}} = T {(\frac{J}{ν})}^{\frac{2 - q}{2 q}}

⟹ y_{p} \approx t_{p} {(\frac{J_{p}}{ν})}^{\frac{2 - q}{2 q}}

.

S i

q < 2 ⟹ y_{p} \approx t_{p} [\frac{{(\frac{J_{p}}{ν})}^{\frac{2 - q}{2 q}} + {(\frac{J_{p}}{ν})}^{\frac{q - 2}{2 q}}}{2}]

.

S i

q > 2 ⟹ y_{p} \approx \frac{t_{p}}{{(\frac{J_{p}}{ν})}^{\frac{q - 2}{2 q}}}

. □

Figure 3 shows the quantiles of the generalized student’s t distribution compared to quantile of proposition 2 for values

q = 1

and

ν = 5

.

Properties:

If $q = 2$ then $y_{p} = t_{p}$ ;
if $ν \to \infty$ then $y_{p} = z_{p}$ where $z_{p}$ is the quantile p of standard normal distribution.

In Table 2 we present quantiles generalized student’s t for n degrees of freedom and q = 1.

2.4. Properties of the Generalized Student’s t Distribution

In this section, we present some properties of the generalized student’s t distribution.

Proposition 3.

Let

Y \sim G T (μ, σ, ν, q)

then

1.: $lim_{q \to \infty} f_{Y} (y; μ, σ, ν, q) = \frac{1}{σ} ϕ (\frac{y - μ}{σ})$ .
2.: If $Y | V = v \sim N (μ, v^{- 2 / q} σ^{2})$ and $V \sim χ_{(ν)}^{2}$ then $Y \sim G T (μ, σ, ν, q)$ .
3.: If $Y \sim G T (0, 1, ν, 2)$ , then, $Y \sim t_{(ν)}$ .

Proof.

Making q tend to infinity in representation (13), the result is immediately obtained;
$f_{Y} (y; μ, σ, ν, q) = \int_{0}^{\infty} ϕ (y; μ, v^{- 1 / q} σ) f_{V} (v) d v = \int_{0}^{\infty} \frac{v^{1 / q}}{σ} ϕ (\frac{y - μ}{σ v^{- 1 / q}}) f_{V} (v) d v$ . where $f_{V}$ es la fdp chi-square distribution with $ν$ degrees of freedom. The result follows using transformation $t = v^{1 / q}$ and direct integral computations;
Making $q = 2$ we obtain the density student’s with $ν$ degrees of freedom.

□

Remark 2.

Proposition 3 shows first that the generalized student’s t distribution contains the normal distribution as a special case (

q \to \infty

). Moreover, it also shows that the generalized student’s t distribution is a scale mixture between the normal and the chi-square distribution with ν degrees of freedom. The third property shows that for

q = 2

, the density function for the generalized student’s t coincides with the density function of the student’s t distribution with ν degrees of freedom.

2.5. Moments

In this subsection the moments of the generalized student’s t distribution are deduced.

Proposition 4.

Let

X \sim G T (0, 1, ν, q)

and

Y \sim G T (μ, σ, ν, q)

. Hence, for

r = 1, 2, 3, . . . .

and

q > 2 r / ν

, we have that

μ_{2 r} = E (X^{2 r}) = \frac{ν^{\frac{2 r}{q}} 2^{2 r \frac{q + 1}{q}} (2 r)! Γ (\frac{ν}{2} - \frac{2 r}{q})}{r! Γ (ν / 2)} μ_{2 r - 1} = E (X^{2 r - 1}) = 0

and

\begin{matrix} E (Y^{r}) & = & \sum_{k = 0}^{r} (\begin{matrix} r \\ k \end{matrix}) σ^{k} μ^{r - k} μ_{k} . \end{matrix}

Proof.

Representation (13) with

μ = 0

and

σ = 1

, and since W and V are independent, we have that

μ_{2 r} = E (X^{2 r}) = E ({(\frac{W}{{(V / ν)}^{1 / q}})}^{2 r}) = E (W^{2 r}) E ({(V / ν)}^{- 2 r / q}) .

Moreover, since

E ({(V / ν)}^{- 2 r / q}) = ν^{2 r / q} E (V^{- 2 r / q}) = ν^{2 r / q} 2^{- 2 r / q} \frac{Γ (\frac{ν}{2} - \frac{2 r}{q})}{2^{2 r / q} Γ (ν / 2)}, q > 2 r / ν

and

E (W^{2 r}) = \frac{(2 r)!}{2^{r} r!}

are even moments for the standard normal distribution, the second result follows directly by applying the formula to the stochastic representation (13). □

Corollary 2.

Let

Y \sim G T (μ, σ, ν, q)

, and hence,

E (Y) = μ a n d V a r (Y) = \frac{2 σ^{2} ν^{2 / q} 2^{2 \frac{q + 2}{q}} Γ (\frac{ν}{2} - \frac{2}{q})}{Γ (ν / 2)}, q > 4 / ν .

(17)

Proposition 5.

Let

Y \sim G T (μ, σ, ν, q)

, so that the coefficient of skewness and kurtosis are:

γ_{1} = 0

(18)

and

β_{2} = \frac{3 Γ (ν / 2) Γ (\frac{ν}{2} - \frac{4}{q})}{Γ^{2} (\frac{ν}{2} - \frac{2}{q})}, q > 8 / ν .

(19)

Proof.

The standardized coefficient of skewness and kurtosis are

γ_{1} = \frac{μ_{3} - 3 μ_{1} μ_{3} + 2 μ_{1}^{3}}{{(μ_{2} - μ_{1}^{2})}^{3 / 2}}

and

β_{2} = \frac{μ_{4} - 4 μ_{1} μ_{3} + 6 μ_{1}^{2} μ_{2} - 3 μ_{1}^{4}}{{(μ_{2} - μ_{1}^{2})}^{2}}

and the result follows after replacing the even moments derived in Proposition 4. □

Figure 4 shows the kurtosis the

G T

distribution compared with T distribution for different values of q and

ν = 8

.

It can be seen that the generalized student’s distribution has a greater kurtosis than the student’s distribution for q less than 2, then for data with high kurtosis, it would be recommended to use the generalized student’s distribution.

3. Inference

3.1. Moment Estimators

In the following proposition we present the moment estimators of

μ

,

σ

, and q for

ν = 8

.

Proposition 6.

Where

Y_{1}, \dots, Y_{n}

a random sample from the distribution of the random variable

Y \sim G T (μ, σ, ν, q)

, so that the moment estimators of

θ = (μ, σ, ν, q)

for

q > 1

are given by

{\hat{μ}}_{M} = \bar{Y}, {\hat{σ}}_{M} = {(\frac{Γ (ν / 2) S^{2}}{2 ν^{2 / {\hat{q}}_{M}} 2^{2 \frac{q + 2}{q}} Γ (\frac{ν}{2} - \frac{2}{{\hat{q}}_{M}})})}^{1 / 2} a n d γ_{2} = \frac{3 Γ (ν / 2) Γ (\frac{ν}{2} - \frac{4}{{\hat{q}}_{M}})}{Γ^{2} (\frac{ν}{2} - \frac{2}{{\hat{q}}_{M}})}, ν > \frac{8}{q} o ν > 8 a n d q < 1

where

\bar{Y}

, S and

γ_{2}

are the mean, standard deviation, and sample kurtosis coefficient.

Proof.

Using (17) it follows that

μ = E (Y) a n d σ^{2} = \frac{Γ (ν / 2) V a r (Y)}{2 ν^{2 / q} 4^{\frac{q + 2}{q}} Γ (\frac{ν}{2} - \frac{2}{q})}

(20)

replacing

γ_{2}

in (19) one obtains the numerical equation

γ_{2} = \frac{3 Γ (ν / 2) Γ (\frac{ν}{2} - \frac{4}{{\hat{q}}_{M}})}{Γ^{2} (\frac{ν}{2} - \frac{2}{{\hat{q}}_{M}})}

(21)

and solving (21) for

\hat{q}

and

\hat{ν}

one obtains

{\hat{q}}_{M}

and

{\hat{ν}}_{M}

. Further, replacing in (20) q by

{\hat{q}}_{M}

,

ν

by

{\hat{ν}}_{M}

,

E (Y)

by

\bar{Y}

and

V a r (Y)

by the sample variance

S^{2}

, we obtain the moment estimators

({\hat{μ}}_{M}, {\hat{σ}}_{M}, {\hat{ν}}_{M}, {\hat{q}}_{M})

for

(μ, σ, ν, q)

. □

3.2. Maximum Likelihood Estimation

Given a random sample

Y_{i} \sim G T (μ, σ, ν, q)

, for

i = 1, . ., n

, the log-likelihood function can be written as

l (μ, σ, ν, q) = - n l o g (σ) - \frac{n ν}{2} \log (2) - \frac{n}{q} l o g (ν) - n l o g (Γ (ν / 2)) - \frac{n}{2} l o g (2 π) + \sum_{i = 1}^{n} l o g G (y_{i})

(22)

where

G (y_{i}) = G (y_{i}; μ, σ, ν, q) = \int_{0}^{\infty} v^{\frac{ν - 2}{2} + \frac{1}{q}} e^{- \frac{1}{2} [{(\frac{y_{i} - μ}{σ})}^{2} {(\frac{v}{ν})}^{\frac{2}{q}} + v]} d v

and hence the maximum likelihood equations are given by

\begin{matrix} \sum_{i = 1}^{n} \frac{G_{1} (y_{i})}{G (y_{i})} & = & 0 \end{matrix}

(23)

\begin{matrix} \sum_{i = 1}^{n} \frac{G_{2} (y_{i})}{G (y_{i})} & = & \frac{n}{σ} \end{matrix}

(24)

\begin{matrix} \sum_{i = 1}^{n} \frac{G_{3} (y_{i})}{G (y_{i})} & = & \frac{n l o g (2)}{2} + \frac{n}{q ν} + \frac{n Ψ (ν / 2)}{2} \end{matrix}

(25)

\begin{matrix} \sum_{i = 1}^{n} \frac{G_{4} (y_{i})}{G (y_{i})} & = & - \frac{n l o g (ν)}{q^{2}} \end{matrix}

(26)

where,

G_{1} (y_{i}) = \frac{\partial}{\partial μ} G (y_{i})

,

G_{2} (y_{i}) = \frac{\partial}{\partial σ} G (y_{i})

,

G_{3} (y_{i}) = \frac{\partial}{\partial ν} G (y_{i})

.

G_{4} (y_{i}) = \frac{\partial}{\partial q} G (y_{i})

. The expressions for

G_{1} (y_{i})

,

G_{2} (y_{i})

,

G_{3} (y_{i})

and

G_{4} (y_{i})

should be given,

\begin{matrix} G_{1} (y_{i}) & = & \frac{1}{σ 2 ν^{\frac{1}{q}}} \int_{0}^{\infty} (y_{i} - μ) t_{i} (ν) d v \end{matrix}

(27)

\begin{matrix} G_{2} (y_{i}) & = & \frac{1}{σ 3 ν^{\frac{2}{q}}} \int_{0}^{\infty} {(y_{i} - μ)}^{2} t_{i} (ν) d v \end{matrix}

(28)

\begin{matrix} G_{3} (y_{i}) & = & \frac{1}{q σ 2 ν} \int_{0}^{\infty} [{\frac{v}{ν}}^{2 / q} {(y_{i} - μ)}^{2} + q σ^{2} ν \log (v) t_{i} (ν) d v \end{matrix}

(29)

\begin{matrix} G_{4} (y_{i}) & = & - \frac{1}{σ 2 q^{2}} \int_{0}^{\infty} [σ^{2} \log (v) - \log (v / q) {(v / q)}^{2 / q} {(y_{i} - μ)}^{2}] t_{i} (ν) d v, \end{matrix}

(30)

where

t_{i} (ν) = v^{\frac{ν - 2}{2} + \frac{1}{q}} e^{- \frac{1}{2} [{(\frac{y_{i} - μ}{σ})}^{2} {(\frac{v}{ν})}^{\frac{2}{q}} + v]}

.

Using numerical procedures Equations (27)–(30) can be solved.

Proposition 7.

Let

Y_{1}, \dots, Y_{n}

a random sample from the distribution of random variable

Y \sim G T (μ, σ, ν, q)

. Then,

Y = (\frac{\bar{Y} - μ}{S^{2 / q} σ^{1 - 2 / q}}) \sqrt{n} \sim G T (0, 1, ν, q)

(31)

Proof.

The random variable Z and T

Z = \frac{\bar{Y} - μ}{σ \sqrt{n}} \sim N (0, 1)

T = \frac{(n - 1) S^{2}}{σ^{2}} \sim χ_{(n - 1)}^{2}

then

Y = \frac{Z}{{(T / (n - 1))}^{1 / q}} \sim G T (0, 1, ν, q)

replacing the result is obtained. □

Proposition 8.

Let

Y_{1}, \dots, Y_{n}

a random sample from the distribution of random variable

Y \sim G T (μ, σ, ν, q)

. Then, a level

(1 - α)

confidence interval for the population mean is

[\bar{Y} - t_{1 - α / 2}^{'} \frac{S^{2 / q} σ^{1 - 2 / q}}{\sqrt{n}}, \bar{Y} + t_{1 - α / 2}^{'} \frac{S^{2 / q} σ^{1 - 2 / q}}{\sqrt{n}}],

where

t_{1 - α / 2}^{'}

is the percentile of order

1 - \frac{α}{2}

of GT distribution.

Proof.

The result is obtained from the previous proposition. □

3.3. Simulation Study

To generate random numbers from the

G T (μ, σ, 8, q)

distribution we will use the stochastic representation given in (13) and the following algorithm:

Simulate $Z \sim N (0, 1);$
Simulate $V \sim χ^{2} (ν);$
Compute $Y = σ \frac{Z}{{(V / ν)}^{1 / q}} + μ .$

It then follows that

Y \sim G T (μ, σ, ν, q)

.

Table 3 shows the parameter estimates obtained by the maximum likelihood method (MLE) through 1000 replicates of sizes 50, 100, 150, and 200 with their corresponding standard errors, mean length of the interval, and empirical coverage.

4. Two Illustrative Datasets

Illustrative Datasets 1

We consider the data that were first presented in Jander [16], from an entomology experiment. with respect to ants. A total of

n = 730

ants were individually placed in the center of an arena. The measurements correspond to the initial direction in which they moved relative to a visual stimulus in a 180 degree angle from zero direction, rounded to the nearest 10 grades. Figure 5 depicts the histogram of these data, including estimated densities under a T,

E S

,

M S

,

S G T

,

D S L

and

G T

model, using maximum likelihood. Figure 6 shows the qqplots for T,

E S

,

M S

and

G T

models. We use the AIC (Akaike Information Criterion), which penalizes the maximized likelihood function by the excess of model parameters (AIC = −2log(lik) + 2k, where k is the number of unknown parameters being estimated, see Akaike [17]). Table 4 shows the descriptive statistics of the database, while Table 5 presents the Kolmogorov -Smirnov (KSS) statistic, corresponding values for the four given models, which also indicates that the best fit is presented by the

G T

model. Table 6 shows a 95% confidence interval for the population mean using generalized Student’s t-quantiles. Moreover, Figure 7 depicts the empirical cumulative distribution function (cdf) and the estimated cdfs for T,

E S

,

M S

and

G T

models.

The estimators of moments for the dataset are:

${\hat{μ}}_{M} = 170.438$ ;
${\hat{σ}}_{M} = 47.551$ ;
${\hat{ν}}_{M} = 9.3458$ ;
${\hat{q}}_{M} = 0.4868$ ,

which will be used as starting points in obtaining the EMVs.

Figure 8 depicts the histogram of these data, including estimated densities under a

S G T

,

D S L

and

G T

model, using maximum likelihood. We use the Akaike information criterion (AIC) and Bayesian Information Criterion (BIC), see Schwarz [18], which is defined as (BIC =

- 2 l o g (l i k) + k l o g (n)

, where k is the number of estimated parameters and n is the sample size. Table 7 shows these results.

5. Quantile Regression

The quantile regression is used when the study objective focuses on the estimation of the different percentiles (such as the median) of a population of interest. An advantage of using quantile regression to estimate the median, rather than ordinary least squares regression current file (to estimate the mean), is that the quantile regression will be more robust in the presence of outliers. Quantile regression can be seen as a natural analogue in regression analysis when using different measures of central tendency and dispersion, in order to obtain a more complete and robust analysis of the data. Another advantage of this type of regression lies in the possibility of estimating any quantile, thus being able to assess what happens with extreme values of the population.

5.1. Quantile Regression Uni-Dimensional

Translating this concept of quantile to the regression line, we obtain the linear quantile regression.

If we assume that

Y_{i} = β_{0, τ} + β_{1, τ} X_{i} + ϵ_{i, τ},

\forall i ϵ (1, . . ., n)

with

τ ϵ (0, 1)

and that the conditional expected value is not necessarily zero, but the

τ

-ésimo quantile of the error with respect to the regressive variable is zero

(Q_{τ} (ϵ_{i, τ} / X) = 0)

, then the

τ

-ésimo quantile of

Y_{i}

with respect to X can be written as

Q_{τ} (Y_{i} / X) = β_{0, τ} + β_{1, τ} X_{i}

The estimates of

β_{0, τ}

y

β_{1, τ}

are found by

\hat{β_{τ}} = arg min_{β_{τ} ϵ ℜ^{2}} (\sum_{Y_{i} \geq A} τ | Y_{i} - β_{0, τ} - β_{1, τ} X_{i} | + \sum_{Y_{i} < A} (1 - τ) | Y_{i} - β_{0, τ} - β_{1, τ} X_{i} |),

(32)

being

β_{τ} = (β_{0, τ}, β_{1, τ})

y

A = β_{0, τ} + β_{1, τ} X_{i}

.

To estimate the parameters, the function described in the equation should be minimized. For this, there is a way to approach the minimization problem as a linear programming problem. This allows us to obtain the regression line for the value of a certain quantile. Therefore, the first of the limitations will be solved raised at the end of the previous section, for simple linear regression. Furthermore, since the quartiles have robust properties, it is also possible to solve the second of the limitations that arose with the classical regression line.

5.2. Quantile Regression Student’s t

In this case, in the regression equation

Y_{i} = β_{0, τ} + β_{1, τ} X_{i} + ϵ_{i, τ},

\forall i ϵ (1, . . ., n)

the response variable

Y \sim T (μ, σ, ν)

, it is possible to generate random numbers for the

T (μ, σ, ν)

distribution, which the parameters

μ

,

σ

and

ν

they are estimated using maximum likelihood for the data. Then, one way to obtain the quantiles of Y is using the stochastic representation.

Simulate $W \sim N (0, 1)$ ;
Simulate $T \sim χ^{2} (ν);$
Compute $Y_{1} = σ (\frac{W}{{(T / ν)}^{1 / 2}}) + μ .$

Using this new variable

Y_{1}

quantile regression is applied to the data

(X, Y_{1})

.

5.3. Quantile Regression Slash Logistic

In this case, in the regression equation

Y_{i} = β_{0, τ} + β_{1, τ} X_{i} + ϵ_{i, τ},

\forall i ϵ (1, . . ., n)

the response variable

Y \sim G S L O G (μ, σ, q)

, it is possible to generate random numbers for the

S L O G (μ, σ, q)

distribution, which the parameters

μ

,

σ

, and q they are estimated using maximum likelihood for the data. Then, one way to obtain the quantiles of Y is using the stochastic representation.

Simulate $W \sim U (0, 1)$ ;
Compute $T = μ + σ \log (\frac{W}{1 - W})$ ;
Simulate $U \sim U (0, 1)$ ;
Compute $Y_{2} = \frac{T}{U^{1 / q}}$ .

Using this new variable

Y_{2}

quantile regression is applied to the data

(X, Y_{2})

.

5.4. Quantile Regression Generalized Student’s t

In this case, in the regression equation

Y_{i} = β_{0, τ} + β_{1, τ} X_{i} + ϵ_{i, τ},

\forall i ϵ (1, . . ., n)

the response variable

Y \sim G T (μ, σ, ν, q)

, it is possible to generate random numbers for the

G T (μ, σ, ν, q)

distribution, which the parameters

μ

,

σ

,

ν

, and q they are estimated using maximum likelihood for the data. Then, one way to obtain the quantiles of Y is using the stochastic representation given in (13)

Simulate $W \sim N (0, 1)$ ;
Simulate $T \sim χ^{2} (ν);$
Compute $Y_{3} = σ (\frac{W}{{(T / ν)}^{1 / q}}) + μ .$

Using this new variable

Y_{3}

quantile regression is applied to the data

(X, Y_{3})

.

5.5. Application 2

We consider now data concerning the body mass index and Lean Body Mass of 202 Australian athletes. The data are available for download at http://azzalini.stat.unipd.it/SN/index.html (accessed on 15 October 2021). Table 8 shows statistics for these data for which the maximum likelihood estimators of (

β_{0}

,

β_{1}

) and its corresponding coefficients AIC and BIC fit models for data. are shown in Table 9 and Table 10, respectively.

In Figure 9 the quantile regression of the data is shown using the T,

S L O G

and

G T

models.

6. Discussion

We have introduced a new distribution called the generalized student’s t distribution (GT). The main idea is to replace the exponent

1 / 2

of the chi-square distribution by a exponent

1 / q

where

q > 0

is the kurtosis parameter. We consider the density function of the distribution and study some of its properties, as well as its moments. The parameter estimation was analyzed using the method of moments and maximum likelihood estimation. We present two illustrations, in the first a set of real data are studied where we show that the GT distribution fits the data better than the T, ES,

M S

,

S G T

, and

D S L

distributions. In the other application, we use quantile regression to fit a linear model to a paired dataset where the response variable shows high kurtosis where it is shown that the

G T

distribution fits better than the T and

S L O G

distributions to model the residuals.

Author Contributions

Data curation, J.R., M.A.R. and J.A.; formal analysis, J.R., M.A.R. and J.A.; investigation, J.R., M.A.R. and J.A.; methodology, J.R., M.A.R. and J.A.; writing—original draft, J.R., M.A.R. and J.A.; writing—review and editing, M.A.R. and J.A.; Funding Acquisition, J.R., M.A.R. and J.A. All authors have read and agreed to the published version of the manuscript.

Funding

Research of J.R., M.R. and J.A. was supported by Universidad de Antofagasta through project SEMILLERO UA 2021.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

The authors would like to thank the referee for his/her constructive suggestions that improved the final version of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Rogers, W.H.; Tukey, J.W. Understanding Some Long-Tailed Symmetrical Distributions. Stat. Neerl. 1972, 26, 211–226. [Google Scholar] [CrossRef]
Mosteller, F.; Tukey, J.W. Data Analysis and Regression; Addison-Wesley: Boston, MA, USA, 1977. [Google Scholar]
Kafadar, K.A. Biweight Approach to the One-Sample Problem. J. Am. Stat. Assoc. 1982, 77, 416–424. [Google Scholar] [CrossRef]
Wang, J.; Genton, M.G. The multivariate skew-slash distribution. J. Stat. Plan. Inference 2006, 136, 209–220. [Google Scholar] [CrossRef]
Gómez, H.W.; Quintana, F.A.; Torres, F.J. A New Family of Slash-Distributions with Elliptical Contours. Stat. Probab. Lett. 2008, 77, 717–725, Erratum in Gómez, H.W.; Venegas, O. Stat. Probab. Lett. 2008, 78, 2273–2274. [Google Scholar] [CrossRef]
Arslan, O. An Alternative Multivariate Skew-Slash Distribution. Stat. Probab. Lett. 2008, 78, 2756–2761. [Google Scholar] [CrossRef]
Genc, A.I. A Generalization of the Univariate Slash by a Scale-Mixture Exponential Power Distribution. Commun. Stat. Simul. Comput. 2007, 36, 937–947. [Google Scholar] [CrossRef]
Gómez, H.W.; Olivares-Pacheco, J.F.; Bolfarine, H. An Extension of the Generalized Birnbaum-Saunders Distribution. Stat. Probab. Lett. 2009, 79, 331–338. [Google Scholar] [CrossRef]
Reyes, J.; Gómez, H.W.; Bolfarine, H. Modified slash distribution. Statistics 2013, 47, 929–941. [Google Scholar] [CrossRef]
Rojas, M.A.; Bolfarine, H.; Gómez, H.W. An extension of the slash-elliptical distribution. Stat. Oper. Res. Trans. (SORT) 2014, 38, 215–230. [Google Scholar]
Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; Wiley: New York, NY, USA, 1988. [Google Scholar]
Li, R.; Nadarajah, S. A review of Student’s t distribution and its generalizations. Empir. Econ. 2020, 58, 1461–1490. [Google Scholar] [CrossRef] [Green Version]
El-Bassiouny, A.H.; El-Morshedy, M. The Univarite and Multivariate Generalized Slash Student Distribution. Int. J. Math. Its Appl. 2015, 3, 3547. [Google Scholar]
El-Morshedy, M.; EL-Bassiouny, A.H.; Tahir, M.H.; Eliwa, M.S. Univariate and Multivariate Double Slash Distribution. J. Stat. Appl. Probab. 2020, 9, 459–471. [Google Scholar]
Reyes, J.; Barranco-Chamorro, I.; Gómez, H.W. Generalized modified slash distribution with applications. Commun. Stat.-Theory Methods 2020, 49, 2025–2048. [Google Scholar] [CrossRef]
Jander, R. Die Optische Richtungsorientierung der RotenWaldameise (Formica rufa L.). Z. Vgl. Physiol. 1957, 40, 162–238. [Google Scholar] [CrossRef]
Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
Schwarz, G.E. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]

Figure 1. Generalized student’s pdf with

q = 1

(solid line), student’s for

ν = 5

pdf (dotted line), Normal pdf (dashed line),

G S L T

(dashed and dotted line) and

D S L

(thick dashed line) (left), and tails comparison (right).

Figure 1. Generalized student’s pdf with

q = 1

(solid line), student’s for

ν = 5

pdf (dotted line), Normal pdf (dashed line),

G S L T

(dashed and dotted line) and

D S L

(thick dashed line) (left), and tails comparison (right).

Figure 2. Quantile function of the generalized student’s t distribution compared to quantile function of the student’s t for

ν = 5

for

p = 0.975

(left) and

p = 0.95

(right).

Figure 2. Quantile function of the generalized student’s t distribution compared to quantile function of the student’s t for

ν = 5

for

p = 0.975

(left) and

p = 0.95

(right).

Figure 3. Densidad de

G T

evaluate in quantile theoretical compared to quantile, proposition 2 (upper), and qqplot (under).

Figure 3. Densidad de

G T

evaluate in quantile theoretical compared to quantile, proposition 2 (upper), and qqplot (under).

Figure 4. Kurtosis of the

G T

distribution compared with T distribution for

ν = 8

.

Figure 4. Kurtosis of the

G T

distribution compared with T distribution for

ν = 8

.

Figure 5. Histogram (left) and Comparison the tails (right) for ants dataset. Overlaid on top is the generalized student’s t density with parameters estimated via ML (solid line), the modified slash density (dashed line), the extended slash density (dotted line), the student’s t density (dashed line).

Figure 6. Q-q plots: student’s t (a), modified slash (b), extended slash (c), generalized student’s t (d).

Figure 7. Empirical cdf with estimated T c.d.f. (yellow color),estimated

M S

cdf (red color), estimated

E S

c.d.f. (green color), and estimated

G T

c.d.f. (blue color).

Figure 7. Empirical cdf with estimated T c.d.f. (yellow color),estimated

M S

cdf (red color), estimated

E S

c.d.f. (green color), and estimated

G T

c.d.f. (blue color).

Figure 8. Histogram (left) and comparison the tails (right) for ants dataset. Overlaid on top is the generalized student’s t density with parameters estimated via ML (solid line), the modified slash density (dashed line), the extended slash density (dotted line),the student’s t density (dashed line).

Figure 9. Quantile regression for BMI and LBM data with student’s t distribution (left), slash logistic distribution (center) and generalized student’s t distribution (right).

Table 1. Tails comparison GT distributions and student’s t distribution.

Distribution	$P (Y > 3)$	$P (Y > 4)$	$P (Y > 5)$	$P (Y > 10)$
$T (5)$	0.0150	0.0052	0.0021	0.0001
$G T (5)$	0.0301	0.0103	0.0041	0.0002

Table 2. Table of quantiles generalized student’s t for

ν

degrees of freedom and

q = 1

.

Table 2. Table of quantiles generalized student’s t for

ν

degrees of freedom and

q = 1

.

$ν$	${GT}_{0.60}$	${GT}_{0.70}$	${GT}_{0.80}$	${GT}_{0.90}$	${GT}_{0.95}$	${GT}_{0.975}$	${GT}_{0.99}$	${GT}_{0.995}$
1	0.330	0.727	1.419	3.467	7.798	17.074	47.159	100.682
2	0.289	0.620	1.091	2.052	3.371	5.252	9.096	13.578
3	0.277	0.587	1.002	1.749	2.628	3.710	5.583	7.453
4	0.271	0.571	0.960	1.619	2.334	3.149	4.442	5.631
5	0.267	0.562	0.936	1.546	2.176	2.861	3.891	4.791
6	0.265	0.556	0.920	1.499	2.078	2.687	3.569	4.314
7	0.263	0.551	0.909	1.467	2.011	2.570	3.358	4.006
8	0.262	0.548	0.900	1.443	1.962	2.486	3.209	3.792
9	0.261	0.545	0.894	1.425	1.925	2.422	3.098	3.634
10	0.260	0.543	0.889	1.410	1.896	2.373	3.012	3.513
11	0.260	0.542	0.884	1.398	1.872	2.333	2.944	3.418
12	0.259	0.540	0.881	1.389	1.853	2.300	2.888	3.340
13	0.259	0.539	0.878	1.380	1.836	2.273	2.842	3.276
14	0.258	0.538	0.875	1.373	1.823	2.250	2.803	3.221
15	0.258	0.537	0.873	1.367	1.811	2.230	2.769	3.175
16	0.258	0.536	0.871	1.362	1.800	2.213	2.740	3.135
17	0.257	0.536	0.870	1.357	1.791	2.197	2.715	3.100
18	0.257	0.535	0.868	1.353	1.783	2.184	2.692	3.070
20	0.257	0.534	0.865	1.346	1.769	2.161	2.655	3.018
21	0.257	0.534	0.864	1.343	1.763	2.152	2.639	2.996
22	0.256	0.533	0.863	1.340	1.758	2.143	2.624	2.976
23	0.256	0.533	0.862	1.338	1.753	2.135	2.611	2.958
24	0.256	0.532	0.861	1.335	1.748	2.127	2.599	2.942
25	0.256	0.532	0.861	1.333	1.744	2.121	2.588	2.927
26	0.256	0.532	0.860	1.331	1.740	2.114	2.577	2.913
27	0.256	0.532	0.859	1.329	1.737	2.109	2.568	2.900
28	0.256	0.531	0.859	1.328	1.734	2.103	2.559	2.888
29	0.256	0.531	0.858	1.326	1.730	2.098	2.551	2.877
30	0.256	0.531	0.858	1.325	1.728	2.094	2.544	2.867

Table 3. Simulation of 1000 iterations of the model

G T (μ, σ, 8, q)

.

Table 3. Simulation of 1000 iterations of the model

G T (μ, σ, 8, q)

.

n	$μ$	$σ$	q	$\hat{μ}$	$sd (\hat{μ})$	$ali (\hat{μ})$	$c (\hat{μ})$	$\hat{σ}$	$sd (\hat{σ})$	$ali (\hat{σ})$	$c (\hat{σ})$	$\hat{q}$	$sd (\hat{q})$	$ali (\hat{q})$	$c (\hat{q})$
50	0.5	1	1	0.4992	0.1665	0.6527	96.10	0.9958	0.1760	0.6899	94.80	1.1558	0.5080	1.9914	92.80
100				0.5018	0.1148	0.4502	94.50	1.0012	0.1237	0.4851	94.30	1.0961	0.3319	1.3009	94.20
150				0.5045	0.0965	0.3785	95.50	1.0016	0.0967	0.3791	95.20	1.0542	0.2576	1.0098	95.00
200				0.5003	0.0801	0.3138	95.80	1.0018	0.0822	0.3221	95.10	1.0442	0.1908	0.7481	94.70
50	1	1	1	1.0002	0.1649	0.6462	95.90	0.9963	0.1723	0.6756	94.90	1.1580	0.5084	1.9931	92.80
100				1.0007	0.1190	0.4664	95.10	1.0003	0.1277	0.5005	94.80	1.0948	0.3340	1.3094	94.10
150				1.0045	0.0966	0.3785	95.50	1.0016	0.0967	0.3790	95.20	1.0540	0.2575	1.0093	95.00
200				1.0003	0.0801	0.3138	95.80	1.0018	0.0822	0.3222	95.10	1.0442	0.1908	0.7479	94.70
50	1	2	1	0.9998	0.3311	1.2979	96.00	1.9893	0.3506	1.3743	94.90	1.1511	0.4939	1.9363	92.90
100				1.0037	0.2290	0.8978	94.60	2.0035	0.2467	0.9670	94.70	1.0964	0.3247	1.2729	93.80
150				1.0094	0.1929	0.7560	95.50	2.0043	0.1935	0.7584	94.90	1.0552	0.2548	0.9989	95.20
200				1.0004	0.1601	0.6276	95.80	2.0044	0.1639	0.6425	94.90	1.0445	0.1893	0.7420	94.70
50	1	3	1	0.9337	0.5396	2.1153	97.70	2.7991	0.8856	3.4714	93.60	1.0815	0.5529	2.1675	94.80
100				1.0042	0.3448	1.3516	94.70	3.0016	0.3818	1.4966	94.90	1.0970	0.3373	1.3222	94.10
150				1.0135	0.2893	1.1342	95.10	3.0082	0.2903	1.1381	95.10	1.0568	0.2563	1.0045	95.00
200				0.9997	0.2416	0.9472	95.70	3.0059	0.2630	1.0308	96.60	1.0456	0.1950	0.7643	95.20
50	0.5	0.5	1	0.5000	0.0825	0.3235	95.90	0.4986	0.0873	0.3422	94.70	1.1629	0.5318	2.0848	93.60
100				0.5003	0.0577	0.2261	94.70	0.5006	0.0620	0.2431	94.40	1.0954	0.3285	1.2878	94.10
150				0.5020	0.0482	0.1889	95.30	0.5007	0.0484	0.1899	94.80	1.0544	0.2573	1.0086	95.00
200				0.5000	0.0400	0.1567	96.00	0.5011	0.0412	0.1617	95.00	1.0445	0.1935	0.7585	94.60
50	1	0.5	1	1.0000	0.0825	0.3236	95.90	0.4987	0.0872	0.3420	94.70	1.1662	0.5413	2.1219	93.60
100				1.0004	0.0576	0.2260	94.60	0.5006	0.0620	0.2431	94.40	1.0960	0.3286	1.2880	94.10
150				1.0020	0.0482	0.1890	95.50	0.5008	0.0483	0.1895	94.70	1.0547	0.2572	1.0081	95.00
200				0.9999	0.0400	0.1567	95.90	0.5010	0.0414	0.1621	94.90	1.0461	0.1955	0.7663	94.70
50	1	0.5	0.5	1.0002	0.0695	0.2725	95.40	0.5045	0.1138	0.4462	94.70	0.5252	0.1355	0.5312	95.10
100				1.0005	0.0479	0.1879	94.00	0.5003	0.0798	0.3127	94.80	0.5131	0.0750	0.2941	94.80
150				1.0019	0.0389	0.1527	95.70	0.5022	0.0609	0.2388	94.10	0.5070	0.0587	0.2301	94.90
200				1.0003	0.0327	0.1281	95.20	0.5016	0.0544	0.2131	96.10	0.5070	0.0520	0.2038	94.60
50	0.5	0.5	0.5	0.5001	0.0695	0.2724	95.40	0.5040	0.1139	0.4467	94.70	0.5249	0.1355	0.5313	95.10
100				0.5007	0.0481	0.1885	94.20	0.5011	0.0800	0.3137	94.80	0.5135	0.0752	0.2949	94.80
150				0.5020	0.0390	0.1529	95.70	0.5021	0.0610	0.2393	94.20	0.5070	0.0587	0.2303	94.90
200				0.5001	0.0329	0.1290	95.10	0.5016	0.0544	0.2132	96.00	0.5073	0.0520	0.2038	94.60

s d

corresponds to the standard deviation, average length of interval (

a l i

) is the average length of the confidence interval and c the empirical coverage of the respective EMV of the parameters, based on a

95 %

confidence interval.

Table 4. Descriptive statistics the for dataset.

n	$\bar{X}$	S	$\sqrt{b_{1}}$	$b_{2}$
730	$176.438$	$62.6434$	$- 0.2057$	$4.6071$

Table 5. Parameter estimates, AIC and KSS values for T,

M S

,

E S

, and

G T

models for the ants dataset.

Table 5. Parameter estimates, AIC and KSS values for T,

M S

,

E S

, and

G T

models for the ants dataset.

Parameter	T	MS	ES	GT
$μ$	181.58 (1.265)	181.67 (1.217)	181.321 (0.094)	181.4824 (1.1466)
$σ$	26.142 (1.712)	16.7 (0.878)	1.336 (0.108)	33.4038 (1.5802)
$ν$	1.47 (0.134)			18.7203 (0.0029)
q		1.50 (0.034)		0.4085 (0.0013)
$α$			1.907 (0.094)
$β$			40.084 (4.719)
AIC	7928.448	7921.282	7914.642	7899.405
KSS	0.1174	0.0781	0.1000	0.0644
p-value	0.0005	0.0117	0.0007	0.4850

Table 6. The 95 percent confidence interval for the mean of dataset using T and

G T

quantiles T.

Table 6. The 95 percent confidence interval for the mean of dataset using T and

G T

quantiles T.

Distribution	Lower Limit	Upper Limit
T	170.5121	182.4633
$G T$	166.8242	186.1511

Table 7. Parameter estimates, AIC and BIC values for

G S L T

,

D S L

and

G T

models for the ants dataset.

Table 7. Parameter estimates, AIC and BIC values for

G S L T

,

D S L

and

G T

models for the ants dataset.

Parameter	$DSL$	$GSLT$	$GT$
$μ$	181.6341 (1.2443)	180.0680 (0.0169)	181.4824 (1.1466)
$σ$	11.9447 (1.10722)	2.5871 (0.0168)	33.4038 (1.5802)
$ν$		2.2523 (0.0168	18.7203 (0.0029)
$q_{1}$	1.6916 (0.2390)	0.4774 (0.0069)	0.4085 (0.0013)
$q_{2}$	1.6911 (0.2788)		0.4085 (0.0013)
$α$	12.9451 (0.0169)
$β$	28.0256 (0.0170)
AIC	7931.313	7915.774	7899.405
BIC	7949.745	7943.333	7902.14

Table 8. Summary statistics for dataset of the body mass index and Lean Body Mass of 202 Australian athletes.

Data	n	$\bar{W}$	$S_{W}$	$\sqrt{β_{1}}$	$β_{2}$
BMI	202	22.9264	2.8664	0.9395	5.1323
LBM	202	64.8767	13.0702	0.3558	2.7326

Table 9. Coefficients AIC and BIC fit models for dataset of the body mass index and Lean Body Mass of 202 Australian athletes for quantile regression student’s t (T), quantile regression slash logistic (SLOG) and quantile regression generalized student’s t (GT).

Coef.	T	SLOG	GT
AIC	915.309	1252.004	904.573
BIC	925.234	1261.928	914.498

Table 10. Parameter estimates and standard deviation values for quantile regression coefficients 50 student’s t (T) and generalized student’s t (

G T

) models for the dataset.

Table 10. Parameter estimates and standard deviation values for quantile regression coefficients 50 student’s t (T) and generalized student’s t (

G T

) models for the dataset.

Distribution	Coef.	Est.	SD	t-Value	$P (> \| t \|)$
T	$β_{0}$	17.5068	1.1938	14.6641	0.0000
T	$β_{1}$	0.0742	0.0172	14.6641	0.0002
$S L O G$	$β_{0}$	8.7411	1.6237	5.3818	0.0000
$S L O G$	$β_{1}$	0.2795	0.0279	9.9866	0.0000
$G T$	$β_{0}$	17.1050	1.2414	13.7781	0.0000
$G T$	$β_{1}$	0.0802	0.0172	4.6665	0.0001

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Reyes, J.; Rojas, M.A.; Arrué, J. A New Generalization of the Student’s t Distribution with an Application in Quantile Regression. Symmetry 2021, 13, 2444. https://doi.org/10.3390/sym13122444

AMA Style

Reyes J, Rojas MA, Arrué J. A New Generalization of the Student’s t Distribution with an Application in Quantile Regression. Symmetry. 2021; 13(12):2444. https://doi.org/10.3390/sym13122444

Chicago/Turabian Style

Reyes, Jimmy, Mario A. Rojas, and Jaime Arrué. 2021. "A New Generalization of the Student’s t Distribution with an Application in Quantile Regression" Symmetry 13, no. 12: 2444. https://doi.org/10.3390/sym13122444

APA Style

Reyes, J., Rojas, M. A., & Arrué, J. (2021). A New Generalization of the Student’s t Distribution with an Application in Quantile Regression. Symmetry, 13(12), 2444. https://doi.org/10.3390/sym13122444

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A New Generalization of the Student’s t Distribution with an Application in Quantile Regression

Abstract

1. Introduction

2. The Generalized Student’s t Distribution

2.1. Density Function

2.2. Tails Comparison of GT and Student’s t Distributions

2.3. Compared GT Quantiles with T Quantiles

2.4. Properties of the Generalized Student’s t Distribution

2.5. Moments

3. Inference

3.1. Moment Estimators

3.2. Maximum Likelihood Estimation

3.3. Simulation Study

4. Two Illustrative Datasets

Illustrative Datasets 1

5. Quantile Regression

5.1. Quantile Regression Uni-Dimensional

5.2. Quantile Regression Student’s t

5.3. Quantile Regression Slash Logistic

5.4. Quantile Regression Generalized Student’s t

5.5. Application 2

6. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI