1. Introduction
The proportional hazard distribution was presented as an extension of the distribution of the minimum of a random sample replacing
with
in a random variable
Z with probability density function (PDF)
f, and cumulative distribution function (CDF)
F. So, this distribution could be considered as a distribution of fractional order statistics ([
1]). Here, we start defining the proportional hazard (PHF) distribution, which was studied by [
2]; see also [
3] where inference under progressively type II right-censored sampling for the PHF distribution is considered. Let
F be a continuous CDF with PDF
, and hazard function
We say that
Z has a PHF distribution associated with
F and
f, if its PDF is of the form
where
is the shape parameter. We use the notation
to refer to this distribution. The CDF of the PHF distribution is given by
The hazard function of this model is
; that is, the proportional hazard function of the PDF
f. From Equation (
2), we can use the inversion method for generating a random variable with PHF distribution; that is, if
, i.e., a uniform distribution on the interval
, then the random variable
is distributed according to the PHF distribution with parameter vector
, where
denotes the inverse of
. The distributional properties of the PHF distribution, as well as inferential procedures and information matrix for the model parameters are studied by [
2].
The location-scale extension of model (
1) is obtained using the linear transformation
, where
is a location parameter,
is a scale parameter, and
. The PDF of the random variable
X is given by
If a random variable
X follows the model (
3), it is denoted by
. If
and
, where
and
detone the PDF and CDF of a standard normal distribution, we write
, and so we obtain the proportional hazard normal (PHN) distribution. Note that
, and so the normal distribution is a special case of the PHN distribution. Hence, it is more flexible than the normal model in terms of skewness and kurtosis, due to the inclusion of the shape parameter
. We have that the Kumaraswamy-G (KwG) distribution, proposed by [
4], is defined by the PDF
where
G is an arbitrary continuous CDF, and
. Also,
and
are additional shape parameters to that of
G. If
Z is a random variable with PDF (
4), we write
. It is worth stressing that when
, we obtain the
distribution, i.e.,
, and so this kind of distribution arises as a possible solution when we have asymmetric data.
The construction of multivariate distributions from copula theory can be carried out using the Sklar’s theorem. The way to obtain a multivariate distribution from the Sklar’s theorem is as follows: Let
be
p random variables with continuous CDFs
respectively. Then, according to Sklar’s theorem,
has a unique copula representation:
Moreover, it is well known that many dependence properties of a multivariate distribution depend only on the corresponding copula. Therefore, many dependence properties of a multivariate distribution can be obtained by studying the corresponding copula. By using Clayton copula, Ref [
5] proposed a bivariate extension of the power-normal distribution. Specifically, a bivariate random variable
has a bivariate power-normal distribution, denoted by
, if for
and
has the following joint CDF
where
is the parameter that controls the dependence in the Clayton copula. The Clayton copula is usually attributed to [
6], but it has also been studied by [
7]. For
, it is defined as
The first systematic study of this model was conducted by [
8], who interpreted the parameter
as a measure of the dependence between
u and
Therefore, the independence between the variables is obtained when
tends to zero. Obviously there are other forms of constructing multivariate distributions, and so new multitariate distributions have been proposed in the statistical literature. To mention a few, but not limited to, we refer the reader to the works by [
9,
10,
11,
12], among others.
It is worth emphasizing that the univariate PHF family of distributions has received significant attention over the last years in the statistical literature, mainly due to its flexibility in considering different models in its construction given by
f and
F; see expression (
1). On the other hand, multivariate extensions of this univariate family of distributions has been little explored. This paper fills this gap and provides a bivariate extension of this univariate distribution. The bivariate PHF distribution is quite simple and, hence, may be widely applied in analyzing bivariate real data in practice. Additionally, distributional properties of the bivariate PHF distribution are investigated in details. An important claim to introduce the bivariate PHF distribution relies on the fact that the practitioners will have new bivariate model to use in bivariate settings. Additionally, the formulae related with the new bivariate model are manageable and with the use of modern computer resources and its numerical capabilities, the bivariate PHF distribution may prove to be an useful addition to the arsenal of applied statisticians. We hope that the bivariate distribution introduced in this paper may serve as an alternative bivariate model to some well-known bivariate model available in the statistical literature. We also hope that the bivariate PHF distribution may work better (at least in terms of model fitting) than some bivariate distributions available in the literature in certain practical situations, although it cannot always be guaranteed. Here, the bivariate PHF family of distributions we introduce is obtained from a multivariate distribution which was constructed from Pareto copula (descried in the next section) coupled with PHF marginals. It is worth stressing that many types of copulas are available o construct multivariate distributions, namely: Clayton copula, Frank copula, Joe copula, and Gumbel copula, among others. Perhaps, the most common copula is the Clayton copula. In this paper, instead, we shall consider the Pareto copula to introduce the new bivariate model due to its simplicity and interesting properties (descried in the next section). In addition, the Pareto copula yields a simple form of the likelihood function, so it makes the estimation of the model parameters easy to deal with. In short, it is evident that few works about bivariate generalizations of the PHF distribution have been published in the statistical literature and, hence, a new, simple and tractable bivariate PHF extension of the univariate PHF distribution through Pareto copula is welcome.
The paper is organized as follows.
Section 2 presents the Pareto copula and discusses with details the structural properties derived from it. The bivariate proportional hazard distribution is proposed in
Section 3. Distributional properties of this bivariate family of distributions are also derived in this section. In
Section 4, we study the bivariate PHN distribution, where the univariate normal distribution is taken into account. Location-scale extension, as well as parameter estimation for the bivariate PHN distribution are also discussed in this section. An empirical appliction of the bivariate PHN distribution that considers real data is provided in
Section 5 for illustrative purposes. Finally,
Section 6 concludes the paper.
3. Bivariate Proportional Hazard Distribution
In what follows, the joint CDF of
,
, is constructed from the bivariate Pareto copula, with marginal distributions
and
, where
and
. Then, from the Sklar’s theorem, the joint CDF of
is given by
We have the following definition.
Definition 1. A bivariate random vector has a bivariate proportional hazard distribution, if its joint CDF is given bywhere , , and . Remark 1. We denote the bivariate proportional hazard distribution in Definition 1 by .
Remark 2. Let . Then, the joint PDF of has the form We have the following propositions.
Proposition 1. Let . We have that
- (i)
for
- (ii)
The CDF of , given , is - (iii)
The PDF of , given , is - (iv)
The joint survival function of is
Proposition 2. Let . We have that is stochastically decreasing in , and vice versa, for any value of , , and α.
Proof. Since Pareto copula is a concave function on for fixed , the result holds. □
The bivariate hazard rate of
and
was defined by [
14] and it is given by
Proposition 3. Let . We have that
- (i)
For fixed , is an increasing function of
- (ii)
For fixed , is an increasing function of
Proof. This result can be verified as follows. Let
and
. It can be shown that
where
and, hence, the result follows. □
Proposition 4. Let . We have that the Kendall’s tau correlation coefficient, Spearman’s rho and the medial correlation coefficient, are given by the expressions provided in (5). Proof. Kendall’s tau and Spearman’s rho correlation coefficients are invariant to increasing transformations, and so the resut follows. Note that is enough to take the transformation and which leads to the PDF Pareto copula. Since the medial coefficient is defined directly from the Pareto copula, the result is obtained directly. □
Remark 3. The upper and lower limits for are Proof. Since the Pareto copula has a lower tail dependence, the result holds. □
Proposition 5. The bivariate distribution has a lower tail dependence.
4. Bivariate Proportional Hazard Normal Distribution
When
, i.e., the standard normal CDF, we obtain the bivariate PHN distribution, denoted by
. The joint PDF of
is given by
where
,
,
and
. The joint CDF takes the form
The marginal distributions of
are
for
. The PDF of
, given
, is given by
and the CDF of
, given
, is
The joint survival function of
has the form
4.1. Location-Scale Extension
The family of BPHN distributions with location-scale parameters is defined as the joint distribution of
and
, where
,
and
for
The corresponding joint PDF is given by
where
is the location parameter,
is the scale parameter, and
We use the notation
to denote this location-scale extension.
Figure 1 and
Figure 2 display contour plots of the BPHN distribution for some parameter values.
4.2. Parameter Estimation
In what follows, we consider the issue of estimating the parameters of the BPHN distribution. Given the dependency structure induced by Pareto copula, we use the maximum likelihood (ML) method, and the two-stage method (see [
15]) to estimate the parameter vector
. Initially, as in [
5], we perform the following reparameterizations:
, and
, so that the interest is focused on the parameter vector
. Thus, for a sample of size
n from the
distribution, say
, the log-likelihood function is given by
where
The ML estimate of can be obtained from the direct maximization of the log-likelihood function with respect to the model parameters. Using the R programming language, specifically applying the optim function, the maximization of the log-likelihood function can be performed. Also, using the invariance property of the ML estimator, we can easily obtain the ML estimates and of and , respectively.
The score functions are
where
. The ML estimate
can also be obtained by solving simultaneously the nonlinear system of equations
Numerical approaches are required for solving the above nonlinear system of equations, which requires the specification of initial values. To obtain initial values for the model parameters to be used in the iterative process, we can use the two-stage estimation procedure proposed by [
15], since this procedure was defined for multivariate copula-based models. The first stage considers ML estimation from univariate marginals, while the second stage considers ML estimation of the dependence parameter with the other parameters held fixed from the first stage:
First stage. Considering that
for
. We have that the ML estimaties of
,
and
are obtained from the solutions of the nonlinear equations
where
The solution of the previous system of equations for each
j allows us to obtain the ML estimates
and
of
,
and
, respectively, for
.
Second stage. In this stage, we estimate the parameter
by replacing the unknown parameters with the ML estimates from the previous stage and then maximize the resulting log-likelihood function; that is, we need to maximize the function
where
so that we have the score function
The solution of the resulting equation
lead us to the estimate
of
.
We can also consider another alternative to obtain initial values for the model parameters by using the method proposed by [
16], where we initially normalize each variable to obtain a model that depends only on
,
and
; that is, we have the log-likelihood function
After doing
we obtain
where
Thus, the new profiled log-likelihood function is reached. After obtaining the estimates and of and , respectively, we then obtain an estimate for as . We can take as initial values for and the sample means, and for and the sample standard deviations, or we can replace , and in the original log-likelihood to obtain the initial values for , , and .
When
n is large, we have that
, where “
” means approximately distributed,
is the expected Fisher information matrix for
, and
is the Hessian matrix, whose elements are provided in the
Appendix A. Using the transformation method, we have that
, where
with
denoting its inverse, and
The elements of
are obtained in terms of numerical integration. On the other hand, the observed Fisher information matrix given by
can also be used for computing asymptotic standard errors for the ML estimates of the model parameters (see
Appendix A). After computing
, we obtain the observed Fisher information matrix under
-parametrization as
, where
. It is worth emphisizing that the observed information matrix can be computed numerically from standard maximization routines, which now provide the observed information matrix as part of their output. For example, one can use the R functions
optim or
nlm to compute the observed Fisher information matrix numerically.
4.3. Monte Carlo Simulation
In the following, we consider Monte Carlo simulation experiments to evaluate the performance of the ML estimation procedure discussed in the previous section to estimate the BPHN distribution parameters. It is worth emphasizing that is rather easy to generate random variates from the BPHN distribution. We consider the following steps to generate from distribution:
- Step 1.
Generate two independent random variates and , where means a uniform distribution on .
- Step 2.
Compute
where
is the standard normal quantile function.
- Step 3.
Define
and
. Let
. For each value of
, compute
Then, .
The Monte Carlo experiments were performed using the R programming language, and we consider 10,000 Monte Carlo replications. The performance of the ML procedure to estimate the BPHN distribution parameters was evaluated on the basis of the following quantities for each sample size: the empirical mean, and the empirical standard deviation, which are computed from 10,000 replications. The sample sizes we consider are
and 150. Without loss of generality, we consider
,
,
,
,
, and
,
,
,
,
,
and
. The emprical mean is listed in
Table 1, while the empirical standard deviation is listed in
Table 2. In general, from
Table 1, note that the ML estimates of
,
,
,
,
and
are close to their respective true values, while the parameter that controls the dependence in the Pareto copula,
, is overestimated, mainly as this parameter increases. As expected, the standard deviation of the ML estimates of all parameters decreases as the sample size increases (see
Table 2). In short, the Monte Carlo simulation experiments reveal that the ML method can be used effectively to estimate the BPHN parameters.
5. Real Data Application
Next, we consider an application of the BPHN distribution to real data for illustrative purposes. The data set contains two different measures of stiffness of each of the 30 boards. The considered stiffness measures are the Shock (
) and Vibration (
) of each of the 30 boards. The data were reported by [
17]. The first measurement involves sending a shock wave down the board, and the second measurement is determined while vibrating the board. All numerical computations provided in this were done by using the
R program.
We assume that
for
. To estimate the BPHN model parameters, we initially estimate the marginal PDFs corresponding to each variable. We thus obtain the following estimates:
,
,
,
,
, and
.
Figure 3a,b display the QQ-plots for the estimated marginal distributions. Note the goodness-of-fit of each estimated marginal distribution to the real data, which means that the BPHN distribution may be a good choice for modeling these data. Now, using the above estimates in the copula function, we obtain an estimate for the parameter
, that is,
. Finally, taking the above estimates as initial values for the iterative process, we obtain the following ML estimates (standard errors in parentheses):
,
,
,
,
,
, and
. Visual inspection of the contour plot in
Figure 3c confirms a satisfactory fit of the BPHN distribution to the data.
Next, for the sake of comparison, we also consider the following bivariate distributions to fit the current real data: the bivariate skew-normal conditional (BVSNC) distribution introduced by [
9], denoted by BVSNC
, and the bivariate Birnbaum-Saunders (BVBS) distribution studied by [
10], denoted by BVBS
. The ML estimates (standard errors in parentheses) of the BVSNC parameters are
and
, whereas the ML estimates (standard errors in parentheses) of the BVBS parameters are
and
. To compare the bivariate distributions, we make use of the Akaike information criterion (AIC), and Bayesian information criterion (BIC). The smaller the values of AIC and BIC, the better the fitted distribution to the real data.
Table 3 lists the AIC and BIC values for all bivariate distributions, which leads to the conclusion that the BPHN distribution is better than the other bivariate distributions to model the current bivariate real data.