Next Article in Journal
On the Fractal Measures and Dimensions of Image Measures on a Class of Moran Sets
Next Article in Special Issue
Finding the Best Dueler
Previous Article in Journal
Dynamics and Global Bifurcations in Two Symmetrically Coupled Non-Invertible Maps
Previous Article in Special Issue
RKHS Representations for Augmented Quaternion Random Signals: Application to Detection Problems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quantile Regression with a New Exponentiated Odd Log-Logistic Weibull Distribution

by
Gabriela M. Rodrigues
1,†,
Edwin M. M. Ortega
1,*,†,
Gauss M. Cordeiro
2,† and
Roberto Vila
3,†
1
Department of Exact Sciences, University of São Paulo, Piracicaba 13418-900, Brazil
2
Department of Statistics, Federal University of Pernambuco, Recife 50670-901, Brazil
3
Department of Statistics, University of Brasilia, Brasilia 70910-900, Brazil
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2023, 11(6), 1518; https://doi.org/10.3390/math11061518
Submission received: 13 February 2023 / Revised: 15 March 2023 / Accepted: 17 March 2023 / Published: 21 March 2023
(This article belongs to the Special Issue Probability Theory and Stochastic Modeling with Applications)

Abstract

:
We define a new quantile regression model based on a reparameterized exponentiated odd log-logistic Weibull distribution, and obtain some of its structural properties. It includes as sub-models some known regression models that can be utilized in many areas. The maximum likelihood method is adopted to estimate the parameters, and several simulations are performed to study the finite sample properties of the maximum likelihood estimators. The applicability of the proposed regression model is well justified by means of a gastric carcinoma dataset.

1. Introduction

For survival data, the outcome variable is usually the time until the occurrence of an event of interest. A common characteristic of this type of data is the presence of censoring, that is, when the event of interest is not observed for some subjects before the study is finished. Furthermore, this variable depends on one or more explanatory variables (covariables), which have characteristics of the sample under study. Cox’s proportional hazards and accelerated failure time (AFT) models are two common tools in time-to-event modeling. The first class of models has the strong assumption of proportional risks, which is often invalid, so the effects of the covariables on the risk function are examined which can lead to difficult interpretations. The second class assumes that an association exists between the predictors and the survival time, permitting a direct interpretation of the effects of the covariables on lifetimes.
Nevertheless, these methods can fail to capture the heterogeneity of the effects of the covariables. In this respect, the quantile regression (QR) (Koenker and Bassett [1]) can be an alternative to these models, enabling evaluation of the heterogeneous effects of the predictors via analysis of different quantiles. This method involves modeling the quantiles of the survival time and links them to the covariables, providing some advantages, such as:
  • Possible identification and inference under the heterogeneous effects of the covariables for different quantiles, thus supplying more complete information about the covariables and more flexibly controlling for the heterogeneity caused by them;
  • Flexibility regarding the assumption of proportional risks;
  • Provision of a direct interpretation of the results, that is, between the survival time and the covariables of interest;
  • Possible analysis of different quantiles, allowing identification of the different effects of the covariables on individuals with different risks; and
  • Robustness with respect to outliers in the regression models.
Originally, the QR methods are based on minimizing weighted absolute residuals [1] without any probability distribution, and the estimation of the parameters occurs by means of linear programming algorithms.
Although this approach is very flexible, some challenges such as: (i) the quantile crossing, that is, when two or more estimated quantile curves cross or overlap, causing difficulty in interpretability; and (ii) the drawback of the inability to apply parametric inference tools led to the search for other methods. Regarding the quantile crossing problem, we can verify alternative methods, such as: semiparametric models [2], the support vector (SV) regression approach [3], and a joint quantile estimation approach [4,5]. The weighted absolute residuals estimators coincide with the maximum likelihood estimators (MLEs), when the response follows a skewed Laplace distribution, so the initial association of a continuous distribution to the QR models was based on it (Koenker and Machado [6]).
In the context of censored data, an extensive bibliography can be mentioned, for example: Peng and Huang [7] developed a QR approach for survival data subject to conditionally independent censoring, Wang and Wang [8] proposed a locally weighted censored QR approach following the redistribution-of-mass idea and employed a local reweighing scheme. Zarean et al. [9] used the censored QR for determining overall survival and risk factors in esophageal cancer. Yang [10] presented a new approach for censored QR estimation, and Du et al. [11] developed estimation procedures for partially linear QR models, where some of the responses were censored by another random variable. Further, Xue et al. [12] addressed these limitations by using both simulated examples and data from National Wilms Tumor clinical trials to illustrate proper interpretation of the censored QR model and the differences and advantages of the model compared to the Cox proportional hazard model. Hong et al. [13] provided a practical guide for using QR for right-censored outcome data with covariates of low or high dimensionality, and De Backer et al. [14] studied a novel approach for the estimation of quantiles when facing potential right-censoring of the responses. Recently, De Backer et al. [15] investigated a new procedure for estimating a linear QR with possibly right-censored responses; Qiu et al. [16] considered the QR model for survival data with missing censoring indicators; Yazdani et al. [17] introduced the QR approach for modelling failure time and investigated the covariate effects for different quantiles; Peng [18] provided a comprehensive review of statistical methods for performing QR with different types of survival data; Hsu et al. [19] studied regression models for interval censored data using quantile coefficient functions via a set of parametric basis functions; He et al. [20] provided a unified analysis of the smoothed sequential estimator and its penalized counterpart for increasing dimensions in censored QR; and Wei [21] introduced a discussion about QR for censored data in haematopoietic cell transplant research. Note that all these articles cited in the QR with censored data did not use parametric models or use the skewed Laplace distribution (see [17]), whose estimators coincide.
Subsequently, other distributions were proposed by re-parameterizing them in terms of the quantile function (qf). Recent papers involving models for non-censored data based on other distributions can be mentioned: log-extended exponential-geometric [22]; Birnbaum–Saunders [23,24]; discrete generalized half-normal [25]; transmuted unit-Rayleigh [26]; unit-Burr-XII [27]; unit-Chen [28]; log-symmetric [29]; arcsecant hyperbolic Weibull [30]; and Dagum and Singh–Maddala [31] distributions. However, there is a relative lack in the literature of models for censored data in the parametric context: generalized Gompertz [32] and skew-t [33].
It is well known that the hazard rate function can assume different forms, which has led to the proposal of a large number of new distributions with the purpose of obtaining greater flexibility of data modeling, for example, Ref. [34]. In this sense, we introduce a QR regression model based on a reparameterized, exponentiated, odd log-logistic Weibull (EOLLW) distribution. It has two extra shape parameters, thus enabling the modeling of different forms of hazard rate functions, as well as data with positive or negative symmetric or asymmetric bimodal shapes, making it an alternative to the mixture models commonly used in the presence of bimodality. Another important feature of the new QR model is that it has as special cases: the exponentiated Weibull and odd log-logistic Weibull QR models. A detailed discussion of the theoretical foundations is given in analysis of survival data with concrete applications. The maximum likelihood method is adopted, and several simulations evaluate the behavior of these estimators under some scenarios. Additionally, we show that the model can establish functional relations of the covariables with other parameters, including scale and kurtosis, besides the quantile parameter.
The paper is structured as follows. Section 2 introduces a reparametrization of the EOLLW distribution based on quantiles. Section 3 addresses some mathematical properties. The proposed QR regression model, and some classic inference methods to estimate the parameters are addressed in Section 4. Some simulations are reported in Section 5. Section 6 provides a real application for the new regression model. Section 7 ends with a brief conclusion.

2. The Reparameterized EOLLW Distribution

Let G ( x ; η ) be a parent cumulative distribution function (cdf) and g ( x ; η ) = d G ( x ; η ) / d x be its associated probability density function (pdf), both functions of a parameter vector η . The cdf of the exponentiated odd log-logistic (EOLL-G) family is given by (Alizadeh et al. [35]) (for x R )
F ( x ; ν , λ , η ) = G ( x ; η ) ν λ G ( x ; η ) ν + [ 1 G ( x ; η ) ] ν λ ,
where ν > 0 and λ > 0 are two extra shape parameters.
The pdf corresponding to Equation (1) has the form
f ( x ; ν , λ , η ) = ν λ g ( x ; η ) G ( x ; η ) ν λ 1 [ 1 G ( x ; η ) ] ν 1 G ( x ; η ) ν + [ 1 G ( x ; η ) ] ν λ + 1 .
Henceforth, let X EOLL - G ( ν , λ , η ) be a random variable with density function (2).
The EOLL-G family reduces to the OLL-G class when λ = 1 (Gleaton and Lynch [36]), and to the exponentiated (Exp-G) family (Mudholkar et al., [37]) when ν = 1 . Clearly, it becomes the parent G ( x ; η ) when ν = λ = 1 .
The EOLLW distribution is defined from (2) by taking the parent Weibull
G ( x ; γ , σ ) = 1 exp x γ σ and g ( x ; γ , σ ) = σ γ σ x σ 1 exp x γ σ , x > 0 ,
respectively, where η = ( γ , σ ) , γ > 0 is a scale parameter, and σ > 0 is a shape parameter.
The cdf of the random variable X EOLLW ( ν , λ , γ , σ ) follows from Equations (1) and (3)
F ( x ; ν , λ , γ , σ ) = 1 exp ( x γ ) σ ν λ 1 exp ( x γ ) σ ν + exp ( x γ ) σ ν λ , x > 0 .
Based on Equations (2) and (3), the pdf of X becomes
f ( x ; ν , λ , γ , σ ) = ν λ σ x σ 1 exp ( x γ ) σ ν 1 exp ( x γ ) σ ν λ 1 γ σ 1 exp ( x γ ) σ ν + exp ( x γ ) σ ν λ + 1 , x > 0 .
The hazard rate function corresponding to (5) is h ( x ; ν , λ , γ , σ ) = f ( x ; ν , λ , γ , σ ) / [ 1 F ( x ; ν , λ , γ , σ ) ] .
By inverting (1), the qf of X reduces to
x = Q ( q ) = Q W q 1 / ( ν λ ) q 1 / ( ν λ ) + ( 1 q 1 / λ ) 1 / ν , 0 < q < 1 ,
where Q W ( q ) = G 1 ( p ; γ , σ ) ( p ( 0 , 1 ) ) is the qf of the Weibull distribution, namely
G 1 ( p ; γ , σ ) = γ [ log ( 1 p ) ] 1 / σ .
Thus, we rewrite the τ th quantile (6) as
x = Q ( q ) = γ log ( 1 q 1 / λ ) 1 / ν q 1 / ( ν λ ) + ( 1 q 1 / λ ) 1 / ν 1 / σ .
We can easily obtain the quartiles: first quartile (Q(0.25)), median (Q(0.5)), and third quartile (Q(0.75)).
We define a reparametrization of the pdf (5) as a function of the τ th quantile (6), where the scale γ becomes
γ = μ log ( 1 τ 1 / λ ) 1 / ν τ 1 / ( ν λ ) + ( 1 τ 1 λ ) 1 / ν 1 / σ ,
μ > 0 is the location, and τ ( 0 , 1 ) th is the quantile of X (assumed known).
By substituting (9) into Equation (4), the reparameterized cdf of X reduces to
F ( x ; ν , λ , μ , σ ) = 1 exp w ( x μ ) σ ν λ 1 exp w ( x μ ) σ ν + exp w ( x μ ) σ ν λ ,
where w ( τ , λ , ν ) = log ( 1 τ 1 λ ) 1 / ν / [ τ 1 / ( ν λ ) + ( 1 τ 1 / λ ) 1 / ν ] .
By simple differentiation, the reparameterized pdf of X has the form
f ( x ; ν , λ , μ , σ ) = ν λ σ x σ 1 w exp w ( x μ ) σ ν 1 exp w ( x μ ) σ ν λ 1 μ σ 1 exp w ( x μ ) σ ν + exp w ( x μ ) σ ν λ + 1 .
Henceforth, we redefine X EOLLW ( ν , λ , μ , σ , τ ) as a random variable with pdf (11), where τ ( 0 , 1 ) is fixed. Figure 1 displays plots of the pdf of X for some τ values, thus showing its asymmetry and bimodality.
The qf of X is obtained by replacing (9) in Equation (8)
x = Q ( q ) = μ log ( 1 q 1 / λ ) 1 / ν q 1 / ( ν λ ) + ( 1 q 1 / λ ) 1 / ν / log ( 1 τ 1 / λ ) 1 / ν τ 1 / ( ν λ ) + ( 1 τ 1 / λ ) 1 / ν 1 / σ , 0 < q < 1 .

3. Structural Properties

Some known properties of the reparameterized EOLLW distribution are given below:
(A)
Equation (11) gives lim x f ( x ; ν , λ , γ , σ ) = 0 . Furthermore (Rodrigues et al. [38]),
lim x 0 + f ( x ; ν , λ , γ , σ ) = , ν λ < 1 / σ , w / μ σ , ν λ = 1 / σ , 0 , ν λ > 1 / σ ,
where w ( τ , λ , ν ) is defined in Section 2.
(B)
The point ( x , f ( x ) ) is called a critical point of f if x is in the domain of f, and either f ( x ) = 0 or f ( x ) do not exist. Since f in (11) is differentiable, f ( x ) always exists in its domain. Differentiating f ( x ) in (11) and setting it equal to zero, its critical points are the roots of (Rodrigues et al. [38])
T ( x ) [ T ( x ) ] 2 = ( ν + 1 ) T ν ( x ) + ( 1 ν λ ) T ( x ) [ 1 + T ν ( x ) ] ,
where T ( x ) = G ( x ) [ 1 G ( x ) ] 1 , T ( x ) = g ( x ) [ 1 G ( x ) ] 2 , T ( x ) = g ( x ) [ 1 G ( x ) ] 2 + 2 g 2 ( x ) [ 1 G ( x ) ] 3 , and g ( x ) and G ( x ) are given in (3). Every critical point where f reaches a maximum (resp., minimum) value is called the mode (resp., minimum point). Using the intermediate value theorem, it is simple to verify that, for ν λ > 1 / σ , Equation (13) has at least one root in ( 0 , ) (see Appendix A).
(C)
If σ = 1 and ν > 0 is an integer, the pdf of X is (Rodrigues et al. [38])
  • decreasing or decreasing–increasing–decreasing for ν λ < 1 ;
  • unimodal for ν λ 1 .
Note that Figure 1a ( ν = 2 , λ = 0.4 ) shows the unimodality of the OELLW pdf when ν λ < 1 and σ = 4 > 1 . We emphasize that the values of the parameters in Figure 1a do not satisfy the hypothesis of the result C(2), so it does not contradict this one.
(D)
For 0 < ν 1 , the pdf of X is (Rodrigues et al. [38])
  • decreasing or decreasing–increasing–decreasing for ν λ < σ 1 ;
  • decreasing or uni/bimodal or decreasing–increasing–decreasing for ν λ = σ 1 .
(E)
If X EOLLW ( ν , λ , γ , σ ) and 0 < ν 1 , the pdf of X is uni- or bimodal for ν λ > σ 1 (Rodrigues et al. [38]).
Note that Figure 1b ( ν = 0.4 , λ = 0.90 ) and Figure 1c ( ν = 0.4 , λ = 1.5 ) are consistent with this result, because bimodality is obtained and ν λ > σ 1 is satisfied for both cases.
(F)
If D has the Type I Dagum distribution (Dagum [39]), say D DAGUM ( ν , 1 , λ ) , the cdf of X EOLL - G ( ν , λ , γ , σ ) in (10) can be written as
F ( x ; ν , λ , μ , σ ) = P [ D T ( x ) ] = P G 1 D 1 + D ; η x .
Consequently,
X = G 1 D 1 + D
is a stochastic representation for X.
(G)
The cdf (10) satisfies the identity
F ( x ) = P G 1 1 1 + B x ,
where B = 1 / D , D DAGUM ( ν , 1 , λ ) and G ( x ) is as given in Item (B).
(H)
We write A = { ( ν , λ ) ( 0 , ) 2 : ( 1 + ν ) t 2 ν + ν 2 ( 1 + λ ) t ν + 1 + [ ν 2 ( 1 + λ ) + ν ( 1 λ ) + 2 ] t ν + ( 1 ν λ ) > 0 , t > 0 } (see Rodrigues et al. [38]).
  • If ( ν , λ ) A and σ = 1 , the hrf of X is increasing.
  • Let ν 2 ( 1 + λ ) + ν ( 1 λ ) + 2 0 , ν λ > 1 , and ν > 0 be an integer. For example, take the ν 1 integer and λ > ν 1 .
    (a)
    If there exists 0 < x * < ν / γ such that h ( x * ) = 0 , the hrf of X has a bathtub (BT) shape.
    (b)
    If there does not exist 0 < x * < ν / γ such that h ( x * ) = 0 , the hrf of X is increasing.
  • Let ν 2 + 3 ν 1 > 0 , ν λ = 1 , and ν > 0 be an integer. For example, take 0 < ν < ( 13 3 ) / 2 and λ > n u 1 . Under the conditions of Item (a) [Item (b)], the hrf of X has a BT (increasing) shape.
(I)
The EOLLW density transitions from heavy-tailed to light-tailed (Rodrigues et al. [38]).

3.1. Other Properties

3.1.1. Existence of Moments

The tail of the density of X follows from (14),
P ( X x ) = P [ D T ( x ) ] , D DAGUM ( ν , 1 , λ ) ,
where T ( x ) = exp [ ( x / γ ) σ ] 1 . Markov’s inequality gives
P [ D T ( x ) ] E ( D ) T ( x ) = λ Γ ( 1 1 ν ) Γ ( 1 + 1 ν ) T ( x ) , ν > 1 .
Hence,
P ( X x ) λ Γ ( 1 1 ν ) Γ ( 1 + 1 ν ) T ( x ) , ν > 1 .
Having an upper bound on the tail of the distribution, we proceed to bound the moments of X. This will prove its existence.
Based on the known formula E ( X p ) = p 0 x p 1 P ( X x ) d x (for X 0 and p > 0 ), the inequality holds:
E ( X p ) ( 15 ) p λ Γ 1 1 ν Γ 1 + 1 ν 0 x p 1 T ( x ) d x , p > 0 , ν > 1 , = p λ Γ 1 1 ν Γ 1 + 1 ν γ p σ 0 y ( p / σ ) 1 exp ( y ) 1 d y ,
where y = exp [ ( x / γ ) σ ] .
From the Riemann zeta function ζ ( s ) = n = 1 n s = [ Γ ( s ) ] 1 0 z s 1 [ exp ( z ) 1 ] 1 d z , Re ( s ) > 1 , where Γ ( s ) = 0 x s 1 e x d x is the gamma function, we obtain
E ( X p ) p λ γ p σ Γ 1 1 ν Γ 1 + 1 ν Γ p σ ζ p σ < , p > σ .
Thus, for ν > 1 and p > σ , the existence of the pth moment of X (for any p > 0 ) is guaranteed.

3.1.2. Gini’s Mean Difference

Given the random variables X 1 , , X n , the Gini mean difference (GMD) is defined as
GMD n = 1 n 2 1 i < j n E ( | X i X j | ) ,
provided the involved expectations exist. The GMD is a very useful measure of variability in the presence of non-normality.
(a)
If X 1 , , X n is a sequence of independent and identically distributed (iid) random variables, the classical GMD (La Haye and Zizler [40]) is GMD = E ( | X 1 X 2 | ) .
From Proposition 3 of Vila et al. [41], the GMD for a random sample X 1 , , X n of the EOLLW model is
GMD = 0 1 ( 2 u 1 ) F X 1 1 ( u ) d u ,
where
F X 1 1 ( u ) = μ ln ( 1 u 1 / λ ) 1 / ν u 1 / ( ν λ ) + ( 1 u 1 / λ ) 1 / ν / ln ( 1 τ 1 / λ ) 1 / ν τ 1 / ( ν λ ) + ( 1 τ 1 / λ ) 1 / ν 1 / σ .
Note that analytically, the GMD (17) for the OELLW model is difficult to obtain. Vila et al. [41] provided the following upper bound ( 2 / 3 ) Var ( X 1 ) for the GMD.
(b)
If X EOLLW ( ν i , λ i , γ i , σ i ) , and X 1 , , X n is a sample not necessarily independent nor identically distributed, the following inequality for the GMD (16) follows from Vila et al. [41]
GMD n 1 n 2 1 i < j n ( Var ( X i ) Var ( X j ) ρ i , j ) 2 + Var ( X j ) ( 1 ρ i , j 2 ) + | ς | ,
where ς = E ( X i ) E ( X j ) and ρ i , j = Corr ( X i , X j ) , for i , j = 1 , , n .
Under constraints ν i > 1 and σ i < 1 , the moments of X i EOLLW ( ν i , λ i , γ i , σ i ) ( i = 1 , , n ) always exist (see Section 3.1.1). Then, the mean E ( X i ) , variance Var ( X i ) , and correlations ρ i , j = Corr ( X i , X j ) (for i , j = 1 , , n ) also exist. Hence, for both cases (a) and (b), we can deduce non-trivial upper bounds (then its existence) of the GMD for the EOLLW model.

4. The EOLLW QR Model for Censored Data

A new regression model is defined from the reparametrized EOLLW density (11), and two systematic components for the parameters μ i and σ i (for i = 1 , , n )
μ i ( τ ) = exp v i β 1 ( τ ) and σ i ( τ ) = exp v i β 2 ( τ ) ,
where β 1 ( τ ) = ( β 10 , β 11 , , β 1 p ) and β 2 ( τ ) = ( β 20 , β 21 , , β 2 p ) are unknown parameter vectors, and v i = ( v i 1 , , v i p ) is the explanatory variable vector. Thus, the heteroscedasticity is modeled via σ .
The EOLLW QR model is defined by Equations (11) and (18), where ν and λ are unknown constants, and it has as special models:
  • Rhe exponentiated Weibull (EW) QR model for ν = 1 ;
  • the odd log-logistic Weibull (OLLW) QR model for λ = 1 ;
  • and the Weibull QR model for ν = λ = 1 .
Consider a sample ( x 1 , δ 1 , v 1 ) , , ( x n , δ n , v n ) of independent observations, where each random response is defined by x i = min { X i , C i } , δ i = I X i C i (censoring indicator), where I ( · ) denotes the indicator function. We consider non-informative censoring and the observed lifetimes and censoring times are independent given v i . Let F and C be the sets of individuals for which x i is the lifetime or censoring time, respectively. Conventional likelihood estimation techniques can be applied here. The log-likelihood function for the vector θ = ( β 1 ( τ ) , β 2 ( τ ) , ν , λ ) from model (18) has the form
( θ ) = i F i ( θ ) + i C i ( c ) ( θ ) ,
where i ( θ ) = log [ f ( x i ) ] , i ( c ) ( θ ) = log [ S ( x i ) ] , f ( x i ) is the density (11), S ( x i ) = 1 F ( x i ) is the survival function, and F ( x i ) is the cdf (10) of X i . The total log-likelihood function for θ can be expressed as
( θ ) = r log ( ν λ w ) + i F log [ σ i ( τ ) ] + i F [ σ i ( τ ) 1 ] log ( x i ) + ν i F log ( u i ) + ( ν λ 1 ) i F log ( 1 u i ) i F σ i ( τ ) log [ μ i ( τ ) ] ( λ + 1 ) i F log ( 1 u i ) ν + u i ν + i C log 1 ( 1 u i ) ν λ [ ( 1 u i ) ν + u i ν ] λ ,
where
w ( ν , λ , τ ) = log ( 1 τ 1 λ ) 1 ν τ 1 ν λ + ( 1 τ 1 λ ) 1 ν , u i = exp w ( ν , λ , τ ) x i μ i ( τ ) σ i ( τ ) ,
and r is the number of uncensored observations (failures).
The gamlss package in R [42] is used to find the maximum likelihood estimate θ ^ of θ . This package comes from the general class of generalized additive models for location, scale and shape (GAMLSS) (Rigby and Stasinopoulos [43]). These models allow all parameters of a distribution to be modeled as a function of covariates, such as non-parametric, parametric and/or additive smooth functions. Furthermore, they do not have the restriction that the response distribution belongs to a given family such as the exponential family. The package basically has two algorithms: CG (Cole and Green [44]) and RS (Rigby and Stasinopoulos [43]), whose acronyms come from the names of the authors. These algorithms are stable and do not require precise initial values to guarantee convergence. For this reason, we work with the RS algorithm with initial values for β 1 ( τ ) and β 2 ( τ ) obtained from the fitted Weibull QR model ( ν = λ = 2 ). Compared to the CG algorithm, RS is faster for larger datasets and does not use the expected value of cross derivatives, which can be useful when these values are equal to zero. For more details of the algorithms, see [43].
The codes for the reparametric EOLLW distribution in the GAMLSS framework are available at https://github.com/gabrielamrodrigues/EOLLW_quantiles (accessed on 10 February 2023). Following this approach, different regression models can be constructed by incorporating non-parametric smoothing functions, random effects, or other additive terms to the predictors.
Under conditions that are fulfilled for the parameter vector θ in the interior of the parameter space but not on the boundary, the asymptotic distribution of n ( θ ^ θ ) is multivariate normal N 2 p + 2 ( 0 , K ( θ ) 1 ) , where K ( θ ) is the information matrix. The asymptotic covariance matrix K ( θ ) 1 of θ ^ can be approximated by the inverse of the ( 2 p + 2 ) × ( 2 p + 2 ) observed information matrix L ¨ ( θ ) . The approximate multivariate normal distribution N 2 p + 2 ( 0 , L ¨ ( θ ) 1 ) for θ ^ can be used in the classical way to construct approximate confidence regions for some parameters in θ .
We can use the likelihood ratio (LR) statistic for comparing some sub-models with the EOLLW QR model. We consider the partition θ = ( θ 1 T , θ 2 T ) T , where θ 1 is the subset of parameters of interest and θ 2 is the subset of remaining parameters. The LR statistic for testing the null hypothesis H 0 : θ 1 = θ 1 ( 0 ) versus the alternative hypothesis H 1 : θ 1 θ 1 ( 0 ) is given by w * = 2 { ( θ ^ ) ( θ ˜ ) } , where θ ˜ and θ ^ are the estimates under the null and alternative hypotheses, respectively. The statistic w is asymptotically (as n ) distributed as χ k 2 , where k is the dimension of the subset of parameters θ 1 of interest.
The standard maximum likelihood techniques can be adopted for the proposed regression, such as the quantile residuals ( q r i ) (Dunn and Smyth [45]), namely
q r i = Φ 1 ( 1 u ^ i ) ν ^ λ ^ [ ( 1 u ^ i ) ν ^ + u ^ i ν ^ ] λ ^ ,
where
u ^ i = exp w ^ ( ν ^ , λ ^ , τ ) x i μ ^ i ( τ ) σ ^ i ( τ ) , w ^ ( ν ^ , λ ^ , τ ^ ) = log ( 1 τ ^ 1 λ ^ ) 1 ν ^ τ ^ 1 ν ^ λ ^ + ( 1 τ ^ 1 λ ^ ) 1 ν ^ ,
μ ^ i ( τ ) = exp v i β ^ 1 ( τ ) , σ ^ i ( τ ) = exp v i β ^ 2 ( τ ) ,
and Φ ( · ) 1 is the inverse cumulative standard normal distribution.

5. Simulation Study

A simulation study is carried out to verify the accuracy of the MLEs in the EOLLW QR model for the quartiles τ = 0.25 , 0.50 and 0.75 , and approximate censoring percentages 0%, 10% and 50%. Just one covariate v 1 Binomial ( 1 , 0.5 ) is included in the systematic components:
μ i = exp ( β 10 + β 11 v 1 i ) , σ i = exp ( β 20 + β 21 v 1 i ) , ν i = exp ( β 30 ) , and λ i = exp ( β 40 ) ,
For each combination, N = 1000 replicas of sizes n = 100 , 300 and 500 are generated. The true values used are: β 10 = 1.5 , β 11 = 1.32 , β 20 = 0.5 , β 21 = 0.2 , β 30 = 1.1 and β 40 = 1.4 .
The inverse transformation method is used to generate the lifetimes x 1 , , x n from the EOLLW ( μ i , σ i , ν , λ , τ ) distribution, and the censoring times c 1 , , c n are determined from a uniform distribution ( 0 , k ) , where k controls the censoring percentages. For each scenario, the Average Estimates (AEs), Biases and Mean Square Errors (MSEs) of the MLEs are calculated from:
AE ( θ ^ ) = 1 N i = 1 N θ i ^ , Bias ( θ ^ ) = 1 N i = 1 N ( θ i ^ θ i ) , MSE ( θ ^ ) = 1 N i = 1 N ( θ i ^ θ i ) 2 ,
where θ ^ = ( β ^ 10 , β ^ 11 , β ^ 20 , β ^ 21 , β ^ 30 , β ^ 40 ) . The software R is used and Algorithm 1 presents the simulation steps.
Algorithm 1: Simulation study
Mathematics 11 01518 i001
Table 1, Table 2 and Table 3 report the findings. For all scenarios, the AEs converge to the true parameter values, and the biases and MSEs decrease when n increases. These facts indicate that the consistency of the estimators hold. In addition, this behavior is verified even for high censoring percentages. We also found the empirical coverage probabilities (CPs) corresponding to the 95% confidence intervals calculated from the simulations. Table 4 reports CPs values which approach to the nominal level.

6. Application to Gastric Cancer Data

Gastric cancer is the 5th most common cancer worldwide. There are more than one million new cases of this cancer every year, and it ranked as the 2nd leading cause of mortality from cancer in the world. We consider a survival dataset of patients suffering from gastric adenocarcinoma treated by surgery at Helsinki University Hospital in Finland [46] (available at https://doi.org/10.5061/dryad.hb62394, accessed on 29 November 2022 [47]), which contains 301 individuals with approximate censoring of 60%. Here we consider two covariables. The first corresponds to the classification of Lauren (Figure 2a). Various pathological classifications of the disease exist, but that of Lauren is the most common. Originally developed in the 1960s, the classification system adopted cell structural components to separate the patients in three types: well differentiated (non-cardia/intestinal), poorly differentiated (cardia/diffuse), and mixed disease [48]. Based on histology, the two leading types of gastric cancer are diffuse and intestinal [49]. These two types are reflected in the dataset. The second covariable corresponds to the presence of distant metastasis (M1 disease) (Figure 2b). Many patients diagnosed with gastric cancer present distant metastasis, implying a very poor prognosis, generally indicating prophylactic rather than curative treatment ([50,51]). The objective here is to verify the effects of the covariables in different quantiles, so as to obtain a more complete view of this dataset. Table 5 gives a descriptive summary, which includes the mean times, median times and times for the first and third quartiles. We can observe differences for the Lauren classification covariate: between the quantiles, the average time, and the Lauren 1 and Lauren 2 levels. However, we note subtle differences for the presence of distant metastases covariate. Then, the variables considered are ( i = 1 , , 301 ):
  • x i survival time (in years);
  • cens i : censoring indicator (0 = censored, 1 = observed);
  • v 1 i : Lauren classification (1 = intestinal, 2 = diffuse), defined by a dummy variable (0 = intestinal, 1 = diffuse);
  • v 2 i : Presence of distant metastases (pm) (1 = yes, 0 = no)

Regression Model

We compare the EOLLW QR model with the nested OLLW, Exp-W and Weibull models under three systematic components:
M 0 = μ ( τ ) = exp [ β 10 ( τ ) ] σ i ( τ ) = exp [ β 20 ( τ ) ] ;
M 1 = μ ( τ ) = μ ( τ ) = exp [ β 10 ( τ ) + β 11 ( τ ) v 1 i + β 12 ( τ ) v 2 i ] σ i ( τ ) = exp [ β 20 ( τ ) ] ;
M 2 = μ ( τ ) = exp [ β 10 ( τ ) + β 11 ( τ ) v 1 i + β 12 ( τ ) v 2 i ] σ i ( τ ) = exp [ β 20 ( τ ) + β 21 ( τ ) v 1 i + β 22 ( τ ) v 2 i ] .
We consider the following quantiles: τ = 0.10 , 0.25 , 0.50 , 0.75 and 0.90 . Table 6 reports the Akaike information criterion (AIC) values for the fitted QR regression models. The EOLLW QR model under structure M 2 gives the lowest values for these quantiles.
Table 7 gives three likelihood ratio (LR) statistics (p-values in parentheses), thus indicating that the EOLLW QR model under structure M 2 is better than the others. Thus, we can consider this model as the predictive model.
Figure 3 displays the MLEs and the corresponding confidence intervals along with the interval [ 0.01 , 0.99 ] , and Table 8 gives the MLEs and their standard errors (SEs) for the quantiles τ = 0.10 , 0.25 , 0.50 , 0.75 and 0.90 at the significance level of 5%. The following facts can be mentioned:
  • The effect of the Lauren classification 2 in comparison with 1 is decreasing along the quantiles and its confidence interval shows significant estimates for all quantiles. These results corroborate with those point quantiles reported in Table 8.
  • The effect of the presence of distant metastasis is rising along the quantiles. Its confidence interval includes zero in the interval [ 0.25 , 0.75 ) , thus indicating that the covariable is not significant for these quantiles. These results can be noted by the non-significant p-values for τ = 0.25 and 0.50 .
  • For the parameters β 21 and β 22 , the estimates are significant for both quantiles, thus indicating that those covariables influence the variability of the survival times.
  • The estimates corresponding to the shape parameters β 30 and β 40 are also significant for all quantiles.

Residual Analysis

Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8 provide the normal probability plots of the q r i ’s in Equation (20) under structure M 2 for some quantiles. They reveal that the EOLLW QR model is the best among the fitted models. Further, they approximately follow a standard normal distribution, thus indicating adequate fits. Figure 9 shows the index plot of the q r i ’s for the EOLLW QR model under structure M 2 . There are few points outside the interval [ 3 , 3 ] for both quantiles, and a random pattern around zero which show that these models are very adequate to the current data.

7. Concluding Remarks

We introduced a new quantile regression model for censored data based on the reparametrization of the exponentiated log-logistic odd Weibull (EOLLW) distribution in terms of quantiles with two systematic components. We presented some mathematical properties of the reparametrized EOLLW distribution. The proposed quantile regression model is an important extension of other regression models and can be a valuable addition to the survival analysis area. The new regression model also serves as a good alternative for the analysis of lifetime data and may be more flexible than the exponentiated Weibull, odd log-logistic Weibull and Weibull models. Several simulations were performed for different parameter settings, sample sizes and censoring percentages, to assess the accuracy of the maximum likelihood estimators. The usefulness of the new model was also proved by means of a gastric cancer dataset.

Author Contributions

Conceptualization, G.M.R., E.M.M.O., G.M.C., R.V.; methodology, G.M.R., E.M.M.O., G.M.C., R.V.; software, G.M.R., E.M.M.O., G.M.C., R.V.; validation, G.M.R., E.M.M.O., G.M.C., R.V.; formal analysis, G.M.R., E.M.M.O., G.M.C., R.V.; investigation, G.M.R., E.M.M.O., G.M.C., R.V.; data curation, G.M.R., E.M.M.O., G.M.C., R.V.; writing—original draft preparation, G.M.R., E.M.M.O., G.M.C., R.V.; writing—review and editing, G.M.R., E.M.M.O., G.M.C., R.V.; visualization, G.M.R., E.M.M.O., G.M.C., R.V.; supervision, G.M.R., E.M.M.O., G.M.C., R.V. All authors have read and agreed to the current version of the manuscript.

Funding

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior–Brasil (CAPES) (Finance Code 001).

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset was obtained from https://doi.org/10.5061/dryad.hb62394. We also made the data available at https://github.com/gabrielamrodrigues/EOLLW_quantiles.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Here, for the EOLLW model, we verify (11), Equation (13) has at least one zero under the restriction ν λ > 1 / σ .
Indeed, if G ( x ; γ , σ ) and g ( x ; γ , σ ) are as in (3), Equation (13) can be written as
L ( z ) 2 1 σ 1 exp ( z ) z ν [ exp ( z ) 1 ] ν λ [ exp ( z ) 1 ] ν + 1 1 = 0 , where z = x γ σ .
L’Hospital’s rule gives
lim z 0 + L ( z ) = 2 1 σ ν ( λ ) 1 = 1 + ν λ 1 σ > 0
and
lim z L ( z ) = ( ν + 1 ) < 0 ,
since ν λ > 1 / σ . Further, L is continuous in ( 0 , ) , and by the intermediate value theorem, there is a c ( 0 , ) such that L ( c ) = 0 . In other words, Equation (13) has at least one zero if ν λ > 1 / σ .

References

  1. Koenker, R.; Bassett, G., Jr. Regression quantiles. Econometrica 1978, 46, 33–50. [Google Scholar] [CrossRef]
  2. Gijbels, I.; Karim, R.; Verhasselt, A. Semiparametric quantile regression using family of quantile-based asymmetric densities. Comput. Stat. Data Anal. 2021, 157, 107–129. [Google Scholar] [CrossRef]
  3. Takeuchi, I.; Furuhashi, T. Non-crossing quantile regressions by SVM. Int. Jt. Conf. Neural Netw. 2004, 1, 401–406. [Google Scholar]
  4. Das, K.; Krzywinski, M.; Altman, N. Quantile regression. Nat. Methods 2019, 16, 451–452. [Google Scholar] [CrossRef] [PubMed]
  5. Liu, Y.; Wu, Y. Simultaneous multiple non-crossing quantile regression estimation using kernel constraints. J. Non Parametr. Stat. 2011, 23, 415–437. [Google Scholar] [CrossRef]
  6. Koenker, R.; Machado, J.A.F. Goodness of fit and related inference processes for quantile regression. J. Am. Stat. Assoc. 1999, 94, 1296–1310. [Google Scholar] [CrossRef]
  7. Peng, L.; Huang, Y. Survival analysis with quantile regression models. J. Am. Stat. Assoc. 2008, 103, 637–649. [Google Scholar] [CrossRef]
  8. Wang, H.J.; Wang, L. Locally weighted censored quantile regression. J. Am. Stat. Assoc. 2009, 104, 1117–1128. [Google Scholar] [CrossRef] [Green Version]
  9. Zarean, E.; Mahmoudi, M.; Azimi, T.; Amini, P. Determining overall survival and risk factors in esophageal cancer using censored quantile regression. Asian Pac. J. Cancer Prev. 2018, 19, 3081–3086. [Google Scholar] [CrossRef] [Green Version]
  10. Yang, X.; Narisetty, N.N.; He, X. A new approach to censored quantile regression estimation. J. Comput. Graph. Stat. 2018, 27, 417–425. [Google Scholar] [CrossRef]
  11. Du, J.; Zhang, Z.; Xu, D. Estimation for the censored partially linear quantile regression models. Commun. Stat. Simul. Comput. 2018, 47, 2393–2408. [Google Scholar] [CrossRef]
  12. Xue, X.; Xie, X.; Strickler, H.D. A censored quantile regression approach for the analysis of time to event data. Stat. Methods Med. Res. 2018, 27, 955–965. [Google Scholar] [CrossRef] [PubMed]
  13. Hong, H.G.; Christiani, D.C.; Li, Y. Quantile regression for survival data in modern cancer research: Expanding statistical tools for precision medicine. Precis. Clin. Med. 2019, 2, 90–99. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. De Backer, M.; Ghouch, A.E.; Van Keilegom, I. An adapted loss function for censored quantile regression. J. Am. Stat. Assoc. 2019, 114, 1126–1137. [Google Scholar] [CrossRef]
  15. De Backer, M.; El Ghouch, A.; Van Keilegom, I. Linear censored quantile regression: A novel minimum-distance approach. Scand. J. Stat. 2020, 47, 1275–1306. [Google Scholar] [CrossRef]
  16. Qiu, Z.; Ma, H.; Chen, J.; Dinse, G.E. Quantile regression models for survival data with missing censoring indicators. Stat. Methods Med. Res. 2021, 30, 1320–1331. [Google Scholar] [CrossRef]
  17. Yazdani, A.; Yaseri, M.; Haghighat, S.; Kaviani, A.; Zeraati, H. The comparison of censored quantile regression methods in prognosis factors of breast cancer survival. Sci. Rep. 2021, 11, 18268. [Google Scholar] [CrossRef]
  18. Peng, L. Quantile regression for survival data. Annu. Rev. Stat. Appl. 2021, 8, 413–437. [Google Scholar] [CrossRef]
  19. Hsu, C.Y.; Wen, C.C.; Chen, Y.H. Quantile function regression analysis for interval censored data, with application to salary survey data. Jpn. J. Stat. Data Sci. 2021, 4, 999–1018. [Google Scholar] [CrossRef]
  20. He, X.; Pan, X.; Tan, K.M.; Zhou, W.X. Scalable estimation and inference for censored quantile regression process. Ann. Stat. 2022, 50, 2899–2924. [Google Scholar] [CrossRef]
  21. Wei, B. Quantile regression for censored data in haematopoietic cell transplant research. Bone Marrow Transplant. 2022, 57, 853–856. [Google Scholar] [CrossRef] [PubMed]
  22. Jodrá, P.; Jiménez-Gamero, M.D. A quantile regression model for bounded responses based on the exponential-geometric distribution. Revstat Stat. J. 2020, 18, 415–436. [Google Scholar]
  23. Sánchez, L.; Leiva, V.; Galea, M.; Saulo, H. Birnbaum-Saunders quantile regression and its diagnostics with application to economic data. Appl. Stoch. Model. Bus. Ind. 2020, 37, 53–73. [Google Scholar] [CrossRef]
  24. Sánchez, L.; Leiva, V.; Galea, M.; Saulo, H. Birnbaum-Saunders quantile regression models with application to spatial data. Mathematics 2020, 8, 1000. [Google Scholar] [CrossRef]
  25. Gallardo, D.I.; Gómez-Déniz, E.; Gómez, H.W. Discrete generalized half-normal distribution with applications in quantile regression. SORT 2020, 44, 265–284. [Google Scholar]
  26. Korkmaz, M.C.; Chesneau, C.; Korkmaz, Z.S. Transmuted unit Rayleigh quantile regression model: Alternative to beta and Kumaraswamy quantile regression models. Sci. Bull. 2021, 83, 149–159. [Google Scholar]
  27. Korkmaz, M.M.; Chesneau, C. On the unit Burr-XII distribution with the quantile regression modeling and applications. Comput. Appl. Math. 2021, 40, 2–26. [Google Scholar] [CrossRef]
  28. Korkmaz, M.Ç.; Altun, E.; Chesneau, C.; Yousof, H.M. On the unit-Chen distribution with associated quantile regression and applications. Math. Slovaca 2022, 72, 765–786. [Google Scholar] [CrossRef]
  29. Saulo, H.; Dasilva, A.; Leiva, V.; de la Fuente-Mella, L.S. Log-symmetric quantile regression models. Stat. Neerl. 2022, 76, 124–163. [Google Scholar] [CrossRef]
  30. Korkmaz, M.C.; Chesneau, C.; Korkmaz, Z.Z.S. A new alternative quantile regression model for the bounded response with educational measurements applications of OECD countries. J. Appl. Stat. 2023, 50, 131–154. [Google Scholar] [CrossRef]
  31. Saulo, H.; Vila, R.; Borges, G.V.; Bourguignon, M.; Leiva, V.; Marchant, C. Modeling Income Data via New Parametric Quantile Regressions: Formulation, Computational Statistics, and pplication. Mathematics 2023, 11, 448. [Google Scholar] [CrossRef]
  32. Rodrigues, A.; Borges, P.; Santos, B. A Defective Cure Rate Quantile Regression Model for Male Breast Cancer Data. 2021. Preprint. Available online: https://arxiv.org/abs/2105.03699 (accessed on 19 December 2022).
  33. Morales, C.E.G.; Lachos, V.H.; Bourguignon, M. A skew-t quantile regression for censored and missing data. Stat 2021, 10, e379. [Google Scholar]
  34. Ozel, G.; Alizadeh, M.; Cakmakyapan, S.; Hamedani, G.G.; Ortega, E.M.; Cancho, V.G. The odd log-logistic Lindley Poisson model for lifetime data. Commun. -Stat.-Simul. Comput. 2017, 46, 6513–6537. [Google Scholar] [CrossRef]
  35. Alizadeh, M.; Tahmasebi, S.; Haghbin, H. The exponentiated odd log-logistic family of distributions: Properties and applications. J. Stat. Model. Theory Appl. 2020, 1, 29–52. [Google Scholar]
  36. Gleaton, J.U.; Lynch, J.D. Properties of generalized log-logistic families of lifetime distributions. J. Probab. Stat. Sci. 2006, 4, 51–64. [Google Scholar]
  37. Mudholkar, G.S.; Srivastava, D.K.; Kollia, G. A generalization of the Weibull distribution with application to the analysis of survival data. J. Am. Stat. Assoc. 1996, 91, 1575–1583. [Google Scholar] [CrossRef]
  38. Rodrigues, G.M.; Vila, R.; Ortega, E.M.M.; Cordeiro, G.M.; Serra, V. New Results and Regression Model for the Exponentiated Odd Log-Logistic Weibull Family of Distributions with Applications. 2022. Preprint. Available online: https://arxiv.org/abs/2203.14189 (accessed on 7 February 2023).
  39. Dagum, C. A model of income distribution and the conditions of existence of moments of finite order. Bull. Int. Stat. Inst. 1975, 46, 199–205. [Google Scholar]
  40. La Haye, R.; Zizler, P. The Gini mean difference and variance. Metron 2019, 77, 43–52. [Google Scholar] [CrossRef]
  41. Vila, R.; Balakrishnan, N.; Saulo, H. An Upper Bound and a Characterization for GINI’s Mean Difference Based on Correlated Random Variables. 2023. Preprint. Available online: https://arxiv.org/abs/2301.07229 (accessed on 7 February 2023).
  42. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022. [Google Scholar]
  43. Rigby, R.A.; Stasinopoulos, D.M. Generalized additive models for location, scale and shape. J. R. Stat. Soc. Ser. (Appl. Stat.) 2005, 54, 507–554. [Google Scholar] [CrossRef] [Green Version]
  44. Cole, T.J.; Green, P.J. Smoothing reference centile curves: The LMS method and penalized likelihood. Stat. Med. 1992, 11, 1305–1319. [Google Scholar] [CrossRef]
  45. Dunn, P.; Smyth, G. Randomized quantile residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar]
  46. Kasurinen, A.; Tervahartiala, T.; Laitinen, A.; Kokkola, A.; Sorsa, T.; Böckelman, C.; Haglund, C. High serum MMP-14 predicts worse survival in gastric cancer. PLoS ONE 2018, 13, e0208800. [Google Scholar] [CrossRef] [PubMed]
  47. Kasurinen, A.; Tervahartiala, T.; Laitinen, A.; Kokkola, A.; Sorsa, T.; Böckelman, C.; Haglund, C. Data from: High Serum MMP-14 Predicts Worse Survival in Gastric Cancer. Available online: https://doi.org/10.5061/dryad.hb62394 (accessed on 29 November 2022).
  48. Chen, Y.C.; Fang, W.L.; Wang, R.F.; Liu, C.A.; Yang, M.H.; Lo, S.S.; Wu, C.W.; Li, A.F.; Shyr, Y.M.; Huang, K.H. Clinico pathological variation of Lauren classification in gastric cancer. Pathol. Oncol. Res. 2016, 22, 197–202. [Google Scholar] [CrossRef]
  49. Lauren, P. The two histologic main types of gastric carcinoma: Diffuse and so-called intestinal type carcinoma. An attempt at a histoclinicalclassification. Acta Parhol. Microbid. Scan. 1965, 64, 31–49. [Google Scholar] [CrossRef] [PubMed]
  50. Dixon, M.; Mahar, A.L.; Helyer, L.K.; Vasilevska-Ristovska, J.; Law, C.; Coburn, N.G. Prognostic factors in metastatic gastric cancer: Results of a population-based, retrospective cohort study in Ontario. Gastric Cancer 2016, 19, 150–159. [Google Scholar] [CrossRef] [Green Version]
  51. Kwee, R.M.; Kwee, T.C. Modern imaging techniques for preoperative detection of distant metastases in gastric cancer. World J. Gastroenterol. WJG 2015, 21, 10502–10509. [Google Scholar] [CrossRef]
Figure 1. Plots of the pdf of X for some τ values: (a) ν = 2, λ = 0.4, μ = 5, σ = 4, (b) ν = 0.4, λ = 0.90, μ = 1.2, σ = 3, (c) ν = 0.4, λ = 1.5, μ = 1.2, σ = 2.9.
Figure 1. Plots of the pdf of X for some τ values: (a) ν = 2, λ = 0.4, μ = 5, σ = 4, (b) ν = 0.4, λ = 0.90, μ = 1.2, σ = 3, (c) ν = 0.4, λ = 1.5, μ = 1.2, σ = 2.9.
Mathematics 11 01518 g001
Figure 2. Kaplan-Meier survival curves for gastric cancer data: (a) Lauren classification; (b) Presence of distant metastases.
Figure 2. Kaplan-Meier survival curves for gastric cancer data: (a) Lauren classification; (b) Presence of distant metastases.
Mathematics 11 01518 g002
Figure 3. Point estimates and 95% confidence intervals for the parameters versus τ from the fitted ELLOW QR model under structure M 2 for the gastric cancer data.
Figure 3. Point estimates and 95% confidence intervals for the parameters versus τ from the fitted ELLOW QR model under structure M 2 for the gastric cancer data.
Mathematics 11 01518 g003
Figure 4. QQ plots for q r i ’s from some fitted regression models under structure M 2 and τ = 0.10 .
Figure 4. QQ plots for q r i ’s from some fitted regression models under structure M 2 and τ = 0.10 .
Mathematics 11 01518 g004
Figure 5. QQ plots for the q r i ’s from some fitted regression models under structure M 2 and τ = 0.25 .
Figure 5. QQ plots for the q r i ’s from some fitted regression models under structure M 2 and τ = 0.25 .
Mathematics 11 01518 g005
Figure 6. QQ plots for the q r i ’s from some fitted regression models under structure M 2 and τ = 0.50 .
Figure 6. QQ plots for the q r i ’s from some fitted regression models under structure M 2 and τ = 0.50 .
Mathematics 11 01518 g006
Figure 7. QQ plots for the q r i ’s from some regression models under structure M 2 and τ = 0.75 .
Figure 7. QQ plots for the q r i ’s from some regression models under structure M 2 and τ = 0.75 .
Mathematics 11 01518 g007
Figure 8. QQ plots for the q r i ’s from some regression models under structure M 2 and τ = 0.90 .
Figure 8. QQ plots for the q r i ’s from some regression models under structure M 2 and τ = 0.90 .
Mathematics 11 01518 g008
Figure 9. Index plots for the q r i ’s from some regression models under structure M 2 : (a) τ = 0.10 ; (b) τ = 0.25 ; (c) τ = 0.50 ; (d) τ = 0.75 ; (e) τ = 0.90 .
Figure 9. Index plots for the q r i ’s from some regression models under structure M 2 : (a) τ = 0.10 ; (b) τ = 0.25 ; (c) τ = 0.50 ; (d) τ = 0.75 ; (e) τ = 0.90 .
Mathematics 11 01518 g009
Table 1. Simulation results from the fitted EOLLW QR model for τ = 0.25 .
Table 1. Simulation results from the fitted EOLLW QR model for τ = 0.25 .
% θ True Value n = 100 n = 300 n = 500
AEsBiasesMSEsAEsBiasesMSEsAEsBiasesMSEs
0% β 10 1.501.50170.00170.00061.50100.00100.00021.50150.00150.0001
β 11 −1.32−1.3208−0.00080.0009−1.31950.00050.0003−1.3204−0.00040.0002
β 20 0.500.4713−0.02870.37440.4689−0.03110.08110.4661−0.03390.0647
β 21 0.200.20260.00260.02320.1983−0.00170.00720.1996−0.00040.0043
β 30 1.100.9322−0.16780.62861.14100.04100.15231.16060.06060.1208
β 40 1.402.17250.77252.25071.3820−0.01800.25941.3228−0.07720.1726
10% β 10 1.501.50190.00190.00061.50070.00070.00021.50120.00120.0001
β 11 −1.32−1.3213−0.00130.0010−1.31990.00010.0003−1.3205-0.00050.0002
β 20 0.500.51070.01070.35160.4687−0.03130.07750.4664−0.03360.0663
β 21 0.200.1998−0.00020.02510.1992−0.00080.00800.1955−0.00450.0046
β 30 1.100.8796−0.22040.62881.13190.03190.15171.16400.06400.1210
β 40 1.402.10300.70301.76951.42450.02450.31371.3198−0.08020.1713
50% β 10 1.501.50830.00830.00241.50150.00150.00061.50050.00050.0003
β 11 −1.32−1.3262−0.00620.0029−1.3208−0.00080.0007−1.3202−0.00020.0004
β 20 0.500.77970.27970.37050.50160.00160.17650.4869−0.01310.0922
β 21 0.200.1276−0.07240.07670.1830−0.01700.01970.1865−0.01350.0127
β 30 1.100.5845−0.51550.66811.0430−0.05700.33441.11650.01650.1563
β 40 1.402.17570.77571.63961.61310.21310.68151.43050.03050.2805
Table 2. Simulation results from the fitted EOLLW QR model for τ = 0.50 .
Table 2. Simulation results from the fitted EOLLW QR model for τ = 0.50 .
% θ True Value n = 100 n = 300 n = 500
AEsBiasesMSEsAEsBiasesMSEsAEsBiasesMSEs
0% β 10 1.501.50180.00180.00051.50070.00070.00021.50100.00100.0001
β 11 −1.32−1.3213−0.00130.0008−1.31970.00030.0003−1.3205−0.00050.0002
β 20 0.500.50260.00260.38900.4834−0.01660.06970.4664−0.03360.0700
β 21 0.200.20760.00760.02020.20420.00420.00660.20060.00060.0039
β 30 1.100.9111−0.18890.64341.14570.04570.13171.16940.06940.1291
β 40 1.402.14630.74632.25711.3320−0.06800.23321.3050−0.09500.1874
10% β 10 1.501.50160.00160.00071.50040.00040.00021.50050.00050.0001
β 11 −1.32−1.3203−0.00030.0010−1.31970.00030.0004−1.3200−0.00000.0002
β 20 0.500.57880.07880.29820.4814−0.01860.07870.4676−0.03240.0666
β 21 0.200.20370.00370.02360.20320.00320.00820.1994−0.00060.0046
β 30 1.100.8421−0.25790.56461.14180.04180.14731.16860.06860.1223
β 40 1.401.99850.59851.58041.3647−0.03530.25981.3109−0.08910.1954
50% β 10 1.501.50400.00400.00331.4995−0.00050.00071.4996−0.00040.0004
β 11 −1.32−1.3213−0.00130.0036−1.31850.00150.0008−1.31910.00090.0005
β 20 0.500.82840.32840.40590.50020.00020.27750.4885−0.01150.1185
β 21 0.200.1409−0.05910.07300.1852−0.01480.01950.1931−0.00690.0127
β 30 1.100.5795−0.52050.66601.0593−0.04070.45541.13950.03950.1775
β 40 1.402.04470.64471.48381.58400.18400.67431.3576−0.04240.2460
Table 3. Simulation results from the fitted EOLLW QR model for τ = 0.75 .
Table 3. Simulation results from the fitted EOLLW QR model for τ = 0.75 .
% θ True Value n = 100 n = 300 n = 500
AEsBiasesMSEsAEsBiasesMSEsAEsBiasesMSEs
0% β 10 1.501.4978−0.00220.00061.4974−0.00260.00021.4982−0.00180.0001
β 11 −1.32−1.3202−0.00020.0011−1.31890.00110.0004−1.3200−0.00000.0002
β 20 0.500.4999−0.00010.47780.50310.00310.06300.4692−0.03080.0629
β 21 0.200.20570.00570.01870.1994−0.00060.00640.20030.00030.0038
β 30 1.100.9645−0.13550.72901.14830.04830.11491.18510.08510.1136
β 40 1.402.05720.65722.06731.3201−0.07990.22561.2860−0.11400.1613
10% β 10 1.501.4968−0.00320.00081.4973−0.00270.00031.4981−0.00190.0002
β 11 −1.32−1.3207−0.00070.0011−1.31920.00080.0004−1.3200−0.00000.0003
β 20 0.500.57070.07070.30920.50120.00120.06660.4817−0.01830.0553
β 21 0.200.20450.00450.02330.20460.00460.00690.20040.00040.0043
β 30 1.100.9004−0.19960.53991.14520.04520.11791.17600.07600.0991
β 40 1.401.97650.57651.73091.3391−0.06090.23261.2787−0.12130.1641
50% β 10 1.501.4909−0.00910.00371.4949−0.00510.00101.4953−0.00470.0007
β 11 −1.32−1.31340.00660.0043−1.31770.00230.0011−1.31720.00280.0007
β 20 0.500.82830.32830.56210.52140.02140.22100.52340.02340.0790
β 21 0.200.1300−0.07000.07780.1920−0.00800.01840.1904−0.00960.0120
β 30 1.100.6175−0.48250.81541.0842−0.01580.34761.14190.04190.1129
β 40 1.402.06800.66801.61981.50140.10140.63401.3051−0.09490.1893
Table 4. CPs for the 95% nominal level from the fitted EOLLW QR regression model when τ = 0.25 , 0.50 and 0.75 and approximate censoring percentages 0%, 10% and 50%.
Table 4. CPs for the 95% nominal level from the fitted EOLLW QR regression model when τ = 0.25 , 0.50 and 0.75 and approximate censoring percentages 0%, 10% and 50%.
τ θ 0 % ( n ) 10 % ( n ) 50 % ( n )
(100)(300)(500)(100)(300)(500)(100)(300)(500)
0.25 β 10 0.9390.9460.9570.9480.9510.9470.9220.9490.953
β 11 0.9540.9490.9590.9460.9420.9660.9370.9510.946
β 20 0.9730.9740.9650.9720.9810.9690.9620.9750.981
β 21 0.9480.9550.9560.9460.9570.9540.9370.9610.954
β 30 0.9801.0001.0000.9780.9981.0000.9540.9810.993
β 40 0.9900.9990.9910.9930.9990.9930.9960.9981.000
0.50 β 10 0.9500.9470.9520.9400.9350.9510.9070.9390.968
β 11 0.9500.9390.9560.9470.9370.9500.9130.9330.996
β 20 0.9690.9720.9590.9550.9770.9620.9590.9750.973
β 21 0.9590.9670.9640.9530.9480.9570.9450.9560.998
β 30 0.9860.9991.0000.9700.9991.0000.9410.9760.995
β 40 0.9860.9990.9870.9950.9990.9950.9950.9960.989
0.75 β 10 0.9500.9560.9630.9410.9590.9580.8950.9550.938
β 11 0.9460.9560.9590.9540.9580.9630.9140.9640.949
β 20 0.9520.9680.9660.9520.9740.9690.9390.9720.991
β 21 0.9680.9690.9650.9620.9680.9680.9360.9700.959
β 30 0.9800.9991.0000.9791.0001.0000.9520.9870.996
β 40 0.9800.9990.9890.9910.9980.9960.9860.9930.999
Table 5. Descriptive analysis of gastric cancer data.
Table 5. Descriptive analysis of gastric cancer data.
0.250.500.75Mean
Lauren 14.338.5710.187.73
Lauren 28.3211.0813.3010.39
Pm 07.159.6611.909.24
Pm 110.0610.0610.068.48
Table 6. AIC values for some fitted QR models to gastric cancer data.
Table 6. AIC values for some fitted QR models to gastric cancer data.
Model τ
0.100.250.500.750.90
M 0 EOLLW755.3104760.6965755.1712755.1715755.1759
OLLW773.1060773.1021773.1010773.1017773.1011
Exp-W759.9144759.0376758.4132757.2295757.9724
Weibull813.5743813.5743813.5744813.5744813.5746
M 1 EOLLW755.2259755.1844755.1742755.1753755.1778
OLLW774.2566774.2433774.2239774.2288774.2294
Exp-W762.3150761.8464761.2808760.9295938.5430
Weibull811.0754811.0752811.0754811.0755811.0756
M 2 EOLLW750.3085750.1898750.1712750.1812750.1930
OLLW769.1151769.1514769.2313769.2891769.3191
Exp-W755.8550755.7825755.7881755.8164768.3938
Weibull797.4298797.4352797.4666797.5064797.5421
Table 7. LR statistics for the ELLOW QR model under structure M 2 and some τ values for the gastric cancer data.
Table 7. LR statistics for the ELLOW QR model under structure M 2 and some τ values for the gastric cancer data.
τ
ModelsHypotheses0.100.250.500.750.90
EOLLW vs. OLLW H 0 : λ = 1 vs. H 1 : H 0 is false 20.80(<0.001)20.95(<0.001)21.06(<0.001)21.10(<0.001)21.12(<0.001)
EOLLW vs. Exp-W H 0 : ν = 1 vs. H 1 : H 0 is false 7.54(0.006)7.58(0.005)7.61(0.005)7.63(0.005)20.20(<0.001)
EOLLW vs. Weibull H 0 : λ = ν = 1 vs. H 1 : H 0 is false 51.12(<0.001)51.24(<0.001)51.29(<0.001)51.28(<0.001)51.34(<0.001)
Table 8. Estimation findings from the ELLOW QR model under structure M 2 and τ = 0.10 , 0.25 , 0.50 , 0.75 and 0.90 for the current data.
Table 8. Estimation findings from the ELLOW QR model under structure M 2 and τ = 0.10 , 0.25 , 0.50 , 0.75 and 0.90 for the current data.
τ θ MLEsSEsp-Values
0.10 β 10 0.3720.050<0.01
β 11 0.6910.066<0.01
β 12 −1.1000.206<0.01
β 20 −0.1650.022<0.01
β 21 0.2690.036<0.01
β 22 −0.6520.113<0.01
β 30 2.0920.016<0.01
β 40 −2.0430.017<0.01
0.25 β 10 1.3220.050<0.01
β 11 0.4680.066<0.01
β 12 −0.3500.2100.096
β 20 −0.2260.037<0.01
β 21 0.2710.057<0.01
β 22 −0.6700.176<0.01
β 30 2.1590.026<0.01
β 40 −2.0540.028<0.01
0.50 β 10 1.9900.050<0.01
β 11 0.3080.066<0.01
β 12 0.2110.2120.320
β 20 −0.2450.061<0.01
β 21 0.2690.078<0.01
β 22 −0.6790.131<0.01
β 30 2.1780.034<0.01
β 40 −2.0560.037<0.01
0.75 β 10 2.3620.050<0.01
β 11 0.2200.066<0.01
β 12 0.5330.2130.013
β 20 −0.2390.077<0.01
β 21 0.2660.088<0.01
β 22 −0.6820.105<0.01
β 30 2.1720.034<0.01
β 40 −2.0540.036<0.01
0.90 β 10 2.5650.050<0.010
β 11 0.1720.0660.010
β 12 0.7100.213<0.01
β 20 −0.2310.077<0.01
β 21 0.2640.087<0.01
β 22 −0.6840.094<0.01
β 30 2.1650.033<0.01
β 40 −2.0530.034<0.01
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Rodrigues, G.M.; Ortega, E.M.M.; Cordeiro, G.M.; Vila, R. Quantile Regression with a New Exponentiated Odd Log-Logistic Weibull Distribution. Mathematics 2023, 11, 1518. https://doi.org/10.3390/math11061518

AMA Style

Rodrigues GM, Ortega EMM, Cordeiro GM, Vila R. Quantile Regression with a New Exponentiated Odd Log-Logistic Weibull Distribution. Mathematics. 2023; 11(6):1518. https://doi.org/10.3390/math11061518

Chicago/Turabian Style

Rodrigues, Gabriela M., Edwin M. M. Ortega, Gauss M. Cordeiro, and Roberto Vila. 2023. "Quantile Regression with a New Exponentiated Odd Log-Logistic Weibull Distribution" Mathematics 11, no. 6: 1518. https://doi.org/10.3390/math11061518

APA Style

Rodrigues, G. M., Ortega, E. M. M., Cordeiro, G. M., & Vila, R. (2023). Quantile Regression with a New Exponentiated Odd Log-Logistic Weibull Distribution. Mathematics, 11(6), 1518. https://doi.org/10.3390/math11061518

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop