Regression Analysis of Multivariate Interval-Censored Failure Time Data under Transformation Model with Informative Censoring

Yu, Mengzhu; Du, Mingyue

doi:10.3390/math10183257

Open AccessArticle

Regression Analysis of Multivariate Interval-Censored Failure Time Data under Transformation Model with Informative Censoring

by

Mengzhu Yu

¹

and

Mingyue Du

^1,2,*

¹

School of Mathematics, Jilin University, Changchun 130012, China

²

The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen 518057, China

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(18), 3257; https://doi.org/10.3390/math10183257

Submission received: 31 July 2022 / Revised: 24 August 2022 / Accepted: 30 August 2022 / Published: 7 September 2022

(This article belongs to the Special Issue Statistical Methods and Models for Survival Data Analysis)

Download

Browse Figure

Versions Notes

Abstract

:

We consider a regression analysis of multivariate interval-censored failure time data where the censoring may be informative. To address this, an approximated maximum likelihood estimation approach is proposed under a general class of semiparametric transformation models, and in the method, the frailty approach is employed to characterize the informative interval censoring. For the implementation of the proposed method, we develop a novel EM algorithm and show that the resulting estimators of the regression parameters are consistent and asymptotically normal. To evaluate the empirical performance of the proposed estimation procedure, we conduct a simulation study, and the results indicate that it performs well for the situations considered. In addition, we apply the proposed approach to a set of real data arising from an AIDS study.

Keywords:

case K interval-censored data; informative censoring; semiparametric transformation model; sieve approach

MSC:

62N02; 62H12; 62G20

1. Introduction

In this paper, we consider a regression analysis of multivariate interval-censored failure time data where the censoring may be informative. Interval-censored data arise when the failure time of interest is known or observed only to belong to an interval instead of being observed exactly (Finkelstein, 1986 [1]; Sun, 2006 [2]). It is apparent that one can treat right-censored data as a special case of interval-censored data. Multivariate interval-censored data occur if a failure time study involves more than one related failure time of interest for which only interval-censored data are available. Among others, one can often face multivariate interval-censored data in epidemiological studies and clinical trials. Informative censoring occurs if the censoring mechanism or the underlying process generating observations is related to the failure times of interest (Kalbfleisch and Prentice, 2002 [3]; Sun, 2006 [2]).

An example of informative censoring is given by a clinical trial or periodic study on a failure event such as death for which some symptoms may occur before the occurrence of the event. For the situation, the study subject may tend to pay more clinical visits when the symptoms occur rather than following the pre-specified schedule. Many authors have pointed out’ that with informative censoring, the analysis that ignores it could lead to serious biased estimators or analysis results (Wang et al., 2010 [4]; Sun, 2006 [2]; Zhang et al., 2005 [5], 2007 [6]). For example, Sun (1999) [7] studied the issue for univariate current status data, a special case of interval-censored data where the observed interval includes either zero or infinity, and showed that the analysis could yield misleading results if the informative censoring is ignored or treated to be non-informative censoring.

A large amount of literature has been established for the regression analysis of multivariate interval-censored failure time data or their special cases, multivariate current status data and bivariate interval-censored data (Chen et al., 2007 [8], 2009 [9], 2013 [10]; Goggins and Finkelstein, 2000 [11]; Shen, 2015 [12]; Tong et al., 2008 [13]; Wang et al., 2008 [14]; Zeng et al., 2017 [15]; Zhang et al., 2009 [16]; Zhou et al., 2017 [17]). For this, three types of methods are commonly used, including the copula model approach, the marginal model-based approach and the frailty model-based method. The first employs various copula models to characterize the relationship between correlated failure times of interest (Wang et al., 2008 [14]; Zhang et al., 2009 [16]), and among others, Sun and Ding (2019) [18] discussed this for bivariate cases under the framework of the two-parameter Archimedean copula model. The second mainly focuses on the marginal distribution and puts no assumption on the correlation between the failure times of interest (Wei et al. 1989 [19]). The authors who developed such methods include Chen et al. (2007) [8], Chen et al. (2013) [10] and Tong et al. (2008) [13].

The frailty model-based approach generally employs the frailty or latent variable to model the correlation between the correlated failure times. It has the advantage of allowing one to directly estimate the correlation. One main shortcoming of most of the existing methods for multivariate interval-censored data is that they assume independent or non-informative interval censoring, and it is apparent that this may not hold in practice as discussed above. In this paper, we will adopt the frailty model-based approach to develop a new estimation procedure that allows for dependent or informative interval censoring.

Several authors have considered a regression analysis of univariate informatively interval-censored failure time data. For example, Zhang (2005) [5], Wang et al. (2010) [4] and Wang et al. (2018) [20] investigated the problem for current status data, case II interval-censored data and case K interval-censored data, respectively. Case II means that each study subject is observed only twice, while case K refers to the situation where each subject is observed at a sequence of observation times, which is much more general than others (Sun, 2006) [2]. As mentioned above, most of the existing methods for multivariate interval-censored data apply only to the situation with independent interval censoring, except Yu et al. (2022) [21]. Yu et al. (2022) [21] only considered case II interval-censored data under the additive hazards model. In this paper, the focus will be on case K multivariate interval-censored data with informative censoring and the proposed methods apply to much more general situations than Yu et al. (2022) [21].

More specifically, in Section 2, some notation and assumptions will be first introduced as well as the data structures. In the proposed method, we will focus on the case where the failure time of interest marginally follows a general class of semiparametric transformation models. The proposed approximated maximum likelihood estimation procedure will be presented in Section 3, and for the implementation of the proposed method, a novel EM algorithm will be developed. The asymptotic properties of the resulting estimators of the regression parameters will be given in Section 4. Section 5 will present some simulation results obtained from a study performed to evaluate the performance of the proposed method, and they indicate that it performs as expected. In Section 6, we apply the proposed methodology to a set of real data arising from an AIDS clinical trial, and Section 7 contains some discussion and concluding remarks.

2. Assumptions and Background

In this section, we first introduce some notation and background and then describe the model and data structure. Suppose that there is a failure time study consisting of n independent subjects and concerning M failure events of interest that may be related. Define

T_{i m}

to be the failure time of interest and

X_{i m}

a p-dimensional vector of covariates both related to the ith subject and the event m. Furthermore, for each subject, suppose that there exists a sequence of potential observation times

U_{i 0} = 0 < U_{i 1} < \dots < U_{i K_{i}^{*}}

and a follow-up or stopping time

τ_{i}

, where

K_{i}^{*}

denotes the number of potential observations,

i = 1, \dots, n

. For simplicity, we assume that for each subject, the observation times for different failure events are the same and the proposed method below can be easily generalized to more general situations.

For subject i, define the point process

{\tilde{N}}_{i} (t) = \sum_{j = 1}^{K_{i}^{*}} I (U_{i j} \leq t)

, describing the observation process on the subject that jumps only at the observation times,

i = 1, \dots, n

. Note that for the situation considered here, we have

M + 1

processes, the M underlying failure time processes of interest and the observation process

{\tilde{N}}_{i} (t)

, and as mentioned above, the focus below will be on the case where they may be related (Ma et al., 2015 [22]; Wang et al., 2016 [23]; Zhang et al., 2007 [6]). To describe their relationships and the possible covariate effects on them, we assume that there exists a vector of latent variables

b_{i}

and another latent variable

u_{i}

with mean zero, and given

X_{i m}

,

Z_{i m}

,

b_{i}

and

u_{i}

, the cumulative hazard function of

T_{i m}

has the form

Λ_{i m} (t | X_{i m}, Z_{i m}, b_{i}, u_{i}) = G_{m} {exp (X_{i m}^{T} β_{x m} + Z_{i m}^{T} b_{i} + u_{i} β_{u m}) Λ_{m} (t)} .

(1)

Here,

G_{m} (.)

is a known strictly increasing transformation function,

Λ_{m} (.)

is an unknown baseline cumulative hazard function,

Z_{i m}

contains 1 and part of the covariates

X_{i m}

, and

β_{m} = {(β_{x m}^{T}, β_{u m})}^{T}

denotes the vector of unknown regression parameters.

For the observation process, it will be assumed that

{\tilde{N}}_{i} (t)

is a non-homogeneous Poisson process satisfying

λ_{i h} (t | X_{i}, u_{i}) = λ_{0 h} (t) exp (\sum_{m = 1}^{M} X_{i m}^{T} α_{m} + u_{i}) = λ_{0 h} (t) exp (X_{i}^{T} α + u_{i})

(2)

for the intensity function given

X_{i m}

and

u_{i}

. Here,

λ_{0 h}

is an unknown continuous baseline intensity function,

X_{i} = {(X_{i 1}^{T}, \dots, X_{i M}^{T})}^{T}

, and

α^{T} = (α_{1}^{T}, \dots, α_{M}^{T})

, which is a vector of regression parameters as

β_{m}

. In the following, it will be assumed that given

X_{i m}

,

b_{i}

and

u_{i}

,

T_{i 1}, \dots, T_{i M}

are independent, and given

X_{i m}

and

u_{i}

,

T_{i}

and

{\tilde{N}}_{i}

are independent. Moreover,

τ_{i}

is independent of

T_{i}

and

{\tilde{N}}_{i}

. We point out that models (1) and (2) with

u_{i} = 0

have been commonly used in the analysis of failure-time data (Klein and Moescherger, 2003 [24]) and the analysis of event history data (Cook and Lawless, 2007 [25]), respectively. The parameter

β_{u m}

denotes the degree of the correlation between the failure-time process and the observation process, and they will be independent if

β_{u m} = 0

.

The semiparametric transformation model (1) with

b_{i} = 0

and

u_{i} = 0

is quite general and can give many specific models. In particular, one can express it as a class of frailty-induced transformations

G_{m} (x) = - log \int_{0}^{\infty} exp (- x t) f_{m} (t) d t .

In the above,

f_{m} (t)

denotes the density function of a frailty variable with support

[0, \infty]

. By setting

f_{m} (t)

to be the gamma density with mean 1 and variance

r_{m}

, it gives the class of logarithmic transformations

G_{m} (x) = r_{m}^{- 1} log (1 + r_{m} x)

with

r_{m} > 0

(Chen et al., 2002 [26]). In particular, it yields the proportional odds model with

r_{m} = 1

or

G_{m} (x) = log (1 + x)

and gives the proportional hazards model with

r_{k} = 0

or

G_{m} (x) = x

.

To describe the observed data, define

δ_{i m j} = I (U_{i j - 1} < T_{i m} \leq U_{i j})

, indicating if the failure time

T_{i m}

belongs to the interval

(U_{i j - 1}, U_{i j}]

. In the following, it will be assumed that the observed data have the form

O = {O_{i} = (τ_{i}, X_{i m}, Z_{i m}, U_{i j}, δ_{i m j}, m = 1, \dots, M, j = 1, \dots, K_{i}^{*}); i = 1, \dots, n},

where

K_{i}^{*} = {\tilde{N}}_{i} (τ_{i})

. That is, we observe case K interval-censored data (Sun, 2006).

3. Maximum Likelihood Estimation

3.1. Estimation Procedure

Now, we discuss inference about models (1) and (2), and for this, we will propose a two-step or an approximate maximum likelihood estimation procedure by following Huang and Wang (2004) [27] and Wang et al. (2016) [23]. More specifically, we will first consider the estimation of model (2) and then estimation of

ϕ^{T} = (β_{1}^{T}, \dots, β_{M}^{T})

, the parameter of interest. The first step will be based on the following two facts. One is that one can easily show that

K_{i}^{*}

follows the Poisson distribution with the mean

Λ_{i h} (τ_{i}; X_{i}, u_{i}) = Λ_{0 h} (τ_{i}) exp (X_{i}^{T} α + u_{i})

given

X_{i}

and

u_{i}

,

i = 1, \dots, n

. The other is that the observation times

U_{i 1}, \dots, U_{i K_{i}^{*}}

can be seen as the order statistics of a set of i.i.d. random variables with the density function

\begin{matrix} π (t) = \frac{λ_{0 h} (t) exp (X_{i}^{T} α + u_{i})}{Λ_{0 h} (τ_{i}) exp (X_{i}^{T} α + u_{i})} I (0 \leq t \leq τ_{i}) = \frac{λ_{0 h} (t)}{Λ_{0 h} (τ_{i})} I (0 \leq t \leq τ_{i}) . \end{matrix}

One can see that the function above does not depend on neither

X_{i}

nor

u_{i}

, which suggests that the function

Λ_{0 h} (t)

can be estimated by

{\hat{Λ}}_{0 h} (t) = \prod_{s (l) > t} (1 - \frac{d_{(l)}}{R_{(l)}}) .

In the above, the

s_{(l)}

’s denote the ordered and distinct values of observation times

{U_{i k}}

,

d_{(l)}

the number of the observation times equal to

s_{(l)}

, and

R_{(l)}

the number of observation times satisfying

U_{i k} \leq s_{(l)} \leq τ_{i}

among all subjects.

Under the assumptions above, it is easy to show that

E [K_{i}^{*}; X_{i .}, u_{i}, τ_{i}] = Λ_{0 h} (τ_{i}) exp

(X^{T} α + u_{i})

. This yields

E_{u_{i}} [E [K_{i}^{*} Λ_{0 h}^{- 1} (τ_{i}); X_{i m}, u_{i}, τ_{i}]] = E (e^{u_{i}}) exp (X_{i}^{T} α),

and a class of estimating equations

\sum_{i = 1}^{n} ω_{i} X_{i} (K_{i}^{*} {\hat{Λ}}_{0 h}^{- 1} (τ_{i}) - E (e^{u_{i}}) exp (X_{i}^{T} α)) = 0

for estimation of

α_{m}

,

m = 1, \dots, M

with the

ω_{i}

’s being some weights. Let

{\hat{α}}_{m}

denote the estimator of

α_{m}

given by the estimating equations above, which suggests that one can naturally estimate

u_{i}

by

{\hat{u}}_{i} = log \{\frac{K_{i}^{*}}{{\hat{Λ}}_{0 h} (τ_{i}) exp (X_{i}^{T} α)}\} .

Now, consider estimation of

ϕ

as well as model (1). For this, note that if the

u_{i}

’s were known, it would be natural to maximize the likelihood function

\begin{matrix} L_{n} (ϕ, Λ, γ | u_{i}^{'} s) = & \prod_{i = 1}^{n} \int \prod_{m = 1}^{M} \prod_{j = 1}^{K_{i}^{*}} \{exp (- G_{m} [\int_{0}^{U_{i, j - 1}} exp {x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i}} d Λ_{m} (t)]) - \\ {exp (- G_{m} [\int_{0}^{U_{i j}} exp {x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i}} d Λ_{m} (t)])\}}^{Δ_{i m j}} \\ exp {(- G_{m} [\int_{0}^{U_{i K_{i}^{*}}} exp {x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i}} d Λ_{m} (t)])}^{1 - \sum_{j = 1}^{K_{i}^{*}} Δ_{i m j}} f_{b} (b_{i} | γ) d b_{i}, \end{matrix}

where

Λ = (Λ_{1}, \dots, Λ_{M})

,

x_{i m}^{*} = {(X_{i m}^{T}, {\hat{u}}_{i})}^{T}

, and

f_{b}

denotes the density function of the

b_{i}

’s assumed to be known up to a vector of parameters

γ

. Define

L_{i m} = m a x {U_{i j} : U_{i j} < T_{i m}, j = 0, \dots, K_{i}^{*}}

and

R_{i m} = m i n {U_{i j} : U_{i j} \geq T_{i m}, j = 1, \dots, K_{i}^{*} + 1}

, where

U_{i 0} = 0

and

U_{i, K_{i}^{*} + 1} = \infty

. Then,

(L_{i m}, R_{i m}]

represents the shortest time interval that brackets

T_{i m}

and the likelihood function above can be rewritten as

\begin{matrix} L_{n} (ϕ, Λ, γ | u_{i}^{'} s) = & \prod_{i = 1}^{n} \int \prod_{m = 1}^{M} \{exp (- G_{m} [\int_{0}^{L_{i m}} exp (x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i}) d Λ_{m} (t)]) \\ - exp (- G_{m} [\int_{0}^{R_{i m}} exp (x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i}) d Λ_{m} (t)])\} f_{b} (b_{i} ∣ γ) d b_{i} . \end{matrix}

By following Huang and Wang (2004) [27] and others, it is natural to estimate

ϕ

and

Λ

by their values that maximize the approximated likelihood function

L_{n} (ϕ, Λ, γ | {\hat{u}}_{i}^{'} s)

.

For the maximization of

L_{n} (ϕ, Λ, γ | {\hat{u}}_{i}^{'} s)

, note that it involves the unknown functions

Λ_{m}

’s and integrations. To deal with them, for the former, we propose to adopt the nonparametric approach. More specifically, for each

m = 1, \dots, M

, let

0 = t_{m 0} < t_{m 1} < \dots < t_{m k_{m}} < \infty

denote the ordered sequence of all

L_{i m}

and

R_{i m}

with

R_{i m} < \infty

and assume that

Λ_{m}

is a step function that jumps only at the

t_{m q}

’s with the jump sizes

λ_{m q}

’s. Then,

L_{n} (ϕ, Λ, γ | u_{i}^{'} s)

can be expressed as

L_{n} (ϕ, Λ, γ | u_{i}^{'} s) = \prod_{i = 1}^{n} \int \prod_{m = 1}^{M} \{exp (- G_{m} [\sum_{t_{m q} \leq L_{i m}} exp {x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i}} λ_{m q}])

- exp (- G_{m} [\sum_{t_{m q} \leq R_{i m}} exp {x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i}} λ_{m q}])\} f_{b} (b_{i} ∣ γ) d b_{i} .

(3)

In the following, we will develop an EM algorithm for the maximization with the focus on the situation where

f_{b}

is a multivariate normal distribution with the covariance matrix

Σ (γ)

depending on the q-dimensional unknown parameter

γ

. The algorithm is valid for other distributions and some comments on this will be given below. It is worth to point out that as mentioned above, the idea discussed above has been used by Huang and Wang (2004) [27] and Wang et al. (2016) [23], among others. However, the problem discussed here is different or much more general than the existing literature.

3.2. EM Algorithm

In this subsection, we will develop an EM algorithm for the maximization of

L_{n} (ϕ, Λ,

γ ∣ {\hat{u}}_{i}^{'} s)

, and for this, we will first discuss the data augmentation. Let the

ξ_{i m}

’s denote the random sample of size n from the density

f_{m} (t)

. Then, we can rewrite the observed likelihood function as

\begin{matrix} L_{n} & (ϕ, Λ, γ | u_{i}^{'} s) = \prod_{i = 1}^{n} \int \{\prod_{m = 1}^{M} \int_{ξ_{i m}} exp {- ξ_{i m} \sum_{t_{m q} \leq L_{i m}} λ_{m q} exp (x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i})} \\ {[1 - exp {- ξ_{i m} \sum_{L_{i m} < t_{m q} \leq R_{i m}} λ_{m q} exp (x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i})}]}^{I (R_{i m} < \infty)} f_{m} (ξ_{i m}) d ξ_{i m}\} \times f_{b} (b_{i} ∣ γ) d b_{i} . \end{matrix}

(4)

Moreover, let the

W_{i m q}

’s denote the random sample of size n from the Poisson distributions with means

ξ_{i m} λ_{m q} exp (x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i})

given

ξ_{i m}

, and define

A_{i m} = \sum_{t_{m q} \leq L_{i m}} W_{i m q}

and

B_{i m} = I (R_{i m} < \infty) \sum_{L_{i m} < t_{m q} \leq R_{i m}} W_{i m q}

such that

\begin{matrix} P (A_{i m} = 0, & B_{i m} > 0 | L_{i m}, R_{i m}, x_{i m}^{*}) = exp {- ξ_{i m} \sum_{t_{m q} \leq L_{i m}} λ_{m q} exp (x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i})} \\ {[1 - exp {- ξ_{i m} \sum_{L_{i m} < t_{m q} \leq R_{i m}} λ_{m q} exp (x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i})}]}^{I (R_{i m} < \infty)} . \end{matrix}

It is easy to see that the maximization of (4) is equivalent to maximizing the likelihood function based on the data (

L_{i m}, R_{i m}, x_{i m}^{*}, A_{i m} = 0, B_{i m} > 0

)

(i = 1, \dots, n; m = 1, \dots, M)

. Based on this, for the development of the EM algorithm, it is natural to use the

W_{i m q}

’s,

ξ_{i m}

’s and

b_{i}

’s to augment the observed data. As a consequence, one can derive the resulting pseudo complete data log-likelihood function as

\begin{matrix} l_{c} (ϕ, Λ, γ | u_{i}^{'} s) = & \sum_{i = 1}^{n} \{\sum_{m = 1}^{M} (\sum_{q = 1}^{k_{m}} I (t_{m q} \leq R_{i m}^{*}) [W_{i m q} log {ξ_{i m} λ_{m q} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i})} \\ - ξ_{i m} λ_{m q} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i}) - log (W_{i m q}!)] + log f_{m} (ξ_{i m})) \\ - \frac{d_{i}}{2} log (2 π) - \frac{1}{2} log | Σ_{i} (γ) | - \frac{b_{i}^{T} Σ_{i} {(γ)}^{- 1} b_{i}}{2}\}, \end{matrix}

(5)

where

R_{i m}^{*} = L_{i m} I (R_{i m} = \infty) + R_{i m} I (R_{i m} < \infty)

.

Now, we consider the E-step of the EM algorithm. At the

(s + 1)

th iteration and given

{(ϕ^{s}, Λ^{s}, γ^{s})}^{T}

, we need to determine

Q (ϕ, Λ, γ | ϕ^{s}, Λ^{s}, γ^{s}) = E [l_{c} (ϕ, Λ, γ | u_{i}^{'} s, O, ϕ^{s}, Λ^{s}, γ^{s})]

= \sum_{i = 1}^{n} \{\sum_{m = 1}^{M} (\sum_{q = 1}^{k_{m}} I (t_{m q} \leq R_{i m}^{*}) [E [W_{i m q} log \{ξ_{i m} λ_{m q} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i})\}

- ξ_{i m} λ_{m q} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i})] - E [log (W_{i m q}!)]] + log f_{m} (ξ_{i m}))

- \frac{d_{i}}{2} log (2 π) - \frac{1}{2} log | Σ_{i} (γ) | - \frac{E [b_{i}^{T} Σ_{i} {(γ)}^{- 1} b_{i}]}{2}\}

under the multivariate normal distribution with the covariance matrix

Σ_{i} (γ)

. To calculate the conditional expectations

E [ξ_{i m} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i})]

,

E [W_{i m q}]

and

E [b_{i}^{T} Σ^{- 1} (γ) b_{i}]

given the observed data, we need to employ the joint density of

ξ_{i m}

and

b_{i}

given the observed data, which is proportional to

\begin{matrix} \prod_{m = 1}^{M} [exp \{- ξ_{i m} \sum_{t_{m q} \leq L_{i m}} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i}) λ_{m q}\} \\ - I (R_{i m} < \infty) exp \{- ξ_{i m} \sum_{t_{m q} \leq R_{i m}} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i}) λ_{m q}\}] \\ \times f_{m} (ξ_{i m}) {(2 π)}^{- d_{i} / 2} {| Σ_{i} (γ) |}^{- 1 / 2} exp {- \frac{b_{i}^{T} Σ_{i} {(γ)}^{- 1} b_{i}}{2}} . \end{matrix}

Note that the conditional expectation of

W_{i m q}

for

t_{m q \leq R_{i m}^{*}}

given

ξ_{i m} (m = 1, \dots, M)

,

b_{i}

and the observed data is given by

\hat{E} (W_{i m q} | ξ_{i m}, b_{i}) = I (L_{i m} < t_{m q} \leq R_{i m}) \frac{λ_{m q} ξ_{i m} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i})}{1 - exp {- \sum_{L_{i m} < t_{m q^{'}} \leq R_{i m}} λ_{m q^{'}} ξ_{i m q^{'}} exp (x_{i m}^{* T} β + Z_{i m}^{T} b_{i})}} .

In the M-step, we can employ the Newton–Raphson method to update

β_{m}

based on the equation

\sum_{i = 1}^{n} \sum_{m = 1}^{M} \sum_{q = 1}^{k_{m}} I (t_{m q} \leq R_{i m}^{*}) \hat{E} (W_{i m q}) \{x_{i m}^{*} - \frac{\sum_{i^{'} = 1}^{n} I (t_{m q} \leq R_{i^{'} m}^{*}) \hat{E} {ξ_{i^{'} m} exp (x_{i^{'} m}^{* T} β_{m} + Z_{i^{'} m}^{T} b_{i^{'}})} x_{i^{'} m}^{*}}{\sum_{i^{'} = 1}^{n} I (t_{m q} \leq R_{i^{'} m}^{*}) \hat{E} {ξ_{i^{'} m} exp (x_{i^{'} m}^{* T} β_{m} + Z_{i^{'} m}^{T} b_{i^{'}})}}\} = 0 .

For estimation of

λ_{m q}

, we have the closed form expression

λ_{m q} = \frac{\sum_{i = 1}^{n} I (t_{m q} \leq R_{i m}^{*}) \hat{E} (W_{i m q})}{\sum_{i = 1}^{n} I (t_{m q} \leq R_{i m}^{*}) \hat{E} {ξ_{i m} exp (x_{i m}^{* T} β_{m} + Z_{i m}^{T} b_{i})}},

(6)

for

q = 1, \dots, k_{m}

and

m = 1, \dots, M

. To estimate

γ

, one can maximize

- log ∥ Σ_{i} (γ) ∥ - \hat{E} {b_{i}^{T} Σ_{i}^{- 1} (γ) b_{i}}

with the

Σ_{i}

’s given by

Σ = n^{- 1} \sum_{i = 1}^{n} \hat{E} (b_{i}^{T} b_{i})

.

Now, we summarize the EM algorithm described above as follows.

Step 1. Choose initial estimates

ϕ^{(0)}, Λ^{(0)}, γ^{(0)}

of

ϕ, Λ, γ

, respectively.

Step 2. In the

(s + 1)

th iteration, calculate

\hat{E} [ξ_{i m} exp (x_{i m}^{* T} β^{(s)} + Z_{i m}^{T} b_{i})]

,

\hat{E} [W_{i m q}]

and

\hat{E} [b_{i}^{T} Σ^{- 1}

(γ^{(s)}) b_{i}]

by using, for example, the Gaussian quadrature method.

s = 0, 1, 2, \dots

Step 3. Update

Λ^{(s + 1)}

by (6) with current

ϕ^{(s)}, γ^{(s)}

, and then update

γ^{(s + 1)}

by maximizing

- log ∥ Σ_{i} (γ) ∥ - \hat{E} {b_{i}^{T} Σ_{i}^{- 1} (γ) b_{i}}

. In addition, estimate

ϕ^{(s + 1)}

by employing the one-step Newton–Raphson method.

Step 4. Repeat Steps 2–3 until the convergence such that the absolute difference of the log-likelihood values between two consecutive iterations is less than a given positive value

ε

such as

10^{- 3}

.

4. Asymptotic Properties

Let

\hat{θ} = {({\hat{β}}_{1}^{T}, \dots, {\hat{β}}_{M}^{T}, {\hat{γ}}^{T}, {\hat{Λ}}_{1}, \dots, {\hat{Λ}}_{M})}^{T}

denote the estimator of

θ = (β_{1}^{T}, \dots, β_{M}^{T}, γ^{T},

Λ_{1}, \dots, Λ_{M})^{T}

defined above and

θ_{0} = {(β_{01}^{T}, \dots, β_{0 M}^{T}, γ_{0}^{T}, Λ_{01}, \dots, Λ_{0 M})}^{T}

the true value of

θ

. Define

ζ_{0} = {(β_{01}^{T}, \dots, β_{0 M}^{T}, γ_{0}^{T})}^{T}

and

\hat{ζ} = {({\hat{β}}_{1}^{T}, \dots, {\hat{β}}_{M}^{T}, {\hat{γ}}^{T})}^{T}

. In this section, we will establish the asymptotic properties of

\hat{θ}

, and for this, we first describe the regularity conditions needed.

Define

Q_{m}^{*} (t, b; β_{m}, Λ_{m}) = exp (- G_{m} [\int_{0}^{t} exp {β_{m}^{T} x_{m}^{*} + b^{T} Z_{m}}]) d Λ_{m} (s),

D_{m} (U_{m}, b; β_{m}, Λ_{m}) = \sum_{l = 0}^{K_{m}} Δ_{m l} {Q_{m}^{*} (U_{m l}, b; β_{m}, Λ_{m}) - Q_{m}^{*} (U_{m, l + 1}, b; β_{m}, Λ_{m})}

,

U_{m} =

(U_{m 1}

,

\dots, U_{m, K_{m}})

,

Δ_{m l} = I (U_{m l} ⩽ T_{m} < U_{m, l + 1})

, and

p (b | γ) = {(2 π)}^{- d / 2} {| Σ (γ) |}^{- 1 / 2}

exp (- b^{T} Σ {(γ)}^{- 1} b / 2)

. For the asymptotic properties of

\hat{θ}

, we need the following regularity conditions.

Condition 1.

The true value

ζ_{0}

belongs to a known compact set

A ⨂ B ⨂ C

, where

A

denotes a compact set of

R^{pM}

,

B

a compact set in

R^{M}

, and

C

a compact set of

R^{q}

in the domain of γ such that

Σ (γ)

is a positive-definite matrix with eigenvalues bounded away from zero and ∞. In addition, the true value

Λ_{0 m} (\cdot)

is continuously differentiable with positive derivatives in

[0, τ_{m}]

.

Condition 2.

The covariate vector

X_{m}

and

Z_{k}

are bounded in

[0, τ_{m}]

.

Condition 3.

For the transformation function

G_{m}

, assume that it is twice continuously differentiable on

[0, \infty)

with

G_{m} (0) = 0

,

G_{m}^{^{'}} (x) > 0

and

G_{m} (\infty) = \infty

.

Condition 4.

Assume that

{sup}_{γ \in C} \int_{b} g (b) p^{(j)} (b | γ) d b < \infty

for any smooth function

g (\cdot)

and

j = 0, 1, 2

. Here,

p^{(j)} (b | γ)

denotes the jth derivative of

p (b | γ)

with respect to γ.

Condition 5.

If there exists a vector u and some constants

v_{m}, m = 1, \dots, M

such that

\begin{matrix} {(u^{T} \frac{\partial}{\partial ζ} + \sum_{m = 1}^{M} v_{m} \frac{\partial}{\partial y_{m}})|}_{(ζ, y_{1}, \dots, y_{M}) = (ζ_{0}, Λ_{10} (c_{1}), \dots, Λ_{M 0} (c_{M}))} \\ \cdot log \int_{b} \prod_{m = 1}^{M} D_{m} (U_{m}, b; β, β_{u}, Λ_{m}) p (b ∣ γ) d b = 0 \end{matrix}

for each of these values, then

u = 0_{p M + M + q}

and

v_{m} = 0

. In addition,

0_{p M + M + q}

denotes a

(p M + M + q)

-dimensional vector of zeros.

Condition 6.

Assume that

P (τ_{m} \geq τ_{0}, exp (u) > 0) > 0

for the follow-up time

τ_{m}

and latent variable u, where

τ_{0}

denotes the longest study time and the variance of

exp (u)

is bounded and there exists a positive small constant

ϵ > 0

such that

exp (u) > ϵ

almost surely. Moreover, for

τ_{m}

and u, the function

F (s) = E [exp (u) I (τ_{m} \geq s)]

is continuous for

s \in [0, τ_{0}]

.

Note that Conditions 1 and 2 are standard conditions in survival analysis, and it is easy to check that Condition 3 on the transformation function holds for the logarithmic family

G_{r} (x) = r^{- 1} log (1 + r x) (r ⩾ 0)

and the Box–Cox family

G_{d} (x) = d^{- 1} \{{(1 + x)}^{d} - 1\} (d ⩾ 0)

. Moreover, Condition 4 holds for modeling multivariate data with frailty models, and Condition 5 is required for the identifiability of the model. In addition, Condition 6 describes the relationship between the latent variable u and the parameters of interest. Most of the conditions above are purely for technique purposes and hold in general, in particular, for periodic follow-up studies.

Let

∥ \cdot ∥

denote the Euclidean norm and define

P f = \int f (x) d P (x)

and

P_{n} f = n^{- 1} \sum_{i = 1}^{n} f (X_{i})

for a function f and a random variable X with distribution P. The following two theorems give the asymptotic properties of

\hat{θ}

.

Theorem 1.

Suppose that Conditions 1–6 hold. Then, as

n \to \infty

, we have that

∥\hat{ζ} - ζ_{0}∥

+ \sum_{m = 1}^{M} {sup}_{t \in [0, τ_{m}]} |{\hat{Λ}}_{m} (t) - Λ_{0 m} (t)| \to 0

almost surely.

Theorem 2.

Suppose that Conditions 1–6 hold. Then, as

n \to \infty

, we have that

\sqrt{n} ({\hat{ζ}}_{n} - ζ_{0}) \to^{d} N (0, I_{0}^{- 1})

, where

I_{0} = P {\tilde{l} (θ_{0}) \tilde{l} {(θ_{0})}^{T}}

with

\tilde{l} (θ_{0})

given in the Appendix A.

We will sketch the proof for the results described above in Appendix A. For inference about

ζ

, it is apparent that one needs to estimate the covariance matrix, and for this, one can see from Appendix A that it would be difficult to derive a consistent estimator of

I_{0}

. Thus, we propose to employ the profile likelihood approach to estimate the covariance matrix of

\hat{ζ}

(Murphy & van der Vaart, 2000) [28]. Specifically, let

C

denote the set of all step functions with nonnegative jumps at

t_{m q}

and define

{pl}_{n} (ζ) = {max}_{Λ \in C} log L_{n} (ζ, Λ)

, the profile log-likelihood. Then, one can estimate the covariance matrix of

\hat{ζ}

by the negative inverse of the matrix with the

(j, k)

th element given by

\frac{{pl}_{n} (\hat{ζ}) - {pl}_{n} (\hat{ζ} + h_{n} e_{k}) - {pl}_{n} (\hat{ζ} + h_{n} e_{j}) + {pl}_{n} (\hat{ζ} + h_{n} e_{k} + h_{n} e_{j})}{h_{n}^{2}} .

In the above,

e_{j}

denotes the jth canonical vector in

R^{d}

and

h_{n}

is a constant of order

n^{- 1 / 2}

. Note that to calculate

p l_{n} (ζ)

for each

ζ

, one can reuse the proposed EM algorithm with

β

held fixed and the only step in the EM algorithm is to explicitly evaluate

\hat{E} (W_{i m q})

and

\hat{E} (ξ_{i m})

to update

λ_{m}

using above. The iteration converges quickly in general by setting

{\hat{λ}}_{m}

to be the initial value.

5. A Simulation Study

In this section, we give some of the simulation results obtained from a study performed to evaluate the finite sample performance of the proposed method with the focus on estimation of the

β_{m}

’s. In the study, we considered the situation with

M = 2

correlated failure times of interest and two covariates. For the covariates, it was assumed that the first covariate follows the Bernoulli distribution with the success probability of 0.5 and the second covariate, the uniform distribution over

(0, 1)

. To generate the true failure times, we first set

Z_{i m}

to be one and generated the latent variables

b_{i}

’s from the normal distribution

N (0, σ^{2})

with

σ^{2} = 0.25

and the latent variables

u_{i}

’s from the normal distribution with the mean 0 and variance 1. Then, given the

X_{i m}

’s,

Z_{i m}

’s,

b_{i}

’s and

u_{i}

’s, the

T_{i 1}

’s and

T_{i 2}

’s were generated under model (1) with

G_{m} (x) = r_{m}^{- 1} log (1 + r_{m} x)

,

Λ_{1} (t) = log (1 + 0.5 t)

and

Λ_{2} (t) = 0.5 t

for

r_{1} = r_{2} = 0

,

r_{1} = r_{2} = 0.5

or

r_{1} = r_{2} = 1

, respectively.

For the generation of the observation process and the observed data, we first assumed that the

τ_{i}^{'} s

follow the uniform distribution over the interval [2, 3] and generated the

K_{i}^{*}

’s from the Poisson distribution with the mean

Λ_{i h} (τ_{i}; X_{i}, u_{i}) = τ_{i} exp (X_{i}^{T} α + u_{i})

given the

X_{i}

’s and

u_{i}

’s. Note that in the above, we took

Λ_{0 h} (t) = t

and

α_{m} = 1

. Given the

K_{i}^{*}

’s, we took

U_{i 1} < \dots < U_{i K_{i}^{*}}

to be the order statistics of the random sample of size

K_{i}^{*}

from the uniform distribution over

(0, τ_{i})

. In the following, we considered two sets of true values,

{(0, 0, 0)}^{T}

and

{(0.5, 0.5, 0.5)}^{T}

, for the regression parameters

β_{1} = {(β_{x 11}, β_{x 12}, β_{u 1})}^{T}

and

β_{2} = {(β_{x 21}, β_{x 22}, β_{u 2})}^{T}

, corresponding to

T_{i 1}

and

T_{i 2}

, respectively. The results given below are based on

n = 200

or 400 with 1000 replications.

Table 1 gives the results on the estimation of

β_{1}

and

β_{2}

given by the proposed estimation procedure with

r_{1} = r_{2} = 0

,

r_{1} = r_{2} = 0.5

and

r_{1} = r_{2} = 1

. Here, we calculated the estimated bias (Bias) given by the average of the estimates minus the true value, the sample standard error (SSE) of the estimates, the average of the estimated standard errors (ESE) and the

95 %

empirical coverage probability (CP). The results suggest that the proposed estimator of the regression parameters seems to be unbiased and the variance estimation based on the profile likelihood approach also seems to be reasonable. Furthermore, the results on the empirical coverage probabilities indicate that the normal approximation to the distribution of the proposed estimator of the regression parameters appears to be appropriate. In addition, the results got better in general with the increasing sample size, as expected.

As mentioned before, the proposed estimation procedure can be applied to any distribution for the latent variables

b_{i}

’s. To see this, we repeated the study above, except that we generated the

b_{i}

’s from the uniform distribution over

(- 1, 1)

, and Table 2 presents the obtained results on the estimation of

β_{1}

and

β_{2}

with

n = 200

and

r_{1} = r_{2} = 0

. One can see that they are similar to those given in Table 1 and again suggest that the proposed approach seems to work well for the situations considered. To see the performance of the proposed approach with different types of covariates, we also repeated the study giving Table 1, except that both covariates were assumed to follow the standard normal distribution and give the obtained results with

n = 200

and

r_{1} = r_{2} = 0

in Table 3. They indicate that the proposed estimation procedure seems to be robust to different types of covariates.

Note that in the proposed estimation procedure, it has been assumed that the observation process

{\tilde{N}}_{i} (t)

is a non-homogeneous Poisson process and one may be interested in the performance of the proposed method if this assumption is not true. To see this, we repeated the study giving Table 1, except that the

{\tilde{N}}_{i} (t)

’s were assumed to be mixed Poisson processes with the intensity function

λ_{i h} (t | X_{i}, u_{i}) = v_{i} λ_{0 h} (t) exp (X_{i}^{T} α + u_{i})

given the

v_{i}

’s, where the

v_{i}

’s were generated from the gamma distribution. Table 4 presents the results on the estimation of

β_{1}

and

β_{2}

given by the proposed approach with

n = 200

and

r_{1} = r_{2} = 0

, and they indicate that the approach seems to be robust with the processes

{\tilde{N}}_{i} (t)

’s.

For the initial value in the EM algorithm here, we set

ϕ^{(0)} = 0

,

λ_{m q}^{(0)} = \frac{1}{k_{m}}

,

q = 1, \dots, k_{m}

, and

γ^{(0)} = 0.25

. It is worth to point out that we did try other initial values and the proposed EM algorithm seems to be robust with respect to the selection of the initial values. In other words, we did not encounter non-convergence issue in the simulation study. We also considered some other setups, including multivariate cases and the case with more than one covariate and obtained similar results.

6. An Application

In the section, the estimation procedure proposed in the previous sections is applied to the set of bivariate interval-censored data arising from an AIDS clinical trial, AIDS Clinical Trial Group 181, described in Goggins and Finkelstein (2000) [11]. The study concerns the opportunistic infection cytomegalovirus (CMV) and examined the study patients periodically. At each clinical visit or observation, among other information, the blood and urine samples were collected and tested to detect the existence of the CMV virus in the sample, which is also commonly referred to as the shedding of the virus. In addition, for each patient, the CD4 count, indicating the status of a person’s immune system and being commonly used to measure the stage of HIV infection, was also recorded at the entry time. For the analysis here, we are mainly interested in if the baseline CD4 account, the indicator of the initial stage of HIV disease, is related to the CMV shedding risk in either blood or urine.

The data set consists of 204 subjects, and they belong to two groups based on their baseline CD4 counts, either less than 75 or otherwise. More specifically, the two groups have 111 and 93 patients, respectively. On the observation of the CMV shedding times, some patients gave left-censored observations and some right-censored observations. The others provided some intervals or interval-censored observations, given by the last negative and first positive test dates. That is, we have bivariate interval-censored data on the CMV shedding times in the blood and urine. The percentages of right-censored observations for the CMV shedding times in the blood and urine are about

85 %

and

43 %

, respectively, which indicate that the CMV shedding risk in the urine may be higher than that in the blood. For the application of the proposed estimation procedure, let

T_{i 1}

and

T_{i 2}

denote the CMV shedding times in the blood and urine associated with the ith patient, respectively, and define

X_{i} = 1

if the ith subject’s baseline CD4 count was less than 75 and 0 otherwise. As in the simulation study, we took

G_{m} (x) = r_{m}^{- 1} log (1 + r_{m} x)

and set

Z_{i} = 1

for all patients.

Table 5 presents the estimation results given by the proposed approach for different combinations of

r_{1} = 0, 0.5, 1

and

r_{2} = 0, 0.5, 1

, and they include the estimated covariate effects,

{\hat{β}}_{b l o o d}

and

{\hat{β}}_{u r i n e}

, the estimated standard errors (SE) and the p-values for testing no covariate effect (P). In addition, we have calculated the Akaike Information Criterion (AIC, Akaike, 1973 [29]) and Bayesian Information Criterion (BIC, Schwarz, 1978 [30]) for the selection of the optimal model. One can see from the table that the AIC and BIC values are quite close for all combinations of

r_{1}

and

r_{2}

, and the same is true for the estimated effects. By choosing

r_{1} = r_{2} = 0

, which correspond to the proportional hazards models for both the

T_{i 1}

’s and

T_{i 2}

’s, we have

{\hat{β}}_{b l o o d} = 2.312

and

{\hat{β}}_{u r i n e} = 1.143

with the estimated standard errors being

0.396

and

0.142

, respectively. They suggest that the patients with lower CD4 at the baseline experienced CMV shedding in both blood and urine significantly early. To provide a graphical view about the difference between the CMV shedding in the blood and urine, Figure 1 presents the estimates of the baseline marginal survival functions given by the proposed method with

r_{1} = r_{2} = 0

for the CMV shedding times in the blood and urine, respectively. They suggest that as discussed above, the CMV shedding in the urine occurred much earlier than in the blood.

In addition, with

r_{1} = r_{2} = 0

, the proposed method yielded

{\hat{β}}_{u 1} = 2.845

and

{\hat{β}}_{u 2} = 0.958

with the estimated standard errors of

0.112

and

0.116

, respectively. They indicate that the observation process was significantly correlated with the CMV shedding times in both blood and urine. That is, we had dependent or informative censoring. To investigate the effects of informative censoring on the covariate effects, we assumed that

β_{u 1} = β_{u 2} = 0

, meaning independent interval censoring, and obtained

{\hat{β}}_{b l o o d} = 1.560

and

{\hat{β}}_{u r i n e} = 1.306

with the estimated standard errors being

0.514

and

0.326

, respectively. They would correspond to the p-values of

0.015

and

0.011

for testing

β_{b l o o d} = 0

and

β_{u r i n e} = 0

, respectively. Although these results are similar to those given above, it is apparent that they underestimated the effects of the baseline CD4 on the risks of the CMV shedding times.

7. Discussion and Concluding Remarks

In the preceding sections, the regression analysis of case K multivariate interval-censored failure-time data was discussed under a general class of semiparametric transformation models in the presence of informative censoring. For the problem, an approximate maximum likelihood estimation procedure was proposed and the resulting estimators of the regression parameters were shown to be consistent and asymptotically normal. In the method, the frailty approach was employed to characterize the informative censoring as well as the relationship among the correlated failure times of interest. To implement the proposed approach, a novel EM algorithm was developed and the numerical studies indicated that the proposed method works well in practical situations. In addition, it was applied to a set of real bivariate interval-censored data arising from an AIDS clinical trial.

The proposed approach can be seen as a generalization of the method given by Zeng et al. (2017) [15] to allow for informative interval censoring, which can occur quite often, as discussed above and in the literature. In particular, it has been shown that in the presence of informative censoring, the analysis that ignores it could lead to biased or misleading results and conclusions. The proposed method has the advantages that it does not need or impose an assumption on the distribution of the latent variables and it is quite flexible and can be easily implemented. Moreover, the type of the data considered here includes most types of the failure-time data discussed in the literature as special cases and the model (1) gives many commonly used models.

As discussed above, although model (1) is quite flexible, it may not be straightforward to choose an optimal model for a given set of data, and one commonly used procedure for this is to apply the AIC or BIC. As an alternative, one may prefer to develop a model-checking or data-driven technique. However, this may be difficult and such a method does not seem to exist even for simple types of multivariate interval-censored data. It is worth noting that instead of the proposed approximation maximum likelihood estimation method, one may consider a full maximum likelihood estimation procedure. For this, however, one would need to specify or postulate some distributions for the latent variables

b_{i}

’s, which may be hard to be verified, and also the implementation would be much more complicated.

Author Contributions

Formal analysis, M.Y.; Conceptualization, M.Y. and M.D.; Software, M.Y.; Supervision, M.D.; Funding acquisition, M.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the National Natural Science Foundation of China Grant (12101522).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the finding of this study are openly available in “ACTG 181” at https://doi.org/10.1007/0-387-37119-2 [2].

Acknowledgments

We want to thank the editor as well as the two reviewers for their many insightful and helpful comments and suggestions that greatly helped the paper. The R code about the implementation of the proposed method is available upon request to the first author.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorems 1 and 2

In this Appendix, we will sketch the proof of the two theorems given above.

Proof of Theorem 1.

To prove the consistency, we can verify the condition of Theorem 5.7 of Van der Vaart (1998) [31].

B V [0, τ_{m}]

denotes the functions whose total variation in

[0, τ_{m}]

are bounded by a given constant. Define

M = {θ : θ \in A ⨂ B ⨂ C ⨂ {B V}^{⨂ M}}

, where

B V^{⨂ M} = B V [0, τ_{1}] ⨂ B V [0, τ_{2}] ⨂ \dots ⨂ B V [0, τ_{M}]

and

M_{0}

is a similar space with

M

containing

θ_{0}

. Moreover, define the metric

ρ (θ, θ_{0})

on the parameter space

M

as

ρ (θ, θ_{0}) = \sum_{m = 1}^{M} ∥ β_{x m} - β_{0 x m} ∥^{2} + \sum_{m = 1}^{M} ∥ β_{u m} - β_{0 u m} ∥^{2} + ∥ γ - γ_{0} ∥^{2} + \sum_{m = 1}^{M} {sup}_{t \in [0, τ_{m}]} | Λ_{m} (t) -

Λ_{0 m} (t) |

. Let

L (θ)

be the likelihood, so the log-likelihood is

l (θ) = log \int \{\prod_{m = 1}^{M} D_{m} (U_{m}, b; β_{m}, Λ_{m})\} p (b | γ) d b

Then, the class of function

D_{m} (U_{m}, b; β_{m}, Λ_{m})

is a Donsker class. By condition 4, we know that

l (θ)

is bounded away from zero. Therefore,

l (θ, O)

belongs to some Donsker class due to the preservation property of the Donsker class under the Lipschitz-continuous transformations. Then, we can conclude that

{sup}_{θ \in M} | P_{n} l (θ, O) - P l (θ_{0}, O) |

converges in probability to 0 as

n \to \infty

.

Next, we need to verify another condition of Theorem 5.7 of Van der Vaart (1998) [31], for any

ϵ > 0

,

sup_{ρ (θ, θ_{0}) > ϵ} P l (θ, O) < P l (θ_{0}, O) .

Following Gibbs’ inequality, we have that

P l (θ, O) \leq P l (θ_{0}, O)

for all

θ \in M

with equality holds if and only if

l (θ, O) = l (θ_{0}, O)

almost surely. Assume that

{sup}_{ρ (θ, θ_{0}) > ϵ} P l (θ) = P l (θ_{0})

. Then, there exists a sequence

θ_{j}

such that

P l (θ_{j}) \to P l (θ_{0})

and

ρ (θ_{j}, θ_{0}) > ϵ

. Because

A ⨂ B ⨂ C

are compact and

B V^{⨂ M}

are uniformly bounded such that

θ_{j m}

converges to

θ_{j 0}

, and

θ_{j m}

is the subsequence of

θ_{j}

, where

θ_{j 0}

may or not be in

M

, but in

M_{0}

. Clearly,

P l (θ)

is continuous with respect to

θ

, such that

P l (θ_{j 0}) = P l (θ_{0})

. By Condition 5 and similar arguments to the proof of Theorem 2.1 of Chang et al. (2007) [32], we can show the identifiability of the model parameters, so that

θ_{j 0} = θ_{0}

. However,

ρ (θ_{j m}, θ_{0}) > ϵ

, so

θ_{j m}

cannot converge to

θ_{0}

. This is a contradiction. Therefore,

{sup}_{ρ (θ, θ_{0}) > ϵ} P l (θ) < P l (θ_{0})

. Following Theorem 5.7 of Van der Vaart (1998) [31], we have

ρ (\hat{θ}, θ_{0}) = o_{p} (1)

, which completes the proof of Theorem 1. □

Proof of Theorem 2.

Define

S_{β_{x m}} (θ) = \frac{\partial l (θ)}{\partial β_{x m}}, S_{β_{u m}} (θ) = \frac{\partial l (θ)}{\partial β_{u m}}, S_{γ} (θ) = \frac{\partial l (θ)}{\partial γ},

the score functions with respect to

β_{x m}

,

β_{u m}

and

γ

, respectively. For

m = 1, \dots, M

, let

h_{m} (t)

be a nonnegative and nondecreasing function on

[0, τ_{m}]

. Define

H = {h = (h_{1} (t), \dots, h_{M} (t))}

,

Λ_{ϵ} (t) = (Λ_{1, ϵ} (t), \dots, Λ_{M, ϵ} (t))

, and

H_{m l} (t; θ) = \frac{\int B_{m} (t, U_{m l}, U_{m, l + 1}, b; β_{m}, Λ_{m}) \{\prod_{m^{'} = 1, m^{'} \neq m}^{M} D_{m^{'}} (U_{m^{'}}, b; β_{m^{'}}, Λ_{m^{'}})\} p {b ∣ γ} d b}{\int \{\prod_{m^{'} = 1}^{M} D_{m^{'}} (U_{m^{'}}, b; β_{m^{'}}, Λ_{m^{'}})\} p {b ∣ γ} d b},

where

Λ_{m, ϵ} (t) = Λ_{m} (t) + ϵ h_{m} (t)

and

\begin{matrix} B_{m} (t, s_{1}, s_{2}, b; β_{m}, Λ_{m}) = & exp \{β_{x m}^{T} X_{m} + u β_{u m} + b^{T} Z_{m}\} \\ \times & (Q_{m} (s_{2}, b; β_{m}, Λ_{m}) G_{m}^{'} [\int_{0}^{v} exp \{β_{m}^{T} x_{m}^{*} + b^{T} Z_{m}\} d Λ_{m} (s)] I (s_{2} ⩾ t) \\ - Q_{m} (s_{1}, b; β_{m}, Λ_{m}) G_{m}^{'} [\int_{0}^{u} exp \{β_{m}^{T} x_{m}^{*} + b^{T} Z_{m}\} d Λ_{m} (s)] I (s_{1} ⩾ t)) . \end{matrix}

It follows that

\begin{matrix} S_{β_{x}} (θ) = \sum_{m = 1}^{M} \sum_{l = 0}^{K_{m}} \int_{0}^{τ_{m}} H_{m l} (t; θ, A) X_{m} d Λ_{m} (t), \\ S_{β_{u}} (θ) = \sum_{m = 1}^{M} \sum_{l = 0}^{K_{m}} \int_{0}^{τ_{m}} H_{m l} (t; θ, A) u d Λ_{m} (t), \\ S_{γ} (θ) = \frac{\int \{\prod_{m = 1}^{M} D_{m} (U_{m}, b; β_{m}, Λ_{m})\} p_{γ}^{'} {b ∣ γ} d b}{\int \{\prod_{m = 1}^{M} D_{m} (U_{m}, b; β_{m}, Λ_{m})\} p {b ∣ γ} d b}, \end{matrix}

where

p_{γ}^{'} {b ∣ γ}

is the first-order derivative of

p {b ∣ γ}

with respect to

γ

,

β_{x} = {(β_{x 1}^{T}, \dots, β_{x M}^{T})}^{T}

and

β_{u} = {(β_{u 1}, \dots, β_{u M})}^{T}

.

To obtain the score operator for

A

, we consider submodels

A_{ϵ} (h)

, where

h = {(h_{1}, \dots, h_{M})}^{T}

is a vector of functions in

L_{2} [0, τ_{m}]

. Then, we have that

d Λ_{m, ϵ, h_{m}} = (1 + ϵ h_{m}) d Λ_{m}

, and the score function along the mth submodels for every

Λ_{m}, m = 1, \dots, M

has the form

S_{Λ_{m}} (θ) (h) = \sum_{l = 0}^{K_{m}} Δ_{m l} \int_{0}^{τ_{m}} H_{m l} (t; θ) h_{m} (t) d Λ_{m} (t) .

The efficient score for

ζ

at

(ζ_{0}, Λ_{0})

is

\tilde{l} (ζ_{0}, Λ_{0}) = S_{ζ} (ζ_{0}, Λ_{0}) - \sum_{m = 1}^{M} S_{Λ_{m}} (ζ_{0}, Λ_{0}) [h_{m}^{*}]

, where

S_{ζ} (ζ_{0}, Λ_{0}) = {(S_{β_{x 1}} (θ_{0}), \dots, S_{β_{x M}} (θ_{0}), S_{β_{u}} (θ_{0}), S_{γ} (θ_{0}))}^{T}, h_{m}^{*}

is a

(p M + M + q)

-vector function satisfying

P [{(S_{ζ} (ζ_{0}, Λ_{0}) - \sum_{m = 1}^{M} S_{Λ_{m}} (ζ_{0}, Λ_{0}) [h_{m}^{*}])}^{T} (\sum_{m = 1}^{M} S_{Λ_{m}} (ζ_{0}, Λ_{0}) [h_{m}])] = 0,

for each

h_{m}

in

H

.

By following similar calculations in Section 3 of Chang et al. (2007) [32], we can establish the existence of

h_{m}^{*}

in the above equation. The efficient Fisher information matrix

I_{0}

for

ζ

at

(ζ_{0}, Λ_{0})

is defined as

P (\tilde{l} (ζ_{0}, Λ_{0}) \tilde{l} {(ζ_{0}, Λ_{0})}^{T})

. In the following, we will show that

I_{0}

is positive definite. If the

I_{0}

is singular, then there exists a nonzero vector

ν \in R^{(p M + M + q)}

such that

ν^{T} I_{0} ν = 0

. It follows that, with probability one, the score function along the submodel

\{ζ_{0} + ϵ ν, Λ_{10} + ϵ ν^{T} h_{1}^{*}, \dots, Λ_{M 0} + ϵ ν^{T} h_{M}^{*}\}

is zero. Therefore,

{ν^{T} (\frac{\partial}{\partial ζ} + \sum_{m = 1}^{M} h_{m}^{*} \frac{\partial}{\partial y_{m}})|}_{(ζ, y_{1}, \dots, y_{M}) = (ζ_{0}, Λ_{10} (c_{1}), \dots, Λ_{M 0} (c_{M}))}

\cdot log \int_{b} \prod_{m = 1}^{M} \{D_{m} (U_{m}, b, β_{m}, Λ_{m})\} p (b ∣ γ) d b = 0 .

Using Condition 5, we know that

ν = 0

, and this is a contradiction. Therefore, we can conclude that

ν^{T} I_{0} ν = 0

implies

ν = 0

. That is, the efficient Fisher information matrix is positive.

Define

S_{ζ, m} (θ) [h_{m}] = {\frac{\partial}{\partial ϵ}|}_{ϵ = 0} S_{ζ} (θ; Λ_{m} = Λ_{m ϵ}),

and

S_{m, j} (θ) [{\tilde{h}}_{m}, h_{j}] = {\frac{\partial}{\partial ϵ}|}_{ϵ = 0} S_{Λ_{m}} (θ; Λ_{j} = Λ_{j ϵ}) [{\tilde{h}}_{k}]

for

m = 1, \dots, M

and

j = 1, \dots, M

, where

\partial / {\partial ϵ|}_{ϵ = 0} Λ_{j ϵ} = h_{j}

. By Taylor expansion, we can obtain

\begin{matrix} P \tilde{l} (ζ_{0}, Λ) = P \tilde{l} (ζ_{0}, Λ_{0}) + P & \{\sum_{m = 1}^{M} S_{ζ m} (θ) [Λ_{m} - Λ_{m 0}] - \sum_{m = 1}^{M} \sum_{j = 1}^{M} S_{m, j} (θ) [h_{m}^{*}, Λ_{m} - Λ_{m 0}]\} \\ + O_{p} (\sum_{m = 1}^{M} {∥Λ_{m} - Λ_{m 0}∥}^{2}) . \end{matrix}

Note that

P \tilde{l} (ζ_{0}, Λ_{0}) = 0

,

P (S_{ζ} (θ) S_{Λ_{m}} (θ) [h_{m}]) = - P (S_{ζ, m} (θ) [h_{m}])

,

P (S_{Λ_{m}} (θ) [{\tilde{h}}_{m}]

S_{Λ_{j}} (θ)

[h_{j}]) =

- P (S_{m, j} (ζ) [{\tilde{h}}_{m}, h_{j}])

. By the consistency and the proof of Theorem of Zeng et al. (2017), we can conclude that

P \tilde{l} (ζ_{0}, {\hat{Λ}}_{n}) = O_{p} (n^{- 2 / 3})

, which implies

\sqrt{n} P \tilde{l} (ζ_{0}, {\hat{Λ}}_{n}) = o_{p} (1)

.

We know from Example

19.11

of Van der Vaart (1998) [31] that the class of uniformly bounded functions with bounded variations is a Donsker class. By using Theorem

2.10.6

of Van der Vaart and Wellner (1996) [33], we can verify that

\tilde{l} (ζ, Λ)

is a uniformly bounded Donsker class. In addition, we have proved that

{\hat{θ}}_{n}

is consistent. Therefore,

\sqrt{n} (P_{n} - P) (\tilde{l} ({\hat{ζ}}_{n}, {\hat{Λ}}_{n}) - \tilde{l} (ζ_{0}, Λ_{0})) = o_{p} (1)

. Due to the fact that

P_{n} \tilde{l} ({\hat{θ}}_{n}) = P \tilde{l} (θ_{0}) = 0

and

P \tilde{l} (ζ_{0}, {\hat{Λ}}_{n}) = o_{p} (1)

, we can have

- \sqrt{n} P (\tilde{l} ({\hat{θ}}_{n}) - \tilde{l} (ζ_{0}, {\hat{Λ}}_{n})) = \sqrt{n} P_{n} \tilde{l} (θ_{0}) + o_{p} (1) .

By the mean value theorem, we have

- \sqrt{n} P \frac{\partial}{\partial ζ} \tilde{l} (ζ^{'}, {\hat{Λ}}_{n}) ({\hat{ζ}}_{n} - ζ_{0}) = \sqrt{n} P_{n} \tilde{l} (θ_{0}) + o_{p} (1),

where

ζ^{'}

is a point between

{\hat{ζ}}_{n}

and

ζ_{0}

. Because

{\hat{θ}}_{n}

is consistency and

P (- \frac{\partial}{\partial ζ} \tilde{l} (θ_{0})) =

P (\tilde{l} (θ_{0}) \tilde{l} {(θ_{0})}^{T}) = I_{0}

, we can conclude that

\sqrt{n} ({\hat{ζ}}_{n} - ζ_{0}) = I_{0}^{- 1} \sqrt{n} P_{n} \tilde{l} (θ_{0}) + o_{p} (1) \overset{d}{\to} N (0, I_{0}^{- 1}) .

This completes the proof of Theorem 2. □

References

Finkelstein, D.M. A proportional hazards model for interval-censored failure time data. Biometrics 1986, 42, 845–854. [Google Scholar] [CrossRef] [PubMed]
Sun, J. Statistical Analysis of Interval-Censored Failure Time Data; Springer: New York, NY, USA, 2006. [Google Scholar]
Kalbfleisch, J.D.; Prentice, R.L. The Statistival Analysis of Failure Time Data, 2nd ed.; Wiley: New York, NY, USA, 2002. [Google Scholar]
Wang, L.; Sun, J.; Tong, X. Regression analysis of case II interval censored failure time data with the additive hazards model. Stat. Sin. 2010, 20, 1709–1723. [Google Scholar] [PubMed]
Zhang, Z.; Sun, J.; Sun, L. Statistical analysis of current status data with informative observation times. Stat. Med. 2005, 24, 1399–1407. [Google Scholar] [CrossRef] [PubMed]
Zhang, Z.; Sun, L.; Sun, J.; Finkelstein, D. Regression analysis of failure time data with informative interval censoring. Stat. Med. 2007, 26, 2533–2546. [Google Scholar] [CrossRef]
Sun, J. A nonparametric test for current status data with unequal censoring. J. R. Stat. Soc. B 1999, 61, 243–250. [Google Scholar] [CrossRef]
Chen, M.; Tong, X.; Sun, J. The proportional odds model for multivariate interval-censored failure time data. Stat. Med. 2007, 26, 5147–5161. [Google Scholar] [CrossRef]
Chen, M.; Tong, X.; Sun, J. A frailty model approach for regression analysis of multivariate current status data. Stat. Med. 2009, 28, 3424–3436. [Google Scholar] [CrossRef]
Chen, M.; Tong, X.; Zhu, L. A linear transformation model for multivariate interval-censored failure time data. Can. J. Stat. 2013, 41, 275–290. [Google Scholar] [CrossRef]
Goggins, W.B.; Finkelstein, D.M. A proportional hazards model for multivariate interval-censored failure time data. Biometrics 2000, 56, 940–943. [Google Scholar] [CrossRef] [PubMed]
Shen, P. Additive transformation models for multivariate interval-censored data. Commun. Stat.-Theory Methods 2015, 44, 1065–1079. [Google Scholar] [CrossRef]
Tong, X.; Chen, M.H.; Sun, J. Regression analysis of multivariate interval censored failure time data with application to tumorigenicity experiments. Biom. J. 2008, 33, 364–374. [Google Scholar] [CrossRef]
Wang, L.; Sun, J.; Tong, X. Efficient estimation for the proportional hazards model with bivariate current status data. Lifetime Data Anal. 2008, 14, 134–153. [Google Scholar] [CrossRef]
Zeng, D.; Gao, F.; Lin, D. Maximum likelihood estimation for semiparametric regression models with multivatiate interval-censored data. Biometrika 2017, 104, 505–525. [Google Scholar] [CrossRef]
Zhang, B.; Tong, X.; Sun, J. Efficient Estimation for the proportional odds model with bivariate current status data. Far East J. Theor. Stat. 2009, 27, 113–132. [Google Scholar]
Zhou, Q.; Hu, T.; Sun, J. A sieve semiparametric maximum likelihood approach for regression analysis of bivariate interval-censored failure time data. J. Am. Stat. Assoc. 2017, 112, 664–672. [Google Scholar] [CrossRef]
Sun, T.; Ding, Y. Copula-based semiparametric regression method for bivariate data under general interval censoring. Biostatistics 2021, 2, 315–330. [Google Scholar] [CrossRef]
Wei, L.J.; Lin, D.Y.; Weissfeld, L. Regression analysis of multivariate incomplere failure time data by modeling marginal distributions. J. Am. Stat. Assoc. 1989, 84, 1065–1073. [Google Scholar] [CrossRef]
Wang, S.; Wang, C.; Wang, P.; Sun, J. Semiparametric analysis of the additive hazards model with informatively interval-censored failure time data. Comput. Stat. Data Anal. 2018, 125, 1–9. [Google Scholar] [CrossRef]
Yu, M.; Feng, Y.; Duan, R.; Sun, J. Regression analysis of multivariate interval-censored failure time data with informative censoring. Stat. Methods Med. Res. 2022, 31, 391–403. [Google Scholar] [CrossRef] [PubMed]
Ma, L.; Hu, T.; Sun, J. Sieve maximum likelihood regression analysis of dependent current status data. Biometrika 2015, 102, 731–738. [Google Scholar] [CrossRef]
Wang, P.; Zhao, H.; Sun, J. Regression analysis of case k interval-censored failure time data in the presence of informative censoring. Biometrics 2016, 72, 1103–1112. [Google Scholar] [CrossRef]
Klein, J.P.; Moeschberger, M.L. Survival Analysis: Techniques for Censored and Truncated Data; Springer: New York, NY, USA, 2003. [Google Scholar]
Cook, R.J.; Lawless, J. The Statistical Analysis of Recurrent Events; Springer: New York, NY, USA, 2007. [Google Scholar]
Chen, K.; Jin, Z.; Ying, Z. Semiparametric Analysis of Transformation Models with Censored Data. Biometrika 2002, 89, 659–668. [Google Scholar] [CrossRef]
Huang, C.Y.; Wang, M.C. Joint modeling and estimation for recurrent event processes and failure time data. J. Am. Stat. Assoc. 2004, 99, 1153–1165. [Google Scholar] [CrossRef]
Murphy, S.A.; Van Der Vaart, A.W. On profile likelihood. J. Am. Stat. Assoc. 2000, 95, 449–465. [Google Scholar] [CrossRef]
Akaike, H. Information Theory and an Extension of the Maximum Likelihood Principle. In Second International Symposium on Information Theory; Petrov, B., Csaki, F., Eds.; Academiae Kiado: Budapest, Hungary, 1973; pp. 267–281. [Google Scholar]
Schwarz, G. Estimating the Dimension of a Model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Van der Vaart, A.W. Asymptotic Statistics; Cambridge University Press: New York, NY, USA, 1998. [Google Scholar]
Chang, I.S.; Wen, C.C.; Wu, Y.J. A profile likelihood theory for the correlated gamma-frailty model with current status family data. Stat. Sin. 2007, 17, 1023–1046. [Google Scholar]
Van der Vaart, A.W.; Wellner, J.A. Weak Convergence and Empirical Processes; Springer: New York, NY, USA, 1996. [Google Scholar]

Figure 1. Estimated marginal survival functions for the CMV shedding times in the blood and urine.

Table 1. Simulation results on estimation of

β

with the

b_{i}

’s generated from the normal distribution.

Table 1. Simulation results on estimation of

β

with the

b_{i}

’s generated from the normal distribution.

	True Value	$r_{1} = r_{2} = 0$				$r_{1} = r_{2} = 0.5$				$r_{1} = r_{2} = 1$
	True Value	Bias	SSE	SEE	CP	Bias	SSE	SEE	CP	Bias	SSE	SEE	CP
$n = 200$
$β_{x 11}$	0	0.030	0.248	0.252	0.945	0.006	0.289	0.288	0.951	0.031	0.251	0.259	0.945
$β_{x 12}$	0	0.028	0.412	0.422	0.957	0.021	0.479	0.476	0.953	0.029	0.418	0.452	0.957
$β_{u 1}$	0	−0.024	0.141	0.143	0.938	−0.026	0.164	0.163	0.945	−0.024	0.142	0.142	0.938
$β_{x 21}$	0	0.044	0.236	0.236	0.953	0.001	0.278	0.276	0.948	0.044	0.242	0.252	0.953
$β_{x 22}$	0	0.001	0.390	0.422	0.949	0.011	0.459	0.460	0.949	0.001	0.403	0.429	0.949
$β_{u 2}$	0	−0.013	0.138	0.135	0.960	−0.026	0.159	0.158	0.945	−0.014	0.138	0.140	0.960
$β_{x 11}$	0.5	0.040	0.233	0.246	0.944	0.024	0.291	0.290	0.955	0.033	0.314	0.317	0.950
$β_{x 12}$	0.5	0.025	0.378	0.410	0.955	−0.007	0.485	0.501	0.951	0.020	0.508	0.533	0.948
$β_{u 1}$	0.5	0.010	0.135	0.142	0.956	−0.026	0.165	0.168	0.942	−0.022	0.177	0.179	0.948
$β_{x 21}$	0.5	0.028	0.227	0.238	0.949	0.017	0.280	0.282	0.949	0.062	0.307	0.316	0.947
$β_{x 22}$	0.5	0.011	0.358	0.436	0.953	0.011	0.466	0.484	0.956	0.054	0.489	0.498	0.949
$β_{u 2}$	0.5	−0.005	0.135	0.131	0.943	−0.027	0.160	0.160	0.950	−0.022	0.176	0.179	0.958
$n = 400$
$β_{x 11}$	0	0.009	0.171	0.178	0.952	0.010	0.200	0.204	0.954	−0.001	0.226	0.228	0.949
$β_{x 12}$	0	0.001	0.280	0.283	0.952	−0.003	0.327	0.329	0.954	0.017	0.370	0.374	0.946
$β_{u 1}$	0	−0.023	0.097	0.104	0.949	−0.036	0.112	0.113	0.944	0.036	0.127	0.130	0.942
$β_{x 21}$	0	0.009	0.162	0.180	0.941	0.007	0.191	0.194	0.946	0.009	0.219	0.222	0.942
$β_{x 22}$	0	0.001	0.260	0.287	0.948	0.014	0.314	0.321	0.951	0.003	0.358	0.366	0.955
$β_{u 2}$	0	−0.025	0.093	0.096	0.942	−0.037	0.108	0.109	0.948	−0.036	0.124	0.123	0.946
$β_{x 11}$	0.5	0.005	0.158	0.163	0.949	0.012	0.187	0.190	0.949	0.031	0.215	0.22	0.942
$β_{x 12}$	0.5	0.019	0.253	0.254	0.954	0.023	0.299	0.305	0.957	0.010	0.344	0.363	0.949
$β_{u 1}$	0.5	−0.025	0.090	0.096	0.935	−0.031	0.105	0.105	0.937	−0.034	0.121	0.120	0.944
$β_{x 21}$	0.5	0.012	0.153	0.166	0.952	0.008	0.183	0.185	0.942	0.024	0.211	0.219	0.946
$β_{x 22}$	0.5	0.035	0.239	0.259	0.956	0.007	0.289	0.302	0.949	0.011	0.333	0.348	0.953
$β_{u 2}$	0.5	−0.024	0.089	0.094	0.941	−0.032	0.104	0.104	0.94	−0.03	0.120	0.115	0.943

Table 2. Simulation results on estimation of

β

with the

b_{i}

’s generated from the uniform distribution and

r_{1} = r_{2} = 0

.

Table 2. Simulation results on estimation of

β

with the

b_{i}

’s generated from the uniform distribution and

r_{1} = r_{2} = 0

.

	True Value	Bias	SSE	ESE	CP
$β_{x 11}$	0	−0.005	0.238	0.240	0.948
$β_{x 12}$	0	−0.004	0.404	0.404	0.953
$β_{u 1}$	0	0.001	0.134	0.134	0.946
$β_{x 21}$	0	−0.001	0.223	0.232	0.950
$β_{x 22}$	0	−0.004	0.374	0.412	0.945
$β_{u 2}$	0	0.003	0.126	0.126	0.957
$β_{x 11}$	0.5	0.031	0.223	0.231	0.948
$β_{x 12}$	0.5	0.035	0.361	0.401	0.956
$β_{u 1}$	0.5	−0.021	0.129	0.134	0.947
$β_{x 21}$	0.5	0.022	0.215	0.231	0.948
$β_{x 22}$	0.5	0.054	0.342	0.412	0.947
$β_{u 2}$	0.5	−0.012	0.129	0.129	0.955

Table 3. Simulation results on estimation of

β

with the covariates generated from the normal distribution and

r_{1} = r_{2} = 0

.

Table 3. Simulation results on estimation of

β

with the covariates generated from the normal distribution and

r_{1} = r_{2} = 0

.

	True Value	Bias	SSE	ESE	CP
$β_{x 11}$	0	−0.003	0.127	0.125	0.949
$β_{x 12}$	0	0.003	0.126	0.126	0.953
$β_{u 1}$	0	−0.006	0.142	0.145	0.950
$β_{x 21}$	0	0.003	0.120	0.119	0.938
$β_{x 22}$	0	−0.001	0.120	0.119	0.941
$β_{u 2}$	0	0.004	0.133	0.141	0.957
$β_{x 11}$	0.5	0.037	0.142	0.141	0.946
$β_{x 12}$	0.5	0.027	0.141	0.141	0.954
$β_{u 1}$	0.5	−0.011	0.146	0.152	0.948
$β_{x 21}$	0.5	0.037	0.137	0.135	0.945
$β_{x 22}$	0.5	0.033	0.138	0.134	0.939
$β_{u 2}$	0.5	−0.008	0.143	0.148	0.949

Table 4. Simulation results on estimation of

β

with mixed Poisson observation processes and

r_{1} = r_{2} = 0

.

Table 4. Simulation results on estimation of

β

with mixed Poisson observation processes and

r_{1} = r_{2} = 0

.

	True Value	Bias	SSE	ESE	CP
$β_{x 11}$	0	0.002	0.246	0.250	0.952
$β_{x 12}$	0	0.013	0.418	0.429	0.951
$β_{u 1}$	0	0.001	0.137	0.136	0.951
$β_{x 21}$	0	−0.002	0.233	0.241	0.948
$β_{x 22}$	0	0.006	0.387	0.419	0.946
$β_{u 2}$	0	−0.001	0.130	0.131	0.949
$β_{x 11}$	0.5	0.037	0.233	0.241	0.948
$β_{x 12}$	0.5	0.075	0.376	0.400	0.949
$β_{u 1}$	0.5	−0.034	0.133	0.136	0.945
$β_{x 21}$	0.5	0.037	0.226	0.257	0.953
$β_{x 22}$	0.5	0.038	0.357	0.435	0.949
$β_{u 2}$	0.5	−0.029	0.132	0.142	0.952

Table 5. Analysis results for the AIDS clinical trials data.

$r_{1}$	$r_{2}$	$β_{blood}$	$β_{urine}$	${SE}_{blood}$	${SE}_{urine}$	$P_{urine}$	AIC	BIC
0	0	2.312	1.143	0.396	0.142	0.001	727.917	744.507
	0.5	2.333	1.291	0.401	0.208	0.002	733.972	750.563
	1	2.381	1.437	0.413	0.256	0.002	738.959	755.550
0.5	0	2.483	1.141	0.424	0.141	0.001	727.327	743.918
	0.5	2.504	1.288	0.430	0.206	0.002	733.405	749.996
	1	2.552	1.435	0.442	0.255	0.002	738.426	755.016
1	0	2.652	1.140	0.451	0.140	0.001	726.826	743.416
	0.5	2.670	1.285	0.457	0.205	0.002	732.924	749.514
	1	2.717	1.433	0.469	0.254	0.002	737.981	754.572

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yu, M.; Du, M. Regression Analysis of Multivariate Interval-Censored Failure Time Data under Transformation Model with Informative Censoring. Mathematics 2022, 10, 3257. https://doi.org/10.3390/math10183257

AMA Style

Yu M, Du M. Regression Analysis of Multivariate Interval-Censored Failure Time Data under Transformation Model with Informative Censoring. Mathematics. 2022; 10(18):3257. https://doi.org/10.3390/math10183257

Chicago/Turabian Style

Yu, Mengzhu, and Mingyue Du. 2022. "Regression Analysis of Multivariate Interval-Censored Failure Time Data under Transformation Model with Informative Censoring" Mathematics 10, no. 18: 3257. https://doi.org/10.3390/math10183257

APA Style

Yu, M., & Du, M. (2022). Regression Analysis of Multivariate Interval-Censored Failure Time Data under Transformation Model with Informative Censoring. Mathematics, 10(18), 3257. https://doi.org/10.3390/math10183257

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Regression Analysis of Multivariate Interval-Censored Failure Time Data under Transformation Model with Informative Censoring

Abstract

1. Introduction

2. Assumptions and Background

3. Maximum Likelihood Estimation

3.1. Estimation Procedure

3.2. EM Algorithm

4. Asymptotic Properties

5. A Simulation Study

6. An Application

7. Discussion and Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proof of Theorems 1 and 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI