2.1. Covariate Adjustment Methods
To compare outcome means between treatment groups, we use ANOVA (when we do not perform covariate adjustment) or ANCOVA (when we perform covariate adjustment), assuming that the error terms are independent, normally distributed, and with equal variance. For sensitivity analysis purposes, we may also want to use alternative statistical methods that do not make these parametric assumptions, to evaluate how robust the results of the ANOVA/ANCOVA methods are to their specific assumptions. In our paper, the covariate adjustment was performed using three GEL methods—EL estimation, ET estimation, and CUE—in addition to the ANCOVA method. The technical details regarding the GEL methods and the three nonparametric covariate adjustment methods based on the EL, ET, and CUE methods are included in the sections of the Appendix. Here, we are providing only a simplified description of these covariate adjustment methods to allow the reader to understand the main ideas underlying them.
For simplicity, let us consider a randomized study where we have two treatment groups—one outcome, and one covariate. We want to estimate the outcome mean difference between the two treatment groups with adjustment for the covariate. The GEL-based covariate adjustment methods start with all observations having uniform weights , where n is the total sample size. To estimate the outcome mean difference, we reweigh the observations as little as possible, as measured by a “distance” between the uniform weights and the new weights, such that the weighted means (using the new weights) for the covariate for the two treatment groups are equal (i.e., covariate balance). The estimate of the outcome mean difference is the difference between the weighted means (using the new weights that provide covariate balance) for the outcome.
To construct the 95% confidence interval for the outcome mean difference by using the test inversion method, for each hypothesized value for the outcome mean difference, we reweigh the observations to achieve covariate balance and to have the outcome (weighted) mean difference equals the hypothesized value. If the new weights are “too far” from the uniform weights, we do not include that specific hypothesized value (for the outcome mean difference) in the 95% confidence interval. Conceptually, to construct the 95% confidence interval, we perform this for all possible values for the outcome mean difference. It is important to note that the only difference between the three GEL-based covariate adjustment methods is the specific measure used to quantify the “distance” between the uniform weights and the new weights.
2.2. Simulation Study
The simulation study had two goals. The first goal was to estimate the root mean squared error (RMSE) for each method using 10,000 simulations for each scenario. The second goal was to evaluate how well the nominal 95% CIs for the difference between means constructed by these methods cover the true mean difference (0, in our simulation study) by calculating the empirical coverage based on 10,000 simulations. The point estimates and corresponding 95% confidence intervals for the difference between means using the EL, ET, and CUE methods were constructed using the R package
gmm [
7,
8]. These confidence intervals for the GEL methods that are constructed based on test inversion are only available starting with version 1.6 of the R package
gmm.
Our simulation study is divided into three parts. In the first part, we consider situations involving equal sample sizes for the treatment groups, homoscedasticity, and no interaction between covariates and the treatment group. In the second part, we consider situations involving unequal sample sizes for the treatment groups, heteroscedasticity, and/or interactions between covariates and the treatment group. For both the first and the second part of the simulation study, we consider only the case when the true outcome mean difference is zero. In the third part of the simulation study, we use real data from Lanphear
et al. [
4] to investigate situations involving equal sample sizes for the treatment groups, homoscedasticity, and no interaction between covariates and the treatment group, similar to the first part of the simulation study, while considering situations where the true outcome mean difference is different from zero. We note that our simulation study is comprehensive by covering a broad range of possible situations and also by including simulations based on real data.
The general setup for the simulation study was as follows:
We estimated the difference between means and constructed corresponding 95% CIs, without adjustment and with adjustment for one covariate or two covariates;
We performed 10,000 simulations for each scenario under investigation;
We considered a sample size of 200 from which are assigned to group 1 (z = 0) and are assigned to group 2 (z = 1), where δ is between 0 and 1. Without loss of generality, the vector z is generated by setting the first elements to 0 and the remaining ones to 1;
For the underlying distributions of the data, we considered the following three types of multivariate distributions for
, where
y is the outcome and
and
are the covariates:
- (a)
Normal (generated using the R package
mvtnorm [
9]);
- (b)
t with three degrees of freedom (generated using the R package
mnormt [
10]);
- (c)
Centered lognormal (generated using the R package
mvtnorm [
9]).
For each distribution, , , and the three variables have mean 0. For the lognormal, which is the exponential of a multivariate normal with mean 0 and covariance matrix Σ, the multivariate normal was selected as to obtain the desired variances and correlations. We also subtracted from each variable its expected value.
In the simulation, we want to evaluate different scenarios. In particular, we want to allow for unequal assignment to the treatment groups,
, and/or
. In order to accomplish that, after generating the 200 observations, the outcome is modified as follows: Every
with
is multiplied by
, and then
is added, where
is a parameter that affects the variance of
y when
, and
is another parameter that affects the correlation between
y and the covariates when
. This modification has no effect on
y when
, but it affects the variance of
y and its correlation with the covariates when
in the following way:
2.2.1. Equal Sample Sizes, Homoscedasticity, and No Interaction
For the first part of our simulation, we set , , and , which implies and for the two treatment groups. In this set of simulations, we want to compare the properties of the four methods for different values of the correlation coefficient ρ. In particular, we consider ρ being equal to one of the following values: .
We note that the simulated data satisfies the moment conditions for the GEL methods for all three distributions considered. The data simulated using the normal distribution satisfies the ANOVA/ANCOVA assumptions. The data simulated using the t distribution with three degrees of freedom and the lognormal distribution satisfies the ANOVA/ANCOVA assumptions except the normality assumption for the error terms, although the use of treatment groups with equal sample sizes makes the ANOVA/ANCOVA method robust to violations of the normality assumption, see [
5] and [
11]. Because of the randomization, there is no confounding due to the covariates. We are adjusting for covariates only to increase the efficiency of our estimators for the outcome mean difference between the two treatment groups.
2.2.2. Unequal Sample Sizes, Heteroscedasticity, and/or Interaction
For the second part of the simulation study, we consider scenarios involving unbalanced treatment groups, heteroscedasticity, and/or interactions between covariates and treatment group. For each distribution, we consider five different combinations of the parameters : Case 1: , Case 2: , Case 3: , Case 4: , and Case 5: . The correlation coefficient ρ is set to 0.5 for all these five cases.
Specifically, Case 1 involves unequal group sizes, heteroscedasticity and interaction ( and ), Case 2 involves equal group sizes, homoscedasticity and interaction ( and ), Case 3 involves unequal group sizes, homoscedasticity and no interaction ( and ), Case 4 involves equal group sizes, heteroscedasticity and no interaction ( and ), and Case 5 involves unequal group sizes, heteroscedasticity and no interaction ( and ). We note that the validity of the GEL moment conditions is not affected by these changes, while the validity of the ANCOVA assumptions (i.e., homoscedasticity, no covariate by treatment interaction) is affected.
2.2.3. Real Data and Non-Null Effect Sizes
To enhance the paper, we have used real data from the randomized controlled trial described in Lanphear
et al. [
4] to perform additional simulations that are close to a real life situation, and also to illustrate the use of the four covariate adjustment methods with real data. We have used for this paper the data for the 169 children for whom both six-months baseline and 48-months follow-up blood lead concentrations are available. This includes 89 children randomized to the intervention group (group 2 or
, using the above terminology) and 80 children randomized to the control group (group 1 or
). Similar to the original study, we have used the natural log transformed blood lead concentration values instead of the original blood lead concentration values. The outcome for this experimental study was the natural log transformed blood lead concentration at the 48-month follow-up, while the covariate was the natural log transformed blood lead concentration at the six-months baseline.
For the third part of the simulation study, we have used descriptive statistics (means, standard deviations, and correlation coefficient) from the real data, to consider scenarios where the mean difference is not null, to allow us to compare the statistical power of the four different methods. We have expressed the mean difference in standard deviation units. The setup of the simulations was as follows: for each treatment group, is a bivariate normal with mean and covariance matrix Σ, with , , and , where is the element of Σ on the row and column, and the number of replications equals to 10,000. For the third part of the simulation study, we have , , and , which implies the same variance and correlation for the two groups. The correlation between x and y is therefore equal to 0.35. Here when (i.e., the control group or group 1) and when (i.e., the intervention group or group 2), where . For , we evaluate the size of the statistical tests, while, for all other values, we estimate the statistical power. We have simulated normal data because the distributions of the natural log transformed blood lead concentrations for the two treatment groups were approximately normal.