Next Article in Journal
A Spatial–Temporal Bayesian Model for a Case-Crossover Design with Application to Extreme Heat and Claims Data
Previous Article in Journal
Some Notes on the Gini Index and New Inequality Measures: The nth Gini Index
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Communication

Comparing the Relative Efficacy of Generalized Estimating Equations, Latent Growth Curve Modeling, and Area Under the Curve with a Repeated Measures Discrete Ordinal Outcome Variable

1
Department of Urban Public Health and Nutrition, School of Nursing and Health Sciences, La Salle University, Philadelphia, PA 18901, USA
2
International Research Institute of North Carolina, USA
*
Author to whom correspondence should be addressed.
Stats 2024, 7(4), 1366-1378; https://doi.org/10.3390/stats7040079
Submission received: 13 October 2024 / Revised: 10 November 2024 / Accepted: 15 November 2024 / Published: 18 November 2024

Abstract

:
Researchers are often interested in how changes in one variable influence changes in a second variable, requiring the repeated measures of two variables. There are several multivariate statistical methods appropriate for this research design, including generalized estimating equations (GEE) and latent growth curve modeling (LGCM). Both methods allow for variables that are not continuous in measurement level and not normally distributed. More recently, researchers have begun to employ area under the curve (AUC) as a potential alternative when the nature of change is less important than the overall effect of time on repeated measures of a random variable. The research showed that AUC is an acceptable alternative to LGCM with repeated measures of a continuous and a zero-inflated Poisson random variable. However, less is known about its performance relative to GEE and LGCM when the repeated measures are ordinal random variables. Further, to our knowledge, no study has compared AUC to LGCM or GEE when there are two longitudinal processes. We thus compared AUC to LGCM and GEE, assessing the effects of repeated measures of psychological distress on repeated measures of smoking. Results suggest AUC performed equally well with both methods, although missing data management is an issue with both AUC and GEE.

1. Introduction

Statistical analysis options for repeated measures designs vary depending upon the nature of the outcome variable assessed. When the outcome variable (dependent variable) is continuous and normally distributed (multivariate normal), there are more analysis options than when the dependent variable is discrete. Most students taking statistics courses in undergraduate or even some graduate programs learn little about the different statistical analysis methods available for dealing with repeated measures designs, especially when the dependent variable is discrete. The longitudinal data analysis method they are most likely to encounter is repeated measures ANOVA (RM-ANOVA). Although providing some flexibility, such as the ability to assess differences among time points and shape of change over time in a post hoc analysis (e.g., linear or quadratic), RM-ANOVA has limitations when compared to more advanced multivariate statistical methods, such as generalized estimating equations (GEE) and latent growth curve modeling (LGCM). Both of these methods, however, have advantages and disadvantages as well. A more recent method beginning to be employed in the assessment of repeated measures of a variable of interest is area under the curve (AUC). Researchers are now assessing its comparative effectiveness in relation to these methods.
GEE is a multivariate method that has several advantages over RM-ANOVA [1,2]. First, unlike RM-ANOVA, GEE permits repeated measure variables to be discrete or continuous. Second, there is no need to specify the multivariate distribution as the aim is parameter estimation instead of model testing, and GEE provides normally distributed and consistent parameter estimates by simply specifying the correct mean structure [3]. RM-ANOVA, by contrast, assumes the repeated measures are multivariate normally distributed [4]. Third, GEE permits the selection of the appropriate correlation structure (i.e., the correlations among the repeated measures), and the efficiency of parameter estimates does not suffer from an incorrectly specified correlation structure. RM-ANOVA assumes the correlations between measures remain constant [5]. Further, GEE permits estimation of more complicated relations in a single model, such as when one desires to explore the relation between two repeated measures variables (e.g., the effect of change in one variable on change in a second variable). Assessing the effects of time-varying covariates is not easily available in RM-ANOVA, although one can format the data to permit the use of a time-varying covariate [6].
Equation (1) presents a basic GEE cumulative logit model with three covariates (x1 through x3; the number of covariates employed in the present study), where the logit is the link function relating the covariates to the repeated-measures outcome variable Yit, j is the level of the ordinal variable Y, where j ranges from 1 to J-1, and t represents time (i.e., the unit of time for the repeated measures). Equation (2) is the general cumulative logit model relating the vectors of predictor variables (X) and parameters (β) to the vector of outcomes (Y), with the mean structure being the probability of the jth level of the ordinal outcome variable y for individual i at time t (Equation (3)) [3].
l o g i t p r y i t j = β 0 j + β 1 x i 1 + β 2 x i 2 + β 3 x i 3 + β 4 t
l o g i t p r Y i t j / X i t = β X i t
E ( Y i t / X i t ; β ) = μ i t 1 , , μ i t J 1 , w h e r e   μ i t ( m u ) =   p r O i t = j / X i t ; β , a n d   O i t   i s   t h e   o r d i n a l   r e s p o n s e   1,2 , 3 , J   f o r   i n d i v i d u a l   i   a t   t i m e   t .
While there are indeed many advantages to GEE over RM-ANOVA, limitations include the inability to test models and compare different models since GEE lacks a likelihood function. Further, the treatment of missing data is a question. With GEE, one assumes that data are missing completely at random (MCAR), meaning that the available cases are a random sample of all cases (absent missing data) [3,7,8]. Although plausible, MCAR is less likely than data being missing at random (MAR). When the data are MAR, this indicates that missingness is related to a predictor variable (or predictor variables) or prior measures of the dependent variable. This is a less restrictive assumption. Although there are extensions to model missing data in GEE for cases that are not MCAR [9,10], these are less likely to be employed in a naive analysis.
Another issue related to modeling discrete dependent variables in GEE is the use of correlation coefficients to represent the relations among the repeated measures. Simulation studies suggest that the best measure of association among repeated measures of discrete data is the local odds ratio (LOR) rather than correlation coefficients [11,12]. Although we did not address this limitation in our GEE analysis in this study, as our aim is to conduct a simple (naive) analysis that most researchers would do, it is noted that there are more efficient ways to estimate associations than correlation coefficients.
Many of the restrictions encountered with GEE are not met when using LGCM with repeated measures data. LGCM is a multivariate method that employs unobserved (latent) variables to represent initial level (baseline) and rate of change from baseline (trend) [13,14]. Equation (4) presents the relations among the latent and observed variables in a prototypical LGCM with ordinal data [15].
y = ν + Λ η + ε , w h e r e   y   i s   a   v e c t o r   o f   o b s e r v e d   o r d i n a l   o u t c o m e s ,   v   i s   a   v e c t o r   o f   r e g r e s s i o n   i n t e r c e p t s , Λ l a m b d a   i s   a   m a t r i x   o f   f a c t o r   l o a d i n g s   r e l a t i n g   t h e   o b s e r v e d   v a r i a b l e s   t o   t h e   v e c t o r   o f l a t e n t   v a r i a b l e s   η   e t a ,   a n d   ε   e p s i l o n   i s   a   v e c t o r   o f   r e s i d u a l s .
Equation (5) clarifies Equation (4) with respect to three repeated measures, i.e., the number of repeated measures employed in the present study.
y i 1 y i 2 y i 3 = 0 0 0 + 1 0 1 1 1 2 × η 0 η 1 + ε i 1 ε i 2 ε i 3
Working from left to right, we have a vector of ordinal outcome variables for participant i at time t for T = 3 time points. On the right side of the equation, we have a vector of regression intercepts, set to zero. The Λ matrix includes the factor loadings for the intercept factor (first column) and the slope factor (second column). The intercept loadings are set to one, indicating an unchanged relation between the intercept and the repeated measures. The unit increasing factor loadings in the second column suggest linear growth over time. The next vector includes the two factors (η), the first representing the intercept factor and the second representing a linear trend factor. If we had proposed a quadratic trend, there would have been a third factor term. For a cubic trend, there would have been a fourth factor term. The final vector represents the residual, one for each time point. These are assumed to be uncorrelated, although at times researchers may correlate them to improve model fit to the data. However, this is generally not recommended [16,17,18].
The latent variables (η) underlying the observed ordinal variables are continuous, not discrete. To capture progression across increasing levels of the ordinal variable, LGCM uses thresholds (τ; tau). Thresholds are cut points in a continuum representing propensities to progress from one ordinal category to the next in an observed variable [15,19]. To assess change over time, these thresholds are constrained to equality across time points. As such, a change in participant proportions across ordinal levels, increasing from a lower to a higher category over time, results in an increased propensity to cross thresholds to higher levels of the dependent variable.
With respect to the latent variables, they are assumed to be multivariate normally distributed, with a mean α (alpha) and variance/covariance ψ (psi; Equation (6)) [15,19]. The residuals are assumed to be normally distributed, with a mean of 0 and a variance θ (theta; Equation (7)). For a very detailed presentation on LGCM with ordinal data, see the Mehta et al. reference [15]. See also Masyn et al. reference [19]. For a general introduction to growth modeling, see Duncan and Duncan [13].
η ~ M V N α , ψ
ε ~ N ( 0 , θ )
LGCM also permits researchers to assess the effect of potential predictor variables on baseline and trend factors. In addition, one can assess how changes in another repeated measures variable influence the repeated measures of the dependent variable, what is commonly known as an associative (parallel) process in LGCM [20]. This could involve, for instance, assessing how a baseline level of one process influences a trend for a second process, as well as trend-to-trend effects. We represent effects on the growth factors in Equation (8).
η = α + Γ ξ + Β η + ζ ,   w h e r e   α   i s   t h e   i n t e r c e p t , Γ ( G a m m a )   i s   a   m a t r i x   o f   c o e f f i c i e n t s   r e l a t i n g   o b s e r v e d   p r e d i c t o r s   t o   t h e   l a t e n t   v a r i a b l e s , ξ X i   i s   t h e   v e c t o r   o f   o b s e r v e d   p r e d i c t o r   v a r i a b l e s , Β   i s   a   m a t r i x   o f   c o e f f i c i e n t s   r e l a t i n g   t h e   l a t e n t   v a r i a b l e s   η   t o   o n e   a n o t h e r , a n d   ζ Z e t a   i s   t h e   v e c t o r   o f   r e s i d u a l s .
Advantages of LGCM include a likelihood function, allowing for model comparisons. In addition, one can test multiple hypotheses, exploring the effects of predictors on repeated measures of a dependent variable. Moreover, one can assess the effects of initial level and rate of change on other outcome variables, whether latent or observed. Further, like GEE, one can assess change on a variety of levels of a dependent variable, whether continuous or discrete. Another advantage of LGCM, not shared by GEE, is how it handles missing data. Unlike GEE, which assumes MCAR, LGCM assumes data are MAR. This is a far more tenable assumption. Using full information maximum likelihood (FIML) estimation, LGCM uses all available data on the dependent variables to estimate parameters [21]. Thus, it has the same sample size as GEE, but with a less restrictive assumption.
Despite the advantages of both methods, conducting GEE and LGCM requires more than a rudimentary understanding of statistics. Moreover, although not necessarily the case for GEE, one needs specific software to conduct LGCM (e.g., Mplus). This can be an unnecessary impediment to researchers if they are interested in the assessment of longitudinal processes but not in the impact of selected covariates on rates of change or the shape of development (e.g., linear, quadratic, or cubic). For instance, a researcher may wish to assess whether males or females differ in psychological distress (PD) over multiple time points, but not whether males differ from females in the rate of change in PD from baseline or whether change in PD is linear or quadratic in nature. Further, one important limitation with LGCM and GEE alike is related to the nature of ordinal variables, the type of discrete random variable we are employing in our analysis. When modeling with an ordinal random outcome variable (Y) with k categories, one assumes proportional odds, such that the odds related to a set of predictor variables (X) remain constant when comparing different levels of the dependent variable [22,23,24,25]. This is seen in Equation (1), where we predict the log odds (logit) of being in category ≤ j vs. > j given a set of predictor variables (x). However, this assumption may not always hold in practice, meaning that multinomial logistic regression instead of ordinal logistic regression is the more appropriate analysis option.
One existing method that researchers are recently beginning to investigate as a viable alternative for longitudinal data analysis is area under the curve (AUC). With this method, one merely calculates the area under the curve generated by the repeated measures. Equation (9) presents the calculation of area under the curve for repeated measures analysis, known as AUC with respect to the ground [26,27,28]. Employing the trapezoid rule for calculating AUC across T time points, we have
A U C g r o u n d =   y 2 + y 1 2 × x 2 x 1 + y 3 + y 2 2 × x 3 x 2 + + y T + y T 1 2 × x T x T 1 , w h e r e   y t   i s   t h e   y     a x i s   v a l u e   a n d   x t   i s   t h e   x     a x i s   v a l u e   a t   t i m e p o i n t   t .
If we let ti represent our intervals xt − xt−1, there will be one less interval than time points (Equation (10)).
A U C g r o u n d = i = 1 T 1 y i + 1 + y i 2 × t i
If the intervals are constant in length, Equation (10) reduces to Equation (11):
A U C g r o u n d = i = 1 T 1 t 2 y i + 1 + y i
Researchers have shown that AUC performs as well as LGCM when the data are continuous or discrete counts [26,27]. However, to our knowledge, no study has yet compared AUC to GEE or LGCM using discrete ordinal random variables.
AUC has several advantages over GEE and LGCM. First, instead of multiple dependent variables to form growth curves, AUC is a single variable that can be employed in simpler statistical methods, such as regression analysis, t-tests, and ANOVAs, as well as multivariate methods such as structural equation modeling (SEM). Second, one does not need advanced statistical knowledge to understand AUC, although some basic knowledge about distributions (e.g., normal or Poisson) is a plus. Further, there is no need to consider issues such as the proportional odds assumption when calculating AUC, as the primary concern is the area created by the different rectangles covering the mass or density for the T-1 intervals.
Despite these advantages, there are some key limitations. First, in calculating AUC without any modifications, individuals missing data at a given time point are deleted listwise, meaning the entire record for that participant is expunged, resulting in a potentially drastic reduction in sample size. As such, AUC assumes data are MCAR, and data imputation methods are necessary to reduce the number of missing cases. Second, one must write syntax to calculate AUC for repeated measures designs at this point, and this may be difficult for those with limited experience with syntax and coding. Acknowledging these limitations, though, AUC may provide a useful alternative to other repeated measures statistical methods. As such, the purpose of the present study was to assess the relative efficacy of AUC compared to GEE and LGCM using real data from a publicly available data source. Using data from the Panel Study of Income Dynamics (PSID) Transition to Adulthood (TA) study and all three data analysis methods, we assessed the impact of repeated measures of a continuous predictor variable, psychological distress, on repeated measures of an ordinal dependent variable, nicotine use, both measured in 2017, 2019, and 2021.
The rationale for selecting these variables is as follows: In this study, we assessed the relation between psychological distress and smoking in older adolescents and young adults (ages 18–24). Smoking is the leading cause of preventable disease and death in the United States [29], and while combustible cigarette smoking has decreased in recent years, e-cigarette smoking (vaping) has been on the rise, and both remain serious risk factors for poor health outcomes [30,31]. A potential reason for smoking in older adolescents and young adults is the rising prevalence of mental health issues [32]. Nicotine can temporarily relieve symptoms of mental illness, such as anxiety and depression, and many people suffering from these issues may use it to self-medicate, leading to nicotine dependence [33,34,35,36]. In the long term, however, smoking can increase susceptibility to anxiety and the severity of depression, exacerbating poor mental health [36,37]. Thus, studying the relation between psychological distress and smoking in older adolescents and young adults may help to find solutions to both problems. Finally, we also controlled for race and education level, as these are two factors that are known to affect smoking status [38].

2. Materials and Methods

Participants and procedures: Participants were drawn from a total sample size of n = 4222 young adults (18–24 years old) at each of three data collection waves (2017, 2019, and 2021) who were taking part in the Panel Study of Income Dynamics (PSID) Transition to Adulthood (TA) supplement. Started in 1968, the PSID is the longest continuously running cohort study [39]. Although its original purpose was to understand the intergenerational transmission of poverty, subsequent data collections have included a variety of variables covering health and other experiential domains. The final sample sizes per analysis differed due to missing data and its treatment in each statistical method, with 2510 participants for the GEE and 2511 participants for the LGCM and AUC, after multiple imputation for the AUC analysis. The large number of participants missing per wave resulted from the transitional nature of this supplement. Participants age out, and new participants enter the TA sample during each data collection wave. However, we were only interested in following the same participants (cohort) across the data collection waves to meet our aim. We found that the three waves selected maximized our cohort size, as adding additional waves resulted in a drastic decrease in sample size.
Instrumentation: To estimate the use of nicotine across time, we generated a four-level ordinal variable from two PSID questions related to one’s nicotine use behavior (i.e., “Do you smoke cigarettes?” and “Have you ever vaped?”). Values are 0—does not use a nicotine product, neither combustible cigarettes (smoking) nor electronic cigarettes (e-cigarette; vaping), 1—vaped or vapes, 2—smokes, and 3—vaped/vapes and smokes. We assessed psychological distress with a variable that is the sum of six five-point (0 through 4) Likert-style questions asking how often in the past month the respondent felt nervous, hopeless, restless, everything an effort, too sad, and worthless. Scores ranged from 0 to 24 points. Finally, we controlled for race (0—non-White and 1—White) and education (0—greater than a high school education and 1—high school education or less).
Data analysis methods: We conducted GEE, LGCM, and AUC. Our GEE analysis included four predictor variables: the time-varying covariate psychological distress, the two time-invariant covariates education and race (both measured in 2017), and time (i.e., data collection wave). We also included two interaction terms, one each for the time-invariant covariates: race by time and education by time. This allowed us to assess whether the effects of education and race smoking in 2017, if any, changed across time.
For the LGCM, we conducted an associated-processes model, with one process for repeated measures of psychological distress and a second process for smoking. Each LGCM included two factors, one for the intercept and one for a linear trend. In each LGCM, we controlled for the effects of the time-invariant covariates, education and race, on the baseline level and linear trend factors. We also controlled for the effect of the baseline level of the opposite LGCM on each trend factor (i.e., the effect of baseline smoking on the psychological distress trend and the effect of baseline psychological distress on the smoking trend). Finally, we assessed the effect of psychological distress trend on smoking trend. To assess the fit of our LGCM to the data, we used the chi-square test of model fit, the comparative fit index (CFI), the root mean square error of approximation (RMSEA), and the standardized root mean residual (SRMR). Heuristics for good fit include a non-significant chi-square, CFI ≥ 0.95, RMSEA ≤ 0.05, and SRMR ≤ 0.06 [40,41,42].
For AUC, we employed multiple regression analysis to assess the effects of psychological distress AUC on smoking AUC, controlling for race and education, both measured in 2017.

3. Results

3.1. Descriptive Statistics

Table 1 presents the descriptive statistics for our sample, including missing data per variable.

3.2. GEE

Table 2 presents the results of the GEE analysis. Psychological distress and time had significant and positive effects on nicotine use. Converting the logits (log odds) to odds ratios, each unit increase in psychological distress across the three time points was associated with a 5% increase in the odds of progressing to a higher level of smoking, from not smoking to smoking both combustible and e-cigarettes (OR = 1.053; 95% CI = 1.029, 1.078). Compared to baseline (time 1; 2017), time 2 (2019) and time 3 (2021) were associated with a 34% (OR = 1.34; 95% CI = 1.174, 1.531) and an over two-fold (OR = 2.054; 95% CI = 1.711, 2.467) increase in the odds of progressing to a higher level of smoking, respectively.

3.3. LGCM

Table 3 presents the results of the associated-processes LGCM analysis. The model fit the data fairly well: χ2(df=15) = 184.613, p < 0.001; CFI = 0.990; RMSEA = 0.067, 90% CI = 0.059, 0.076; and SRMR = 0.043. These results indicate acceptable fit, with the exception of the significant chi-square, suggesting some model misfit that is amplified by the larger sample size [43]. The RMSEA was above the 0.06 cutoff for good fit, suggesting an acceptable but not necessarily good fit. Regarding model effects, baseline psychological distress had a significant and positive effect on smoking trend (b = 0.037, z = 6.777, p < 0.001). The trend-to-trend path from psychological distress to smoking distress, however, was not significant (p = 0.259).

3.4. AUC

Table 4 presents the results of the AUC multiple regression analysis, for the original sample based on listwise deletion of missing data (n = 1095) and the multiple imputation sample (n = 2511). Figure 1 presents the standardized residual plots for the initial sample (panel a) and each of the five imputations (panels b–f) from the multiple imputation runs. Figure 2 presents the residual plots of the standardized residuals for the initial sample (panel a) and each of the five imputations (panels b–f) from the multiple imputation runs. Looking at the regression assumptions, the standardized residuals peak at lower values, with higher values approaching normality, suggesting most participants have lower levels of smoking (Figure 1), and there is apparent homoscedasticity when looking at the variance of the residuals (Figure 2). Given that, as sample size increases, there is convergence in the distribution of standardized variables to the standard normal distribution (central limit theorem) [44], possible violations of the normality assumption are unlikely. Further, there was no difference in the pattern of results between the two samples (Table 4). However, we will look at the multiple imputation sample, as its size is comparable to the sample sizes for the GEE and the LGCM analyses. Psychological distress had a significant and positive effect on smoking (b = 0.033, t = 7.189, p < 0.0001).

3.5. Comparing the Three Methods

In all three methods, psychological distress had a significant effect on smoking. The only difference was in how the effects are partitioned in LGCM. With LGCM, we were able to explore the effects of baseline psychological distress on smoking trend and the effect of psychological distress trend on smoking trend. Because of this partitioning, we saw that only the effect of baseline distress on the smoking trend was significant. As such, initial levels of psychological distress have the greatest impact on progression to higher levels of smoking, with no further impact from change in distress with time. The two covariates race and education in the LGCM, being White versus non-White and having a high school education or less versus a greater education, were associated with higher levels of smoking at baseline but had the inverse effect on the rate of change from baseline (i.e., a negative effect on smoking trend). This indicates a possible ceiling effect for these variables, meaning that their effect was strong initially, leaving little room for additional growth.
In contrast to the results for the LGCM, neither race nor education had a significant effect on smoking in the GEE analysis. The AUC results were more like those seen with LGCM, with both covariates having significant and positive effects on smoking.

4. Discussion

The aim of the present study was to compare three multivariate methods for analyzing repeated measures ordinal data: GEE, LGCM, and AUC. The results of all three methods agreed on the positive effects of psychological distress on smoking, with LGCM being able to partition the effects between baseline level and trend. This is a key difference among the methods and suggests that researchers interested in understanding how different facets of growth in a longitudinal process, whether initial level or rate of change, affect each other may be best served by LGCM over the other two methods. For researchers interested in overall effects, however, it seems that AUC provides a reasonable choice, as it incorporates the entirety of change within a single variable, whether a predictor or an outcome variable. Nevertheless, the handling of missing data in AUC and GEE can be problematic, especially if one is not well versed in handling missing data using methods such as multiple imputation.
Even though the aim of this study was to compare performance among the three different statistical methods with non-simulated data, the finding that psychological distress was associated with an increase in the propensity to smoke with all three methods is noteworthy. A potential reason for this robust association is that psychological distress may lead some to smoke to self-medicate for symptoms of mental illness. Indeed, psychological distress increased the risk of nicotine use, and the effect persisted across time. This is seen with the significant effect of baseline distress on the LGCM trend, along with a non-significant effect from the psychological distress trend to the smoking trend. This result is mirrored by the effect of psychological distress on smoking in both GEE and AUC, two methods that essentially aggregate change into a single variable (long-format data structure in GEE, calculation in AUC).
Another interesting contrast that mirrors the effects of psychological distress is seen when comparing the effects of education and race on smoking in LGCM and AUC. Both results mirror what is seen with distress, as there appears to be a ceiling effect for race and education in LGCM and a clear positive effect for the two covariates with AUC. This highlights the greater ability to partition effects with LGCM. It is notable that neither variable had a significant effect on smoking using GEE, neither on its own nor as an interaction term with time. More research is needed comparing these different methods to better understand the nature of such differing effects.
Like all studies, there are several limitations to our work. First, our smoking variable was generated based on the available questions in the PSID TA dataset. The questions related to vaping were not as well-defined as those for combustible cigarette smoking, at times failing to delineate past from current use. Creating a similar variable with different questions in another dataset may result in a different pattern of results than found here. Second, the low prevalence of smoking in this sample may have affected the results. As such, the comparisons made here should be made with different samples with a higher prevalence of smoking. This would allow researchers to see if the findings in the analyses here are replicable or merely a product of the sample. Third, we only controlled for two variables, education and race. However, as the aim of this study was to compare the three methods, we did not find it necessary to go beyond two control variables. Researchers interested in better understanding the relation between psychological distress and smoking, however, should include additional control variables. These limitations noted, this study adds to recent studies that suggest that AUC may be a viable alternative method for assessing longitudinal data, especially when the research aim is to understand overall effects rather than partitioning effects by baseline level and trend. Nevertheless, our findings suggest that all three methods are useful to researchers interested in assessing change, with AUC being more accessible because of its simplicity. In our opinion, the next steps are to develop a program to calculate AUC regardless of the number of repeated measures, develop an equation that incorporates baseline to assess change without allowing for negative values, and to add the ability to account for missing data through multiple imputation without the researcher having to write additional code.

Author Contributions

Conceptualization, D.R. and R.V.; methodology, D.R.; software, D.R.; validation, D.R. and J.U.; formal analysis, D.R.; investigation, D.R. and R.V.; resources, D.R., R.V. and J.U.; data curation, D.R.; writing—original draft preparation, D.R., R.V., and J.U.; writing—review and editing, D.R., R.V. and J.U.; visualization, D.R.; supervision, D.R. and J.U.; project administration, D.R. and J.U. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study did not require ethical approval.

Informed Consent Statement

This study used publicly available data and therefore did not require informed consent.

Data Availability Statement

The data employed in this study are from the Panel Study of Income Dynamics (PSID) dataset, https://simba.isr.umich.edu/default.aspx (accessed on 12 July 2024). Direct access to the specific dataset employed for this study is available from the first author.

Acknowledgments

We would like to thank the International Research Institute of North Carolina for providing additional support for article searches and other guidance during this process.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zeger, S.L.; Liang, K.-Y.; Albert, P.S. Models for longitudinal data: A generalized estimating equation approach. Biometrics 1988, 44, 1049–1060. [Google Scholar] [CrossRef] [PubMed]
  2. Liang, K.-Y.; Zeger, S.L. Longitudinal data analysis using generalized linear models. Biometrika 1986, 73, 13–22. [Google Scholar] [CrossRef]
  3. da Silva, J.L.; Colosimo, E.A.; Demarqui, F.N. A general GEE framework for the analysis of longitudinal ordinal missing data and related issues. Stat. Model. 2019, 19, 174–193. [Google Scholar] [CrossRef]
  4. Rodriguez, D. Core Statistic: Practial Knowledge for the Health Sciences, 2nd ed; Kendall Hunt: Dubuque, IA, USA, 2024. [Google Scholar]
  5. Schober, P.; Vetter, T.R. Repeated Measures Designs and Analysis of Longitudinal Data: If at First You Do Not Succeed-Try, Try Again. Anesth. Analg. 2018, 127, 569–575. [Google Scholar] [CrossRef]
  6. How Can I Do Repeated Measures ANOVA with Covariates in SPSS? Available online: https://stats.oarc.ucla.edu/spss/faq/how-can-i-do-repeated-measures-anova-with-covariates-in-spss/ (accessed on 19 September 2024).
  7. Little, R.J.; Rubin, D.B. Statistical Analysis with Missing Data; John Wiley & Sons: Hoboken, NJ, USA, 2019; Volume 793. [Google Scholar]
  8. Lipsitz, S.R.; Fitzmaurice, G.M.; Weiss, R.D. Using Multiple Imputation with GEE with Non-monotone Missing Longitudinal Binary Outcomes. Psychometrika 2020, 85, 890–904. [Google Scholar] [CrossRef]
  9. Robins, J.M.; Rotnitzky, A.; Zhao, L.P. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. J. Am. Stat. Assoc. 1995, 90, 106–121. [Google Scholar] [CrossRef]
  10. Yang, C.; Diao, L.; Cook, R.J. Adaptive response-dependent two-phase designs: Some results on robustness and efficiency. Stat. Med. 2022, 41, 4403–4425. [Google Scholar] [CrossRef] [PubMed]
  11. Zhou, X.; Xu, R.; Elashoff, D. Local odds ratio is more efficient than correlation coefficient for modeling longitudinal ordinal data. In Proceedings of the Joint Statistical Meetings, Baltimore, MD, USA, 29 July–3 August 2017. [Google Scholar]
  12. Touloumis, A.; Agresti, A.; Kateri, M. GEE for multinomial responses using a local odds ratios parameterization. Biometrics 2013, 69, 633–640. [Google Scholar] [CrossRef] [PubMed]
  13. Duncan, T.E.; Duncan, S.C. An introduction to latent growth curve modeling. Behav. Ther. 2004, 35, 333–363. [Google Scholar] [CrossRef]
  14. Duncan, T.E.; Duncan, S.C. The ABC’s of LGM: An introductory guide to latent variable growth curve modeling. Soc. Personal. Psychol. Compass 2009, 3, 979–991. [Google Scholar] [CrossRef]
  15. Mehta, P.D.; Neale, M.C.; Flay, B.R. Squeezing interval change from ordinal panel data: Latent growth curves with ordinal outcomes. Psychol. Methods 2004, 9, 301. [Google Scholar] [CrossRef] [PubMed]
  16. Hermida, R. The problem of allowing correlated errors in structural equation modeling: Concerns and considerations. Comput. Methods Soc. Sci. 2015, 3, 5–17. [Google Scholar]
  17. MacCallum, R.C.; Roznowski, M.; Necowitz, L.B. Model modifications in covariance structure analysis: The problem of capitalization on chance. Psychol. Bull. 1992, 111, 490. [Google Scholar] [CrossRef] [PubMed]
  18. Goffin, R.D. Assessing the adequacy of structural equation models: Golden rules and editorial policies. Personal. Individ. Differ. 2007, 42, 831–839. [Google Scholar] [CrossRef]
  19. Masyn, K.E.; Petras, H.; Liu, W. Growth curve models with categorical outcomes. In Encyclopedia of Criminology and Criminal Justice; Springer: New York, NY, USA, 2014; pp. 2013–2025. [Google Scholar]
  20. Rodriguez, D. Research Methods; Kendall Hunt Publishing Company: Dubuque, IA, USA, 2021. [Google Scholar]
  21. Schminkey, D.L.; von Oertzen, T.; Bullock, L. Handling missing data with multilevel structural equation modeling and full information maximum likelihood techniques. Res. Nurs. Health 2016, 39, 286–297. [Google Scholar] [CrossRef]
  22. Liu, A.; He, H.; Tu, X.M.; Tang, W. On testing proportional odds assumptions for proportional odds models. Gen. Psychiatry 2023, 36, e101048. [Google Scholar] [CrossRef]
  23. Brant, R. Assessing proportionality in the proportional odds model for ordinal logistic regression. Biometrics 1990, 46, 1171–1178. [Google Scholar] [CrossRef]
  24. McCullagh, P. Regression models for ordinal data. J. R. Stat. Soc. Ser. B (Methodol.) 1980, 42, 109–127. [Google Scholar] [CrossRef]
  25. Agresti, A. Categorical Data Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2012; Volume 792. [Google Scholar]
  26. Rodriguez, D. Assessing area under the curve as an alternative to latent growth curve modeling for repeated measures zero-inflated poisson data: A simulation study. Stats 2023, 6, 354–364. [Google Scholar] [CrossRef]
  27. Rodriguez, D. Area under the curve as an alternative to latent growth curve modeling when assessing the effects of predictor variables on repeated measures of a continuous dependent variable. Stats 2023, 6, 674–688. [Google Scholar] [CrossRef]
  28. Pruessner, J.C.; Kirschbaum, C.; Meinlschmid, G.; Hellhammer, D.H. Two formulas for computation of the area under the curve represent measures of total hormone concentration versus time-dependent change. Psychoneuroendocrinology 2003, 28, 916–931. [Google Scholar] [CrossRef] [PubMed]
  29. Cornelius, M.E. Tobacco product use among adults–United States, 2021. MMWR Morb. Mortal. Wkly. Rep. 2023, 72, 475–483. [Google Scholar] [CrossRef] [PubMed]
  30. Pierce, J.P.; Luo, M.; McMenamin, S.B.; Stone, M.D.; Leas, E.C.; Strong, D.; Shi, Y.; Kealey, S.; Benmarhnia, T.; Messer, K. Declines in cigarette smoking among US adolescents and young adults: Indications of independence from e-cigarette vaping surge. Tobacco Control 2023, tc-2022-057907. [Google Scholar] [CrossRef]
  31. Bandi, P.; Star, J.; Minihan, A.K.; Patel, M.; Nargis, N.; Jemal, A. Changes in E-Cigarette Use Among U.S. Adults, 2019–2021. Am. J. Prev. Med. 2023, 65, 322–326. [Google Scholar] [CrossRef] [PubMed]
  32. Becker, T.D.; Arnold, M.K.; Ro, V.; Martin, L.; Rice, T.R. Systematic Review of Electronic Cigarette Use (Vaping) and Mental Health Comorbidity Among Adolescents and Young Adults. Nicotine Tob. Res. 2020, 23, 415–425. [Google Scholar] [CrossRef]
  33. Romm, K.F.; Cohn, A.M.; Wang, Y.; Berg, C.J. Psychosocial predictors of trajectories of dual cigarette and e-cigarette use among young adults in the US. Addict. Behav. 2023, 141, 107658. [Google Scholar] [CrossRef] [PubMed]
  34. Moustafa, A.F.; Testa, S.; Rodriguez, D.; Pianin, S.; Audrain-McGovern, J. Adolescent depression symptoms and e-cigarette progression. Drug Alcohol Depend. 2021, 228, 109072. [Google Scholar] [CrossRef]
  35. Collins, S.; Hoare, E.; Allender, S.; Olive, L.; Leech, R.M.; Winpenny, E.M.; Jacka, F.; Lotfalian, M. A longitudinal study of lifestyle behaviours in emerging adulthood and risk for symptoms of depression, anxiety, and stress. J. Affect. Disord. 2023, 327, 244–253. [Google Scholar] [CrossRef]
  36. Rodriguez, D.; Moss, H.B.; Audrain-McGovern, J. Developmental Heterogeneity in Adolescent Depressive Symptoms: Associations with Smoking Behavior. Psychosom. Med. 2005, 67, 200–210. [Google Scholar] [CrossRef] [PubMed]
  37. Zimmermann, M.; Chong, A.K.; Vechiu, C.; Papa, A. Modifiable risk and protective factors for anxiety disorders among adults: A systematic review. Psychiatry Res. 2020, 285, 112705. [Google Scholar] [CrossRef] [PubMed]
  38. Wang, G.; Wu, L. Healthy people 2020: Social determinants of cigarette smoking and electronic cigarette smoking among youth in the United States 2010–2018. Int. J. Environ. Res. Public Health 2020, 17, 7503. [Google Scholar] [CrossRef] [PubMed]
  39. Panel Study of Income Dynamics, Public Use Dataset, University of Michigan, Ann Arbor, MI, USA. 2012. Available online: https://simba.isr.umich.edu/default.aspx (accessed on 12 July 2024).
  40. Cangür, Ş.; Ercan, I. Comparison of Model Fit Indices Used in Structural Equation Modeling under Multivariate Normality; Wayne State Univ Press: Detroit, MI, USA, 2015. [Google Scholar]
  41. MacCallum, R.C.; Browne, M.W.; Sugawara, H.M. Power analysis and determination of sample size for covariance structure modeling. Psychol. Methods 1996, 1, 130. [Google Scholar] [CrossRef]
  42. Shi, D.; Lee, T.; Maydeu-Olivares, A. Understanding the Model Size Effect on SEM Fit Indices. Educ. Psychol. Meas. 2019, 79, 310–334. [Google Scholar] [CrossRef]
  43. McNeish, D. Should we use F-tests for model fit instead of chi-square in overidentified structural equation models? Organ. Res. Methods 2020, 23, 487–510. [Google Scholar] [CrossRef]
  44. Zhang, X.; Astivia, O.L.O.; Kroc, E.; Zumbo, B.D. How to think clearly about the central limit theorem. Psychol. Methods 2023, 28, 1427. [Google Scholar] [CrossRef]
Figure 1. Standardized residual plots for the original sample with missing data (panel a) and the five imputed samples (panels bf).
Figure 1. Standardized residual plots for the original sample with missing data (panel a) and the five imputed samples (panels bf).
Stats 07 00079 g001aStats 07 00079 g001b
Figure 2. Residual plots for the original sample (panel a) and the five imputed samples from the multiple imputation (panels bf).
Figure 2. Residual plots for the original sample (panel a) and the five imputed samples from the multiple imputation (panels bf).
Stats 07 00079 g002aStats 07 00079 g002b
Table 1. Descriptive statistics.
Table 1. Descriptive statistics.
Discrete Variables
VariableLevelN%
RaceWhite109726.0
Non-White141433.5
System missing171140.5
Education≤High school105525.0
>High school147134.8
System missing169640.2
Nicotine use (2017)Does not smoke167439.6
Vapes/vaped47311.2
Smokes1854.4
Smokes and vapes/vaped1884.5
System missing170240.3
Nicotine use (2019)Does not smoke164238.9
Vapes/vaped64215.2
Smokes1222.9
Smokes and vapes/vaped1423.4
System missing167439.6
Nicotine use (2021)Does not smoke143834.1
Vapes/vaped72917.3
Smokes541.3
Smokes and vapes/vaped882.1
System missing191345.3
Continuous variables
VariableNMeanSD
Distress 201725124.914.00
System missing1710
Distress 201925196.105.35
System missing1703
Distress 202123096.725.73
System missing1913
Table 2. GEE results (n = 2510).
Table 2. GEE results (n = 2510).
VariableBSE95% CI
LowerUpperWaldp-Value
≤High school0.1390.2258−0.3040.5810.3770.539
White race0.1910.2368−0.2730.6550.6510.420
Psychological distress0.0520.01160.0290.07520.274<0.001
Time 30.7200.09320.5370.90359.705<0.001
Time 20.2930.06770.1600.42618.759<0.001
High school*Time 30.1170.1174−0.1130.3470.9940.319
High school*Time 20.0530.0868−0.1170.2230.3780.539
Race*Time 3−0.1430.1161−0.3710.0841.5190.218
Race*Time 2−0.0490.0838−0.2130.1150.3420.559
Table 3. LGCM results (n = 2511).
Table 3. LGCM results (n = 2511).
InterceptSlope
Smoking
bSEz-Valuep-ValuebSEz-Valuep-Value
Distress slope----0.0110.011.1290.259
Distress intercept----0.0370.0056.777<0.001
High school0.3640.057.363<0.001−0.0820.027−3.0630.002
White0.2510.0495.095<0.001−0.0580.027−2.1760.03
Psychological distress
bSEz-valuep-valuebSEz-valuep-value
Smoke slope--------
Smoke intercept----−0.0080.055−0.1380.89
High school0.2950.1591.8520.0640.2260.1521.4880.137
White0.1640.161.0230.3060.3790.1512.5140.012
Table 4. AUC results.
Table 4. AUC results.
Original Data (N = 1095)Multiple Imputation (n = 2511)
PredictorbSEt-Valuep-ValuebSEt-Valuep-Value
Intercept0.3350.0536.314p < 0.0010.4980.0855.8450.000
High school0.4510.0508.989p < 0.0010.5120.0697.3920.000
White0.3170.0506.358p < 0.0010.2860.0664.3090.000
AUC distress0.0330.00311.102p < 0.0010.0330.0057.1890.000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Rodriguez, D.; Verma, R.; Upchurch, J. Comparing the Relative Efficacy of Generalized Estimating Equations, Latent Growth Curve Modeling, and Area Under the Curve with a Repeated Measures Discrete Ordinal Outcome Variable. Stats 2024, 7, 1366-1378. https://doi.org/10.3390/stats7040079

AMA Style

Rodriguez D, Verma R, Upchurch J. Comparing the Relative Efficacy of Generalized Estimating Equations, Latent Growth Curve Modeling, and Area Under the Curve with a Repeated Measures Discrete Ordinal Outcome Variable. Stats. 2024; 7(4):1366-1378. https://doi.org/10.3390/stats7040079

Chicago/Turabian Style

Rodriguez, Daniel, Ryan Verma, and Juliana Upchurch. 2024. "Comparing the Relative Efficacy of Generalized Estimating Equations, Latent Growth Curve Modeling, and Area Under the Curve with a Repeated Measures Discrete Ordinal Outcome Variable" Stats 7, no. 4: 1366-1378. https://doi.org/10.3390/stats7040079

APA Style

Rodriguez, D., Verma, R., & Upchurch, J. (2024). Comparing the Relative Efficacy of Generalized Estimating Equations, Latent Growth Curve Modeling, and Area Under the Curve with a Repeated Measures Discrete Ordinal Outcome Variable. Stats, 7(4), 1366-1378. https://doi.org/10.3390/stats7040079

Article Metrics

Back to TopTop