Next Article in Journal
Generalized Viscosity Implicit Iterative Process for Asymptotically Non-Expansive Mappings in Banach Spaces
Next Article in Special Issue
A Model for Predicting Statement Mutation Scores
Previous Article in Journal
Solving the Systems of Equations of Lane-Emden Type by Differential Transform Method Coupled with Adomian Polynomials
Previous Article in Special Issue
The Modified Beta Gompertz Distribution: Theory and Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry

by
Miltiadis S. Chalikias
Department of Accounting and Finance, School of Business, Economics and Social Sciences, University of West Attica, 12244 Egaleo, Greece
Mathematics 2019, 7(4), 378; https://doi.org/10.3390/math7040378
Submission received: 18 February 2019 / Revised: 9 April 2019 / Accepted: 13 April 2019 / Published: 25 April 2019
(This article belongs to the Special Issue Applied and Computational Statistics)

Abstract

:
In this paper, we construct optimal repeated measurement designs of two treatments for estimating direct effects, and we examine the case of compound symmetry dependency. We present the model and the design that minimizes the variance of the estimated difference of the two treatments. The optimal designs with dependent observations in a compound symmetry model are the same as in the case of independent observations.

1. Introduction

In repeated measurement designs, a sequence of treatments is applied to each experimental unit (e.u.). In particular, one treatment is applied in each period. For example, for two treatments, A and B, and three periods, a possible sequence is ABA, which means that the treatments A, B, and A are respectively applied at the beginning of each of the three periods. The direct effect of a treatment is the effect of the treatment which is applied in the period that is examined. The residual effect is the effect of the treatment which is applied in the period preceding the period that is examined. In the case of two treatments, A and B, the direct τ A and τ B can be estimated. In every period, a treatment is applied, so either τ A or τ B is estimable. In this paper, the parameter of interest is the difference of direct effects τ = τ A τ B .
Most researchers who have investigated repeated measurement designs, such as [1,2,3,4,5,6], have been occupied with universally optimal designs where the observations are independent. However, researchers have also shown interest in designs with dependent observations, as in the cases of [7,8,9,10,11].
The model we use in this paper, and which is presented below, was first introduced by Hedayat and Afsarinejad [12,13]. In previous research [14,15] using this model, the author of this article studied two treatment designs under the assumption that consecutive observations were independent. Building on that previous work, in the present article the author examines the case of compound symmetry dependency. The aim is to find a design that corresponds to a minimum variance estimator.

2. The Model

A compound symmetry model has the following characteristics:
(i)
For each sequence, the variance matrix is of the form Σ m = a I m + b J m , where I m is the unit m × m matrix, and J m is the m × m matrix where all elements are equal to 1 (m is the number of periods).
(ii)
The observations corresponding to different treatment sequences (different e.u.) are independent, and the number of sequences is 2 m .
The goal is to find the design that corresponds to the minimum variance estimator. I show that, in this case, the optimal design regarding the direct effect is the same as in the model of independent observations, and only the variance of the estimator is different.
The model is [12]:
y i j k = μ + τ + π j + δ i , j 1 + γ i + ζ k + e i j k
  • j corresponds to the j-th period, j = 1, 2, …, m;
  • i corresponds to the i-th sequence, i = 0 , 1 , 2 m 1 ;
  • k corresponds to the unit k = 1, 2, …, n;
  • τ A , τ B : are direct effects of treatments A and B;
  • π j : is the effect of the j-th period;
  • δ A , δ B : are the residual effects of A and B;
  • γ i : is the effect of the i-th sequence; and
  • ζ k : is the effect of the k-th e.u. (subject effect), which is a random variable, independent of the error e i j k .
The errors e i j k are assumed to be independent. However, the quantities ζ k + e i j k are independent only between sequences and not within sequences.
The overparameterized model vector form of the above model is written as:
Y = τ A τ A + τ B τ B + δ A δ A + δ B δ B + π 1 π 1 + + π m π m + + γ 0 γ 0 + + γ q γ q + e
where q = 2 m 1 and Y , τ A , τ B , δ A , δ B , π 1 , π m , γ 0 , γ q , e are 1 × m n vectors; the direct effect vector is 1 if the treatment is A, and zero if it is B. For example, for the sequence ABB…, τ A = [ 1 0 0 ] and, in the same way, τ B ,   δ A ,   δ B π ι   and   γ ι are defined so that τ A + τ B = 1 m n , δ A + δ B + π 1 = 1 m n , and π 1 + π 2 + + π m = 1 m n . Also, 1 when the ith unit is employed, and 0 elsewhere, so γ 0 + γ 1 + γ 2 + + γ 2 m 1 = 1 m n . So, in equation (2) there are linearly dependent vectors.
Keeping only the linear independent vector [16], the model (2) is transformed to
E ( Y ) = τ ( τ A τ B ) + δ ( δ A δ B ) + π 1 π 1 + + π m 1 π m 1 + + γ 0 γ 0 + + γ q 1 γ q 1
where q = 2 m 1 . In a vector form:
Y = X b + e Y = ( X 1 X 2 ) ( b 1 b 2 )
where Y is (mn) × 1, the design matrix X is (mn) × s, b is s × 1, e is (mn) × 1, and s is the number of unknown parameters. If we are interested only in some and not in all of the parameters, then we write b = ( b 1 b 2 ) , where b 1 is the r parameters of interest, and b 2 is the s-r remaining parameters.
We assume only one parameter of interest for the difference of the direct effects, τ = τ A τ B , which can be considered as the direct effect of A in the case of τ B = 0 . In order to guarantee the estimability of the model, we postulate the restrictions τ B = 0 , π m = 0 , γ 2 m 1 = 0 .
The matrix X 1 corresponds to the coefficients of τ, and the matrix X 2 corresponds to the coefficients of the rest of the non-random variables. Let us assume V = X 2 ( X 2 T Σ 1 X 2 ) 1 X 2 T is a ( m n ) × ( m n ) matrix, P the projection matrix of X 2 , P = X 2 ( X 2 T X 2 ) 1 X 2 T and Σ are the ( m n ) × ( m n ) variance matrix of the observations.
From the ordinal least-squares equations, we derive the following relation for the estimation of the main effect τ:
( X 1 T Σ 1 X 1 X 1 T Σ 1 V Σ 1 X 1 ) τ ^ = X 1 T Σ 1 ( I P Σ 1 ) Y
We also have
var ( τ ) = σ 2 ( X 1 T Σ 1 X 1 X 1 T Σ 1 V Σ 1 X 1 ) 1 = σ 2 Q 1

3. The Case of Compound Symmetry

The observations are dependent within sequences with variance matrix Σ m . The observations from different sequences are independent, therefore:
Σ = [ Σ m 0 0 0 Σ m 0 0 0 Σ m ]   and   V = [ V m 0 0 0 0 V m 1 0 0 0 V m q ]
where q = 2 m 1 and V m j = X 2 j ( X 2 j T Σ m 1 X 2 j ) 1 X 2 j T .
In order to obtain a sequence enumeration, the binary enumeration system was used, with 0 corresponding to A, and 1 to B. Thus, we obtained the enumerations 0 , 1 , , 2 m 1 . For example, if we have five periods and the sequence BABBA, then this is the 13th sequence, since B A B B A 1 2 0 + 0 2 0 + 1 2 2 + 1 2 3 + 0 2 4 = 13 . For two periods, we have four sequences, that is, A A 0 , B A 1 , A B 2 , B B 3 . For three periods (two treatments) we have eight sequences:
ABABABAB
AABBAABB
AAAABBBB
u0u1u2u3u4u5u6u7
where u i i = 0, 2, 3, 4, 5, 6, 7 is the number of units that received the i-th sequence of treatments. The sequences that we obtain by substituting A for B and vice versa are called dual or reversal designs. Observe that for these sequences, we obtain the enumeration 7 − i, i = 0, 1, 2, 3.
Proposition 1.
For a repeated measurement design with m periods, n experimental units, and a variance matrix Σ that consists of n diagonal block matrices of the form Σ m = a I m + b J m ,
( X 1 T Σ 1 X 1 X 1 T Σ 1 V Σ 1 X 1 ) = 1 a ( X 1 T X 1 X 1 T P X 1 )
where P = X 2 ( X 2 T X 2 ) 1 X 2 T .
Proof. 
Let X ˜ 1 = Σ 1 / 2 X 1 , X ˜ 2 = Σ 1 / 2 X 2 , and Y ˜ = Σ 1 / 2 Y . Then
( X 1 T Σ 1 X 1 X 1 T Σ 1 V Σ 1 X 1 ) = X 1 T X 1 X ˜ 1 T P X ˜ 1
where P ˜ = X ˜ 2 ( X ˜ 2 T X ˜ 2 ) 1 X ˜ 2 T . In other words, P ˜ is the matrix of the orthogonal projection to R ( X ˜ 2 ) . □
X 1 j   j = 0 , 1 , 2 m is the m × 1 matrix of τ in the j-th sequence, and X 2 j   j = 0 , 1 , 2 m is the m x ( m + 2 m ) matrix of the parameters μ , π 1 , π 2 , π m 1 , δ A , δ B , γ 1 , γ 2 , , γ q , where q = 2 m 1 .
For example, for three periods (m = 3), we have the matrices:
X 1 = [ X 10 } u 0 X 11 } u 1 X 17 } u 7 ]   and   X 2 = [ X 20 X 21 X 27 ]
For the linear space R ( X ˜ 2 j ) and for any sequence (m observations) R ( X ˜ 2 j ) = R ( X 2 j ) , we observe the following:
(i) The matrix ( a I m + b J m ) is positive definite, so the matrix ( a I m + b J m ) 1 2 is also positive definite, and we conclude that:
( a I m + b J m ) 1 2 = 1 a ( I m b a + b m J m ) ( a I m + b J m ) 1 2 = 1 a ( I m δ J m )
where δ = a a + b m m , 1 δ m > 0 , and we have X ˜ 2 j = 1 α ( I m δ J m ) 1 X 2 j .
(ii) The coefficients of the general mean are 1, so 1 m R ( X 2 j ) and.
1 a ( I m δ J m ) 1 m = 1 a + b m 1 m 1 m R ( X ˜ 2 j )
(iii) If z is another column vector, and z R ( X 2 j ) , then
1 a ( I m δ J m ) z = 1 a ( z δ ( 1 m T z ) 1 m ) z R ( X ˜ 2 j ) R ( X ˜ 2 j ) = R ( X 2 j )
(iv) If P ˜ m is the matrix of the orthogonal projection to the linear space R ( X ˜ 2 j ) , then P ˜ m j = P m j , where P m j = X 2 j ( X 2 j T X 2 j ) 1 X 2 j T is the matrix of the orthogonal projection to R ( X 2 j ) and P m j 1 m = 1 m P m j J m = J m . From the above, we conclude that:
( X ˜ 1 j T X ˜ 1 j X ˜ 1 j T P ˜ m j X ˜ 1 j ) = X ˜ 1 j T X ˜ 1 j X ˜ 1 j T P ˜ m j X ˜ 1 j = 1 a ( X 1 j T X 1 j X 1 j T P m j X 1 j ) ( I m P ˜ m j ) X ˜ 1 j = 1 a ( I m P m j ) ( I m δ J m ) X 1 j = 1 a ( I m P m j ) X 1 j ( X ˜ 1 T X ˜ 1 X ˜ 1 T P ˜ X ˜ 1 ) = j = 0 q ( X ˜ 1 j T X ˜ 1 j X ˜ 1 j T P ˜ m j X ˜ 1 j ) = 1 a ( X 1 T X 1 X 1 T P X 1 )
Corollary 1.
The designs that result in the estimators with the minimum variance, i.e., min var ( τ ^ ) are exactly the optimal designs of the model with independent observations. In this case, the variance var ( τ ^ ) is multiplied by α:
var ( τ ) = σ 2 ( X 1 T X 1 X 1 T P X 1 ) 1 = σ 2 a ( Q ) 1
σ 2 ( Q ) 1 is the variance of the optimal designs in the model with independent observations).
Proof. 
From the previous proof, we conclude that the variance of the estimator of the direct effect, which is given by Formula (3), equals to
var ( τ ) = σ 2 a ( Q ) 1
Comments: (1) If we consider that an observation can influence another observation, the e.u are correlated, and the correlation is given by ρ, −1 < ρ < 1. Dependent observations are often considered observations of the same cluster [17]. A simple example of dependency appears when children of the same mother are included in a sample. Due to their common household environment and genes, it is expected that these children have a greater chance of having the same characteristics.
(2) In the case of compound symmetry, the variance matrix of each sequence observations is Σ m = ( 1 ρ ) I m + ρ J m , so α = 1 − ρ, and b = ρ. In order for the matrix to be positive definite, the condition 1 m 1 < ρ < 1 is necessary. If ρ = 0, then we obtain the model with independent observations and α = 1.
(3) The variance of the estimator of the direct effect, var ( τ ^ ) , decreases when the correlation coefficient ρ increases and it approaches 0, when ρ approaches 1, since α = 1 − ρ.
(4) For two periods with dependent observations, the 2 × 2 variance matrix of the observations in the compound symmetry model is Σ 2 = ( 1 ρ ) I 2 + ρ J 2 . The optimal design for this model is the same as the optimal design for independent observations for every ρ, −1 < ρ < 1.
For an even n, such an optimal design is obtained when to the sequences AA and AB correspond to n/2 e.u, while for an odd n, the optimal design is obtained when to the sequences AA and AB correspond to (n − 1)/2 and (n + 1)/2 e.u., respectively [11]. The reverse sequences BB, BA also correspond to an optimal design with: var ( τ ) = σ 2 ( 1 ρ ) ( Q ) 1
(5) As illustrated, the examined model with dependent observations is also associated with variance matrices Σ for which the optimal designs are the same as the ones of the model with independent observations [14,18].

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Carriere, K.C.; Reinsel, G.C. Optimal two period repeated measurement designs with two or more treatments. Biometrika 1993, 80, 924–929. [Google Scholar] [CrossRef]
  2. Chalikias, M.; Kounias, S. Extension and necessity of Cheng and Wu conditions. J. Stat. Plan. Infer. 2012, 142, 1794–1800. [Google Scholar] [CrossRef]
  3. Cheng, C.S.; Wu, C.F. Balanced repeated measurements designs. Ann. Stat. 1980, 11, 29–50. [Google Scholar] [CrossRef]
  4. Hedayat, A.S.; Yang, M. Universal Optimality of Selected Crossover Designs. J. Am. Stat. Assoc. 2004, 99, 461–466. [Google Scholar] [CrossRef]
  5. Hedayat, A.S.; Zheng, W. Optimal and efficient crossover designs for test-control study when subject effects are random. J. Am. Stat. Assoc. 2010, 105, 1581–1592. [Google Scholar] [CrossRef]
  6. Stufken, J. Some families of optimal and efficient repeated measurements designs. J. Stat. Plan. Infer. 1991, 27, 75–83. [Google Scholar] [CrossRef]
  7. Hedayat, A.S.; Yan, Z. Crossover designs based on type I orthogonal arrays for a self and simple mixed carryover effects model with correlated errors. J. Stat. Plan. Infer. 2008, 138, 2201–2213. [Google Scholar] [CrossRef]
  8. Kounias, S.; Chalikias, M.S. An algorithm applied to designs of repeated measurements. J. Appl. Stat. Sci. 2005, 14, 243–250. [Google Scholar]
  9. Kushner, H.B. Allocation rules for adaptive repeated measurements designs. J. Stat. Plan. Infer. 2003, 113, 293–313. [Google Scholar] [CrossRef]
  10. Laska, E.M.; Meisner, M. A variational approach to optimal two treatment crossover designs: Application to carryover effect models. J. Am. Stat. Assoc. 1985, 80, 704–710. [Google Scholar] [CrossRef]
  11. Matthews, J.N.S. Optimal crossover designs for the comparison of two treatments in the presence of carryover effects and autocorrelated errors. Biometrika 1987, 74, 311–320. [Google Scholar] [CrossRef]
  12. Hedayat, A.; Afsarinejad, K. Repeated measurements designs, I. Survey Stat. Des. Linear Models 1975, 229–242. Available online: http://ani.stat.fsu.edu/techreports/M261.pdf (accessed on 23 April 2019).
  13. Hedayat, A.S.; Afsarinejad, K. Repeated measurements designs II. Ann. Stat. 1978, 18, 1805–1816. [Google Scholar] [CrossRef]
  14. Kounias, S.; Chalikias, M. Optimal and Universally Optimal Two Treatment Repeated Measurement Designs; Vonta, F., Nikulin, M., Eds.; Statistics for industry and technology Birkhauser: Boston, MA, USA; Basel, Switzerland; Berlin, Germany, 2008; pp. 465–477. [Google Scholar]
  15. Kounias, S.; Chalikias, M.S. Optimal two treatment repeated measurement designs with treatment-period interaction in the model. Util. Math. 2015, 96, 243–261. [Google Scholar]
  16. Kounias, S.; Chalikias, M. Estimability of Parameters in a Linear Model. Stat. Probab. Lett. 2008, 28, 2437–2439. [Google Scholar] [CrossRef]
  17. Liang, K.Y.; Zeger, S.L. Regression analysis for correlated data. Annu. Rev. Pub. Health 1993, 14, 43–68. [Google Scholar] [CrossRef] [PubMed]
  18. Chalikias, M.; Kounias, S. Optimal two Treatment Repeated Measurement Designs for three Periods. Commun. Stat. Theory Methods 2017, 46, 200–209. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Chalikias, M.S. Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry. Mathematics 2019, 7, 378. https://doi.org/10.3390/math7040378

AMA Style

Chalikias MS. Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry. Mathematics. 2019; 7(4):378. https://doi.org/10.3390/math7040378

Chicago/Turabian Style

Chalikias, Miltiadis S. 2019. "Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry" Mathematics 7, no. 4: 378. https://doi.org/10.3390/math7040378

APA Style

Chalikias, M. S. (2019). Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry. Mathematics, 7(4), 378. https://doi.org/10.3390/math7040378

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop