Next Article in Journal
SNPD: Semi-Supervised Neural Process Dehazing Network with Asymmetry Pseudo Labels
Next Article in Special Issue
Asymmetry Model Based on Quasi Local Odds Symmetry for Square Contingency Tables
Previous Article in Journal
Approximate Analytic–Numeric Fuzzy Solutions of Fuzzy Fractional Equations Using a Residual Power Series Approach
Previous Article in Special Issue
Families of Generalized Quasisymmetry Models: A ϕ-Divergence Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quasi Association Models for Square Contingency Tables with Ordinal Categories

Faculty of Science and Technology, Tokyo University of Science, Chiba 278-8510, Japan
*
Author to whom correspondence should be addressed.
Symmetry 2022, 14(4), 805; https://doi.org/10.3390/sym14040805
Submission received: 25 February 2022 / Revised: 5 April 2022 / Accepted: 11 April 2022 / Published: 12 April 2022
(This article belongs to the Special Issue Advances in Quasi-Symmetry Models)

Abstract

:
The analysis of contingency tables focuses on a statistical model instead of independence when the independence between row and column variables does not hold. Many association models have been proposed to indicate the structure of odds ratios. Additionally, symmetry and asymmetry models have been proposed to analyze the cell probabilities of square contingency tables with symmetric or asymmetric structures. This paper proposes an asymmetry plus association model for square contingency tables with ordinal categories and partitioning of the test statistic for goodness-of-fit using our proposed model.

1. Introduction

A categorical variable distinguishes a set of categories. It is employed in diverse fields such as social sciences, medical sciences, engineering, and education. Here, we consider a categorical variable with r categories and another one with c categories. The outcome for two variables has r c possible combinations, which can be denoted by a rectangular table with r rows and c columns, where the cells illustrate the r c possible outcomes. This is called a contingency table (for more details, see [1,2]). A contingency table illustrates the joint frequencies by combination of two categorical variables. When analyzing a contingency table, only the observed frequencies are seen, but the true distribution is unknown. One of the aims of analyzing a contingency table is to estimate an unknown probability distribution from the observed frequencies. The confidence level of the estimated distribution is higher when fewer parameters are used to describe the data. Sometimes, we need to consider a parsimonious model. Traditionally, a contingency table is used to evaluate whether classifications are associated. That is, the analysis determines whether two variables are statistically independent.
If two variables take the same categorical values, the table is called a “square” contingency table. When the observed frequencies are concentrated in the main diagonal cells, the two variables are dependent. Even if the observations are not concentrated on the main diagonal but we have one large frequency and several small frequencies in each row and each column, then there is a strong association between the categories of a variable and those of the other, and hence a strong dependence. This is a common situation in real world data and, since the case of independence is infrequent and unrealistic, a suitable model for representing dependence data is important. Consequently, many statisticians consider various statistical models instead of an independence model and study the method of estimation and hypothesis testing based on a statistical model.When statistical independence between two variables does not hold, association models, which indicate the structure of odds ratios, have been considered to analyze contingency tables. On the other hand, symmetry or asymmetry models, which indicate the structure of ratios for cell probabilities in symmetric positions, are often used to analyze square contingency tables.
This study proposes a model with characteristics of both an association model and asymmetry model. Our model is more parsimonious than many association or asymmetry models. Hence, our model may better estimate the distribution than conventional association models and asymmetry models.
This paper is organized as follows. Section 2 introduces previous research and proposes an asymmetry plus association model. Section 3 describes the necessary and sufficient condition to use our model. Section 4 provides the methods to evaluate model-fitting based on goodness-of-fit. Section 5 concludes this paper.

2. Models

For an r × r square contingency table with ordinal categories, let π i j denote the probability that an observation will fall in the ith row and jth column of the contingency table ( i = 1 , , r ; j = 1 , , r ). Goodman [3,4,5] considered many association models in a contingency table. For example, the quasi-uniform association (QU) model is defined as
π i j = μ α i β j θ i j ( i j ) , ψ i i ( i = j ) .
Without loss of generality, we impose α r = β r = 1 . The odds ratio for rows i and j ( > i), and columns s and t ( > s) are denoted by ϕ ( i j ; s t ) . That is,
ϕ ( i j ; s t ) = π i s π j t π j s π i t .
Using the odds ratios, the QU model can be expressed as
ϕ ( i j ; s t ) = θ ( j i ) ( t s ) ( i s , i t , j s , j t ) .
The QU model with θ = 1 is the quasi-independence (QI) model (see p. 426 in Agresti [6]). That is,
π i j = μ α i β j ( i j ) , ψ i i ( i = j ) .
On the other hand, many statisticians have analyzed square contingency tables using a symmetric structure or an asymmetric structure for cell probabilities. Bowker [7] proposed the symmetry (S) model, which is defined as
π i j = ψ i j ( i = 1 , , r ; j = 1 , , r ) ,
where ψ i j = ψ j i . This model indicates the symmetric structure for cell probabilities.
Stuart [8] proposed the marginal homogeneity (MH) model, which is defined as
π i + = π + i ( i = 1 , , r ) ,
where π i + = j = 1 r π i j and π + i = j = 1 r π j i . The MH model indicates that the row marginal distribution is equivalent to the column marginal distribution.
Caussinus [9] proposed the quasi-symmetry (QS) model, which is defined as
π i j = μ α i β j ψ i j ( i = 1 , , r ; j = 1 , , r ) ,
where ψ i j = ψ j i . This model is identical to the S model when α i = β i . The QS model can be expressed as
ϕ ( i j ; s t ) = ϕ ( s t ; i j ) ( i < j ; s < t ) .
The QS model indicates the symmetric structure of the odds ratios. The QU model implies the QS model. That is, the QS model holds when the QU model holds.
When the S model does not hold, asymmetry models, with a weaker restriction than the S model, have been proposed. For example, Tahata and Tomizawa [10] proposed the kth linear asymmetry (LS k ) model, which is defined for a fixed k ( k = 1 , , r 1 ) as
π i j = μ l = 1 k α l i l β l j l ψ i j ( i = 1 , , r ; j = 1 , , r ) ,
where ψ i j = ψ j i . Note that when α l = β l , this model is the S model. As k increases, the LS k model is less restrictive, and the LS r 1 model is the QS model. Namely, the LS k model is the intermediate model between the S model and QS model. The LS k model can be expressed as
π i j π j i = l = 1 k γ l j l i l ( i j ) .
The LS k model includes the linear diagonals-parameter symmetry model [11] and the extended linear diagonals-parameter symmetry model [12].
Goodman [4] introduced the symmetry plus quasi-independence (SQI) model, which is defined as
π i j = μ α i α j ( i j ) , ψ i i ( i = j ) .
This model is a special case of the S model and the QI model when ψ i j = μ α i α j and α i = β i for i j , respectively.
Yamamoto and Tomizawa [13] proposed the symmetry plus quasi-uniform association (SQU) model, which is defined as
π i j = μ α i α j θ i j ( i j ) , ψ i i ( i = j ) .
The SQU model implies the S model and QU model. Note that the SQU model is identical to the SQI model when θ = 1 .
Association models and asymmetry models have been proposed independently. However, an asymmetry plus association model, which considers both the structure of asymmetry for cell probabilities and odds ratios, is rarely considered.
Here, we propose a new model defined for a fixed k ( k = 1 , , r 1 ) as
π i j = μ α i α j l = 1 k δ l j l i l θ i j ( i j ) , ψ i i ( i = j ) .
Without loss of generality, we set α r = 1 . This model is called the kth linear asymmetry plus quasi-uniform association (LSQU k ) model. When θ = 1 , it is called the kth linear asymmetry plus quasi-independence (LSQI k ) model.
If the LSQU k model holds, then
π i j π j i = l = 1 k δ l 2 ( j l i l ) ( i j ) .
The LS k model holds by γ l = δ l 2 in Equation (14). Additionally,
ϕ ( i j ; s t ) = θ ( j i ) ( t s ) ( i s , i t , j s , j t ) .
Therefore, the LSQU k model shows characteristics of both the LS k model and the QU model.
This model with δ l = 1 for l = 1 , , k is the SQU model. When k = r 1 , the LSQU k model implies
π i j π j i = l = 1 r 1 γ l j l γ l i l ( i j ) .
On the other hand, the QU model implies
π i j π j i = λ j λ i ( i j ) ,
where λ j = β j / α j . Setting λ j = l = 1 r 1 γ l j l provides a one-to-one relation between { λ 1 , , λ r 1 } and { γ 1 , , γ r 1 } . This means that the LSQU r 1 model is equivalent to the QU model. The LSQU k ( k < r 1 ) model is a special case of the QU model since the LSQU r 1 model with δ l = 1 for l = k + 1 , , r 1 is the LSQU k model. Hence, the LSQU k model is an intermediate model between the SQU and QU models. Similarly, the LSQI r 1 model is equivalent to the QI model. That is, the LSQI k model is an intermediate model between the SQI and QI models (For more details, see Figure 1).

3. Necessary and Sufficient Condition for the SQU Model

Caussinus [9] introduced the necessary and sufficient condition for the S model. This condition separates the S model into multiple models with a weaker restriction than the S model. Assuming that model M 1 holds if and only if both models M 2 and M 3 hold, then analyzing models M 2 and M 3 should elucidate a more detailed structure of the cell probabilities. Here, we are interested in deriving a necessary and sufficient condition for the SQU model using the LSQU k model.
Yamamoto and Tomizawa [13] provided the following necessary and sufficient condition for the SQU model.
Theorem 1.
The SQU model holds if and only if both the QU model and the MH model hold.
Let X and Y denote the row and column variables, respectively, and consider a model defined for a fixed k ( k = 1 , , r 1 ), which is given as
E ( X l ) = E ( Y l ) ( l = 1 , , k ) ,
where E ( X l ) = i j i l π i j and E ( Y l ) = i j j l π i j . This model can be referred to as the marginal kth moment equality (ME k ) model. This leads to the following theorem.
Theorem 2.
For any k ( k = 1 , , r 1 ) , the SQU model holds if and only if both the LSQU k model and the ME k model hold.
Proof. 
If the SQU model holds, the LSQU k model holds because the LSQU k model with δ l = 1 ( l = 1 , , k ) is the SQU model. Since the SQU model implies the S model, we can see that
E ( X l ) = i j i l π i j = i j i l π j i = E ( Y l ) ( l = 1 , , k ) .
The ME k model also holds. The necessity is proved.
Conversely, if both the LSQU k model and the ME k model hold, we can prove that the SQU model holds. If the LSQU k model holds, from Equation (14), we obtain
log π i j log π j i = 2 l = 1 k ( j l i l ) log δ l ( i j ) .
The ME k model can also be expressed as
i j ( j l i l ) π i j = 0 ( l = 1 , , k ) .
From the LSQU k model and the ME k model, we obtain
i j ( π i j π j i ) ( log π i j log π j i ) = 2 l = 1 log δ l i j ( j l i l ) ( π i j π j i ) = 0 .
Since the logarithmic function is strictly increasing, then for any i j
( π i j π j i ) ( log π i j log π j i ) 0 .
Equation (22) with π i j = π j i holds, that is, the S model holds. When the S model holds, the MH model holds. Additionally, the LSQU k model is a special case of the QU model. From Theorem 1, the SQU model holds. The proof is complete. □
Theorem 2 is a generalization of Yamamoto and Tomizawa’s result because the ME r 1 model is equivalent to the MH model (see [14]). This leads to the following corollary.
Corollary 1.
For any k ( k = 1 , , r 1 ) , the SQI model holds if and only if both the LSQI k model and the ME k model hold.

4. Partition of Test Statistics

Here, we describe a method to evaluate the model fitting. We consider a test of hyphothesis, where the null hypothesis is that model M holds, and the alternative hypothesis is that model M does not hold. Let n i j denote the observed frequency in the ( i , j )th cell of the table and m i j indicate the corresponding expected frequency with n = i j n i j ( i = 1 , , r ; j = 1 , , r ) . Assume that { n i j } has a multinomial distribution. Then m ^ i j denotes the maximum likelihood estimate (MLE) of m i j under a model. The likelihood ratio chi-squared statistic for the goodness-of-fit of the model M is defined as
G 2 ( M ) = 2 i = 1 r j = 1 r n i j log n i j m ^ i j .
The numbers of degrees of freedom (df) for testing the goodness-of-fit under the SQU, LSQU k , and ME k models are r 2 2 r 1 , r 2 2 r 1 k , and k, respectively. The number of df for the SQU model is equal to the sum of those for the LSQU k and ME k models.
Previous studies have discussed the separability of a model [15,16,17,18,19]. Separability means that a test statistic for the goodness-of-fit of model M 1 is asymptotically equivalent to the sum of the test statistics for model M 2 and model M 3 when model M 1 can be separated into model M 2 and model M 3 . If it holds, the incompatible situation, where both model M 2 and model M 3 are accepted but model M 1 is rejected, would not arise. This leads to the following theorem.
Theorem 3.
For any k ( k = 1 , , r 1 ) , the test statistic G 2 ( S Q U ) is asymptotically equivalent to the sum of G 2 ( L S Q U k ) and G 2 ( M E k ) .
Proof. 
For a fixed k ( k = 1 , , r 1 ), the LSQU k model can be expressed as
log π i j = μ + α i + α j + l = 1 k ( j l i l ) δ l + i j θ ( i j ) , μ + α i + α i + ψ i i ( i = j ) .
Without loss of generality, we can impose α r = 0 . Let
π = ( π 11 , , π 1 r , π 21 , , π 2 r , , π r r ) T ,
and
β = ( μ , β 1 , β 2 , β 12 ) T ,
where “T" denotes the transpose,
β 1 = ( α 1 , , α r 1 ) , β 2 = ( δ 1 , , δ k ) ,
and
β 12 = ( θ , ψ 11 , , ψ r r ) .
The LSQU k model can also be expressed as
log π = X β = ( 1 r 2 , X 1 , X 2 , X 12 ) β ,
where log π = ( log π 11 , , log π r r ) T , X is the r 2 × ( 2 r + 1 + k ) matrix, and 1 s is the s × 1 vector of the 1 element. Additionally,
X 1 = I r 1 1 r O r , r 1 + 1 r I r 1 0 r 1 T
X 2 = ( x 1 , , x k ) ,
where
x l = 1 r J r l J r l 1 r ( l = 1 , , k ) ,
and X 12 is the r 2 × ( r + 1 ) matrix determined from Equation (25). Note that O s t is the s × t zero matrix, 0 s is the s × 1 zero vector, J r l = ( 1 l , , r l ) T , and “⊗" represents the Kronecker product. The matrix X has a full column rank, which is K = 2 r + 1 + k .
We denote the linear space spanned by the columns of the matrix X by S ( X ) with dimension K. Let U be an r 2 × d 1 full column rank matrix, where d 1 = r 2 2 r 1 k , such that S ( U ) is the orthogonal complement of space S ( X ) . Hence, U T X = O d 1 , K .
Let h 1 ( π ) be a vector of functions defined by h 1 ( π ) = U T log π . Moreover, let h 2 ( π ) be a vector of functions defined by h 2 ( π ) = X 2 T π , and note that X 2 T U = O d 2 , d 1 where d 2 = k because X 2 belongs to space S ( X ) .
From Equation (25), the LSQU k model is equivalent to the hypothesis h 1 ( π ) = 0 d 1 . Additionally, the ME k model is equivalent to the hypothesis h 2 ( π ) = 0 d 2 . From Theorem 2, the SQU model is equivalent to the hypothesis h 3 ( π ) = 0 d 3 where h 3 = ( h 1 T , h 2 T ) T and d 3 = d 1 + d 2 = r 2 2 r 1 .
We derive the Wald statistic for the SQU model in an analogous mannar to Bhapkar [20]. Let H s ( s = 1 , 2 , 3 ) denote the d s × r 2 matrix of partial derivatives of h s ( π ) with respect to π . Namely, H s ( π ) = h s ( π ) / π T . Let Σ ( π ) = d i a g ( π ) π π T , where d i a g ( π ) denotes a diagonal matrix with the ith component of π as the ith diagonal component. Additionally, let p i j denote a sample proportion of the ( i , j ) cell. That is, p i j = n i j / n , and p = ( p 11 , , p 1 r , p 21 , , p 2 r , , p r r ) T . The central limit theorem indicates that n ( p π ) has an asymptotic normal distribution with mean 0 r 2 and covariance matrix Σ ( π ) . Using the delta method, n ( h 3 ( p ) h 3 ( π ) ) has an asymptotic normal distribution with mean 0 d 3 and covariance matrix
H 3 ( π ) Σ ( π ) H 3 T ( π ) = H 1 ( π ) Σ ( π ) H 1 T ( π ) H 1 ( π ) Σ ( π ) H 2 T ( π ) H 2 ( π ) Σ ( π ) H 1 T ( π ) H 2 ( π ) Σ ( π ) H 2 T ( π ) .
Since H 1 ( π ) π = U T 1 r 2 = 0 d 1 , H 1 ( π ) d i a g ( π ) = U T , and H 2 ( π ) = X 2 T , we obtain
H 1 ( π ) Σ ( π ) H 2 T ( π ) = U T X 2 = O d 1 , d 2 .
Under each hypothesis, h s ( π ) = 0 d s ( s = 1 , 2 , 3 ) , we see
W 3 = W 1 + W 2 ,
where
W s = n h s ( p ) T ( H s ( p ) Σ ( p ) H s T ( p ) ) 1 h s ( p ) .
The Wald statistic W s has an asymptotic chi-squared distribution with d s df. That is, (i) W 1 is the Wald statistic for the LSQU k model, (ii) W 2 is that for the ME k model, and (iii) W 3 is that for the SQU model. The proof is completed using the asymptotic equivalence of the Wald statistic and the likelihood ratio statistic as proved by Rao [21]. □
Theorem 3 is also a generalization of Yamamoto and Tomizawa’s result since this theorem is identical to Yamamoto and Tomizawa’s result when k = r 1 . Moreover, we obtain the following corollary.
Corollary 2.
For any k ( k = 1 , , r 1 ) , the test statistic G 2 ( S Q I ) is asymptotically equivalent to the sum of G 2 ( L S Q I k ) and G 2 ( M E k ) .

5. Example

Table 1 shows the data cited by [22]. This data described 59 matched pairs using 4 dose levels of conjugated estrogen. The models described herein are used to analyze this data. Table 2 shows the value of G 2 ( M ) for each model applied to the data in Table 1. That is, for model M, the null hypothesis is that model M holds, and the alternative hypothesis is that model M does not hold. From Table 2, the SQI, SQU, S, and ME k models do not fit well, and the LSQI k , LSQU k , and LS k models are accepted at the 0.05 significant level ( k = 1 , 2 , 3 ) . We choose the most appropriate model in these models. If model M 1 is a special case of model M 2 , a test based on the difference between the likelihood ratio chi-squared statistic can compare the model fitting of two nested models. Let d 1 and d 2 denote the degrees of freedom for the models M 1 and M 2 , respectively. Assuming that model M 2 holds, a likelihood ratio chi-squared statistic under model M 1 is given as G 2 ( M 1 | M 2 ) = G 2 ( M 1 ) G 2 ( M 2 ) . This statistic is an asymptotically chi-squared distribution with d 1 d 2 degrees of freedom. When we use it at the 0.05 significant level, the LSQI 1 model is the most appropriate model.
Table 3 shows the estimated expected frequencies from the LSQI 1 model for the data in Table 1. The value of maximum likelihood estimator of δ 1 for the LSQI 1 model is 0.71. We estimate the ratio between two probabilities as π ^ i j / π ^ j i = 0 . 71 2 ( j i ) for i < j . Therefore, the probability distribution for the average dose for a case tends to be stochastically higher than the probability distribution for the average dose for control because δ ^ 1 < 1 .
Finally, we are interested in inferring the reason for the poor fit of the SQI model. According to Corollary 1, the SQI model is separated into the LSQI 1 model and the ME 1 model. Since the LSQI 1 model fits very well, but the ME 1 model fits very poorly, we deduce that the lack of structure of the ME 1 model is responsible for the poor fit of the SQI model.

6. Conclusions

Herein we describe an asymmetry plus association model. This model indicates the asymmetry structures for cell probabilities between symmetric position and odds ratios. Our model is an intermediate model between the SQU model and the QU model. If the QU (LSQU r 1 ) model holds but the SQU model does not, the LSQU k model for k < r 1 may hold. In this case, the QU model may be overfitting. That is, our model may realize a better fit than the QU model under these conditions. In practice, the LSQI 1 model fits well when the SQU model fits poorly and the QU model fits for the data in Table 1. Additionally, a theorem with respect to the necessary and sufficient condition for the SQU model is represented using our model. Using this theorem, we show the asymptotic separability for the SQU model. Namely, the likelihood ratio chi-squared statistic for the SQU model is equivalent to the sum of those for the separated models, which helps deduce the reason that the SQU model does not hold.

Author Contributions

Conceptualization, K.T.; methodology, K.F.; formal analysis, K.F; writing—–original draft preparation, K.F.; writing—review and editing, K.F. and K.T.; project administration, K.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We would like to thank the three anonymous referees for their helpful comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
QUQuasi-uniform association
QIQuasi-independence
SSymmetry
MHMarginal homogeneity
QSQuasi-symmetry
LS k kth linear asymmetry
SQISymmetry plus quasi-independence
SQUSymmetry plus quasi-uniform association
LSQU k kth linear asymmetry plus quasi-uniform association
LSQI k kth linear asymmetry plus quasi-independence
ME k Marginal kth moment equality
dfDegrees of freedom

References

  1. Bishop, Y.M.M.; Fienberg, S.E.; Holland, P.W. Discrete Multivariate Analysis: Theory and Practice; The MIT Press: Cambridge, MA, USA, 1975. [Google Scholar]
  2. Kateri, M. Contingency Table Analysis: Methods and Implementation Using R; Birkhäuser (Springer): New York, NY, USA, 2014. [Google Scholar]
  3. Goodman, L.A. Simple models for the analysis of association in cross-classifications having ordered categories. J. Am. Stat. Assoc. 1979, 74, 537–552. [Google Scholar] [CrossRef]
  4. Goodman, L.A. The analysis of cross-classified data having ordered and/or unordered categories: Association models, correlation models, and asymmetry models for contingency tables with or without missing entries. Ann. Stat. 1985, 13, 10–69. [Google Scholar] [CrossRef]
  5. Goodman, L.A. Some useful extensions of the usual correspondence analysis approach and the usual log-linear models approach in the analysis of contingency tables. Int. Stat. Rev. 1986, 54, 243–309. [Google Scholar] [CrossRef]
  6. Agresti, A. Categorical Data Analysis, 2nd ed.; Wiley: New York, NY, USA, 2002. [Google Scholar]
  7. Bowker, A.H. A test for symmetry in contingency tables. J. Am. Stat. Assoc. 1948, 43, 572–574. [Google Scholar] [CrossRef] [PubMed]
  8. Stuart, A. A test for homogeneity of the marginal distributions in a two-way classification. Biometrika 1955, 42, 412–416. [Google Scholar] [CrossRef]
  9. Caussinus, H. Contribution à l’analyse statistique des tableaux de corrélation. Ann. Fac. Des. Sci. L’Univ. Toulouse 1965, 29, 77–182. [Google Scholar] [CrossRef]
  10. Tahata, K.; Tomizawa, S. Generalized linear asymmetry model and decomposition of symmetry for multiway contingency tables. J. Biom. Biostat. 2011, 2, 1–6. [Google Scholar] [CrossRef] [Green Version]
  11. Agresti, A. A simple diagonals-parameter symmetry and quasi-symmetry model. Stat. Probab. Lett. 1983, 1, 313–316. [Google Scholar] [CrossRef]
  12. Tomizawa, S. An extended linear diagonals-parameter symmetry model for square contingency tables with ordinal categories. Metron 1991, 49, 401–409. [Google Scholar]
  13. Yamamoto, K.; Tomizawa, S. Symmetry plus quasi uniform association model and its orthogonal decomposition for square contingency tables. J. Mod. Appl. Stat. Methods 2010, 9, 255–262. [Google Scholar] [CrossRef] [Green Version]
  14. Tahata, K.; Tomizawa, S. Generalized marginal homogeneity model and its relation to marginal equimoments for square contingency tables with ordered categories. Adv. Data Anal. Classif. 2008, 2, 295–311. [Google Scholar] [CrossRef]
  15. Aitchison, J. Large-sample restricted parametric tests. J. R. Stat. Soc. Ser. B 1962, 24, 234–250. [Google Scholar] [CrossRef]
  16. Darroch, J.N.; Silvey, S.D. On testing more than one hypothesis. Ann. Math. Stat. 1963, 34, 555–567. [Google Scholar] [CrossRef]
  17. Lang, J.B.; Agresti, A. Simultaneously modeling joint and marginal distributions of multivariate categorical responses. J. Am. Stat. Assoc. 1994, 89, 625–632. [Google Scholar] [CrossRef]
  18. Lang, J.B. On the partitioning of goodness-of-fit statistics for multivariate categorical response models. J. Am. Stat. Assoc. 1996, 91, 1017–1023. [Google Scholar] [CrossRef]
  19. Tomizawa, S.; Tahata, K. The analysis of symmetry and asymmetry: Orthogonality of decomposition of symmetry into quasi-symmetry and marginal symmetry for multi-way tables. J. Soc. Fr. Stat. 2007, 148, 3–36. [Google Scholar]
  20. Bhapkar, V.P. A note on the equivalence of two test criteria for hypotheses in categorical data. J. Am. Stat. Assoc. 1966, 61, 228–235. [Google Scholar] [CrossRef]
  21. Rao, C.R. Linear Statistical Inference and Its Applications, 2nd ed.; Wiley: New York, NY, USA, 1973. [Google Scholar]
  22. Breslow, N.E.; Day, N.E. Statistical Methods in Cancer Research, Vol. I: The Analysis of Case-Control Studies; International Agency for Research on Cancer: Lyon, France, 1980. [Google Scholar]
Figure 1. Relationships among the models (A → B indicates that model A is a special case of model B).
Figure 1. Relationships among the models (A → B indicates that model A is a special case of model B).
Symmetry 14 00805 g001
Table 1. Average doses of conjugated estrogen used by cases and matched controls: Los Angeles endometrial cancer study [22].
Table 1. Average doses of conjugated estrogen used by cases and matched controls: Los Angeles endometrial cancer study [22].
Average Dose for Case (mg/day)Average Dose for Control (mg/day)
0 (1)0.1–0.299 (2)0.3–0.625 (3)0.625+ (4)Total
0 (1)623112
0.1–0.299 (2)942116
0.3–0.625 (3)923115
0.625+ (4)1212116
Total36910459
Table 2. The values of likelihood ratio chi-squared statistics for models applied to Table 1.
Table 2. The values of likelihood ratio chi-squared statistics for models applied to Table 1.
Modeldf G 2 ( M )
SQI819.98 *
SQU719.86 *
LSQI 1 73.62
LSQI 2 62.98
LSQI 3 (QI)50.77
LSQU 1 63.61
LSQU 2 52.98
LSQU 3 (QU)40.69
S619.27 *
LS 1 52.97
LS 2 42.33
LS 3 (QS)30.46
ME 1 116.43 *
ME 2 217.08 *
ME 3 (MH)319.12 *
Note * Significant at the 0.05 level.
Table 3. Estimated expected frequencies from the LSQI 1 model.
Table 3. Estimated expected frequencies from the LSQI 1 model.
Average Dose for Case (mg/day)Average Dose for Control (mg/day)
0 (1)0.1–0.299 (2)0.3–0.625 (3)0.625+ (4)
0 (1)63.582.641.42
0.1–0.299 (2)7.0741.130.61
0.3–0.625 (3)10.342.2430.89
0.625+ (4)10.962.371.761
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fujisawa, K.; Tahata, K. Quasi Association Models for Square Contingency Tables with Ordinal Categories. Symmetry 2022, 14, 805. https://doi.org/10.3390/sym14040805

AMA Style

Fujisawa K, Tahata K. Quasi Association Models for Square Contingency Tables with Ordinal Categories. Symmetry. 2022; 14(4):805. https://doi.org/10.3390/sym14040805

Chicago/Turabian Style

Fujisawa, Kengo, and Kouji Tahata. 2022. "Quasi Association Models for Square Contingency Tables with Ordinal Categories" Symmetry 14, no. 4: 805. https://doi.org/10.3390/sym14040805

APA Style

Fujisawa, K., & Tahata, K. (2022). Quasi Association Models for Square Contingency Tables with Ordinal Categories. Symmetry, 14(4), 805. https://doi.org/10.3390/sym14040805

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop