Next Article in Journal
On Centralizers of Idempotents with Restricted Range
Next Article in Special Issue
Partially Nonclassical Method and Conformal Invariance in the Context of the Lie Group Method
Previous Article in Journal
Four-Point Bending of Basic Rails: Theory and Experimental Verification
Previous Article in Special Issue
Symmetric U-Net Model Tuned by FOX Metaheuristic Algorithm for Global Prediction of High Aerosol Concentrations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Diagonals–Parameter Symmetry Model and Its Property for Square Contingency Tables with Ordinal Categories

Department of Information Sciences, Tokyo University of Science, Chiba 278-8510, Japan
*
Author to whom correspondence should be addressed.
Symmetry 2024, 16(6), 768; https://doi.org/10.3390/sym16060768
Submission received: 2 May 2024 / Revised: 11 June 2024 / Accepted: 14 June 2024 / Published: 19 June 2024
(This article belongs to the Special Issue Symmetry in Mathematical Models)

Abstract

:
The diagonals–parameter symmetry (DPS) model is a proposed method for analyzing square contingency tables with ordinal categories. Previously, it was stated that the generalized DPS (DPS[f]) model was equivalent to the DPS model for any function f, but the proof was not provided. This paper presents the derivation of the DPS[f] model and the proof of the relationship between the two models. The findings offer various interpretations of the DPS model. Additionally, a new model is considered, and it is shown that the proposed model and the DPS[f] model are separable.

1. Introduction

A contingency table with identical categories for rows and columns can be produced when a categorical variable is repeatedly measured. Observations in this type of table tend to concentrate on the cells along the main diagonal. Our research focuses on applying symmetry instead of assuming independence between row and column categories. Several studies have addressed symmetry issues, such as [1,2,3,4,5,6,7,8,9].
Let X and Y represent the row and column variables for an r × r contingency table with ordinal categories. Additionally, let π i j represent the probability of an observation falling into the ( i , j ) th cell, where i = 1 , , r and j = 1 , , r . The diagonals–parameter symmetry (DPS) model proposed by Goodman [10] is defined as follows.
π i j = d k ψ i j ( i < j ) , ψ i j ( i j ) ,
where ψ i j = ψ j i and k = j i . The parameter d k in the DPS model represents the odds of an observation falling into cells ( i , j ) where j i = k and i < j , rather than cells ( j , i ) for k = 1 , , r 1 . Moreover, the ratio between π i j and π j i can be expressed as the constant d k for j i = k and i < j . This ratio depends solely on the distance from the main diagonal cells.
When d 1 = d 2 = = d r 1 = 1 in Equation (1), the DPS model reduces to the symmetry (S) model proposed by Bowker [1]. When d k is independent of i and j in Equation (1), with d 1 = = d r 1 , the DPS model reduces to the conditional symmetry (CS) model proposed by McCullagh [11].
Using the f-divergence, Kateri and Papaioannou [2] proposed the generalized DPS (DPS[f]) model, defined as
π i j = π i j S F 1 ( Δ k + ζ i j ) ( i = 1 , , r ; j = 1 , , r ) ,
where k = i j , π i j S = ( π i j + π j i ) / 2 , ζ i j = ζ j i and Δ k + Δ k = 0 . It should be noted that the function f is twice-differentiable and strictly convex. Additionally, F ( t ) = f ( t ) , f ( 1 ) = 0 , f ( 0 ) = lim t 0 f ( t ) , 0 · f ( 0 / 0 ) = 0 , and 0 · f ( a / 0 ) = a lim t [ f ( t ) / t ] . The model derivation is not included in their paper. They did mention that the DPS model is the closest to symmetry regarding the Kullback–Leibler distance under some conditions and that the DPS[f] model is equivalent to the DPS model. In this study, we will derive the DPS[f] model and provide proof of the relation between the two models. We can obtain various interpretations of the DPS model from the result. We discuss the necessary and sufficient condition for the S model and the property between test statistics for goodness of fit.
The paper is organized as follows: Section 2 derives Equation (2) and interprets the model from an information theory viewpoint. The proof is given that the DPS[f] model is equivalent to the DPS model regardless of the function f. Section 3 considers a new model and proves that the proposed model and the DPS[f] model are separable. A numerical example is provided in Section 4. Finally, Section 5 summarizes the paper.

2. Properties of the DPS[f] Model

Kateri and Papaioannou [2] noted that the DPS[f] model is the closest model to the S model in terms of the f-divergence under the conditions where j i = k π i j (and i j = k π i j ) for k= 1 , , r 1 and the sums π i j + π j i for i , j = 1 , , r are given. Similar research has been conducted in, for example, Ireland et al. [12], Kateri and Agresti [3], and Tahata [5]. This section derives the DPS[f] model and describes its properties.
We can obtain the following theorem, although the proof of Theorem 1 is given in Appendix A.1.
Theorem 1. 
In the class of models with given i j = k π i j , k 0 , and π i j + π j i ( i = 1 , , r ; j = 1 , , r ) , the model
π i j = π i j S F 1 ( Δ k + ζ i j ) ( i = 1 , , r ; j = 1 , , r )
with k = i j , ζ i j = ζ j i and Δ k + Δ k = 0 , is the model closest to the complete symmetry model in terms of the f-divergence.
The DPS [ f ] model can be expressed as
F ( 2 π i j c ) = γ i j + a k ( i < j ) , γ i j ( i j ) ,
where k = j i , γ i j = γ j i and π i j c = π i j / ( π i j + π j i ) . It should be noted that π i j c represents the conditional probability of an observation falling in the ( i , j ) cell, given that it falls in either the ( i , j ) cell or the ( j , i ) cell. Namely, the DPS[f] model indicates that
F ( 2 π i j c ) F ( 2 π j i c ) = a k ( i < j ) .
When a 1 = = a r 1 = 0 , the DPS[f] model is reduced to the S model.
If f ( x ) = x log ( x ) , x > 0 , then the f-divergence is reduced to the KL divergence. When we set f ( x ) = x log ( x ) , Equation (3) is reduced to
π i j = π i j S exp ( γ i j + a k 1 ) ( i < j ) , π i j S exp ( γ i j 1 ) ( i j ) ,
where k = j i and γ i j = γ j i . We shall refer to this model as the DPSKL model. Under the DPSKL model, the ratios of π i j and π j i for i < j are expressed as
π i j π j i = d k KL ( i < j ) ,
where d k KL = exp ( a k ) and k = j i . Since Equation (5) indicates that the ratio of π i j and π j i depends on the distance of k = j i , the DPSKL model is equivalent to the DPS model proposed by Goodman [10]. Namely, the DPS model is the closest model to the S model in terms of the KL divergence under the conditions where i j = k π i j , k 0 , and the sums π i j + π j i for i = 1 , , r ; j = 1 , , r are given. This is a special case of Theorem 1.
If f ( x ) = log ( x ) , x > 0 , then the f-divergence is reduced to the reverse KL divergence. Then, the DPS[f] model is reduced to
π i j = π i j S 1 γ i j + a k ( i < j ) , π i j S 1 γ i j ( i j ) ,
where k = j i and γ i j = γ j i . We shall refer to this model as the DPSRKL model. This model is the closest to the S model when the divergence is measured by the reverse KL divergence and can be expressed as
1 π i j c 1 π j i c = d k RKL ( i < j ) ,
where d k RKL = 2 a k and k = j i . This model indicates that the difference between inverse probabilities 1 / π i j c and 1 / π j i c depends on the distance of k = j i .
If f ( x ) = ( 1 x ) 2 , then the f-divergence is reduced to the χ 2 -divergence (Pearsonian distance). Then, the DPS[f] model is reduced to
π i j = π i j S γ i j + a k 2 + 1 ( i < j ) , π i j S γ i j 2 + 1 ( i j ) ,
where k = j i and γ i j = γ j i . We shall refer to this model as the DPSP model. This model is the closest to the S model when the divergence is measured by the χ 2 -divergence and can be expressed as
π i j c π j i c = d k P ( i < j ) ,
where d k P = a k / 4 and k = j i . This model indicates that the difference between π i j c and π j i c depends on the distance of k = j i .
Moreover, if f ( x ) = ( λ ( λ + 1 ) ) 1 ( x λ + 1 x ) , x > 0 , where λ is a real-valued parameter, then the f-divergence is reduced to the power-divergence [13]. Then, the DPS [ f ] model is reduced to
π i j = π i j S λ γ i j + a k + 1 λ + 1 1 λ ( i < j ) , π i j S λ γ i j + 1 λ + 1 1 λ ( i j ) ,
where k = j i and γ i j = γ j i . We shall refer to this model as the DPSPD(λ) model. This model is the closest to the S model when the power-divergence measures the divergence and can be expressed as
( π i j c ) λ ( π j i c ) λ = d k PD ( λ ) ( i < j ) ,
where d k PD ( λ ) = ( λ a k ) / 2 λ and k = j i . This model indicates that the difference between the symmetric conditional probabilities to the power of λ depends on the distance of k = j i . When we apply the DPSPD(λ) model, we should set the value of λ .
Kateri and Papaioannou [2] reported that the DPS[f] model is equivalent to the DPS model regardless of f. That is, all the models described above (i.e., DPSKL, DPSRKL, DPSP, and DPSPD(λ)) are equivalent to the DPS model surprisingly. However, the proof was not given. We prove the following theorem.
Theorem 2. 
The DPS[f] model is equivalent to the DPS model regardless of f.
The poof is given in Appendix A.2. Theorem 2 states that the DPS model holds if and only if the DPS[f] model holds. If the DPS model fits the given dataset, we obtain various interpretations for the data.
When a 1 = = a r 1 , the DPS[f] model is reduced to the conditional symmetry model based on the f-divergence (CS[f]) model. The CS[f] model is described previously Kateri and Papaioannou [2]. Additionally, Fujisawa and Tahata [14] proposed the generalization of the CS[f] model. Similarly, when d 1 = = d r 1 , the DPS model is reduced to the CS model proposed by McCullagh [11]. The CS[f] model is equivalent to the CS model regardless of f (Kateri and Papaioannou [2]). Hence, Theorem 2 leads to the following result.
Corollary 1. 
The CS[f] model is equivalent to the CS model regardless of f.

3. Equivalence Conditions for Symmetry

Here, the equivalence conditions of the S model are discussed. If the S model holds, then the DPS[f] model with a 1 = = a r 1 = 0 holds. Conversely, if the DPS[f] model holds, then the S model does not hold generally. Therefore, we are interested in considering an additional condition to obtain the S model when the DPS[f] model holds. Other studies have discussed such conditions; see Read [15] and Tahata et al. [16].
We consider the distance global symmetry (DGS) model defined as
δ k U = δ k L ( k = 1 , , r 1 ) ,
where δ k U = j i = k π i j , δ k L = i j = k π i j . For k = 1 , , r 1 , this model indicates that the sum of probabilities which are apart distance k = j i from main diagonal cells is equal to the sum of probabilities which are apart distance k = i j from main diagonal cells. We obtain the following theorem. (The proof is given in Appendix A.3.)
Theorem 3. 
The S model holds if and only if both the DPS[f] and DGS models hold.
Next, we consider the global symmetry (GS) model, which is defined as
i < j π i j = i < j π j i .
It should be noted that the DGS model implies the GS model. Read [15] noted that the S model holds if and only if both the CS and GS models hold. Fujisawa and Tahata [14] proved that the S model holds if and only if the CS[f] and GS models hold. These statements are the same as those from Corollary 1. In addition, a refined estimator for measures associated with the S, CS, and GS models was introduced by [17]. The result has a significant connection to decomposing the S model and separating the goodness-of-fit test statistic of the S model. According to Corollary 1, the refined estimator for the measure of CS can be utilized to gauge the extent of deviation from the CS[f] model.
This section proves the separation of the test statistics for the S model into those for the DPS[f] model and the DGS model. Consider a square contingency table of size r × r where n i j denotes the observed frequency in the cell located at the ( i , j ) position. Assume this contingency table adheres to a multinomial distribution. In this context, let m i j represent the expected frequency in the ( i , j ) cell, and m ^ i j be its corresponding maximum likelihood estimate under a specified model. To test each model’s goodness of fit, we can employ the likelihood ratio chi-square statistic, denoted by G 2 ( M ) . This statistic is computed using the following formula:
G 2 ( M ) = 2 i = 1 r j = 1 r n i j log n i j m ^ i j .
This statistic follows a chi-square distribution with the corresponding degrees of freedom (df).
It is supposed that model M3 holds if and only if both models M1 and M2 hold. In this case, if the analyst has found hypothesis M3 unacceptable, their attention will move to examining components M1 and M2. For these three models, Aitchison [18] discussed the properties of the Wald test statistics, and Darroch and Silvey [19] described the properties of the likelihood ratio chi-square statistics. Assume that the following equivalence holds:
T ( M 3 ) = T ( M 1 ) + T ( M 2 ) ,
where T is the goodness of fit test statistic and the number of df for M3 is equal to the sum of numbers of df for M1 and M2. If both M1 and M2 are accepted with a high probability (at the α significance level), then M3 is accepted. However, when (6) does not hold, an incompatible situation where both M1 and M2 are accepted with a high probability but M3 is rejected may arise. In fact, Darroch and Silvey [19] showed such an interesting example. The partitions of chi-squared test statistics are also discussed in, for example, [20,21].
From Theorem 3, the S model holds if and only if the DPS[f] model and the DGS model hold. In addition, df for the DPS[f] model is ( r 1 ) ( r 2 ) / 2 and that for DGS model is ( r 1 ) . The df for the S model can be obtained by adding the degrees of freedom for the DPS[f] model and the DGS model. Thus, we consider partitioning test statistics.
Theorem 2 confirms that the DPS[f] model is equivalent to the DPS model. Therefore, the maximum likelihood estimates (MLEs) under the DPS[f] model are given by
m ^ i j = n k U n k U + n k L n i j + n j i ( i < j ) , m ^ i j = n i j ( i = j ) , m ^ i j = n k L n k U + n k L n i j + n j i ( i > j ) ,
where k = | j i | , n k U = k = j i n i j , and n k L = k = j i n j i (Goodman [10]).
Next, we consider the MLEs under the DGS model using the Lagrange function. Since the kernel of the log likelihood is i = 1 r j = 1 r n i j log π i j , Lagrange function L is written as
L = i = 1 r j = 1 r n i j log π i j + λ i = 1 r j = 1 r π i j 1 + k = 1 r 1 λ k k = j i π i j π j i .
Equating the derivation of L to 0 with respect to π i j , λ , and λ k gives
m ^ i j = ( n k U + n k L ) n i j 2 n k U ( i < j ) , m ^ i j = n i j ( i = j ) , m ^ i j = ( n k U + n k L ) n i j 2 n k L ( i > j ) ,
where k = | j i | . It is important to note that the DPS and DGS models do not remain the same when the row and column categories are permuted. Therefore, these models should be used with data from an ordinal category.
We obtain the following equivalence from Equations (7) and (8):
G 2 ( S ) = G 2 ( D P S [ f ] ) + G 2 ( D G S ) ,
because the MLEs under the S model are m ^ i j = ( n i j + n j i ) / 2 . Therefore, the DPS[f] model and the DGS model are separable and exhibit independence.
Let W ( M ) denote the Wald statistic for model M. We obtain the following theorem and prove it in Appendix A.4.
Theorem 4. 
W ( S ) is equal to the sum of W ( D P S [ f ] ) and W ( D G S ) .

4. Numerical Example

Table 1, which is taken from Smith et al. [22], describes the amount of influence religious leaders and medical leaders should have in government funding for decisions on stem cell research when surveying 871 people. The influence levels are divided into four categories: (1) Great influence, (2) Some influence, (3) A little influence, and (4) No influence.
The values of the likelihood ratio chi-square statistics G 2 and the corresponding p values for the models applied to these data are shown in Table 2. Table 2 indicates that the sum of the test statistics DPS (i.e., DPS[f]) model and DGS model is equal to that of the S model. The S model fits the data very poorly. We can infer that the marginal distribution for religious leaders is not equal to that for medical leaders. On the other hand, the DPS model fits the data very well. The likelihood-ratio test for the null hypothesis H 0 : d 1 = d 2 = d 3 = 1 uses a test statistic which is the difference between G 2 for the S model and the DPS model. The resulting test statistic is 545.15 2.45 = 542.70 with three degrees of freedom. This indicates strong evidence of at least one difference from 1. Additionally, the DGS model fits the data poorly. From Theorem 3, the reason of the poor fit of S model is caused by the poor fit of the DGS model rather than the DPS model.
The values of MLEs of ( d 1 , d 2 , d 3 ) in Equation (1) are ( 0.15 , 0.05 , 0.06 ) . It should be noted that ( d 1 , d 2 , d 3 ) is equal to ( d 1 KL , d 2 KL , d 3 KL ) in the DPSKL model. Let ( i , j ) denote the pair that the amount of influence religious leaders is ith level and that of medical leaders is jth level. When k = j i ( k = 1 , 2 , 3 ), a pair ( i , j ) is d ^ k times as likely as a pair ( j , i ) on condition that a pair is ( i , j ) or ( j , i ) . From d ^ k < 1 ( k = 1 , 2 , 3 ), the probability distribution for religious leaders is stochastically higher than the probability distribution of medical leaders. That is, the medical leaders rather than the religious leaders should have influence in government funding for decisions on stem cell research.
Moreover, from Theorem 2, we can obtain various interpretations. Since the DPS model holds, the DPSRKL, DPSP, and DPSPD(λ) models also hold. For example, we obtain
( d ^ 1 RKL , d ^ 2 RKL , d ^ 3 RKL ) = ( 6.37 , 19.22 , 18.09 ) , ( d ^ 1 P , d ^ 2 P , d ^ 3 P ) = ( 0.73 , 0.90 , 0.90 ) ,
and for λ = 3 ,
( d ^ 1 PD ( 3 ) , d ^ 2 PD ( 3 ) , d ^ 3 PD ( 3 ) ) = ( 0.65 , 0.86 , 0.85 ) .
When k = j i ( k = 1 , 2 , 3 ), we can infer that (i) the difference between the reciprocal of conditional probability that a pair is ( i , j ) and the reciprocal of conditional probability that a pair is ( j , i ) is d ^ k RKL on condition that the pair is ( i , j ) or ( j , i ) from the DPSRKL model, (ii) the difference between the conditional probability that a pair is ( i , j ) and the conditional probability that a pair is ( j , i ) is d ^ k P under the same condition from the DPSP model, and (iii) the difference between the conditional probability that a pair is ( i , j ) to the third power and the conditional probability that a pair is ( j , i ) to the third power is d ^ k PD ( 3 ) under the same condition from the DPSPD(3) model.

5. Concluding Remarks

This paper proves that the DPS[f] model is equivalent to the DPS model proposed by Goodman [10]. This result provides various interpretations of the DPS model. The separation of the test statistic for the S model is considered. The DPS[f] and DGS models are separable and exhibit independence. Kateri and Papaioannou [2], Kateri and Agresti [3], Tahata [5] and Fujisawa and Tahata [14] considered models based on the f-divergence for the analysis of square contingency tables with ordinal categories. In the future, it should be studied whether the model based on the f-divergence is equivalent to the conventional model.

Author Contributions

Conceptualization, K.T.; methodology, K.T.; software, K.M.; validation, K.T. and K.M.; formal analysis, K.T.; investigation, K.M.; resources, Tahata, K.; data curation, K.M.; writing—original draft preparation, K.M.; writing—review and editing, K.T.; visualization, K.M.; supervision, K.T.; project administration, K.T.; funding acquisition, K.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by JSPS KAKENHI (Grant Number 20K03756).

Data Availability Statement

“The General Social Survey” at https://gss.norc.org/ (accessed on 1 June 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

This section provides the proofs of theorems.

Appendix A.1

In a similar manner to Tahata [5], we prove Theorem 1. Let I C ( π : π S ) denote the f-divergence between ( π i j ) and ( π i j S ). That is
I C ( π : π S ) = i = 1 r j = 1 r π i j S f π i j π i j S ,
where f satisfies the conditions described in Section 1. Now minimize (A1) under the conditions where the restraints
π i j + π j i = t i j = t j i ( i = 1 , , r ; j = 1 , , r )
and
δ k U = i j = k π i j , δ k L = i j = k π i j ( k = 1 , , r 1 )
are given. The Lagrange function is written as
L = I C ( π : π S ) + i = 1 r j = 1 r λ i j π i j + π j i t i j + k = 1 r 1 Δ ¯ k i j = k π i j δ k U + Δ ¯ k i j = k π i j δ k L .
By taking the partial derivative of L with respect to π i j and setting it to zero, we obtain the following equation:
f π i j π i j S + Δ ¯ k + λ i j + λ j i = 0 ( i < j ) , f π i j π i j S + λ i j + λ j i = 0 ( i = j ) , f π i j π i j S + Δ ¯ k + λ i j + λ j i = 0 ( i > j ) .
Let f denote F, and let π i j * denote the solution satisfying (A2), (A3), and (A4). Given that f is a strictly convex function, it follows that F ( x ) = f ( x ) > 0 for all x. Thus, F is strictly monotonic, ensuring the existence of F 1 . We represent ζ i j as ( λ i j + λ j i ) and Δ l as Δ ¯ l . From Equation (A4), we obtain
π i j * = π i j S F 1 Δ k + ζ i j ( i < j ) , π i j * = π i j S F 1 ζ i j ( i = j ) , π i j * = π i j S F 1 Δ k + ζ i j ( i > j ) ,
where ζ i j = ζ j i and Δ k + Δ k = 0 . The minimum value of I C ( π : π S ) is obtained for π i j * , where ζ i j and Δ l are selected to ensure that π i j * complies with the constraints (A2) and (A3). Thus, the DPS[f] model represents the optimal approximation to the S model in terms of f-divergence under these specified conditions.

Appendix A.2

Let function G be defined as
G ( x ) = F 2 x 1 + x F 2 1 + x ( x > 0 ) ,
where F = f . Then, the derivative of G is
G ( x ) = 2 ( 1 + x ) 2 F 2 x 1 + x + F 2 1 + x .
Since the function f is twice-differential and strictly convex G ( x ) > 0 for x > 0 , hence G is a strictly increasing function, and G 1 exists.
If the DPS model holds, π i j / π j i = d k holds for i < j from Equation (1), where k = j i . Then we can see that for i < j ,
G ( d k ) = F 2 d k 1 + d k F 2 1 + d k , = F 2 π i j c F 2 π j i c .
This is equivalent to Equation (4). Namely, the DPS[f] model holds.
On the other hand, if the DPS[f] model holds, Equation (4) holds. We can see that for i < j ,
G π i j π j i = a k .
Since G 1 exists, we obtain
π i j π j i = G 1 a k .
Namely, the DPS model holds. The proof is complete.

Appendix A.3

It is obvious that if the S model holds, the DPS[f] model and the DGS model simultaneously hold. Assuming that both the DPS[f] and the DGS models hold, we show that the S model holds. From Theorem 2, the DPS[f] model is equivalent to π i j / π j i = d k for i < j with k = j i . Since the DGS model holds, we obtain
j i = k d k 1 π j i = 0 ( k = 1 , , r 1 ) .
Since π j i > 0 , we get d k = 1 ( k = 1 , , r 1 ). Namely, the S model holds.

Appendix A.4

Theorem 2 shows that the DPS[f] model is equivalent to the DPS model. Let
π = ( π 11 , , π 1 r , π 21 , , π 2 r , , π r 1 , , π r r ) t , β = ( ρ 1 , , ρ r 1 , ε ) t ,
where ε = ( ε 11 , , ε 1 r , ε 22 , , ε 2 r , , ε r r ) . Then, from Equation (1), the DPS model is expressed as
log π = X β = ( x 1 , , x r 1 , x 11 , , x 1 r , x 22 , , x 2 r , , x r r ) β ,
where x l = ( w l + 1 , , w r , 0 , , 0 ) t is a r 2 × 1 vector ( l = 1 , , r 1 ) . Here, w h ( 1 × r vector) is 1 for the hth element and 0 otherwise. For example, when r = 4 ,
x 1 = ( w 2 , w 3 , w 4 , 0 , , 0 ) t = ( 0 , 1 , 0 , 0 , 0 , 0 , 1 , 0 , 0 , 0 , 0 , 1 , 0 , 0 , 0 , 0 ) t .
Additionally, x i j ( i j ) is the r 2 × 1 vector shouldering ε i j . Note that the r 2 × K matrix X is a full column rank where K = ( r 1 ) + r ( r + 1 ) / 2 .
We define the linear space spanned by the columns of the matrix X as S ( X ) , which has dimension K. This space, S ( X ) , is a subspace of R r 2 . Consider an r 2 × d 1 matrix U with full column rank, such that the linear space S ( U ) , spanned by the columns of U , serves as the orthogonal complement of S ( X ) . Note that d 1 is calculated as d 1 = r 2 ( ( r 1 ) + r ( r + 1 ) / 2 ) = ( r 1 ) ( r 2 ) / 2 . Given that U t X = O d 1 , K , where O d 1 , K denotes the d 1 × K zero matrix, the DPS model can be expressed as h 1 ( π ) = U t log π = 0 d 1 , with 0 s representing the s × 1 zero vector.
Additionally, the DGS model can be expressed as h 2 ( π ) = M π = 0 d 2 where
M = ( g 1 , , g r 1 ) t ,
and d 2 = r 1 . Here, g l = 2 x l j i = l x i j . Note that M t belongs to the space S ( X ) . That is, S ( M t ) S ( X ) .
Let p denote π with π i j replaced by p i j , where p i j = n i j / n with n = n i j . From Theorem 3, the S model is equivalent to h 3 ( π ) = 0 d 3 , where h 3 = ( h 1 t , h 2 t ) t and d 3 = d 1 + d 2 = r ( r 1 ) / 2 . In an analogous manner to Tahata [5], we obtain that n ( h 3 ( p ) h 3 ( π ) ) has an asymptotically normal distribution with mean 0 d 3 and covariance matrix
H 3 ( π ) Σ ( π ) H 3 t ( π ) = H 1 ( π ) Σ ( π ) H 1 t ( π ) O d 1 , d 2 O d 2 , d 1 H 2 ( π ) Σ ( π ) H 2 t ( π ) ,
where H s ( π ) = h s ( π ) / π t and Σ ( π ) = d i a g ( π ) π π t . Here, d i a g ( π ) denotes a diagonal matrix with the ith component of π as the ith diagonal component. Therefore, W 3 = W 1 + W 2 holds, where
W s = n h s t ( p ) ( H s ( p ) Σ ( p ) H s t ( p ) ) 1 h s ( p ) .
The Wald statistic for the DPS[f] model (i.e., W ( D P S [ f ] ) ) is W 1 , that for the DGS model (i.e., W ( D G S ) ) is W 2 , and that for the S model (i.e., W ( S ) ) is W 3 . The proof is complete.

References

  1. Bowker, A.H. A test for symmetry in contingency tables. J. Am. Stat. Assoc. 1948, 43, 572–574. [Google Scholar] [CrossRef]
  2. Kateri, M.; Papaioannou, T. Asymmetry models for contingency tables. J. Am. Stat. Assoc. 1997, 92, 1124–1131. [Google Scholar] [CrossRef]
  3. Kateri, M.; Agresti, A. A class of ordinal quasi-symmetry models for square contingency tables. Stat. Probab. Lett. 2007, 77, 598–603. [Google Scholar] [CrossRef]
  4. Tahata, K.; Tomizawa, S. Generalized linear asymmetry model and decomposition of symmetry for multiway contingency tables. J. Biom. Biostat. 2011, 2, 1–6. [Google Scholar] [CrossRef]
  5. Tahata, K. Separation of symmetry for square tables with ordinal categorical data. Jpn. J. Stat. Data Sci. 2020, 3, 469–484. [Google Scholar] [CrossRef]
  6. Tahata, K. Advances in Quasi-Symmetry for Square Contingency Tables. Symmetry 2022, 14, 1051. [Google Scholar] [CrossRef]
  7. Beh, E.J.; Lombardo, R. Visualising Departures from Symmetry and Bowker’s X2 Statistic. Symmetry 2022, 14, 1103. [Google Scholar] [CrossRef]
  8. Altun, G.; Saraçbaşı, T. Determination of model fitting with power-divergence-type measure of departure from symmetry for sparse and non-sparse square contingency tables. Commun. Stat.-Simul. Comput. 2022, 51, 4087–4111. [Google Scholar] [CrossRef]
  9. Ando, S. Generalized Sum-Asymmetry Model and Orthogonality of Test Statistic for Square Contingency Tables. Austrian J. Stat. 2024, 53, 99–108. [Google Scholar] [CrossRef]
  10. Goodman, L.A. Multiplicative Models for Square Contingency Tables with Ordered Categories. Biometrika 1979, 66, 413–418. [Google Scholar] [CrossRef]
  11. McCullagh, P. A class of parametric models for the analysis of square contingency tables with ordered categories. Biometrika 1978, 65, 413–418. [Google Scholar] [CrossRef]
  12. Ireland, C.T.; Ku, H.H.; Kullback, S. Symmetry and Marginal Homogeneity of an r × r Contingency Table. J. Am. Stat. Assoc. 1969, 64, 1323–1341. [Google Scholar] [CrossRef]
  13. Read, C.B.; Cressie, N. Goodness-of-Fit Statistics for Discrete Multivariate Data; Springer: New York, NY, USA, 1988. [Google Scholar]
  14. Fujisawa, K.; Tahata, K. Asymmetry model based on f-divergence and orthogonal decomposition of symmetry for square contingency tables with ordinal categories. SUT J. Math. 2020, 56, 39–53. [Google Scholar] [CrossRef]
  15. Read, C.B. Partitioning chi-squape in contingency tables: A teaching approach. Commun. Stat.-Theory Methods 1977, 6, 553–562. [Google Scholar] [CrossRef]
  16. Tahata, K.; Naganawa, M.; Tomizawa, S. Extended linear asymmetry model and separation of symmetry for square contingency tables. J. Jpn. Stat. Soc. 2016, 46, 189–202. [Google Scholar] [CrossRef]
  17. Tahata, K.; Auchi, R.; Ando, S.; Tomizawa, S. Separation of the refined estimator of the measure for symmetry in square contingency tables. Commun. Stat.-Simul. Comput. 2023, 1–17. [Google Scholar] [CrossRef]
  18. Aitchison, J. Large-sample restricted parametric tests. J. R. Stat. Soc. Ser. B-Stat. Methodol. 1962, 24, 234–250. [Google Scholar] [CrossRef]
  19. Darroch, J.N.; Silvey, S.D. On testing more than one hypothesis. Ann. Math. Stat. 1963, 34, 555–567. [Google Scholar] [CrossRef]
  20. Lang, J.B.; Agresti, A. Simultaneously modeling joint and marginal distributions of multivariate categorical responses. J. Am. Stat. Assoc. 1994, 89, 625–632. [Google Scholar] [CrossRef]
  21. Lang, J.B. On the partitioning of goodness-of-fit statistics for multivariate categorical response models. J. Am. Stat. Assoc. 1996, 91, 1017–1023. [Google Scholar] [CrossRef]
  22. Smith, T.W.; Marsden, P.; Hout, M.; Kim, J. General Social Surveys, 1972–2014 [Machine-Readable Data File]; NORC at the University of Chicago: Chicago, IL, USA, 2006. [Google Scholar]
Table 1. How much influence should religious leaders and medical leaders have in government funding for decisions on stem cell research? [22].
Table 1. How much influence should religious leaders and medical leaders have in government funding for decisions on stem cell research? [22].
ReligiousMedical Leaders
LeadersGreat (1)Fair (2)Little (3)None (4)Total
Great (1)36167766
(36.00) a(11.96) a(6.22) a(7.00) a
(36.00) b(60.19) b(70.95) b(67.00) b
Fair (2)7496224196
(78.04) a(96.00) a(26.05) a(4.78) a
(42.67) b(96.00) b(82.76) b(40.55) b
Little (3)119174484345
(119.78) a(169.95) a(48.00) a(3.99) a
(62.59) b(100.34) b(48.00) b(15.05) b
None (4)127932618264
(127.00) a(92.22) a(26.01) a(18.00) a
(67.00) b(48.91) b(14.99) b(18.00) b
Total35637910333871
a MLEs under the DPS model; b MLEs under the DGS model.
Table 2. Likelihood ratio chi-square values G 2 for the models applied to Table 1.
Table 2. Likelihood ratio chi-square values G 2 for the models applied to Table 1.
Modelsdf G 2 p-Value
S6545.15< 0.0001
DPS32.450.4847
DGS3542.70< 0.0001
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tahata, K.; Matsuda, K. Diagonals–Parameter Symmetry Model and Its Property for Square Contingency Tables with Ordinal Categories. Symmetry 2024, 16, 768. https://doi.org/10.3390/sym16060768

AMA Style

Tahata K, Matsuda K. Diagonals–Parameter Symmetry Model and Its Property for Square Contingency Tables with Ordinal Categories. Symmetry. 2024; 16(6):768. https://doi.org/10.3390/sym16060768

Chicago/Turabian Style

Tahata, Kouji, and Kohei Matsuda. 2024. "Diagonals–Parameter Symmetry Model and Its Property for Square Contingency Tables with Ordinal Categories" Symmetry 16, no. 6: 768. https://doi.org/10.3390/sym16060768

APA Style

Tahata, K., & Matsuda, K. (2024). Diagonals–Parameter Symmetry Model and Its Property for Square Contingency Tables with Ordinal Categories. Symmetry, 16(6), 768. https://doi.org/10.3390/sym16060768

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop