Next Article in Journal
Coarse-Grained Column Agglomeration Parallel Algorithm for LU Factorization Using Multi-Threaded MATLAB
Previous Article in Journal
Return Strategies of Competing E-Sellers: Return Freight Insurance vs. Return Pickup Services
Previous Article in Special Issue
Wild Bootstrap-Based Bias Correction for Spatial Quantile Panel Data Models with Varying Coefficients
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation for Two Sensitive Variables Using Randomization Response Model Under Stratified Random Sampling

1
Department of Children Welfare, Woosuk University, Wanju 55338, Jeonbuk, Republic of Korea
2
Department of Computer Science, Dongshin University, Naju 58245, Jeonnam, Republic of Korea
3
Division of Public Affairs & Police Administration, Dongguk University, Gyeongju 38066, Gyeongbuk, Republic of Korea
4
Department of Applied Statistics, Dongguk University, Gyeongju 38066, Gyeongbuk, Republic of Korea
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(2), 297; https://doi.org/10.3390/math13020297
Submission received: 18 December 2024 / Revised: 10 January 2025 / Accepted: 14 January 2025 / Published: 17 January 2025
(This article belongs to the Special Issue Statistical Theory and Application, 2nd Edition)

Abstract

:
When direct survey are about sensitive characteristics such as addiction to drugs, alcoholism, proneness to tax invasion and sexual violence, nonresponse bias and response bias become serious problems because people oftentimes do not wish to give true information. In this study, when the population is composed of strata such as gender, region, age group, we consider the simple model and crossed model by applying stratified random sampling which can estimate not only the domain population proportion but also the whole population proportion for two sensitive attributes such as drug use and sexual violence in the same time. In addition, when the size of each population stratum is unknown in stratified random sampling, we propose the simple model and crossed model by using stratified double sampling method. In each proposed survey design, the sample allocation of each stratum is dealt with in consideration of proportional allocation and optimal one. We compare the efficiency between the simple model and the crossed model according to the proposed stratified random sampling design.

1. Introduction

The surveys on sensitive attributes as drug use, tax evasion or criminal violence, respondents may avoid responding or give false answers because of the risk of exposing their personal confidentiality, which can lead to non-response bias or response bias. To overcome this problem, the randomized response model (RRM) was first proposed by [1] that could obtain sensitive information while protecting the identity or confidentiality of the respondent through an indirect response using a randomization device and an unrelated question randomized response model was suggested by [2] that improved the Warner’s related question model. Since then many researchers have suggested various randomized response models to improve the estimation quality [3].
For the stratified random sampling design of RRM, a stratified unrelated question randomized response model was suggested by [4] to estimate the whole and domain population proportion, a stratified Warner model was published by [5] and it have developed a discrete quantitative randomized response model using stratified random sampling [6].
Especially, a survey for the two sensitive attributes was suggested by [7] to estimate the proportion of two sensitive characteristics at the same time by having the respondents selected by the simple random sampling with replacement using two randomization devices. They provided two shuffled decks of cards to each respondent in the sample and each deck was comprised two sorts of cards that indicate whether or not the respondent possesses a particular sensitive attributes.
In Deck-I, they structured the first randomization device as “Do you have a sensitive attribute A?” and “Do you have a nonsensitive attribute A c ?”, and the second randomization device as “Do you have a sensitive attribute B?” and “Do you have a nonsensitive attribute B c ?”, and proposed a simple model that estimates the proportion of the population with sensitive attribute A, sensitive attribute B, and both sensitive attributes A and B simultaneously. In Deck-II, they structured the first randomization device as “Do you have a sensitive attribute A?” and “Do you have a nonsensitive attribute B c ?”, and the second randomization device as “Do you have a sensitive attribute B?” and “Do you have a nonsensitive attribute A c ?”, and proposed a crossed model that estimates the proportion of the population with sensitive attribute A, sensitive attribute B, and both sensitive attributes A and B at the same time.
In this study, when the population is composed of strata, we extend the simple model and crossed model by applying stratified random sampling which can estimate not only the stratum population proportion but also the whole population proportion for two sensitive attributes. When the size of each stratum is unknown in stratified random sampling, we propose the simple model and crossed model by using stratified double sampling method at the same time. In each proposed model, the sample allocation of each stratum is dealt with in consideration of proportional allocation and optimal one. We are to compare the efficiency between the simple model and the crossed model according to the proposed stratified random sampling method.
In Section 2 we consider the stratified estimation with simple model, and then we consider the crossed model with stratified sampling design in Section 3. In Section 4, we show the efficiency comparison between the stratified random sampling and the stratified double sampling by simple model and crossed model. In Section 5, we make a concluding remarks.

2. Stratified Estimation for Two Sensitive Attributes with the Simple Model

2.1. A Simple Model with the Stratified Random Sampling

In this section, we consider that the application of stratified sampling design to a simple model for estimating two sensitive attributes when the sizes of each stratum are known. We assume a population with size N composed of mutually exclusive strata, each with N h a known size. From each stratum N h ( h = 1 ,   2 ,   ,   L ) , a sample of size n h is drawn using simple random sampling with replacement (SRSWR). The ith respondent from each stratum h responds to a survey selected by two randomization devices as following Table 1 and Table 2, answering either “yes” or “no” by the following devices, with selection probabilities P h 0.5 and T h 0.5 .
Therefore, the probability that the respondent in stratum h ( = 1 ,   2 ,   ,   L ) answers (yes, yes) is given by
θ h 11 = ( 2 P h 1 ) ( 2 T h 1 ) π h A B + ( 2 P h 1 ) ( 1 T h ) π h A + ( 1 P h ) ( 2 T h 1 ) π h B + ( 1 P h ) ( 1 T h ) .
And the probability that the respondent in stratum h answers (yes, no) is given by
θ h 10 = ( 2 P h 1 ) ( 2 T h 1 ) π h A B + ( 2 P h 1 ) T h π h A ( 1 P h ) ( 2 T h 1 ) π h B + ( 1 P h ) T h .
The probability that the respondent in stratum h answers (no, yes) is given by
θ h 01 = ( 2 P h 1 ) ( 2 T h 1 ) π h A B ( 2 P h 1 ) ( 1 T h ) π h A + P h ( 2 T h 1 ) π h B + P h ( 1 T h ) .
Finally, the probability that the respondent in stratum h answers (no, no) is given by
θ h 00 = ( 2 P h 1 ) ( 2 T h 1 ) π h A B ( 2 P h 1 ) T h π h A P h ( 2 T h 1 ) π h B + P h T h ,
where π h A and π h B are a population proportion of sensitive attribute A and sensitive attribute B and π h A B is a population proportion of sensitive attribute A and B, and i = 0 1 j = 0 1 θ h i j = 1 for a stratum h, respectively.
Let the number of person who answered (yes, yes) out of n h respondents sampled from h stratum is n h 11 , and answered (yes, no) is n h 10 , and answered (no, yes) is n h 01 , and answered (no, no) is n h 00 , then the sample proportions are
θ ^ h 11 = n h 11 n h , θ ^ h 10 = n h 10 n h , θ ^ h 01 = n h 01 n h , θ ^ h 00 = n h 00 n h ,
respectively.
We can define the minimum distance function between the population proportion θ h i j and observed sample proportion θ ^ h i j for stratum h, as follows
D h = 1 2 i = 0 1 j = 0 1 θ h i j θ ^ h i j 2 .
Setting for minimizing the distance function,
D h π h A = 0 , D h π h B = 0 , D h π h A B = 0 ,
then using the method of moment we can find the estimators of population proportion π h A , π h B , and π h A B for stratum h as following
π ^ h A = θ ^ h 11 + θ ^ h 10 θ ^ h 01 θ ^ h 00 + ( 2 P h 1 ) 2 ( 2 P h 1 ) , π ^ h B = θ ^ h 11 θ ^ h 10 + θ ^ h 01 θ ^ h 00 + ( 2 T h 1 ) 2 ( 2 T h 1 ) , π ^ h A B = ( P h + T h ) θ ^ h 11 + ( T h P h ) θ ^ h 10 ( P h T h ) θ ^ h 01 2 ( 2 P h 1 ) ( 2 T h 1 ) + ( 2 P h T h ) θ ^ h 00 T h ( 1 P h ) P h ( 1 T h ) 2 ( 2 P h 1 ) ( 2 T h 1 ) .
And the population proportions of sensitive attribute are respectively,
π A = h = 1 L W h π h A , π B = h = 1 L W h π h B , π A B = h = 1 L W h π h A B ,
where W h = N h / N .
Theorem 1. 
The stratified estimators π ^ A , π ^ B and π ^ A B are an unbiased estimator for the population proportions of sensitive attribute, π A , π B , and π A B , respectively.
π ^ A = h = 1 L W h θ ^ h 11 + θ ^ h 10 θ ^ h 01 θ ^ h 00 + ( 2 P h 1 ) 2 ( 2 P h 1 ) ,
π ^ B = h = 1 L W h θ ^ h 11 θ ^ h 10 + θ ^ h 01 θ ^ h 00 + ( 2 T h 1 ) 2 ( 2 T h 1 ) ,
π ^ A B = h = 1 L W h ( P h + T h ) θ ^ h 11 + ( T h P h ) θ ^ h 10 ( P h T h ) θ ^ h 01 2 ( 2 P h 1 ) ( 2 T h 1 ) + ( 2 P h T h ) θ ^ h 00 T h ( 1 P h ) P h ( 1 T h ) 2 ( 2 P h 1 ) ( 2 T h 1 ) .
Proof. 
Because E ( θ ^ h 11 ) = θ h 11 , E ( θ ^ h 10 ) = θ h 10 , E ( θ ^ h 01 ) = θ h 01 , E ( θ ^ h 00 ) = θ h 00 , then
E ( π ^ A ) = E h = 1 L W h θ ^ h 11 + θ ^ h 10 θ ^ h 01 θ ^ h 00 + ( 2 P h 1 ) 2 ( 2 P h 1 ) = h = 1 L W h E ( θ ^ h 11 ) + E ( θ ^ h 10 ) E ( θ ^ h 01 ) E ( θ ^ h 00 ) + ( 2 P h 1 ) 2 ( 2 P h 1 ) = h = 1 L W h θ h 11 + θ h 10 θ h 01 θ h 00 + ( 2 P h 1 ) 2 ( 2 P h 1 ) = h = 1 L W h π h A = π A .
In the same way, it can prove that E ( π ^ B ) = π B , E ( π ^ A B ) = π A B respectively. □
Theorem 2. 
The variances of stratified estimators π ^ A , π ^ B and π ^ A B are given by
V ( π ^ A ) = h = 1 L W h 2 n h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 ,
V ( π ^ B ) = h = 1 L W h 2 n h π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 ,
V ( π ^ A B ) = h = 1 L W h 2 n h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 .
Proof. 
V ( π ^ A ) = h = 1 L W h 2 V ( π ^ h A ) = h = 1 L W h 2 V θ ^ h 11 + θ ^ h 10 θ ^ h 01 θ ^ h 00 + ( 2 P h 1 ) 2 ( 2 P h 1 ) = h = 1 L W h 2 4 ( 2 P h 1 ) 2 V ( θ ^ h 11 ) + V ( θ ^ h 10 ) + 2 C o v ( θ ^ h 11 , θ ^ h 10 ) = h = 1 L W h 2 4 ( 2 P h 1 ) 2 θ h 11 ( 1 θ h 11 ) n h + θ h 10 ( 1 θ h 10 ) n h 2 θ h 11 θ h 10 n h = h = 1 L W h 2 n h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 ,
V ( π ^ B ) = h = 1 L W h 2 V ( π ^ h B ) = h = 1 L W h 2 V θ ^ h 11 θ ^ h 10 + θ ^ h 01 θ ^ h 00 + ( 2 T h 1 ) 2 ( 2 T h 1 ) = h = 1 L W h 2 4 ( 2 T h 1 ) 2 V ( θ ^ h 11 ) + V ( θ ^ h 01 ) + 2 C o v ( θ ^ h 11 , θ ^ h 01 ) = h = 1 L W h 2 4 ( 2 T h 1 ) 2 θ h 11 ( 1 θ h 11 ) n h + θ h 01 ( 1 θ h 01 ) n h 2 θ h 11 θ h 01 n h = h = 1 L W h 2 n h π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 ,
and
V ( π ^ A B ) = h = 1 L W h 2 V ( π ^ h A B ) = h = 1 L W h 2 V ( P h + T h ) θ ^ h 11 + ( T h P h ) θ ^ h 10 ( P h T h ) θ ^ h 01 2 ( 2 P h 1 ) ( 2 T h 1 ) + ( 2 P h T h ) θ ^ h 00 T h ( 1 P h ) P h ( 1 T h ) 2 ( 2 P h 1 ) ( 2 T h 1 ) = h = 1 L W h 2 n h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 .
The covariances of stratified estimator π ^ A , π ^ B and π ^ A B are given by
C o v ( π ^ A , π ^ B ) = h = 1 L W h 2 n h π h A B π h A π h B ,
C o v ( π ^ A B , π ^ A ) = h = 1 L W h 2 n h π h A B ( 1 π h A ) + P h ( 1 P h ) π h B ( 2 P h 1 ) 2 ,
C o v ( π ^ A B , π ^ B ) = h = 1 L W h 2 n h π h A B ( 1 π h B ) + T h ( 1 T h ) π h A ( 2 T h 1 ) 2 .
Next, we consider the sample alloction problem for the stratified sampling design, which are the proportional allocation and the optimum allocation. In the proportional allocation the size of straum h is n h = n ( N h / N ) then the variances of the stratified estimator are given by
V P ( π ^ A ) = 1 n h = 1 L W h 2 π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 ,
V P ( π ^ B ) = 1 n h = 1 L W h 2 π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 ,
V P ( π ^ A B ) = 1 n h = 1 L W h 2 π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 .
In the optimum allocation, for a fixed cost, the stratum size n h can be determined to minimize the variance V ( π ^ A ) , V ( π ^ B ) , V ( π ^ A B ) for each estimator. Let the cost function can be defined as follows;
C = c 0 + h = 1 L n h c h ,
where c 0 be a fixed cost and c h be a survey cost for unit in stratum h.
Using the Cauchy-Schwartz inequality, we can find the optimum sample size n h which minimize the variance V ( π ^ A ) , V ( π ^ B ) , V ( π ^ A B ) for a stratum h as following,
n h A = n × W h c h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 1 / 2 h = 1 L W h c h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 1 / 2 ,
n h B = n × W h c h π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 1 / 2 h = 1 L W h c h π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 1 / 2 ,
n h A B = n × W h c h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 1 / 2 h = 1 L W h c h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 1 / 2 .
Thus, we can find that the variances due to the optimum allocation are given by
V m i n ( π ^ A ) = 1 n h = 1 L W h c h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 1 / 2 × h = 1 L W h c h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 1 / 2 ,
V m i n ( π ^ B ) = 1 n h = 1 L W h c h π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 1 / 2 × h = 1 L W h c h π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 1 / 2 ,
V m i n ( π ^ A B ) = 1 n h = 1 L W h c h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 1 / 2 × h = 1 L W h c h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 1 / 2 .

2.2. A Simple Model with the Stratified Double Sampling

In Section 2.1, the problem of applying the stratified random sampling method to the simple model was discussed, assuming that the sizes of each stratum in the population are precisely known. However, when information about the stratification variables is not available, it is necessary to initially take a large sample to understand the information about the stratification variables. Stratified double sampling is a method after stratifying, a subsample is selected to obtain information about sensitive attributes.
In this section, we consider that the stratified double sampling is applied to the simple model in estimating the proportion of a sensitive attribute in sensitive surveys. Also, we deal the problem of allocating samples to each stratum.
When the sizes of each stratum are unknown, in the first stage, respondents are directly questioned about surveys that match the stratification criteria to classify the strata. In the second stage, the simple model is applied to sensitive questions. The survey involves directly questioning n respondents, who were randomly and uniformly selected in the first stage from a population of size N, which is composed of L strata.
  • Question: Do you belong to the h stratum?
In the first stage, let’s classify the sample into h stratum, and denote the number of individuals in each stratum as n h . Then, W h and w h can be expressed as follows.
  •    W h = N h / N , ( h = 1 ,   2 ,   ,   L ) : the population proportion of stratum h,
  •    w h = n h / n , ( h = 1 ,   2 ,   ,   L ) : the sample proportion of stratum h,
where w h is the unbiased estimator of W h .
In the second stage, the respondents od size n h ( n = h L n h ) by simple random sample with replacement selected from the first-stage sample of n h individuals respond using the randomization device of the simple model of [7].
Among the respondents sampled from stratum h, n h , the number of person who answered (yes, yes) is n h 11 , who answered (yes, no) is n h 10 , and who answered (no, yes) is n h 01 , and who answered (no, no) is n h 00 , then the sample proportions θ ^ h 11 , θ ^ h 10 , θ ^ h 01 and θ ^ h 00 are
θ ^ h 11 = n h 11 n h , θ ^ h 10 = n h 10 n h , θ ^ h 01 = n h 01 n h , θ ^ h 00 = n h 00 n h .
The estimators for the population proportion of sensitive attribute π h A , π h B , and π h A B ara given by
π ^ h A d = θ ^ h 11 + θ ^ h 10 θ ^ h 01 θ ^ h 00 + ( 2 P h 1 ) 2 ( 2 P h 1 ) ,
π ^ h B d = θ ^ h 11 θ ^ h 10 + θ ^ h 01 θ ^ h 00 + ( 2 T h 1 ) 2 ( 2 T h 1 ) ,
π ^ h A B d = ( P h + T h ) θ ^ h 11 + ( T h P h ) θ ^ h 10 ( P h T h ) θ ^ h 01 2 ( 2 P h 1 ) ( 2 T h 1 ) + ( 2 P h T h ) θ ^ h 00 T h ( 1 P h ) P h ( 1 T h ) 2 ( 2 P h 1 ) ( 2 T h 1 ) ,
respectively.
Theorem 3. 
If the estiamtors π ^ h A d , π ^ h B d , and π ^ h A B d are unbiased estimator for π h A , π h B and π h A B in stratum h, then the stratified double estimators π ^ A d , π ^ B d and π ^ A B d are unbiased for π A , π B and π A B , respectively
π ^ A d = h = 1 L w h π ^ h A d = h = 1 L w h θ ^ h 11 + θ ^ h 10 θ ^ h 01 θ ^ h 00 + ( 2 P h 1 ) 2 ( 2 P h 1 ) ,
π ^ B d = h = 1 L w h π ^ h B d = h = 1 L w h θ ^ h 11 θ ^ h 10 + θ ^ h 01 θ ^ h 00 + ( 2 T h 1 ) 2 ( 2 T h 1 ) ,
π ^ A B d = h = 1 L w h π ^ h A B d = h = 1 L w h ( P h + T h ) θ ^ h 11 + ( T h P h ) θ ^ h 10 ( P h T h ) θ ^ h 01 2 ( 2 P h 1 ) ( 2 T h 1 ) + ( 2 P h T h ) θ ^ h 00 T h ( 1 P h ) P h ( 1 T h ) 2 ( 2 P h 1 ) ( 2 T h 1 ) .
Proof. 
Since π ^ h A d , π ^ h B d and π ^ h A B d are unbiased estimator of π h A , π h B and π h A B , respectively, we get
E ( π ^ A d ) = E 1 E 2 h = 1 L w h π ^ h A d | w h = E 1 h = 1 L w h π h A = h = 1 L W h π h A = π A ,
E ( π ^ B d ) = E 1 E 2 h = 1 L w h π ^ h B d | w h = E 1 h = 1 L w h π h B = h = 1 L W h π h B = π B ,
E ( π ^ A B d ) = E 1 E 2 h = 1 L w h π ^ h A B d | w h = E 1 h = 1 L w h π h A B = h = 1 L W h π h A B = π A B .
Theorem 4. 
If the respondents are selected with simple random sampling from different strata, the variance of the stratified double estimator π ^ A d , π ^ B d , and π ^ A B d are given as following, respectively,
V ( π ^ A d ) = 1 n h = 1 L W h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 + h = 1 L W h ( π h A π A ) 2 + h = 1 L W h n 1 v h 1 π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 ,
V ( π ^ B d ) = 1 n h = 1 L W h π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 + h = 1 L W h ( π h B π B ) 2 + h = 1 L W h n 1 v h 1 π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 ,
and
V ( π ^ A B d ) = 1 n h = 1 L W h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A + P h ( 1 P h ) ( 2 T h 1 ) 2 ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 + h = 1 L W h ( π h A B π A B ) 2 + h = 1 L W h n 1 v h 1 π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 ,
where 0 v h ( = n h / n h ) 1 .
Proof. 
By [8] we can rewrite the estimator π ^ A d as follows
π ^ A d = h = 1 L w h π ^ h A d = h = 1 L w h π ^ h A + h = 1 L w h ( π ^ h A d π ^ h A )
The variance of the first term on right-hand side of equation is
V h = 1 L w h π ^ h A = 1 n h = 1 L w h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 + h = 1 L w h ( π h A π A ) 2 ,
and the variance of the second term on right-hand side of equation is
E 1 V 2 h = 1 L w h ( π ^ h A d π ^ h A ) = E 1 h = 1 L 1 n h 1 n h w h 2 π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 ,
Since n h = v h n h = v h w h n , then it becomes
= E 1 h = 1 L w h n 1 v h 1 π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2
= h = 1 L W h n 1 v h 1 π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 .
It can make (27) from these equation, (28) and (29) of the variance π ^ B d , and π ^ A B d in the same way. □
Next, we consider the proportional and the optimum allocation as methods of allocating overall sample of n to each stratum of n h and check variance for each case. If the first samples n and n h are used instead of the population size N and N h , the sample size of stratum h in proportional distribution is n h = n ( n h / n ) , so the variances of π ^ A d , π ^ B d and π ^ A B d are as follows respectively.
V P ( π ^ A d ) = 1 n h = 1 L W h ( π h A π A ) 2 + 1 n h = 1 L W h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 ,
V P ( π ^ B d ) = 1 n h = 1 L W h ( π h B π B ) 2 + 1 n h = 1 L W h π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 ,
V P ( π ^ A B d ) = 1 n h = 1 L W h ( π h A B π A B ) 2 + 1 n h = 1 L W h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 .
We can find the optimum values of n and v h to minimize V ( π ^ A d ) , V ( π ^ B d ) and V ( π ^ A B d ) for a specified cost. The cost function is
C = c n + h = 1 L c h n h ,
where c is a classification cost for unit and c h is the cost per unit in stratum h.
Since n h is a random variable, we should minimize the expected cost E ( C ) to obtain the optimum values of n and v h .
E ( C ) = C * = c n + h = 1 L c h E ( n h ) = c n + n h = 1 L c h v h W h .
The optimum values of v h that minimize the products of the expected cost (34) and variances (27)–(29) are obtained using Cauchy-Schwartz inequality respectively.
v h A = c c h π h A ( 1 π h A ) + π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 h = 1 L W h ( π h A π A ) 2 1 / 2 ,
v h B = c c h π h B ( 1 π h B ) + π h B ( 1 π h B ) + T h ( 1 T h ) ( 2 T h 1 ) 2 h = 1 L W h ( π h B π B ) 2 1 / 2 ,
v h A B = c c h π h A B ( 1 π h A B ) + π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 h = 1 L W h ( π h A B π A B ) 2 1 / 2 .
The optimum values of n A , n B and n A B can be obtained as follows by substituting the values of v h A , v h B and v h A B into the expected cost (34).
n A = C * c + h = 1 L c h W h c c h π h A ( 1 π h A ) + π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 h = 1 L W h ( π h A π A ) 2 1 / 2
n B = C * c + h = 1 L c h W h c c h π h B ( 1 π h B ) + π h B ( 1 π h B ) + P h ( 1 P h ) ( 2 P h 1 ) 2 h = 1 L W h ( π h B π B ) 2 1 / 2
n A B = C * c + h = 1 L c h W h c c h π h A B ( 1 π h A B ) + ( 2 P h 1 ) 2 T h ( 1 T h ) π h A + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 h = 1 L W h ( π h A B π A B ) 2 1 / 2 .
The minimum variances of π ^ A , π ^ B and π ^ A B are obtained by substituting the optimum values of n A , n B and n A B and v h A , v h B and v h A B into (27), (28) and (29) respectively.
V m i n ( π ^ A d ) = 1 C * c h = 1 L W h ( π h A π A ) 2 + h = 1 L W h c h π h A ( 1 π h A ) + P h ( 1 P h ) ( 2 P h 1 ) 2 2 ,
V m i n ( π ^ B d ) = 1 C * c h = 1 L W h ( π h B π B ) 2 + h = 1 L W h c h π h B ( 1 π h B ) + P h ( 1 P h ) ( 2 P h 1 ) 2 2 ,
V m i n ( π ^ A B d ) = 1 C * c h = 1 L W h ( π h A B π A B ) 2 + h = 1 L W h c h ( 2 P h 1 ) 2 T h ( 1 T h ) π h A + P h ( 1 P h ) ( 2 T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 2 P h 1 ) 2 ( 2 T h 1 ) 2 2 .

3. Stratified Estimation of Two Sensitive Attributes with the Crossed Model

3.1. Crossed Model with the Stratified Random Sampling

In this section, we will take the problem of applying stratified sampling to crossed model for estimating two sensitive attributes in the case where the size of each stratum for the population is precisely known. Let the population with size N be divided into disjoint L strata with size N h ( h = 1 ,   2 ,   ,   L ) and assume each stratum size is known. In stratum h, n h ( n = h = 1 L n h ) respondents are selected by simple random sampling with replacement. Each respondent is asked to answer “yes” or “no” according to his/her status using the following two randomization devices as following Table 3 and Table 4 without reporting the selected question to the interviewer with selection probability subject to P h + T h = 1 .
Hence, the probability of getting the response (yes, yes) from respondents in stratum h is given by
θ h 11 * = π h A B { P h T h + ( 1 P h ) ( 1 T h ) } π h A ( 1 P h ) ( 1 T h ) π h B ( 1 P h ) ( 1 T h ) + ( 1 P h ) ( 1 T h ) .
Similarly, the probability of getting the response (yes, no) from respondents in stratum h is given by
θ h 10 * = π h A B { P h T h + ( 1 P h ) ( 1 T h ) } π h A { ( 1 P h ) T h 1 } π h B ( 1 P h ) T h + ( 1 P h ) T h .
And, the probability of getting the response (no, yes) from respondents in stratum h is given by
θ h 01 * = π h A B { P h T h + ( 1 P h ) ( 1 T h ) } π h A P h ( 1 T h ) π h B { P h ( 1 T h ) T h 1 } + P h ( 1 T h ) .
Finally, the probability of getting the response (no, no) from respondents in stratum h is given by
θ h 00 * = π h A B { P h T h + ( 1 P h ) ( 1 T h ) } π h A P h T h π h B P h T h + P h T h ,
where π h A is the population proportion of respondents in stratum h possessing sensitive attribute A, π h B is the population proportion of respondents in stratum h possessing sensitive attribute B, π h A B is the population proportion of respondents in stratum h possessing both sensitive attributes A and B, and that θ h 11 * + θ h 10 * + θ h 01 * + θ h 00 * = 1 .
If n h 11 * , n h 10 * , n h 01 * and n h 00 * are the numbers of respondents responding (yes, yes), (yes, no), (no, yes), and (no, no) respectively from n h sampled respondents in stratum h, the observed proportions θ h 11 * , θ h 10 * , θ h 01 * and θ h 00 * are given by, respectively,
θ ^ h 11 * = n h 11 * n h , θ ^ h 10 * = n h 10 * n h , θ ^ h 01 * = n h 01 * n h , θ ^ h 00 * = n h 00 * n h .
The estimators of π h A , π h B and π h A B , the population proportions of sensitive attribute for stratum h are given by
π ^ h A * = 1 2 + ( T h P h + 1 ) ( θ ^ h 11 * θ ^ h 00 * ) + ( P h + T h 1 ) ( θ ^ h 10 * θ ^ h 01 * ) 2 ( P h + T h 1 ) ,
π ^ h B * = 1 2 + ( P h T h + 1 ) ( θ ^ h 11 * θ ^ h 00 * ) + ( P h + T h 1 ) ( θ ^ h 01 * θ ^ h 10 * ) 2 ( P h + T h 1 ) ,
and
π ^ h A B * = P h T h θ ^ h 11 * ( 1 P h ) ( 1 T h ) θ ^ h 00 * { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) .
Thus the stratified estimators of the proportion of sensitive attribute A, B, and both A and B are
π ^ A * = h = 1 L W h π ^ h A * ,
π ^ B * = h = 1 L W h π ^ h B * ,
π ^ A B * = h = 1 L W h π ^ h A B * ,
where W h = N h / N .
The variances of the stratified estimators π ^ A * , π ^ B * and π ^ A B * are given by
V ( π ^ A * ) = h = 1 L W h 2 n h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
V ( π ^ B * ) = h = 1 L W h 2 n h π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
V ( π ^ A B * ) = h = 1 L W h 2 n h π h A B ( 1 π h A B ) + π h A B { P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 } { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ,
And, the covariances of stratum estimators π ^ A * , π ^ B * , and π ^ A B * are given by
C o v ( π ^ A * , π ^ B * ) = h = 1 L W h 2 n h ( π h A B π h A π h B ) + 2 P h T h ( 1 P h ) ( 1 T h ) ( 1 + 2 π h A B π h A π h B ) ( P h + T h 1 ) 2 ,
C o v ( π ^ A B * , π ^ A * ) = h = 1 L W h 2 n h π h A B ( 1 π h A ) + π h A B T h ( 1 P h ) ( P h T h + 1 ) ( P h + T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( T h P h + 1 ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ,
C o v ( π ^ A B * , π ^ B * ) = h = 1 L W h 2 n h π h A B ( 1 π h B ) + π h A B P h ( 1 T h ) ( T h P h + 1 ) ( P h + T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( P h T h + 1 ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ,
In proportional allocation for stratified random sampling, the size of sample in stratum h is n h = n ( N h / N ) , so the variances of the stratified estimators π ^ A * , π ^ B * and π ^ A B * are given by
V P ( π ^ A * ) = 1 n h = 1 L W h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
V P ( π ^ B * ) = 1 n h = 1 L W h π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
V P ( π ^ A B * ) = 1 n h = 1 L W h π h A B ( 1 π h A B ) + π h A B [ P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ] { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ,
We use Cauchy-Schwartz inequality to obtain n h to minimize V ( π ^ A * ) , V ( π ^ B * ) and V ( π ^ A B * ) for specified cost function C in (17).
n h A = n W h c h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 1 / 2 h = 1 L W h c h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 1 / 2 ,
n h B = n W h c h π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 1 / 2 h = 1 L W h c h π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 1 / 2 ,
n h A B = n W h c h π h A B ( 1 π h A B ) + π h A B [ P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ] { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 1 / 2 h = 1 L W h c h π h A B ( 1 π h A B ) + π h A B [ P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ] { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 1 / 2 .
Hence. the minimum variances are
V m i n ( π ^ A * ) = 1 n h = 1 L W h c h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 1 / 2 × h = 1 L W h c h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 1 / 2 ,
V m i n ( π ^ B * ) = 1 n h = 1 L W h c h π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 1 / 2 × h = 1 L W h c h π h A ( 1 π h A ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 1 / 2 ,
V m i n ( π ^ A B * ) = 1 n h = 1 L W h c h π h A B ( 1 π h A B ) + π h A B [ P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ] { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 1 / 2 × h = 1 L W h c h π h A B ( 1 π h A B ) + π h A B [ P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ] { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 1 / 2 ,

3.2. Crossed Model with the Stratified Double Sampling

In the previous section, we addressed the problem of applying stratified sampling to crossed model for estimating two sensitive attributes in the case where the size of each stratum for the population is precisely known.
In this section, we apply stratified double sampling to crossed model, and we are going to deal with the problem of allocating samples to each stratum. The first sample is used to classify stratum by asking direct question and the second sample is used to estimate a sensitive attribute by using crossed model. According to the procedure in Section 2.2, the first sample of n respondents is selected by SRSWR from the population of size N consisted of L strata, and is asked to answer to direct question such as “Are you in stratum h?”. Then we classify the first sample into h stratum. In second sample, n h respondents are selected by SRSWR from first sample and asked to answer according to the randomization device of crossed model.
Hence, the probability of getting the response (yes, yes) from respondents in stratum h is the same as (44), the probability of getting the response (yes, no) is the same as (45), the probability of getting the response (no, yes) is the same as (46), and the probability of getting the response (no, no) is the same as (47).
If n h 11 , n h 10 , n h 01 and n h 00 are the numbers of respondents responding (yes, yes), (yes, no), (no, yes), and (no, no) respectively from n h sampled respondents in stratum h, the observed proportions θ ^ h 11 * , θ ^ h 10 * , θ ^ h 01 * and θ ^ h 00 * are given by
θ ^ h 11 * = n h 11 n h , θ ^ h 10 * = n h 10 n h , θ ^ h 01 * = n h 01 n h , θ ^ h 00 * = n h 00 n h .
The estimators of the population proportion of sensitive attributes in stratum h π h A , π h B and π h A B are given by
π ^ h A * d = 1 2 + ( T h P h + 1 ) ( θ ^ h 11 * θ ^ h 00 * ) + ( P h + T h 1 ) ( θ ^ h 10 * θ ^ h 01 * ) 2 ( P h + T h 1 ) ,
π ^ h B * d = 1 2 + ( P h T h + 1 ) ( θ ^ h 11 * θ ^ h 00 * ) + ( P h + T h 1 ) ( θ ^ h 01 * θ ^ h 10 * ) 2 ( P h + T h 1 ) ,
π ^ h A B * d = P h T h θ ^ h 11 * ( 1 P h ) ( 1 T h ) θ ^ h 00 * { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) .
Theorem 5. 
The stratified double estimators π ^ A * d , π ^ B * d , and π ^ A B * d using the crossed model for sensitive attributes A, B and both A and B are unbiased estimators of π A , π B , and π A B , respectively,
π ^ A * d = h = 1 L w h π ^ h A * d = h = 1 L w h 1 2 + ( T h P h + 1 ) ( θ ^ h 11 * θ ^ h 00 * ) + ( P h + T h 1 ) ( θ ^ h 10 * θ ^ h 01 * ) 2 ( P h + T h 1 ) ,
π ^ B * d = h = 1 L w h π ^ h B * d = h = 1 L w h 1 2 + ( P h T h + 1 ) ( θ ^ h 11 * θ ^ h 00 * ) + ( P h + T h 1 ) ( θ ^ h 01 * θ ^ h 10 * ) 2 ( P h + T 1 1 ) ,
π ^ A B * d = h = 1 L w h π ^ h A B * d = h = 1 L w h P h T h θ ^ h 11 * ( 1 P h ) ( 1 T h ) θ ^ h 00 * { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) .
Proof. 
Since π ^ h A * d , π ^ h B * d and π ^ h A B * d are unbiased estimators of π h A , π h B and π h A B , respectively, we
E ( π ^ A * d ) = E 1 E 2 h = 1 L w h π ^ h A * d | w h = E 1 h = 1 L w h π h A = h = 1 L W h π h A = π A ,
E ( π ^ B * d ) = E 1 E 2 h = 1 L w h π ^ h B * d | w h = E 1 h = 1 L w h π h B = h = 1 L W h π h B = π B ,
E ( π ^ A B * d ) = E 1 E 2 h = 1 L w h π ^ h A B * d | w h = E 1 h = 1 L w h π h A B = h = 1 L W h π h A B = π A B ,
Theorem 6. 
The variances of π ^ A * d , π ^ B * d and π ^ A B * d are given by
V ( π ^ A * d ) = 1 n h = 1 L W h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 + h = 1 L W h ( π h A π A ) 2 + h = 1 L W h n 1 v h 1 π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
V ( π ^ B * d ) = 1 n h = 1 L W h π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 + h = 1 L W h ( π h B π B ) 2 + h = 1 L W h n 1 v h 1 π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
V ( π ^ A B * d ) = 1 n h = 1 L W h π h A B ( 1 π h A B ) + π h A B [ P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ] { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 + h = 1 L W h ( π h A B π A B ) 2 + h = 1 L W h n 1 v h 1 π h A B ( 1 π h A B ) + π h A B [ P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ] { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 ,
where 0 v h ( = n h / n h ) 1 .
Proof. 
By [8], if we redefine the stratified double estimator π ^ A * d of π A for sensitive attribute A using crossed model as follow;
π ^ A * d = h = 1 L w h π ^ h A * d = h = 1 L w h π ^ h A * + h = 1 L w h ( π ^ h A * d π ^ h A * )
The variance of the first term on the right hand of equation is
V ( h = 1 L w h π ^ h A * ) = 1 n h = 1 L W h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 + h = 1 L W h ( π h A π A ) 2 ,
and the variance of the second term is
E 1 V 2 h = 1 L w h ( π ^ h A * d π ^ h A * ) = E 1 h = 1 L 1 n h 1 n h w h 2 π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 .
Since n h = v h n h = v h w h n
= E 1 h = 1 L w h n 1 v h 1 π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 , = h = 1 L W h n 1 v h 1 π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
We have (69) from those equations. We can obtain (70) and (71) in the same way.
Next, we consider the proportional and the optimum allocation as methods of allocating overall sample of to each stratum of n and check variance for each case. If the first samples n and n h are used instead of the population size N and N h , the sample size of stratum h in proportional distribution is n h = n ( n h / n ) , so the variances of π ^ A * d , π ^ B * d and π ^ A B * d are as follows, respectively.
V P ( π ^ A * d ) = 1 n h = 1 L W h ( π h A π A ) 2 + 1 n h = 1 L W h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
V P ( π ^ B * d ) = 1 n h = 1 L W h ( π h B π B ) 2 + 1 n h = 1 L W h π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 ,
and
V P ( π ^ A B * d ) = 1 n h = 1 L W h ( π h A B π A B ) 2 + 1 n h = 1 L W h π h A B ( 1 π h A B ) + π h A B { P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 } { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 + P h T h ( 1 P h ) ( 1 T h ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 .
We have to find the optimum values of n and v h that minimize V ( π ^ A * d ) , V ( π ^ B * d ) and V ( π ^ A B * d ) for a specified cost. The optimum values of v h that minimize the products of the expected cost (34) and the variances (72)–(74) are obtained using Cauchy-Schwartz inequality respectively.
v h A = c c h π h A ( 1 π h A ) + π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 h = 1 L W h ( π h A π A ) 2 1 / 2 ,
v h B = c c h π h B ( 1 π h B ) + π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 h = 1 L W h ( π h B π B ) 2 1 / 2 ,
v h A B = c c h π h A B ( 1 π h A B ) + π h A B ( 1 π h A B ) + π h A B { P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 } + P h T h ( 1 P h ) ( 1 T h ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 h = 1 L W h ( π h A B π A B ) 2 1 / 2 ,
The optimum values of n . (78)–(80) can be obtained as follows by substituting the values v h of (75), (76), and (77) into (34).
n A = C * c + h = 1 L c h W h c c h π h A ( 1 π h A ) + π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 h = 1 L W h ( π h A π A ) 2 1 / 2 ,
n B = C * c + h = 1 L c h W h c c h π h B ( 1 π h B ) + π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 h = 1 L W h ( π h B π B ) 2 1 / 2 ,
n A B = C * c + h = 1 L c h W h c c h π h A B ( 1 π h A B ) + π h A B ( 1 π h A B ) + π h A B { P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 } + P h T h ( 1 P h ) ( 1 T h ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 h = 1 L W h ( π h A B π A B ) 2 1 / 2 .
The minimum variances of π ^ A * d , π ^ B * d and π ^ A B * d are obtained by substituting the optimum values of n A , n B and n A B , v h A , v h B and v h A B into (72), (73) and (74), respectively.
V m i n ( π ^ A * d ) = 1 C * c h = 1 L W h ( π h A π A ) 2 + h = 1 L W h c h π h A ( 1 π h A ) + ( 1 P h ) T h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 2 ,
V m i n ( π ^ B * d ) = 1 C * c h = 1 L W h ( π h B π B ) 2 + h = 1 L W h c h π h B ( 1 π h B ) + ( 1 T h ) P h { P h T h + ( 1 P h ) ( 1 T h ) } ( 1 π h A π h B + 2 π h A B ) ( P h + T h 1 ) 2 2 ,
V m i n ( π ^ A B * d ) = 1 C * c h = 1 L W h ( π h A B π A B ) 2 + h = 1 L W h c h π h A B { P h 2 T h 2 ( 1 P h ) 2 ( 1 T h ) 2 { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 } + P h T h ( 1 P h ) ( 1 T h ) ( 1 π h A π h B ) { P h T h + ( 1 P h ) ( 1 T h ) } ( P h + T h 1 ) 2 2 .

4. Efficiency Comparisons

In this section, we compare the efficiency of the simple model and the crossed model by the stratified sampling method proposed in Section 2 and Section 3, respectively. We will compare the variances such as between (8) and (51), (9) and (52), and (10) and (53). Let N = 10,000   ( N 1 = 7000 ,   N 2 = 3000 ) , n = 1000   ( n 1 = 700 ,   n 2 = 300 ) , W 1 = 0.7 , W 2 = 0.3 and we calculate the relative efficiencies (REs) such that
R E 1 = V ( π ^ A ) V ( π ^ A * ) , R E 2 = V ( π ^ B ) V ( π ^ B * ) , R E 3 = V ( π ^ A B ) V ( π ^ A B * ) ,
We consider changing π h A , π h B and π h A B and in cases of selection probability for the first and second randomization device (1) P h = T h , (2) P h > T h , and (3) P h < T h , respectively. The values of over than 1 indicate that the crossed model of [7] by using stratified sampling method is more efficient than the simple model of [7].
In first case P h = T h , Table 5 shows that the means of relative efficiency are symmetrically on the left and right based on P h = T h = 0.5 . For example, when P h = T h = 0.4 , π h A B = 0.1 , π h A = 0.2 , π h B = 0.1 then the mean value of R E 1 is 2.411 in A, R E 2 is 2.411 in B, and R E 3 is 9.9 in AB, which is the same as when P h = T h = 0.6 , indicating that the efficiency of the cross model is higher than that of the simple model. As shown in Table 5, in all possible cases where P h = T h , the crossed model is more efficient than the simple model. In second case P h > T h , Table 6 shows the relative efficiency of the cross model compared to the simple model when P h is relatively larger than T h . When P h and T h have similar value, i.e., P h = 0.7 , T h = 0.6 , the efficiency of the crossed model is high, and the other case of the difference between P h and T h is relatively large, P h = 0.7 , T h = 0.2 , the relative efficiency of the crossed model is low. In third case P h < T h , Table 7 shows the relative efficiency of the cross model compared to the simple model when P h is relatively smaller than T h . When P h and T h have similar value, i.e., P h = 0.2 , T h = 0.4 , the efficiency of the crossed model is high, and the other case of the difference between P h and T h is relatively large, P h = 0.2 , T h = 0.7 , the relative efficiency of the crossed model is low.
From the numerical study, we can find that the proposed crossed model with stratified random sampling is more efficient than the simple model when the selection probability of randomization device I, P h , is similar with that of randomization device II, T h , that is, the difference between P h and T h is small.

5. Concluding Remarks

When the population is composed of strata as gender, region, or age group, we consider the simple model and crossed model by applying stratified random sampling which can estimate not only the stratum population proportion but also the whole population proportion for two sensitive attributes such as criminal violence and tax evasion at the same time. In addition, when the size of each stratum is unknown in stratified random sampling, we propose the simple model and crossed model by using stratified double sampling method. In each proposed model, the sample allocation of each stratum is dealt with in consideration of proportional allocation and optimal one. We compare the efficiency between the simple model and the crossed model according to the proposed stratified random sampling method. As a result, it was found that the crossed model by the stratified sampling method was more efficient than the simple model by the stratified sampling method when the difference between P h and T h is small. In particular, the bigger the difference between π h A , π h B , and π h A B and the smaller difference between P h and T h , then the crossed model is more efficient than the simple model under stratified random sampling.
When trying to estimate the proportion of domestic violence and drug use experience in a social survey at the same time, a randomized response model that considers two sensitive variables can be used. At this time, the stratified random sampling design proposed in this paper can be applied to secure the representativeness of the sample, and has the advantage of enabling estimation for the entire group and estimation by demographic characteristics.
Finally based on this study, we will consider the two sensitive variables by applying the probability proportional to size design as larger sampling strategy.

Author Contributions

Conceptualization, G.-S.L.; methodology, C.-K.S.; writing—original draft preparation, K.-H.H. and S.-H.K.; writing—review and editing, C.-K.S.; project administration and funding acquisition, G.-S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Ministry of Education (2022R1F1A1063263).

Data Availability Statement

Data are contained within the article.

Acknowledgments

We would like to thank the anonymous reviewers for their very careful reading and valuable comments/suggestions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Warner, S.L. Randomized response: A survey technique for eliminating evasive answer bias. J. Am. Stat. Assoc. 1965, 60, 63–69. [Google Scholar] [CrossRef] [PubMed]
  2. Greenberg, B.G.; Abul-Ela, A.-L.A.; Simmons, W.R.; Horvitz, D.G. The unrelated question randomized response model: Theoretical framework. J. Am. Stat. Assoc. 1969, 64, 520–539. [Google Scholar] [CrossRef]
  3. Le, T.-N.; Lee, S.-M.; Tran, P.-L.; Li, C.-S. Randomized Response Techniques: A Systematic Review from the Pioneering Work of Warner (1965) to the Present. Mathematics 2023, 11, 1718. [Google Scholar] [CrossRef]
  4. Ahn, S.C.; Lee, G.S. A stratified unrelated question model. J. Korean Data Anal. Soc. 2003, 5, 853–864. (In Korean) [Google Scholar]
  5. Kim, J.M.; Warde, W.D. A stratified Warner’s randomized response model. J. Stat. Plan. Inference 2004, 120, 155–165. [Google Scholar] [CrossRef]
  6. Ahn, S.C.; Lee, G.S. A stratified discrete quantitative randomized response model. J. Korean Data Anal. Soc. 2004, 6, 181–191. (In Korean) [Google Scholar]
  7. Lee, C.S.; Sedory, S.A.; Singh, S. Estimating at least seven measures of qualitative variables from a sample using randomized response technique. Stat. Probab. Lett. 2013, 83, 399–409. [Google Scholar] [CrossRef]
  8. Cochran, W.G. Sampling Techniques, 3rd ed.; John Wiley and Sons: New York, NY, USA, 1977. [Google Scholar]
Table 1. Randomization Device I of the simple model in stratum h.
Table 1. Randomization Device I of the simple model in stratum h.
QuestionSelection Probability
Question1Do you have sensitive attribute A h ? P h
Question2Do you have non-sensitive attribute A h c ? 1 P h
Table 2. Randomization Device II of the simple model in stratum h.
Table 2. Randomization Device II of the simple model in stratum h.
QuestionSelection Probability
Question1Do you have sensitive attribute B h ? T h
Question2Do you have non-sensitive attribute B h c ? 1 T h
Table 3. Randomization Device I of crossed model in stratum h.
Table 3. Randomization Device I of crossed model in stratum h.
QuestionSelection Probability
Question1Do you have a rare sensitive attribute A h ? P h
Question2Do you have non-sensitive attribute B h c ? 1 P h
Table 4. Randomization Device II of crossed model in stratum h.
Table 4. Randomization Device II of crossed model in stratum h.
QuestionSelection Probability
Question1Do you have sensitive attribute B h ? T h
Question2Do you have non-sensitive attribute A h c ? 1 T h
Table 5. The means and standard deviations of R E for the simple and crossed model when P h = T h .
Table 5. The means and standard deviations of R E for the simple and crossed model when P h = T h .
T h
P h = T h 0.4 0.6 0.7
π hAB π hA π hB RE 1 RE 2 RE 3 RE 1 RE 2 RE 3 RE 1 RE 2 RE 3
0.10.20.22.4112.4119.92.4112.4119.91.961.9613.884
(0.201)(0.201)(0.952)(0.201)(0.201)(0.952)(0.13)(0.133)(0.419)
0.32.6382.65210.8262.6382.65210.8262.1092.1594.154
(0.241)(0.243)(1.139)(0.241)(0.243)(1.139)(0.15)(0.157)(0.479)
0.42.9112.93811.9442.9112.93811.9442.2822.3684.465
(0.294)(0.297)(1.387)(0.294)(0.297)(1.387)(0.176)(0.185)(0.554)
0.53.253.28313.3253.253.28313.3252.4862.5924.828
(0.367)(0.371)(1.729)(0.367)(0.371)(1.729)(0.208)(0.220)(0.648)
0.30.22.6132.59810.932.6132.59810.932.0562.0094.264
(0.235)(0.234)(1.149)(0.235)(0.234)(1.149)(0.14)(0.140)(0.492)
0.32.882.8812.062.882.8812.062.2162.2164.584
(0.286)(0.286)(1.400)(0.286)(0.286)(1.400)(0.162)(0.165)(0.569)
0.43.2083.21913.4543.2083.21913.4542.4032.4374.956
(0.356)(0.357)(1.745)(0.356)(0.357)(1.745)(0.19)(0.196)(0.665)
0.53.6223.63915.223.6223.63915.222.6262.6745.395
(0.455)(0.457)(2.24)(0.455)(0.457)(2.240)(0.227)(0.234)(0.789)
0.20.20.22.0582.0588.192.0582.0588.191.7181.7193.068
(0.147)(0.147)(0.652)(0.147)(0.147)(0.652)(0.101)(0.103)(0.258)
0.32.2212.2338.812.2212.2338.811.8311.8753.231
(0.171)(0.172)(0.754)(0.171)(0.172)(0.754)(0.114)(0.119)(0.286)
0.42.4112.4339.5332.4112.4339.5331.962.0343.412
(0.201)(0.203)(0.882)(0.201)(0.203)(0.882)(0.13)(0.138)(0.319)
0.52.6382.66410.3862.6382.66410.3862.1092.1983.615
(0.241)(0.244)(1.047)(0.241)(0.244)(1.047)(0.150)(0.160)(0.358)
0.30.22.2052.1938.8952.2052.1938.8951.7971.7563.315
(0.167)(0.167)(0.761)(0.167)(0.167)(0.761)(0.107)(0.107)(0.294)
0.32.3922.3929.6252.3922.3929.6251.9181.9183.501
(0.197)(0.197)(0.89)(0.197)(0.197)(0.890)(0.122)(0.125)(0.327)
0.42.6132.62210.4862.6132.62210.4862.0562.0853.71
(0.235)(0.236)(1.057)(0.235)(0.236)(1.057)(0.14)(0.144)(0.367)
0.52.882.89311.5192.882.89311.5192.2162.2573.945
(0.286)(0.288)(1.275)(0.286)(0.288)(1.275)(0.162)(0.168)(0.415)
0.30.20.21.7961.7967.0011.7961.7967.0011.531.532.565
(0.112)(0.112)(0.477)(0.112)(0.112)(0.477)(0.08)(0.082)(0.179)
0.31.9181.9297.4481.9181.9297.4481.6191.6572.676
(0.127)(0.128)(0.54)(0.127)(0.128)(0.540)(0.089)(0.094)(0.195)
0.42.0582.0777.9572.0582.0777.9571.7181.7832.798
(0.147)(0.148)(0.615)(0.147)(0.148)(0.615)(0.101)(0.107)(0.213)
0.52.2212.2438.542.2212.2438.541.8311.9092.931
(0.171)(0.173)(0.708)(0.171)(0.173)(0.708)(0.114)(0.121)(0.234)
0.30.21.9081.8977.521.9081.8977.521.5961.5592.745
(0.125)(0.125)(0.545)(0.125)(0.125)(0.545)(0.085)(0.085)(0.200)
0.32.0462.0468.0332.0462.0468.0331.691.692.87
(0.144)(0.144)(0.621)(0.144)(0.144)(0.621)(0.095)(0.097)(0.218)
0.42.2052.2138.6232.2052.2138.6231.7971.8223.007
(0.167)(0.168)(0.715)(0.167)(0.168)(0.715)(0.107)(0.111)(0.239)
0.52.3922.4039.3062.3922.4039.3061.9181.9533.157
(0.197)(0.198)(0.832)(0.197)(0.198)(0.832)(0.122)(0.127)(0.264)
(Note: The first and second row values in each cell represent the mean and standard deviation of RE).
Table 6. The means and standard deviations of R E for the simple and crossed model when P h > T h .
Table 6. The means and standard deviations of R E for the simple and crossed model when P h > T h .
T h
P h = 0 . 7 0.2 0.4 0.6
π hAB π hA π hB RE 1 RE 2 RE 3 RE 1 RE 2 RE 3 RE 1 RE 2 RE 3
0.10.20.20.7760.0380.1150.3390.4140.5781.4844.2266.066
(0.065)(0.004)(0.01)(0.03)(0.037)(0.059)(0.108)(0.327)(0.649)
0.30.8460.0440.1260.3720.4580.6361.6064.6246.569
(0.077)(0.005)(0.013)(0.036)(0.046)(0.07)(0.127)(0.39)(0.757)
0.40.9320.050.140.4120.5110.7061.755.0887.164
(0.093)(0.006)(0.016)(0.045)(0.057)(0.086)(0.15)(0.471)(0.897)
0.51.0360.0570.1570.4620.5760.7941.9225.647.879
(0.115)(0.008)(0.02)(0.056)(0.072)(0.109)(0.181)(0.578)(1.081)
0.30.20.8490.0420.1280.3770.4540.6611.5844.486.825
(0.075)(0.004)(0.013)(0.037)(0.045)(0.073)(0.12)(0.368)(0.785)
0.30.9330.0490.1420.4180.5070.7341.724.9297.443
(0.091)(0.006)(0.016)(0.045)(0.056)(0.089)(0.142)(0.443)(0.929)
0.41.0350.0570.1590.4680.5730.8251.8815.468.186
(0.112)(0.008)(0.02)(0.056)(0.072)(0.112)(0.169)(0.542)(1.12)
0.51.1630.0660.1810.5320.6570.9432.0766.1019.098
(0.141)(0.01)(0.026)(0.073)(0.094(0.146)(0.206)(0.678)(1.38)
0.20.20.20.6650.0320.0980.2880.350.4871.2893.6374.902
(0.048)(0.003)(0.007)(0.022)(0.027)(0.042)(0.082)(0.242)(0.429)
0.30.7160.0370.1060.3110.3810.5271.383.9325.222
(0.055)(0.003)(0.009)(0.026)(0.032)(0.049)(0.094)(0.282)(0.484)
0.40.7760.0410.1150.3390.4170.5741.4844.2645.587
(0.065)(0.004)(0.01)(0.03)(0.038)(0.058)(0.108)(0.33)(0.551)
0.50.8460.0460.1260.3720.460.631.6064.6456.008
(0.077)(0.005)(0.012)(0.036)(0.046)(0.069)(0.127)(0.391)(0.634)
0.30.20.720.0350.1070.3160.3780.5471.3683.8245.425
(0.054)(0.003)(0.009)(0.026)(0.031)(0.051)(0.09)(0.268)(0.501)
0.30.780.040.1170.3440.4150.5961.4684.155.804
(0.064)(0.004)(0.01)(0.031)(0.037)(0.06)(0.104)(0.314)(0.571)
0.40.8490.0450.1280.3770.4580.6551.5844.5216.242
(0.075)(0.005)(0.013)(0.037)(0.046)(0.071)(0.12)(0.371)(0.657)
0.50.9330.0510.1420.4180.510.7261.724.9516.751
(0.091)(0.006)(0.015)(0.045)(0.056)(0.087)(0.142)(0.445)(0.765)
0.30.20.20.5810.0280.0860.250.3030.4211.143.1934.137
(0.037)(0.002)(0.006)(0.017)(0.02)(0.032)(0.065)(0.187)(0.31)
0.30.620.0310.0910.2680.3270.451.213.424.361
(0.042)(0.002)(0.006)(0.019)(0.023)(0.036)(0.073)(0.213)(0.343)
0.40.6650.0350.0980.2880.3530.4841.2893.674.612
(0.048)(0.003)(0.007)(0.022)(0.027)(0.042)(0.082)(0.245)(0.381)
0.50.7160.0380.1060.3110.3830.5231.383.9494.893
(0.055)(0.003)(0.009)(0.026)(0.032)(0.048)(0.094)(0.283)(0.426)
0.30.20.6260.030.0930.2720.3240.4681.2043.3364.53
(0.041)(0.002)(0.006)(0.019)(0.023)(0.038)(0.07)(0.204)(0.355)
0.30.670.0340.0990.2930.3510.5031.2813.5844.79
(0.047)(0.003)(0.007)(0.022)(0.027)(0.043)(0.079)(0.234)(0.394)
0.40.720.0380.1070.3160.3820.5431.3683.8595.082
(0.054)(0.003)(0.009)(0.026)(0.032)(0.05)(0.09)(0.27)(0.441)
0.50.780.0420.1170.3440.4170.5911.4684.1685.413
(0.064)(0.004)(0.01)(0.031)(0.038)(0.058)(0.104)(0.315)(0.497)
(Note: The first and second row values in each cell represent the mean and standard deviation of RE).
Table 7. The means and standard deviations of R E for the simple and crossed model when P h < T h .
Table 7. The means and standard deviations of R E for the simple and crossed model when P h < T h .
T h
P h = 0 . 2 0.4 0.6 0.7
π hAB π hA π hB RE 1 RE 2 RE 3 RE 1 RE 2 RE 3 RE 1 RE 2 RE 3
0.10.20.20.60112.3885.3930.1487.2631.1010.0380.7760.079
(0.047)(0.652)(0.682)(0.014)(0.491)(0.131)(0.004)(0.065)(0.008)
0.30.6513.2415.7520.1627.8781.20.0420.8660.087
(0.055)(0.74)(0.768)(0.017)(0.574)(0.153)(0.004)(0.079)(0.01)
0.40.70914.1786.1620.1798.5781.3190.0460.9670.096
(0.065)(0.845)(0.873)(0.02)(0.678)(0.182)(0.006)(0.097)(0.012)
0.50.77915.2156.6370.2019.3891.4640.0521.080.108
(0.078)(0.971)(1.003)(0.025)(0.812)(0.22)(0.007)(0.12)(0.015)
0.30.20.66312.256.2660.1697.4991.3070.0440.830.092
(0.054)(0.637)(0.825)(0.017)(0.523)(0.164)(0.005)(0.074)(0.01)
0.30.7213.0856.7130.1878.1541.4360.0490.9330.102
(0.063)(0.722)(0.939)(0.021)(0.615)(0.195)(0.006)(0.091)(0.013)
0.40.78813.9997.2310.218.9061.5940.0551.0490.115
(0.075)(0.824)(1.08)(0.026)(0.731)(0.237)(0.007)(0.114)(0.016)
0.50.87115.0097.8360.2399.7841.7920.0631.1840.131
(0.091)(0.945)(1.259)(0.034)(0.882)(0.295)(0.01)(0.144)(0.02)
0.20.20.20.52111.0794.2070.1256.3410.9090.0320.6650.068
(0.036)(0.522)(0.432)(0.01)(0.374)(0.093)(0.003)(0.048)(0.006)
0.30.55811.7634.4190.1366.8090.9750.0350.7330.074
(0.041)(0.585)(0.471)(0.012)(0.429)(0.105)(0.003)(0.057)(0.007)
0.40.60112.5014.6530.1487.3291.050.0380.8050.08
(0.047)(0.658)(0.517)(0.014)(0.495)(0.12)(0.004)(0.067)(0.008)
0.50.6513.3014.9140.1627.9131.1390.0420.8820.088
(0.055)(0.743)(0.57)(0.017)(0.577)(0.138)(0.004)(0.08)(0.01)
0.30.20.57210.9684.810.1426.521.0610.0370.7040.078
(0.041)(0.512)(0.503)(0.012)(0.396)(0.112)(0.003)(0.054)(0.007)
0.30.61411.6395.0660.1557.0141.1430.040.780.085
(0.046)(0.573)(0.552)(0.014)(0.455)(0.127)(0.004)(0.064)(0.009)
0.40.66312.3615.3490.1697.5671.2390.0440.8610.093
(0.054)(0.643)(0.61)(0.017)(0.528)(0.147)(0.005)(0.077)(0.01)
0.50.7213.1445.6670.1878.1911.3530.0490.950.103
(0.063)(0.726)(0.678)(0.021)(0.618)(0.173)(0.006)(0.093)(0.012)
0.30.20.20.46110.023.490.1095.6270.7770.0280.5810.06
(0.029)(0.428)(0.31)(0.008)(0.295)(0.071)(0.002)(0.037)(0.005)
0.30.48910.5823.6320.1175.9970.8230.030.6350.064
(0.032)(0.474)(0.332)(0.009)(0.333)(0.078)(0.002)(0.043)(0.005)
0.40.52111.1793.7880.1256.3990.8760.0320.690.069
(0.036)(0.527)(0.357)(0.01)(0.378)(0.087)(0.003)(0.05)(0.006)
0.50.55811.8163.9570.1366.840.9360.0350.7460.074
(0.041)(0.587)(0.385)(0.012)(0.431)(0.097)(0.003)(0.058)(0.007)
0.30.20.5039.9293.9520.1225.7670.8960.0310.6110.068
(0.032)(0.42)(0.353)(0.009)(0.31)(0.083)(0.002)(0.041)(0.006)
0.30.53510.4824.1210.1316.1550.9530.0340.670.073
(0.036)(0.465)(0.38)(0.011)(0.351)(0.092)(0.003)(0.048)(0.006)
0.40.57211.0684.3050.1426.5791.0180.0370.730.079
(0.041)(0.516)(0.41)(0.012)(0.399)(0.103)(0.003)(0.056)(0.007)
0.50.61411.6924.5060.1557.0461.0930.040.7940.086
(0.046)(0.575)(0.445)(0.014)(0.457)(0.117)(0.004)(0.065)(0.008)
(Note: The first and second row values in each cell represent the mean and standard deviation of RE).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lee, G.-S.; Hong, K.-H.; Kim, S.-H.; Son, C.-K. Estimation for Two Sensitive Variables Using Randomization Response Model Under Stratified Random Sampling. Mathematics 2025, 13, 297. https://doi.org/10.3390/math13020297

AMA Style

Lee G-S, Hong K-H, Kim S-H, Son C-K. Estimation for Two Sensitive Variables Using Randomization Response Model Under Stratified Random Sampling. Mathematics. 2025; 13(2):297. https://doi.org/10.3390/math13020297

Chicago/Turabian Style

Lee, Gi-Sung, Ki-Hak Hong, Sung-Hwan Kim, and Chang-Kyoon Son. 2025. "Estimation for Two Sensitive Variables Using Randomization Response Model Under Stratified Random Sampling" Mathematics 13, no. 2: 297. https://doi.org/10.3390/math13020297

APA Style

Lee, G.-S., Hong, K.-H., Kim, S.-H., & Son, C.-K. (2025). Estimation for Two Sensitive Variables Using Randomization Response Model Under Stratified Random Sampling. Mathematics, 13(2), 297. https://doi.org/10.3390/math13020297

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop