Next Article in Journal
Influence of the Aqueous Environment on Protein Structure—A Plausible Hypothesis Concerning the Mechanism of Amyloidogenesis
Previous Article in Journal
Fuzzy Shannon Entropy: A Hybrid GIS-Based Landslide Susceptibility Mapping Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Propositions for Confidence Interval in Systematic Sampling on Real Line

by
Mehmet Niyazi Çankaya
Department of Statistics, Faculty of Arts and Science, University of Uşak, Ankara-İzmir Karayolu 8.Km. 1.Eylül Kampüsü, UŞAK 64200, Turkey
Entropy 2016, 18(10), 352; https://doi.org/10.3390/e18100352
Submission received: 14 June 2016 / Revised: 14 September 2016 / Accepted: 21 September 2016 / Published: 28 September 2016
(This article belongs to the Section Information Theory, Probability and Statistics)

Abstract

:
Systematic sampling is used as a method to get the quantitative results from tissues and radiological images. Systematic sampling on a real line ( R ) is a very attractive method within which biomedical imaging is consulted by practitioners. For the systematic sampling on R , the measurement function ( M F ) occurs by slicing the three-dimensional object equidistant systematically. The currently-used covariogram model in variance approximation is tested for the different measurement functions in a class to see the performance on the variance estimation of systematically-sampled R . An exact calculation method is proposed to calculate the constant λ ( q , N ) of the confidence interval in the systematic sampling. The exact value of constant λ ( q , N ) is examined for the different measurement functions, as well. As a result, it is observed from the simulation that the proposed M F should be used to check the performances of the variance approximation and the constant λ ( q , N ) . Synthetic data can support the results of real data.

1. Introduction

Systematic sampling, often used in the area of biomedical imaging, is a design-based approach for estimating a parameter Q of the geometrical quantities, such as volume, area, surface area and length. In the systematic sampling principle, geometrical objects are sampled with probes, such as lines, a regular grid or designed patterns. The probes superimposed on the geometrical objects are tools for us to get the quantitative values of geometrical objects. If we have an equidistant systematic sampling on geometrical objects, we will have the estimated values for each replication of the sampling of these objects. The estimated values obtained from each replication of this sampling on geometrical objects produce a fluctuation [1,2,3,4,5]. Therefore, this fluctuation is modeled by Fourier transformation F 1 , which was considered by [6,7]. Since we have the estimated values for parameter Q, the variance estimation is needed. For the systematic sampling on R , also called Cavalieri sampling, previous studies shed some light on the variance estimation for the systematic sampling on a real line [8,9,10,11,12,13,14,15]. The main source in these studies is the inspiration of Matheron’s theory. Matheron proposed his covariogram model g ( h ) intuitively. In this study, we aim to focus on testing the performance of this covariogram model for the different measurement functions in the regular and irregular forms. It is important to test the capability of fitting the performance of his covariogram model, because the sampling directions (axises), which are axial, coronal or sagittal, produce the different measurement functions, as implied by [2]. There is no study addressing the performance evaluation of the covariogram model for the different measurement functions. In this context, the works in Ref. [1,2,3,4,5,6,7,8,9,10,11,13,14,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31] used Matheron’s covariogram model for the different measurement functions.
After having the estimated value for the parameter Q, said as Q ^ , we should focus on the confidence interval for Q ^ . The work in Ref. [2] proposes two approaches for the coefficient of the confidence interval. The first one is based on statistical theory. In this study, the second proposition of [2] is examined. The main motivation is to find a calculation method for the constant λ ( q , N ) , which is used to construct the confidence interval for Q ^ . In this sense, we will give the exact calculation method for λ ( q , N ) , so we have more accurate information for the confidence interval. We also test the exact values of λ ( q , N ) for the different measurement functions.
The paper is organized as follows. Section 2 introduces the materials in the systematic sampling on R and describes the exact calculation method for constant λ ( q , N ) , as well. A simulation study and real data examples are given in Section 3. Section 4 considers the discussions of the results.

2. Materials and Methods

The estimation for volume Q, the empirical true variance, the true variance, the variance approximation and the confidence interval for systematic sampling on R with the variance approximation formula are introduced briefly.

2.1. Estimations for Volume Q and the Variance of Q ^

2.1.1. Estimation for Volume Q

Suppose that we have a three-dimensional geometrical object. This object is a fixed, bounded, non-void with a piecewise smooth boundary of finite surface area of Q, except fractals. The volume value of the object is parameter Q. Let parameter Q be estimated. To get the estimation value for the parameter Q, Cavalieri planes called measurement function f are used. The function f has to represent the shape of the geometrical object. The mathematical expression for the volume estimation with Cavalieri planes is given in the following form,
Q ^ ( u , T ) = T j = 0 n 1 f ( ( u + j ) T ) .
T is a constant distance among the slices obtained from the three-dimensional object. u is a uniform random variable in the interval [ 0 , T ) . Q ^ depends on the random variable u and the slice thickness or the number of systematic samplings, n.
The problem is about predicting V a r ( Q ^ ) = E ( Q ^ Q ) 2 = E ( Q ^ 2 ) Q 2 . Since the uniform distribution defined at [ 0 , T ) is used, E ( Q ^ 2 ) = 1 / T 0 T Q ^ 2 d u . In following subsections, we will introduce some important formulae for the variance estimation and its counterparts [6,7,8,9,10,11,12,13,14,15,18,19,20,21,22].

2.1.2. Variance of Q ^

The behavior of the variance of the Cavalieri estimator is strongly connected to analytical properties of the measurement function f. An aspect coming from Matheron’s transitive theory is given in [8] for these properties. The variance of Cavalieri sampling changes with a fractional power of T [2,9]. The fact that there is a fractional power for T is not pointed out by [13,14]. In this sense, [9] is an extension of [13]. For this reason, we want to test the performance of the covariogram model in Equation (12) and the variance extension term in Equation (20) for the different measurement functions.
For practical purposes, since the true M F is not known, we do not get the true variance of systematic sampling on R . For this reason, the variance approximation known as a variance extension ( V a r E ( Q ^ ) ) term is given. It is in the two forms coming from Fourier transformation basically given in [8] and the properties of measurement functions proposed by [9].
Firstly, we will introduce the empirical true variance for Q ^ . Secondly, the true variance for Q ^ is defined in Equation (5). It includes the covariogram function. After the covariogram function is defined, the measurement functions will be introduced.
The empirical true variance of systematic sampling on R is calculated by the formula given below,
V a r ( Q ^ ) = 1 m r = 1 m ( Q ^ r Q ) 2 ,
where m is the number of replication for the systematic sampling on R . Q ^ r is an estimated value at a uniformly random starting point u for the area. Q is the true value for the area under the M F [16,18,32].
The empirical true variance of systematic sampling on R is also:
C E 2 ( Q ^ ) = V a r ( Q ^ / Q ) ,
which is in the context of the coefficient of the error square.
The coefficient of error for the empirical true variance is calculated by:
C E ( Q ^ ) = V a r ( Q ^ ) / Q 2 .
In the simulation section, Equation (4) is used to test the performance of the variance approximation formula in Equation (20). The true variance is calculated by a formula in Equation (5). A calculation of the true variance by means of Equation (5) is intractable (see Ref. [8] for more details). The true variance is comprised of three components, which are the variance extension term, Zitterbewegung and the higher-order terms, respectively. These terms were obtained in Ref. [9] (see Section 6 in Ref. [9] for detailed expressions).
V a r ( Q ^ ) = T k = g ( k T ) g ( h ) d h ,
where
g ( h ) = R f ( x ) · f ( x + h ) d x , h R
is a covariogram function of the measurement function f. It is proposed by Matheron’s transitive theory and is known to be the convolution of f with its reflection, G ( t ) = F 1 g = ( F 1 f ) ( F 1 ¯ f ) ; F 1 expresses the Fourier transform defined as:
G ( t ) = F 1 g ( h ) = g ( h ) e x p ( 2 π i t h ) d h .
Note that the subscript is for the dimension. In this case, we are interested in the one-dimensional systematic sampling.
Suppose that we have measurement functions in Equations (8)–(11). These functions are used for the simulation. They can represent biological objects. They are given with the following forms, and the function f has positive values for each value of variable x, namely f : R R + .
f ( x ) = ( 1 x 2 ) q , x [ 1 , 1 ] , q [ 0 , 1 ] ,
f ( x ) = ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q , x [ 1 , 1 ] , q [ 0 , 1 ] ,
where q is a parameter of the smoothness constant.
f ( x ) = e x p ( s i n ( x 3 ) ) , x [ 38 π / 100 , 53 π / 100 ] ,
and
f ( x ) = ( 5 / 112 ) ( 54 x 4 25 x 3 + 48 x 2 + 25 x + 6 ) , x [ 1 , 1 ] .
They represent the area measured on the each slice of three-dimensional biological objects. The measurement functions in Equations (8) and (11) are used by [9,33], respectively. Each function f and its covariogram function can be an example for the representation of q, even though the function f does not have the parameter q. The two approximately same functions can be proposed, f i f j , where i j , i , j = N , which have different mathematical expressions; however, they can be a neighborhood of each other. For example, the measurement functions in Equations (10) and (11) can be a more modified version of M F in Equation (9). M F in Equation (9) can be a modified version of M F in Equation (8).
Since the covariogram functions of measurement functions in Equations (8)–(11) (displayed by Figure 1 and Figure 2) cannot be calculable to get the integral values, a model for the covariogram functions should be proposed. In this sense, the covariogram functions can be modeled by a polynomial with the fractional power,
g ( h ) = b 0 + b j | h | j + b 2 h 2 , j = 2 q + 1 , q [ 0 , 1 ] .
Equation (12) is defined to be a covariogram model [6,7,8,9,18].

2.1.3. Approximate Variance of Q ^

By using Equation (5) and the properties of G ( t ) , we finally get:
V a r ( Q ^ ) = T 1 0 T Q ^ 2 d u Q 2 = k = G ( k / T ) G ( 0 ) = 2 k = 1 G ( k / T )
(detailed expressions are given in Ref. [6,7,8,9]).
The Fourier transformations of parts h 0 and h 2 in the covariogram model in Equation (12) are zero. Thus, the variance extension term of systematic sampling on R is obtained by using the formula given in Equations (9.1) and (9.2) in [8] as follows,
F 1 | h | j = b ( j , 1 ) ϱ ( j + 1 ) , ( j > 1 , non-even )
which is the Fourier transformation of | h | j for the one-dimensional systematic sampling. b ( j , 1 ) = π j 1 / 2 Γ ( j + 1 2 ) Γ ( j 2 ) .
By using Equations (12)–(14), Equation (15) is obtained.
V a r E ( Q ^ ) = 2 b j b ( j , 1 ) T j + 1 ζ ( j + 1 ) V a r E ( Q ^ ) = α ( q ) ( 3 g ( 0 ) 4 g ( T ) + g ( 2 T ) ) V a r E ( Q ^ ) = α ( q ) ( 3 C 0 4 C 1 + C 2 ) T 2
where
α ( q ) = 2 π ( 2 q + 3 / 2 ) Γ ( q + 1 ) ζ ( 2 q + 2 ) ( 2 2 q + 1 4 ) Γ ( 2 q + 1 2 ) , q R ,
g ^ ( k T ) = T C k ,
Γ, ζ are Gamma and Riemann’s Zeta functions in [34], respectively. g ^ ( k T ) is unbiased point estimators of g ( k T ) , k = 0 , 1 , 2 , . . . .
Equation (16) is obtained according to Fourier transformation basically given in [8]. However, Equation (18) is obtained according to the generalized version of the refined Euler–MacLaurin, proposed by [9].
α ( q ) = Γ ( 2 q + 2 ) ζ ( 2 q + 2 ) c o s ( q π ) ( 2 π ) 2 q + 2 ( 1 2 2 q 1 ) , q [ 0 , 1 ] ,
V a r E ( Q ^ ) = 2 π ( 2 q + 3 / 2 ) Γ ( q + 1 ) ζ ( 2 q + 2 ) ( 2 2 q + 1 4 ) Γ ( 2 q + 1 2 ) ( 3 C 0 4 C 1 + C 2 ) T 2 .
Note that Equations (16) and (18) give the same results, because the connection between the generalized version of the refined Euler–MacLaurin summation formula and Matheron’s transitive theory is expressed by [9].
We calculate the estimated coefficient of error for the systematic sampling on R by means of Equation (20) given below,
v a r E ^ ( Q ^ ) / Q 2 = α ( q ) ( 3 C 0 4 C 1 + C 2 ) T 2 / Q 2 c e ^ ( Q ^ ) = α ( q ) ( 3 C 0 4 C 1 + C 2 ) ( i = 1 n f i ) 1
where
C k = i = 1 n k f i · f i + k , k = 0 , 1 , , n 1 .
This is defined to be a coefficient of error of Matheron’s covariogram model. Equations (6) and (21) have in fact inheritance. n 2 k + 1 observations are required due to Equation (21) [6,7,8,9,17,20,21].
The estimation values of parameter q from ( 1 x 2 ) q , ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q , e x p ( s i n ( x 3 ) ) and ( 5 / 112 ) ( 54 x 4 25 x 3 + 48 x 2 + 25 x + 6 ) measurement functions will be obtained at the simulation section. The works in Ref. [1,2,3,4,5,6,7,8,9,10,11,13,14,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31] used the covariogram model for the different measurement functions. We aim to show the performance of the covariogram model in Equation (12) for the different measurement functions. In other words, the capability of the covariogram model for fitting performance on the covariogram functions is tested, so the information gained by this model is displayed by the simulation results. By using this model and the variance extension term in Equation (15), the estimation formula of parameter q is given in the following subsection.

2.1.4. Estimation Formula of Parameter q

The estimation formula of smoothness constant q is proposed by [9]. In this subsection, we will get it via the variance extension term in Equation (15) in the framework of the Fourier transformation. The formula we got for q is the same as the formula in [9].
The covariogram model g can be declared by integer k values. If h = i T is near zero, Equation (7) has more information. By using Equations (12) and (15), respectively,
g ( i T ) = b 0 + b 2 q + 1 ( | i T | ) 2 q + 1 + b 2 ( i T ) 2 , i = 0 , k , 2 k ,
V a r E ( Q ^ ) = 2 b 2 q + 1 b ( 2 q + 1 , 1 ) T 2 q + 2 ζ ( 2 q + 2 ) V a r E ( Q ^ ) = 2 π ( 2 q + 3 / 2 ) Γ ( q + 1 ) Γ ( 2 q + 1 2 ) ζ ( 2 q + 2 ) T · 3 g ( 0 ) 4 g ( k T ) + g ( 2 k T ) k 2 q + 1 ( 2 2 q + 1 4 )
can be obtained. By using Equations (19) and (22),
1 k 2 q + 1 [ 3 g ( 0 ) 4 g ( k T ) + g ( 2 k T ) ] = [ 3 g ( 0 ) 4 g ( T ) + g ( 2 T ) ]
is obtained and then:
q = 1 2 l o g ( k ) · l o g 3 g ( 0 ) 4 g ( k T ) + g ( 2 k T ) 3 g ( 0 ) 4 g ( T ) + g ( 2 T ) 1 2 ,
where, k = 2 , 3 , . . . .
By using Equation (17),
q ^ = 1 2 l o g ( k ) · l o g 3 C 0 4 C k + C 2 k 3 C 0 4 C 1 + C 2 1 2 , k = 2 , 3 , . . . .
The estimator q ^ is obtained when the M F is obtained by planimetry. The planimetry is also called automatic pixel counting given in the section for real data.

2.2. Confidence Interval in Systematic Sampling on ℝ

We will give an exact calculation of λ ( q , N ) in a generalized version of the refined Euler–MacLaurin summation formula with a fractional power of measurement functions. A brief introduction and some tools will be given in the next subsection to get the formula of λ ( q , N ) for the confidence interval of Q ^ .

2.2.1. Confidence Interval of Q ^ : λ ( q , N )

Some definitions expressed in Ref. [2,9] will be given. A theorem will be proposed for a tool in the constant λ ( q , N ) .
A bounded interval for the difference ( Q ^ Q ) , which is defined as a generalized version of the refined Euler–MacLaurin summation formula, is:
| Q ^ Q | T q + 1 P q + 1 * i = 1 N | S f ( q ) ( a i ) |
where
P q + 1 * = m a x { Δ , β } { 2 ( 2 π ) q + 1 j = 1 1 j q + 1 c o s ( 2 π Δ j π 2 ( q + 1 ) + β ) } ,
Δ [ 0 , 1 ) and β [ 0 , 2 π ] . { a i , i = 1 , 2 , , N } is the set of a real line at which function f ( q ) exhibits discontinuities. N is the number of sets. The difference Q ^ Q and ( S f ( q ) ( a i ) ) 2 are given by Equations (6) and (11) in Ref. [2]. If N is greater than 2, the information in Q ^ Q decreases (for detailed expressions, see Ref. [2]). The preliminaries were given. Now, we will use the tools in Ref. [2,9]. After some straightforward calculations in the following steps, the formula of λ ( q , N ) will be obtained.
By means of the Cauchy–Schwarz inequality ( i = 1 N | y i | ) 2 N i = 1 N y i 2 , where y i = S f ( q ) ( a i ) ,
i = 1 N | S f ( q ) ( a i ) | 2 N i = 1 N S f ( q ) ( a i ) 2 .
When Equations (26) and (28) are used,
| Q ^ Q | T q + 1 P q + 1 * i = 1 N | S f ( q ) ( a i ) | | Q ^ Q | T q + 1 P q + 1 * N i = 1 N S f ( q ) ( a i ) 2
is obtained.
The following formula having the parameter q is proposed by [9],
V a r E ( Q ^ ) = T 2 q + 2 P 2 q + 2 , T ( 0 ) c o s ( π q ) i = 1 N S f ( q ) ( a i ) 2 .
When i = 1 N S f ( q ) ( a i ) 2 in Equation (30) is written in Equation (29), Equation (31) is obtained.
| Q ^ Q | P q + 1 * N c o s ( π q ) P 2 q + 2 , T ( 0 ) V a r E ( Q ^ ) , | Q ^ Q | λ ( q , N ) V a r E ( Q ^ ) ,
λ ( q , N ) = P q + 1 * N c o s ( π q ) P 2 q + 2 , T ( 0 ) , q [ 0 , 1 ] .
λ ( q , N ) is a function of q and N. In the following steps, we will give a definition for the function P k , T in [34]. The function P q + 1 * in Equation (27) is required to apply Theorem 1 for the exact calculation of λ ( q , N ) .
From Equation (31), for true parameter Q, a bounded interval (or 100 % confidence interval) is given as [2,3]:
Q ^ λ ( q , N ) V a r E ( Q ^ ) , Q ^ + λ ( q , N ) V a r E ( Q ^ ) .
We need the following periodic function with period T to get the values of constant λ ( q , N ) [9,34],
P k , T ( x ) = P k x T x T = 2 ( 2 π ) k j = 1 c o s ( 2 π j ( x / T ) ( 1 / 2 ) π k ) j k ,
where x R , k = 2 , 3 , . . . .
From Equation (34), P 2 q + 2 , T ( 0 ) is obtained,
P 2 q + 2 , T ( 0 ) = 2 ( 2 π ) 2 q + 2 j = 1 c o s ( 2 π j ( 0 / T ) ( 1 / 2 ) π ( 2 q + 2 ) ) j 2 q + 2 P 2 q + 2 , T ( 0 ) = 2 ( 2 π ) 2 q + 2 ζ ( 2 q + 2 ) c o s ( π q ) .
We will give the following theorem to get the values P q + 1 * exactly.
Theorem 1.
1.
Supposing that we have a right-open side interval [ a , b ) , the maximum value of the interval is:
lim h 0 ( b h ) = lim h ( b 1 / h ) .
2.
Supposing that we have a left-open side interval ( a , b ] , the maximum value of the interval is:
lim h 0 ( a + h ) = lim h ( a + 1 / h ) .
3.
Supposing that we have a left-open side and right-open side interval ( a , b ) , the maximum value of the interval is:
lim h 0 ( a + h ) = lim h ( a + 1 / h ) ,
lim h 0 ( b h ) = lim h ( b 1 / h ) .
In order to calculate P q + 1 * in Equation (27) exactly, Δ is replaced with 1 h , because we must take the maximum value of Δ. When h 0 , we get the result for the function P q + 1 * given in the following form,
P q + 1 * = lim h 0 2 ( 2 π ) q + 1 j = 1 1 j q + 1 c o s ( 2 π ( 1 h ) j π 2 ( q + 1 ) + 2 π ) .
Now, we can explain how to get the values of constant λ ( q , N ) below. In the following lines, we will give the limit values done by Mathematica 7, 8 or 9.
When N and P 2 q + 2 , T ( 0 ) given in Equation (32) are replaced with 2 and Equation (35), respectively, we get an equation,
λ ( q , N ) = P q + 1 * 2 ( 2 π ) 2 q + 2 2 ζ ( 2 q + 2 ) .
The codes written in Mathematica 7 or higher versions 8 and 9 do not compute the value of λ ( 0 , 2 ) and give infinity. Therefore, for the calculation of q = 0 , we use Equation (38) in order to give the result.
By means of Equation (37), we get λ ( 0 , 2 ) :
P q + 1 * = lim h 0 2 ( 2 π ) j = 1 1 j c o s ( 2 π ( 1 h ) j + 3 π 2 ) = 1 / 2 λ ( 0 , 2 ) = 1 2 · ( 2 π ) 2 ζ ( 2 ) = 2 . 44949 .
By using Equation (37), for q = 0 . 1 ,
λ ( 0 . 1 , 2 ) = lim h 0 2 ( 2 π ) 1 . 1 j = 1 1 j 1 . 1 c o s ( 2 π ( 1 h ) j π 2 ( 1 . 1 ) + 2 π ) · ( 2 π ) 2 . 2 ζ ( 2 . 2 ) = 2 . 71243
is obtained. For other q values, a similar procedure is performed. By using the formula given in Equation (32), the values in Table 1 are obtained for N = 2 , N = 3 and N = 4 [35].
It is attested that l i m h ( 1 1 / h ) and l i m h 0 ( 1 h ) give the same values for λ ( q , N ) , which is used to define the right-open side interval.
Moreover, in order to construct the confidence interval for the dataset,
Q ^ λ ( q , N ) · c e ^ ( Q ^ ) · Q ^ Q Q ^ + λ ( q , N ) · c e ^ ( Q ^ ) · Q ^
are used, where λ ( q , N ) is the coefficient of the confidence interval. c e ^ is an estimated coefficient of error [2,3].

2.2.2. Exact and Computational Values of λ ( q , N )

We got the values of λ ( q , N ) for different q and N values by using the exact calculation. This calculation was done by the Mathematica 7 or the higher Versions 8 and 9. One can get the values of λ ( q , N ) for the different q values with N = 2 , N = 3 and N = 4 . When the number of sequences in codes prepared by [2] is increased, the computational values of λ ( q , N ) converge to the exact values of λ ( q , N ) (see Table 1 and Table 2). Because getting the computational values of λ ( q , N ) for N = 2 would be unlikely to be useful, we give up computing some of them. S i : the number of sequences increased by a user, i : 1 , 2 , 3 , 4 .

3. Results

This section will give the simulation results for the estimation of parameter q in the variance extension term in Equation (20). Together with the estimated q ^ with k = 2 for Equations (8)–(11), it is planned to see whether or not the confidence interval includes the true value of volume. While constructing the confidence interval, the approximate variance based on Matheron’s covariogram model and the empirical true variance estimations are used. Real data examples are given to test the performance of the confidence interval.

3.1. Simulation

3.1.1. Plan and Output of Simulation

C E M C , C E E T , C P C E M C and C P C E E T are the abbreviations for the coefficient of error of Matheron’s covariogram model, the coefficient of error for the empirical true variance, the coverage probability of confidence interval based on the coefficient of error for the Matheron’s covariogram model and the coverage probability of confidence interval based on the coefficient of error for the empirical true variance, respectively. The last two abbreviations are for the coverage probabilities of confidence intervals based on C E M C and C E E T . If the true value Q is in the confidence interval, the confidence interval is successful. The numbers of successes for each replication of sampling are counted and divided by the number of replications. Thus, the values of the coverage probability of the confidence interval for a parameter Q are computed. V a r ^ ( q ^ ) and M S E ^ ( q ^ ) are the simulated variance and the simulated mean squared error of the estimator q ^ , respectively. The k in q formula was taken to be two, because h = i T must be near zero in order to increase the information in Equation (7).
In the simulation performed, the number of replication is 3000, and the numbers of systematic samplings, n, on R are from 5–20 for the measurement functions in Equations (8)–(11). The number of systematic samplings is taken to be 20 at most for practical purposes.
The following lines represent the produced x values of M F , the values of M F , the estimated values of parameter Q and the coefficient of error for the empirical true variance in three blocks given below.
FOR i FROM 1 TO the number of replicationFOR j FROM 0 TO the number of systematic sampling minus one
  T(j)=Length of interval of domain of function / the number of systematic sampling
 FOR k FROM 1 TO the number of systematic sampling
   x(k,j,i)=Left end point of the domain of the measurement function+(u(j,i)+j)T(j)
   y(k,j,i)=Measurement function(x(k,j,i))
 END FOREND FOR
END FORFOR i FROM 1 TO the number of replicationFOR j FROM 1 TO the number of systematic sampling
   Estimated Q in replicated sampling=(Summation of y in replicated sampling)T(j)END FOR
END FORFOR i FROM 1 TO the number of systematic sampling
 Summation of empirical variance in each number of systematic sampling is assigned to be zeroFOR j FROM 1 TO the number of replication
   Empirical variance in each replication=(Estimated Q in each replication-True value)^2;
   Summation of empirical variance in each replicationEND FOR
 Mean of empirical variance=Summation of empirical variance in each replication/The number of replication
 The coefficient of error for the empirical true variance=The root of mean of empirical variance/True value
END FOR

3.1.2. Estimations for Parameter q and Variance Approximation of Q ^

As implied by p. 319 in [8], it is obvious that the changing of M F affects the variance extension term given in Equation (15). In this sense, the covariogram model in Equation (12) can be adopted for some measurement functions. The simulation results in Table A1, Table A2, Table A3, Table A4, Table A5, Table A6, Table A7, Table A8, Table A9, Table A10, Table A11 and Table A12 show that the covariogram model can be used for the measurement functions given in Equations (8), (9) and (11). Note that the integral of the function in Equation (9) is computed by means of the numerical integration in MATLAB 2013 a , but the integrals of the functions in Equation (8), (10) and (11) are computed by means of the ‘int’ function, which is a function for the exact calculation of the integral in MATLAB 2013 a . The covariogram model in Equation (12) is proposed intuitively by [6,7]. We want to use it to check the performance of variance estimation in Equation (20). By using the covariogram model in Equation (12), the estimation of q can be said to be an open problem for the measurement functions in Equations (8)–(11), except Equation (8) with q = 0 . 4 . It is observed from the simulation results that the performance of the q formula depends on the covariogram model in Equation (12) and the variance extension term in Equation (20). For irregular M F , such as Equations (10) and (11), the estimations of q are unstable, namely, they change according to the sample size. It should be noted that for each sample size, the information on the equidistant systematic sampling version of f changes. When we have regular patterns, such as Equations (8) and (9), the estimations of q are stable according to the sample size.
Suppose that we have a flexible balloon filled with gas. This three-dimensional balloon can have different shapes after modification done by hand. Thus, each measurement function can be in a class for the functions with parameter q. As is seen, Equations (10) and (11) do not have the parameter q, but we want to estimate it to get the values of variance estimation precisely for them. The true parameter values of them are accepted to be q = 0 . 95 , q = 0 . 9 , respectively, because it seems that the estimated values of parameter q are around these values under Equations (12) and (20).
The estimated values for q of the measurement functions in Equation (8) with q = 0 . 8 and also Equation (9) with q = 0 . 4 and q = 0 . 8 would be around the true parameter values. It is seen that the covariogram model in Equation (12) cannot be a very good representative for these q values of two measurement functions. However, when it is considered on the performance of variance estimation, the values of C E M C and C E E T for the systematic sampling of Equations (8), (9) and (11) contain the same numbers of zero digits in the fraction part the most times under Equations (12) and (20). It should be noted that even though M F in Equation (11) does not have the parameter q, the values of C E M C and C E E T contain the same number of zeros in the fraction part when the sample sizes are from 6–20. When n = 5 , there is not enough information due to the irregular form of M F . It is seen from the simulation results that the values of C E M C for M F in Equation (11) are more approximate than that of Equations (8) and (9) generally, which shows evidence that we can propose a function without a parameter q, which is a neighborhood of functions with parameter q. In other words, we have a family for the covariogram functions. This family can be modeled by the covariogram model. This is one of the reasons why the proposed M F must be used. The efficiency for the systematic sampling of M F in Equation (10) is not as good as that of Equation (11) because of Equations (12) and (20). The covariogram function of Equation (10) cannot be modeled by the covariogram model. The precision of the systematic sampling of M F in Equation (10) should be improved. This shows us that the proposed measurement functions are also used while making research into the applied science when Equation (20) is used. As a result, for Equations (8), (9) and (11), Equations (12) and (20) give the values around C E E T ; however, for Equation (10), the values of C E M C and C E E T do not contain the same number of zero digits in the fraction part even if the sample size increases. The fluctuation of C E E T is an expected result, because the idea of estimation for parameter Q via the systematic sampling in Equation (1) is used.

3.1.3. Confidence Interval of Q ^ : Empirical True and Approximation for V a r ( Q ^ )

The theoretical coverage probability of the confidence interval for a parameter Q was 100 % . In this sense, the coverage probabilities of confidence intervals based on the coefficient of error for Matheron’s covariogram model are 100 % ; but the coverage probabilities of the confidence intervals based on the coefficient of error for the empirical true variance cannot be 100 % . It is observed that the coverage probabilities of confidence intervals based on the coefficient of error for the empirical true variance are 95 % approximately. The benefit of using the covariogram model with parameter q is that the bandwidth of confidence can be more accurate for a class of measurement functions, such as Equations (8), (9) and (11).

3.1.4. Confidence Interval of Q ^ : λ ( q , N )

C E M C can be approximated to C E E T for Equations (8), (9) and (11), and so, the coverage probabilities of the confidence intervals based on C E M C and C E E T , which are called C P C E M C and C P C E E T , respectively, can be considered to have similar values, which shows that our investigation on the performance of λ ( q , N ) values is reasonable. In other words, when we look at the performance of the confidence interval for whether or not it includes the true value, we should focus on C P C E M C and C P C E E T , which must give similar results together. To this point, the trustiness of the confidence interval is acceptable, because C P C E E T is at least 95 % approximately. In all performed simulations, N in λ ( q , N ) is taken to be two, which shows us that the confidence interval keeps itself in the optimal sense. This optimality is supported by C P C E M C and C P C E E T , because they have similar results. In other words, suppose that if N is taken to be three, the bandwidths of the confidence interval in the λ ( q , 3 ) case are larger than that of λ ( q , 2 ) . In such a case, we would have non-useful information for the confidence band of Q ^ . Another point we should focus on is the values in Table 1 and Table 2, because the exact values of constant λ ( q , N ) provide useful information for the confidence band. For M F in Equation (10), since C P C E E T is 100 % , it seems the constant λ ( q , N ) produces the confidence intervals, which includes the true value of the area. However, the accurateness of the constant λ ( q , N ) with the empirical true variance and the approximate variance estimations should be examined together. For the same values of λ ( q , N ) , the bandwidths of the confidence intervals with C E E T are smaller than that of C E M C , which shows us that the bandwidths of confidence intervals constructed with C E E T are more accurate than those of C E M C . When we look at Table A12, C P C E M C and C P C E E T have the same values, except that the number of samplings n is equal to five. The more number of samplings means the more information gained from the M F . This is one of the reasons why the proposed M F must be used. For M F in Equation (11), it is important to take a value for q, such that we can not only estimate the variance of systematic sampling more precisely, but also choose the right value for the constant λ ( q , N ) . Finally, proposing a class of measurement functions for research helps us to control the trustworthiness of the research in the applied science if the experts in the area can propose them correctly.
A final comment for the simulation section is that the covariogram model in Equation (12) and the variance extension term in Equation (20) can be used. However, the proposed measurement functions, such as Equations (8)–(11), should be systematically sampled, so that one can get a more precise decision on the application for biomedical imaging. As another solution, Equation (20) may be reformulated in the framework of the fractional Fourier transformation proposed by [36,37].

3.2. Real Data

Five different sheep brains, which were 12–18 months old, were removed from their skull via craniotomy in the laboratory for anatomy. These brains were immersed in formalin (5%) for 10 days. Brains were scanned with standard T 2 -weighted 0 . 5 Tesla magnetic resonance imaging (MRI) in the coronal plane with a 5-mm slice thickness. The true values of volume for each brain were obtained by using the Archimedean principle repeated six times. The arithematic mean of six results for each brain was used as a true value of the volume of a brain. After MRI scanned the sheep brains, the area of each digital image for slices of brains was computed by pixel counting. While doing the pixel counting, the edges of each digital image were detected by experts in the anatomy area. After that, they were estimated by the slices in the coronal plane. The results are given in Table 3 and Table 4. Table 3 and Table 4 show the true values for the volume (Q), the estimated volume ( Q ^ ), the estimated smoothness constant ( q ^ ), the estimated coefficient of error ( c e ^ ), the lower bound of the estimated volume ( Q ^ l o w e r ), the upper bound of the estimated volume ( Q ^ u p p e r ) and the number of slices (Num. Slc.). The area values of each slice obtained from the coronal axis are depicted in the Figure 3.
When N in Equation (32) is replaced with three, λ ( 0 . 3 , 3 ) = 3 . 82689 and λ ( 0 . 34 , 3 ) = 3 . 90405 are found. The confidence intervals of Brain 2 for q = 0 . 3 and q = 0 . 34 are (87,055.68, 94,420.32) and (86,981.44, 94,494.56), respectively. These confidence intervals include the true value of volume for Brain 2.
In Table 4, when the estimated q ^ with k = 2 is taken, the confidence interval includes the true values of volume for each non-vivo brain. It is seen that estimating accurately the parameter q affects the variance estimation and the confidence interval, as well. For this reason, λ ( q , N ) values for N = 2 , 3 , 4 in Table 1 are computed. The simulation results show that N should be two. The more precise measurement of the slice area is needed for the real data case, because measuring the area of slices more precisely produces the shape of M F more accurately. Having a more accurate shape of M F means that the estimated values of parameter q can be determined more accurately. However, it should be noted that the estimation of parameter q except Equation (8) with q = 0 . 4 is an open problem, as well, even if the exact form of M F is known. Taking the values of k in q formula and N in the constant λ ( q , N ) as bigger than two is not useful due to the loss of information; however, we do not have a solution for the real data case. In the real data case, we have to try different values of k and N, and also, the proposed M F must be used to increase the trustworthiness of research in the applied science. As a result, using two propositions is required.

4. Conclusions and Discussions

The works in Ref. [6,7,8,9,10,11,12,13,14,15,19,20,21,22] focused on the variance estimation for systematic sampling. As implied by [9,17,21], the estimation of q is important to avoid the biasedness of the variance estimator in systematic sampling on R . The unbiasedness of the variance extension term estimator leads to having accurate lower and upper bounds of the confidence interval for the systematic sampling on R . In fact, the true variance in Equation (5) is the combination of three components, so we always have biased variance estimation; however, for practical purposes, we are only interested in the variance extension term. It is observed from the simulation results that C E M C and C E E T have values that contain the same number of zero digits in the fraction part for the measurement functions in Equations (8), (9) and (11) at most times under Equations (12) and (20). However, the same performance of the formula in Equation (20) is not observed for the variance approximation of the systematic sampling of M F in Equation (10) even if the number of sample sizes increases. Therefore, the formulae in Equation (12) and Equation (20) are not appropriate to use. It should be noted that due to the definition of systematic sampling given in Equation (1), a fluctuation can be observed, so evaluating the performance of C E M C and C E E T according to the behavior of the sample size cannot be sensible. However, when the sample size increases, it is logical to expect the information coming from M F to be increased. In this context, Table A1, Table A2, Table A3, Table A4, Table A5, Table A6, Table A7, Table A8, Table A9, Table A10, Table A11 and Table A12 show that when the sample size increases, the values of C E M C and C E E T are decreased. As implied by p. 319 in [8], if a measurement function has irregularities, such as Equations (9)–(11), the values of C E M C are high when the sample size is small, because the information is not enough to catch the approximate shape of the three-dimensional object. For the measurement functions in Equations (9)–(11), Table A6, Table A8, Table A10 and Table A12 support this implication on p. 319 of [8]. The C E E T values of M F in Equation (9) with q = 0 . 4 can approve the fluctuation when the sample sizes are 5–20. For other measurement functions, the fluctuation on C E E T is not observed; in fact, there is a fluctuation; however, the effect of sample size covers the fluctuation. Otherwise, proposing a Fourier transformation to model the estimated values of parameter Q would not give the results for C E M C that are approximate to C E E T .
A three-dimensional object is represented by a flexible balloon filled with gas. The shape of this balloon can be modified by means of hands. The volume of the balloon can change after each modification; however, it is not important whether or not there is a change of the volume of the balloon. The modified versions of the balloon are represented by the measurement functions given in Equations (8)–(11). The measurement functions in Equations (10) and (11) do not have the parameter q; however, they can be a neighborhood of M F having parameter q. Since we have different forms of the balloon, we will have many forms of the measurement functions representing the three-dimensional object exactly. In the real data case, knowing the exact form of the three-dimensional object is impossible. The covariogram functions coming from the M F without q will be the neighborhood of the covariogram functions coming from M F with q. This is why the covariogram model has the parameter q. The true M F with q cannot be known exactly due to fact that we can have M F without q, which is a neighborhood of M F with q. In other words, we can have approximately the same functions, which are different mathematical expressions.
A method showing how to calculate constant λ ( q , N ) is proposed. It is expected that this method can be used as a new tool in mathematics when it is needed. A package program written in MATLAB 2013 a gives the lower and upper levels of confidence and the quantitative values of stereology when the data obtained from a single replication is typed. The program can be supplied on request. The estimation of q is an open problem, even if we know the exact form of the measurement functions, except Equation (8) with q = 0 . 4 . In the other perspective of our discussion, the covariogram model cannot be a very good approximation for the covariogram functions of measurement functions. However, the values of C E M C and C E E T contain the same numbers of zero digits in the fraction part at most times for the measurement functions in Equations (8), (9) and (11) under Equations (12) and (20).
The values of C P C E M C and C P C E E T are the other criteria to approve the performance of the exact values of the constant λ ( q , N ) , as observed from the simulation results. A numerical computation for the constant λ ( q , N ) of the confidence interval was done by [2]. The more precise the values of the constant λ ( q , N ) means the more precise the confidence interval. It is obvious that the proposition of exact calculation has to be preferred, because the computation of the constant λ ( q , N ) proposed by [2] is not as good as the results displayed by Table 1 and Table 2, as seen by the conformity between C P C E M C and C P C E E T for exact values of λ ( q , N ) , especially for the measurement functions in Equations (8), (9) and (11).
It is observed that the synthetic data can approve the real data for non-vivo brains if the real data have an exactly similar form as the synthetic data. Since the real data cannot be represented by the exactly similar measurement functions, we should prefer to use different N values. For this reason, we chose the different k values for the estimation of q in real data. Equation (25) can produce the negative estimated q ^ value for the area function of the real data given in Figure 3. The constant λ ( q , N ) values for different N values must be given, because the real data shows that we need constant λ ( q , N ) values with different N. In this context, the edges of digital images have to be found more precisely. Especially, when the edges of images are more irregular, the precision of getting area values for each image has to be decreased significantly. To do this, new edge detection methods proposed by [38] can be used.
The proposed measurement functions should be systematically sampled while conducting research on biomedical imaging to increase the information in the decision rule for now. To be able to increase the performance of the variance approximation, the fractional Fourier transformation may be applied to get a new variance estimation of the equidistant systematically sampled on R . Other types of M F may also be used. In this case, we need the computational integral techniques. The studies done by [39,40] will be enlightening for the computational integration of other M F types that we will use. For the construction of M F via digital images, which have coronal, axial or sagittal directions, the new edge detection methods proposed by [38] may be applied to get more precise area values of slices. We will prepare a free statistical software R package with a macro for Mathematica for all methodologies given here and in the future.

Acknowledgments

I am very grateful to Luis M. Cruz-Orive who sincerely invited me, to Marta García-Fiñana for sending the codes in R and to Niyazi Acer for kindly providing the data. I am also so indebted to the Council of Higher Education for giving the funding for my research. I am indebted to James F. Peters from the University of Manitoba, Winnipeg, Canada, for critical reading. I would like to thank sincerely the four referees. Without their help, this paper would not have been improved. I also wish to thank my great parents. In the memory of my great brother.

Author Contributions

Mehmet Niyazi Çankaya finished the manuscript. The author has read and approved the final manuscript.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A

Table A1. ( 1 x 2 ) q : q = 0 . 4 .
Table A1. ( 1 x 2 ) q : q = 0 . 4 .
n q ^ Var ^ ( q ^ ) MSE ^ ( q ^ )
5 0 . 469211 0 . 000755 0 . 005545
6 0 . 449412 0 . 001009 0 . 003450
7 0 . 438921 0 . 000994 0 . 002509
8 0 . 430876 0 . 000992 0 . 001945
9 0 . 426702 0 . 001029 0 . 001742
10 0 . 422139 0 . 001089 0 . 001580
11 0 . 419202 0 . 001029 0 . 001398
12 0 . 417282 0 . 001049 0 . 001348
13 0 . 414414 0 . 001213 0 . 001421
14 0 . 413815 0 . 001090 0 . 001281
15 0 . 411180 0 . 001210 0 . 001335
16 0 . 411575 0 . 001060 0 . 001194
17 0 . 409010 0 . 001232 0 . 001314
18 0 . 409631 0 . 001119 0 . 001211
19 0 . 409595 0 . 001086 0 . 001178
20 0 . 408593 0 . 001079 0 . 001153
Table A2. ( 1 x 2 ) q : q = 0 . 4 .
Table A2. ( 1 x 2 ) q : q = 0 . 4 .
n CEMC CEET CPCEMC % CPCEET %
5 0 . 031387 0 . 027150 100 . 0 98 . 8
6 0 . 024935 0 . 021804 100 . 0 99 . 1
7 0 . 020275 0 . 017246 100 . 0 99 . 1
8 0 . 016988 0 . 014385 100 . 0 99 . 3
9 0 . 014433 0 . 011948 100 . 0 98 . 7
10 0 . 012535 0 . 010502 100 . 0 99 . 1
11 0 . 011000 0 . 009178 100 . 0 99 . 5
12 0 . 009751 0 . 008085 100 . 0 99 . 0
13 0 . 008768 0 . 007458 100 . 0 99 . 0
14 0 . 007891 0 . 006564 100 . 0 99 . 1
15 0 . 007211 0 . 006121 100 . 0 99 . 3
16 0 . 006562 0 . 005408 100 . 0 99 . 3
17 0 . 006069 0 . 005181 100 . 0 99 . 2
18 0 . 005580 0 . 004672 100 . 0 99 . 5
19 0 . 005167 0 . 004271 100 . 0 99 . 4
20 0 . 004820 0 . 003961 100 . 0 99 . 3
Table A3. ( 1 x 2 ) q : q = 0 . 8 .
Table A3. ( 1 x 2 ) q : q = 0 . 8 .
n q ^ Var ^ ( q ^ ) MSE ^ ( q ^ )
5 0 . 798607 0 . 014187 0 . 014189
6 0 . 821274 0 . 015347 0 . 015800
7 0 . 827312 0 . 016704 0 . 017450
8 0 . 838035 0 . 016471 0 . 017918
9 0 . 835330 0 . 017350 0 . 018599
10 0 . 840301 0 . 017530 0 . 019154
11 0 . 840649 0 . 017641 0 . 019294
12 0 . 839555 0 . 018128 0 . 019693
13 0 . 842700 0 . 017997 0 . 019821
14 0 . 837016 0 . 018080 0 . 019450
15 0 . 838411 0 . 018319 0 . 019794
16 0 . 840756 0 . 018341 0 . 020002
17 0 . 837451 0 . 018479 0 . 019881
18 0 . 831963 0 . 019301 0 . 020323
19 0 . 839645 0 . 019642 0 . 021214
20 0 . 835757 0 . 019321 0 . 020599
Table A4. ( 1 x 2 ) q : q = 0 . 8 .
Table A4. ( 1 x 2 ) q : q = 0 . 8 .
n CEMC CEET CPCEMC % CPCEET %
5 0 . 018256 0 . 022835 100 . 0 100 . 0
6 0 . 012532 0 . 016332 100 . 0 100 . 0
7 0 . 009405 0 . 012425 100 . 0 100 . 0
8 0 . 007190 0 . 009670 100 . 0 100 . 0
9 0 . 005913 0 . 007842 100 . 0 100 . 0
10 0 . 004853 0 . 006570 100 . 0 100 . 0
11 0 . 004053 0 . 005437 100 . 0 100 . 0
12 0 . 003491 0 . 004684 100 . 0 100 . 0
13 0 . 003021 0 . 004037 100 . 0 100 . 0
14 0 . 002674 0 . 003554 100 . 0 100 . 0
15 0 . 002380 0 . 003191 100 . 0 100 . 0
16 0 . 002116 0 . 002814 100 . 0 100 . 0
17 0 . 001885 0 . 002528 100 . 0 100 . 0
18 0 . 001702 0 . 002280 100 . 0 100 . 0
19 0 . 001549 0 . 002043 100 . 0 100 . 0
20 0 . 001429 0 . 001903 100 . 0 100 . 0
Table A5. ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q : q = 0 . 4 .
Table A5. ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q : q = 0 . 4 .
n q ^ Var ^ ( q ^ ) MSE ^ ( q ^ )
5 0 . 059737 0 . 009231 0 . 220590
6 0 . 257864 0 . 001882 0 . 022085
7 0 . 412975 0 . 000590 0 . 000758
8 0 . 483350 0 . 000779 0 . 007726
9 0 . 501458 0 . 000922 0 . 011216
10 0 . 507921 0 . 001314 0 . 012961
11 0 . 507945 0 . 000837 0 . 012489
12 0 . 507897 0 . 001435 0 . 013076
13 0 . 503777 0 . 000699 0 . 011469
14 0 . 501835 0 . 001445 0 . 011816
15 0 . 497033 0 . 000737 0 . 010153
16 0 . 494786 0 . 001527 0 . 010511
17 0 . 489366 0 . 000710 0 . 008696
18 0 . 488828 0 . 001432 0 . 009323
19 0 . 483590 0 . 000714 0 . 007702
20 0 . 481851 0 . 001531 0 . 008230
Table A6. ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q : q = 0 . 4 .
Table A6. ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q : q = 0 . 4 .
n CEMC CEET CPCEMC % CPCEET %
5 0 . 136986 0 . 035193 100 . 0 94 . 6
6 0 . 075842 0 . 056090 100 . 0 97 . 4
7 0 . 048192 0 . 022847 100 . 0 98 . 0
8 0 . 035892 0 . 036584 100 . 0 99 . 1
9 0 . 029172 0 . 017402 100 . 0 98 . 3
10 0 . 024801 0 . 026910 100 . 0 99 . 5
11 0 . 021386 0 . 013797 100 . 0 98 . 6
12 0 . 018852 0 . 020350 100 . 0 99 . 4
13 0 . 016763 0 . 010922 100 . 0 98 . 9
14 0 . 015121 0 . 016248 100 . 0 99 . 7
15 0 . 013697 0 . 009184 100 . 0 98 . 5
16 0 . 012545 0 . 013334 100 . 0 99 . 4
17 0 . 011526 0 . 007842 100 . 0 98 . 9
18 0 . 010623 0 . 011027 100 . 0 99 . 7
19 0 . 009865 0 . 006687 100 . 0 98 . 5
20 0 . 009211 0 . 009652 100 . 0 99 . 6
Table A7. ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q : q = 0 . 8 .
Table A7. ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q : q = 0 . 8 .
n q ^ Var ^ ( q ^ ) MSE ^ ( q ^ )
5 0 . 127902 0 . 025653 0 . 886655
6 0 . 259307 0 . 009587 0 . 301935
7 0 . 471353 0 . 008843 0 . 116851
8 0 . 591335 0 . 008365 0 . 051906
9 0 . 667866 0 . 009158 0 . 026618
10 0 . 712836 0 . 010045 0 . 017643
11 0 . 743469 0 . 010628 0 . 013824
12 0 . 769775 0 . 011272 0 . 012186
13 0 . 782195 0 . 012278 0 . 012595
14 0 . 794971 0 . 013111 0 . 013137
15 0 . 802343 0 . 013406 0 . 013412
16 0 . 810423 0 . 013491 0 . 013599
17 0 . 814130 0 . 013994 0 . 014194
18 0 . 822037 0 . 014498 0 . 014984
19 0 . 822426 0 . 014351 0 . 014854
20 0 . 822911 0 . 015298 0 . 015823
Table A8. ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q : q = 0 . 8 .
Table A8. ( ( 1 c o s ( x ) ) ( 1 x 2 ) ) q : q = 0 . 8 .
n CEMC CEET CPCEMC % CPCEET %
5 0 . 179574 0 . 070757 100 . 0 95 . 8
6 0 . 098853 0 . 058929 100 . 0 98 . 7
7 0 . 056666 0 . 040635 100 . 0 100 . 0
8 0 . 037906 0 . 035454 100 . 0 100 . 0
9 0 . 027210 0 . 025762 100 . 0 100 . 0
10 0 . 021142 0 . 023491 100 . 0 100 . 0
11 0 . 016973 0 . 018652 100 . 0 100 . 0
12 0 . 014014 0 . 016925 100 . 0 100 . 0
13 0 . 011669 0 . 013700 100 . 0 100 . 0
14 0 . 010083 0 . 012682 100 . 0 100 . 0
15 0 . 008732 0 . 010624 100 . 0 100 . 0
16 0 . 007799 0 . 010070 100 . 0 100 . 0
17 0 . 006862 0 . 008591 100 . 0 100 . 0
18 0 . 006093 0 . 008067 100 . 0 100 . 0
19 0 . 005530 0 . 007070 100 . 0 100 . 0
20 0 . 005030 0 . 006753 100 . 0 100 . 0
Table A9. e x p ( s i n ( x 3 ) ) : q = 0 . 95 .
Table A9. e x p ( s i n ( x 3 ) ) : q = 0 . 95 .
n q ^ Var ^ ( q ^ ) MSE ^ ( q ^ )
5 0 . 088096 0 . 050826 1 . 128469
6 0 . 041988 0 . 031897 0 . 856384
7 0 . 189917 0 . 018652 0 . 596379
8 0 . 320166 0 . 008843 0 . 405534
9 0 . 449291 0 . 007235 0 . 257945
10 0 . 549659 0 . 004972 0 . 165245
11 0 . 639576 0 . 002916 0 . 099279
12 0 . 714919 0 . 001908 0 . 057171
13 0 . 776112 0 . 001817 0 . 032055
14 0 . 827798 0 . 001763 0 . 016697
15 0 . 868178 0 . 001724 0 . 008419
16 0 . 899500 0 . 001604 0 . 004154
17 0 . 928089 0 . 001326 0 . 001806
18 0 . 942607 0 . 001128 0 . 001183
19 0 . 959425 0 . 001025 0 . 001114
20 0 . 967014 0 . 000817 0 . 001106
Table A10. e x p ( s i n ( x 3 ) ) : q = 0 . 95 .
Table A10. e x p ( s i n ( x 3 ) ) : q = 0 . 95 .
n CEMC CEET CPCEMC % CPCEET %
5 0 . 123899 0 . 042779 100 . 0 100 . 0
6 0 . 096852 0 . 020019 100 . 0 100 . 0
7 0 . 068757 0 . 008892 100 . 0 100 . 0
8 0 . 046235 0 . 003329 100 . 0 100 . 0
9 0 . 031395 0 . 001138 100 . 0 100 . 0
10 0 . 022537 0 . 000614 100 . 0 100 . 0
11 0 . 016419 0 . 000538 100 . 0 100 . 0
12 0 . 012289 0 . 000462 100 . 0 100 . 0
13 0 . 009477 0 . 000366 100 . 0 100 . 0
14 0 . 007494 0 . 000282 100 . 0 100 . 0
15 0 . 006081 0 . 000220 100 . 0 100 . 0
16 0 . 005047 0 . 000184 100 . 0 100 . 0
17 0 . 004236 0 . 000158 100 . 0 100 . 0
18 0 . 003677 0 . 000132 100 . 0 100 . 0
19 0 . 003202 0 . 000117 100 . 0 100 . 0
20 0 . 002850 0 . 000103 100 . 0 100 . 0
Table A11. ( 5 / 112 ) ( 54 x 4 25 x 3 + 48 x 2 + 25 x + 6 ) : q = 0 . 9 .
Table A11. ( 5 / 112 ) ( 54 x 4 25 x 3 + 48 x 2 + 25 x + 6 ) : q = 0 . 9 .
n q ^ Var ^ ( q ^ ) MSE ^ ( q ^ )
5 0 . 119701 0 . 021089 0 . 629956
6 0 . 349198 0 . 018577 0 . 321959
7 0 . 506408 0 . 017147 0 . 172062
8 0 . 607155 0 . 017009 0 . 102768
9 0 . 677396 0 . 017249 0 . 066802
10 0 . 720912 0 . 016863 0 . 048935
11 0 . 765776 0 . 016458 0 . 034474
12 0 . 786878 0 . 015944 0 . 028741
13 0 . 815932 0 . 015711 0 . 022778
14 0 . 838310 0 . 015351 0 . 019157
15 0 . 856894 0 . 015623 0 . 017481
16 0 . 865002 0 . 016006 0 . 017231
17 0 . 883260 0 . 015086 0 . 015367
18 0 . 887106 0 . 015674 0 . 015840
19 0 . 896326 0 . 015430 0 . 015444
20 0 . 904643 0 . 014824 0 . 014845
Table A12. ( 5 / 112 ) ( 54 x 4 25 x 3 + 48 x 2 + 25 x + 6 ) : q = 0 . 9 .
Table A12. ( 5 / 112 ) ( 54 x 4 25 x 3 + 48 x 2 + 25 x + 6 ) : q = 0 . 9 .
n CEMC CEET CPCEMC % CPCEET %
5 0 . 131979 0 . 061047 100 . 0 97 . 7
6 0 . 076664 0 . 044234 100 . 0 100 . 0
7 0 . 047065 0 . 031891 100 . 0 100 . 0
8 0 . 032165 0 . 024913 100 . 0 100 . 0
9 0 . 023411 0 . 019979 100 . 0 100 . 0
10 0 . 018146 0 . 015962 100 . 0 100 . 0
11 0 . 014053 0 . 013222 100 . 0 100 . 0
12 0 . 011617 0 . 011093 100 . 0 100 . 0
13 0 . 009464 0 . 009315 100 . 0 100 . 0
14 0 . 007890 0 . 007985 100 . 0 100 . 0
15 0 . 006690 0 . 006992 100 . 0 100 . 0
16 0 . 005872 0 . 006274 100 . 0 100 . 0
17 0 . 005015 0 . 005386 100 . 0 100 . 0
18 0 . 004499 0 . 004952 100 . 0 100 . 0
19 0 . 003983 0 . 004410 100 . 0 100 . 0
20 0 . 003544 0 . 003891 100 . 0 100 . 0

References

  1. Eriksen, N.; Rostrup, E.; Andersen, K.; Lauritzen, M.J.; Larsen, V.A.; Dreier, J.P.; Strong, A.J.; Hartings, J.A.; Fabricius, M.; Pakkenberg, B. Application of stereological estimates in patients with severe head injuries using CT and MR scanning images. Br. Inst. Radiol. 2010, 83, 307–317. [Google Scholar] [CrossRef] [PubMed]
  2. García-Fiñana, M. Confidence intervals in Cavalieri Sampling. J. Microsc. 2006, 222, 146–157. [Google Scholar] [CrossRef] [PubMed]
  3. García-Fiñana, M.; Keller, S.S.; Roberts, N. Confidence Intervals for the Volume of Brain Structures in Cavalieri Sampling with Local Errors. J. Neurosci. Methods 2009, 179, 71–77. [Google Scholar] [CrossRef] [PubMed]
  4. Hussain, Z.; Roberts, N.; Whitehouse, G.H.; García-Fiñana, M.; Percy, D. Estimation of Breast Volume and its Variation During the Menstrual Cycle using MRI and Stereology. Br. J. Radiol. 1999, 72, 236–245. [Google Scholar] [CrossRef] [PubMed]
  5. McNulty, V.; Cruz-Orive, L.M.; Roberts, N.; Holmes, C.J.; Gual-Arnau, X. Estimation of Brain Compartment Volume from MR Cavalieri Slices. J. Comput. Assist. Tomogr. 2000, 24, 466–477. [Google Scholar] [CrossRef] [PubMed]
  6. Matheron, G. Variables Régionalisées et Leur Estimation [Les]; Masson: Paris, France, 1965. [Google Scholar]
  7. Matheron, G. The Theory of Regionalized Variables and Its Applications. In Les Cahires du Centre de Morphologie Mathématique de Fontainebleau, No. 5; Ecole Nationale Supérieure des Mines de Paris: Paris, France, 1971. [Google Scholar]
  8. Cruz-Orive, L.M. On the Precision of Systematic Sampling: A Review of Matheron’s Transitive Methods. J. Microsc. 1989, 153, 315–333. [Google Scholar] [CrossRef]
  9. García-Fiñana, M.; Cruz-Orive, L.M. Improved Variance Prediction for Systematic Sampling on R. Statistics 2004, 38, 243–272. [Google Scholar] [CrossRef]
  10. Gual Arnau, X.; Cruz-Orive, L.M. Variance prediction under systematic sampling with geometric probes. Adv. Appl. Probab. 1998, 30, 889–903. [Google Scholar] [CrossRef]
  11. Gundersen, H.J.G.; Jensen, E.B. The Efficiency of Systematic Sampling in Stereology and its Prediction. J. Microsc. 1987, 147, 229–263. [Google Scholar] [CrossRef] [PubMed]
  12. Kellerer, A.M. Exact formulae for the precision of systematic sampling. J. Microsc. 1989, 153, 285–300. [Google Scholar] [CrossRef] [Green Version]
  13. Kiêu, K. Three Lectures on Systematic Geometric Sampling; Memoirs 13; Department of Theoretical Statistics, University of Aarush: Aarhus, Denmark, 1997; p. 100. [Google Scholar]
  14. Kiêu, K.; Souchet, S.; Istas, J. Precision of Systematic Sampling and Transitive Methods. J. Stat. Plan. Inference 1999, 77, 263–279. [Google Scholar] [CrossRef]
  15. Mattfeldt, T. The accuracy of one-dimensional systematic sampling. J. Microsc. 1989, 153, 301–313. [Google Scholar] [CrossRef]
  16. Roberts, N.; Cruz-Orive, L.M.; Reid, M.K.; Brodie, D.A.; Bourne, M.; Edwards, H.T. Unbiased Estimation of Human Body Composition by the Cavalieri Method Using Magnetic Resonance Imaging. J. Microsc. 1993, 171, 239–253. [Google Scholar] [CrossRef] [PubMed]
  17. García-Fiñana, M.; Cruz-Orive, L.M.; Mackay Clare, E.; Pakkenberg, B.; Roberts, N. Comparison of MR Imaging Against Physical Sectioning to Estimate the Volume of Human Cerebral Compartments. NeuroImage 2003, 18, 505–516. [Google Scholar] [CrossRef]
  18. Baddeley, A.J.; Vedel Jensen, E.B. Stereology for Statisticians; Monographs on Statistics and Applied Probability; Chapman & Hall/CRC: London, UK, 2005; p. 380. [Google Scholar]
  19. Cruz-Orive, L.M. Variance predictors for isotropic geometric sampling, with applications in forestry. Stat. Methods Appl. 2012, 22, 3–31. [Google Scholar] [CrossRef]
  20. Cruz-Orive, L.M. Systematic Sampling in Stereology. In Proceedings of the 49th Session of the International Statistical Institute, Florence, Italy, 25 August–3 Sepetmber 1993; Volume 55, pp. 451–468.
  21. Cruz-Orive, L.M. A General Variance Predictor for Cavalieri Slices. J. Microsc. 2006, 222, 158–165. [Google Scholar] [CrossRef] [PubMed]
  22. Gundersen, H.J.G.; Jensen, E.B.V.; Kiêu, K.; Nielsen, J. The Efficiency of Systematic Sampling in Stereology—Reconsidered. J. Microsc. 1999, 193, 199–211. [Google Scholar] [CrossRef] [PubMed]
  23. Acer, N.; Şahin, B.; Usanmaz, M.; Tatoǧlu, H.; Irmak, Z. Comparison of Point Counting and Planimetry Methods for the Assessment of Cerebellar Volume in Human Using Magnetic Resonance Imaging: A Stereological Study. Surg. Radiol. Anat. 2008, 30, 335–339. [Google Scholar] [CrossRef] [PubMed]
  24. Acer, N.; Şahin, B.; Emirzeoğlu, M.; Uzun, A.; İncesu, L.; Bek, Y.; Bilgiç, S.; Kaplan, S. Stereological Estimation of the Orbital Volume: A Criterion Standard Study. J. Craniofacial Surg. 2009, 20, 921–938. [Google Scholar] [CrossRef] [PubMed]
  25. Acer, N.; Çankaya, M.N.; İşçi, Ö.; Baş, O.; Çamurdanoğlu, M.; Turgut, M. Estimation of Cerebral Surface Area using Vertical Sectioning and Magnetic Resonance Imaging: A Stereological Study. Brain Res. 2010, 1310, 29–36. [Google Scholar] [CrossRef] [PubMed]
  26. Akbaş, H.; Şahin, B.; Eroğlu, L.; Odacı, E.; Bilgiç, S.; Kaplan, S.; Uzun, A.; Ergür, H.; Bek, Y. Estimation of Breast Prosthesis Volume by the Cavalieri Principle Using Magnetic Resonance Images. Aesthet. Plast. Surg. 2004, 28, 275–280. [Google Scholar] [CrossRef] [PubMed]
  27. Howard, M.A.; Roberts, N.; García-Fiñana, M.; Cowell, P.E. Volume Estimation of Prefrontal Cortical Subfields using MRI and Stereology. Brain Res. Protoc. 2003, 10, 125–138. [Google Scholar] [CrossRef]
  28. Maudsley, R.; García-Fiñana, M. Sampling Intensity with Fixed Precision When Estimating Volume of Human Brain Compartments. Image Anal. Stereol. 2008, 27, 143–149. [Google Scholar] [CrossRef]
  29. Roberts, N.; Pubdephat, M.J.; McNulty, V. The Benefit of Stereology for Quantitative Radiology. Br. J. Radiol. 2000, 73, 679–697. [Google Scholar] [CrossRef] [PubMed]
  30. Şahin, B.; Emirzeoğlu, M.; Uzun, A.; İncesu, L.; Bek, Y.; Bilgiç, S.; Kaplan, S. Unbiased Estimation of the Liver Volume by the Cavalieri Principle Using Magnetic Resonance Images. Eur. J. Radiol. 2003, 47, 164–170. [Google Scholar] [CrossRef]
  31. Şahin, B.; Ergür, H. Assessment of the Optimum Section Thickness for the Estimation of Liver Volume Using Magnetic Resonance Images: A Stereological Gold Standard Study. Eur. J. Radiol. 2005, 57, 96–101. [Google Scholar] [CrossRef] [PubMed]
  32. Bellhouse, D.R.; Krishnaiah, P.R.; Rao, C.R. (Eds.) Sampling: Handbook of Statistics; Elsevier Science: Amsterdam, The Netherlands, 1988; Volume 6, pp. 125–145.
  33. Hall, P.; Ziegel, J. Distributions estimators and confidence intervals for stereological volumes. Biometrika 2011, 98, 417–431. [Google Scholar] [CrossRef]
  34. Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions; U.S. Government Printing Office: Washington, DC, USA, 1970; p. 1058.
  35. Çankaya, M.N. Stereological Estimation and Inference with Applications. Master’s Thesis, Council of Higher Education, Ankara, Turkey, 2010; p. 168. [Google Scholar]
  36. Baleanu, D.; Wu, G.C.; Duan, J.S. Some Analytical Techniques in Fractional Calculus: Realities and Challenges. In Discontinuity and Complexity in Nonlinear Physical Systems; The Series Nonlinear Systems and Complexity; Springer: Heidelberg, Germany, 2013; Volume 6, pp. 35–62. [Google Scholar]
  37. Poularikas, A.D. Transforms and Applications Handbook, 3rd ed.; CRC Press, Taylor and Fracis Group: Boca Raton, FL, USA, 2010. [Google Scholar]
  38. Peters, J.F. Computational Proximity Excursions in the Topology of Digital Images; Intelligent Systems Reference Library; Springer: Basel, Switzerland, 2016; Volume 102, p. 433. [Google Scholar]
  39. Qin, Y.M.; Shi, Y.G. A new approximate method to conjugacies between a family of unimodal interval maps. J. Comput. Complex. Appl. 2016, 2, 163–169. [Google Scholar]
  40. Zengn, Y. Approximate solutions of three integral equations by the new Adomian decomposition method. J. Comput. Complex. Appl. 2016, 2, 38–43. [Google Scholar]
Figure 1. Measurement functions with parameter q. (a) Measurement function ( M F ) of Equation (8); (b) M F of Equation (9).
Figure 1. Measurement functions with parameter q. (a) Measurement function ( M F ) of Equation (8); (b) M F of Equation (9).
Entropy 18 00352 g001
Figure 2. Measurement functions without parameter q. (a) M F of Equation (10); (b) M F of Equation (11).
Figure 2. Measurement functions without parameter q. (a) M F of Equation (10); (b) M F of Equation (11).
Entropy 18 00352 g002
Figure 3. Area functions for each brain.
Figure 3. Area functions for each brain.
Entropy 18 00352 g003
Table 1. q and its exact λ ( q , N ) values for different N values.
Table 1. q and its exact λ ( q , N ) values for different N values.
q λ ( q , 2 ) λ ( q , 3 ) λ ( q , 4 )
0 2 . 44949 --
0 . 1 2 . 71243 3.322033.83595
0 . 2 2 . 93821 3.598554.15525
0 . 3 3 . 12464 3.826894.41891
0 . 4 3 . 26925 4.0044.62342
0 . 5 3 . 36968 4.126994.76544
0 . 6 3 . 42394 4.193454.84218
0 . 7 3 . 43064 4.201654.85165
0 . 8 3 . 38906 4.150734.79285
0 . 9 3 . 29929 4.040794.6659
1 3 . 16228 3.872984.47214
Table 2. q and its computational λ ( q , N ) values for N = 2 .
Table 2. q and its computational λ ( q , N ) values for N = 2 .
q S 1 : λ ( q , N ) S 2 : λ ( q , N ) S 3 : λ ( q , N ) S 4 : λ ( q , N )
0 2 . 402871 2.418465--
0 . 1 2 . 671264 ---
0 . 2 2 . 89612 --2.926749
0 . 3 3 . 07818 3.1050913.1119373.114245
0 . 4 3 . 216635 --3.258353
0 . 5 3 . 323898 --3.365038
0 . 6 3 . 397839 3.4088763.4125773.414001
0 . 7 3 . 418973 --3.426855
0 . 8 3 . 383912 -3.387363.387638
0 . 9 3 . 297052 3.298313.2986483.298765
1 3 . 161317 ---
Table 3. Five sheep brain volumes from automatic pixel counting (mm 2 ) after determining the border of slices with a 5-mm thickness and their confidence intervals for N = 2 : arithmetic mean of q ^ values with k = 2 , 3 , 4 , 5 , 6 , 7 in Equation (25). Num. Slc., number of slices.
Table 3. Five sheep brain volumes from automatic pixel counting (mm 2 ) after determining the border of slices with a 5-mm thickness and their confidence intervals for N = 2 : arithmetic mean of q ^ values with k = 2 , 3 , 4 , 5 , 6 , 7 in Equation (25). Num. Slc., number of slices.
BrainsQ Q ^ q ^ c e ^ Q ^ l o w e r Q ^ u p p e r Num. Slc.
185,00087,3000.200.023481,289.0593,310.9512
294,00090,7380.340.010687,731.4093,744.6013
383,00084,6080.500.006682,722.6486,493.3614
488,00088,8460.420.008786,318.1391,372.8714
5100,00099,6750.670.003998,351.66100,998.3415
Table 4. Five sheep brain volumes from automatic pixel counting (mm 2 ) after determining the border of slices with a 5-mm thickness and their confidence intervals for N = 2 : q ^ values with k = 2 in Equation (25).
Table 4. Five sheep brain volumes from automatic pixel counting (mm 2 ) after determining the border of slices with a 5-mm thickness and their confidence intervals for N = 2 : q ^ values with k = 2 in Equation (25).
BrainsQ Q ^ q ^ c e ^ Q ^ l o w e r Q ^ u p p e r Num. Slc.
185,00087,300−0.060.023482,646.9891,953.0212
294,00090,7380.180.013187,249.3194,226.6913
383,00084,6080.410.007582,464.8486,751.1614
488,00088,8460.280.010585,934.1091,756.9014
5100,00099,6750.630.004198,283.46101,066.5415

Share and Cite

MDPI and ACS Style

Çankaya, M.N. Propositions for Confidence Interval in Systematic Sampling on Real Line. Entropy 2016, 18, 352. https://doi.org/10.3390/e18100352

AMA Style

Çankaya MN. Propositions for Confidence Interval in Systematic Sampling on Real Line. Entropy. 2016; 18(10):352. https://doi.org/10.3390/e18100352

Chicago/Turabian Style

Çankaya, Mehmet Niyazi. 2016. "Propositions for Confidence Interval in Systematic Sampling on Real Line" Entropy 18, no. 10: 352. https://doi.org/10.3390/e18100352

APA Style

Çankaya, M. N. (2016). Propositions for Confidence Interval in Systematic Sampling on Real Line. Entropy, 18(10), 352. https://doi.org/10.3390/e18100352

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop