1. Introduction
Control charts, also known as Shewhart control charts [
1,
2,
3], have been used to determine if a manufacturing process is in a state of control. In particular, the
and
S charts have been widely used to monitor or detect the mean and variability of variables. Here, a variable is a quality characteristic measured on a numerical scale. For example, variables include continuous measurement process data such as length, pressure, width, temperature, and volume, in a time-ordered sequence.
Due to their importance and usefulness in real life applications, these traditional types of univariate and control charts have still received much attention in the literature. We observe that these control charts are usually adopted for continuously monitoring numerous data and for solving the problem of process control in the Industry 4.0 framework; see, for example [
4,
5,
6], among others. Of particular note is that [
7] developed a nice
qcr package in R to generate Shewhart-type charts and obtained numerical results of interest for a process quality control. More recently, based on the concept of data depth, Ref. [
8] proposed a novel alternative way for constructing control charts when the critical to quality (CTQ) variables of the process are functional and also developed the Phase I and II control charts for stabilizing and monitoring the processes, respectively.
It deserves mentioning that these traditional Shewhart control charts mentioned above consist of the upper and lower control limits (for short, UCL and LCL) and the center line (CL). It is noteworthy that the American Standard is based on control limits with an ideal false alarm rate (FAR) of 0.27% while the British Standard is based on with an ideal FAR of 0.20%, where denotes the standard error.
The usual requirement behind these control charts is that the sample sizes from the process are all equal. In practice, however, it is often the case that this requirement can not be met due to wrong or missing observations in collecting them. In such setting, the three conventional approaches below are widely used to deal with unequal sample sizes:
- (i)
A weighted average approach in calculating and .
- (ii)
Control limits based on an average sample size.
- (iii)
A weighted average in calculating .
For more details, see Subsection 6.3.2 of Montgomery [
9] and Subsection 3.8.B of ASTM [
10]. The first approach uses variable-width control limits which are determined by the sample-specific values such as
,
,
, and
. To estimate the scale parameter, a weighted average of sample variances is calculated first and then its square-root is taken to estimate the scale parameter. The second approach uses fixed-width control limits which is based on the average of the sample sizes. For more details on these two methods, see Subsection 6.3.2 of Montgomery [
9]. The third approach is very similar to calculating
in the first approach. However, it uses a weighted average of sample standard deviations directly to estimate the scale parameter. For more details, see Subsection 3.8.B of ASTM [
10]. It is known that these three approaches may be satisfactory when the sample sizes are not very different. Given that the average of the sample sizes is not necessarily an integer in general, a practical alternative to the second approach is the use of a modal sample size.
However, when using these ad hoc approaches above, the parameter estimators are biased and they actually underestimate the true population parameters as will be shown in Remark 2 in
Section 2 and Remarks 3 and 4 in
Section 3. These underestimating ad hoc approaches could result in degraded performance of the control charts. Nonetheless, these biased methods are widely covered in many popular textbooks. These observations motivate us to clarify these conventional methods and investigate other existing methods, especially when the samples are not equal in size. Through the rigorous proofs, we provide a guideline for the best selection of the methods to construct the
and
S control charts.
This paper is organized as follows. In
Section 2, we provide two location estimators and four scale estimators with unequal sample sizes and show that they are all unbiased. In
Section 3, we provide the variances of the estimators considered in this paper and show the inequality relations among them. In
Section 4, we provide the relative efficiency of the methods and conduct simulation results to compare the performance of the location and scale estimators. In
Section 5, we illustrate how to construct various Shewhart-type control charts (i.e.,
S,
, and
charts) using the provided estimators. In
Section 6, we provide the empirical estimates of the average run length (ARL) and the standard deviation of the run length (SDRL) through using the extensive Monte Carlo simulations. Three real-data examples are presented in
Section 7 for illustrative purposes. Some concluding remarks are given in
Section 8.
2. Estimation of Process Parameters with Unequal Sample Sizes
In this section, we provide two location estimators and three scale estimators for the process parameters under the assumption that each sample has different sample sizes. In parametric statistical quality control, the underlying distribution is used to construct the control charts. A quality characteristic is assumed to be normally distributed, which is most widely used in most practical cases. Under this assumption, we show that the estimators provided in this section are all unbiased.
We assume that we have m samples and that each sample has different sample sizes. Let be the ith sample (subgroup) of size from a stable manufacturing process, where and . We also assume that are independent and identically distributed as normal with mean and variance .
2.1. Location Parameter
Using the
ith sample above, the sample mean and the sample variance are given by
where
. Montgomery [
9] provides two location estimators of the population mean parameter
in Equations (6.2) and (6.30) in his book, which are given by
and
where
and
.
These grand averages can be used as the CL on the chart. Since for , it is easily seen that and , showing that these two estimators are unbiased. In addition, the variances of and are given by and , which results in . Thus, is preferred to . It should be noted that, for the case of an equal sample size, we have , where .
2.2. Scale Parameter
It is well known that
is an unbiased estimator of
. However,
is not an unbiased estimator of
as below. Since
is distributed as the gamma with shape
and scale 2, we have
This shows that
is actually an unbiased estimator of
. Note that
is the normal-consistent unbiasing factor, which is a function of the sample size and this
notation was originally used in ASQC [
11] to the best of our knowledge. For more details on
, the interested reader is referred to Vardeman [
12]. In
Appendix A, we also provide an approximate calculation of
which can be used for practically easier calculation.
Thus, we can estimate
using
which are are all unbiased. Analogous to Equation (
1), one can use
which is clearly unbiased for
. Since
from Equation (
3), the estimator below
is also unbiased for
. In addition, we consider the following unbiased estimator proposed by Burr [
13]
This estimator is actually the best linear unbiased estimator (BLUE) and we provide the proof below.
Theorem 1. The estimator in Equation (7) is the BLUE. Proof. First, we consider a linear unbiased estimator in the form of
. Then, its variance and expectation are given by
To obtain the BLUE, we need to minimize
with the unbiasedness condition
. Thus, our objective is to minimize
which can be easily solved by using the method of Lagrange multipliers. The auxiliary function with the Lagrange multiplier
is given by
It is immediate from
that
, which results in
Multiplying
to Equation (
8) and then making the sum of the two sides, we have
Since
, we first solve the above for
and then substitute it into Equation (
8), which provides
which results in
. This completes the proof. □
Remark 1. It should be noted that, for the case of an equal sample size (), we can easily show that , where .
One can also incorporate the pooled sample variance in estimating
given by
where
again. However,
is not unbiased for
although
is. This is because
Based on this, Burr [
13] suggested the following unbiased estimator of
,
Remark 2. The weighted average approach introduced in Subsection 6.3.2 of Montgomery [9] uses the pooled sample standard deviation from Equation (9) to estimate σ. However, since , which will be shown in Lemma 1, it is immediate from Equation (10) that clearly underestimates the true parameter σ. We have introduced the four unbiased scale estimators of which are denoted by , , , and . A natural question appears: which of the four estimators should be recommended for estimating in practical applications? In the following section, we clarify this question by providing a guideline in terms of inequalities of their variances of the estimators under consideration.
3. Inequalities of the Variances of the Scale Estimators
We first obtain the variance of
in Equation (
5), which is obtained as
Using Equation (
3) and the unbiasedness property of
, we have
Substituting Equation (
13) into (
12), we have
Similarly, the variance of
in Equation (
6) is easily obtained as
and that of
in Equation (
7) is also obtained as
Finally, for the case of
, we have
Using Equation (
11) and the unbiasedness property of
, we can rewrite Equation (
17) as
Since
is also unbiased for
, we have
Next, as aforementioned, we here answer the question of how to choose the best one among the four estimators by comparing their variances of these four scale estimators. To be more specific, we prove the inequality relations among the four scale estimators as follows.
Lemma 1. The function defined in Equation (4) is monotonically increasing and Proof. Using
, we can rewrite
in Equation (
4) as
where
. Watson [
14] showed that the function
is monotonically decreasing. It is noteworthy that the function
was motivated by Wallis’ famous infinite fraction for
[
15]. We can rewrite
using
in Equation (
20) as
Since both
and
are positive and monotonically decreasing,
is also decreasing. Thus,
is monotonically increasing. Watson [
14] and Mortici [
16] also showed that
Multiplying
on each of the above terms, some algebra shows that
For convenience, we let
. Then, we have
which completes the proof. □
Lemma 2 (Chebyshev’s sum inequality)
. If and , then we haveSimilarly, if and , then we have Proof. First, we will consider the case where both
and
are increasing. In this case, both
and
have the same sign, or at least one of them can have the zero value. Then, the value of
is positive or zero for any
i and
j, which results in
After the tedious algebra of the above, we have
Next, we consider the case when is increasing and is decreasing. In this case, and have different signs, or at least one of them can have the zero value. Then, the value of is negative or zero for any i and j. Similar to the above approach, we have which results in . This completes the proof. □
It should be noted that the above inequality name is coined after Pafnuty Lvovich Chebyshev (1821–1894) who mentioned it in a brief note [
17]. He provided it in an integral form and his original proof can be found in Chebyshev [
18]. For more details, the readers are also referred to Besenyei [
19] and Section 2.17 of Hardy et al. [
20].
Proof. For convenience, we rearrange the sample sizes so that
. Then, it is immediate from Equations (
14) and (
15) that it suffices to show
Since
is increasing from Lemma 1, it is easily seen that
is decreasing. For convenience, let
and
. Then, we have
and
. Thus, we observe from Lemma 2 that
Applying Lemma 2 again with
(increasing) and
(decreasing), we have
which results in
Comparing Equations (
22) and (
23), we have
which results in
This completes the proof. □
Proof. We have the variances of
and
from Equations (
15) and (
16), which are given by
Thus, it suffices to show
For convenience, we let
and
. Then, it is immediate from the Cauchy–Schwarz inequality,
that the inequality in Equation (
24) holds. This completes the proof. □
Proof. It is immediate from Equation (
21) that we have
For convenience, we let
. Then, it suffices to show
is concave. Bustoz and Ismail [
21] showed that
is completely monotonic on
using the representation by Watson [
14]. For more details on complete monotonicity, refer to Section XIII.4 of Feller [
22]. It is well known that completely monotonic functions are log-convex. For example, see Lemma 4.3 of Merkle [
23], Theorem 1 of Fink [
24], Exercise 6 in Section 2.1 of Niculescu and Persson [
25], and Equation (3.4) of van Haeringen [
26] with
and
, among others.
Thus, is convex so is concave. Since is also concave, is concave. This implies that is log-concave. The log-concavity of guarantees that it is concave. This completes the proof. □
Proof. From Equations (
16) and (
18), we have
Thus, it suffices to show
It is immediate from Lemma 3 that
is concave. Then, using Jensen’s inequality, we have
where
. Thus, it suffices to show
Using Equation (
25) in Lemma 3, we have
and
where
is defined in Equation (
20). Comparing Equations (
27) and (
28), we need to show
Since
and
is decreasing from Watson [
14], the above inequality in Equation (
26) holds. This completes the proof. □
In combination with the inequalities in Theorems 2–4, we have the following result:
Lemma 4. The defined in Equation (4) satisfies Proof. First, we will show that
is concave. Taking the logarithm of
in Equation (
19), we have
where
. It is well known that the second derivative of
can be expressed as the sum of the series
For more details, see Merkle [
27] and Section 11.14 (iv) of Schilling [
28].
Using Equation (
30), we have
Thus,
is log-concave which guarantees that
is concave. Then, using Jensen’s inequality, we have
where
. This completes the proof. □
Remark 3. It is worth mentioning that one can argue an alternative way based on an average sample size . For example, see Section 6.3.2 of Montgomery [9]. In such setting, one may consider the following quotient for the estimator of σwhere However, this estimator is not unbiased because Because from Lemma 4, can underestimate the true parameter σ. However, for the case of an equal sample size , we have so that . Thus, in this special case of an equal sample size, is unbiased.
Remark 4. There is another alternative in Subsection 3.8.B of ASTM [10], which is based on the weighted average of sample standard deviations given by It deserves mentioning that the estimator above still underestimates the true parameter σ because from Lemma 1. In addition, for the case of an equal sample size, we have , where .
Theorem 5. Let and . Then, and are the uniform minimum variance unbiased estimators of μ and σ, respectively. Thus, we have Proof. One can obtain the uniform minimum variance unbiased estimator (UMVUE) using complete sufficient statistics as described in Theorem 7.3.23 of Casella and Berger [
29]. For more details on the UMVUE, one can refer to Definition 1.6 of Lehmann and Casella [
30].
The control charts we have developed are under the assumption that are independent and identically distributed as normal with mean and variance which leads to the joint complete sufficient statistics for and given by and , respectively. Since and , and are the UMVUEs of and , respectively. This completes the proof. □
Remark 5. It is noteworthy that, analogous to Equation (18), we havewhich also results in since is increasing from Lemma 1. Remark 6. Although can attain the minimum variance, we do not adopt this estimator to construct the control charts. The main reason can be discussed as follows. Consider the case that are independent and identically distributed as normal with different means for and variance . Then, the pooled sample variance, , is a complete sufficient statistic. See Example 2.3 in Section 2.2 of Lehmann and Casella [30]. This implies that is better under the null hypothesis (in control), whereas is better under the alternative (out of control). Thus, if the process is out of control, the control charts using could have wider control limits, which may result in an increase in the rate incorrectly signaling that the process is in-control when the process is actually out of control. In addition, one can think of the case of non-constancy of σ. In this heteroscedasticity case, Burr [13] mentioned that seems preferable to . We think that this case should be investigated more thoroughly in a sequel paper. 5. Construction of the Control Charts with Unequal Sample Sizes
We briefly introduce how to construct the control charts and then discuss how to implement the estimators provided here to construct the
S chart in
Section 5.1 and improve the
S chart using the probability limits in
Section 5.2. We also discuss the construction of the
chart in
Section 5.3.
In general, we construct statistical quality control charts based on two phases [
9,
38], usually denoted by
Phase-I and
Phase-II. Then, one can establish control limits with a set of stable manufacturing process data in Phase-I. Then, we monitor the process in Phase-II using the control limits obtained in Phase-I. We assume that we have
m samples from a stable manufacturing process (Phase-I) and each sample has different sample sizes, denoted by
where
. Then, we monitor the process with a sample of size
(Phase-II).
5.1. The S Chart
From the statistical asymptotic theory (for example, Corollarys 6–10 of Arnold [
39]), we can have an approximate distribution
where
is the sample standard deviation with sample size
. In order to construct the
control limits, we can set up
. Solving this for
, we have
Since
is generally unknown, we need to estimate
. One can use
in Equation (
5),
in Equation (
6),
in Equation (
7), and
in Equation (
11). Using
, we can construct the
S chart with sample size
as follows:
where we assign zero to
if it is negative. Next, using
, we construct the
S chart as follows:
In addition, using
, we can construct the
S chart as follows:
Of particular note is that, for the case of an equal sample size (
) in Phase-I, it is easily seen that
,
, and
. Thus, we have the
S chart below with sample size
to use in Phase-II
where
. We assign zero to
if it is negative. Furthermore, if we assume
, we have
where
, and
. This is a well-known
S chart introduced in the quality control literature. For example, see the chart in Equation (6.27) of Montgomery [
9]. This indicates that the proposed
S chart includes the existing
S chart as a special case.
Using
, we construct the
S chart as follows:
where
and
. Unlike the previous cases, this control chart is not the same as the one in Equation (36) even for the case of an equal sample size.
5.2. The S and Charts with Probability Limits
We can improve the above
S charts by using probability limits as mentioned in Section 4.7.4 of Ryan [
40]. It follows from the following Chi-square distribution result
that
where
is the
th upper quantile of the Chi-square distribution with
degrees of freedom. Rewriting Equation (37) about
, we then have
Thus, we can construct the
S chart with probability limits such that
,
, and
. In practice, since
is unknown, we need to estimate
. Thus, with the estimator
, we can obtain
In the above, one construct the S chart with probability limits by using , , or instead of .
Next, we consider the construction of the
chart. We rewrite Equation (37) about
and we then have
Using the above along with
where
is the pooled sample variance denoted in Equation (
9), we can also construct the
chart as follows:
Note that none of , , , or is unbiased for , whereas is unbiased. Thus, it is not recommended to use any of the four estimators to construct the chart.
5.3. The Chart
From the statistical asymptotic theory, we have
where
is the sample mean with sample size
. In order to construct the
control limits, we can set up
. Solving this for
, we have
Since
and
are unknown in practice, we need to estimate them. With the estimates
and
, we have
To estimate
, one can use
defined in Equation (
1) or
defined in Equation (
2). To estimate
, one can use any of
,
,
, and
.
As an illustration, using
and
, we have
chart as follows:
Similarly, we can construct various charts using a total of eight combinations of and estimators.
Remark 7. It should be noted that, when is used, the chart is somewhat different from the above chart and is given by As shown in
Section 4,
performs better than
,
, and
. However, to the best of our knowledge, the
chart based on
has not been widely used probably because the calculation of the normal-consistent unbiasing factor
is difficult especially with a large argument value. Most textbooks provide the values of
only for
. In
Appendix A, for an easy calculation, we provide an approximation for
which is highly accurate within one unit in the ninth decimal place for
. Thus, using this, one can easily calculate the LCL and UCL of the
chart based on
.
6. Average Run Length and Standard Deviation of Run Length
To compare the performance of the control charts based on the various estimators, we obtained the empirical estimates of the ARL and the SDRL through using the extensive Monte Carlo simulation method. For this simulation, we considered the
chart. In this chart, we only estimated the location with
because the RE of
is better than that of
as shown in
Section 4. For the scale, we considered seven different estimates (
,
,
,
,
,
,
) and we denoted the charts based on these scale estimates by A, B, C, D, S, S*, and Sw, respectively.
We assume that we have m samples () in Phase-I. Again, let be the ith sample (subgroup) of size . Then, ’s were generated from the normal distribution with mean and standard deviation and we obtained the location estimate () and the seven scale estimates. Using these estimates, we constructed the seven control charts based on control limits with FAR 0.27%. Then, we monitored the process with a new sample of size from the same normal distribution and obtained the run length in Phase-II. We repeated this simulation one million times () to obtain the run lengths and then estimated the ARL and SDRL based on these run lengths. Note that the simulation results are the same as the ones under different parameter values of and . It deserves mentioning that the results are quite reasonable, since the normal distribution is a location-scale family, whereas the results are somewhat dependent on the number of samples and the combination of the sample sizes.
We generated
samples with the five different scenarios. The sample sizes of each scenario are given by
We considered Scenarios I–IV for the cases of unequal sample sizes and Scenario V for the case of an equal sample size as a reference. Upon each of these scenarios, we estimated the ARL and SDRL as described above. The simulation results are provided in
Table 3.
Note that the ideal ARL under the normal distribution is around
with FAR 0.27%. However, this ideal value is obtained when using the true
and
. In practice, we need to estimate these parameters in Phase-I. Thus, with such an uncertainty due to estimation, the target ARL can be different from the ideal ARL of 370. We observed that the empirical results of A, B, C, and S* have the same value under the case of an equal sample size (Scenario V) and these results have the same tendency when the RE measure is used in
Section 4.2. These results are expected as pointed out in Remark 3. The estimated ARL of D (362.58) is very close to that of A, B, C, and S* (364.36). In addition, the estimated SDRL of D (537.31) is very close to that of A, B, C, and S* (541.45). This minor difference may be due to a random phenomenon of the Monte Carlo simulation. Thus, it is quite reasonable to assume that the target ARL is around 360 and the target SDRL is around 540.
In what follows, we analyze and compare our results based on these target values. Note that we did not consider the values from the control charts based on S and Sw because they have much smaller ARL values than the others, which is mainly due to the fact that they have serious negative bias even with the case of an equal sample size as seen in
Table 1.
In Scenario I, there is a serious difference in terms of the sample sizes (3, 10, 17). The ARL values of the charts based on A and B seriously overshoot the target value and their SDRLs are far above the target value. The ARL values of the charts using S and Sw seriously undershoot the target value while the ARL of the chart using S* has a minor underestimate. These results are closely related to the RE. In
Table 2,
and
have noticeable negative values of the biases while
has a decent negative value. This implies that the scale estimates underestimate the true value, which results in a narrower control chart. Thus, using a narrower control chart, one can have a smaller ARL. When it comes to the SDRL, the SDRLs of the charts using A and B seriously overshoot the target and those using S and Sw undershoot the target. The chart using S* overshoots the target. These results are similar to those with the RE.
In Scenarios II–IV, we have similar observations as noticed in Scenario I. Because there is a less serious difference in sample sizes, the observations are not so dramatic as seen in Scenario I, but their tendencies are quite similar. For example, in Scenario IV, the sample sizes are very slightly different, so these results are quite close to those in Scenario V (equal sample size).
In Scenario V, as mentioned earlier, we considered this as a reference. In this case, we can also observe that the charts using S and Sw have the same results and their ARL values seriously undershoot the target value. In Remark 4, we pointed out
for the case of an equal sample size. We can also observe this in
Figure 1a. As mentioned earlier, their underperformance is due to a serious negative bias as seen in
Table 1. The results show that the charts based on C and D perform very well. However, when the samples are very small (small number of samples with small sample size), we expect that the chart based on D can be slightly better than that based on C as shown in
Figure 1. However, in practice, with a decent size of samples, one can use the charts based on either C or D.
8. Conclusions
In this paper, we have considered several unbiased location and scale estimators for the process parameters of the and S control charts when the sample sizes are not necessarily equal. These estimators are essential for constructing the control limits of the Shewhart-type control charts. A natural question is: among these unbiased estimators, which one should be recommended in practical applications? We clarified this question by providing the inequality relations among the variances of these estimators through the rigorous proofs. We also showed that the conventional ad hoc methods could result in degraded performance of the control charts, mainly because the adopted estimators are all biased and they actually tend to underestimate the true scale parameter.
We also provided the relative efficiency of the methods along with the conventional methods and the empirical estimates of the ARL and the SDRL through using the extensive Monte Carlo simulations. We observed that the chart based on outperforms the others under consideration from both theoretical and numerical points of view. The only difficulty of using lies in calculating the normal-consistent unbiasing factor for a large argument value (large sample size) without using a professional software. To resolve this problem, we provided an approximation of the as a function of a sample size which can be easily calculated with a general calculator for the case of a large argument value and is accurate within one unit in the ninth decimal place.
All of the theoretical results revealed an interesting and useful connection between the two fields of statistics and mathematics. For example, the normal-consistent unbiasing factor can be expressed as a function through using the Watson representation, which helps one to understand the behavior of the in depth. We thus expect that the new findings about the can help quality engineers develop more useful results in various engineering statistics fields.
It is noteworthy to mention that all the theoretical and empirical results of this paper require the assumption of the homoscedasticity case of . In an ongoing work, we investigate these estimators more thoroughly for the heteroscedasticity case.