3.1. Generalized Linear Mixed Models
When considering theoretical statistical models, many issues should be evaluated, including (i) first, the distributional assumptions for the dependent variable and random errors; and (ii) second, the type of independent variables involved in the model. With this in mind, GLMMs could be viewed as an extension of Generalized Linear Models (GLMs) [
12,
13] where linear models are enlarged to consider the exponential family for the dependent variable distribution assumptions. At the same time, GLMMs also evaluate the inclusion in the model of random effects, and this feature extends this class of models with respect to mixed linear models [
14]. Therefore, the main theoretical characteristics for GLMMs [
15,
16] can be summarized in two key-points: (i) the distributional assumption for the response variable
Y; and (ii) the evaluation of fixed and random effects involved in the model as independent variables. Obviously, the GLMM theory and the advantages obtained by applying this class of models cannot be restricted to two key-points alone; in [
17], GLMMs were applied to improve the semiconductor yield and to provide significant information when using a large dataset; however, these two key-points were essential for the theory applied in this study.
Let us consider a random variable
Y with the expected value
and variance
; we also defined the
i-th realization of
as
. Moreover, each
was supposed to be distributed according to a density function belonging to the exponential family. The distribution for each
conditioned to the random effects
can be expressed as
where
is the canonical parameter for each
,
is the scale parameter, and
is a function for the mean and is equal to
in the Gaussian case. The variance function
expresses the relation between
and
and is one of the main elements for
when considering the marginal variance in the presence of random effects [
15]. These elements were differentiated according to the specific probability density function (p.d.f.) belonging to Formula (
2). Moreover, the variance function played a relevant role in the 1990s when GLMs were introduced as alternative models for improving Taguchi’s two-step procedure [
18] for modeling the location and dispersion of the response variable
Y, also called the dual-response approach. Details of this well-known approach are presented in
Section 3.3 below.
Given Formula (
2), a GLMM can be defined with respect to a GLM model by adding random effects to the linear predictor
, where the distribution for each
is conditioned to the random effects
. Therefore, a GLMM can be defined through the following three equations:
where
g is the link function that is equal to the identity function when
is assumed to be normally distributed;
is the unknown vector of the coefficients for random effects;
if a diagonal structure is assumed for the variance-covariances (null covariances) of the residual error;
A is a diagonal matrix formed by the variance function
. As in a linear mixed model [
14], the array
G is related to the variances and covariances of the random effects. Moreover, when considering the analysis of experimental variables in order to set a robust design, the inclusion of random effects allowed us to evaluate the effects on process variability due to noises that cannot be directly controlled, even though they are measurable. It must be noted that Formulas (3)–(5) fully define a general GLMM, namely, the model of Formula (
3), the array
G of variance-covariances for random effects (4), and lastly, the array defining the variance-covariances for the response variable
Y, Formula (5).
3.2. The Split-Plot Design and Modeling
Split-plot designs were first defined in the 1950s [
19]. Over the last two decades, following the seminal contributions by [
3,
20,
21,
22,
23,
24], split-plot design has received great attention as a valid experimental design in the technological field and for a robust design approach. In [
3], this experimental design was developed by considering its specific framework in which the key aspect is the distinction between Whole-Plot (WP) and Sub-Plot (SP) factors, and three arrangements were suggested in order to consider the classification and environmental (noise) factors in an efficient setting. Moreover, the bi-randomization of the split-plot design may be useful to study noise and block variables as WP factors, also considered as random effects. Subsequently, the experimental variables, e.g., SP factors, are randomized within the WP units. Furthermore, the inclusion of experimental variables as SP factors guarantees more accurate estimations of these effects. Further theoretical developments have contributed towards expounding the relevance of the split-plot design. In [
23], the equivalence between the Ordinary Least Squares (OLS) and Generalized Least Squares (GLS) estimation methods for fixed effects was established for cases where the split-plot design is planned according to a specific structure.
Next, we consider having WP factors as random variables and SP factors as fixed ones. This choice is strictly related to the case study illustrated below.
In general, a split-plot design is formed by two sets of factors in a bi-randomized structure: the set of Whole-Plot (WP) factors, , and the set of Sub-Plot (SP) factors, . Therefore, within each block (replicate) k, , for a balanced split-plot design, we have n runs, and consequently, the total number of trials is .
For the sake of simplicity, let us start by considering a split-plot design with one WP factor (
) and one SP factor (
). Moreover, let us assume we have
K blocks and that the WP factor is involved in the model as a random variable while the SP factor is included as a fixed effect. Therefore, by also considering the general formulation (
3), the corresponding GLMM for a single observation can be expressed as
where
is the linear predictor;
is the replicate (block) effect; and
and
are the WP and SP errors respectively. Please note that
, e.g., the WP error, is estimated through the first interaction term between replicates and the WP factor. The two unknown coefficients
and
, which are random and fixed respectively, relate to the two effects for the WP and SP factors. It must be noted that
. Furthermore, by considering the distributional assumption of Formula (
6), we can define the array for the variance-covariances of the response variable as follows:
where
R is the array for the variance-covariances of the residual error if the Normal distribution is hypothesized. In general, a diagonal structure can be assumed for the two arrays,
G and
R, i.e., explicitly referring to a variance component structure for both in which all the covariances are assumed to be null. Nevertheless, for a split-plot design/structure,
G and/or
R arrays can be structured differently, e.g., a diagonal structure is often misleading. In fact, no null covariances can occur when considering the residual error and/or random effects conditioned to blocks and/or categorical WP factors, such as a classification variable. In this case, we may assume that a covariance is present among observations belonging to the same Whole-Unit (WU), formed by the level combination of WP factors in the
k block; otherwise the covariance is null.
Therefore, we may consider a partition for array
G by discriminating between
and
:
is the part of
G related to the variance for random effects;
is related to the covariances between random effects. By considering the model in Formula (
6), we can express
V as follows:
The general structure for V (Formulas (7) and (8)) may be better defined for a split-plot design when considering a Compound Symmetry (CS) structure.
Regarding the CS structure, by denoting the array with J for each block/replicate (n dimension), the array G becomes
In Formula (
9), the variances of the random effects are formed by
, and covariances are assumed to be constant and are equal to
. Moreover, the CS structure for the
G array allows the no-null covariances to be evaluated among units within the same WU and the same replicate, while variances are related to the random effects, such as
and the replicate effect of model (
6). The variance component of the WP error is then given by the interaction between the random factor
and the replicates, while the array
R is related to the variance-covariances for the SP error (residual). In the case-study we evaluated a CS structure for
G and a diagonal structure for
R.
3.3. The Dual-Response Generalized Linear Mixed Models Approach
Over the past few decades, response surface methodology (RSM) and GLMs have been widely and permanently introduced to define alternative methods to Taguchi’s two-step procedure an the robust design approach [
18,
25], where two statistical models for the response variable, e.g., location and dispersion models, are defined. RSM has been extensively applied in order to improve Taguchi’s parameter design, especially considering the concept of a robust design and the two-step procedure (dual-response modeling approach). The dual-response approach was initially addressed with consideration of both RSM [
25] and the class of GLMs [
18]. Nevertheless, the sequential nature of the RSM, that is, the experimental design and statistical model, certainly represents an advantage for the application of this methodology [
26]. The combined array is a valid alternative to improve both the two-step procedure and the robust design [
27]. On the other hand, GLMs confirm their important role in this field, and they have been integrated with the local optimal experimental design theory [
28]. In this sense, GLMMs represent a further improvement due to the peculiarities belonging to this class of models that extend GLMs towards the inclusion of random effects; moreover, also with respect to RSM, they could be more suitable for specific experimental designs, such as the split-plot, when considering the split-plot structure and the GLMM features illustrated in
Section 3.1.
Therefore, in this section, the proposal of a dual-response approach arises from the need to account for the residual variability of the response by modeling the residuals of the location model. As a result, we have (i) a main (location) split-plot model including random as well as fixed effects, in which residual weighting is involved in order to account for overdispersion; (ii) a dispersion model, in which the dependent variable is defined by the residuals obtained as a first step, through the estimation of the location model by considering the homoscedasticity assumption. It must be noted that the estimated residual values obtained from the fitting of the dispersion model are called “residual weighting”, and they are used as weights for the subsequent estimated location model in order to evaluate the overdispersion.
Starting from the main split-plot model (Formula (
6)), in general, the dispersion model can be defined according to the following formula, where
is the expected value for the residuals:
and the variance function
. This dual-response modeling, e.g., location and dispersion models, is iteratively performed. The residual values of the dispersion model are used as weights, and the main split-plot model (Formula (
6)) is newly fitted to obtain a better fitting of the model for coefficient estimates, predicted values, diagnostic measures, and analyses. The better fitting is obviously tested through diagnostics, as shown in the following case-study.
Moreover, the dispersion model, Formula (
10), is fitted through a Quasi-Likelihood (QL) estimation method, e.g., by only specifying the mean and the variance function (first two moments) for the dependent variable [
13,
29,
30]. Therefore, the application of the QL estimation method allows for obtaining a better fitting of the weighted location model by involving overdispersion and avoiding the specification of the probability distribution function (p.d.f.) for the dispersion model.
Regarding the kind of residuals chosen for weighting, we applied the Pearson residuals here, e.g., each residual (
was standardized by the square-root of the estimated conditional variance, calculated for each observation: