Step 7. Compute mAIC for each model and choose
the final model with the smallest mAIC
among all models, and all E
j
included in the
final model are considered to be significant
to the response y
0
.
The first modification is in Step 2. Due to the
heredity principle, two-factor interactions are never
be selected as the first PIE, so only the marginal corre-
lations of all main effects are compared for selecting
the first PIE. The second and third modifications are
in Step 5. During the search of the j
th
PIE, not all
two-factor interactions are considered in the compari-
son of marginal correlation. According to the heredity
principle, a two-factor interaction X
ij
is considered in
Step 5(b) if and only if either X
i
or X
j
or both parents
main effects have been included in S
Inf
in the pre-
vious searches. Therefore, the modifications in Step
5 take away a subset of two-factor interactions that
none of their corresponding parent main effects have
been PIEs. The last modification is in Step 6. The re-
duced models built in this step must follow the hered-
ity principle in order to avoid the situation that some
significant two-factor interactions are included in the
reduced model but none of their parent main effects
have been included.
2.2 Two Illustrating Examples
We illustrate the analysis of nonregular FFDs via the
SRRS method step by step using the following two
examples. The Factor Screening procedures is termi-
nated via the noise threshold in the first example and
via the maximum number of PIEs in the second ex-
ample.
Example 1. Consider the cast fatigue experiment
(Wu and Hamada 2000 , section 7.1), a real data set
consisting of seven two-level factors. The design ma-
trix and the response are found in Wu and Hamada
(2000) . When all two-factor interactions are consid-
ered to be as important as the main effects, the design
matrix consists of 21 additional interactions and is su-
persaturated.
In the Factor Screening procedure, the first PIE
being identified is F and its absolute marginal cor-
relation to y
0
is the highest among all main effects
(0.6672). A regression model between y
0
and F is
built and the magnitude of the slope estimate |β
F
| =
0.4576. Then we set the threshold γ = 0.04, about
10% of β
F
.
To search for the second PIE, the new response y
1
is refined by subtracting Fβ
F
from y
0
. Then among
all main effects and all the two-factor interactions that
consist of F, FG (the interaction between main ef-
fects F and G) has the highest absolute marginal cor-
Table 1: Factor Screening of Cast Fatigue Experiment Data.
Marginal Continue
m PIE Correlation |β| or Stop
0 F 0.6672 0.4576 Continue
1 FG −0.8980 0.4588 Continue
2 D −0.4677 0.1183 Continue
3 EF −0.6336 0.1442 Continue
4 C 0.5032 0.0758 Continue
5 E −0.5817 0.0785 Continue
6 AE −0.7667 0.1482 Continue
AE −0.6835 0 Stop
PIEs in S
Inf
after Factor Screening:
C, D, E, F, AE, EF, FG
relation (0.8980) to y
1
and so it is identified as the
second PIE. A regression model between y
1
and FG,
F is built and the magnitude of the slope estimate
|β
FG
| = 0.4588 > γ. This means FG is important
enough to be included in the influential set S
Inf
to-
gether with F.
The procedure continues to search for the next five
PIEs. Table 1 shows every step of the process of Fac-
tor Screening. Note that in the last step, the absolute
magnitude of the slope estimate of AE is close to 0,
so the search stops and seven PIEs are identified in the
Factor Screening procedure.
Since there are 12 observations in the data, the
maximum number of active factors is suggested to be
4. There are totally 98 reduced models up to four-
factors models that are constructed from seven PIEs,
but only 49 of them fulfill the heredity principle. A
comparison of the mAICs of these 49 reduced models
shows that the two-effects model with F and FG has
the lowest mAIC = −27.82. Thus the SRRS method
suggests that F and FG have significant impacts to
the response y
0
. This result is also recommended by
Wu and Hamada (2000, Section 8.4) and the Dantzig
selector (DS) method in Phoa, Pan and Xu (2009) .
Example 2. Consider the high-performance liq-
uid chromatography (HPLC) experiment (Vander-
Heyden et al., 1999), a real data set consisting of eight
two-level factors. The design matrix and the response
are found in Phoa, Wong and Xu (2009) . When all
two-factor interactions are considered to be as impor-
tant as the main effects, the design matrix consists of
28 additional interactions and is supersaturated.
In the Factor Screening procedure, the first PIE
being identified is E and its absolute marginal cor-
relation to y
0
is the highest among all main effects
(0.5019). A regression model between y
0
and E is
built and the magnitude of the slope estimate |β
F
| =
0.5583. Then we set the threshold γ = 0.05, about
10% of β
E
.
To search for the second PIE, the new response y
1
THE STEPWISE RESPONSE REFINEMENT SCREENER (SRRS) AND ITS APPLICATIONS TO ANALYSIS OF
FACTORIAL EXPERIMENTS
159