can use conditional activation probabilities. Condi-
tional activation probabilities can also be obtained by:
P(β
jk
6= 0|β
j
6= 0,β
k
6= 0,D ) (10)
=
P(β
jk
6= 0,β
j
6= 0,β
k
6= 0|D )
P(β
j
6= 0,β
k
6= 0|D )
,
provided that P(β
j
6= 0,β
k
6= 0|D ) > 0. In practice
one should only consider conditional activation prob-
abilities when both P(β
j
|D ) and P(β
k
|D ) are consid-
erably large. In cases where P(β
j
|D ) or P(β
k
|D ) are
small then unreasonably large inflations to the condi-
tional activation probabilities will occur and hence the
result in incorrect inferences.
2.3 Restricted Model Space
A simple approach to defining the neighborhoods of
a model M
c
is to include all models that add an ad-
ditional term or drop an existing term. However, this
violates a model that require both main effect terms
need to be present in the model in order for the cor-
responding two-way interaction to be added. Further-
more, the model need not contain all interaction terms
possible. Notice this creates a large model space.
For the first order models with p predictors the size
of the model space is 2
p
. However with the addi-
tion of interaction terms, the size grows considerably
more. In a dataset with 30 loci, a full model with all
first order terms and two-way interaction terms will
have 465 terms. This can be prohibitively large for
most datasets and algorithms. If the model space is
restricted to r < p predictors and the corresponding
epistasis terms, then any model considered will not
have nearly as many terms. If r is chosen wisely, then
the researcher can ensure that each model under con-
sideration has sufficient degrees of freedom to be es-
timated.
Furthermore, cases where linear dependencies ex-
ist among the predictors estimation can be compli-
cated. One approach to address this issue is to assign
P(M
c
) = 0 to all models where linear dependencies
exist among the predictors. Hence removing all mul-
ticollinear models from consideration. Any time there
are multicollinear terms an index will need to be cre-
ated in order to keep track of any aliased terms. This
aliasing can cause problems when there is a large ef-
fect size for the aliased terms.
The use of restricted model spaces allows for the
assessment of all candidate variables, however it re-
stricts the number of candidate variables that may be
simultaneously considered in a single model. (Yi et
al., 2003), (Yi et al., 2005), (Yi et al., 2007a), (Yi et
al., 2007b) and (Yandell et al., 2007) use two restric-
tions one for the number of main effect terms and one
for the number of epistatic terms allowed in the model
simultaneously. They also give a simple guideline to
determine the size of each restriciton. They suggest
to choose the restriction r = m+ 2
√
m where m is the
a priori expected number of main effects. Similarly
the same formula can be employed where m is the ex-
pected number of epistatic effect.
To search through the restricted model space,
MC
3
can be employed using equation (7). Note that
q(M
t
|M
c
) must be determined to move through the
sample space. Let nbd(M
c
) be all models with one
main effect term more, one valid interaction term
more, one main effect term less and one interaction
term less than model M
l
. Denote adding a main ef-
fect term as AMT, adding an interaction effect term as
AIT, dropping a main effect term as DMT and drop-
ping an interaction effct term as DIT. The probility of
each of these actions depends on the attributes of the
current model M
c
. Let γ
c
and φ
c
be the number of
main effect terms and number of interaction terms in
M
c
, respectively. In order to ensure that all models
in nbd(M
c
) are equally likely, the probability of each
action, AMT, AIT, DMT and DIT need to be deter-
mined. Let Ω = {AMT, AIT,DMT,DIT} be an action
space. Once these probabilities have been calculated,
the following procedure allows for each of the models
in nbd(M
c
) to be sampled to be candidate model. First
determine, P(AMT), P(AIT), P(DMT) and P(DIT),
and choose an action with the corresponding proba-
bility. Then select with equal probability a model that
is in nbd(M
c
) and corresponds to the action. This pro-
cedure ensures that all models in nbd(M
c
) have equal
probability. Having all models in nbd(M
c
) equally
likely will be necessary in computing q(M
c
|M
t
).
For γ
c
= 0, only a main effect term may be added
since no interaction terms are in the model. Hence the
probability distribution for Ω is:
P(AMT) = 1, P(DMT) = 0,
P(AIT) = 0,P(DIT) = 0. (11)
For γ
c
= 1, the one of the p−1 main effect terms
not in the model may be added or the one main effect
term in the model may be droped and no interaction
terms are allowed in this model. Hence the probabil-
ity distribution for Ω is:
P(AMT) =
p−1
p
,P(DMT) =
1
p
,
P(AIT) = 0,P(DIT) = 0. (12)
For 2 ≤ γ
c
≤ r, no restrictions are involved.
Hence, all actions in Ω are allowed. Hence, the prob-
ICAART 2011 - 3rd International Conference on Agents and Artificial Intelligence
74