Two different mRNA samples were tested, one treated
with the contaminant Benzo(a)pyrene (BaP) with a
concentration of 5µM and one with a lower one of
50nM, called T5µM and T50nM in the following.
This contaminant BaP is found in cigarette smoke
and automobile exhaust and is connected to deadly
diseases such as cancer. Geneticists assume that the
contamination of cells with BaP with the high con-
centration of 5µM leads to the cellular process apop-
tosis, programmed cell death, but not the contamina-
tion with the low concentration. Therefore the present
gene data is analyzed with regard to apoptosis.
The apoptotic pathway for mice can be found
in the KEGG database, Kanehisa and Goto (2000),
hosted by Kanehisa Laboratory. From all detected
genes, 78 are, due to the database, known to be in-
volved in the apoptotic pathways. These are extracted
and considered in the following. The database gives
for each gene a set of genes where it may depend on.
This knowledge is taken into account for a first identi-
fication, where these sets are taken as possible clusters
for the identification of the respective gene. Thereby
possible solutions of clusters are a priori reduced.
The identification with canalyzing constraints as pre-
sented in Section 4.2 and without constraints as given
in Lichtenberg and Eichler (2011) is applied. The
maximum number of rows of the resulting OTVLs
is restricted to two. For the identification for each
gene a model for connectivity degree two up to the set
size given in the database is identified with canalyzing
constraints. For the identification without constraints
the maximal connectivity degree for each gene is re-
stricted to 5, although for some genes the database
give a possible larger cluster, since already for 5 the
average calculation time for one possible cluster is
with 71 s more than a minute. And if a gene may
depend on 11 genes, according to the database, with a
connectivity degree of 5 this results in
11
5
= 462 pos-
sible clusters, and thus in more than 546 minutes for
only one gene. In comparison with canalyzing con-
straints, one cluster takes 0.022s for a connectivity
degree of 5. For a connectivitydegree of 11, the maxi-
mum one found in the database, the identification with
canalyzing constraints takes 28.66×10
3
s.
A cutout of the identified network is shown in Fig-
ure 2 for both concentrations. In general the apoptotic
pathways consists of the extrinsic pathway and the in-
trinsic one. Here the extrinsic one is shown in de-
tail. The expectation, that the concentration of T5µM
leads to apoptosis, while that of T50nM does not, is
affirmed here. According to the database the extrinsic
pathway is triggered by engagements at the death lig-
ands, which activate caspase-8. That induces a signal-
ing cascade, resulting in an activation of caspase-3,
T5µM
Fadd
Tradd
Cflar
Capn1
Capn2
Casp8
Casp12
Casp3
Casp7
CAD
Dffb
Dffa
Casp6
T50nM
Fadd
Tradd
Cflar
Capn1
Capn2
Casp8
Casp12
Casp3
Casp7
CAD
Dffb
Dffa
Casp6
s = 0, v = 0 s = 0, v = 1
s = 1, v = 0 s = 1, v = 1
Figure 2: Identified extrinsic pathway for T5µM and
T50nM with given clustering constraints, (canalyzing func-
tions in red canalyzing functions, with no constraints in
black, that with minimum error is shown).
what leads to cell death. This can be seen for T5µM,
where caspase-8 is activated leading to and activation
of caspase-3. In Figure 2 the connections of impor-
tance here are marked in red. For T50nM there is
no connection between caspase-8 and caspase-3 de-
tected. The arcs with circled tail and triangular head,
denote the canalyzing genes, thus the major influenc-
ing one. If the tail is colored, its canalyzing value
is one, if the head is colored, the canalyzing value is
one, and zero otherwise. Thus, for the interconnec-
tion from caspase-8 to caspase-7 in the network of
T5µM this, e.g. means that an activation of caspase-
8 always activates caspase-7, irrespectively of other
genes, whereas a deactivation of Tradd always acti-
vates caspase-3.
An a posteriori analysis of the models identified
by the identification, where no canalyzing constraints
were imposed, shows that a significant ratio of identi-
fied models are canalyzing functions. These ratios of
canalyzing functions compared to all identified func-
tions for a certain connectivity degree are shown in
Figure 3. For comparison the overall ratios of canaly-
zing function in all Boolean functions, as calculated
from Table 1, are given. It is obvious that expect for
the connectivity degree of two the identified models