networks, a member of the family of neural
networks based on the adaptive resonance theory
(ART). The paper is organized as follows:
Section 1 introduces the bankruptcy prediction
problem and outlines previous research in that area.
Section 2 presents the ART neural networks and
discusses the ARTMAP-IC algorithm and features.
Section 3 describes the experimental data and the
preprocessing steps needed to transform data into a
form proper for submission to the neural network.
Section 4 discusses the experimental results and
outlines advantages of the proposed model.
2 ARTMAP-IC NEURAL
NETWORK CLASSIFIER
In an ART-based network, information reverberates
between the network’s layers. Learning is possible
in the network, when resonance of the neuronal
activity occurs. ART1 was developed to perform
clustering on binary-valued patterns. By
interconnecting two ART1 modules, ARTMAP was
the first ART-based architecture suited for
classification tasks. ARTMAP- IC adds to the basic
ARTMAP system new capabilities designed to solve
the problem with inconsistent cases, which arises in
prediction, where similar input vectors correspond to
cases with different outcomes, (Carpenter,
Grossberg, and Reynolds, 1991), (Carpenter and
Markuzon, 1998). It modifies the ARTMAP search
algorithm to allow the network to encode
inconsistent cases (IC).
Figure 1, adapted from (Carpenter and
Markuzon, 1998), shows the architecture of an
ARTMAP-IC network. It consist of fully connected
layers of nodes: an M-node input layer F1, an N-
node competitive layer F2, an N-node instance
counting layer F3, an L-node output layer F
0
b
, and
an L-node map field F
ab
that links F3 and F
0
b
. In
ARTMAP-IC an input a=(a
1
, a
2
, … , a
M
) learns to
predict an outcome b=(b
1
, b
2
, …, b
L
), , where only
one component b
K
=1, placing the input a in class K.
With fast learning,
β
=1, ARTMAP-IC represents
category K as hyper-rectangle
ℜ
K
that just encloses
all the training set patterns a to which it has been
assigned. A set of real weights W={w
ji
: j=1,…,N;
i=1,…,M} is associated with the F1 - F2 layer
connections. Each F2 node j represents a category in
the input space, and stores a prototype vector
w
j
=(w
j1
, w
j2
, …,w
jM
). The F2 layer is connected,
through associative links to F3, which in turn is
connected to the map field F
ab
by associative links
with binary weights W
ab
=(w
jk
ab
:j=1,…,N; k=1,…,L}.
The vector w
j
ab
=(w
j1
ab
, w
j2
ab
, …,w
jL
ab
) relates F2
node j to one of the L output classes. Instance
counting biases distributed predictions according to
the number of training set inputs classified by each
F2 node. During testing the F2->F3 input y
j
is
multiplied by the counting weight c
j
to produce
normalized F3 activity, which projects to the map
field F
ab
for prediction.
2.1 ARTMAP-IC Algorithm
The following algorithm describes the operation of
an ARTMAP-IC classifier in learning mode:
1. Initialisation: Initially, all the neurons of F2
are uncommitted, all weight values w
ji
are initialised
to 1, and all weight values w
jk
of F
ab
are set to 0.
2. Input pattern coding: When a training pair
(a,b) is presented to the network, a undergoes pre-
processing, and yields pattern A=(A
1
,A
2
,…,A
2M
). The
vigilance parameter ρ is reset to its baseline value.
3. Prototype selection: Pattern A activates layer
F1 and is propagated through weighted connections
W to layer F2. Activation of each node j in the F2
layer is determined by the choice function
T
j
(A)=|A
∧
w
j
|/(
α
+|w
j
|). The F2 layer produces a
winner-take-all pattern of activity y=(y
1
,y
2
,…,y
N
)
such that only node j=J with the greatest activation
value remains active (y
J
=1). Node J propagates its
prototype vector w
J
back onto F1 and the vigilance
test |A
∧
w
j
|
≥ρ
M is performed. This test compares the
degree of match between w
J
and A to the vigilance
parameter
ρ∈
[0,1]. If this test is satisfied, node J
remains active and resonance is said to occur.
Otherwise, the network inhibits the active F2 node
and searches for another node J that passes the
vigilance test. If such a node does not exist, an
uncommitted F2 node becomes active and
undergoes learning (step 5).
4. Class prediction: Pattern b is fed directly to
the map field F
ab
, while the F2 activity pattern y is
propagated to the map field via associative
connections W
ab
. The latter input activates F
ab
nodes
according to the prediction function
∑
=
=
N
j
ab
jkj
ab
k
wyyS
1
)(
and the most active F
ab
node K yields the class
prediction (K=k(J)). If node K constitutes an
incorrect class prediction, a match tracking signal
raises vigilance just enough to induce another search
among F2 nodes (step 3). This search continues until
either an uncommitted F2 node becomes active
(learning ensues at step 5), or a node J that has
ICEIS 2008 - International Conference on Enterprise Information Systems
168