difference between discrete and continuous attributes
for each extracted rule.
Another approach of the Re-RX algorithm,
proposed by Chakraborty et al. (Chakraborty, 2018),
is Reverse Engineering Recursive Rule Extraction
(RE-Re-RX). Re-RX cannot achieve the accuracy of
NNs when the output is described as a non-linear
function of continuous attributes, because Re-RX
creates a linear hyperplane for such algorithms.
Because of that, RE-Re-RX uses attribute data angel
to create rules with continuous attributes. As a result,
the rules are simple and understandable and the
nonlinearity on data is managed.
The algorithm described in this paper combines
parts from each of the above-mentioned algorithms.
Data pruning is reduced to a minimum, so that the
final result can be as accurate as possible, as well as
simple and easy to understand. The algorithm also
contains a part that has not been introduced in any of
the previous works. It is a combination of different
approaches for the inclusion or exclusion of rules.
Neural networks can have various applications in
finance. The current application of the observed
network concerns Rating/Scoring systems. In recent
past, automated calculations of credit score have been
developed, providing sophisticated and accrued
results for banking professionals in risk management.
They enable the detection of hidden patterns, which
can be hard to identify for bank experts. Automated
lending risk calculations are currently not being
perfected and incorrectness of the credit score might
lead to the loss of resources in terms of default.
Artificial intelligence (AI) methods have not been
integrated into such systems, because they are
considered too complex and require a lot of time and
effort for result interpretation. This paper introduces
several methods that will enforce the clarification and
interpretation of the NN logic.
3 THE OBSERVED NEURAL
NETWORK
There are different approaches in the field of artificial
intelligence (AI), each with its own characteristics,
advantages and disadvantages. Generally speaking
we can distinguish two main subfields – symbolic and
non-symbolic approaches. Symbolic approach is
based on production systems with rules and facts.
They are mainly used in cases when the theory of
the problem is known and can be described in the
form of sets with mutually dependent rules. Examples
of such systems include medical or technical
diagnosis, monitoring, repairing, etc. One of the
advantages of this approach is that their inference
process can be explained in a way that is
understandable to humans. For this purpose, it is
possible for fired rules to be extracted with the data
that matched their antecedents and the actions
performed in their consequences.
Non-symbolic approaches are more convenient
for problems with no information on the theory of the
problem, but with available examples. It includes
examples such as functional approximation, patterns
recognition, forecasting based on historical data,
voice processing and recognition, etc. One of its main
disadvantages is that, unlike the symbolic approach,
it is not able to explain its inference and reasoning
process. In practical terms this matters greatly, as it is
often necessary for users to have the required
background knowledge. For that reason, the problem
is addressed here by extracting observable and
explainable rules, thereby retaining the symbolic
approach characteristics from a trained neural
network that is in the sub-symbolic approach.
3.1 Training Network
The neural network used here is a multilayer
perceptron, which is a feed forward neural network
and one of the most commonly used neural networks
in practice. The training patterns represent input-
output vectors that are applied in the training stage. In
the generation stage, only input vectors are used and
the trained neural network produces outputs for them.
The training stage consists of two other stages:
forward and backward stage. Figure 1 illustrates the
structure of the network.
Figure 1: The structure of the backpropagation neural
network.
Input vectors are represented as x and output
vectors as y. The number of input neurons is A, of
hidden neurons B and of outputs C. Symbol x
i
is used
for the output values of input neurons, h
j
for hidden
neurons, and o
k
for outputs. Index i represents input
…
Input
neurons
Hidden
neurons
Output
neurons