machine. In this paper, we investigate the structure
of our web-based application.
The next section is devoted to a general presentation
of the advantages e-technology can bring in the
context of machine learning and especially in bio-
informatics. Section 3 briefly recall the main
philosophy of inductive logic programming, as a
sub-topic of machine learning. In section 4, we
describe how we process the data, using XML as a
backbone format, and we present the general
architecture of our system. We conclude in section5.
2 E-TECHNOLOGY
E- Technology has made it possible to carry out lot
of activities, previously locally achieved and are
remotely performed. All these activities constitute
what is usually called E-business. Government and
organisation have come to realise the value of e-
technology hence such projects i.e. e-learning, e-
government, e-commerce, e-health are being
implemented globally. In some sense, bio-
informatics aims to solve biological problems using
computing methods. That is why we think it is
essential to make bio-informatics benefit of these
techniques because the bio-informatics definitions
are changing in accordance with the development of
new areas in science. BITE (UK), IRIT (France),
LAAS (France) and NII (Japan) focus together on
maturing technology to globalise development and
research of bio-informatics.
A global platform [farmer and al.2004],
integrating grid computing, programming tools and
data visualization technologies would enable
scientists to gain greater insight into their research
through direct comparisons of simulations,
experiments and observations. Knowledge
repositories maintaining relationships, concepts, and
inference rules would be accessible through such a
global platform. Starting from these ideas, we
develop a web-based architecture around a machine
learning application : the target machinery is an
Inductive Logic Programming (ILP) system and the
target application is a bio-process leading to ethanol.
The next sections are devoted to a more precise
investigation of these two fields.
But it should be clear that our scheme is open
and that there is no conceptual obstacle to integrate
other inductive engines as well as there is no
obstacle to focus on other fields than bio-informatics
Starting from these ideas, we develop a web-
based architecture around a machine learning
application : the target machinery is an Inductive
Logic Programming (ILP) system and the target
application is a bio-process leading to ethanol. The
next sections are devoted to a more precise
investigation of these two fields.
But it should be clear that our scheme is open and
that there is no conceptual obstacle to integrate other
inductive engines as well as there is no obstacle to
focus on other fields than bio-informatics.
3 INDUCTIVE LOGIC
PROGRAMMING: A BRIEF
REVIEW
Inductive Logic Programming (ILP) is the part of
machine learning where the underlying model is
described in term of first-order logic. We briefly
outline here the standard definitions and notations.
Given a first-order language L with a set of variables
Var, we build the set of terms Term, atoms Atom and
formulas as usual. The set of ground terms is the
Herbrand universe H and the set of ground atoms or
facts is the Herbrand base B. A literal l is just an
atom a (positive literal) or its negation ~a (negative
literal). A substitution s is an application from Var
to Term with inductive extension to Atom. We
denote Subst the set of ground substitutions.
A clause is a finite disjunction of literals and a
Horn clause is a clause with at most one positive
literal. A Herbrand interpretation I is just a subset of
B : I is the set of true ground atomic formulas and its
complementary denotes the set of false ground
atomic formulas.
We can now proceed with the notion of logical
consequence. Given A an atomic formula, I, s |- A
means that s(A) belongs to I. As usual, the
extension to general formulas F uses
compositionality.
• I |- F means : forall s, I, s |- F (we say I is
a model of F).
• |- F means : forall I, I |- F.
• F |- G means that all models of F are
models of G.
Stated in the general context of first-order logic,
the task of induction is to find a set of formulas H
such that:
B U H |- E
Given a background theory B and a set of
observations E (training set), where E, B and H here
denote sets of clauses. In this paper, E is always
ICEIS 2005 - ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS
358