This paper deals with a first step towards the
recognition of intention within scientific
publications in the computer science field. The idea
is that given a model of intention defined in (Al-
Tawki, 2002), we aim at finding a methodology of
matching a text to this model. At the progress state
of this work, the recognition is made “by hand”; we
have found certain regularities in the studied corpus
which constitute the main new result presented here.
The objective of the research conducted here is to
formalize these findings and to implement
algorithms to automate as much as possible the
process of segmentation.
This paper is organized as follows. The next
section presents an overview related to the concepts
of intention in written communication, and to the
field of text segmentation. In section 3, we present
the model of intentional structure. Finally, the article
finishes by a discussion of future developments.
2 STATE OF THE ART
The research undertaken here aims at analyzing
authorial intentions. Each text is segmented into
fragments that correspond to the recognized
intentions associated with these fragments. This
work is related to two main fields: intention and text
segmentation.
2.1 Intention
The concept of intention is omnipresent in any
human action and is particularly important in
communication. Several works attempt to account
for the relations between an action undertaken by a
human being and the mental state which guides this
action. Searle remains a main reference on the
matter (Searle, 1983). He distinguishes between two
types of intention: intentions in the course of action
and former, or pre-formulated, intentions. Former
intentions correspond to the representation of the
initial goal fixed before the beginning of the action
whereas the intention in the course of action
accompanies the action during its execution. This
distinction makes it possible to treat only intentional
actions, and not the "micro actions", or the
movements which are not inevitably intentional.
Intentions in the course of action are those which
represent these intentions, whereas the former
intentions represent a condition of satisfaction of the
intention (Pacherie 2003). Writing is an intentional
action; its characteristic is that it represents two
types of action, the physical actions of using a
medium to transcribe thought by writing and the
actions which aim at modifying the mental state of
the reader, by transmitting information, knowledge,
advice or orders to him. This second type of action
can be accomplished or not depending on the
receiver of the written text: the reader. The concept
of associating intentions with segments of document
was initiated by Grosz and Sidner in (Grosz, Sidner,
1986). This concept consists in describing the
intentions of the author for each segment of the
document. This description will help to read and
consult the document in terms of the intentions of
the author. These intentions are added in the form of
annotations to the documents marked out with XML
tags for example.
According to the theory of Grosz and Sidner
(Grosz, Sidner, 1986), the intentional structure
makes it possible to represent the structure of the
goals. The subjacent objective allows the recognition
of the intentions of the author by the reader. These
authors identified two structural relations between
intentions, fundamental for the analysis of the
structure of the discourse at a basic level: the
relation of dominance and the relation of satisfaction
precedence: an intention I1 dominates an intention
I2 if the satisfaction of I2 contributes to that of I1
and an intention I1 precedes (the satisfaction of) I2 if
I1 must be satisfied before I2. It is not certain if
these two relations are sufficient, on a pragmatic
level, to describe the production process of a
discourse effectively, because what is interesting in
this case, is to be able to associate a finer direction
to the relations between various parts of discourse.
However, the two relations between intentions
suggested by Grosz and Sidner do not account for
the large variety of these intentions and may imply
loss of semantics. On the other hand, this theory is
built so as to depend neither on the domain, nor of
the type of the discourse. Indeed, the studies on the
modeling of the intention derive from the causal
theories of the action. To describe an intention is to
find a rational explanation of the action which was
caused by this intention. This explanation depends
on the context in which the action can be performed.
The concept of association of the intentional
structures is a concept which consists in describing
the intentions of the author for each segment of the
document. This description will be able to help to
browse the document in terms of authorial
intentions.
ICEIS 2007 - International Conference on Enterprise Information Systems
490