2.2 Convention based Approaches
To simplify, a dialog can be considered as a protocol
represented by finite state automata in which
transitions are the possible speech acts of the dialog.
The agent has no internal representation. These
approaches are rather rigid even if some of them
(Sitter and Stein, 1992) use recursive automata.
Another conventional model (Lewis, 1979)
consists in representing information shared during
the dialog (called “common ground”) in a
conversational board. This theory is more
descriptive than predictive and thus is difficult to
integrate into a dialog system.
2.3 Mixed Approaches
Dialog games (Levin and Moore, 1980) are
interested in social conventions between utterances.
They use structures, games for which interactions
are precisely described. Games are stereotypes that
model a communicational situation.
The QUD (Questions Under Discussion) model,
proposed by (Ginzburg, 1996) and totally
implemented in the GoDiS system (Larsson, 2002),
takes into account mainly the transmission of
missing information. The dialog uses both a
conversational board and internal representation of
the agent. This approach is mainly based on the
questions and their responses. Each speech act
(enunciated by the user or the system) modifies the
“information state” (IS), comprising a private part
and a public part.
With the “grounding” theory, (Traum, 1994)
proposes 5 modalities according to which an
utterance is grounded: perception, contact, semantic
understanding, pragmatic understanding, integration.
For each modality, there are speech acts of positive
(resp. negative) grounding if this modality is (resp.
is not) grounded. For example, if the perception is
grounded but not the semantic understanding, the
system can produce a repeating of the utterance to
show that it has been heard and then it can say a
speech act like “not understood”.
This approach is highly capable when it is added
with accommodation effects (Lewis, 1979) like in
GoDiS. When user utterances do not match with the
current plans, the system loads a new relevant plan
to this utterance. Plans can be performed in parallel.
3 CORPUS COLLECTION
At first, we wanted to model the reasoning of the
CISMeF chief librarian, when he was searching in
the CISMeF system. He was asked five questions
from health professionals and his answers have been
recorded. These records showed that the CISMeF
chief librarian has a complete understanding of the
user’s intention and suggests optimal queries.
However, he does not need to converse with the user
to understand his inquiry. We had thus to set up a
new experimentation dealing with the recording of
dialogue between a CISMeF expert and a user.
The users were voluntary members of the LITIS
laboratory (secretary, PhD students, researchers and
teachers) who wanted to obtain responses about
medical inquiries. The experts were two members of
our project, trained to the CISMeF system and
terminology. The experimentation took place as
follows: one expert and one user were facing a
computer using the advanced search interface of the
system and recording all the queries with their
answers in a log. The expert was in charge of
conducting the search by conversing with the user
and verbalizing each action, inquiry and answer. The
experimentation ended when relevant documents
were given to the user or when it seemed that no
answer existed in the system. A textual corpus was
constituted from the transcription of the twenty-one
dialogues recorded.
Moreover, following this experimentation, we
asked the CISMeF chief librarian to answer the
users’ inquiries and to verbalize his search process.
The verbal occurrences were also recorded. Our aim
was to obtain optimal queries to these questions
using the CISMeF terminology. They provide
explanations about the strategies adopted by the
chief librarian.
4 ANALYSIS OF THE CORPUS
We have hand-analyzed the textual corpus. During
the conversations, experts tried to keep control of the
dialog by making the user repeat and confirm his
utterances to avoid ambiguity or contestation. Many
discursive tags (agreement, question, suggestion,
refusal…) lead to interaction. Several iterative loops
ensure the continuity of the dialog.
This analysis brings out a global structure of
dialogs broken down into sub-dialogs and it allows
to build a list of speech acts observed in the corpus.
4.1 Global Structure of Dialogs
In the dialogs, there are a lot of comings and goings
between the initial query of the user and the answers
of the system depending on the results. Moreover,
dialogs can be divided into sub-dialogs. Figure 1
ICEIS 2008 - International Conference on Enterprise Information Systems
228