afterwards to ignore the correct part and focus on the
misunderstanding.
67% of the utterances after a misunderstanding
included two intents. One regarding the
misunderstanding and the other one an answer to the
question. During the interview the participants
confirmed that using a MI is nothing uncommon. The
main issue, mentioned by 23% of the participants
was, that they were not used talking to a system in
such a natural way. A common statement included: “I
wouldn't have used Multi-Intents because I don't think
a system can handle that.”
5 CONCLUSION
If there is a complicated task like resolving a
misunderstanding users should be able to drop all the
other topics and focus on the important task if needed.
The need for such a mechanism was mentioned
during the interview by 61% of the participants. If the
user interrupts the system and drops a topic the
system should allow the interruption and proactively
raise the dropped topic again after the important task
is finished. Another possibility which was mentioned
during the interview is, that the system choses one
topic to focus on and postpone the other one. But the
decision has to be logical and comprehensible.
Users have no difficulties using MIs while talking
to a simulated SDS. They even used MIs to solve
misunderstandings and talk about other things in one
turn. To maintain a consistent dialogue flow, an
adequate meta-dialogue is a useful mechanism. The
results presented in this work emphasise this
statement and also indicate that a spoken meta-
dialogue explaining what topic is about to be started,
is preferred.
A visual realization of this additional information
achieved a similar good rating, but is cognitively
more demanding. In addition, a graphical
representation for meta-dialogue is impracticable in
scenarios with a visually demanding main task like
driving a car.
6 FUTURE WORK
Despite the usefulness of MIs it seems that if the
system uses MIs, too, to add topics or to try to clarify
multiple topics at once, the whole dialogue becomes
cognitive very demanding. In order to reduce the
general high cognitive load, a system-side
prioritisation of one intent could be useful, if the
prioritisation is logical and comprehensible for the
user. In future research, we will take a further look
into prioritising one intent if multiple intents occur in
a single utterance. We will investigate which
parameters can be used to prioritize a topic and how
the dialog and meta-dialog should be designed to
enable intuitive and easily accessible communication
for the user.
REFERENCES
Lemon, O. et al. (2002): Multi-tasking and collaborative
activities in dialogue systems. In Proceedings of the
Third SIGdial Workshop on Discourse and Dialogue.
Bunt, H. (2011). Multifunctionality in dialogue. Computer
Speech & Language, 25(2), pp. 222-245.
Wahlster, W. et al. (1983): Over-answering yes-no
questions: Extended responses in a NL interface to a
vision system. In Proceedings of the Eighth
international joint conference on Artificial intelligence,
pp. 643-646.
Xu, P., & Sarikaya, R. (2013): Exploiting shared
information for multi-intent natural language sentence
classification. In Interspeech, pp. 3785-3789.
Kim, B., Ryu, S., & Lee, G. G. (2017): Two-stage multi-
intent detection for spoken language understanding.
Multimedia Tools and Applications, 76(9), pp. 11377-
11390.
Beaver, I., Freeman, C., & Mueen, A. (2017): An
Annotated Corpus of Relational Strategies in Customer
Service. arXiv:1708.05449.
Clark, H. H., & Brennan, S. E. (1991): Grounding in
communication. Perspectives on socially shared
cognition, 13, pp. 127-149.
Skantze, G. (2003): Exploring human error handling
strategies: Implications for spoken dialogue systems. In
ISCA Tutorial and Research Workshop on Error
Handling in Spoken Dialogue Systems.
Georgiladakis, S. et al. (2016): Root Cause Analysis of
Miscommunication Hotspots in Spoken Dialogue
Systems. In INTERSPEECH, pp. 1156-1160.
Heinroth, T., Koleva, S., & Minker, W. (2011). Topic
switching strategies for spoken dialogue systems. In
Twelfth Annual Conference of the International Speech
Communication Association.
Sidner, C. et al. (2005): Explorations in engagement for
humans and robots. Artificial Intelligence, 166(1-2),
pp. 140-164.
Glas, N., & Pelachaud, C. (2015): Topic Transition
Strategies for an Information-Giving Agent. In
European Workshop on Natural Language Generation
(ENLG), pp. 146-155.
Dahlbäck, N., Jönsson, A., & Ahrenberg, L. (1993): Wizard
of Oz studies—why and how. Knowledge-based
systems, 6(4), pp. 258-266.
Finding a Meta-dialogue Strategy for Multi-intent Spoken Dialogue Systems
175