Table 4: Ablation test on MultiWOZ 2.1 data set.
Feature Joint accuracy
GRU encoder(no memory) 46.55
Bert encoder(no memory) 46.62
Bert+Memory mechanism 47.27
5 CONCLUSION AND
DISCUSSION
In this paper we introduced a novel memory mech-
anism for a dialogue state tracking system. The core
contribution of our work is to incorporate the memory
architecture into the dialogue state tracking system.
We used a vector which will be updated at each turn
of the dialogue, so it will preserve useful historical
information in the model. This model outperforms a
set of benchmark models with joint goal accuracy on
both MultiWOZ 2.0 and MultiWOZ 2.1 data set.
In the domain-specific accuracy table, we can see
that the Hotel and Taxi domains are shown to be more
difficult compared with other domains. The Hotel do-
main has 11 slots to be filled which is the largest of
all domains, so it is reasonable that the Hotel domain
has a lower joint goal accuracy. For the Taxi domain,
as shown in the Appendix, the number possible val-
ues for its state slot is the highest among all domains,
which may lead to the low joint goal accuracy.
REFERENCES
Bohus, D. and Rudnicky, A. (2006). A “k hypotheses+
other” belief updating model.
Budzianowski, P., Wen, T.-H., Tseng, B.-H., Casanueva,
I., Ultes, S., Ramadan, O., and Ga
ˇ
si
´
c, M. (2018).
Multiwoz-a large-scale multi-domain wizard-of-oz
dataset for task-oriented dialogue modelling. arXiv
preprint arXiv:1810.00278.
Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K.
(2018). Bert: Pre-training of deep bidirectional trans-
formers for language understanding. arXiv preprint
arXiv:1810.04805.
Eric, M., Goel, R., Paul, S., Sethi, A., Agarwal, S., Gao,
S., and Hakkani-Tur, D. (2019). Multiwoz 2.1: Multi-
domain dialogue state corrections and state tracking
baselines. arXiv preprint arXiv:1907.01669.
Eric, M. and Manning, C. D. (2017). Key-value retrieval
networks for task-oriented dialogue. arXiv preprint
arXiv:1705.05414.
Gao, S., Sethi, A., Agarwal, S., Chung, T., and Hakkani-
Tur, D. (2019). Dialog state tracking: A neural
reading comprehension approach. arXiv preprint
arXiv:1908.01946.
Goel, R., Paul, S., and Hakkani-T
¨
ur, D. (2019). Hyst: A hy-
brid approach for flexible and accurate dialogue state
tracking. arXiv preprint arXiv:1907.00883.
Kanai, S., Fujiwara, Y., and Iwamura, S. (2017). Prevent-
ing gradient explosions in gated recurrent units. In
Advances in neural information processing systems,
pages 435–444.
Kumar, A., Irsoy, O., Ondruska, P., Iyyer, M., Bradbury,
J., Gulrajani, I., Zhong, V., Paulus, R., and Socher, R.
(2016). Ask me anything: Dynamic memory networks
for natural language processing. In International con-
ference on machine learning, pages 1378–1387.
Merity, S., Xiong, C., Bradbury, J., and Socher, R. (2016).
Pointer sentinel mixture models. arXiv preprint
arXiv:1609.07843.
Pennington, J., Socher, R., and Manning, C. D. (2014).
Glove: Global vectors for word representation. In
Proceedings of the 2014 conference on empirical
methods in natural language processing (EMNLP),
pages 1532–1543.
Qiu, L., Xiao, Y., Qu, Y., Zhou, H., Li, L., Zhang, W.,
and Yu, Y. (2019). Dynamically fused graph network
for multi-hop reasoning. In Proceedings of the 57th
Annual Meeting of the Association for Computational
Linguistics, pages 6140–6150.
Ramadan, O., Budzianowski, P., and Ga
ˇ
si
´
c, M. (2018).
Large-scale multi-domain belief tracking with knowl-
edge sharing. arXiv preprint arXiv:1807.06517.
Rosenfeld, A., Zuckerman, I., Segal-Halevi, E., Drein, O.,
and Kraus, S. (2014). Negochat: a chat-based ne-
gotiation agent. In Proceedings of the 2014 interna-
tional conference on Autonomous agents and multi-
agent systems, pages 525–532.
Sukhbaatar, S., Weston, J., Fergus, R., et al. (2015). End-
to-end memory networks. In Advances in neural in-
formation processing systems, pages 2440–2448.
Weston, J., Chopra, S., and Bordes, A. (2014). Memory
networks. arXiv preprint arXiv:1410.3916.
Wu, C.-S., Madotto, A., Hosseini-Asl, E., Xiong, C.,
Socher, R., and Fung, P. (2019). Transferable multi-
domain state generator for task-oriented dialogue sys-
tems. arXiv preprint arXiv:1905.08743.
Xu, P. and Hu, Q. (2018). An end-to-end approach for han-
dling unknown slot values in dialogue state tracking.
arXiv preprint arXiv:1805.01555.
Zhong, V., Xiong, C., and Socher, R. (2018). Global-locally
self-attentive encoder for dialogue state tracking. In
Proceedings of the 56th Annual Meeting of the Associ-
ation for Computational Linguistics (Volume 1: Long
Papers), pages 1458–1467.
Memory State Tracker: A Memory Network based Dialogue State Tracker
537