Question and Answer Classification in Czech Question Answering Benchmark Dataset

Daša Kušniráková, Marek Medved, Aleš Horák

Abstract

In this paper, we introduce a new updated version of the Czech Question Answering database SQAD v2.1 (Simple Question Answering Database) with the update being devoted to improved question and answer classification. The SQAD v2.1 database contains more than 8,500 question-answer pairs with all appropriate metadata for QA training and evaluation. We present the details and changes in the database structure as well as a new algorithm for detecting the question type and the actual answer type from the text of the question. The algorithm is evaluated with more than 4,000 question answer pairs reaching the F1-measure of 88% for question typed and 85% for answer type detection.

Download


Paper Citation


in Harvard Style

Kušniráková D., Medved M. and Horák A. (2019). Question and Answer Classification in Czech Question Answering Benchmark Dataset.In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-350-6, pages 701-706. DOI: 10.5220/0007396907010706


in Bibtex Style

@conference{icaart19,
author={Daša Kušniráková and Marek Medved and Aleš Horák},
title={Question and Answer Classification in Czech Question Answering Benchmark Dataset},
booktitle={Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2019},
pages={701-706},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007396907010706},
isbn={978-989-758-350-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Question and Answer Classification in Czech Question Answering Benchmark Dataset
SN - 978-989-758-350-6
AU - Kušniráková D.
AU - Medved M.
AU - Horák A.
PY - 2019
SP - 701
EP - 706
DO - 10.5220/0007396907010706