number. For example, an integer is a number,
a float is a float, a decimal is a decimal, and a
hexadecimal is a hex. Data Types can also be
used to describe types of numbers. For instance,
if you have a number of digits, you can use a
Data Type to describe what those digits are, and
how they relate to each other. So, for example, if
I have a set of numbers, I can use Data Types to
describe how those numbers relate to one another.
3. No “Data Type” and “Python language” keyword
are used in the same paragraph in the textbook.
iTA still generates an explanatory response that
matches the context of the question. The re-
trieval model has extracted a passage which con-
tains “Data Type” tokens.
Example Question: What are Data Types in
Python language?
Generated Answer: Data types are a way of rep-
resenting data in a way that the programmer can
understand. For example, if you have a list of
numbers, you can use a data type to represent the
number in the list. For instance, if I have a set of
numbers and I want to know how many of those
numbers are in that set, I could use a DataType to
represent that number in a list. You can also use
Data Types to represent data in other ways, for ex-
ample if I wanted to know the age of a person, I
can use the DataType for that
5 CONCLUSIONS
Comprehending large documents is a challenging task
for current machine learning systems. Existing work
focuses on systems that can generate responses given
a small body of text to work with. Most available
generative-based response systems generate one or
two-word replies. Short replies are not always ac-
ceptable, and many questions that a user may want
to have answered will require more than one or two
words. iTA can produce a detailed, multi-sentence
response to user-generated questions by combining
a language comprehension module and a generative-
based response module. Noisy labels are reduced
while selecting relevant paragraphs. Finally, a gen-
erative model replies with an explanatory response.
6 FUTURE WORK
In the future, we will add more documents and allow
users to choose from which document to ask ques-
tions. Moreover, one of iTA’s modules uses LSTM. It
may be possible to significantly increase iTA’s perfor-
mance by using Transformer based methods instead
of LSTM. At present, the response time of iTA is
about 15 seconds. Improvement of the response time
will be the primary focus of future work.
REFERENCES
Adhikari, A. and John DeNero, B. (2019). The foundations
of data science.
Bajaj, P., Campos, D., Craswell, N., Deng, L., Gao, J., Liu,
X., Majumder, R., McNamara, A., Mitra, B., Nguyen,
T., Rosenberg, M., Song, X., Stoica, A., Tiwary, S.,
and Wang, T. (2018). Ms marco: A human generated
machine reading comprehension dataset.
Beltagy, I., Peters, M. E., and Cohan, A. (2020). Long-
former: The long-document transformer.
Cho, K., van Merrienboer, B., Gulcehre, C., Bahdanau, D.,
Bougares, F., Schwenk, H., and Bengio, Y. (2014).
Learning phrase representations using rnn encoder-
decoder for statistical machine translation.
Clark, C. and Gardner, M. (2017). Simple and effective
multi-paragraph reading comprehension.
Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K.
(2019). Bert: Pre-training of deep bidirectional trans-
formers for language understanding.
Donya Rooein, P. P. (2020). Adaptive conversations for
adaptive learning: Sustainable development of educa-
tional chatbots.
Fabio Catania, Micol Spitale, G. C. and Garzotto, F. (2020).
Conversational agents to promote children’s verbal
communication skills.
Fan, A., Jernite, Y., Perez, E., Grangier, D., Weston, J., and
Auli, M. (2019). Eli5: Long form question answering.
Hien, H. T., Cuong, P.-N., Nam, L. N. H., Nhung, H. L.
T. K., and Thang, L. D. (2018). Intelligent assis-
tants in higher-education environments: The fit-ebot,
a chatbot for administrative and learning support. In
Proceedings of the Ninth International Symposium on
Information and Communication Technology, SoICT
2018, page 69–76, New York, NY, USA. Association
for Computing Machinery.
Hobert, S. (2019). Say hello to ‘coding tutor’! design and
evaluation of a chatbot-based learning system support-
ing students to learn to program.
Joshi, M., Choi, E., Weld, D. S., and Zettlemoyer, L. (2017).
Triviaqa: A large scale distantly supervised challenge
dataset for reading comprehension.
Lewis, M., Liu, Y., Goyal, N., Ghazvininejad, M., Mo-
hamed, A., Levy, O., Stoyanov, V., and Zettlemoyer,
L. (2019). Bart: Denoising sequence-to-sequence pre-
training for natural language generation, translation,
and comprehension.
Papineni, K., Roukos, S., Ward, T., and Zhu, W. J. (2002).
Bleu: a method for automatic evaluation of machine
translation.
CSEDU 2021 - 13th International Conference on Computer Supported Education
280