straightforward, but it often lacks at least one of these
properties.
This paper proposes an approach where the
domain experts are able to formulate their queries in
the natural language sentences that contain particular
keywords. The system would then translate the query
into one or more valid queries in the Base Language
that is also based on the natural language and that
already has an efficient implementation. The Base
Language has proven to be very easily readable by
non-IT specialists (i.e. the domain experts of the
medical management domain). Thus the domain
expert would be able to understand the translation
results and to select the correct one, that is, the query
he/she had intended to formulate.
For it to be possible to implement such natural
language-based querying, this paper proposes a data
schema called the Semistar data ontology that
alleviates the process of formulating queries. The
practice has shown that such a data structure is
prevalent in subject-oriented domains such as
hospital management.
To test the base query language, a tool was
developed that allows users to create queries and to
receive answers to them. An experiment was
conducted where the tool was taught to a group of
students. After having worked with the tool and the
Base Language for some time, they acknowledged the
language as very well readable. Therefore, it justifies
the approach of showing the list of the query
translation results in the Base Language back to the
user so that he/she can point out to the correct one. As
a result, the system learns from the user experience so
that the correct query will have higher credibility (i.e.,
smaller entropy) next time.
This paper describes the work in progress that
continues the work described in (Rencis, 2018-2). A
prototype implementing the natural language-based
querying has been developed, as well as the
calculation of the entropy for the query translation
results has been implemented. The user experience-
based learning is a part of the future work that has yet
to be implemented.
ACKNOWLEDGEMENTS
This work is supported by the ERDF PostDoc Latvia
project Nr. 1.1.1.2/16/I/001 under agreement Nr.
1.1.1.2/VIAA/1/16/218 “User Experience-Based
Generation of Ad-hoc Queries From Arbitrary
Keywords-Containing Text”.
REFERENCES
Androutsopoulos, I., Ritchie, G. D., Thanisch, P. 1995.
Natural language interfaces to databases – an
introduction. In: Natural Language Engineering, 1(1),
29-81. DOI: 10.1017/S135132490000005X.
Barzdins, J., Rencis, E., and Sostaks, A. 2014. Data
Ontologies and Ad Hoc Queries: a Case Study. In: H.M.
Haav, A. Kalja, T. Robal (Eds.) Proc. of the 11th
International Baltic Conference, Baltic DB&IS, 55-66,
TUT Press.
Barzdins, J., Rencis, E., Sostaks, A. 2014. Fast Ad Hoc
Queries Based on Data Ontologies. In: H.M. Haav, A.
Kalja, T. Robal (Eds.), Frontiers of AI and
Applications, Vol. 270, Databases and Information
Systems VIII, 43-56, IOS Press.
Barzdins, J., Grasmanis, M., Rencis, E., Sostaks, A.,
Barzdins, J. 2016. Self-service Ad-hoc Querying Using
Controlled Natural Language. In: G. Arnicans et al.
(Eds.) Proc. of the 12th International Baltic
Conference, Baltic DB&IS, 18-34, CCIS 615.
Barzdins, J., Grasmanis, M., Rencis, E., Sostaks, A.,
Barzdins, J. 2016. Ad-hoc Querying of Semistar Data
Ontologies Using Controlled Natural Language. In:
Frontiers in Artificial Intelligence and Applications.
Databases and Information Systems IX, Vol. 291, IOS
Press, 3-16. DOI: 10.3233/978-1-61499-714-6-3.
Barzdins, J., Grasmanis, M., Rencis, E., Sostaks, A.,
Steinsbekk, A. 2016. Towards a more effective
hospital: helping health professionals to learn from their
own practice by developing an easy to use clinical
processes querying language. In: International
Conference on Health and Social Care Information
Systems and Technologies, Procedia Computer Science
Journal, 100, 498-506. DOI: 10.1016/j.procs.2016.
09.188.
Chamberlin, D.D., Boyce, R.F. 1974. SEQUEL: A
structured English query language. In: Proc. ACM
SIGFIDET Workshop, Ann Arbor, Mich., 249-264.
Densen, P. 2011. Challenges and opportunities facing
medical education. In: Transactions of the American
Clinical and Climatological Association, 122, 48-58.
Fei, L., Jagadish, H.V. 2014. NaLIR: An interactive natural
language interface for querying relational databases. In:
Proceedings of the ACM SIGMOD International
Conference on Management of Data. DOI:
10.1145/2588555.2594519.
Gao, T., Dontcheva, M., Adar, E., Liu, Z., Karahalios, K.G.
2015. DataTone: Managing Ambiguity in Natural
Language Interfaces for Data Visualization. In:
Proceedings of the 28th Annual ACM Symposium on
User Interface Software & Technology (UIST '15).
ACM, New York, NY, USA, 489-500. DOI:
10.1145/2807442.2807478.
Kaleske, S. 2011. SAP Query Reporting – Practical Guide,
Galileo Press.
Li, F., Jagadish, H. V. 2014. Constructing an interactive
natural language interface for relational databases. In:
Journal Proceedings of the VLDB Endowment, 8(1),
73-84.
User Experience-based Information Retrieval from Semistar Data Ontologies
425