
straightforward, but it often lacks at least one of these 
properties. 
This  paper  proposes  an  approach  where  the 
domain experts are able to formulate their queries in 
the natural language sentences that contain particular 
keywords. The system would then translate the query 
into one or more valid queries in the Base Language 
that is also based on the natural language and that 
already  has  an  efficient  implementation.  The  Base 
Language has proven to be very easily readable by 
non-IT  specialists  (i.e.  the  domain  experts  of  the 
medical  management  domain).  Thus  the  domain 
expert  would  be  able  to  understand  the  translation 
results and to select the correct one, that is, the query 
he/she had intended to formulate. 
For it to be possible to implement such natural 
language-based querying, this paper proposes a data 
schema  called  the  Semistar  data  ontology  that 
alleviates  the  process  of  formulating  queries.  The 
practice  has  shown  that  such  a  data  structure  is 
prevalent  in  subject-oriented  domains  such  as 
hospital management. 
To  test  the  base  query  language,  a  tool  was 
developed that allows users to create queries and to 
receive  answers  to  them.  An  experiment  was 
conducted where the tool  was taught to a group of 
students. After having worked with the tool and the 
Base Language for some time, they acknowledged the 
language as very well readable. Therefore, it justifies 
the  approach  of  showing  the  list  of  the  query 
translation results in the Base Language back to the 
user so that he/she can point out to the correct one. As 
a result, the system learns from the user experience so 
that the correct query will have higher credibility (i.e., 
smaller entropy) next time. 
This  paper  describes  the  work  in  progress  that 
continues the work described in (Rencis, 2018-2). A 
prototype  implementing  the  natural  language-based 
querying  has  been  developed,  as  well  as  the 
calculation of  the  entropy  for  the query  translation 
results has been implemented. The user experience-
based learning is a part of the future work that has yet 
to be implemented. 
ACKNOWLEDGEMENTS 
This work is supported by the ERDF PostDoc Latvia 
project  Nr.  1.1.1.2/16/I/001  under  agreement  Nr. 
1.1.1.2/VIAA/1/16/218  “User  Experience-Based 
Generation  of  Ad-hoc  Queries  From  Arbitrary 
Keywords-Containing Text”. 
 
REFERENCES 
Androutsopoulos,  I.,  Ritchie,  G.  D.,  Thanisch,  P.  1995. 
Natural  language  interfaces  to  databases  –  an 
introduction. In: Natural Language Engineering, 1(1), 
29-81. DOI: 10.1017/S135132490000005X. 
Barzdins,  J.,  Rencis,  E.,  and  Sostaks,  A.  2014.  Data 
Ontologies and Ad Hoc Queries: a Case Study. In: H.M. 
Haav,  A.  Kalja,  T.  Robal  (Eds.)  Proc.  of  the  11th 
International Baltic Conference, Baltic DB&IS, 55-66, 
TUT Press. 
Barzdins,  J., Rencis, E., Sostaks, A.  2014.  Fast  Ad Hoc 
Queries Based on Data Ontologies. In: H.M. Haav, A. 
Kalja,  T.  Robal  (Eds.),  Frontiers  of  AI  and 
Applications,  Vol.  270,  Databases  and  Information 
Systems VIII, 43-56, IOS Press. 
Barzdins,  J.,  Grasmanis,  M.,  Rencis,  E.,  Sostaks,  A., 
Barzdins, J. 2016. Self-service Ad-hoc Querying Using 
Controlled  Natural  Language.  In:  G.  Arnicans  et  al. 
(Eds.)  Proc.  of  the  12th  International  Baltic 
Conference, Baltic DB&IS, 18-34, CCIS 615. 
Barzdins,  J.,  Grasmanis,  M.,  Rencis,  E.,  Sostaks,  A., 
Barzdins, J. 2016. Ad-hoc Querying of Semistar Data 
Ontologies  Using  Controlled  Natural  Language.  In: 
Frontiers  in  Artificial  Intelligence  and  Applications. 
Databases and Information Systems IX, Vol. 291, IOS 
Press, 3-16. DOI: 10.3233/978-1-61499-714-6-3. 
Barzdins,  J.,  Grasmanis,  M.,  Rencis,  E.,  Sostaks,  A., 
Steinsbekk,  A.  2016.  Towards  a  more  effective 
hospital: helping health professionals to learn from their 
own  practice  by  developing  an  easy  to  use  clinical 
processes  querying  language.  In:  International 
Conference  on  Health  and  Social  Care  Information 
Systems and Technologies, Procedia Computer Science 
Journal,  100,  498-506.  DOI:  10.1016/j.procs.2016. 
09.188. 
Chamberlin,  D.D.,  Boyce,  R.F.  1974.  SEQUEL:  A 
structured  English  query  language.  In:  Proc.  ACM 
SIGFIDET Workshop, Ann Arbor, Mich., 249-264. 
Densen,  P.  2011.  Challenges  and  opportunities  facing 
medical  education. In:  Transactions  of  the American 
Clinical and Climatological Association, 122, 48-58. 
Fei, L., Jagadish, H.V. 2014. NaLIR: An interactive natural 
language interface for querying relational databases. In: 
Proceedings  of  the  ACM  SIGMOD  International 
Conference  on  Management  of  Data.  DOI: 
10.1145/2588555.2594519. 
Gao, T., Dontcheva, M., Adar, E., Liu, Z., Karahalios, K.G. 
2015.  DataTone:  Managing  Ambiguity  in  Natural 
Language  Interfaces  for  Data  Visualization.  In: 
Proceedings of the 28th Annual ACM Symposium on 
User  Interface  Software  &  Technology  (UIST  '15). 
ACM,  New  York,  NY,  USA,  489-500.  DOI: 
10.1145/2807442.2807478. 
Kaleske, S. 2011. SAP Query Reporting – Practical Guide, 
Galileo Press. 
Li, F., Jagadish, H.  V. 2014. Constructing an  interactive 
natural language interface for relational databases. In: 
Journal  Proceedings  of the VLDB  Endowment,  8(1), 
73-84. 
User Experience-based Information Retrieval from Semistar Data Ontologies
425