6 CONCLUSIONS
The present work analyzed functional issues that are
related to the keyword query processing problem.
Such issues represent important aspects that make
such a problem challenging and complex.
The problem involves submitting keyword-based
queries — using an appropriate interface for non-
expert users — to access the content of heterogeneous
data sources with respect to the database category and
the plurality of idioms. Unlike other sources of data
accessible on the Internet, such as documents of pub-
lic content, the problem reaches sensitive databases,
which are usually hidden on the Web.
The main contribution of the work is to advance
the characterization and formalization of the problem
in functional terms, such that it promotes a greater un-
derstanding of its functional requirements and a better
perception of its complexity. Some specific contribu-
tions of this work include:
• evolution of functional issues present in the litera-
ture, by bringing more details and perspectives for
understanding as well as adding new issues not yet
addressed;
• abstraction of problem objects that are actually
necessary responsibilities for modeling the func-
tional requirements;
• construction of a software engineering artifact -
UML Sequence Diagram - describing the prob-
lem domain objects and the interactions between
them; such a diagram also eases keyword-based
query comprehension;
• introduction of definitions in the problem domain
to add more formalism, extend existing terminol-
ogy and promote new perspectives on the func-
tional issues.
Threats to validity are addressed by: (i) building a
software engineering artifact, as it is a systematic rep-
resentation; (ii) grounding the results on functional is-
sues introduced mainly by the research area; and (iii)
producing partial artifacts by random pairs of authors,
and integrating them to consolidate the final artifact of
the complete problem by all authors.
Further studies will refine the functional require-
ments and build (specialise) artifacts tailored to the
solution domain.
REFERENCES
Bergamaschi, S., Domnori, E., Guerra, F., Lado, R. T., and
Velegrakis, Y. (2011). Keyword Search over Rela-
tional Databases: A Metadata Approach. In Proceed-
ings of the ACM SIGMOD International Conference
on Management of Data, pages 565–576. ACM.
Bergamaschi, S., Ferro, N., Guerra, F., and Silvello, G.
(2016). Keyword-Based Search Over Databases: A
Roadmap for a Reference Architecture Paired with an
Evaluation Framework, pages 1–20. Springer Berlin
Heidelberg.
He, H., Wang, H., Yang, J., and Yu, P. S. (2007). Blinks:
Ranked keyword searches on graphs. In Proceed-
ings of the ACM SIGMOD International Conference
on Management of Data, pages 305–316. ACM.
Hormozi, N. (2019). Disambiguation and Result Expan-
sion in Keyword Search Over Relational Databases.
In Proceedings of the IEEE International Conference
on Data Engineering, pages 2101–2105. IEEE Com-
puter Society.
Hristidis, V. and Papakonstantinou, Y. (2002). DISCOVER:
Keyword Search in Relational Databases. In Proceed-
ings of the International Conference on Very Large
Data Bases, pages 670–681. Morgan Kaufmann.
IEEE-Computer-Society, Bourque, P., and Fairley, R. E.
(2014). Guide to the Software Engineering Body of
Knowledge SWEBOK Version 3.0. IEEE Computer
Society Press, Los Alamitos, CA, USA.
Kargar, M., An, A., Cercone, N., Godfrey, P., Szlichta,
J., and Yu, X. (2015). Meaningful Keyword Search
in Relational Databases with Large and Complex
Schema. In Proceedings of the IEEE International
Conference on Data Engineering, pages 411–422.
IEEE Computer Society.
Li, G., Ooi, B. C., Feng, J., Wang, J., and Zhou, L. (2008).
Ease: An effective 3-in-1 keyword search method
for unstructured, semi-structured and structured data.
In Proceedings of the ACM SIGMOD International
Conference on Management of Data, page 903–914.
ACM.
Luo, Y., Wang, W., and Lin, X. (2008). SPARK: A keyword
search engine on relational databases. In Proceedings
of the IEEE International Conference on Data Engi-
neering, pages 1552–1555. IEEE Computer Society.
Pu, K. Q. and Yu, X. (2008). Keyword query cleaning.
Proceedings of the International Conference on Very
Large Data Bases, page 909–920.
Pu, K. Q. and Yu, X. (2009). FRISK: keyword query clean-
ing and processing in action. In Proceedings of the
IEEE International Conference on Data Engineering,
pages 1531–1534. IEEE Computer Society.
Ramada, M. S., da Silva, J. C., and de S
´
a Leit
˜
ao-J
´
unior, P.
(2020). From keywords to relational database content:
A semantic mapping method. Information Systems,
88:101460.
Sayyadian, M., LeKhac, H., Doan, A., and Gravano, L.
(2007). Efficient keyword search across heteroge-
neous relational databases. In Proceedings of the
IEEE International Conference on Data Engineering,
pages 346–355. IEEE.
On Functional Requirements for Keyword-based Query over Heterogeneous Databases on the Web
231