performing a search of scientific articles in databases,
is faced with the problem of having a very large
number of documents, but much of these are not
actually useful and do not attend the user’s need.
It was then created an ontology and a search robot,
and the connection between them was established so
that the the initial goal was achieved.
For testing, in a way to assess the actual operation
of this process, the search robot was implemented
with the ability to extract articles from the IEEE
Xplore database, and the ontology has been built with
the field of database discipline.
After testing, it was observed that the use of
ontology for the search agent is an effective way to
obtain valuable information and be able to meet the
informational needs of the user.
The ontology can be effective in this case, because
it becomes a way of organizing semantic information,
and in this manner, only significant information will
be presented to the user.
Although the Semantic Web term has already
been used for some years, there is still a limitation in
their use, because much of the Web is organized in a
syntactic form in which most pages are created so that
only humans can read what is written without being
structured in a way that computational agents can
extract the data inside a context with the implied
meaning in the HTML.
The extraction agent can extract the documents
from the web, and a program can process information
by using onthology, thereby presenting the most
relevant results.
In this manner, the results obtained by using the
prototype developed can substantially narrow down
the amount of items presented to users. This research
aims, therefore, at making the user get, in a process of
Information Retrieval, more significant, quality
results. Thus, the user can evaluate more information
that is meaningful and does not waste time with data
that does not meet their needs.
Therefore, in order to address the issue of how to
insert intelligence in the recovery of web pages which
do not contextualize their information, the present
research proposes that the process of adding
semantics to these pages takes place outside the Web,
that is, the extraction of pages occurs in a syntactic
way, and from what was extracted, information is
checked by semantically entering into this process.
This method was very efficient since it is in fact able
to make a smarter search, which goes beyond simple
formulas of searches, which observe only the syntax
of the texts, and is able to analyze the context in which
the extracted documents are inserted, and then
visualize if that document fulfill the user’s needs.
ACKNOWLEDGEMENTS
The work presented in the paper was supported by the
CAPES and FAPESP (Fundação de Amparo a
Pesquisa do Estado de São Paulo), process
2015/01517-2.
REFERENCES
Coneglian, Caio Saraiva, and Elvis Fusco. "Recuperação da
Informação em Ambientes Semânticos: uma ferramenta
aplicada à publicações científicas." Journal on
Advances in Theoretical and Applied Informatics
(ISSN 2447-5033) 1.1 (2015): 30-37.
Coneglian, Caio Saraiva, and Elvis Fusco. Semantic Agent
of Informational Extraction on Big Data Ontological
Context, eKNOW 2014: The Sixth Internationl
Conference on information, Process, and Knowledge
Management (2014): 34-37.
Coneglian, Caio Saraiva, Elvis Fusco and Leonardo de
Castro Botega. Using Ontological Semantic Agent to
Data Extraction in the Context of Big Data: a tool
applied to Higher Education Institutions. Proceedings
of the International Conference on Semantic Web and
Web Services (SWWS) (2014): 1-6.
Fleisher, Craig S., Blenkhorn, David L., “Managing
Frontiers in Competitive Intelligence”. Greenwood
Publishing Group, 2001.
Graham-Rowe, D., et. al. “Big data: science in the petabyte
era”. Nature, 455, 1-50, 2008.
Deters, Janice I., Adaime, Silsomar F., “Um estudo
comparativo dos sistemas de busca na web” (“A
comparative study of search systems on the web”),
Anais do V Encontro de Estudantes de Informática do
Tocantins. Palmas, TO, 189-200, 2003.
Douglas, Laney. "The Importance of ‘Big Data’: A
Definition." Gartner (June 2012), 2012.
Silva, Tércio. M. S., “Extração De Informação Para Busca
Semântica Na Web Baseada Em Ontologias”
(“Information Extraction for Semantic Search In Web
Based On Ontology”). Florianópolis, 2003.
Noy, Natalya F., McGuiness, Deborah L., “Ontology
Development 101: A Guide to Creating Your First
Ontology”.
<http://www.ksl.stanford.edu/people/dlm/papers/ontol
ogy-tutorial-noy-mcguinness-abstract.html>
[retrieved: 09/27/2015], 2001.
Mizoguchi, Riichiro, “Tutorial on Ontological
Engineering”. NEW GENERATION COMPUTING-
TOKYO- 21.4, 363-364, 2003.
Protégé. Stanford University. <http://protege.stanford.
edu/> [retrieved: 10/10/2015].
Owl2Java.<http://www.incunabulum.de/projects/it/owl2ja
va> [retrieved: 11/14/2015].