of the ontology of concepts and provides input items
for this domain to be mapped and towards obtaining
the data that is found in each Curriculum Lattes of the
researchers studied.
In the development of the application, already-
existing functionalities were re-used, to extract the
curriculum from the Lattes platform and to create the
ontology. The ontological file is read and consulted in
order to obtain the bibliographical works considered
in the calculation of the similarity, amongst the indivi-
duals selected. The result is exported on a spreadsheet
that lists the profiles under comparison, and displays
a percentage of adherence as found amongst them.
Tests were carried out to verify and validate the fi-
gures obtained. The set of tests was controlled with
the knowledge of the set of profiles that would be
compared. The goal was to attest whether the figures
actually obtained matched that which were observed
in the real world.
In order to better explain the point of this work, it
is structured as follows. Section 2 presents the pro-
posed model, with the basis of the proposal, the pro-
posal in itself, the mathematical formulation, and the
implementation of the algorithm. Section 3 descri-
bes the implementation and the tests that were run. It
also presents the tables, as originated in the execution
of the application, along with the final result and the
percentages of similarity for the comparisons. Finally,
Section 4 concludes the work.
2 MODEL FOR SIMILARITY
AMONGST RESEARCHERS
The model proposed is based on the use of the plat-
form found in the Lattes academic record database as
its data source to obtain information on the academic
careers of people, aimed at making comparisons via
an algorithm. In possession of such data, one can con-
solidate inferences as found amongst the individuals
found in the domain that the Lattes database is about.
In the academic community, both the teaching
staff as the students corpus can have their Curriculum
Lattes, as a way to build a portfolio on one’s academic
path, for different ends. The Curriculum Lattes base,
as found in the Web, holds varied information on the
academic career of any of the individuals registered.
Based on this data a strategy was devised to create an
ontology that would represent this model, as expres-
sed in the Lattes database (Galego and Renata, 2013).
This strategy also included a manner for extracting
data that would allow a relationship amongst the indi-
viduals found in the database. Such a drive stemmed
from a few questions we wanted to see answered:
1. Is it possible to establish a link between different
individuals that have not met, based on the life
they lead in the Academy?
2. Is it possible to establish a quantitative approach
through calculation of how much similarity there
is in a comparison made of individuals hitherto
unknown to each other?
3. Is it possible to make this line of thought automa-
tic with the use of an algorithm? That is, there
is a possibility for drawing aspects that are simi-
lar amongst individuals and to have the process
for that running on a machine, to obtain an index
that allows assessing whether a person is similar
or not to another one, considering the aspects of
one’s trajectory in the Academy.
In the analysis of the Lattes database it is possible
to see that several fields separate the individuals and
characterize them according to a predominance in a
given area of knowledge.
Some fields found in blocks of items can be con-
sidered for the purposes of individualization, of the
characteristic of each individual.
Amongst these items we can cite: academic
background and titles, supplementary qualifications,
professional history, research areas pursued, rese-
arch projects, extension projects, development pro-
jects, reviewing work for periodicals, areas of acti-
vity, awards and titles received, work that contains
the bibliographical production, articles published in
periodicals, books published/organized, or editions,
chapters of books published, text published in jour-
nals/magazines as news, full work published n con-
ference proceedings, expanded abstracts published in
conference proceedings, abstracts published in con-
ference proceedings, presentations of technical pro-
duction work with information on assistance and con-
sultancy, technical work, interviews, round tables,
programs and comments in the media and in the item
that covers other types of technical production, and
other work.
The blocks of information have other elements
that hold information that is semantically relevant and
that can be used. The following elements can be men-
tioned as examples: advice in work for degree course
final project, advice and supervision completed, ad-
vice for MSC theses, end-of-course work in refres-
her/specialization courses, end-of-course work in de-
gree courses, scientific introduction courses, advice
of other kinds, development of teaching or instruction
material, interviews, round tables, programs and com-
ments in the media, organization of events, conferen-
ces, exhibitions and fairs, examination boards in jud-
ges committees in public tests, and other participati-
ons.
ICEIS 2018 - 20th International Conference on Enterprise Information Systems
204