questions concerning the semantic processing of texts
in the natural language require the formulation of a
narrow range of problems to be solved and further
research on the possibility of their resolution. An
example of such a range of problems is the semantic
search (Rashid and Nisar, 2016).
Theoretically, the semantic approach to text
processing is designed to solve the main problem of
lexical search that is huge number of errors during the
incorrect resolution of the polysemy of search query
lexemes.
The possible way of eliminating such errors is the
usage of the ontology-based semantic graph to keep
the knowledge needed to improve the search quality
(Modoni et al. 2014). In their article, the authors offer
the general architecture that has several advantages
regarding the quality of results and the usability to
formulate the queries, but their main focus is on the
data mining needed to collect and fill the knowledge
base. Another way of resolving word-sense
disambiguation is based on the usage of entity linking
in queries following by choice between supervised
and unsupervised alternatives (Hasibi et al, 2016).
As a part of this work, we propose a method for
implementing enterprise search based on the semantic
data retrieved from the ontological network. Using
the semantics of the search query we can significantly
increase the pertinence of the response, and therefore
the proposed method is based on using the semantic
relations of the ontological network, the lexical
information of semantic values and the translingual
data.
2 SEMANTIC NETWORKS AND
LEXICAL INFORMATION
Semantic networks are graph structures with nodes
that store semantic values (senses that represent
concepts), and the edges between nodes indicate the
relative semantic affiliation of one concept with
respect to another. Examples of such relations can be
synonymy, hyponymy, meronymy, and their reverse
relations: antonymy, hypernymy, and holonymy
(Stern D., 2015). These elementary semantic relations
between senses can be used to construct more
complex relations, such as cohyponyms, converses,
and others.
It is important to note that the semantic network
described in this paper doesn’t conform to the LMF
(Lexical Markup Framework) (Francopoulo G.,
2013) or UBY-LMF (Eckle-Kohler J. et al, 2015)
standards, because of some limitations imposed by
the object-oriented model. Instead, we used the
semantic representation based on a labeled oriented
graph structure, where nodes correspond to senses of
several types, and edges provide links between nodes
(Klimenkov et al, 2020). In addition, each node
corresponding to a sense is connected to all possible
lexemes used to represent the sense in different
languages. The ontology is formed from several semi-
structural sources (Pismak et al, 2019) and the
translingual lexemes are collected during the process
of sense-to-sense relation reconstruction (Osika et al,
2017). Such a graph structure allows us to eliminate
the needs for word-sense disambiguation due to usage
of reverse sense-to-lexeme relations while providing
the possibility for a quick search of sense nodes by
lexemes (Pokid et al, 2017). This lexico-semantic
structure contains the following types of nodes:
A semantic node is the type of node for storing
data about semantic values. In its general form it is an
abstract node that does not store specific information
about the meaning of the sense, but only positions it
with respect to other concepts in the semantic
network;
A lexical node is the type of node for storing a
certain lexeme. Lexical nodes are always associated
with a sematic node representing a sense which can
be expressed with a given lexeme. It is important that
lexical nodes also contain information about the
language. It is used for applying the translingual
functions of the semantic search.
The types of relationships are determined by the
set of permissible combinations of node types and can
take the following values:
1. Sense-to-sense-synonymy;
2. Sense-to-sense-antonymy;
3. Sense-to-sense-hyponymy;
4. Sense-to-sense-hypernymy;
5. Sense-to-sense-holonymy;
6. Sense-to-sense-meronymy;
7. Sense-to-lexeme.
One of the advantages of such a lexical-semantic
structure is the elimination of ambiguity resolving.
While working with semantic nodes we use all word
forms that express associated senses. Another benefit
is that sense-to-sense relations can be taken into
account, which makes it possible to refine the
particular concept for a given semantic meaning. And
the last but not the least advantage is the using of
sense-to-lexeme relations to provide translingual
search due to keeping word forms in different
languages.
KEOD 2020 - 12th International Conference on Knowledge Engineering and Ontology Development