Authors:
Cleiton Fernando Lima Sena
;
Rafael Glauber
and
Daniela Barreiro Claro
Affiliation:
Federal University of Bahia (UFBA), Brazil
Keyword(s):
Open Information Extraction, Inference, Transitivity, Symmetry, Portuguese.
Related
Ontology
Subjects/Areas/Topics:
Applications of Expert Systems
;
Artificial Intelligence and Decision Support Systems
;
Enterprise Information Systems
;
Natural Language Interfaces to Intelligent Systems
Abstract:
Open Information Extraction (Open IE) enables the extraction of facts in large quantities of texts written in natural language. Despite the fact that almost research has been doing in English texts, methods and techniques for other languages have been less frequent. However, those languages other than English correspond to 48% of content available on websites around the world. In this work, we propose a method for extracting facts in Portuguese without pre-determining the types of the facts. Additionally, we increased the quantity of those extracted facts by the use of an inference approach. Our inference method is composed of two issues: a transitive and a symmetric mechanism. To the best of our knowledge, this is the first time that inference approach is used to extract facts in Portuguese texts. Our proposal allowed an increase of 36% in quantity of valid facts extracted in a Portuguese Open IE system, and it is compatible in the quality of facts with English approaches.