Authors:
Walter Travassos Sarinho
1
;
Bernadette Farias Lóscio
1
and
Damires Souza
2
Affiliations:
1
Federal University of Pernambuco, Brazil
;
2
Federal Institute of Education and Science and Technology of Paraiba, Brazil
Keyword(s):
Linked Dataset, Quality Information, Completeness Assessment.
Related
Ontology
Subjects/Areas/Topics:
Cloud Computing
;
Databases and Information Systems Integration
;
Enterprise Information Systems
;
Query Languages and Query Processing
;
Semantic Web Technologies
;
Services Science
;
Software Agents and Internet Computing
Abstract:
The huge volume of datasets available on the Web has motivated the development of a new class of Web applications, which allow users to perform complex queries on top of a set of predefined linked datasets. However, given the large number of available datasets and the lack of information about their quality, the selection of datasets for a particular application may become a very complex and time consuming task. In this work, we argue that one possible way of helping the selection of datasets for a given application consists of evaluating the completeness of the dataset with respect to the data considered as important by the application users. With this in mind, we propose an approach to assess the completeness of a linked dataset, which considers a set of specific data requirements and allows saving large amounts of query processing. To provide a more detailed evaluation, we propose three distinct types of completeness: schema, literal and instance completeness. We present the defin
itions underlying our approach and some results obtained with the accomplished evaluation.
(More)