ConvertView
A Tool for Conversion and Visualization of Open Heterogenic Governmental Data
to the RDF Standard
Clóvis H. Nascimento
1
, Ricardo A. Afonso
2
, Vinicius C. Garcia
1
and Carlo M. Revoredo da Silva
1
1
Informatics Center – Cin, Federal University of Pernambuco (UFPE), Caixa Postal 7851, 50.740-560, Recife,
Pernambuco, PE, Brazil
2
Federal University of Alagoas (UFAL), Campus Arapiraca, ARACOMP, Arapiraca, Alagoas, AL, Brazil
Keywords: Open Data, Structured Pattern, Data Extraction, Cloud Computing.
Abstract: The diversity and relevance of the information made available by many governments, using the principles
defined by the open data manifest, creates an opportunity for the populations aid in many fields of
governmental administration, such as security, transportation, health care and education. However, most of
this information was generated in unstructured digital files, making them hard to use. This paper presents an
approach for the structuring of data, improving the processing and extraction of information. Thus, we created
a tool based on a cloud computing environment, which has high processing and storage ability, to create files
in a homogeneous and structured format, allowing for ominous and specific queries; contributing to the access
of the information.
1 INTRODUCTION
When the manifesto named Open Data (Accar 2009)
appeared through the governments of many countries
like United States and Brazil, to provide their legacy
and current data to the world through the internet,
created positive expectations in the population. The
manifesto aimed, total and permanent transparency of
their data produced in several areas, such as security,
transportation, health, education, among others. In
addition the use of these data, enables a better
political and economic contribution from the
population, for example, identifying the areas that
require a greater investment of resources by the
government.
In Brazil’s case, these expectations were not fully
met, not because of the data itself, but because of the
difficulty in using them. According to (Breitman, K.
et al. 2011) our biggest challenge today comes from
the diversity of data sources and the provenance of
these sources. Furthermore, most of the data provided
by the Brazilian government are not in the Linked
Data standard (Bizer, C. 2009), which allows the
interconnection of different data sources, being the
format recommended by the W3C consortium. The
available Open data is unstructured (documents,
spreadsheets etc.), despite several government
initiatives supported by the W3C, the results aren’t
sufficient to define a consistent approach, which
would facilitate the creation of an infrastructure of
Governmental Data in the Linked Data standard.
Despite the listed problems, the exponential
growth in available data on the web and other open
data initiatives creates new challenges, such as the
processing of large volumes of information and
minimizing the high costs of hardware and software.
To solve these problems, new technological
approaches emerged, such as big Data (Wu et al.
2013), designed to process large volumes of
distributed, heterogeneous data and cloud computing
(Qi Zhang et al. 2010), which was used in this work.
Cloud Computing provides hardware and software as
a service for a low cost combined with various
benefits such as high processing power, storage,
scalability and so on. According to (Simmhan et al.
2010), scalability and ease of integrating data from
multiple heterogeneous networks, justifies the
adoption of cloud computing.
In this context, this paper aims to contribute in the
research of heterogeneous processing for the RDF
(Resource Description Framework) data standard,
thus, in an attempt to fill this gap, we propose a tool
that operates in the cloud computing environment
called ConvertView containing options for the user to
253
H. Nascimento C., A. Afonso R., C. Garcia V. and Revoredo da Silva C..
ConvertView - A Tool for Conversion and Visualization of Open Heterogenic Governmental Data to the RDF Standard.
DOI: 10.5220/0005456902530258
In Proceedings of the 11th International Conference on Web Information Systems and Technologies (WEBIST-2015), pages 253-258
ISBN: 978-989-758-106-9
Copyright
c
2015 SCITEPRESS (Science and Technology Publications, Lda.)