Authors:
Marcus Vinicius Santana Poletti
;
Methanias Colaço Junior
and
André Nascimento
Affiliation:
Federal University of Sergipe, Avenida Marechal Rondon S/N Caixa Postal 49,100-000 São Cristovão, SE, Brazil
Keyword(s):
Open Data, e-Government, Public Transparency, ETL, Data Pipeline.
Abstract:
Context: Government transparency portals are built based on ETL (Extract, Transform and Load) processes, which increase the quality and interoperability of data, making a critical subsystem for these applications, subject to evaluative research for improvements. Objective: To analyze publications on the use of ETL in transparency portals, in order to characterize them in relation to their scenarios, impacts, empirical methods and general bibliometric data. Method: Using the PICO strategy (Population, Intervention, Comparison and Outcome), a systematic mapping of the literature was performed. Summary of Results: In a total of 204 publications researched, 25 works were selected, of which 40% present, as the main impact for the portals, the availability of support for the construction of loads through a graphical interface, followed by the possibility of connectivity between bases of heterogeneous data (27%) and the ability to monitor loads (22%). Regarding the real automation of loads
and their quality control, respectively, only 8% and 3% of the works discussed the impacts of these characteristics. Conclusion: The research showed that the use of ETLs in transparency portals still lacks comparative and feasibility studies. In this sense, an existing challenge is the lack of research that carries out replications to consolidate and validate the works already published, evidenced by the scarcity of controlled experiments in the area. Finally, analyzes on the quality control of loads was an important gap identified.
(More)