loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: R. Raminhos 1 and J. Moura-Pires 2

Affiliations: 1 UNINOVA – Desenvolvimento de Novas Tecnologias, Portugal ; 2 CENTRIA/FCT, Portugal

Keyword(s): ETD, ETL, IL, Declarative Language, Semi-Structured Text Files.

Related Ontology Subjects/Areas/Topics: Coupling and Integrating Heterogeneous Data Sources ; Databases and Information Systems Integration ; Enterprise Information Systems

Abstract: The World Wide Web is a major source of textual information, with a human-readable semi-structured format, referring to multiple domains, some of them highly complex. Traditional ETL approaches following the development of specific source code for each data source and based on multiple domain / computer-science experts interactions, become an inadequate solution, time consuming and prone to error. This paper presents a novel approach to ETL, based on its decomposition in two phases: ETD (Extraction, Transformation and Data Delivery) and IL (Integration and Loading). The ETD proposal is supported by a declarative language for expressing ETD statements and a graphical application for interacting with the domain expert. When applying ETD mainly domain expertise is required, while computer-science expertise will be centered in the IL phase, linking the processed data to target system models, enabling a clearer separation of concerns. This paper presents how ETD has been integrated, teste d and validated in a space domain project, currently operational at the European Space Agency for the Galileo Mission. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.224.0.25

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Raminhos, R. and Moura-Pires, J. (2007). EXTRACTION AND TRANSFORMATION OF DATA FROM SEMI-STRUCTURED TEXT FILES USING A DECLARATIVE APPROACH. In Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 3: ICEIS; ISBN 978-972-8865-88-7; ISSN 2184-4992, SciTePress, pages 199-205. DOI: 10.5220/0002364201990205

@conference{iceis07,
author={R. Raminhos. and J. Moura{-}Pires.},
title={EXTRACTION AND TRANSFORMATION OF DATA FROM SEMI-STRUCTURED TEXT FILES USING A DECLARATIVE APPROACH},
booktitle={Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 3: ICEIS},
year={2007},
pages={199-205},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002364201990205},
isbn={978-972-8865-88-7},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 3: ICEIS
TI - EXTRACTION AND TRANSFORMATION OF DATA FROM SEMI-STRUCTURED TEXT FILES USING A DECLARATIVE APPROACH
SN - 978-972-8865-88-7
IS - 2184-4992
AU - Raminhos, R.
AU - Moura-Pires, J.
PY - 2007
SP - 199
EP - 205
DO - 10.5220/0002364201990205
PB - SciTePress