Authors:
Wojciech M. Barczyñski
1
;
Felix Förster
1
;
Falk Brauer
1
and
Daniel Schuster
2
Affiliations:
1
SAP Research CEC Dresden, Germany
;
2
Technische Universität Dresden, Germany
Keyword(s):
Information extraction, Domain adaption, User interface, Process modeling.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Biomedical Engineering
;
Business Analytics
;
Coupling and Integrating Heterogeneous Data Sources
;
Data Engineering
;
Data Mining
;
Databases and Information Systems Integration
;
Datamining
;
Enterprise Information Systems
;
Health Information Systems
;
Information Systems Analysis and Specification
;
Knowledge Management
;
Ontologies and the Semantic Web
;
Sensor Networks
;
Signal Processing
;
Society, e-Business and e-Government
;
Soft Computing
;
Software Engineering
;
Web Information Systems and Technologies
Abstract:
Implementing domain specific Information Extraction (IE) technologies to retrieve structured information from unstructured data is a challenging and complex task. It requires both IE expertise (e.g., in linguistics) and domain knowledge, provided by a domain expert who is aware of, say, the text corpus specifics and entities of interest. While the IE expert role is addressed by several approaches, less has been done in enabling domain experts in the process of IE development. Our approach targets this issue. We provide a base platform for collaboration of experts through IE plan modeling languages used to compose basic IE operators into complex IE flows. We provide each of the experts with a language that is adapted to their respective expertise. IE experts leverage a fine grained view and domain experts use a coarse grain view on execution of IE. We use Model Driven Architecture concept to enable transition among the languages and operators provided by an algebraicIE framework. To p
rove applicability of our approach we implemented an Eclipse based tool –AdaptIE– and demonstrate it in a real world scenario for the SAP Community Network.
(More)