ARHINET - A System for Generating and Processing Semantically-Enhanced Archival eContent

Ioan Salomie, Mihaela Dinsoreanu, Cristina Pop, Sorin Suciu, Tudor Vlad, Ioana Iacob

Abstract

This paper addresses the problem of generating and processing of eContent from archives and digital libraries. We present a system that adds semantic mark-up to the content of historical documents, thus enabling document and knowledge retrieval as response to semantic queries. The system functionality follows two main workflows: eContent generation and knowledge acquisition on one hand and knowledge processing and retrieval on the other hand. Within the first workflow, the relevant domain information is extracted from documents written in natural languages, followed by semantic annotation and domain ontology population. In the second workflow, ontologically guided queries trigger reasoning processes that provide relevant search results.

References

  1. Amardeilh, F., 2007. Web Sémantique et Informatique Linguistique: propositions méthodologiques et réalisation d'une plateforme logicielle. These de Doctorat, Universite Paris X-Nanterrere.
  2. Amardeilh, F., 2006. OntoPop or how to annotate documents and populate ontologies from texts. in Proceedings of the ESWC 2006 Workshop on Mastering the Gap: From Information Extraction to Semantic Representation, Budva, Montenegro, June 12, 2006. CEUR Workshop Proceedings,
  3. ISSN 1613-0073.
  4. Buitelaar, P., Cimiano, P., Racioppa S., Siegel, M., 2006. Ontology-based Information Extraction with SOBA. In Proceedings of the International Conference on Language Resources and Evaluation, pp. 2321-2324.
  5. Laclavik M., Ciglan M, Seleng M, Krajei S., 2007. Ontea: Semi-automatic Pattern based Text Annotation empowered with Information Retrieval Methods. In Tools for acquisition, organisation and presenting of information and knowledge: proceedings in Informatics and Information Technologies, Kosice: Vydavatelstvo STU, Bratislava. ISBN 978-80-227- 2716-7, part 2, pp. 119-129.
  6. Schäfer, U., 2007. Integrating Deep and Shallow Natural Language Processing Components - Representations and Hybrid Architectures. Saarbrücken Dissertations in Computational Linguistics and Language Te, DFKI GmbH and Computational Linguistics Department, Saarland University, Saarbrücken, Germany.
  7. Tablan V., Maynard D., Bontcheva K., Cunningham H., 2004. Gate - An Application Developer's Guide. Available online: http://gate.ac.uk/.
  8. Sandia National Laboratories, 2008. Jess the Rule Engine for the Java Platform, Version 7.1. Available online: http://www.jessrules.com/jess/docs/Jess71.pdf.
  9. Horrocks I., et al., 2004. SWRL: A Semantic Web Rule Language Combining OWL and RuleML. Available online: http://www.w3.org/Submission/SWRL/.
  10. SQWRL: Semantic Query-Enhanced Web Rule Language. Available online: http://protege.cim3.net/cgibin/wiki.pl?SQWRL, Date accessed 01/06/2008.
  11. Horridge, M., et al., 2007. A Practical Guide to Building OWL Ontologies Using Protégé 4 and CO-ODE Tools. Available online: http://www.coode.org/resources/tutorials/ProtegeOWLTutorialp4.0.pdf.
  12. CCNA, Cluj County National Archives, 2008, Online: http://www.clujnapoca.ro/arhivelenationale/.
Download


Paper Citation


in Harvard Style

Salomie I., Dinsoreanu M., Pop C., Suciu S., Vlad T. and Iacob I. (2009). ARHINET - A System for Generating and Processing Semantically-Enhanced Archival eContent . In Proceedings of the Fifth International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8111-81-4, pages 151-158. DOI: 10.5220/0001833401510158


in Bibtex Style

@conference{webist09,
author={Ioan Salomie and Mihaela Dinsoreanu and Cristina Pop and Sorin Suciu and Tudor Vlad and Ioana Iacob},
title={ARHINET - A System for Generating and Processing Semantically-Enhanced Archival eContent},
booktitle={Proceedings of the Fifth International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2009},
pages={151-158},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001833401510158},
isbn={978-989-8111-81-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fifth International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - ARHINET - A System for Generating and Processing Semantically-Enhanced Archival eContent
SN - 978-989-8111-81-4
AU - Salomie I.
AU - Dinsoreanu M.
AU - Pop C.
AU - Suciu S.
AU - Vlad T.
AU - Iacob I.
PY - 2009
SP - 151
EP - 158
DO - 10.5220/0001833401510158