Ontology-Driven Extraction of Contextualized Information from Research Publications

Vayianos Pertsas, Panos Constantopoulos

2023

Abstract

We present transformer-based methods for extracting information about research processes from scholarly publications. We developed a two-stage pipeline comprising a transformer-based text classifier that predicts whether a sentence contains the entities sought in tandem with a transformer-based entity recogniser for finding the boundaries of the entities inside the sentences that contain them. This is applied to extracting two different types of entities: i) research activities, representing the acts performed by researchers, which are entities of complex lexico-syntactic structure, and ii) research methods, representing the procedures used in performing research activities, which are named entities of variable length. We also developed a system that assigns semantic context to the extracted entities by: i) linking them according to the relation employs(Activity,Method) using a transformer-based binary classifier for relation extraction; ii) associating them with information extracted from publication metadata; and iii) encoding the contextualized information at the output into an RDF Knowledge Graph. The entire workflow is ontology-driven, based on Scholarly Ontology, specifically designed for documenting scholarly work. Our methods are trained and evaluated on a dataset comprising 12,626 sentences, manually annotated for the task at hand, and shown to surpass simpler transformer-based methods and baselines.

Download


Paper Citation


in Harvard Style

Pertsas V. and Constantopoulos P. (2023). Ontology-Driven Extraction of Contextualized Information from Research Publications. In Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 2: KEOD; ISBN 978-989-758-671-2, SciTePress, pages 108-118. DOI: 10.5220/0012254100003598


in Bibtex Style

@conference{keod23,
author={Vayianos Pertsas and Panos Constantopoulos},
title={Ontology-Driven Extraction of Contextualized Information from Research Publications},
booktitle={Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 2: KEOD},
year={2023},
pages={108-118},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012254100003598},
isbn={978-989-758-671-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 2: KEOD
TI - Ontology-Driven Extraction of Contextualized Information from Research Publications
SN - 978-989-758-671-2
AU - Pertsas V.
AU - Constantopoulos P.
PY - 2023
SP - 108
EP - 118
DO - 10.5220/0012254100003598
PB - SciTePress