Authors:
Feng Liu
;
Yifei Chen
and
Bernard Manderick
Affiliation:
Computational Modeling Lab, Vrije Universiteit Brussel, Belgium
Keyword(s):
Named entity recognition, gene/protein names identification, support vector machine, two-layer structure, boundary check.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Artificial Intelligence and Decision Support Systems
;
Biomedical Engineering
;
Business Analytics
;
Data Engineering
;
Data Mining
;
Databases and Information Systems Integration
;
Datamining
;
Enterprise Information Systems
;
Health Information Systems
;
Natural Language Interfaces to Intelligent Systems
;
Sensor Networks
;
Signal Processing
;
Soft Computing
Abstract:
In this paper, we propose a named entity recognition system for biomedical literature using two-layer support vector machines. In addition, we employ a post-processing module called a boundary check module to eliminate some boundary errors, which can lead to improved system performance. Our system doesn’t make use of any external lexical resources and hence it is a fairly simple system. Furthermore, with carefully designed features and introducing a second layer, our system can recognize named entities in biomedical literature with fairly high accuracy, which can achieve the precision of 83.5%, recall of 80.8% and balanced Fβ=1 score of 82.1%, an approximate state of the art performance for the moment.