Authors:
Rashmi Burse
;
Michela Bertolotto
and
Gavin McArdle
Affiliation:
University College Dublin, Belfield, Dublin 4, Ireland
Keyword(s):
Quality Assurance, Biomedical Ontologies, SNOMED-CT Templates, Lexical Auditing, Semantic Analysis, Biomedical Named Entity Recognition.
Abstract:
Quality Assurance (QA) of biomedical ontologies is a major challenge in the health-informatics domain. One of the preliminary ways in which we can maintain the quality of a biomedical ontology is by ensuring consistency in the modelling styles of biomedical concepts. Maintaining consistency in the lexical, structural and ontological modelling of biomedical concepts reduces a concept’s susceptibility to errors. SNOMED-CT, which is one of the most widely adopted biomedical ontologies, strives to achieve this consistency by creating templates for logical definitions based on the description of biomedical concept names. The work presented here in based on the observation that the majority of the SNOMED-CT templates contain stopwords (non-medical terms) in their description that indicate a relationship between two medical concepts. We hypothesize that the process of creating SNOMED-CT templates can be automated to a large extent by targeting stopwords. In this work, we present a method th
at exploits stopwords in concept names to create templates for the structural and logical modelling of lexically and semantically similar biomedical concepts. The results have shown promising potential by extracting a multitude of SNOMED-CT templates, exhibiting more than 200 templates for the stopword of. Given the high demand for QA of biomedical ontologies, these results are highly beneficial in automating the existing mechanisms employed in maintaining consistency in the modeling of SNOMED-CT concepts. The presented method can be used as a complementary process to mitigate the manual efforts of SNOMED-CT curators. Furthermore, auditing potentially incomplete definitions of SNOMED-CT concepts using the extracted templates has identified 49-87% inconsistent concepts for the stopwords of and in in the biomedical ontology.
(More)