Construct Semantic Type of “Gene-mutation-disease” Relation by Computer-aided Curation from Biomedical Literature
Dongsheng Zhao, Fan Tong, Zheheng Luo
2019
Abstract
Background: Current semantic type of “gene-mutation-disease” relation lacks fine-grained classification and corresponding relation signal words, which limits its usage in relation extraction from biomedical literature using text mining approach. Methods: We propose a computer-aided curation pipeline in which open relation extraction, signal word clustering, relation type mapping are used to analyze biomedical abstracts for semantic type of “gene-mutation-disease” construction. Coverage metrics are used to evaluate the defined relation type while ClinVar is chosen as a target to test our semantic type’s usability and performance on guiding relation extraction from biomedical literature. Results: We have constructed a 5-layer and 16-category semantic type of “gene-mutation-disease” relation with a vocabulary list containing 58 commonly used relation signal words. The vocabulary list has coverage of 95.08% and the semantic type has coverage of 94.12%. From 25 abstracts linked to 30 ClinVar records, 15 relations are correctly mapped and 8 novel relations are discovered additionally. Conclusion: The results show that our semantic type can cover the main relations between “gene”, “mutation” and “disease” and can achieve good performance on guiding relation extraction from biomedical text even using relatively out-of-date dictionary-based text mining methods.
DownloadPaper Citation
in Harvard Style
Zhao D., Tong F. and Luo Z. (2019). Construct Semantic Type of “Gene-mutation-disease” Relation by Computer-aided Curation from Biomedical Literature. In Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-353-7, SciTePress, pages 123-130. DOI: 10.5220/0007688101230130
in Bibtex Style
@conference{bioinformatics19,
author={Dongsheng Zhao and Fan Tong and Zheheng Luo},
title={Construct Semantic Type of “Gene-mutation-disease” Relation by Computer-aided Curation from Biomedical Literature},
booktitle={Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS},
year={2019},
pages={123-130},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007688101230130},
isbn={978-989-758-353-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS
TI - Construct Semantic Type of “Gene-mutation-disease” Relation by Computer-aided Curation from Biomedical Literature
SN - 978-989-758-353-7
AU - Zhao D.
AU - Tong F.
AU - Luo Z.
PY - 2019
SP - 123
EP - 130
DO - 10.5220/0007688101230130
PB - SciTePress