Sharing Bioinformatic Data for Machine Learning: Maximizing Interoperability through License Selection
Alexander Bernier, Adrian Thorogood
2020
Abstract
Efficient machine learning in bioinformatics requires a large volume of data from different sources. Bioinformatics is shifting from a paradigm of siloed analysis of individual datasets by researchers to the aggregation and analysis of disparate sets of health and biomedical data across from academic, healthcare and commercial settings. Data generating organizations must give thought to selecting legal terms for dataset release that will promote compatibility with other datasets. In releasing bioinformatic data for open use, care must be taken to ensure that the terms of the licenses selected ensure maximum interoperability. The following technical elements should inform the choice of license: License hybridity; waivers of liability, warranties and guarantees; commercial/non-commercial use; attribution and copyleft; granular permission and bilateral or multilateral licensing. Licenses are compared to inform optimal license selection and enable data integration and analysis; consideration is given to an eventual standard license for open sharing of bioinformatic data.
DownloadPaper Citation
in Harvard Style
Bernier A. and Thorogood A. (2020). Sharing Bioinformatic Data for Machine Learning: Maximizing Interoperability through License Selection. In Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-398-8, SciTePress, pages 226-232. DOI: 10.5220/0009179502260232
in Bibtex Style
@conference{bioinformatics20,
author={Alexander Bernier and Adrian Thorogood},
title={Sharing Bioinformatic Data for Machine Learning: Maximizing Interoperability through License Selection},
booktitle={Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS},
year={2020},
pages={226-232},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009179502260232},
isbn={978-989-758-398-8},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS
TI - Sharing Bioinformatic Data for Machine Learning: Maximizing Interoperability through License Selection
SN - 978-989-758-398-8
AU - Bernier A.
AU - Thorogood A.
PY - 2020
SP - 226
EP - 232
DO - 10.5220/0009179502260232
PB - SciTePress