Authors:
Alba Garin-Muga
1
;
2
;
Aurora María Sucre
1
;
2
;
Jordi Torres
2
and
Jon Kerexeta
2
Affiliations:
1
Biodonostia, Bioengineering Area, eHealth Group, Donostia-San Sebastián 20014, Spain
;
2
Vicomtech, eHealth and Biomedical Applications Area, Donostia-San Sebastian 20014, Spain
Keyword(s):
TCGA, Stratification, ML, Visualization, Clinical Data, Genomics
Abstract:
The Cancer Genome Atlas (TCGA) is a collection of freely available data of several human cancer types. TCGA contains over 2.5 petabytes of data, which includes, among others, clinical and genomic data. However, the visualization of such data is cumbersome and tiring for non-expert users. VisualMLTCGA is an intuitive and easy-to-use web tool that allows the automatic download and visualization of TCGA data and the processing of genomic data using GATK. Additionally, the tool allows to create comprehensive decision trees (DT) for prediction of outcomes from clinical and genomic TCGA data and other external datasets. VisualMLTCGA offers a simple web tool to download, process and visualize TCGA data, suitable for researchers and clinicians without any bioinformatics background.