neural-network based survival prediction method, on
pathology imaging data and beyond. Using TCGA
HCC pathology images as the example, we
demonstrate that Cox-nnet is more robust and
accurate at predicting testing dataset, relative to Cox-
PH, the standard method for survival prediction
(which was also the second-best method in the
original RNA-Seq transcriptomic study (T. Ching, et
al., 2018)). Moreover, we propose a new two-stage
complex Cox-nnet model to integrate imaging and
RNA-Seq transcriptomic data, and show case its
outstanding predictive accuracy on testing dataset (C-
index almost as high as 0.90). The two-stage Cox-
nnet model combines the transformed, hidden node
features from the first-stage of Cox-nnet models for
imaging or RNA-Seq based data respectively and use
these combined features as the inputs to train a
second-stage Cox-nnet model.
Rather than using convolutional neural network
(CNN) models that are more complex, we utilized a
less complex but perhaps more biologically relevant
approach, where we extract imaging features using
the tool CellProfiler. These features are then fed in a
relatively simple, two-layer neural network model,
and still achieve credible predictive performance.
Such success argues that in biological domain, it is
possible to use relatively simple neural network
models with have prior biological relevance (such as
in the input features). In summary, our work here not
only extends the previous Cox-nnet model to process
pathological imaging data, but also creatively
addresses the multi-modal data integration challenges
for patient survival prediction.
ACKNOWLEDGEMENTS
LXG would like to thank the support by grants
K01ES025434 awarded by NIEHS through funds
provided by the trans-NIH Big Data to Knowledge
(BD2K) initiative (www.bd2k.nih.gov), R01
LM012373 and LM awarded by NLM, R01
HD084633 awarded by NICHD to L.X. Garmire.
REFERENCES
T. Ching, X. Zhu, and L. X. Garmire, “Cox-nnet: An
artificial neural network method for prognosis
prediction of high-throughput omics data,” PLoS
Comput. Biol., vol. 14, no. 4, p. e1006076, Apr. 2018.
H. Ishwaran and M. Lu, “Random Survival Forests,” Wiley
StatsRef: Statistics Reference Online. pp. 1–13, 2019.
R. D. Bin and R. De Bin, “Boosting in Cox regression: a
comparison between the likelihood-based and the
model-based approaches with focus on the R-packages
CoxBoost and mboost,” Computational Statistics, vol.
31, no. 2. pp. 513–531, 2016.
C. McQuin, A. Goodman, V. Chernyshev, L. Kamentsky,
B. A. Cimini, K. W. Karhohs, M. Doan, L. Ding, S. M.
Rafelski, D. Thirstrup, W. Wiegraebe, S. Singh, T.
Becker, J. C. Caicedo, and A. E. Carpenter,
“CellProfiler 3.0: Next-generation image processing for
biology,” PLoS Biol., vol. 16, no. 7, p. e2005970, Jul.
2018.
K. Chaudhary, O. B. Poirion, L. Lu, and L. X. Garmire,
“Deep Learning-Based Multi-Omics Integration
Robustly Predicts Survival in Liver Cancer,” Clin.
Cancer Res., vol. 24, no. 6, pp. 1248–1259, Mar. 2018.
K. Chaudhary, O. B. Poirion, L. Lu, S. Huang, T. Ching,
and L. X. Garmire, “Multi-modal meta-analysis of 1494
hepatocellular carcinoma samples reveals vast impacts
of consensus driver genes on phenotypes.” .
C. Marinaccio and D. Ribatti, “A simple method of image
analysis to estimate CAM vascularization by APERIO
ImageScope software,” Int. J. Dev. Biol., vol. 59, no. 4–
6, pp. 217–219, 2015.
K.-H. Yu, C. Zhang, G. J. Berry, R. B. Altman, C. Ré, D.
L. Rubin, and M. Snyder, “Predicting non-small cell
lung cancer prognosis by fully automated microscopic
pathology image features,” Nat. Commun., vol. 7, p.
12474, Aug. 2016.
S. Huang, C. Yee, T. Ching, H. Yu, and L. X. Garmire, “A
novel model to combine clinical and pathway-based
transcriptomic information for the prognosis prediction
of breast cancer,” PLoS Comput. Biol., vol. 10, no. 9, p.
e1003851, Sep. 2014.
T. Schulz-Streeck, J. O. Ogutu, and H.-P. Piepho,
“Comparisons of single-stage and two-stage
approaches to genomic selection,” Theor. Appl. Genet.,
vol. 126, no. 1, pp. 69–82, Jan. 2013.
R. Wei, I. De Vivo, S. Huang, X. Zhu, H. Risch, J. H.
Moore, H. Yu, and L. X. Garmire, “Meta-dimensional
data integration identifies critical pathways for
susceptibility, tumorigenesis and progression of
endometrial cancer,” Oncotarget, vol. 7, no. 34, pp.
55249–55263, Aug. 2016.
F. R. Pinu, D. J. Beale, A. M. Paten, et al., “Systems
Biology and Multi-Omics Integration: Viewpoints from
the Metabolomics Research Community,”.
Metabolites. 2019;9(4):76. Published 2019 Apr 18.
Y. Bengio, N. Boulanger-Lewandowski, R. Pascanu,
editors. “Advances in optimizing recurrent networks,”
Acoustics, Speech and Signal Processing (ICASSP),
2013 IEEE International Conference on; 2013: IEEE.
L. Kamentsky, et al. Improved structure, function and
compatibility for CellProfiler: modular high-
throughput image analysis software. Bioinformatics 27,
1179–1180 (2011).