Multiple Instance Learning for Detection of Polyps in Computed
Tomographic Colonography Images
Yunshen Xie
1
, Jianqiang Li
1 a
and Yan Pei
2 b
1
Faculty of Information, Beijing University of Technology, Beijing, 100124, China
2
Computer Science Division, University of Aizu, Aizu-wakamatsu, 965-8580, Japan
Keywords:
Machine Learning, Computed Tomographic Colonography, Computer Aid Diagnosis, Polyps, Bioinformatics.
Abstract:
Colorectal cancer(CRC) is a significant health problem in the world, the incidence of CRC can be largely
preventable by early detection and removal of the polyps before they turn into the malignant structure. Most
existing CAD system for polyps detection rely on fully supervised learning which requires the tedious manual
annotation and precise colon segmentation. This paper proposed a method based on multiple instance learning
and transfer learning. Our scheme firstly extracts many small patches from CTC images by using threshold
segmentation method, then a pre-trained model was applied for feature extracting of instances, next pooling
operator was used to aggregating these instance features into a bag, finally, classification result was obtained
by a classifier. Our proposed method does not rely on accurate colon segmentation and the result show that it
can achieve a high accuracy rate.
1 INTRODUCTION
According to the recent statistics from the American
Cancer Society, both incidence and mortality of col-
orectal cancer(CRC) rank the third among all kinds
of cancers in 2019 (DeSantis et al., 2019). The
majority of CRCs are thought to arise from polyps,
and the process can take 5-15 years for malignant
transformation into cancer. Thus, the incidence of
CRC can be largely preventable by early detection
and removal of the polyps before they turn into the
malignant structure. Nowadays, computed tomogra-
phy colonoscopy(CTC) provides a non-invasive tech-
nique for colorectal cancer screening. However, it
is a time-consuming task to review the result of the
colonoscopy, furthermore, different radiologists often
have different opinions, even for the same patient. To
overcome the limitations, various computer-aided di-
agnosis (CAD) systems were developed for the detec-
tion of polyps in CTC images.
Generally speaking, the CAD systems consist of
three main components: colon segmentation, feature
extraction and classification. Polyp candidates on
the colon surface are identified in colon segmenta-
tion step. Li et al. performed colon segmentation
a
https://orcid.org/0000-0003-1995-9249
b
https://orcid.org/0000-0003-1545-9204
using a two-dimensional region growing algorithm
on each CT slice image(Li et al., 2009). Chowdury
and Whelan developed a method for colon segmenta-
tion using geometric features(Chowdhury and Whe-
lan, 2011). Masutani et al. proposed a method to
realize colon segmentation through thresholding of
CT values and gradient magnitude values(Masutani
et al., 2001). Subsequently, a centerline-based seg-
mentation method was presented and improved the
preformance(Frimmel et al., 2005). Moreover, a
knowledge-based method was used for colon segmen-
tation(Manjunath et al., 2015), and Wyatt et al. ap-
plied 3-D region growing technique to achieve the
goal(Wyatt et al., 2000).
For feature extraction, the distinguishing features
of polyps which are malignant are curvature, size,
haustral folds, shape, colour and texture(Mittal et al.,
2016). Hu et al. used Haralick’s texture features
for 3D space. They applied the Karhunen-Loeve(KL)
transformation on these features to obtain new fea-
tures and classified by the random forest algorithm.
The volumetric curvedness and shape index is used
for polyps detection based on colon segmentation
(Zhu et al., 2009; Wang et al., 2008). Besides, Xu
and Zhao developed an algorithm based on comple-
mentary geodesic distance transformation in consid-
eration of challenges for polyps detection due to haus-
tral folds(Xu and Zhao, 2014). The morphological
236
Xie, Y., Li, J. and Pei, Y.
Multiple Instance Learning for Detection of Polyps in Computed Tomographic Colonography Images.
DOI: 10.5220/0009352002360240
In Proceedings of the 6th International Conference on Information and Communication Technologies for Ageing Well and e-Health (ICT4AWE 2020), pages 236-240
ISBN: 978-989-758-420-6
Copyright
c
2020 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved