In train-set2, which is used to train the classifier,
the classes of blood and background have the fewest
number of samples. However, these are the classes
which perform best. This is probably because these
classes have the least within-class variance, e.g. most
of the tiles have a similar visual appearance.
Urothelium and damaged tissue both perform
well, even though these classes have a substantial vi-
sual variance in the form of colour and texture in the
tiles. The dataset for these classes contains the most
number of patients (25 and 8 patients, respectively),
and therefore contains the most diverse samples in the
dataset, contributing to the good results.
The precision of stroma and recall of muscle is
not performing as good as the rest. The dataset for
these classes contains few patients and are also the
two classes which needed augmentation due to small
amounts of available data. The low recall of muscle
tissue indicates that a large proportion of the muscle
tiles are misclassified as other classes, most proba-
bly urothelium, stroma and damaged tissue (due to the
high precision of blood and background, these are not
likely to include many misclassified tiles). It is im-
portant to note that the muscle class achieves a very
good precision score, and stroma has an acceptable
good recall score.
4.3 Heat Maps
The resulting model was utilised to classify entire
whole-slide images. Each tile in the WSI was classi-
fied and the percentage for each class recorded. These
were then combined to create the probability maps.
These maps were then post-processed in MATLAB by
applying a Gaussian filter kernel with a standard devi-
ation of σ = 0.6 to smooth the images. After filtering,
a thresholding operation was performed on the image
with a limit of 0.8, setting all predictions below this
threshold to zero. This ensures that only predictions
of 0.8 or higher are visible in the final heat maps.
Figure 3 shows three example WSI with their
corresponding heat maps. By visual inspection per-
formed by pathologists, this is considered to look very
promising. However, a quantitative measure for the
WSI ROI extraction is lacking since we do not have
complete WSI manually labelled into the six classes
at the current time.
5 CONCLUSION
This paper proposes a method for automatic classi-
fication of tile-segments of histopathological WSI of
urinary bladder cancer into six different classes us-
ing a CNN-based model. An encoder-decoder struc-
ture is trained on a large set of unlabelled data. After
training, the encoder part of the autoencoder acts as a
feature extractor making low dimensional latent vec-
tors. An encoder-classifier structure is then fine-tuned
on a set of labelled tiles. The finished model is able
to classify input tiles from the WSI into the classes
urothelium, stroma, damaged tissue, muscle, blood
and background. The best model achieved an average
F1-score of 93.4% over all the six classes, an overall
good result. However, future work will include an ef-
fort to improve the classifier. Other methods such as
a multiscale approach are considered.
The model is further used to classify entire WSI
to produce heat maps, which visualises each of the
classes and their location in the image. These maps
can provide useful information to the pathologist dur-
ing visual inspection. Future work consists of using
the above model as an ROI extractor of relevant tissue
in the WSI to make a dataset suitable as training data
for a diagnostic and prognostic classification model.
REFERENCES
Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean,
J., Devin, M., Ghemawat, S., Irving, G., Isard, M.,
et al. (2016). Tensorflow: a system for large-scale
machine learning. In OSDI, volume 16, pages 265–
283.
Ara
´
ujo, T., Aresta, G., Castro, E., Rouco, J., Aguiar,
P., Eloy, C., Pol
´
onia, A., and Campilho, A. (2017).
Classification of breast cancer histology images us-
ing convolutional neural networks. PloS one,
12(6):e0177544.
Baldi, P. (2012). Autoencoders, unsupervised learning, and
deep architectures. In Proceedings of ICML workshop
on unsupervised and transfer learning, pages 37–49.
Eble, J. N., Sauter, G., Epstein, J. I., and Sesterhenn, I. A.
(2004). World Health Organization Classification of
Tumours. Pathology and Genetics of Tumours of the
Urinary System and Male Genital Organs. IARC
Press: Lyon.
Kather, J. N., Weis, C.-A., Bianconi, F., Melchers, S. M.,
Schad, L. R., Gaiser, T., Marx, A., and Z
¨
ollner, F. G.
(2016). Multi-class texture analysis in colorectal can-
cer histology. Scientific reports, 6:27988.
LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep learn-
ing. nature, 521(7553):436.
Li, R. and Huang, J. (2015). Fast regions-of-interest de-
tection in whole slide histopathology images. In In-
ternational Workshop on Patch-based Techniques in
Medical Imaging, pages 120–127. Springer.
Lozano, R., Naghavi, M., Foreman, K., and Lim, S. (2012).
Global and regional mortality from 235 causes of
Multiclass Tissue Classification of Whole-Slide Histological Images using Convolutional Neural Networks
325