consuming. In contrast, the proposed MIL technique
requires only manual annotation of the GB images,
the number of which is significantly lower. MIL can
thus leverage surgical image classification to improve
GB vascularity assessment, without the need of
labeling the patches extracted from every surgical
image.
A potential extension of the proposed work is to
apply the MIL concept directly at the patient-level. In
particular, in this study the video-level classification
was based simply on a majority voting of the image
labels from each video, mainly due to the small
number of operations available (only 53). Given a
larger video dataset, one could consider the patient as
the bag, along with its GB vascularity label, and the
patches extracted from the GB images of the video as
the instances. This way, the patient-level GB
classification could be improved, without the need of
labelling the images extracted from every video of the
operation.
As future work, we aim to expand the GBVasc181
dataset by performing more annotations upon the
Cholec80 video collection. Moreover, we aim to
combine the MIL concept with CNNs at the image-
level to improve further the classification
performance. In particular, we currently design a 3D
CNN architecture that takes as input a sequence of
patches extracted from a GB image and outputs the
vascularity label of the image. The generation of
spatial attention maps that allow visualization of GB
wall regions with a variable vascularity is also major
topic of interest for future research work.
ACKNOWLEDGEMENTS
The author thanks Special Account for Research
Grants and National and Kapodistrian University of
Athens for funding to attend the meeting
REFERENCES
Al Abbas, A.I. et al., 2019. Methodology for developing an
educational and research video library in minimally
invasive surgery, Journal of Surgical Education, 76(3),
pp. 745–755.
Amores, J., 2013. Multiple instance classification: Review,
taxonomy and comparative study, Artificial
Intelligence, 201, pp. 81–105.
van Amsterdam, B., Clarkson, M.J. and Stoyanov, D., 2021.
Gesture recognition in robotic surgery: A review, IEEE
Transactions on Biomedical Engineering, 68(6), pp.
2021–2035.
Andrews, S., Tsochantaridis, I. and Hofmann, T., 2003.
Support vector machines for multiple-instance learning,
In Advances in Neural Information Processing Systems,
pp. 1–8.
Baghdadi, A. et al., 2019. A computer vision technique for
automated assessment of surgical performance using
surgeons’ console-feed videos, International Journal of
Computer Assisted Radiology and Surgery, 14(4), pp.
697–707.
Beyersdorffer, P. et al., 2021. Detection of adverse events
leading to inadvertent injury during laparoscopic
cholecystectomy using convolutional neural networks,
Biomedizinische Technik/Biomedical Engineering,
66(4), pp. 413-421.
Bishop, C.M., 2006. Illustration: Variational Mixture of
Gaussians, in Pattern Recognition and Machine
Learning. New York: Springer-Verlag New York, Inc.,
pp. 474–481.
Cheng, K. et al., 2021. Artificial intelligence-based
automated laparoscopic cholecystectomy surgical
phase recognition and analysis’, Surgical Endoscopy,
(ahead of print).
Derathé, A. et al., 2020. Predicting the quality of surgical
exposure using spatial and procedural features from
laparoscopic videos, International Journal of Computer
Assisted Radiology and Surgery, 15(1), pp. 59–67.
Dietterich, T.G., Lathrop, R.H. and Lozano-Pérez, T., 1997.
Solving the multiple instance problem with axis-
parallel rectangles, Artificial Intelligence, 89(1–2), pp.
31–71.
Funke, I. et al., 2019. Video-based surgical skill assessment
using 3D convolutional neural networks, International
Journal of Computer Assisted Radiology and Surgery,
14(7), pp. 1217–1225.
Iwashita, Y. et al., 2016. What are the appropriate
indicators of surgical difficulty during laparoscopic
cholecystectomy? Results from a Japan-Korea-Taiwan
multinational survey, Journal of Hepato-Biliary-
Pancreatic Sciences, 23(9), pp. 533–547.
Loukas, C. et al., 2011. The contribution of simulation
training in enhancing key components of laparoscopic
competence, The American Surgeon, 77(6), pp. 708–
715.
Loukas, C. and Georgiou, E., 2015. Smoke detection in
endoscopic surgery videos: a first step towards retrieval
of semantic events, International Journal of Medical
Robotics and Computer Assisted Surgery, 11(1), pp.
80–94.
Loukas, C. et al., 2018. Keyframe extraction from
laparoscopic videos based on visual saliency detection,
Computer Methods and Programs in Biomedicine, 165,
pp. 13–23.
Loukas, C. and Sgouros, N.P., 2020. Multiinstance
multilabel learning for surgical image annotation, The
International Journal of Medical Robotics and
Computer Assisted Surgery, 16 (e2058), pp. 1–12.
Loukas, C., Frountzas, M. and Schizas, D., 2021. Patch-
based classification of gallbladder wall vascularity
from laparoscopic images using deep learning,