Authors:
Houda Chaabouni-Chouayakh
1
;
2
;
Manel Farhat
1
;
2
and
Achraf Ben-Hamadou
1
;
2
Affiliations:
1
Centre de Recherche en Numérique de Sfax, 3021, Sfax, Tunisia
;
2
Laboratory of Signals, Systems, Artificial Intelligence and Networks (SM@RT), Sfax, Tunisia
Keyword(s):
Endoscopic Images,Deep Learning, Image Feature Matching, Adaptive Triplet Loss.
Abstract:
Image feature matching is a key step in creating endoscopic mosaics of the bladder inner walls, which help urologists in lesion detection and patient follow-up. Endoscopic images, on the other hand, are particularly difficult to match because they are weekly textured and have limited surface area per frame. Deep learning techniques have recently gained popularity in a variety of computer vision tasks. The ability of convolutional neural networks (CNNs) to learn rich and optimal features contributes to the success of these methods. In this paper, we present a novel deep learning based approach for endoscopic image matching. Instead of standard handcrafted image descriptors, we designed a CNN to extract feature vector from local interest points. We propose an efficient approach to train our CNN without manually annotated data. We proposed an adaptive triplet loss which has the advantage of improving the inter-class separability as well as the inter class compactness. The training datas
et is automatically constructed, each sample is a triplet of patches: an anchor, one positive sample (a perspective transformation of the anchor) and one negative sample. The obtained experimental results show at the end of the training step a more discriminative space representation where the anchor becomes closer to the positive sample and farther from the negative one in the embedding space. Comparison with the well-known standard hand-crafted descriptor SIFT in terms of recall and precision showed the effectiveness of the proposed approach, reaching the top recall value for a precision value of 0.97.
(More)