FinSeg: Finger Parts Semantic Segmentation using Multi-scale Feature Maps Aggregation of FCN

Adel Saleh, Hatem A. Rashwan, Mohamed Abdel-Nasser, Vivek K. Singh, Saddam Abdulwahab, Md. Mostafa Kamal Sarker, Miguel Angel Garcia, Domenec Puig

2019

Abstract

Image semantic segmentation is in the center of interest for computer vision researchers. Indeed, huge number of applications requires efficient segmentation performance, such as activity recognition, navigation, and human body parsing, etc. One of the important applications is gesture recognition that is the ability to understanding human hand gestures by detecting and counting finger parts in a video stream or in still images. Thus, accurate finger parts segmentation yields more accurate gesture recognition. Consequently, in this paper, we highlight two contributions as follows: First, we propose data-driven deep learning pooling policy based on multi-scale feature maps extraction at different scales (called FinSeg). A novel aggregation layer is introduced in this model, in which the features maps generated at each scale is weighted using a fully connected layer. Second, with the lack of realistic labeled finger parts datasets, we propose a labeled dataset for finger parts segmentation (FingerParts dataset). To the best of our knowledge, the proposed dataset is the first attempt to build a realistic dataset for finger parts semantic segmentation. The experimental results show that the proposed model yields an improvement of 5% compared to the standard FCN network.

Download


Paper Citation


in Harvard Style

Saleh A., Rashwan H., Abdel-Nasser M., Singh V., Abdulwahab S., Sarker M., Garcia M. and Puig D. (2019). FinSeg: Finger Parts Semantic Segmentation using Multi-scale Feature Maps Aggregation of FCN. In Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP; ISBN 978-989-758-354-4, SciTePress, pages 77-84. DOI: 10.5220/0007382100770084


in Bibtex Style

@conference{visapp19,
author={Adel Saleh and Hatem A. Rashwan and Mohamed Abdel-Nasser and Vivek K. Singh and Saddam Abdulwahab and Md. Mostafa Kamal Sarker and Miguel Angel Garcia and Domenec Puig},
title={FinSeg: Finger Parts Semantic Segmentation using Multi-scale Feature Maps Aggregation of FCN},
booktitle={Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP},
year={2019},
pages={77-84},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007382100770084},
isbn={978-989-758-354-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP
TI - FinSeg: Finger Parts Semantic Segmentation using Multi-scale Feature Maps Aggregation of FCN
SN - 978-989-758-354-4
AU - Saleh A.
AU - Rashwan H.
AU - Abdel-Nasser M.
AU - Singh V.
AU - Abdulwahab S.
AU - Sarker M.
AU - Garcia M.
AU - Puig D.
PY - 2019
SP - 77
EP - 84
DO - 10.5220/0007382100770084
PB - SciTePress