loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Saptakatha Adak and Sukhendu Das

Affiliation: Visualization and Perception Lab, Indian Institute of Technology Madras, Chennai - 600036 and India

Keyword(s): Video Object Segmentation, Generative Adversarial Network (GAN), Deep Learning.

Related Ontology Subjects/Areas/Topics: Computer Vision, Visualization and Computer Graphics ; Image and Video Analysis ; Motion, Tracking and Stereo Vision ; Segmentation and Grouping ; Tracking and Visual Navigation

Abstract: This paper studies the problem of Video Object Segmentation which aims at segmenting objects of interest throughout entire videos, when provided with initial ground truth annotation. Although, variety of works in this field have been done utilizing Convolutional Neural Networks (CNNs), adversarial training techniques have not been used in spite of their effectiveness as a holistic approach. Our proposed architecture consists of a Generative Adversarial framework for the purpose of foreground object segmentation in videos coupled with Intersection-over-union and temporal information based loss functions for training the network. The main contribution of the paper lies in formulation of the two novel loss functions: (i) Inter-frame Temporal Symmetric Difference Loss (ITSDL) and (ii) Intra-frame Temporal Loss (IFTL), which not only enhance the segmentation quality of the predicted mask but also maintain the temporal consistency between the subsequent generated frames. Our end-to-end tra inable network exhibits impressive performance gain compared to the state-of-the-art model when evaluated on three popular real-world Video Object Segmentation datasets viz. DAVIS 2016, SegTrack-v2 and YouTube-Objects dataset. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 52.90.121.17

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Adak, S. and Das, S. (2019). TempSeg-GAN: Segmenting Objects in Videos Adversarially using Temporal Information. In Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 4: VISAPP; ISBN 978-989-758-354-4; ISSN 2184-4321, SciTePress, pages 221-232. DOI: 10.5220/0007254302210232

@conference{visapp19,
author={Saptakatha Adak. and Sukhendu Das.},
title={TempSeg-GAN: Segmenting Objects in Videos Adversarially using Temporal Information},
booktitle={Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 4: VISAPP},
year={2019},
pages={221-232},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007254302210232},
isbn={978-989-758-354-4},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 4: VISAPP
TI - TempSeg-GAN: Segmenting Objects in Videos Adversarially using Temporal Information
SN - 978-989-758-354-4
IS - 2184-4321
AU - Adak, S.
AU - Das, S.
PY - 2019
SP - 221
EP - 232
DO - 10.5220/0007254302210232
PB - SciTePress