Targeted Test Time Adaptation of Memory Networks for Video Object Segmentation

Isidore Dubuisson, Damien Muselet, Christophe Ducottet, Jochen Lang

2025

Abstract

Semi Automatic Video Object Segmentation (SVOS) aims to segment few objects in a video based on the annotation of these particular objects in the first frame only. State-of-the-art methods rely on offline training on a large dataset that may lack specific samples and details directly applicable to the current test video. Common solutions are to use test-time adaptation to finetune the offline model with the single annotated frame or by relying on complex semi-supervised strategies. In this paper, we introduce targeted test-time adaptation of memory-based SVOS providing the benefits of finetuning with much smaller learning effort. Our method targets specific parts of the model to ensure improved results while maintaining robustness of the offline training. We find that targeting the bottleneck features and the masks that are saved in memory provide substantial benefits. The evaluation of our method shows a significant improvement for video segmentation on DAVIS16 and DAVIS17 datasets.

Download


Paper Citation


in Harvard Style

Dubuisson I., Muselet D., Ducottet C. and Lang J. (2025). Targeted Test Time Adaptation of Memory Networks for Video Object Segmentation. In Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP; ISBN 978-989-758-728-3, SciTePress, pages 30-38. DOI: 10.5220/0013108500003912


in Bibtex Style

@conference{visapp25,
author={Isidore Dubuisson and Damien Muselet and Christophe Ducottet and Jochen Lang},
title={Targeted Test Time Adaptation of Memory Networks for Video Object Segmentation},
booktitle={Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP},
year={2025},
pages={30-38},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013108500003912},
isbn={978-989-758-728-3},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP
TI - Targeted Test Time Adaptation of Memory Networks for Video Object Segmentation
SN - 978-989-758-728-3
AU - Dubuisson I.
AU - Muselet D.
AU - Ducottet C.
AU - Lang J.
PY - 2025
SP - 30
EP - 38
DO - 10.5220/0013108500003912
PB - SciTePress