Video Object Segmentation Based on Guided Feature Transfer Learning

Mustansar Fiaz, Arif Mahmood, Sehar Shahzad Farooq, Kamran Ali, Muhammad Shaheryar, Soon Ki Jung

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Video Object Segmentation (VOS) is a fundamental task with many real-world computer vision applications and challenging due to available distractors and background clutter. Many existing online learning approaches have limited practical significance because of high computational cost required to fine-tune network parameters. Moreover, matching based and propagation approaches are computationally efficient but may suffer from degraded performance in cluttered backgrounds and object drifts. In order to handle these issues, we propose an offline end-to-end model to learn guided feature transfer for VOS. We introduce guided feature modulation based on target mask to capture the video context information and a generative appearance model is used to provide cues for both the target and the background. Proposed guided feature modulation system learns the target semantic information based on modulation activations. Generative appearance model learns the probability of a pixel to be target or background. In addition, low-resolution features from deeper networks may not capture the global contextual information and may reduce the performance during feature refinement. Therefore, we also propose a guided pooled decoder to learn the global as well as local context information for better feature refinement. Evaluation over two VOS benchmark datasets including DAVIS2016 and DAVIS2017 have shown excellent performance of the proposed framework compared to more than 20 existing state-of-the-art methods.

Original languageEnglish
Title of host publicationFrontiers of Computer Vision - 28th International Workshop, IW-FCV 2022, Revised Selected Papers
EditorsKazuhiko Sumi, In Seop Na, Naoshi Kaneko
PublisherSpringer Science and Business Media Deutschland GmbH
Pages197-210
Number of pages14
ISBN (Print)9783031063800
DOIs
StatePublished - 2022
Event28th International Workshop on Frontiers of Computer Vision, IW-FCV 2022 - Virtual, Online
Duration: 21 Feb 202222 Feb 2022

Publication series

NameCommunications in Computer and Information Science
Volume1578 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference28th International Workshop on Frontiers of Computer Vision, IW-FCV 2022
CityVirtual, Online
Period21/02/2222/02/22

Keywords

  • Generative appearance model
  • Guided Feature Modulation
  • Guided Pooled Decoder
  • Video Object Segmentation

Fingerprint

Dive into the research topics of 'Video Object Segmentation Based on Guided Feature Transfer Learning'. Together they form a unique fingerprint.

Cite this