TY - GEN
T1 - Video Object Segmentation Based on Guided Feature Transfer Learning
AU - Fiaz, Mustansar
AU - Mahmood, Arif
AU - Shahzad Farooq, Sehar
AU - Ali, Kamran
AU - Shaheryar, Muhammad
AU - Jung, Soon Ki
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Video Object Segmentation (VOS) is a fundamental task with many real-world computer vision applications and challenging due to available distractors and background clutter. Many existing online learning approaches have limited practical significance because of high computational cost required to fine-tune network parameters. Moreover, matching based and propagation approaches are computationally efficient but may suffer from degraded performance in cluttered backgrounds and object drifts. In order to handle these issues, we propose an offline end-to-end model to learn guided feature transfer for VOS. We introduce guided feature modulation based on target mask to capture the video context information and a generative appearance model is used to provide cues for both the target and the background. Proposed guided feature modulation system learns the target semantic information based on modulation activations. Generative appearance model learns the probability of a pixel to be target or background. In addition, low-resolution features from deeper networks may not capture the global contextual information and may reduce the performance during feature refinement. Therefore, we also propose a guided pooled decoder to learn the global as well as local context information for better feature refinement. Evaluation over two VOS benchmark datasets including DAVIS2016 and DAVIS2017 have shown excellent performance of the proposed framework compared to more than 20 existing state-of-the-art methods.
AB - Video Object Segmentation (VOS) is a fundamental task with many real-world computer vision applications and challenging due to available distractors and background clutter. Many existing online learning approaches have limited practical significance because of high computational cost required to fine-tune network parameters. Moreover, matching based and propagation approaches are computationally efficient but may suffer from degraded performance in cluttered backgrounds and object drifts. In order to handle these issues, we propose an offline end-to-end model to learn guided feature transfer for VOS. We introduce guided feature modulation based on target mask to capture the video context information and a generative appearance model is used to provide cues for both the target and the background. Proposed guided feature modulation system learns the target semantic information based on modulation activations. Generative appearance model learns the probability of a pixel to be target or background. In addition, low-resolution features from deeper networks may not capture the global contextual information and may reduce the performance during feature refinement. Therefore, we also propose a guided pooled decoder to learn the global as well as local context information for better feature refinement. Evaluation over two VOS benchmark datasets including DAVIS2016 and DAVIS2017 have shown excellent performance of the proposed framework compared to more than 20 existing state-of-the-art methods.
KW - Generative appearance model
KW - Guided Feature Modulation
KW - Guided Pooled Decoder
KW - Video Object Segmentation
UR - http://www.scopus.com/inward/record.url?scp=85131147376&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-06381-7_14
DO - 10.1007/978-3-031-06381-7_14
M3 - Conference contribution
AN - SCOPUS:85131147376
SN - 9783031063800
T3 - Communications in Computer and Information Science
SP - 197
EP - 210
BT - Frontiers of Computer Vision - 28th International Workshop, IW-FCV 2022, Revised Selected Papers
A2 - Sumi, Kazuhiko
A2 - Na, In Seop
A2 - Kaneko, Naoshi
PB - Springer Science and Business Media Deutschland GmbH
T2 - 28th International Workshop on Frontiers of Computer Vision, IW-FCV 2022
Y2 - 21 February 2022 through 22 February 2022
ER -