Patch-wise Weakly Supervised Learning for Object Localization in Video

Dong Huh, Taekyung Kim, Jaeil Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Object localization in video is to predict the location and image boundaries of objects of interest in sequential scenes. Despite numerous methods being developed for the task, there are still challenging issues, such as labor-intensive data preparation. In this paper, we propose a patch-wise approach with weak supervision to resolve those issues in the object localization. We first train an patch-wise object classifier based on convolutional neural network with simple labeling about object classes, instead of the bounding box annotation. Then, the object regions are estimated using the class activation maps of the classifier for each patch. The patch-wise classifier can learn more relevant features of objects from the patches containing various parts of them. In addition, background patches for weakly-supervised learning can be easily prepared. Experiments using the visual object tracking challenge data set showed that the patch-wise weakly supervised approach is effective in the object localization in video.

Original languageEnglish
Title of host publication1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages263-266
Number of pages4
ISBN (Electronic)9781538678220
DOIs
StatePublished - 18 Mar 2019
Event1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019 - Okinawa, Japan
Duration: 11 Feb 201913 Feb 2019

Publication series

Name1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019

Conference

Conference1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019
Country/TerritoryJapan
CityOkinawa
Period11/02/1913/02/19

Fingerprint

Dive into the research topics of 'Patch-wise Weakly Supervised Learning for Object Localization in Video'. Together they form a unique fingerprint.

Cite this