TY - GEN
T1 - Patch-wise Weakly Supervised Learning for Object Localization in Video
AU - Huh, Dong
AU - Kim, Taekyung
AU - Kim, Jaeil
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/3/18
Y1 - 2019/3/18
N2 - Object localization in video is to predict the location and image boundaries of objects of interest in sequential scenes. Despite numerous methods being developed for the task, there are still challenging issues, such as labor-intensive data preparation. In this paper, we propose a patch-wise approach with weak supervision to resolve those issues in the object localization. We first train an patch-wise object classifier based on convolutional neural network with simple labeling about object classes, instead of the bounding box annotation. Then, the object regions are estimated using the class activation maps of the classifier for each patch. The patch-wise classifier can learn more relevant features of objects from the patches containing various parts of them. In addition, background patches for weakly-supervised learning can be easily prepared. Experiments using the visual object tracking challenge data set showed that the patch-wise weakly supervised approach is effective in the object localization in video.
AB - Object localization in video is to predict the location and image boundaries of objects of interest in sequential scenes. Despite numerous methods being developed for the task, there are still challenging issues, such as labor-intensive data preparation. In this paper, we propose a patch-wise approach with weak supervision to resolve those issues in the object localization. We first train an patch-wise object classifier based on convolutional neural network with simple labeling about object classes, instead of the bounding box annotation. Then, the object regions are estimated using the class activation maps of the classifier for each patch. The patch-wise classifier can learn more relevant features of objects from the patches containing various parts of them. In addition, background patches for weakly-supervised learning can be easily prepared. Experiments using the visual object tracking challenge data set showed that the patch-wise weakly supervised approach is effective in the object localization in video.
UR - http://www.scopus.com/inward/record.url?scp=85063880755&partnerID=8YFLogxK
U2 - 10.1109/ICAIIC.2019.8668987
DO - 10.1109/ICAIIC.2019.8668987
M3 - Conference contribution
AN - SCOPUS:85063880755
T3 - 1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019
SP - 263
EP - 266
BT - 1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019
Y2 - 11 February 2019 through 13 February 2019
ER -