TY - GEN
T1 - Learning Temporal Context of Normality for Unsupervised Anomaly Detection in Videos
AU - Hyun, Wooyeol
AU - Nam, Woo Jeoung
AU - Lee, Jooyeon
AU - Lee, Seong Whan
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Incomplete reconstruction of abnormal samples using convolutional autoencoders trained only on normal samples has been the key principle of anomaly detection. Such detection mechanisms utilize reconstruction error differences between normal and abnormal frames. This is not consistent, however, causing the normal and abnormal samples undistin-guishable. To handle this problem, we propose a shuffle-and-sort strategy for learning the temporal context of normality. The purpose of the strategy is to reconstruct shuffled input frames into an output with the correct order using a self-attention mechanism. Consequently, the proposed method can model the temporal context of normal events, which prevents the successful completion of reconstructing anomalies by the convolutional layers. We demonstrated the detection efficiency of the proposed method using public benchmark datasets: UCSD Pedestrian 2, CUHK Avenue, and ShanghaiTech Campus Datasets.
AB - Incomplete reconstruction of abnormal samples using convolutional autoencoders trained only on normal samples has been the key principle of anomaly detection. Such detection mechanisms utilize reconstruction error differences between normal and abnormal frames. This is not consistent, however, causing the normal and abnormal samples undistin-guishable. To handle this problem, we propose a shuffle-and-sort strategy for learning the temporal context of normality. The purpose of the strategy is to reconstruct shuffled input frames into an output with the correct order using a self-attention mechanism. Consequently, the proposed method can model the temporal context of normal events, which prevents the successful completion of reconstructing anomalies by the convolutional layers. We demonstrated the detection efficiency of the proposed method using public benchmark datasets: UCSD Pedestrian 2, CUHK Avenue, and ShanghaiTech Campus Datasets.
KW - attention mechanism
KW - deep autoencoders
KW - out-of-distribution detection
KW - video anomaly detection
UR - https://www.scopus.com/pages/publications/85142734232
U2 - 10.1109/SMC53654.2022.9945233
DO - 10.1109/SMC53654.2022.9945233
M3 - Conference contribution
AN - SCOPUS:85142734232
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 3261
EP - 3266
BT - 2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022
Y2 - 9 October 2022 through 12 October 2022
ER -