TY - GEN
T1 - Robust Foreground Segmentation in RGBD Data from Complex Scenes Using Adversarial Networks
AU - Sultana, Maryam
AU - Bouwmans, Thierry
AU - Giraldo, Jhony H.
AU - Jung, Soon Ki
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Foreground segmentation is a fundamental problem in many artificial intelligence and computer vision based applications. However, robust foreground segmentation with high precision is still a challenging problem in complex scenes. Currently, many of the existing algorithms process the input data in RGB space only, where the foreground segmentation performance is most likely degraded by various challenges like shadows, color camouflage, illumination changes, out of range camera sensors and bootstrapping. Cameras capturing RGBD data are highly active visual sensors as they provide depth information along with RGB of the given input images. Therefore, to address the challenging problem we propose a foreground segmentation algorithm based on conditional generative adversarial networks using RGB and depth data. The goal of our proposed model is to perform robust foreground segmentation in the presence of various complex scenes with high accuracy. For this purpose, we trained our GAN based CNN model with RGBD input data conditioned on ground-truth information in an adversarial fashion. During training, our proposed model aims to learn the foreground segmentation on the basis of cross-entropy loss and euclidean distance loss to identify between real vs fake samples. While during testing the model is given RGBD input to the trained generator network that performs robust foreground segmentation. Our proposed method is evaluated using two RGBD benchmark datasets that are SBM-RGBD and MULTIVISION kinect. Various experimental evaluations and comparative analysis of our proposed model with eleven existing methods confirm its superior performance.
AB - Foreground segmentation is a fundamental problem in many artificial intelligence and computer vision based applications. However, robust foreground segmentation with high precision is still a challenging problem in complex scenes. Currently, many of the existing algorithms process the input data in RGB space only, where the foreground segmentation performance is most likely degraded by various challenges like shadows, color camouflage, illumination changes, out of range camera sensors and bootstrapping. Cameras capturing RGBD data are highly active visual sensors as they provide depth information along with RGB of the given input images. Therefore, to address the challenging problem we propose a foreground segmentation algorithm based on conditional generative adversarial networks using RGB and depth data. The goal of our proposed model is to perform robust foreground segmentation in the presence of various complex scenes with high accuracy. For this purpose, we trained our GAN based CNN model with RGBD input data conditioned on ground-truth information in an adversarial fashion. During training, our proposed model aims to learn the foreground segmentation on the basis of cross-entropy loss and euclidean distance loss to identify between real vs fake samples. While during testing the model is given RGBD input to the trained generator network that performs robust foreground segmentation. Our proposed method is evaluated using two RGBD benchmark datasets that are SBM-RGBD and MULTIVISION kinect. Various experimental evaluations and comparative analysis of our proposed model with eleven existing methods confirm its superior performance.
KW - Foreground segmentation
KW - Generative adversarial networks
KW - Supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85112697607&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-81638-4_1
DO - 10.1007/978-3-030-81638-4_1
M3 - Conference contribution
AN - SCOPUS:85112697607
SN - 9783030816377
T3 - Communications in Computer and Information Science
SP - 3
EP - 16
BT - Frontiers of Computer Vision - 27th International Workshop, IW-FCV 2021, Revised Selected Papers
A2 - Jeong, Hieyong
A2 - Sumi, Kazuhiko
PB - Springer Science and Business Media Deutschland GmbH
T2 - 27th International Workshop on Frontiers of Computer Vision, IW-FCV 2021
Y2 - 22 February 2021 through 23 February 2021
ER -