TY - JOUR
T1 - CVGG-19
T2 - Customized Visual Geometry Group Deep Learning Architecture for Facial Emotion Recognition
AU - Kim, Jung Hwan
AU - Poulose, Alwin
AU - Han, Dong Seog
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024
Y1 - 2024
N2 - Facial emotion recognition (FER) detects a user's facial expression with the camera sensors and behaves according to the user's emotions. The FER can apply to entertainment, security, and traffic safety. The FER system requires a highly accurate and efficient algorithm to classify the driver's emotions. The-state-of-art architectures for FER, such as visual geometry group (VGG), Inception-V1, ResNet, and Xception, have some level of performance for classification. Nevertheless, the original VGG architectures suffer from the vanishing gradient, limited improvement performance, and expensive computational cost. In this paper, we propose the customized visual geometry group-19 (CVGG-19), which adopts the designs of the VGG, Inception-v1, ResNet, and Xception. Our proposed CVGG-19 architecture outperforms the conventional VGG-19 architecture by 59.29%, reducing the computational cost by 89.5%. Moreover, the CVGG-19 architecture's F1-score, which represents the real-time classifying performance, displays superior to the Inception-V1, ResNet50, and Xception architectures by 3.86%
AB - Facial emotion recognition (FER) detects a user's facial expression with the camera sensors and behaves according to the user's emotions. The FER can apply to entertainment, security, and traffic safety. The FER system requires a highly accurate and efficient algorithm to classify the driver's emotions. The-state-of-art architectures for FER, such as visual geometry group (VGG), Inception-V1, ResNet, and Xception, have some level of performance for classification. Nevertheless, the original VGG architectures suffer from the vanishing gradient, limited improvement performance, and expensive computational cost. In this paper, we propose the customized visual geometry group-19 (CVGG-19), which adopts the designs of the VGG, Inception-v1, ResNet, and Xception. Our proposed CVGG-19 architecture outperforms the conventional VGG-19 architecture by 59.29%, reducing the computational cost by 89.5%. Moreover, the CVGG-19 architecture's F1-score, which represents the real-time classifying performance, displays superior to the Inception-V1, ResNet50, and Xception architectures by 3.86%
KW - Categorical-cross entropy classification
KW - deep learning
KW - facial emotion recognition
KW - ResNet
KW - VGG
KW - VGG module
KW - Xception
UR - http://www.scopus.com/inward/record.url?scp=85188438674&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3377235
DO - 10.1109/ACCESS.2024.3377235
M3 - Article
AN - SCOPUS:85188438674
SN - 2169-3536
VL - 12
SP - 41557
EP - 41578
JO - IEEE Access
JF - IEEE Access
ER -