TY - JOUR
T1 - Comparative Analysis of Deep Learning Architectures for Penetration and Aspiration Detection in Videofluoroscopic Swallowing Studies
AU - Reddy, Chinthala Sreya
AU - Park, Eunhee
AU - Lee, Jong Taek
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2023
Y1 - 2023
N2 - This study concentrates on machine learning, specifically deep learning techniques, to automatically detect the presence of aspiration or penetration in videofluoroscopic swallowing studies (VFSS). A comparative analysis is conducted on various deep learning architectures such as 2D Convolutional Neural Networks (2D-CNN), Long Short-Term Memory (LSTM), and 3D Convolutional Neural Networks (3D-CNN). This comparison assesses the performance, network size, and computational speed of the models. In addition, we present findings derived from multi-label and multi-class classification methods. By evaluating the strengths and weaknesses of each technique, we propose the most effective method for detecting penetration or aspiration in VFSS. Our comprehensive evaluation reveals the superiority of 3D-CNN in the automatic detection of penetration and aspiration in VFSS. This research contributes to the development of a clinically viable automatic detection system, offering potential advancements in the care and management of patients with dysphagia.
AB - This study concentrates on machine learning, specifically deep learning techniques, to automatically detect the presence of aspiration or penetration in videofluoroscopic swallowing studies (VFSS). A comparative analysis is conducted on various deep learning architectures such as 2D Convolutional Neural Networks (2D-CNN), Long Short-Term Memory (LSTM), and 3D Convolutional Neural Networks (3D-CNN). This comparison assesses the performance, network size, and computational speed of the models. In addition, we present findings derived from multi-label and multi-class classification methods. By evaluating the strengths and weaknesses of each technique, we propose the most effective method for detecting penetration or aspiration in VFSS. Our comprehensive evaluation reveals the superiority of 3D-CNN in the automatic detection of penetration and aspiration in VFSS. This research contributes to the development of a clinically viable automatic detection system, offering potential advancements in the care and management of patients with dysphagia.
KW - convolutional networks
KW - dysphagia
KW - long short-term memory
KW - video classification
KW - Videofluoroscopic swallowing study
UR - http://www.scopus.com/inward/record.url?scp=85171530851&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2023.3315342
DO - 10.1109/ACCESS.2023.3315342
M3 - Article
AN - SCOPUS:85171530851
SN - 2169-3536
VL - 11
SP - 102843
EP - 102851
JO - IEEE Access
JF - IEEE Access
ER -