Comparative Analysis of Deep Learning Architectures for Penetration and Aspiration Detection in Videofluoroscopic Swallowing Studies

Chinthala Sreya Reddy, Eunhee Park, Jong Taek Lee

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

This study concentrates on machine learning, specifically deep learning techniques, to automatically detect the presence of aspiration or penetration in videofluoroscopic swallowing studies (VFSS). A comparative analysis is conducted on various deep learning architectures such as 2D Convolutional Neural Networks (2D-CNN), Long Short-Term Memory (LSTM), and 3D Convolutional Neural Networks (3D-CNN). This comparison assesses the performance, network size, and computational speed of the models. In addition, we present findings derived from multi-label and multi-class classification methods. By evaluating the strengths and weaknesses of each technique, we propose the most effective method for detecting penetration or aspiration in VFSS. Our comprehensive evaluation reveals the superiority of 3D-CNN in the automatic detection of penetration and aspiration in VFSS. This research contributes to the development of a clinically viable automatic detection system, offering potential advancements in the care and management of patients with dysphagia.

Original languageEnglish
Pages (from-to)102843-102851
Number of pages9
JournalIEEE Access
Volume11
DOIs
StatePublished - 2023

Keywords

  • convolutional networks
  • dysphagia
  • long short-term memory
  • video classification
  • Videofluoroscopic swallowing study

Fingerprint

Dive into the research topics of 'Comparative Analysis of Deep Learning Architectures for Penetration and Aspiration Detection in Videofluoroscopic Swallowing Studies'. Together they form a unique fingerprint.

Cite this