Skip to main navigation Skip to search Skip to main content

A Hierarchical Topic Modeling Analysis of Twitter Data on COVID-19 Vaccines

  • Samsung
  • Kyungpook National University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study employs hierarchical topic modeling (HTM) to analyze the discourse surrounding COVID-19 vaccines in Korea. We collected and preprocessed 123,041 Korean tweets mentioning major vaccine brands using the Twitter API, spanning from February 2021 to January 2023. Three hierarchical topic models were applied and compared on this dataset. To enhance the hierarchical understanding, the dataset was divided into four periods marked by significant vaccination events, and each HTM was applied to these subsets. Our findings show that hLDA consistently outperformed the other models across all periods. The topic hierarchy generated by the best-performing hLDA model reveals that public discourse in Korea primarily focuses on vaccine safety and effectiveness. These insights are expected to inform policy-making efforts aimed at increasing vaccine acceptance and improving public health outcomes.

Original languageEnglish
Title of host publication2024 Research in Adaptive and Convergent Systems - Proceedings of the 2024 International Conference on Research in Adaptive and Convergent Systems, RACS 2024
PublisherAssociation for Computing Machinery, Inc
Pages150-153
Number of pages4
ISBN (Electronic)9798400706066
DOIs
StatePublished - 8 Oct 2025
Event2024 International Conference on Research in Adaptive and Convergent Systems, RACS 2024 - Pompei, Italy
Duration: 5 Nov 20248 Nov 2024

Publication series

Name2024 Research in Adaptive and Convergent Systems - Proceedings of the 2024 International Conference on Research in Adaptive and Convergent Systems, RACS 2024

Conference

Conference2024 International Conference on Research in Adaptive and Convergent Systems, RACS 2024
Country/TerritoryItaly
CityPompei
Period5/11/248/11/24

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • BERTopic
  • COVID-19
  • hierarchical non-negative matrix factorization
  • hierarchical topic modeling
  • hLDA
  • korean Tweets

Fingerprint

Dive into the research topics of 'A Hierarchical Topic Modeling Analysis of Twitter Data on COVID-19 Vaccines'. Together they form a unique fingerprint.

Cite this