TY - JOUR
T1 - COVID-19 Outbreak Prediction by Using Machine Learning Algorithms
AU - Sher, Tahir
AU - Rehman, Abdul
AU - Kim, Dongsun
N1 - Publisher Copyright:
© 2023 Tech Science Press. All rights reserved.
PY - 2023
Y1 - 2023
N2 - COVID-19 is a contagious disease and its several variants put under stress in all walks of life and economy as well. Early diagnosis of the virus is a crucial task to prevent the spread of the virus as it is a threat to life in the whole world. However, with the advancement of technology, the Internet of Things (IoT) and social IoT (SIoT), the versatile data produced by smart devices helped a lot in overcoming this lethal disease. Data mining is a technique that could be used for extracting useful information from massive data. In this study, we used five supervised ML strategies for creating a model to analyze and forecast the existence of COVID-19 using the Kaggle dataset” COVID-19 Symptoms and Presence.” RapidMiner Studio ML software was used to apply the Decision Tree (DT), Random Forest (RF), K-Nearest Neighbors (K-NNs) and Naive Bayes (NB), Integrated Decision Tree (ID3) algorithms. To develop the model, the performance of each model was tested using 10-fold cross-validation and compared to major accuracy measures, Cohan’s kappa statistics, properly or mistakenly categorized cases and root means square error. The results demonstrate that DT outperforms other methods, with an accuracy of 98.42% and a root mean square error of 0.11. In the future, a devised model will be highly recommendable and supportive for early prediction/diagnosis of disease by providing different data sets.
AB - COVID-19 is a contagious disease and its several variants put under stress in all walks of life and economy as well. Early diagnosis of the virus is a crucial task to prevent the spread of the virus as it is a threat to life in the whole world. However, with the advancement of technology, the Internet of Things (IoT) and social IoT (SIoT), the versatile data produced by smart devices helped a lot in overcoming this lethal disease. Data mining is a technique that could be used for extracting useful information from massive data. In this study, we used five supervised ML strategies for creating a model to analyze and forecast the existence of COVID-19 using the Kaggle dataset” COVID-19 Symptoms and Presence.” RapidMiner Studio ML software was used to apply the Decision Tree (DT), Random Forest (RF), K-Nearest Neighbors (K-NNs) and Naive Bayes (NB), Integrated Decision Tree (ID3) algorithms. To develop the model, the performance of each model was tested using 10-fold cross-validation and compared to major accuracy measures, Cohan’s kappa statistics, properly or mistakenly categorized cases and root means square error. The results demonstrate that DT outperforms other methods, with an accuracy of 98.42% and a root mean square error of 0.11. In the future, a devised model will be highly recommendable and supportive for early prediction/diagnosis of disease by providing different data sets.
KW - algorithms
KW - COVID-19 analysis
KW - COVID-19 prediction
KW - internet of things (IoT)
KW - machine learning (ML)
KW - social IoT (SIoT)
UR - http://www.scopus.com/inward/record.url?scp=85139742683&partnerID=8YFLogxK
U2 - 10.32604/cmc.2023.032020
DO - 10.32604/cmc.2023.032020
M3 - Article
AN - SCOPUS:85139742683
SN - 1546-2218
VL - 74
SP - 1561
EP - 1574
JO - Computers, Materials and Continua
JF - Computers, Materials and Continua
IS - 1
ER -