TY - JOUR
T1 - Machine learning for high-resolution landslide susceptibility mapping
T2 - case study in Inje County, South Korea
AU - Le, Xuan Hien
AU - Eu, Song
AU - Choi, Chanul
AU - Nguyen, Duc Hai
AU - Yeon, Minho
AU - Lee, Giha
N1 - Publisher Copyright:
Copyright © 2023 Le, Eu, Choi, Nguyen, Yeon and Lee.
PY - 2023
Y1 - 2023
N2 - Landslides are a major natural hazard that can significantly damage infrastructure and cause loss of life. In South Korea, the current landslide susceptibility mapping (LSM) approach is mainly based on statistical techniques (logistic regression (LR) analysis). According to previous studies, this method has achieved an accuracy of approximately 75.2%. In this paper, we expand upon this traditional approach by comparing the performance of six machine learning (ML) algorithms for LSM in Inje County, South Korea. The study employed a combination of geographical data gathered from 2005 to 2019 to train and evaluate six algorithms, including LR, Gaussian Naive Bayes (GNB), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Random Forest (RF), and Extreme Gradient Boosting (XGB). The effectiveness of these models was measured by various criteria, such as the percentage of correct classification (PCC) score, F1 score, and Kappa score. The results demonstrated that the PCC and F1 scores of the six models fell between [0.869–0.941] and [0.857–0.940], respectively. RF and XGB had the highest PCC and F1 scores of 0.939 and 0.941, respectively. This study indicates that ML can be a valuable technique for high-resolution LSM in South Korea instead of the current approach.
AB - Landslides are a major natural hazard that can significantly damage infrastructure and cause loss of life. In South Korea, the current landslide susceptibility mapping (LSM) approach is mainly based on statistical techniques (logistic regression (LR) analysis). According to previous studies, this method has achieved an accuracy of approximately 75.2%. In this paper, we expand upon this traditional approach by comparing the performance of six machine learning (ML) algorithms for LSM in Inje County, South Korea. The study employed a combination of geographical data gathered from 2005 to 2019 to train and evaluate six algorithms, including LR, Gaussian Naive Bayes (GNB), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Random Forest (RF), and Extreme Gradient Boosting (XGB). The effectiveness of these models was measured by various criteria, such as the percentage of correct classification (PCC) score, F1 score, and Kappa score. The results demonstrated that the PCC and F1 scores of the six models fell between [0.869–0.941] and [0.857–0.940], respectively. RF and XGB had the highest PCC and F1 scores of 0.939 and 0.941, respectively. This study indicates that ML can be a valuable technique for high-resolution LSM in South Korea instead of the current approach.
KW - disaster management
KW - extreme gradient boosting (XGB)
KW - feature importance
KW - landslide
KW - landslide probability
KW - landslide susceptibility mapping (LSM)
KW - random forest (RF)
KW - risk map
UR - http://www.scopus.com/inward/record.url?scp=85173460773&partnerID=8YFLogxK
U2 - 10.3389/feart.2023.1268501
DO - 10.3389/feart.2023.1268501
M3 - Article
AN - SCOPUS:85173460773
SN - 2296-6463
VL - 11
JO - Frontiers in Earth Science
JF - Frontiers in Earth Science
M1 - 1268501
ER -