TY - JOUR
T1 - Toward practical and plausible counterfactual explanation through latent adjustment in disentangled space
AU - Na, Seung Hyup
AU - Nam, Woo Jeoung
AU - Lee, Seong Whan
N1 - Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/12/15
Y1 - 2023/12/15
N2 - Extensive research into eXplainable AI (XAI) has raised interest in generating counterfactual (CF) explanations. In the past, minimizing the perturbation of input was considered a priority aspect of CF for the benefit of user practicality. However, closeness to the CF data manifold, indicating plausibility, is now emerging as another important property of CF. Thus, we propose a novel framework for generating practical and plausible CFs by minimally perturbing the semantic information of inputs in a disentangled latent space of a generative adversarial network (GAN). Considering the possibility of linear change of semantic information in a disentangled latent space, we obtain the desired CFs using proposed algorithms that adjust the input latents and reference CF latents derived using an optimization-based GAN inversion method. The results of qualitative and quantitative experiments on several datasets from different domains demonstrate the superiority and versatility of our framework. In comparative experiments, it not only achieves 1.0 Validity for test samples from all datasets but also achieves the minimum values of 0.07 Dissimilarity, 5.96 Rec. Error, 0.94 IM1, and 0.01 Infer. Time for the MNIST dataset.
AB - Extensive research into eXplainable AI (XAI) has raised interest in generating counterfactual (CF) explanations. In the past, minimizing the perturbation of input was considered a priority aspect of CF for the benefit of user practicality. However, closeness to the CF data manifold, indicating plausibility, is now emerging as another important property of CF. Thus, we propose a novel framework for generating practical and plausible CFs by minimally perturbing the semantic information of inputs in a disentangled latent space of a generative adversarial network (GAN). Considering the possibility of linear change of semantic information in a disentangled latent space, we obtain the desired CFs using proposed algorithms that adjust the input latents and reference CF latents derived using an optimization-based GAN inversion method. The results of qualitative and quantitative experiments on several datasets from different domains demonstrate the superiority and versatility of our framework. In comparative experiments, it not only achieves 1.0 Validity for test samples from all datasets but also achieves the minimum values of 0.07 Dissimilarity, 5.96 Rec. Error, 0.94 IM1, and 0.01 Infer. Time for the MNIST dataset.
KW - GAN inversion
KW - Latent interpolation
KW - Post-hoc explanation
UR - http://www.scopus.com/inward/record.url?scp=85165534584&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2023.120982
DO - 10.1016/j.eswa.2023.120982
M3 - Article
AN - SCOPUS:85165534584
SN - 0957-4174
VL - 233
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 120982
ER -