TY - JOUR
T1 - DigBug—Pre/post-processing operator selection for accurate bug localization
AU - Kim, Kisub
AU - Ghatpande, Sankalp
AU - Liu, Kui
AU - Koyuncu, Anil
AU - Kim, Dongsun
AU - Bissyandé, Tegawendé F.
AU - Klein, Jacques
AU - Traon, Yves Le
N1 - Publisher Copyright:
© 2022
PY - 2022/7
Y1 - 2022/7
N2 - Bug localization is a recurrent maintenance task in software development. It aims at identifying relevant code locations (e.g., code files) that must be inspected to fix bugs. When such bugs are reported by users, the localization process become often overwhelming as it is mostly a manual task due to incomplete and informal information (written in natural languages) available in bug reports. The research community has then invested in automated approaches, notably using Information Retrieval techniques. Unfortunately, reported performance in the literature is still limited for practical usage. Our key observation, after empirically investigating a large dataset of bug reports as well as workflow and results of state-of-the-art approaches, is that most approaches attempt localization for every bug report without considering the different characteristics of the bug reports. We propose DIGBUG as a straightforward approach to specialized bug localization. This approach selects pre/post-processing operators based on the attributes of bug reports; and the bug localization model is parameterized in accordance as well. Our experiments confirm that departing from “one-size-fits-all” approaches, DIGBUG outperforms the state-of-the-art techniques by 6 and 14 percentage points, respectively in terms of MAP and MRR on average.
AB - Bug localization is a recurrent maintenance task in software development. It aims at identifying relevant code locations (e.g., code files) that must be inspected to fix bugs. When such bugs are reported by users, the localization process become often overwhelming as it is mostly a manual task due to incomplete and informal information (written in natural languages) available in bug reports. The research community has then invested in automated approaches, notably using Information Retrieval techniques. Unfortunately, reported performance in the literature is still limited for practical usage. Our key observation, after empirically investigating a large dataset of bug reports as well as workflow and results of state-of-the-art approaches, is that most approaches attempt localization for every bug report without considering the different characteristics of the bug reports. We propose DIGBUG as a straightforward approach to specialized bug localization. This approach selects pre/post-processing operators based on the attributes of bug reports; and the bug localization model is parameterized in accordance as well. Our experiments confirm that departing from “one-size-fits-all” approaches, DIGBUG outperforms the state-of-the-art techniques by 6 and 14 percentage points, respectively in terms of MAP and MRR on average.
KW - Bug characteristics
KW - Bug localization
KW - Bug report
KW - Fault localization
KW - Information retrieval
KW - Operator combination
UR - http://www.scopus.com/inward/record.url?scp=85127058392&partnerID=8YFLogxK
U2 - 10.1016/j.jss.2022.111300
DO - 10.1016/j.jss.2022.111300
M3 - Article
AN - SCOPUS:85127058392
SN - 0164-1212
VL - 189
JO - Journal of Systems and Software
JF - Journal of Systems and Software
M1 - 111300
ER -