Which crashes should i fix first? Predicting top crashes at an early stage to prioritize debugging efforts

Dongsun Kim, Xinming Wang, Sunghun Kim, Andreas Zeller, S. C. Cheung, Sooyong Park

Research output: Contribution to journalArticlepeer-review

68 Scopus citations

Abstract

Many popular software systems automatically report failures back to the vendors, allowing developers to focus on the most pressing problems. However, it takes a certain period of time to assess which failures occur most frequently. In an empirical investigation of the Firefox and Thunderbird crash report databases, we found that only 10 to 20 crashes account for the large majority of crash reports; predicting these "top crashes" thus could dramatically increase software quality. By training a machine learner on the features of top crashes of past releases, we can effectively predict the top crashes well before a new release. This allows for quick resolution of the most important crashes, leading to improved user experience and better allocation of maintenance efforts.

Original languageEnglish
Pages (from-to)430-447
Number of pages18
JournalIEEE Transactions on Software Engineering
Volume37
Issue number3
DOIs
StatePublished - 2011

Keywords

  • Crash reports
  • Data mining
  • Machine learning
  • Social network analysis
  • Top crash

Fingerprint

Dive into the research topics of 'Which crashes should i fix first? Predicting top crashes at an early stage to prioritize debugging efforts'. Together they form a unique fingerprint.

Cite this