Reducing Features to Improve Bug Prediction

Recently, machine learning classifiers have emerged as a way to predict the existence of a bug in a change made to a source code file. The classifier is first trained on software history data, and then used to predict bugs. Two drawbacks of existing classifier-based bug prediction are potentially in...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Shivaji, Shivkumar, Jr, E. James Whitehead, Akella, Ram, Kim, Sunghun
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Bug prediction Computer bugs Computing methodologies > Machine learning > Learning paradigms > Supervised learning > Supervised learning by classification Computing methodologies > Machine learning > Machine learning algorithms > Feature selection Computing methodologies > Machine learning > Machine learning approaches > Classification and regression trees Design engineering Feature Selection General and reference > Cross-computing tools and techniques > Verification History Machine learning Prediction algorithms Reliability Scalability Software and its engineering > Software creation and management > Software development process management Software and its engineering > Software creation and management > Software verification and validation > Formal software verification Software and its engineering > Software creation and management > Software verification and validation > Software defect analysis > Software testing and debugging Software and its engineering > Software organization and properties > Software functional properties > Formal methods > Software verification Software engineering Software performance Support vector machine classification Support vector machines Theory of computation > Semantics and reasoning > Program reasoning > Program verification
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Recently, machine learning classifiers have emerged as a way to predict the existence of a bug in a change made to a source code file. The classifier is first trained on software history data, and then used to predict bugs. Two drawbacks of existing classifier-based bug prediction are potentially insufficient accuracy for practical use, and use of a large number of features. These large numbers of features adversely impact scalability and accuracy of the approach. This paper proposes a feature selection technique applicable to classification-based bug prediction. This technique is applied to predict bugs in software changes, and performance of Naive Bayes and Support Vector Machine (SVM) classifiers is characterized.
ISSN:	1938-4300
DOI:	10.1109/ASE.2009.76