Search: onr:"swepub:oai:research.chalmers.se:47944dc5-d901-4b68-9519-3d39b7c8f709" >
Predicting build ou...
Predicting build outcomes in continuous integration using textual analysis of source code commits
-
- Al Sabbagh, Khaled, 1987 (author)
- Göteborgs universitet,University of Gothenburg
-
- Staron, Miroslaw, 1977 (author)
- Göteborgs universitet,University of Gothenburg
-
- Hebig, Regina, 1984 (author)
- Göteborgs universitet,University of Gothenburg
-
(creator_code:org_t)
- 2022-11-09
- 2022
- English.
-
In: PROMISE 2022 - Proceedings of the 18th International Conference on Predictive Models and Data Analytics in Software Engineering, co-located with ESEC/FSE 2022. - New York, NY, USA : ACM. ; , s. 42-51
- Related links:
-
https://research.cha... (primary) (free)
-
show more...
-
https://doi.org/10.1...
-
https://research.cha...
-
show less...
Abstract
Subject headings
Close
- Machine learning has been increasingly used to solve various software engineering tasks. One example of its usage is to predict the outcome of builds in continuous integration, where a classifier is built to predict whether new code commits will successfully compile. The aim of this study is to investigate the effectiveness of fifteen software metrics in building a classifier for build outcome prediction. Particularly, we implemented an experiment wherein we compared the effectiveness of a line-level metric and fourteen other traditional software metrics on 49,040 build records that belong to 117 Java projects. We achieved an average precision of 91% and recall of 80% when using the line-level metric for training, compared to 90% precision and 76% recall for the next best traditional software metric. In contrast, using file-level metrics was found to yield a higher predictive quality (average MCC for the best software metric= 68%) than the line-level metric (average MCC= 16%) for the failed builds. We conclude that file-level metrics are better predictors of build outcomes for the failed builds, whereas the line-level metric is a slightly better predictor of passed builds.
Subject headings
- NATURVETENSKAP -- Data- och informationsvetenskap -- Programvaruteknik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Software Engineering (hsv//eng)
- TEKNIK OCH TEKNOLOGIER -- Samhällsbyggnadsteknik -- Husbyggnad (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Civil Engineering -- Building Technologies (hsv//eng)
- TEKNIK OCH TEKNOLOGIER -- Elektroteknik och elektronik -- Datorsystem (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Electrical Engineering, Electronic Engineering, Information Engineering -- Computer Systems (hsv//eng)
Keyword
- Build Prediction
- Continuous Integration
- Textual Analysis
Publication and Content Type
- kon (subject category)
- ref (subject category)
To the university's database