SwePub
Tyck till om SwePub Sök här!
Sök i LIBRIS databas

  Extended search

hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap)
 

Search: hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) > Conference paper > Improving Data Qual...

Improving Data Quality for Regression Test Selection by Reducing Annotation Noise

Al Sabbagh, Khaled, 1987 (author)
Gothenburg University,Göteborgs universitet,Institutionen för data- och informationsteknik, datavetenskap (GU),Department of Computer Science and Engineering, Computing Science (GU)
Staron, Miroslaw, 1977 (author)
Gothenburg University,Göteborgs universitet,Institutionen för data- och informationsteknik, Software Engineering (GU),Software Center,Institutionen för data- och informationsteknik (GU),Institutionen för data- och informationsteknik, Software Engineering (GU),Department of Computer Science and Engineering (GU)
Hebig, Regina, 1984 (author)
Gothenburg University,Göteborgs universitet,Institutionen för data- och informationsteknik, Software Engineering (GU),Institutionen för data- och informationsteknik (GU),Institutionen för data- och informationsteknik, Software Engineering (GU),Department of Computer Science and Engineering (GU)
show more...
Meding, Wilhelm, 1970 (author)
Telefonaktiebolaget L M Ericsson,Ericsson
show less...
 (creator_code:org_t)
2020
2020
English.
In: Proceedings - 46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020. ; , s. 191-194
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • Big data and machine learning models have been increasingly used to support software engineering processes and practices. One example is the use of machine learning models to improve test case selection in continuous integration. However, one of the challenges in building such models is the identification and reduction of noise that often comes in large data. In this paper, we present a noise reduction approach that deals with the problem of contradictory training entries. We empirically evaluate the effectiveness of the approach in the context of selective regression testing. For this purpose, we use a curated training set as input to a tree-based machine learning ensemble and compare the classification precision, recall, and f-score against a non-curated set. Our study shows that using the noise reduction approach on the training instances gives better results in prediction with an improvement of 37% on precision, 70% on recall, and 59% on f-score.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Annan data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Other Computer and Information Science (hsv//eng)
NATURVETENSKAP  -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Language Technology (hsv//eng)
NATURVETENSKAP  -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
NATURVETENSKAP  -- Data- och informationsvetenskap -- Programvaruteknik (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Software Engineering (hsv//eng)

Keyword

Machine Learning Models
Regression Testing
Annotation Noise
Annotation Noise
Machine Learning Models
Regression Testing

Publication and Content Type

kon (subject category)
ref (subject category)

To the university's database

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view