SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:DiVA.org:mdh-64695"
 

Sökning: id:"swepub:oai:DiVA.org:mdh-64695" > Cross-Version Softw...

Cross-Version Software Defect Prediction Considering Concept Drift and Chronological Splitting

Kabir, Md Alamgir (författare)
Mälardalens universitet,Inbyggda system
Rehman, Atiq Ur (författare)
Mälardalens universitet,Inbyggda system,Department of Electrical and Computer Engineering, Pak-Austria Fachhochschule Institute of Applied Sciences and Technology, Haripur, Pakistan
Islam, M. M. M. (författare)
Intelligent Systems Research Centre, University of Ulster, Londonderry, United Kingdom
visa fler...
Ali, Nazakat (författare)
Mälardalens universitet,Inbyggda system
Baptista, M. L. (författare)
Air Transport and Operations, Faculty of Aerospace Engineering, Delft University of Technology (TU Delft), Delft, Netherlands
visa färre...
 (creator_code:org_t)
Multidisciplinary Digital Publishing Institute (MDPI), 2023
2023
Engelska.
Ingår i: Symmetry. - : Multidisciplinary Digital Publishing Institute (MDPI). - 2073-8994. ; 15:10
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • Concept drift (CD) refers to a phenomenon where the data distribution within datasets changes over time, and this can have adverse effects on the performance of prediction models in software engineering (SE), including those used for tasks like cost estimation and defect prediction. Detecting CD in SE datasets is difficult, but important, because it identifies the need for retraining prediction models and in turn improves their performance. If the concept drift is caused by symmetric changes in the data distribution, the model adaptation process might need to account for this symmetry to maintain accurate predictions. This paper explores the impact of CD within the context of cross-version defect prediction (CVDP), aiming to enhance the reliability of prediction performance and to make the data more symmetric. A concept drift detection (CDD) approach is further proposed to identify data distributions that change over software versions. The proposed CDD framework consists of three stages: (i) data pre-processing for CD detection; (ii) notification of CD by triggering one of the three flags (i.e., CD, warning, and control); and (iii) providing guidance on when to update an existing model. Several experiments on 30 versions of seven software projects reveal the value of the proposed CDD. Some of the key findings of the proposed work include: (i) An exponential increase in the error-rate across different software versions is associated with CD. (ii) A moving-window approach to train defect prediction models on chronologically ordered defect data results in better CD detection than using all historical data with a large effect size (Formula presented.).

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences (hsv//eng)

Nyckelord

chronological splitting
concept drift
cross-version defect prediction
software defect prediction

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

  • Symmetry (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy