Sökning: WFRF:(Abbas Cheddad) >
Clustering-based Ad...
Clustering-based Adaptive Data Augmentation for Class-imbalance in Machine Learning (CADA) : Additive Manufacturing Use-case
-
- Dasari, Siva Krishna, 1988- (författare)
- Blekinge Tekniska Högskola,Institutionen för datavetenskap
-
- Cheddad, Abbas (författare)
- Blekinge Tekniska Högskola,Institutionen för datavetenskap
-
- Palmquist, Jonatan (författare)
- GKN Aerospace Engine Systems
-
visa fler...
-
- Lundberg, Lars, 1962- (författare)
- Blekinge Tekniska Högskola,Institutionen för datavetenskap
-
visa färre...
-
(creator_code:org_t)
- 2022
- 2022
- Engelska.
-
Ingår i: Neural Computing & Applications. - : Springer London. - 0941-0643 .- 1433-3058.
- Relaterad länk:
-
https://doi.org/10.1...
-
visa fler...
-
https://bth.diva-por... (primary) (Raw object)
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Large amount of data are generated from in-situ monitoring of additive manufacturing (AM) processes which is later used in prediction modelling for defect classification to speed up quality inspection of products. A high volume of this process data is defect-free (majority class) and a lower volume of this data has defects (minority class) which result in the class-imbalance issue. Using imbalanced datasets, classifiers often provide sub-optimal classification results i.e. better performance on the majority class than the minority class. However, it is important for process engineers that models classify defects more accurately than the class with no defects since this is crucial for quality inspection. Hence, we address the class-imbalance issue in manufacturing process data to support in-situ quality control of additive manufactured components. For this, we propose cluster-based adaptive data augmentation (CADA) for oversampling to address the class-imbalance problem. Quantitative experiments are conducted to evaluate the performance of the proposed method and to compare with other selected oversampling methods using AM datasets from an aerospace industry and a publicly available casting manufacturing dataset. The results show that CADA outperformed random oversampling and the SMOTE method and is similar to random data augmentation and cluster-based oversampling. Furthermore, the results of the statistical significance test show that there is a significant difference between the studied methods. As such, the CADA method can be considered as an alternative method for oversampling to improve the performance of models on the minority class.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
Nyckelord
- Class-imbalance
- Melt-pool defects classification
- Aerospace application
- Additive Manufacturing
- Polar Transformation
- Random Forests
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas