Sökning: id:"swepub:oai:DiVA.org:ltu-27045" >
Multivariate simila...
Multivariate similarity-based conformity measure (MSCM : An outlier detection measure for data mining applications
-
- Badawy, Shaimaa Ali (författare)
- University of Western Ontario
-
- Elragal, Ahmed (författare)
- German University in Cairo (GUC)
-
- Gabr, Mahmoud Mohamed Hassan (författare)
- Faculty of Science, Alexandria University
-
(creator_code:org_t)
- Anaheim, Calif : ACTA Press, 2008
- 2008
- Engelska.
-
Ingår i: Proceedings of the IASTED International Conference on Artificial Intelligence and Applications Machine Learning. - Anaheim, Calif : ACTA Press. - 9780889867093 ; , s. 314-320
- Relaterad länk:
-
https://urn.kb.se/re...
Abstract
Ämnesord
Stäng
- Outliers, the odd objects in the dataset, can be viewed from two different perspectives; the outliers as undesirable objects that should be treated or deleted in the data preparation step of the data mining process, and the outliers as interesting objects that are identified for their own interest in the data mining step of the mining process. In the latter case, outliers shouldn't be removed, that's why one of the main categories of tasks performed by data mining techniques is outlier detection. Applications that make use of such detection include credit card fraud detection and network intrusion detection. Most of the available outlier detection techniques rely in a distance measure to compare the objects in the dataset which imposed the restriction of dealing with numeric data. In this paper a new multivariate similarity-based conformity measure (MSCM) is suggested to be used to detect outliers in datasets that contain attributes of different data types. The MSCM satisfies two other desirable features; being a multivariate measure and giving ranking instead of a binary judgment of the object. The measure has been applied on three different datasets in order to be evaluated; the measure has shown good results in these experiments.
Ämnesord
- SAMHÄLLSVETENSKAP -- Medie- och kommunikationsvetenskap -- Systemvetenskap, informationssystem och informatik med samhällsvetenskaplig inriktning (hsv//swe)
- SOCIAL SCIENCES -- Media and Communications -- Information Systems, Social aspects (hsv//eng)
Nyckelord
- Information systems
- Informationssystem
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)
Hitta via bibliotek
Till lärosätets databas