Sökning: onr:"swepub:oai:DiVA.org:uu-431778" >
Monitoring High-Fre...
-
Pelckmans, KristiaanUppsala universitet,Avdelningen för systemteknik,Reglerteknik
(författare)
Monitoring High-Frequency Data Streams in FinTech : FADO Versus K-means
- Artikel/kapitelEngelska2020
Förlag, utgivningsår, omfång ...
Nummerbeteckningar
-
LIBRIS-ID:oai:DiVA.org:uu-431778
-
https://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-431778URI
-
https://doi.org/10.1109/MIS.2020.2977012DOI
Kompletterande språkuppgifter
-
Språk:engelska
-
Sammanfattning på:engelska
Ingår i deldatabas
Klassifikation
-
Ämneskategori:ref swepub-contenttype
-
Ämneskategori:art swepub-publicationtype
Anmärkningar
-
Modern applications of FinTech are challenged by enormous volumes of financial data. One way to handle these is to adopt a streaming setting where data are only available to the algorithms during a very short time. When a new data point (financial transaction) is generated, it needs to be processed directly, and be forgotten immediately after. Especially, ongoing globalization efforts in FinTech require modern methods of fault detection to be able to work efficiently through more than 10 000 financial transactions per second if they are to be deployed as a first line of defence. This article investigates two algorithms able to perform well in this demanding setting: K-means and FADO. Especially, this article provides supports for the claim that “the use of multiple clusters does not necessarily translate into increased detection performance”. To support this claim, results are reported when operating in a quasi-realistic case study of Anti Money Laundering (AML) detection in real-time payment systems. We focus on two prototypical algorithms: the passive aggressive FADO assuming a single cluster, and the well-known K-means algorithm working with K > 1 clusters. We find-in this case-that the use of K-means with multiple clusters is unfavorable as 1) both tuning for K, as well as the need for additional complexity in the K-means algorithm challenges the computational constraints; 2) K-means introduces necessarily added variability (unreliability) in the results; 3) it requires dimensionality reduction, compromising interpretability of the detections; 4) the prevalence of singleton clusters adds unreliability to the outcome. This makes in the presented case FADO favorable over K-means (with K > 1).
Ämnesord och genrebeteckningar
Biuppslag (personer, institutioner, konferenser, titlar ...)
-
Uppsala universitetAvdelningen för systemteknik
(creator_code:org_t)
Sammanhörande titlar
-
Ingår i:IEEE Intelligent Systems35:2, s. 36-421541-16721941-1294
Internetlänk
Hitta via bibliotek
Till lärosätets databas