SwePub
Sök i LIBRIS databas

  Utökad sökning

onr:"swepub:oai:DiVA.org:uu-431778"
 

Sökning: onr:"swepub:oai:DiVA.org:uu-431778" > Monitoring High-Fre...

Monitoring High-Frequency Data Streams in FinTech : FADO Versus K-means

Pelckmans, Kristiaan (författare)
Uppsala universitet,Avdelningen för systemteknik,Reglerteknik
 (creator_code:org_t)
2020
2020
Engelska.
Ingår i: IEEE Intelligent Systems. - 1541-1672 .- 1941-1294. ; 35:2, s. 36-42
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • Modern applications of FinTech are challenged by enormous volumes of financial data. One way to handle these is to adopt a streaming setting where data are only available to the algorithms during a very short time. When a new data point (financial transaction) is generated, it needs to be processed directly, and be forgotten immediately after. Especially, ongoing globalization efforts in FinTech require modern methods of fault detection to be able to work efficiently through more than 10 000 financial transactions per second if they are to be deployed as a first line of defence. This article investigates two algorithms able to perform well in this demanding setting: K-means and FADO. Especially, this article provides supports for the claim that “the use of multiple clusters does not necessarily translate into increased detection performance”. To support this claim, results are reported when operating in a quasi-realistic case study of Anti Money Laundering (AML) detection in real-time payment systems. We focus on two prototypical algorithms: the passive aggressive FADO assuming a single cluster, and the well-known K-means algorithm working with K > 1 clusters. We find-in this case-that the use of K-means with multiple clusters is unfavorable as 1) both tuning for K, as well as the need for additional complexity in the K-means algorithm challenges the computational constraints; 2) K-means introduces necessarily added variability (unreliability) in the results; 3) it requires dimensionality reduction, compromising interpretability of the detections; 4) the prevalence of singleton clusters adds unreliability to the outcome. This makes in the presented case FADO favorable over K-means (with K > 1).

Ämnesord

TEKNIK OCH TEKNOLOGIER  -- Elektroteknik och elektronik -- Reglerteknik (hsv//swe)
ENGINEERING AND TECHNOLOGY  -- Electrical Engineering, Electronic Engineering, Information Engineering -- Control Engineering (hsv//eng)
NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Nyckelord

Clustering algorithms
Intelligent systems
Anomaly detection
Approximation algorithms
Reliability
Law
Fraud detection
fintech
artificial intelligence
filtering

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Av författaren/redakt...
Pelckmans, Krist ...
Om ämnet
TEKNIK OCH TEKNOLOGIER
TEKNIK OCH TEKNO ...
och Elektroteknik oc ...
och Reglerteknik
NATURVETENSKAP
NATURVETENSKAP
och Data och informa ...
och Datavetenskap
Artiklar i publikationen
IEEE Intelligent ...
Av lärosätet
Uppsala universitet

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy