SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Korhonen Topi) "

Sökning: WFRF:(Korhonen Topi)

  • Resultat 1-6 av 6
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Garcia, Johan, 1970-, et al. (författare)
  • DIOPT : Extremely Fast Classification Using Lookups and Optimal Feature Discretization
  • 2020
  • Ingår i: 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN). - : IEEE. - 9781728169262
  • Konferensbidrag (refereegranskat)abstract
    • For low dimensional classification problems we propose the novel DIOPT approach which considers the construction of a discretized feature space. Predictions for all cells in this space are obtained by means of a reference classifier and the class labels are stored in a lookup table generated by enumerating the complete space. This then leads to extremely high classification throughput as inference consists only of discretizing the relevant features and reading the class label from the lookup table index corresponding to the concatenation of the discretized feature bin indices. Since the size of the lookup table is limited due to memory constraints, the selection of optimal features and their respective discretization levels is paramount. We propose a particular supervised discretization approach striving to achieve maximal class separation of the discretized features, and further employ a purpose-built memetic algorithm to search towards the optimal selection of features and discretization levels. The inference run time and classification accuracy of DIOPT is compared to benchmark random forest and decision tree classifiers in several publicly available data sets. Orders of magnitude improvements are recorded in classification runtime with insignificant or modest degradation in classification accuracy for many of the evaluated binary classification tasks.
  •  
2.
  • Garcia, Johan, 1970-, et al. (författare)
  • Efficient Distribution-Derived Features for High-Speed Encrypted Flow Classification
  • 2018
  • Ingår i: NetAI'18 Proceedings of the 2018 Workshop on Network Meets AI & ML. - New York : Association for Computing Machinery (ACM). - 9781450359115 ; , s. 21-27
  • Konferensbidrag (refereegranskat)abstract
    • Flow classification is an important tool to enable efficient network resource usage, support traffic engineering, and aid QoS mechanisms. As traffic is increasingly becoming encrypted by default, flow classification is turning towards the use of machine learning methods employing features that are also available for encrypted traffic. In this work we evaluate flow features that capture the distributional properties of in-flow per-packet metrics such as packet size and inter-arrival time. The characteristics of such distributions are often captured with general statistical measures such as standard deviation, variance, etc. We instead propose a Kolmogorov-Smirnov discretization (KSD) algorithm to perform histogram bin construction based on the distributional properties observed in the data. This allows for a richer, histogram based, representation which also requires less resources for feature computation than higher order statistical moments. A comprehensive evaluation using synthetic data from Gaussian and Beta mixtures show that the KSD approach provides Jensen-Shannon distance results surpassing those of uniform binning and probabilistic binning. An empirical evaluation using live traffic traces from a cellular network further shows that when coupled with a random forest classifier the KSD-constructed features improve classification performance compared to general statistical features based on higher order moments, or alternative bin placement approaches.
  •  
3.
  • Garcia, Johan, 1970-, et al. (författare)
  • On Runtime and Classification Performance of the Discretize-Optimize (DISCO) Classification Approach
  • 2018
  • Ingår i: Performance Evaluation Review. - New york, USA : Association for Computing Machinery (ACM). - 0163-5999 .- 1557-9484. ; 46:3, s. 167-170
  • Tidskriftsartikel (refereegranskat)abstract
    • Using machine learning in high-speed networks for tasks such as flow classification typically requires either very resource efficient classification approaches, large amounts of computational resources, or specialized hardware. Here we provide a sketch of the discretize-optimize (DISCO) approach which can construct an extremely efficient classifier for low dimensional problems by combining feature selection, efficient discretization, novel bin placement, and lookup. As feature selection and discretization parameters are crucial, appropriate combinatorial optimization is an important aspect of the approach. A performance evaluation is performed for a YouTube classification task using a cellular traffic data set. The initial evaluation results show that the DISCO approach can move the Pareto boundary in the classification performance versus runtime trade-off by up to an order of magnitude compared to runtime optimized random forest and decision tree classifiers.
  •  
4.
  • Garcia, Johan, 1970-, et al. (författare)
  • Towards Video Flow Classification at a Million Encrypted Flows Per Second
  • 2018
  • Ingår i: Proceedings of 32nd International Conference on Advanced Information Networking and Applications (AINA). - Krakow : IEEE. - 9781538621967 - 9781538621950
  • Konferensbidrag (refereegranskat)abstract
    • As end-to-end encryption on the Internet is becoming more prevalent, techniques such as deep packet inspection (DPI) can no longer be expected to be able to classify traffic. In many cellular networks a large fraction of all traffic is video traffic, and being able to divide flows in the network into video and non-video can provide considerable traffic engineering benefits. In this study we examine machine learning based flow classification using features that are available also for encrypted flows. Using a data set of several several billion packets from a live cellular network we examine the obtainable classification performance for two different ensemble-based classifiers. Further, we contrast the classification performance of a statistical-based feature set with a less computationally demanding alternate feature set. To also examine the runtime aspects of the problem, we export the trained models and use a tailor-made C implementation to evaluate the runtime performance. The results quantify the trade-off between classification and runtime performance, and show that up to 1 million classifications per second can be achieved for a single core. Considering that only the subset of flows reaching some minimum flow length will need to be classified, the results are promising with regards to deployment also in scenarios with very high flow arrival rates.
  •  
5.
  • Korhonen, Topi, et al. (författare)
  • Computational study of the effect of hypoxia on cancer response to radiation treatment
  • 2020
  • Ingår i: ROMAI Journal. - : The Romanian Society of Applied and Industrial Mathematics. - 1841-5512 .- 2065-7714. ; 16:2, s. 75-86
  • Tidskriftsartikel (refereegranskat)abstract
    • We perform a computational study of the propagation of the oxygen concentration within a two-dimensional slice of a heterogeneous tumour region where the position and shape of the blood vessels are known. Exploiting the parameters space, we explore which effect is noticeable what concerns the formation of hypoxic zones. We use this information to anticipate a patient-specific radiation treatment with a potentially controlled response of the cancer growth.
  •  
6.
  • Korhonen, Topi, et al. (författare)
  • Exploring Ranked Local Selectors for Stable Explanations of ML Models
  • 2021
  • Ingår i: 2021 2nd International Conference on Intelligent Data Science Technologies and Applications, IDSTA 2021. - : Institute of Electrical and Electronics Engineers (IEEE). - 9781665421805 ; , s. 122-129
  • Konferensbidrag (refereegranskat)abstract
    • While complex machine learning methods can achieve great performance, human-interpretable details of their internal reasoning is to a large extent unavailable. Interpretable machine learning can remedy the lack of access to model reasoning but remains an elusive feat to fully achieve. Here we propose ranked selectors as a method for post-hoc explainability of classification outcomes from arbitrary classification models, with an initial emphasis on tabular data of moderate dimensions. The method is based on constructing a set of selectors, or rules, delimiting a local class consistent domain with maximal cover around the item of interest. The extended adjacent feature space is probed to achieve a ranking of the selectors. The method supports the use of an explicit foil class Q, allowing the formulation of contrastive queries in the form 'Why inference P instead of alternative inference Q?'. The answer is given as a short list of disjoint rules, a format previously shown to be amenable to human interpretation. We demonstrate the proposed method in open datasets, and elaborate on its stability aspects relative to other comparable methods.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-6 av 6

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy