Sökning: id:"swepub:oai:DiVA.org:kth-144608" >
Probabilistic Fault...
Probabilistic Fault Management in Networked Systems
-
- Steinert, Rebecca (författare)
- RISE,KTH,Beräkningsbiologi, CB,Swedish ICT SICS,Decisions, Networks and Analytics lab
-
- Lansner, Anders, Professor (preses)
- KTH,Beräkningsbiologi, CB
-
- Gillblad, Daniel, PhD (preses)
- Swedish ICT SICS
-
visa fler...
-
- Holst, Anders, Docent (preses)
- Swedish ICT SICS
-
- Festor, Olivier, Professor (opponent)
- School of Engineering in Information Technology TELECOM Nancy
-
visa färre...
-
(creator_code:org_t)
- ISBN 9789175951140
- Stockholm : KTH Royal Institute of Technology, 2014
- Engelska 61 s.
-
Serie: TRITA-CSC-A, 1653-5723 ; 2014:06
-
Serie: SICS dissertation series, 1101-1335
- Relaterad länk:
-
https://kth.diva-por... (primary) (Raw object)
-
visa fler...
-
http://www.diva-port...
-
https://urn.kb.se/re...
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- Technical advances in network communication systems (e.g. radio access networks) combined with evolving concepts based on virtualization (e.g. clouds), require new management algorithms in order to handle the increasing complexity in the network behavior and variability in the network environment. Current network management operations are primarily centralized and deterministic, and are carried out via automated scripts and manual interventions, which work for mid-sized and fairly static networks. The next generation of communication networks and systems will be of significantly larger size and complexity, and will require scalable and autonomous management algorithms in order to meet operational requirements on reliability, failure resilience, and resource-efficiency.A promising approach to address these challenges includes the development of probabilistic management algorithms, following three main design goals. The first goal relates to all aspects of scalability, ranging from efficient usage of network resources to computational efficiency. The second goal relates to adaptability in maintaining the models up-to-date for the purpose of accurately reflecting the network state. The third goal relates to reliability in the algorithm performance in the sense of improved performance predictability and simplified algorithm control.This thesis is about probabilistic approaches to fault management that follow the concepts of probabilistic network management (PNM). An overview of existing network management algorithms and methods in relation to PNM is provided. The concepts of PNM and the implications of employing PNM-algorithms are presented and discussed. Moreover, some of the practical differences of using a probabilistic fault detection algorithm compared to a deterministic method are investigated. Further, six probabilistic fault management algorithms that implement different aspects of PNM are presented.The algorithms are highly decentralized, adaptive and autonomous, and cover several problem areas, such as probabilistic fault detection and controllable detection performance; distributed and decentralized change detection in modeled link metrics; root-cause analysis in virtual overlays; event-correlation and pattern mining in data logs; and, probabilistic failure diagnosis. The probabilistic models (for a large part based on Bayesian parameter estimation) are memory-efficient and can be used and re-used for multiple purposes, such as performance monitoring, detection, and self-adjustment of the algorithm behavior.
Ämnesord
- TEKNIK OCH TEKNOLOGIER -- Elektroteknik och elektronik -- Kommunikationssystem (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Electrical Engineering, Electronic Engineering, Information Engineering -- Communication Systems (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences (hsv//eng)
Nyckelord
- probabilistic network management; probabilistic modeling; fault management; fault detection; event-correlation; change detection
- probabilistisk nätverkshantering; probabilistiska modeller; fel- hantering; feldetektion; korrelationsanalys; förändringsdetektion
- Computer Science
- Datalogi
Publikations- och innehållstyp
- vet (ämneskategori)
- dok (ämneskategori)
Hitta via bibliotek
Till lärosätets databas