Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge IoT

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: onr:"swepub:oai:DiVA.org:kth-325693" > Adaptive Stochastic...

1 av 1
Föregående post
Nästa post
Till träfflistan

LIBRIS Formathandbok (Information om MARC21)

Fältnamn	Indikatorer	Metadata
000		04562naa a2200637 4500
001		oai:DiVA.org:kth-325693
003		SwePub
008		230412s2022 \| \|\|\|\|\|\|\|\|\|\|\|000 \|\|eng\|
024	7	a https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-3256932 URI
024	7	a https://doi.org/10.1109/JIOT.2022.31870672 DOI
040		a (SwePub)kth
041		a engb eng
042		9 SwePub
072	7	a ref2 swepub-contenttype
072	7	a art2 swepub-publicationtype
100	1	a Lei, Wanluu KTH,Teknisk informationsvetenskap,Interconnection Design in Baseband and Interconnect Department, Ericsson AB, Stockholm, Sweden4 aut0 (Swepub:kth)u1dm05jn
245	1 0	a Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge IoT
264	1	b Institute of Electrical and Electronics Engineers (IEEE),c 2022
338		a print2 rdacarrier
500		a QC 20230412
520		a Edge computing provides a promising paradigm to support the implementation of Internet of Things (IoT) by offloading tasks to nearby edge nodes. Meanwhile, the increasing network size makes it impractical for centralized data processing due to limited bandwidth, and consequently a decentralized learning scheme is preferable. Reinforcement learning (RL) has been widely investigated and shown to be a promising solution for decision-making and optimal control processes. For RL in a decentralized setup, edge nodes (agents) connected through a communication network aim to work collaboratively to find a policy to optimize the global reward as the sum of local rewards. However, communication costs, scalability, and adaptation in complex environments with heterogeneous agents may significantly limit the performance of decentralized RL. Alternating direction method of multipliers (ADMM) has a structure that allows for decentralized implementation and has shown faster convergence than gradient descent-based methods. Therefore, we propose an adaptive stochastic incremental ADMM (asI-ADMM) algorithm and apply the asI-ADMM to decentralized RL with edge-computing-empowered IoT networks. We provide convergence properties for the proposed algorithms by designing a Lyapunov function and prove that the asI-ADMM has O(1/k) + O(1/M) convergence rate, where k and M are the number of iterations and batch samples, respectively. Then, we test our algorithm with two supervised learning problems. For performance evaluation, we simulate two applications in decentralized RL settings with homogeneous and heterogeneous agents. The experimental results show that our proposed algorithms outperform the state of the art in terms of communication costs and scalability and can well adapt to complex IoT environments.
650	7	a TEKNIK OCH TEKNOLOGIERx Elektroteknik och elektronikx Reglerteknik0 (SwePub)202022 hsv//swe
650	7	a ENGINEERING AND TECHNOLOGYx Electrical Engineering, Electronic Engineering, Information Engineeringx Control Engineering0 (SwePub)202022 hsv//eng
653		a Communication efficiency
653		a decentralized edge computing
653		a reinforcement learning (RL)
653		a stochastic alternating direction method of multiplier (ADMM)
653		a Complex networks
653		a Data handling
653		a Decision making
653		a Edge computing
653		a Gradient methods
653		a Internet of things
653		a Job analysis
653		a Lyapunov functions
653		a Random processes
653		a Reinforcement learning
653		a Scalability
653		a Stochastic systems
653		a Alternating directions method of multipliers
653		a Convergence
653		a Decentralised
653		a Optimisations
653		a Reinforcement learnings
653		a Stochastic alternating direction method of multiplier
653		a Stochastics
653		a Task analysis
653		a Optimization
700	1	a Ye, Yuu KTH,Teknisk informationsvetenskap4 aut0 (Swepub:kth)u1y1pv31
700	1	a Xiao, Ming,d 1975-u KTH,Teknisk informationsvetenskap4 aut0 (Swepub:kth)u1iq6n9a
700	1	a Skoglund, Mikael,d 1969-u KTH,Teknisk informationsvetenskap4 aut0 (Swepub:kth)u1dbnyps
700	1	a Han, Z.4 aut
710	2	a KTHb Teknisk informationsvetenskap4 org
773	0	t IEEE Internet of Things Journald : Institute of Electrical and Electronics Engineers (IEEE)g 9:22, s. 22958-22971q 9:22<22958-22971x 2327-4662
856	4 8	u https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-325693
856	4 8	u https://doi.org/10.1109/JIOT.2022.3187067

Hitta via bibliotek

IEEE Internet of Things Journal (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Lei, Wanlu; Ye, Yu; Xiao, Ming, 1975 ...; Skoglund, Mikael ...; Han, Z.

Om ämnet

TEKNIK OCH TEKNOLOGIER: TEKNIK OCH TEKNO ...; och Elektroteknik oc ...; och Reglerteknik

Artiklar i publikationen: IEEE Internet of ...

Av lärosätet: Kungliga Tekniska Högskolan

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se