Distributed Bandit Online Convex Optimization With Time-Varying Coupled Inequality Constraints

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: onr:"swepub:oai:DiVA.org:kth-303537" > Distributed Bandit ...

1 av 1
Föregående post
Nästa post
Till träfflistan

Distributed Bandit Online Convex Optimization With Time-Varying Coupled Inequality Constraints

Yi, Xinlei (författare): KTH,Reglerteknik

Li, Xiuxian (författare): Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore.

Yang, Tao (författare): Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China.

visa fler...

Xie, Lihua (författare): Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore.

Chai, Tianyou (författare): Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China.

Johansson, Karl H., 1967- (författare): KTH,Reglerteknik

visa färre...

(creator_code:org_t)

Institute of Electrical and Electronics Engineers (IEEE), 2021
2021
Engelska.
Ingår i: IEEE Transactions on Automatic Control. - : Institute of Electrical and Electronics Engineers (IEEE). - 0018-9286 .- 1558-2523. ; 66:10, s. 4620-4635

Relaterad länk:: https://urn.kb.se/re...; visa fler...; https://doi.org/10.1...; visa färre...

Tidskriftsartikel (refereegranskat)

Abstract Ämnesord

Stäng

Distributed bandit online convex optimization with time-varying coupled inequality constraints is considered, motivated by a repeated game between a group of learners and an adversary. The learners attempt tominimize a sequence of global loss functions and at the same time satisfy a sequence of coupled constraint functions, where the constraints are coupled across the distributed learners at each round. The global loss and the coupled constraint functions are the sum of local convex loss and constraint functions, respectively, which are adaptively generated by the adversary. The local loss and constraint functions are revealed in a bandit manner, i.e., only the values of loss and constraint functions are revealed to the learners at the sampling instance, and the revealed function values are held privately by each learner. Both one- and two-point bandit feedback are studied with the two corresponding distributed bandit online algorithms used by the learners. We show that sublinear expected regret and constraint violation are achieved by these two algorithms, if the accumulated variation of the comparator sequence also grows sublinearly. In particular, we show that O(T-theta) expected static regret and O(T7/4-theta) constraint violation are achieved in the one-point bandit feedback setting, and O((T max{kappa,1-kappa})) expected static regret and O(T1-kappa/2) constraint violation in the two-point bandit feedback setting, where theta is an element of(3/4, 5/6] and kappa is an element of(0, 1) are user-defined tradeoff parameters. Finally, the tightness of the theoretical results is illustrated by numerical simulations of a simple power grid example, which also compares the proposed algorithms to algorithms existing in the literature.

Hitta via bibliotek

IEEE Transactions on Automatic Control (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Yi, Xinlei; Li, Xiuxian; Yang, Tao; Xie, Lihua; Chai, Tianyou; Johansson, Karl ...

Om ämnet

TEKNIK OCH TEKNOLOGIER: TEKNIK OCH TEKNO ...; och Elektroteknik oc ...; och Reglerteknik

Artiklar i publikationen: IEEE Transaction ...

Av lärosätet: Kungliga Tekniska Högskolan

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se

Distributed Bandit Online Convex Optimization With Time-Varying Coupled Inequality Constraints

Ämnesord

Nyckelord

Publikations- och innehållstyp

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Sök utanför SwePub