Sökning: id:"swepub:oai:DiVA.org:kth-330695" >
Reducing the Learni...
Reducing the Learning Time of Reinforcement Learning for the Supervisory Control of Discrete Event Systems
-
- Yang, Junjun (författare)
- School of Electro-Mechanical Engineering, Xidian University, Xi’an, China
-
- Tan, Kaige (författare)
- KTH,Mekatronik och inbyggda styrsystem
-
- Feng, Lei (författare)
- KTH,Mekatronik och inbyggda styrsystem
-
visa fler...
-
- El-Sherbeeny, Ahmed M. (författare)
- Department of Industrial Engineering, College of Engineering, King Saud University, Riyadh, Saudi Arabia
-
- Li, Zhiwu (författare)
- School of Electro-Mechanical Engineering, Xidian University, Xi’an, China
-
visa färre...
-
(creator_code:org_t)
- IEEE, 2023
- 2023
- Engelska.
-
Ingår i: IEEE Access. - : IEEE. - 2169-3536. ; 11, s. 59840-59853
- Relaterad länk:
-
https://doi.org/10.1...
-
visa fler...
-
https://ieeexplore.i...
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Reinforcement learning (RL) can obtain the supervisory controller for discrete-event systems modeled by finite automata and temporal logic. The published methods often have two limitations. First, a large number of training data are required to learn the RL controller. Second, the RL algorithms do not consider uncontrollable events, which are essential for supervisory control theory (SCT). To address the limitations, we first apply SCT to find the supervisors for the specifications modeled by automata. These supervisors remove illegal training data violating these specifications and hence reduce the exploration space of the RL algorithm. For the remaining specifications modeled by temporal logic, the RL algorithm is applied to search for the optimal control decision within the confined exploration space. Uncontrollable events are considered by the RL algorithm as uncertainties in the plant model. The proposed method can obtain a nonblocking supervisor for all specifications with less learning time than the published methods.
Ämnesord
- TEKNIK OCH TEKNOLOGIER -- Elektroteknik och elektronik -- Reglerteknik (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Electrical Engineering, Electronic Engineering, Information Engineering -- Control Engineering (hsv//eng)
Nyckelord
- Discrete event system
- linear temporal logic
- supervisory control theory
- reinforcement learning
- Optimeringslära och systemteori
- Optimization and Systems Theory
- Datalogi
- Computer Science
- Industriella informations- och styrsystem
- Industrial Information and Control Systems
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas