1. |
- Landelius, Tomas, et al.
(författare)
-
Reinforcement Learning Adaptive Control and Explicit Criterion Maximization
- 1996
-
Rapport (övrigt vetenskapligt/konstnärligt)abstract
- This paper reviews an existing algorithm for adaptive control based on explicit criterion maximization (ECM) and presents an extended version suited for reinforcement learning tasks. Furthermore, assumptions under which the algorithm convergences to a local maxima of a long term utility function are given. Such convergence theorems are very rare for reinforcement learning algorithms working with continuous state and action spaces. A number of similar algorithms, previously suggested to the reinforcement learning community, are briefly surveyed in order to give the presented algorithm a place in the field. The relations between the different algorithms is exemplified by checking their consistency on a simple problem of linear quadratic regulation (LQR).
|
|