SwePub
Sök i LIBRIS databas

  Extended search

WFRF:(Landelius Tomas)
 

Search: WFRF:(Landelius Tomas) > (1993-1994) > A Dynamic Tree Stru...

  • Landelius, Tomasn/a (author)

A Dynamic Tree Structure for Incremental Reinforcement Learning of Good Behavior

  • BookEnglish1994

Publisher, publication year, extent ...

  • Linköping, Sweden :Linköping University, Department of Electrical Engineering,1994
  • 12 s.
  • electronicrdacarrier

Numbers

  • LIBRIS-ID:oai:DiVA.org:liu-53421
  • https://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-53421URI

Supplementary language notes

  • Language:English
  • Summary in:English

Part of subdatabase

Classification

  • Subject category:vet swepub-contenttype
  • Subject category:rap swepub-publicationtype

Series

  • LiTH-ISY-R,1400-3902 ;1628

Notes

  • This paper addresses the idea of learning by reinforcement, within the theory of behaviorism. The reason for this choice is its generality and especially that the reinforcement learning paradigm allows systems to be designed, which can improve their behavior beyond that of their teacher. The role of the teacher is to define the reinforcement function, which acts as a description of the problem the machine is to solve. Gained knowledge is represented by a behavior probability density function which is approximated with a number of normal distributions, stored in the nodes of a binary tree. It is argued that a meaningful partitioning into local models can only be accomplished in a fused space consisting of both stimuli and responses. Given a stimulus, the system searches for responses likely to result in highly reinforced decisions by treating the sum of the two normal distributions on each level in the tree as a distribution describing the system's behavior at that resolution. The resolution of the response, as well as the tree growing and pruning processes, are controlled by a random variable based on the difference in performance between two consecutive levels in the tree. This results in a system that will never be content but will indefinitely continue to search for better solutions.

Subject headings and genre

  • TECHNOLOGY
  • TEKNIKVETENSKAP

Added entries (persons, corporate bodies, meetings, titles ...)

  • Knutsson, HansLinköpings universitet,Bildbehandling,Tekniska högskolan(Swepub:liu)hankn25 (author)
  • n/aBildbehandling (creator_code:org_t)

Internet link

To the university's database

Find more in SwePub

By the author/editor
Landelius, Tomas
Knutsson, Hans
Parts in the series
LiTH-ISY-R,
By the university
Linköping University

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view