SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:DiVA.org:umu-31865"
 

Sökning: id:"swepub:oai:DiVA.org:umu-31865" > Model-free learning...

Model-free learning from demonstration

Billing, Erik, 1981- (författare)
Umeå universitet,Institutionen för datavetenskap,Department of Computing Science, Umeå University, Umeå, Sweden
Hellström, Thomas, 1956- (författare)
Umeå universitet,Institutionen för datavetenskap,Department of Computing Science, Umeå University, Umeå, Sweden
Janlert, Lars Erik, 1950- (författare)
Umeå universitet,Institutionen för datavetenskap,Department of Computing Science, Umeå University, Umeå, Sweden
 (creator_code:org_t)
Portugal : INSTICC, 2010
2010
Engelska.
Ingår i: ICAART 2010 - Proceedings of the international conference on agents and artificial intelligence. - Portugal : INSTICC. - 9789896740221 ; , s. 62-71
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • A novel robot learning algorithm called Predictive Sequence Learning (PSL) is presented and evaluated. PSL is a model-free prediction algorithm inspired by the dynamic temporal difference algorithm S-Learning. While S-Learning has previously been applied as a reinforcement learning algorithm for robots, PSL is here applied to a Learning from Demonstration problem. The proposed algorithm is evaluated on four tasks using a Khepera II robot. PSL builds a model from demonstrated data which is used to repeat the demonstrated behavior. After training, PSL can control the robot by continually predicting the next action, based on the sequence of passed sensor and motor events. PSL was able to successfully learn and repeat the first three (elementary) tasks, but it was unable to successfully repeat the fourth (composed) behavior. The results indicate that PSL is suitable for learning problems up to a certain complexity, while higher level coordination is required for learning more complex behaviors.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Nyckelord

Learning from Demonstration
Prediction
Robot Imitation
Motor Control
Model-free Learning
Computer science
Datavetenskap
computer and systems sciences
data- och systemvetenskap

Publikations- och innehållstyp

ref (ämneskategori)
kon (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy