SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:DiVA.org:oru-4137"
 

Sökning: id:"swepub:oai:DiVA.org:oru-4137" > Learning reactive b...

Learning reactive behaviors with constructive neural networks in mobile robotics

Li, Jun (författare)
Örebro universitet,Institutionen för teknik,Learning Systems Lab
Duckett, Tom (preses)
Ziemke, Tom, Professor (opponent)
University of Skövde, Sweden
 (creator_code:org_t)
ISBN 9176684903
Örebro : Örebro universitetsbibliotek, 2006
Engelska 165 s.
Serie: Örebro Studies in Technology, 1650-8580 ; 23
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)
Abstract Ämnesord
Stäng  
  • This thesis investigates a learning system for acquiring robot behaviors by mapping sensor information directly to motor actions. It addresses the integration of three learning paradigms, namely unsupervised learning (UL), supervised learning (SL), and reinforcement learning (RL). The approach is characterized by the use of constructive artificial neural networks (ANNs). The sensor-motor mappings acquired by the learning system form part of a tight "sense-learn-act" cycle, as opposed to "sense-plan-act", thus allowing the robot to learn concepts within its own sensorimotor experience while avoiding anthropomorphic bias.Novel techniques for robot learning using constructive radial basis function (RBF) networks are introduced. This leads to a self-organizing, incremental and local construction of the sensorimotor space for learning different behaviors with the same basic architecture, thus a great simplification of the engineering design process of the ANN's structure. Integration of the different learning paradigms takes place in a two-layer learning architecture.The lower layer with the UL and SL paradigms is used to quickly construct a controller for the required behavior. The upper layer with the RL paradigm is used for tuning and refining of the controller resulting from the lower layer (or a controller obtained from other prior knowledge) to further improve the robustness and performance of the behavior. Both layers apply constructive RBF networks, taking into account the different requirements of the respective learning paradigms.The learning system is verified by a number of experiments on a real robot. We begin our experiments with the lower layer together with a teaching-by-demonstration approach for acquiring different behaviors. The experimental results show that the lower layer can learn a wide range of robot behaviors, thus demonstrating the task non-specific nature of the architecture. We then demonstrate the necessity of the layered learning architecture for more complex behaviors by a docking behavior requiring precise positioning at a goal location. The results obtained show that learning with only the lower layer can not obtain robust performance and optimal trajectories, while learning with only the upper layer is impractical and even infeasible on the real robot due to the slow learning process of the RL paradigm. With layered learning, however, the upper layer is speeded up by bootstrapping from the learnedcontroller in the lower layer, and a robust and time-optimal controller can be obtained.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Nyckelord

Computer science
Datavetenskap
Datalogi
Computer and Systems Science

Publikations- och innehållstyp

vet (ämneskategori)
dok (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Av författaren/redakt...
Li, Jun
Duckett, Tom
Ziemke, Tom, Pro ...
Om ämnet
NATURVETENSKAP
NATURVETENSKAP
och Data och informa ...
och Datavetenskap
Delar i serien
Örebro Studies i ...
Av lärosätet
Örebro universitet

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy