SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:liu-196015"
 

Search: onr:"swepub:oai:DiVA.org:liu-196015" > Learning Hierarchic...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Learning Hierarchical Policies by Iteratively Reducing the Width of Sketch Rules

Drexler, Dominik, 1993- (author)
Linköpings universitet,Artificiell intelligens och integrerade datorsystem,Tekniska fakulteten
Seipp, Jendrik (author)
Linköpings universitet,Artificiell intelligens och integrerade datorsystem,Tekniska fakulteten
Geffner, Hector (author)
Linköpings universitet,Artificiell intelligens och integrerade datorsystem,Tekniska fakulteten
 (creator_code:org_t)
2023
2023
English.
In: 20th International Conference on Principles of Knowledge Representation and Reasoning, Rhodes, Greece, September 2-8, 2023.
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • Hierarchical policies are a key ingredient of intelligent behavior, expressing the different levels of abstraction involved in the solution of a problem. Learning hierarchical policies, however, remains a challenge, as no general learning principles have been identified for this purpose, despite the broad interest and vast literature in both model-free reinforcement learning and model-based planning. In this work, we introduce a principled method for learning hierarchical policies over classical planning domains, with no supervision from small instances. The method is based on learning to decompose problems into subproblems so that the subproblems have a lower complexity as measured by their width. Problems and subproblems are captured by means of sketch rules, and the scheme for reducing the width of sketch rules is applied iteratively until the final sketch rules have zero width and encode a general policy. We evaluate the learning method on a number of classical planning domains, analyze the resulting hierarchical policies, and prove their properties. We also show that learning hierarchical policies by learning and refining sketches iteratively is often more efficient than learning flat general policies in one shot.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Keyword

classical planning
learning hierarchical policies
policy sketches language
planning width

Publication and Content Type

ref (subject category)
kon (subject category)

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Drexler, Dominik ...
Seipp, Jendrik
Geffner, Hector
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Computer Science ...
Articles in the publication
By the university
Linköping University

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view