Relevance-based Online Planning in Complex POMDPs

Please use this identifier to cite or link to this item:
https://osnadocs.ub.uni-osnabrueck.de/handle/urn:nbn:de:gbv:700-202007173302
Open Access logo originally created by the Public Library of Science (PLoS)
Title: Relevance-based Online Planning in Complex POMDPs
Authors: Saborío Morales, Juan Carlos
ORCID of the author: https://orcid.org/0000-0003-3625-0661
Thesis advisor: Prof. Dr. Joachim Hertzberg
Thesis referee: Prof. Dr. Marc Toussaint
Abstract: Planning under uncertainty is a central topic at the intersection of disciplines such as artificial intelligence, cognitive science and robotics, and its aim is to enable artificial agents to solve challenging problems through a systematic approach to decision-making. Some of these challenges include generating expectations about different outcomes governed by a probability distribution and estimating the utility of actions based only on partial information. In addition, an agent must incorporate observations or information from the environment into its deliberation process and produce the next best action to execute, based on an updated understanding of the world. This process is commonly modeled as a POMDP, a discrete stochastic system that becomes intractable very quickly. Many real-world problems, however, can be simplified following cues derived from contextual information about the relative expected value of actions. Based on an intuitive approach to problem solving, and relying on ideas related to attention and relevance estimation, we propose a new approach to planning supported by our two main contributions: PGS grants an agent the ability to generate internal preferences and biases to guide action selection, and IRE allows the agent to reduce the dimensionality of complex problems while planning online. Unlike existing work that improves the performance of planning on POMDPs, PGS and IRE do not rely on detailed heuristics or domain knowledge, explicit action hierarchies or manually designed dependencies for state factoring. Our results show that this level of autonomy is important to solve increasingly more challenging problems, where manually designed simplifications scale poorly.
URL: https://osnadocs.ub.uni-osnabrueck.de/handle/urn:nbn:de:gbv:700-202007173302
Subject Keywords: Planning under uncertainty; POMDP planning; Monte Carlo Tree Search
Issue Date: 17-Jul-2020
License name: Attribution-NonCommercial-NoDerivs 3.0 Germany
License url: http://creativecommons.org/licenses/by-nc-nd/3.0/de/
Type of publication: Dissertation oder Habilitation [doctoralThesis]
Appears in Collections:FB06 - E-Dissertationen

Files in This Item:
File Description SizeFormat 
thesis_saborio_morales.pdfPräsentationsformat1,02 MBAdobe PDF
thesis_saborio_morales.pdf
Thumbnail
View/Open


This item is licensed under a Creative Commons License Creative Commons