Target Value Criterion in Markov Decision Processes

Moritz, Lars Norman

doi:10.5445/IR/1000047288

Target Value Criterion in Markov Decision Processes

Moritz, Lars Norman

Abstract:

We apply the Target Value Criterion to an MDP with a random planning horizon, derive an optimality equation and prove the existence of an optimal stationary policy in a generalized state space. The structure of the value function is exploited to approximate the target space by a finite subset and to derive upper and lower bounds as well as nearly optimal policies. As an extension we combine the Total Reward Criterion and the Target Value Criterion in a penalty approach.

KITopen-Download

Volltext

DOI: 10.5445/IR/1000047288

Export

Statistiken

Seitenaufrufe: 320
seit 05.05.2018

Downloads: 1089
seit 06.05.2015

Zugehörige Institution(en) am KIT	Institut für Operations Research (IOR)
Publikationstyp	Hochschulschrift
Publikationsjahr	2014
Sprache	Englisch
Identifikator	urn:nbn:de:swb:90-472888 KITopen-ID: 1000047288
Verlag	Karlsruher Institut für Technologie (KIT)
Art der Arbeit	Dissertation
Fakultät	Fakultät für Wirtschaftswissenschaften (WIWI)
Institut	Institut für Operations Research (IOR)
Prüfungsdaten	04.12.2014
Schlagwörter	Markov Decision Processes, Risk Sensitive Optimization, Target Value Criterion
Referent/Betreuer	Waldmann, K.-H.

Repository KITopen

Target Value Criterion in Markov Decision Processes

Abstract: