University of Leicester
Browse

Robust optimal policies for Markov decision processes with safety-threshold constraints

Download (313.1 kB)
conference contribution
posted on 2018-05-30, 11:30 authored by Rayna Dimitrova, Jie Fu, Ufuk Topcu
Abstract: We study the synthesis of robust optimal control policies for Markov decision processes with transition uncertainty (UMDPs) and subject to two types of constraints: (i) constraints on the worst-case, maximal total cost and (ii) safety-threshold constraints that bound the worst-case probability of visiting a set of error states. For maximal total cost constraints, we propose a state-augmentation method and a two-step synthesis algorithm to generate deterministic, memoryless optimal policies given the reward to be maximized. For safety threshold constraints, we introduce a new cost function and provide an approximately optimal solution by a reduction to an uncertain Markov decision process under a maximal total cost constraint. The safety-threshold constraints require memory and randomization for optimality. We discuss the use and the limitations of the proposed solution.

History

Citation

55th IEEE Conference on Decision and Control (CDC), 2016, pp. 7081-7086 (6)

Author affiliation

/Organisation/COLLEGE OF SCIENCE AND ENGINEERING/Department of Informatics

Source

55th IEEE Conference on Decision and Control (CDC), Las Vegas, NV, USA

Version

  • AM (Accepted Manuscript)

Published in

55th IEEE Conference on Decision and Control (CDC)

Publisher

IEEE

issn

0743-1546

isbn

978-1-5090-1837-6

Copyright date

2016

Available date

2018-05-30

Publisher version

https://ieeexplore.ieee.org/document/7799360/

Temporal coverage: start date

2016-12-12

Temporal coverage: end date

2016-12-14

Language

en

Usage metrics

    University of Leicester Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC