Gradient-Descent for Randomized Controllers under Partial Observability (VMCAI 2022)

Sun 16 - Fri 28 January 2022 Philadelphia, Pennsylvania, United States

Who

Jip Spel, Linus Heck, Sebastian Junges, Joshua Moerman, Joost-Pieter Katoen

Track

VMCAI 2022

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 18 Jan 2022 09:00 - 09:30 at Salon I - Synthesis Chair(s): Viktor Kunčak

Abstract

Randomization is a powerful technique to create robust controllers, in particular in partially observable settings. The degrees of randomization have a significant impact on the system performance, yet they are intricate to get right. The use of synthesis algorithms for parametric Markov chains (pMCs) is a promising direction to support the design process of such controllers. This paper shows how to define and evaluate gradients of pMCs. Furthermore, it investigates varieties of gradient descent techniques from the machine learning community to synthesize the probabilities in a pMC. The resulting method scales to significantly larger pMCs than before and empirically outperforms the state-of-the-art, often by at least one order of magnitude.

Jip Spel

RWTH Aachen University

Germany

Linus Heck

RWTH Aachen University

Germany

Sebastian Junges

University of California, Berkeley

United States

Joshua Moerman

Open University of the Netherlands

Netherlands

Joost-Pieter Katoen

RWTH Aachen University

Germany

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 18 Jan
Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:00	SynthesisVMCAI at Salon I Chair(s): Viktor Kunčak EPFL, Switzerland

09:00 30m Paper		Gradient-Descent for Randomized Controllers under Partial ObservabilityInPerson VMCAI Jip Spel RWTH Aachen University, Linus Heck RWTH Aachen University, Sebastian Junges University of California, Berkeley, Joshua Moerman Open University of the Netherlands, Joost-Pieter Katoen RWTH Aachen University
09:30 30m Paper		Satisfiability and Synthesis Modulo OraclesRemote VMCAI Elizabeth Polgreen University of Edinburgh, Andrew Reynolds University of Iowa, Sanjit Seshia UC Berkeley