Abstract
We propose a novel framework to controller design in environments with a two-level structure: a known high-level graph (“map”) in which each vertex is populated by a Markov decision process, called a “room”. The framework “separates concerns” by using different design techniques for low- and high-level tasks. We apply reactive synthesis for high-level tasks: given a specification as a logical formula over the high-level graph and a collection of low-level policies obtained together with “concise” latent structures, we construct a “planner” that selects which low-level policy to apply in each room. We develop a reinforcement learning procedure to train low-level policies on latent structures, which unlike previous approaches, circumvents a model distillation step. We pair the policy with probably approximately correct guarantees on its performance and on the abstraction quality, and lift these guarantees to the high-level task. These formal guarantees are the main advantage of the framework. Other advantages include scalability (rooms are large and their dynamics are unknown) and reusability of low-level policies. We demonstrate feasibility in challenging case studies where an agent navigates environments with moving obstacles and visual inputs.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025 |
| Editors | Yevgeniy Vorobeychik, Sanmay Das, Ann Nowe |
| Publisher | International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS) |
| Pages | 574-583 |
| Number of pages | 10 |
| ISBN (Electronic) | 9798400714269 |
| State | Published - 2025 |
| Event | 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025 - Detroit, United States Duration: 19 May 2025 → 23 May 2025 |
Publication series
| Name | Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS |
|---|---|
| ISSN (Print) | 1548-8403 |
| ISSN (Electronic) | 1558-2914 |
Conference
| Conference | 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025 |
|---|---|
| Country/Territory | United States |
| City | Detroit |
| Period | 19/05/25 → 23/05/25 |
Bibliographical note
Publisher Copyright:© 2025 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org).
Keywords
- Controller Synthesis
- Model Checking
- Planning and Reasoning under Uncertainty
- Reinforcement Learning
- Representation Learning
ASJC Scopus subject areas
- Software
- Control and Systems Engineering
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'Composing Reinforcement Learning Policies, with Formal Guarantees AAAI Track'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver