AYMERIC BARBIN

PhD Graduate

PhD program:: XXXVII


supervisor: Alfonso Gerevini
co-supervisor: Federico Cerutti

Thesis title: Learning Symbolic Planning Models from Images

The construction of symbolic planning models, particularly in the PDDL language, remains a longstanding challenge in Artificial Intelligence due to its reliance on manual domain engineering. This thesis explores how such models can be learned automatically and without supervision, from raw visual input. We focus on learning PDDL-style action models for Classical Planning, with an emphasis on planning effectiveness and semantic interpretability. Our work builds on Latplan, a system that learns symbolic representations from images via deep latent-space encoding. In the first part of the thesis we improve Latplan without architectural changes by enforcing logical invariants through fuzzy logic regularization and tuning hyperparameters. While these methods yield improvements in simpler domains, they do not overcome Latplan’s two key limitations: the lack of action interpretability and poor planning performance due to hallucinated plans. To address this, we propose R-Latplan, an architectural extension introducing deterministic action labeling based on visual differences. This enhances planning performance and aligns learned actions with high-level behaviors, even under noise. However, scalability remains a challenge due to the proliferation of generated actions, which increases planning time. To compress the action space, we investigate various techniques, first we use Decision Trees in order to identify patterns among the effects and preconditions of R-Latplan and translate these patterns into higher level actions, but this approach does not enable efficient action models; then we propose RC-latplan, which clusters low-level actions into semantically coherent groups, each translated into a PDDL action. Experiments show that RC-latplan greatly reduces domain size while preserving interpretability. Although it slightly underperforms R-Latplan, a variant, clustering actions with the same effects achieves logical equivalence with R-Latplan. Finally, the thesis explores a complementary research in which I worked, on Visual Re- ward Machines (VRMs), a framework for learning symbolic automata from visual observations to improve learning in sparse-reward environments.

Research products

11573/1729764 - 2024 - Learning Reliable PDDL Models for Classical Planning from Visual Data
Barbin, Aymeric; Cerutti, Federico; Gerevini, Alfonso Emilio - 04b Atto di convegno in volume
conference: 36th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2024 (Herndon; USA)
book: 2024 IEEE 36th International Conference on Tools with Artificial Intelligence (ICTAI) - (9798331527235)

11573/1684340 - 2023 - Visual reward machines
Umili, Elena; Argenziano, Francesco; Barbin, Aymeric; Capobianco, Roberto - 04b Atto di convegno in volume
conference: 17th International Workshop on Neural-Symbolic Learning and Reasoning (La Certosa di Pontignano (SI); Italy)
book: Proceedings of the 17th International Workshop on Neural-Symbolic Learning and Reasoning La Certosa di Pontignano, Siena, Italy, July 3-5, 2023. - ()

11573/1729767 - 2022 - Addressing the Symbol Grounding Problem with Constraints in Neuro-Symbolic Planning
Barbin, A.; Cerutti, F.; Gerevini, A. E. - 04b Atto di convegno in volume
conference: Italian Workshop on Planning and Scheduling (IPS-2022, 10th edition) (Udine; Italia)
book: Proceedings of the 10th Italian workshop on Planning and Scheduling (IPS 2022), RCRA Incontri E Confronti (RiCeRcA 2022), and the workshop on Strategies, Prediction, Interaction, and Reasoning in Italy (SPIRIT 2022) co-located with 21st International Conference of the Italian Association for Artificial Intelligence (AIxIA 2022) - ()

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma