NICHOLAS ROSSETTI

PhD Graduate

PhD program:: XXXVII


supervisor: Alfonso Gerevini
co-supervisor: Ivan Serina

Thesis title: Learning General Policies for Planning through GPT Models

Transformer-based architectures, such as BERT, GPT and T5, have achieved remarkable results across various Natural Language Processing (NLP) tasks. Beyond these linguistic capabilities, these Large Language Models (LLMs) exhibit varying degrees of factual knowledge, common sense reasoning, and even programming capabilities. However, their effectiveness in performing logical inference and automated planning remains an open question. Recent attempts to apply LLMs to classical planning have produced mixed results. In this thesis, we tackle this challenge by introducing \model, a GPT-based model trained from scratch on solved planning instances to learn a general policy for classical planning. By leveraging domain-specific training data and incorporating automated planning knowledge, \model can generate solution plans for unseen problems within the same domain, demonstrating good coverage and performance relative to other deep learning approaches. However, these are no formal guarantees of validity and \model can produce invalid plans that fail to meet all goals or contain actions with unsatisfied preconditions. To mitigate these problems, we propose two approaches. First, we incorporate a validator directly into the generation process, which allows us to prune invalid partial plans on the fly and generate valid solutions. Second, we combine \model with a plan-repair planner, \lpg, which refines invalid or incomplete candidate plans into fully valid solutions. Our empirical evaluations across diverse classical planning domains confirm the efficacy of these strategies. Ultimately, this work demonstrates the potential of integrating learned policies with model-based reasoning.

Research products

11573/1724777 - 2024 - Enhancing GPT-Based Planning Policies by Model-Based Plan Validation
Rossetti, N.; Tummolo, M.; Gerevini, A. E.; Olivato, M.; Putelli, L.; Serina, I. - 04b Atto di convegno in volume
conference: Neural-Symbolic Learning and Reasoning (NeSy 2024) (Barcelona, Spain)
book: Lecture Notes in Computer Science ((LNAI,volume 14980)) - (9783031711695; 9783031711701)

11573/1724772 - 2024 - Learning General Policies for Planning through GPT Models
Rossetti, N.; Tummolo, M.; Gerevini, A. E.; Putelli, L.; Serina, I.; Chiari, M.; Olivato, M. - 04b Atto di convegno in volume
conference: International Conference on Automated Planning and Scheduling, ICAPS, 2024 (Banff, Canada)
book: Proceedings International Conference on Automated Planning and Scheduling - ()

11573/1697118 - 2023 - Recurrent Neural Networks for Daily Estimation of COVID-19 Prognosis with Uncertainty Handling
Rossetti, Nicholas; Gerevini, Alfonso E.; Olivato, Matteo; Putelli, Luca; Chiari, Mattia; Serina, Ivan; Minisci, Davide; Foca, Emanuele - 04b Atto di convegno in volume
conference: 27th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (Athens)
book: Procedia Computer Science, vol 225, 2023 - ()

11573/1671189 - 2022 - Machine Learning Models for Predicting Short-Long Length of Stay of COVID-19 Patients
Olivato, M.; Rossetti, N.; Gerevini, A. E.; Chiari, M.; Putelli, L.; Serina, I. - 04c Atto di convegno in rivista
paper: PROCEDIA COMPUTER SCIENCE (Amsterdam : Elsevier) pp. 1232-1241 - issn: 1877-0509 - wos: (0) - scopus: 2-s2.0-85143348535 (9)
conference: 26th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, KES 2022 (Verona; Italia)

11573/1671193 - 2021 - An Application of Recurrent Neural Networks for Estimating the Prognosis of COVID-19 Patients in Northern Italy
Chiari, M.; Gerevini, A. E.; Olivato, M.; Putelli, L.; Rossetti, N.; Serina, I. - 04b Atto di convegno in volume
conference: 19th International Conference on Artificial Intelligence in Medicine, AIME 2021 (AIME 2021)
book: Artificial Intelligence in Medicine - (978-3-030-77210-9)

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma