LUCA SCOFANO

PhD Graduate

PhD program:: XXXVII


co-supervisor: Fabio Galasso

Thesis title: Decoding Human Dynamics: Explorations in Motion Forecasting, Social Navigation, and Egocentric Perception

Human dynamics—how individuals move, interact, and perceive their environment—pose significant challenges for theoretical understanding and practical implementation in robotics, human-computer interaction, and behavior analysis. Accurate models addressing these challenges are essential for developing intelligent systems capable of effectively collaborating with or understanding humans. This Ph.D. thesis investigates key aspects of human dynamics through Motion Forecasting, Social Navigation, and Egocentric Perception. In Motion Forecasting, we explore both two-body pose prediction and global human motion prediction. We present best practices for improving collaborative motion prediction. We introduce a staged, contact-aware framework for global human motion forecasting that predicts human movements within broader environmental contexts. Our model surpasses existing methods by incorporating contact points and staged motion, enabling more accurate human pose and trajectory predictions. In the context of social dynamics, we investigate the impact of latent variables on forecasting human interactions, especially in team-based settings. Introducing a role-based approach demonstrates that understanding these latent social roles can significantly improve trajectory prediction in multi-agent systems. This concept extends to Social Navigation, where a robot's trajectory planning must account for human movement and be processed in real-time. Human dynamics are incorporated into the robot's reinforcement learning path-planning framework via a social dynamics module. This module distills human trajectories into latent codes, which serve as contextual input for the robot's policy model. We also address challenges in Egocentric Perception and Mistake Detection. By developing a novel method, we tackle the need for real-time online detection of procedural mistakes from egocentric video streams. Our approach, PREGO, introduces an innovative model that recognizes current actions and predicts future ones to identify discrepancies and detect mistakes. We also present an extension of the latter, which offers an in-depth analysis and enhances the framework with an Automatic Chain of Thought mechanism. This addition improves the model’s reasoning capabilities, enabling more nuanced error detection. Additionally, we contribute a framework for estimating social interactions and human meshes using egocentric video, improving pose estimation accuracy by incorporating wearer-interactee interactions. Beyond direct applications to human dynamics, this thesis includes a contribution to Topological Deep Learning. We contributed to a technical paper introducing the first Python framework for Topological Deep Learning, offering new tools for researchers exploring machine learning on non-Euclidean data structures. Overall, this thesis explores human motion forecasting, social interaction modeling, and egocentric perception while advancing methodologies in machine learning. The insights and tools developed contribute to understanding human behavior and pave the way for further research in intelligent systems and interactive environments.

Research products

11573/1728973 - 2024 - PREGO: Online Mistake Detection in PRocedural EGOcentric Videos
Flaborea, A.; D'amely Di Melendugno, G. M.; Plini, L.; Scofano, L.; De Matteis, E.; Furnari, A.; Farinella, G. M.; Galasso, F. - 04b Atto di convegno in volume
conference: IEEE Conference on Computer Vision and Pattern Recognition (Seattle; United States of America)
book: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - (979-8-3503-5300-6; 979-8-3503-5301-3)

11573/1702411 - 2024 - About Latent Roles in Forecasting Players in Team Sports
Scofano, Luca; Sampieri, Alessio; Re, Giuseppe; Almanza, Matteo; Panconesi, Alessandro; Galasso, Fabio - 01a Articolo in rivista
paper: NEURAL PROCESSING LETTERS (Kluwer Academic Publishers:Journals Department, PO Box 322, 3300 AH Dordrecht Netherlands:011 31 78 6576050, EMAIL: frontoffice@wkap.nl, kluweronline@wkap.nl, INTERNET: http://www.kluwerlaw.com, Fax: 011 31 78 6576254) pp. - - issn: 1370-4621 - wos: WOS:001167155000002 (0) - scopus: 2-s2.0-85185659133 (0)

11573/1695410 - 2023 - ICML 2023 topological deep learning challenge. Design and results
Papillon, Mathilde; Hajij, Mustafa; Frantzen, Florian; Hoppe, Josef; Jenne, Helen; Mathe, Johan; Myers, Audun; Papamarkou, Theodore; Schaub, Michael T.; Zamzmi, Ghada; Birdal, Tolga; Dey, Tamal; Doster, Timothy; Emerson, Tegan H.; Gopalakrishnan, Gurusankar; Govil, D.; Grande, Vincent P.; Guzm'an-S'aenz, Aldo; Kvinge, Henry; Livesay, Neal; Meisner, Jan; Mukherjee, Soham; Samaga, Shreyas N.; Natesan Ramamurthy, Karthikeyan; Reddy Karri, Maneel; Rosen, Paul; Sanborn, Sophia; Scholkemper, Michael; Walters, Robin; Agerberg, Jens; Bokman, Georg; Barikbin, Sadrodin; Battiloro, Claudio; Bazhenov, Gleb; Bern('A)Rdez, Guillermo; Brent, Aiden; Escalera, Sergio; Fiorellino, Simone; Gavrilev, Dmitrii; Hassanin, Mohammed; Hausner, Paul; Hoff Gardaa, Odin; Khamis, Abdelwahed; Lecha, M; Magai, German; Malygina, Tatiana; Melnyk, Pavlo; Ballester, Rub('E)N; Varma Nadimpalli, Kalyan; Nikitin, Alexander; Rabinowitz, Abraham; Salatiello, Alessandro; Scardapane, Simone; Scofano, Luca; Singh, Suraj; Sjolund, Jens; Snopov, Paul; Spinelli, Indro; Telyatnikov, Lev; Testa, Lucia; Yang, Maosheng; Yue, Yixiao; Zaghen, Olga; Zia, Ali; Miolane, Nina - 04b Atto di convegno in volume
conference: International Conference on Machine Learning (Honolulu; Hawaii)
book: Proceedings of Machine Learning Research - ()

11573/1686540 - 2023 - Best Practices for 2-Body Pose Forecasting
Rahman, Muhammad Rameez Ur; Scofano, Luca; De Matteis, Edoardo; Flaborea, Alessandro; Sampieri, Alessio; Galasso, Fabio - 04b Atto di convegno in volume
conference: IEEE Conference on Computer Vision and Pattern Recognition (Vancouver, Canada)
book: IEEE Conference on Computer Vision and Pattern Recognition Workshops - ()

11573/1695402 - 2023 - Staged Contact-Aware Global Human Motion Forecasting
Scofano, Luca; Sampieri, Alessio; Schiele, Elisabeth; De Matteis, Edoardo; Leal-Taixé, Laura; Galasso, Fabio - 04b Atto di convegno in volume
conference: British Machine Vision Conference (Aberdeen; United Kingdom)
book: British Machine Vision Conference - ()

11573/1664528 - 2022 - Mesoscale precipitation nowcasting from weather radar data using space-time-separable graph convolutional networks
Trappolini, Daniele; Scofano, Luca; Sampieri, Alessio; Messina, Francesco; Galasso, Fabio; Di Fabio, Saverio; Silvio Marzano, Frank - 04d Abstract in atti di convegno
conference: European Geoscience Union General Assembly (Vienna)
book: EGU General Assembly Conference Abstracts - ()

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma