SANDEEP REDDY SABBELLA

PhD Graduate

PhD program:: XXXVII


supervisor: Daniele Nardi
advisor: Daniele Nardi

Thesis title: Multimodal Communication for Enhancing Human-Robot Interaction: Virtual Simulations to Real Robots

Human-robot interaction (HRI) is a rapidly evolving domain focused on the interaction between humans and robots, exploring robotic systems' design, functionality, and social implications in various environments. Virtual Reality (VR) has emerged as a valuable tool for evaluating HRI solutions before real-world deployment, ensuring safety and scalability. The main objective of this thesis is to present a multimodal interaction framework integrating speech and gesture recognition to enhance collaboration between humans and robots in precision agriculture, particularly in table-grape vineyards under the CANOPIES project, as well as in broader contexts of indoor and outdoor logistics. In collaborative robotics, where human-robot collaboration (HRC) is essential, multimodal communication between humans and robots is crucial during each interaction. To address this challenge, building on a categorization of the information content and speech act classification in the context of HRI in shared environments, Speech and Gesture recognition pipelines were designed and integrated into HRI architecture for cobots. Leveraging virtual reality (VR) as a testbed, the work generates synthetic datasets to train robust gesture and speech recognition models, overcoming the scarcity of real-world data in agricultural contexts. The framework is empirically validated through VR-based user studies and field experiments, demonstrating improved communication reliability in noisy vineyard environments and reduced task completion times. Notably, the system emphasizes modularity, allowing interchangeable components (e.g., pose estimators and speech classifiers) to adapt to dynamic tasks. Key contributions include (i) a standardized gesture taxonomy tailored to agricultural workflows, (ii) open-source datasets produced from both real and synthetic sources, (iii) a synthetic data generation pipeline for pose estimation, and (iv) a multimodal communication architecture augmented by large language models (LLMs) for contextual reasoning using limited computational capacity in agricultural logistics. By bridging virtual simulations and real-world deployment, this research advances human-robot collaboration in precision agriculture, offering interactive solutions for harvesting, pruning, and logistics tasks. The findings underscore the potential of multimodal HRI and immersive technologies to address collaboration between human expertise and robots and enhance safety and efficiency across both indoor and outdoor collaborative environments. Keywords: Human-Robot Interaction (HRI), Human-Robot Collaboration (HRC), Virtual Reality (VR), Synthetic Data Generation, Multimodal Communication, Gesture Recognition, User Evaluation, large language models (LLMs), Precision Agriculture, Collaborative Robotics.

Research products

11573/1699078 - 2024 - Empowering Collaboration: A Pipeline for Human-Robot Spoken Interaction in Collaborative Scenarios
Kaszuba, Sara; Caposiena, Julien; Sabbella, Sandeep Reddy; Leotta, Francesco; Nardi, Daniele - 04b Atto di convegno in volume
conference: 15th International Conference on Social Robotics, ICSR 2023 (Doha; Qatar)
book: Social Robotics - (978-981-99-8718-4)

11573/1699069 - 2024 - Gesture Recognition for Human-Robot Interaction Through Virtual Characters
Sabbella, S. R.; Kaszuba, S.; Leotta, F.; Nardi, D. - 04b Atto di convegno in volume
conference: 15th International Conference on Social Robotics, ICSR 2023 (Doha; Qatar)
book: Social Robotics - (978-981-99-8717-7; 978-981-99-8718-4)

11573/1727061 - 2024 - Generating and Evaluating Synthetic Data in Virtual Reality Simulation Environments for Pose Estimation
Sabbella, Sandeep Reddy; Serrarens, Pascal; Leotta, Francesco; Nardi, Daniele - 04b Atto di convegno in volume
conference: IEEE International Workshop on Robot and Human Communication (ROMAN) 2024 (Pasadena, California, USA)
book: 2024 33rd IEEE International Conference on Robot and Human Interactive Communication (ROMAN) - (9798350375022)

11573/1699075 - 2023 - Speech Act Classification in Collaborative Robotics
Kaszuba, Sara; Sabbella, Sandeep Reddy; Leotta, Francesco; Nardi, Daniele - 04b Atto di convegno in volume
conference: 32nd IEEE International Conference on Robot and Human Interactive Communication, IEEE RO-MAN 2023 (Busan; South Korea)
book: 32nd IEEE International Conference on Robot and Human InteractiveCommunication, RO-MAN 2023, Busan, Republic of Korea, August 28-31,2023 - (979-8-3503-3670-2)

11573/1699068 - 2023 - Virtual Reality Applications for Enhancing Human-Robot Interaction: A Gesture Recognition Perspective
Sabbella, Sandeep Reddy; Kaszuba, Sara; Leotta, Francesco; Nardi, Daniele - 04b Atto di convegno in volume
conference: Intelligent Virtual Agents (Würzburg; Germany)
book: IVA '23: Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents - (978-1-4503-9994-4)

11573/1578596 - 2021 - RoSmEEry: Robotic Simulated Environment for Evaluation and Benchmarking of Semantic Mapping Algorithms
Kaszuba, Sara; Sabbella, Sandeep Reddy; Suriani, Vincenzo; Riccio, Francesco; Nardi, Daniele - 04b Atto di convegno in volume
conference: Autonomous Robots and Multirobot Systems (ARMS) (London, UK)
book: ARMS2021 - ()

11573/1619629 - 2021 - S-AvE: Semantic Active Vision Exploration and Mapping of Indoor Environments for Mobile Robots
Suriani, Vincenzo; Kaszuba, Sara; Sabbella, Sandeep R.; Riccio, Francesco; Nardi, Daniele - 04b Atto di convegno in volume
conference: 10th European Conference on Mobile Robots, ECMR 2021 (Virtual, Bonn, Germany)
book: 2021 European Conference on Mobile Robots (ECMR) - (978-1-6654-1213-1)

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma