ALESSANDRO FLABOREA

Dottore di ricerca

ciclo: XXXVI


co-supervisore: Prof. Fabio Galasso

Titolo della tesi: Anomaly Detection Across Different Domains: the role of Generative Models, Self-Supervised Learning, Hyperbolic Neural Networks and Large Language Models

Anomaly detection plays a crucial role in assisting humans across various domains, from healthcare to security. In healthcare, it can identify both observable anomalies like heart attacks and more nuanced ones such as signs of social isolation or depression. Similarly, it can detect incidents like fights or crimes in a surveillance environment. Procedural mistake detection can help ensure the safe execution of tasks or procedures, preventing potential harm if not performed correctly. Recognizing anomalies involves identifying rare and unexpected events or behaviors that deviate from normal patterns. However, the task poses challenges as anomalies are subjective, context-dependent, and open-set, meaning novel types of anomalies may emerge. This thesis presents novel methods for detecting anomalies in various scenarios, using advancements in graph neural networks, hyperbolic geometry, and generative models. Addressing the limitations of existing methods, the proposed approaches tackle challenges such as uncertainty estimation, multimodal human action handling, and the scarcity of online and egocentric procedural mistake detection methods. First, HypAD is introduced, a novel method that uses hyperbolic neural networks to estimate uncertainty and detect anomalies in univariate and multivariate time series. HypAD outperforms the current state-of-the-art for univariate anomaly detection on established benchmarks based on data from NASA, Yahoo, Numenta, Amazon, and Twitter. HypAD is also tested on a multivariate dataset of anomaly activities in elderly home residences, where it detects anomalies in the daily routine of patients and provides explainable features. Next, two novel methods for detecting human-related anomalies in videos are presented: COSKAD and MoCoDAD. COSKAD uses graph convolutional networks and three different latent spaces (Euclidean, Hyperbolic and Spherical) to model and detect human pose and anomalies. All variants of COSKAD surpass the state-of-the-art on the UBnormal dataset, for which we contribute a human-related version with annotated skeletons. MoCoDAD is a novel generative model for video anomaly detection, which assumes that both normality and abnormality are multimodal. MoCoDAD leverages a diffusion probabilistic model to generate an array of possible future human poses and detect anomalies in human activities from videos. It is validated on four established benchmarks, surpassing state-of-the-art results. Then, PREGO, the first online one-class classification model for mistake detection in PRocedural EGOcentric videos, is introduced. PREGO uses an online action recognition component to model the current action and a symbolic reasoning module to predict the following actions. A mistake is detected when the predictions of two modules do not match. PREGO is evaluated on two procedural egocentric video datasets, which we rearrange for online benchmarking of procedural mistake detection. Finally, collaborative human pose forecasting, which involves predicting the future poses of multiple individuals interacting with each other, is investigated. This aims to improve anomaly detection systems by providing valuable insights and features, enhancing the understanding of abnormal behavior in human interactions and activities. In summary, this thesis advances the state-of-the-art in anomaly detection by proposing novel methods and techniques that can help and assist people in complex environments. We demonstrate the effectiveness and efficiency of our methods on various data modalities and scenarios and provide new datasets and benchmarks for future research. We hope our contributions inspire further research and lead to more robust and reliable anomaly detection systems.

Produzione scientifica

11573/1699647 - 2023 - Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection
Flaborea, A.; Collorone, L.; D?Amely Di Melendugno, G. M.; D?Arrigo, S.; Prenkaj, B.; Galasso, F. - 04b Atto di convegno in volume
congresso: IEEE International Conference on Computer Vision (Paris, France)
libro: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) - ()

11573/1692564 - 2023 - Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection
Flaborea, Alessandro; Collorone, Luca; D'amely Di Melendugno, Guido Maria; D'arrigo, Stefano; Prenkaj, Bardh; Galasso, Fabio - 04b Atto di convegno in volume
congresso: IEEE/CVF International Conference on Computer Vision 2023 (Paris)
libro: Proceedings of the IEEE/CVF International Conference on Computer Vision - ()

11573/1692699 - 2023 - Are we certain it’s anomalous?
Flaborea, Alessandro; Prenkaj, Bardh; Munjal, Bharti; Sterpa, Marco Aurelio; Aragona, Dario; Podo, Luca; Galasso, Fabio - 04b Atto di convegno in volume
congresso: IEEE Conference on Computer Vision and Pattern Recognition (Vancouver, Canada)
libro: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops - (979-8-3503-0249-3)

11573/1660534 - 2023 - A self-supervised algorithm to detect signs of social isolation in the elderly from daily activity sequences
Prenkaj, Bardh; Aragona, Dario; Flaborea, Alessandro; Galasso, Fabio; Gravina, Saverio; Podo, Luca; Reda, Emilia; Velardi, Paola - 01a Articolo in rivista
rivista: ARTIFICIAL INTELLIGENCE IN MEDICINE (Amsterdam; Tecklenburg: Burgverlag; Elsevier Science Publishers) pp. 1-13 - issn: 0933-3657 - wos: WOS:000892449400004 (2) - scopus: 2-s2.0-85145439911 (5)

11573/1686540 - 2023 - Best Practices for 2-Body Pose Forecasting
Rahman, Muhammad Rameez Ur; Scofano, Luca; De Matteis, Edoardo; Flaborea, Alessandro; Sampieri, Alessio; Galasso, Fabio - 04b Atto di convegno in volume
congresso: IEEE Conference on Computer Vision and Pattern Recognition (Vancouver, Canada)
libro: IEEE Conference on Computer Vision and Pattern Recognition Workshops - ()

11573/1656390 - 2022 - Query-guided networks for few-shot fine-grained classification and person search
Munjal, Bharti; Flaborea, Alessandro; Amin, Sikandar; Tombari, Federico; Galasso, Fabio - 01a Articolo in rivista
rivista: PATTERN RECOGNITION (Elsevier Science Limited:Oxford Fulfillment Center, PO Box 800, Kidlington Oxford OX5 1DX United Kingdom:011 44 1865 843000, 011 44 1865 843699, EMAIL: asianfo@elsevier.com, tcb@elsevier.co.UK, INTERNET: http://www.elsevier.com, http://www.elsevier.com/locate/shpsa/, Fax: 011 44 1865 843010) pp. 109049- - issn: 0031-3203 - wos: WOS:000870987900009 (3) - scopus: 2-s2.0-85138467026 (14)

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma