IRENE CANNISTRACI

PhD Graduate

PhD program:: XXXVI


supervisor: Prof. Danilo Avola
co-supervisor: Prof. Emanuele Rodolà

Thesis title: Improving Neural Networks Efficiency via Representation Similarities

As large-scale Neural Networks (NNs) continue to push the boundaries of performance in different fields ranging from drug discovery to climate science, their computational demands have become a major bottleneck. These models require extensive resources, limiting their accessibility and raising concerns about sustainability. Additionally, the reusability of these models is constrained by the need for costly retraining or fine-tuning when adapting them to new tasks or data. This dissertation presents novel approaches to address these challenges by exploiting similarities between and within NNs, thereby reducing computational and data requirements without compromising performance. The core contribution of this research lies in leveraging latent space representations of NNs to enable model reuse, and reduce computational complexity. First, we introduce a framework for combining latent spaces from different models, facilitating the unification of these neural representations in a meaningful way, allowing for the reuse of existing neural components without the need for further training. Then we exploit the aggregation of latent spaces, that may partially overlap or be entirely disjoint, to unify them in an efficient and meaningful way. Additionally, we develop an optimization method to align neural representations across diverse domains, addressing the limitations of existing methods that often depend on large sets of parallel samples to unify different latent spaces, which is an impractical requirement in many real-world scenarios. Finally, we investigate intra-network similarities to simplify large pretrained models. By identifying redundant computational blocks within individual NNs and approximating them using simpler transformations, our approach reduces the number of parameters and speeds up inference while maintaining the model’s integrity. Our findings demonstrate that leveraging similarities in latent spaces can simplify large-scale models through representation alignment and approximation, making them more efficient, accessible, and sustainable while maintaining their effectiveness. These methods are applicable across various architectures, such as transformers and convolutional networks, and support a wide range of tasks, as well as different data modalities. By enabling the reuse and simplification of NNs, this research contributes to the democratization of Machine Learning technologies and the development of more sustainable and efficient models.

Research products

11573/1713406 - 2024 - MV-MS-FETE: Multi-view multi-scale feature extractor and transformer encoder for stenosis recognition in echocardiograms
Avola, D.; Cannistraci, I.; Cascio, M.; Cinque, L.; Fagioli, A.; Foresti, G. L.; Rodola, E.; Solito, L. - 01a Articolo in rivista
paper: COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE (Elsevier Science Ireland Limited:PO Box 85, Limerick Ireland:011 353 61 709600, 011 353 61 61944, EMAIL: usinfo-f@elsevier.com, INTERNET: http://www.elsevier.com, Fax: 011 353 61 709114) pp. 1-8 - issn: 0169-2607 - wos: WOS:001179046700001 (0) - scopus: 2-s2.0-85185003240 (2)

11573/1712069 - 2024 - From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication
Cannistraci, Irene; Moschella, Luca; Fumero, Marco; Maiorca, Valentino; Rodolà, Emanuele - 04b Atto di convegno in volume
conference: The Twelfth International Conference on Learning Representations (Vienna, Austria)
book: International Conference on Learning Representations - ()

11573/1704983 - 2024 - LOB-Based Deep Learning Models for Stock Price Trend Prediction: A Benchmark Study
Prata, Matteo; Masi, Giuseppe; Berti, Leonardo; Arrigoni, Viviana; Coletta, Andrea; Cannistraci, Irene; Vyetrenko, Svitlana; Velardi, Paola; Bartolini, Novella - 01a Articolo in rivista
paper: ARTIFICIAL INTELLIGENCE REVIEW (-DORDRECHT, NETHERLANDS: SPRINGER VERLAG -Oxford; Exeter: Blackwell Scientific Publications Intellect Limited. -Dordrecht Netherlands: Kluwer Academic Publishers) pp. - - issn: 0269-2821 - wos: WOS:001201489200002 (0) - scopus: 2-s2.0-85190360528 (2)

11573/1696639 - 2023 - Real-time GAN-based model for underwater image enhancement
Avola, D.; Cannistraci, I.; Cascio, M.; Cinque, L.; Diko, A.; Distante, D.; Foresti, G. L.; Mecca, A.; Scagnetto, I. - 04b Atto di convegno in volume
conference: Proceedings of the 22nd International Conference on Image Analysis and Processing, ICIAP 2023 (Udine)
book: Image Analysis and Processing – ICIAP 2023 - (978-3-031-43147-0; 978-3-031-43148-7)

11573/1713815 - 2023 - Bootstrapping Parallel Anchors for Relative Representations
Cannistraci, Irene; Moschella, Luca; Maiorca, Valentino; Fumero, Marco; Norelli, Antonio; Rodolà, Emanuele - 04b Atto di convegno in volume
conference: Tiny Papers Track at ICLR (Kigali, Rwanda)
book: The First Tiny Papers Track at ICLR 2023 - ()

11573/1655215 - 2022 - A novel GAN-based anomaly detection and localization method for aerial video surveillance at low altitude
Avola, D.; Cannistraci, I.; Cascio, M.; Cinque, L.; Diko, A.; Fagioli, A.; Foresti, G. L.; Lanzino, R.; Mancini, M.; Mecca, A.; Pannone, D. - 01a Articolo in rivista
paper: REMOTE SENSING (Basel : Molecular Diversity Preservation International) pp. 1-18 - issn: 2072-4292 - wos: WOS:000845372100001 (13) - scopus: 2-s2.0-85137772162 (25)

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma