ALESSIO DEVOTO

Dottore di ricerca

ciclo: XXXVIII



Titolo della tesi: Adaptive and Efficient Neural Architectures: From Adaptive Computation to Interpretable AI Systems

The rapid advancement of Artificial Intelligence, particularly through deep learning and Large Language Models, has yielded unprecedented performance across diverse domains. However, this success has come at the cost of increasingly demanding computational requirements and diminished transparency. Modern AI systems rely heavily on over-parameterized architectures that, while powerful, are inefficient for deployment in resource-constrained environments and opaque in their decision-making processes, raising critical concerns for adoption in high-stakes applications. This thesis addresses the urgent need to reconcile the capabilities of state-of-the-art AI with the practical demands of real-world deployment by developing systems that are simultaneously adaptive, efficient, and interpretable. Through three interconnected research themes, we advance the foundations of AI architectures. First, we establish a comprehensive framework for adaptive computation, introducing mechanisms that enable neural networks to dynamically allocate computational resources based on input complexity and available resources. Our contributions include Adaptive Computation Modules for granular per-token efficiency, adaptive layer selection for accelerated fine-tuning of Vision Transformers, and adaptive semantic token selection strategies for edge intelligence systems. Second, we tackle critical efficiency bottlenecks in Large Language Model deployment, specifically addressing the memory constraints imposed by the Key-Value cache. We propose multiple novel compression strategies—including $L_2$ norm-based approaches, Q-Filters that exploit query-key geometrical relationships, and Expected Attention methods that leverage future query distributions—enabling practical deployment of LLMs with extended context windows. Third, we advance interpretable AI through mechanistic analysis of model internals and domain-specific applications of explainable AI. Our work encompasses steering knowledge selection behaviors in LLMs, analyzing residual streams under knowledge conflicts, and deploying interpretable models in high-stakes domains including high-energy physics and archaeological classification, demonstrating how transparency enables both trust and scientific insight. Collectively, this body of work demonstrates that efficiency and interpretability need not be traded against performance. By developing adaptive architectures, addressing specific deployment bottlenecks, and opening the black box of neural networks, this thesis provides both theoretical frameworks and practical solutions that move AI systems closer to widespread, trustworthy deployment across diverse real-world applications.

Produzione scientifica

11573/1746903 - 2025 - Towards interpretable deep learning in ceramic petrographic fabric classification through a comparative study of convolutional neural networks and vision transformers
Capriotti, S; Devoto, A; Genovese, D; Mignardi, S; Scardapane, S; Medeghini, L - 04d Abstract in atti di convegno
congresso: Congresso congiunto SGI, SIMP 2025 (Padova, Italia)
libro: Atti di convegno Congresso congiunto SGI, SIMP 2025 - ()

11573/1746900 - 2025 - Explainable vision transformers for the petrographic classification of Levantine ceramics
Capriotti, Sara; Devoto, Alessio; Genovese, Donatella; Mignardi, Silvano; Scardapane, Simone; Medeghini, Laura - 04d Abstract in atti di convegno
congresso: EMAC European Meeting on Ancient Ceramics (Bilbao, Spagna)
libro: Atti di convegno EMAC - ()

11573/1746779 - 2025 - EXPLAINABLE DEEP LEARNING FOR PETROGRAPHIC FABRIC CLASSIFICATION OF LEVANTINE POTTERY
Capriotti, Sara; Devoto, Alessio; Scardapane, Simone; Mignardi, Silvano; Medeghini, Laura - 04d Abstract in atti di convegno
congresso: TECHNART 2025 - International conference on analytical techniques for heritage studies and conservation (Perugia, Italia)
libro: Atti di convegno TECHNART 2025 - ()

11573/1746780 - 2025 - Interpretable classification of Levantine ceramic thin sections via neural networks
Capriotti, Sara; Devoto, Alessio; Scardapane, Simone; Mignardi, Silvano; Medeghini, Laura - 01a Articolo in rivista
rivista: MACHINE LEARNING: SCIENCE AND TECHNOLOGY (Bristol: IOP Publishing) pp. - - issn: 2632-2153 - wos: WOS:001520296100001 (1) - scopus: 2-s2.0-105009844893 (1)

11573/1747501 - 2025 - Deep learning for ceramic fabric classification: A focus on Levantine pottery
Capriotti, Sara; Devoto, Alessio; Scardapane, Simone; Mignardi, Silvano; Medeghini, Laura - 04d Abstract in atti di convegno
congresso: 4th International Conference TMM_CH Transdisciplinary Multispectral Modelling and Cooperation for the Preservation of Cultural Heritage (Athens, Greece)
libro: Atti di convegno TMM_CH - ()

11573/1747187 - 2025 - Mixture-of-experts graph transformers for interpretable particle collision detection
Genovese, D.; Sgroi, A.; Devoto, A.; Valentine, S.; Wood, L.; Sebastiani, C.; Scardapane, S.; D'onofrio, M.; Giagu, S. - 01a Articolo in rivista
rivista: SCIENTIFIC REPORTS (London: Springer Nature London: Nature Publishing Group) pp. - - issn: 2045-2322 - wos: WOS:001542459800031 (0) - scopus: 2-s2.0-105012286552 (0)

11573/1750784 - 2024 - Adaptive Semantic Token Selection for AI-native Goal-oriented Communications
Devoto, Alessio; Petruzzi, Simone; Pomponi, Jary; Di Lorenzo, Paolo; Scardapane, Simone - 04b Atto di convegno in volume
congresso: Globecomm 2024 (Cape Town)
libro: Generative Horizons @ Global Communications Conference 2024 - ()

11573/1717183 - 2024 - Conditional computation in neural networks: Principles and research trends
Scardapane, Simone; Baiocchi, Alessandro; Devoto, Alessio; Marsocci, Valerio; Minervini, Pasquale; Pomponi, Jary - 01a Articolo in rivista
rivista: INTELLIGENZA ARTIFICIALE (Associazione Italiana per l'Intelligenza Artificiale) pp. 175-190 - issn: 1724-8035 - wos: WOS:001301163200013 (3) - scopus: 2-s2.0-85200778975 (5)

11573/1693645 - 2023 - On the robustness of vision transformers for in-flight monocular depth estimation
Ercolino, Simone; Devoto, Alessio; Monorchio, Luca; Santini, Matteo; Mazzaro, Silvio; Scardapane, Simone - 01a Articolo in rivista
rivista: Industrial Artificial Intelligence (Springer) pp. - - issn: 2731-667X - wos: (0) - scopus: (0)

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma