CESARE CAMPAGNANO

PhD Graduate

PhD program:: XXXVI


supervisor: Prof. Gabriele Tolomei
co-supervisor: Prof. Fabrizio Silvestri

Thesis title: Foundational Advancements of Large Language Models: Current and Future Implications

The advent of Large Language Models (LLMs) represents a pivotal revolution in the field of Artificial Intelligence. These models have paved the way for a new era where machines can understand and generate language on par with humans in some tasks, showcasing remarkable proficiency across a wide spectrum of linguistic nuances. However, especially for open models, much of the focus has been predominantly on the English language. This thesis delves into this transformative landscape, exploring the current state and potential future trajectories of LLMs. Specifically, this work illustrates a series of novel contributions to the development and implementation of a novel foundational model, assessing models' performance on multilingual downstream tasks, and discussing the revolutionary prospects of integrating AI models into Human-Computer Interactions. The first contribution, DanteLLM, highlights the disparity in language model resources and attempts to bridge this gap by introducing an Italian-centric LLM, setting a new standard for language-specific model development. The second work, XL-WA, introduces a novel benchmark for word alignment, facilitating progress in cross-lingual understanding and translation. Furthermore, SRL4E advances the field of structured emotion classification by proposing a novel standardized formulation and framework for emotion-based semantic role labeling, with a unified emotion taxonomy. Finally, Prompt-to-OS envisions a future where operating systems and user interfaces are fundamentally redefined through the integration of generative AI, emphasizing the transformative potential of LLMs beyond traditional applications. By providing foundational tools for non-English LLMs, this dissertation not only showcases the technical advancements and applications of multilingual LLMs but also aims to establish the initial groundwork towards bridging the linguistic gap in Natural Language Processing. We critically examine the practical, societal and ethical implications of these technologies, hoping to pave the way for a more inclusive, democratized, and ethically aware AI future.

Research products

11573/1716988 - 2024 - DanteLLM: Let’s Push Italian LLM Research Forward!
Bacciu, Andrea; Campagnano, Cesare; Trappolini, Giovanni; Silvestri, Fabrizio - 04b Atto di convegno in volume
conference: LREC-COLING (Turin; Italy)
book: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) - (9782493814104)

11573/1716155 - 2024 - The Power of Noise: Redefining Retrieval for RAG Systems
Cuconasu, Florin; Trappolini, Giovanni; Siciliano, Federico; Filice, Simone; Campagnano, Cesare; Maarek, Yoelle; Tonellotto, Nicola; Silvestri, Fabrizio - 04b Atto di convegno in volume
conference: ACM International Conference on Research and Development in Information Retrieval (Washington D.C.; USA)
book: SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval - (9798400704314)

11573/1723801 - 2024 - Rethinking Relevance: How Noise and Distractors Impact Retrieval-Augmented Generation
Cuconasu, Florin; Trappolini, Giovanni; Siciliano, Federico; Filice, Simone; Campagnano, Cesare; Maarek, Yoelle; Tonellotto, Nicola; Silvestri, Fabrizio - 04b Atto di convegno in volume
conference: Italian Information Retrieval Workshop 2024 (Udine; Italy)
book: Proceedings of the 14th Italian Information Retrieval Workshop (IIR 2024) - ()

11573/1670826 - 2023 - CycleDRUMS: automatic drum arrangement for bass lines using CycleGAN
Barnabò, Giorgio; Trappolini, Giovanni; Lastilla, Lorenzo; Campagnano, Cesare; Fan, Angela; Petroni, Fabio; Silvestri, Fabrizio - 01a Articolo in rivista
paper: DISCOVER ARTIFICIAL INTELLIGENCE (Cham: Springer International Publishing) pp. - - issn: 2731-0809 - wos: (0) - scopus: 2-s2.0-85164904687 (4)

11573/1694184 - 2023 - XL-WA: a Gold Evaluation Benchmark for Word Alignment in 14 Language Pairs
Martelli, Federico; Bejgu, Andrei Stefan; Campagnano, Cesare; Čibej, Jaka; Costa, Rute; Gantar, Apolonija; Kallas, Jelena; Koeva, Svetla; Koppel, Kristina; Krek, Simon; Langemets, Margit; Lipp, Veronika; Nimb, Sanni; Olsen, Sussi; Sandford Pedersen, Bolette; Quochi, Valeria; Salgado, Ana; Simon, László; Tiberius, Carole; Ureña-Ruiz, Rafael-J; Navigli, Roberto - 04b Atto di convegno in volume
conference: Ninth Italian Conference on Computational Linguistics (Venice; Italy)
book: Proceedings of the Ninth Italian Conference on Computational Linguistics - ()

11573/1672374 - 2023 - Universal Semantic Annotator
Navigli, R.; Orlando, R.; Campagnano, C.; Conia, S. - 02a Capitolo o Articolo
book: European Language Grid - (978-3-031-17257-1; 978-3-031-17258-8)

11573/1705588 - 2023 - Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models
Tolomei, Gabriele; Campagnano, Cesare; Silvestri, Fabrizio; Trappolini, Giovanni - 04b Atto di convegno in volume
conference: 2023 IEEE 5th International Conference on Cognitive Machine Intelligence (CogMI) (Atlanta; USA)
book: Proceedings of the 2023 IEEE 5th International Conference on Cognitive Machine Intelligence (CogMI) - (979-8-3503-2383-2; 979-8-3503-2384-9)

11573/1654027 - 2022 - SRL4E - Semantic Role Labeling for Emotions: A Unified Evaluation Framework
Campagnano, Cesare; Conia, Simone; Navigli, Roberto - 04b Atto di convegno in volume
conference: Association for Computational Linguistics (Dublin, Ireland)
book: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics - (9781955917216)

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma