Explainable AI (XAI) and Large Language Models (LLM): an impossible pairing?
2024-12-11 , Auditorium

In this talk, we will explore the field of Explainable AI (XAI), with a particular focus on techniques used to gain insights into Large Language Models (LLMs). We will show how XAI can be applied to LLMs, specifically analyzing attention maps and the Transformer architecture. Additionally, we will walk through the process of building an LLM from scratch, using visualization tools to delve into its inner workings. The talk will address key questions surrounding the explainability of LLMs, such as: How can we interpret these vast models? Do LLMs primarily memorize or generalize? What are the emergent capabilities of LLMs, and how do they arise?

Enrico Zimuel, Tech Lead and Principal Software Engineer at Elastic. Adjunct professor of Machine Learning at University of Turin and Applied Statistics at University of Roma Tre. TEDx and international speaker in 130+ conferences. Programmer since 1996 and open source contributors of many projects. MSc in Data Analytics and BSc in Economics and Computer Science.

I'm a PhD in theoretical physics, currently working as the CTO of Cyber Guru with active projects and collaborations in Machine Learning.
I'm the co-author of the book for Springer "Explainable AI with Python" published by Springer