Emanuele Fabbiani

Emanuele is an engineer, researcher, and entrepreneur with a passion for artificial intelligence.

He earned his PhD by exploring time series forecasting in the energy sector and spent time as a guest researcher at EPFL in Lausanne. Today, he is co-founder and Head of AI at xtream, a boutique company that applies cutting-edge technology to solve complex business challenges.

Emanuele is also a contract professor in AI at the Catholic University of Milan. He has published eight papers in international journals and contributed to over 30 international conferences worldwide. His engagements include AMLD Lausanne, ODSC London, WeAreDevelopers Berlin, PyData Berlin, PyData Paris, PyCon Florence, the Swiss Python Summit in Zurich, and Codemotion Milan.

Emanuele has been a guest lecturer at Italian, Swiss, and Polish universities.


Your job title

Head of AI

Your company

xtream


Session

05-27
16:30
40min
Beyond Basic RAG: HyDE, Visual Embeddings, and Other Tricks
Emanuele Fabbiani

The foundational RAG architecture we all know - built on embeddings, vector databases, retrieval algorithms, and generative models - served us well in 2023. Now it's time to explore the powerful enhancements that have emerged to take RAG to the next level.

We'll explore HyDE, an approach that leverages an LLM to generate hypothetical documents, creating a stronger bridge between user queries and relevant content. The result is a marked improvement in retrieval accuracy without requiring additional models.

Visual content handling gets a major upgrade through advanced embedding techniques, particularly with the introduction of ColPali. This retrieval model works directly with document images, using vision-language models to create rich, contextual embeddings from document pages. It's particularly powerful when dealing with PDFs, scanned documents, and other visually structured content that traditional RAG systems struggle with.

For those focused on efficiency, we'll examine SPLADE, a neural retrieval model that creates sparse representations of both documents and queries. By combining the speed advantages of traditional inverted indexes with the accuracy of modern neural approaches, SPLADE offers a compelling solution for production environments.

The discussion isn't just theoretical - we'll ground everything in practical implementation. Through code snippets, engineering best practices, and live demonstrations, you'll see these enhancements in action and understand how to integrate them into your own systems.

Whether you're building your first RAG system or looking to enhance an existing one, this session will equip you with actionable insights and practical solutions to make your retrieval systems more robust and capable of handling real-world challenges.

Sala 7