Scaling Vector Search with PostgreSQL: A Deep Dive into pgvector Optimization
2025-05-27 , Sala 8

Are you building AI applications that need a reliable, scalable vector store? This talk provides a comprehensive guide to configuring PostgreSQL as an optimal vector store for AI applications using the pgvector extension.

We'll begin with a brief introduction to vector stores and PostgreSQL fundamentals before diving into the pgvector extension and its capabilities. The presentation will explore essential vector operations and compare indexing approaches (HNSW vs. IVFFlat), offering practical guidance on hyperparameter tuning for your specific use cases.

We'll also cover advanced optimization techniques including:
- Reduced precision
- Quantization methods
- Various scaling strategies such as partitioning, sharding, and replication to handle growing data volumes

We'll also see a real-world example of a semantic search application using pgvector, demonstrating how to implement and optimize it for production.

Whether you're implementing semantic search, recommendation systems, or other vector-based AI applications, you'll leave with actionable knowledge to configure PostgreSQL as a high-performance, production-ready vector database.

I'm a Software Engineer specialized in AI and Computer Vision, with a strong background in web and AI technologies. Over the past two years, I've focused on building GenAI applications—working with LLMs, RAG, and AI agents.
I'm the co-founder and CTO of three companies:

  • Mònade, a 30-person consultancy delivering custom software and AI Agents
  • Elf Games, a game studio with two internationally recognized titles
  • Elysia, a SaaS platform that helps companies unlock internal knowledge using Generative AI