Skip to content
PyData Warsaw #26

Details

18:00 - **Piotr Migdał, https://p.migdal.pl/** - "Don't use cosine similarity carelessly"
About Topic: How can we find relevant documents using LLMs? Many of us use cosine similarity of embedding without giving it a second thought. Yet, it is a duct tape of AI—a readily accessible tool but not the most robust. I will explore how to improve it for practical applications.
I will show common pitfalls of cosine similarity, from matching
questions to questions rather than answers, to getting distracted by
superficial patterns. We'll examine when cosine similarity works, when
it fails, and practical alternatives like task-specific embeddings and
prompt engineering. Whether you're building a recommendation system or a search engine, you'll learn to be more intentional about similarity metrics and get better results.
About Speaker: Piotr Migdał is an AI consultant focusing on combining deep technology with user-facing applications. He has PhD in quantum physics (ICFO) and was previously co-founder and CTO of Quantum Flytrap. He has extensive experience in deep learning, data visualization, and making complex concepts accessible. His technical blog posts regularly reach the front page of Hacker News, and his open-source projects include livelossplot, a popular Python library for visualizing deep learning model training. He created Data Science PL, Poland's largest data science community.
18:45 - Piotr Wachulec, CloudHackers - "From Documents to Insights: Analysis at Scale with Document Intelligence, AI Search, and OpenAI"
About Topic: Is your organization drowning in documents? Every day, companies waste countless hours manually processing documents while valuable business insights remain locked inside. This session demonstrates how to build an intelligent document processing system that transforms PDFs into actionable insights using Azure's AI capabilities.

After party in Pizza przy Politechnice :)

Venue:
Centrum Innowacji Politechniki Warszawskiej, ul. Rektorska 4
Room 3.12 (3rd Floor)

Photo of PyData Warsaw group
PyData Warsaw
See more events