Improving performance of NLP Encoder Models

Hosted By
Goran S. M.

Details
Improving performance of NLP Encoder Models
Vladimir Ageev
ML Engineer with focus on Natural Language Processing
www.linkedin.com/in/vladimir-ageev-ds
This talk will explore techniques for accelerating the inference of NLP models. It might be interesting to specialists working on retrieval-related tasks, such as text search, recommendations, or Retrieval-Augmented Generation (RAG), who are looking to optimize inference speed on GPUs or CPUs.

Data Science Club
See more events
Startit Centar Beograd
Savska 5 · Beograd
Improving performance of NLP Encoder Models