From Word2Vec to GPT: How AI Learns Language and How We Evaluate Them


Details
We are excited to welcome back our alumna Viviana Márquez. Viviana is the AI Community Lead at Prolific, where she helps AI teams build better machine learning systems by providing fast, self-serve access to diverse human data across the AI lifecycle. She holds a Master’s in Data Science from the University of San Francisco and has worked as a data scientist at HBO, Dataminr, and Royal Caribbean, among others. As an educator, she has trained more than 4,000 learners worldwide, from Fortune 500 teams to graduate students, transforming complex AI concepts into practical skills.
Fun fact: In 2021, Viviana competed in the Miss Universe Colombia contest as Miss Sucre, placing in the top 13.
Description
How do machines learn the meaning of words, and how do we know if they’ve learned it well? In this talk, we will explore the evolution of natural language processing, beginning with early representations such as one-hot vectors, moving through the breakthrough of Word2Vec, and continuing to the rise of transformers and today’s generative models like GPT. Along the way, we will examine the differences between representation models and generative models, and why evaluation is straightforward for one but a major challenge for the other.
The session will then dive into modern evaluation methods, including code-based checks and LLMs-as-judges, with a special focus on human-in-the-loop evaluation. We will explore how real human judgment is used to measure quality, safety, and usefulness in language AI. Attendees will leave with both a historical perspective and a practical framework for thinking about evaluation in their own projects.
🔗 Register now to join us.
#USFDataScience #SpeakerSeries #NaturalLanguageProcessing #GenerativeAI #MachineLearning #ArtificialIntelligence #AIethics #DataScience

From Word2Vec to GPT: How AI Learns Language and How We Evaluate Them