Skip to content

Research Paper review (OpenAI Whisper)

Photo of Imran H
Hosted By
Imran H.
Research Paper review (OpenAI Whisper)

Details

Join us to review OpenAI's groundbreaking paper that revolutionised automatic speech recognition and underpins the Whisper speech-to-text model:

  • Robust Speech Recognition via Large-Scale Weak Supervision

We'll aim to dissect the methodology, architecture, and implications of training on multilingual audio data.

Key Points:

  • Weak supervision approach - How Whisper learned from imperfect, web-scraped data
  • Zero-shot transfer capabilities - Why it works across languages without fine-tuning
  • Architecture deep dive - Transformer encoder-decoder design choices
  • Scaling insights - What 680k hours of data teaches us about speech AI
  • Real-world performance - Robustness to accents, noise, and domain shifts

Please note that the session is not recorded.

Key links:

Discord joining instructions: https://bit.ly/llm-discord

Photo of LLM Reading Club (NYC) group
LLM Reading Club (NYC)
See more events
Online event
Link visible for attendees
FREE