Open AI Whisper - From Paper to Code
Details
Following on from the previous week's unpacking of the Whisper paper (Robust Speech Recognition via Large-Scale Weak Supervision), in this session we switch gears and dive into the code.
🔧 What we’ll look at together:
- Preprocessing of audio data into the token equivalent representations fed into the encoder.,
- Cross-encoder mechanism.
- Learned Positional Encoding,
- Multitask training format specification
Please note that the session is not recorded.
Key links:
Discord joining instructions: https://bit.ly/llm-discord
AI Algorithms
Artificial Intelligence
Machine Learning
Data Science
Machine Learning with Python