Name: Open AI Whisper - From Paper to Code
Start: 2025-08-19T20:00:00+01:00
End: 2025-08-19T21:30:00+01:00

Following on from the previous week's unpacking of the *Whisper* paper (*Robust Speech Recognition via Large-Scale Weak Supervision*), in this session we switch gears and dive into the code.

🔧 What we’ll look at together:

1. Preprocessing of audio data into the token equivalent representations fed into the encoder.,
2. Cross-encoder mechanism.
3. Learned Positional Encoding,
4. Multitask training format specification

Please note that the session is not recorded.

Key links:

* [Code](https://github.com/openai/whisper)
* [Research Paper](https://arxiv.org/abs/2212.04356)
* [Zoom](https://us06web.zoom.us/j/89872165234?pwd=bZKwS4Hy8hMpbKahaqqNyDvIeI9Zqz.1)

Discord joining instructions: [https://bit.ly/llm-discord](https://bit.ly/llm-discord)

Imran H

LLM Reading Club

Technology

Python

AI and Society

Data Science

Data Science using Python

Artificial Intelligence

AI Algorithms

Machine Learning

Machine Learning with Python

Open AI Whisper - From Paper to Code

Online event

Share

LLM Reading Club

Open AI Whisper - From Paper to Code

LLM Reading Club

Details

Related topics

You may also like