Attention Is All You Need
Details
During this session we will be covering "Attention Is All You Need" (Vaswani et al., 2017); the paper that introduced the Transformer architecture. We'll break down the key ideas and walk through the code implementation together.
