🔥 New (1h56m) video lecture: "Let's build GPT: from scratch, in code, spelled out."
piped.video/watch?v=kCc8FmEb…
We build and train a Transformer following the "Attention Is All You Need" paper in the language modeling setting and end up with the core of nanoGPT.
Jan 17, 2023 · 5:18 PM UTC
483
2,995
19,842

