🔥 New (1h56m) video lecture: "Let's build GPT: from scratch, in code, spelled out." piped.video/watch?v=kCc8FmEb… We build and train a Transformer following the "Attention Is All You Need" paper in the language modeling setting and end up with the core of nanoGPT.

Jan 17, 2023 · 5:18 PM UTC

483
2,995
19,842