Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve

24 Sep 2025

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. ai.meta.com/research/publica…

310

1,798

920,808

Gabriel Synnaeve · Jun 9, 2023 · 9:36 AM UTC

Gabriel Synnaeve @syhw

9 Jun 2023

We've just released MusicGen, and there is a @huggingface demo now, here is a thread about me playing with it just right now. huggingface.co/spaces/facebo… A 🧵👇

MusicGen - a Hugging Face Space by facebook

Enter a description of the music you want, optionally upload a short melody to guide it, and set parameters like length. The app then creates a music audio file (and, if chosen, a higher‑quality ve...

huggingface.co

252

1,163

698,189

Gabriel Synnaeve · Oct 9, 2025 · 11:45 AM UTC

Gabriel Synnaeve @syhw

9 Oct 2025

This is an excellent history of LLMs, doesn't miss seminal papers I know. Reminds you we're standing on the shoulders of giants, and giants are still being born today. gregorygundersen.com/blog/20…

110

684

127,697

Gabriel Synnaeve · Oct 4, 2024 · 1:57 PM UTC

Gabriel Synnaeve @syhw

4 Oct 2024

Reinforcement learning with execution feedback (RLEF). Lots of sweat went into this one, but what works in principle works in practice: for code generation we can turn compute into training data: arxiv.org/abs/2410.02089 This works for LLMs, but will lead to world models.

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Large language models (LLMs) deployed as agents solve user-specified tasks over multiple steps while keeping the required manual engagement to a minimum. Crucially, such LLMs need to ground their...

arxiv.org

554

62,864

Gabriel Synnaeve · Sep 30, 2025 · 7:36 PM UTC

Gabriel Synnaeve @syhw

30 Sep 2025

Everything I know in RL in one tweet: exploration>exploitation, easy to leverage off-policy positive rewards, hard to leverage off-policy negative rewards, update the policy often, focus on throughput, self-play or find asymmetric grounding, clip everything but check statistics.

486

33,753

Gabriel Synnaeve · Jun 18, 2024 · 5:10 PM UTC

Gabriel Synnaeve @syhw

18 Jun 2024

Multi-token prediction models are here huggingface.co/facebook/mult…

facebook/multi-token-prediction · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

318

91,521

Gabriel Synnaeve · Oct 21, 2024 · 4:05 PM UTC

Gabriel Synnaeve @syhw

21 Oct 2024

Want to do research in code generation with LLMs and wonky deep learning from the 90s? We're recruiting one Master student (M2) intern for 2025 at FAIR Paris in my team metacareers.com/jobs/1068714…

286

58,039

Gabriel Synnaeve · Dec 15, 2020 · 6:29 PM UTC

Gabriel Synnaeve @syhw

15 Dec 2020

The wav2letter Santa has brought 50k hours of read speech in 8 languages in CC-BY 4.0: - dataset: openslr.org/94/ - paper: arxiv.org/abs/2012.03411 - pretrained models: github.com/facebookresearch/…

275

Gabriel Synnaeve · Apr 17, 2024 · 8:56 PM UTC

Gabriel Synnaeve @syhw

17 Apr 2024

To all the defeatists who think there is nothing else but scale: * 5 years between Self-Attention Is All You Need and FlashAttention * Transformers still require warmup. Researchers: get back to work! The future is bright :)

254

76,807

Gabriel Synnaeve · Apr 5, 2023 · 8:17 AM UTC

Gabriel Synnaeve @syhw

5 Apr 2023

Do you need to quantize models? Try diffq, `pip install diffq` and github.com/facebookresearch/…

GitHub - facebookresearch/diffq: DiffQ performs differentiable quantization using pseudo quantiza...

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off b...

github.com

264

32,909

Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve @syhw

24 Sep 2025

4/ Here is an example of the Code World Model tracing the execution of the piece of code counting the "r"s in "strawberry". Think of it like a neural `pdb` that you can set to any initial frame state, and that reasoning can query as a tool in token space.

259

121,326

Gabriel Synnaeve · Aug 24, 2023 · 3:47 PM UTC

Gabriel Synnaeve @syhw

24 Aug 2023

Happy to be releasing Code Llama! We've built it on Llama 2 and improved it for code use cases. In particular it supports infilling out of the box, and was trained with sequences up to 16k tokens. Looking forward to what the community will build with it! 1/7

248

33,518

Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve @syhw

24 Sep 2025

2/ When humans plan, we imagine the possible outcomes of different actions. When we reason about code we simulate part of its execution in our head. The current generation of LLMs struggles to do this. What kind of research will an explicitly trained code world model enable?

258

33,944

Gabriel Synnaeve · Apr 20, 2021 · 3:36 PM UTC

Gabriel Synnaeve @syhw

20 Apr 2021

Flashlight's v0.3 release: a lightweight, modern C++ deep learning autograd-based library with SOTA models in speech recognition, language modeling, and vision: github.com/flashlight/flashl… dataloading/model/training/docs to follow [1/5]

Release v0.3 · flashlight/flashlight

First stable release post-consolidation. Separates Flashlight into four parts: flashlight/lib contains kernels and standalone utilities for sequence losses, beam search decoding, text processing, ...

github.com

225

Gabriel Synnaeve · Nov 28, 2023 · 3:18 PM UTC

Gabriel Synnaeve @syhw

28 Nov 2023

I'm hiring Master interns at FAIR Paris to work on code generation, to work with me and the awesome CodeGen team (@b_roziere, @jadecopet, @jnsgehring, @adiyossLC et al.). We do Code Llama and research. Candidate at metacareers.com/jobs/1126568… and send me an email or message.

228

77,015

Gabriel Synnaeve · Dec 14, 2024 · 7:10 PM UTC

Gabriel Synnaeve @syhw

14 Dec 2024

Just gave a talk on "Grounding LLMs in Code Execution" at the NeurIPS Hacker-Cup AI Competition, here are the slides docs.google.com/presentation…

[NeurIPS HackerCup 2024] Grounding LLMs in Code Execution

Grounding LLMs in Code Execution Gabriel Synnaeve, Meta, FAIR

docs.google.com

232

46,343

Gabriel Synnaeve · Aug 24, 2023 · 3:47 PM UTC

Gabriel Synnaeve @syhw

24 Aug 2023

We're releasing base models, Python-specialized models, and Instruct(ion following) models, all in sizes 7B, 13B, 34B params. Get the code and weights: github.com/facebookresearch/… Read the research paper: ai.meta.com/research/publica… Read the blog post: ai.meta.com/blog/code-llama-…

200

330,368

Gabriel Synnaeve · Jan 17, 2020 · 3:41 PM UTC

Gabriel Synnaeve @syhw

17 Jan 2020

Thanks to *big* team effort, we released the code and the trained models from our LibriSpeech acoustic model architecture study and SOTA results arxiv.org/abs/1911.08460 here github.com/facebookresearch/…

198

Gabriel Synnaeve · Dec 21, 2018 · 8:47 PM UTC

Gabriel Synnaeve @syhw

21 Dec 2018

The speech team at FAIR is delivering some open source presents: a fully convolutional ASR pipeline, a fast C++ ASR library, and a C++ ML library code.fb.com/ai-research/wav2…

Open sourcing wav2letter++, the fastest state-of-the-art speech system, and flashlight, an ML...

WHAT THE RESEARCH IS: A new fully convolutional approach to automatic speech recognition and wav2letter++, the fastest state-of-the-art end-to-end speech recognition system available. The approach …

engineering.fb.com

192

Gabriel Synnaeve · Nov 10, 2025 · 1:53 PM UTC

Gabriel Synnaeve @syhw

10 Nov 2025

Good overview of Code World Model from @rasbt! sebastianraschka.com/blog/20…

Beyond Standard LLMs

Linear Attention Hybrids, Text Diffusion, Code World Models, and Small Recursive Transformers

magazine.sebastianraschka.com

199

47,087

Gabriel Synnaeve · Aug 2, 2023 · 11:22 PM UTC

Gabriel Synnaeve @syhw

2 Aug 2023

Remember MusicGen? Today we open sourced the training code, as well as for AudioGen, and for EnCodec, and a bunch of goodies (models you can go play with and extend). Big congrats to @jadecopet, @honualx, @adiyossLC and everybody else for this last push! audiocraft.metademolab.com/?…

AudioCraft

AudioCraft is a single-stop code base for all your generative audio needs: music, sound effects, and compression after training on raw audio signals.

audiocraft.metademolab.com

186

33,866

Gabriel Synnaeve · Aug 1, 2025 · 12:06 PM UTC

Gabriel Synnaeve @syhw

1 Aug 2025

In another life I worked on StarCraft: Brood War, doing RL self-play from scratch with a population of agents. Lots of the lessons learned there I still carry to this day.

192

13,010

Gabriel Synnaeve · Dec 13, 2017 · 10:03 AM UTC

Gabriel Synnaeve @syhw

13 Dec 2017

NIPS Workshop "Learn to Code a Paper with State of the Art Frameworks" (I missed it): mltrain.cc/events/nips-highl… code: github.com/vasiloglou/mltrai…

186

Gabriel Synnaeve · Jun 3, 2017 · 12:58 AM UTC

Gabriel Synnaeve @syhw

3 Jun 2017

TorchCraft v1.3-0 is here github.com/TorchCraft/TorchC… C++, Python, Lua clients, for StarCraft control and state serialization. Thanks team!

Release v1.3-0 · TorchCraft/TorchCraft

Major release This a major release of TorchCraft. Major Changes Adds text drawing commands (#94) Adds status flags for units (#95, #96, #98) Separates map into a walkability map, buildability map,...

github.com

174

Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve @syhw

24 Sep 2025

6/ Find the links below: 📊 Tech Report ai.meta.com/research/publica… ⚖️ Models weights ai.meta.com/resources/models… 🤗 On Huggingface huggingface.co/facebook/cwm huggingface.co/facebook/cwm-… huggingface.co/facebook/cwm-… 🧑‍💻 Inference Code github.com/facebookresearch/…

170

17,112

Gabriel Synnaeve · Sep 16, 2017 · 1:35 PM UTC

Gabriel Synnaeve @syhw

16 Sep 2017

A toolkit for controlling Euro Truck Simulator 2 with python to develop self-driving algorithms. github.com/marshq/europilot

GitHub - marsauto/europilot: A toolkit for controlling Euro Truck Simulator 2 with the end-to-end...

A toolkit for controlling Euro Truck Simulator 2 with the end-to-end driving model - marsauto/europilot

github.com

146

Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve @syhw

24 Sep 2025

3/ CWM allows us to study this question. Our model is trained on large amounts of coding data & bespoke Python + Bash world modeling data, allowing it to simulate Python function execution and agentic interactions in Bash environments.

151

21,579

Gabriel Synnaeve · Dec 16, 2023 · 12:05 PM UTC

Gabriel Synnaeve @syhw

16 Dec 2023

My only swag this time, let's wish it becomes vintage but not collector!

143

20,350

Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve @syhw

24 Sep 2025

5/ The team and I can’t wait to see what new research will be enabled with a world model. We release 3 checkpoints under a research license: 1️⃣ CWM pretrained, e.g. for new posttrainings methods, 2️⃣ CWM SFT, e.g. for RL research, 3️⃣ CWM, e.g. for inference time scaling.

150

17,298

Gabriel Synnaeve · Jul 24, 2024 · 7:59 AM UTC

Gabriel Synnaeve @syhw

24 Jul 2024

A short time ago in 10 timezones from California away... While Llama 3.1 is (rightfully) all the rage, some weirdos are making progress on generating all tokens at once with flow matching (a diffusion family process), and testing on the hardest task to get exactly right: codegen!

Felix Kreuk @FelixKreuk

23 Jul 2024

Excited to share our latest work on discrete flow matching! A new framework that achieves SOTA non-autoregressive generation. For example, pass@1 on HumanEval is 6.7/11.6 and 6.7/13.1 on MBPP. Paper: arxiv.org/abs/2407.15595 [1/n]

142

18,607

Gabriel Synnaeve · Nov 9, 2022 · 8:54 AM UTC

Gabriel Synnaeve @syhw

9 Nov 2022

Sir David MacKay had a tremendous influence on me. First with Information Theory, Inference, and Learning Algorithms inference.org.uk/mackay/itil… which is my first read-cover-to-cover ML book, then with WithoutTheHotAir withouthotair.com/

130

Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve @syhw

24 Sep 2025

I'm immensely proud of the work done by my cracked CodeGen team at Meta, with PhD students and veterans, for which nothing is someone else's problem. The broader Meta AI community all pulled together for this. I'm very thankful for the unwavering support of our whole leadership.

135

11,210

Gabriel Synnaeve · Feb 4, 2025 · 12:28 AM UTC

Gabriel Synnaeve @syhw

4 Feb 2025

AGI delayed internally.

127

26,577

Gabriel Synnaeve · Feb 7, 2024 · 3:48 PM UTC

Gabriel Synnaeve @syhw

7 Feb 2024

We’re hiring PhD interns to work on code generation research at FAIR in EMEA! Please apply at metacareers.com/jobs/1126568… if you’re interested by research in Code Llama, LLMs, code generation, compilers, reinforcement learning.

118

34,934

Gabriel Synnaeve · Feb 26, 2025 · 6:58 PM UTC

Gabriel Synnaeve @syhw

26 Feb 2025

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution arxiv.org/abs/2502.18449 by @YuxiangWei9 @sidawxyz and the whole team! Get started with your favorite model here github.com/facebookresearch/…

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open...

The recent DeepSeek-R1 release has demonstrated the immense potential of reinforcement learning (RL) in enhancing the general reasoning capabilities of large language models (LLMs). While...

arxiv.org

119

11,567

Gabriel Synnaeve · Nov 17, 2023 · 8:25 PM UTC

Gabriel Synnaeve @syhw

17 Nov 2023

Science is not a zero-sum game. With more knowledge, we can do more things, with less. Science is not a zero-sum game. With more funding, more labs, we’ll train more students, and grow the pie for all. Science is not a zero-sum game. Publish, I can build on your breakthrough.

106

119,881

Gabriel Synnaeve · Oct 5, 2025 · 3:32 PM UTC

Gabriel Synnaeve @syhw

5 Oct 2025

it's what we do in Code World Model too ai.meta.com/temp/research/pu…

🇺🇦 Dzmitry Bahdanau @DBahdanau

26 Apr 2025

I am excited to open-source PipelineRL - a scalable async RL implementation with in-flight weight updates. Why wait until your bored GPUs finish all sequences? Just update the weights and continue inference! Code: github.com/ServiceNow/Pipeli… Blog: huggingface.co/blog/ServiceN…

114

15,574

Gabriel Synnaeve · Dec 31, 2017 · 7:21 PM UTC

Gabriel Synnaeve @syhw

31 Dec 2017

We just open-sourced wav2letter! facebook.com/ronan.collobert…

116

Gabriel Synnaeve · Dec 14, 2019 · 12:02 AM UTC

Gabriel Synnaeve @syhw

14 Dec 2019

Asja and I are thrilled to announce the list of accepted workshops for ICLR 2020 in Addis Ababa, Ethiopia. (The ICLR website will get updated soon.)

113

Gabriel Synnaeve · May 9, 2017 · 5:03 PM UTC

Gabriel Synnaeve @syhw

9 May 2017

SOTA machine translation with ConvNets code.facebook.com/posts/1978… paper: fb.me/convolutional-s2s.pdf code: github.com/facebookresearch/…

A novel approach to neural machine translation

Visit the post for more.

engineering.fb.com

111

Gabriel Synnaeve · Mar 20, 2024 · 12:59 PM UTC

Gabriel Synnaeve @syhw

20 Mar 2024

The CodeGen team at FAIR *in Paris* is recruiting junior and senior research engineers! metacareers.com/jobs/4176159… Come work with us @jadecopet @b_roziere @qcar_ @FabianGloeckle @KunhaoZ et al., and folks in EMEA @jnsgehring @TacoCohen @adiyossLC @FelixKreuk et al.

102

101,969

Gabriel Synnaeve · Apr 27, 2021 · 3:53 PM UTC

Gabriel Synnaeve @syhw

27 Apr 2021

pip install diffq more at github.com/facebookresearch/… and

Alexandre Défossez @honualx

27 Apr 2021

@adiyossLC @syhw and I are happy to present our work: Differentiable Model Compression with Pseudo Quantization Noise 🗜️💾🤖 Our method, DiffQ, uses additive noise as a proxy for quantization, giving differentiability with no Straight Through Estimator👇 github.com/facebookresearch/…

102

Gabriel Synnaeve · Feb 1, 2022 · 11:16 AM UTC

Gabriel Synnaeve @syhw

1 Feb 2022

STC is CTC with wildcards, easily implemented with WFSTs and benchmarked on ASR and HWR. 😅 arxiv.org/abs/2201.12208 (by @vineelk @awnihannun Ronan)

Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve @syhw

24 Sep 2025

7/ We hope CWM provides a strong testbed for research on improving code generation with world models. We performed multi-task RL, and CWM has competitive performance for its size with 67.6% on LiveCodeBench v5, 76% on AIME24, and 65.8% on SweBench Verified with test time scaling.

11,692

Gabriel Synnaeve · Oct 18, 2025 · 1:10 AM UTC

Gabriel Synnaeve @syhw

18 Oct 2025

AI can both be awesome today, tomorrow, and a ton of work is left to do for a while!

Dwarkesh Patel

@dwarkesh_sp

17 Oct 2025

The @karpathy interview 0:00:00 – AGI is still a decade away 0:30:33 – LLM cognitive deficits 0:40:53 – RL is terrible 0:50:26 – How do humans learn? 1:07:13 – AGI will blend into 2% GDP growth 1:18:24 – ASI 1:33:38 – Evolution of intelligence & culture 1:43:43 - Why self driving took so long 1:57:08 - Future of education Look up Dwarkesh Podcast on YouTube, Apple Podcasts, Spotify, etc. Enjoy!

43,276

Gabriel Synnaeve · Sep 24, 2025 · 9:17 PM UTC

Gabriel Synnaeve @syhw

24 Sep 2025

8/ Additionally, we’re publishing a preparedness report ai.meta.com/research/publica… in line with Meta’s Frontier AI Framework (ai.meta.com/static-resource/…). While CWM is intended for noncommercial research use, Meta makes system-level protections available (llama.com/llama-protections/).

12,023

Gabriel Synnaeve · Nov 7, 2025 · 3:50 PM UTC

Gabriel Synnaeve @syhw

7 Nov 2025

Legend.

Soumith Chintala

@soumithchintala

6 Nov 2025

Leaving Meta and PyTorch I'm stepping down from PyTorch and leaving Meta on November 17th. tl;dr: Didn't want to be doing PyTorch forever, seemed like the perfect time to transition right after I got back from a long leave and the project built itself around me. Eleven years at Meta. Nearly all my professional life. Making many friends for life. Almost eight years leading PyTorch, taking it from nothing to 90%+ adoption in AI. Walking away from this was one of the hardest things I've ever done. But I'm leaving with a full heart. PyTorch handles exascale training now. It powers foundation models that are redefining intelligence. It's in production at virtually every major AI company. It's taught in classrooms from MIT to rural India. The tools I dreamed about making accessible? They are. The barrier to entry I wanted to lower? It's almost gone. To be clear, there’s so much more to do. As long as AI evolves at a breakneck pace, PyTorch will continue to play catch up. Obsessing over the yet-to-come sometimes makes us forget how much we’ve already done. To everyone who built this with me—who believed research should be joyful, that tools should be elegant, that open source changes everything—thank you. This wasn't my journey. It was ours. What's next for me? Something small. Something new. Something I don't fully understand yet. Something uncomfortable. I could have moved to something else inside Meta. But I needed to know what's out there. I needed to do something small again. I couldn't live with the counterfactual regret of never trying something outside Meta. It's very hard to leave. I probably have one of the AI industry’s most leveraged seats, I lead the software layer that powers the entire AI industry. Every major AI company and hardware vendor are on a speed dial. This kind of power is really hard to give up. But curiosity ultimately won out in my head. Keep making AI delicious and accessible. I'll be watching. Probably filing issues. Definitely staying involved. Is PyTorch going to be okay? I don't want to be doing PyTorch forever. I don't want to be like Guido or Linus— bound to a single thing for decades. Last November, coinciding with the birth of my daughter, I started planning my exit with Aparna. My goal was to leave PyTorch in a good and stable place. By this August, during the second half of my parental leave, I knew: Edward, Suo, Alban, Greg, John, Joe and Jana were ready. The team faced hard people, product, technical and organizational problems and didn’t feel the need to lean back on me to solve these for them (unlike in the past). The product story they crafted for the PyTorch Conference was coherent—really coherent. The things I'd flagged red were turning healthy. The project didn't need me anymore. Unlike 2020-2022 (when I stepped down to go do robotics and came back when Lin, Dima and Dwarak left), I have strong confidence that this time PyTorch is truly resilient. The most aligned culture carriers of PyTorch – Greg, Alban, Ed, Jason and Joe are at the decision table now, and people with strong value alignment – Suo, John and Jana have joined them at the table. And there’s a long list of equally value-aligned people willing to sit at the table should any of these people leave. There are many little things that make up my confidence on the people – John worked on Julia and open-source for a very long time (in fact we hacked a Torch.jl in 2015), Suo has been the strongest systems builder and strategic partner I’ve had for the past two years, and Jana worked on resilient core systems for a very long time, I’ve had long technical and organizational discussions with her over the past few months that give me confidence. And the product lineup and execution in 2025 should be sufficient evidence for any remaining doubt. I’m confident that this band of PyTorchers are going to do exceptionally well. PyTorch might change in flavor because I no longer impose my own taste from the top, but I’m confident that the values are going to stay intact and the product is going to be awesome. My time at Meta The early years of FAIR were absolutely magical. I was part of a small family of absolutely brilliant people building state-of-the-art AI out in the open. From working on GANs with Emily Denton, Rob Fergus, Leon Bottou, Martin Arjovsky and the (now legendary) Alec Radford to building Starcraft bots with Gabriel Synnaeve, to building the first FAIR Cluster with Howard Mansell, to working on object detection with Adam Lerer and Piotr Dollar, to building PyTorch. It was more fun than I can describe in words. 2015 and 2016 were probably the most productive and professionally enjoyable years of my life. I’ll probably romanticize this period of my life forever. When I joined FAIR, I had massive impostor syndrome, and the first 3 months were very very difficult. I can’t credit Andrew Tulloch enough for being the most thoughtful, kind and welcoming mentor, without whom I wouldn’t have made it. I’m so damn bullish for Meta just from the fact that he’s back. --- My time on PyTorch was special. I loved every part of building it—designing it, managing it, being the PM, TL, comms lead, doc engineer, release engineer, squashing bugs, growth hacking, turning it into a coherent product with hundreds of people, transitioning it to industry stakeholdership – the whole nine yards. To the core PyTorch team at Meta: the engineers, researchers, open-source maintainers, docs writers, CI infrastructure folks, hardware partners, the community builders. To the hundreds more inside and outside Meta—thank you. You turned a library into a movement. There are too many people to credit and thank, but I can't not mention Adam Paszke, Sam Gross, Greg Chanan, Joe Spisak, Alban Desmaison, Edward Yang, Richard Zou, Tongzhou Wang, Francisco Massa, Luca Antiga, Andreas Köpf, Zach DeVito, Zeming Lin, Adam Lerer, Howard Mansell and Natalia Gimelshein. And Schrep. They made the launch happen. And so many more people became centrally important later: Lu Fang, Xiaodong Wang, Junjie Bai, Nikita Shulga, Horace He, Mark Saroufim, Jason Ansel, Dmytro Dzhulgakov, Yangqing Jia, Geeta Chauhan, Will Constable, Briah Hirsh, Jane Xu, Mario Lezcano, Piotr Balecki, Yinghai Lu, Less Wright, Andrew Tulloch, Bruce Lin, Woo Kim, Helen Suk, Chris Gottbrath, Peng Wu, Joe Isaacson, Eli Uriegas, Tristan Rice, Yanan Cao, Elias Ellison, Animesh Jain, Peter Noordhuis, Tianyu Liu, Yifu Wang, Lin Qiao and hundreds more. It’s criminal of me to not take the space to list out everyone else I should be mentioning here. PyTorch is nothing without its people ❤️. The most joyful moments of building PyTorch was meeting users eager to share their happiness, love and feedback. I remember a grad student coming to me at Neurips 2017, in a slurring emotional voice he said he’d been trying to make progress on his research for 3 years but within 3 months of using PyTorch he made so much progress that he was ready to graduate. That moment made it tangible that what we do matters, a lot, to a lot of people, even if you don't constantly hear from them. I do miss the intimacy of the PyTorch community, with a 300 person conference that felt like an extended family gathering, but I feel that’s a small price to pay considering the scale of impact PyTorch is truly having today – yes the Conference is now 3,000 people where market-moving deals get brokered, but it’s helping orders of magnitude more people to do their best AI work. I miss the intimacy, but I'm proud of that growth. --- To Mark Zuckerberg and Mike Schroepfer, who believed that open-sourcing is fundamentally important and is a sound business strategy. This is so hard to understand for most people within the course of business, but we’ve run lock-step on this strategy without ever having to discuss it. Without you two, neither FAIR nor PyTorch would’ve happened. And those mean so much to me. To Yann LeCun and Rob Fergus, for building the magical early FAIR that I so revere. To Aparna Ramani, a leader that I find so rare at Meta in her ability to hold a really high bar for the org, technically brilliant with the span to discuss deep infra systems and industry-strategy within the same conversation and for being an absolute execution-machine! I’ve learned so much from you. To Santosh, Kaushik, Delia, Oldham and Ben for being so welcoming to Infra. For someone coming over from FAIR with a wildly different culture, you all made me feel at home and made me part of the family, and thank you for that. To all my managers who've championed me through the PSC video game – Serkan, Howard, Jerome, Abhijit, Yoram, Joelle, Aparna and Damien – I owe you a lifetime of drinks. --- Signing off for now. —Soumith

15,007

Gabriel Synnaeve · Aug 9, 2014 · 1:25 PM UTC

Gabriel Synnaeve @syhw

9 Aug 2014

I finally did this "harsh/hands-on intro/tips to deep learning" blog post: snippyhollow.github.io/blog/… #deeplearning #machinelearning

Gabriel Synnaeve · Jun 4, 2017 · 12:28 AM UTC

Gabriel Synnaeve @syhw

4 Jun 2017

The CfP for our Video Games and Machine Learning workshop at ICML2017 has arrived syhw.github.io/vgml_workshop… (cc @togelius @OriolVinyalsML)

Gabriel Synnaeve · Nov 28, 2018 · 6:22 PM UTC

Gabriel Synnaeve @syhw

28 Nov 2018

We open sourced TorchCraftAI, a modular bot framework for StarCraft: Brood War AI research facebook.com/notes/gabriel-s…

Gabriel Synnaeve · Apr 18, 2024 · 7:41 AM UTC

Gabriel Synnaeve @syhw

18 Apr 2024

Algorithmic progress is faster than hardware progress. arxiv.org/abs/2403.05812

8,329

Gabriel Synnaeve · Oct 10, 2023 · 7:33 PM UTC

Gabriel Synnaeve @syhw

10 Oct 2023

- I don’t believe in AI extinction scenarios (more worried by climate change), - I believe we have agency on any AI development, - I think the best way to do so is through open source AI platforms, that provide democratic access.

Yann LeCun

@ylecun

10 Oct 2023

The heretofore silent majority of AI scientists and engineers who - do not believe in AI extinction scenarios or - believe we have agency in making AI powerful, reliable, and safe and - think the best way to do so is through open source AI platforms NEED TO SPEAK UP !

16,148

Gabriel Synnaeve · Apr 25, 2025 · 1:44 AM UTC

Gabriel Synnaeve @syhw

25 Apr 2025

3,398

Gabriel Synnaeve · Feb 12, 2024 · 1:37 PM UTC

Gabriel Synnaeve @syhw

12 Feb 2024

Come to the next ParisAI meetup! Happy to chat about codegen and LLMs!

ParisAI @parisai

5 Feb 2024

Join us for our next meetup on Tues 5 March, feat. talks on research and applications of AI technology from @instadeepai @GoogleDeepMind @ScientaLab @Meta Register/more information--> paris.ai

41,929

Gabriel Synnaeve · May 7, 2018 · 9:23 PM UTC

Gabriel Synnaeve @syhw

7 May 2018

Slides for my "Introduction to (Deep) RL", a bad attempt at going from zero to implementing A2C in 1h30. pdf: dropbox.com/s/nx0gpb01thqi0r… pptx: dropbox.com/s/ivgqivgxnoyk7t…

DRL101_NYU_notes.pdf

Shared with Dropbox

dropbox.com

Gabriel Synnaeve · Jun 9, 2023 · 9:37 AM UTC

Gabriel Synnaeve @syhw

9 Jun 2023

What will you do with it? The paper is here: arxiv.org/abs/2306.05284 Code here: github.com/facebookresearch/… And if you want to have more control, but still generate in your browser, Colab here: colab.research.google.com/dr…

Simple and Controllable Music Generation

We tackle the task of conditional music generation. We introduce MusicGen, a single Language Model (LM) that operates over several streams of compressed discrete music representation, i.e.,...

arxiv.org

6,440

Gabriel Synnaeve · Dec 9, 2020 · 11:54 AM UTC

Gabriel Synnaeve @syhw

9 Dec 2020

Reminder that Fukushima is (still) causing 3690 death *per year* "in Germany". (24.6 death/TWh [1] for coal and 150TWh [2]) [1] ourworldindata.org/safest-so… [2] en.wikipedia.org/wiki/Energy…

What are the safest and cleanest sources of energy?

Fossil fuels are the dirtiest and most dangerous energy sources, while nuclear and modern renewable energy sources are vastly safer and cleaner.

ourworldindata.org

Gabriel Synnaeve · Mar 7, 2015 · 8:46 AM UTC

Gabriel Synnaeve @syhw

7 Mar 2015

LT: feel free to follow along with: github.com/deeplearningparis… nbviewer.ipython.org/github/… nbviewer.ipython.org/github/… and ask questions here. :)

GitHub - deeplearningparis/dl-machine: Scripts to setup a GPU / CUDA-enabled compute server with...

Scripts to setup a GPU / CUDA-enabled compute server with libraries for deep learning - deeplearningparis/dl-machine

github.com

Gabriel Synnaeve · Dec 5, 2018 · 3:46 PM UTC

Gabriel Synnaeve @syhw

5 Dec 2018

Come see poster 128 at #NeurIPS2018 (upstairs poster room)!

Gabriel Synnaeve · Dec 9, 2016 · 4:10 PM UTC

Gabriel Synnaeve @syhw

9 Dec 2016

Our poster is ready at the deep RL workshop at #nips2016 come see us 5:30pm onwards 😊

Gabriel Synnaeve · Jul 23, 2024 · 3:09 PM UTC

Gabriel Synnaeve @syhw

23 Jul 2024

"Open Source AI is the Path Forward" llama.meta.com/

4,423

Gabriel Synnaeve · Dec 3, 2016 · 9:30 PM UTC

Gabriel Synnaeve @syhw

3 Dec 2016

Excited about TorchCraft's release github.com/TorchCraft/TorchC… Big thanks to @nntsn @ebetica for the last push. More at: facebook.com/gabriel.synnaev…

GitHub - TorchCraft/TorchCraft: Connecting Torch to StarCraft

Connecting Torch to StarCraft. Contribute to TorchCraft/TorchCraft development by creating an account on GitHub.

github.com

Gabriel Synnaeve · Jul 17, 2017 · 3:19 PM UTC

Gabriel Synnaeve @syhw

17 Jul 2017

Spatially-sparse convolutions for Torch and PyTorch facebook.com/benjamin.thomas… Code: github.com/facebookresearch/…

Benjamin Graham

Source code release for: Submanifold Sparse Convolutional Networks https://github.com/facebookresearch/SparseConvNet Benjamin Graham and Laurens van der Maaten SparseConvNet: A Torch/PyTorch...

facebook.com

Gabriel Synnaeve · Aug 24, 2023 · 3:47 PM UTC

Gabriel Synnaeve @syhw

24 Aug 2023

And beyond just code completion and code generation, it can help you finding bugs or pair program in general.

65,141

Gabriel Synnaeve · Jul 1, 2019 · 6:49 PM UTC

Gabriel Synnaeve @syhw

1 Jul 2019

Growing Action Spaces, a "curriculum" that is always available (no need to modify the environment), from @greg_far and team!

Greg Farquhar @greg_far

1 Jul 2019

Progressively growing the action space creates a great curriculum for learning agents -- check out our paper: arxiv.org/abs/1906.12266 + code: github.com/TorchCraft/TorchC…. Great working with Laura Gustafson @ebetica @shimon8282 Nicolas Usunier @syhw

Gabriel Synnaeve · Nov 30, 2018 · 4:23 PM UTC

Gabriel Synnaeve @syhw

30 Nov 2018

We (with @ebetica and the team) are publishing "Forward Modeling for Partial Observation Strategy Games – A StarCraft Defogger" at NeurIPS 2018, here is a 3min video piped.video/watch?v=L1uHMAW6… Come talk to us at poster #128 on Wednesday morning, paper link: papers.nips.cc/paper/8272-fo…

StarCraft_Defogger_NeurIPS_2018

http://papers.nips.cc/paper/8272-forward-modeling-for-partial-obser...

youtube.com

Gabriel Synnaeve · Feb 21, 2020 · 3:43 AM UTC

Gabriel Synnaeve @syhw

21 Feb 2020

The front cover of the MacTeX manual aims at producing panic attacks in graphic designers...

Gabriel Synnaeve · Sep 21, 2017 · 8:44 PM UTC

Gabriel Synnaeve @syhw

21 Sep 2017

Deep Reinforcement Learning that Matters arxiv.org/abs/1709.06560

Deep Reinforcement Learning that Matters

In recent years, significant progress has been made in solving challenging problems across various domains using deep reinforcement learning (RL). Reproducing existing work and accurately judging...

arxiv.org

Gabriel Synnaeve · Nov 8, 2023 · 3:35 PM UTC

Gabriel Synnaeve @syhw

8 Nov 2023

Replying to @ylecun @giffmana @boazbaraktcs

Compulsory piped.video/MFzDaBzBlL0?si=7yoE…

The Backwards Brain Bicycle - Smarter Every Day 133

Get your own here ⇒ http://bit.ly/BuyBackwardsBike ⇐ Shirt: https:...

youtube.com

143,609

Gabriel Synnaeve · Jun 18, 2024 · 5:08 PM UTC

Gabriel Synnaeve @syhw

18 Jun 2024

A bunch of releases from Meta FAIR today: ai.meta.com/blog/meta-fair-r…

Sharing new research, models, and datasets from Meta FAIR

Meta FAIR is releasing several new research artifacts. Our hope is that the research community can use them to innovate, explore, and discover new ways to apply AI at scale.

ai.meta.com

3,748

Gabriel Synnaeve · Mar 2, 2017 · 1:37 PM UTC

Gabriel Synnaeve @syhw

2 Mar 2017

Faiss: A library for efficient similarity search and clustering of dense vectors. github.com/facebookresearch/… CPU and GPU large kNN SOTA!

Gabriel Synnaeve · Nov 3, 2020 · 12:12 PM UTC

Gabriel Synnaeve @syhw

3 Nov 2020

We just moved back to France from NYC to be closer to family, but my heart is with America today: I wish you the best, and hope for a big blue wave.

Gabriel Synnaeve · Mar 22, 2024 · 11:46 PM UTC

Gabriel Synnaeve @syhw

22 Mar 2024

Before enlightenment: add layers, train longer. After enlightenment: add layers, train longer.

4,271

Gabriel Synnaeve · Sep 26, 2025 · 2:37 PM UTC

Gabriel Synnaeve @syhw

26 Sep 2025

Replying to @giffmana

Thanks Lucas. Yeah and we're not saying we're there yet, we've just opened this research direction. I'd say we're at the first "CoT" paper, "o1" still to be discovered.

1,284

Gabriel Synnaeve · Nov 17, 2023 · 5:53 PM UTC

Gabriel Synnaeve @syhw

17 Nov 2023

Dream team

kyutai @kyutai_labs

17 Nov 2023

Replying to @kyutai_labs

Our founding team is covering many AI fields from vision, with Patrick Pérez and Hervé Jégou (@hjegou) to LLMs with Edouard Grave (@EXGRV), audio with Neil Zeghidour (@neilzegh) and Alexandre Défossez (@honualx) and infra with Laurent Mazaré (@lmazare).

3,912

Gabriel Synnaeve · Nov 9, 2024 · 10:05 PM UTC

Gabriel Synnaeve @syhw

9 Nov 2024

Playing StarCraft for the first time in ages with my teenager-years team, we're 2% of the EU players online on BNet :-D

5,096

Gabriel Synnaeve · Jun 9, 2023 · 9:36 AM UTC

Gabriel Synnaeve @syhw

9 Jun 2023

Then I asked it to generate "Swinged folk balad about the happy times of a paper release. 4/4 drums that are not too loud" with this audio conditioning. All first tries, no cheating!

6,409

Gabriel Synnaeve · Feb 23, 2016 · 12:39 AM UTC

Gabriel Synnaeve @syhw

23 Feb 2016

Practical Black-Box Attacks against Deep Learning Systems using Adversarial Examples arxiv.org/abs/1602.02697

Gabriel Synnaeve · Jan 25, 2017 · 8:28 PM UTC

Gabriel Synnaeve @syhw

25 Jan 2017

Dermatologist-level classification of skin cancer with deep neural networks (the paper:) nature.com/articles/nature21…

Gabriel Synnaeve · Dec 2, 2016 · 3:53 AM UTC

Gabriel Synnaeve @syhw

2 Dec 2016

Nice vulgarization of SGD, backprop and ConvNets code.facebook.com/posts/3848…

Artificial intelligence, revealed

Visit the post for more.

engineering.fb.com

Gabriel Synnaeve · Aug 24, 2023 · 3:47 PM UTC

Gabriel Synnaeve @syhw

24 Aug 2023

Here are some fun things we did with it: > I have a pandas dataframe with the columns "decoding", "Capabilities", "Fine-tuning", "Model size", "HE pass@1", "MBPP pass@1". I want a seaborn figure with two scatterplots side-by-side. [...]

49,376

Gabriel Synnaeve · Dec 4, 2019 · 8:42 PM UTC

Gabriel Synnaeve @syhw

4 Dec 2019

"A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning" by @alcinos26 et al. is going to be presented at NeurIPS next week, come talk to us! ai.facebook.com/blog/using-m…

Gabriel Synnaeve · Jun 9, 2023 · 9:37 AM UTC

Gabriel Synnaeve @syhw

9 Jun 2023

Made with ❤️ by @FelixKreuk, @jadecopet, @itai_gat, Tal Remez, David Kant, @adiyossLC, @honualx (make sure to follow them!) Videos generated with github.com/adefossez/seewav

GitHub - adefossez/seewav: Audio waveform visualisation, converts any audio to a nice video

Audio waveform visualisation, converts any audio to a nice video - adefossez/seewav

github.com

5,171

Gabriel Synnaeve · Sep 4, 2019 · 2:24 PM UTC

Gabriel Synnaeve @syhw

4 Sep 2019

The ICLR 2020 call for workshops is online, workshops will take place on April 26th, 2020, in Addis Ababa, Ethiopia iclr.cc/Conferences/2020/Cal…

Gabriel Synnaeve · Apr 25, 2017 · 8:37 AM UTC

Gabriel Synnaeve @syhw

25 Apr 2017

Our poster is up at ICLR in C30 (first floor) 😊

Gabriel Synnaeve · Dec 9, 2024 · 11:03 PM UTC

Gabriel Synnaeve @syhw

9 Dec 2024

Gonna be at NeurIPS starting tomorrow afternoon. See you there, in particular if you want to talk about codegen and (post-)LLM research!

3,166

Gabriel Synnaeve · Dec 8, 2023 · 1:13 PM UTC

Gabriel Synnaeve @syhw

8 Dec 2023

piped.video/watch?v=lowvJWnX… piped.video/watch?v=xCE0xO1Y… by @NimOne510 @BigDaddyCh0p We're just at the beginning, we had lil' data But A.I. ain't replacing artists anytime A.I. ain't those guys creativity I tell ya They even turn A.I. brain farts into arts h/t @Thom_Wolf for the link

9,331

Gabriel Synnaeve · Dec 5, 2023 · 11:55 PM UTC

Gabriel Synnaeve @syhw

5 Dec 2023

Replying to @awnihannun

Awesome!! Congrats to the whole team 😊

2,850

Gabriel Synnaeve · Aug 24, 2023 · 3:47 PM UTC

Gabriel Synnaeve @syhw

24 Aug 2023

We showed Code Llama can be easily fine-tuned to SOTA (e.g. on MBPP below), so we hope it'd be a good foundation model for code. It's also state-of-the-art for multiple languages. There is already a nice thread by @b_roziere with some more details

Baptiste Rozière

@b_roziere

24 Aug 2023

Today, we release CodeLlama, a collection of base and instruct-finetuned models with 7B, 13B and 34B parameters. For coding tasks, CodeLlama 7B is competitive with Llama 2 70B and CodeLlama 34B is state-of-the-art among open models. Paper and weights: ai.meta.com/research/publica…

5,252

Gabriel Synnaeve · Jun 9, 2023 · 9:36 AM UTC

Gabriel Synnaeve @syhw

9 Jun 2023

"Free jazz with electronic saxophone played by somebody who enjoys counterpoint"

9,718

Gabriel Synnaeve · Jul 7, 2023 · 8:53 AM UTC

Gabriel Synnaeve @syhw

7 Jul 2023

An hommage thread!

5,321

Gabriel Synnaeve · Jan 2, 2025 · 9:37 AM UTC

Gabriel Synnaeve @syhw

2 Jan 2025

EnCodec running in ffmpeg piped.video/watch?v=5wlNAGep…

ZML x FFmpeg

This demo shows EnCodec (https://ai.honu.io/papers/encodec/samples....

youtube.com

4,368

Gabriel Synnaeve · Jul 18, 2014 · 2:17 PM UTC

Gabriel Synnaeve @syhw

18 Jul 2014

How do I debug my neural nets? I plot gradients and updates over epochs (time), as a video piped.video/watch?v=Fj_0R1Yn…

Gabriel Synnaeve · May 9, 2017 · 5:04 PM UTC

Gabriel Synnaeve @syhw

9 May 2017

and pre-trained models! github.com/facebookresearch/…

Gabriel Synnaeve · Dec 21, 2018 · 7:11 PM UTC

Gabriel Synnaeve @syhw

21 Dec 2018

Nevergrad: A Python toolbox for performing gradient-free optimization, from FAIRies github.com/facebookresearch/…

GitHub - facebookresearch/nevergrad: A Python toolbox for performing gradient-free optimization

A Python toolbox for performing gradient-free optimization - facebookresearch/nevergrad

github.com

Gabriel Synnaeve · Oct 10, 2016 · 11:35 PM UTC

Gabriel Synnaeve @syhw

10 Oct 2016

Improving Monte Carlo Tree Search Policies in StarCraft via Probabilistic Models Learned from Replay Data nova.wolfwork.com/papers/aii…

Gabriel Synnaeve · Feb 28, 2017 · 8:49 PM UTC

Gabriel Synnaeve @syhw

28 Feb 2017

Pre-trained word vectors for several languages in fastText: facebook.com/groups/11745472…

FastText Users | We are publishing pre-trained word vectors for 90...

We are publishing pre-trained word vectors for 90 languages, trained on Wikipedia with fastText. The list of languages and links to download the models are available here:...

facebook.com

Gabriel Synnaeve · Jun 3, 2017 · 7:26 PM UTC

Gabriel Synnaeve @syhw

3 Jun 2017

"in Go, [...] pattern macthing algos are not yet at the strategical level of human players" 2012,my thesis aged fast emotion.inrialpes.fr/people/…

Gabriel Synnaeve · Nov 6, 2015 · 9:35 PM UTC

Gabriel Synnaeve @syhw

6 Nov 2015

"Autograd for Torch" has arrived blog.twitter.com/2015/autogr… Thanks to the people from Twitter Cortex!

Gabriel Synnaeve · Mar 9, 2022 · 5:50 PM UTC

Gabriel Synnaeve @syhw

9 Mar 2022

For my ASR twitter, interested in a single multi-modal code-switching model? @lorenlugosch made a good thread about the models release of his internship's work nitter.app/lorenlugosch/status/15… As he puts it: "Death To Tokenizers!", just forward() this model on whatever speech you have...

Loren Lugosch @lorenlugosch

9 Mar 2022

We're releasing an open-source massively multilingual speech recognizer! Repo (+ colab notebook): github.com/flashlight/wav2le… It's a 1-billion-parameter CTC transformer. This is a very cool model, for a few reasons: