Sean Welleck · Jan 15, 2025 · 3:55 PM UTC

Sean Welleck

Sean Welleck

@wellecks

15 Jan 2025

Excited to teach Advanced NLP at CMU this semester! Slides are on the course page as the course proceeds: cmu-l3.github.io/anlp-spring… Lectures will be uploaded to Youtube: piped.video/playlist?list=PL…

175

1,023

759,449

Sean Welleck · Feb 19, 2025 · 1:23 AM UTC

Sean Welleck

@wellecks

19 Feb 2025

Lecture 11: Reinforcement Learning piped.video/disWB7qwcOk - RL basics - Reward functions for NLP - Optimizing rewards (policy gradient) - Stabilizing learning (e.g., KL penalty, PPO)

Sean Welleck

@wellecks

15 Jan 2025

783

85,230

Sean Welleck · Sep 25, 2024 · 6:38 PM UTC

Sean Welleck

@wellecks

25 Sep 2024

Slides for my recent talk on: "Reasoning with inference-time compute" wellecks.com/data/welleck202… Papers: - Lean-STaR: arxiv.org/abs/2407.10040 - Easy-to-hard: arxiv.org/abs/2403.09472 - Compute-optimal inference: arxiv.org/abs/2408.00724 - Meta-generation: arxiv.org/abs/2406.16838

111

710

64,517

Sean Welleck · Apr 4, 2025 · 12:41 PM UTC

Sean Welleck

@wellecks

4 Apr 2025

Lecture 20: Advanced Post-Training piped.video/yuJUkR2vvJM - Supervised Fine-tuning - Reward Modeling - Reinforcement Learning - Direct Preference Optimization

CMU Advanced NLP Spring 2025 (20): Advanced Post-Training

This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP cove...

youtube.com

Sean Welleck

@wellecks

15 Jan 2025

664

72,504

Sean Welleck · Mar 21, 2025 · 2:10 PM UTC

Sean Welleck

@wellecks

21 Mar 2025

Lecture 16: Parallelism and Scaling piped.video/Mpg1YJfAEH0 - Basics of training on one device - Parallelization on multiple devices (e.g., data, tensor, pipeline parallel) - Combining and comparing strategies

CMU Advanced NLP Spring 2025 (16): Parallelism and Scaling

This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP cove...

youtube.com

Sean Welleck

@wellecks

15 Jan 2025

638

68,428

Sean Welleck · Jan 29, 2025 · 1:46 AM UTC

Sean Welleck

@wellecks

29 Jan 2025

Lecture 5: Transformers - Attention - Transformers - Improved transformers piped.video/bN6YylvZCzM

Sean Welleck

@wellecks

15 Jan 2025

524

42,175

Sean Welleck · Jun 27, 2024 · 2:00 PM UTC

Sean Welleck

@wellecks

27 Jun 2024

What do nucleus sampling, tree-of-thought, and PagedAttention have in common? They're all part of our new survey: "From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models" arxiv.org/abs/2406.16838

112

530

69,826

Sean Welleck · Nov 22, 2023 · 8:10 PM UTC

Sean Welleck

@wellecks

22 Nov 2023

Announcing the L3 Lab at CMU! cmu-l3.github.io/ We focus on Learning, Language, and Logic, including: - Principles of ML for language - ML in high-trust areas, such as verifying math and programs - ML systems that improve over time Recruiting PhD students for fall 2024!

512

70,917

Sean Welleck · Mar 31, 2025 · 1:03 PM UTC

Sean Welleck

@wellecks

31 Mar 2025

Lecture 19: Efficient Inference piped.video/jbHgzU4r7yU - Basics of efficient LLM inference - Speeding up single-token and sequence generation - Speeding up meta-generation strategies

CMU Advanced NLP Spring 2025 (19): Efficient Inference

This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP cove...

youtube.com

Sean Welleck

@wellecks

15 Jan 2025

509

43,130

Sean Welleck · Mar 14, 2025 · 1:18 AM UTC

Sean Welleck

@wellecks

14 Mar 2025

Lecture 15: Quantization (Guest lecture by @Tim_Dettmers) piped.video/YXZZaje76r4 - Quantization basics - Quantized foundation models: LLM.int8() - Finetuning foundation models: QLoRA - Quantization and users

Sean Welleck

@wellecks

15 Jan 2025

486

58,144

Sean Welleck · Feb 12, 2025 · 1:39 PM UTC

Sean Welleck

@wellecks

12 Feb 2025

Lecture 9: Fine-tuning - Fine-tuning basics - Instruction tuning - Knowledge distillation - Efficient fine-tuning piped.video/watch?v=3qW996ux…

Sean Welleck

@wellecks

15 Jan 2025

448

41,705

Sean Welleck · Mar 26, 2025 · 1:31 AM UTC

Sean Welleck

@wellecks

26 Mar 2025

Lecture 18: Advanced Inference Strategies piped.video/jNpeYvZtJkw - Parallel, tree search, refinement strategies - Long chain-of-thought - Inference scaling laws

Sean Welleck

@wellecks

15 Jan 2025

439

41,369

Sean Welleck · Jul 29, 2024 · 9:35 PM UTC

Sean Welleck

@wellecks

29 Jul 2024

Interested in LLMs and Lean? Check out LLMLean, a tool for using LLMs to suggest proof steps and complete proofs in Lean: github.com/cmu-l3/llmlean Here's an example of using LLMLean with GPT-4o to solve problems from Mathematics in Lean:

262

28,935

Sean Welleck · Jan 21, 2024 · 11:29 PM UTC

Sean Welleck

@wellecks

21 Jan 2024

Teaching a new course on Neural Code Generation with @dan_fried! cmu-codegen.github.io/s2024/ Here is the lecture on pretraining and scaling laws: cmu-codegen.github.io/s2024/…

391

37,822

Sean Welleck · Feb 28, 2025 · 2:06 AM UTC

Sean Welleck

@wellecks

28 Feb 2025

Lecture 14: Agents piped.video/4_kbc0_J_U0 - What is an agent? - Agent environments - Agent patterns

CMU Advanced NLP Spring 2025 (14): Agents

This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP cove...

youtube.com

Sean Welleck

@wellecks

15 Jan 2025

375

38,255

Sean Welleck · Apr 9, 2025 · 10:08 PM UTC

Sean Welleck

@wellecks

9 Apr 2025

Had a fun time giving the tutorial at @SimonsInstitute! Here are the materials: Transformers for Mathematics Tutorial - Slides: wellecks.com/transformers4ma… - Code/exercises: github.com/wellecks/transfor…

Sean Welleck

@wellecks

8 Apr 2025

Excited to give a tutorial on Transformers for Mathematics at @SimonsInstitute tomorrow! Part of the wonderful Workshop on AI for Mathematics and Theoretical Computer Science simons.berkeley.edu/workshop…

374

34,686

Sean Welleck · Jun 4, 2025 · 1:49 PM UTC

Sean Welleck

@wellecks

4 Jun 2025

New paper by Andre He: Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening arxiv.org/abs/2506.02355 Tired of sharpening the distribution? Try unlikeliness reward to learn new things from the roads less traveled

358

32,044

Sean Welleck · Nov 20, 2024 · 12:23 AM UTC

Sean Welleck

@wellecks

20 Nov 2024

I was honored to give a talk at Simons Institute on inference-time algorithms and meta-generation! simons.berkeley.edu/talks/se… It was a sneak-preview subset of our NeurIPS tutorial: cmu-l3.github.io/neurips2024…

345

27,371

Sean Welleck · Apr 8, 2025 · 4:45 AM UTC

Sean Welleck

@wellecks

8 Apr 2025

328

47,422

Sean Welleck · Apr 18, 2025 · 12:31 AM UTC

Sean Welleck

@wellecks

18 Apr 2025

And to finish off, Lectures 21 - 23: - AI for Mathematics: piped.video/ToY57HgQKXA - Multimodal I (CLIP / Llava): piped.video/5uI5WOpq8LQ - Multimodal II (VQVAE / Chameleon): piped.video/VismiXpCs_Y

Sean Welleck

@wellecks

15 Jan 2025

317

28,160

Sean Welleck · Dec 6, 2024 · 3:46 PM UTC

Sean Welleck

@wellecks

6 Dec 2024

Curious about inference-time scaling, the #1 trending topic in LLMs? Come to our NeurIPS tutorial: Beyond Decoding: Meta-Generation Algorithms for LLMs (Tue. @ 1:30)! cmu-l3.github.io/neurips2024…

307

73,624

Sean Welleck · Aug 20, 2023 · 11:20 PM UTC

Sean Welleck

@wellecks

20 Aug 2023

A tutorial on neural theorem proving: github.com/wellecks/ntptutor… Interactive notebooks for learning about combining neural language models with formal proof assistants. Part I) Build and evaluate a next-step suggestion tool Part II) LLM cascades and Draft, Sketch, Prove

295

65,247

Sean Welleck · Dec 10, 2023 · 5:24 PM UTC

Sean Welleck

@wellecks

10 Dec 2023

I was honored to give a talk at UW Mathematics on "Language models and formal mathematics", covering: - Neural theorem proving tutorial: github.com/wellecks/ntptutor… - LLMstep: mathai2023.github.io/papers/… - Llemma: arxiv.org/abs/2310.10631 Slides are here! wellecks.com/data/welleck202…

272

41,015

Sean Welleck · Jan 31, 2025 · 1:30 AM UTC

Sean Welleck

@wellecks

31 Jan 2025

Lecture 6: Pretraining - Pretraining objectives - Data: quantity, quality, coverage - Compute and scaling laws piped.video/qUAkjz3-VFg

Sean Welleck

@wellecks

15 Jan 2025

272

19,175

Sean Welleck · Dec 18, 2020 · 8:33 PM UTC

Sean Welleck

@wellecks

18 Dec 2020

Successfully defended my PhD dissertation! Thank you to the committee members (@kchonyc, @hhexiy, @jaseweston, @zz_aws_nyush, Keith Ross) and all of those who made this possible. Excited to join the University of Washington as a postdoc with @YejinChoinka's group in early 2021

238

Sean Welleck · Mar 7, 2025 · 2:47 PM UTC

Sean Welleck

@wellecks

7 Mar 2025

New paper by @PranjalAggarw16: L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning arxiv.org/abs/2503.04697 We train L1: a reasoning model with controllable thinking length, allowing for precisely trading off test-time compute for improved reasoning

228

14,885

Sean Welleck · Aug 14, 2019 · 1:03 AM UTC

Sean Welleck

@wellecks

14 Aug 2019

our new paper: "Neural Text d̶e̶Generation with Unlikelihood Training" is now on arxiv! (w/ @uralik1, @stephenroller, Emily Dinan, @kchonyc, @jaseweston) arxiv.org/pdf/1908.04319.pdf A step towards solving the case of neural text degeneration 🔎

216

Sean Welleck · Apr 15, 2025 · 6:23 PM UTC

Sean Welleck

@wellecks

15 Apr 2025

I was honored to give a talk on AI for theorem proving for the Berkeley Advanced LLM Agents course! "Bridging Informal and Formal Mathematical Reasoning with AI" Youtube: piped.video/live/Gy5Nm17l9oo Slides: wellecks.com/data/welleck202… It covers three themes from our recent work: - Informal thoughts: Lean-STaR - Informal proofs: Draft-Sketch-Prove, LeanHammer 👀 - Research-level math: miniCTX, LLMLean

Dawn Song

@dawnsongtweets

14 Apr 2025

📣 Today 4/14 at 4:10 PM PT, join us for the 10th Advanced LLM Agents MOOC lecture on Advanced Topics in Neural Theorem Proving by @wellecks @CarnegieMellon. 🌐 Join the thriving community of the LLM Agents MOOC series, with 23K+ registered learners & 10K+ members on Discord! 🚀 Register NOW for the AgentX Competition by @BerkeleyRDI @UCBerkeley, w. sponsors @Amazon @huggingface @LambdaAPI @MistralAI @Google @GroqInc @schmidtsciences, and VC partners @Accel @BainCapVC @BessemerVP @lightspeedvp @MayfieldFund @NEA! Exciting announcements on prizes/credits/resources and more coming soon!

223

18,868

Sean Welleck · Mar 26, 2021 · 6:40 PM UTC

Sean Welleck

@wellecks

26 Mar 2021

new paper: "NaturalProofs: Mathematical Theorem Proving in Natural Language" As a step towards systems that understand and use natural mathematical language, we develop a dataset of mathematical statements+proofs and a reference retrieval task. wellecks.github.io/naturalpr… (1/7)

211

Sean Welleck · Jul 19, 2024 · 1:35 PM UTC

Sean Welleck

@wellecks

19 Jul 2024

How can informal reasoning improve formal theorem proving? New paper: "Lean-STaR: Learning to Interleave Thinking and Proving" arxiv.org/abs/2407.10040 We introduce a framework for learning to interleave informal thoughts with steps of formal proving. 46.3% on miniF2F 🔥

215

25,032

Sean Welleck · May 26, 2022 · 5:17 AM UTC

Sean Welleck

@wellecks

26 May 2022

New paper: arxiv.org/abs/2205.12910 Theorem proving in natural mathematical language- the mix of symbolic and natural language used by humans- tests reasoning and plays a central role in mathematical education. Can language models prove theorems & help us when we're stuck? 1/N

205

Sean Welleck · Feb 11, 2025 · 2:40 PM UTC

Sean Welleck

@wellecks

11 Feb 2025

New paper by Weihua Du (@StigLidu): Optimizing Temperature for Language Models with Multi-Sample Inference arxiv.org/abs/2502.05234 We develop TURN, which automatically finds the optimal temperature for inference strategies such as best-of-N or majority voting (1/5)

204

20,538

Sean Welleck · Feb 7, 2025 · 10:58 AM UTC

Sean Welleck

@wellecks

7 Feb 2025

Lecture 8: Prompting - Prompting basics - Few-shot prompting - Prompt engineering - Prompting patterns (e.g., chain-of-thought, prompt chaining) piped.video/watch?v=hq5kld3k…

Sean Welleck

@wellecks

15 Jan 2025

198

17,353

Sean Welleck · Dec 15, 2024 · 11:06 PM UTC

Sean Welleck

@wellecks

15 Dec 2024

Our Inference Scaling Laws paper received an Outstanding Paper Award at NeurIPS Math-AI! Congrats to Yangzhen (@WYZ0402), Zhiqing (@EdwardSun0909), Shanda (@Shanda_Li_2000) and Yiming!

Sean Welleck

@wellecks

13 Dec 2024

Replying to @wellecks

2. Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models arxiv.org/abs/2408.00724 Oral presentation at Math-AI! Saturday, West Meeting Room 118-120

199

22,672

Sean Welleck · Jan 13, 2021 · 8:25 PM UTC

Sean Welleck

@wellecks

13 Jan 2021

My PhD thesis, "Order and Learning in Sequential Neural Structured Prediction" is now online at cs.nyu.edu/media/publication…

Sean Welleck

@wellecks

18 Dec 2020

185

Sean Welleck · Jan 23, 2025 · 2:27 PM UTC

Sean Welleck

@wellecks

23 Jan 2025

4 papers accepted at ICLR 2025! - Improver: arxiv.org/abs/2410.04753 - Inference Scaling Laws: arxiv.org/abs/2408.00724 - Lean-STaR: arxiv.org/abs/2407.10040 - miniCTX: arxiv.org/abs/2408.03350

ImProver: Agent-Based Automated Proof Optimization

Large language models (LLMs) have been used to generate formal proofs of mathematical theorems in proofs assistants such as Lean. However, we often want to optimize a formal proof with respect to...

arxiv.org

186

15,043

Sean Welleck · Nov 4, 2024 · 9:42 PM UTC

Sean Welleck

@wellecks

4 Nov 2024

Excited to give a NeurIPS tutorial on LLM inference strategies, inference-time scaling laws & more with @mattf1n and @haileysch__ ! "Beyond Decoding: Meta-Generation Algorithms for Large Language Models" More details soon, check out arxiv.org/abs/2406.16838 in the meantime!

From Decoding to Meta-Generation: Inference-time Algorithms for...

One of the most striking findings in modern research on large language models (LLMs) is that scaling up compute during training leads to better results. However, less attention has been given to...

arxiv.org

Andrew M. Dai

@AndrewDai

4 Nov 2024

We’re excited to share the list of accepted tutorials for @NeurIPSConf ! Thanks to everyone who put in the time to submit a proposal. Check out the lineup and let us know which tutorials you’re most looking forward to! blog.neurips.cc/2024/10/17/i… with @irenetrampoline & @GalChechik

178

20,909

Sean Welleck · Aug 4, 2025 · 6:31 PM UTC

Sean Welleck

@wellecks

4 Aug 2025

Excited about CMU's new Institute for Computer-Aided Reasoning in Mathematics (ICARM), a new NSF Mathematical Sciences Research Institute. I'm honored to serve as an Assistant Director focusing on machine learning and mathematics.

Carnegie Mellon University @CarnegieMellon

4 Aug 2025

A new federally funded national institute at CMU will help mathematicians use AI to make mathematical reasoning faster and more reliable in solving pressing challenges across science, security and the economy. Read more, and scroll for further details: cmu.is/NSF-institute

172

25,204

Sean Welleck · Feb 27, 2025 · 2:30 PM UTC

Sean Welleck

@wellecks

27 Feb 2025

New paper by Pranjal Aggarwal (@PranjalAggarw16): Programming with Pixels: Computer-Use Meets Software Engineering arxiv.org/abs/2502.18525 We reframe agentic software engineering as interacting with an IDE using visual observations and simple actions like clicking and typing

6,887

Sean Welleck · Dec 13, 2024 · 6:52 PM UTC

Sean Welleck

@wellecks

13 Dec 2024

5 papers upcoming at NeurIPS: 1. Easy-to-Hard Generalization (TODAY 4:30-7:30, East Exhibit Hall A-C #2806) arxiv.org/abs/2403.09472 2. Inference Scaling Laws (Oral @ Math-AI) arxiv.org/abs/2408.00724 3. Lean-STAR @ Math-AI arxiv.org/abs/2407.10040 4. miniCTX @ Math-AI arxiv.org/abs/2408.03350 5. miniCodeProps @ Safe Generative AI arxiv.org/abs/2406.11915

166

18,390

Sean Welleck · Oct 23, 2024 · 1:49 PM UTC

Sean Welleck

@wellecks

23 Oct 2024

Code generation graduated from self-contained problems to complex codebases. Neural theorem proving should too! Introducing miniCTX, a new benchmark that tests a model's ability to prove theorems from complex, real Lean projects cmu-l3.github.io/minictx/ arxiv.org/abs/2408.03350

158

24,229

Sean Welleck · Dec 11, 2024 · 5:02 AM UTC

Sean Welleck

@wellecks

11 Dec 2024

Thank you for coming to the tutorial! The recording is already up for those with NeurIPS registrations: - neurips.cc/virtual/2024/tuto… All of the materials are here for further reference/study: - cmu-l3.github.io/neurips2024… Also check out our code examples: - github.com/cmu-l3/neurips202…

Sean Welleck

@wellecks

6 Dec 2024

Curious about inference-time scaling, the #1 trending topic in LLMs? Come to our NeurIPS tutorial: Beyond Decoding: Meta-Generation Algorithms for LLMs (Tue. @ 1:30)! cmu-l3.github.io/neurips2024…

151

10,946

Sean Welleck · Jun 21, 2024 · 3:14 PM UTC

Sean Welleck

@wellecks

21 Jun 2024

Can LLMs prove that code is correct? New paper: "miniCodeProps: a Minimal Benchmark for Proving Code Properties" arxiv.org/abs/2406.11915 miniCodeProps tests LLMs' ability to prove properties of simple Lean programs. Despite its simplicity, it's challenging! Led by Evan Lohn

145

20,757

Sean Welleck · Sep 16, 2019 · 1:11 PM UTC

Sean Welleck

@wellecks

16 Sep 2019

code and pre-trained models for "Neural Text Generation with Unlikelihood Training" now available! - Train and fine-tune LMs with unlikelihood - 🚨fine-tune a GPT-2 model from pytorch-transformers with unlikelihood github.com/facebookresearch/…

GitHub - facebookresearch/unlikelihood_training: Neural Text Generation with Unlikelihood Training

Neural Text Generation with Unlikelihood Training. Contribute to facebookresearch/unlikelihood_training development by creating an account on GitHub.

github.com

Sean Welleck

@wellecks

14 Aug 2019

142

Sean Welleck · Feb 5, 2025 · 3:37 PM UTC

Sean Welleck

@wellecks

5 Feb 2025

Lecture 7: Decoding algorithms (guest lecture by @abertsch72) - decoding as optimization - sampling algorithms - constrained generation piped.video/cN8yX_ZZWJw

CMU Advanced NLP Spring 2025 (7): Decoding Algorithms

This lecture (by Amanda Bertsch) for CMU CS 11-711, Advanced NLP co...

youtube.com

Sean Welleck

@wellecks

15 Jan 2025

142

13,418

Sean Welleck · Feb 10, 2020 · 2:07 AM UTC

Sean Welleck

@wellecks

10 Feb 2020

new paper "Consistency of a Recurrent Language Model With Respect to Incomplete Decoding" arxiv.org/pdf/2002.02492.pdf we show that common decoding algorithms can yield infinite-length, zero-probability strings from neural LMs♾ w/@uralik1, Jaedeok Kim, Richard Pang, @kchonyc (1/6)

138

Sean Welleck · Jun 8, 2020 · 2:29 AM UTC

Sean Welleck

@wellecks

8 Jun 2020

new paper w/ @kchonyc: “MLE-guided parameter search for task loss minimization in neural sequence modeling” arxiv.org/pdf/2006.03158.pdf Sequence-level training based on random search around the current parameters and the MLE gradient

131

Sean Welleck · Dec 20, 2019 · 1:06 PM UTC

Sean Welleck

@wellecks

20 Dec 2019

Our paper “Neural Text Generation with Unlikelihood Training” was accepted to #ICLR2020! w/ @uralik1 @stephenroller @em_dinan @kchonyc @jaseweston

Sean Welleck

@wellecks

16 Sep 2019

125

Sean Welleck · Dec 9, 2024 · 1:13 AM UTC

Sean Welleck

@wellecks

9 Dec 2024

In Vancouver for NeurIPS but don't have Taylor Swift tickets? You can still spend the day going through our tutorial reading list: - cmu-l3.github.io/neurips2024… Tuesday December 10, 1:30-4:00pm @ West Exhibition Hall C, NeurIPS

Sean Welleck

@wellecks

6 Dec 2024

Curious about inference-time scaling, the #1 trending topic in LLMs? Come to our NeurIPS tutorial: Beyond Decoding: Meta-Generation Algorithms for LLMs (Tue. @ 1:30)! cmu-l3.github.io/neurips2024…

128

15,150

Sean Welleck · Jan 17, 2025 · 1:55 PM UTC

Sean Welleck

@wellecks

17 Jan 2025

Lecture 2: Neural Text Representation and Classification piped.video/watch?v=2eJ3S1gX… Includes: - tokenization - token embeddings - minimizing cross entropy loss with neural networks

Sean Welleck

@wellecks

15 Jan 2025

124

9,107

Sean Welleck · Apr 5, 2024 · 10:53 PM UTC

Sean Welleck

@wellecks

5 Apr 2024

Version II of the tutorial on neural theorem proving: github.com/cmu-l3/ntptutoria… Some new additions - Train a model that gets 29.5% on miniF2F - Data extraction in Lean, based on lean-training-data - LLMLean tool (github.com/cmu-l3/llmlean)

Sean Welleck

@wellecks

20 Aug 2023

117

21,302

Sean Welleck · Jan 22, 2025 · 2:06 PM UTC

Sean Welleck

@wellecks

22 Jan 2025

Lecture 3: Language Modeling Fundamentals - What is a language model? - Bigram, ngram, feedforward neural language model - Connecting maximum likelihood, KL divergence, and cross entropy loss piped.video/9JuMXy-5Y0E?si=bZAH…

Sean Welleck

@wellecks

15 Jan 2025

114

8,182

Sean Welleck · Aug 26, 2021 · 3:18 AM UTC

Sean Welleck

@wellecks

26 Aug 2021

Excited to announce “Math AI for Education: Bridging the Gap Between Research and Smart Education" (MathAI4Ed) A NeurIPS 2021 workshop on the intersection of AI, mathematics, and education. mathai4ed.github.io/ Now accepting submissions! (due Oct 06, 2021) (1/6)

103

Sean Welleck · Oct 16, 2020 · 1:21 AM UTC

Sean Welleck

@wellecks

16 Oct 2020

Thanks to @chelseabfinn for the great conversation about her work on meta-learning and robotics -- check it out below!

The Thesis Review Podcast @thesisreview

16 Oct 2020

Episode 10 of The Thesis Review: Chelsea Finn (@chelseabfinn), "Learning to Learn with Gradients" We discuss meta-learning, her work on MAML and its applications, and the future of robotics research soundcloud.com/thesis-review…

Sean Welleck · Dec 10, 2024 · 5:40 PM UTC

Sean Welleck

@wellecks

10 Dec 2024

We present AlphaVerus, which enables LLMs to generate provably correct Rust code via a new tree search and self-improvement loop Very excited about AlphaVerus as a starting point for truly trustworthy code generation. Amazing work by @PranjalAggarw16! alphaverus.github.io/

Pranjal Aggarwal ✈️ ICLR'26

@PranjalAggarw16

10 Dec 2024

LLMs often generate incorrect code. Instead, what if they can generate provably correct code? Presenting AlphaVerus: A self-reinforcing method that automatically learns to generate mathematically correct code using inference-time search and verifier feedback. 🧵

16,690

Sean Welleck · Jun 17, 2022 · 3:09 PM UTC

Sean Welleck

@wellecks

17 Jun 2022

Honored to receive a Best Paper award at NAACL 2022 for NeuroLogic A*esque Decoding, with an awesome team @GXiming @PeterWestTM @liweijianglw @wittgen_ball @DanielKhashabi @Ronan_LeBras @Lianhuiq @YoungjaeYu3 @rown @nlpnoah @YejinChoinka ! arxiv.org/abs/2112.08726

NeuroLogic A*esque Decoding: Constrained Text Generation with...

The dominant paradigm for neural text generation is left-to-right decoding from autoregressive language models. Constrained or controllable generation under complex lexical constraints, however,...

arxiv.org

William Wang

@WilliamWangNLP

17 Jun 2022

#NAACL2022 Best Paper Session schedule is out. 👀 👀 👀

Sean Welleck · Aug 4, 2024 · 10:15 PM UTC

Sean Welleck

@wellecks

4 Aug 2024

Our CMU-MATH team placed 2nd in the AIMO progress prize! (1st academic team 😎) The solution combines inference algorithms, clever reward model training, and program aided reasoning Code: github.com/AIMO-CMU-MATH/CMU… Models: huggingface.co/AIMO-CMU-MATH Blog: blog.ml.cmu.edu/2024/07/29/c…

GitHub - AIMO-CMU-MATH/CMU_MATH-AIMO

Contribute to AIMO-CMU-MATH/CMU_MATH-AIMO development by creating an account on GitHub.

github.com

ML@CMU @mlcmublog

29 Jul 2024

🔥Our CMU-MATH team proudly clinched 2nd place in the Artificial Intelligence Mathematical Olympiad (AIMO) out of 1,161 participating teams with the best performance of an academic team! Dive into our blog to discover our winning formula: blog.ml.cmu.edu/2024/07/29/c…

12,560

Sean Welleck · Apr 16, 2025 · 2:48 AM UTC

Sean Welleck

@wellecks

16 Apr 2025

Cool to see our L1 (arxiv.org/abs/2503.04697) methodology used here! And a nice insight about using the controllable reasoning budget to enable more efficient use of inference hardware

Prime Intellect

@PrimeIntellect

15 Apr 2025

Replying to @PrimeIntellect

With INTELLECT-2 we aim for frontier reasoning performance with a controllable thinking budget. By incorporating length rewards into our training run, users can specify how long the model should reason for a given task. primeintellect.ai/blog/intel…

11,365

Sean Welleck · Dec 10, 2024 · 5:36 PM UTC

Sean Welleck

@wellecks

10 Dec 2024

Our LLM inference tutorial is happening TODAY! cmu-l3.github.io/neurips2024… Tuesday December 10, 1:30-4:00pm @ West Exhibition Hall C, NeurIPS See you there!

Sean Welleck

@wellecks

6 Dec 2024

Curious about inference-time scaling, the #1 trending topic in LLMs? Come to our NeurIPS tutorial: Beyond Decoding: Meta-Generation Algorithms for LLMs (Tue. @ 1:30)! cmu-l3.github.io/neurips2024…

10,770

Sean Welleck · Sep 15, 2022 · 12:53 AM UTC

Sean Welleck

@wellecks

15 Sep 2022

Three papers accepted at #NeurIPS2022 - looking forward to chatting about reasoning & generation in New Orleans! 1. NaturalProver, neural (informal) theorem proving with language models w/ @liujc1998, @GXiming, @HannaHajishirzi, @YejinChoinka

Sean Welleck

@wellecks

26 May 2022

Sean Welleck · Oct 7, 2021 · 4:02 PM UTC

Sean Welleck

@wellecks

7 Oct 2021

new paper: "Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics" Sequence models show amazing performance on many tasks. Does perfect test accuracy tell the full story? w/ @PeterWestTM, @JizeCao, @YejinChoinka arxiv.org/abs/2109.13986

Sean Welleck · Jan 24, 2025 · 2:33 PM UTC

Sean Welleck

@wellecks

24 Jan 2025

Lecture 4: Recurrent Neural Networks - Recurrent neural networks - Vanishing gradients and other recurrent architectures - Encoder-decoder - Attention piped.video/MDYywCo3-rM?si=toXN…

Sean Welleck

@wellecks

15 Jan 2025

5,625

Sean Welleck · May 22, 2025 · 5:15 PM UTC

Sean Welleck

@wellecks

22 May 2025

New COLM workshop on test-time scaling and reasoning models! Submit your papers by June 23, more info: scalr-workshop.github.io/

Muhammad Khalifa

@MKhalifaaaa

22 May 2025

🚨Announcing SCALR @ COLM 2025 — Call for Papers!🚨 The 1st Workshop on Test-Time Scaling and Reasoning Models (SCALR) is coming to @COLM_conf in Montreal this October! This is the first workshop dedicated to this growing research area. 🌐 scalr-workshop.github.io

9,822

Sean Welleck · Mar 21, 2024 · 3:53 PM UTC

Sean Welleck

@wellecks

21 Mar 2024

It's often said that "evaluation is easier than generation"... We go one step further: strong evaluators enable generalizing to harder problems! New paper led by @EdwardSun0909 and @scut_longhui Using supervision only on easy problems, 52.5 on MATH with Llemma-34b + re-ranking

Zhiqing Sun

@EdwardSun0909

21 Mar 2024

🌟Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision 🌟 arxiv.org/abs/2403.09472 How can we keep improving AI systems when their capabilities surpass those of human supervisors? (1/n)

19,537

Sean Welleck · Aug 7, 2024 · 6:55 PM UTC

Sean Welleck

@wellecks

7 Aug 2024

How do we optimally use compute at inference time? New paper led by @WYZ0402: "An Empirical Analysis of Compute-Optimal Inference with LMs" arxiv.org/abs/2408.00724 We study scaling laws of inference, finding that smaller models with sophisticated inference are compute-optimal.

10,499

Sean Welleck · Dec 1, 2021 · 12:01 AM UTC

Sean Welleck

@wellecks

1 Dec 2021

MAUVE has received an Outstanding Paper Award at NeurIPS 2021! Honored to be part of a great team -- and an extra congrats to first author @KrishnaPillutla

Gautam Kamath @thegautamkamath

30 Nov 2021

Replying to @thegautamkamath

Outstanding Paper Award 4. MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers, by @KrishnaPillutla, @swabhz, @rown, @jwthickstun, @wellecks, @YejinChoinka, Zaid Harchaoui arxiv.org/abs/2102.01454 (4/n)

Sean Welleck · May 1, 2025 · 4:25 PM UTC

Sean Welleck

@wellecks

1 May 2025

AlphaVerus has been accepted at #ICML2025! alphaverus.github.io/ arxiv.org/abs/2412.06176 We've seen in math that good verification (e.g., Lean) unlocks surprising capabilities–why not for code too? AlphaVerus puts LLMs & Rust’s Verus verifier into a self-improving loop–lots of untapped potential and open problems in this direction!

Sean Welleck

@wellecks

10 Dec 2024

6,885

Sean Welleck · Oct 17, 2023 · 2:15 AM UTC

Sean Welleck

@wellecks

17 Oct 2023

Llemma: open language models for mathematics We train 7B and 34B models on Proofpile II, a 55B token dataset of code, web text, and papers. We make everything publicly available: models, code, data, and evaluation. Excited to have a new platform for research in AI+math!

Zhangir Azerbayev

@zhangir_azerbay

17 Oct 2023

We release Llemma: open LMs for math trained on up to 200B tokens of mathematical text. The performance of Llemma 34B approaches Google's Minerva 62B despite having half the parameters. Models/data/code: github.com/EleutherAI/math-l… Paper: arxiv.org/abs/2310.10631 More ⬇️

17,008

Sean Welleck · May 1, 2025 · 5:02 PM UTC

Sean Welleck

@wellecks

1 May 2025

TURN has been accepted at ICML! arxiv.org/abs/2502.05234 Automatically select the temperature for inference strategies like best-of-N and majority voting

Sean Welleck

@wellecks

11 Feb 2025

6,095

Sean Welleck · Mar 7, 2025 · 5:47 PM UTC

Sean Welleck

@wellecks

7 Mar 2025

The recent Claude 3.7 model from Anthropic lets you control the budget for thinking—how might this work? Check out L1, our fully open recipe for training reasoning models with controllable thinking budgets!

Pranjal Aggarwal ✈️ ICLR'26

@PranjalAggarw16

7 Mar 2025

What if you could control how long a reasoning model “thinks”? Presenting L1-1.5B, an RL-trained reasoning model with: - controllable thinking length via a prompt - better performance per token than S1 - better short CoT performance than GPT-4o cmu-l3.github.io/l1 🧵

10,035

Sean Welleck · Oct 25, 2022 · 5:36 PM UTC

Sean Welleck

@wellecks

25 Oct 2022

new paper: Draft, Sketch, and Prove arxiv.org/abs/2210.12283 A step towards bridging informal and formal mathematical reasoning via language models! LLMs can be used to draft natural mathematical proofs and autoformalize them into high-level sketches that guide a formal prover.

Albert Jiang @AlbertQJiang

25 Oct 2022

Large language models can write informal proofs, translate them into formal ones, and achieve SoTA performance in proving competition-level maths problems! LM-generated informal proofs are sometimes more useful than the human ground truth 🤯 Preprint: arxiv.org/abs/2210.12283 🧵

Sean Welleck · Apr 23, 2025 · 11:46 PM UTC

Sean Welleck

@wellecks

23 Apr 2025

🚨TODAY🚨: Jiewen (@Jiewenhu02) and Thomas (@hanwen_zhu) are presenting miniCTX as an oral presentation at ICLR! miniCTX: Neural Theorem Proving with (Long-)Contexts arxiv.org/abs/2408.03350 Theorem proving beyond competition problems: research-level mathematics, scientific projects, and beyond

4,831

Sean Welleck · May 29, 2025 · 10:08 AM UTC

Sean Welleck

@wellecks

29 May 2025

I was honored to give a talk and a tutorial at the 2nd Conference on Foundation Models and AI Agents for Science (SciFM 2025)! - Talk: Bridging Informal and Formal Mathematical Reasoning - Tutorial: Test-Time Scaling for Mathematical Reasoning Slide links are below

3,468

Sean Welleck · Apr 19, 2024 · 1:41 PM UTC

Sean Welleck

@wellecks

19 Apr 2024

Llama 3 70B in LLMLean! Suggests proofs or next steps that are checked in Lean Try it out with a @togethercompute API key: github.com/cmu-l3/llmlean

8,224

Sean Welleck · Mar 26, 2025 · 4:56 PM UTC

Sean Welleck

@wellecks

26 Mar 2025

New paper on scaling evaluation-time compute — thinking longer leads to better evaluation. Led by @seungonekim! Excited about this new dimension for taking advantage of inference-time compute and reasoning models. arxiv.org/abs/2503.19877

Seungone Kim

@seungonekim

26 Mar 2025

#NLProc New paper on "evaluation-time scaling", a new dimension to leverage test-time compute! We replicate the test-time scaling behaviors observed in generators (e.g., o1, r1, s1) with evaluators by enforcing to generate additional reasoning tokens. arxiv.org/abs/2503.19877

5,076

Sean Welleck · Nov 7, 2024 · 2:30 PM UTC

Sean Welleck

@wellecks

7 Nov 2024

Easy-to-Hard Generalization was accepted to NeurIPS! Congrats to @EdwardSun0909 and @scut_longhui! Check out the updated camera-ready version here: openreview.net/pdf?id=qwgfh2…

Sean Welleck

@wellecks

21 Mar 2024

34,646

Sean Welleck · Jan 6, 2025 · 9:58 PM UTC

Sean Welleck

@wellecks

6 Jan 2025

Excited about our new ICLR workshop on AI + Verification! In the age of increasingly capable models, trusting outputs and getting high-quality feedback to improve models are becoming central bottlenecks. Our workshop explores how AI can be combined with formal systems (e.g. program verifiers and proof assistants) or other kinds of verification to bring correctness and high-quality learning signals to code generation, mathematical reasoning, and beyond. Open for submissions! verifai-workshop.github.io/

Wenting Zhao

@wzhao_nlp

6 Jan 2025

📣Announcing VerifAI: AI Verification in the Wild, a workshop at #ICLR2025 VerifAI will gather researchers to explore topics at the intersection of genAI/trustworthyML and verification: verifai-workshop.github.io/ @celine_ylee @theo_olausson @ameeshsh @wellecks @taoyds

9,890

Sean Welleck · Feb 11, 2025 · 5:00 PM UTC

Sean Welleck

@wellecks

11 Feb 2025

Very exciting course on LLM Agents!! Looking forward to giving a lecture for the course in April

Dawn Song

@dawnsongtweets

11 Feb 2025

Really excited to announce our Advanced LLM Agents MOOC (Spring 2025)! Building on the success of our LLM Agents MOOC from Fall 2024 (15K+ registered learners, ~9K Discord members, 200K+ lecture views on YouTube), we are excited to extend the MOOC this semester to cover some more advanced topics: → Reasoning & planning → Multimodal Agents → Coding agents, web agents → AI for mathematics and theorem proving → Agent safety & security, and more 🎥 LIVE every Monday @ 4:10PM PT ✨ Whether you're a student, researcher, developer, practitioner, or AI enthusiast, join us on this exciting journey of shaping the future of LLM Agents!

3,622

Sean Welleck · Feb 26, 2025 · 5:28 PM UTC

Sean Welleck

@wellecks

26 Feb 2025

Will future SWE agents be computer-use agents? We explore this shift in Programming with Pixels: an agent environment where agents learn to use an IDE's existing functionality rather than relying on hand-designed tool APIs programmingwithpixels.com/

Pranjal Aggarwal ✈️ ICLR'26

@PranjalAggarw16

26 Feb 2025

What if AI agents did software engineering like humans—seeing the screen & using any developer tool? Introducing Programming with Pixels: an SWE environment where agents control VSCode via screen perception, typing & clicking to tackle diverse tasks. programmingwithpixels.com 🧵

4,549

Sean Welleck · Nov 2, 2021 · 3:47 PM UTC

Sean Welleck

@wellecks

2 Nov 2021

Interested in reasoning, scientific discovery, and/or the intersection of NLP & mathematics? Vote for 𝐌𝐚𝐭𝐡𝐍𝐋𝐏: 𝟏𝐬𝐭 𝐖𝐨𝐫𝐤𝐬𝐡𝐨𝐩 𝐨𝐧 𝐌𝐚𝐭𝐡𝐞𝐦𝐚𝐭𝐢𝐜𝐚𝐥 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐏𝐫𝐨𝐜𝐞𝐬𝐬𝐢𝐧𝐠 to appear at a 2022 *CL conference! docs.google.com/forms/d/e/1F…

Attendance Survey: ACL-NAACL-COLING-EMNLP 2022 Workshops

Please help us organize an effective set of workshops for the 2022 *ACL conferences at the right location by completing this survey! The simple survey is due by Wednesday, NOVEMBER 3rd (anywhere on...

docs.google.com

Sean Welleck · Jul 17, 2020 · 3:14 AM UTC

Sean Welleck

@wellecks

17 Jul 2020

I had a great time talking with @seb_ruder about his work on transfer learning - check out Episode 3 of the Thesis Review below!

The Thesis Review Podcast @thesisreview

17 Jul 2020

Episode 3 of The Thesis Review: Sebastian Ruder (@seb_ruder), "Neural Transfer Learning for Natural Language Processing" We discuss transfer learning, including cross-lingual learning & sequential transfer learning, and advice for researchers cs.nyu.edu/~welleck/episode3…

Sean Welleck · Jun 18, 2021 · 4:10 PM UTC

Sean Welleck

@wellecks

18 Jun 2021

New multi-domain NaturalProofs for theorem proving in natural mathematical language: Statements+proofs from broad (ProofWiki), in-depth (Stacks), real-world (textbook) sources New retrieval baselines and generation task arxiv.org/abs/2104.01112 github.com/wellecks/naturalp… (1/8)

Sean Welleck · Dec 7, 2024 · 6:24 PM UTC

Sean Welleck

@wellecks

7 Dec 2024

Pleased to see our tutorial featured on the Institute for Foundations of Machine Learning (IFML, @MLFoundations) webpage! ifml.institute/events/neurip… - Tutorial: cmu-l3.github.io/neurips2024… Tuesday December 10, 1:30-4:00pm @ West Exhibition Hall C, NeurIPS

Sean Welleck

@wellecks

6 Dec 2024

Curious about inference-time scaling, the #1 trending topic in LLMs? Come to our NeurIPS tutorial: Beyond Decoding: Meta-Generation Algorithms for LLMs (Tue. @ 1:30)! cmu-l3.github.io/neurips2024…

6,706

Sean Welleck · Jun 19, 2020 · 12:30 AM UTC

Sean Welleck

@wellecks

19 Jun 2020

new project: "The Thesis Review Podcast" I'll interview researchers from around the field, focusing on their PhD thesis work and how their research and perspective has evolved since. Follow along at @thesisreview, I hope you enjoy the conversations!

The Thesis Review Podcast @thesisreview

19 Jun 2020

New podcast🎙️ The Thesis Review brings you interviews with machine learning researchers, with each conversation centered around their PhD thesis. We've got a great set of guests, from newly minted PhD's to senior researchers!

Sean Welleck · Sep 15, 2020 · 12:32 PM UTC

Sean Welleck

@wellecks

15 Sep 2020

our paper on Consistency of a Recurrent LM was accepted to #emnlp2020! (w/ @uralik1, Jaedeok Kim, @yzpang97, @kchonyc ) Stay tuned for the updated version ♾

Sean Welleck

@wellecks

10 Feb 2020

Sean Welleck · Nov 24, 2021 · 9:46 PM UTC

Sean Welleck

@wellecks

24 Nov 2021

MAUVE -- an automatic evaluation metric for open-ended generation -- will appear at NeurIPS as an oral presentation! Check out our new paper, code, and the summary below 👇 arxiv.org/abs/2102.01454 w/ @KrishnaPillutla, @swabhz, @rown, @jwthickstun, @YejinChoinka, Zaid Harchaoui

MAUVE: Measuring the Gap Between Neural Text and Human Text using...

As major progress is made in open-ended text generation, measuring how close machine-generated text is to human language remains a critical open problem. We introduce MAUVE, a comparison measure...

arxiv.org

Krishna Pillutla @KrishnaPillutla

24 Nov 2021

How can we measure the gap between machine text and human text? We introduce MAUVE, a new comparison measure for open-ended text generation, in our upcoming oral presentation at NeurIPS 2021. Paper: arxiv.org/abs/2102.01454 Pip package: github.com/krishnap25/mauve 1/n

Sean Welleck · May 30, 2022 · 2:49 AM UTC

Sean Welleck

@wellecks

30 May 2022

check out Quark, our new [un]learning algorithm for adjusting & aligning language models!

@_akhaliq

30 May 2022

Quark: Controllable Text Generation with Reinforced Unlearning abs: arxiv.org/abs/2205.13636 introduce Quantized Reward Konditioning (Quark), an algorithm for optimizing a reward function that quantifies an (un)wanted property, while not straying too far from the original model

Sean Welleck · May 25, 2020 · 1:00 PM UTC

Sean Welleck

@wellecks

25 May 2020

"Stolen Probability: A Structural Weakness of Neural Language Models" arxiv.org/pdf/2005.02433.pdf Embedding norms influence token probabilities due to pre-softmax dot product Tokens inside the convex hull of the token embeddings receive smaller probabilities by David Demeter et al

Sean Welleck · May 3, 2021 · 3:07 AM UTC

Sean Welleck

@wellecks

3 May 2021

"How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks" #ICLR by @KeyuluXu et al. Proves MLPs quickly converge to linear functions outside of training data range Can extrapolate when the task is linear & training data is diverse openreview.net/forum?id=UH-c…

Sean Welleck · Mar 31, 2020 · 2:43 PM UTC

Sean Welleck

@wellecks

31 Mar 2020

"TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation" Investigates the effect of hard vs. easy tokens on repetition with a variant of the focal loss by @Shaojie_Jiang, @Thom_Wolf , @c_monz @mdr arxiv.org/pdf/2003.11963.pdf

Sean Welleck · Oct 24, 2024 · 11:23 PM UTC

Sean Welleck

@wellecks

24 Oct 2024

Replying to @srush_nlp @gneubig

We also covered this in neural code generation: cmu-codegen.github.io/s2024/…

5,580

Sean Welleck · Feb 7, 2019 · 1:52 AM UTC

Sean Welleck

@wellecks

7 Feb 2019

Our new paper "Non-Monotonic Sequential Text Generation" (with @xkianteb, @haldaume3, and @kchonyc) - generating text in learned, non left-to-right orders arxiv.org/abs/1902.02192

Non-Monotonic Sequential Text Generation

Standard sequential generation methods assume a pre-specified generation order, such as text generation methods which generate words from left to right. In this work, we propose a framework for...

arxiv.org

Sean Welleck · Sep 10, 2024 · 7:20 PM UTC

Sean Welleck

@wellecks

10 Sep 2024

We'll also have a NeurIPS 2024 tutorial based on the survey! Stay tuned for more details 👀

Graham Neubig

@gneubig

10 Sep 2024

The Information reports that OpenAI's new "strawberry" product will be in ~2 weeks, using 10-20 seconds of inference time compute: theinformation.com/articles/… If you want to study up on methods for inference time compute, our survey could be useful! arxiv.org/abs/2406.16838

8,848

Sean Welleck · Nov 3, 2019 · 8:55 PM UTC

Sean Welleck

@wellecks

3 Nov 2019

"An Empirical Study of Generation Order for Machine Translation" (arxiv.org/pdf/1910.13437.pdf) Nice paper studying effects of varying the generation order used to train an Insertion Transformer by William Chan, Mitchell Stern, Jamie Kiros, Jakob Uszkoreit

Sean Welleck · Jul 27, 2022 · 12:20 AM UTC

Sean Welleck

@wellecks

27 Jul 2022

Check out our NeurIPS workshop on AI for math & reasoning! "Math-AI : Toward Human-Level Mathematical Reasoning" mathai2022.github.io/ excited to co-organize with @lupantech @Swarooprm7 @Yuhu_ai_ @HannaHajishirzi @percyliang

MATH-AI

Toward Human-Level Mathematical Reasoning

mathai2022.github.io

Yuhuai (Tony) Wu

@Yuhu_ai_

27 Jul 2022

🚨We are organizing the 2nd MATHAI workshop at NeurIPS! Check it out if you're interested in AI for math, and machine reasoning in general🤯! We have a great lineup of speakers & panelists! See more in call for papers: 👇 mathai2022.github.io/cfp/

Sean Welleck · Oct 15, 2024 · 1:34 AM UTC

Sean Welleck

@wellecks

15 Oct 2024

We're back! New Thesis Review episode with @niloofar_mire on privacy and LLMs

The Thesis Review Podcast @thesisreview

15 Oct 2024

Episode 47 of The Thesis Review: Niloofar Mireshghallah (@niloofar_mire), "Auditing and Mitigating Safety Risks in Large Language Models" We discuss her journey into research, PhD work on privacy in LLMs, and memorization vs generalization. soundcloud.com/thesis-review…

2,754

Sean Welleck · Dec 9, 2019 · 6:46 PM UTC

Sean Welleck

@wellecks

9 Dec 2019

NeurIPS tutorial on "Imitation Learning and its Application to Natural Language Generation" by @haldaume3 and @kchonyc slideslive.com/38921527/imit…

Sean Welleck · Dec 2, 2021 · 1:20 AM UTC

Sean Welleck

@wellecks

2 Dec 2021

our paper on generalization in symbolic mathematics was accepted to #AAAI2022!

Sean Welleck

@wellecks

7 Oct 2021

Sean Welleck · Jul 17, 2020 · 5:58 PM UTC

Sean Welleck

@wellecks

17 Jul 2020

nice papers on set generation/modeling at the ICML Object-Oriented Learning workshop Conditional Set Generation with Transformers slideslive.com/38930876/cond… arxiv.org/pdf/2006.16841.pdf Generative Adversarial Set Transformers slideslive.com/38930872/gene… github.com/oolworkshop/oolwo…

Adam R. Kosiorek, Hyunjik Kim, Danilo Rezende · Conditional Set Generation with Transformers

slideslive.com

Sean Welleck · Nov 26, 2020 · 1:58 AM UTC

Sean Welleck

@wellecks

26 Nov 2020

Thanks to @adjiboussodieng for the great conversation about her work on deep probabilistic models -- listen below!

The Thesis Review Podcast @thesisreview

26 Nov 2020

Episode 13 of The Thesis Review: Adji Bousso Dieng (@adjiboussodieng), "Deep Probabilistic Graphical Modeling" We discuss models and algorithms for deep PGMs, interpretability & applications, and having an impact through research. soundcloud.com/thesis-review…