Lester Mackey (@LesterMackey) | nitter

Pinned Tweet

Lester Mackey @LesterMackey

Feb 11

@tobias_schrdr and I are excited to share WildCat: Near-Linear Attention in Theory and Practice arxiv.org/abs/2602.10056 By attending over a spectrally-accurate optimally-weighted coreset, WildCat approximates exact attention with super-polynomial error decay in near-linear time

4

11

64

9,001

Lester Mackey @LesterMackey

3 Nov 2023

If you're a PhD student interested in interning with me or one of my amazing colleagues at Microsoft Research New England (@MSRNE, @MSFTResearch) this summer, please apply here jobs.careers.microsoft.com/g…

12

80

395

86,491

Lester Mackey @LesterMackey

20 Oct 2025

If you're a PhD student interested in interning with me or one of my amazing colleagues at Microsoft Research New England (@MSRNE, @MSFTResearch) this summer, please apply here jobs.careers.microsoft.com/g… (If you'd like to work with me, please include my name in your cover letter!)

8

67

424

52,213

Lester Mackey @LesterMackey

30 Oct 2024

If you're a PhD student interested in interning with me or one of my amazing colleagues at Microsoft Research New England (@MSRNE, @MSFTResearch) this summer, please apply here jobs.careers.microsoft.com/g…

6

65

308

48,081

Lester Mackey @LesterMackey

15 Aug 2024

I just want to return my package to Whole Foods 😭

10

16

292

36,848

Lester Mackey @LesterMackey

21 Oct 2022

If you're a PhD student interested in interning with me or one of my amazing colleagues at MSR New England this summer, please apply here careers.microsoft.com/us/en/…

4

65

273

Lester Mackey @LesterMackey

15 Oct 2021

If you're a PhD student interested in interning with me or one of my amazing colleagues at MSR New England this summer, please apply here careers.microsoft.com/us/en/…

6

79

229

Lester Mackey @LesterMackey

12 Feb 2025

Why permute when you can cheaply permute?

2

21

196

28,660

Lester Mackey @LesterMackey

8 Dec 2022

Introducing CheatGPT

5

13

146

Lester Mackey @LesterMackey

14 Dec 2023

If you’d like to join Microsoft Research New England as a researcher in AI / ML / statistics, please apply here: jobs.careers.microsoft.com/g… @MSFTResearch @MSRNE

1

24

129

23,200

Lester Mackey @LesterMackey

18 Feb 2025

New guarantees for approximating attention, accelerating SGD, and testing sample quality in near-linear time

https://arxiv.org/abs/2502.12063

ALT https://arxiv.org/abs/2502.12063

1

13

136

25,762

Lester Mackey @LesterMackey

23 Nov 2022

Call for Machine Learning, AI, & Statistics Researchers at Microsoft Research New England @MSRNE : careers.microsoft.com/us/en/…

2

39

116

Lester Mackey @LesterMackey

29 Dec 2023

Interested in helping to organize @NeurIPSConf 2024? Let us know here: docs.google.com/forms/u/1/d/…

NeurIPS 2024 Organizer Nomination

Please use this form to nominate yourself as an organizer (a.k.a. chair) for the next NeurIPS conference in Dec 2024. Ideal organizers have attended NeurIPS in the past, have organized similar events...

docs.google.com

33

107

43,639

Lester Mackey @LesterMackey

16 Nov 2021

If you're a recent or graduating PhD student interested in postdocing with the ML / stats team at MSR New England, please apply here aka.ms/ml-postdoc-msrne

1

37

93

Lester Mackey @LesterMackey

10 Oct 2024

If you're an undergraduate interested in interning with me or one of my amazing colleagues at Microsoft Research New England (@MSRNE, @MSFTResearch) this summer, please apply here microsoft.com/en-us/research…

Undergraduate Research Internship – Computing - Microsoft Research

Accepting applications for 12-week summer research internships for juniors & senior undergrads w/ demonstrated leadership in diversity.

4

20

76

14,884

Lester Mackey @LesterMackey

7 Dec 2022

Call for machine learning and statistics postdocs at Microsoft Research New England @MSFTResearch careers.microsoft.com/us/en/…

18

72

Lester Mackey @LesterMackey

17 Dec 2024

See you next year @NeurIPSConf !

1

3

53

8,865

Lester Mackey @LesterMackey

15 Dec 2023

Thanks @NeurIPSConf — it’s been a blast. See you all at next year’s conference!

1

37

4,034

Lester Mackey @LesterMackey

3 Jul 2024

Replying to @srush_nlp

For me the Netflix Prize was a bellwether of many ML trends: all of the leading teams used SGD and low-rank approximation for scalable non-convex optimization, trained neural networks for both modeling and ensembling, and fit billion-parameter models to get the best performance

1

4

36

9,918

Lester Mackey @LesterMackey

15 Jun 2023

New @NatureComms work with Soukayna Mouatadid, Paulo Orenstein, @GFlaspohler, @judah47, Miruna Oprescu, Ernest Fraenkel, @UofT, @MSFTResearch, @MSRNE, @AER_inc, @MIT, @SpringerNature Adaptive bias correction for improved subseasonal forecasting rdcu.be/deAJW

1

6

33

6,037

Lester Mackey @LesterMackey

14 Aug 2023

Replying to @bremen79

You can also use “empirical Berry Esseen bounds” to get both finite sample validity and asymptotic efficiency (matching the optimal CLT-based width in the limit) arxiv.org/abs/2208.09922

Efficient Concentration with Gaussian Approximation

Concentration inequalities for the sample mean, like those due to Bernstein, Hoeffding, and Bentkus, are valid for any sample size but overly conservative, yielding confidence intervals that are...

1

2

29

4,470

Lester Mackey @LesterMackey

22 Nov 2021

If you're a PhD student interested in a spring internship exploring fairness in clinical trials and A/B experimentation, please apply here careers.microsoft.com/studen…

7

26

Lester Mackey @LesterMackey

14 Feb 2024

I've decided to solve the most pressing problems of our time using language models, for example,

1

20

2,163

Lester Mackey @LesterMackey

23 Jul 2025

Replying to @docmilanfar

This follow-up article also surveys recent applications in probabilistic inference, computational statistics, and machine learning arxiv.org/pdf/2105.03481

3

19

3,010

Lester Mackey @LesterMackey

18 Oct 2024

Replying to @minilek

I’ll never be too old for this:

1

17

1,207

Lester Mackey @LesterMackey

7 Oct 2024

If you’re attending @COLM_conf this week, check out Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation (w/ @ericzelikman @AdamKalai)

1

17

1,198

Lester Mackey @LesterMackey

11 Jun 2024

🤔

1

14

1,423

Lester Mackey @LesterMackey

15 Aug 2024

Replying to @LesterMackey @elmelis

Well, it was worth a try

1

1

14

1,350

Lester Mackey @LesterMackey

27 Aug 2021

Check out this amazing work by Myra Cheng! (@mariadearteaga, @adamfungi, and I helped too)

Maria De-Arteaga @mariadearteaga

27 Aug 2021

📣New research📣"Social Norm Bias: Residual Harms of Fairness-Aware Algorithms” led by undergrad Myra Cheng (applying to PhDs soon!) w/ @adamfungi @lestermackey🧵 arxiv.org/abs/2108.11056

1

12

Lester Mackey @LesterMackey

15 Aug 2024

Replying to @aminkarbasi

Yeah! It somehow knew that I wanted to buy a statistics for social good t-shirt, mug, sticker, and book

11

1,498

Lester Mackey @LesterMackey

5 Oct 2023

Don’t just improve your code; improve your code improver!

Eric Zelikman

@ericzelikman

5 Oct 2023

“Recursive self-improvement” (RSI) is one of the oldest ideas in AI. Can language models write code that recursively improves itself? Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation w/@elianalorch, @LesterMackey, @adamfungi (1/n)

Pipeline figure for STOP. On the left, improver_0 improves itself to become improver_1, etc. until improver_T. On the right, improver_0 is expanded to visualize that improver_0, the seed improver, takes a program and returns the best improvement the language model generates.

ALT Pipeline figure for STOP. On the left, improver_0 improves itself to become improver_1, etc. until improver_T. On the right, improver_0 is expanded to visualize that improver_0, the seed improver, takes a program and returns the best improvement the language model generates.

4

11

5,787

Lester Mackey @LesterMackey

8 Mar 2022

Thanks @MayaAjmera ! #TuesdayTrailblazers

Maya Ajmera @MayaAjmera

8 Mar 2022

For this edition of #TuesdayTraillazers, meet 2003 Science Talent Search and International Science and Engineering Fair alum, @LesterMackey

1

11

Lester Mackey @LesterMackey

5 Oct 2023

Replying to @peteratmsr

Thanks Peter!

3

66

Lester Mackey @LesterMackey

15 Aug 2024

Also curious how @amazon placed me at Morehouse 🤔

8

1,618

Lester Mackey @LesterMackey

18 Aug 2021

Exciting new work with @raazdwivedi; let us know what you think!

Raaz Dwivedi

@raazdwivedi

18 Aug 2021

Super excited to present “Kernel Thinning”, a new procedure for compressing a distribution more effectively than i.i.d. sampling or standard MCMC thinning. w/ @LesterMackey, today 2PM ET@#MCM, tomorrow 10AM/Noon@#COLT2021 arxiv.org/abs/2105.05842 Video: learningtheory.org/colt2021/…

1

9

Lester Mackey @LesterMackey

11 Oct 2025

Only in LA 🐢

1

9

521

Lester Mackey @LesterMackey

13 Dec 2022

Could this be our future? #ChADGPT

1

1

9

Lester Mackey @LesterMackey

15 Jun 2021

My first tweet 😀

Genevieve Flaspohler @GFlaspohler

15 Jun 2021

Our #ICML2021 paper unifies optimistic and delayed online learning to develop optimal algorithms with no hyperparameters to tune (w/ applications in subseasonal forecasting) arxiv.org/abs/2106.06885 w/ @bremen79 @judah47 S Mouatadid @MirunaOprescu P Orenstein @LesterMackey 🧵

1

7

Lester Mackey @LesterMackey

1 Oct 2025

I’m pretty sure this is what they designed Sora 2 for (sound on) @raazdwivedi @AShettyV

1

8

755

Lester Mackey @LesterMackey

18 Feb 2025

Replying to @LesterMackey @annabelle_cs @GongAlbert @AShettyV @raazdwivedi

Inspired by the groundbreaking work of @insu_han @daliri__majid @aminkarbasi @WentaoGuo7 @chrismdesa @afedercooper @cdomingoenrich

1

1

8

1,775

Lester Mackey @LesterMackey

15 Sep 2025

Come intern with us at @MSRNE!

Microsoft Research

@MSFTResearch

12 Sep 2025

The Microsoft Research Undergraduate Internship Program offers 12-week internships in our Redmond, NYC, or New England labs for rising juniors and seniors who are passionate about technology. Apply by October 6: msft.it/6015scgSJ

1

8

1,521

Lester Mackey @LesterMackey

3 Jul 2024

Replying to @LesterMackey @srush_nlp

The competition also ushered in the new era of parallel and distributed ML. My teammates and I bought some of the first Intel quad-core processors to work on the competition, and I remember writing some of the first Spark code with @matei_zaharia to scale beyond that.

8

542

Lester Mackey @LesterMackey

16 Sep 2024

Replying to @sp_monte_carlo

There’s a related inequality in arxiv.org/abs/1201.6002

8

519

Lester Mackey @LesterMackey

4 Apr 2023

Replying to @KevinKaichuang

Do you mean overall or per day?

1

6

497

Lester Mackey @LesterMackey

15 Dec 2023

6

250

Lester Mackey @LesterMackey

8 Dec 2022

Inspired in no small way by ChaatGPT @matei_zaharia

5

Lester Mackey @LesterMackey

5 Oct 2022

Replying to @risteski_a

This looks great @risteski_a ! See also Minimum Stein Discrepancy Estimators (arxiv.org/pdf/1906.08283.pdf) for an analysis of score matching (and related procedures like diffusion score matching)

2

6

Lester Mackey @LesterMackey

14 Dec 2024

Replying to @immonicax @elonmusk @drfeifei @IValeraM @WilliamWangNLP

@immonica please see the response from @NeurIPSConf here:

NeurIPS Conference

@NeurIPSConf

14 Dec 2024

NeurIPS acknowledges that the cultural generalization made by the keynote speaker today reinforces implicit biases by making generalisations about Chinese scholars. This is not what NeurIPS stands for. NeurIPS is dedicated to being a safe space for all of us. We want to address the comment made during the invited talk this afternoon, as it is something that NeurIPS does not condone and it doesn't align with our code of conduct. We are addressing this issue with the speaker directly. NeurIPS is dedicated to being a diverse and inclusive place where everyone is treated equally.

1

6

1,410

Lester Mackey @LesterMackey

27 Apr 2022

Replying to @KevinKaichuang @MSFTResearch

Wow, I can’t believe it’s been two years… especially because I still haven’t seen you at work

1

4

Lester Mackey @LesterMackey

23 Jul 2025

Replying to @docmilanfar

Andrew Barbour’s generator method also gives you a practical way to create Stein operators for any distribution

6

306

Lester Mackey @LesterMackey

3 Aug 2023

Replying to @BernoulliSoc

Thanks @BernoulliSoc!

1

6

281

Lester Mackey @LesterMackey

16 Oct 2025

Replying to @zdhnarsil

Here’s the rap version (sound on)

2

1

6

542

Lester Mackey @LesterMackey

18 Feb 2025

Code coming soon! github.com/microsoft/thinfor… github.com/microsoft/khsgd github.com/microsoft/deepctt

5

1,138

Lester Mackey @LesterMackey

12 Dec 2023

Replying to @irenetrampoline

Wait, how do you get a mug?

1

5

650

Lester Mackey @LesterMackey

19 May 2022

Replying to @nfusi

Thanks @nfusi!

1

5

Lester Mackey @LesterMackey

9 Dec 2022

Well this is surprising #ChatGPT

3

5

Lester Mackey @LesterMackey

14 Jul 2025

Replying to @annabelle_cs

1

5

336

Lester Mackey @LesterMackey

12 Dec 2023

So, which posters should I check out at NeurIPS Tuesday poster session 2?

2

4

1,749

Lester Mackey @LesterMackey

12 Feb 2025

Replying to @LesterMackey @elmelis @roydanroy @cdomingoenrich @raazdwivedi

I was watching the season finale of Severance and didn’t want the screenshot noise to ruin the experience

4

321

Lester Mackey @LesterMackey

29 Sep 2023

Undergrads, get your MSR internship applications in by November 6!

Kevin K. Yang 楊凱筌 @KevinKaichuang

28 Sep 2023

Come do an undergraduate research internship at MSR! microsoft.com/en-us/research…

3

1,037

Lester Mackey @LesterMackey

4 Sep 2024

Replying to @abeirami

This reminds me of the tool that Joel Tropp used to derive "intrinsic dimension" concentration inequalities for random matrices in apps.dtic.mil/sti/tr/pdf/ADA…

1

4

295

Lester Mackey @LesterMackey

25 Aug 2024

Replying to @LihongLi20

I have a feature request 😀

Lester Mackey @LesterMackey

15 Aug 2024

I just want to return my package to Whole Foods 😭

3

960

Lester Mackey @LesterMackey

4 Nov 2023

Replying to @Avipartho1 @MSRNE @MSFTResearch

These letters often come from a PhD advisor or other collaborators or research mentors who can speak to your research experience

1

3

3,383

Lester Mackey @LesterMackey

11 Aug 2022

Replying to @KLdivergence @BetsyOgburn @nataliexdean @daniela_witten

Love this! Congratulations again! (Also, this is tweet number 34)

4

Lester Mackey @LesterMackey

15 Dec 2023

Replying to @MuratAErdogdu

Great suggestion!

3

94

Lester Mackey @LesterMackey

31 Oct 2023

Replying to @GesineReinert @BernoulliSoc

Congratulations Gesine; your work is inspirational!

1

3

140

Lester Mackey @LesterMackey

24 Sep 2024

Replying to @fx_briol

Is this pronounced “Doctor MMD”? (I hope so)

1

3

124

Lester Mackey @LesterMackey

8 Oct 2021

Replying to @roydanroy

MSR New Englanders are exceptionally collaborative, so my colleagues are my collaborators!

3

Lester Mackey @LesterMackey

14 Nov 2022

Replying to @KevinKaichuang

Sometimes I wear jeans with a belt

3

3

Lester Mackey @LesterMackey

1 Oct 2025

Imagining Ne Zha 3

1

3

526

Lester Mackey @LesterMackey

17 May 2024

Replying to @daniela_witten

Reminds me of when someone opened our package of microwave popcorn and left half of the bags for us to enjoy

3

562

Lester Mackey @LesterMackey

16 Aug 2024

Replying to @daniela_witten @amazon

They can make up whatever backstory they’d like if it saves me a trip to Kohls.

2

273

Lester Mackey @LesterMackey

5 Oct 2023

Replying to @songsteven2 @macfound

Thanks @songsteven2!

1

20

Lester Mackey @LesterMackey

8 Mar 2025

Replying to @lorin_crawford

Thanks Lorin!

3

143

Lester Mackey @LesterMackey

5 Oct 2023

Replying to @franklinleonard @netflix @macfound

I still remember our conversations Franklin, and I’m glad to see all of the success the that The Black List has had!

1

211

Lester Mackey @LesterMackey

12 Feb 2025

Replying to @elmelis @roydanroy @cdomingoenrich @raazdwivedi

I’d like to pretend that added it on purpose, but, in reality, I accidentally took the screenshot while pressing the mute button

1

3

356

Lester Mackey @LesterMackey

9 Jun 2023

Call for NeurIPS Ethics Reviewers neurips.cc/Conferences/2023/…

2

1

1,506

Lester Mackey @LesterMackey

12 Feb 2025

Replying to @roydanroy @cdomingoenrich @raazdwivedi

Thanks! I'm surprised no one has commented on the Mute symbol 😀

1

3

688

Lester Mackey @LesterMackey

7 Nov 2023

Replying to @lmeyerov @TheOfficialACM @splashcon

I’m proud to say that I’ve shared two houses and two initials with this guy

1

2

241

Lester Mackey @LesterMackey

20 May 2022

Replying to @nancybaym

Thanks @nancybaym! I'm looking forward to h̶a̶n̶g̶i̶n̶g̶ ̶o̶u̶t̶ working with you in person one day soon

3

Lester Mackey @LesterMackey

14 Feb 2024

In the #realworldproblem motivating this question, my only option was a Church's Chicken in Costa Rica

1

79

Lester Mackey @LesterMackey

13 Dec 2022

Replying to @lizbwood

@KevinKaichuang

3

Lester Mackey @LesterMackey

10 Sep 2024

Replying to @KevinKaichuang

I think you mean a whole nother level

3

1,052

Lester Mackey @LesterMackey

12 Dec 2022

Introducing CandidGPT

3

Lester Mackey @LesterMackey

5 Oct 2023

Replying to @lmeyerov

Thanks LM1! I still miss our houses

2

142

Lester Mackey @LesterMackey

25 Jul 2025

Replying to @james_y_zou

Go James go!

1

230

Lester Mackey @LesterMackey

16 Oct 2025

Replying to @YuanqiD @zdhnarsil

I’ve been searching for the killer app, and I think rapping about CCDD might be it!

2

43

Lester Mackey @LesterMackey

4 Oct 2022

How to catch a man (according to LaMDA) #aitestkitchen

1

1

2

Lester Mackey @LesterMackey

29 Nov 2022

Replying to @JohnCLangford @jordan_t_ash @koulanurag @awjuliani

I'm very curious about the <mystery> hire!

2

Lester Mackey @LesterMackey

5 Nov 2023

Replying to @ErickRodOrd @MSRNE @MSFTResearch

Yes, it is open to international students

1

1

732

Lester Mackey @LesterMackey

15 Aug 2024

Replying to @elmelis

Brilliant! I’ll try that next

1

2

1,131

Lester Mackey @LesterMackey

15 Jun 2023

tl;dr: A low-cost machine learning correction for physics-based dynamical models improves subseasonal forecasting of temperature and precipitation two to six weeks ahead.

2

269

Lester Mackey @LesterMackey

5 Oct 2023

Replying to @kklmmr

Thanks Konstantin!

1

133

Lester Mackey @LesterMackey

5 Oct 2023

Replying to @BeEngelhardt @lorin_crawford

Thanks Barbara!

1

62

Lester Mackey @LesterMackey

20 Jul 2022

Replying to @KevinKaichuang

On a related note, would you mind hiring me as an intern?

1

2

Lester Mackey @LesterMackey

8 Dec 2022

Replying to @KevinKaichuang

The leaf sheep finally has some competition! boredpanda.com/leaf-sheep-se…

2

Lester Mackey @LesterMackey

24 Mar 2023

Replying to @sigact

Congratulations @minilek !

1

450

Lester Mackey @LesterMackey

13 Dec 2022

2

Lester Mackey @LesterMackey

1 Nov 2023

Replying to @shortstein

The matrix Hoeffding inequality (Cor. 4.2 of arxiv.org/pdf/1201.6002.pdf) would give you P( twonorm(sum_i X_i) >= t ) <= (d+1) exp( - t^2 / sum_i c_i^2 ) . There are also ways to get rid of that d multiplier

2

628

Lester Mackey @LesterMackey

9 Aug 2023

Replying to @KevinKaichuang

You will be missed!

1

2

268

Lester Mackey @LesterMackey

6 Aug 2021

Replying to @KevinKaichuang

Oh he knows -- they always know

1

2