Ben Poole · May 19, 2026 · 5:57 PM UTC

Ben Poole

Pinned Tweet

Ben Poole

@poolio

May 19

Real-world models are here! Stoked to share how we're bringing real-world locations to life by integrating Street View into Genie. Try it now at labs.google/fx/projectgenie and read the blog for more info: blog.google/innovation-and-a…

618

222,251

Ben Poole · Sep 29, 2022 · 8:01 PM UTC

Ben Poole

@poolio

29 Sep 2022

Happy to announce DreamFusion, our new method for Text-to-3D! dreamfusion3d.github.io We optimize a NeRF from scratch using a pretrained text-to-image diffusion model. No 3D data needed! Joint work w/ the incredible team of @BenMildenhall @ajayj_ @jon_barron #dreamfusion

127

1,388

5,458

Ben Poole · Dec 13, 2024 · 10:24 PM UTC

Ben Poole

@poolio

13 Dec 2024

How to upset the (few remaining) neuroscientists at NeurIPS 101

130

2,116

296,131

Ben Poole · Oct 5, 2022 · 5:35 PM UTC

Ben Poole

@poolio

5 Oct 2022

Stoked to share our work on Imagen Video! Diffusion models continue to unlock new possibilities for generative creativity: 3D with #DreamFusion last week, video with #ImagenVideo today 😎

Jonathan Ho @hojonathanho

5 Oct 2022

Excited to announce Imagen Video, our new text-conditioned video diffusion model that generates 1280x768 24fps HD videos! #ImagenVideo imagen.research.google/video… Work w/ @wchan212 @Chitwan_Saharia @jaywhang_ @RuiqiGao @agritsenko @dpkingma @poolio @mo_norouzi @fleet_dj @TimSalimans

450

Ben Poole · Jun 25, 2018 · 2:20 PM UTC

Ben Poole

@poolio

25 Jun 2018

Happy to announce that I’m officially a doctor and will be joining Google Brain today! After an awesome time adventuring, I’m excited to get back to work on understanding and advancing artificial intelligence.

413

Ben Poole · Oct 11, 2022 · 12:08 PM UTC

Ben Poole

@poolio

11 Oct 2022

Text-to-3D synthesis from "a DSLR photo of a hand drawing a picture of a hand with a pencil" Doesn't quite master the recursion, but love the quality of the floating hand. #dreamfusion

396

Ben Poole · Oct 27, 2017 · 7:35 PM UTC

Ben Poole

@poolio

27 Oct 2017

Ridiculously good looking GAN results from Karras et al. (NVIDIA) by progressively growing the network: research.nvidia.com/sites/de…

176

390

Ben Poole · Oct 4, 2022 · 1:09 PM UTC

Ben Poole

@poolio

4 Oct 2022

What does research look like when you can no longer read all the relevant research papers? I used to read the daily feed, but now it's a full-time job just to read all the abstracts.

Mario Krenn

@MarioKrenn6240

4 Oct 2022

The number of AI papers on arXiv per month grows exponentially with doubling rate of 24 months. How can we cope with this? AI itself can help, by predicting & suggesting new research directions. Predicting the Future of AI with AI: arxiv.org/abs/2210.00881

381

Ben Poole · Jan 30, 2025 · 6:30 PM UTC

Ben Poole

@poolio

30 Jan 2025

Brush🖌️ is now a competitive 3D Gaussian Splatting engine for real-world data and supports dynamic scenes too! Check out the release notes here: github.com/ArthurBrussee/bru…

383

28,842

Ben Poole · Nov 28, 2024 · 2:45 AM UTC

Ben Poole

@poolio

28 Nov 2024

Stop watching videos, start interacting with worlds. Stoked to share CAT4D, our new method for turning videos into dynamic 3D scenes that you can move through in real-time!

362

45,953

Ben Poole · May 17, 2024 · 2:47 AM UTC

Ben Poole

@poolio

17 May 2024

Excited to share our work on image-to-3D scene generation: cat3d.github.io CAT3D uses a multi-view diffusion model to generate novel views, and just inputs these to NeRF/3DGS. Create anything in 3D in 1 minute!

367

48,148

Ben Poole · Mar 6, 2017 · 2:44 AM UTC

Ben Poole

@poolio

6 Mar 2017

Evolution is catching up to intelligent design for neural net architectures (94.6% vs. 96.7% on CIFAR-10): arxiv.org/abs/1703.01041

144

320

Ben Poole · Dec 19, 2017 · 1:08 AM UTC

Ben Poole

@poolio

19 Dec 2017

Successfully defended my PhD today! Thanks to @SuryaGanguli @drfeifei @ermonste @dyamins and Tom Clandinin for taking it easy on me :)

323

Ben Poole · Jun 23, 2017 · 1:19 AM UTC

Ben Poole

@poolio

23 Jun 2017

Free lunch theorem: for any idea, there exists a dataset where that idea performs well.

127

318

Ben Poole · Aug 29, 2020 · 12:17 AM UTC

Ben Poole

@poolio

29 Aug 2020

no thank you #neuralink

293

Ben Poole · Jul 8, 2020 · 2:25 AM UTC

Ben Poole

@poolio

8 Jul 2020

TIL tf.image.resize != torchvision.transforms.Resize unless you set antialias=True. Something to check when porting and comparing models between frameworks 🙃

306

Ben Poole · Jan 16, 2017 · 7:06 PM UTC

Ben Poole

@poolio

16 Jan 2017

We just released an example notebook for unrolled GANs on github! Very easy to implement using TF's graph_replace: github.com/poolio/unrolled_g…

136

304

Ben Poole · Jun 25, 2019 · 7:50 PM UTC

Ben Poole

@poolio

25 Jun 2019

Want to estimate or optimize mutual information using neural networks and the latest variational bounds? Check out our Colab notebook for implementations and experiments! Colab: colab.research.google.com/gi… Paper: arxiv.org/abs/1905.06922

303

Ben Poole · Jan 5, 2021 · 10:09 PM UTC

Ben Poole

@poolio

5 Jan 2021

🤯

293

Ben Poole · Jun 4, 2019 · 3:42 AM UTC

Ben Poole

@poolio

4 Jun 2019

Big hierarchical VQ-VAEs with autoregressive priors do amazing things. Awesome work from @catamorphist @avdnoord @OriolVinyalsML: arxiv.org/abs/1906.00446

300

Ben Poole · Jul 8, 2019 · 5:57 AM UTC

Ben Poole

@poolio

8 Jul 2019

BigBiGAN shows that "progress in image generation quality translates to substantially improved representation learning performance." Competitive w/self-supervised approaches on ImageNet. The cycle from generative models to other methods and back again continues.

Brundage Bot @BrundageBot

8 Jul 2019

Replying to @BrundageBot

Large Scale Adversarial Representation Learning. Jeff Donahue and Karen Simonyan arxiv.org/abs/1907.02544

286

Ben Poole · Mar 21, 2023 · 8:21 PM UTC

Ben Poole

@poolio

21 Mar 2023

Excited to share that DreamFusion has won an Outsanding Paper Award at #ICLR2023: blog.iclr.cc/2023/03/21/anno… Thanks to amazing coauthors @BenMildenhall @ajayj_ @jon_barron and great feedback from colleagues and reviewers that improved the paper. See y'all in Rwanda!

Ben Poole

@poolio

29 Sep 2022

282

33,156

Ben Poole · Feb 7, 2018 · 10:03 PM UTC

Ben Poole

@poolio

7 Feb 2018

Cool work on opening closed eyes with GANs: bdol.github.io/exemplar_gans… Would love to see this productionized so I don't need to worry about staring into the sun, blinking, or sleeping in lectures.

263

Ben Poole · Nov 4, 2025 · 4:08 AM UTC

Ben Poole

@poolio

4 Nov 2025

Join our team and build the future of generative worlds! We are at an incredibily exciting moment where research prototypes are becoming useful technology for capture, creation, and interaction in 3D worlds.

Jon Barron

@jon_barron

3 Nov 2025

We're hiring for full-time roles in NYC and SF, link to the listing is below.

283

43,882

Ben Poole · Nov 5, 2019 · 5:54 PM UTC

Ben Poole

@poolio

5 Nov 2019

peer review in machine learning is broken #ICLR2020

262

Ben Poole · Dec 21, 2018 · 6:19 AM UTC

Ben Poole

@poolio

21 Dec 2018

unfortunate ICLR metareview typo: "slightly under the acceptance trashhold"

263

Ben Poole · Aug 5, 2025 · 2:21 PM UTC

Ben Poole

@poolio

5 Aug 2025

Stoked to share our work on realtime interactive video models 🌎🕹️🎉

Google DeepMind

@GoogleDeepMind

5 Aug 2025

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

270

14,240

Ben Poole · Jan 19, 2019 · 6:39 AM UTC

Ben Poole

@poolio

19 Jan 2019

Overly useful ML hack: to increase the gradient for a parameter w by a factor k, divide the initial value by k and scale w by k before using it: w = Variable(w0) → w = k * Variable(w0/k)

256

Ben Poole · Sep 30, 2022 · 7:47 AM UTC

Ben Poole

@poolio

30 Sep 2022

"bear trying a too soft mattress" #dreamfusion

240

Ben Poole · Jun 14, 2022 · 10:24 PM UTC

Ben Poole

@poolio

14 Jun 2022

"a photo of a red panda reading the research paper 'attention is all you need'" #Imagen

246

Ben Poole · May 20, 2025 · 10:01 PM UTC

Ben Poole

@poolio

20 May 2025

critical capybara #veo3

242

23,044

Ben Poole · Nov 26, 2024 · 2:57 AM UTC

Ben Poole

@poolio

26 Nov 2024

Come intern at Google DeepMind in 2025! We've got a rad generative 3D crew in SF 🤓

Jon Barron

@jon_barron

25 Nov 2024

Our group at Google DeepMind is now accepting intern applications for summer 2025. Attached is the official "call for interns" email; the links and email aliases that got lost in the screenshot are below.

248

37,001

Ben Poole · May 22, 2024 · 7:48 PM UTC

Ben Poole

@poolio

22 May 2024

time for a nap

237

16,638

Ben Poole · Feb 15, 2024 · 8:57 PM UTC

Ben Poole

@poolio

15 Feb 2024

the best current method for text-to-3d scenes is text-to-video followed by 3D reconstruction

Ben Mildenhall

@BenMildenhall

15 Feb 2024

will it nerf? yep ✅ congrats to @_tim_brooks @billpeeb and colleagues, absolutely incredible results!!

229

31,126

Ben Poole · Dec 7, 2018 · 7:07 PM UTC

Ben Poole

@poolio

7 Dec 2018

Interested in deep learning, mutual information, and variational bounds? Come check out my poster w/ @sherjilozair @avdnoord @alemi and @georgejtucker at 17:30 in the #NeurIPS2018 Bayesian Deep Learning workshop!

230

Ben Poole · Jul 16, 2019 · 12:23 AM UTC

Ben Poole

@poolio

16 Jul 2019

It's impossible to keep up with papers, but please take a few days to review literature and ask around before spending months on a research project #reviewer2 #neurips2019

230

Ben Poole · Feb 21, 2018 · 2:15 AM UTC

Ben Poole

@poolio

21 Feb 2018

speech cloning + few shot image generation = unlimited fake video content of anyone doing anything. deepfakes was just the tip of the iceberg.

Baidu Research

@BaiduResearch

21 Feb 2018

Our neural network based system learned to "clone" a voice with less than a minute of audio data from the speaker. Check out our paper to find out more about this latest breakthrough in speech synthesis. #DeepLearning #MachineLearning #AI bit.ly/2GvGhBP

120

222

Ben Poole · Dec 9, 2017 · 8:12 PM UTC

Ben Poole

@poolio

9 Dec 2017

A voice of reason at the BigNeuro panel: "we are very very far from human-level AI... maybe decades or centuries" - Yoshua Bengio #NIPS2017

216

Ben Poole · Aug 31, 2022 · 11:41 PM UTC

Ben Poole

@poolio

31 Aug 2022

Replying to @ericjang11

For diffusion models you can just combine the score functions! See e.g. arxiv.org/abs/2206.01714 I'm guessing this is how MJ integrated SD so quickly: classifier/score-based guidance makes for easy composability of models and signals.

202

Ben Poole · Sep 29, 2022 · 8:17 PM UTC

Ben Poole

@poolio

29 Sep 2022

DreamFusion generates 3D models from diverse text prompts. Check out our gallery of hundreds of 3D models: dreamfusion3d.github.io/gall…

191

Ben Poole · Nov 2, 2017 · 5:07 AM UTC

Ben Poole

@poolio

2 Nov 2017

New paper on an information-theoretic framework for understanding VAEs! Points to challenges and new directions. arxiv.org/abs/1711.00464

191

Ben Poole · Dec 23, 2016 · 10:09 PM UTC

Ben Poole

@poolio

23 Dec 2016

Denoising autoencoders fit to medical records learn a representation that predicts patient outcome: nature.com/articles/srep2609…

187

Ben Poole · Dec 14, 2022 · 2:44 AM UTC

Ben Poole

@poolio

14 Dec 2022

Congrats to the Luma team on reproducing and launching DreamFusion just 2 months after paper release! Looking forward to seeing what folks create. Checkout our work to learn about the method behind this tech: dreamfusion3d.github.io

DreamFusion: Text-to-3D using 2D Diffusion

We combine neural rendering with a multi-modal text-to-2D image diffusion generative model to synthesize diverse 3D objects from text.

dreamfusion3d.github.io

Luma

@LumaLabsAI

14 Dec 2022

✨ Introducing Imagine 3D: a new way to create 3D with text! Our mission is to build the next generation of 3D and Imagine will be a big part of it. Today Imagine is in early access and as we improve we will bring it to everyone captures.lumalabs.ai/imagine

187

Ben Poole · Nov 8, 2018 · 4:43 PM UTC

Ben Poole

@poolio

8 Nov 2018

Derive GANs from mutual information! Y ~ Bernoulli(1/2), M = Y * P + (1-Y) * Q (equal mixture of P and Q). JS(P; Q) = I(Y; M) = H(Y) - H(Y|M) = log(2) - E[log p(y|m)] = log(2) - E[log q(y|m)] + E[KL(p(y|m) || q(y|m))] >= log(2) - E[log q(y|m)]

183

Ben Poole · Jan 5, 2021 · 9:34 PM UTC

Ben Poole

@poolio

5 Jan 2021

Reviewer 2: proposed method shows no improvement on ImageNet, weak reject

189

Ben Poole · May 12, 2021 · 3:24 AM UTC

Ben Poole

@poolio

12 May 2021

Diffusion models rock: stable training, high quality samples, improved diversity, and moderately fast sampling. Awesome work from @prafdhar and @unixpickle showing that improved diffusion architectures + classifier guidance outperforms GANs on ImageNet.

Aran Komatsuzaki

@arankomatsuzaki

12 May 2021

Diffusion Models Beat GANs on Image Synthesis Achieves 3.85 FID on ImageNet 512×512 and matches BigGAN-deep even with as few as 25 forward passes per sample, all while maintaining better coverage of the distribution. arxiv.org/abs/2105.05233

185

Ben Poole · Jan 26, 2017 · 9:50 PM UTC

Ben Poole

@poolio

26 Jan 2017

New notebook implementing Adversarial Variational Bayes in TensorFlow: gist.github.com/poolio/b71eb…

181

Ben Poole · Dec 16, 2024 · 11:00 PM UTC

Ben Poole

@poolio

16 Dec 2024

donut house flythrough #veo2

175

11,246

Ben Poole · May 20, 2025 · 8:06 PM UTC

Ben Poole

@poolio

20 May 2025

we have liftoff #veo3

183

10,989

Ben Poole · Oct 13, 2023 · 11:32 PM UTC

Ben Poole

@poolio

13 Oct 2023

So many incredible generative 3D papers submitted to ICLR! One year after DreamFusion and folks have improved efficiency to 20 seconds: instant-3d.github.io/ 🤯 Happy weekend reading 🙃 openreview.net/group?id=ICLR…

181

41,216

Ben Poole · Oct 6, 2022 · 3:15 PM UTC

Ben Poole

@poolio

6 Oct 2022

Happy one week anniversary to #DreamFusion (dreamfusion3d.github.io/) 🥳 Thanks to GitHub user ashawkey, you can try it out now: github.com/ashawkey/stable-d… Amazed at the speed of the open source community, and power of open diffusion models. Can't wait to see what people create!

@_akhaliq

6 Oct 2022

A implementation of text-to-3D dreamfusion, powered by stable diffusion github: github.com/ashawkey/stable-d…

169

Ben Poole · Dec 13, 2024 · 10:29 PM UTC

Ben Poole

@poolio

13 Dec 2024

10 years ago I was working on deep models for single neurons and couldn't believe this slide. Crazy how right @ilyasut has been about AI progress, but neuroscience is still so hard.

162

19,004

Ben Poole · May 20, 2025 · 8:59 PM UTC

Ben Poole

@poolio

20 May 2025

knock knock #veo3

171

18,722

Ben Poole · Dec 3, 2020 · 3:59 AM UTC

Ben Poole

@poolio

3 Dec 2020

This ain't right. Timnit is one of the most amazing researchers and authentic humans in our field, and we were blessed to have her at Google. We have to do better.

@timnitGebru (@dair-community.social/bsky.social)

@timnitGebru

3 Dec 2020

Apparently my manager’s manager sent an email my direct reports saying she accepted my resignation. I hadn’t resigned—I had asked for simple conditions first and said I would respond when I’m back from vacation. But I guess she decided for me :) that’s the lawyer speak.

150

Ben Poole · Oct 6, 2022 · 3:31 PM UTC

Ben Poole

@poolio

6 Oct 2022

We have been calling this issue where the learned 3D model has multiple faces the Janus problem (en.wikipedia.org/wiki/Janus) h/t @jon_barron View-dependent prompting helps, but doesn't solve it in all cases as seen with the DreamFusion model of the squirrel below.

@_akhaliq

6 Oct 2022

Replying to @_akhaliq

Failure cases: "A DSLR photo of a squirrel"

159

Ben Poole · Dec 2, 2024 · 5:00 PM UTC

Ben Poole

@poolio

2 Dec 2024

Woohoo, big congrats to the World Labs team! Tech looks similar to CAT3D (cat3d.github.io): multi-view diffusion model + 3DGS, maybe w/360 data + depth priors. To bring these worlds to life with dynamics, check out our new work on CAT4D: cat-4d.github.io 😺

CAT3D: Create Anything in 3D with Multi-View Diffusion Models

Advances in 3D reconstruction have enabled high-quality 3D capture, but require a user to collect hundreds to thousands of images to create a 3D scene. We present CAT3D, a method for creating...

cat3d.github.io

World Labs

@theworldlabs

2 Dec 2024

We’ve been busy building an AI system to generate 3D worlds from a single image. Check out some early results on our site, where you can interact with our scenes directly in the browser! worldlabs.ai/blog 1/n

157

15,762

Ben Poole · Dec 11, 2020 · 5:50 PM UTC

Ben Poole

@poolio

11 Dec 2020

"Racism is a well-oiled machine. It really doesn't require people to proactively do much at this point, it can perpetuate itself... AI has lubricated that, it's like using WD-40 on this extremely well-lubricated machine to begin with." - @red_abebe at @ResistanceAI panel

134

Ben Poole · Jun 12, 2019 · 9:53 PM UTC

Ben Poole

@poolio

12 Jun 2019

Come learn about variational bounds of mutual information tomorrow (Thursday) at #ICML2019, 4:40pm in the Grand Ballroom or drop by poster #86 at 6:30pm! Joint work w/awesome collaborators @sherjilozair @avdnoord @alemi @georgejtucker arxiv.org/abs/1905.06922

158

Ben Poole · Oct 24, 2017 · 6:23 AM UTC

Ben Poole

@poolio

24 Oct 2017

Woohoo! This is Google's fork of IPython notebooks w/ multiple users, remote kernels, and more goodies. Hope it merges back into Jupyter.

Iwona Bialynicka-Birula ⏩

@iwonabb

24 Oct 2017

One of my favorite internal Google tools is now available externally colab.research.google.com

150

Ben Poole · Feb 14, 2018 · 6:08 AM UTC

Ben Poole

@poolio

14 Feb 2018

Folks, please spend more time searching for prior work. This paper on the flipped adversarial autoencoder (arxiv.org/abs/1802.04504) is the same as InfoGAN (arxiv.org/abs/1606.03657). We need better ways to summarize, distill, and distribute research so this stops happening.

148

Ben Poole · Nov 16, 2018 · 6:45 AM UTC

Ben Poole

@poolio

16 Nov 2018

When arguing with reviewers that they misunderstand prior work, keep in mind that they may be the author of that prior work.

143

Ben Poole · Nov 18, 2023 · 7:55 AM UTC

Ben Poole

@poolio

18 Nov 2023

Logging into Twitter after the CVPR deadline...

ALT 혼파망 피자 GIF

145

19,552

Ben Poole · May 17, 2017 · 7:06 PM UTC

Ben Poole

@poolio

17 May 2017

Learning to learn has gone from a fringe research area to a Google I/O keynote in just 1 year. The pace of progress in ML is insane.

145

Ben Poole · Oct 17, 2017 · 1:16 AM UTC

Ben Poole

@poolio

17 Oct 2017

Exciting topic, bold claims: "This paper explains why deep learning can generalize well...". Looking forward to reading!

Stat.ML Papers @StatMLPapers

17 Oct 2017

Generalization in Deep Learning. (arXiv:1710.05468v1 [stat.ML]) ift.tt/2hKCZ30

138

Ben Poole · Dec 1, 2020 · 5:07 PM UTC

Ben Poole

@poolio

1 Dec 2020

Super duper excited to share our new paper on score-based generative modeling! Stable training, exact likelihoods, high resolution samples, and much much more! Amazing work from @YSongStanford's internship with us at @GoogleAI 🧵👇

Yang Song

@DrYangSong

1 Dec 2020

Happy to announce our new work on score-based generative modeling: high quality samples, exact log-likelihoods, and controllable generation, all available through score matching and Stochastic Differential Equations (SDEs)! Paper: arxiv.org/abs/2011.13456

139

Ben Poole · Feb 23, 2018 · 5:22 AM UTC

Ben Poole

@poolio

23 Feb 2018

Cool work on Machine Theory of Mind from Neil Rabinowitz et al. (@DeepMindAI): arxiv.org/abs/1802.07740 Learns a system that can build models of other agents from observations alone. Neat direction for human-machine interaction and understanding of artificial agents!

136

Ben Poole · Sep 29, 2022 · 10:48 PM UTC

Ben Poole

@poolio

29 Sep 2022

The 3D model we generate is an improved NeRF that produces a 3D volume with density, color, and surface normals:

135

Ben Poole · Apr 12, 2017 · 12:56 AM UTC

Ben Poole

@poolio

12 Apr 2017

Optimize for simplest mask that confuses classifier to get interpretable explanations. Neat work from Fong&Vedaldi: arxiv.org/abs/1704.03296

134

Ben Poole · May 10, 2017 · 2:08 AM UTC

Ben Poole

@poolio

10 May 2017

Jointly train classifier & adversarial example generator, GAN-style -> improved adv. robustness & generalization arxiv.org/abs/1705.03387

131

Ben Poole · Dec 20, 2023 · 8:16 AM UTC

Ben Poole

@poolio

20 Dec 2023

One paper can change your life. But which one? Overproductivity doesn't just come from paper counting, but from the desperate acts of young researchers under extreme pressure to be part of that one paper.

Ben Recht @beenwrekt

19 Dec 2023

Since we just wrapped up an AI megaconference, it felt like a good day to plead for fewer papers. argmin.net/p/too-much-inform…

134

44,788

Ben Poole · Jul 22, 2025 · 6:57 PM UTC

Ben Poole

@poolio

22 Jul 2025

veo team is hiring, join the fun :) the yeti videos are cool, but there's still so much unknown in how to build spatial intelligence and useful creative tools!

Dumitru Erhan

@doomie

22 Jul 2025

Want to be part of a team redefining SOTA for generative video models? Excited about building models that can reach billions of users? The Veo team is hiring! We are looking for amazing researchers and engineers, in North America and Europe. Details below:

126

11,719

Ben Poole · Oct 30, 2018 · 3:31 AM UTC

Ben Poole

@poolio

30 Oct 2018

Humans are still the best lossy image compressors: arxiv.org/abs/1810.11137 Human describer views input image and communicates text to human reconstructor who uses image editing software to recreate an image. Fun paper led by 3 *high school* students interning at Stanford!

119

Ben Poole · Jan 15, 2019 · 7:09 PM UTC

Ben Poole

@poolio

15 Jan 2019

Want to learn latent variable models with powerful decoders? Check out our #iclr2019 accepted paper w/Ali Razavi, @avdnoord, @OriolVinyalsML that prevents posterior collapse withs δ-VAEs: arxiv.org/abs/1901.03416 Key idea: choose family of q(z) such that KL(q(z) || p(z)) > δ

120

Ben Poole · Feb 25, 2017 · 11:28 PM UTC

Ben Poole

@poolio

25 Feb 2017

Check out time-warped PCA, an unsupervised approach to aligning neural data, tonight @ #cosyne17 poster III-14 w/@niru_m @ItsNeuronal #twpca

117

Ben Poole · Dec 10, 2024 · 5:49 PM UTC

Ben Poole

@poolio

10 Dec 2024

Every time I fire a l̵i̵n̵g̵u̵i̵s̵t̵ graphics researcher, performance goes up.

Chris Offner

@chrisoffner3d

10 Dec 2024

"Sora is a data-driven physics engine."

116

17,579

Ben Poole · Oct 18, 2017 · 3:40 AM UTC

Ben Poole

@poolio

18 Oct 2017

TL;DR: use x * sigmoid(x) for neural net activations. Multiplicative interactions strike again!

Miles Brundage

@Miles_Brundage

18 Oct 2017

Replying to @Miles_Brundage

"Swish: a Self-Gated Activation Function," Ramachandran et al.: arxiv.org/abs/1710.05941

114

Ben Poole · Sep 30, 2022 · 1:29 PM UTC

Ben Poole

@poolio

30 Sep 2022

Nothing like waking up to see the 3D models we generated yesterday 3D printed in the real world today 😍 #dreamfusion

Mike S @mikiexx

30 Sep 2022

Replying to @poolio @BenMildenhall @ajayj_ @jon_barron

Paging Nurse Cogi

116

Ben Poole · Dec 16, 2024 · 7:41 PM UTC

Ben Poole

@poolio

16 Dec 2024

the sweater frogs can moooove #veo2

115

5,764

Ben Poole · May 19, 2020 · 10:55 PM UTC

Ben Poole

@poolio

19 May 2020

no i'm eating

113

Ben Poole · Nov 24, 2020 · 4:01 AM UTC

Ben Poole

@poolio

24 Nov 2020

Woohoo, code is now available for the kickass image VAE work from... *drumroll* ... the team @OpenAI!

@_akhaliq

24 Nov 2020

code released for Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images pdf: openreview.net/pdf?id=RLRXCV… github: github.com/openai/vdvae

114

Ben Poole · Mar 17, 2021 · 8:34 PM UTC

Ben Poole

@poolio

17 Mar 2021

MNIST is down: yann.lecun.com/exdb/mnist/ Maybe it's a sign I should try out some other datasets...

111

Ben Poole · Apr 28, 2023 · 8:56 PM UTC

Ben Poole

@poolio

28 Apr 2023

Headed to Kigali for #ICLR2023! Excited to meet folks and share the research that generated these 3D models

111

12,434

Ben Poole · Dec 6, 2023 · 2:51 AM UTC

Ben Poole

@poolio

6 Dec 2023

ReconFusion = 3D Reconstruction + Diffusion prior for novel view synthesis reconfusion.github.io Better NeRFs, less data.

Aleksander Holynski

@holynski_

6 Dec 2023

Excited to share ReconFusion! 3D reconstruction of real-world scenes from only a few photos, powered by diffusion priors: reconfusion.github.io w/ amazing team @ChrisWu6080 @BenMildenhall @philipphenzler @KeunhongP @RuiqiGao @watson_nn @_pratul_ @dorverbin @jon_barron @poolio

106

17,929

Ben Poole · Dec 2, 2024 · 6:56 PM UTC

Ben Poole

@poolio

2 Dec 2024

diffusion = flow matching great blog post from the experts on the synergy of these frameworks. parameterization and weighting matters, and i love how the choices in flow matching lead to simpler implementations compared to our early score-based SDE/diffusion model work!

Ruiqi Gao

@RuiqiGao

2 Dec 2024

A common question nowadays: Which is better, diffusion or flow matching? 🤔 Our answer: They’re two sides of the same coin. We wrote a blog post to show how diffusion models and Gaussian flow matching are equivalent. That’s great: It means you can use them interchangeably.

110

15,263

Ben Poole · Jan 10, 2018 · 8:44 PM UTC

Ben Poole

@poolio

10 Jan 2018

Love this work from Justin Gilmer et al. (Google Brain) on "adversarial spheres." Presents a tractable toy model for adversarial examples and proves that small misclassification error yields adversarial examples in high dimensions: arxiv.org/abs/1801.02774

105

Ben Poole · Dec 16, 2024 · 9:28 PM UTC

Ben Poole

@poolio

16 Dec 2024

Wild to compare #veo2 below to Imagen Video (SOTA from just 2 years ago):

Weizhe Hua

@hua_weizhe

16 Dec 2024

A cat jumps on a couch. #veo2

103

19,161

Ben Poole · Oct 5, 2022 · 5:38 PM UTC

Ben Poole

@poolio

5 Oct 2022

Progressive distillation (arxiv.org/abs/2202.00512) is awesome! Generation time can be reduced from 10 minutes to 30 seconds with minimal loss in quality.

Progressive Distillation for Fast Sampling of Diffusion Models

Diffusion models have recently shown great promise for generative modeling, outperforming GANs on perceptual quality and autoregressive models at density estimation. A remaining downside is their...

arxiv.org

Jonathan Ho @hojonathanho

5 Oct 2022

Replying to @hojonathanho

With the help of progressive distillation, Imagen Video can generate high quality videos using just 8 diffusion steps per sub-model. This speeds up video generation time substantially, by a factor of ~18x.

107

Ben Poole · May 1, 2023 · 9:52 AM UTC

Ben Poole

@poolio

1 May 2023

Excited to present our award-winning DreamFusion research today at #ICLR2023! Talk at 3:40pm in AD12, and poster #73 at 4:30pm. Have a few souvenirs to distribute too 🐸👻🐷🐶

104

9,810

Ben Poole · Dec 16, 2024 · 10:19 PM UTC

Ben Poole

@poolio

16 Dec 2024

catpacking #veo2

108

20,954

Ben Poole · Jan 19, 2024 · 9:34 PM UTC

Ben Poole

@poolio

19 Jan 2024

The highest quality 3D reconstruction pipeline is now open source!

Jon Barron

@jon_barron

19 Jan 2024

We just finished a joint code release for CamP (camp-nerf.github.io/) and Zip-NeRF (jonbarron.info/zipnerf/). As far as I know, this code is SOTA in terms of image quality (but not speed) among all the radiance field techniques out there. Have fun! github.com/jonbarron/camp_zi…

101

10,027

Ben Poole · Sep 30, 2022 · 1:16 AM UTC

Ben Poole

@poolio

30 Sep 2022

This was an incredibly fun team effort w/ NeRF wizards @BenMildenhall & @jon_barron, and NeRF + diffusion expert @ajayj_ (graduating this year!). We're excited to incorporate our methods with open source models and enable a new future for 3D generation! 🚀 #dreamfusion

103

Ben Poole · Jan 24, 2023 · 3:29 AM UTC

Ben Poole

@poolio

24 Jan 2023

StyleGAN-T generates faster and better samples than diffusion models at lower resolution (64x64) but underperforms at higher res (256x256). Excited to learn some new GAN tricks and for more diversity in research ideas around generative models :)

@_akhaliq

24 Jan 2023

StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis significantly improves over previous GANs and outperforms distilled diffusion models in terms of sample quality and speed abs: arxiv.org/abs/2301.09515 project page: sites.google.com/view/styleg…

103

28,295

Ben Poole · Feb 14, 2018 · 6:11 AM UTC

Ben Poole

@poolio

14 Feb 2018

We need a @schmidhubered consulting service. Tell them your idea, get back a list of references you have missed. Honestly, it would be a great service to the community and useful at the early stages of research projects.

Ben Poole · Dec 6, 2016 · 4:28 PM UTC

Ben Poole

@poolio

6 Dec 2016

Check out our poster on expressivity of random neural networks! Tonight 6-9pm #95 #nips2016

102

Ben Poole · Oct 28, 2017 · 3:40 AM UTC

Ben Poole

@poolio

28 Oct 2017

One day after complaints of the NIPS capsules paper, we've got a shiny new one! Patience, DL community. Patience. openreview.net/pdf?id=HJWLfG…

100

Ben Poole · Dec 4, 2020 · 5:36 PM UTC

Ben Poole

@poolio

4 Dec 2020

"Timnit responded with an email requiring that a number of conditions be met in order for her to continue working at Google" - @JeffDean (platformer.news/p/the-wither…) Geez, must be some pretty wild demands to lose such an amazing colleague. Wonder what they are? 😠

The withering email that got an ethical AI researcher fired at Google

"Stop writing your documents because it doesn’t make a difference": Timnit Gebru's final message to her peers

platformer.news

@timnitGebru (@dair-community.social/bsky.social)

@timnitGebru

4 Dec 2020

Replying to @debarrosmarcelo

Easy. 1 Tell us exactly the process that led to retraction order and who exactly was involved. 2. Have a series of meetings with the ethical ai team about process. 3 have an understanding of research parameters, what can be done/not, who can make these censorship decisions etc.

Ben Poole · Jan 5, 2017 · 6:51 AM UTC

Ben Poole

@poolio

5 Jan 2017

New paper from Krotov & Hopfield shows that dense associative memory models are robust to adversarial inputs: arxiv.org/abs/1701.00939

102

Ben Poole · Oct 26, 2020 · 4:13 PM UTC

Ben Poole

@poolio

26 Oct 2020

BYOL works even without batch statistics! arxiv.org/abs/2010.10241 surprising result that refutes the critical role of BN as implicit contrastive learning (untitled-ai.github.io/unders…, arxiv.org/abs/2010.00578) so... why does it work?

Ben Poole · Oct 12, 2022 · 8:23 PM UTC

Ben Poole

@poolio

12 Oct 2022

In spite of the limitations of current generative models, they can create something that really feels like AI Magic! To think I was pretty darn proud of these samples 7 years ago...

Runway

@runwayml

12 Oct 2022

Introducing AI Magic Tools Dozens of creative tools to edit and generate content like never before. New tools added every week. Available now: runwayml.com

Ben Poole · Nov 1, 2022 · 2:03 AM UTC

Ben Poole

@poolio

1 Nov 2022

"a skeleton juggling pumpkins" happy halloween from #dreamfusion!

Ben Poole · Dec 11, 2020 · 4:47 PM UTC

Ben Poole

@poolio

11 Dec 2020

Great perspective from @BachFrancis Q&A: “For science it’s not number of viewers, it’s not even number of citations, it’s something more complicated. Whether your work tackles important questions… and that can’t be seen simply by a single number or social media.”