Tim Dockhorn · Nov 21, 2023 · 8:39 PM UTC

Tim Dockhorn

Tim Dockhorn

@timudk

21 Nov 2023

Over the last few months I have spent a lot of time sampling from this model. Some tips: 1) You can generate videos even with small GPUs (just decrease number of frames you decode at a time as this eats most VRAM). 14 frames (decoding one at a time) should be less than 20GB VRAM

@_akhaliq

21 Nov 2023

Stability releases Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets model: huggingface.co/stabilityai/s… present Stable Video Diffusion — a latent video diffusion model for high-resolution, state-of-the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained for 2D image synthesis have been turned into generative video models by inserting temporal layers and finetuning them on small, high-quality video datasets. However, training methods in the literature vary widely, and the field has yet to agree on a unified strategy for curating video data. In this paper, we identify and evaluate three different stages for successful training of video LDMs: text-to-image pretraining, video pretraining, and high-quality video finetuning.

276

117,357

Tim Dockhorn · Nov 21, 2023 · 7:25 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Stable Video Diffusion: code & chekpoints out! On my way to get some vacation now 🌴🌻

123

18,457

Tim Dockhorn · Sep 23, 2024 · 5:43 PM UTC

Tim Dockhorn

@timudk

23 Sep 2024

come and build the best models with us

Black Forest Labs

@bfl_ai

23 Sep 2024

We are actively hiring across several roles, check out our website or job board for detailed job descriptions.

122

10,899

Tim Dockhorn · Aug 14, 2024 · 6:33 AM UTC

Tim Dockhorn

@timudk

14 Aug 2024

FLUX.1 🤝 Grok-2

Igor Babuschkin

@ibab

14 Aug 2024

Huge thank you to the @bfl_ml team, who scaled up their FLUX.1 inference API to support the Grok-2 release today!

119

8,816

Tim Dockhorn · Oct 12, 2022 · 12:26 PM UTC

Tim Dockhorn

@timudk

12 Oct 2022

📢📢 Introducing GENIE: Higher-Order Denoising Diffusion Solvers. nv-tlabs.github.io/GENIE/ GENIE distills higher-order score terms into a small neural network and uses them for accelerated diffusion model sampling. 💨 Fun project with @karsten_kreis & @ArashVahdat! (1/6)

@_akhaliq

12 Oct 2022

GENIE: Higher-Order Denoising Diffusion Solvers abs: buff.ly/3yAaZaq project page: buff.ly/3yAb3aa Higher-Order Denoising Diffusion Solvers: Based on truncated Taylor methods, we derive a novel higher-order solver that significantly accelerates synthesis

114

Tim Dockhorn · Aug 1, 2024 · 1:41 PM UTC

Tim Dockhorn

@timudk

1 Aug 2024

so this happened

Black Forest Labs

@bfl_ai

1 Aug 2024

We are excited to announce the launch of Black Forest Labs. Our mission is to develop and advance state-of-the-art generative deep learning models for media and to push the boundaries of creativity, efficiency and diversity.

102

4,359

Tim Dockhorn · Oct 19, 2022 · 12:57 PM UTC

Tim Dockhorn

@timudk

19 Oct 2022

📢 Excited to announce Differentially Private Diffusion Models! 🔒 nv-tlabs.github.io/DPDM We train diffusion models with strict differential privacy guarantees and outperform previous methods by large margins. w/ @tianshi_cao, @ArashVahdat, @karsten_kreis (1/n)

@_akhaliq

19 Oct 2022

Differentially Private Diffusion Models abs: arxiv.org/abs/2210.09929 project page: nv-tlabs.github.io/DPDM

Tim Dockhorn · Nov 24, 2023 · 10:51 PM UTC

Tim Dockhorn

@timudk

24 Nov 2023

Two examples of how lower motion score can give you more object motion (left 255, right 31):

30,865

Tim Dockhorn · Apr 19, 2023 · 5:51 PM UTC

Tim Dockhorn

@timudk

19 Apr 2023

📢📢 Glad we can finally share our work on (text-to-)video generation. TL;DR: Take Stable Diffusion, insert additional temporal layers and fine-tune them on video data while keeping the spatial layers fixed. w/ @andi_blatt, @robrombach, @HuanLing6, @FidlerSanja, @karsten_kreis

@_akhaliq

19 Apr 2023

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models turn the publicly available, state-of-the-art text-to-image LDM Stable Diffusion into an efficient and expressive text-to-video model with resolution up to 1280 x 2048 abs: arxiv.org/abs/2304.08818 project page: research.nvidia.com/labs/tor…

20,411

Tim Dockhorn · Aug 24, 2023 · 6:31 PM UTC

Tim Dockhorn

@timudk

24 Aug 2023

After two failed conference attempts, DPDM has been accepted at TMLR (openreview.net/forum?id=ZPpQ…). I am super happy about the discussions around DPDMs and follow-up works despite "only being an arxiv paper" for almost a year.

Differentially Private Diffusion Models

While modern machine learning models rely on increasingly large training datasets, data is often limited in privacy-sensitive domains. Generative models trained with differential privacy (DP) on...

openreview.net

Tim Dockhorn

@timudk

19 Oct 2022

15,221

Tim Dockhorn · Feb 1, 2022 · 1:08 PM UTC

Tim Dockhorn

@timudk

1 Feb 2022

Bored of overdamped Langevin dynamics in diffusion models? Why not introduce velocity variables and speed-up the diffusion process with a Hamiltonian component. That's exactly what we did in our #iclr2022 (spotlight) paper (w/ @karsten_kreis,@ArashVahdat): nv-tlabs.github.io/CLD-SGM/

Tim Dockhorn · Jul 22, 2023 · 6:17 PM UTC

Tim Dockhorn

@timudk

22 Jul 2023

Thank you for the kind words @thegautamkamath, and thank you to all the committee members, in particular Yaoliang for all his support during my PhD and for being an amazing supervisor. I am also very grateful to @driainmurray for his consistent guidance from afar.

Gautam Kamath @thegautamkamath

22 Jul 2023

Congrats to Dr. Tim Dockhorn (@timudk) who defended his PhD thesis yesterday! Tim is a world expert in diffusion models (DMs), and is going on to work at @StabilityAI. His work on privatizing DMs is an incredible leap forward (arxiv.org/abs/2210.09929). timudk.github.io/

ALT Dr. Tim Dockhorn and his thesis committee

10,152

Tim Dockhorn · Dec 15, 2021 · 6:19 PM UTC

Tim Dockhorn

@timudk

15 Dec 2021

Super excited to finally reveal what I have been working on with @karsten_kreis and @ArashVahdat during my internship at nvidia. I am also very happy to announce that I will stay on this amazing team led by @FidlerSanja as an intern, and push score-based models even further.

Karsten Kreis @karsten_kreis

15 Dec 2021

📢 Score-based Generative Modeling with Critically-Damped Langevin Diffusion! nv-tlabs.github.io/CLD-SGM/ We propose a novel diffusion using auxiliary velocity variables for more efficient denoising and higher quality generative models. w/ the amazing @timudk & @ArashVahdat! (1/n)

Tim Dockhorn · Mar 3, 2022 · 2:13 PM UTC

Tim Dockhorn

@timudk

3 Mar 2022

📢 The code and checkpoints for our Critically-Damped Diffusion paper has been released: github.com/nv-tlabs/CLD-SGM We also made some colabs so you can play 🎮 with sampling and likelihood computation.

GitHub - nv-tlabs/CLD-SGM: Score-Based Generative Modeling with Critically-Damped Langevin Diffusion

Score-Based Generative Modeling with Critically-Damped Langevin Diffusion - nv-tlabs/CLD-SGM

github.com

@_akhaliq

3 Mar 2022

Replying to @_akhaliq

github: github.com/nv-tlabs/CLD-SGM

Tim Dockhorn · Sep 17, 2024 · 2:39 AM UTC

Tim Dockhorn

@timudk

17 Sep 2024

join us for food, drinks, and discussions on diffusion models @bfl_ml x @FAL lu.ma/r1ntmmh2

fal x BFL Present: Acceleration by the Bay · Luma

Join us for drinks and hors d'oeuvres, and meet the developers shaping the future of creative tools through generative AI.

luma.com

4,679

Tim Dockhorn · Feb 28, 2023 · 4:49 PM UTC

Tim Dockhorn

@timudk

28 Feb 2023

Awesome work showing that you can scale DPDMs to CIFAR-10 using public pre-training. Congrats to @SGhalebikesabi and the team!

@_akhaliq

28 Feb 2023

Differentially Private Diffusion Models Generate Useful Synthetic Images By privately fine-tuning ImageNet pre-trained diffusion models with more than 80M parameters, obtain SOTA results on CIFAR-10 and Camelyon17 abs: arxiv.org/abs/2302.13861

13,573

Tim Dockhorn · Sep 18, 2024 · 6:35 PM UTC

Tim Dockhorn

@timudk

18 Sep 2024

I am in SF for a few days: - Sep 18 / Sep 19 @PyTorch conference - Sep 18 @bfl_ml x @FAL dinner event - Sep 19 @huggingface Flare Party - Sep 21 CUDA MODE IRL We are also hiring at @bfl_ml for several roles, please come and chat with us job-boards.greenhouse.io/bla…

1,983

Tim Dockhorn · Jun 18, 2020 · 12:39 PM UTC

Tim Dockhorn

@timudk

18 Jun 2020

Looking for yet another application of normalizing flows? We show that you can fit them given only noisy observations and the statistics of the noise distribution. This is joint work with @jmsrtch, @driainmurray and Yaoliang Yu. arxiv.org/abs/2006.09396

Tim Dockhorn · Nov 21, 2023 · 8:39 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

6) Lastly I want to say that this is just the beginning and we have a lot of ideas on how to improve video models. I am also super excited to see what finetunes/inference tricks the community can come up with; that's the best part about releasing weights!

1,874

Tim Dockhorn · Mar 28, 2023 · 4:42 PM UTC

Tim Dockhorn

@timudk

28 Mar 2023

We released code & models for our Differentially Private Diffusion Models (DPDMs): github.com/nv-tlabs/DPDM Check it out and train your own DPDMs.

GitHub - nv-tlabs/DPDM: Differentially Private Diffusion Models

Differentially Private Diffusion Models. Contribute to nv-tlabs/DPDM development by creating an account on GitHub.

github.com

Tim Dockhorn

@timudk

19 Oct 2022

Replying to @timudk

🔒 Despite DP generative modeling being incredibly challenging, we hope that our results can stimulate future work in this important field. Project page: nv-tlabs.github.io/DPDM/ arXiv: arxiv.org/abs/2210.09929 Code will be released soon! Stay tuned! (12/12)

3,548

Tim Dockhorn · Oct 3, 2024 · 4:33 PM UTC

Tim Dockhorn

@timudk

3 Oct 2024

🫐🫐🫐

844

Tim Dockhorn · Oct 30, 2023 · 3:06 PM UTC

Tim Dockhorn

@timudk

30 Oct 2023

The diffusion tutorial dream team is back. Don't miss it.

Karsten Kreis @karsten_kreis

30 Oct 2023

📢 Planning your NeurIPS'23 trip? Interested in *Latent* Diffusion Models? @RuiqiGao, @ArashVahdat and I will present the tutorial "Latent Diffusion Models: Is the Generative AI Revolution Happening in Latent Space?" Monday, Dec 11, New Orleans. neurips2023-ldm-tutorial.git… (1/n)

5,038

Tim Dockhorn · Nov 25, 2022 · 9:20 PM UTC

Tim Dockhorn

@timudk

25 Nov 2022

I will be at @NeurIPSConf from Saturday-Saturday. 📨 DM if you want to chat about Diffusion models or if you need a buddy to watch the world cup ⚽️

Tim Dockhorn · Nov 21, 2023 · 8:39 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

2) The fps conditioning and motion conditioning can greatly influence results. You don't necessarily need to choose the fps conditioning = fps rendering! I have gotten very good results with high fps / high motion conditionings rendered at lower fps.

3,049

Tim Dockhorn · Dec 10, 2023 · 7:28 PM UTC

Tim Dockhorn

@timudk

10 Dec 2023

On my way to New Orleans for NeurIPS! Excited to chat about all things generative modeling, especially efficient and scalable video generation 📽️🎞️

2,134

Tim Dockhorn · May 10, 2021 · 5:43 PM UTC

Tim Dockhorn

@timudk

10 May 2021

TAing this class was super fun and I learned lots. My favorite parts were learning about convergence of proximal descent for non-convex functions (Remark 4.22) and the connection between dual averaging and the generalized conditional gradient (HW5).

Gautam Kamath @thegautamkamath

10 May 2021

Want to learn optimization? Start with my @UWCheritonCS colleague Yaoliang Yu's course "Optimization for Data Science"! 20 excellent lectures, starting from the basics. cs.uwaterloo.ca/~y328yu/myco…

Tim Dockhorn · Feb 16, 2023 · 3:28 AM UTC

Tim Dockhorn

@timudk

16 Feb 2023

Replying to @lipmanya @qsh_zh @sedielem @RickyTQChen @helibenhamu @mnick @lematt1991

If I am not mistaken, one can recover OT flow matching (your (21) + (9)) exactly using the diffusion v-prediction (SNR+1 loss) from arxiv.org/abs/2202.00512l with alpha_t = 1 and sigma_t = 1 -t. Credits to @RiversHaveWings who originally found this.

2,832

Tim Dockhorn · Dec 1, 2022 · 3:46 PM UTC

Tim Dockhorn

@timudk

1 Dec 2022

📢📢 Presenting: Latent Space Diffusion Models of Cryo-EM Structures We are training diffusion models in the latent space of a cryo-EM autoencoder. Huge potential for downstream applications such as protein generative modeling from cryo-EM data. 🔥

Ellen Zhong

@ZhongingAlong

1 Dec 2022

In a fantastic collaboration with @karsten_kreis, @timudk, and Zihao Li, we extend cryoDRGN ❄️🐉 for generative sampling of cryo-EM structures via latent diffusion models. We'll be presenting this work @workshopmlsb @NeurIPSConf Sat, 9am! #EZlab Paper: arxiv.org/abs/2211.14169 1/

Tim Dockhorn · Jan 4, 2022 · 5:04 PM UTC

Tim Dockhorn

@timudk

4 Jan 2022

I will give a talk on score-based models and our CLD-SGM model on Thursday 4pm EST. Tune in by registering here vectorinstitute.zoom.us/meet…

Karsten Kreis @karsten_kreis

15 Dec 2021

Tim Dockhorn · Apr 5, 2021 · 1:33 PM UTC

Tim Dockhorn

@timudk

5 Apr 2021

What do people think about this comparison between generative models? Source: arxiv.org/abs/2103.04922

Tim Dockhorn · Nov 21, 2023 · 8:39 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

7) This was a a great collaborative project and I am deeply grateful for co-leads @andi_blatt @sumith1896 and the rest of the team

1,795

Tim Dockhorn · Nov 21, 2023 · 8:39 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

3) The guidance scale can also have a big impact on results. We actually increase the guidance scale linearly from w_min to w_max over the frame axis. More guidance will lead to better consistency but may result in oversaturation. For best results play with w_min/w_max.

2,011

Tim Dockhorn · Nov 6, 2024 · 8:51 PM UTC

Tim Dockhorn

@timudk

6 Nov 2024

🔥🔥🔥

fofr

@fofrAI

6 Nov 2024

Flux 1.1 pro ultra is now on Replicate. 4 megapixels (2096x2096) in 10 seconds 🔥 replicate.com/black-forest-l…

1,195

Tim Dockhorn · Nov 21, 2023 · 8:39 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

4) The model was only trained for resolution 576x1024 and you will likely observe artifacts when changing the aspect ratio considerably. If you still want to try, it may help to increase the conditioning augmentation noise.

1,894

Tim Dockhorn · Mar 14, 2023 · 1:00 AM UTC

Tim Dockhorn

@timudk

14 Mar 2023

Replying to @arankomatsuzaki

" Unlike previous methods, our approach can remove concepts from a diffusion model permanently rather than modifying the output at the inference time, so it cannot be circumvented even if a user has access to model weights." - are weights released though?

1,465

Tim Dockhorn · Mar 5, 2024 · 7:18 AM UTC

Tim Dockhorn

@timudk

5 Mar 2024

Replying to @iScienceLuvr

This is a great summary Tanishq

1,590

Tim Dockhorn · Dec 9, 2021 · 4:59 PM UTC

Tim Dockhorn

@timudk

9 Dec 2021

Working with @karsten_kreis and @ArashVahdat on score-based generative models has been nothing but great. Excited to share what we have come up with soon. If you like to push SOTA generative models and their applications consider applying!

Karsten Kreis @karsten_kreis

8 Dec 2021

This was an exciting project - I continue to be amazed by the capabilities of modern deep generative models! If you are interested in working with us on generative models and their applications, please reach out. We are looking for exceptional interns at NVIDIA's Toronto AI Lab.

Tim Dockhorn · Mar 9, 2023 · 6:48 PM UTC

Tim Dockhorn

@timudk

9 Mar 2023

Better late than never: We released our code (github.com/nv-tlabs/GENIE) and all checkpoints (drive.google.com/drive/folde…).

GitHub - nv-tlabs/GENIE: GENIE: Higher-Order Denoising Diffusion Solvers

GENIE: Higher-Order Denoising Diffusion Solvers. Contribute to nv-tlabs/GENIE development by creating an account on GitHub.

github.com

540

Tim Dockhorn · Dec 3, 2020 · 10:13 PM UTC

Tim Dockhorn

@timudk

3 Dec 2020

Replying to @sp_monte_carlo

Looking for a textbook on SDEs mostly for their application in probability theory/mcmc (fokker-planck eq, langevin dynamics)? Any recommendations?

Tim Dockhorn · Nov 21, 2023 · 10:07 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Replying to @rbhuta95

Increasing motion_bucket_id should lead to more overall motion in the generated video

941

Tim Dockhorn · Nov 21, 2023 · 9:02 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Replying to @cocktailpeanut @streamlit @EMostaque

You should be able to get it below 20GB VRAM by decoding one frame at a time

944

Tim Dockhorn · Oct 12, 2022 · 12:26 PM UTC

Tim Dockhorn

@timudk

12 Oct 2022

GENIE is a higher-order solver that is based on the second truncated Taylor method (TTM). Intuitively, the higher-order terms in GENIE capture the local curvature of the ODE and enable larger step sizes when compared to DDIM (first TTIM). (2/6)

Tim Dockhorn · Dec 4, 2023 · 5:16 PM UTC

Tim Dockhorn

@timudk

4 Dec 2023

Replying to @sedielem

I guess it's time for yet another diffusion circle ⛱️

422

Tim Dockhorn · May 10, 2020 · 7:04 PM UTC

Tim Dockhorn

@timudk

10 May 2020

Personal update: I am very excited to start my PhD (tomorrow!) with Yaoliang Yu at @UWaterloo and @VectorInst. I will broadly be working on combining machine learning and probabilistic modeling.

Tim Dockhorn · Aug 26, 2019 · 1:44 PM UTC

Tim Dockhorn

@timudk

26 Aug 2019

Given a dataset {(x_i, y_i)}_{i=1}^m ⊂ R^2 how many parameters does a (deep) neural network need to achieve training error below epsilon (given the optimal solution is found by a magical optimizer and we are ok with overfitting)? Can we do better than O(m)? @roydanroy @mpd37

Tim Dockhorn · Nov 21, 2023 · 7:26 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Code: github.com/Stability-AI/gene… Checkpoints: huggingface.co/stabilityai/s… huggingface.co/stabilityai/s…

547

Tim Dockhorn · Oct 12, 2022 · 12:26 PM UTC

Tim Dockhorn

@timudk

12 Oct 2022

Project page: nv-tlabs.github.io/GENIE/ arXiv: arxiv.org/abs/2210.05475 Code will be released soon! Stay tuned! (6/6)

Tim Dockhorn · Sep 21, 2024 · 3:26 AM UTC

Tim Dockhorn

@timudk

21 Sep 2024

Replying to @cloneofsimo @karpathy

i love this

2,351

Tim Dockhorn · Jun 24, 2023 · 10:10 PM UTC

Tim Dockhorn

@timudk

24 Jun 2023

"Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" - SD-XL 0.9

Stability AI

@StabilityAI

22 Jun 2023

Introducing the latest release from Stability AI: Breaking barriers with #SDXL 0.9! SDXL 0.9 produces massively improved text-to-image and composition detail over the beta release and provides a leap in use cases for generative AI imagery. #StabilityAI Unleash your creativity today! → bit.ly/3Xn12bI

1,802

Tim Dockhorn · Nov 21, 2023 · 8:39 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

5) Increasing conditioning augmentation noise is also necessary when applying the model to images that have heavy compression artifacts.

1,713

Tim Dockhorn · Feb 15, 2024 · 8:34 PM UTC

Tim Dockhorn

@timudk

15 Feb 2024

Replying to @_tim_brooks @billpeeb @OpenAI

amazing work

288

Tim Dockhorn · Jan 7, 2020 · 11:23 AM UTC

Tim Dockhorn

@timudk

7 Jan 2020

(1/6) My master's thesis is now available online: Generative Modeling with Neural Ordinary Differential Equations uwspace.uwaterloo.ca/handle/…

Tim Dockhorn · Dec 11, 2020 · 2:47 PM UTC

Tim Dockhorn

@timudk

11 Dec 2020

Replying to @sp_monte_carlo

Seems like you have had a look at quite a few books. Do you have recommendations/tips how to read them? I presume you did not have time to read them end to end and taking elaborate all throughout? I am struggling to find an efficient strategy.

Tim Dockhorn · Sep 26, 2019 · 5:45 PM UTC

Tim Dockhorn

@timudk

26 Sep 2019

Replying to @JeffDean @hardmaru @geoffreyhinton @OriolVinyalsML

Would be interesting to see how popular it would have become if it was published by some less popular researchers. "Just" publishing on arxiv when you are not well-known is difficult, i.e., you might not get much attention.

Tim Dockhorn · Mar 19, 2020 · 3:11 AM UTC

Tim Dockhorn

@timudk

19 Mar 2020

Replying to @jm_alexia

Thank you for tweeting about something else than covid!!

Tim Dockhorn · Mar 19, 2020 · 3:15 AM UTC

Tim Dockhorn

@timudk

19 Mar 2020

Replying to @zacharylipton

Similar pattern: new adversarial attack vs new adversarial defense

Tim Dockhorn · Jan 24, 2020 · 4:50 PM UTC

Tim Dockhorn

@timudk

24 Jan 2020

Replying to @lxuechen

In my Master's thesis, I derived the adjoint method for Neural ODEs and for Continuous normalizing flows from a constrained optimization framework. I didn't know about LeCun's paper when I derived the results but fortunately was pointed there before submitting the final version.

Tim Dockhorn · Nov 21, 2023 · 9:10 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Replying to @timudk @cocktailpeanut @streamlit @EMostaque

python scripts/sampling/simple_video_sample.py --decoding_t 1 --version svd_image_decoder

215

Tim Dockhorn · Dec 16, 2021 · 4:34 AM UTC

Tim Dockhorn

@timudk

16 Dec 2021

Replying to @chinwei_h @ArashVahdat @karsten_kreis

Thanks Chin-Wei :) Your Augmented Normalizing Flow paper was part of the motivation for our work.

Tim Dockhorn · Aug 7, 2021 · 3:22 PM UTC

Tim Dockhorn

@timudk

7 Aug 2021

Spotted in a high schooler's NeurIPS 2021 checklist: Q: Did you include the total amount of compute and the type of resources used? A: Our models were trained for a total of 1378 CPU-years on a TI-84 Plus.

Chanin Nantasenamat @thedataprof

7 Aug 2021

Texas Instruments just released the TI-84 Plus CE Color Graphing Calculator. The crazy thing is it now supports #Python! Reminiscent of high school years, the TI has come a long way. amzn.to/2VBxyv7

Tim Dockhorn · Nov 6, 2021 · 2:31 PM UTC

Tim Dockhorn

@timudk

6 Nov 2021

I wanted to get my feet wet with Schrödinger Bridge GMs for a while. This work made the journey quite comfortable by neatly connecting to SGMs (via Forward-Backward SDEs). IMO the major advantage compared to SGMs is that you don't have to craft the forward process yourself.

Guan-Horng Liu @guanhorng_liu

26 Oct 2021

Score-based generative models are implicit optimal transport models; lifting them to accept fully nonlinear diffusion yields Schrödinger Bridge generative models. Check out our latest work on log-likelihood training of Schrödinger Bridge 🌉! arxiv.org/pdf/2110.11291.pdf (1/3)

Tim Dockhorn · Jul 16, 2020 · 4:46 PM UTC

Tim Dockhorn

@timudk

16 Jul 2020

Come by to hear about our work on density deconvolution with flows invertibleworkshop.github.io…

Ricky T. Q. Chen @RickyTQChen

16 Jul 2020

Please join us *Saturday at #ICML2020 for the INNF+ workshop for invited talks by @wellingmax, @eric_nalisnick, @emidup, Cheng Zhang, @adjiboussodieng, @KyleCranmer and Martin Jankowiak. Starts 5:25 EDT / 11:25 CET / 18:25 JST invertibleworkshop.github.io… icml.cc/virtual/2020/worksho…

Tim Dockhorn · Dec 6, 2023 · 5:06 PM UTC

Tim Dockhorn

@timudk

6 Dec 2023

Replying to @thegautamkamath @WenhuChen @UWCheritonCS

*Cries in Canada*

246

Tim Dockhorn · Sep 18, 2024 · 6:06 PM UTC

Tim Dockhorn

@timudk

18 Sep 2024

Replying to @fal @FAL @PyTorch @isidentical @cloneofsimo @chamini2 @gorkem @burkaygur

new logo who this

214

Tim Dockhorn · Oct 12, 2022 · 12:26 PM UTC

Tim Dockhorn

@timudk

12 Oct 2022

Cascaded DM pipelines and DM-based super-resolution have become crucial ingredients in large-scale image generation. We also explore the applicability of GENIE in this setting. Our GENIE upsampler only uses five function evaluations to generate the cats below. (5/6)

Tim Dockhorn · Feb 5, 2021 · 1:50 PM UTC

Tim Dockhorn

@timudk

5 Feb 2021

Replying to @jm_alexia

I guess it ultimately depends on what your goal is. I would have liked to see an actual application (or motivation) where this is useful.

Tim Dockhorn · Dec 22, 2023 · 11:54 AM UTC

Tim Dockhorn

@timudk

22 Dec 2023

Replying to @seungkim0123

Great stuff as always from you guys

180

Tim Dockhorn · Feb 9, 2022 · 4:50 PM UTC

Tim Dockhorn

@timudk

9 Feb 2022

Very excited to dig into this. I have been thinking about this problem for a while and I am very glad somebody did the math for me.

Arnaud Doucet @ArnaudDoucet1

9 Feb 2022

Diffusion models go Riemannian arxiv.org/abs/2202.02763 - Time reversal + score-matching on compact manifolds - Sampling and likelihood computation with SOTA results - Solves Schrodinger bridges on manifolds @ValentinDeBort1 @MathieuEmile @MHutchinson141 @JamesTThorn @yeewhye

Tim Dockhorn · Oct 28, 2021 · 12:39 AM UTC

Tim Dockhorn

@timudk

28 Oct 2021

Excited to finally share our work "Demystifying and Generalizing BinaryConnect". This is joint work between UWaterloo and Huawei Noah's Ark Lab. arxiv.org/abs/2110.13220

Demystifying and Generalizing BinaryConnect

BinaryConnect (BC) and its many variations have become the de facto standard for neural network quantization. However, our understanding of the inner workings of BC is still quite limited. We...

arxiv.org

Tim Dockhorn · Oct 28, 2024 · 7:56 PM UTC

Tim Dockhorn

@timudk

28 Oct 2024

Replying to @thegautamkamath @WaterlooMath

haha

Tim Dockhorn · Dec 4, 2019 · 2:43 PM UTC

Tim Dockhorn

@timudk

4 Dec 2019

Replying to @tallinzen @roydanroy @adjiboussodieng @hannawallach @NeurIPSConf

That's definitely how most people in Germany would pronounce it.

Tim Dockhorn · Aug 1, 2019 · 10:38 PM UTC

Tim Dockhorn

@timudk

1 Aug 2019

Replying to @andyblarsen @lichess

@ThomasMBury @saptarshipal49

Tim Dockhorn · Dec 16, 2021 · 11:06 PM UTC

Tim Dockhorn

@timudk

16 Dec 2021

Excited to try out this beast!! Great work as always @RiversHaveWings

Rivers Have Wings @RiversHaveWings

16 Dec 2021

My 602M parameter CLIP conditioned diffusion model trained on Conceptual 12M is out at github.com/crowsonkb/v-diffu…! It can generate images matching the prompt quickly using its CLIP conditioning, but still requires CLIP guidance for best results.

Tim Dockhorn · Nov 21, 2023 · 10:14 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Replying to @fofrAI

Awesome - btw you can even generate more than 25 frames. Depending on the input, I could sometimes get good results for up to 40 frames - even more results will deteriorate quality.

132

Tim Dockhorn · Oct 12, 2022 · 12:26 PM UTC

Tim Dockhorn

@timudk

12 Oct 2022

During training, we propose to extract the necessary higher-order terms from the diffusion model (DM) via automatic differentiation. The higher-order terms are then distilled into a small neural network on top of the DM, allowing for efficient inference. (3/6)

Tim Dockhorn · Jan 3, 2022 · 4:36 PM UTC

Tim Dockhorn

@timudk

3 Jan 2022

Replying to @karsten_kreis @ArashVahdat

@gaetan_hadjeres the revised version is now on arxiv

Tim Dockhorn · Nov 21, 2023 · 9:10 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Replying to @cocktailpeanut @streamlit @EMostaque

python scripts/sampling/simple_video_sample.py decoding_t 1 --version svd_image_decoder (If you decode one frame at a time you may as well use the standard image decoder)

217

Tim Dockhorn · Sep 18, 2024 · 6:36 PM UTC

Tim Dockhorn

@timudk

18 Sep 2024

Replying to @timudk @PyTorch @bfl_ml @FAL @huggingface

tweet format stolen from @danielhanchen 🤫

292

Tim Dockhorn · Nov 25, 2023 · 12:54 AM UTC

Tim Dockhorn

@timudk

25 Nov 2023

Replying to @gjzhang1

We are still early in video generative modeling. Generating 20s-1min videos should be the next goal.

180

Tim Dockhorn · Nov 13, 2021 · 6:15 PM UTC

Tim Dockhorn

@timudk

13 Nov 2021

Replying to @RiversHaveWings

Last picture looks like an ocean wave crashing against the W Barcelona.

Tim Dockhorn · Aug 26, 2019 · 1:56 PM UTC

Tim Dockhorn

@timudk

26 Aug 2019

Replying to @timudk @roydanroy @mpd37

@zacharylipton @suzatweet @DeepSpiker @pfau @hardmaru

Tim Dockhorn · Jan 20, 2021 · 11:08 PM UTC

Tim Dockhorn

@timudk

20 Jan 2021

Just reread arxiv.org/abs/1904.12083 by @daibond_alpha et al. on training EBMs by learning a "dual sampler". Their work generalizes more or less all EBM training methods. Very impressive work.

Tim Dockhorn · Oct 19, 2022 · 12:57 PM UTC

Tim Dockhorn

@timudk

19 Oct 2022

🔒 (ii) Results! Training CNN classifiers with synthesized data from our DMs performs on par with CNN classifiers trained directly w/ DP-SGD. This is initial proof that DP generative models can eventually be used as effective data sharing media of sensitive data. (11/n)

Tim Dockhorn · Jan 19, 2020 · 10:00 PM UTC

Tim Dockhorn

@timudk

19 Jan 2020

Replying to @IntuitMachine @citnaj

Pointers?

Tim Dockhorn · Sep 18, 2019 · 7:13 PM UTC

Tim Dockhorn

@timudk

18 Sep 2019

I really liked @carlhenrikek's explanation of variational inference (piped.video/watch?v=qLyIGnS-…). Most often, VI is motivated by finding a good approximation q to the posterior, but we actually want is a lower bound of the marginal likelihood.

Tim Dockhorn · Oct 17, 2023 · 2:26 AM UTC

Tim Dockhorn

@timudk

17 Oct 2023

Replying to @giannis_daras @ArashVahdat

Congrats and very good choice !!

346

Tim Dockhorn · Jun 17, 2021 · 1:32 PM UTC

Tim Dockhorn

@timudk

17 Jun 2021

I haven’t really tried anything but Adam and SGD for NN training, and I don’t plan to do so. Seems like that’s ok.

Frank Schneider @frankstefansch1

16 Jun 2021

📣 #ICML 2021 Paper 📣 Overwhelmed by the flood of optimizers for deep learning? We felt the same and performed an extensive benchmark. Joint work with @robinschmidt_ & @PhilippHennig5. Paper: arxiv.org/abs/2007.01547 Results: github.com/SirRob1997/Crowde… Video: piped.video/cz9RzlstFdE

Tim Dockhorn · Feb 16, 2023 · 5:12 AM UTC

Tim Dockhorn

@timudk

16 Feb 2023

Replying to @timudk @lipmanya @qsh_zh @sedielem @RickyTQChen @helibenhamu @mnick @lematt1991 @RiversHaveWings

Correct link: arxiv.org/abs/2202.00512

261

Tim Dockhorn · Oct 19, 2022 · 12:57 PM UTC

Tim Dockhorn

@timudk

19 Oct 2022

🔒 (iii) Why DMs? GANs are currently predominantly used in DP generative modeling. They are difficult to optimize and prone to mode collapse which is problematic during noisy DP-SGD training. In contrast, DMs are trained with a robust and scalable regression-like loss. (8/n)

Tim Dockhorn · Oct 19, 2022 · 1:01 PM UTC

Tim Dockhorn

@timudk

19 Oct 2022

Replying to @_arohan_ @_akhaliq

Thanks Rohan :) We average the gradients for different noise levels of the same training example *before* clipping the gradients. This induces no additional privacy cost. Our approach is motivated by "augmentation multiplicity" in arxiv.org/abs/2204.13650.

Tim Dockhorn · Aug 26, 2019 · 7:35 PM UTC

Tim Dockhorn

@timudk

26 Aug 2019

Replying to @BrynElesedy @roydanroy @mpd37

The authors show here exact representation using O(m) parameters; I am more interested in getting some error bounds when we use less parameters. Results like this exist for polynomial interpolation: en.wikipedia.org/wiki/Polyno… see Chapter 7.

Polynomial interpolation - Wikipedia

en.wikipedia.org

Tim Dockhorn · Oct 3, 2024 · 4:54 PM UTC

Tim Dockhorn

@timudk

3 Oct 2024

Replying to @isidentical

hahaha

Tim Dockhorn · Oct 19, 2022 · 12:57 PM UTC

Tim Dockhorn

@timudk

19 Oct 2022

Tim Dockhorn · Nov 21, 2023 · 9:52 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Replying to @cocktailpeanut @streamlit @EMostaque

python scripts/sampling/simple_video_sample.py --decoding_t 1 --version svd_image_decoder Runs with VRAM spike of 20GB on A100 (PyTorch 2.0.1+cu117)

171

Tim Dockhorn · Oct 28, 2021 · 12:39 AM UTC

Tim Dockhorn

@timudk

28 Oct 2021

@yubai01 et al. pointed out the similarity of BC to Dual Averaging (DA). Our main contribution is a refinement thereof: BC is a nonconvex counterpart of DA, and more importantly, DA itself is the generalized conditional gradient algorithm applied to a smoothened dual problem.

Tim Dockhorn · Jul 24, 2023 · 3:46 PM UTC

Tim Dockhorn

@timudk

24 Jul 2023

Replying to @tetraduzione @driainmurray

Thanks for everything Iain!!

Tim Dockhorn · Jan 7, 2020 · 11:23 AM UTC

Tim Dockhorn

@timudk

7 Jan 2020

(3/6) I show how to make training of CNFs more efficient by scheduling the numerical solver tolerances. The inspiration for this comes from inexact newton methods. I am currently working on a paper that will extent this work to "adaptive tolerance schedulers". Stay tuned!

Tim Dockhorn · Nov 21, 2023 · 10:18 PM UTC

Tim Dockhorn

@timudk

21 Nov 2023

Replying to @fofrAI

Yes, should decrease gradually up to some limit. Too many frames generally lead to repetition/ back-forth movements.

Tim Dockhorn · Oct 19, 2022 · 12:57 PM UTC

Tim Dockhorn

@timudk

19 Oct 2022

🔒 (i) Why DMs? In DP-SGD, the amount of injected noise also depends on the model size: more parameters, more noise! The denoiser, the learnable component in DMs, is less complex than the network learned by a GAN or the end-to-end sampling process of the DM itself. (6/n)

Tim Dockhorn · Oct 14, 2022 · 2:37 AM UTC

Tim Dockhorn

@timudk

14 Oct 2022

Replying to @timudk @PatrickKidger @karsten_kreis @ArashVahdat

Btw, GENIE and other recent DM acceleration works build on the DDIM ODE, which is considerably more easy to solve than the Probability Flow ODE. We discuss this in our paper but also see the excellent arxiv.org/abs/2206.00364 (3/n)