Vincent Sitzmann · Jun 8, 2026 · 4:28 PM UTC

Vincent Sitzmann

Pinned Tweet

Vincent Sitzmann

@vincesitzmann

Jun 8

Introducing MilliVid, our new method for long-context video generation! MilliVid creates videos that are consistent over long time spans, without using retrieval heuristics or 3D maps! (1/n) davidcharatan.com/millivid/#

402

54,501

Vincent Sitzmann · Jun 7, 2021 · 2:56 PM UTC

Vincent Sitzmann

@vincesitzmann

7 Jun 2021

In personal news, I’m thrilled to announce that I’ll be joining @MIT as tenure-track assistant professor in July 2022! My lab will investigate neural scene representations, inverse graphics, neural rendering, and their applications in vision, graphics, robotics, and AI! (1/n)

1,441

Vincent Sitzmann · Jun 18, 2020 · 12:17 AM UTC

Vincent Sitzmann

@vincesitzmann

18 Jun 2020

Excited to share our work on "Implicit Neural Representations with Periodic Activations" vsitzmann.github.io/siren We show how to fit complex signals, such as room-scale SDFs, video, & audio, and supervise implicit reps via their gradients to solve boundary value problems! (1/n)

220

910

Vincent Sitzmann · Jun 24, 2020 · 9:38 PM UTC

Vincent Sitzmann

@vincesitzmann

24 Jun 2020

We released the code for SIREN! vsitzmann.github.io/siren We also wrote a comprehensive Colab notebook with a no-frills implementation that reproduces image, audio, and poisson experiments, and explores initialization- and shift-invariance properties! colab.research.google.com/gi…

explore_siren.ipynb

Run, share, and edit Python notebooks

colab.research.google.com

162

622

Vincent Sitzmann · Apr 24, 2024 · 4:09 PM UTC

Vincent Sitzmann

@vincesitzmann

24 Apr 2024

Introducing “FlowMap”, the first self-supervised, differentiable structure-from-motion method that is competitive with conventional SfM like Colmap! cameronosmith.github.io/flow… IMO this solves a major missing piece for internet-scale training of 3D Deep Learning methods. 1/n

102

607

128,581

Vincent Sitzmann · Dec 10, 2021 · 4:59 PM UTC

Vincent Sitzmann

@vincesitzmann

10 Dec 2021

Introducing “Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation”! yilundu.github.io/ndf/ (w/ video!) NDFs are an object representation for robotic manipulation enabling imitation of pick-and-place tasks with pose generalization guarantees (1/n)

554

Vincent Sitzmann · Dec 28, 2020 · 6:43 PM UTC

Vincent Sitzmann

@vincesitzmann

28 Dec 2020

Implicit neural representations have recently gotten a lot of attention. I have compiled a reading list that I give students to get started in this area, inspired by the awesome-computer-vision list with extra commentary & notes. Check it out! github.com/vsitzmann/awesome…

GitHub - vsitzmann/awesome-implicit-representations: A curated list of resources on implicit neural...

A curated list of resources on implicit neural representations. - vsitzmann/awesome-implicit-representations

github.com

126

558

Vincent Sitzmann · Jun 7, 2021 · 1:23 PM UTC

Vincent Sitzmann

@vincesitzmann

7 Jun 2021

Introducing "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering"! vsitzmann.github.io/lfns (w/ video!) LFNs are the first fully implicit neural scene representation with real-time rendering, without post-processing / hybrid data-structures! (1/n)

117

530

Vincent Sitzmann · Nov 2, 2021 · 5:18 PM UTC

Vincent Sitzmann

@vincesitzmann

2 Nov 2021

I am hiring graduate students for my new lab at MIT, where I will start as faculty in July 2022! If you want to push what's possible with neural scene representations & inverse graphics please apply under: gradapply.mit.edu/eecs/apply… Deadline is Dec 15th!

111

529

Vincent Sitzmann · Jul 3, 2024 · 5:16 PM UTC

Vincent Sitzmann

@vincesitzmann

3 Jul 2024

Introducing Diffusion Forcing, a new way of training sequence generative models that unifies next-token prediction (think LLM) and full-sequence diffusion (think video diffusion models)! I’m super excited about this - it has a number of unique skills! (1/n)

Boyuan Chen

@BoyuanChen0

3 Jul 2024

Introducing Diffusion Forcing, which unifies next-token prediction (eg LLMs) and full-seq. diffusion (eg SORA)! It offers improved performance & new sampling strategies in vision and robotics, such as stable, infinite video generation, better diffusion planning, and more! (1/8)

519

64,437

Vincent Sitzmann · Aug 8, 2024 · 3:58 PM UTC

Vincent Sitzmann

@vincesitzmann

8 Aug 2024

Introducing Neural Jacobian Fields, robot 3D kinematic models learned only from vision! They can model & control robots from just a single RGB camera, even those w/ intractable kinematics & no embedded sensors such as soft, 3D-printed pneumatic hands! sizhe-li.github.io/publicati… 1/n

500

54,204

Vincent Sitzmann · Jun 8, 2023 · 5:03 PM UTC

Vincent Sitzmann

@vincesitzmann

8 Jun 2023

Introducing “FlowCam: Training Generalizable 3D Radiance Fields w/o Camera Poses via Pixel-Aligned Scene Flow”! We train a generalizable 3D scene representation self-supervised on datasets of raw videos, without any pre-computed camera poses or SFM! cameronosmith.github.io/flow… 1/n

468

88,512

Vincent Sitzmann · Aug 25, 2023 · 4:29 PM UTC

Vincent Sitzmann

@vincesitzmann

25 Aug 2023

Introducing “Diffusion with Forward Models”, 𝗮 𝗺𝗼𝗱𝗲𝗹 𝘁𝗵𝗮𝘁 𝗰𝗮𝗻 𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗲 𝗱𝗶𝘃𝗲𝗿𝘀𝗲, 𝗿𝗲𝗮𝗹 𝟯𝗗 𝘀𝗰𝗲𝗻𝗲𝘀 𝗳𝗿𝗼𝗺 𝗮 𝘀𝗶𝗻𝗴𝗹𝗲 𝗶𝗺𝗮𝗴𝗲, 𝘁𝗿𝗮𝗶𝗻𝗲𝗱 𝘄𝗶𝘁𝗵 𝗶𝗺𝗮𝗴𝗲𝘀 𝘄/𝗼 𝗮𝗻𝘆 𝟯𝗗 𝗱𝗮𝘁𝗮! diffusion-with-forward-model… 1/n

476

88,724

Vincent Sitzmann · Jun 2, 2022 · 1:18 PM UTC

Vincent Sitzmann

@vincesitzmann

2 Jun 2022

NeRFs will transform computer graphics. But we need to be able to edit them! In “Decomposing NeRF for Editing via Feature Field Distillation” we use Image and Image/Language foundation models for easy, query-based editing via language- and patch queries! pfnet-research.github.io/dis…

450

Vincent Sitzmann · Jun 18, 2020 · 10:42 PM UTC

Vincent Sitzmann

@vincesitzmann

18 Jun 2020

In personal news, I graduated Stanford with my thesis on "Self-supervised Scene Representation Learning"! My next stop will be a postdoc at MIT, with Josh Tenenbaum, Fredo Durand, and Bill Freeman, starting in August. If you're around & wanna talk neural scene reps, reach out!

434

Vincent Sitzmann · Nov 9, 2021 · 3:51 AM UTC

Vincent Sitzmann

@vincesitzmann

9 Nov 2021

For folks prepping their grad school documents right now, I published my SoPs for my PhD, my MSc, and my MSc scholarship here: github.com/vsitzmann/phd-mas… Definitely don't copy it (I'd probably write it differently today), but maybe it'll give you some inspiration for your app docs!

GitHub - vsitzmann/phd-master-application-docs: A collection of the application documents I used to...

A collection of the application documents I used to apply to universities in the US. - vsitzmann/phd-master-application-docs

github.com

364

Vincent Sitzmann · Nov 30, 2023 · 9:20 PM UTC

Vincent Sitzmann

@vincesitzmann

30 Nov 2023

I am looking to hire a PhD student with a background in representation theory and an interest in geometric deep learning *for vision*. If that is you, please apply to MIT (deadline Dec 15th) and mention me in your application, I would love to chat!

274

59,380

Vincent Sitzmann · Dec 21, 2023 · 12:26 AM UTC

Vincent Sitzmann

@vincesitzmann

21 Dec 2023

Introducing pixelSplat: feed-forward Gaussian splats from image pairs! Led by @DavidCharatan and @sizhe_lester_li, collaborating with @taiyasaki! We propose a memory-efficient, fast and editable alternative to pixelNeRF based on 3D Gaussian Splatting! davidcharatan.com/pixelsplat… 1/n

270

47,612

Vincent Sitzmann · Dec 9, 2022 · 3:49 PM UTC

Vincent Sitzmann

@vincesitzmann

9 Dec 2022

Considering a PhD and interested in differentiable rendering, self-supervised representation learning in vision, and 3D scene representations? Apply to MIT and consider my research group! scenerepresentations.org/ Application under gradapply.mit.edu/eecs/apply…!

223

Vincent Sitzmann · Jun 19, 2020 · 11:17 PM UTC

Vincent Sitzmann

@vincesitzmann

19 Jun 2020

Many people have asked us how SIREN is different from positional encodings (ReLU P.E.). First, SIREN fits complex signals, such as images, audio, video, etc. better - see video, website, paper! (1/n)

214

Vincent Sitzmann · Apr 27, 2023 · 4:58 PM UTC

Vincent Sitzmann

@vincesitzmann

27 Apr 2023

Methods such as PixelNeRF can synthesize novel views given few input images. However, they are limited to simple scenes and small baselines. In our CVPR paper, we present a method for high-quality novel view synthesis given only two distant observations: yilundu.github.io/wide_basel…

207

31,036

Vincent Sitzmann · Dec 7, 2021 · 3:34 PM UTC

Vincent Sitzmann

@vincesitzmann

7 Dec 2021

Our NeurIPS spotlight on using neural light fields instead of 3D neural fields for scene representation has been covered by MIT news! Neural light fields allow rendering without ray-marching. IMO, light fields are the way to go for learning at scale! news.mit.edu/2021/3-d-image-…

Technique enables real-time rendering of scenes in 3D

Researchers at MIT and elsewhere have demonstrated a novel technique that vastly increases the speed of rendering 3D scenes from images by using a neural network to reconstruct the 360-degree light...

news.mit.edu

201

Vincent Sitzmann · Jun 19, 2024 · 4:03 PM UTC

Vincent Sitzmann

@vincesitzmann

19 Jun 2024

Wow - we are honored that pixelSplat wins the "Best Paper Runner-Up" at CVPR 2024! Congratulations to @DavidCharatan and @sizhe_lester_li who made this happen in their first year of their PhD - you guys rock!! Also thanks @taiyasaki for the fun collab :)

Vincent Sitzmann

@vincesitzmann

21 Dec 2023

168

16,302

Vincent Sitzmann · Dec 24, 2021 · 1:40 PM UTC

Vincent Sitzmann

@vincesitzmann

24 Dec 2021

Welcome Ayush Tewari (@_atewari) and Krishna Murthy (@_krishna_murthy), who have joined MIT as post-docs! Thrilled to have you as colleagues :) Lots more cool work on Neural Rendering, Scene Representations, and ML for Physics to come :D

171

Vincent Sitzmann · Aug 14, 2025 · 3:08 PM UTC

Vincent Sitzmann

@vincesitzmann

14 Aug 2025

Sometime in the next few weeks, we will do an explainer video on world models, video gen models, and embodied intelligence. If you have any questions you'd like me to discuss, please post them in the replies!! First time I'm doing something like that, I hope it'll be interesting!

MIT CSAIL

@MIT_CSAIL

12 Aug 2025

Ask us your questions about embodied intelligence or AI systems that interact w/the world. We’re featuring a few in an upcoming explainer w/MIT prof. Vincent Sitzmann (@vincesitzmann). For more on his work: vincentsitzmann.com/

159

17,535

Vincent Sitzmann · Jun 27, 2025 · 4:25 PM UTC

Vincent Sitzmann

@vincesitzmann

27 Jun 2025

Our paper on learning controllable 3D robot models from vision is published in Nature! Huge congrats to Lester and the team, @annan__zhang, @BoyuanChen0, Hanna Matusik, Chao Liu, and Daniela Rus!! Learning joint world models for the environment & the agent is super exciting :)

Lester Li

@sizhe_lester_li

27 Jun 2025

Now in Nature! 🚀 Our method learns a controllable 3D model of any robot from vision, enabling single-camera closed-loop control at test time! This includes robots previously uncontrollable, soft, and bio-inspired, potentially lowering the barrier of entry to automation! Paper: nature.com/articles/s41586-0… (1/n)

152

20,115

Vincent Sitzmann · Sep 20, 2023 · 11:05 PM UTC

Vincent Sitzmann

@vincesitzmann

20 Sep 2023

If you have experience in generative modeling and differentiable rendering and are looking to join a fun team, I've recently co-founded a stealth startup in this space and we're looking for 1-2 ML experts still. Reach out w/ email & summary of what you've worked on in the past!

152

31,840

Vincent Sitzmann · May 5, 2024 · 9:31 PM UTC

Vincent Sitzmann

@vincesitzmann

5 May 2024

My student @DavidCharatan published all the code to make figures for both pixelSplat and FlowMap in the respective github repositories, and just published the Figma file that generates the figures in the paper as well: figma.com/file/WLHx9d6qDRol9…

FlowMap Figures

Created with Figma

figma.com

144

13,764

Vincent Sitzmann · Oct 31, 2025 · 5:08 PM UTC

Vincent Sitzmann

@vincesitzmann

31 Oct 2025

Our new method for diffusion stitching allows us to generate ultra-long video sequences that follow a long, pre-defined camera trajectory! All segments are generated in parallel (not auto-regressive) and so the model never generates walls that it has to later step through!

Chonghyuk (ND) Song @ndsong95

31 Oct 2025

Introducing Generative View Stitching (GVS), a non-autoregressive sampling method for length extrapolation of video diffusion models. GVS enables collision-free camera-guided video generation for predefined trajectories, including Oscar Reutersvärd's Impossible Staircase (1/9).

154

22,017

Vincent Sitzmann · Dec 14, 2023 · 4:47 PM UTC

Vincent Sitzmann

@vincesitzmann

14 Dec 2023

How can we learn to generate 3D scenes directly with diffusion models if we only have images, no ground-truth 3d scenes? Ayush, Tianwei and George will tell you at our poster “diffusion with Forward Models”, #202!

145

13,646

Vincent Sitzmann · Nov 23, 2021 · 10:01 PM UTC

Vincent Sitzmann

@vincesitzmann

23 Nov 2021

In other news, I have a first version of the website for my future MIT research group! scenerepresentations.org/ The name I have decided on for now is the "Scene Representation Group". There's even a logo! Thanks to the amazing @ludwigschubert for opinions & hands-on help :)

140

Vincent Sitzmann · Jan 25, 2024 · 5:30 PM UTC

Vincent Sitzmann

@vincesitzmann

25 Jan 2024

Excited to introduce DittoGym @ ICLR, in which we study the control of a neat new kind of robot: soft shape-shifters! This is work done by @SuningHuan44558 during his visit at my group at MIT, jointly with my student @BoyuanChen0! Project page: dittogym.github.io/ 1/n

144

29,963

Vincent Sitzmann · Apr 11, 2024 · 11:49 PM UTC

Vincent Sitzmann

@vincesitzmann

11 Apr 2024

My student Boyuan always joked he would open a Boba shop someday - he actually made it happen, and now students on our floor are always supplied with (free) Boba 😀

Boyuan Chen

@BoyuanChen0

11 Apr 2024

I quit PhD (for a day) and opened a boba shop at @MIT - Generative Boba! It’s a huge success - right next to our office so all the AI researchers are enjoying it. Checkout our boba diffusion algorithm in the poster to understand why boba generation is so important to @MIT_CSAIL !

140

24,547

Vincent Sitzmann · Nov 3, 2025 · 10:00 PM UTC

Vincent Sitzmann

@vincesitzmann

3 Nov 2025

Introducing XFactor: the first pose- and geometry-free method capable of true Novel View Synthesis (NVS). We re-think NVS and the concept of camera poses completely without concepts from multi-view geometry as a pure representation learning problem! mitchel.computer/xfactor/ (1/n)

151

9,078

Vincent Sitzmann · Sep 12, 2022 · 7:54 PM UTC

Vincent Sitzmann

@vincesitzmann

12 Sep 2022

Recordings of our CVPR 2022 workshop on neural fields are now public - check it out!

Srinath Sridhar @drsrinathsridha

12 Sep 2022

Neural fields are emerging as useful signal representations in computer vision & beyond. Our full-day introductory @CVPR tutorial on the topic is now public. Video: piped.video/PeRRp1cFuH4 Slides: drive.google.com/drive/folde… Web: neuralfields.cs.brown.edu/cv…

136

Vincent Sitzmann · Sep 22, 2023 · 5:20 PM UTC

Vincent Sitzmann

@vincesitzmann

22 Sep 2023

Thrilled to share that this paper was accepted to NeurIPS as a spotlight! Code coming soon as well!

Vincent Sitzmann

@vincesitzmann

25 Aug 2023

135

31,396

Vincent Sitzmann · Oct 12, 2023 · 3:27 PM UTC

Vincent Sitzmann

@vincesitzmann

12 Oct 2023

📢📢📢 Code for our 3D generative model that learns to generate 3D appearance and geometry from just a single image is out now! It's trained just from real-world multi-view images, and generates scenes directly w/o score distillation! github.com/ayushtewari/DFM/

134

16,988

Vincent Sitzmann · Jul 12, 2024 · 5:51 AM UTC

Vincent Sitzmann

@vincesitzmann

12 Jul 2024

We have released the code for Video Diffusion via 3D UNet and Temporal Attention trained with Diffusion Forcing! The results are really cool - the model can roll out far beyond the training horizon :) Thanks to our UROP @kiwhansong0 who is really outstanding!!

Boyuan Chen

@BoyuanChen0

12 Jul 2024

Diffusion Forcing Update: code & ckpt for 3D-UNet + Temporal Attention version is released thanks to my amazing undergrad mentee @kiwhansong0! See project github for more info. I also added suggested future directions to the our website boyuan.space/diffusion-forci…. Check them out!

132

16,119

Vincent Sitzmann · Jun 17, 2022 · 1:33 PM UTC

Vincent Sitzmann

@vincesitzmann

17 Jun 2022

At CVPR and interested in INRs / Neural Fields / Neural Scene Representations? Come to our tutorial on Monday! We'll cover fundamental techniques and latest advances in Neural Fields, and reflect on what's next together with exciting invited speakers! neuralfields.cs.brown.edu/cv… 1/4

CVPR 2022 Tutorial: Neural Fields in Computer Vision

neuralfields.cs.brown.edu

130

Vincent Sitzmann · Nov 23, 2021 · 9:40 PM UTC

Vincent Sitzmann

@vincesitzmann

23 Nov 2021

Our review article on neural fields is out! arxiv.org/abs/2111.11426 W/ @yiheng @yongyuanxi @psyth91 @orlitany Shiqin Yan (bit.ly/3l2obhK) Numair Khan (bit.ly/30VuRY6) @fedassa Sunny Li (bit.ly/3cJtrCa) @jtompkin @drsrinathsridha (equal advising). (1/n)

124

Vincent Sitzmann · Aug 16, 2021 · 9:11 PM UTC

Vincent Sitzmann

@vincesitzmann

16 Aug 2021

(1/4) Announcing the 2nd 3DReps Workshop at ICCV on "Learning 3D Representations for Shape and Appearance"! We will bring together researchers across disciplines, from vision to graphics, neuroscience, robotics, and geometry! ivl.cs.brown.edu/3DReps/

119

Vincent Sitzmann · Feb 11, 2025 · 8:23 PM UTC

Vincent Sitzmann

@vincesitzmann

11 Feb 2025

We wrote a new video diffusion paper! @kiwhansong0 and @BoyuanChen0 and co-authors did absolutely amazing work here. Apart from really working, the method of "variable-length history guidance" is really cool and based on some deep truths about sequence generative modeling....

Boyuan Chen

@BoyuanChen0

11 Feb 2025

Announcing Diffusion Forcing Transformer (DFoT), our new video diffusion algorithm that generates ultra-long videos of 800+ frames. DFoT enables History Guidance, a simple add-on to any existing video diffusion models for a quality boost. Website: boyuan.space/history-guidanc… (1/7)

121

13,009

Vincent Sitzmann · Oct 1, 2025 · 3:49 PM UTC

Vincent Sitzmann

@vincesitzmann

1 Oct 2025

Finally get to tweeting about this: @BoyuanChen0, my student co-advised with @RussTedrake, graduated recently! Boyuan has done groundbreaking work on video generative modeling and video as the "language" of robotics. He is off to OpenAI where I am sure he will do amazing things!

123

18,973

Vincent Sitzmann · Mar 19, 2025 · 1:32 PM UTC

Vincent Sitzmann

@vincesitzmann

19 Mar 2025

Great results - congrats to the team! I think that this is already very close to outperforming any differentiable-rendering based NVS method. While NVS has always been somewhat of a toy problem, I think pose-conditioned diffusion models plus large data have essentially solved it.

Jensen Zhou @jensenzhoujh

18 Mar 2025

Hi there, 🎉 We are thrilled to introduce Stable Virtual Camera, a generalist diffusion model designed to address the exciting challenge of Novel View Synthesis (NVS). With just one or a few images, it allows you to create a smooth trajectory video from any viewpoint you desire. We’re naming this model in tribute to the Virtual Camera cinematography technology. @StabilityAI 🏠 Project Page: stable-virtual-camera.github… 📄 Paper: stable-virtual-camera.github… 📃 Blog: stability.ai/news/introducin… 💻 Code: github.com/Stability-AI/stab… 🤗 Model Card: huggingface.co/stabilityai/s… 🚀 Gradio Demo: huggingface.co/spaces/stabil… 🎬 Video: piped.video/channel/UCLLlVDc…

113

14,168

Vincent Sitzmann · Apr 2, 2024 · 8:55 PM UTC

Vincent Sitzmann

@vincesitzmann

2 Apr 2024

David & Lester have just updated the pixelSplat code, now supporting 3-view training in addition to 2-view and with some improvements suggested to us by reviewers, including new pre-trained checkpoints! davidcharatan.com/pixelsplat… More exciting work coming very soon, stay tuned!

pixelSplat: 3D Gaussian Splats from Image Pairs

Novel view synthesis via feed-forward 3D Gaussian inference from two images.

davidcharatan.com

117

12,032

Vincent Sitzmann · Nov 13, 2021 · 7:38 PM UTC

Vincent Sitzmann

@vincesitzmann

13 Nov 2021

Concerned about terminology of neural implicit reps / neural coordinate-based reps / etc.? We have a review article forthcoming (on arXiv soon!) that argues that the appropriate term is "neural field", i.e., neural light field, neural signed distance field, etc. (1/n)

114

Vincent Sitzmann · Jun 15, 2024 · 12:53 AM UTC

Vincent Sitzmann

@vincesitzmann

15 Jun 2024

Introducing Neural Isometries where we show how to exploit equivariant ML even for transformations that are “nasty”, e.g. non-compact, projective, nonlinear, or not even a group action! arxiv.org/abs/2405.19296 Collab w/ the amazing Tommy Mitchel @twmitchel and Mike Taylor! 1/n

Neural Isometries: Taming Transformations for Equivariant ML

Real-world geometry and 3D vision tasks are replete with challenging symmetries that defy tractable analytical expression. In this paper, we introduce Neural Isometries, an autoencoder framework...

arxiv.org

115

16,004

Vincent Sitzmann · Jun 9, 2021 · 3:59 PM UTC

Vincent Sitzmann

@vincesitzmann

9 Jun 2021

I have made a slideshare account and will start uploading slides for some of my presentations / talks / courses, starting with the slides for the introduction to novel view synthesis @siggraph 2021. Feel free to re-use them! Find them here: slideshare.net/VincentSitzma…

Vincent Sitzmann · Oct 22, 2025 · 11:49 PM UTC

Vincent Sitzmann

@vincesitzmann

22 Oct 2025

Strongly agree with Jon - thanks for putting it together so concisely. IMO there is no point in either this form or the NeurIPS checklist - they come from a good place but ultimately serve only to make the whole experience even more cumbersome and stressful, with no upside.

Jon Barron

@jon_barron

22 Oct 2025

It looks like @CVPR has implemented a new mandatory "Compute Reporting Form" that must be submitted alongside any paper submission. Though I am sympathetic to the motivations for this change, I am opposed to it for a variety of reasons:

105

18,911

Vincent Sitzmann · Sep 22, 2023 · 5:20 PM UTC

Vincent Sitzmann

@vincesitzmann

22 Sep 2023

This paper was accepted to NeurIPS! More stuff coming soon :)

Vincent Sitzmann

@vincesitzmann

8 Jun 2023

103

15,885

Vincent Sitzmann · Nov 5, 2024 · 9:47 AM UTC

Vincent Sitzmann

@vincesitzmann

5 Nov 2024

Really happy to see this study! Always wanted to do something like this myself, if only to support calming words to grad students: current-gen generative models have nothing to do with intelligence, and AI research remains fascinating and unsolved!

Bingyi Kang

@bingyikang

5 Nov 2024

Curious whether video generation models (like #SORA) qualify as world models? We conduct a systematic study to answer this question by investigating whether a video gen model is able to learn physical laws. Three are three key messages to take home: 1⃣The model generalises perfectly for in-distribution data, but fails to do out-of-distribution generalization. For combinatorial scenarios, scaling law is observed. 2⃣The models fail to abstract general rules and instead tries to mimic the closest training example. 3⃣The model prioritizes different attributes when referencing training data: color > size > velocity > shape. This work is a joint effort with our outstanding intern @YangYue_THU. Paper: arxiv.org/abs/2411.02385 Webpage: phyworld.github.io/

102

9,841

Vincent Sitzmann · Dec 14, 2023 · 8:54 PM UTC

Vincent Sitzmann

@vincesitzmann

14 Dec 2023

For anyone at NeurIPS: at our startup for 3D asset generation for gaming, we are still looking for one Research Scientist and one ML Engineer with a background in 3D generation, mesh processing, animation, etc. Please reach out and let's chat if that is you!

102

24,172

Vincent Sitzmann · Jan 18, 2021 · 6:39 PM UTC

Vincent Sitzmann

@vincesitzmann

18 Jan 2021

Our NeurIPS 2020 paper "MetaSDF" shows how to rapidly fit neural implicit representations with gradient-based meta-learning. We wrote a Colab with a stand-alone implementation of MAML, Siren, and ReLU MLPs so you can jump right in and easily extend it! colab.research.google.com/gi…

MetaSDF.ipynb

Run, share, and edit Python notebooks

colab.research.google.com

Vincent Sitzmann · Dec 1, 2024 · 10:18 PM UTC

Vincent Sitzmann

@vincesitzmann

1 Dec 2024

If you are looking to do a PhD on inverse graphics, 3D computer vision, differentiable rendering, etc, please apply to Ayush's lab at the University of Cambridge! He is brilliant, very patient, and a kind human :)

Ayush Tewari @_atewari

1 Dec 2024

I am looking for graduate students for my new lab at the University of Cambridge! Join me to understand and build models of visual perception.

11,950

Vincent Sitzmann · Oct 7, 2024 · 5:06 PM UTC

Vincent Sitzmann

@vincesitzmann

7 Oct 2024

This looks great! I think in general, it seems that any reconstruction problem that is reasonably well-determined given the input data can be solved with supervised learning for in-distribution test data (where it's of course interesting to ask what constitutes in-distribution)!

Junyi Zhang

@junyi42

7 Oct 2024

Excited to share MonST3R! -- a simple way to estimate geometry from unposed video of dynamic scene We achieve competitive results on several downstreams (video depth, camera pose) and believe this is a promising step toward feed-forward 4D reconstruction monst3r-project.github.io

102

9,737

Vincent Sitzmann · Nov 21, 2023 · 10:00 PM UTC

Vincent Sitzmann

@vincesitzmann

21 Nov 2023

Me and some members of my research group (@_atewari, @GCazenavette, @omcamsmith) will be at NeurIPS - talk to us about our work on training 3D diffusion models only from images (diffusion-with-forward-model…) and pixelNeRF without camera poses (scenerepresentations.org/pub…)! #NeurIPS2023

13,776

Vincent Sitzmann · Oct 17, 2021 · 1:03 AM UTC

Vincent Sitzmann

@vincesitzmann

17 Oct 2021

Come to our ICCV workshop on 3D representations - lots of fantastic talks and panels, and a poster session with some of the coolest recent work on 3D reps and neural rendering!

Srinath Sridhar @drsrinathsridha

17 Oct 2021

We are organizing the Second #3DReps workshop at @ICCV_2021 to bring together researchers working on learned 3D representations. We have an amazing lineup of invited speakers and posters. Please join us tomorrow (Oct 17)! Full schedule here: ivl.cs.brown.edu/3DReps

Vincent Sitzmann · Mar 20, 2020 · 4:38 PM UTC

Vincent Sitzmann

@vincesitzmann

20 Mar 2020

Amazing work by Ben Mildenhall et al: NeRF! They clearly demonstrate that implicit representations can achieve photorealism, by using a volume renderer instead of a raymarcher and with a smart positional encoding. Let's tackle generalization next! matthewtancik.com/nerf

Vincent Sitzmann · Dec 7, 2024 · 10:41 AM UTC

Vincent Sitzmann

@vincesitzmann

7 Dec 2024

Super exciting work by friends at MIT! Auto-regressive video generative models are the way to go :) Stay tuned, we are cooking, too - if you're at NeurIPS, make sure to come to our Diffusion Forcing poster!

Tianwei Yin

@TianweiY

7 Dec 2024

Video diffusion models generate high-quality videos but are too slow for interactive applications. We @MIT_CSAIL @AdobeResearch introduce CausVid, a fast autoregressive video diffusion model that starts playing the moment you hit "Generate"! A thread 🧵

10,993

Vincent Sitzmann · Oct 19, 2025 · 4:58 PM UTC

Vincent Sitzmann

@vincesitzmann

19 Oct 2025

Wish I could be at ICCV but alas can't make it due to a wedding :( Have fun everyone and see you at @NeurIPSConf 2025!

10,812

Vincent Sitzmann · Mar 20, 2022 · 3:23 PM UTC

Vincent Sitzmann

@vincesitzmann

20 Mar 2022

At the Dagstuhl Seminar on morphable models and beyond - looking forward to finally meeting graphics folks in person again! ⁦@dagstuhl⁩

Vincent Sitzmann · Feb 1, 2025 · 8:34 PM UTC

Vincent Sitzmann

@vincesitzmann

1 Feb 2025

Really cool! Loved the below paragraph in particular, a cool way to think about gradients!

Jeremy Bernstein @jxbz

31 Jan 2025

I ran this experiment to show that duality-based optimizers like Muon are not only *fast* but also *numerically different* to vanilla gradient descent. In particular, the weights move a qualitatively different amount in the same number of training steps. (1/4)

14,508

Vincent Sitzmann · May 9, 2024 · 2:57 PM UTC

Vincent Sitzmann

@vincesitzmann

9 May 2024

We are excited to have Andreessen Horowitz on board, who funded our seed round with $5M!

GamesBeat @GamesBeat

9 May 2024

.@yellow_3d_ has raised $5M from @a16zGames to further develop its Gen AI-powered 3D character modeling tool venturebeat.com/games/yellow…

23,960

Vincent Sitzmann · Apr 27, 2022 · 4:34 PM UTC

Vincent Sitzmann

@vincesitzmann

27 Apr 2022

If you're at Eurographics and interested in inverse graphics, neural fields, and neural rendering, check out the STAR presentations happening tomorrow: 1. Neural Fields in Visual Computing and Beyond 2. Advances in Neural Rendering eg2022.univ-reims.fr/pr-star…

Vincent Sitzmann · Dec 3, 2020 · 7:41 PM UTC

Vincent Sitzmann

@vincesitzmann

3 Dec 2020

Fantastic neural rendering results leveraging SIREN! They leverage a SIREN in a NERF-like neural rendering framework. The SIREN is conditioned on a random latent code z, and a Holo-GAN like adversarial loss provides the training signal.

Eric Chan

@ericryanchan

3 Dec 2020

Introducing pi-GAN: Periodic Implicit GANs for 3D-Aware Image Synthesis. Trained on unlabeled images, π-GAN generates 3D representations and synthesizes images from arbitrary poses. Website: marcoamonteiro.github.io/pi-… @monteiroamarco Petr Kellnhofer @GordonWetzstein @jiajunwu_cs

Vincent Sitzmann · Jun 5, 2019 · 2:05 PM UTC

Vincent Sitzmann

@vincesitzmann

5 Jun 2019

Check out Scene Representation Networks: piped.video/6vMEBWD8O20 Our new continuous 3D-aware scene representation reconstructs appearance and geometry just from posed images, generalizes across scenes for single-shot reconstruction, and naturally handles non-rigid deformation!

Tomasz Malisiewicz @quantombone

5 Jun 2019

Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations. The idea of differentiable ray-marching looks promising! arxiv.org/abs/1906.01618 #computervision

Vincent Sitzmann · Jun 26, 2021 · 9:35 AM UTC

Vincent Sitzmann

@vincesitzmann

26 Jun 2021

This project, led by UBC's Daniel Rebain, was a lot of fun - we propose a novel parameterization of 3D scene geometry via the medial field, which intuitively parameterizes the local "thickness" of a 3D shape. This has a variety of cool applications in physics & graphics!

@_akhaliq

8 Jun 2021

Deep Medial Fields pdf: arxiv.org/pdf/2106.03804.pdf abs: arxiv.org/abs/2106.03804 an implicit representation of the local thickness, that expands the capacity of implicit representations for 3D geometry

Vincent Sitzmann · Sep 12, 2024 · 4:38 PM UTC

Vincent Sitzmann

@vincesitzmann

12 Sep 2024

We're beginning the beta-testing phase of our first product, a text-to-3D-character mesh generator. More cool stuff coming soon :)

Daz 3D

@daz3d

12 Sep 2024

Announcing the AI Character Shape Generator by @yellow_3d_ 🟡 This handy Daz Studio plug-in allows you to generate character mesh shapes from simple text prompts, transforming the character creation process.

8,877

Vincent Sitzmann · Dec 13, 2023 · 11:08 PM UTC

Vincent Sitzmann

@vincesitzmann

13 Dec 2023

Find us at poster 226 where Cameron is presenting his cool work “FlowCam”!

10,232

Vincent Sitzmann · Oct 4, 2022 · 11:16 PM UTC

Vincent Sitzmann

@vincesitzmann

4 Oct 2022

Very cool work by friends at Google on diffusion models for novel view synthesis! Goes to show, no ray-marching or volume rendering necessary... It's all light fields ;)

Daniel Watson @watson_nn

4 Oct 2022

Excited to announce our work on novel view synthesis with diffusion models! Our model can lift a single 2d image into 3d. 3d-diffusion.github.io Joint work w/ @wchan212 @rmbrualla @hojonathanho @taiyasaki @mo_norouzi

Vincent Sitzmann · Aug 21, 2024 · 10:20 PM UTC

Vincent Sitzmann

@vincesitzmann

21 Aug 2024

I'm very excited to be part of a new NSF center: HAND, which is focused on developing the next generation of dexterous robots! A key motivation for vision for me has always been embodied, intelligent agents, and I think that vision & robots are now closer than ever before!

HAND ERC

@HAND_ERC

21 Aug 2024

The @NSF announced the Human AugmentatioN via Dexterity (HAND) Engineering Research Center (ERC)! Our Center will revolutionize how #robots augment human labor using dexterous robot hands with AI-powered skills and intuitive interfaces. new.nsf.gov/news/nsf-announc…

5,789

Vincent Sitzmann · Jun 13, 2024 · 1:00 PM UTC

Vincent Sitzmann

@vincesitzmann

13 Jun 2024

I'm thrilled that many members of the Scene Representation Group will be at CVPR! Catch them and chat about their work :) I shout them out below in no particular order:

14,151

Vincent Sitzmann · Jul 9, 2021 · 3:17 PM UTC

Vincent Sitzmann

@vincesitzmann

9 Jul 2021

In this project, led by Yunzhu and Shuang Li, we explored the application of 3D implicit representations to visuomotor control. The idea is to use the latent of a conditional 3D implicit representation as a state representation, and use the resulting state space for control (1/n)

Yunzhu Li

@YunzhuLiYZ

9 Jul 2021

Introducing “3D Neural Scene Representations for Visuomotor Control”! bit.ly/2VoHt71 (w/ video!) We combine implicit neural scene representations with intuitive physics models, enabling visuomotor control of dynamic 3D scenes from out-of-distribution viewpoints. (1/7)

Vincent Sitzmann · Jun 24, 2021 · 8:13 AM UTC

Vincent Sitzmann

@vincesitzmann

24 Jun 2021

This is a great example of how neural implicit representations enable principled implementations of symmetries - here, rotation and translation equivariance. Cool work!

@_akhaliq

23 Jun 2021

Alias-Free GAN pdf: nvlabs-fi-cdn.nvidia.com/ali… project page: nvlabs.github.io/alias-free-… networks match the FID of StyleGAN2 but differ dramatically in their internal representations, and they are fully equivariant to translation and rotation even at subpixel scales

Vincent Sitzmann · Nov 27, 2022 · 6:54 PM UTC

Vincent Sitzmann

@vincesitzmann

27 Nov 2022

I will be at @NeurIPSConf in New Orleans - hit me up if you'd like to chat about differentiable rendering, 3D representation learning, neural scene representation, etc :)

Vincent Sitzmann · Jul 31, 2024 · 4:14 PM UTC

Vincent Sitzmann

@vincesitzmann

31 Jul 2024

Some students have approached me for guidance on how to deal with a review of their NeurIPS submission that was clearly LLM generated. Apart from sending a message to the AC, I was just looking through all the NeurIPS guidelines (Reviewer Guide, AC Guide), and couldn't find...

17,589

Vincent Sitzmann · Nov 12, 2021 · 9:33 PM UTC

Vincent Sitzmann

@vincesitzmann

12 Nov 2021

Check out our NeurIPS paper! This was a cool little project. Two key ideas are generative modeling of multi-modal data via neural implicit representations / neural fields, and auto-decoder based manifold learning instead of GANs/VAEs. @du_yilun and @katie_m_collins killed it!

Yilun Du @du_yilun

12 Nov 2021

Check out generative manifold learning! yilundu.github.io/gem/ We show how to capture the manifold of any signal modality (including cross-modal ones), by representing each signal as a neural field. We can then traverse the latent space between signals and generate new samples!

Vincent Sitzmann · Oct 19, 2022 · 2:10 PM UTC

Vincent Sitzmann

@vincesitzmann

19 Oct 2022

Very cool work - it is remarkable how much more robust normal estimation is compared to depth estimation, I think there is more to be done here!

Gwangbin Bae @BaeGwangbin

18 Oct 2022

Introducing IronDepth, a framework that uses surface normal and its uncertainty to iteratively refine the predicted depth map (to appear in #bmvc2022). Visit baegwangbin.github.io/IronDe… for more detail. Joint work with @IgnasBud and @robertocipolla.

Vincent Sitzmann · Jul 21, 2022 · 4:19 PM UTC

Vincent Sitzmann

@vincesitzmann

21 Jul 2022

Last week, I was at the #ICVSS2022 summer school in Sicily. It was my first summer school, and it was so much fun and very inspiring! It was amazing to chat with students, and also to meet, exchange ideas, and receive great advice from fellow faculty...

Vincent Sitzmann · Jun 18, 2020 · 12:41 AM UTC

Vincent Sitzmann

@vincesitzmann

18 Jun 2020

Another one on *generalizing* implicit representations: “MetaSDF: Meta-Learning Signed Distance Functions” vsitzmann.github.io/metasdf We identify a key connection between learning of implicit function spaces and meta-learning, and reconstruct SDFs faster & more accurately! (1/n)

Vincent Sitzmann · Oct 1, 2025 · 11:58 PM UTC

Vincent Sitzmann

@vincesitzmann

1 Oct 2025

I've always been a big fan of the work of Danijar and co-authors on the "Dreamer" line of publications. I'm hence hyped that Diffusion Forcing plays a part in their most recent paper - exciting work, as always!

Danijar Hafner

@danijarh

30 Sep 2025

Replying to @danijarh

▶️ Shortcut forcing builds on diffusion forcing and shortcut models, training a sequence model with both the noise level and requested step size as inputs This enables much faster frame-by-frame generations than diffusion forcing, without needing a distillation phase ⏱️

10,132

Vincent Sitzmann · Dec 13, 2024 · 4:47 PM UTC

Vincent Sitzmann

@vincesitzmann

13 Dec 2024

If you’re looking for me at NeurIPS, I am sitting in my AirBnB with a sore throat and blocked sinuses… of course I get sick immediately upon traveling again 😭

10,401

Vincent Sitzmann · Jun 7, 2021 · 3:00 PM UTC

Vincent Sitzmann

@vincesitzmann

7 Jun 2021

Meanwhile, I am already at MIT, and will be looking for incoming graduate students and postdocs who want to work on the big questions in this area :) You can find more on my research under vsitzmann.github.io - if that's you, get in touch, and let’s chat! (2/n)

Vincent Sitzmann · Jul 13, 2024 · 9:11 AM UTC

Vincent Sitzmann

@vincesitzmann

13 Jul 2024

We have released the code for our paper "Neural Isometries"! All the links here: scenerepresentations.org/pub… Amazing work by @twmitchel :)

9,592

Vincent Sitzmann · Feb 28, 2024 · 6:06 PM UTC

Vincent Sitzmann

@vincesitzmann

28 Feb 2024

Peter's paper on intrinsic image est. with diffusion models was accepted to CVPR! The core insight is that the task of intrinsic image decomposition is inherently uncertain, and that hence, diffusion models yield significant improvements. Fun collab! Web: peter-kocsis.github.io/Intri…

Matthias Niessner

@MattNiessner

27 Feb 2024

Our group has fourteen papers accepted at #CVPR'2024! Exciting topics: lots of diffusion & transformers focusing on generative AI for image synthesis, geometry generation, and many more - check it out! I'm so proud of everyone involved - let's go 🚀🚀 niessnerlab.org/publications…

13,106

Vincent Sitzmann · Oct 18, 2024 · 3:26 PM UTC

Vincent Sitzmann

@vincesitzmann

18 Oct 2024

This is really cool work!

Jihyeon Je @JihyeonJe

11 Oct 2024

Symmetries are everywhere — from butterfly’s wings to Greek temples. But detecting them in noisy data? That’s a challenge. 🦋🏛 Our #SIGGRAPHAsia2024 paper, Robust Symmetry Detection via Riemannian Langevin Dynamics, tackles this: symmetry-langevin.github.io/ 🧵(1/n)

8,634

Vincent Sitzmann · Apr 25, 2024 · 6:19 PM UTC

Vincent Sitzmann

@vincesitzmann

25 Apr 2024

Really cool work from Anagh, with David & Gordon from my old lab - amazing to see light propagate in 3D :)

Anagh Malik @anagh_malik

10 Apr 2024

📢📢📢 A pulse of light takes ~3ns to pass through a Coke bottle—100 million times less than it takes you to blink. Our work lets you fly around this 3D scene at the speed of light, revealing propagating wavefronts of light that are invisible to the naked eye—from any viewpoint! Flying with Photons: Rendering Novel Views of Propagating Light 🌐 Website: anaghmalik.com/FlyingWithPho… ⌨️ Code: github.com/anaghmalik/Flying… w/ @NoahJuravsky, @Po_lhr, @GordonWetzstein, @kyroskutulakos and @DaveLindell

14,762

Vincent Sitzmann · May 9, 2024 · 2:50 PM UTC

Vincent Sitzmann

@vincesitzmann

9 May 2024

Thrilled to share the first milestone of our startup @yellow_3d_! We are set on dramatically lowering the barrier-to-entry of making video games and telling stories using 3D content in general! 1/n

You’re unable to view this Post because this account owner limits who can view their Posts.

8,990

Vincent Sitzmann · Nov 10, 2023 · 1:59 AM UTC

Vincent Sitzmann

@vincesitzmann

10 Nov 2023

Cool paper by Ana, co-advised with Justin Solomon - her first MIT paper, and a great one at that :)

Ana Dodik

@ana_dodik

9 Nov 2023

Can neural fields help unlock new understandings of an old geometry problem? Excited to announce our latest SIGGRAPH Asia work: Variational Barycentric Coordinates! 🧵 - with @OdedStein @vincesitzmann @JustinMSolomon

13,350

Vincent Sitzmann · Feb 17, 2025 · 6:37 PM UTC

Vincent Sitzmann

@vincesitzmann

17 Feb 2025

Check out the Huggingface demo that @kiwhansong0 and @BoyuanChen0 made - you can play with Diffusion Forcing and history guidance to get a feeling for how, together, they allow generating ultra-long, consistent video! huggingface.co/spaces/kiwhan…

Diffusion Forcing Transformer - a Hugging Face Space by kiwhansong

Generate a video from any number of images

huggingface.co

10,873

Vincent Sitzmann · Mar 1, 2022 · 7:09 PM UTC

Vincent Sitzmann

@vincesitzmann

1 Mar 2022

We’ve put a lot of time into this review paper - it’s likely useful to you if you’re writing on neural fields!

Srinath Sridhar @drsrinathsridha

1 Mar 2022

If you're working on a neural field/coordinate-based neural net paper for @eccvconf, you may want to use our review to help with your related work section. #eccv2022 #neuralfields @neural_fields

Vincent Sitzmann · Oct 18, 2024 · 3:14 PM UTC

Vincent Sitzmann

@vincesitzmann

18 Oct 2024

A thread and video by MIT CSAIL about our Diffusion Forcing paper!

MIT CSAIL

@MIT_CSAIL

17 Oct 2024

Sequence models have skyrocketed in popularity for their ability to analyze data & predict what to do next. MIT’s "Diffusion Forcing" method combines the strengths of next-token prediction (like w/ChatGPT) & video diffusion (like w/Sora), training neural networks to handle corrupted data while predicting the next steps. This flexible, reliable sequence model helps produce higher-quality artificial videos and guides more precise decision-making for robots & AI agents: bit.ly/3BK2wWC

5,776

Vincent Sitzmann · Jul 1, 2024 · 4:31 PM UTC

Vincent Sitzmann

@vincesitzmann

1 Jul 2024

Amazing work, congrats, Congyue!! We love it :)

Congyue Deng @CongyueD

1 Jul 2024

Our paper "Zero-Shot Image Feature Consensus with Deep Functional Maps" is accepted at #ECCV2024! @eccvconf Want better image correspondences with noisy and inaccurate features? Let's go to the spectral space with Laplacian eigenfunctions! ArXiv: arxiv.org/abs/2403.12038

11,393

Vincent Sitzmann · Apr 9, 2020 · 4:33 PM UTC

Vincent Sitzmann

@vincesitzmann

9 Apr 2020

Neural Rendering offers new approaches to computer graphics, inverse graphics, and computer vision. In our state-of-the-art report, we compiled an overview over this emerging field - check it out! arxiv.org/pdf/2004.03805.pdf #EGEV2020 #NeuralRendering

Vincent Sitzmann · Dec 9, 2021 · 8:37 PM UTC

Vincent Sitzmann

@vincesitzmann

9 Dec 2021

Come join me in the gather hall 6: representation learning at poster A0 to chat about Light Field Networks / neural light fields! I just joined, and it's basically just me... nips.cc/virtual/2021/poster/…

Vincent Sitzmann · Jul 19, 2024 · 12:33 PM UTC

Vincent Sitzmann

@vincesitzmann

19 Jul 2024

If you are interested in geometry processing, I highly recommend applying to Silvia's lab - she is brilliant and kind and will be an amazing adviser :)

This Post is from an account that no longer exists.

7,961

Vincent Sitzmann · Oct 10, 2024 · 4:31 PM UTC

Vincent Sitzmann

@vincesitzmann

10 Oct 2024

I'm planning my trip to NeurIPS right now - excited to meet everyone again after (sadly) having to miss ECCV for visa reasons...

7,275

Vincent Sitzmann · Dec 8, 2019 · 5:50 PM UTC

Vincent Sitzmann

@vincesitzmann

8 Dec 2019

Scene Representation Networks receive an honorable mention in the „Outstanding New Directions“ category! I’m super happy and grateful that folks find this line of work promising.

Hugo Larochelle

@hugo_larochelle

8 Dec 2019

Want to know which NeurIPS papers were selected for an award, and how the selection was done? Check out our latest blog post on the subject: medium.com/@NeurIPSConf/neur…

Vincent Sitzmann · Sep 23, 2022 · 8:13 AM UTC

Vincent Sitzmann

@vincesitzmann

23 Sep 2022

Awesome work! It’s really great to do such a principled study to generate a reliable empirical insight that others can build on 😊 Also wonder if we’ll see more work like this going forward following the establishment of @TmlrOrg !

Andrea Tagliasacchi @CVPR @taiyasaki

23 Sep 2022

📢📢📢 𝗔𝘁𝘁𝗲𝗻𝘁𝗶𝗼𝗻 𝗕𝗲𝗮𝘁𝘀 𝗖𝗼𝗻𝗰𝗮𝘁𝗲𝗻𝗮𝘁𝗶𝗼𝗻 𝗳𝗼𝗿 𝗖𝗼𝗻𝗱𝗶𝘁𝗶𝗼𝗻𝗶𝗻𝗴 𝗡𝗲𝘂𝗿𝗮𝗹 𝗙𝗶𝗲𝗹𝗱𝘀 We ran A LOT of experiments to find the best way to make neural fields generalize... so you don’t have to! arxiv.org/abs/2209.10684

Vincent Sitzmann · Sep 20, 2021 · 9:10 AM UTC

Vincent Sitzmann

@vincesitzmann

20 Sep 2021

Congrats again, Sergey - awesome to collaborate w/ you, Stanford, and the other folks at TRI!

Sergey Zakharov

@ZakharovSergeyN

16 Sep 2021

Proud to announce that our paper “Single-Shot Scene Reconstruction” is accepted to #CoRL2021! We use transformers and implicit representations to infer a fully editable 3D scene from a single image. Collaboration between @ToyotaResearch, @Stanford and @MIT.

Vincent Sitzmann · May 1, 2021 · 6:21 AM UTC

Vincent Sitzmann

@vincesitzmann

1 May 2021

Really great work - equivariance in implicit representations has been a long-standing question for me, and this is significant progress!

Andrea Tagliasacchi @CVPR @taiyasaki

30 Apr 2021

📢📢📢 Introducing "Vector Neurons" Want a network (and latent space) that act by construction in an equivariant way w.r.t. SO(3) transformations? All you need is to do is to generalize the scalar non-linearity to a vector one (e.g. Vector ReLU) cs.stanford.edu/~congyue/vnn…

The Vector Neuron networks provide equivariance to rotations: if the input point cloud rotates, the features also rotate (by construction). Overall, this removes the need for SO(3) data augmentations from 3D deep learning pipelines.

ALT The Vector Neuron networks provide equivariance to rotations: if the input point cloud rotates, the features also rotate (by construction). Overall, this removes the need for SO(3) data augmentations from 3D deep learning pipelines.