Siddharth Karamcheti · Jun 18, 2025 · 5:16 PM UTC

Siddharth Karamcheti

Pinned Tweet

Siddharth Karamcheti

@siddkaramcheti

18 Jun 2025

Thrilled to share that I'll be starting as an Assistant Professor at Georgia Tech (@ICatGT / @GTrobotics / @mlatgt) in Fall 2026. My lab will tackle problems in robot learning, multimodal ML, and interaction. I'm recruiting PhD students this next cycle – please apply/reach out!

565

61,394

Siddharth Karamcheti · Jul 23, 2020 · 5:17 AM UTC

Siddharth Karamcheti

@siddkaramcheti

23 Jul 2020

Since getting academic access, I’ve been thinking about GPT-3’s applications to grounded language understanding — e.g. for robotics and other embodied agents. In doing so, I came up with a new demo: Objects to Affordances: “what can I do with an object?” cc @gdb

479

Siddharth Karamcheti · Feb 27, 2023 · 5:02 PM UTC

Siddharth Karamcheti

@siddkaramcheti

27 Feb 2023

How can we use language supervision to learn better visual representations for robotics? Introducing Voltron: Language-Driven Representation Learning for Robotics! Paper: arxiv.org/abs/2302.12766 Models: github.com/siddk/voltron-rob… Evaluation: github.com/siddk/voltron-eva… 🧵👇(1 / 12)

ALT Voltron Framework – Balancing language conditioning and generation to shape visual representation learning.

390

146,128

Siddharth Karamcheti · Aug 24, 2021 · 3:49 PM UTC

Siddharth Karamcheti

@siddkaramcheti

24 Aug 2021

We're excited to open-source Mistral 🚀 - a codebase for accessible large-scale LM training, built as part of Stanford's CRFM (crfm.stanford.edu/). We're releasing 10 GPT-2 Small & Medium models with different seeds & 600+ checkpoints per run! github.com/stanford-crfm/mis… [1/4]

101

365

Siddharth Karamcheti · Oct 20, 2021 · 7:32 PM UTC

Siddharth Karamcheti

@siddkaramcheti

20 Oct 2021

Thrilled to announce I'm joining @huggingface 🤗 as a research intern while doing my PhD! Step 1: scalable training that's accessible and transparent w/ @StasBekman and Thomas Wang. Step 2+: multimodality, robotics, RL! Huge thanks to @Thom_Wolf and team for the opportunity!

286

Siddharth Karamcheti · Feb 13, 2024 · 7:05 PM UTC

Siddharth Karamcheti

@siddkaramcheti

13 Feb 2024

What design choices matter when developing a visually-conditioned language model (VLM)? Check out our paper – Prismatic VLMs – and open-source training code, evaluation suite, and 42 pretrained VLMs at the 7B-13B scale! 📜 arxiv.org/abs/2402.07865 ⚙️ + 🤗 github.com/TRI-ML/prismatic-…

Our investigation of different axes for developing visually-conditioned language models (VLMs) spans four different axes.

[Top Left] 1. Optimization procedure - should we freeze model components, or do we need multi-stage training?

[Bottom Left] 2. Image processing and visual representations - how should we choose pretrained representations?

[Top Right] 3. Language Models - how do base or instruct-tuned LMs affect performance? Does co-training on language-only data help?

[Bottom Right] 4. Scaling Properties - are we undertraining our models? What type of added data helps?

ALT Our investigation of different axes for developing visually-conditioned language models (VLMs) spans four different axes. [Top Left] 1. Optimization procedure - should we freeze model components, or do we need multi-stage training? [Bottom Left] 2. Image processing and visual representations - how should we choose pretrained representations? [Top Right] 3. Language Models - how do base or instruct-tuned LMs affect performance? Does co-training on language-only data help? [Bottom Right] 4. Scaling Properties - are we undertraining our models? What type of added data helps?

194

62,015

Siddharth Karamcheti · Jun 28, 2022 · 6:26 PM UTC

Siddharth Karamcheti

@siddkaramcheti

28 Jun 2022

Super honored (and very embarrassed) that @karpathy took the time to look at some of our code and fix an inefficiency (*cough*, bug, *cough*) I introduced 😅. Loving the @huggingface open source community today 🤗!

Julien Chaumond

@julien_c

27 Jun 2022

I'm gonna memorize commit hash `e02037b3524686b57c5a861ea49ac751f15568af` forever ❤️❤️🔥

160

Siddharth Karamcheti · Jul 5, 2021 · 6:38 PM UTC

Siddharth Karamcheti

@siddkaramcheti

5 Jul 2021

Incredibly excited (and still a bit in shock) that our #ACL2021 paper with the amazing @RanjayKrishna @drfeifei and @chrmanning won an Outstanding Paper award! This paper has a fun story that doesn’t quite fit in 8 pages; blog post, paper, and all code up soon!

Sameer Singh @sameer_

5 Jul 2021

The best paper awards for ACL 2021 are out! @aclmeeting #NLProc #ACL2021 2021.aclweb.org/program/acce…

155

Siddharth Karamcheti · Oct 13, 2020 · 6:47 PM UTC

Siddharth Karamcheti

@siddkaramcheti

13 Oct 2020

How do we build adaptive language interfaces that learn through interaction with real human users? New work w/ my amazing advisors @DorsaSadigh and @percyliang, to be presented at the @intexsempar2020 workshop at #emnlp2020. Link: arxiv.org/abs/2010.05190 A thread 🧵(1 / N).

Learning by Decomposition: First, a user tried to complete a high-level task ("wash the coffee mug") with a model pre-trained on simple instructions (e.g. "go to the cup." "pick it up," etc.).

Over the course of interaction, users use high-level language ("wash the coffee mug") that the robot doesn't understand, and have to simplify to get the robot to complete the task. After succeeded at a task, users "teach" these high-level instructions by decomposing them in terms of the low-level actions the robot was able to execute.

This "interact, then decompose" framework is pretty general, and lets us update our system efficiently between tasks, so users can use what they just taught immediately, when completing a new task.

ALT Learning by Decomposition: First, a user tried to complete a high-level task ("wash the coffee mug") with a model pre-trained on simple instructions (e.g. "go to the cup." "pick it up," etc.). Over the course of interaction, users use high-level language ("wash the coffee mug") that the robot doesn't understand, and have to simplify to get the robot to complete the task. After succeeded at a task, users "teach" these high-level instructions by decomposing them in terms of the low-level actions the robot was able to execute. This "interact, then decompose" framework is pretty general, and lets us update our system efficiently between tasks, so users can use what they just taught immediately, when completing a new task.

132

Siddharth Karamcheti · Nov 8, 2021 · 2:10 PM UTC

Siddharth Karamcheti

@siddkaramcheti

8 Nov 2021

Proud to share "LILA: Language-Informed Latent Actions," our paper at #CoRL2021. How can we build assistive controllers by fusing language & shared autonomy? Jointly authored w/ @megha_byte, with my advisors @percyliang & @DorsaSadigh. 📜: arxiv.org/abs/2111.03205 🧵: (1 / 10)

Siddharth Karamcheti · Jan 9, 2023 · 4:21 PM UTC

Siddharth Karamcheti

@siddkaramcheti

9 Jan 2023

Want to build robots that adapt to language corrections in real-time? Introducing "No, to the Right – Online Language Corrections for Manipulation via Shared Autonomy" (arxiv.org/abs/2301.02555) w/ @YuchenCui1, Raj, Nidhya, @percyliang & @DorsaSadigh at #HRI2023 - 🧵👇 (1/N).

Left panel: a robot manipulator trying to insert a book into a shelf, but getting stuck.

Right panel: a human user identifying that the robot is stuck, and issuing a correction – "No, to the right!"

ALT Left panel: a robot manipulator trying to insert a book into a shelf, but getting stuck. Right panel: a human user identifying that the robot is stuck, and issuing a correction – "No, to the right!"

28,528

Siddharth Karamcheti · Jul 9, 2023 · 10:48 PM UTC

Siddharth Karamcheti

@siddkaramcheti

9 Jul 2023

Incredibly honored that Voltron is one of the nominees for Best Paper at #RSS2023! Just landed in Daegu, South Korea and can’t wait to present on Tuesday (catch our talk in Session 4 at 3 PM KST). Excited to meet everyone, and stoked for a week of awesome talks and demos!

Siddharth Karamcheti

@siddkaramcheti

27 Feb 2023

ALT Voltron Framework – Balancing language conditioning and generation to shape visual representation learning.

47,876

Siddharth Karamcheti · Apr 12, 2022 · 3:37 PM UTC

Siddharth Karamcheti

@siddkaramcheti

12 Apr 2022

Diverse, representative data is becoming increasingly important for building generalizable robotic systems. We're organizing the Workshop on Learning from Diverse, Offline Data (L-DOD) at RSS 2022 (NYC/hybrid) to come together and discuss this! sites.google.com/view/l-dod-…

ALT Workshop on Learning from Diverse, Offline Data in NYC on June 27th, as part of Robotics: Science & Systems 2022.

Siddharth Karamcheti · Jun 26, 2023 · 7:14 PM UTC

Siddharth Karamcheti

@siddkaramcheti

26 Jun 2023

I've been struggling to put words to Professor Charniak's passing. I can count on one hand the people in my life that have single-handedly reshaped the trajectory of my life – he's at the top of that list. He was a man of honor, humility, and boundless energy.

Brown CS @BrownCSDept

16 Jun 2023

@BrownCSDept is mourning the loss of University Professor Emeritus of Computer Science and Cognitive Science Eugene Charniak, one of our founding faculty members. He passed away on June 13, just a few days after his seventy-seventh birthday. (1 of 3)

23,126

Siddharth Karamcheti · May 4, 2021 · 4:46 PM UTC

Siddharth Karamcheti

@siddkaramcheti

4 May 2021

How do we build visually guided controllers that help humans operate complex robots? Thrilled to share our #L4DC paper "Learning Visually Guided Latent Actions for Assistive Teleoperation" with Albert Zhai, @loseydp, and @DorsaSadigh! Paper: arxiv.org/abs/2105.00580 A 🧵 [1/7]

Visually Guided Latent Actions: We build models that leverage perception (via a learned object detector) to learn a low-dimensional structured "latent actions" for task-conditional control.

ALT Visually Guided Latent Actions: We build models that leverage perception (via a learned object detector) to learn a low-dimensional structured "latent actions" for task-conditional control.

Siddharth Karamcheti · May 19, 2022 · 3:02 PM UTC

Siddharth Karamcheti

@siddkaramcheti

19 May 2022

How do we conduct ethical research from the *start*? At Hugging Face, we've started working on multimodal pretraining (💬, 📸, 🎤, 🎥), involving collecting a dataset & training models. Ethics can't be an afterthought! A 🧵⬇️ (1/3)

Saulnier Lucile @LucileSaulnier

19 May 2022

How do we integrate ethical principles into the ML research cycle? A few months ago, we kicked off a project at Hugging Face on multimodal datasets and models.🐙 Instead of discussing ethics at the end, we wrote down our ethical values from the start! huggingface.co/blog/ethical-…

Siddharth Karamcheti · May 17, 2024 · 9:38 PM UTC

Siddharth Karamcheti

@siddkaramcheti

17 May 2024

Extremely honored to be named an RSS Pioneer! Thrilled to get to know the rest of the cohort in Delft this summer — thank you @RSSPioneers for organizing such a wonderful event!

RSS Pioneers @RSSPioneers

17 May 2024

We are thrilled to announce our #RSSPioneers2024 cohort! 🎉sites.google.com/view/rsspio… Congratulations to these 30 rising stars in robotics! We thank all of our applicants for their inspiring submissions, and our selection committee and reviewers for their participation and insight.

33,085

Siddharth Karamcheti · Jun 18, 2024 · 2:51 PM UTC

Siddharth Karamcheti

@siddkaramcheti

18 Jun 2024

Thrilled to announce OpenVLA (openvla.github.io/) – a vision-language-action policy for robotic control! Shout out to my co-leads @moo_jin_kim & @KarlPertsch; see their threads for overviews of our work. Here though, I want to talk about observations & next steps! 🧵⬇️

OpenVLA: An Open-Source Vision-Language-Action Model

openvla.github.io

Moo Jin Kim @moo_jin_kim

14 Jun 2024

✨ Introducing 𝐎𝐩𝐞𝐧𝐕𝐋𝐀 — an open-source vision-language-action model for robotics! 👐 - SOTA generalist policy - 7B params - outperforms Octo, RT-2-X on zero-shot evals 🦾 - trained on 970k episodes from OpenX dataset 🤖 - fully open: model/code/data all online 🤗 🧵👇

13,405

Siddharth Karamcheti · Dec 8, 2022 · 8:39 PM UTC

Siddharth Karamcheti

@siddkaramcheti

8 Dec 2022

After a few days of playing with #ChatGPT by @OpenAI, I'm inspired by the potential for enriching systems for human-robot interaction! > "Let's role play: You're the "brain" behind an assistive robot that may not be perfect, and I'm a human trying to work with you..." 🧵👇

#ChatGPT by @OpenAI Screenshot.

User: Let's role play: You're the "brain" behind an assistive robot that may not be perfect, and I'm a human trying to work with you. We're going to try and make my life easier, and you're going to help me figure out what our physical robot can and cannot do.

Please acknowledge that you understand, and describe who you are."

ChatGPT: "I understand. I am Assistant, a large language model trained by OpenAI. I am not a physical robot, but I am here to help you understand what a hypothetical assistive robot might be capable of.... Please let me know if there is anything specific you would like to know about the capabilities of an assistive robot."

ALT #ChatGPT by @OpenAI Screenshot. User: Let's role play: You're the "brain" behind an assistive robot that may not be perfect, and I'm a human trying to work with you. We're going to try and make my life easier, and you're going to help me figure out what our physical robot can and cannot do. Please acknowledge that you understand, and describe who you are." ChatGPT: "I understand. I am Assistant, a large language model trained by OpenAI. I am not a physical robot, but I am here to help you understand what a hypothetical assistive robot might be capable of.... Please let me know if there is anything specific you would like to know about the capabilities of an assistive robot."

Siddharth Karamcheti · Apr 23, 2021 · 8:56 PM UTC

Siddharth Karamcheti

@siddkaramcheti

23 Apr 2021

Huge congratulations to @ethayarajh for being named a Facebook AI PhD Fellow in NLP! Kawin is a brilliant researcher, collaborator, and friend who has taught me so much; this is incredibly well-deserved and I'm so proud! research.fb.com/fellows/etha…

Kawin Ethayarajh - Meta Research | Meta Research

I’m a PhD student in the Stanford NLP Group, advised by Dan Jurafsky. My research interests are broadly in model...

research.facebook.com

Siddharth Karamcheti · Mar 8, 2023 · 3:45 AM UTC

Siddharth Karamcheti

@siddkaramcheti

8 Mar 2023

This is such a cool paper; really simple idea at its core, and incredible results! Must read for anyone working on imitation learning for robotics!

@_akhaliq

8 Mar 2023

Diffusion Policy: Visuomotor Policy Learning via Action Diffusion abs: arxiv.org/abs/2303.04137 project page: diffusion-policy.cs.columbia…

14,925

Siddharth Karamcheti · Jul 23, 2020 · 5:17 AM UTC

Siddharth Karamcheti

@siddkaramcheti

23 Jul 2020

Finally, I’d like to thank @sh_reya and @notsleepingturk for putting together an incredibly easy to use Github Repo (github.com/shreyashankar/gpt…) for putting together these interactive demos. What an awesome resource!

GitHub - shreyashankar/gpt3-sandbox: The goal of this project is to enable users to create cool web...

The goal of this project is to enable users to create cool web demos using the newly released OpenAI GPT-3 API with just a few lines of Python. - shreyashankar/gpt3-sandbox

github.com

Siddharth Karamcheti · Jun 25, 2022 · 10:11 PM UTC

Siddharth Karamcheti

@siddkaramcheti

25 Jun 2022

Our #RSS2022 workshop on Learning from Diverse, Offline Data (#LDOD) is on Monday 6/27! Amazing set of papers, incredible speakers (below), and a full panel moderated by @chelseabfinn & @DorsaSadigh! Sneak Peek: 1-1 chats between students & speakers: sites.google.com/view/l-dod-…!

Our incredible speakers for the L-DOD Workshop at RSS 2022.

Left-to-right: Kristen Grauman, Abhinav Gupta, Cathy Wu, Benjamin Sapp, Eric Jang, Sergey Levine, and Davide Scaramuzza.

ALT Our incredible speakers for the L-DOD Workshop at RSS 2022. Left-to-right: Kristen Grauman, Abhinav Gupta, Cathy Wu, Benjamin Sapp, Eric Jang, Sergey Levine, and Davide Scaramuzza.

Siddharth Karamcheti · Nov 6, 2021 · 11:37 AM UTC

Siddharth Karamcheti

@siddkaramcheti

6 Nov 2021

Just touched down in London for #CoRL2021 — what a beautiful city! Looking forward to meeting lots of new folks, please email/DM me if you’re down to chat robotics & NLP or shared autonomy for manipulation or (better yet) both! Super excited 🗣🤖!

Siddharth Karamcheti · Feb 22, 2022 · 8:13 PM UTC

Siddharth Karamcheti

@siddkaramcheti

22 Feb 2022

Professor Charniak gave me my start in ML. Each week, I’d come to his office and we’d talk ideas — no judgement, no feeling I needed to prove anything. He’d always meet me with patience and joy. There’s no way I’d be where I am without those meetings. Congrats on retirement!

Brown CS @BrownCSDept

22 Feb 2022

First seen in the pages of Conduit, the annual @BrownCSDept magazine, we're excited to share an extensive look back at Professor Eugene Charniak's work and life as he enters retirement: bit.ly/3HaziNP

Siddharth Karamcheti · Jan 12, 2022 · 8:03 PM UTC

Siddharth Karamcheti

@siddkaramcheti

12 Jan 2022

S4 (by the amazing @_albertgu and @krandiash) is a new sequence model that can reliably scale to *huge* contexts. To dive into how it works, @srush_nlp and I wrote a code library (~200 lines of JAX) and blog post: "The Annotated S4" (srush.github.io/annotated-s4…). Check it out!

Sasha Rush

@srush_nlp

12 Jan 2022

The Annotated S4 (github.com/srush/annotated-s… /w @siddkaramcheti) A step-by-step guide for building your own 16,000-gram language model...

Siddharth Karamcheti · Nov 6, 2024 · 7:06 AM UTC

Siddharth Karamcheti

@siddkaramcheti

6 Nov 2024

Thrilled to introduce Vocal Sandbox (arxiv.org/abs/2411.02599) – our new framework for situated human-robot collaboration. We'll be @corl_conf all week; don't miss @jenngrannen's oral today (Session 3 @ 5 PM) or our poster tomorrow! Why am I so proud of this paper? 🧵👇

Vocal Sandbox: Continual Learning and Adaptation for Situated...

We introduce Vocal Sandbox, a framework for enabling seamless human-robot collaboration in situated environments. Systems in our framework are characterized by their ability to adapt and...

arxiv.org

Jenn Grannen @jenngrannen

6 Nov 2024

Introducing 🆚Vocal Sandbox: a framework for building adaptable robot collaborators that learns new 🧠high-level behaviors and 🦾low-level skills from user feedback in real-time. ✅ Appearing today at @corl_conf as an Oral Presentation (Session 3, 11/6 5pm). 🧵(1/6)

15,905

Siddharth Karamcheti · Aug 18, 2021 · 10:11 PM UTC

Siddharth Karamcheti

@siddkaramcheti

18 Aug 2021

Check out the Robotics section (§2.3), discussing opportunities in applying #foundationmodels across the robotics pipeline. Challenges await! Collecting the right data, ensuring safety are crucial. But tackling these problems now – *before* building models – is key!

Stanford HAI

@StanfordHAI

18 Aug 2021

NEW: This comprehensive report investigates foundation models (e.g. BERT, GPT-3), which are engendering a paradigm shift in AI. 100+ scholars across 10 departments at Stanford scrutinize their capabilities, applications, and societal consequences. bit.ly/3xZPFYK

Siddharth Karamcheti · Aug 4, 2021 · 5:19 PM UTC

Siddharth Karamcheti

@siddkaramcheti

4 Aug 2021

Honored to be presenting our work "Mind Your Outliers" (arxiv.org/abs/2107.02331) at the #ACL2021NLP Best Paper Session today at 4 PM PST (23:00 UTC+0). ACL-Internal link w/ Q&A: underline.io/events/167/sess… Video (I'll present a "punchier" version tonight): piped.video/L0f9mZMn5GM

Stanford NLP Group

@stanfordnlp

7 Jul 2021

Congrats to @siddkaramcheti, @RanjayKrishna, @drfeifei & @chrmanning for #ACL2021NLP Outstanding Paper Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering. arxiv.org/abs/2107.02331 Code github.com/siddk/vqa-outlier… #NLProc

Siddharth Karamcheti · Jul 7, 2021 · 5:11 PM UTC

Siddharth Karamcheti

@siddkaramcheti

7 Jul 2021

Paper and code is out! Getting to this point, this paper – which is fundamentally about a negative result – wasn't a linear path, but one that took months. Excited to share that story (the one that didn't make it into the paper) with y'all. Stay tuned for the blog post!

Stanford NLP Group

@stanfordnlp

7 Jul 2021

Siddharth Karamcheti · May 20, 2019 · 6:33 PM UTC

Siddharth Karamcheti

@siddkaramcheti

20 May 2019

Incredibly honored to be named a 2019 Open Philanthropy AI Fellow. Thanks so much to all my advisors, mentors, friends, and family who helped me get this far!

Coefficient Giving

@coeff_giving

20 May 2019

Excited to announce the 2019 class of the Open Phil AI Fellowship. Eight machine learning students will collectively receive up to $2 million in PhD fellowship support over the next five years. Meet the 2019 fellows: openphilanthropy.org/focus/g…

Siddharth Karamcheti · Jun 30, 2021 · 8:20 PM UTC

Siddharth Karamcheti

@siddkaramcheti

30 Jun 2021

This is amazing news, and truly incredible. I'm so lucky to have @DorsaSadigh as an advisor, and really looking forward to the awesome work our brilliant lab will come out with over the next few years! Congratulations @DorsaSadigh!!!

Stanford AI Lab

@StanfordAILab

30 Jun 2021

Congratulations to @StanfordAILab faculty Dorsa Sadigh on receiving an MIT Tech Review TR-35 award for her work on teaching robots to be better collaborators with people technologyreview.com/innovat…

Siddharth Karamcheti · Nov 25, 2019 · 5:22 PM UTC

Siddharth Karamcheti

@siddkaramcheti

25 Nov 2019

I just finished the Residency this past August, and it was one of the most enriching experiences I’ve ever had — I got to work with amazing people on really hard research, and I learned a ton! I highly recommend applying — opportunities like this are few and far between!

Yann LeCun

@ylecun

25 Nov 2019

Applications for the Facebook AI Residency program are open. US (NYC, Seattle, Menlo Park): facebook.com/careers/jobs/67… UK (London): facebook.com/careers/jobs/79… Deadline: 2020-01-31 facebook.com/careers/jobs/67…

Siddharth Karamcheti · Jul 23, 2020 · 5:17 AM UTC

Siddharth Karamcheti

@siddkaramcheti

23 Jul 2020

Interestingly, @_eric_mitchell_ and I found that if you “prime” GPT-3 with “natural” (less structured) text, you get less ambiguous action associations (set it to stun vs. stun). Maybe this provides insight on how to structure your “prompts” — the more “natural” the better!

Siddharth Karamcheti · Sep 16, 2019 · 6:10 PM UTC

Siddharth Karamcheti

@siddkaramcheti

16 Sep 2019

How do reading comprehension models select supporting evidence? How does this evidence compare to those chosen by human users? Very excited to share our new #emnlp2019 paper (arxiv.org/abs/1909.05863) w/ @EthanJPerez, Rob Fergus, @jaseweston, @douwekiela, and @kchonyc!

Finding Generalizable Evidence by Learning to Convince Q&A Models

We propose a system that finds the strongest supporting evidence for a given answer to a question, using passage-based question-answering (QA) as a testbed. We train evidence agents to select the...

arxiv.org

Ethan Perez

@EthanJPerez

16 Sep 2019

What evidence do people find convincing? Often, the same evidence that Q&A models find convincing. Check out our #emnlp2019 paper: arxiv.org/abs/1909.05863 And blog post: medium.com/@ethanperez18/wha… w/ @siddkaramcheti Rob Fergus @jaseweston @douwekiela @kchonyc

Siddharth Karamcheti · Feb 15, 2022 · 10:12 PM UTC

Siddharth Karamcheti

@siddkaramcheti

15 Feb 2022

Congrats to my advisor @DorsaSadigh for being named a 2022 Sloan Fellow! I’m incredibly lucky and grateful to be one of your students! sloan.org/fellowships/2022-F…

Siddharth Karamcheti · Nov 3, 2021 · 5:31 PM UTC

Siddharth Karamcheti

@siddkaramcheti

3 Nov 2021

🎉 Stoked to share our #NeurIPS2021 paper "ELLA: Exploration through Learned Language Abstraction." How can language help RL agents solve sparse-reward tasks more efficiently? Led by @suvir_m (applying to PhDs now!), with my advisor @DorsaSadigh! 🔗: arxiv.org/abs/2103.05825

Suvir Mirchandani @suvir_m

3 Nov 2021

Training RL agents to complete language instructions can be difficult & sample-inefficient. A key challenge is exploration. Our method, ELLA, helps guide exploration in terms of simpler subtasks. Paper: arxiv.org/abs/2103.05825 Talk: piped.video/7iDeF5eiyIA #NeurIPS21

Siddharth Karamcheti · Nov 19, 2020 · 7:06 AM UTC

Siddharth Karamcheti

@siddkaramcheti

19 Nov 2020

Very excited for this line-up of amazing speakers, and to be presenting our work "Learning Adaptive Language Interfaces through Decomposition" (arxiv.org/abs/2010.05190) w/ @DorsaSadigh and @percyliang! If you're at #emnlp2020 tomorrow, definitely stop by!

Learning Adaptive Language Interfaces through Decomposition

Our goal is to create an interactive natural language interface that efficiently and reliably learns from users to complete tasks in simulated robotics settings. We introduce a neural semantic...

arxiv.org

Interactive and Executable Semantic Parsing @intexsempar2020

18 Nov 2020

Looking forward to see you tomorrow (19/11) at the Interactive and Executable Semantic Parsing workshop, starting at 8:15am PT! On our page you'll find our schedule, list of invited speakers and link to the zoom session: virtual.2020.emnlp.org/works…

Siddharth Karamcheti · Oct 8, 2020 · 5:01 AM UTC

Siddharth Karamcheti

@siddkaramcheti

8 Oct 2020

Stoked to kick-off the 2020 @stanfordnlp seminar series (nlp.stanford.edu/seminar/) with a talk from @_jessethomason_ on "From Human Language to Agent Action." As a student working in #RoboNLP, I can't wait to hear about his recent work/perspective on the field as a whole!

Siddharth Karamcheti · Jul 23, 2020 · 5:17 AM UTC

Siddharth Karamcheti

@siddkaramcheti

23 Jul 2020

“Priming” the model was pretty straightforward — I just picked four random objects, and chose the first few affordances that came to mind:

Siddharth Karamcheti · Oct 29, 2021 · 6:18 PM UTC

Siddharth Karamcheti

@siddkaramcheti

29 Oct 2021

The Bay Area Robotics Symposium 2021 is in full swing - we’ve got a full house! #BARS2021 Catch the second set of faculty talks now: piped.video/KxBOxH4CFc8. 1:15 PT: Keynotes by @percyliang and @robreich + buzzing afternoon session w/ more faculty, student, and sponsor talks!

Siddharth Karamcheti · Nov 9, 2021 · 7:58 PM UTC

Siddharth Karamcheti

@siddkaramcheti

9 Nov 2021

Really excited to see this at #CoRL2021 tomorrow! @coreylynch and the entire Google robotics team have really inspired my research (especially with their RoboNLP work). Super stoked to hear more about implicit behavioral cloning — y’all should make it to the poster if you can!

Corey Lynch

@coreylynch

9 Nov 2021

How can robots learn to imitate precise 🎯 and multimodal 🔀 human behaviors? “Implicit Behavioral Cloning” 🦾🔷🟦💛🟨 paper, videos, code: implicitbc.github.io See IBC learning combinatorial sorting and 1mm precision insertion from vision, tasks explicit BC struggles with.

Siddharth Karamcheti · Dec 11, 2024 · 3:30 PM UTC

Siddharth Karamcheti

@siddkaramcheti

11 Dec 2024

Really grateful to @StanfordHAI for covering our work on Vocal Sandbox - a framework for building robots that can seamlessly work with and learn from you in the real world (w/ @jenngrannen @suvir_m @percyliang @DorsaSadigh). In case you missed it: arxiv.org/abs/2411.02599

Vocal Sandbox: Continual Learning and Adaptation for Situated...

We introduce Vocal Sandbox, a framework for enabling seamless human-robot collaboration in situated environments. Systems in our framework are characterized by their ability to adapt and...

arxiv.org

Stanford HAI

@StanfordHAI

10 Dec 2024

A new robot system called Vocal Sandbox is the first of many systems that promise to help integrate robots into our daily lives. Learn about the prototype that @Stanford researchers presented at the 8th annual Conference on Robot Learning. stanford.io/3Bco1jd

8,917

Siddharth Karamcheti · Jun 16, 2021 · 7:21 PM UTC

Siddharth Karamcheti

@siddkaramcheti

16 Jun 2021

🎉 Incredibly thrilled to share our work "Targeted Data Acquisition for Evolving Negotiation Agents" to be presented at #ICML2021, led by my inspirational labmate @MinaeKwon, Mariano-Florentino Cuéllar, and @DorsaSadigh! Reasons why I find this work exciting - a 🧵.

Minae Kwon @MinaeKwon

16 Jun 2021

Excited to share our #ICML2021 paper “Targeted Data Acquisition for Evolving Negotiation Agents” with the amazing @siddkaramcheti, Mariano-Florentino Cuéllar, and @DorsaSadigh! Paper: arxiv.org/abs/2106.07728 Talk: piped.video/watch?v=xxCSim8Y… 🧵👇 [1/6]

Siddharth Karamcheti · Oct 19, 2022 · 6:34 PM UTC

Siddharth Karamcheti

@siddkaramcheti

19 Oct 2022

How can we teach humans to provide better demonstration data for robotic manipulation? Check out our #CoRL2022 paper on "Eliciting Compatible Demonstrations for Multi-Human Imitation Learning" (arxiv.org/abs/2210.08073) w/ @gandhikanishk @madelineliao & @DorsaSadigh – 🧵👇 (1/N).

Eliciting Compatible Demonstrations for Multi-Human Imitation Learning

Imitation learning from human-provided demonstrations is a strong approach for learning policies for robot manipulation. While the ideal dataset for imitation learning is homogenous and...

arxiv.org

Kanishk Gandhi

@gandhikanishk

19 Oct 2022

Are you collecting demonstrations for imitation learning from multiple demonstrators? Naively collecting demonstrations might actually hurt performance!! We present a simple way to teach people to teach robots better! Appearing at CoRL ‘22. 🧵 (1/5)

Siddharth Karamcheti · Mar 27, 2024 · 12:01 PM UTC

Siddharth Karamcheti

@siddkaramcheti

27 Mar 2024

David (@dlwh) is an incredible friend and mentor. Highly recommend following his work — he not only dives deep into understanding *all* the parts of the systems he works with, but also cares about sharing these insights in a way that’s accessible. Levanter is just one example!

Sasha Rush

@srush_nlp

27 Mar 2024

Recommend following David Hall (@dlwh) and the Levanter project from @StanfordCRFM . Just no nonsense details about fixing the pain-points of scaling LLM training, one at a time.

5,399

Siddharth Karamcheti · Dec 3, 2022 · 2:01 PM UTC

Siddharth Karamcheti

@siddkaramcheti

3 Dec 2022

Replying to @kevin_zakka

Never to late to start working on robotics & NLP! It’s such a wonderful time, so many great questions to explore!

Siddharth Karamcheti · Jul 24, 2020 · 8:53 PM UTC

Siddharth Karamcheti

@siddkaramcheti

24 Jul 2020

I have a grounded language joke, but you’d miss the context.

Ida Momennejad @criticalneuro

24 Jul 2020

I have a reinforcement learning joke, but not sure it's rewarding.

Siddharth Karamcheti · Jan 12, 2021 · 2:08 AM UTC

Siddharth Karamcheti

@siddkaramcheti

12 Jan 2021

This is going to be a fantastic talk - can’t wait for Thursday!

Stanford NLP Group

@stanfordnlp

12 Jan 2021

We’re very excited to kick off our 2021 Stanford NLP Seminar series with Ian Tenney (@iftenney) of Google Research presenting on “BERTology and Beyond”! Thursday 10am PT. Open to the public non-Stanford people register at nlp.stanford.edu/seminar

Siddharth Karamcheti · Aug 26, 2021 · 7:06 PM UTC

Siddharth Karamcheti

@siddkaramcheti

26 Aug 2021

In addition to the codebase, @laurel_orr1 and I wrote up a blog post (with the rest of the Propulsion team!) describing a bit more about Mistral and our journey in more detail. Check it out here, and we'd love to hear your thoughts: crfm.stanford.edu/blog.html [1/5]

Siddharth Karamcheti

@siddkaramcheti

24 Aug 2021

Siddharth Karamcheti · Jul 23, 2020 · 5:17 AM UTC

Siddharth Karamcheti

@siddkaramcheti

23 Jul 2020

The affordance prediction is fairly good (e.g. interdimensional portal), but it’s not perfect (e.g. soup can). That being said, I think this has potential for text-based games (@jaseweston, @mark_riedl), Nethack (@_rockt, @egrefen), and more importantly robotic manipulation!

Siddharth Karamcheti · Jun 17, 2025 · 7:30 PM UTC

Siddharth Karamcheti

@siddkaramcheti

17 Jun 2025

Introducing ProVox — new work on building proactive and personalized agents for human-robot collaboration! ProVox is the next chapter in the Vocal Sandbox saga, and makes it even easier to deploy adaptive robots alongside real people. See @jenngrannen’s thread for more!

Jenn Grannen @jenngrannen

17 Jun 2025

Meet ProVox: a proactive robot teammate that gets you 🤖❤️‍🔥 ProVox models your goals and expectations before a task starts — enabling personalized, proactive help for smoother, more natural collaboration. All powered by LLM commonsense. Recently accepted at @ieeeras R-AL! 🧵1/7

2,551

Siddharth Karamcheti · Jun 26, 2024 · 3:27 PM UTC

Siddharth Karamcheti

@siddkaramcheti

26 Jun 2024

I'm really loving alphaXiv (from a great team of Stanford students including @rajpalleti314)! Beyond just reading arXiv papers – it's an awesome platform for discussion, collaborative annotation, and note-taking. Give it a try – clear win for open-science...

alphaXiv

@askalphaxiv

25 Jun 2024

How do LLMs learn new facts while pre-training? Excited to have authors @hoyeon_chang and Jinho Park answer questions on their latest paper "How Do Large Language Models Acquire Factual Knowledge During Pretraining?" Leave questions for the authors: alphaxiv.org/abs/2406.11813

4,134

Siddharth Karamcheti · Jul 13, 2020 · 4:15 PM UTC

Siddharth Karamcheti

@siddkaramcheti

13 Jul 2020

I had a great time compiling this post. There’s some really exciting and compelling work coming out of Stanford in a lot of different areas. Very proud to call these people my peers!

Stanford AI Lab

@StanfordAILab

13 Jul 2020

The International Conference on Machine Learning (ICML) 2020 is being hosted virtually this week. We’re excited to share all the work from SAIL that’s being presented, and you’ll find links to papers, videos and more in our latest blog post: ai.stanford.edu/blog/icml-20…

Siddharth Karamcheti · Oct 12, 2022 · 9:48 PM UTC

Siddharth Karamcheti

@siddkaramcheti

12 Oct 2022

This is incredibly cool — I really really like this line of work on learning to quickly deploy language-aware robots to completely new environments. Amazing work!

Mahi Shafiullah 🏠🤖

@notmahi

12 Oct 2022

How can we train data-efficient robots that can respond to open-ended queries like “warm up my lunch” or “find a blue book”? Introducing CLIP-Field, a semantic neural field trained w/ NO human labels & only w/ web-data pretrained detectors, VLMs, and LLMs mahis.life/clip-fields

Siddharth Karamcheti · Jul 8, 2023 · 3:20 AM UTC

Siddharth Karamcheti

@siddkaramcheti

8 Jul 2023

This is the best kind of paper by my labmates @suneel_belkhale and @YuchenCui1 — it starts with a simple punchline (data quality matters for imitation learning) but really really drives home exactly what “good data” looks like. Definitely worth a read!

Suneel Belkhale @suneel_belkhale

7 Jul 2023

In imitation learning (IL), we often focus on better algorithms, but what about improving the data? What does it mean for a dataset to be high quality? Our work takes a first step towards formalizing and analyzing data quality. (1/5) arxiv.org/abs/2306.02437

10,912

Siddharth Karamcheti · Jun 18, 2025 · 5:16 PM UTC

Siddharth Karamcheti

@siddkaramcheti

18 Jun 2025

This was a rough year to be on the academic job market in the US, and I'm so grateful to all of my supporters. I especially want to thank my advisors @DorsaSadigh and @percyliang for their unwavering faith in me, my amazing collaborators, and my family. Onwards!

2,348

Siddharth Karamcheti · Nov 17, 2020 · 10:32 PM UTC

Siddharth Karamcheti

@siddkaramcheti

17 Nov 2020

Thrilled to have @ybisk from @SCSatCMU at this week's @stanfordnlp Seminar (Thursday @ 10 AM PST - open to the public: nlp.stanford.edu/seminar/)! @ybisk's work in #RoboNLP has been truly inspirational - I can't wait to learn from him and get a taste of where the field is moving!

Yonatan Bisk will be presenting his talk "Language Should be Embodied - But what does that mean?" this Thursday 11/19 at 10 AM PT for the Stanford NLP Seminar. The talk is open to the public: please register at nlp.stanford.edu/seminar.

ALT Yonatan Bisk will be presenting his talk "Language Should be Embodied - But what does that mean?" this Thursday 11/19 at 10 AM PT for the Stanford NLP Seminar. The talk is open to the public: please register at nlp.stanford.edu/seminar.

Siddharth Karamcheti · Jun 26, 2023 · 7:14 PM UTC

Siddharth Karamcheti

@siddkaramcheti

26 Jun 2023

He gave me my start in research, shaped the way I think about my work (depth over distance, the value of simple words and ideas), and convinced me that a PhD, a career in research was something that I could do. I will always remember his patience and support for me.

6,202

Siddharth Karamcheti · Apr 22, 2020 · 2:21 AM UTC

Siddharth Karamcheti

@siddkaramcheti

22 Apr 2020

Grounding, embodiment, and interaction - really excited to see this, and can’t wait to explore these areas throughout the rest of my PhD!

Ari Holtzman

@universeinanegg

22 Apr 2020

"You can't learn language from the radio." 📻 Why does NLP keep trying to? In arxiv.org/abs/2004.10151 we argue that physical and social grounding are key because, no matter the architecture, text-only learning doesn't have access to what language is *about* and what it *does*.

Siddharth Karamcheti · Nov 9, 2023 · 10:57 PM UTC

Siddharth Karamcheti

@siddkaramcheti

9 Nov 2023

Really excited by this work from my incredible labmate @priyasun_! Sketches are an intuitive and expressive way of specifying not just a goal, but also *how* to perform a task — can’t wait to see sketches + language + gestures in the context of rich, collaborative robotics!

Priya Sundaresan @priyasun_

3 Nov 2023

We can tell our robots what we want them to do, but language can be underspecified. Goal images are worth 1,000 words, but can be overspecified. Hand-drawn sketches are a happy medium for communicating goals to robots! 🤖✏️Introducing RT-Sketch: rt-sketch.github.io 🧵1/11

4,994

Siddharth Karamcheti · Nov 3, 2020 · 7:37 PM UTC

Siddharth Karamcheti

@siddkaramcheti

3 Nov 2020

Everyone needs a bit of @douwekiela in their life! Tune into this week's Stanford NLP Seminar this Thursday at 10 AM PST (open to the public - register here: nlp.stanford.edu/seminar/) where he'll talk about "Rethinking Benchmarking in AI" and Dynabench (dynabench.org)!

Siddharth Karamcheti · Sep 14, 2020 · 7:20 PM UTC

Siddharth Karamcheti

@siddkaramcheti

14 Sep 2020

Incredible to see LXMERT added to Transformers - it’s a clean and impressive implementation that’s really going to make building vision-and-language applications more accessible and widespread. Excited to see people adopt it!

Mohit Bansal

@mohitban47

14 Sep 2020

Amazing effort by @avalmendoz+@haotan5 & @huggingface @LysandreJik @qlhoest @Thom_wolf on LXMERT demo+backend! Comes with flexible dataset generation via HF/datasets for feat+box predn from bottomup-FRCNN w ultra-fast access; allows extension to other mmodal tasks by community 🤗

Siddharth Karamcheti · May 13, 2021 · 1:56 AM UTC

Siddharth Karamcheti

@siddkaramcheti

13 May 2021

My incredible mother is talking with fellow professionals tomorrow about managing mental health in the midst of the India COVID Crisis on @Radiozindagisfo. This is an incredibly important discussion to be having, for those here and abroad. Please tune in if you can!

Siddharth Karamcheti · Sep 1, 2020 · 12:03 AM UTC

Siddharth Karamcheti

@siddkaramcheti

1 Sep 2020

This is an incredible initiative! In addition, if you want to chat about grad school/applying, please feel free to DM/email me - I couldn’t have gotten into grad school without the support of older students and mentors, and I’d love to do what I can to help out!

This tweet is unavailable

Siddharth Karamcheti · Oct 10, 2019 · 8:40 PM UTC

Siddharth Karamcheti

@siddkaramcheti

10 Oct 2019

Very cool work presenting a brand new language-centric collaborative task pairing a human user and a bot situated in a grounded environment! Very thorough evaluation, including a bonafide human eval. Really exciting stuff - we need more tasks like this.

This tweet is unavailable

Siddharth Karamcheti · Aug 22, 2023 · 2:16 PM UTC

Siddharth Karamcheti

@siddkaramcheti

22 Aug 2023

Check out IDEFICS - an open vision-language model that can accept sequences of images and text, for use in tasks like visual dialogue, dense captioning, and more! Demo & Models: huggingface.co/spaces/Huggin…

IDEFICS Playground - a Hugging Face Space by HuggingFaceM4

Discover amazing ML apps made by the community

huggingface.co

Victor Sanh

@SanhEstPasMoi

22 Aug 2023

Introducing IDEFICS, the first open state-of-the-art visual language model at the 80B scale! The model accepts arbitrary sequences of images and texts and produces text. A bit like a multimodal ChatGPT! Blogpost: huggingface.co/blog/idefics Playground: huggingface.co/spaces/Huggin…

8,764

Siddharth Karamcheti · Feb 21, 2021 · 3:40 AM UTC

Siddharth Karamcheti

@siddkaramcheti

21 Feb 2021

Thank you so much @michellearning and @andrey_kurenkov for everything you’ve done for the @StanfordAILab Blog! I learned so much from you both; really looking forward to continuing what you started with the amazing @nelsonfliu, @jmschreiber91, and @megha_byte!

Michelle Lee

@michellearning

20 Feb 2021

It's official! I am now an "alumni editor" of the @StanfordAILab blog! ai.stanford.edu/blog/about/ It was an amazing journey to have started the blog and led it as the co-editor-in-chief with @andrey_kurenkov, but even more amazing to see the new editorial board take over!

Siddharth Karamcheti · Feb 16, 2022 · 8:10 PM UTC

Siddharth Karamcheti

@siddkaramcheti

16 Feb 2022

It's a wonderful week! Really proud of my advisor @percyliang for being named a AI2050 fellow! Thrilled for you, and ever so grateful to be one of your students! schmidtfutures.com/schmidt-f…

Siddharth Karamcheti · Aug 14, 2019 · 4:40 PM UTC

Siddharth Karamcheti

@siddkaramcheti

14 Aug 2019

Fantasy fans rejoice! Very excited that our paper introducing the LIGHT dialogue dataset was accepted at EMNLP! Can't wait to see others build on the fantasy text adventure platform and develop new grounded agents capable of speaking and acting in the world.

ParlAI @parlai_parley

14 Aug 2019

Accepted at EMNLP! Built in ParlAI. Learning to Speak and Act in a Fantasy Text Adventure Game @JackUrbs Angela Fan @siddkaramcheti Saachi Jain, Samuel Humeau, Emily Dinan @_rockt @douwekiela kiela, Arthur Szlam, @jaseweston arxiv.org/abs/1903.03094 parl.ai/projects/light/

Siddharth Karamcheti · Sep 18, 2020 · 9:11 PM UTC

Siddharth Karamcheti

@siddkaramcheti

18 Sep 2020

I’ve been incredibly honored to be an OpenPhil Fellow and part of the fellows community! It’s a great group of people and a wonderful program, so I highly recommend current (and incoming) PhD students apply!

Coefficient Giving

@coeff_giving

18 Sep 2020

Applications are open for the Open Phil AI Fellowship! This program extends full support to a community of current & incoming PhD students, in any area of AI/ML, who are interested in making the long-term, large-scale impacts of AI a focus of their work. openphilanthropy.org/focus/g…

Siddharth Karamcheti · Feb 1, 2022 · 2:48 AM UTC

Siddharth Karamcheti

@siddkaramcheti

1 Feb 2022

Replying to @nabla_theta

I hear the sqrt might be optional?

Siddharth Karamcheti · Jun 20, 2023 · 4:08 PM UTC

Siddharth Karamcheti

@siddkaramcheti

20 Jun 2023

When faced with a socially ambiguous cleanup task (a half-complete Lego model, a Starbucks cup), what should a robot do? Our approach – iterate an LLM "reasoner" with active perception/VQA: "move above the cup" --> "is it empty?" (yes) --> `cleanup(cup)` See @MinaeKwon's 🧵👇

Minae Kwon @MinaeKwon

20 Jun 2023

How can 🤖s act in a socially appropriate manner without human specification? Our 🤖s reason socially by actively gathering missing info in the real world. We release the MessySurfaces dataset to assess socially appropriate behavior. 🧵👇 arxiv.org/abs/2306.08651

5,460

Siddharth Karamcheti · Mar 17, 2023 · 10:31 AM UTC

Siddharth Karamcheti

@siddkaramcheti

17 Mar 2023

Really grateful to have the chance to present our work at @HRI_Conference this week! Had so much fun in Stockholm - lots of great papers and new friends.

The HRI Conference @HRI_Conference

15 Mar 2023

Takayuki Kanda has just announced the start of the Human-robot communication – 1 session 🗣 Enjoy! ✨ #hri2023 #hri

7,004

Siddharth Karamcheti · Jun 18, 2025 · 5:16 PM UTC

Siddharth Karamcheti

@siddkaramcheti

18 Jun 2025

I can't wait to join the GT community next year. Until then, I'll continue at @ToyotaResearch (in Boston). Check out @RussTedrake's recent talk; beyond results, it describes a philosophy for scientific exploration that I hope to carry forward. piped.video/TN1M6vg4CsQ?si=dHwZ…

2,710

Siddharth Karamcheti · Jan 11, 2021 · 2:12 AM UTC

Siddharth Karamcheti

@siddkaramcheti

11 Jan 2021

Dilip and I met in a class. I was a lonely transfer student, and didn’t really know anyone. In a stroke of fate, @StefanieTellex helped pair us together. Years later, we’re PhD students at the same school, trade ideas all the time, and (COVID-permitting) get KBBQ once a quarter.

Dr. Laura Forlano @laura4lano

9 Jan 2021

Academic love letters: A thread on how you met your closest collaborator, intellectual soulmate, favorite coauthor or other kindred spirit. Do tell.

Siddharth Karamcheti · Feb 25, 2022 · 6:44 PM UTC

Siddharth Karamcheti

@siddkaramcheti

25 Feb 2022

Very lucky to have @ebiyik_ as a labmate and friend. His work is insightful, thorough, and just plain cool! He's also on the academic job market this year 🎉

HRI Pioneers @HRIPioneers

25 Feb 2022

#HRIPioneers2022 Erdem Bıyık is working on “Learning from Humans for Adaptive Interaction” Erdem’s website: stanford.edu/~ebiyik/ Twitter: nitter.app/ebiyik_ And check out our full list of participants on our website: hripioneers.org/participants

Siddharth Karamcheti · Mar 27, 2023 · 5:03 PM UTC

Siddharth Karamcheti

@siddkaramcheti

27 Mar 2023

Incredible work by @tonyzzhao on low-cost, fine-grained bimanual teleoperation. This work is clean, open, and is a game changer for data collection and enabling new, complex tasks. Check out the demos — they’re the real deal.

Tony Zhao

@tonyzzhao

27 Mar 2023

Introducing ALOHA 🏖: 𝐀 𝐋ow-cost 𝐎pen-source 𝐇𝐀rdware System for Bimanual Teleoperation After 8 months iterating @stanford and 2 months working with beta users, we are finally ready to release it! Here is what ALOHA is capable of:

4,280

Siddharth Karamcheti · Jun 28, 2022 · 1:04 PM UTC

Siddharth Karamcheti

@siddkaramcheti

28 Jun 2022

#LDOD was yesterday, and we had a blast! Thanks to everyone for coming out! In case you missed it, congratulations to our outstanding paper - "Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations." Looking forward to next time!

Siddharth Karamcheti

@siddkaramcheti

25 Jun 2022

ALT Our incredible speakers for the L-DOD Workshop at RSS 2022. Left-to-right: Kristen Grauman, Abhinav Gupta, Cathy Wu, Benjamin Sapp, Eric Jang, Sergey Levine, and Davide Scaramuzza.

Siddharth Karamcheti · Sep 27, 2021 · 4:48 AM UTC

Siddharth Karamcheti

@siddkaramcheti

27 Sep 2021

Incredibly excited to see this new paper at @corl_conf - scalable language-conditioned policy learning for manipulation using CLIP + Transporter networks. Also comes with a great suite of benchmark tasks! Stoked to build off this in future work - congrats @mohito1905! #RoboNLP

@_akhaliq

27 Sep 2021

CLIPort: What and Where Pathways for Robotic Manipulation pdf: arxiv.org/pdf/2109.12098.pdf abs: arxiv.org/abs/2109.12098 project page: cliport.github.io/

Siddharth Karamcheti · Aug 24, 2021 · 3:49 PM UTC

Siddharth Karamcheti

@siddkaramcheti

24 Aug 2021

At 10:20 PDT, @laurel_orr1 and I will be talking at the Workshop for #FoundationModels (crfm.stanford.edu/workshop.h…) about Mistral, as well as our journey towards transparent and accessible training. We hope to see you there - bring your questions! [2/4]

Siddharth Karamcheti · Jul 23, 2020 · 5:17 AM UTC

Siddharth Karamcheti

@siddkaramcheti

23 Jul 2020

For example, I’d really love to see a robot (equipped with a robust object detection pipeline) use GPT-3 to figure out how to manipulate new objects!

Siddharth Karamcheti · Jul 19, 2021 · 8:28 PM UTC

Siddharth Karamcheti

@siddkaramcheti

19 Jul 2021

Excited to be at #ICML2021 this week! Catch @MinaeKwon's amazing talk at the "Reinforcement Learning 2" session tomorrow from 7 - 8 AM PDT (lots of other fantastic work too - don't miss it!). We'll also be at our poster from 8 - 11 AM PDT at Section C4! Let's chat negotiation!

Siddharth Karamcheti

@siddkaramcheti

16 Jun 2021

Siddharth Karamcheti · Oct 11, 2021 · 7:29 PM UTC

Siddharth Karamcheti

@siddkaramcheti

11 Oct 2021

Join us as we scale Mistral (github.com/stanford-crfm/mis…) and tackle research around responsibly training/understanding large-scale language models! And looking forward – multimodality: models for language + video, robotics, amongst others. Please share & DMs open for questions!

Percy Liang

@percyliang

11 Oct 2021

The Stanford Center for Research on Foundation Models (CRFM) is looking for a research engineer to join our development team! Interested in large-scale training / being immersed in an interdisciplinary research environment? Please apply! crfm.stanford.edu/apply.html

Siddharth Karamcheti · Jun 12, 2024 · 9:06 PM UTC

Siddharth Karamcheti

@siddkaramcheti

12 Jun 2024

John is going to be an incredible advisor — apply apply apply! (And if you’re a Columbia student, take all his classes too!)

John Hewitt @johnhewtt

12 Jun 2024

I’m joining the Columbia Computer Science faculty as an assistant professor in fall 2025, and hiring my first students this upcoming cycle!! There’s so much to understand and improve in neural systems that learn from language — come tackle this with me!

2,349

Siddharth Karamcheti · Jul 1, 2020 · 8:23 PM UTC

Siddharth Karamcheti

@siddkaramcheti

1 Jul 2020

Fantastic work from my labmate @michiyasunaga around leveraging error messages to perform program repair in code generation/editing style tasks! Learning from feedback is a pretty general principle - excited to see other applications of this in related work!

You’re unable to view this Post because this account owner limits who can view their Posts.

Siddharth Karamcheti · Jan 27, 2022 · 1:22 AM UTC

Siddharth Karamcheti

@siddkaramcheti

27 Jan 2022

Hugging Face is an incredible place to work, and I’ve been so lucky to learn from a diverse and kind group of researchers, engineers, and other interns. We’ve got some great stuff on the horizon; definitely apply!

Douwe Kiela

@douwekiela

27 Jan 2022

🥳 We are hiring researchers and research interns! Apply here: apply.workable.com/huggingfa…. People with characteristics that are underrepresented in tech are especially encouraged to apply. We will also be having a residency program soon @HuggingFace, stay tuned! 🤗

Siddharth Karamcheti · Apr 15, 2024 · 8:16 PM UTC

Siddharth Karamcheti

@siddkaramcheti

15 Apr 2024

Amazing work from @LeoTronchon @HugoLaurencon @SanhEstPasMoi and others at HF on extending VLMs for *interleaved* images and text. Really cool to see the open-source multimodal instruct data (Cauldron), high-res image support, and a super efficient image encoding scheme!

Victor Sanh

@SanhEstPasMoi

15 Apr 2024

New multimodal model in town: Idefics2! 💪 Strong 8B-parameters model: often on par with open 30B counterparts. 🔓Open license: Apache 2.0. 🚀 Strong improvement over Idefics1: +12 points on VQAv2, +30 points on TextVQA while having 10x fewer parameters. 📚 Better data: boosting OCR capabilities with 6TB of documents to transcribe, and improving QA capabilities on charts/figures/diagrams. 🕵️‍♀️ Transparent training data: inspect and build upon all the data (10s of TB of data) we trained on. 🔲 More natural image processing: Incorporating strategies to treat images in their native resolution and native aspect ratio. 📸 High-resolution images: image resolutions up to 980 x 980 and integrating strategies that allow to trade computational efficiency for performance. 😎 2 checkpoints: Releasing both base checkpoint and instruction fine-tuned checkpoint. Chat version to come. huggingface.co/blog/idefics2

1,596

Siddharth Karamcheti · Oct 20, 2021 · 7:32 PM UTC

Siddharth Karamcheti

@siddkaramcheti

20 Oct 2021

The openness and transparency of the HF ecosystem is truly great, as is the drive of the team behind it. Enabling data curation, training, and evaluation (at scale) is fundamental to tackling the problems plaguing large-scale models – I'm hopeful and ready to build these tools.

Siddharth Karamcheti · Feb 13, 2023 · 7:44 PM UTC

Siddharth Karamcheti

@siddkaramcheti

13 Feb 2023

Very excited to be co-organizing the *2nd* Workshop on Learning from Diverse, Offline Data at ICRA this year! Submissions due March 23rd – super excited to see all the amazing work in this area for the second year in a row!

Ted Xiao

@xiao_ted

13 Feb 2023

Announcing the 2nd Workshop on Learning from Diverse, Offline Data (L-DOD) at @ICRA2023 in London on June 2! There has been tremendous progress in scaling AI systems with data - can we apply this paradigm to generalizable robotic systems? sites.google.com/view/ldod20…

2,679

Siddharth Karamcheti · Jan 12, 2022 · 2:54 AM UTC

Siddharth Karamcheti

@siddkaramcheti

12 Jan 2022

Very appreciative of the thoughtful summary of our work! Thanks for the highlight @MosaicML - stoked to follow your progress towards more efficient ML!

Databricks AI Research

@DbrxMosaicAI

12 Jan 2022

New year, new summaries! Let's look at dataset quality and its impact on sample efficiency. This paper (arxiv.org/abs/2107.02331) studies the ineffectiveness of active learning on visual question answering (VQA) datasets and points to *collective outliers* as the culprit. (1/8)

Siddharth Karamcheti · Dec 13, 2024 · 9:03 PM UTC

Siddharth Karamcheti

@siddkaramcheti

13 Dec 2024

Suneel just trained some small, highly-performant VLA models that are hackable and easy to work with! Highly recommend folks try these out!

Suneel Belkhale @suneel_belkhale

13 Dec 2024

Want a smaller VLA that performs better? We just released some core improvements to OpenVLA, like: + MiniVLA: 7x smaller model! + Action chunking using Vector Quantization + Multi-image support Blog: ai.stanford.edu/blog/minivla… Code: github.com/Stanford-ILIAD/op… (1/5) More below! 👇

1,064

Siddharth Karamcheti · Feb 27, 2023 · 5:03 PM UTC

Siddharth Karamcheti

@siddkaramcheti

27 Feb 2023

This project was a huge endeavor; one that would not have been possible without amazing collaborators and mentors – @SurajNair_1 @_anniechen_ @tkollar @chelseabfinn @DorsaSadigh and @percyliang. Further thanks to @ToyotaResearch, @stanfordnlp, and the @StanfordAILab ! (11/12)

1,085

Siddharth Karamcheti · Aug 26, 2021 · 7:06 PM UTC

Siddharth Karamcheti

@siddkaramcheti

26 Aug 2021

Big thanks to everyone who helped us build Mistral -- from @Thom_Wolf & @StasBekman who helped us navigate @huggingface Transformers, to @carey_phelps for providing support with @wandb. Also huge shoutout to @BlancheMinerva from #EleutherAI for providing feedback! [3/5]

Siddharth Karamcheti · Dec 19, 2020 · 5:13 AM UTC

Siddharth Karamcheti

@siddkaramcheti

19 Dec 2020

Phenomenal work by my amazing labmates. Really excited to see this paper go public!

Sang Michael Xie

@sangmichaelxie

18 Dec 2020

🍔🍟"In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness" arxiv.org/abs/2012.04550 Real-world tasks (crop yield prediction from satellites) are often label-scarce. Only some countries have labels - how do we generalize globally?

Siddharth Karamcheti · Dec 8, 2020 · 3:34 AM UTC

Siddharth Karamcheti

@siddkaramcheti

8 Dec 2020

Ranjay is not only a phenomenal Computer Vision / HCI researcher, but an incredible and supportive mentor. I'm so grateful to be learning from him, and I know that he'll be a strong addition to any department out there. Best of luck @RanjayKrishna!

Ranjay Krishna

@RanjayKrishna

7 Dec 2020

🎓 I'm on the faculty job market this year! Please send me a message if your department (or one you know) is interested in a Computer Vision / HCI researcher who designs models inspired by human perception and social interaction! My application materials: ranjaykrishna.com

Siddharth Karamcheti · Jul 19, 2020 · 5:36 PM UTC

Siddharth Karamcheti

@siddkaramcheti

19 Jul 2020

These GPT-3 results have me really excited about exploring its potential for spatial reasoning and grounded language understanding, including applications for robot navigation given textual representations of state. @gdb - would love an invite!

Siddharth Karamcheti · Oct 20, 2021 · 7:32 PM UTC

Siddharth Karamcheti

@siddkaramcheti

20 Oct 2021

Finally, a big thanks to my advisors @percyliang and @DorsaSadigh for their support and fait in me! I'm really excited to see what the next year looks like, both in terms of research and open-source. And robots! Don't forget the robots!

Siddharth Karamcheti · Feb 27, 2023 · 5:02 PM UTC

Siddharth Karamcheti

@siddkaramcheti

27 Feb 2023

The Voltron framework offers a simple way to use language supervision to shape representation learning, building off of prior work in representations for robotics like MVP (arxiv.org/abs/2210.03109) and R3M (arxiv.org/abs/2203.12601). The secret is *balance* (3/12)

Real-World Robot Learning with Masked Visual Pre-training

In this work, we explore self-supervised visual pre-training on images from diverse, in-the-wild videos for real-world robotic tasks. Like prior work, our visual representations are pre-trained...

arxiv.org

1,508

Siddharth Karamcheti · Mar 2, 2023 · 10:46 AM UTC

Siddharth Karamcheti

@siddkaramcheti

2 Mar 2023

For almost two years, I’ve been incredibly lucky to learn from the @AiEleuther community — from sharing tips around training LLMs, to discussing open research problems. Huge congrats to my friend @BlancheMinerva and the entire community! Can’t wait to see what’s up next!

EleutherAI @AiEleuther

2 Mar 2023

Over the past two and a half years, EleutherAI has grown from a group of hackers on Discord to a thriving open science research community. Today, we are excited to announce the next step in our evolution: the formation of a non-profit research institute. blog.eleuther.ai/year-two-pr…

2,051