Yevgen Chebotar · Sep 7, 2023 · 10:15 PM UTC

Yevgen Chebotar

Yevgen Chebotar

@YevgenChebotar

7 Sep 2023

Offline RL strikes back! In our new Q-Transformer paper, we introduce a scalable framework for offline reinforcement learning using Transformers and autoregressive Q-Learning to learn from mixed-quality datasets! Website and paper: q-transformer.github.io 🧵

103

522

210,591

Yevgen Chebotar · Mar 11, 2024 · 9:02 PM UTC

Yevgen Chebotar

@YevgenChebotar

11 Mar 2024

Some personal updates! Excited to join the team @Figure_robot to help building AI for the robot age! 🤖

164

66,357

Yevgen Chebotar · Jun 14, 2021 · 6:53 PM UTC

Yevgen Chebotar

@YevgenChebotar

14 Jun 2021

Excited to present our work on Actionable Models at #ICML! Find the camera-ready version at arxiv.org/abs/2104.07749 In this work, we learn functional understanding of the world through goal-conditioned Q-learning and use it for reaching visual goals or learning downstream tasks.

107

Yevgen Chebotar · Jul 28, 2023 · 6:15 PM UTC

Yevgen Chebotar

@YevgenChebotar

28 Jul 2023

Excited to present RT-2, a large unified Vision-Language-Action model! By converting robot actions to strings, we can directly train large visual-language models to output actions while retaining their web-scale knowledge and generalization capabilities! robotics-transformer2.github…

Google DeepMind

@GoogleDeepMind

28 Jul 2023

Today, we announced 𝗥𝗧-𝟮: a first of its kind vision-language-action model to control robots. 🤖 It learns from both web and robotics data and translates this knowledge into generalised instructions. Find out more: dpmd.ai/introducing-rt2

24,186

Yevgen Chebotar · Apr 19, 2021 · 8:46 PM UTC

Yevgen Chebotar

@YevgenChebotar

19 Apr 2021

Excited to present our new work on Actionable Models, an approach for learning functional understanding of the world via goal-conditioned Q-functions in a fully-offline setting! paper: arxiv.org/abs/2104.07749 website: actionable-models.github.io piped.video/watch?v=S3SCR7iY…

Actionable Models: Unsupervised Offline Reinforcement Learning of...

We consider the problem of learning useful robotic skills from previously collected offline data without access to manually specified rewards or additional online exploration, a setting that is...

arxiv.org

Yevgen Chebotar · May 19, 2019 · 5:26 PM UTC

Yevgen Chebotar

@YevgenChebotar

19 May 2019

Excited to present our work on closing the sim-to-real loop at ICRA in Montreal! Please visit our poster and talk on Tuesday! “Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience” Paper: arxiv.org/abs/1810.05687 Video: piped.video/watch?v=nilcJY5K…

Yevgen Chebotar · Mar 6, 2024 · 5:07 PM UTC

Yevgen Chebotar

@YevgenChebotar

6 Mar 2024

RT-H learns a hierarchy all the way from high-level tasks through low-level “language motions” to robot actions! ✅ Improved performance and generalization through better data sharing ✅ Automated grounded “bottom-up” labeling ✅ Ability to intervene and correct with language

Suneel Belkhale @suneel_belkhale

6 Mar 2024

Is language capable of representing low-level *motions* of a robot? RT-Hierarchy learns an action hierarchy using motions described in language, like “move arm forward” or “close gripper” to improve policy learning. 📜: arxiv.org/abs/2403.01823 🏠: rt-hierarchy.github.io (1/10)

4,643

Yevgen Chebotar · Oct 16, 2018 · 12:30 AM UTC

Yevgen Chebotar

@YevgenChebotar

16 Oct 2018

Improve the simulation to reality robotic skill transfer by closing the sim-to-real loop and adjusting simulation randomization! Paper: arxiv.org/abs/1810.05687 piped.video/nilcJY5Kdt8

Yevgen Chebotar · Jun 14, 2019 · 3:31 PM UTC

Yevgen Chebotar

@YevgenChebotar

14 Jun 2019

Our new work on performing meta-learning using learned loss functions! Also visit our talk and poster at Multi-Task and Lifelong Reinforcement Learning Workshop at ICML tomorrow! Paper: arxiv.org/abs/1906.05374 w/ @amolchanov86, S. Bechtle, @ludo_righetti, @_kainoa_, G. Sukhatme

Yevgen Chebotar · Nov 9, 2023 · 6:09 PM UTC

Yevgen Chebotar

@YevgenChebotar

9 Nov 2023

Presenting RT-2 poster at CoRL! robotics-transformer2.github…

RT-2: Vision-Language-Action Models

Project page for RT-2

robotics-transformer2.github.io

Quan Vuong

@QuanVng

9 Nov 2023

Pictures taken at RT-2 poster at @DannyDriess requests ; ) @YevgenChebotar We miss you @TianheYu CC @hausman_k

3,821

Yevgen Chebotar · May 16, 2024 · 4:53 PM UTC

Yevgen Chebotar

@YevgenChebotar

16 May 2024

Congrats everyone, 170+ authors and contributors, great to see the robotic field coming together!

Karl Pertsch

@KarlPertsch

16 May 2024

Our OpenX paper won best paper at ICRA! Congrats to all my co-authors! 🎉🎉 This is an ongoing effort, we recently added new datasets from the community that double the size of the OpenX dataset -- keep 'em coming! :) Check datasets & how to contribute: robotics-transformer-x.githu…

2,819

Yevgen Chebotar · Sep 7, 2023 · 10:15 PM UTC

Yevgen Chebotar

@YevgenChebotar

7 Sep 2023

By using autoregressive Bellman updates, conservative regularization, Monte Carlo and n-step returns, we are able to combine human demonstrations and autonomously collected data to learn multi-task language-conditioned policies from both, successful and failed examples.

2,537

Yevgen Chebotar · Sep 7, 2023 · 10:15 PM UTC

Yevgen Chebotar

@YevgenChebotar

7 Sep 2023

Our real robot policies significantly improve upon RT-1 and other baselines when trained on limited amount of human demonstrations by leveraging autonomously collected negatives and dynamic programming properties of Q-learning.

2,116

Yevgen Chebotar · Mar 7, 2024 · 6:05 PM UTC

Yevgen Chebotar

@YevgenChebotar

7 Mar 2024

Turns out classification loss works surprisingly well for value-based RL, also some nice gains when used with Q-Transfomer!

Aviral Kumar

@aviral_kumar2

7 Mar 2024

Super simple code change to get value-based deep RL scale *much* better w/ big models across the board on Atari games, robotic manipulation w/ transformers, LLM + text games, & even Chess! Just use classification loss (i.e., cross entropy), not MSE!! arxiv.org/abs/2403.03950🧵⬇️

5,799

Yevgen Chebotar · Oct 4, 2023 · 5:33 PM UTC

Yevgen Chebotar

@YevgenChebotar

4 Oct 2023

Exciting times for Robot Learning! 60 datasets from 22 different robots and 21 institutions combined in a single Open-X Embodiment data repository, resulting in over 1 million episodes and improved RT-X models! Amazing and a very important collaboration across the world! 🤖🌐

Quan Vuong

@QuanVng

3 Oct 2023

RT-X: generalist AI models lead to 50% improvement over RT-1 and 3x improvement over RT-2, our previous best models. 🔥🥳🧵 Project website: robotics-transformer-x.githu…

1,429

Yevgen Chebotar · Sep 7, 2023 · 10:15 PM UTC

Yevgen Chebotar

@YevgenChebotar

7 Sep 2023

Joint work with @QuanVng, @AlexIrpan, @hausman_k, @xf1280, @Yao__Lu, @aviral_kumar2, @TianheYu, @AlexHerzog001, @KarlPertsch, @keerthanpg, @julianibarz, @ofirnachum, @Kanishka_Rao, @chelseabfinn, @svlevine

1,858

Yevgen Chebotar · Apr 19, 2021 · 8:46 PM UTC

Yevgen Chebotar

@YevgenChebotar

19 Apr 2021

Blogpost: ai.googleblog.com/2021/04/mu… Together with @hausman_k, Yao Lu, @xiao_ted, Dmitry Kalashnikov, Jake Varley, @AlexIrpan, @ben_eysenbach, @ryancjulian, @chelseabfinn, @svlevine

Yevgen Chebotar · Jun 14, 2021 · 9:37 PM UTC

Yevgen Chebotar

@YevgenChebotar

14 Jun 2021

Joint work with @hausman_k, Yao Lu, @xiao_ted, Dmitry Kalashnikov, Jake Varley, @AlexIrpan, @ben_eysenbach, @ryancjulian, @chelseabfinn, @svlevine

Yevgen Chebotar · Apr 19, 2021 · 8:46 PM UTC

Yevgen Chebotar

@YevgenChebotar

19 Apr 2021

Our method enables a real-world robotic system to accomplish a wide range of visually indicated tasks and acquires rich representations that can be used to accelerate learning of downstream tasks.

Yevgen Chebotar · Oct 9, 2017 · 3:17 AM UTC

Yevgen Chebotar

@YevgenChebotar

9 Oct 2017

Happy to be part of the exciting project on robot learning from videos! arxiv.org/abs/1704.06888 piped.video/watch?v=b1UTUQpx…

Yevgen Chebotar · Oct 17, 2023 · 11:23 PM UTC

Yevgen Chebotar

@YevgenChebotar

17 Oct 2023

Replying to @_shydrie

There was a technical problem with the old website address, the updated website is at qtransformer.github.io/