Surya Bhupatiraju · Feb 21, 2024 · 2:53 PM UTC

Surya Bhupatiraju

Surya Bhupatiraju @suryabhupa

21 Feb 2024

Thrilled to see Gemma released today, loved working on post-training with the team!

Jeff Dean

@JeffDean

21 Feb 2024

Introducing Gemma - a family of lightweight, state-of-the-art open models for their class, built from the same research & technology used to create the Gemini models. Blog post: blog.google/technology/devel… Tech report: goo.gle/GemmaReport This thread explores some of the performance characteristics of these models.

20,793

Surya Bhupatiraju · Jun 16, 2019 · 7:37 AM UTC

Surya Bhupatiraju @suryabhupa

16 Jun 2019

Thanks to all who came to our workshop in Exploration in RL! The videos, slides, and papers are now available: sites.google.com/corp/view/e…. Thanks again to our speakers and panelists @pabbeel Doina Precup, @white_martha, Emo Todorov, @RaiaHadsell, @pulkitology, and @jeffclune! :)

Home

Thanks for attending! ERL 2019 has concluded. The videos are available here: Introduction, Keynote by Doina Precup, and Author Spotlights Invited Talk by Emo Todorov, Best Paper Awards, and Invited...

sites.google.com

Surya Bhupatiraju · Aug 16, 2018 · 3:22 AM UTC

Surya Bhupatiraju @suryabhupa

16 Aug 2018

Check some new work I got to work on with @catherineols and others!

Catherine Olsson

@catherineols

16 Aug 2018

Our paper "Skill Rating for Generative Models" is now up! arxiv.org/abs/1808.04888 tl;dr: A new idea & proof-of-concept for evaluating generative models. Train a bunch of GANs. Have the generators "play against" all the discriminator snapshots. Rate them like chess players. 1/n

Surya Bhupatiraju · Apr 8, 2024 · 2:36 PM UTC

Surya Bhupatiraju @suryabhupa

8 Apr 2024

Gemma v1.1 Instruct 2B and “7B” :) are out! See @robdadashi’s thread for details, featuring improvements in multi-turn, fixing some overly-chatty features, and new RL. Even more to come soon!

Robert Dadashi @robdadashi

8 Apr 2024

I am very happy to announce that Gemma 1.1 Instruct 2B and “7B” are out! Here are a few details about the new models: 1/11

2,792

Surya Bhupatiraju · Feb 28, 2018 · 5:57 PM UTC

Surya Bhupatiraju @suryabhupa

28 Feb 2018

Check out our newest paper! Perhaps biased policy gradients really are the future of RL...

George Tucker @georgejtucker

28 Feb 2018

We looked at the sources of variance in policy gradient estimators for some common continuous control tasks, and I was surprised by the results: arxiv.org/abs/1802.10031.

Surya Bhupatiraju · Dec 6, 2018 · 2:16 PM UTC

Surya Bhupatiraju @suryabhupa

6 Dec 2018

We've launched our first DFL Fellowship to help support more people in creating high-quality ML curricula -- please apply and share! :)

Depth First Learning @DepthFirstLearn

6 Dec 2018

We’re thrilled to announce the DFL fellowship, generously funded by @JaneStreetGroup. Have curriculum ideas? We are offering 4 fellows a $4000 grant each to build a 6 week curriculum and run weekly on-line discussions. Learn more and apply at fellowship.depthfirstlearnin…! (1/3)

Surya Bhupatiraju · Nov 6, 2018 · 5:02 PM UTC

Surya Bhupatiraju @suryabhupa

6 Nov 2018

Ben Eysenbach and I organized a workshop at ICML 2018 and decided to write about our experience, what we learned, and what we would've tried differently -- check it out! We hope it helps anyone looking to try organizing a workshop: medium.com/@erl.leads/hitchh… (also go vote!)

Hitchhiker’s Guide to Organizing an Academic Workshop

Introduction

medium.com

Surya Bhupatiraju · Jun 20, 2018 · 5:51 PM UTC

Surya Bhupatiraju @suryabhupa

20 Jun 2018

Check out our guide about TRPO that I helped co-write for @DepthFirstLearn! Feedback and suggestions are welcome :)

Depth First Learning @DepthFirstLearn

20 Jun 2018

We just released our newest study guide! Learn all about TRPO from professors @kumarkagrawal and @suryabhupa → depthfirstlearning.com/2018/….

Surya Bhupatiraju · Dec 9, 2017 · 2:38 AM UTC

Surya Bhupatiraju @suryabhupa

9 Dec 2017

Just watched an AI named Ousia absolutely CRUSH a team of very qualified Quiz Bowl veterans at Quiz Bowl in the Human-Computer Question Answering competition track! #NIPS2017

Surya Bhupatiraju · Aug 2, 2017 · 10:58 PM UTC

Surya Bhupatiraju @suryabhupa

2 Aug 2017

Replying to @hardmaru

Related: how different probability metrics are related cf "On Choosing and Bounding Probability Metrics" (2002):

Surya Bhupatiraju · Jul 11, 2018 · 3:46 PM UTC

Surya Bhupatiraju @suryabhupa

11 Jul 2018

@anishathalye presenting adversarial turtles at ICML 2018 with @logan_engstrom @andrew_ilyas @antimatter15!

Surya Bhupatiraju · Jul 15, 2018 · 7:09 AM UTC

Surya Bhupatiraju @suryabhupa

15 Jul 2018

Our workshop, Exploration in RL, is starting momentarily in Room T1 at #ICML — come by and hear some amazing speakers talk about solving exploration!

Surya Bhupatiraju · Oct 2, 2017 · 9:34 PM UTC

Surya Bhupatiraju @suryabhupa

2 Oct 2017

The new Google AI Residency has been announced! It's been simply fantastic so far -- please consider applying! research.google.com/teams/br…

Surya Bhupatiraju · Jun 16, 2019 · 7:37 AM UTC

Surya Bhupatiraju @suryabhupa

16 Jun 2019

Huge thanks to the other co-organizers, including Ben Eysenbach, @shaneguML, @HarriLEdwards, @white_martha, @pyoudeyer, @EmmaBrunskill, @kenneth0stanley and sponsors @DeepMindAI and @GoogleAI!

Surya Bhupatiraju · Dec 20, 2023 · 4:09 PM UTC

Surya Bhupatiraju @suryabhupa

20 Dec 2023

Replying to @johnma2006 @_albertgu @tri_dao

Super clean implementation!

2,870

Surya Bhupatiraju · Jun 6, 2018 · 4:00 PM UTC

Surya Bhupatiraju @suryabhupa

6 Jun 2018

Check out our new educational effort!

Depth First Learning @DepthFirstLearn

6 Jun 2018

Announcing DepthFirstLearning.com! We are building a repository of study guides targeting consequential papers. Check it out, learn something in-depth, and help us build the next one. @avitaloliver @suryabhupa @kumarkagrawal @cinjoncin

Surya Bhupatiraju · Apr 9, 2024 · 11:28 AM UTC

Surya Bhupatiraju @suryabhupa

9 Apr 2024

Gemma 1.1 improves on lmsys!

Arena.ai

@arena

9 Apr 2024

Exciting news - the latest Arena result are out! @cohere's Command R+ has climbed to the 6th spot, matching GPT-4-0314 level by 13K+ human votes! It's undoubtedly the **best** open model on the leaderboard now🔥 Big congrats to @cohere's incredible work & valuable contribution to the open community! More exciting updates: - Qwen1.5-32B-Chat almost top-10 - Gemma-1.1-7B-it shows great improvement (1044 -> 1088, on par with Llama-2-70b) - Starling-7B-Beta still the best 7B with over 13K votes!

611

Surya Bhupatiraju · Mar 6, 2024 · 7:51 PM UTC

Surya Bhupatiraju @suryabhupa

6 Mar 2024

Replying to @danielhanchen

this is really thoughtfully done, thanks for surfacing all of these so clearly! we're on it :)

1,292

Surya Bhupatiraju · Apr 9, 2017 · 9:36 PM UTC

Surya Bhupatiraju @suryabhupa

9 Apr 2017

I recently gave a deep learning talk for prefrosh at MIT and I spent way too long on this slide (the rest was serious, I promise):

Surya Bhupatiraju · Mar 12, 2024 · 8:16 PM UTC

Surya Bhupatiraju @suryabhupa

12 Mar 2024

this is remarkable stuff, i can't imagine how much progress this team will continue to make!

Cognition

@cognition

12 Mar 2024

Today we're excited to introduce Devin, the first AI software engineer. Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork. Devin is an autonomous agent that solves engineering tasks through the use of its own shell, code editor, and web browser. When evaluated on the SWE-Bench benchmark, which asks an AI to resolve GitHub issues found in real-world open-source projects, Devin correctly resolves 13.86% of the issues unassisted, far exceeding the previous state-of-the-art model performance of 1.96% unassisted and 4.80% assisted. Check out what Devin can do in the thread below.

476

Surya Bhupatiraju · Mar 27, 2017 · 5:31 AM UTC

Surya Bhupatiraju @suryabhupa

27 Mar 2017

A plethora of wonderful ideas/tricks/resources related to self-learning: metacademy.org/roadmaps/rgro….

Surya Bhupatiraju · May 3, 2018 · 5:25 PM UTC

Surya Bhupatiraju @suryabhupa

3 May 2018

Come see our poster at ICLR 2018! :)

George Tucker @georgejtucker

3 May 2018

We decompose the variance of a policy gradient estimator with an action dependent baseline which provides insights into previous methods and new opportunities for improvements. Workshop poster #2 at 11am today. @suryabhupa @shanegu @svlevine #ICLR2018

Surya Bhupatiraju · May 21, 2019 · 4:33 PM UTC

Surya Bhupatiraju @suryabhupa

21 May 2019

Replying to @SmithaMilli @catherineols @danieldewey @open_phil

Huge congrats Smitha! :) Super well-deserved!!

Surya Bhupatiraju · Nov 7, 2017 · 1:15 AM UTC

Surya Bhupatiraju @suryabhupa

7 Nov 2017

Replying to @lishali88 @AmplifyPartners @pabbeel

Congrats! Really exciting stuff :)

Surya Bhupatiraju · Oct 14, 2017 · 8:20 PM UTC

Surya Bhupatiraju @suryabhupa

14 Oct 2017

Such a wonderful read rec'd by @josephwandile: paulgraham.com/hs.html by @paulg -- would've been lovely to have read this years ago.

Surya Bhupatiraju · Apr 21, 2017 · 6:23 PM UTC

Surya Bhupatiraju @suryabhupa

21 Apr 2017

Press release of some of the work in neural program synthesis I did with researchers at @MSFTResearch: microsoft.com/en-us/research….

Deep Learning for Program Synthesis - Microsoft Research

By Rishabh Singh, Jacob Devlin, Abdelrahman Mohamed, and Pushmeet Kohli, Microsoft Research Despite the many advances in computing over the past decades, the actual process of writing computer...

microsoft.com