Alireza Fathi · Jan 7, 2025 · 7:20 PM UTC

Alireza Fathi

Alireza Fathi

@alirezafathi

7 Jan 2025

Our team at Google DeepMind Foundational Research is hiring full-time Research Scientists and Research Interns! Multimodal, Reasoning, self-improving agents, Video Understanding. Looking for candidates with strong papers at top ML and CV conferences. Email: af_hiring@google.com

614

61,548

Alireza Fathi · Sep 17, 2024 · 6:32 AM UTC

Alireza Fathi

@alirezafathi

17 Sep 2024

Our team at Google DeepMind is seeking a Research Scientist with a strong publication record (multiple first-author papers) on multi-modal LLMs in top ML venues like NeurIPS, ICLR, CVPR. Email me at af_hiring@google.com @CordeliaSchmid

377

53,311

Alireza Fathi · Oct 23, 2024 · 7:16 PM UTC

Alireza Fathi

@alirezafathi

23 Oct 2024

✨ Our team at Google DeepMind is hiring Research Interns (Summer 2025)! Multimodal, text-to-3D, Personalized LLMs, Video Understanding and Generation. Looking for candidates with multiple first-author papers in top ML conferences. Email: af_hiring@google.com @CordeliaSchmid

351

43,787

Alireza Fathi · Aug 5, 2025 · 4:48 AM UTC

Alireza Fathi

@alirezafathi

5 Aug 2025

Our team at Google DeepMind Foundational Research has an opening for a full-time Research Scientist! Areas of Interest are Multimodal, 3D and Spatial Reasoning, Self-improving Agents. Looking for candidates with strong publications at top ML and CV conferences. Email: af_hiring@google.com

350

37,627

Alireza Fathi · Sep 10, 2020 · 1:11 AM UTC

Alireza Fathi

@alirezafathi

10 Sep 2020

Robotics at Google has released a very high quality dataset of scanned objects. It could enable interesting research in 3d shape modeling. app.ignitionrobotics.org/Goo…

294

Alireza Fathi · Aug 18, 2025 · 5:55 AM UTC

Alireza Fathi

@alirezafathi

18 Aug 2025

We are hiring job-boards.greenhouse.io/dee…

DeepMind

job-boards.greenhouse.io

280

44,450

Alireza Fathi · Aug 26, 2021 · 6:01 AM UTC

Alireza Fathi

@alirezafathi

26 Aug 2021

Jitendra Malik's thoughts on Foundation Models, in the Stanford HAI workshop piped.video/watch?v=dG628PEN…

177

Alireza Fathi · Feb 11, 2021 · 10:25 PM UTC

Alireza Fathi

@alirezafathi

11 Feb 2021

We have released TensorFlow 3D!

Google AI

@GoogleAI

11 Feb 2021

Announcing the release of TensorFlow 3D, a set of training and evaluation pipelines for state-of-the-art 3D semantic segmentation, object detection and instance segmentation, with support for distributed training. Check it out and download the code at goo.gle/3pchcSG

Alireza Fathi · Feb 27, 2023 · 7:19 PM UTC

Alireza Fathi

@alirezafathi

27 Feb 2023

Augmenting Large Language & Visual models with Retrieval helps the model to answer questions that were not present in the training data. REVEAL is one of the recent works by our team arxiv.org/abs/2212.05221 @acbuller, @ahmetius, @jesu9, @MrZiruiWang , David Ross, @CordeliaSchmid

REVEAL: Retrieval-Augmented Visual-Language Pre-Training with...

In this paper, we propose an end-to-end Retrieval-Augmented Visual Language Model (REVEAL) that learns to encode world knowledge into a large-scale memory, and to retrieve from it to answer...

arxiv.org

3,046

Alireza Fathi · Jul 27, 2020 · 8:32 PM UTC

Alireza Fathi

@alirezafathi

27 Jul 2020

Most of the previous work on 3d object detection use only one frame of data. In our #eccv2020 paper, we present a 3d sparse LSTM model that achieves more accurate results when applied to a sequence of point clouds. arxiv.org/abs/2007.12392

Alireza Fathi · Dec 16, 2020 · 8:09 PM UTC

Alireza Fathi

@alirezafathi

16 Dec 2020

Our recent work on object-centric neural rendering. Our new formulation makes it possible to move the objects around in the scene and still be able to render high quality images from different views.

Michelle Guo

@mshlguo

16 Dec 2020

We made NeRF compositional! By learning object-centric neural scattering functions (OSFs), we can now compose dynamic scenes from captured images of objects. Website: shellguo.com/osf Joint work with @alirezafathi @jiajunwu_cs Thomas Funkhouser

Alireza Fathi · Feb 2, 2020 · 7:16 PM UTC

Alireza Fathi

@alirezafathi

2 Feb 2020

I am glad that our #cvpr2020 reviews are very positive, but at the same time I am very worried that the quality of the reviews have significantly degraded compared to few years ago.

Alireza Fathi · Jul 3, 2020 · 6:21 AM UTC

Alireza Fathi

@alirezafathi

3 Jul 2020

Congratulations to Yue Wang (research intern), Rui Huang (AI resident), Wanyue Zhang (AI resident) and @_abhijit_kundu_ for getting their papers accepted to #eccv2020.

Alireza Fathi · Jun 13, 2023 · 10:06 PM UTC

Alireza Fathi

@alirezafathi

13 Jun 2023

Today marks my 7th year at Google! How time flies! Thank you, Google, for giving me the opportunity to work on what I enjoy...

2,767

Alireza Fathi · Oct 11, 2024 · 8:01 AM UTC

Alireza Fathi

@alirezafathi

11 Oct 2024

Tesla’s event did a great job showing how far ahead Waymo is compared to everyone else!

1,745

Alireza Fathi · Aug 18, 2023 · 9:09 PM UTC

Alireza Fathi

@alirezafathi

18 Aug 2023

Here is our Google AI blog post on AVIS, a Large Language Model Agent that achieves state-of-the-art results on visual information seeking tasks. @acbuller @ahmetius @jesu9 @CordeliaSchmid

Google AI

@GoogleAI

18 Aug 2023

Today on the blog, read all about AVIS — Autonomous Visual Information Seeking with Large Language Models — a novel method that iteratively employs a planner and reasoner to achieve state-of-the-art results on visual information seeking tasks → goo.gle/3P2y2mY

ALT AVIS employs a dynamic decision-making strategy to respond to visual information-seeking queries.

5,720

Alireza Fathi · Jul 21, 2020 · 2:34 AM UTC

Alireza Fathi

@alirezafathi

21 Jul 2020

Our ECCV paper on "Pillar-based Object Detection for Autonomous Driving" that achieves state of the art results on 3d object detection on the Waymo Open Dataset. arxiv.org/abs/2007.10323

Alireza Fathi · Jun 1, 2023 · 7:33 PM UTC

Alireza Fathi

@alirezafathi

1 Jun 2023

REVEAL will be a highlight at @CVPR. Looking forward to discussing it in more details there with @acbuller, @ahmetius, @jesu9, @CordeliaSchmid

Google AI

@GoogleAI

1 Jun 2023

Learn how REVEAL, an end-to-end retrieval-augmented visual-language model that learns to use multi-source multi-modal data to answer knowledge-intensive queries, achieves state-of-the-art results on visual question answering and image caption tasks. goo.gle/3qcZwwc

REVEAL is an augmented visual-language model with the ability to retrieve multiple knowledge entries from a diverse set of knowledge sources, which helps generation.

ALT REVEAL is an augmented visual-language model with the ability to retrieve multiple knowledge entries from a diverse set of knowledge sources, which helps generation.

2,166

Alireza Fathi · Apr 4, 2020 · 12:20 AM UTC

Alireza Fathi

@alirezafathi

4 Apr 2020

Another CVPR2020 paper by our group on detecting 3d objects and predicting their 3d shapes arxiv.org/abs/2004.01170

Alireza Fathi · Feb 7, 2019 · 5:45 PM UTC

Alireza Fathi

@alirezafathi

7 Feb 2019

Neural Networks seem to follow a puzzlingly simple strategy to classify images medium.com/bethgelab/neural-…

Alireza Fathi · Jun 28, 2021 · 6:34 PM UTC

Alireza Fathi

@alirezafathi

28 Jun 2021

We are gonna be able to go back to office starting July 12th! Never thought I would be this excited to go back to work in person :)

Alireza Fathi · Jun 8, 2020 · 9:04 AM UTC

Alireza Fathi

@alirezafathi

8 Jun 2020

Having to take shelter in place, I have been spending some time on gardening! Here is how our sour cherry tree is looking like today!

Alireza Fathi · Aug 18, 2020 · 7:25 PM UTC

Alireza Fathi

@alirezafathi

18 Aug 2020

Looking forward to presenting our work on 3d scene understanding in the Deep Learning 2.0 Virtual Summit.

Hollie Jaques @reworkhollie

18 Aug 2020

I am looking forward to Alireza Fathi presenting his research advancements at the Deep Learning 2.0 Virtual Summit, Jan 2021. Alireza is currently working on object detection and segmentation in 3D. Join us, and Alireza in January: re-work.co/summits/deep-lear… #computervision

Alireza Fathi · Jun 4, 2025 · 11:28 PM UTC

Alireza Fathi

@alirezafathi

4 Jun 2025

That is why you need Lidar! It is not enough to detect the event eventually! A hundred milliseconds late in detecting such event will result in a catastrophic crash!

Dmitri Dolgov

@dmitri_dolgov

4 Jun 2025

Kids chasing dogs, chasing balls on the streets of LA… once again, @Waymo AI with advanced sensing making our roads safer.

1,258

Alireza Fathi · Apr 3, 2020 · 8:04 PM UTC

Alireza Fathi

@alirezafathi

3 Apr 2020

Great work Francis Engelman! Our CVPR 2020 paper achieving the state of the art results on 3d instance segmentation in ScanNet and S3DIS :) arxiv.org/abs/2003.13867

Matthias Niessner

@MattNiessner

3 Apr 2020

"3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation" #CVPR2020 piped.video/ifL8yTbRFDk We perform SemInstSeg by proposal aggregation using a GraphConvNet to model higher-order proposal interactions! Great results on ScanNet and S3DIS :) @FrancisEngelman

Alireza Fathi · Jun 19, 2019 · 8:38 PM UTC

Alireza Fathi

@alirezafathi

19 Jun 2019

Vote for CVPR 2023 at Vancouver if you are at #CVPR2019

Greg Mori @greg_mori

14 Jun 2019

It’s hard to think of a better place than #Vancouver for #CVPR 2023. Beyond our strong team, it’s fitting that a conference on vision should take place in one of the most beautiful spots on earth. Check out our awesome bid #AINorth #AI #computervision cs.sfu.ca/~mori/cvpr2023_van…

Alireza Fathi · Nov 9, 2022 · 6:41 PM UTC

Alireza Fathi

@alirezafathi

9 Nov 2022

I am sorry to see colleagues and friends getting affected by mass layoffs in recent days. Please reach out and I would try my best to help with any resources I can think of. Hopefully things will bounce back soon.

Alireza Fathi · Apr 30, 2020 · 4:52 PM UTC

Alireza Fathi

@alirezafathi

30 Apr 2020

One of the sad things during this pandemic is to observe the ugly gap between the rich and the poor. At the same time that the rich stays home and orders groceries online to avoid exposure, the poor shops those groceries in store and delivers them to make a living

Alireza Fathi · Mar 11, 2024 · 3:45 PM UTC

Alireza Fathi

@alirezafathi

11 Mar 2024

Check out our CVPR paper on generative retrieval for web-scale entity recognition!

Mathilde Caron @mcaron31

11 Mar 2024

Happy to introduce GERALD - our new VLM that recognizes 6M+ entities, an exciting step towards Web-scale visual entity recognition! Predictions are simply made by auto-regressively decoding a code representing the entity name. Check out our CVPR24 paper: arxiv.org/abs/2403.02041

920

Alireza Fathi · Nov 8, 2024 · 8:05 AM UTC

Alireza Fathi

@alirezafathi

8 Nov 2024

Our new #Neurips2024 paper explores the power of multimodal LLMs for building better datasets. We demonstrate significant improvements on visual entity recognition with a novel approach to label verification and data enrichment.

Ahmet Iscen @ahmetius

8 Nov 2024

Our new #NeurIPS2024 paper tackles web-scale visual entity recognition by automatically curating a training dataset with a multimodal LLM, achieving SOTA results (+6.9% on OVEN)! Learn how we use multimodal LLMs for label verification and data enrichment: arxiv.org/abs/2410.23676

1,311

Alireza Fathi · Jun 16, 2023 · 3:57 AM UTC

Alireza Fathi

@alirezafathi

16 Jun 2023

🚀Introducing AVIS: a groundbreaking system that couples #LLM powered planning & reasoning with external tools, resulting in #StateOfTheArt performance on VQA datasets that demand external knowledge! 🧠🔍

@_akhaliq

16 Jun 2023

AVIS: Autonomous Visual Information Seeking with Large Language Models paper page: huggingface.co/papers/2306.0… In this paper, we propose an autonomous information seeking visual question answering framework, AVIS. Our method leverages a Large Language Model (LLM) to dynamically strategize the utilization of external tools and to investigate their outputs, thereby acquiring the indispensable knowledge needed to provide answers to the posed questions. Responding to visual questions that necessitate external knowledge, such as "What event is commemorated by the building depicted in this image?", is a complex task. This task presents a combinatorial search space that demands a sequence of actions, including invoking APIs, analyzing their responses, and making informed decisions. We conduct a user study to collect a variety of instances of human decision-making when faced with this task. This data is then used to design a system comprised of three components: an LLM-powered planner that dynamically determines which tool to use next, an LLM-powered reasoner that analyzes and extracts key information from the tool outputs, and a working memory component that retains the acquired information throughout the process. The collected user behavior serves as a guide for our system in two key ways. First, we create a transition graph by analyzing the sequence of decisions made by users. This graph delineates distinct states and confines the set of actions available at each state. Second, we use examples of user decision-making to provide our LLM-powered planner and reasoner with relevant contextual instances, enhancing their capacity to make informed decisions. We show that AVIS achieves state-of-the-art results on knowledge-intensive visual question answering benchmarks such as Infoseek and OK-VQA.

1,891

Alireza Fathi · Sep 20, 2018 · 6:28 AM UTC

Alireza Fathi

@alirezafathi

20 Sep 2018

techcrunch.com/2018/09/19/am…

Alireza Fathi · May 3, 2025 · 6:52 PM UTC

Alireza Fathi

@alirezafathi

3 May 2025

Random thought! No company is as undervalued as Waymo! If it’s self-driving technology is used in 10% of the ~100M global annual car sales (recent Toyota deal) resulting in $5k per vehicle, that’s $50bn in annual revenue! And pair that with Android car OS and the rest of the Google ecosystem!

1,612

Alireza Fathi · Feb 10, 2018 · 3:09 AM UTC

Alireza Fathi

@alirezafathi

10 Feb 2018

We have just released the instance segmentation support for the Tensor Flow Object Detection API. #TensorFlow #ObjectDetection #Google #API #Segmentation #InstanceSegmentation github.com/tensorflow/models…

Alireza Fathi · Mar 27, 2021 · 6:16 PM UTC

Alireza Fathi

@alirezafathi

27 Mar 2021

Google has launched it's best thing for everything guide. No need for consumer reports subscription anymore! shopping.google.com/m/bestth…

Alireza Fathi · Dec 3, 2019 · 9:47 PM UTC

Alireza Fathi

@alirezafathi

3 Dec 2019

Sundar Pichai is now the CEO of Alphabet... blog.google/inside-google/co…

Alireza Fathi · Jan 3, 2021 · 9:52 PM UTC

Alireza Fathi

@alirezafathi

3 Jan 2021

Something interesting that I just learned today! Are green, red, yellow and orange bell peppers different or the same? bbc.co.uk/newsround/45522834

Alireza Fathi · Jun 19, 2019 · 9:47 PM UTC

Alireza Fathi

@alirezafathi

19 Jun 2019

Great job Steven. A network for predicting surface normals running in real-time on a pixel 2 phone @StevenDHickson @aCromulentName Kevin Murphy @irrfaan arxiv.org/abs/1906.06792

Alireza Fathi · Aug 8, 2018 · 3:33 PM UTC

Alireza Fathi

@alirezafathi

8 Aug 2018

After almost a decade and billions in outside investment, Magic Leap's first product is finally on sale for $2,295. Here's what it's like. cnbc.com/2018/08/08/magic-le… #MagicLeap

Alireza Fathi · Jun 10, 2020 · 9:40 PM UTC

Alireza Fathi

@alirezafathi

10 Jun 2020

An interesting blog post on using unity for creating synthetic data for object detection and beyond blogs.unity3d.com/2020/06/10…

Alireza Fathi · Apr 22, 2019 · 7:18 PM UTC

Alireza Fathi

@alirezafathi

22 Apr 2019

Waymo Truck

Alireza Fathi · Apr 12, 2023 · 5:44 PM UTC

Alireza Fathi

@alirezafathi

12 Apr 2023

In this work led by @ahmetius we show that image recognition can benefit when retrieving similar images from a web-scale corpus of image-text pairs.

Ahmet Iscen @ahmetius

12 Apr 2023

New #CVPR2023 paper "Improving Image Recognition by Retrieving from Web-Scale Image-Text Data". arxiv.org/abs/2304.05173 We improve the recognition capabilities of the model by retrieving images/texts from large-scale memory. Joint work with @alirezafathi and @CordeliaSchmid .

1,278

Alireza Fathi · Oct 13, 2020 · 11:00 PM UTC

Alireza Fathi

@alirezafathi

13 Oct 2020

Here is the link if you are interested in applying for the Google Summer Research Internship :) careers.google.com/jobs/resu…

Alireza Fathi · Sep 30, 2019 · 2:48 AM UTC

Alireza Fathi

@alirezafathi

30 Sep 2019

Great course for learning deep reinforcement learning!

Sergey Levine

@svlevine

29 Sep 2019

Want to learn deep RL? My deep RL course now has a permanent course number (CS285) and is being offered this semester: rail.eecs.berkeley.edu/deepr… Lecture videos here (so far, we've gotten through most of model-free RL, model-based RL coming up next): piped.video/playlist?list=PL…

Alireza Fathi · Jul 25, 2020 · 9:27 AM UTC

Alireza Fathi

@alirezafathi

25 Jul 2020

I have a #TensorFlow joke but I need to be in eager mode!

Alireza Fathi · Sep 10, 2019 · 7:18 AM UTC

Alireza Fathi

@alirezafathi

10 Sep 2019

This would be a great resource for software engineers and researchers outside Google

Peyman Milanfar

@docmilanfar

10 Sep 2019

Google's software engineering best practices facilitate consistency & productivity. All code is peer reviewed for clarity, correctness, and adherence to standards. We've just published these practices. Highly recommended for any lab, academic or otherwise. google.github.io/eng-practic…

Alireza Fathi · Sep 24, 2021 · 7:58 PM UTC

Alireza Fathi

@alirezafathi

24 Sep 2021

OpenAI's new model fine-tuned from GPT3 for summarizing books! openai.com/blog/summarizing-…

Alireza Fathi · Sep 5, 2023 · 11:51 PM UTC

Alireza Fathi

@alirezafathi

5 Sep 2023

Happy 25th birthday Google 🎉

Jeff Dean

@JeffDean

5 Sep 2023

Happy 25th Birthday Google! 🎉 I have gotten incredible enjoyement from being along for the ride for 24+ of these years. When I joined, we were a handful of people wedged into a small office area in downtown Palo Alto above what is now a T-Mobile store. 1/

1,167

Alireza Fathi · Aug 1, 2023 · 8:19 PM UTC

Alireza Fathi

@alirezafathi

1 Aug 2023

These short Neurips reviews could be done by LLMs! Probably we don't need reviewers anymore...LLM would write the review and AC makes the decision by looking at the review and the paper!

3,013

Alireza Fathi · Sep 4, 2019 · 4:25 AM UTC

Alireza Fathi

@alirezafathi

4 Sep 2019

Moore's law vs. reality animation. Very cool.

Lionel Page

@page_eco

3 Sep 2019

Fascinating: Moore’s Law predictions vs actual growth in transistor count. by @datagrapha teddit.net/r/dataisbeautiful…

Alireza Fathi · Jan 20, 2019 · 6:04 AM UTC

Alireza Fathi

@alirezafathi

20 Jan 2019

Replying to @realDonaldTrump

Remove travel ban! #RemoveTravelBan

Alireza Fathi · Apr 15, 2019 · 6:25 AM UTC

Alireza Fathi

@alirezafathi

15 Apr 2019

Replying to @elonmusk

One keeps a car for 5 years on average. I promise u there won't be self driving cars in streets five years from now :)

Alireza Fathi · Aug 23, 2019 · 4:56 AM UTC

Alireza Fathi

@alirezafathi

23 Aug 2019

An interesting blog post on transformers in deep learning models

Peter Bloem (@pbloem@sigmoid.social)@pbloemesquire

20 Aug 2019

New blogpost! Transformers from scratch. Modern transformers are super simple, so we can explain them in a really straightforward manner. Includes pytorch code. peterbloem.nl/blog/transform…

Alireza Fathi · Nov 17, 2021 · 5:18 AM UTC

Alireza Fathi

@alirezafathi

17 Nov 2021

Replying to @fdellaert

So you submitted HiNeRF to CVPR? :D

Alireza Fathi · Aug 9, 2019 · 6:58 AM UTC

Alireza Fathi

@alirezafathi

9 Aug 2019

Congrats to Martial Hebert for becoming the new dean of School of Computer Science at CMU ri.cmu.edu/hebert-named-dean…

Hebert Named Dean of Carnegie Mellon’s Top-Ranked School of Computer Science - Robotics Institute...

Acclaimed computer scientist and AI researcher has led Robotics Institute since 2014 PITTSBURGH—Martial Hebert, a leading researcher in computer vision and robotics, has been named dean of Carnegie...

ri.cmu.edu

Alireza Fathi · Jul 28, 2020 · 6:39 PM UTC

Alireza Fathi

@alirezafathi

28 Jul 2020

An interesting podcast with Jitendra Malik on challenges in computer vision piped.video/watch?v=LRYkH-fA…

Alireza Fathi · May 8, 2025 · 6:14 PM UTC

Alireza Fathi

@alirezafathi

8 May 2025

"We continue to see overall query growth in Search. That includes an increase in total queries coming from Apple’s devices and platforms. More generally, as we enhance Search with new features, people are seeing that Google Search is more useful for more of their queries — and they’re accessing it for new things and in new ways, whether from browsers or the Google app, using their voice or Google Lens. We’re excited to continue this innovation and look forward to sharing more at Google I/O." - blog.google/products/search/…

Here's our statement on this morning’s press reports about Search traffic.

We continue to see overall query growth in Search. That includes an increase in total queries coming from Apple’s devices and platforms.

blog.google

593

Alireza Fathi · Jan 17, 2024 · 1:32 AM UTC

Alireza Fathi

@alirezafathi

17 Jan 2024

Spread between 2-year and 30-year U.S. Treasury securities over time!

428

Alireza Fathi · Aug 11, 2019 · 5:45 AM UTC

Alireza Fathi

@alirezafathi

11 Aug 2019

'3D' is the most frequently used keyword after 'detection' in CVPR 2019 towardsdatascience.com/lates…

Alireza Fathi · Jan 31, 2025 · 8:36 PM UTC

Alireza Fathi

@alirezafathi

31 Jan 2025

Replying to @anikembhavi @ICCVConference

in CVPR, some reviewers came up with I just got sick or I just realized this paper is not related to my expertise excuses. Hopefully you will find a way to handle those cases too

590

Alireza Fathi · Dec 11, 2024 · 11:06 PM UTC

Alireza Fathi

@alirezafathi

11 Dec 2024

Replying to @m__dehghani

Somehow there is a very large jump from step 1.5 to 1.75 :)

1,635

Alireza Fathi · Apr 19, 2025 · 5:14 AM UTC

Alireza Fathi

@alirezafathi

19 Apr 2025

A cool demo of project Astra on Google AR glasses (building on Android XR)! A look at the future piped.video/gElClXpg4J0

The Next Computer? Your Glasses | Shahram Izadi | TED

Picture this: you’re wearing a normal-looking pair of glasses, but ...

youtube.com

1,266

Alireza Fathi · Mar 16, 2019 · 9:45 PM UTC

Alireza Fathi

@alirezafathi

16 Mar 2019

technologyreview.com/s/61311…

Alireza Fathi · Aug 9, 2020 · 8:57 AM UTC

Alireza Fathi

@alirezafathi

9 Aug 2020

Replying to @JeffDean

Maximum possible distance on earth is about 19,000km. So this one is probably very unlikely to beat :) en.m.wikipedia.org/wiki/Extr…

Extremes on Earth - Wikipedia

en.wikipedia.org

Alireza Fathi · Aug 2, 2023 · 5:04 PM UTC

Alireza Fathi

@alirezafathi

2 Aug 2023

Replying to @negar_rz @3scorciav @CVPR @ICCVConference

I was thinking LLM mostly does a summarization and comparison to previous work. Not necessarily scoring the paper. This would make ACs job much easier, but AC would make the final decision by both looking at the summary and the paper itself.

172

Alireza Fathi · Feb 11, 2021 · 11:47 PM UTC

Alireza Fathi

@alirezafathi

11 Feb 2021

Replying to @0xMattGray @GoogleAI

3D object detection and segmentation for self driving cars / robotics, augmented reality, etc.

Alireza Fathi · Dec 4, 2019 · 7:56 AM UTC

Alireza Fathi

@alirezafathi

4 Dec 2019

Interesting to know! Number of deaths by risk factor ourworldindata.org/grapher/n…

Alireza Fathi · Jun 14, 2023 · 2:00 AM UTC

Alireza Fathi

@alirezafathi

14 Jun 2023

Replying to @docmilanfar

That probably is right. But raising $90M in the current environment where most startups are having a hard time raising any money is a very strong signal

772

Alireza Fathi · Jun 9, 2023 · 7:58 PM UTC

Alireza Fathi

@alirezafathi

9 Jun 2023

Replying to @_akhaliq

Everything is now "Everything Everywhere All at Once"!

1,180

Alireza Fathi · Mar 5, 2019 · 2:30 AM UTC

Alireza Fathi

@alirezafathi

5 Mar 2019

GPipe, an Open Source Library for Efficiently Training Large-scale Neural Network Models ai.googleblog.com/2019/03/in…

Alireza Fathi · Sep 30, 2020 · 6:11 PM UTC

Alireza Fathi

@alirezafathi

30 Sep 2020

This is how betting odds changed after last night's debate realclearpolitics.com/electi…

Alireza Fathi · Feb 3, 2021 · 5:18 AM UTC

Alireza Fathi

@alirezafathi

3 Feb 2021

I don't play video games, but if I do, I play FIFA! And it will be great if I have a work related reason to play! engadget.com/ea-fifa-21-foot…

'FIFA 21' comes to Google Stadia on March 17th - Engadget

EA is bringing more than one football game to Stadia. Today, the publisher announced that FIFA 21, the latest entry in 'the beautiful game' franchise, will available to stream from March 17th. It...

engadget.com

Alireza Fathi · Feb 7, 2019 · 6:37 AM UTC

Alireza Fathi

@alirezafathi

7 Feb 2019

Google's plan to build 6,600 houses in Mountain View realestate.withgoogle.com/no…

Alireza Fathi · Aug 8, 2025 · 12:08 AM UTC

Alireza Fathi

@alirezafathi

8 Aug 2025

Which company has best AI model end of August? polymarket.com/event/which-c…

1,330

Alireza Fathi · Nov 23, 2019 · 1:28 AM UTC

Alireza Fathi

@alirezafathi

23 Nov 2019

Pretty exciting project at Google X wired.com/story/alphabets-dr…

Alireza Fathi · Jul 10, 2020 · 6:09 PM UTC

Alireza Fathi

@alirezafathi

10 Jul 2020

Folks in our team have released the Tensorflow 2.0 version of Object Detection API #tensorflow #ObjectDetection blog.tensorflow.org/2020/07/…

Alireza Fathi · Jul 3, 2023 · 5:45 PM UTC

Alireza Fathi

@alirezafathi

3 Jul 2023

"Model the world, not the data"!

510

Alireza Fathi · Jun 17, 2019 · 6:26 AM UTC

Alireza Fathi

@alirezafathi

17 Jun 2019

Rumors that apparently Apple is buying drive ai engadget.com/2019/06/06/appl…

Alireza Fathi · Aug 27, 2019 · 1:32 AM UTC

Alireza Fathi

@alirezafathi

27 Aug 2019

This might be a useful idea for last minute researchers like myself :)

Devi Parikh

@deviparikh

26 Aug 2019

I have a system to plan writing papers for conference deadlines. My students and some collaborators know about it. With the ICLR 2020 deadline coming up, I thought this might be a good time to share this with a wider audience. link.medium.com/XASmjK6ftZ

Alireza Fathi · Jun 27, 2020 · 4:40 AM UTC

Alireza Fathi

@alirezafathi

27 Jun 2020

cnbc.com/2020/06/26/amazon-b…

Alireza Fathi · Jun 6, 2019 · 2:12 AM UTC

Alireza Fathi

@alirezafathi

6 Jun 2019

Replying to @drfeifei @leto__jean @EmmaBrunskill @silviocinguetta

Congratulations @yukez and @drfeifei. Have been lucky to work with both of you

Alireza Fathi · Jul 21, 2019 · 10:38 PM UTC

Alireza Fathi

@alirezafathi

21 Jul 2019

Ego is the anesthesia that deadens the pain of stupidity #famousquotes

Alireza Fathi · Sep 24, 2019 · 4:57 PM UTC

Alireza Fathi

@alirezafathi

24 Sep 2019

Google just publicly released its DeepFakes dataset so all researchers can work on it.

Sundar Pichai

@sundarpichai

24 Sep 2019

Detecting deepfakes is one of the most important challenges ahead of us. Following our release of a synthetic audio dataset in Jan, we're releasing a large dataset of visual deepfakes to support researchers working on synthetic video detection #GoogleAI ai.googleblog.com/2019/09/co…

Alireza Fathi · Aug 22, 2019 · 7:34 AM UTC

Alireza Fathi

@alirezafathi

22 Aug 2019

Waymo open dataset is publicly released. Orders of magnitude larger than Kitti

Waymo

@Waymo

21 Aug 2019

Today, we're launching our Waymo Open Dataset. This high resolution lidar and camera data has been collected by our self-driving cars across a diverse range of situations. We're excited to share it directly with the research community. Download now: waymo.com/open

Alireza Fathi · Aug 3, 2019 · 7:46 AM UTC

Alireza Fathi

@alirezafathi

3 Aug 2019

It is true 🙂

Brent Mittelstadt @bmittelstadt.bsky.social @b_mittelstadt

2 Aug 2019

This might be the perfect overhyped #AI meme. Courtesy of @c_russl

Alireza Fathi · Aug 1, 2018 · 5:53 PM UTC

Alireza Fathi

@alirezafathi

1 Aug 2018

Working from Google SF today! Look at the view... #sf #working #Google #googlesf

Alireza Fathi · Dec 30, 2018 · 8:51 PM UTC

Alireza Fathi

@alirezafathi

30 Dec 2018

I feel so out of touch with the people and what they care about around me. I thought I will look at Google trends to see what people are thinking about politics or economic situation, but I realized the main thing they care about at this moment is #NFL

Alireza Fathi · Aug 2, 2023 · 6:04 PM UTC

Alireza Fathi

@alirezafathi

2 Aug 2023

200 Billion galaxies in the observable universe, and each galaxy has on average 100 Million stars! Don't take your life so serious stressing out for things that do not even matter on multi-galaxy level!

416

Alireza Fathi · Oct 17, 2019 · 1:32 AM UTC

Alireza Fathi

@alirezafathi

17 Oct 2019

Amazing photos from Pixel 4 show how computer vision and machine learning can give a strong boost to the camera hardware cnet.com/google-amp/news/16-…

Alireza Fathi · Jul 19, 2019 · 12:11 AM UTC

Alireza Fathi

@alirezafathi

19 Jul 2019

NeurIPS2019 Competition tracks are released, including a 20K competition on 3d object detection organized by Lyft #NeurIPS #NeurIPS2019 nips.cc/Conferences/2019/Com…

Alireza Fathi · Dec 20, 2018 · 6:12 PM UTC

Alireza Fathi

@alirezafathi

20 Dec 2018

Fill in the blanks! What is your prediction on where this curve is going? #NASDAQ

Alireza Fathi · Oct 22, 2019 · 7:26 AM UTC

Alireza Fathi

@alirezafathi

22 Oct 2019

More than 17 million Americans have more than 1 million dollars in assets! en.m.wikipedia.org/wiki/Mill…

Alireza Fathi · Mar 6, 2019 · 10:13 PM UTC

Alireza Fathi

@alirezafathi

6 Mar 2019

Wow...Go Man U...What a come back...

Alireza Fathi · Aug 7, 2025 · 12:10 AM UTC

Alireza Fathi

@alirezafathi

7 Aug 2025

Replying to @HesamAslan

Try this prompt on a video generation model: “white ball hits other balls and scatters them around on a pool table” and see how good the model is at physics :)

267

Alireza Fathi · Aug 30, 2019 · 4:13 AM UTC

Alireza Fathi

@alirezafathi

30 Aug 2019

On Netflix beginning Sep 20

This tweet is unavailable

Alireza Fathi · Aug 30, 2019 · 1:17 AM UTC

Alireza Fathi

@alirezafathi

30 Aug 2019

Interesting deep learning research at hardware level phys.org/news/2019-08-all-op…

Alireza Fathi · Jul 11, 2020 · 12:21 AM UTC

Alireza Fathi

@alirezafathi

11 Jul 2020

This whole last few months feels like a dream. One weird part of this dream is that everyday I wake up I see stocks going up! #ShelterInPlace

Alireza Fathi · Jun 17, 2024 · 4:29 PM UTC

Alireza Fathi

@alirezafathi

17 Jun 2024

🔥

Ahmet Iscen @ahmetius

14 Jun 2024

🔥 Calling all #CVPR2024 attendees! 🔥 Join us for the 1st Tool-Augmented VIsion (TAVI) Workshop on Monday morning in Summit 321! 💡 5 inspiring keynote talks 🎨 5 invited posters from the main conference Don't miss out! ➡️ More info: sites.google.com/corp/view/t…

565

Alireza Fathi · May 15, 2018 · 12:45 PM UTC

Alireza Fathi

@alirezafathi

15 May 2018

wsj.com/articles/jpmorgan-ta…

Alireza Fathi · Jul 13, 2018 · 6:49 AM UTC

Alireza Fathi

@alirezafathi

13 Jul 2018

wired.com/story/alphabet-goo…