Ali Eslami · May 5, 2022 · 4:24 PM UTC

Ali Eslami

Pinned Tweet

Ali Eslami

@arkitus

5 May 2022

AI is a form of empirical philosophy. I bet that if Plato, Phyrro, Descartes or Wittgenstein were around now, they’d be tinkering with neural networks. With language models, with generative models and with agents.

467

Ali Eslami · Oct 3, 2019 · 2:09 PM UTC

Ali Eslami

@arkitus

3 Oct 2019

Now for something different! Deep RL + GAN training + CelebA = artificial caricature. Agents learn to draw simplified (artistic?) portraits via trial and error. @ #NeurIPS2019 creativity workshop. Animated paper: learning-to-paint.github.io PDF: arxiv.org/abs/1910.01007 Thread.

189

550

Ali Eslami · Jun 14, 2018 · 6:03 PM UTC

Ali Eslami

@arkitus

14 Jun 2018

"Neural scene representation and rendering" now in @sciencemagazine. By training deep networks to predict what scenes look like from new viewpoints, we get them to understand images: deepmind.com/blog/neural-sce… @DeepSpiker @OriolVinyalsML @theophaneweber @demishassabis

160

491

Ali Eslami · Sep 18, 2018 · 10:49 AM UTC

Ali Eslami

@arkitus

18 Sep 2018

Conditional Neural Process implementation by Marta Garnelo! Neural networks meet stochastic processes. Useful for few-shot regression, classification and meta-learning: 1. Repo: github.com/deepmind/conditio… 2. Notebook: github.com/deepmind/conditio… 3. Paper: arxiv.org/abs/1807.01613

178

474

Ali Eslami · Aug 4, 2025 · 11:05 AM UTC

Ali Eslami

@arkitus

4 Aug 2025

This started as a prototype we built back in March 2024 which we called ‘Neural Google’. The core idea: wrap Gemini around Google and let it do the searching for you.

457

54,828

Ali Eslami · Sep 22, 2025 · 11:04 AM UTC

Ali Eslami

@arkitus

22 Sep 2025

This is AGI complete

429

38,617

Ali Eslami · May 6, 2025 · 3:09 PM UTC

Ali Eslami

@arkitus

6 May 2025

Gemini 2.5 Pro just got even better at code ✨ #1 on LMArena with 1448 Elo, #1 on WebDev Arena with 1420 Elo. Also SOTA for video, with 84.8% on VideoMME. @TimBettridge vibe-coded a 3D tour of the Art Institute of Chicago's collection with it, right in @GeminiApp Canvas 🎨

402

27,926

Ali Eslami · Feb 19, 2019 · 12:45 PM UTC

Ali Eslami

@arkitus

19 Feb 2019

Notebooks for Neural Processes (NPs) and Attentive Neural Processes (ANPs) now available. Compared to CNPs (published last year): 1. NPs model the function with a global latent, and 2. ANPs fit the data better. Kudos to @hyunjik11 and Marta Garnelo et al! github.com/deepmind/neural-p…

371

Ali Eslami · May 23, 2019 · 7:29 AM UTC

Ali Eslami

@arkitus

23 May 2019

Getting closer to the dream! A network that uses unlabelled images to boost performance when labels are scarce (new SOTA), and it's no worse than ResNet when labels are plentiful. Also: Unsupervised net + just a linear on top outperforms original AlexNet! arxiv.org/abs/1905.09272

350

Ali Eslami · Feb 26, 2020 · 11:05 AM UTC

Ali Eslami

@arkitus

26 Feb 2020

Introducing PolyGen: an autoregressive model of 3D meshes. arxiv.org/abs/2002.10880 Transformers + Pointer Nets = train on raw mesh data (i.e. variable-length lists of vertices and faces). No need to voxelise or rasterise! with @charlietcnash @yaroslav_ganin @PeterWBattaglia

274

Ali Eslami · Jun 27, 2025 · 7:36 AM UTC

Ali Eslami

@arkitus

27 Jun 2025

better late than never lots of rough edges still but the team is grinding do share feedback github.com/google-gemini/gem…

287

26,021

Ali Eslami · Dec 6, 2023 · 3:41 PM UTC

Ali Eslami

@arkitus

6 Dec 2023

10 years ago I didn't think I'd see this in my lifetime. Welcome to the future. piped.video/watch?v=UIZAiXYc…

The capabilities of multimodal AI | Gemini Demo

Our natively multimodal AI model Gemini is capable of reasoning acr...

youtube.com

260

46,683

Ali Eslami · Dec 12, 2018 · 1:29 PM UTC

Ali Eslami

@arkitus

12 Dec 2018

Clean and performant PyTorch implementation of Generative Query Networks by Shohei Taniguchi: github.com/iShohei220/torch-… Pixyz implementation by Shohei Taniguchi and Masahiro Suzuki: github.com/masa-su/pixyzoo/t… Pixyz (deep generative modeling library): github.com/masa-su/pixyz 👌

260

Ali Eslami · Aug 5, 2025 · 3:07 PM UTC

Ali Eslami

@arkitus

5 Aug 2025

Genie 3 is the most impressive AI demo I've seen since ChatGPT. In 2016 we were working on 'Neural Representation and Rendering' and could already see a vague path to this. But I didn’t think it’d happen so soon.

236

28,927

Ali Eslami · Jul 12, 2020 · 5:43 PM UTC

Ali Eslami

@arkitus

12 Jul 2020

@irinavlh, @DaniloJRezende and I are presenting a tutorial at ICML on 'Representation Learning Without Labels'. Jul 13, 9 AM to 12 PM and 7 PM to 10 PM (BST) icml.cc/virtual/2020/tutoria… icml.cc/Conferences/2020/Sch… Drop by to find out what Plato has to do with VAEs, GANs and SimCLR!

233

Ali Eslami · Sep 22, 2023 · 12:10 PM UTC

Ali Eslami

@arkitus

22 Sep 2023

We're hiring a research scientist to join our Quantum Chemistry and Materials team ⚛️🚨 The team is working on using machine learning to better our understanding of the universe, down at the level of quantum physics. See: deepmind.com/blog/simulating… Share: boards.greenhouse.io/deepmin…

207

53,948

Ali Eslami · Nov 26, 2018 · 1:31 PM UTC

Ali Eslami

@arkitus

26 Nov 2018

This is a fantastic resource. It was written before the deep revolution and therefore provides good context for it all. I spent most of the the first year of my PhD jumping from chapter to chapter of this book.

Microsoft Research @MSFTResearchCam

26 Nov 2018

"Pattern Recognition and Machine Learning" by @ChrisBishopMSFT is now available as a free download. Download your copy today for an introduction to the fields of pattern recognition & machine learning: aka.ms/prml #ML #Insights

198

Ali Eslami · Oct 30, 2018 · 12:24 PM UTC

Ali Eslami

@arkitus

30 Oct 2018

Cool work by Brett Göhre, showing that a GQN trained on synthetic data can be leveraged to transfer to real images. Impressive results! docs.google.com/presentation… @brett_gohre

197

Ali Eslami · May 28, 2024 · 10:13 AM UTC

Ali Eslami

@arkitus

28 May 2024

Love this visualisation. Drives home just how two-dimensional vision is, and how much work our brains do to make it feel 3D. We see the world through a small flat window. @UnitreeRobotics

181

35,507

Ali Eslami · Jul 13, 2020 · 11:45 AM UTC

Ali Eslami

@arkitus

13 Jul 2020

Slides now available to download: drive.google.com/file/d/1Ee2… Thank you all for attending the morning session. We'll be back online tonight. Tune in for more Q&A.

ICML 2020 Tutorial - Slides.pdf

drive.google.com

Ali Eslami

@arkitus

12 Jul 2020

178

Ali Eslami · Jul 5, 2019 · 7:29 AM UTC

Ali Eslami

@arkitus

5 Jul 2019

Probabilistic U-Nets adapted to produce calibrated uncertainties. Very important for clinical deployment of segmentation networks. Cool work!

Daniel Worrall @danielewworrall

4 Jul 2019

NEW PAPER @miccai2019! "Supervised Uncertainty Quantification for Segmentation with Multiple Annotations". We adapt Prob Unet to output epistemic & CALIBRATED aleatoric uncertainties arxiv.org/abs/1907.01949. Work w. Shi Hu, Stefan Knegt, @BasVeeling, Henkjan Huisman & @wellingmax

178

Ali Eslami · Dec 28, 2018 · 4:42 PM UTC

Ali Eslami

@arkitus

28 Dec 2018

A visual introduction to probability and statistics: seeing-theory.brown.edu/inde… 👌

165

Ali Eslami · Oct 17, 2018 · 11:09 AM UTC

Ali Eslami

@arkitus

17 Oct 2018

If you're interested in interning at DeepMind, the deadline for applications is Oct 29th. You need to be in the last two years of your PhD programme, and be available for 14-20 weeks in 2019. It doesn't matter what country you're based in. Get in touch with me! همین الان!

165

Ali Eslami · Mar 25, 2025 · 5:14 PM UTC

Ali Eslami

@arkitus

25 Mar 2025

Introducing Gemini 2.5 Pro 🌀 which thinks natively and is SOTA across a number of key math, reasoning and science benchmarks

161

9,310

Ali Eslami · Aug 26, 2025 · 2:42 PM UTC

Ali Eslami

@arkitus

26 Aug 2025

🍌🤖💀✨

160

14,021

Ali Eslami · Sep 29, 2018 · 5:08 PM UTC

Ali Eslami

@arkitus

29 Sep 2018

Wow, very impressive samples by an ICLR 2019 submission (I had nothing to do with this paper). Crazy to think how much information there is hidden in a collection of images. Enough to allow a model to generalise this convincingly. Paper: openreview.net/pdf?id=B1xsqj…

157

Ali Eslami · Dec 5, 2018 · 3:19 AM UTC

Ali Eslami

@arkitus

5 Dec 2018

Absolutely mind blowing talk about non-neural computation, a.k.a. 'primitive cognition'. Liquefied brains that retain their memories, two headed worms, salamander tails that turn themselves into legs, and much more. Highly recommended, fascinating watch: bit.ly/2Qgls71

150

Ali Eslami · Aug 15, 2025 · 7:43 AM UTC

Ali Eslami

@arkitus

15 Aug 2025

This figure from the impressive DINOv3 paper is fun to think about. Pretend it's 2018 and you're deciding what research to focus on. Self supervised is <40% and supervised >80%. Would you bet on SSL ever catching up? Some people were believers even then. Have faith!

Max Seitzer @maxseitzer

14 Aug 2025

Introducing DINOv3 🦕🦕🦕 A SotA-enabling vision foundation model, trained with pure self-supervised learning (SSL) at scale. High quality dense features, combining unprecedented semantic and geometric scene understanding. Three reasons why this matters…

143

15,455

Ali Eslami · Sep 12, 2022 · 3:58 PM UTC

Ali Eslami

@arkitus

12 Sep 2022

A comprehensive overview of the Neural Process Family: - What do they have to do with Neural Networks? - What do they have to do with Gaussian Processes? - What does it all have to with Meta Learning? - What advances have been made in the last 4 years? arxiv.org/abs/2209.00517

126

Ali Eslami · Aug 9, 2025 · 7:24 AM UTC

Ali Eslami

@arkitus

9 Aug 2025

Look mum, no NeRF! And from a single reference image. Absolutely gorgeous.

Aleksander Holynski

@holynski_

8 Aug 2025

Another one. Already a powerful painting, but moving around it yourself gives a totally different feeling. Jacques Louis David's "The Death of Socrates" => #Genie3

135

9,737

Ali Eslami · Jul 14, 2020 · 3:46 PM UTC

Ali Eslami

@arkitus

14 Jul 2020

Contrastive Training for Improved Out-of-Distribution Detection arxiv.org/abs/2007.05566 Joint (cross entropy + SimCLR) training gives your network a feature space that is better for OOD detection than cross entropy training alone.

112

Ali Eslami · Aug 6, 2018 · 8:39 AM UTC

Ali Eslami

@arkitus

6 Aug 2018

Jesper Wohlert from Technical University of Denmark has implemented Generative Query Networks in PyTorch. Code available to download. 👌 github.com/wohlert/generativ…

GitHub - wohlert/generative-query-network-pytorch: Generative Query Network (GQN) in PyTorch as...

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering" - wohlert/generative-query-network-pytorch

github.com

105

Ali Eslami · Dec 19, 2024 · 5:02 PM UTC

Ali Eslami

@arkitus

19 Dec 2024

Lateral thinking: "Wait a minute... could I turn one of the numbers upside down?"

Logan Kilpatrick

@OfficialLoganK

19 Dec 2024

Replying to @OfficialLoganK

It’s still an early version, but check out how the model handles a challenging puzzle involving both visual and textual clues: (2/3)

100

15,132

Ali Eslami · Jan 5, 2025 · 8:03 PM UTC

Ali Eslami

@arkitus

5 Jan 2025

I worked with Felix on a paper in 2021. I remember Felix as a consistently kind person, and as a brilliant thinker. I'm sharing his farewell letter, as I think it's what he would have wanted. Please be mindful, it's not an easy read: docs.google.com/document/d/1… Rest in peace.

On mental health, psychedelics and life

On mental health, psychedelics and life This is a story about mental health, psychedelics, psychology and the mind. It is a story about the joy of family, the joy of friends, the joy of being in...

docs.google.com

Felix Hill @FelixHill84

7 Oct 2024

Do you work in AI? Do you find things uniquely stressful right now, like never before? Haver you ever suffered from a mental illness? Read my personal experience of those challenges here: docs.google.com/document/d/1…

13,440

Ali Eslami · Sep 22, 2021 · 1:50 PM UTC

Ali Eslami

@arkitus

22 Sep 2021

Research scientist internship applications are now open for London. First deadline: Oct 4th. Second window: Dec 6th - Dec 17th. Just do it! حتی شما I expect things like e.g. geography / nationality to be less of a blocker than before. deepmind.com/careers/jobs/25…

Ali Eslami · Nov 12, 2018 · 12:38 PM UTC

Ali Eslami

@arkitus

12 Nov 2018

Differentiable Monte Carlo ray tracing. Very cool. "We interface [the method] with PyTorch and show prototype applications in inverse rendering and the generation of adversarial examples for neural networks." Now we just need to make it faster! people.csail.mit.edu/tzumao/…

Ali Eslami · Jul 13, 2018 · 12:37 PM UTC

Ali Eslami

@arkitus

13 Jul 2018

Josh Tenenbaum on artificial intelligence @icmlconf

Ali Eslami · Sep 5, 2025 · 3:02 PM UTC

Ali Eslami

@arkitus

5 Sep 2025

1 click address2watercolour brought to you by 🍌 the Edinburgh flat i grew up in all those years ago 🥲 going to send this to my parents!

8,374

Ali Eslami · Jun 30, 2018 · 3:54 PM UTC

Ali Eslami

@arkitus

30 Jun 2018

Very understandable yet detailed explanation of GQN 👌

Timothy B. Lee @binarybits

29 Jun 2018

Replying to @binarybits

Mind-blowing stuff from Google's DeepMind. arstechnica.com/science/2018…

Ali Eslami · Jul 5, 2018 · 1:48 PM UTC

Ali Eslami

@arkitus

5 Jul 2018

GQN + time: Instead of predicting what a scene looks like from a new viewpoint, predict what it will look like at a new timestamp. For consistent samples, introduce global latent variable. Very cool work by @ananyaku @DeepSpiker @mpshanahan et al. deepmind.com/documents/227/c…

Ali Eslami · Apr 28, 2022 · 4:07 PM UTC

Ali Eslami

@arkitus

28 Apr 2022

DALL-E 2 and Flamingo are the most impressive AI demos that I've ever seen. I wouldn't have predicted that we'd be here if you'd asked me 2 years ago. Not even in the best case scenario.

Google DeepMind

@GoogleDeepMind

28 Apr 2022

Introducing Flamingo 🦩: a generalist visual language model that can rapidly adapt its behaviour given just a handful of examples. Out of the box, it's also capable of rich visual dialog. Read more: dpmd.ai/dm-flamingo 1/

Ali Eslami · May 19, 2022 · 2:55 PM UTC

Ali Eslami

@arkitus

19 May 2022

2007. Me watching Jobs' iPhone keynote: "Dumbest idea ever. Browsing on the go? No keyboard? Is he high?" 2012. Me watching the AlexNet talk: "Dumbest idea ever. NNs can't do cats vs dogs, why jump to 1000-way classification?" Lesson: try suspension of disbelief once in a while

Ali Eslami · Aug 1, 2024 · 6:16 PM UTC

Ali Eslami

@arkitus

1 Aug 2024

Hundreds of researchers make thousands of discoveries. Most researchers focus only on a specific part of the problem, and yet when all those discoveries are put together, it compounds. Pretty incredible to witness.

Arena.ai

@arena

1 Aug 2024

Exciting News from Chatbot Arena! @GoogleDeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive score of 1300 (!), and also achieving #1 on our Vision Leaderboard. Gemini 1.5 Pro (0801) excels in multi-lingual tasks and delivers robust performance in technical areas like Math, Hard Prompts, and Coding. Huge congrats to @GoogleDeepMind on this remarkable milestone! Gemini (0801) Category Rankings: - Overall: #1 - Math: #1-3 - Instruction-Following: #1-2 - Coding: #3-5 - Hard Prompts (English): #2-5 Come try the model and let us know your feedback! More analysis below👇

18,811

Ali Eslami · Jan 2, 2024 · 3:13 PM UTC

Ali Eslami

@arkitus

2 Jan 2024

predictions for 2024: - vision+language models go continuous and real-time (not just turn-based) - nerfs/splats/etc get strong priors (only 1 image of a complex test scene to get full 3D) - generative video models reach photo-realism - major election scandal powered by an LLM

8,624

Ali Eslami · Oct 3, 2023 · 8:34 AM UTC

Ali Eslami

@arkitus

3 Oct 2023

This is very cool. Impressive action-conditioned, language-conditioned, or un-conditioned rollouts of videos. We worked on this hard at DM back in 2016/2017, but it was very difficult to get working then. Huge progress!

Wayve

@wayve_ai

3 Oct 2023

It’s #GAIA1 world, we just drive in it. Generate realistic driving videos using only prompts. See how it works below 🧵

15,551

Ali Eslami · Dec 9, 2019 · 4:09 PM UTC

Ali Eslami

@arkitus

9 Dec 2019

Exciting updated results for self-supervised representation learning on ImageNet: - 71.5% top-1 with a *linear* classifier - 77.9% top-5 with only *1%* of the labels - 76.6 mAP when transferred to PASCAL VOC-07 (better than *fully-supervised's* 74.7 mAP) arxiv.org/abs/1905.09272

Ali Eslami · Jul 23, 2018 · 5:17 PM UTC

Ali Eslami

@arkitus

23 Jul 2018

Computer vision is far from solved. Nice slides by Thomas Funkhouser precisely describing a few of the open problems, along with supervised learning solutions. Question is, how can we learn all of these capabilities with less/no supervision? cs.princeton.edu/~funk/bridg…

Ali Eslami · May 23, 2022 · 10:11 PM UTC

Ali Eslami

@arkitus

23 May 2022

Mind-blowing. Between this and DALL-E, I genuinely believe that our relationship with the concept of an 'image' is changing, forever. There will now be a period of human history before such models, and a period after. Amazing work @Chitwan_Saharia, @wchan212, @mo_norouzi et al.

@_akhaliq

23 May 2022

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding project page: gweb-research-imagen.appspot… sota FID(7.27 on COCO), without ever training on COCO, human raters find Imagen samples to be on par with the COCO data itself in image-text alignment

Ali Eslami · Dec 3, 2018 · 2:16 AM UTC

Ali Eslami

@arkitus

3 Dec 2018

Very cool. Nice to see generative models used in new and innovative ways... deepmind.com/blog/alphafold/

Ali Eslami · Aug 12, 2018 · 11:13 AM UTC

Ali Eslami

@arkitus

12 Aug 2018

Two reasons why vision is hard: 1. 2D images are always only /projections/ of an underlying 3D reality. 2. Sometimes we're interested in classifying 3D realities that are only subtly different from each other. We're currently better at problem 2 than problem 1.

Ali Eslami · Jul 3, 2018 · 4:27 PM UTC

Ali Eslami

@arkitus

3 Jul 2018

Very impressive to see high-level, multi-player strategy emerging from pure RL on raw pixels. Could this be how we design game AIs of the future? Cool new work by @maxjaderberg et al. deepmind.com/blog/capture-th…

Ali Eslami · Dec 30, 2018 · 6:36 AM UTC

Ali Eslami

@arkitus

30 Dec 2018

"It's true that life is short. The solution isn't to do things quickly, but to do things the way they're meant to be done."

Ali Eslami · Jun 6, 2018 · 9:34 AM UTC

Ali Eslami

@arkitus

6 Jun 2018

Sequential Attend, Infer, Repeat: A generative model of moving objects. arxiv.org/abs/1806.01794 Very cool work by @arkosiorek.

Ali Eslami · Aug 21, 2020 · 3:56 PM UTC

Ali Eslami

@arkitus

21 Aug 2020

Charlie has just released code + colab for PolyGen, a generative model of vertices and faces. Check it out!

Charlie Nash @charlietcnash

21 Aug 2020

We've just released code for PolyGen, our generative model of 3D meshes github: github.com/deepmind/deepmind…

Ali Eslami · Sep 14, 2018 · 3:42 PM UTC

Ali Eslami

@arkitus

14 Sep 2018

Our paper 'A Probabilistic U-Net for Segmentation of Ambiguous Images', led by @saakohl, will be presented at #NIPS2018! Code is available at github.com/SimonKohl/probabi…

GitHub - SimonKohl/probabilistic_unet: A U-Net combined with a variational auto-encoder that is...

A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations. - SimonKohl/probabilistic_unet

github.com

Simon Kohl

@saakohl

14 Sep 2018

Our paper `A Probabilistic U-Net for Segmentation of Ambiguous Images' was accepted at #NIPS2018 as a spotlight presentation! A re-implementation of the code is now available at github.com/SimonKohl/probabi…. Paper arxiv.org/abs/1806.05034 by @DeepMindAI and @mic_dkfz.

Ali Eslami · Jan 5, 2020 · 10:00 AM UTC

Ali Eslami

@arkitus

5 Jan 2020

#IranianCulturalSites

Ali Eslami · Jul 5, 2018 · 1:38 PM UTC

Ali Eslami

@arkitus

5 Jul 2018

GQN + attention: Better likelihoods, faster training, more complex scenes (Minecraft). Given a neural scene representation, can you localise new images? Great work by @danrsm @DeepSpiker @fabiointheuk et al. deepmind.com/documents/229/g…

Ali Eslami · Nov 15, 2021 · 5:36 PM UTC

Ali Eslami

@arkitus

15 Nov 2021

Amazing to see how far we've come in about 10 years. Compare piped.video/watch?v=tk9FTdKO… (~state of the art at CVPR 2012) with the video in the tweet below.

@_akhaliq

11 Nov 2021

Palette: Image-to-Image Diffusion Models abs: arxiv.org/abs/2111.05826 project page: iterative-refinement.github.… a simple and general framework for image-to-image translation using conditional diffusion models

Ali Eslami · Oct 2, 2018 · 8:27 AM UTC

Ali Eslami

@arkitus

2 Oct 2018

If you're ever training a (potentially conditional) VAE and find yourself struggling to keep the KL down, you need to use this. Figure 5 on page 9 is a good start. @DeepSpiker @fabiointheuk

Danilo J. Rezende @DaniloJRezende

2 Oct 2018

Taming VAEs: A theoretical analysis of their properties and behaviour in the high-capacity regime. We also argue for a different way of training these models for robust control of key properties. It was fun thinking about this with @fabiointheuk arxiv.org/abs/1810.00597

Ali Eslami · Oct 14, 2022 · 1:24 PM UTC

Ali Eslami

@arkitus

14 Oct 2022

Surely it's better to self-learn vision from videos than from just images? Surprisingly, that doesn't seem to have been the case... until now. A simple objective (VITO) + a balanced video dataset (VideoNet) shows us a path forwards! @nikparth1, @joaocarreira, @olivierhenaff

Ali Eslami · Jun 16, 2022 · 7:17 AM UTC

Ali Eslami

@arkitus

16 Jun 2022

I'd forgotten how nice it is to talk with real 3D humans in a real 3D room. Grateful to be back at it after 2 long years.

Yonk Shi @iamyonk

15 Jun 2022

Excellent talk on representation learning and neural scene understanding by DeepMind’s Ali Eslami @arkitus at @kth_rpl summer school. #KTHRPLSummerSchool

Ali Eslami · Aug 18, 2022 · 2:59 PM UTC

Ali Eslami

@arkitus

18 Aug 2022

When starting out in a new field: 1. Work on ideas you believe in 2. Work with people you learn from 3. Have fun

Ali Eslami · Jan 29, 2019 · 8:18 AM UTC

Ali Eslami

@arkitus

29 Jan 2019

If you're curious about how/why the aggregation functions of GQNs and NPs work (in particular, the summing aggregation function), see the paper below.

You’re unable to view this Post because this account owner limits who can view their Posts.

Ali Eslami · Jun 29, 2021 · 12:54 PM UTC

Ali Eslami

@arkitus

29 Jun 2021

I would've considered this pure sci-fi not too long ago! We can now teach a model new concepts by showing it a sequence of examples of images and associated text. No param updates required. Concepts: e.g. how to classify, caption or answer Qs Model: pre-trained language model

This Post is from an account that no longer exists.

Ali Eslami · Apr 13, 2022 · 12:42 PM UTC

Ali Eslami

@arkitus

13 Apr 2022

A friend of mine (editor of an influential arts magazine) is looking to hire an intern to apply neural rendering (e.g. NeRF and family) to high-end fashion shoots with professional photography. If this is something you or someone you know might be interested in, DM me 💃🕺

Ali Eslami · Jul 12, 2022 · 7:30 PM UTC

Ali Eslami

@arkitus

12 Jul 2022

Both released today. Left: A @NASAWebb image of Stephan's Quintet, five tightly-bound galaxies 290 million light-years away. Right: @clured's visualisation of 10 million chunks from the data used to train @BigscienceW and @huggingface's BLOOM model. Encoded then UMAPed to 2D.

Ali Eslami · Nov 18, 2020 · 10:50 AM UTC

Ali Eslami

@arkitus

18 Nov 2020

Mason McGough @MasonMMcGough has written a nice piece on generating 3D models with PolyGen and PyTorch. Check it out! towardsdatascience.com/gener…

Ali Eslami · May 9, 2019 · 11:28 AM UTC

Ali Eslami

@arkitus

9 May 2019

If you're interested in state-of-the-art machine learning research AND in positive real-world impact, consider joining the health research team in London. I've been collaborating with this team for a while and they are truly awesome 👌

Alan Karthikesalingam @alan_karthi

9 May 2019

Interested in state-of-art, clinically-applicable deep learning research & positive real-world impact? We are growing our health research team in London, with EHR & Imaging roles for talented deep learning research scientists & engineers- Get in touch if this sounds like you :)

Ali Eslami · Jun 17, 2019 · 5:25 PM UTC

Ali Eslami

@arkitus

17 Jun 2019

Bayesian optimisation is an efficient strategy for optimization of black-box functions without derivatives. Here we show how Neural Processes can be used for this: arxiv.org/abs/1903.11907 With: @schwarzjn_, @agalashov, @hyunjik11, Marta Garnelo, @dwsaxton, @pushmeet, @yeewhye

Google DeepMind

@GoogleDeepMind

17 Jun 2019

Interested in adversarial tests and reinforcement learning? We combine meta-learning in a general probabilistic paradigm to detect failures, helping us build robust algorithms. Includes results on recommender systems and control: arxiv.org/abs/1903.11907 @schwarzjn_ @agalashov

Ali Eslami · Jul 23, 2022 · 2:28 PM UTC

Ali Eslami

@arkitus

23 Jul 2022

As someone who largely ignored biology in school and at uni, this resonated with me deeply. I'm only now beginning to learn about biology and chemistry and it's all so incredibly beautiful. jsomers.net/i-should-have-lo…

Ali Eslami · Jul 11, 2018 · 8:27 AM UTC

Ali Eslami

@arkitus

11 Jul 2018

Nice overview of the "predictive coding" theory of the brain, and how GQN relates to it. By Jordana Cepelewicz @QuantaMagazine quantamagazine.org/to-make-s…

Ali Eslami · Aug 14, 2022 · 4:49 PM UTC

Ali Eslami

@arkitus

14 Aug 2022

A striking vision expressed by a single artist working with a powerful tool. An idea that would have likely taken a team of people to produce previously, and at much greater cost.

Xander Steenbrugge

@xsteenbrugge

13 Aug 2022

"Voyage through Time" is my first artpiece using #stablediffusion and I am blown away with the possibilities... We're crossing a threshold where generative AI is no longer just about novel aesthetics, but evolving into an amazing tool to build powerful, human-centered narratives

Ali Eslami · Apr 10, 2024 · 8:19 AM UTC

Ali Eslami

@arkitus

10 Apr 2024

love the authors list on the CodeGemma paper: storage.googleapis.com/deepm…

2,832

Ali Eslami · Mar 3, 2022 · 1:26 PM UTC

Ali Eslami

@arkitus

3 Mar 2022

If you're interested in AI art or AI+human art, consider submitting an abstract to therai.org.uk/conferences/an…. @elluba, @MirowskiPiotr, @chrisantha_f, @korymath and I are organising a panel for the "Visions of the future of human-machine creative symbiosis" conf on 6-10 June 2022.

Anthropology, AI and the Future of Human Society - Royal Anthropological Institute

Anthropology, AI and the Future of Human Society Virtual Conference 6 -10 June 2022 Anthropology, AI and the Future of Human Society. AI has come to represent multiple causal drivers of change:...

therai.org.uk

Ali Eslami · Oct 3, 2019 · 2:09 PM UTC

Ali Eslami

@arkitus

3 Oct 2019

When sufficiently constrained, agents learn to paint surprisingly abstract images. Some of the paintings remind me of cubist portraits. (Remember: no imitation or supervision). Can you spot any familiar faces? See learning-to-paint.github.io for loads more emergent drawing styles.

Ali Eslami · Sep 11, 2018 · 3:27 PM UTC

Ali Eslami

@arkitus

11 Sep 2018

Excellent tutorial on generative models.

Danilo J. Rezende @DaniloJRezende

11 Sep 2018

The slides of our CCN2018 tutorial can now be found here: tinyurl.com/ydyzvkbd

Ali Eslami · May 5, 2022 · 4:25 PM UTC

Ali Eslami

@arkitus

5 May 2022

Of course it does NOT follow that all neural network tinkerers are therefore great philosophers 🥴

Ali Eslami · Jun 29, 2021 · 3:25 PM UTC

Ali Eslami

@arkitus

29 Jun 2021

Interested in generative models, 3D computer vision or inverse graphics? We use ideas and techniques from these fields to show the possibility of imaging very small objects (e.g. proteins) more effectively. In this setting we cannot fall back to supervised learning!

Olaf Ronneberger @ORonneberger

29 Jun 2021

Proteins are not static bricks! Feasibility study to infer a continuous distribution of all states using an end-to-end model from Cryo-EM images to atom coordinates: arxiv.org/abs/2106.14108. @danrsm, @GarneloMarta, @MichaelZielins, @JonasAAdler, @arkitus, @CarlDoersch, @pushmeet

Ali Eslami · Jul 12, 2022 · 11:32 AM UTC

Ali Eslami

@arkitus

12 Jul 2022

"Persian" x "Robot" #dalle

Ali Eslami · Jun 25, 2019 · 1:53 PM UTC

Ali Eslami

@arkitus

25 Jun 2019

If you've been wondering how Generative Query Networks should be trained in the absence of camera positions (e.g. in a SLAM setting), this paper offers a possible solution. Very cool work!

Danilo J. Rezende @DaniloJRezende

25 Jun 2019

Happy share our work: Shaping Belief States with Generative Environment Models for RL Thanks Karol Gregor, Frederic Besse, Yan Wu, Hamza Merzic and @avdnoord ! arxiv.org/abs/1906.09237v2 piped.video/dOnvAp_wxv0 #RL #SelfSupervised #GenerativeWorldModels #BeliefStates

Ali Eslami · Jul 5, 2018 · 1:52 PM UTC

Ali Eslami

@arkitus

5 Jul 2018

Neural Processes (NPs) generalise GQN’s training regime to other few-shot prediction tasks arxiv.org/abs/1807.01622, arxiv.org/abs/1807.01613. Awesome work by Marta Garnelo who will be presenting at ICML: bit.ly/2MQ8wi3 , tinyurl.com/npaticml

Ali Eslami · Mar 26, 2025 · 2:25 PM UTC

Ali Eslami

@arkitus

26 Mar 2025

Yesterday's dreams are today's reality. Incredible stuff.

Wayve

@wayve_ai

26 Mar 2025

Introducing GAIA-2 🌎Generative world modeling just stepped up a gear. GAIA-2 is the latest development of Wayve’s video-generative world model tailored for driving. GAIA-2 offers richer, more realistic, and highly controllable synthetic driving scenarios, accelerating Wayve’s path to safe driver assistance and automated driving at scale. Learn more about GAIA-2 in our Blog: wayve.ai/thinking/gaia-2/ #GAIA2 #GAIA #EmbodiedAI

1,618

Ali Eslami · Nov 28, 2018 · 6:27 PM UTC

Ali Eslami

@arkitus

28 Nov 2018

For those of you attending NeurIPS, be sure to check out Olaf's talk at the medical imaging workshop. He'll be speaking about state-of-the-art machine learning for medicine, including our work on using probabilistic models to allow such systems to express uncertainty.🤔💉👌

Olaf Ronneberger @ORonneberger

28 Nov 2018

Looking forward to the Medical Imaging meets NeurIPS Workshop next week Saturday (Dec, 8th). At 9:45am I'll present our work on radiotherapy planning (arxiv.org/abs/1809.04430), triaging eye diseases (nature.com/articles/s41591-0…) and the probabilistic u-net (arxiv.org/abs/1806.05034)

Ali Eslami · Jan 29, 2025 · 4:16 PM UTC

Ali Eslami

@arkitus

29 Jan 2025

Reasoning training is getting AIs to be patient with their thoughts: “Don’t rush”. Agent training is getting AIs to be persistent with their actions: “Don’t give up”.

3,110

Ali Eslami · Aug 10, 2018 · 3:20 PM UTC

Ali Eslami

@arkitus

10 Aug 2018

Excellent walk through of Neural Processes by @kasparmartens. Marta Garnelo, Dan Rosenbaum, @DeepSpiker, @schwarzjn_, @yeewhye and others.

Kaspar Märtens 💙💛@kasparmartens

10 Aug 2018

Neural Processes - what they are and how they behave as distributions over functions. This blog post is my attempt to answer these questions: kasparmartens.rbind.io/post/…

Ali Eslami · Jul 25, 2018 · 3:56 PM UTC

Ali Eslami

@arkitus

25 Jul 2018

Abstraction vs realism. With generative models, which use cases require one more than the other?

Ali Eslami · Dec 5, 2022 · 4:33 PM UTC

Ali Eslami

@arkitus

5 Dec 2022

- Earth isn't at the centre of the universe. - Humans and animals share a huge chunk of their DNA. - And it's increasingly looking likely that brains aren't the only intelligent things around. We're not as special as we think.

Ali Eslami · May 23, 2019 · 8:36 AM UTC

Ali Eslami

@arkitus

23 May 2019

Excellent overview of why this topic is important: towardsdatascience.com/the-q…

Ali Eslami · May 5, 2021 · 3:26 PM UTC

Ali Eslami

@arkitus

5 May 2021

Painter = directed search e.g. neuroevolution Critic = img+txt encoder e.g. ALIGN or CLIP Artist = human that sets the txt input to the critic arxiv.org/abs/2105.00162 Creative work by @chrisantha_f with Jean-Baptiste Alayrac, @MirowskiPiotr, Dylan Banarse, @sindero Thread👇

Ali Eslami · Mar 23, 2018 · 4:42 PM UTC

Ali Eslami

@arkitus

23 Mar 2018

New work with colleagues from @DeepMindAI: Kickstarting Deep Reinforcement Learning, proposes a paradigm where 'teacher' agents help train 'student' agents. Benefits include faster research cycles and students that can surpass their teachers: arxiv.org/abs/1803.03835

Ali Eslami · Sep 22, 2021 · 1:50 PM UTC

Ali Eslami

@arkitus

22 Sep 2021

My suggestion: 1. Fill out the form. 2. Send an email with CV attached to 5 researchers at DM you'd like to work with (senior or junior), indicating research interests, mentioning you've already submitted the form. Some researchers won't / can't reply. 3. Resume life as normal.

Ali Eslami · Jan 15, 2024 · 8:00 PM UTC

Ali Eslami

@arkitus

15 Jan 2024

someone needs to make a modern documentary about intelligence for the general public not on AI but intelligence itself we have many great docus on the wonder of the cosmos and not just on spaceships too much emphasis on the artifact and not enough on the phenomenon imo

2,134

Ali Eslami · Jan 3, 2019 · 11:40 AM UTC

Ali Eslami

@arkitus

3 Jan 2019

First high-resolution image of Ultima Thule. The object is miniscule, only 19 km long, but it's over 6 billion km from earth. Read this nitter.app/Alex_Parker/status/107… for a fascinating sneak peak into what it takes to do this kind of research.

Ali Eslami · Aug 4, 2025 · 11:06 AM UTC

Ali Eslami

@arkitus

4 Aug 2025

This is just the start. Coming up: better planning, more features, and more advanced agentic setups. There will be rough edges, but the team is shipping fast. Working on the rollout to more countries and languages. Any feedback let me know!

1,786

Ali Eslami · Jun 12, 2022 · 1:06 PM UTC

Ali Eslami

@arkitus

12 Jun 2022

Start with 2 copies of an LLM. Every day, for each LLM: 1. Ask what it would like to read that day (eg selections of news or new books) 2. Feed it the content 3. Ask for learnings and conclusions and save to disk 4. Fine tune it on all its learnings from its first day until now

Ali Eslami · Jul 11, 2018 · 3:52 PM UTC

Ali Eslami

@arkitus

11 Jul 2018

Come see our poster on Neural Processes at @icmlconf: Hall B poster 130. Marta Garnelo woop woop!

Ali Eslami · Aug 5, 2025 · 3:22 PM UTC

Ali Eslami

@arkitus

5 Aug 2025

Replying to @arkitus @GarneloMarta @DaniloJRezende

It's clear to me that this can 'just' be scaled up now. The simulations will look just as real as the best movies or video or image models, but also feel as interactive as the best video games. It will fundamentally change how we think about simulations and entertainment.

1,900

Ali Eslami · Jan 27, 2019 · 4:47 PM UTC

Ali Eslami

@arkitus

27 Jan 2019

Timely and important paper by @dbalduzzi, Marta Garnelo, @maxjaderberg and others on how you should train agents when there is no single winning strategy, e.g. in StarCraft. Thread below has a good summary.

dbalduzzi @dbalduzzi

27 Jan 2019

Excited to share some new work on learning in games: arxiv.org/abs/1901.08106. The paper is about formulating useful objectives in nontransitive games (e.g. poker or StarCraft), which turns out to be a surprisingly subtle problem.

Ali Eslami · Dec 13, 2023 · 3:51 PM UTC

Ali Eslami

@arkitus

13 Dec 2023

Spend a tiny bit of compute to decide if each datapoint should be trained on or not. Because of all the datapoints you SKIP, training is dramatically more efficient overall.

Olivier Hénaff

@olivierhenaff

13 Dec 2023

So excited to announce what we've been working on for the past ~year or so: Active Learning Accelerates Large-Scale Visual Understanding We show that model-based data selection efficiently and effectively speeds up classification- and multimodal pretraining by up to 50%

3,382