Nenad Tomasev · Jun 24, 2026 · 2:33 PM UTC

Nenad Tomasev

Pinned Tweet

Nenad Tomasev

@weballergy

Jun 24

Excited to share this podcast - where we got to talk about the current trajectory of AI agent research, and open challenges in enabling safe and reliable multi-agent coordination at scale.

Google DeepMind

@GoogleDeepMind

Jun 24

What happens when millions of AI agents start negotiating, transacting, and delegating to one another? @weballergy joined our podcast with @fryrsquared to explore the rise of agentic economies – and how we can diversify agent decision-making to avoid AI groupthink. Timecodes: 00:00 Intro 1:07 Defining AI agents 4:44 Agentic exploration in science and research 15:46 Delegation between agents 22:46 Agentic security and traps 29:31 Building an agentic economy 33:22 Cognitive monoculture 36:29 Distributed intelligence

3,118

Nenad Tomasev · Nov 18, 2021 · 10:33 AM UTC

Nenad Tomasev

@weballergy

18 Nov 2021

Deep learning models are often perceived as black boxes. In our most recent work, Acquisition of Chess Knowledge in AlphaZero arxiv.org/abs/2111.09259 , we try to unpack how AlphaZero represents knowledge, where it resides within the network, and when it is acquired in training

426

Nenad Tomasev · Oct 28, 2024 · 8:35 PM UTC

Nenad Tomasev

@weballergy

28 Oct 2024

I'm happy to share that I got promoted to the role of Senior Staff Research Scientist here at Google DeepMind. It's been an incredibly exciting year, though the truly exciting work, as always, lies ahead.

355

31,357

Nenad Tomasev · Dec 5, 2024 · 7:41 AM UTC

Nenad Tomasev

@weballergy

5 Dec 2024

I'm excited to share a new paper: "Mastering Board Games by External and Internal Planning with Language Models" storage.googleapis.com/deepm… (also soon to be up on Arxiv, once it's been processed there)

344

152,291

Nenad Tomasev · Jun 1, 2017 · 6:45 AM UTC

Nenad Tomasev

@weballergy

1 Jun 2017

'Adversarial Generation of Natural Language': producing realistic sentences arxiv.org/abs/1705.10929 #deeplearning #machinelearning #NLP #AI

132

253

Nenad Tomasev · Aug 3, 2017 · 8:40 AM UTC

Nenad Tomasev

@weballergy

3 Aug 2017

DeepMoji: Predicting emojis for classifying text sentiment/emotion/sarcasm arxiv.org/abs/1708.00524 #NLP #deeplearning #AI

125

247

Nenad Tomasev · Jun 6, 2018 · 7:59 AM UTC

Nenad Tomasev

@weballergy

6 Jun 2018

'Relational recurrent neural networks': performing complex relational reasoning in memory networks. arxiv.org/abs/1806.01822 #DeepLearning #AI #MachineLearning

224

Nenad Tomasev · May 31, 2018 · 8:33 AM UTC

Nenad Tomasev

@weballergy

31 May 2018

"To Trust Or Not To Trust A Classifier" by Google Research arxiv.org/abs/1805.11783 : beyond simple confidence scores. The ability to auto-detect bad predictions in critical for safe deployments in sensitive applications. #MachineLearning #DataScience #AI

To Trust Or Not To Trust A Classifier

Knowing when a classifier's prediction can be trusted is useful in many applications and critical for safely using AI. While the bulk of the effort in machine learning research has been towards...

arxiv.org

214

Nenad Tomasev · Jul 31, 2019 · 5:07 PM UTC

Nenad Tomasev

@weballergy

31 Jul 2019

Proud to share the results of our work on applying deep learning for early prediction of future acute kidney injury from electronic health records in our collaboration with the US Department of Veterans Affairs - just published in Nature: nature.com/articles/s41586-0…

191

Nenad Tomasev · Sep 4, 2017 · 6:54 AM UTC

Nenad Tomasev

@weballergy

4 Sep 2017

'Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning' arxiv.org/abs/1709.00103 #MachineLearning

179

Nenad Tomasev · Jun 14, 2018 · 9:38 AM UTC

Nenad Tomasev

@weballergy

14 Jun 2018

'A Probabilistic U-net for Segmentation of Ambiguous Images': a cool new paper by my colleagues at DeepMind on how to deal with uncertainty in segmentation models. arxiv.org/abs/1806.05034 #DeepLearning #MachineLearning

167

Nenad Tomasev · Apr 6, 2018 · 8:03 AM UTC

Nenad Tomasev

@weballergy

6 Apr 2018

'Hyperbolic Entailment Cones for Learning Hierarchical Embeddings': viewing hierarchical relations as partial orders based on a family of nested geodesically convex cones arxiv.org/abs/1804.01882 #AI #MachineLearning

151

Nenad Tomasev · Oct 4, 2017 · 7:07 AM UTC

Nenad Tomasev

@weballergy

4 Oct 2017

'Dilated Convolutions for Modeling Long-Distance Genomic Dependencies' arxiv.org/abs/1710.01278 #DeepLearning #Genomics #AI

144

Nenad Tomasev · Jul 24, 2017 · 7:29 AM UTC

Nenad Tomasev

@weballergy

24 Jul 2017

'A Distributional Perspective on Reinforcement Learning': modeling the full distribution of return. arxiv.org/abs/1707.06887 #machinelearning

143

Nenad Tomasev · Dec 1, 2021 · 4:28 PM UTC

Nenad Tomasev

@weballergy

1 Dec 2021

Happy to announce our new paper "Advancing mathematics by guiding human intuition with AI" nature.com/articles/s41586-0… that was just published in Nature, and one of the associated maths papers "The signature and cusp geometry of hyperbolic knots" arxiv.org/abs/2111.15323

Advancing mathematics by guiding human intuition with AI

Nature - A framework through which machine learning can guide mathematicians in discovering new conjectures and theorems is presented and shown to yield mathematical insight on important open...

nature.com

126

Nenad Tomasev · Aug 23, 2017 · 6:58 AM UTC

Nenad Tomasev

@weballergy

23 Aug 2017

'Twin Networks: Using the Future as a Regularizer': arxiv.org/abs/1708.06742 #DeepLearning #MachineLearning #AI

126

Nenad Tomasev · Jul 11, 2017 · 7:36 AM UTC

Nenad Tomasev

@weballergy

11 Jul 2017

'MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network' arxiv.org/abs/1707.02485 #deeplearning #AI #medicine

131

Nenad Tomasev · Jul 7, 2017 · 7:13 AM UTC

Nenad Tomasev

@weballergy

7 Jul 2017

'Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks' arxiv.org/abs/1707.01836 #deeplearning #AI #machinelearning

118

Nenad Tomasev · Aug 24, 2017 · 6:31 AM UTC

Nenad Tomasev

@weballergy

24 Aug 2017

'Super Convergence: Very Fast Training of Residual Networks Using Large Learning Rates' arxiv.org/abs/1708.07120 #deeplearning #AI

111

Nenad Tomasev · Jan 3, 2018 · 9:44 AM UTC

Nenad Tomasev

@weballergy

3 Jan 2018

'Character-level Recurrent Neural Networks in Practice: Comparing Training and Sampling Schemes' arxiv.org/abs/1801.00632 #DeepLearning #MachineLearning #AI

108

Nenad Tomasev · Feb 28, 2017 · 8:11 AM UTC

Nenad Tomasev

@weballergy

28 Feb 2017

'Boundary-Seeking Generative Adversarial Networks': Nice new results by Yoshua Bengio's group. arxiv.org/abs/1702.08431 #deeplearning #AI

Boundary-Seeking Generative Adversarial Networks

Generative adversarial networks (GANs) are a learning framework that rely on training a discriminator to estimate a measure of difference between a target and generated distributions. GANs, as...

arxiv.org

111

Nenad Tomasev · May 31, 2017 · 7:12 AM UTC

Nenad Tomasev

@weballergy

31 May 2017

'Deep Learning is Robust to Massive Label Noise': encouraging results. arxiv.org/abs/1705.10694 #deeplearning #machinelearning #AI

101

Nenad Tomasev · Sep 26, 2017 · 8:21 AM UTC

Nenad Tomasev

@weballergy

26 Sep 2017

'The Consciousness Prior': a short write-up by Yoshua Bengio arxiv.org/abs/1709.08568 Thought-provoking, but no experiments given. #AI

Nenad Tomasev · Aug 4, 2017 · 7:51 AM UTC

Nenad Tomasev

@weballergy

4 Aug 2017

'Unsupervised Representation Learning by Sorting Sequences': learning promising visual representations arxiv.org/abs/1708.01246 #deeplearning

Nenad Tomasev · Feb 9, 2021 · 8:49 AM UTC

Nenad Tomasev

@weballergy

9 Feb 2021

Fairness for Unobserved Characteristics: Insights from Technological Impacts on Queer Communities arxiv.org/abs/2102.04257 : highlighting some of the key challenges in ensuring algorithmic fairness for queer communities @jackayline @empiricallykev @Shakir_za

Nenad Tomasev · Oct 17, 2017 · 6:52 AM UTC

Nenad Tomasev

@weballergy

17 Oct 2017

'A systematic study of the class imbalance problem in convolutional neural networks': advocating for oversampling arxiv.org/abs/1710.05381

A systematic study of the class imbalance problem in convolutional...

In this study, we systematically investigate the impact of class imbalance on classification performance of convolutional neural networks (CNNs) and compare frequently used methods to address the...

arxiv.org

Nenad Tomasev · Jan 19, 2018 · 8:45 AM UTC

Nenad Tomasev

@weballergy

19 Jan 2018

'Sparsely Connected Convolutional Networks' arxiv.org/abs/1801.05895 : Introducing SparseNets, a child of DenseNets and ResNets. The authors claim that the method can outperform these with fewer parameters. #DeepLearning #MachineLearning #ComputerVision #AI

Nenad Tomasev · Oct 9, 2017 · 7:40 AM UTC

Nenad Tomasev

@weballergy

9 Oct 2017

'Dilated Recurrent Neural Networks': dilated recurrent skip connections. arxiv.org/abs/1710.02224 #DeepLearning #MachineLearning #AI

Nenad Tomasev · May 24, 2017 · 6:30 AM UTC

Nenad Tomasev

@weballergy

24 May 2017

'Visualizing LSTM Decisions': on interpretability in recurrent neural networks. arxiv.org/abs/1705.08153 #deeplearning #machinelearning #AI

Nenad Tomasev · Apr 17, 2017 · 6:14 AM UTC

Nenad Tomasev

@weballergy

17 Apr 2017

'Stochastic Gradient Descent as Approximate Bayesian Inference': arxiv.org/abs/1704.04289 #machinelearning

Nenad Tomasev · Jan 15, 2018 · 9:20 AM UTC

Nenad Tomasev

@weballergy

15 Jan 2018

'Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic Experiences for Robot Action Execution' arxiv.org/abs/1801.04134 #DeepLearning #Robotics #AI

Nenad Tomasev · Jun 22, 2017 · 6:58 AM UTC

Nenad Tomasev

@weballergy

22 Jun 2017

'Grounded Language Learning in a Simulated 3D World': rewards for following written instructions. arxiv.org/abs/1706.06551 #AI #deeplearning

Nenad Tomasev · Mar 1, 2017 · 8:21 AM UTC

Nenad Tomasev

@weballergy

1 Mar 2017

'Billion-scale similarity search with GPUs': kNN search that is 8.5x faster than prior state-of-the-art. (by FAIR) arxiv.org/abs/1702.08734

Billion-scale similarity search with GPUs

Similarity search finds application in specialized database systems handling complex data such as images or videos, which are typically represented by high-dimensional features and require...

arxiv.org

Nenad Tomasev · Jul 3, 2017 · 6:58 AM UTC

Nenad Tomasev

@weballergy

3 Jul 2017

'Noisy Networks for Exploration': on learning parametric noise for exploration in RL agents. arxiv.org/abs/1706.10295 #deeplearning #AI

Nenad Tomasev · Aug 1, 2022 · 9:17 AM UTC

Nenad Tomasev

@weballergy

1 Aug 2022

Happy to announce our upcoming NeurIPS 2022 workshop on "A Participatory Approach to AI for Mental Health" sites.google.com/view/pai4mh… , hoping to bring experts and communities together to jointly shape the vision for how tech can help with wellbeing and mental health

Call for Papers

Important Dates Abstract Registration deadline: September 25th, 2022, (23:59 AoE) Submission deadline: September 29th, 2022, (23:59 AoE) Paper acceptance notification: October 20th, 2022, (23:59...

sites.google.com

Nenad Tomasev · Nov 22, 2017 · 11:25 AM UTC

Nenad Tomasev

@weballergy

22 Nov 2017

'Non-local Neural Networks' by CMU, FAIR arxiv.org/abs/1711.07971 #DeepLearning #MachineLearning #AI

Nenad Tomasev · Jul 20, 2017 · 6:39 AM UTC

Nenad Tomasev

@weballergy

20 Jul 2017

'The Devil is in the Decoder': the choice of decoder matters in pixel-wise prediction tasks arxiv.org/abs/1707.05847 #deeplearning #AI

Nenad Tomasev · Oct 18, 2017 · 8:16 AM UTC

Nenad Tomasev

@weballergy

18 Oct 2017

'Swish: a Self-Gated Activation Function': a replacement for RELU-s? By Google Brain. arxiv.org/abs/1710.05941 #DeepLearning #MachineLearning

Nenad Tomasev · Feb 14, 2018 · 9:24 AM UTC

Nenad Tomasev

@weballergy

14 Feb 2018

'Learning to Search with MCTSnets' arxiv.org/abs/1802.04697 : learning where, what and how to search. #MachineLearning #AI

Nenad Tomasev · Nov 30, 2023 · 11:40 AM UTC

Nenad Tomasev

@weballergy

30 Nov 2023

I am happy to advertise a position for a student researcher under the Google DeepMind program deepmind.google/about/career… , for a project with @TZahavy , myself, and others, at the intersection of generative modelling, creativity, and planning. Strong programming skills required.

Careers at Google DeepMind

Collaborate with leading thinkers at Google DeepMind. Build AI that benefits humanity.

deepmind.google

14,437

Nenad Tomasev · Jun 21, 2017 · 7:18 AM UTC

Nenad Tomasev

@weballergy

21 Jun 2017

'Programmable Agents': a new paper by colleagues from DeepMind arxiv.org/abs/1706.06383 #deeplearning #AI

Nenad Tomasev · Oct 25, 2017 · 8:42 AM UTC

Nenad Tomasev

@weballergy

25 Oct 2017

'Auto-Differentiating Linear Algebra' arxiv.org/abs/1710.08717 #MachineLearning

Nenad Tomasev · Jan 10, 2018 · 9:21 AM UTC

Nenad Tomasev

@weballergy

10 Jan 2018

'Adversarial Spheres' by Google Brain: towards a better understanding of adversarial examples in #DeepLearning arxiv.org/abs/1801.02774 "the vulnerability of neural networks to small adversarial perturbations is a logical consequence of the amount of test error observed" #AI

Nenad Tomasev · May 18, 2020 · 10:18 AM UTC

Nenad Tomasev

@weballergy

18 May 2020

Applied AI solutions require setting up lasting interdisciplinary partnerships with domain experts, and here we share some thoughts and guidelines on forming these collaborations. "AI for social good: unlocking the opportunity for positive impact" nature.com/articles/s41467-0…

Nenad Tomasev · Oct 23, 2017 · 8:33 AM UTC

Nenad Tomasev

@weballergy

23 Oct 2017

'First-order Methods Almost Always Avoid Saddle Points' arxiv.org/abs/1710.07406 Can't argue with gradient descent. #DeepLearning #AI

First-order Methods Almost Always Avoid Saddle Points

We establish that first-order methods avoid saddle points for almost all initializations. Our results apply to a wide variety of first-order methods, including gradient descent, block coordinate...

arxiv.org

Nenad Tomasev · Apr 3, 2017 · 6:30 AM UTC

Nenad Tomasev

@weballergy

3 Apr 2017

'Factorization tricks for LSTM networks': speeds ups and fewer parameters. arxiv.org/abs/1703.10722 #deeplearning #machinelearning

Factorization tricks for LSTM networks

We present two simple ways of reducing the number of parameters and accelerating the training of large Long Short-Term Memory (LSTM) networks: the first one is "matrix factorization by design" of...

arxiv.org

Nenad Tomasev · Mar 29, 2017 · 7:15 AM UTC

Nenad Tomasev

@weballergy

29 Mar 2017

'Adversarial Transformation Networks: Learning to Generate Adversarial Examples' by Google Research arxiv.org/abs/1703.09387 #deeplearning

Nenad Tomasev · Mar 10, 2017 · 8:09 AM UTC

Nenad Tomasev

@weballergy

10 Mar 2017

'A Structured Self-attentive Sentence Embedding' (by Yoshua Bengio et al.): 2D matrix embedding + self-attention arxiv.org/abs/1703.03130 #AI

A Structured Self-attentive Sentence Embedding

This paper proposes a new model for extracting an interpretable sentence embedding by introducing self-attention. Instead of using a vector, we use a 2-D matrix to represent the embedding, with...

arxiv.org

Nenad Tomasev · Jan 4, 2018 · 8:58 AM UTC

Nenad Tomasev

@weballergy

4 Jan 2018

'Panoptic Segmentation': proposing a new hybrid between instance and semantic segmentation tasks. arxiv.org/abs/1801.00868 #DeepLearning #ComputerVision #MachineLearning #AI

Nenad Tomasev · Nov 30, 2017 · 10:48 AM UTC

Nenad Tomasev

@weballergy

30 Nov 2017

'Do Convolutional Neural Networks Act as Compositional Nearest Neighbors?'; Experiments on semantic segmentation and image-to-image translation. arxiv.org/abs/1711.10683 #MachineLearning #DeepLearning #AI

Patch Correspondences for Interpreting Pixel-level CNNs

We present compositional nearest neighbors (CompNN), a simple approach to visually interpreting distributed representations learned by a convolutional neural network (CNN) for pixel-level tasks...

arxiv.org

Nenad Tomasev · Sep 7, 2017 · 6:57 AM UTC

Nenad Tomasev

@weballergy

7 Sep 2017

'Polar Transformer Networks': invariance to translation, and equivariance to rotation and scale. arxiv.org/abs/1709.01889 #DeepLearning #AI

Nenad Tomasev · May 6, 2021 · 10:10 AM UTC

Nenad Tomasev

@weballergy

6 May 2021

Today we released a detailed protocol of the process that we used to develop acute kidney injury (AKI) risk prediction models: "Use of deep learning to develop continuous-risk models for adverse event prediction from electronic health records" rdcu.be/cj1vf

Nenad Tomasev · Jun 4, 2018 · 8:45 AM UTC

Nenad Tomasev

@weballergy

4 Jun 2018

'Opportunities in Machine Learning for Healthcare' arxiv.org/abs/1806.00388 : many open challenges and a huge potential for impact. #MachineLearning #Health #AI

Nenad Tomasev · May 30, 2017 · 8:30 AM UTC

Nenad Tomasev

@weballergy

30 May 2017

'Contextual Explanation Networks': interpretability as regularization. arxiv.org/abs/1705.10301 #deeplearning #machinelearning #AI

Contextual Explanation Networks

Modern learning algorithms excel at producing accurate but complex models of the data. However, deploying such models in the real-world requires extra care: we must ensure their reliability,...

arxiv.org

Nenad Tomasev · May 31, 2017 · 7:01 AM UTC

Nenad Tomasev

@weballergy

31 May 2017

'Neural Embeddings of Graphs in Hyperbolic Space' arxiv.org/abs/1705.10359 : worth comparing with: arxiv.org/abs/1705.08039 - similar ideas.

Nenad Tomasev · Nov 27, 2017 · 9:01 AM UTC

Nenad Tomasev

@weballergy

27 Nov 2017

'Causal Generative Neural Networks' arxiv.org/abs/1711.08936 : 'Unlike previous approaches, the generative networks used in CGNN allow non-additive noise terms to model flexible conditional distributions' #DeepLearning #MachineLearning #AI

Nenad Tomasev · Jul 29, 2023 · 8:49 PM UTC

Nenad Tomasev

@weballergy

29 Jul 2023

Really enjoying the AI+HCI workshop at @icmlconf - great talks and important topics to reflect on as a field. We need to be thinking much more about complex interactions and wider unintended consequences of AI deployment, and invest more in sociotechnical research.

12,259

Nenad Tomasev · Jul 31, 2017 · 7:10 AM UTC

Nenad Tomasev

@weballergy

31 Jul 2017

'Recurrent Ladder Networks': a recurrent extension of the Ladder network arxiv.org/abs/1707.09219 #deeplearning #machinelearning #AI

Nenad Tomasev · Oct 30, 2017 · 12:56 PM UTC

Nenad Tomasev

@weballergy

30 Oct 2017

'Beyond Finite Layer Neural Networks: Bridging Deep Architectures and Numerical Differential Equations' arxiv.org/abs/1710.10121 #AI

Nenad Tomasev · Aug 22, 2017 · 7:02 AM UTC

Nenad Tomasev

@weballergy

22 Aug 2017

'Neural Block Sampling': a neural approach to automate MC proposal construction. arxiv.org/abs/1708.06040 #machinelearning #statistics

Nenad Tomasev · Jul 18, 2023 · 9:12 PM UTC

Nenad Tomasev

@weballergy

18 Jul 2023

Happy to share that our paper "Detecting shortcut learning for fair medical AI using shortcut testing" has just been published in Nature Communications. We validate our method on clinical ML tasks in radiology and dermatology. nature.com/articles/s41467-0…

Detecting shortcut learning for fair medical AI using shortcut testing

Nature Communications - Diagnosing shortcut learning in clinical models is difficult, as sensitive attributes may be causally linked with disease. Using multitask learning, the authors propose a...

nature.com

4,792

Nenad Tomasev · Jul 4, 2017 · 8:30 AM UTC

Nenad Tomasev

@weballergy

4 Jul 2017

'Variance Regularizing Adversarial Learning': an interesting read. arxiv.org/abs/1707.00309 #deeplearning #machinelearning #AI

Nenad Tomasev · Jan 17, 2018 · 8:24 AM UTC

Nenad Tomasev

@weballergy

17 Jan 2018

'Time Series Segmentation through Automatic Feature Learning' arxiv.org/abs/1801.05394 : Using #DeepLearning to detect abrupt changes in trends in time series data. #MachineLearning #AI

Nenad Tomasev · Aug 13, 2018 · 4:30 PM UTC

Nenad Tomasev

@weballergy

13 Aug 2018

Proud to see our first piece of work at Health Research here at DeepMind appear at Nature Medicine. It's been a long journey and a pleasure to work with quite an inspiring team. #AI #DeepLearning #Healthcare

Google DeepMind

@GoogleDeepMind

13 Aug 2018

Teams at @DeepMind_Health and @Moorfields have developed AI technology that can detect eye disease and prioritise patients. 'Clinically applicable deep learning for diagnosis and referral in retinal OCT' has been published online in @NatureMedicine today: dx.doi.org/10.1038/s41591-01…

Nenad Tomasev · Jun 1, 2017 · 6:52 AM UTC

Nenad Tomasev

@weballergy

1 Jun 2017

GANs + reinforcement learning = OR-GAN; A new paper from Harvard. arxiv.org/abs/1705.10843 #deeplearning #machinelearning #AI

Nenad Tomasev · Jul 20, 2017 · 6:29 AM UTC

Nenad Tomasev

@weballergy

20 Jul 2017

'Imagination-Augmented Agents for Deep Reinforcement Learning' by colleagues at DeepMind: arxiv.org/abs/1707.06203 #deeplearning #AI

Nenad Tomasev · Aug 9, 2018 · 8:40 AM UTC

Nenad Tomasev

@weballergy

9 Aug 2018

Backprop Evolution arxiv.org/abs/1808.02822 : evolutionary approach towards finding alternative update equations to potentially supplant the standard backprop update. #AI #MachineLearning #DeepLearning

Nenad Tomasev · Nov 18, 2021 · 10:33 AM UTC

Nenad Tomasev

@weballergy

18 Nov 2021

As a community, we need to recognize the value of using AI systems not only as tools for 'solving problems' and automating processes - but rather, models about the world from which we ourselves can learn and improve - having them augment our abilities rather than displace them

Nenad Tomasev · Mar 21, 2017 · 7:51 AM UTC

Nenad Tomasev

@weballergy

21 Mar 2017

'Tactics of Adversarial Attack on Deep Reinforcement Learning Agents' arxiv.org/abs/1703.06748 #deeplearning #machinelearning #AI

Nenad Tomasev · Oct 25, 2017 · 8:39 AM UTC

Nenad Tomasev

@weballergy

25 Oct 2017

'Deep Reinforcement Learning from Human Preferences' by DeepMind and OpenAI arxiv.org/abs/1706.03741 #DeepLearning #ReinforcementLearning #AI

Nenad Tomasev · Oct 4, 2017 · 10:54 AM UTC

Nenad Tomasev

@weballergy

4 Oct 2017

'Predicting cancer outcomes from histology and genomics using convolutional networks' biorxiv.org/content/early/20… #DeepLearning #Medicine

Nenad Tomasev · Aug 3, 2017 · 8:33 AM UTC

Nenad Tomasev

@weballergy

3 Aug 2017

'Hidden Physics Models: Machine Learning of Nonlinear Partial Differential Equations' arxiv.org/abs/1708.00588 #physics #machinelearning

Nenad Tomasev · Apr 30, 2024 · 7:33 AM UTC

Nenad Tomasev

@weballergy

30 Apr 2024

I'm really excited to share the work on Med-Gemini, which brings and unites a number of advances in LLM reasoning, search integration, long context utilization, and multimodal understanding into the medical domain, unlocking new opportunities.

Khaled Saab

@_khaledsaab

30 Apr 2024

Introducing Med-Gemini, a family of models that extends the best of Gemini into medicine! ✨⚕️ Highlights of what you can do with Med-Gemini: > Answer medical questions with up-to-date knowledge using agentic web search 🔎❤️‍🩹 > Converse about your medical images, videos, and long multi-visit health records 📷📹📃 > Do a literature search by uploading tens of biomedical papers and asking questions 📚 > And so much more! 🏗️ Development of Med-Gemini included: > Advancing clinical reasoning with self-training and search > Improving multimodal understanding with fine-tuning > Leveraging long-context capabilities with chain-of-reasoning Paper: arxiv.org/abs/2404.18416 Below ⬇️, I talk more about self-training with web search to improve Gemini’s clinical reasoning.

4,221

Nenad Tomasev · Apr 9, 2018 · 7:58 AM UTC

Nenad Tomasev

@weballergy

9 Apr 2018

'Hierarchical Disentangled Representations': on leaning independent factors of variation arxiv.org/abs/1804.02086 #DeepLearning #MachineLearning #AI

Nenad Tomasev · Nov 29, 2017 · 9:51 AM UTC

Nenad Tomasev

@weballergy

29 Nov 2017

'Are GANs Created Equal? A Large-Scale Study' arxiv.org/abs/1711.10337 The study does not find any GAN that consistently outperforms the others. Improvements mostly due to better hyperparam tuning. #DeepLearning #MachineLearning #AI

Nenad Tomasev · Jan 16, 2018 · 9:02 AM UTC

Nenad Tomasev

@weballergy

16 Jan 2018

"Can Computers Create Art?" arxiv.org/abs/1801.04486 : on the interplay between #technology and #art and future prospects for building genuinely creative #AI

Can Computers Create Art?

This essay discusses whether computers, using Artificial Intelligence (AI), could create art. First, the history of technologies that automated aspects of art is surveyed, including photography...

arxiv.org

Nenad Tomasev · Sep 11, 2017 · 7:18 AM UTC

Nenad Tomasev

@weballergy

11 Sep 2017

'Training RNNs as Fast as CNNs': SRUs enable fast parallel computations. arxiv.org/abs/1709.02755 #DeepLearning #MachineLearning #AI

Nenad Tomasev · May 18, 2018 · 7:51 AM UTC

Nenad Tomasev

@weballergy

18 May 2018

"The Blessings of Multiple Causes": performing causal inference in multiple-cause settings. Inferring latent variables for unobserved confounders. arxiv.org/abs/1805.06826 #MachineLearning #DataScience

Nenad Tomasev · Feb 5, 2018 · 8:51 AM UTC

Nenad Tomasev

@weballergy

5 Feb 2018

"How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation": which properties of explanations are most useful to people? arxiv.org/abs/1802.00682 #MachineLearning #AI

Nenad Tomasev · Jul 5, 2019 · 7:33 AM UTC

Nenad Tomasev

@weballergy

5 Jul 2019

'Toward Fairness in AI for People with Disabilities: A Research Roadmap' arxiv.org/abs/1907.02227 highlights the potential of #AI in improving the lives of people with disabilities, while noting that many existing AI systems may not be designed appropriately to achieve this goal.

Toward Fairness in AI for People with Disabilities: A Research Roadmap

AI technologies have the potential to dramatically impact the lives of people with disabilities (PWD). Indeed, improving the lives of PWD is a motivator for many state-of-the-art AI systems, such...

arxiv.org

Nenad Tomasev · Jan 15, 2018 · 9:26 AM UTC

Nenad Tomasev

@weballergy

15 Jan 2018

'MINE: Mutual Information Neural Estimation': Introduces MINE-GAN. Has better coverage, fast convergence and fewer mode collapses. arxiv.org/abs/1801.04062 #DeepLearning #MachineLearning #AI

Nenad Tomasev · Apr 28, 2017 · 7:21 AM UTC

Nenad Tomasev

@weballergy

28 Apr 2017

A nice post on generating high-level document summaries with RNN-s, discussing Pointer-Generator Networks #nlp #deeplearning #AI

Stanford NLP Group

@stanfordnlp

26 Apr 2017

Better neural abstractive summarization—@abigail_e_see. Great blog post abigailsee.com/2017/04/16/ta…—Final @acl2017 paper arxiv.org/abs/1704.04368

Nenad Tomasev · May 22, 2017 · 7:21 AM UTC

Nenad Tomasev

@weballergy

22 May 2017

'Pixel Deconvolutional Networks': improving semantic segmentation. arxiv.org/abs/1705.06820 #deeplearning #machinelearning #computervision

Pixel Deconvolutional Networks

Deconvolutional layers have been widely used in a variety of deep models for up-sampling, including encoder-decoder networks for semantic segmentation and deep generative models for unsupervised...

arxiv.org

Nenad Tomasev · Dec 19, 2017 · 10:10 AM UTC

Nenad Tomasev

@weballergy

19 Dec 2017

'Generating and designing DNA with deep generative models': using #DeepLearning for designing probes for protein binding microarrays. arxiv.org/abs/1712.06148 #AI #Genomics

Nenad Tomasev · Apr 18, 2018 · 10:14 AM UTC

Nenad Tomasev

@weballergy

18 Apr 2018

'Learning Awareness Models': building representations of external objects based only on the internal body signals arxiv.org/abs/1804.06318 #AI #MachineLearning #DeepLearning by CMU, UoM, DeepMind, CIFAR

Nenad Tomasev · Nov 2, 2017 · 8:33 AM UTC

Nenad Tomasev

@weballergy

2 Nov 2017

'Fraternal Dropout' arxiv.org/abs/1711.00066 - an interesting concept. Two RNN copies with different dropout masks forced to be similar.

Nenad Tomasev · Oct 21, 2016 · 6:56 AM UTC

Nenad Tomasev

@weballergy

21 Oct 2016

Non-differentiable, content-addressable memory: 'A Growing Long-term Episodic & Semantic Memory' arxiv.org/abs/1610.06402 #deeplearning #AI

Nenad Tomasev · Apr 2, 2021 · 8:11 AM UTC

Nenad Tomasev

@weballergy

2 Apr 2021

Working towards improving ML explainability in medical applications is key to ensure that the models are not relying merely on spurious correlations - and high-level conceptual expanations can be hepful for clinical interrogation of model behavior.

chilconference @CHILconference

1 Apr 2021

Happy to share our work on demystifying recurrent neural nets for medical applications: Concept-based model explanations for Electronic Health Records arxiv.org/abs/2012.02308 @d_mincu, @shaobohou, @martin_sen, @weballergy, @alan_karthi & @JessicaSchrouff #CHIL2021

Nenad Tomasev · Sep 10, 2020 · 9:44 AM UTC

Nenad Tomasev

@weballergy

10 Sep 2020

Definitely one of the most fun and interesting projects I've had the opportunity to work on! Working with Vladimir Kramnik had been a privilege - seeing how we can use AlphaZero for experimenting with alterations to the rules of chess, exploring some exciting new chess variants.

Google DeepMind

@GoogleDeepMind

10 Sep 2020

In a bid to explore new frontiers in chess, our researchers worked with Vladimir Kramnik to use AlphaZero to test nine new variants of chess. The result? A more creative and collaborative relationship between chess players and machines. bit.ly/32fsmxB via @Wired

Nenad Tomasev · Jun 2, 2017 · 7:34 AM UTC

Nenad Tomasev

@weballergy

2 Jun 2017

'Teaching Machines to Describe Images via Natural Language Feedback' arxiv.org/abs/1706.00130 #deeplearning #machinelearning #AI

Nenad Tomasev · Jan 23, 2018 · 9:10 AM UTC

Nenad Tomasev

@weballergy

23 Jan 2018

"A Deep Reinforcement Learning Chatbot (Short Version)": Introducing MILABOT, a chatbot developed for the Amazon Alexa Prize competition. arxiv.org/abs/1801.06700 #DeepLearning #NLP #AI #MachineLearning

Nenad Tomasev · Aug 7, 2017 · 6:51 AM UTC

Nenad Tomasev

@weballergy

7 Aug 2017

'Independently Controllable Features': learning disentangled representations by interacting with the environment arxiv.org/abs/1708.01289 #AI

Nenad Tomasev · Sep 11, 2017 · 7:14 AM UTC

Nenad Tomasev

@weballergy

11 Sep 2017

'Machine learning & artificial intelligence in the quantum domain': an overview of recent advances. arxiv.org/abs/1709.02779 #AI #physics

Machine learning \& artificial intelligence in the quantum domain

Quantum information technologies, and intelligent learning systems, are both emergent technologies that will likely have a transforming impact on our society. The respective underlying fields of...

arxiv.org

Nenad Tomasev · Mar 19, 2024 · 4:34 PM UTC

Nenad Tomasev

@weballergy

19 Mar 2024

Excited to share our latest piece of work in collaboration with Liverpool FC, on developing a football AI assistant - TacticAI, that can help understand and improve corner kick tactics.

Google DeepMind

@GoogleDeepMind

19 Mar 2024

We're announcing TacticAI: an AI assistant capable of offering insights to football experts on corner kicks. ⚽ Developed with @LFC, it can help teams sample alternative player setups to evaluate possible outcomes, and achieves state-of-the-art results. 🧵 dpmd.ai/49PGq1b

3,569

Nenad Tomasev · Aug 22, 2017 · 7:08 AM UTC

Nenad Tomasev

@weballergy

22 Aug 2017

'A Capacity Scaling Law for Artificial Neural Networks': computing the VC and the MacKay dimension arxiv.org/abs/1708.06019 #deeplearning #AI

Nenad Tomasev · Dec 6, 2024 · 10:52 AM UTC

Nenad Tomasev

@weballergy

6 Dec 2024

If you are as interested as we are in exploring how planning and reasoning with large language models can help master board games and vice versa, Join us this coming Wednesday in Vancouver at the GDM NeurIPS booth, where we will be showing a demo. deepmind.google/discover/eve…

Google DeepMind

Build AI responsibly to benefit humanity

deepmind.google

Nenad Tomasev

@weballergy

5 Dec 2024

5,061

Nenad Tomasev · Jun 2, 2017 · 7:49 AM UTC

Nenad Tomasev

@weballergy

2 Jun 2017

Fader Networks (by FAIR): disentangling image attributes and adjusting their strength arxiv.org/abs/1706.00409 #deeplearning

Nenad Tomasev · Dec 13, 2017 · 9:18 AM UTC

Nenad Tomasev

@weballergy

13 Dec 2017

A nice write-up on an important paper. On how everything can be viewed as a model and how all/most data products (and data structures) can and should be optimized based on the statistical properties of the data. An ML-first dev world.

Delip Rao e/σ

@deliprao

13 Dec 2017

New blog post: On the lessons from the "Learned Index" paper and its impact on product thinking/building. deliprao.com/archives/262

Nenad Tomasev · Jul 5, 2017 · 7:59 AM UTC

Nenad Tomasev

@weballergy

5 Jul 2017

'Multiscale sequence modeling with a learned dictionary' arxiv.org/abs/1707.00762 #deeplearning #machinelearning #AI

Nenad Tomasev · Jun 2, 2017 · 10:00 AM UTC

Nenad Tomasev

@weballergy

2 Jun 2017

On how we recognize faces. (Un)surprisingly similar to modern #AI approaches / encodings.

Jack Rusher @jackrusher

2 Jun 2017

TL;DR Primate brains use what amounts to "face2vec" to encode facial identity + it's decodable from neural activity! cell.com/cell/fulltext/S0092…

Nenad Tomasev · Jul 31, 2017 · 7:17 AM UTC

Nenad Tomasev

@weballergy

31 Jul 2017

'Generator Reversal': modeling natural code distributions in deep generative models. arxiv.org/abs/1707.09241 #deeplearning #machinelearning