Co-Founder and CEO @SakanaAILabs 🎏

Minato-ku, Tokyo
Pinned Tweet
I’m incredibly proud of The AI Scientist team for this milestone publication in @Nature. We started this project to explore if foundation models could execute the entire research lifecycle. Seeing this work validated at this level is a special moment. I truly believe AI will forever change the landscape of how scientific discoveries and scientific progress are made.
The AI Scientist: Towards Fully Automated AI Research, Now Published in Nature Nature: nature.com/articles/s41586-0… Blog: sakana.ai/ai-scientist-natur… When we first introduced The AI Scientist, we shared an ambitious vision of an agent powered by foundation models capable of executing the entire machine learning research lifecycle. From inventing ideas and writing code to executing experiments and drafting the manuscript, the system demonstrated that end-to-end automation of the scientific process is possible. Soon after, we shared a historic update: the improved AI Scientist-v2 produced the first fully AI-generated paper to pass a rigorous human peer-review process. Today, we are happy to announce that “The AI Scientist: Towards Fully Automated AI Research,” our paper describing all of this work, along with fresh new insights, has been published in @Nature! This Nature publication consolidates these milestones and details the underlying foundation model orchestration. It also introduces our Automated Reviewer, which matches human review judgments and actually exceeds standard inter-human agreement. Crucially, by using this reviewer to grade papers generated by different foundation models, we discovered a clear scaling law of science. As the underlying foundation models improve, the quality of the generated scientific papers increases correspondingly. This implies that as compute costs decrease and model capabilities continue to exponentially increase, future versions of The AI Scientist will be substantially more capable. Building upon our previous open-source releases (github.com/SakanaAI/AI-Scien…), this open-access Nature publication comprehensively details our system's architecture, outlines several new scaling results, and discusses the promise and challenges of AI-generated science. This substantial milestone is the result of a close and fruitful collaboration between researchers at Sakana AI, the University of British Columbia (UBC) and the Vector Institute, and the University of Oxford. Congrats to the team! @_chris_lu_ @cong_ml @RobertTLange @_yutaroyamada @shengranhu @j_foerst @hardmaru @jeffclune
78
158
1,230
275,489
This design is better than “𝕏”.
🐦 X @elonmusk
198
9,432
93,223
4,979,624
The Mysteries of the Universe
119
18,635
82,365
Nothing against @JeffBezos but this is the stuff of evil genius villians 🙃
Wevolver
1,433
10,655
68,041
Trolley problem solved:
B作
271
11,199
57,294
Gradient descent is used in many ways at Tesla
175
7,410
50,120
drone pilots
Barstool Sports
161
1,913
8,646
I just ordered this book for my kids.
122
1,838
8,903
😅
29
131
8,436
316,992
AI Twitter these days. 👇🧵
100
1,112
7,952
797,990
The opening line of David Goodstein’s textbook, “States of Matter” 🤯
67
1,118
7,338
830,851
Deploying large language models to production:
45
703
5,244
398,794
AI generated videos out of control 😹
Ring Hyacinth
70
604
5,477
387,688
An auto-encoder with a very strong inductive bias. nitter.app/Damnlnteresting/status…
22
1,015
4,277
Pushing around these little robot soccer players, from DeepMind’s “Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning” paper. arxiv.org/abs/2304.13653 sites.google.com/view/op3-so…
272
665
4,119
1,731,818
We're living in a cyberpunk future: “Fooling automated surveillance cameras: adversarial patches to attack person detection” arxiv.org/abs/1904.08653
49
1,595
3,644
QR codes created using Stable Diffusion and ControlNet. This is Art.
66
442
3,579
534,072
A #StableDiffusion model trained on images of Japanese Kanji characters came up with “Fake Kanji” for novel concepts like Skyscraper, Pikachu, Elon Musk, Deep Learning, YouTube, Gundam, Singularity, etc. They kind of make sense. Not bad!
お絵描きAI(Stable Diffusion)に漢字とその意味を1万字学習させて書き初めをさせました 順に「謹」「賀」「新」「年」です
104
723
3,549
1,266,845
This is what life is like at a Generative AI startup.
64
377
3,446
347,163
Interesting physics analogy of ML from the viewpoint of compression (@elonmusk) “Physics formulas are compression algorithms for reality…If you ran physics simulation of the universe, eventually you will have sentience…At what point from hydrogen to us did it become sentient?”
131
534
3,236
DeepSeek is a side project 🔥
How is Deepseek going to make money?
60
314
3,086
350,031
A fun way to learn about neural networks and AI is to implement a simulation game giving your agents little neural net brains, and training them using a simple method like evolution. This demo trains a small neural network to drive around the track after only a few generations:
29
1,097
3,074
The most important formula in deep learning after 2018
39
495
3,026
I prefer: MANGA 💕💖✨
I propose a new acronym for the AI-ensconced red-hot tech giants: MAGMA Meta, Amazon, Google, Microsoft, Apple.
47
536
3,011
551,522
New Paper: Continuous Thought Machines 🧠 Neurons in brains use timing and synchronization in the way that they compute, but this is largely ignored in modern neural nets. We believe neural timing is key for the flexibility and adaptability of biological intelligence. We propose a new neural architecture, “Continuous Thought Machines” (CTMs), which is built from the ground up to use neural dynamics as a core representation for intelligence. By using neural dynamics as a first-class representational citizen, CTMs naturally perform adaptive computation. Many emergent, interesting behaviors arise as a result: CTMs solve mazes by observing a raw maze image and producing step-by-step instructions directly from its neural dynamics. When tasked with image recognition, the CTM naturally takes multiple steps to examine different parts of the image before making its decision. This step-by-step approach not only makes its behavior more interpretable but also improves accuracy: the longer it “thinks,” the more accurate its answers become. We also found that this allows the CTM to decide to spend less time thinking on simpler images, thus saving energy. When identifying a gorilla, for example, the CTM’s attention moves from eyes to nose to mouth in a pattern remarkably similar to human visual attention. I think this work underscores an important, yet often lost, synergy between neuroscience and AI. While modern AI is ostensibly brain-inspired, the two fields often operate in surprising isolation. By starting with such inspiration and iteratively following the emergent, interesting behaviors, we developed a model with unexpected capabilities, such as its surprisingly strong calibration in classification tasks, a feature that was not explicitly designed for. When we initially asked, “why do this research?”, we hoped the journey of the CTM would provide compelling answers. By embracing light biological inspiration and pursuing the novel behaviors observed, we have arrived at a model with emergent capabilities that exceeded our initial designs. We are committed to continuing this exploration, borrowing further concepts to discover what new and exciting behaviors will emerge, pushing the boundaries of what AI can achieve.
63
549
3,160
257,281
One of the most well-known pieces of software for downloading YouTube videos, “youtube-dl” was removed from GitHub following a takedown notice from the Recording Industry Association of America, or RIAA. Someone encoded the source code into two images and put it on Twitter:
25
890
2,863
Google’s Gemini 2.5 paper has 3295 authors arxiv.org/abs/2507.06261
Google’s Gemini paper has ~1000 authors arxiv.org/abs/2312.11805
82
427
2,994
1,204,866
Facebook AI Research is the OG “Open” AI
92
251
2,939
352,958
Announcing NeurIPS Preschool Track This year, we invite preschoolers to submit machine learning research papers.
62
378
2,833
287,190
This person took ~1700 pages of notes in mathematics lectures using LaTeX and Vim, and documented the workflow: castel.dev/post/lecture-note…
37
872
2,757
Personal Announcement! I’m launching @SakanaAILabs together with my friend, Llion Jones (@YesThisIsLion). sakana.ai is a new R&D-focused company based in Tokyo, Japan. We’re on a quest to create a new kind of foundation model based on nature-inspired intelligence!
141
398
2,742
573,250
data preprocessing
10
368
2,647
Artificial lifeforms are super fascinating to watch. These self-organizing, self-replicating, “lifeforms” emerged from a continuous time cellular automata system called Flow-Lenia. Lenia is a family of CAs generalizing Conway’s Game of Life to continuous space, time and states.
32
486
2,542
568,538
Some personal news: After six years at Google, I decided it was time for me to leave and try something new again. I had a fantastic time at Google Brain, and I’ll miss my friends, collaborators, and hanging out at the microkitchens!
129
42
2,585
"Some rich people lost all their fortunes and became homeless" #StableDiffusion2 #AIart (Source: teddit.net/comments/zy9cmg)
75
273
2,484
413,423
Teams of high school students built bottle-flipping robots for RoboCon 2018 in Japan
28
800
2,474
Anti-hype LLM reading list. Pretty good list. gist.github.com/veekaybee/be…
26
456
2,382
228,694
New blog post: Collective Intelligence for Deep Learning Recently, @yujin_tang and I published a paper about how ideas like swarm behavior, self-organization, emergence are gaining traction in deep learning. I wrote a blog post summarizing the key ideas: blog.otoro.net/2022/10/01/co…

ALT Emergence of encirclement tactics in MAgent, a large scale multi-agent simulator.

60
448
2,322
Using gradient descent for everything nitter.app/danhett/status/1176116…
41
422
2,351
Interesting application combining computer vision and high-powered lasers to eradicate weeds on a farm.
Unlike other weeding technologies, this #robot utilizes high-power lasers to eradicate weeds, without disturbing the soil... And, avoiding the use of herbicides! It leverages #AI to instantly identify and target weeds while rolling, days and night By Carbon Robotics #green
43
374
2,144
Also how deep learning models are trained on a MacBook
This is how I render my animations
40
274
2,103
Proof by meme?
12
274
2,115
Asked #Dalle to generate photographs of the bear stock market in the 1930s
33
346
2,042
Weight Agnostic Neural Networks 🦎 Inspired by precocial species in biology, we set out to search for neural net architectures that can already (sort of) perform various tasks even when they use random weight values. Article: weightagnostic.github.io PDF: arxiv.org/abs/1906.04358
52
626
2,060
Roman Emperor Project Using GAN-based tools to help create photorealistic portraits of Roman Emperors from historical references Project voshart.com/ROMAN-EMPEROR-PR… Article medium.com/@voshart/photorea…
25
516
2,003
MIT offers an excellent course on Deep Learning for Art, Aesthetics, and Creativity. All of the lecture videos are available on YouTube, with a fantastic list of speakers: ali-design.github.io/deepcre…
19
397
1,930
“Oriental Painting of Sun Tzu playing a game of Warcraft II on his desktop compute” generated using #Dalle
20
292
1,859
4 hours of baby play in 2 minutes
50
439
1,916
How do you skim a research paper? I usually read (in order): 1) abstract 2) 1st paragraph of the intro 3) last paragraph of intro (for contributions) 4) 1st paragraph of the conclusion (it's usually one paragraph anyways) 5) figures / tables of results, and read their captions.
44
316
1,909
A GAN trained on accepted @CVPR papers.
18
583
1,937
I'm blown away by the method and results in this paper. Progressive growing neural nets may be a trend we will see in 2018.
14
775
1,860
Machines see objects Humans see ideology
24
359
1,803
If Google doesn’t get their act together and start shipping, they will go down in history as the company who nurtured and trained an entire generation of machine learning researchers and engineers who went on to deploy the technology at other companies… The modern day Bell Labs.
Replying to @hardmaru
What exactly is google going to do with its AI research outcome is the biggest secret in the field 🤣
57
173
1,809
569,487
"Anime scene of Yann Lecun at Bell Labs working on convolutional neural networks." #StableDiffusion
25
136
1,792
I love the diversity of Tokyo’s urban architecture. This Tiny House is designed by Atelier Tekuto.
14
121
1,764
100,909
RIP, John Conway.
18
654
1,777
Infinitely Recursive of Game of Life. Life and civilization emerges from self-organization at different levels of complexity.
saharan / さはら
43
352
1,769
267,989
The most well-dressed snowball fight, colorized using deep learning.
19
310
1,685
uh oh...
20
372
1,735
Neural network video streaming SDK from @NVIDIAAI can compress video conference data like these at ~0.1KB / frame, roughly 1000x better than H.264 (MPEG-4) compression on the same data (~100KB / frame).
23
429
1,645
A series of blog posts on applying machine learning to architecture Experiments: bit.ly/2XDhjtJ Background: bit.ly/2DIeWh4
17
476
1,647
PixelMe: Convert your photo into Pixel Art pixel-me.tokyo/en/ cloud.google.com/blog/produc…
20
364
1,651
Face to Anime using AnimeGANv2 teddit.net/r/MachineLearning…
.@Gradio Demo for AnimeGANv2 Face Portrait v2 now on @huggingface Spaces demo: huggingface.co/spaces/akhali… github: github.com/bryandlee/animega…
11
342
1,640
A Line Rider on Beethoven's 5th Symphony
20
544
1,617
This amazing book on the foundations of machine learning is now available for free from Microsoft as a PDF download. I learned so much from this book over the years, and I feel that much of the material is still relevant. The solutions to the exercises also seem to be available!
"Pattern Recognition and Machine Learning" by @ChrisBishopMSFT is now available as a free download. Download your copy today for an introduction to the fields of pattern recognition & machine learning: aka.ms/prml #ML #Insights
12
532
1,634
This must be where the Bayesians meet up on the weekends
28
195
1,596
Papers with Code: A searchable site that links machine learning papers on ArXiv with code on GitHub. They also tag any framework libraries used, along with other info like GitHub stars. I think such a feature would be a nice addition to ArXiv-Sanity. paperswithcode.com/
7
610
1,608
Oil should learn to code.
22
257
1,542
Fooling Facial Detection with Fashion Nice article surveying common face detection methods, and tests practical implementations of adversarial patches on a face mask for fooling them. h/t @MelMitchell1 bit.ly/2VJbOLc github.com/BruceMacD/Adversa…
14
468
1,515
High resolution inpainting experiment with #StableDiffusion2 Transporting the famous Futaba Sushi restaurant in Ginza, Tokyo, to other cities, countries, planets, and finally, to a galaxy far far away… #StableDiffusion #AIart
26
279
1,556
851,619
Reinforcement Learning for Improving Agent Design: What happens when we let an agent learn a better body design together with learning its task? article: designrl.github.io/ pdf: arxiv.org/abs/1810.03779
24
405
1,527
One year after building rock solid technical infrastructure for your machine learning research project
15
193
1,483
“First Order Motion Model for Image Animation” hooked up to a live camera: github.com/anandpawara/Real_… Original NeurIPS2019 paper / code: aliaksandrsiarohin.github.io…
14
311
1,485
Being able to fool AI detection algorithms IRL will be an important survival skill in the 21st century 👻
In other news, our latest work made its way onto Japanese national TV this week! 🚘
30
225
1,486
316,232
This #StableDiffusion add-on for Blender looks amazing. @AI_Render renders an AI-generated image based on a text prompt and your scene in Blender. github.com/benrugg/AI-Render
17
312
1,457
OpenAI, Google & Anthropic ban the use of the generated output content from their AI models to train other AI models, under their terms-of-service. However, they’ve been using other online content for their own model training. They can’t have it both ways. businessinsider.com/openai-g…
43
325
1,441
300,414
Life before GPUs.
13
158
1,462
172,481
Language is primarily a tool for communication rather than thought nature.com/articles/s41586-0… “Language is a defining characteristic of our species, but the function, or functions, that it serves has been debated for centuries. Here we bring recent evidence from neuroscience and allied disciplines to argue that in modern humans, language is a tool for communication, contrary to a prominent view that we use language for thinking. We begin by introducing the brain network that supports linguistic ability in humans. We then review evidence for a double dissociation between language and thought, and discuss several properties of language that suggest that it is optimized for communication. We conclude that although the emergence of language has unquestionably transformed human culture, language does not appear to be a prerequisite for complex thought, including symbolic thought. Instead, language is a powerful tool for the transmission of cultural knowledge; it plausibly co-evolved with our thinking and reasoning capacities, and only reflects, rather than gives rise to, the signature sophistication of human cognition.”
84
311
1,434
644,768
LIMA, a 65B LLaMa fine-tuned only with supervised learning on 1000 curated examples, without any RLHF, demonstrates remarkably strong performance, generalizes well to unseen tasks not in training data. Comparable to GPT-4, Bard, DaVinc003 in human studies.teddit.net/r/MachineLearning…
21
226
1,464
591,432
An interactive article explaining why weight initialization is so important for training neural nets by @deeplearningai_, written in the distill.pub format. deeplearning.ai/ai-notes/ini…
2
459
1,478
With the right body, no brain is needed.
MachinePix
11
228
1,430
Uber developed a system of nested hexagons to represent space, called H3: eng.uber.com/h3/
64
183
1,432
how probability distributions are related
34
461
1,416
Jupyter notebooks with Python examples for reproducing examples from each chapter of Christopher Bishop's “Pattern Recognition and Machine Learning” textbook (also available for free in link above) github.com/ctgk/PRML
7
405
1,425
I’m super excited to see ideas from complex systems such as swarm intelligence, self-organization, and emergent behavior gain traction again in AI research. We wrote a survey of recent developments that combine ideas from deep learning and complex systems: arxiv.org/abs/2111.14377
23
284
1,413
Academic Torrents is a distributed system for sharing enormous datasets. So far they have made 27.23TB of research data available. academictorrents.com
6
517
1,415
The map of the brain, created by an aerospace engineer. These are the result of six years of research. It’s always interesting to me to view the perspective of one challenging scientific field through the lens of an expert from another field. 🧠 Source: thehighestofthemountains.com…
24
302
1,378
Excited to announce our Series A! We raised more than $100M to grow Sakana AI into a World Class AI Lab in Japan. We’re going to really push the frontiers of what’s possible with AI. As a founder mode startup, we operate much faster than most frontier AI labs at a global level.
117
101
1,439
233,500
After many years, this guy has come back to haunt me.
Replying to @hardmaru
If we remove all design constraints, the optimizer came up with a really tall bipedal walker robot that “solves” the task by simply falling over and landing near the exit.
39
63
1,443
110,062
Self-attention mechanism can be viewed as the update rule of a Hopfield network with continuous states. Deep learning models can take advantage of Hopfield networks as a powerful concept comprising pooling, memory, and attention. arxiv.org/abs/2008.02217 github.com/ml-jku/hopfield-l…
21
349
1,399
i feel seen
17
143
1,388
Decoding the Enigma with RNNs. They trained a LSTM with 3000 hidden units to decode ciphertext with 96%+ accuracy. greydanus.github.io/2017/01/…
11
735
1,394
Using deep learning to implement linear regression
A skilled excavator operator with a Engcon EC206 tiltrotator.
12
291
1,350
Edo period cat meme
16
466
1,327
Dive into Deep Learning: An interactive deep learning book with code, math, and discussions, based on the NumPy interface. I really like the format of the textbook! d2l.ai/
15
324
1,368
MANGA sounds better than FAANG
26
401
1,367
TinyML and Efficient Deep Learning Computing MIT 6.5940 (efficientml.ai) “This course will introduce efficient AI computing techniques that enable powerful deep learning applications on resource-constrained devices. Topics include model compression, pruning, quantization, neural architecture search, distributed training, data/model parallelism, gradient compression, and on-device fine-tuning. It also introduces application-specific acceleration techniques for large language models, diffusion models, video recognition, and point cloud. This course will also cover topics about quantum machine learning. Students will get hands-on experience deploying large language models (e.g., LLaMA 2) on a laptop.”
22
215
1,349
239,527
Conventional thinking: Build a robot to solve the problem. Out-of-the-box thinking: Get the problem to solve itself. Example: Self-solving Rubik's Cube by @takashikaburagi
15
278
1,336